llvm-project

Commit Graph

Author	SHA1	Message	Date
Mogball	72619d101f	[mlir][ods] NFC fix tblgen crash with empty assembly format	2022-02-08 21:13:58 +00:00
tyb0807	d379ec9908	[AArch64] ACLE feature macro for Armv8.8-A MOPS This introduces the new __ARM_FEATURE_MOPS ACLE feature test macro, which signals the availability of the new Armv8.8-A/Armv9.3-A instructions for standardising memcpy, memset and memmove operations. This patch supersedes the one from https://reviews.llvm.org/D116160. Differential Revision: https://reviews.llvm.org/D118199	2022-02-08 21:07:49 +00:00
Krzysztof Parzyszek	7403c02f06	[Hexagon] Fix crash with shuffle_vector of v128f16	2022-02-08 13:05:22 -08:00
Benjamin Kramer	6fde1afecd	[Debuginfod][Symbolizer] Cut dependency cycle after `4a6553f4c2`	2022-02-08 21:53:30 +01:00
Florian Weimer	ef14b78d9a	[sanitizer] Use _thread_db_sizeof_pthread to obtain struct pthread size This symbol has been exported (as an internal GLIBC_PRIVATE symbol) from libc.so.6 starting with glibc 2.34. glibc uses it internally for its libthread_db implementation to enable thread debugging on GDB, so it is unlikely to go away for now. Fixes #52989. Reviewed By: #sanitizers, MaskRay, vitalybuka Differential Revision: https://reviews.llvm.org/D119007	2022-02-08 12:46:41 -08:00
Guillaume Chatelet	7e7ecef980	[libc] Replace type punning with bit_cast Although type punning is defined for union in C, it is UB in C++. This patch introduces a bit_cast function to convert between types in a safe way. This is necessary to get llvm-libc compile with GCC. This patch is extracted from D119002. Differential Revision: https://reviews.llvm.org/D119145	2022-02-08 20:45:59 +00:00
Amir Ayupov	9be6e40d1a	[BOLT][TEST] Add .so instrumentation test Summary: Shared object instrumentation test Test Plan: bin/llvm-lit -a bolt/test/X86/internal-call-instrument-so.s Reviewers: rafauler FBD34064557	2022-02-08 12:43:06 -08:00
Sylvestre Ledru	f2c2e924e7	Fix a typo (occured => occurred) Reported: https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=1005195	2022-02-08 21:35:26 +01:00
Valentin Clement	9b5bb511ad	[flang][codegen] Keep primitive type for extractvalue and insertvalue llvm.insertvalue and llvm.extractvalue need LLVM primitive type for the indexing operands. While upstreaming the TargetRewrite pass the change was made from i32 to index without knowing this restriction. This patch reverts back the types used for indexing in the two ops created in this pass. the error you will receive when lowering to LLVM IR with the current code is the following: ``` 'llvm.insertvalue' op operand #1 must be primitive LLVM type, but got 'index' ``` Reviewed By: jeanPerier, schweitz Differential Revision: https://reviews.llvm.org/D119253	2022-02-08 21:26:38 +01:00
Louis Dionne	817d897b57	[libc++] Remove _LIBCPP_ABI_UNSTABLE Previously, _LIBCPP_ABI_UNSTABLE would be used interchangeably with _LIBCPP_ABI_VERSION >= 2. This was confusing and creating unnecessary complexity. This patch removes _LIBCPP_ABI_UNSTABLE -- instead, the LIBCXX_ABI_UNSTABLE CMake option will result in the LIBCXX_ABI_VERSION being set to '2', the current unstable ABI. As a result, in the code, we only have _LIBCPP_ABI_VERSION to check in order to query the current ABI version. As a fly-by, this also defines the ABI namespace during CMake configuration to reduce complexity in __config. I believe it was previously done this way because we used to try to use __config_site as seldom as possible. Now that we always ship a __config_site, it doesn't really matter and I think being explicit about how the library is configured in the __config_site is actually a feature. Differential Revision: https://reviews.llvm.org/D119173	2022-02-08 15:18:09 -05:00
Louis Dionne	506cf6dc04	[libc++] Fix modules and benchmarks CI builds when incomplete features are disabled Differential Revision: https://reviews.llvm.org/D119036	2022-02-08 15:15:07 -05:00
Mogball	07486395d2	[mlir][ods] Optional Attribute or Type Parameters Implements optional attribute or type parameters, including support for such parameters in the assembly format `struct` directive. Also implements optional groups. Depends on D117971 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D118208	2022-02-08 20:09:44 +00:00
harsh	4a876b13fb	Add case to handle 0-D vectors in FlattenContiguousRowMajorTransferWritePattern and FlattenContiguousRowMajorTransferReadPattern. For 0-D as well as 1-D vectors, both these patterns should return a failure as there is no need to collapse the shape of the source. Currently, only 1-D vectors were handled. This patch handles the 0-D case as well. Reviewed By: Benoit, ThomasRaoux Differential Revision: https://reviews.llvm.org/D119202	2022-02-08 20:00:12 +00:00
Martin Storsjö	079b6d02d1	[clang] [MinGW] Recognize -lcrtdll as a library replacing -lmsvcrt Differential Revision: https://reviews.llvm.org/D119234	2022-02-08 21:57:07 +02:00
James Y Knight	9545976ff1	Revert "[Clang] Propagate guaranteed alignment for malloc and others" The above change assumed that malloc (and friends) would always allocate memory to getNewAlign(), even for allocations which have a smaller size. This is not actually required by spec (a 1-byte allocation may validly have 1-byte alignment). Some real-world malloc implementations do not provide this guarantee, and thus this optimization is breaking programs. Fixes #53540 This reverts commit `c2297544c0`. Differential Revision: https://reviews.llvm.org/D118804	2022-02-08 14:34:44 -05:00
LLVM GN Syncbot	34d557f38a	[gn build] Port `4a6553f4c2`	2022-02-08 19:21:13 +00:00
Daniel Thornburgh	4a6553f4c2	[Debuginfod] [Symbolizer] Break debuginfod out of libLLVM. Debuginfod can pull in libcurl as a dependency, which isn't appropriate for libLLVM. (See https://gitlab.freedesktop.org/mesa/mesa/-/issues/5732). This change breaks out debuginfod into a separate non-component library that can be used directly in llvm-symbolizer. The tool can inject debuginfod into the Symbolizer library via an abstract DebugInfoFetcher interface, breaking the dependency of Symbolizer on debuinfod. See https://github.com/llvm/llvm-project/issues/52731 Reviewed By: phosek Differential Revision: https://reviews.llvm.org/D118413	2022-02-08 19:14:18 +00:00
Amilendra Kodithuwakku	424e850f1e	[clang][ARM] Re-word PACBTI warning. The original warning added in D115501 when pacbti is used with an incompatible architecture was not exactly correct because it was not really ignored and can affect codegen. Therefore reword to say that the pacbti option is incompatible with the given architecture. Reviewed By: chill Differential Revision: https://reviews.llvm.org/D119166	2022-02-08 19:13:02 +00:00
Leonard Chan	4ac58b6102	[clang][Fuchsia] Ensure static sanitizer libs are only linked in after the -nostdlib check Differential Revision: https://reviews.llvm.org/D119201	2022-02-08 10:53:22 -08:00
Steffen Larsen	ead1690d31	Allow parameter pack expansions and initializer lists in annotate attribute These changes make the Clang parser recognize expression parameter pack expansion and initializer lists in attribute arguments. Because expression parameter pack expansion requires additional handling while creating and instantiating templates, the support for them must be explicitly supported through the AcceptsExprPack flag. Handling expression pack expansions may require a delay to when the arguments of an attribute are correctly populated. To this end, attributes that are set to accept these - through setting the AcceptsExprPack flag - will automatically have an additional variadic expression argument member named DelayedArgs. This member is not exposed the same way other arguments are but is set through the new CreateWithDelayedArgs creator function generated for applicable attributes. To illustrate how to implement support for expression pack expansion support, clang::annotate is made to support pack expansions. This is done by making handleAnnotationAttr delay setting the actual attribute arguments until after template instantiation if it was unable to populate the arguments due to dependencies in the parsed expressions.	2022-02-08 13:38:07 -05:00
Alex Brachet	70ae480c82	[libc][NFC] Remove all Linux specific code to respective linux/ directories These were all the non OS agnostic implementations I could find in general directories. Currently none of these functions are actually enabled, but for when they do it makes sense that they be in linux/ specific directories. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D119164	2022-02-08 18:32:18 +00:00
Roman Lebedev	c8ba2b67a0	[SimplifyCFG] 'merge compatible invokes': fully support indirect invokes As long as all the invokes in the set are indirect, we can merge them, but don't merge direct invokes into the set, even though it would be legal to do.	2022-02-08 21:29:38 +03:00
Roman Lebedev	414b47645d	[SimplifyCFG] 'merge compatible invokes': don't create trivial PHI's with all-identical incoming values	2022-02-08 21:29:38 +03:00
Roman Lebedev	e2aed0b047	[NFC][SimplifyCFG] 'merge compatible invokes': tests for indirect invokes.	2022-02-08 21:29:38 +03:00
Stanislav Mekhanoshin	aeaf85b9c2	[AMDGPU] Select VGPR versions of MFMA if possible We can select _vgprcd versions of MAI instructions and have no AGPRs with the whole budget left for VGPRs if: 1. This is a kernel; 2. It has no calls; 3. It runs at least on 2 waves thus having not more that 256 VGPRs. 4. There is no inline asm requesting AGPRs. Differential Revision: https://reviews.llvm.org/D117253	2022-02-08 10:19:41 -08:00
Mahesh Ravishankar	2abd7f13bc	[mlir][Linalg] NFC: Combine elementwise fusion test passes. There are a few different test passes that check elementwise fusion in Linalg. Consolidate them to a single pass controlled by different pass options (in keeping with how `TestLinalgTransforms` exists).	2022-02-08 18:08:37 +00:00
Josh Mottley	5ebbcfa01d	[flang] Upstream partial lowering of GET_ENVIRONMENT_VARIABLE intrinsic This patch adds partial lowering of the "GET_ENVIRONMENT_VARIABLE" intrinsic to the backend runtime hook implemented in patches D111394 and D112698. It also renames the `isPresent` lambda to `isAbsent` and moves it out to its own function in `Command.cpp`. Corresponding comment fixes for this are also modified. Lastly it adds the i1 type to `RuntimeCallTestBash.h`. Differential Revision: https://reviews.llvm.org/D118984	2022-02-08 17:42:53 +00:00
Craig Topper	56d6ccd4cb	[X86] Update register RCL/RCR by 1 and immediate scheduling for Intel CPUs Most Intel CPU scheduler files lumped the immediate and 1 instructions together, but uops.info shows they are quite different. For the most part the by 1 instructions were pretty accurate to the uops.info data except the latency was 3 instead of 2 as uops.info indicates. The by immediate instructions need 7 or 8 uops and have higher latency. It looks like the 8-bit by immediate instructions may need even more uops, but I just lumped them with the 16/32/64. Noticed while checking out PR53648. So mostly I cared about the by 1 instructions. Reviewed By: RKSimon, pengfei Differential Revision: https://reviews.llvm.org/D119217	2022-02-08 09:20:20 -08:00
Corentin Jabot	c151225096	[C++2b] Implement multidimentional subscript operator Implement P2128R6 in C++23 mode. Unlike GCC's implementation, this doesn't try to recover when a user meant to use a comma expression. Because the syntax changes meaning in C++23, the patch is NOT implemented as an extension. Instead, declaring an array with not exactly 1 parameter is an error in older languages modes. There is an off-by-default extension warning in C++23 mode. Unlike the standard, we supports default arguments; Ie, we assume, based on conversations in WG21, that the proposed resolution to CWG2507 will be accepted. We allow arrays OpenMP sections and C++23 multidimensional array to coexist: [a , b] multi dimensional array [a : b] open mp section [a, b: c] // error The rest of the patch is relatively straight forward: we take care to support an arbitrary number of arguments everywhere.	2022-02-08 12:10:47 -05:00
Joseph Huber	caf7f05c1c	[Attributor] Emit fixed-point remark on function list This patch replaces the function we emit the remark on when we run into the fix-point limit. Previously we got a function to emit a remark on from the worklist's associated function. However, the worklist may not always have an associated function in the case of global variables. Replace this with the function set, and if there are no functions don't emit the remark. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D119248	2022-02-08 12:10:21 -05:00
Joseph Huber	99d72ebddf	[Libomptarget] Add header files as a dependency to CMake target This patch manually adds the runtime include files to the list of dependencies when we build the bitcode runtime library. Previously if only the header was changed we would not recompile the source files. The solution used here isn't optimal because every source file not has a dependency on each header file regardless of if it was actually used by that file. Reviewed By: tianshilei1992 Differential Revision: https://reviews.llvm.org/D119254	2022-02-08 12:09:59 -05:00
Krzysztof Parzyszek	2ecda9ec9c	[Hexagon] Alter meaning of versionless -mhvx The documentation for the official (downstream) Qualcomm Hexagon Clang states that -mhvx sets the HVX version to be the same as the CPU version. The current implementation upstream would use the most recent versioned -mhvx= flag first (if present), then the CPU version. Change the upstream behavior to match the documented behavior of the downstream compiler.	2022-02-08 09:06:15 -08:00
Dawid Jurczak	5d8d3a11c4	[NFC] Increase initial size of FoldingSets used in ASTContext and CodeGenTypes Among many FoldingSet users most notable seem to be ASTContext and CodeGenTypes. The reasons that we spend not-so-tiny amount of time in FoldingSet calls from there, are following: 1. Default FoldingSet capacity for 2^6 items very often is not enough. For PointerTypes/ElaboratedTypes/ParenTypes it's not unlikely to observe growing it to 256 or 512 items. FunctionProtoTypes can easily exceed 1k items capacity growing up to 4k or even 8k size. 2. FoldingSetBase::GrowBucketCount cost itself is not very bad (pure reallocations are rather cheap thanks to BumpPtrAllocator). What matters is high collision rate when lot of items end up in same bucket slowing down FoldingSetBase::FindNodeOrInsertPos and trashing CPU cache (as items with same hash are organized in intrusive linked list which need to be traversed). This change address both issues by increasing initial size of FoldingSets used in ASTContext and CodeGenTypes. Extracted from: https://reviews.llvm.org/D118385 Differential Revision: https://reviews.llvm.org/D118608	2022-02-08 17:54:04 +01:00
Benjamin Kramer	78eeda7529	[MLIR][Presburger] Fix linkage of functions in header Static functions in a header cause spurious unused function warnings.	2022-02-08 17:51:34 +01:00
Jacques Pienaar	d15baefafa	[mlir][bazel] Update post 24a1	2022-02-08 08:48:33 -08:00
Bixia Zheng	61a3dd70ff	[mlir][taco] Use sparse_tensor.out to write sparse tensors to files. Add a Python method, output_sparse_tensor, to use sparse_tensor.out to write a sparse tensor value to a file. Modify the method that evaluates a tensor expression to return a pointer of the MLIR sparse tensor for the result to delay the extraction of the coordinates and non-zero values. Implement the Tensor to_file method to evaluate the tensor assignment and write the result to a file. Add unit tests. Modify test golden files to reflect the change that TNS outputs now have a comment line and two meta data lines. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D118956	2022-02-08 08:47:05 -08:00
Balazs Benics	620d99b7ed	Revert "[analyzer] Prevent misuses of -analyze-function" This reverts commit `841817b1ed`. Ah, it still fails on build bots for some reason. Pinning the target triple was not enough.	2022-02-08 17:42:46 +01:00
Mark de Wever	5dc1da3e30	[libc++][nfc] Use TEST_SAFE_STATIC. This avoids using an libc++ internal macro in our tests. Reviewed By: #libc, philnik, ldionne Differential Revision: https://reviews.llvm.org/D118874	2022-02-08 17:39:21 +01:00
Mark de Wever	a0071b9353	[libc++] Removes cpp17_output_iterator's default constructor. This has been suggested in D117950. Reviewed By: ldionne, #libc, philnik Differential Revision: https://reviews.llvm.org/D118971	2022-02-08 17:38:30 +01:00
Andy Yankovsky	3df88ec335	[Support] Don't print stacktrace if DbgHelp.dll hasn't been loaded yet On Windows certain function from `Signals.h` require that `DbgHelp.dll` is loaded. This typically happens when the main program calls `llvm::InitLLVM`, however in some cases main program doesn't do that (e.g. when the application is using LLDB via `liblldb.dll`). This patch adds a safe guard to prevent crashes. More discussion in https://reviews.llvm.org/D119009. Reviewed By: aganea Differential Revision: https://reviews.llvm.org/D119181	2022-02-08 16:37:36 +00:00
Mircea Trofin	5a50ab4d5c	[nfc][mlgo][regalloc] Stop warnings about unused function Added a `NoopSavedModelImpl` type which can be used as a mock AOT-ed saved model, and further minimize conditional compilation cases. This also removes unused function warnings on gcc.	2022-02-08 08:35:33 -08:00
Krzysztof Drewniak	24a1869d00	[MLIR][GPU] Update GPUToROCDL to account for ControlFlow dialect The conversion to the new ControlFlow dialect didn't change the GPUToROCDL pass - this commit fixes this issue. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D119188	2022-02-08 16:34:34 +00:00
Hongtao Yu	34e131b0f2	[llvm-profgen] On-demand track optimized-away inlinees for preinliner. Tracking optimized-away inlinees based on all probes in a binary is expansive in terms of memory usage I'm making the tracking on-demand based on profiled functions only. This saves about 10% memory overall for a medium-sized benchmark. Before: note: After parsePerfTraces note: Thu Jan 27 18:42:09 2022 note: VM: 8.68 GB RSS: 8.39 GB note: After computeSizeForProfiledFunctions note: Thu Jan 27 18:42:41 2022 note: VM: 10.63 GB RSS: 10.20 GB note: After generateProbeBasedProfile note: Thu Jan 27 18:45:49 2022 note: VM: 25.00 GB RSS: 24.95 GB note: After postProcessProfiles note: Thu Jan 27 18:49:29 2022 note: VM: 26.34 GB RSS: 26.27 GB After: note: After parsePerfTraces note: Fri Jan 28 12:04:49 2022 note: VM: 8.68 GB RSS: 7.65 GB note: After computeSizeForProfiledFunctions note: Fri Jan 28 12:05:26 2022 note: VM: 8.68 GB RSS: 8.42 GB note: After generateProbeBasedProfile note: Fri Jan 28 12:08:03 2022 note: VM: 22.93 GB RSS: 22.89 GB note: After postProcessProfiles note: Fri Jan 28 12:11:30 2022 note: VM: 24.27 GB RSS: 24.22 GB This should be a no-diff change in terms of profile quality. Reviewed By: wenlei Differential Revision: https://reviews.llvm.org/D118515	2022-02-08 08:33:23 -08:00
Mark de Wever	e885b1137b	[libc++][format[[nfc] Use string_view in tests. This change is a preparation for adapting the tests for P2216 std::format improvements Reviewed By: #libc, Quuxplusone, ldionne Differential Revision: https://reviews.llvm.org/D118717	2022-02-08 17:31:18 +01:00
Balazs Benics	841817b1ed	[analyzer] Prevent misuses of -analyze-function Sometimes when I pass the mentioned option I forget about passing the parameter list for c++ sources. It would be also useful newcomers to learn about this. This patch introduces some logic checking common misuses involving `-analyze-function`. Reviewed-By: martong Differential Revision: https://reviews.llvm.org/D118690	2022-02-08 17:27:57 +01:00
Matt Arsenault	f2c99ea47d	AMDGPU: Use reserved VGPR for AGPR spills to memory Previously would reuse the VGPR used for large frame offsets with the one needed for copying from the AGPR. Fix this by reusing the register we already reserved for handling AGPR to AGPR copies.	2022-02-08 11:26:59 -05:00
Philip Reames	c302f1e677	[SCEV] Generalize SCEVEqualsPredicate to any compare [NFC] PredicatedScalarEvolution has a predicate type for representing A == B. This change generalizes it into something which can represent a A <pred> B. This generality is currently unused, but is motivated by a couple of recent cases which have come up. In particular, I'm currently playing around with using this to simplify the runtime checking code in LoopVectorizer. Regardless of the outcome of that prototyping, generalizing the compare node seemed useful.	2022-02-08 08:18:09 -08:00
Nikita Popov	074561a4a2	[Mem2Reg] Check that load type matches alloca type Alloca promotion can only deal with cases where the load/store types match the alloca type (it explicitly does not support bitcasted load/stores). With opaque pointers this is no longer enforced through the pointer type, so add an explicit check.	2022-02-08 17:16:15 +01:00
Matt Arsenault	8b2ca766f0	AMDGPU: Reserve v32 if we may need to copy between AGPRs on gfx908 We need to guarantee cheap copies between AGPRs, and unfortunately gfx908 cannot directly do this. Theoretically we could set the scavenger up with an emergency spill slot, but it also feels unreasonable to pay that cost for what was assumed to be a simple and cheap copy. Pick a register that doesn't conflict with any ABI registers. This does not address the same issue when copying from SGPR to AGPR for gfx90a (this coincidentally fixes it for gfx908), but that's less interesting since the register allocator shouldn't be proactively introducing such copies. One edge case I'm worried about is respecting the VGPR budget implied by amdgpu-waves-per-eu. If the theoretical upper bound of a function is 32 VGPRs, this will force the actual count to be 33. This is also broken if inline assembly uses/defs something in v32. The coalescer will eliminate the intermediate vreg between the def and use, and the introduced copy will clobber the user value. (cherry picked from commit 3335784ac2d587ff4eac04586e189532ae8b2607)	2022-02-08 11:14:52 -05:00
Matt Arsenault	a7f60bfdf6	AMDGPU: Regenerate mir test checks to include -NEXT	2022-02-08 11:14:52 -05:00

1 2 3 4 5 ...

414206 Commits All Branches Search

414206 Commits

All Branches