llvm-project

Commit Graph

Author	SHA1	Message	Date
zoecarver	6ffc41b014	[libcxx][ranges] Add `random_access_{iterator,range}`. Differential Revision: https://reviews.llvm.org/D101316	2021-05-04 21:42:55 -07:00
William S. Moses	f4a2dbfe29	[MLIR][SCF] Combine adjacent scf.if with same condition Differential Revision: https://reviews.llvm.org/D101798	2021-05-05 00:39:58 -04:00
Serguei Katkov	9f631d14c6	[GreedyRA] Add support for invoke statepoint with tied-defs. statepoint instruction uses tied-def registers to represent live gc value which is use and def at the same time on a call. At the same time invoke statepoint instruction is a last split point which can throw and jump to landing pad. As a result we have instructon which is last split point with tied-defs registers and we need to teach Greedy RA to work with it. The option -use-registers-for-gc-values-in-landing-pad controls whether statepoint lowering will generate tied-defs for invoke statepoint and is off by default now. To resolve all issues the following changes has been done. 1) Last Split point for invoke statepoint should be statepoint itself If statepoint has a def it is a relocated gc pointer and it should be available in landing pad. So we cannot split interval after statepoint at end of basic block. 2) Do not split interval on tied-def If end of interval for overlap utility is a use which has tied-def we should not split interval on this instruction due to in this case use and def may have different registers and it breaks tied-def property. 3) Take into account Last Split Point for enterIntvAtEnd If the use after Last Split Point is a def so it should be tied-def and we can take the def of the tied-use as ParentVNI and thus tied-use and tied-def will be live in resulting interval. 4) Handle the case when def is after LIP in InlineSpiller If def of LI is after last insertion point of basic block we cannot hoist in this BB. The example of such instruction is invoke statepoint where def represents the relocated live gc pointer. Invoke is a last insertion point and its def is located after it. In this case there is no place to insert spill and we bail out. 5) Fix removeBackCopies to account empty copies RegAssignMap cannot hold empty interval, so do not set stop to kill value if it produces empty interval. This can happen if we remove back-copy and right before that we have another back-copy. For example, for parent %0 we can get %1 = COPY %0 %2 = COPY %0 while we removing %2 we cannot set kill for %1 due to its empty. 6) Do not hoist copy to BB if its def is after LSP If the parent def is a LastSplitPoint or later we cannot hoist copy to this basic block because inserted copy (or re-materialization) will be located before the def. All parts have been reviewed separately as follows: https://reviews.llvm.org/D100747 https://reviews.llvm.org/D100748 https://reviews.llvm.org/D100750 https://reviews.llvm.org/D100927 https://reviews.llvm.org/D100945 https://reviews.llvm.org/D101028 Reviewers: reames, rnk, void, MatzeB, wmi, qcolombet Reviewed By: reames, qcolombet Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D101150	2021-05-05 11:13:35 +07:00
LLVM GN Syncbot	88ec05b654	[gn build] Port `f2018d6c16`	2021-05-05 03:54:38 +00:00
Lang Hames	f2018d6c16	[ORC] Reintroduce the ORC C API test. This test was removed in `51495fd285` due to broken bots. Its reintroduction is expected to trigger failures on some builders. The test has been modified to print error messages in full, which should aid in tracking these down.	2021-05-04 20:46:00 -07:00
Walter Erquinigo	ade59d5309	[trace] Dedup different source lines when dumping instructions + refactor When dumping the traced instructions in a for loop, like this one 4: for (int a = 0; a < n; a++) 5: do something; there might be multiple LineEntry objects for line 4, but with different address ranges. This was causing the dump command to dump something like this: ``` a.out`main + 11 at main.cpp:4 [1] 0x0000000000400518 movl $0x0, -0x8(%rbp) [2] 0x000000000040051f jmp 0x400529 ; <+28> at main.cpp:4 a.out`main + 28 at main.cpp:4 [3] 0x0000000000400529 cmpl $0x3, -0x8(%rbp) [4] 0x000000000040052d jle 0x400521 ; <+20> at main.cpp:5 ``` which is confusing, as main.cpp:4 appears twice consecutively. This diff fixes that issue by making the line entry comparison strictly about the line, column and file name. Before it was also comparing the address ranges, which we don't need because our output is strictly about what the user sees in the source. Besides, I've noticed that the logic that traverses instructions and calculates symbols and disassemblies had too much coupling, and made my changes harder to implement, so I decided to decouple it. Now there are two methods for iterating over the instruction of a trace. The existing one does it on raw load addresses, but the one provides a SymbolContext and an InstructionSP, and does the calculations efficiently (not as efficient as possible for now though), so the caller doesn't need to care about these details. I think I'll be using that iterator to reconstruct the call stacks. I was able to fix a test with this change. Differential Revision: https://reviews.llvm.org/D100740	2021-05-04 19:40:52 -07:00
Jianzhou Zhao	bf4e1cf80a	Revert "[sanitizer_common] Recycle StackDepot memory" This reverts commit `78804e6b20`.	2021-05-05 00:57:34 +00:00
Jianzhou Zhao	1fb612d060	[dfsan] Add a DFSan allocator This is a part of https://reviews.llvm.org/D101204 Reviewed By: morehouse Differential Revision: https://reviews.llvm.org/D101666	2021-05-05 00:51:45 +00:00
Jianzhou Zhao	78804e6b20	[sanitizer_common] Recycle StackDepot memory This relates to https://reviews.llvm.org/D95835. In DFSan origin tracking we use StackDepot to record stack traces and origin traces (like MSan origin tracking). For at least two reasons, we wanted to control StackDepot's memory cost 1) We may use DFSan origin tracking to monitor programs that run for many days. This may eventually use too much memory for StackDepot. 2) DFSan supports flush shadow memory to reduce overhead. After flush, all existing IDs in StackDepot are not valid because no one will refer to them.	2021-05-05 00:51:45 +00:00
Med Ismail Bennani	d5069dace7	[lldb/Symbol] Fix typo in SymbolFilePDBTests (NFC) Signed-off-by: Med Ismail Bennani <medismail.bennani@gmail.com>	2021-05-05 00:38:41 +00:00
Med Ismail Bennani	30fcdf0b19	[lldb/Symbol] Update SymbolFilePDB unitest with SourceLocationSpec This patch should fix the windows test failure following `3e2ed7440569`. It makes use of a `SourceLocationSpec` object when resolving a symbol context from `SymbolFilePDB` file. Signed-off-by: Med Ismail Bennani <medismail.bennani@gmail.com>	2021-05-05 00:34:44 +00:00
Fangrui Song	96f3a63076	[llvm-objcopy] --dump-section: error if '=' is missing or filename is empty Fix PR45416: the diagnostic when '=' is missing is misleading. `FileOutputBuffer::create` returns successfully when the filename is empty (the temporary file is `.tmp%%%%%%%`), but `FileOutputBuffer::commit` will error when renaming `.tmp%%%%%%%` to the empty name). Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D101697	2021-05-04 17:30:57 -07:00
Giorgis Georgakoudis	f016c06abb	Revert "[OpenMP][NFC] Refactor Clang OpenMP tests using update_cc_test_checks" This reverts commit `956cae2f09`.	2021-05-04 17:12:32 -07:00
Aart Bik	a2c9d4bb04	[mlir][sparse] Introduce proper sparsification passes This revision migrates more code from Linalg into the new permanent home of SparseTensor. It replaces the test passes with proper compiler passes. NOTE: the actual removal of the last glue and clutter in Linalg will follow Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D101811	2021-05-04 17:10:09 -07:00
Han Zhu	da1cdffbb1	[loop-idiom] Hoist loop memcpys to loop preheader For a simple loop like: ``` struct S { int x; int y; char b; }; unsigned foo(S* __restrict__ a, S* b, int n) { for (int i = 0; i < n; i++) a[i] = b[i]; return sizeof(a[0]); } ``` We could eliminate the loop and convert it to a large memcpy of 12n bytes. Currently this is not handled. Output of `opt -loop-idiom -S < memcpy_before.ll` ``` %struct.S = type { i32, i32, i8 } define dso_local i32 @_Z3fooP1SS0_i(%struct.S noalias nocapture %a, %struct.S* nocapture readonly %b, i32 %n) local_unnamed_addr { entry: %cmp7 = icmp sgt i32 %n, 0 br i1 %cmp7, label %for.body.preheader, label %for.cond.cleanup for.body.preheader: ; preds = %entry br label %for.body for.cond.cleanup.loopexit: ; preds = %for.body br label %for.cond.cleanup for.cond.cleanup: ; preds = %for.cond.cleanup.loopexit, %entry ret i32 12 for.body: ; preds = %for.body, %for.body.preheader %i.08 = phi i32 [ %inc, %for.body ], [ 0, %for.body.preheader ] %idxprom = zext i32 %i.08 to i64 %arrayidx = getelementptr inbounds %struct.S, %struct.S* %b, i64 %idxprom %arrayidx2 = getelementptr inbounds %struct.S, %struct.S* %a, i64 %idxprom %0 = bitcast %struct.S* %arrayidx2 to i8* %1 = bitcast %struct.S* %arrayidx to i8* call void @llvm.memcpy.p0i8.p0i8.i64(i8* nonnull align 4 dereferenceable(12) %0, i8* nonnull align 4 dereferenceable(12) %1, i64 12, i1 false) %inc = add nuw nsw i32 %i.08, 1 %cmp = icmp slt i32 %inc, %n br i1 %cmp, label %for.body, label %for.cond.cleanup.loopexit } ; Function Attrs: argmemonly nofree nosync nounwind willreturn declare void @llvm.memcpy.p0i8.p0i8.i64(i8* noalias nocapture writeonly, i8* noalias nocapture readonly, i64, i1 immarg) #0 attributes #0 = { argmemonly nofree nosync nounwind willreturn } ``` The loop idiom pass currently only handles load and store instructions. Since struct S is too big to fit in a register, the loop body contains a memcpy intrinsic. With this change, re-run `opt -loop-idiom -S < memcpy_before.ll`. The loop memcpy is promoted to loop preheader. For this trivial case, the loop is dead and will be removed by another pass. ``` %struct.S = type { i32, i32, i8 } define dso_local i32 @_Z3fooP1SS0_i(%struct.S* noalias nocapture %a, %struct.S* nocapture readonly %b, i32 %n) local_unnamed_addr { entry: %a1 = bitcast %struct.S* %a to i8* %b2 = bitcast %struct.S* %b to i8* %cmp7 = icmp sgt i32 %n, 0 br i1 %cmp7, label %for.body.preheader, label %for.cond.cleanup for.body.preheader: ; preds = %entry %0 = zext i32 %n to i64 %1 = mul nuw nsw i64 %0, 12 call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 4 %a1, i8* align 4 %b2, i64 %1, i1 false) br label %for.body for.cond.cleanup.loopexit: ; preds = %for.body br label %for.cond.cleanup for.cond.cleanup: ; preds = %for.cond.cleanup.loopexit, %entry ret i32 12 for.body: ; preds = %for.body, %for.body.preheader %i.08 = phi i32 [ %inc, %for.body ], [ 0, %for.body.preheader ] %idxprom = zext i32 %i.08 to i64 %arrayidx = getelementptr inbounds %struct.S, %struct.S* %b, i64 %idxprom %arrayidx2 = getelementptr inbounds %struct.S, %struct.S* %a, i64 %idxprom %2 = bitcast %struct.S* %arrayidx2 to i8* %3 = bitcast %struct.S* %arrayidx to i8* %inc = add nuw nsw i32 %i.08, 1 %cmp = icmp slt i32 %inc, %n br i1 %cmp, label %for.body, label %for.cond.cleanup.loopexit } ; Function Attrs: argmemonly nofree nosync nounwind willreturn declare void @llvm.memcpy.p0i8.p0i8.i64(i8* noalias nocapture writeonly, i8* noalias nocapture readonly, i64, i1 immarg) #0 attributes #0 = { argmemonly nofree nosync nounwind willreturn } ``` Reviewed By: zino Differential Revision: https://reviews.llvm.org/D97667	2021-05-04 17:05:04 -07:00
Giorgis Georgakoudis	956cae2f09	[OpenMP][NFC] Refactor Clang OpenMP tests using update_cc_test_checks This patch refactors a subset of Clang OpenMP tests, generating checklines using the update_cc_test_checks script. This refactoring facilitates updating the Clang OpenMP code generation codebase by automating test generation. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D101849	2021-05-04 16:58:45 -07:00
Thomas Lively	f3b769e82f	[WebAssembly] Add codegen test for wasm_simd128.h We previously did not have tests demonstrating that the intrinsics in wasm_simd128.h lower to reasonable LLVM IR. This commit adds such a test. Differential Revision: https://reviews.llvm.org/D101805	2021-05-04 16:11:00 -07:00
Med Ismail Bennani	3e2ed74405	[lldb] Refactor argument group by SourceLocationSpec (NFCI) This patch refactors a good part of the code base turning the usual FileSpec, Line, Column, CheckInlines, ExactMatch arguments into a SourceLocationSpec object. This change is required for a following patch that will add handling of the column line information when doing symbol resolution. Differential Revision: https://reviews.llvm.org/D100965 Signed-off-by: Med Ismail Bennani <medismail.bennani@gmail.com>	2021-05-04 23:04:31 +00:00
Jianzhou Zhao	36cec26b38	[dfsan] move dfsan_flags.h to cc files D101666 needs this change. Reviewed By: morehouse Differential Revision: https://reviews.llvm.org/D101857	2021-05-04 22:54:02 +00:00
Leonard Chan	0277a24f4b	[clang][test] Update -fc++-abi tests This attempts to move driver tests out of Frontend and to Driver, separates RUNs that should fail from RUNs that should succeed, and prevent creating output files or dumping output. Differential Revision: https://reviews.llvm.org/D101867	2021-05-04 15:53:00 -07:00
Louis Dionne	347f69c55f	[libc++] Revert the std::to_address change to avoid relying on element_type. This reverts commit `da456167`, which broke the Clang build. I'm able to reproduce it but I want to give myself a bit more time to investigate. Differential Revision: https://reviews.llvm.org/D101638	2021-05-04 18:50:05 -04:00
Baptiste Saleil	845c8a60e9	[AMDGPU] Add rm line to lit test to cleanup bots	2021-05-04 18:27:50 -04:00
River Riddle	c1c1df6347	[mlir] Fix region successor bug in forward dataflow analysis We weren't properly visiting region successors when the terminator wasn't return like, which could create incorrect results in the analysis. This revision ensures that we properly visit region successors, to avoid optimistically assuming a value is constant when it isn't. Differential Revision: https://reviews.llvm.org/D101783	2021-05-04 14:50:37 -07:00
Florian Hahn	ccebf7a109	[VPlan] Properly handle sinking of replicate regions. This patch updates the code that sinks recipes required for first-order recurrences to properly handle replicate-regions. At the moment, the code would just move the replicate recipe out of its replicate-region, producing an invalid VPlan. When sinking a recipe in a replicate-region, we have to sink the whole region. To do that, we first need to split the block at the target recipe and move the region in between. This patch also adds a splitAt helper to VPBasicBlock to split a VPBasicBlock at a given iterator. Fixes PR50009. Reviewed By: Ayal Differential Revision: https://reviews.llvm.org/D100751	2021-05-04 22:36:01 +01:00
Rob Suderman	1f7adf8cb1	[mlir][tosa] Fix tosa.concat by inserting linalg.fill after linalg.init All linalg.init operations must be fed into a linalg operation before subtensor. The inserted linalg.fill guarantees it executes correctly. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D101848	2021-05-04 14:26:28 -07:00
Baptiste Saleil	a018bd5199	[AMDGPU] Fix lit failure introduced by `6a17609157`	2021-05-04 17:25:58 -04:00
Fangrui Song	7cac6a9d7a	[MC] Add MCAsmParser::parseComma to improve diagnostics llvm-mc will error "expected comma" instead of "unexpected token".	2021-05-04 14:13:19 -07:00
Dávid Bolvanský	62fcda9378	Revert "[InstSimplify] Added tests for PR50173, NFC" This reverts commit `4e7a4c73da`. Not needed, pattern is handled by instcombine already.	2021-05-04 23:04:05 +02:00
Arthur O'Dwyer	da456167f5	[libc++] Make sure std::to_address doesn't depend on P::element_type. Differential Revision: https://reviews.llvm.org/D101638	2021-05-04 16:59:25 -04:00
Baptiste Saleil	6a17609157	[AMDGPU] Disable the scalar IR, SDWA and load store vectorizer passes at -O1 This patch disables some of the passes at -O1. These passes have a significant impact on compilation time, so we only want them to be enabled starting from -O2. Differential Revision: https://reviews.llvm.org/D101414	2021-05-04 16:44:39 -04:00
Louis Dionne	17f2d1cb9b	[libc++] Fix QoI bug with construction of std::tuple involving std::any In std::tuple, we should try to avoid calling std::is_copy_constructible whenever we can to avoid surprising interactions with (I believe) compiler builtins. This bug was reported in https://reviews.llvm.org/D96523#2730953. The issue was that when tuple<_Up...> was the same as tuple<_Tp...>, we would short-circuit the _Or (because sizeof...(_Tp) != 1) and go evaluate the following `is_constructible<_Tp, const _Up&>...`. That shouldn't actually be a problem, but see the analysis in https://reviews.llvm.org/D101770#2736470 for why it is with Clang and GCC. Instead, after this patch, we check whether the constructed-from tuple is the same as the current tuple regardless of the number of elements, since we should always prefer the normal copy constructor in that case anyway. Differential Revision: https://reviews.llvm.org/D101770	2021-05-04 16:42:36 -04:00
Fangrui Song	7b1e1fccb0	[MC] Don't capitalize a floating point diagnostic	2021-05-04 13:40:26 -07:00
Matt Arsenault	ccfe017510	GlobalISel: Fix missing newline in debug printing	2021-05-04 16:36:37 -04:00
Matt Arsenault	6dd8834772	X86/GlobalISel: Rely on default assignValueToReg The resulting output is semantically closer to what the DAG emits and is more compatible with the existing CCAssignFns. The returns of f32 in f80 are clearly broken, but they were broken before when using G_ANYEXT to go from f32 to f80.	2021-05-04 16:36:37 -04:00
Fangrui Song	3d473ae72e	[MC] Remove unneeded "in '.xxx' directive" from diagnostics The directive name is not useful because the next line replicates the error line which includes the directive.	2021-05-04 13:30:29 -07:00
Thomas Lively	14ca2e5e22	[WebAssembly] Mark abs of v2i64 as legal We previously had an ISel pattern for i64x2.abs, but because the ISDNode was not marked legal for v2i64, the instruction was not being selected. Differential Revision: https://reviews.llvm.org/D101803	2021-05-04 13:25:32 -07:00
Alina Sbirlea	b14c8f5f6e	Add cal entry for MemorySSA syncs.	2021-05-04 12:56:06 -07:00
Xun Li	def86413d4	[Coroutines] Do not add alloca to the frame if the size is 0 This patch is to address https://bugs.llvm.org/show_bug.cgi?id=49916. When the size of an alloca is 0, it will trigger an assertion in OptimizedStructLayout when being added to the frame. Fix it by not adding it at all. We return index 0 (beginning of the frame) for all 0-sized allocas. Differential Revision: https://reviews.llvm.org/D101841	2021-05-04 12:55:40 -07:00
Adrian Prantl	6c3a10760d	Mark Basic/TargetCXXABI.def as textual in the module map.	2021-05-04 12:52:52 -07:00
Martin Storsjö	70c4930637	[llvm-readobj] [ARMWinEH] Try to resolve label symbols into regular ones Unwind info generated by MSVC tends to have relocations pointing at static "label" symbols like "$LN4" instead of regular ones based on the actual function's name. Try to resolve such symbols to a non-label symbol if possible (ideally to an external symbol), to improve the readability. Differential Revision: https://reviews.llvm.org/D101567	2021-05-04 22:22:18 +03:00
Martin Storsjö	82de4e0753	[LLD] [COFF] Actually include the exported comdat symbols This is a followup to 2b01a417d7ccb001ccc1185ef5fdc967c9fac8d7; previously the RVAs of the exported symbols from comdats were left zero. Thanks to Kleis Auke Wolthuizen for the fix suggestion and pointing out the omission. Differential Revision: https://reviews.llvm.org/D101615	2021-05-04 22:13:08 +03:00
Martin Storsjö	e87fb6d387	[libcxx] Update docs regarding the need for bash/posix tools for tests on Windows. NFC. After `39bbfb7726`, bash is no longer a hard requirement. Differential Revision: https://reviews.llvm.org/D101779	2021-05-04 22:13:08 +03:00
Giorgis Georgakoudis	92f2c39f91	[Utils] Run non-filecheck runlines in-order in update_cc_test_checks The script update_cc_test_checks runs all non-filechecked runlines before the filechecked ones. This creates problems since outputs of those non-filechecked runlines may conflict and that will fail the execution of update_cc_test_checks. This patch executes non-filechecked in the order specified in the test file to avoid this issue. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D101683	2021-05-04 12:06:03 -07:00
Giorgis Georgakoudis	313ee609e1	[OpenMP] Fix non-determinism in clang task codegen (lastprivates) Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D101800	2021-05-04 11:56:31 -07:00
Leonard Chan	9c72a210b5	Fix for test failure caused by `84c4754372`. Reduces the number of targets/triples for this test since not all cmake invocations will build for those targets.	2021-05-04 11:45:32 -07:00
Dan Liew	1971823ecb	[Driver] Fix `ToolChain::getCompilerRTPath()` to return the correct path on Apple platforms. When the target triple was an Apple platform `ToolChain::getOSLibName()` (called by `getCompilerRTPath()`) would return the full OS name including the version number (e.g. `darwin20.3.0`). This is not correct because the library directory for all Apple platforms is `darwin`. This in turn caused * `-print-runtime-dir` to return a non-existant path. * `-print-file-name=<any compiler-rt library>` to return the filename instead of the full path to the library. Two regression tests are included. rdar://77417317 Differential Revision: https://reviews.llvm.org/D101682	2021-05-04 11:28:26 -07:00
Alina Sbirlea	974ff623aa	Add monthly MemorySSA sync.	2021-05-04 11:23:36 -07:00
Fangrui Song	23e2c1b1b3	[llvm-objdump] Delete temporary Hexagon workaround options	2021-05-04 11:05:11 -07:00
Nathan James	61dc0f2b59	[Format] Don't sort includes if DisableFormat is true Fixes https://llvm.org/PR35099. I'm not sure if this decision was intentional but its definitely confusing for users. Reviewed By: MyDeveloperDay, HazardyKnusperkeks, curdeius Differential Revision: https://reviews.llvm.org/D101628	2021-05-04 19:04:12 +01:00
Fangrui Song	e9edd11cda	[Hexagon][test] Migrate llvm-objdump --mv6[0567]t?/--mhvx to --mcpu=hexagonv*/--mattr=+hvx	2021-05-04 11:00:01 -07:00

1 2 3 4 5 ...

387572 Commits All Branches Search

387572 Commits

All Branches