llvm-project

Commit Graph

Author	SHA1	Message	Date
Richard Smith	00e5d38d40	Do not warn that an expression of the form (void)arr; is unused when arr is a volatile non-local array. This fixes a recent regression exposed by removing lvalue-to-rvalue conversion of discarded volatile arrays. In passing, regularize the rules we use to determine whether '(void)expr;' warns when expr is a volatile glvalue.	2020-05-27 17:26:29 -07:00
Philip Reames	c94c5bf9cc	Introduce a GCStatepointInst type analogous to IntrinsicInst subclasses Back when we had CallSite, we implemented the current Statepoint/ImmutableStatepoint structure in analogous manner. Now that CallSite has been removed, the structure used for statepoints looks decidely out of place. gc.statepoint is one of the small handful of intrinsics which are invokable. Because of this, it can't subclass IntrinsicInst as is idiomatic. This change simply introduces the GCStatepointInst class, restructures the existing Statepoint/ImmutableStatepoint types to wrap it. I will be landing a series of changes to sink functionality into GCStatepointInst and updating callers to be more idiomatic.	2020-05-27 17:25:13 -07:00
Fangrui Song	eca963f244	[gn build] Add MLAnalysisTests after D80579	2020-05-27 17:21:05 -07:00
Mircea Trofin	d14ee1553e	[llvm][NFC] ProfileSummaryInfo - const-ify APIs Follow-up from https://reviews.llvm.org/D79920	2020-05-27 17:14:41 -07:00
Fangrui Song	dee2bb5810	[gn build] Port D80579	2020-05-27 17:12:12 -07:00
Rui Ueyama	54d2896852	[ELF] --wrap: Drop __real_ symbol from the symbol table In D34993, we discussed and concluded that we should drop `__real_ symbol from the symbol table, but I did the opposite in D50569. This patch is to drop `__real_` symbol. MaskRay's note: omitting `__real_` is important if it is undefined: otherwise a subsequent link may error due to the undefined `__real_` in .dynsym Differential Revision: https://reviews.llvm.org/D51283	2020-05-27 16:58:00 -07:00
Layton Kifer	2bf3fe9b6d	[TRE] Allow elimination when the returned value is non-constant Currently we can only eliminate call return pairs that either return the result of the call or a dynamic constant. This patch removes that limitation. Differential Revision: https://reviews.llvm.org/D79660	2020-05-27 16:55:03 -07:00
Stanislav Mekhanoshin	7392bbc301	AMDGPU/GlobalISel: Fixed insert element for non-standard vectors Differential Revision: https://reviews.llvm.org/D80653	2020-05-27 16:26:22 -07:00
Leonard Chan	ef37444058	[Lexer] Fix invalid suffix diagnostic for fixed-point literals Committing on behalf of nagart, who authored this patch. Differential Revision: https://reviews.llvm.org/D80412	2020-05-27 16:16:56 -07:00
Matt Arsenault	5e007fe998	AMDGPU: Support non-entry block static sized allocas OpenMP emits these for some reason, so handle them. Assume these use 4096 bytes by default, with a flag to override this. Also change the related stack assumption for calls to have a flag.	2020-05-27 18:46:10 -04:00
Matt Arsenault	dda82986f9	DAG: Fix expansion of DYNAMIC_STACKALLOC for StackGrowsUp targets Can't test this since I can't directly use the default expansion for AMDGPU. It needs to scale the amount by the wave size, rather than use the raw byte size value.	2020-05-27 18:45:40 -04:00
Stanislav Mekhanoshin	8aa81aaebe	AMDGPU/GlobalISel: Fixed handling of non-standard vectors We do not have register classes for all possible vector sizes, so round it up for extract vector element. Also fixes selection of G_MERGE_VALUES when vectors are not a power of two. This has required to refactor getRegSplitParts() in way that it can handle not just power of two vectors. Ideally we would like RegSplitParts to be generated by tablegen. Differential Revision: https://reviews.llvm.org/D80457	2020-05-27 15:44:09 -07:00
Fangrui Song	be6bffe729	[CMake] Revert `cf86a234ba` It is unnecessary after `993bbaf6a3`	2020-05-27 15:29:22 -07:00
Fangrui Song	993bbaf6a3	[MLPolicies] Fix dependency and -DBUILD_SHARED_LIBS=on builds after D80579	2020-05-27 15:26:13 -07:00
Mircea Trofin	cf86a234ba	Fix shared libs build break introduced in rG98ef93eabd76	2020-05-27 15:12:16 -07:00
Adrian McCarthy	2d068e534f	Fix Windows command line bug when last token in response file is "" Patch by Neil Dhar <dhar@alumni.duke.edu> Current state machine for parsing tokens from response files in Windows does not correctly handle the case where the last token is "". The current implementation handles the last token by only adding it if it is not empty, however this does not cover the case where the last token is meant to be the empty string. We can cover this case by checking whether the state machine was last in the UNQUOTED state, which indicates that the last character of the input was a non-whitespace character. Differential Revision: https://reviews.llvm.org/D78346	2020-05-27 14:49:30 -07:00
MaheshRavishankar	0a072b8a0d	[mlir][Linalg] Add missing library linkage for shared library builds. Differential Revision: https://reviews.llvm.org/D80664	2020-05-27 14:29:35 -07:00
Adrian Prantl	a57a67c59b	Fix a use-after-free in GetXcodeSDKPath Introduced in https://reviews.llvm.org/D80595. Thanks Jonas for noticing! Differential Revision: https://reviews.llvm.org/D80666	2020-05-27 14:27:16 -07:00
Louis Dionne	f46bb9dd5c	[NFC] Reformat TEST_FOO macros in test_macros.h To make them easier to read and to make it easier to add new ones.	2020-05-27 16:54:43 -04:00
Jonas Devlieghere	f9bea9bc4a	[lldb/Reproducers] Skip & add FIXME to tests failing with unexpected packet. Add skip decorator to tests failing with an unexpected packet during passive replay.	2020-05-27 13:52:48 -07:00
Jonas Devlieghere	5f97a540ad	[lldb/Reproducers] Differentiate active and passive replay unexpected packet.	2020-05-27 13:52:38 -07:00
Sean Silva	25132b36a8	[mlir][shape] Use IndexElementsAttr in Shape dialect. Summary: Index is the proper type for storing shapes when constant folding, so this fixes the previous code (which was using i64). Differential Revision: https://reviews.llvm.org/D80600	2020-05-27 13:39:49 -07:00
Sean Silva	9546d8b108	[mlir][core] Add IndexElementsAttr helpers. Summary: In a follow-up, I'll update the Shape dialect to use this instead of I64ElementsAttr. Differential Revision: https://reviews.llvm.org/D80601	2020-05-27 13:39:48 -07:00
Mircea Trofin	98ef93eabd	[llvm] Add function feature extraction analysis Summary: This patch introduces an analysis pass to extract function features, which will be needed by the ML InlineAdvisor. RFC: http://lists.llvm.org/pipermail/llvm-dev/2020-April/140763.html Reviewers: davidxl, dblaikie, jdoerfert Subscribers: mgorny, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80579	2020-05-27 13:38:50 -07:00
Michael Liao	fa342b5c80	Enable `align <n>` to be used in the intrinsic definition. - This allow us to specify the (minimal) alignment on an intrinsic's arguments and, more importantly, the return value. Differential Revision: https://reviews.llvm.org/D80422	2020-05-27 16:38:18 -04:00
Michael Liao	03481287ca	Refactor argument attribute specification in intrinsic definition. NFC. - Argument attribute needs specifiying through `ArgIndex<n>` (corresponding to `FirstArgIndex`) to distinguish explicitly from the index number from the overloaded type list. - In addition, `RetIndex` (corresponding to `ReturnIndex`) and `FuncIndex` (corresponding to `FunctionIndex`) are introduced for us to associate attributes on the return value and potentially function itself. Differential Revision: https://reviews.llvm.org/D80422	2020-05-27 16:37:53 -04:00
Vitaly Buka	804a39a201	[NFC,StackSafety] Rename some variables	2020-05-27 13:33:28 -07:00
Vitaly Buka	14f3357586	[StackSafety] Bailout more aggressively Many edge cases, e.g. wrapped ranges, can be processed precisely without bailout. However it's very unlikely that memory access with min/max integer offsets will be classified as safe anyway. Early bailout may help with ThinLTO where we can drop unsafe parameters from summaries.	2020-05-27 13:33:28 -07:00
Sean Silva	b277382311	Remove error-prone mlir::ExecutionEngine::invoke overload. I just spent a bunch of time debugging a mysterious bug that ended being due to my SmallVector getting passed to the Args&... overload instead of the MutableArrayRef overload, with disastrous results. I appreciate the intent of this API, but for a function that does a bunch of unsafe casts, adding in potential overload confusion is just too much C++ footgun. If we end up needing this functionality, having something like a separate `packArgs(Args&...) -> SmallVector` overload would be preferable. Turns out this API is unused and untested (even out of tree as far as I can tell, modulo the optional passing of no args to the other invoke as I fixed in this patch), so it's an easy fix -- just delete it and touch up the other overload. Differential Revision: https://reviews.llvm.org/D80607	2020-05-27 13:26:03 -07:00
Juneyoung Lee	54b6457240	[TargetPassConfig] Add CanonicalizeFreezeInLoops before LSR Summary: This patch adds CanonicalizeFreezeInLoops before LSR. Relevant patch: https://reviews.llvm.org/D77523 Reviewers: spatel, efriedma, jdoerfert, fhahn, nikic, reames, xbolva00 Reviewed By: nikic Subscribers: xbolva00, nikic, lebedev.ri, hiraditya, llvm-commits, sanwou01, nlopes Tags: #llvm Differential Revision: https://reviews.llvm.org/D77524	2020-05-28 05:21:12 +09:00
Nicolas Vasilache	79aa9bfdb8	[mlir] Fix RunnerUtils template specialization Undoing a spurious change that broke SFINAE for some out of core use cases.	2020-05-27 16:14:43 -04:00
MaheshRavishankar	c6fa2efd48	[mlir][Linalg] Fix build failure from D80188 Differential Revision: https://reviews.llvm.org/D80657	2020-05-27 13:06:43 -07:00
Michael Liao	49688b3c30	Fix `-Wpedantic` warning. NFC.	2020-05-27 15:57:03 -04:00
Jessica Paquette	c593bf5342	[GlobalISel] Don't combine instructions which are fed by memory instructions. If we have a memory instruction (e.g. a load), we shouldn't combine it away in some trivial combine. It's possible that, say, a call lives between the instructions. This could modify the value loaded, making the load instructions not safe to fold. Differential Revision: https://reviews.llvm.org/D80053	2020-05-27 12:48:58 -07:00
Dmitry Vyukov	d24dd2b279	tsan: fix test in debug mode sanitizer-x86_64-linux-autoconf has failed after the previous tsan commit: FAIL: ThreadSanitizer-x86_64 :: java_finalizer2.cpp (245 of 403) ****************** TEST 'ThreadSanitizer-x86_64 :: java_finalizer2.cpp' FAILED ****************** Script: -- : 'RUN: at line 1'; /b/sanitizer-x86_64-linux-autoconf/build/tsan_debug_build/./bin/clang --driver-mode=g++ -fsanitize=thread -Wall -m64 -gline-tables-only -I/b/sanitizer-x86_64-linux-autoconf/build/llvm-project/compiler-rt/test/tsan/../ -std=c++11 -I/b/sanitizer-x86_64-linux-autoconf/build/llvm-project/compiler-rt/test/tsan/../ -nostdinc++ -I/b/sanitizer-x86_64-linux-autoconf/build/tsan_debug_build/tools/clang/runtime/compiler-rt-bins/lib/tsan/libcxx_tsan_x86_64/include/c++/v1 -O1 /b/sanitizer-x86_64-linux-autoconf/build/llvm-project/compiler-rt/test/tsan/java_finalizer2.cpp -o /b/sanitizer-x86_64-linux-autoconf/build/tsan_debug_build/tools/clang/runtime/compiler-rt-bins/test/tsan/X86_64Config/Output/java_finalizer2.cpp.tmp && /b/sanitizer-x86_64-linux-autoconf/build/tsan_debug_build/tools/clang/runtime/compiler-rt-bins/test/tsan/X86_64Config/Output/java_finalizer2.cpp.tmp 2>&1 \| FileCheck /b/sanitizer-x86_64-linux-autoconf/build/llvm-project/compiler-rt/test/tsan/java_finalizer2.cpp -- Exit Code: 1 Command Output (stderr): -- /b/sanitizer-x86_64-linux-autoconf/build/llvm-project/compiler-rt/test/tsan/java_finalizer2.cpp:82:11: error: CHECK: expected string not found in input // CHECK: DONE ^ <stdin>:1:1: note: scanning from here FATAL: ThreadSanitizer CHECK failed: /b/sanitizer-x86_64-linux-autoconf/build/llvm-project/compiler-rt/lib/tsan/rtl/tsan_sync.cpp:69 "((meta)) == ((0))" (0x4000003e, 0x0) ^ <stdin>:5:12: note: possible intended match here #3 __tsan::OnUserAlloc(__tsan::ThreadState, unsigned long, unsigned long, unsigned long, bool) /b/sanitizer-x86_64-linux-autoconf/build/llvm-project/compiler-rt/lib/tsan/rtl/tsan_mman.cpp:225:16 (java_finalizer2.cpp.tmp+0x4af407) ^ http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-autoconf/builds/51143/steps/test%20tsan%20in%20debug%20compiler-rt%20build/logs/stdio Fix heap object overlap by offsetting java heap as other tests are doing.	2020-05-27 21:48:39 +02:00
alex-t	eb1092ada3	[AMDGPU] Fix for the lost CarryOut/CarryIn register operands in S_ADD/SUB_CO_PSEUDO. Summary: This fixes the `5b898bddff` bug when the carry-in and carry-out registers became lost in lowering S_ADD/SUB_CO_PSEUDO. Reviewers: rampitec, arsenm Reviewed By: arsenm Subscribers: msearles, arsenm, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80158	2020-05-27 22:41:04 +03:00
Adrian Prantl	3345521507	Also cache negative results in GetXcodeSDKPath (NFC) This fixes a performance issue in the failure case. rdar://63547920 Differential Revision: https://reviews.llvm.org/D80595	2020-05-27 12:26:04 -07:00
Jonas Devlieghere	c30c2368c7	[lldb/Reproducers] Skip tests relying on timeouts The reproducer don't model timeouts so tests that rely on them end up with unexpected packets during replay. Skip them until we can handle this scenario.	2020-05-27 12:08:41 -07:00
Jonas Devlieghere	fe9d8442e0	[lldb/Test] Generate YAML binary in build directory Although it's not entirely clear to me why, this test was generating its binary in the source directory instead of the build directory. This patch fixes that following the same approach as other tests.	2020-05-27 12:08:41 -07:00
Craig Topper	8e7e6a8d6b	[X86] Restore selection of MULX on BMI2 targets. Looking back over gcc and icc behavior it looks like icc does use mulx32 on 32-bit targets and mulx64 on 64-bit targets. It's also used when dividing i32 by constant on 32-bit targets and i64 by constant on 64-bit targets. gcc uses it multiplies producing a 64 bit result on 32-bit targets and 128-bit results on a 64-bit target. gcc does not appear to use it for division by constant. After this patch clang is closer to the icc behavior. This basically reverts `d1c61861dd`, but there were no strong feelings at the time. Fixes PR45518. Differential Revision: https://reviews.llvm.org/D80498	2020-05-27 12:01:18 -07:00
Mircea Trofin	fa3b587196	[llvm]NFC] Simplify ProfileSummaryInfo state transitions ProfileSummaryInfo is updated seldom, as result of very specific triggers. This patch clearly demarcates state updates from read-only uses. This, arguably, improves readability and maintainability.	2020-05-27 11:58:37 -07:00
Sanjay Patel	48cb380abd	[InstCombine] add tests for vector demanded elements of select condition; NFC	2020-05-27 14:49:36 -04:00
Matt Arsenault	4b4496312e	AMDGPU: Start adding MODE register uses to instructions This is the groundwork required to implement strictfp. For now, this should be NFC for regular instructoins (many instructions just gain an extra use of a reserved register). Regalloc won't rematerialize instructions with reads of physical registers, but we were suffering from that anyway with the exec reads. Should add it for all the related FP uses (possibly with some extras). I did not add it to either the gpr index mode instructions (or every single VALU instruction) since it's a ridiculous feature already modeled as an arbitrary side effect. Also work towards marking instructions with FP exceptions. This doesn't actually set the bit yet since this would start to change codegen. It seems nofpexcept is currently not implied from the regular IR FP operations. Add it to some MIR tests where I think it might matter.	2020-05-27 14:47:00 -04:00
John Fastabend	13f6c81c5d	[BPF] simplify zero extension with MOV_32_64 The current pattern matching for zext results in the following code snippet being produced, w1 = w0 r1 <<= 32 r1 >>= 32 Because BPF implementations require zero extension on 32bit loads this both adds a few extra unneeded instructions but also makes it a bit harder for the verifier to track the r1 register bounds. For example in this verifier trace we see at the end of the snippet R2 offset is unknown. However, if we track this correctly we see w1 should have the same bounds as r8. R8 smax is less than U32 max value so a zero extend load should keep the same value. Adding a max value of 800 (R8=inv(id=0,smax_value=800)) to an off=0, as seen in R7 should create a max offset of 800. However at the end of the snippet we note the R2 max offset is 0xffffFFFF. R0=inv(id=0,smax_value=800) R1_w=inv(id=0,umax_value=2147483647,var_off=(0x0; 0x7fffffff)) R6=ctx(id=0,off=0,imm=0) R7=map_value(id=0,off=0,ks=4,vs=1600,imm=0) R8_w=inv(id=0,smax_value=800,umax_value=4294967295,var_off=(0x0; 0xffffffff)) R9=inv800 R10=fp0 fp-8=mmmm???? 58: (1c) w9 -= w8 59: (bc) w1 = w8 60: (67) r1 <<= 32 61: (77) r1 >>= 32 62: (bf) r2 = r7 63: (0f) r2 += r1 64: (bf) r1 = r6 65: (bc) w3 = w9 66: (b7) r4 = 0 67: (85) call bpf_get_stack#67 R0=inv(id=0,smax_value=800) R1_w=ctx(id=0,off=0,imm=0) R2_w=map_value(id=0,off=0,ks=4,vs=1600,umax_value=4294967295,var_off=(0x0; 0xffffffff)) R3_w=inv(id=0,umax_value=800,var_off=(0x0; 0x3ff)) R4_w=inv0 R6=ctx(id=0,off=0,imm=0) R7=map_value(id=0,off=0,ks=4,vs=1600,imm=0) R8_w=inv(id=0,smax_value=800,umax_value=4294967295,var_off=(0x0; 0xffffffff)) R9_w=inv(id=0,umax_value=800,var_off=(0x0; 0x3ff)) R10=fp0 fp-8=mmmm???? After this patch R1 bounds are not smashed by the <<=32 >>=32 shift and we get correct bounds on R2 umax_value=800. Further it reduces 3 insns to 1. Signed-off-by: John Fastabend <john.fastabend@gmail.com> Differential Revision: https://reviews.llvm.org/D73985	2020-05-27 11:26:39 -07:00
Lei Huang	2368bf52cd	[PowerPC] Add support for -mcpu=pwr10 in both clang and llvm Summary: This patch simply adds support for the new CPU in anticipation of Power10. There isn't really any functionality added so there are no associated test cases at this time. Reviewers: stefanp, nemanjai, amyk, hfinkel, power-llvm-team, #powerpc Reviewed By: stefanp, nemanjai, amyk, #powerpc Subscribers: NeHuang, steven.zhang, hiraditya, llvm-commits, wuzish, shchenz, cfe-commits, kbarton, echristo Tags: #clang, #powerpc, #llvm Differential Revision: https://reviews.llvm.org/D80020	2020-05-27 13:14:25 -05:00
aartbik	c295a65da4	[mlir] [VectorOps] Add 'vector.flat_transpose' operation Summary: Provides a representation of the linearized LLVM instrinsic. With tests and lowering implementation to LLVM IR dialect. Prepares better lowering for 2-D vector.transpose. Reviewers: nicolasvasilache, ftynse, reidtatge, bkramer, dcaballe Reviewed By: ftynse, dcaballe Subscribers: mehdi_amini, rriddle, jpienaar, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, stephenneuendorffer, Joonsoo, grosul1, frgossen, Kayjukh, jurahul, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80419	2020-05-27 11:09:48 -07:00
Rithik Sharma	eadf295956	[CodeMoverUtils] Use dominator tree level to decide the direction of code motion Summary: Currently isSafeToMoveBefore uses DFS numbering for determining the relative position of instruction and insert point which is not always correct. This PR proposes the use of Dominator Tree depth for the same. If a node is at a higher level than the insert point then it is safe to say that we want to move in the forward direction. Authored By: RithikSharma Reviewer: Whitney, nikic, bmahjour, etiotto, fhahn Reviewed By: Whitney Subscribers: fhahn, hiraditya, llvm-commits Tag: LLVM Differential Revision: https://reviews.llvm.org/D80084	2020-05-27 18:02:06 +00:00
Fangrui Song	b9c6871a95	[Driver] Support -fsanitize=shadow-call-stack and cfi-icall on aarch64_be D80647 did not fix https://bugs.llvm.org/show_bug.cgi?id=46076 This is the fix.	2020-05-27 10:55:05 -07:00
jasonliu	8d9ff23185	[NFC][XCOFF][AIX] Return function entry point symbol with dedicate function Use getFunctionEntryPointSymbol whenever possible to enclose the implementation detail and reduce duplicate logic. Differential Revision: https://reviews.llvm.org/D80402	2020-05-27 17:54:22 +00:00
Matt Arsenault	d37ce53ad3	AMDGPU: Set StackPointerRegisterToSaveRestore This will enable selecting non-entry block allocas. Skip the SP write check in the base isSchedulingBoundary implementation to preserve the previous scheduling behavior and avoid test churn. It's apparently for compile time reasons, but if we were to use this more work would be needed since in some of the failing tests, we seem to incorrectly get hazard nops inserted.	2020-05-27 13:44:05 -04:00

... 4 5 6 7 8 ...

355667 Commits All Branches Search

355667 Commits

All Branches