llvm-project

Commit Graph

Author	SHA1	Message	Date
Xin Tong	ebfe01c121	[LoopSimplify] Simplify how we compute UniqueExit Summary: Simplify how we compute UniqueExit. Reuse ExitBlockSet. Reviewers: sanjoy, efriedma, hfinkel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D30182 llvm-svn: 295751	2017-02-21 19:10:58 +00:00
Xin Tong	a05a6c101d	More comments for getUniqueExitBlocks. NFCI llvm-svn: 295750	2017-02-21 19:08:03 +00:00
Adrian Prantl	11b2d7dad8	Teach the IR verifier to reject conflicting debug info for function arguments. Conflicting debug info for function arguments causes hard-to-debug assertions in the DWARF backend, so the Verifier should reject it. For performance reasons this only checks function arguments from non-inlined debug intrinsics for now. rdar://problem/30520286 llvm-svn: 295749	2017-02-21 19:03:15 +00:00
Geoff Berry	5d534b6a11	[CodeGenPrepare] Sink and duplicate more 'and' instructions. Summary: Rework the code that was sinking/duplicating (icmp and, 0) sequences into blocks where they were being used by conditional branches to form more tbz instructions on AArch64. The new code is more general in that it just looks for 'and's that have all icmp 0's as users, with a target hook used to select which subset of 'and' instructions to consider. This change also enables 'and' sinking for X86, where it is more widely beneficial than on AArch64. The 'and' sinking/duplicating code is moved into the optimizeInst phase of CodeGenPrepare, where it can take advantage of the fact the OptimizeCmpExpression has already sunk/duplicated any icmps into the blocks where they are used. One minor complication from this change is that optimizeLoadExt needed to be updated to always mark 'and's it has determined should be in the same block as their feeding load in the InsertedInsts set to avoid an infinite loop of hoisting and sinking the same 'and'. This change fixes a regression on X86 in the tsan runtime caused by moving GVNHoist to a later place in the optimization pipeline (see PR31382). Reviewers: t.p.northover, qcolombet, MatzeB Subscribers: aemerson, mcrosier, sebpop, llvm-commits Differential Revision: https://reviews.llvm.org/D28813 llvm-svn: 295746	2017-02-21 18:53:14 +00:00
Wei Ding	16289cfcfc	AMDGPU : AMDGPU : Update AMDGPU Trap Handler ABI. Differential Revision: http://reviews.llvm.org/D29913 llvm-svn: 295745	2017-02-21 18:48:01 +00:00
Dmitry Preobrazhensky	e6e205344e	Test commit llvm-svn: 295740	2017-02-21 18:07:07 +00:00
Simon Pilgrim	8eb515d8c4	[X86] EltsFromConsecutiveLoads SDLoc argument should be const&. There appears never to have been a time that the reference was updated. llvm-svn: 295739	2017-02-21 17:42:28 +00:00
Renato Golin	fc1ccec9bb	[RT ARM] Avoid Linux include with a redefinition To avoid depending on kernel headers, we just repeat the single define we need, which is likely never going to change. Patch by Joakim Sindholt <opensource@zhasha.com> llvm-svn: 295738	2017-02-21 17:40:26 +00:00
Vassil Vassilev	59e5a64435	Do not leak OpenedHandles. Reviewed by Vedant Kumar (D30178) llvm-svn: 295737	2017-02-21 17:30:43 +00:00
Simon Pilgrim	5afda30930	[X86][AVX512] Update VPBROADCASTQ test to combine from VPERMQ instead of VPERMI2Q. VPERMI2Q doesn't have shuffle decoding from re-materializable constants. llvm-svn: 295736	2017-02-21 17:04:11 +00:00
Simon Pilgrim	f321ab6dd2	[X86][AVX] Rename shuffle combine tests to show combined shuffle type. NFCI. llvm-svn: 295735	2017-02-21 16:45:31 +00:00
John Brawn	cfd4f9cfec	[ARM] Correct SP/PC handling in t2MOVr Add a missing test that I forgot to svn add in my previous commit llvm-svn: 295734	2017-02-21 16:45:04 +00:00
Simon Pilgrim	791955819c	[X86][AVX2] Fix VPBROADCASTQ folding on 32-bit targets. As i64 isn't a value type on 32-bit targets, we need to fold the VZEXT_LOAD into VPBROADCASTQ. llvm-svn: 295733	2017-02-21 16:41:44 +00:00
John Brawn	a6e95e1652	[ARM] Correct SP/PC handling in t2MOVr PC isn't allowed in the source operand of t2MOVr, so change the register class to one without PC. SP handling is slightly trickier and changes depending on if we're in ARMv8, so do that in checkTargetMatchPredicate. Differential Revision: https://reviews.llvm.org/D30199 llvm-svn: 295732	2017-02-21 16:41:29 +00:00
Simon Pilgrim	f98a32fa7f	[X86][AVX2] Add AVX512 test targets to AVX2 shuffle combines. llvm-svn: 295731	2017-02-21 16:29:28 +00:00
Etienne Bergeron	fc68c2c777	[compiler-rt][asan] Add support for desallocation of unhandled pointers Summary: On windows 10, the ucrt DLL is performing allocations before the function hooking and there are multiple allocations not handled by Asan. When a free occur at the end of the process, asan is reporting desallocations not malloc-ed. Reviewers: rnk, kcc Reviewed By: rnk, kcc Subscribers: kcc, llvm-commits, kubamracek, chrisha, dberris Differential Revision: https://reviews.llvm.org/D25946 llvm-svn: 295730	2017-02-21 16:09:38 +00:00
Simon Pilgrim	4cc6dd0cf6	[X86][AVX] Add tests showing missed VPBROADCASTQ folding on 32-bit targets. As i64 isn't a value type on 32-bit targets, we fail to fold the VZEXT_LOAD into VPBROADCASTQ. Also shows that we're not decoding VPERMIV3 shuffles very well.... llvm-svn: 295729	2017-02-21 16:05:35 +00:00
Simon Dardis	df827a7165	[mips] Define macros related to -mabicalls in the preprocessor Summary: Historically, NetBSD, FreeBSD and OpenBSD have defined the macro ABICALLS in the preprocessor when -mabicalls is in effect. Mainline GCC later defined __mips_abicalls when -mabicalls is in effect. This patch teaches the preprocessor to define these macros when appropriate. NetBSD does not require the ABICALLS macro. This resolves PR/31694. Thanks to Sean Bruno for highlighting this issue! Reviewers: slthakur, seanbruno Reviewed By: seanbruno Subscribers: joerg, brad, emaste, seanbruno, cfe-commits Differential Revision: https://reviews.llvm.org/D29032 llvm-svn: 295728	2017-02-21 16:01:00 +00:00
George Rimar	78ef645f94	[ELF] - Do not segfault when using --gc-sections with linker script Patch fixes PR32024. Sections that were not marked as Live has null output section. Previously we tried to access that field and segfaulted. Differential revision: https://reviews.llvm.org/D30188 llvm-svn: 295727	2017-02-21 15:46:43 +00:00
Tobias Grosser	cc43087afc	[DependenceInfo] Simplify creation and subsequent use of AccessSchedule [NFC] We only ever use the wrapped domain of AccessSchedule, so stop creating an entire union_map and then pulling the domain out. Reviewers: grosser Tags: #polly Contributed-by: Siddharth Bhat <siddu.druid@gmail.com> Differential Revision: https://reviews.llvm.org/D30179 llvm-svn: 295726	2017-02-21 15:38:31 +00:00
Ed Schouten	c16bc13511	Add a test for the feature introduced in r295240. r295240 tweaked LLD to generate a symbol table when passing in --export-dynamic, even when creating static executables. Add a test to make sure this never regresses. Reviewed by: ruiu, rafael Differential Revision: https://reviews.llvm.org/D30175 llvm-svn: 295725	2017-02-21 15:34:41 +00:00
George Rimar	6d8957b979	[ELF] - Shortify at-addr.s testcase. llvm-svn: 295724	2017-02-21 15:10:30 +00:00
Simon Pilgrim	3546156122	[X86][SSE] Prefer to combine shuffles to VZEXT over VZEXT_MOVL. This matches what is already done during shuffle lowering and helps prevent the need for a zero-vector in cases where shuffles match both patterns. llvm-svn: 295723	2017-02-21 15:09:00 +00:00
George Rimar	ae4761c186	[ELF] - Postpone evaluation of LMA offset. Previously we evaluated the values of LMA incorrectly for next cases: .text : AT(ADDR(.text) - 0xffffffff80000000) { ... } .data : AT(ADDR(.data) - 0xffffffff80000000) { ... } .init.begin : AT(ADDR(.init.begin) - 0xffffffff80000000) { ... } Reason was that we evaluated offset when VA was not assigned. For case above we ended up with 3 loads that has similar LMA and it was incorrect. That is critical for linux kernel. Patch updates the offset after VA calculation. That fixes the issue. Differential revision: https://reviews.llvm.org/D30163 llvm-svn: 295722	2017-02-21 15:08:18 +00:00
Simon Pilgrim	0c094f504c	[X86][SSE] Added SSE41 shuffle combining test file. Currently just contains one case where we combine to VZEXT_MOVL instead of VZEXT which would avoid the need for a zero vector to be generated llvm-svn: 295721	2017-02-21 14:51:15 +00:00
George Rimar	2ee2d2dcb5	[ELF] - Improve diagnostic messages for move location counter errors. Previously LLD would error out just "ld.lld: error: unable to move location counter backward" What does not really reveal the place of issue, Patch adds location to the output. Differential revision: https://reviews.llvm.org/D30187 llvm-svn: 295720	2017-02-21 14:50:38 +00:00
Anna Thomas	ec36f3b79a	[InstCombine] Do not exercise nested max/min pattern on abs Summary: This is a fix for assertion failure in `getInverseMinMaxSelectPattern` when ABS is passed in as a select pattern. We should not be invoking the simplification rule for ABS(MIN(~ x,y))) or ABS(MAX(~x,y)) combinations. Added a test case which would cause an assertion failure without the patch. Reviewers: sanjoy, majnemer Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D30051 llvm-svn: 295719	2017-02-21 14:40:28 +00:00
Igor Breger	812f319794	[AVX512] Fix EXTRACT_VECTOR_ELT for v2i1/v4i1/v32i1/v64i1 with variable index. Differential Revision: https://reviews.llvm.org/D30189 llvm-svn: 295718	2017-02-21 14:01:25 +00:00
Alexey Bataev	64da79424e	[SLP] Tests for shuffle/blending operations. llvm-svn: 295717	2017-02-21 13:40:55 +00:00
Diana Picus	613b65696a	[ARM] GlobalISel: Lower calls to void() functions For now, we hardcode a BLX instruction, and generate an ADJCALLSTACKDOWN/UP pair with amount 0. llvm-svn: 295716	2017-02-21 11:33:59 +00:00
Benjamin Kramer	ba5df6dea5	[clang-tidy] Reword the "code outside header guard" warning. The check doesn't really know if the code it is warning about came before or after the header guard, so phrase it more neutral instead of complaining about code before the header guard. The location for the warning is still not optimal, but I don't think fixing that is worth the effort, the preprocessor doesn't give us a better location. Differential Revision: https://reviews.llvm.org/D30191 llvm-svn: 295715	2017-02-21 11:25:45 +00:00
Krasimir Georgiev	4b15922838	[clang-format] Remove unused member variables from BreakableToken llvm-svn: 295714	2017-02-21 10:54:50 +00:00
Michael Kruse	9e52c39f0a	[DeLICM] Map values hoisted by LICM back to the array. Implement the -polly-delicm pass. The pass intends to undo the effects of LoopInvariantCodeMotion (LICM) which adds additional scalar dependencies into SCoPs. DeLICM will try to map those scalars back to the array elements they were promoted from, as long as the array element is unused. The is the main patch from the DeLICM/DePRE patch series. It does not yet undo GVN PRE for which additional information about known values is needed and does not handle PHI write accesses that have have no target. As such its usefulness is limited. Patches for these issues including regression tests for error situatons will follow. Reviewers: grosser Differential Revision: https://reviews.llvm.org/D24716 llvm-svn: 295713	2017-02-21 10:20:54 +00:00
Pavel Labath	ba95a28c18	Log: Fix race in accessing the stream variable Summary: The code was attempting to copy the shared pointer member in order to guarantee atomicity, but this is not enough. Instead, protect the pointer with a proper read-write mutex. This bug was present here for a long time, but my recent refactors must have altered the timings slightly, such that now this fails fairly often when running the tests: the test runner runs the "log disable" command just as the thread monitoring the lldb-server child is about to report that the server has exited. I add a test case for this. It's not possible to reproduce the race deterministically in normal circumstances, but I have verified that before the fix, the test failed when run under tsan, and was running fine afterwards. Reviewers: clayborg, zturner Subscribers: lldb-commits Differential Revision: https://reviews.llvm.org/D30168 llvm-svn: 295712	2017-02-21 09:58:23 +00:00
Pavel Labath	52a82e2ec6	tablegen: Fix android build use llvm::to_string instead of std:: version. llvm-svn: 295711	2017-02-21 09:19:41 +00:00
Richard Smith	0cd9c0491e	Fix lookup through injected-class-names in implicit deduction guides in the case where the class template has a parameter pack. Checking of the template arguments expects an "as-written" template argument list, which in particular does not have any parameter packs. So flatten the packs into separate arguments before passing them in. llvm-svn: 295710	2017-02-21 08:42:39 +00:00
Craig Topper	fe78d95a49	[X86] Remove ssse3 intrinsic tests from the avx intrinsics test file. They are all covered by the SSSE3 intrinsics test with SSSE3, AVX, and AVX512 command lines. llvm-svn: 295708	2017-02-21 08:06:08 +00:00
Craig Topper	55e2de869d	[X86] Remove sse4.2 intrinsic tests from the avx intrinsics test file. Fix some other consistency issues. They are all covered by the SSE4.2 intrinsics test with SSE4.2, AVX, and AVX512 command lines. Merge sse42.ll into the other intrinsics test. Rename sse42_64.ll to be named like other intrinsic tests. llvm-svn: 295707	2017-02-21 08:06:05 +00:00
Craig Topper	25191b4ac3	[X86] Remove sse4.1 intrinsic tests from the avx intrinsics test file. They are all covered by the SSE4.1 intrinsics test with SSE4.1, AVX, and AVX512 command lines. llvm-svn: 295706	2017-02-21 08:06:02 +00:00
Craig Topper	da8e6f1337	[X86] Remove sse3 intrinsic tests from the avx intrinsics test file. They are all covered by the SSE3 intrinsics test with SSE2, AVX, and AVX512 command lines. llvm-svn: 295705	2017-02-21 08:05:59 +00:00
Evgeny Stupachenko	9909872e30	The patch introduces new way of narrowing complex (>UINT16 variants) solutions. The new method introduced under "-lsr-exp-narrow" option (currenlty set to true). Summary: The method is based on registers number mathematical expectation and should be generally closer to optimal solution. Please see details in comments to "LSRInstance::NarrowSearchSpaceByDeletingCostlyFormulas()" function (in lib/Transforms/Scalar/LoopStrengthReduce.cpp). Reviewers: qcolombet Differential Revision: http://reviews.llvm.org/D29862 From: Evgeny Stupachenko <evstupac@gmail.com> llvm-svn: 295704	2017-02-21 07:34:40 +00:00
George Rimar	60f1fe8438	[ELF] - Make ASSERT() return Dot instead of evaluated value. Previously ASSERT we implemented returned expression value. Ex: . = ASSERT(0x100); would set Dot value to 0x100 Form of assert when it is assigned to Dot was implemented for compatibility with very old GNU ld which required it. Some scripts in the wild, including linux kernel scripts use such ASSERTs at the end for doing different checks. Currently we fail with "unable to move location counter backward" for such scripts. Patch changes ASSERT to return location counter value to fix that. Differential revision: https://reviews.llvm.org/D30171 llvm-svn: 295703	2017-02-21 07:33:38 +00:00
Craig Topper	002549b8be	[X86] Remove aes intrinsic tests from the avx intrinsics test file. They are all covered by the AES intrinsics test with a legacy command line and an AVX command line. llvm-svn: 295702	2017-02-21 07:32:18 +00:00
Craig Topper	2a71fd95e8	[X86] Add an AVX command line and regenerate AES intrinsics test using the update_llc_test_checks.py llvm-svn: 295701	2017-02-21 07:32:14 +00:00
Craig Topper	dbf6f367e9	[X86] Remove sse2 intrinsic tests from the avx intrinsics test file. They are all covered by the SSE2 intrinsics test with SSE2, AVX, and AVX512 command lines. Also remove an unneeded lfence intrinsic test since it was already covered. llvm-svn: 295700	2017-02-21 07:32:11 +00:00
Craig Topper	0d47fdcf3f	[X86] Remove sse1 intrinsic tests from the avx intrinsics test file. They are all covered by the SSE intrinsics test with SSE, AVX, and AVX512 command lines. Also remove an unneeded sfence intrinsic test since it was already covered. llvm-svn: 295699	2017-02-21 07:32:03 +00:00
Richard Smith	7fa88bb844	When deducing an array bound from the length of an initializer list, don't assume the bound has a non-dependent integral type. llvm-svn: 295698	2017-02-21 07:22:31 +00:00
Craig Topper	d88389aa7e	[X86] Use SHLD with both inputs from the same register to implement rotate on Sandy Bridge and later Intel CPUs Summary: Sandy Bridge and later CPUs have better throughput using a SHLD to implement rotate versus the normal rotate instructions. Additionally it saves one uop and avoids a partial flag update dependency. This patch implements this change on any Sandy Bridge or later processor without BMI2 instructions. With BMI2 we will use RORX as we currently do. Reviewers: zvi Reviewed By: zvi Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D30181 llvm-svn: 295697	2017-02-21 06:39:13 +00:00
Richard Smith	b4f9625a7b	PR32010: Fix template argument depth mixup when forming implicit constructor template deduction guides for class template argument deduction. Ensure that we have a local instantiation scope for tracking the instantiated parameters. Additionally, unusually, we're substituting at depth 1 and leaving depth 0 alone; make sure that we don't reduce template parameter depth by 2 for inner parameters in the process. (This is probably also broken for alias templates in the case where they're expanded within a dependent context, but this patch doesn't fix that.) llvm-svn: 295696	2017-02-21 06:30:38 +00:00
Craig Topper	16d9730b86	[X86] Fix formatting. NFC llvm-svn: 295695	2017-02-21 06:27:13 +00:00

1 2 3 4 5 ...

255383 Commits All Branches Search

255383 Commits

All Branches