llvm-project

Commit Graph

Author	SHA1	Message	Date
Alex Bradbury	13ce95b77f	[RISCV] Bugfix createRISCVELFObjectWriter r315275 set the IsLittleEndian parameter incorrectly. This patch corrects this, and adds a test to ensure such mistakes will be caught in the future. llvm-svn: 316091	2017-10-18 16:11:31 +00:00
Justin Bogner	2ac32cc9ce	AArch64/GISel: Fix a couple of tests that were testing the wrong thing Fix a couple of tests that were extending the wrong vreg, and regenerate their checks with update_mir_test_checks. This looks like it was a copy-paste or test update error. llvm-svn: 316087	2017-10-18 15:34:33 +00:00
Andre Vieira	d4a25707f0	[ARM] Fix disassembly for conditional VMRS and VMSR instructions in ARM mode Differential Revision: https://reviews.llvm.org/D38347 llvm-svn: 316085	2017-10-18 14:47:37 +00:00
Simon Dardis	03c2c65b2d	[mips] Fix analyzeBranch to handle debug data In the case where there was a conditional branch followed by a unconditional branch with debug instruction separating them, MipsInstrInfo::analyzeBranch would not skip past debug instruction when searching for the second branch which give erroneous results about the control flow of the block. This could lead to the branch folder to merge the non-fall through case into it's predecessor, leaving the conditional branch with a dangling basic block operand. This resolves PR34975. Thanks to Alexander Richardson for reporting the issue! Reviewers: atanasyan Differential Revision: https://reviews.llvm.org/D39003 llvm-svn: 316084	2017-10-18 14:35:29 +00:00
Simon Dardis	77bf0fd59c	[mips] Move test to correct directory. NFCI llvm-svn: 316081	2017-10-18 13:59:48 +00:00
Michael Zuckerman	7ba046c784	Adding new test for bug fix 316067 https://bugs.llvm.org/show_bug.cgi?id=34978 This test checks that the x86-interleaved ends without any assertion. Change-Id: I1e970482a4d0404516cbc85517fc091bb21c35a8 llvm-svn: 316080	2017-10-18 13:51:31 +00:00
Michael Zuckerman	49293264cc	[AVX512][AVX2]Cost calculation for interleave load/store patterns {v8i8,v16i8,v32i8,v64i8} This patch adds accurate instructions cost. The formula presents two cases(stride 3 and stride 4) and calculates the cost according to the VF and stride. Reviewers: 1. delena 2. Farhana 3. zvi 4. dorit 5. Ayal Differential Revision: https://reviews.llvm.org/D38762 Change-Id: If4cfbd4ac0e63694e8144cb78c7fa34850647ff7 llvm-svn: 316072	2017-10-18 11:41:55 +00:00
Hiroshi Inoue	5388e66d3a	[PowerPC] Use helper functions to check sign-/zero-extended value Helper functions to identify sign- and zero-extending machine instruction is introduced in rL315888. This patch makes PPCInstrInfo::optimizeCompareInstr use the helper functions. It simplifies the code and also makes possible more optimizations since the helper can do more analysis than the original check code; I observed about 5000 more compare instructions are eliminated while building LLVM. Also, this patch fixes a bug in helpers on ANDIo instruction handling due to the order of checks. This bug causes a failure in an existing test case for optimizeCompareInstr. Differential Revision: https://reviews.llvm.org/D38988 llvm-svn: 316071	2017-10-18 10:31:19 +00:00
Nikolai Bozhenov	74c047eabb	Improve lookThroughCast function. Summary: When we have the following case: %cond = cmp iN %x, CmpConst %tr = trunc iN %x to iK %narrowsel = select i1 %cond, iK %t, iK C We could possibly match only min/max pattern after looking through cast. So it is more profitable if widened C constant will be equal CmpConst. That is why just set widened C constant equal to CmpConst, because there is a further check in this function that trunc CmpConst == C. Also description for lookTroughCast function was added. Reviewers: spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D38536 Patch by: Artur Gainullin <artur.gainullin@intel.com> llvm-svn: 316070	2017-10-18 09:28:09 +00:00
Jatin Bhateja	1fc49627e4	[ScalarEvolution] Handling for ICmp occuring in the evolution chain. Summary: If a compare instruction is same or inverse of the compare in the branch of the loop latch, then return a constant evolution node. Currently scope of evaluation is limited to SCEV computation for PHI nodes. This shall facilitate computations of loop exit counts in cases where compare appears in the evolution chain of induction variables. Will fix PR 34538 Reviewers: sanjoy, hfinkel, junryoungju Reviewed By: junryoungju Subscribers: javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D38494 llvm-svn: 316054	2017-10-18 01:36:16 +00:00
Adrian Prantl	5a82f0a470	Verifier: Ignore CUs pulled in by ODR-uniqued types. When more than one Module is imported into the same context, such as during an LTO build before linking the modules, ODR type uniquing may cause types to point to a different CU. This check does not make sense in this case. This fixes the error reported in PR34944. https://bugs.llvm.org/show_bug.cgi?id=34944 rdar://problem/34940685 This reapplies a cleaner implementation of r316049. llvm-svn: 316052	2017-10-18 01:11:01 +00:00
Adrian Prantl	fe8226fd94	Revert "Verifier: Ignore CUs pulled in by ODR-uniqued types." This reverts commit r316049. llvm-svn: 316050	2017-10-18 00:54:31 +00:00
Adrian Prantl	f9a1cf6dcc	Verifier: Ignore CUs pulled in by ODR-uniqued types. When more than one Module is imported into the same context, such as during an LTO build before linking the modules, ODR type uniquing may cause types to point to a different CU. This check does not make sense in this case. This fixes the error reported in PR34944. https://bugs.llvm.org/show_bug.cgi?id=34944 rdar://problem/34940685 llvm-svn: 316049	2017-10-18 00:49:31 +00:00
Wei Ding	7ab1f7a421	AMDGPU : Fix an error for the llvm.cttz implementation. Differential Revision: http://reviews.llvm.org/D39014 llvm-svn: 316037	2017-10-17 21:49:52 +00:00
Tim Northover	350a87eaf1	AArch64: account for possible frame index operand in compares. If the address of a local is used in a comparison, AArch64 can fold the address-calculation into the comparison via "adds". Unfortunately, a couple of places (both hit in this one test) are not ready to deal with that yet and just assume the first source operand is a register. llvm-svn: 316035	2017-10-17 21:43:52 +00:00
Simon Pilgrim	7cd4e2c96f	[X86][SSE] Tests packuswb/truncation codegen from PR34773 llvm-svn: 316033	2017-10-17 21:14:53 +00:00
Konstantin Zhuravlyov	7dabe9ced7	AMDGPU: Start generating metadata for MaxFlatWorkGroupSize Differential Revision: https://reviews.llvm.org/D38958 llvm-svn: 316024	2017-10-17 20:03:21 +00:00
Sanjay Patel	94c0eb031c	[ARM, AArch64] adjust tests trying to maintain their objective; NFC A smarter compiler will see that these might be better without a jump table if we're just using the constant values of the switch. llvm-svn: 316012	2017-10-17 16:54:56 +00:00
Sanjay Patel	6d172f2d72	[SimplifyCFG] add test for part of PR34471 (switch squashing); NFC llvm-svn: 316008	2017-10-17 15:56:42 +00:00
Sanjay Patel	6ed5c91422	[SimplifyCFG] update test to use auto-generated FileCheck asserts; NFC llvm-svn: 316006	2017-10-17 15:50:47 +00:00
Gadi Haber	85d99b4310	[X86][Broadwell] Added the broadwell cpu to the scheduling regression tests.<NFC> NFC. Added the Broadwell cpu and the BROADWELL prefix to all the scheduling regression tests, as part of prepartion for a larger commit of adding all Broadwell scheduiling. Reviewers: RKSimon, zvi, aaboud Differential Revision: https://reviews.llvm.org/D38994 Change-Id: I54bc9065168844c107b1729fcdc1d311ce3ea0a9 llvm-svn: 315998	2017-10-17 13:45:39 +00:00
Nikolai Bozhenov	346f4329c4	Improve clamp recognition in ValueTracking. Summary: ValueTracking was recognizing not all variations of clamp. Swapping of true value and false value of select was added to fix this problem. This change breaks the canonical form of cmp inside the matchMinMax function, that is why additional checks for compare predicates is needed. Added corresponding test cases. Reviewers: spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D38531 Patch by: Artur Gainullin <artur.gainullin@intel.com> llvm-svn: 315992	2017-10-17 11:50:48 +00:00
Yichao Yu	a18b0b1817	Fix implicit null check with negative offset Summary: It seems that negative offset was accidentally allowed in D17967. AFAICT small negative offset should be valid (always raise segfault) on all archs that I'm aware of (especially x86, which is the only one with this optimization enabled) and such case can be useful when loading hiden metadata from an object. However, like the positive side, it should only be done within a certain limit. For now, use the same limit on the positive side for the negative side. A separate option can be added if needs appear. Reviewers: mcrosier, skatkov Reviewed By: skatkov Subscribers: sanjoy, llvm-commits Differential Revision: https://reviews.llvm.org/D38925 llvm-svn: 315991	2017-10-17 11:47:36 +00:00
Gadi Haber	3020490aac	[X86][Skylake] fixed/updated regression test mmx-schedule.ll which failed after r315978. Change-Id: I60cd7e03ea6c3d9a3dc661a882458e83feca66e3 llvm-svn: 315985	2017-10-17 10:00:08 +00:00
Andrew V. Tischenko	5bfbfb48b5	More tests with x86 prefixes which work after rL315899 commit llvm-svn: 315983	2017-10-17 08:49:47 +00:00
Gadi Haber	1e0f1f476a	[X86][SKL] Updated scheduling information for the SkylakeClient target Updated the scheduling information for the SkylakeClient target with the following changes: 1. regrouped the instructions after adding load and store latencies. 2. regrouped the instructions after adding identified missing ports in several groups. The changes were made after revisiting the latencies impact of all the load and store uOps. Reviewers: zvi, RKSimon, craig.topper Differential Revision: https://reviews.llvm.org/D38727 Change-Id: I778a308cc11e490e8fa5e27e2047412a1dca029f llvm-svn: 315978	2017-10-17 06:47:04 +00:00
Max Kazantsev	4faa509bb1	Remove a test after revert of rL315440 llvm-svn: 315977	2017-10-17 06:43:31 +00:00
Max Kazantsev	20fc63351d	[NFC] Add test from bug 34937 llvm-svn: 315976	2017-10-17 06:37:58 +00:00
Philip Reames	6a7bbfb2e2	Revert 315440 on behalf of mkazantsev This patch reverts rL315440 because of the bug described at https://bugs.llvm.org/show_bug.cgi?id=34937 The fix for the bug is on review as D38944, but not yet ready. Given this is a regression reverting until a fix is ready is called for. Max would have done the revert himself, but is having trouble doing a build of fresh LLVM for some reason. I did the build and test to ensure the revert worked as expected on his behalf. llvm-svn: 315974	2017-10-17 06:21:07 +00:00
Daniel Sanders	3229217620	[globalisel][tablegen] Add a GIM_CheckIsSameOperand test where OtherInsnID and OtherOpIdx differ llvm-svn: 315972	2017-10-17 05:24:44 +00:00
Craig Topper	341f2ab444	[X86] Add masked palignr tests to vector-shuffle-masked.ll llvm-svn: 315971	2017-10-17 04:17:56 +00:00
Craig Topper	19f2f49ef1	[X86] Add AVX512BW to the vector-shuffle-masked test to prepare for an upcoming commit. llvm-svn: 315970	2017-10-17 04:17:55 +00:00
Shoaib Meenai	a42f60e7f7	[ExecutionEngine] Correct the size of a write in a COFF i386 relocation We want to be writing a 32bit value, so we should be writing 4 bytes instead of 2. Patch by Alex Langford <apl@fb.com>. Differential Revision: https://reviews.llvm.org/D38872 llvm-svn: 315964	2017-10-17 01:41:14 +00:00
Vedant Kumar	4d1969f22b	[llvm-cov] Add one correction to r315960 (PR34962) In r315960, I accidentally assumed that the first line segment is guaranteed to be the non-gap region entry segment (given that one is present). It can actually be any segment on the line, and the test I checked in demonstrates that. llvm-svn: 315963	2017-10-17 01:34:41 +00:00
Reid Kleckner	57b7d4fad7	Try to make crlf portable to other printf implementations llvm-svn: 315961	2017-10-17 00:27:31 +00:00
Vedant Kumar	58548c30da	[llvm-cov] Remove workaround in line execution count calculation (PR34962) Gap areas make it possible to correctly determine when to use counts from deferred regions. Before gap areas were introduced, llvm-cov needed to use a heuristic to do this: it ignored counts from segments that start, but do not end, on a line. This heuristic breaks down on a simple example (see PR34962). This patch removes the heuristic and picks counts from any region entry segment which isn't a gap area. llvm-svn: 315960	2017-10-16 23:47:10 +00:00
Mark Searles	4e3d6160db	Use the return value of UpdateNodeOperands(); in some cases, UpdateNodeOperands() modifies the node in-place and using the return value isn’t strictly necessary. However, it does not necessarily modify the node, but may return a resultant node if it already exists in the DAG. See comments in UpdateNodeOperands(). In that case, the return value must be used to avoid such scenarios as an infinite loop (node is assumed to have been updated, so added back to the worklist, and re-processed; however, node hasn’t changed so it is once again passed to UpdateNodeOperands(), assumed modified, added back to worklist; cycle infinitely repeats). Differential Revision: https://reviews.llvm.org/D38466 llvm-svn: 315957	2017-10-16 23:38:53 +00:00
Simon Pilgrim	a590c74549	[X86][AVX] Add v4x64 vector shuffle test for <0,2,1,3> mask llvm-svn: 315955	2017-10-16 23:20:16 +00:00
Quentin Colombet	0bd2825517	Re-apply [AArch64][RegisterBankInfo] Use the statically computed mappings for COPY This reverts commit r315823, thus re-applying r315781. Also make sure we don't use G_BITCAST mapping for non-generic registers. Non-generic registers don't have a type but do have a reg bank. Something the COPY mapping now how to deal with but the G_BITCAST mapping don't. -- Original Commit Message -- We use to resort on the generic implementation to get the mappings for COPYs. The generic implementation resorts on table lookup and dynamically allocated objects to get the valid mappings. Given we already know how to map G_BITCAST and have the static mappings for them, use that code path for COPY as well. This is much more efficient. Improve the compile time of RegBankSelect by up to 20%. Note: When we eventually generate all the mappings via TableGen, we wouldn't have to do that dance to shave compile time. The intent of this change was to make sure that moving to static structure really pays off. NFC. llvm-svn: 315947	2017-10-16 22:28:40 +00:00
Quentin Colombet	9f20af6135	[AArch64][RegisterBankInfo] Add mapping support for G_BITCAST of s128 Anything bigger than 64-bit just map to FPR. llvm-svn: 315946	2017-10-16 22:28:38 +00:00
Quentin Colombet	7c114d3d70	[AArch64][LegalizerInfo] Mark s128 G_BITCAST legal We used to mark all G_BITCAST of 128-bit legal but only for vector types. Scalars of this size are just fine as well. llvm-svn: 315945	2017-10-16 22:28:27 +00:00
Matthew Simpson	36bbc8ce98	Add !callees metadata This patch adds a new kind of metadata that indicates the possible callees of indirect calls. Differential Revision: https://reviews.llvm.org/D37354 llvm-svn: 315944	2017-10-16 22:22:11 +00:00
Reid Kleckner	b0c9e0d647	[MC] Lex CRLF as one token This will prevent doubling of line endings when parsing assembly and emitting assembly. Otherwise we'd parse the directive, consume the end of statement, hit the next end of statement, and emit a fresh newline. llvm-svn: 315943	2017-10-16 22:20:03 +00:00
Simon Pilgrim	03c89a840a	[X86][3DNow] Add scheduling latency/throughput tests for 3DNow! instructions llvm-svn: 315942	2017-10-16 21:55:09 +00:00
Simon Pilgrim	608e1b57cf	[X86][MMX] Add scheduling latency/throughput tests for MMX instructions llvm-svn: 315939	2017-10-16 21:29:29 +00:00
Tony Tye	d288430c3e	Add base relative relocation record that can be used for the following case (OpenCL example): static __global int Var = 0; __global int* Ptr[] = {&Var}; ... In this case Var is a non premptable symbol and so its address can be used as the value of Ptr, with a base relative relocation that will add the delta between the ELF address and the actual load address. Such relocations do not require a symbol. Differential Revision: https://reviews.llvm.org/D38909 llvm-svn: 315935	2017-10-16 20:44:29 +00:00
Alexander Timofeev	9dff31c769	[AMDGPU] : revert r315908 llvm-svn: 315916	2017-10-16 16:57:37 +00:00
Akira Hatanaka	e8c1a54c07	[ObjCARC] Do not move a release that has the clang.imprecise_release tag above PHI instructions. ARC optimizer has an optimization that moves a call to an ObjC runtime function above a phi instruction when the phi has a null operand and is an argument passed to the function call. This optimization should not kick in when the runtime function is an objc_release that releases an object with precise lifetime semantics. rdar://problem/34959669 llvm-svn: 315914	2017-10-16 16:46:59 +00:00
Sanjay Patel	a4b89ed0b7	[x86] add minmax tests with more predicate coverage; NFC llvm-svn: 315913	2017-10-16 15:20:00 +00:00
Alexander Timofeev	3828242c7e	[AMDGPU] Prevent Machine Copy Propagation from replacing live copy with the dead one Differential revision: https://reviews.llvm.org/D38754 llvm-svn: 315908	2017-10-16 14:35:29 +00:00

1 2 3 4 5 ...

48268 Commits