llvm-project

Commit Graph

Author	SHA1	Message	Date
Igor Breger	6fc00b0acf	autogenerate checks llvm-svn: 278624	2016-08-14 09:34:39 +00:00
Igor Breger	8672408db0	[AVX512] Fix insertelement i1 lowering. 1. Use shuffle to insert element i1 into vector. The previous implementation was incorrect ( dest_bit OR src_bit , it doesn't clear the bit if src_bit=0 ) 2. Improve shuffle i1 vector, use CVT2MASK if supported instead TRUNCATE. Differential Revision: http://reviews.llvm.org/D23347 llvm-svn: 278623	2016-08-14 05:25:07 +00:00
Diana Picus	68be1eb885	Revert "CodeGen: If Convert blocks that would form a diamond when tail-merged." This reverts commit r278287. This commit broke the clang-cmake-thumbv7-a15-full-sh bot. See https://llvm.org/bugs/show_bug.cgi?id=28949 llvm-svn: 278621	2016-08-14 02:10:18 +00:00
Diana Picus	35ccf53e75	Revert "Codegen: Don't tail-duplicate blocks with un-analyzable fallthrough." This reverts commit r278288. r278287 broke the clang-cmake-thumbv7-a15-full-sh bot. Revert this so we can get to r278287. llvm-svn: 278620	2016-08-14 02:10:12 +00:00
Ron Lieberman	822ee88ab8	Fix unsupported relocation type R_HEX_6_X' for symbol .rodata LowerTargetConstantPool is not properly setting the TargetFlag to indicate desired relocation. Coding error, the offset parameter was omitted, so the TargetFlag was used as the offset, and the TargetFlag defaulted to zero. This only affects -fpic compilation, and only those items created in a Constant Pool, for example a vector of constants. Halide ran into this issue. llvm-svn: 278614	2016-08-13 23:41:11 +00:00
Mehdi Amini	8c629ecf3a	Revert "Revert "Invariant start/end intrinsics overloaded for address space"" This reverts commit 32fc6488e48eafc0ca1bac1bd9cbf0008224d530. llvm-svn: 278609	2016-08-13 23:31:24 +00:00
Mehdi Amini	164ac651da	Revert "Invariant start/end intrinsics overloaded for address space" This reverts commit r276447. llvm-svn: 278608	2016-08-13 23:27:32 +00:00
Sanjay Patel	08c876673e	[x86] add tests to show missed 64-bit immediate merging Tests are slightly modified versions of those written by Sunita Marathe in D23391. llvm-svn: 278599	2016-08-13 18:42:14 +00:00
Craig Topper	3f8126e6fa	[AVX-512] Remove an AddedComplexity that was prioritizing basic vzmovl patterns over more complex ones that produce better code. llvm-svn: 278593	2016-08-13 05:43:20 +00:00
Craig Topper	600685d510	[AVX-512] Add patterns to support VZEXT_MOVL from 512-bit vectors with 64-bit and 32-bit elements. Fixes PR28961. llvm-svn: 278592	2016-08-13 05:33:12 +00:00
Matt Arsenault	3cc1e0066d	AMDGPU: Fix missing test for addressing mode with odd offsets Add test if the constant offset looks unaligned. llvm-svn: 278589	2016-08-13 01:43:51 +00:00
Dominic Chen	4a9b99ee92	[WebAssembly] Re-enable disabled debug value test Summary: This test was resulting in asan/valgrind failures due to undefined DWARF register mappings for WebAssembly, and was disabled in r278495. These have been resolved. Reviewers: sunfish, dschuff Subscribers: bkramer, llvm-commits, jfb Differential Revision: https://reviews.llvm.org/D23459 llvm-svn: 278576	2016-08-12 23:14:18 +00:00
Haicheng Wu	7c4535d1e7	Reapply [BranchFolding] Restrict tail merging loop blocks after MBP Fixed a bug in the test case. To fix PR28104, this patch restricts tail merging to blocks that belong to the same loop after MBP. llvm-svn: 278575	2016-08-12 23:13:38 +00:00
Artem Belevich	2f0a3dfe64	[NVPTX] Use untyped (.b) integer registers in PTX. This bring LLVM-generated PTX closer to what nvcc generates and avoids triggering issues in ptxas. For instance, ptxas does not accept .s16 (or .u16) registers as operands for .fp16 instructions. Differential Revision: https://reviews.llvm.org/D23460 llvm-svn: 278568	2016-08-12 22:02:19 +00:00
Eli Friedman	f184e4befc	[AArch64LoadStoreOptimizer] Check aliasing correctly when creating paired loads/stores. The existing code accidentally skipped the aliasing check in edge cases. Differential revision: https://reviews.llvm.org/D23372 llvm-svn: 278562	2016-08-12 20:39:51 +00:00
Eli Friedman	8585e9d33d	[AArch64LoadStoreOpt] Handle offsets correctly for post-indexed paired loads. Trunk would try to create something like "stp x9, x8, [x0], #512", which isn't actually a valid instruction. Differential revision: https://reviews.llvm.org/D23368 llvm-svn: 278559	2016-08-12 20:28:02 +00:00
Artur Pilipenko	87e4038a91	[x86] X86ISelLowering zext(add_nuw(x, C)) --> add(zext(x), C_zext) Currently X86ISelLowering has a similar transformation for sexts: sext(add_nsw(x, C)) --> add(sext(x), C_sext) In this change I extend this code to handle zexts as well. Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D23359 llvm-svn: 278520	2016-08-12 16:08:30 +00:00
James Y Knight	2cc9da9a65	Revert "[Sparc] Leon errata fix passes." ...and the two followup commits: Revert "[Sparc][Leon] Missed resetting option flags from check-in 278489." Revert "[Sparc][Leon] Errata fixes for various errata in different versions of the Leon variants of the Sparc 32 bit processor." This reverts commit r274856, r278489, and r278492. llvm-svn: 278511	2016-08-12 14:48:09 +00:00
Simon Pilgrim	687d71e877	[X86][SSE] Add support for combining target shuffles to PSLLDQ/PSRLDQ byte shifts llvm-svn: 278502	2016-08-12 11:24:34 +00:00
Benjamin Kramer	05e760ec4b	[Webassembly] disable unstable test. It reads uninitialized memory and crashes randomly. llvm-svn: 278495	2016-08-12 10:13:45 +00:00
Simon Pilgrim	ed96b9adfb	[X86][SSE] Fixed PALIGNR target shuffle decode The PALIGNR target shuffle decode was not taking into account that DecodePALIGNRMask (rather oddly) expects the operands to be in reverse order, nor was it detecting unary patterns, causing combines to combine with the incorrect input. The cgbuiltin, auto upgrade and instruction comments code correctly swap the operands so are not affected. llvm-svn: 278494	2016-08-12 10:10:51 +00:00
Chris Dewhurst	829f8efe55	[Sparc][Leon] Errata fixes for various errata in different versions of the Leon variants of the Sparc 32 bit processor. The nature of the errata are listed in the comments preceding the errata fix passes. Relevant unit tests are implemented for each of these. These changes update older versions of these errata fixes with improvements to code and unit tests. Differential Revision: https://reviews.llvm.org/D21960 llvm-svn: 278489	2016-08-12 09:34:26 +00:00
Haicheng Wu	d9cbb1608f	Revert "[BranchFolding] Restrict tail merging loop blocks after MBP" This reverts commit r278463 because it hits the bot. llvm-svn: 278484	2016-08-12 08:40:24 +00:00
Wei Mi	7e103d92cc	Recommit 'Remove the restriction that MachineSinking is now stopped by "insert_subreg, subreg_to_reg, and reg_sequence" instructions' after adjusting some unittest checks. This is to solve PR28852. The restriction was added at 2010 to make better register coalescing. We assumed that it was not necessary any more. Testing results on x86 supported the assumption. We will look closely to any performance impact it will bring and will be prepared to help analyzing performance problem found on other architectures. Differential Revision: https://reviews.llvm.org/D23210 llvm-svn: 278466	2016-08-12 03:33:22 +00:00
Haicheng Wu	ea02372059	[BranchFolding] Restrict tail merging loop blocks after MBP To fix PR28014, this patch restricts tail merging to blocks that belong to the same loop after MBP. Differential Revision: https://reviews.llvm.org/D23191 llvm-svn: 278463	2016-08-12 03:30:23 +00:00
Vyacheslav Klochkov	6daefcf626	X86-FMA3: Implemented commute transformation for EVEX/AVX512 FMA3 opcodes. This helped to improved memory-folding and register coalescing optimizations. Also, this patch fixed the tracker #17229. Reviewer: Craig Topper. Differential Revision: https://reviews.llvm.org/D23108 llvm-svn: 278431	2016-08-11 22:07:33 +00:00
Tim Northover	8e0c53a018	GlobalISel: support 'null' constant in translation. It's sharing the integer G_CONSTANT for now since I don't think it creates any ambiguity (even on weird archs). If that turns out wrong we can create a G_PTRCONSTANT or something. llvm-svn: 278423	2016-08-11 21:40:55 +00:00
Krzysztof Parzyszek	1b689da04e	[Hexagon] Allow non-returning calls in hardware loops llvm-svn: 278416	2016-08-11 21:14:25 +00:00
Tim Northover	da6f5f2d0a	Remove empty file left by partial reversion. llvm-svn: 278411	2016-08-11 21:01:15 +00:00
Tim Northover	30e67ce793	GlobalISel: add translation support for shift operations. llvm-svn: 278410	2016-08-11 21:01:13 +00:00
Tim Northover	f1f7bf1279	GlobalISel: support zext & sext during translation phase. llvm-svn: 278409	2016-08-11 21:01:10 +00:00
Wei Ding	70cda07526	AMDGPU : Add intrinsic for instruction v_cvt_pk_u8_f32 Differential Revision: http://reviews.llvm.org/D23336 llvm-svn: 278403	2016-08-11 20:34:48 +00:00
Wei Mi	3ab5816000	Revert rL278384 which caused several buildbot failures (like check failures in CodeGen/X86/clz.ll). llvm-svn: 278402	2016-08-11 20:33:37 +00:00
Wei Mi	ec19b35179	Remove the restriction that MachineSinking is now stopped by "insert_subreg, subreg_to_reg, and reg_sequence" instructions. This is to solve PR28852. The restriction was added at 2010 to make better register coalescing. We assumed that it was not necessary any more. Testing results on x86 supported the assumption. We will look closely to any performance impact it will bring and will be prepared to help analyzing performance problem found on other architectures. Differential Revision: https://reviews.llvm.org/D23210 llvm-svn: 278384	2016-08-11 18:42:56 +00:00
Krzysztof Parzyszek	a003b76391	If-conversion incorrectly calculates liveness of redefined registers Differential Revision: https://reviews.llvm.org/D23207 llvm-svn: 278383	2016-08-11 18:42:06 +00:00
Krzysztof Parzyszek	60f0b51485	[Hexagon] Skip byval arguments when checking parameter attributes From the point of view of register assignment, byval parameters are ignored: a byval parameter is not going to be assigned to a register, and it will not affect the assignments of subsequent parameters. When matching registers with parameters in the bit tracker, make sure to skip byval parameters before advancing the registers. llvm-svn: 278375	2016-08-11 18:15:16 +00:00
Dominic Chen	6ba19659cb	Improve virtual register handling when computing debug information Summary: Some backends, like WebAssembly, use virtual registers instead of physical registers. This crashes the DbgValueHistoryCalculator pass, which assumes that all registers are physical. Instead, skip virtual registers when iterating aliases, and assume that they are clobbered. Reviewers: dexonsmith, dschuff, aprantl Subscribers: yurydelendik, llvm-commits, jfb, sunfish Differential Revision: https://reviews.llvm.org/D22590 llvm-svn: 278371	2016-08-11 17:52:40 +00:00
Michael Kuperstein	e36d7716c3	Make TwoAddressInstructionPass::rescheduleMIBelowKill subreg-aware This fixes PR28824. Differential Revision: https://reviews.llvm.org/D23220 llvm-svn: 278370	2016-08-11 17:38:33 +00:00
Matt Arsenault	56684d4538	AMDGPU: Fix crashes on memory functions llvm-svn: 278369	2016-08-11 17:31:42 +00:00
Wei Ding	d3344378c6	AMDGPU : Fix SAD related instruction LIT tests function atttibute issues. Differential Revision: http://reviews.llvm.org/D23133 llvm-svn: 278360	2016-08-11 17:14:17 +00:00
Wei Ding	34e1753585	AMDGPU : Add LLVM intrinsics for SAD related instructions. Differential Revision: http://reviews.llvm.org/D23133 llvm-svn: 278354	2016-08-11 16:33:53 +00:00
Tim Northover	0d51044b69	GlobalISel: clear vreg mapping after translating each function Otherwise we only materialize (shared) constants in the first function they appear in. This doesn't go well. llvm-svn: 278351	2016-08-11 16:21:29 +00:00
Igor Breger	a77b14d02c	[AVX512] Fix extractelement i1 lowering. The previous implementation (not custom) doesn't enforce zeroing off upper bits. The assumption is that i1 PRODUCER (truncate and extractelement) must zero all upper bits, so i1 CONSUMER instructions ( test, zext, save, etc) can be done without additional zeroing. Make extractelement i1 lowering custom for all vector i1. Differential Revision: http://reviews.llvm.org/D23246 llvm-svn: 278328	2016-08-11 12:13:46 +00:00
Marina Yatsina	88f0c31f13	Avoid false dependencies of undef machine operands This patch helps avoid false dependencies on undef registers by updating the machine instructions' undef operand to use a register that the instruction is truly dependent on, or use a register with clearance higher than Pref. Pseudo example: loop: xmm0 = ... xmm1 = vcvtsi2sdl eax, xmm0<undef> ... = inst xmm0 jmp loop In this example, selecting xmm0 as the undef register creates false dependency between loop iterations. This false dependency cannot be solved by inserting an xor before vcvtsi2sdl because xmm0 is alive at the point of the vcvtsi2sdl instruction. Selecting a different register instead of xmm0, especially a register that is not used in the loop, will eliminate this problem. Differential Revision: https://reviews.llvm.org/D22466 llvm-svn: 278321	2016-08-11 07:32:08 +00:00
Craig Topper	a78b768ed4	[AVX-512] Promote 512-bit integer loads to v8i64 similar to what is done for 128/256-bit vectors for overall consistency. llvm-svn: 278318	2016-08-11 06:04:07 +00:00
Craig Topper	14aa2665d3	[AVX-512] Add patterns to allow EVEX encoded stores of v16i16/v8i16/v16i8/v32i8 even when BWI is not supported. llvm-svn: 278317	2016-08-11 06:04:04 +00:00
Craig Topper	3563d0f622	[AVX-512] Fix the 128-bit and 256-bit nontemporal load patterns with elements type other than i64. These loads have all been promoted to v2i64/v4i64 loads so we need bitcasts or we end up selecting VMOVDQA32/VMOVDQU32 instead. llvm-svn: 278316	2016-08-11 06:04:00 +00:00
Tim Northover	357f1be2ca	GlobalISel: support same ConstantExprs as Instructions. It's more than just inttoptr, but the others can't be tested until we have support for non-trivial constants (they currently get unavoidably folded to a ConstantInt). llvm-svn: 278303	2016-08-10 23:02:41 +00:00
Tim Northover	2ff5935a95	GlobalISel: add tests forgotten in r278293. llvm-svn: 278296	2016-08-10 22:13:48 +00:00
Changpeng Fang	fb9c3818dd	AMDGPU/SI: Implement amdgcn image intrinsics with sampler Summary: This patch define and implement amdgcn image intrinsics with sampler. 1. define vdata type to be llvm_anyfloat_ty, address type to be llvm_anyfloat_ty, and rsrc type to be llvm_anyint_ty. As a result, we expect the intrinsics name to have three suffixes to overload each of these three types; 2. D128 as well as two other flags are implied in the three types, for example, if you use v8i32 as resource type, then r128 is 0! 3. don't expose TFE flag, and other flags are exposed in the instruction order: unrm, glc, slc, lwe and da. Differential Revision: http://reviews.llvm.org/D22838 Reviewed by: arsenm and tstellarAMD llvm-svn: 278291	2016-08-10 21:15:30 +00:00

1 2 3 4 5 ...

16983 Commits