llvm-project

Commit Graph

Author	SHA1	Message	Date
Simon Pilgrim	3243b21dae	[X86][SSE] Regenerated vector float tests - fabs / floor(etc.) / fneg / float2double llvm-svn: 265186	2016-04-01 21:30:48 +00:00
Simon Pilgrim	1e5bf0a256	[X86][SSE] Vector i64 load tests llvm-svn: 265185	2016-04-01 21:06:17 +00:00
Simon Pilgrim	275b2bcb76	[X86][SSE] Regenerated comparison mask and float immediate tests llvm-svn: 265184	2016-04-01 21:00:00 +00:00
Simon Pilgrim	a372a0f295	[X86][SSE] Regenerated the vec_extract tests. llvm-svn: 265183	2016-04-01 20:55:19 +00:00
Simon Pilgrim	f739d8a2ed	[X86][SSE] Regenerated the vec_insert tests. llvm-svn: 265179	2016-04-01 19:42:23 +00:00
Simon Pilgrim	6121118663	[X86][SSE] Regenerated vec_partial tests. llvm-svn: 265173	2016-04-01 18:30:29 +00:00
Sanjay Patel	9b5b5c82ca	[x86] add an SSE2 + fast-unaligned accesses run for memset nonzero tests Was there really no other way to splat a byte in SSE2? punpcklbw {{.#+}} xmm0 = xmm0[0,0,1,1,2,2,3,3,4,4,5,5,6,6,7,7] pshuflw {{.#+}} xmm0 = xmm0[0,0,0,0,4,5,6,7] pshufd {{.*#+}} xmm0 = xmm0[0,0,1,1] llvm-svn: 265172	2016-04-01 18:29:25 +00:00
Simon Pilgrim	858194640e	[X86][SSE] Regenerated vec_logical tests. llvm-svn: 265171	2016-04-01 18:28:23 +00:00
Tom Stellard	354a43c7bc	AMDGPU: Implement {BUFFER,FLAT}_ATOMIC_CMPSWAP{,_X2} Summary: Implement BUFFER_ATOMIC_CMPSWAP{,_X2} instructions on all GCN targets, and FLAT_ATOMIC_CMPSWAP{,_X2} on CI+. 32-bit instruction variants tested manually on Kabini and Bonaire. Tests and parts of code provided by Jan Veselý. Patch by: Vedran Miletić Reviewers: arsenm, tstellarAMD, nhaehnle Subscribers: jvesely, scchan, kanarayan, arsenm Differential Revision: http://reviews.llvm.org/D17280 llvm-svn: 265170	2016-04-01 18:27:37 +00:00
Simon Pilgrim	1b14082488	[X86][SSE] Regenerated vector sdiv to shifts tests Added SSE + AVX1 tests as well as AVX2 llvm-svn: 265169	2016-04-01 18:18:40 +00:00
Mike Aizatsky	01c0f8d8a3	[sancov] save entry block from pruning (it is always full dominator) llvm-svn: 265168	2016-04-01 18:13:19 +00:00
Sanjay Patel	d3e3d48cb9	[x86] add an SSE1 run for these tests Note however that this is identical to the existing SSE2 run. What we really want is yet another run for an SSE2 machine that also has fast unaligned 16-byte accesses. llvm-svn: 265167	2016-04-01 18:11:30 +00:00
Simon Pilgrim	b8283631a5	[X86][SSE] Regenerated vec_setcc tests. llvm-svn: 265164	2016-04-01 17:55:02 +00:00
Simon Pilgrim	41e31ff6bd	[X86][SSE] Regenerated the vec_set tests. Replaced lots of dodgy greps with actual codegen llvm-svn: 265163	2016-04-01 17:40:25 +00:00
Sanjay Patel	9f413364d5	[x86] avoid intermediate splat for non-zero memsets (PR27100) Follow-up to http://reviews.llvm.org/D18566 and http://reviews.llvm.org/D18676 - where we noticed that an intermediate splat was being generated for memsets of non-zero chars. That was because we told getMemsetStores() to use a 32-bit vector element type, and it happily obliged by producing that constant using an integer multiply. The 16-byte test that was added in D18566 is now equivalent for AVX1 and AVX2 (no splats, just a vector load), but we have PR27141 to track that splat difference. Note that the SSE1 path is not changed in this patch. That can be a follow-up. This patch should resolve PR27100. llvm-svn: 265161	2016-04-01 17:36:45 +00:00
David Majnemer	fe3f9d1721	[InstCombine] Don't sink an instr after a catchswitch A catchswitch is a terminator, instructions cannot be inserted after it. llvm-svn: 265158	2016-04-01 17:28:17 +00:00
David Majnemer	6f1f85f0e1	[SLPVectorizer] Don't insert an extractelement before a catchswitch A catchswitch cannot be preceded by another instruction in the same basic block (other than a PHI node). Instead, insert the extract element right after the materialization of the vectorized value. This isn't optimal but is a reasonable compromise given the constraints of WinEH. This fixes PR27163. llvm-svn: 265157	2016-04-01 17:28:15 +00:00
Sanjay Patel	a05e0ff223	[x86] avoid intermediate splat for non-zero memsets (PR27100) Follow-up to D18566 - where we noticed that an intermediate splat was being generated for memsets of non-zero chars. That was because we told getMemsetStores() to use a 32-bit vector element type, and it happily obliged by producing that constant using an integer multiply. The tests that were added in the last patch are now equivalent for AVX1 and AVX2 (no splats, just a vector load), but we have PR27141 to track that splat difference. In the new tests, the splat via shuffling looks ok to me, but there might be some room for improvement depending on uarch there. Note that the SSE1/2 paths are not changed in this patch. That can be a follow-up. This patch should resolve PR27100. Differential Revision: http://reviews.llvm.org/D18676 llvm-svn: 265148	2016-04-01 16:27:14 +00:00
Vedant Kumar	7784171721	[PGOProfile] Rename a test to make it more reusable, NFC llvm-svn: 265144	2016-04-01 15:45:33 +00:00
Valery Pykhtin	5b3559c1ec	[AMDGPU] fix MADAK/MADMK instructions operand namings to match encoding fields. $vsrc1 -> $src1, $k -> $imm Differential Revision: http://reviews.llvm.org/D18659 llvm-svn: 265141	2016-04-01 13:13:12 +00:00
Simon Pilgrim	7ec092d0f8	[X86][AVX512] Regenerated intrinsics tests llvm-svn: 265135	2016-04-01 11:57:51 +00:00
Sagar Thakur	48973d21e1	[MIPS][LLVM-MC] Fix JR encoding for MIPSR6 ISA Summary: The assembler was picking the wrong JR variant because the pre-R6 one was still enabled at R6. Author: nitesh.jain Reviewers: vkalintiris, dsanders Subscribers: dsanders, llvm-commits, mohit.bhakkad, sagar, bhushan, jaydeep Differential: D18387 llvm-svn: 265134	2016-04-01 11:55:33 +00:00
Andrey Turetskiy	958eb46443	[X86] Introduce Lakemont CPU. Add a new Intel MCU CPU Lakemont, which doesn't support X87. Differential Revision: http://reviews.llvm.org/D18650 llvm-svn: 265128	2016-04-01 10:16:15 +00:00
James Molloy	b876c72bcc	Fix for pr24346: arm asm label calculation error in sub Some ARM instructions encode 32-bit immediates as a 8-bit integer (0-255) and a 4-bit rotation (0-30, even) in its least significant 12 bits. The original fixup, FK_Data_4, patches the instruction by the value bit-to-bit, regardless of the encoding. For example, assuming the label L1 and L2 are 0x0 and 0x104 respectively, the following instruction: add r0, r0, #(L2 - L1) ; expects 0x104, i.e., 260 would be assembled to the following, which adds 1 to r0, instead of 260: e2800104 add r0, r0, #4, 2 ; equivalently 1 The new fixup kind fixup_arm_mod_imm takes care of the encoding: e2800f41 add r0, r0, #260 Patch by Ting-Yuan Huang! llvm-svn: 265122	2016-04-01 09:40:47 +00:00
Oliver Stannard	a5520b02a5	[AArch64] Better errors for out-of-range fixups When a fixup that can be resolved by the assembler is out of range, we should report an error in the source, rather than crashing. Differential Revision: http://reviews.llvm.org/D18402 llvm-svn: 265120	2016-04-01 09:14:50 +00:00
Jeroen Ketema	c110fbc213	[OCaml] Reinstate data_layout Expose LLVMCreateTargetMachineData as data_layout. As r263530 did for go. From that commit: "LLVMGetTargetDataLayout was removed from the C API, and then TargetMachine.TargetData was removed. Later, LLVMCreateTargetMachineData was added to the C API" Differential Revision: http://reviews.llvm.org/D18677 llvm-svn: 265115	2016-04-01 07:54:24 +00:00
Mehdi Amini	d7ad221c16	Add a module Hash in the bitcode and the combined index, implementing a kind of "build-id" This is intended to be used for ThinLTO incremental build. Differential Revision: http://reviews.llvm.org/D18213 This is a recommit of r265095 after fixing the Windows issues. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265111	2016-04-01 05:33:11 +00:00
Sean Silva	3de5ef96c9	Improve CHECK-NOT robustness of dllexport tests This changes some dllexport tests, to verify that some symbols that should not be exported are not, in a way that improves the robustness of CHECK-SAME interaction with CHECK-NOT. We plan to enable dllimport/dllexport support for the PS4, and these changes are for points we noticed in our internal testing. Patch by Warren Ristow! llvm-svn: 265106	2016-04-01 03:54:03 +00:00
Mehdi Amini	85fb9e058e	Revert "Add support for computing SHA1 in LLVM" This reverts commit r265096, r265095, and r265094. Windows build is broken, and the validation does not pass. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265102	2016-04-01 03:03:21 +00:00
Sanjoy Das	f83ab6de56	Don't insert stackrestore on deoptimizing returns They're not necessary (since the stack pointer is trivially restored on return), and the way LLVM inserts the stackrestore calls breaks the IR (we get a stackrestore between the deoptimize call and the return). llvm-svn: 265101	2016-04-01 02:51:30 +00:00
Sanjoy Das	18b92968ea	Don't insert lifetime end markers on deoptimizing returns They're not necessary (since the lifetime of the alloca is trivially over due to the return), and the way LLVM inserts the lifetime.end markers breaks the IR (we get a lifetime end marker between the deoptimize call and the return). llvm-svn: 265100	2016-04-01 02:51:26 +00:00
Sanjoy Das	9d41a8f269	Don't use an i64 return type with webkit_jscc Re-enable an assertion enabled by Justin Lebar in rL265092. rL265092 was breaking test/CodeGen/X86/deopt-intrinsic.ll because webkit_jscc does not like non-i64 return types. Change the test case to not do that. llvm-svn: 265099	2016-04-01 02:51:21 +00:00
Chuang-Yu Cheng	35c6181982	Fix Sub-register Rewriting in Aggressive Anti-Dependence Breaker Previously, HandleLastUse would delete RegRef information for sub-registers if they were dead even if their corresponding super-register were still live. If the super-register were later renamed, then the definitions of the sub-register would not be updated appropriately. This patch alters the behavior so that RegInfo information for sub-registers is only deleted when the sub-register and super-register are both dead. This resolves PR26775. This is the mirror image of Hal's r227311 commit. Author: Tom Jablin (tjablin) Reviewers: kbarton uweigand nemanjai hfinkel http://reviews.llvm.org/D18448 llvm-svn: 265097	2016-04-01 02:05:29 +00:00
Mehdi Amini	4ea9e9c9bb	Add missing test for the "Module hash in bitcode" added in r265095 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265096	2016-04-01 01:37:52 +00:00
Mehdi Amini	4c2ed3337d	Add a module Hash in the bitcode and the combined index, implementing a kind of "build-id" This is intended to be used for ThinLTO incremental build. Differential Revision: http://reviews.llvm.org/D18213 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265095	2016-04-01 01:30:06 +00:00
Justin Lebar	efcc81cbb4	[NVPTX] Read __CUDA_FTZ from module flags in NVVMReflect. Summary: Previously the NVVMReflect pass would read its configuration from command-line flags or a static configuration given to the pass at instantiation time. This doesn't quite work for clang's use-case. It needs to pass a value for __CUDA_FTZ down on a per-module basis. We use a module flag for this, so the NVVMReflect pass needs to be updated to read said flag. Reviewers: tra, rnk Subscribers: cfe-commits, jholewinski Differential Revision: http://reviews.llvm.org/D18672 llvm-svn: 265090	2016-04-01 01:09:07 +00:00
Akira Hatanaka	e9148dd62f	[LoopVectorize] Don't unconditionally print vectorization diagnostics when compiling with LTO. r244523 a new class DiagnosticInfoOptimizationRemarkAnalysisAliasing for optimization analysis remarks related to pointer aliasing without guarding it in isDiagnosticEnabled in LLVMContext.cpp. This caused the diagnostic message to be printed unconditionally when compiling with LTO. This commit cleans up isDiagnosticEnabled and makes sure all the vectorization optimization remarks are guarded. rdar://problem/25382153 llvm-svn: 265084	2016-04-01 00:34:39 +00:00
Adrian Prantl	b8089516a5	testcase gardening: update the emissionKind enum to the new syntax. (NFC) llvm-svn: 265081	2016-04-01 00:16:49 +00:00
Adrian Prantl	b939a25707	Move the DebugEmissionKind enum from DIBuilder into DICompileUnit. This mostly cosmetic patch moves the DebugEmissionKind enum from DIBuilder into DICompileUnit. DIBuilder is not the right place for this enum to live in — a metadata consumer should not have to include DIBuilder.h. I also added a Verifier check that checks that the emission kind of a DICompileUnit is actually legal. http://reviews.llvm.org/D18612 <rdar://problem/25427165> llvm-svn: 265077	2016-03-31 23:56:58 +00:00
Peter Collingbourne	acff7d4a25	Create thin archive in GNU format to fix test on OS X. llvm-svn: 265069	2016-03-31 23:07:50 +00:00
Tim Shen	2e24d0c0c1	Move asm-printer-topological-order.ll to PowerPC backend llvm-svn: 265067	2016-03-31 22:32:10 +00:00
Peter Collingbourne	f84646cd95	Object: Correctly read thin archives containing absolute paths. Differential Revision: http://reviews.llvm.org/D18666 llvm-svn: 265065	2016-03-31 22:08:31 +00:00
Tim Shen	800ed436e5	[AsmPrinter] Print aliases in topological order Print aliases in topological order, that is, for any alias a = b, b must be printed before a. This is because on some targets (e.g. PowerPC) linker expects aliases in such an order to generate correct TOC information. GCC also prints aliases in topological order. llvm-svn: 265064	2016-03-31 22:08:19 +00:00
Evgeniy Stepanov	f74f091ea6	Preserve blockaddress use edges in the module splitter. "blockaddress" can not apply to an external function. All blockaddress constant uses must belong to the same module as the definition of the target function. llvm-svn: 265061	2016-03-31 21:55:11 +00:00
David Majnemer	ae272d718e	[NVPTX] Infer __nvvm_reflect as nounwind, readnone This patch simply mirrors the attributes we give to @llvm.nvvm.reflect to the __nvvm_reflect libdevice call. This shaves about 30% of the code in libdevice away because of CSE opportunities. It's also helps us figure out that libdevice implementations of transcendental functions don't have side-effects. llvm-svn: 265060	2016-03-31 21:29:57 +00:00
Jun Bum Lim	760afcb338	[AArch64] Allow loads with imp-def to be handled in getMemOpBaseRegImmOfsWidth() Summary: This change will allow loads with imp-def to be clustered in machine-scheduler pass. areMemAccessesTriviallyDisjoint() can also handle loads with imp-def. Reviewers: mcrosier, jmolloy, t.p.northover Subscribers: aemerson, rengolin, mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D18665 llvm-svn: 265051	2016-03-31 20:53:47 +00:00
Hal Finkel	0f02ccd955	[PowerPC] Cleanup test/CodeGen/PowerPC/qpx-load-splat.ll Removing unnecessary attributes and metadata... llvm-svn: 265049	2016-03-31 20:45:00 +00:00
Sanjay Patel	61e13249d8	[x86] add memset tests to show another potential improvement llvm-svn: 265048	2016-03-31 20:40:32 +00:00
Hal Finkel	fc35391f2b	[PowerPC] Add a late MI-level pass for QPX load/splat simplification Chapter 3 of the QPX manual states that, "Scalar floating-point load instructions, defined in the Power ISA, cause a replication of the source data across all elements of the target register." Thus, if we have a load followed by a QPX splat (from the first lane), the splat is redundant. This adds a late MI-level pass to remove the redundant splats in some of these cases (specifically when both occur in the same basic block). This optimization is scheduled just prior to post-RA scheduling. It can't happen before anything that might replace the load with some already-computed quantity (i.e. store-to-load forwarding). llvm-svn: 265047	2016-03-31 20:39:41 +00:00
Hans Wennborg	132cd62121	Revert r265039 "[X86] Merge adjacent stack adjustments in eliminateCallFramePseudoInstr (PR27140)" I think it might have caused these build breakages: http://lab.llvm.org:8011/builders/clang-x86-win2008-selfhost/builds/7234/steps/build%20stage%202/logs/stdio http://lab.llvm.org:8011/builders/sanitizer-windows/builds/19566/steps/run%20tests/logs/stdio llvm-svn: 265046	2016-03-31 20:27:30 +00:00

1 2 3 4 5 ...

35377 Commits