llvm-project

Commit Graph

Author	SHA1	Message	Date
Tim Northover	3ef452e572	AArch64 & ARM: disable generic test that relies on no CFG changes. llvm-svn: 209885	2014-05-30 10:56:12 +00:00
Tim Northover	b4ddc0845a	ARM & AArch64: make use of common cmpxchg idioms after expansion The C and C++ semantics for compare_exchange require it to return a bool indicating success. This gets mapped to LLVM IR which follows each cmpxchg with an icmp of the value loaded against the desired value. When lowered to ldxr/stxr loops, this extra comparison is redundant: its results are implicit in the control-flow of the function. This commit makes two changes: it replaces that icmp with appropriate PHI nodes, and then makes sure earlyCSE is called after expansion to actually make use of the opportunities revealed. I've also added -{arm,aarch64}-enable-atomic-tidy options, so that existing fragile tests aren't perturbed too much by the change. Many of them either rely on undef/unreachable too pervasively to be restored to something well-defined (particularly while making sure they test the same obscure assert from many years ago), or depend on a particular CFG shape, which is disrupted by SimplifyCFG. rdar://problem/16227836 llvm-svn: 209883	2014-05-30 10:09:59 +00:00
Tim Northover	ce6538c38d	AArch64 & ARM: remove undefined behaviour from some tests. llvm-svn: 209880	2014-05-30 08:59:55 +00:00
Hao Liu	5c7314b68d	Test cases named with dates is a legacy rule not used now. Rename several test cases. llvm-svn: 209877	2014-05-30 05:58:19 +00:00
Adam Nemet	b3587e98b7	[X86] Move test from r209863 to CodeGen/X86 We should only run this if X86 is in the targets. llvm-svn: 209866	2014-05-29 23:52:53 +00:00
Adam Nemet	35b80eaef1	[X86] Remove AVX1 vbroadcast intrinsics The corresponding CFE patch replaces these intrinsics with vector initializers in avxintrin.h. This patch removes the LLVM intrinsics from the backend. We now stop lowering at X86ISD::VBROADCAST custom node rather than lowering that further to the intrinsics. The patch only changes VBROADCASTS* and leaves VBROADCAST[FI]128 to continue to use intrinsics. As explained in the CFE patch, the reason is that we currently don't generate as good code for them without the intrinsics. CodeGen/X86/avx-vbroadcast.ll already provides coverage for this change. It checks that for a series of insertelements we generate the appropriate vbroadcast instruction. Also verified that there was no assembly change in the test-suite before and after this patch. llvm-svn: 209864	2014-05-29 23:35:36 +00:00
Filipe Cabecinhas	229dc17610	Added tests for shufflevector lowering to blend instrs. These tests ensure that a change I will propose in clang works as expected. Summary: Added tests for the generation of blend+immediate instructions from a shufflevector. These tests were proposed along with a patch that was dropped. I'm committing the tests anyway to protect against possible regressions in codegen. Reviewers: nadav, bkramer Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D3600 llvm-svn: 209853	2014-05-29 22:04:42 +00:00
Rafael Espindola	04902862a8	[PPC] Use alias symbols in address computation. This seems to match what gcc does for ppc and what every other llvm backend does. This is a fixed version of r209638. The difference is to avoid any change in behavior for functions. The logic for using constant pools for function addresseses is spread over a few places and we have to keep them in sync. llvm-svn: 209821	2014-05-29 15:41:38 +00:00
Rafael Espindola	1613650e90	Add a test showing the ppc code sequence for getting a function pointer. This would have found the miscompile in r209638. llvm-svn: 209820	2014-05-29 15:13:23 +00:00
Hao Liu	d670c7eee0	Rename a test case to contain correct date info. llvm-svn: 209799	2014-05-29 09:21:23 +00:00
Hao Liu	4091450181	Fix an assertion failure caused by v1i64 in DAGCombiner Shrink. llvm-svn: 209798	2014-05-29 09:19:07 +00:00
Michael J. Spencer	f375d80635	[x86] Fold extract_vector_elt of a load into the Load's address computation. An address only use of an extract element of a load can be simplified to a load. Without this the result of the extract element is spilled to the stack so that an address is available. llvm-svn: 209788	2014-05-29 01:42:45 +00:00
Rafael Espindola	59f7eba2b5	[pr19844] Add thread local mode to aliases. This matches gcc's behavior. It also seems natural given that aliases contain other properties that govern how it is accessed (linkage, visibility, dll storage). Clang still has to be updated to expose this feature to C. llvm-svn: 209759	2014-05-28 18:15:43 +00:00
Hal Finkel	2c77fe59d9	Revert "[DAGCombiner] Split up an indexed load if only the base pointer value is live" This reverts r208640 (I've just XFAILed the test) because it broke ppc64/Linux self-hosting. Because nearly every regression test triggers a segfault, I hope this will be easy to fix. llvm-svn: 209747	2014-05-28 15:33:19 +00:00
Hal Finkel	f5c07ada1d	Revert "[PPC] Use alias symbols in address computation." This reverts commit r209638 because it broke self-hosting on ppc64/Linux. (the Clang-compiled TableGen would segfault because it jumped to an invalid address from within _ZNK4llvm17ManagedStaticBase21RegisterManagedStaticEPFPvvEPFvS1_E (which is within the command-line parameter registration process)). llvm-svn: 209745	2014-05-28 15:25:06 +00:00
Tilmann Scheller	7c747fc7c6	[AArch64] Add store post-index update folding regression tests for the load/store optimizer. Add regression tests for the following transformation: str X, [x20] ... add x20, x20, #32 -> str X, [x20], #32 with X being either w0, x0, s0, d0 or q0. llvm-svn: 209715	2014-05-28 06:43:00 +00:00
Tilmann Scheller	35e451461f	[AArch64] Add load post-index update folding regression tests for the load/store optimizer. Add regression tests for the following transformation: ldr X, [x20] ... add x20, x20, #32 -> ldr X, [x20], #32 with X being either w0, x0, s0, d0 or q0. llvm-svn: 209711	2014-05-28 05:44:14 +00:00
Sasa Stankovic	e41db2fe31	[mips] Optimize long branch for MIPS64 by removing %higher and %highest. %higher and %highest can have non-zero values only for offsets greater than 2GB, which is highly unlikely, if not impossible when compiling a single function. This makes long branch for MIPS64 3 instructions smaller. Differential Revision: http://llvm-reviews.chandlerc.com/D3281.diff llvm-svn: 209678	2014-05-27 18:53:06 +00:00
Tim Northover	9a6217b650	AArch64: add test for NZCV cross-copy save. llvm-svn: 209665	2014-05-27 16:50:09 +00:00
Tim Northover	de9402d345	AArch64: add AArch64-specific test for 'c' and 'n'. llvm-svn: 209664	2014-05-27 16:50:03 +00:00
Bill Schmidt	71dddd51d9	[PATCH] Correct type used for VADD_SPLAT optimization on PowerPC In PPCISelLowering.cpp: PPCTargetLowering::LowerBUILD_VECTOR(), there is an optimization for certain patterns to generate one or two vector splats followed by a vector add or subtract. This operation is represented by a VADD_SPLAT in the selection DAG. Prior to this patch, it was possible for the VADD_SPLAT to be assigned the wrong data type, causing incorrect code generation. This patch corrects the problem. Specifically, the code previously assigned the value type of the BUILD_VECTOR node to the newly generated VADD_SPLAT node. This is correct much of the time, but not always. The problem is that the call to isConstantSplat() may return a SplatBitSize that is not the same as the number of bits in the original element vector type. The correct type to assign is a vector type with the same element bit size as SplatBitSize. The included test case shows an example of this, where the BUILD_VECTOR node has a type of v16i8. The vector to be built is {0, 16, 0, 16, 0, 16, 0, 16, 0, 16, 0, 16, 0, 16, 0, 16}. isConstantSplat detects that we can generate a splat of 16 for type v8i16, which is the type we must assign to the VADD_SPLAT node. If we do not, we generate a vspltisb of 8 and a vaddubm, which generates the incorrect result {16, 16, 16, 16, 16, 16, 16, 16, 16, 16, 16, 16, 16, 16, 16, 16}. The correct code generation is a vspltish of 8 and a vadduhm. This patch also corrected code generation for CodeGen/PowerPC/2008-07-10-SplatMiscompile.ll, which had been marked as an XFAIL, so we can remove the XFAIL from the test case. llvm-svn: 209662	2014-05-27 15:57:51 +00:00
Amara Emerson	ceeb1c4830	[ARM] Emit correct build attributes for the relocation models. Patch by Asiri Rathnayake. llvm-svn: 209656	2014-05-27 13:30:21 +00:00
Tim Northover	4f1909f1da	ARM: teach AAPCS-VFP to deal with Cortex-M4. Cortex-M4 only has single-precision floating point support, so any LLVM "double" type will have been split into 2 i32s by now. Fortunately, the consecutive-register framework turns out to be precisely what's needed to reconstruct the double and follow AAPCS-VFP correctly! rdar://problem/17012966 llvm-svn: 209650	2014-05-27 10:43:38 +00:00
Filipe Cabecinhas	82ac07c283	Convert some X86 blendv* intrinsics into IR. Summary: Implemented an InstCombine transformation that takes a blendv* intrinsic call and translates it into an IR select, if the mask is constant. This will eventually get lowered into blends with immediates if possible, or pblendvb (with an option to further optimize if we can transform the pblendvb into a blend+immediate instruction, depending on the selector). It will also enable optimizations by the IR passes, which give up on sight of the intrinsic. Both the transformation and the lowering of its result to asm got shiny new tests. The transformation is a bit convoluted because of blendvp[sd]'s definition: Its mask is a floating point value! This forces us to convert it and get the highest bit. I suppose this happened because the mask has type __m128 in Intel's intrinsic and v4sf (for blendps) in gcc's builtin. I will send an email to llvm-dev to discuss if we want to change this or not. Reviewers: grosbach, delena, nadav Differential Revision: http://reviews.llvm.org/D3859 llvm-svn: 209643	2014-05-27 03:42:20 +00:00
Rafael Espindola	ac69cee6a2	[PPC] Use alias symbols in address computation. This seems to match what gcc does for ppc and what every other llvm backend does. llvm-svn: 209638	2014-05-26 19:08:19 +00:00
Tim Northover	68ae503de9	AArch64: force i1 to be zero-extended at an ABI boundary. This commit is debatable. There are two possible approaches, neither of which is really satisfactory: 1. Use "@foo(i1 zeroext)" to mean an extension to 32-bits on Darwin, and 8 bits otherwise. 2. Redefine "@foo(i1)" to mean that the i1 is extended by the caller to 8 bits. This goes against the spirit of "zeroext" I think, but it's a bit of a vague construct anyway (by definition you're going to extend to the amount required by the ABI, that's why it's the ABI!). This implements option 2. The DAG machinery really isn't setup for the first (there's a fairly strong assumption that "zeroext" goes to at least the smallest register size), and even if it was the resulting DAG looks like it would be inferior in many cases. Theoretically we could add AssertZext nodes in the consumers of ABI-passed values too now, but this actually seems to make the code worse in practice by making truncation proceed in two steps. The code produced is equally valid if we continue to assume only the low bit is defined. Should fix PR19850 llvm-svn: 209637	2014-05-26 17:22:07 +00:00
Tim Northover	47e003c65d	AArch64: simplify calling conventions slightly. We can eliminate the custom C++ code in favour of some TableGen to check the same things. Functionality should be identical, except for a buffer overrun that was present in the C++ code and meant webkit failed if any small argument needed to be passed on the stack. llvm-svn: 209636	2014-05-26 17:21:53 +00:00
Tilmann Scheller	cc3ebc8a98	[AArch64] Add store + add folding regression tests for the load/store optimization pass. Add tests for the following transform: str X, [x0, #32] ... add x0, x0, #32 -> str X, [x0, #32]! with X being either w1, x1, s0, d0 or q0. llvm-svn: 209627	2014-05-26 13:36:47 +00:00
Tilmann Scheller	112ada831f	[AArch64] Add more regression tests for the load/store optimization pass. Cover the following cases: ldr X, [x0, #32] ... add x0, x0, #32 -> ldr X, [x0, #32]! with X being either w1, x1, s0, d0 or q0. llvm-svn: 209624	2014-05-26 12:15:51 +00:00
Tilmann Scheller	968d599dc4	Remove accidentally committed whitespace. llvm-svn: 209619	2014-05-26 09:40:40 +00:00
Tilmann Scheller	2d746bcda3	[AArch64] Add a regression test for the load store optimizer. We have a couple of regression tests for load/store pairing, but (to my knowledge) there are no regression tests for the load/store + add/sub folding. As a first step towards increased test coverage of this area, this commit adds a test for one instance of a load + add to pre-indexed load transformation. llvm-svn: 209618	2014-05-26 09:37:19 +00:00
Rafael Espindola	f557536f56	Just check the entire string. Thanks to David Blaikie for the suggestion. llvm-svn: 209610	2014-05-26 04:08:51 +00:00
Rafael Espindola	4a04c4b69c	Emit data or code export directives based on the type. Currently we look at the Aliasee to decide what type of export directive to use. It seems better to use the type of the alias directly. This is similar to how we handle the alias having the same address but other attributes (linkage, visibility) from the aliasee. With this patch it is now possible to do things like target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128" target triple = "x86_64-pc-windows-msvc" @foo = global [6 x i8] c"\B8\00\00\00\C3", section ".text", align 16 @f = dllexport alias i32 (), [6 x i8] @foo !llvm.module.flags = !{!0} !0 = metadata !{i32 6, metadata !"Linker Options", metadata !1} !1 = metadata !{metadata !2, metadata !3} !2 = metadata !{metadata !"/DEFAULTLIB:libcmt.lib"} !3 = metadata !{metadata !"/DEFAULTLIB:oldnames.lib"} llvm-svn: 209600	2014-05-25 12:49:07 +00:00
Rafael Espindola	d234b0f08c	Make these CHECKs a bit more strict. The " at the end of the line makes sure we matched the entire directive. llvm-svn: 209599	2014-05-25 12:43:13 +00:00
Tim Northover	3b0846e8f7	AArch64/ARM64: move ARM64 into AArch64's place This commit starts with a "git mv ARM64 AArch64" and continues out from there, renaming the C++ classes, intrinsics, and other target-local objects for consistency. "ARM64" test directories are also moved, and tests that began their life in ARM64 use an arm64 triple, those from AArch64 use an aarch64 triple. Both should be equivalent though. This finishes the AArch64 merge, and everyone should feel free to continue committing as normal now. llvm-svn: 209577	2014-05-24 12:50:23 +00:00
Tim Northover	cc08e1fe1b	AArch64/ARM64: remove AArch64 from tree prior to renaming ARM64. I'm doing this in two phases for a better "git blame" record. This commit removes the previous AArch64 backend and redirects all functionality to ARM64. It also deduplicates test-lines and removes orphaned AArch64 tests. The next step will be "git mv ARM64 AArch64" and rewire most of the tests. Hopefully LLVM is still functional, though it would be even better if no-one ever had to care because the rename happens straight afterwards. llvm-svn: 209576	2014-05-24 12:42:26 +00:00
Tim Northover	e471e43484	ARM64: extract a 32-bit subreg when selecting an inreg extend After the load/store refactoring, we were sometimes trying to feed a GPR64 into a 32-bit register offset operand. This failed in copyPhysReg. llvm-svn: 209566	2014-05-24 07:05:42 +00:00
Nico Rieck	fe2413692f	Revert part of "Fix broken FileCheck prefixes" This reverts part of commit r209538. llvm-svn: 209544	2014-05-23 19:33:49 +00:00
Rafael Espindola	a5bb2f61cf	Use alias linkage and visibility to decide tls access mode. This matches both what we do for the non-thread case and what gcc does. With this patch clang would match gcc's behaviour in static __thread int a = 42; extern __thread int b __attribute__((alias("a"))); int f(void) { return &a; } int g(void) { return &b; } if not for pr19843. Manually writing the IL does produce the same access modes. It is also a step in the direction of fixing pr19844. llvm-svn: 209543	2014-05-23 19:16:56 +00:00
Nico Rieck	fb4c55fb64	Remove unused CHECK lines llvm-svn: 209539	2014-05-23 19:06:44 +00:00
Nico Rieck	bd15945ab2	Fix broken FileCheck prefixes llvm-svn: 209538	2014-05-23 19:06:24 +00:00
Rafael Espindola	fbe44200ce	Convert test to use FileCheck. llvm-svn: 209528	2014-05-23 16:51:13 +00:00
Daniel Sanders	ac27263512	[mips][mips64r6] [ls][dw][lr] are not available in MIPS32r6/MIPS64r6 Summary: Instead the system is required to provide some means of handling unaligned load/store without special instructions. Options include full hardware support, full trap-and-emulate, and hybrids such as hardware support within a cache line and trap-and-emulate for multi-line accesses. MipsSETargetLowering::allowsUnalignedMemoryAccesses() has been configured to assume that unaligned accesses are 'fast' on the basis that I expect few hardware implementations will opt for pure-software handling of unaligned accesses. The ones that do handle it purely in software can override this. mips64-load-store-left-right.ll has been merged into load-store-left-right.ll The stricter testing revealed a Bits!=Bytes bug in passByValArg(). This has been fixed and the variables renamed to clarify the units they hold. Reviewers: zoran.jovanovic, jkolek, vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3872 llvm-svn: 209512	2014-05-23 13:18:02 +00:00
Jiangning Liu	4b5b757d65	[ARM64] Fix a bug in shuffle vector lowering to generate corect vext ISD with swapped input vectors. llvm-svn: 209495	2014-05-23 02:54:50 +00:00
Matt Arsenault	05e96f4444	R600: Try to convert BFE back to standard bit ops when possible. This allows existing DAG combines to work on them, and then we can re-match to BFE if necessary during instruction selection. llvm-svn: 209462	2014-05-22 18:09:12 +00:00
Matt Arsenault	5565f65e13	R600: Add dag combine for BFE llvm-svn: 209461	2014-05-22 18:09:07 +00:00
Matt Arsenault	bf8694d36d	R600: Implement ComputeNumSignBitsForTargetNode for BFE llvm-svn: 209460	2014-05-22 18:09:03 +00:00
Matt Arsenault	493c5f1bc4	R600: Expand mul24 for GPUs without it llvm-svn: 209458	2014-05-22 18:00:24 +00:00
Matt Arsenault	f15a05623e	R600: Expand mad24 for GPUs without it llvm-svn: 209457	2014-05-22 18:00:20 +00:00
Matt Arsenault	eb260206c3	R600: Add intrinsics for mad24 llvm-svn: 209456	2014-05-22 18:00:15 +00:00

1 2 3 4 5 ...

9869 Commits