llvm-project

Commit Graph

Author	SHA1	Message	Date
Nadav Rotem	73ddcfe03f	LoopVectorizer: change debug prints: Print the module identifier when deciding to vectorize. When deciding not to vectorize do not print the called function name because it can be null. llvm-svn: 166989	2012-10-30 00:40:39 +00:00
Jakub Staszak	a3d8e9974a	Re-commit r166971. I reverted it to quickly, when buildbots didn't have a chance to test it with chapni's fix (-mattr=+avx). llvm-svn: 166985	2012-10-30 00:01:57 +00:00
Kevin Enderby	6fd9624843	Fix ARM's b.w instruction for thumb 2 and the encoding T4. The branch target is 24 bits not 20 and the decoding needed to correctly handle converting the J1 and J2 bits to their I1 and I2 values to reconstruct the displacement. llvm-svn: 166982	2012-10-29 23:27:20 +00:00
Jakub Staszak	d74cb61d86	Revert r166971. It causes buildbot failure. To be investigated. llvm-svn: 166979	2012-10-29 23:13:50 +00:00
Jakub Staszak	c3a92131dc	Remove unused variable. llvm-svn: 166973	2012-10-29 22:04:32 +00:00
Jakub Staszak	9c361bdfeb	Simplify code. No functionality change. llvm-svn: 166972	2012-10-29 22:02:26 +00:00
Jakub Staszak	c8f4825ba6	Allow to fold vector load if there is more than one bitcast, so in the case: %0 = load <8 x i16>* %dest %1 = shufflevector <8 x i16> %0, <8 x i16> %in, <8 x i32> < i32 0, i32 1, i32 2, i32 3, i32 13, i32 undef, i32 14, i32 14> store <8 x i16> %1, <8 x i16>* %dest We get: vmovlpd (%eax), %xmm0, %xmm0 instead of: vmovaps (%eax), %xmm1 vmovsd %xmm1, %xmm0, %xmm0 No extra test-case is added. I just fixed the existing one (also it uses FileCheck now). llvm-svn: 166971	2012-10-29 21:56:35 +00:00
Nadav Rotem	5ad045a8c5	LoopVectorize: Update and preserve the dominator tree info. llvm-svn: 166970	2012-10-29 21:52:38 +00:00
Bill Schmidt	bd4ac26973	This patch solves a problem with passing varargs parameters under the PPC64 ELF ABI. A varargs parameter consisting of a single-precision floating-point value, or of a single-element aggregate containing a single-precision floating-point value, must be passed in the low-order (rightmost) four bytes of the doubleword stack slot reserved for that parameter. If there are GPR protocol registers remaining, the parameter must also be mirrored in the low-order four bytes of the reserved GPR. Prior to this patch, such parameters were being passed in the high-order four bytes of the stack slot and the mirrored GPR. The patch adds a new test case to verify the correct code generation. llvm-svn: 166968	2012-10-29 21:18:16 +00:00
Reed Kotler	740981e35c	Implement patterns for extloadi8 and extloadi16 llvm-svn: 166960	2012-10-29 19:39:04 +00:00
Ulrich Weigand	3abb34389d	In various places throughout the code generator, there were special checks to avoid performing compile-time arithmetic on PPCDoubleDouble. Now that APFloat supports arithmetic on PPCDoubleDouble, those checks are no longer needed, and we can treat the type like any other. llvm-svn: 166958	2012-10-29 18:35:49 +00:00
Ulrich Weigand	908c936fa9	APFloat cleanup: Remove now unused "arithmeticOK" logic. llvm-svn: 166954	2012-10-29 18:18:44 +00:00
Ulrich Weigand	e1d62f9c0a	APFloat cleanup: Remove now unused fields "sign2" and "exponent2". llvm-svn: 166952	2012-10-29 18:17:42 +00:00
Ulrich Weigand	d9f7e259aa	Implement arithmetic on APFloat with PPCDoubleDouble semantics by treating it as if it were an IEEE floating-point type with 106-bit mantissa. This makes compile-time arithmetic on "long double" for PowerPC in clang (in particular parsing of floating point constants) work, and fixes all "long double" related failures in the test suite. llvm-svn: 166951	2012-10-29 18:09:01 +00:00
Chad Rosier	1bbaa449ad	[ms-inline asm] Add support for the [] operator. Essentially, [expr1][expr2] is equivalent to [expr1 + expr2]. See test cases for more examples. rdar://12470392 llvm-svn: 166949	2012-10-29 18:01:54 +00:00
Nadav Rotem	39aab03be3	Rename the BB-vectorize flag to match the dragonegg name llvm-svn: 166948	2012-10-29 18:01:14 +00:00
Michael Liao	ad0b69fe3e	Fix PR14204 - Add missing pattern on X86ISD::VZEXT from VR256 to VR256 when AVX2 is enabled. llvm-svn: 166947	2012-10-29 17:57:12 +00:00
Joerg Sonnenberger	2b86e48b3a	Fix typo llvm-svn: 166945	2012-10-29 17:56:15 +00:00
Jakob Stoklund Olesen	9a06696a77	Completely disallow partial copies in adjustCopiesBackFrom(). Partial copies can show up even when CoalescerPair.isPartial() returns false. For example: %vreg24:dsub_0<def> = COPY %vreg31:dsub_0; QPR:%vreg24,%vreg31 Such a partial-partial copy is not good enough for the transformation adjustCopiesBackFrom() needs to do. llvm-svn: 166944	2012-10-29 17:51:52 +00:00
Ulrich Weigand	0de4a1e4ae	Allow i32/i64 for 'f' constraint on PowerPC. This fixes PR12757. llvm-svn: 166943	2012-10-29 17:49:34 +00:00
Duncan Sands	5bdd9dda48	Remove a wrapper around getIntPtrType added to GVN by Hal in commit 166624 (the wrapper returns a vector of integers when passed a vector of pointers) by having getIntPtrType itself return a vector of integers in this case. Outside of this wrapper, I didn't find anywhere in the codebase that was relying on the old behaviour for vectors of pointers, so give this a whirl through the buildbots. llvm-svn: 166939	2012-10-29 17:31:46 +00:00
Bob Wilson	09d16aa87e	Remove code to saturate profile counts. We may need to change the way profile counter values are stored, but saturation is the wrong thing to do. Just remove it for now. Patch by Alastair Murray! llvm-svn: 166938	2012-10-29 17:27:39 +00:00
Nadav Rotem	c59ae207ef	Change the PassManagerBuilder (used by -O3) loop vectorizer flag from -vectorize to -vectorize-loops because we dont want to share the same flag as the bb-vectorizer. llvm-svn: 166937	2012-10-29 16:36:25 +00:00
Hans Wennborg	aad8ad1c36	Minor style fixes for TargetTransformationInfo and TargetTransformImpl llvm-svn: 166936	2012-10-29 16:26:52 +00:00
Reed Kotler	aebb8b034c	Expand all atomic ops for mips16. llvm-svn: 166935	2012-10-29 16:16:54 +00:00
NAKAMURA Takumi	4bd79920be	PPCSubtarget.h: Add explicit braces. llvm-svn: 166932	2012-10-29 15:51:42 +00:00
NAKAMURA Takumi	70b25de24e	PPCSubtarget.h: Whitespace. llvm-svn: 166931	2012-10-29 15:51:35 +00:00
Preston Gurd	52dacca977	This patch addresses a problem with the Post RA scheduler generating an incorrect instruction sequence due to it not being aware that an inline assembly instruction may reference memory. This patch fixes the problem by causing the scheduler to always assume that any inline assembly code instruction could access memory. This is necessary because the internal representation of the inline instruction does not include any information about memory accesses. This should fix PR13504. llvm-svn: 166929	2012-10-29 15:01:23 +00:00
Bill Schmidt	bbc661e572	This patch adds alignment information for long double to the 64-bit PowerPC ELF subtarget. The existing logic is used as a fallback to avoid any changes to the Darwin ABI. PPC64 ELF now has two possible data layout strings: one for FreeBSD, which requires 8-byte alignment, and a default string that requires 16-byte alignment. I've added a test for PPC64 Linux to verify the 16-byte alignment. If somebody wants to add a separate test for FreeBSD, that would be great. Note that there is a companion patch to update the alignment information in Clang, which I am committing now as well. llvm-svn: 166928	2012-10-29 14:59:36 +00:00
Duncan Sands	835e93a231	Factorize code: rather than duplication the logic in getPointerTypeSizeInBits, just call getPointerTypeSizeInBits. No functionality change. llvm-svn: 166926	2012-10-29 14:30:05 +00:00
Duncan Sands	ac8448e0d0	Silence a GCC warning about comparing signed and unsigned types. llvm-svn: 166922	2012-10-29 11:29:53 +00:00
Tim Northover	94bc73d3d1	Make use of common-symbol alignment info in ELF loader. Patch by Amara Emerson. llvm-svn: 166919	2012-10-29 10:47:04 +00:00
Tim Northover	4f223bf7c4	Add interface for querying object files for symbol values. Currently only implemented for ELF. Patch by Amara Emerson. llvm-svn: 166918	2012-10-29 10:47:00 +00:00
Nadav Rotem	42f73c8e4d	Calling TLI->getNumRegisters creates a circular dependency when building LLVM using cmake. Get the number of registers by calling getTypeLegalizationCost. PR14199. llvm-svn: 166911	2012-10-29 05:28:35 +00:00
Lang Hames	ee6142c36b	Remove unused typedef. llvm-svn: 166910	2012-10-29 04:57:52 +00:00
Rafael Espindola	56183fbe78	llvm-extract changes linkages so that functions on both sides of the split module can see each other. If it is keeping a symbol that already has a non local linkage, it doesn't need to change it. llvm-svn: 166908	2012-10-29 01:59:03 +00:00
Rafael Espindola	9d30d0fc67	llvm-extract was unable to handle aliases. It would leave a copy on the output of both llvm-extract foo.ll -func=bar and llvm-extract foo.ll -func=bar -delete so the two new files could not be linked together anymore. With this change alias are handled almost like functions and global variables. Almost because with alias we cannot just clear the initializer/body, we have to create a new declaration and replace the alias with it. The net result is that now the output of the above commands can be linked even if foo.ll has aliases. llvm-svn: 166907	2012-10-29 00:27:55 +00:00
Reed Kotler	e6c31579be	Implement brind operator for mips16. llvm-svn: 166903	2012-10-28 23:08:07 +00:00
Rafael Espindola	d957cb2584	Remove TargetELFWriterInfo. All the credit goes to Jan Voung for noticing it was dead! llvm-svn: 166902	2012-10-28 21:34:43 +00:00
Reed Kotler	3589dd74ac	This patch is for the implementation of mips16 complex pattern addr16. Previously mips16 was sharing the pattern addr which is used for mips32 and mips64. This had a number of problems: 1) Storing and loading byte and halfword quantities for mips16 has particular problems due to the primarily non mips16 nature of SP. When we must load/store byte/halfword stack objects in a function, we must create a mips16 alias register for SP. This functionality is tested in stchar.ll. 2) We need to have an FP register under certain conditions (such as dynamically sized alloca). We use mips16 register S0 for this purpose. In this case, we also use this register when accessing frame objects so this issue also affects the complex pattern addr16. This functionality is tested in alloca16.ll. The Mips16InstrInfo.td has been updated to use addr16 instead of addr. The complex pattern C++ function for addr has been copied to addr16 and updated to reflect the above issues. llvm-svn: 166897	2012-10-28 06:02:37 +00:00
Jakob Stoklund Olesen	57143f7e78	Never attempt to join an early-clobber def with a regular kill. This fixes PR14194. llvm-svn: 166880	2012-10-27 17:41:27 +00:00
Benjamin Kramer	8d2ee55a0c	LoopIdiom: Add checks to avoid turning memmove into an infinite loop. I don't think this is possible with the current implementation but that may change eventually. llvm-svn: 166877	2012-10-27 15:18:28 +00:00
Benjamin Kramer	1c9e5186c0	LoopIdiom: Recognize memmove loops. This turns loops like for (unsigned i = 0; i != n; ++i) p[i] = p[i+1]; into memmove, which has a highly optimized implementation in most libcs. This was really easy with the new DependenceAnalysis :) llvm-svn: 166875	2012-10-27 14:25:51 +00:00
Benjamin Kramer	d5c9be8247	LoopIdiom: Replace custom dependence analysis with DependenceAnalysis. Requires a lot less code and complexity on loop-idiom's side and the more precise analysis can catch more cases, like the one I included as a test case. This also fixes the edge-case miscompilation from PR9481. Compile time performance seems to be slightly worse, but this is mostly due to an extra LCSSA run scheduled by the PassManager and should be fixed there. llvm-svn: 166874	2012-10-27 14:25:44 +00:00
Benjamin Kramer	5bc077aa88	SCEV validator: Ignore CouldNotCompute/undef on both sides. This is mostly noise and blocks finding more severe bugs. llvm-svn: 166873	2012-10-27 11:36:07 +00:00
Benjamin Kramer	24d270db57	SCEV validator: Add workarounds for some common false positives due to the way it handles strings. llvm-svn: 166872	2012-10-27 10:45:01 +00:00
Hal Finkel	bad10bb2f3	Update BBVectorize to use the new VTTI instr. cost interfaces. The monolithic interface for instruction costs has been split into several functions. This is the corresponding change. No functionality change is intended. llvm-svn: 166865	2012-10-27 04:33:48 +00:00
Nadav Rotem	859366f93f	1. Fix a bug in getTypeConversion. When a simple type is split, we need to return the type of the split result. 2. Change the maximum vectorization width from 4 to 8. 3. A test for both. llvm-svn: 166864	2012-10-27 04:11:32 +00:00
Quentin Colombet	3ee56a3bf5	[code size][ARM] Emit regular call instructions instead of the move, branch sequence llvm-svn: 166854	2012-10-27 01:10:17 +00:00
Reed Kotler	7e4d9969cb	Implement MipsHi for mips16 llvm-svn: 166852	2012-10-27 00:57:14 +00:00

1 2 3 4 5 ...

57083 Commits