llvm-project

Commit Graph

Author	SHA1	Message	Date
Owen Anderson	352dfff447	Fix another roundToIntegral bug where very large values could become infinity. Problem and solution identified by Steve Canon. llvm-svn: 161969	2012-08-15 18:28:45 +00:00
Evan Cheng	eec6bc6270	Use vld1/vst1 to load/store f64 if alignment is < 4 and the target allows unaligned access. rdar://12091029 llvm-svn: 161962	2012-08-15 17:44:53 +00:00
Owen Anderson	be7e297b6d	Fix typo in comment. llvm-svn: 161956	2012-08-15 16:42:53 +00:00
Jakob Stoklund Olesen	2ec0c41e01	Add missing Rfalse operand to the predicated pseudo-instructions. When predicating this instruction: Rd = ADD Rn, Rm We need an extra operand to represent the value given to Rd when the predicate is false: Rd = ADDCC Rfalse, Rn, Rm, pred The Rd and Rfalse operands are different registers while in SSA form. Rfalse is tied to Rd to make sure they get the same register during register allocation. Previously, Rd and Rn were tied, but that is not required. Compare to MOVCC: Rd = MOVCC Rfalse, Rtrue, pred llvm-svn: 161955	2012-08-15 16:17:24 +00:00
Bill Wendling	e1c54262f4	Set the branch probability of branching to the 'normal' destination of an invoke instruction to something absurdly high, while setting the probability of branching to the 'unwind' destination to the bare minimum. This should set cause the normal destination's invoke blocks to be moved closer to the invoke. PR13612 llvm-svn: 161944	2012-08-15 12:22:35 +00:00
Kostya Serebryany	1e575ab8b2	[asan] implement --asan-always-slow-path, which is a part of the improvement to handle unaligned partially OOB accesses. See http://code.google.com/p/address-sanitizer/issues/detail?id=100 llvm-svn: 161937	2012-08-15 08:58:58 +00:00
Owen Anderson	1ff74b0d2d	Fix a problem with APFloat::roundToIntegral where it would return incorrect results for negative inputs to trunc. Add unit tests to verify this behavior. llvm-svn: 161929	2012-08-15 05:39:46 +00:00
Michael Liao	69e172a6f0	fix infinite loop in instcombine with more than 4GB memcpy - memcpy size is wrongly truncated into 32-bit and treat 8GB memcpy is 0-sized memcpy - as 0-sized memcpy/memset is already removed before SimplifyMemTransfer and SimplifyMemSet in visitCallInst, replace 0 checking with assertions. - replace getZExtValue() with getLimitedValue() according to Eli Friedman llvm-svn: 161923	2012-08-15 03:49:59 +00:00
Nick Lewycky	58564d5aa6	Fix a typo that led to a failure to correctly verify bitcast instructions. Patch by Stephen Hines! llvm-svn: 161921	2012-08-15 02:37:07 +00:00
Richard Smith	8f3447c032	Fix undefined behavior: don't perform array indexing through a potentially null pointer. llvm-svn: 161919	2012-08-15 01:39:31 +00:00
Anton Korobeynikov	c6d945b11a	The names of VFP variants of half-to-float conversion instructions were reversed. This leads to wrong codegen for float-to-half conversion intrinsics which are used to support storage-only fp16 type. NEON variants of same instructions are fine. llvm-svn: 161907	2012-08-14 23:36:01 +00:00
Eric Christopher	5f61a7498b	This needs braces. Spotted by Bill. llvm-svn: 161906	2012-08-14 23:32:15 +00:00
Michael Liao	06f6fe875a	minor fix of X86ISD::VSEXT_MOVL dump llvm-svn: 161902	2012-08-14 22:53:17 +00:00
Michael Liao	34107b9177	fix PR11334 - FP_EXTEND only support extending from vectors with matching elements. This results in the scalarization of extending to v2f64 from v2f32, which will be legalized to v4f32 not matching with v2f64. - add X86-specific VFPEXT supproting extending from v4f32 to v2f64. - add BUILD_VECTOR lowering helper to recover back the original extending from v4f32 to v2f64. - test case is enhanced to include different vector width. llvm-svn: 161894	2012-08-14 21:24:47 +00:00
Jim Grosbach	ecaef49f59	Switch the fixed-length disassembler to be table-driven. Refactor the TableGen'erated fixed length disassemblmer to use a table-driven state machine rather than a massive set of nested switch() statements. As a result, the ARM Disassembler (ARMDisassembler.cpp) builds much more quickly and generates a smaller end result. For a Release+Asserts build on a 16GB 3.4GHz i7 iMac w/ SSD: Time to compile at -O2 (averaged w/ hot caches): Previous: 35.5s New: 8.9s TEXT size: Previous: 447,251 New: 297,661 Builds in 25% of the time previously required and generates code 66% of the size. Execution time of the disassembler is only slightly slower (7% disassembling 10 million ARM instructions, 19.6s vs 21.0s). The new implementation has not yet been tuned, however, so the performance should almost certainly be recoverable should it become a concern. llvm-svn: 161888	2012-08-14 19:06:05 +00:00
Owen Anderson	0b35722533	Fix the construction of the magic constant for roundToIntegral to be 64-bit safe. Fixes c-torture/execute/990826-0.c llvm-svn: 161885	2012-08-14 18:51:15 +00:00
Kostya Serebryany	fda7a138f7	[asan] insert crash basic blocks inline as opposed to inserting them at the end of the function. This doesn't seem to fix or break anything, but is considered to be more friendly to downstream passes llvm-svn: 161870	2012-08-14 14:04:51 +00:00
Craig Topper	925a281b00	Factor duplicate calls to getUNDEF in several functions. llvm-svn: 161860	2012-08-14 08:18:43 +00:00
Craig Topper	d0d4b11f66	Re-factor intrinsic lowering to combine common parts of similar intrinsics. Reduces compiled code size a little bit. llvm-svn: 161859	2012-08-14 07:43:25 +00:00
Craig Topper	2a40418a99	Change greater than to greater than or equal so that an identical sized store to the same offset is treated as completing overwriting. llvm-svn: 161857	2012-08-14 07:32:05 +00:00
Richard Smith	0ff8f0eaf9	Fix undefined behavior: binding null pointer to reference. No functionality change. llvm-svn: 161853	2012-08-14 05:31:26 +00:00
Nadav Rotem	70409991bc	During the CodeGenPrepare we often lower intrinsics (such as objsize) and allow some optimizations to turn conditional branches into unconditional. This commit adds a simple control-flow optimization which merges two consecutive basic blocks which are connected by a single edge. This allows the codegen to operate on larger basic blocks. rdar://11973998 llvm-svn: 161852	2012-08-14 05:19:07 +00:00
Eric Christopher	160522c25a	Grammar. llvm-svn: 161851	2012-08-14 05:13:29 +00:00
Eric Christopher	97f6ea9f34	Typo. llvm-svn: 161826	2012-08-14 01:09:10 +00:00
Owen Anderson	a40319b7f1	Add a roundToIntegral method to APFloat, which can be parameterized over various rounding modes. Use this to implement SelectionDAG constant folding of FFLOOR, FCEIL, and FTRUNC. llvm-svn: 161807	2012-08-13 23:32:49 +00:00
Jakob Stoklund Olesen	396b595b92	Transfer weights in transferSuccessorsAndUpdatePHIs(). llvm-svn: 161805	2012-08-13 23:13:25 +00:00
Jakob Stoklund Olesen	1dc107a84e	Print out MachineBasicBlock successor weights when available. llvm-svn: 161804	2012-08-13 23:13:23 +00:00
Nadav Rotem	8d80452076	LICM uses AliasSet information to hoist and sink instructions. However, other passes, such as LoopRotate may invalidate its AliasSet because SSAUpdater does not update the AliasSet properly. This patch teaches SSAUpdater to notify AliasSet that it made changes. The testcase in PR12901 is too big to be useful and I could not reduce it to a normal size. rdar://11872059 PR12901 llvm-svn: 161803	2012-08-13 23:06:54 +00:00
Nadav Rotem	5d4e205874	MemoryDependenceAnalysis attempts to find the first memory dependency for function calls. Currently, if GetLocation reports that it did not find a valid pointer (this is the case for volatile load/stores), we ignore the result. This patch adds code to handle the cases where we did not obtain a valid pointer. rdar://11872864 PR12899 llvm-svn: 161802	2012-08-13 23:03:43 +00:00
Jakob Stoklund Olesen	702bcc3bcf	Remove the TII::scheduleTwoAddrSource() hook. It never does anything when running 'make check', and it get's in the way of updating live intervals in 2-addr. The hook was originally added to help form IT blocks in Thumb2 code before register allocation, but the pass ordering has changed since then, and we run if-conversion after register allocation now. When the MI scheduler is enabled, there will be no less than two schedulers between 2-addr and Thumb2ITBlockPass, so this hook is unlikely to help anything. llvm-svn: 161794	2012-08-13 21:52:57 +00:00
Manman Ren	d6c8270eaa	ARM: enable struct byval for AAPCS-VFP. This change is to be enabled in clang. rdar://9877866 llvm-svn: 161789	2012-08-13 21:22:50 +00:00
Bill Wendling	49aeb5cc5d	Whitespace cleanup. llvm-svn: 161788	2012-08-13 21:20:43 +00:00
Jakob Stoklund Olesen	d0af1d9657	Count triangles and diamonds in early if-conversion. llvm-svn: 161783	2012-08-13 21:03:27 +00:00
Jakob Stoklund Olesen	62a097d134	Delete dead typedef. llvm-svn: 161782	2012-08-13 21:03:25 +00:00
Jakob Stoklund Olesen	83a927d84a	Handle extra Tail predecessors in if-conversion. It is still possible to if-convert if the tail block has extra predecessors, but the tail phis must be rewritten instead of being removed. llvm-svn: 161781	2012-08-13 20:49:04 +00:00
Arnold Schwaighofer	0bb7f23cfc	[Hexagon] Don't mark callee saved registers as clobbered by a tail call This was causing unnecessary spills/restores of callee saved registers. Fixes PR13572. Patch by Pranav Bhandarkar! llvm-svn: 161778	2012-08-13 19:54:01 +00:00
Nadav Rotem	3a94c545cf	Do not optimize (or (and X,Y), Z) into BFI and other sequences if the AND ISDNode has more than one user. rdar://11876519 llvm-svn: 161775	2012-08-13 18:52:44 +00:00
Manman Ren	959acb106b	X86: move Int_CVTSD2SSrr, Int_CVTSI2SSrr, Int_CVTSI2SDrr, Int_CVTSS2SDrr from OpTbl1 to OpTbl2 since they have 3 operands and the last operand can be changed to a memory operand. PR13576 llvm-svn: 161769	2012-08-13 18:29:41 +00:00
Eric Christopher	7d8b53c1f8	Add support for the %H output modifier. Patch by Weiming Zhao. llvm-svn: 161768	2012-08-13 18:18:52 +00:00
Manman Ren	e90e94f117	X86: when auto-detecting the subtarget features, make sure use IsIntel to detect Nehalem, Westmere and Sandy Bridge. AMD also has processor family 6. llvm-svn: 161763	2012-08-13 17:26:46 +00:00
Kostya Serebryany	0f7a80d0c3	[asan] remove the code for --asan-merge-callbacks as it appears to be a bad idea. (partly related to Bug 13225) llvm-svn: 161757	2012-08-13 14:08:46 +00:00
Tim Northover	5aaa7fde94	Use correct loads for vector types during extending-load operations. Previously, we used VLD1.32 in all cases, however there are both 16 and 64-bit accesses being selected, so we need to use an appropriate width load in those cases. llvm-svn: 161748	2012-08-13 09:06:31 +00:00
Craig Topper	4e5eb72735	Tidy up VSETCC lowering code a bit more by adding an llvm_unreachable and putting an a couple if conditions in a better order. llvm-svn: 161746	2012-08-13 03:42:38 +00:00
Craig Topper	5145a0d967	Refactor code a bit to share commonalities. No functional change intended. llvm-svn: 161745	2012-08-13 02:34:03 +00:00
Craig Topper	ff6e4d1928	Fix an unused variable warning from r161742. llvm-svn: 161743	2012-08-13 01:26:45 +00:00
Craig Topper	a7aaa62d54	Remove the LowerMMXCONCAT_VECTORS function. It could never execute because there are no legal 64-bit vector types that could be used as inputs to a 128-bit concat_vectors. Remove a target specific SDNode and its patterns that become unused as a result. llvm-svn: 161742	2012-08-13 01:23:55 +00:00
Nick Lewycky	333449cd65	When emitting the PC range in an FDE, use the same data encoding for both ends of the range. Fixes PR13581! llvm-svn: 161739	2012-08-12 08:09:45 +00:00
Craig Topper	3d2b271362	Remove call to setOperationAction for SETCC of v4f32. SETCC returns an integer type not an FP type. llvm-svn: 161738	2012-08-12 05:31:32 +00:00
Craig Topper	498228d089	Remove unnecessary call to setOperationAction for SETCC of v2i64 under SSE42. It was already called for the same under SSE2. llvm-svn: 161737	2012-08-12 05:15:16 +00:00
Arnold Schwaighofer	b73da9453c	Revert 161581: Patch to implement UMLAL/SMLAL instructions for the ARM architecture It broke MultiSource/Applications/JM/ldecod/ldecod on armv7 thumb O0 g and armv7 thumb O3. llvm-svn: 161736	2012-08-12 05:11:56 +00:00

1 2 3 4 5 ...

55695 Commits