llvm-project

Commit Graph

Author	SHA1	Message	Date
Benjamin Kramer	1835c13fc2	Remove noisy semicolon. llvm-svn: 112673	2010-08-31 23:38:13 +00:00
Chris Lattner	030f02021b	licm is wasting time hoisting constant foldable operations, instead of hoisting them, just fold them away. This occurs in the testcase for PR8041, for example. llvm-svn: 112669	2010-08-31 23:00:16 +00:00
Kevin Enderby	d94dacf829	This is the second of three patches to implement support for the .loc directive and output the dwarf line number tables. This takes the current loc info after an instruction is assembled and saves the needed info into an object that has vector and for each section. These objects will be used for the final patch to build and emit the encoded dwarf line number tables. Again for now this is only in the Mach-O streamer but at some point will move to a more generic place. llvm-svn: 112668	2010-08-31 22:55:11 +00:00
Dan Gohman	47308d5da3	Reapply r112432, now that the real problem is addressed. llvm-svn: 112667	2010-08-31 22:53:17 +00:00
Dan Gohman	f01a5eed1e	Reapply r112433, now that the real problem is addressed. llvm-svn: 112666	2010-08-31 22:52:12 +00:00
Dan Gohman	aabfc52790	Revert r110916. This patch is buggy because the code inside the inner loop doesn't update all the variables in the outer loop. llvm-svn: 112665	2010-08-31 22:50:31 +00:00
Bill Wendling	6789f8b6ae	We have a chance for an optimization. Consider this code: int x(int t) { if (t & 256) return -26; return 0; } We generate this: tst.w r0, #256 mvn r0, #25 it eq moveq r0, #0 while gcc generates this: ands r0, r0, #256 it ne mvnne r0, #25 bx lr Scandalous really! During ISel time, we can look for this particular pattern. One where we have a "MOVCC" that uses the flag off of a CMPZ that itself is comparing an AND instruction to 0. Something like this (greatly simplified): %r0 = ISD::AND ... ARMISD::CMPZ %r0, 0 @ sets [CPSR] %r0 = ARMISD::MOVCC 0, -26 @ reads [CPSR] All we have to do is convert the "ISD::AND" into an "ARM::ANDS" that sets [CPSR] when it's zero. The zero value will all ready be in the %r0 register and we only need to change it if the AND wasn't zero. Easy! llvm-svn: 112664	2010-08-31 22:41:22 +00:00
Anton Korobeynikov	4e7351d2ba	Some fixes for NetBSD llvm-svn: 112662	2010-08-31 22:38:00 +00:00
Bruno Cardoso Lopes	4b56d87290	Use x86 specific MOVSLDUP node, add more patterns to match it and remove useless load nodes llvm-svn: 112661	2010-08-31 22:35:05 +00:00
Devang Patel	86ec8b3a3f	Reapply r112623. Included additional check for unused byval argument. llvm-svn: 112659	2010-08-31 22:22:42 +00:00
Bruno Cardoso Lopes	61996ef835	Use x86 specific MOVSHDUP node and add more patterns to match it llvm-svn: 112657	2010-08-31 22:22:11 +00:00
Bill Wendling	d657d82597	And ANDS pattern to match the t2ANDS pattern. llvm-svn: 112654	2010-08-31 22:05:37 +00:00
Jakob Stoklund Olesen	94e90b9406	Stack slot access methods are in TargetInstrInfo. llvm-svn: 112653	2010-08-31 22:01:07 +00:00
Dale Johannesen	9b7fdc0483	Comment typo. llvm-svn: 112652	2010-08-31 21:53:15 +00:00
Jakob Stoklund Olesen	33e9fce2d6	Make %EFLAGS unallocatable. No CCR virtual registers should exist, and %EFLAGS is used in ways that can surprise RegAllocFast. llvm-svn: 112650	2010-08-31 21:51:07 +00:00
Jakob Stoklund Olesen	7993dae7bd	Track liveness of unallocatable, unreserved registers in machine DCE. Reserved registers are unpredictable, and are treated as always live by machine DCE. Allocatable registers are never reserved, and can be used for virtual registers. Unreserved, unallocatable registers can not be used for virtual registers, but otherwise behave like a normal allocatable register. Most targets only have the flag register in this set. llvm-svn: 112649	2010-08-31 21:51:05 +00:00
Bruno Cardoso Lopes	5de15ce468	Use MOVHLPS node instead of matching using movhlps and movhlps_undef pattern fragments llvm-svn: 112644	2010-08-31 21:38:49 +00:00
Chris Lattner	daca6f3483	tidy up llvm-svn: 112643	2010-08-31 21:21:25 +00:00
Bruno Cardoso Lopes	03e4c35302	Use MOVLHPS and MOVHLPS x86 nodes whenever possible. Also remove some useless nodes llvm-svn: 112642	2010-08-31 21:15:21 +00:00
Dan Gohman	90f29bcd90	Revert r112432. It appears to be exposing a problem in the emacs build. llvm-svn: 112638	2010-08-31 20:58:44 +00:00
Owen Anderson	a5e6b3eca4	Merge 2010-08-31-InfiniteRecursion.ll into crash.ll. llvm-svn: 112635	2010-08-31 20:27:17 +00:00
Owen Anderson	3c84ecb067	More cleanups of my JumpThreading transforms, including extracting some duplicated code into a helper function. llvm-svn: 112634	2010-08-31 20:26:04 +00:00
Jakob Stoklund Olesen	2c325dc907	Ignore unallocatable registers in RegAllocFast. llvm-svn: 112632	2010-08-31 19:54:25 +00:00
Devang Patel	529f248eb4	Revert r112623. It is causing self host build failures. llvm-svn: 112631	2010-08-31 19:41:03 +00:00
Duncan Sands	11c23fc349	Update the Ada instructions to LLVM 2.7 (from LLVM 2.5). llvm-svn: 112630	2010-08-31 19:40:21 +00:00
Owen Anderson	6fdcb172a9	Add an RAII helper to make cleanup of the RecursionSet more fool-proof. llvm-svn: 112628	2010-08-31 19:24:27 +00:00
Owen Anderson	048efbe225	Only try to clean up the current block if we changed that block already. llvm-svn: 112625	2010-08-31 18:55:52 +00:00
Jim Grosbach	9ce9210e47	SP relative offsets need to be adjusted by the local allocation size when determining if they're likely to be in range of the SP when resolving frame references. llvm-svn: 112624	2010-08-31 18:52:31 +00:00
Devang Patel	8559932d36	Remember byval argument's frame index during argument lowering and use this info to emit debug info. Fixes Radar 8367011. llvm-svn: 112623	2010-08-31 18:50:09 +00:00
Jim Grosbach	6f6b590b99	this assert should just be a condition, since this function is just asking if the offset is legally encodable, not actually trying to do the encoding. llvm-svn: 112622	2010-08-31 18:49:31 +00:00
Owen Anderson	799a08ae48	Add a test for the duplicated-conditional situation illutrated by PR5652. llvm-svn: 112621	2010-08-31 18:49:12 +00:00
Owen Anderson	cd4de7f399	Refactor my fix for PR5652 to terminate the predecessor lookups after the first failure. llvm-svn: 112620	2010-08-31 18:48:48 +00:00
Chris Lattner	e2295f1c80	merge two tests. llvm-svn: 112617	2010-08-31 18:44:03 +00:00
Owen Anderson	3931c85956	Manually reduce this testcase. llvm-svn: 112615	2010-08-31 18:16:29 +00:00
Chris Lattner	fbcd165b59	merge two tests and convert to filecheck. llvm-svn: 112613	2010-08-31 18:05:08 +00:00
Owen Anderson	ada0623725	Add a micro-test for the transforms I added to JumpThreading. I have not been able to find a way to test each in isolation, for a few reasons: 1) The ability to look-through non-i1 BinaryOperator's requires the ability to look through non-constant ICmps in order for it to ever trigger. 2) The ability to do LVI-powered PHI value determination only matters in cases that ProcessBranchOnPHI can't handle. Since it already handles all the cases without other instructions in the def-use chain between the PHI and the branch, it requires the ability to look through ICmps and/or BinaryOperators as well. llvm-svn: 112611	2010-08-31 17:59:07 +00:00
Jim Grosbach	ad9b6de3b6	Update test for 112609 llvm-svn: 112610	2010-08-31 17:58:47 +00:00
Jim Grosbach	365e931f7b	Improve virtual frame base register allocation heuristics. 1. Allocate them in the entry block of the function to enable function-wide re-use. The instructions to create them should be re-materializable, so there shouldn't be additional cost compared to creating them local to the basic blocks where they are used. 2. Collect all of the frame index references for the function and sort them by the local offset referenced. Iterate over the sorted list to allocate the virtual base registers. This enables creation of base registers optimized for positive-offset access of frame references. (Note: This may be appropriate to later be a target hook to do the sorting in a target appropriate manner. For now it's done here for simplicity.) llvm-svn: 112609	2010-08-31 17:58:19 +00:00
Dan Gohman	444c24a9f0	Speculatively revert r112433. llvm-svn: 112608	2010-08-31 17:56:47 +00:00
Benjamin Kramer	4226198a1d	Allow creation of SHT_NULL sections, from Roman Divacky. llvm-svn: 112605	2010-08-31 17:03:33 +00:00
Duncan Sands	bb8a3f9f6d	Stop using the dom frontier in DwarfEHPrepare by not promoting alloca's any more. I plan to reimplement alloca promotion using SSAUpdater later. It looks like Bill's URoR logic really always needs domtree, so the pass now always asks for domtree info. llvm-svn: 112597	2010-08-31 09:05:06 +00:00
Nick Lewycky	68984ede5c	Fix an infinite loop; merging two functions will create a new function (if the two are weak, we make them thunks to a new strong function) so don't iterate through the function list as we're modifying it. Also add back the outermost loop which got removed during the cleanups. llvm-svn: 112595	2010-08-31 08:29:37 +00:00
Owen Anderson	ce401be792	Don't perform an extra traversal of the function just to do cleanup. We can safely simplify instructions after each block has been processed without worrying about iterator invalidation. llvm-svn: 112594	2010-08-31 07:55:56 +00:00
Bill Wendling	b70dc8777e	- Cleanup some whitespaces. - Convert {0,1} and friends into 0b01, which is identical and more consistent. llvm-svn: 112593	2010-08-31 07:50:46 +00:00
Owen Anderson	064b139c8d	Rename test directory to reflect new pass name. llvm-svn: 112592	2010-08-31 07:50:31 +00:00
Owen Anderson	48d58ad64c	Rename ValuePropagation to a more descriptive CorrelatedValuePropagation. llvm-svn: 112591	2010-08-31 07:48:34 +00:00
Owen Anderson	d2918a07bd	Rename file to something more descriptive. llvm-svn: 112590	2010-08-31 07:41:39 +00:00
Owen Anderson	3997a07fb9	More Chris-inspired JumpThreading fixes: use ConstantExpr to correctly constant-fold undef, and be more careful with its return value. This actually exposed an infinite recursion bug in ComputeValueKnownInPredecessors which theoretically already existed (in JumpThreading's handling of and/or of i1's), but never manifested before. This patch adds a tracking set to prevent this case. llvm-svn: 112589	2010-08-31 07:36:34 +00:00
Michael J. Spencer	618d21978d	Cleanup Whitespace. llvm-svn: 112587	2010-08-31 06:36:46 +00:00
Michael J. Spencer	18f5005eb0	System: Fix getMagicNumber on windows. getMagicNumber was treating the _binary_ data it read in as a null terminated string. This resulted in the std::string calculating the length, and causing an assert in other code that assumed that the length it passed was the same as the length of the string it would get back. llvm-svn: 112586	2010-08-31 06:36:33 +00:00
Michael J. Spencer	1279e737f5	Fix spelling/typo. llvm-svn: 112585	2010-08-31 06:36:22 +00:00
Devang Patel	417d72823a	Offset is not always unsigned number. llvm-svn: 112584	2010-08-31 06:12:08 +00:00
Devang Patel	2cfc3af181	Simplify. llvm-svn: 112583	2010-08-31 06:11:28 +00:00
Nick Lewycky	0464d1d7ec	Switch to DenseSet, simplifying much more code. We now have a single iteration where we hash, compare and fold, instead of one iteration where we build up the hash buckets and a second one to fold. llvm-svn: 112582	2010-08-31 05:53:05 +00:00
Owen Anderson	376597c13e	Remove r111665, which implemented store-narrowing in InstCombine. Chris discovered a miscompilation in it, and it's not easily fixable at the optimizer level. I'll investigate reimplementing it in DAGCombine. llvm-svn: 112575	2010-08-31 04:41:06 +00:00
Bruno Cardoso Lopes	d9ef4a1a24	zap unused method. x86 is the only user and already has a more powerfull version llvm-svn: 112571	2010-08-31 02:36:20 +00:00
Bruno Cardoso Lopes	dfd9dd5d75	Use X86ISD::MOVSS and MOVSD to represent the movl mask pattern, also fix the handling of those nodes when seeking for scalars inside vector shuffles llvm-svn: 112570	2010-08-31 02:26:40 +00:00
Eric Christopher	901176a755	Rewrite slightly so we can expand for floating point types easier. llvm-svn: 112568	2010-08-31 01:28:42 +00:00
Jakob Stoklund Olesen	9c39690edf	Add experimental -disable-physical-join command line option. Eventually, we want to disable physreg coalescing completely, and let the register allocator do its job using hints. This option makes it possible to measure the impact of disabling physreg coalescing. llvm-svn: 112567	2010-08-31 01:27:49 +00:00
Owen Anderson	b58b3c0dda	Fix a typo. llvm-svn: 112560	2010-08-30 23:59:30 +00:00
Eric Christopher	bbd1098989	If we have an unhandled type then assert, we shouldn't get here for things we can't handle. llvm-svn: 112559	2010-08-30 23:48:26 +00:00
Dan Gohman	f2d817192b	Update the descriptions of NoModRef and ModRef to be consistent with the descriptions of Mod and Ref. llvm-svn: 112557	2010-08-30 23:47:24 +00:00
Anton Korobeynikov	3a1d87a7ba	Fix borken test llvm-svn: 112555	2010-08-30 23:41:49 +00:00
Owen Anderson	70b17c50e2	Combine these two tests, and make sure there's a newline at the end of the file. llvm-svn: 112554	2010-08-30 23:37:41 +00:00
Owen Anderson	b974dbbdd7	Cleanups suggested by Chris. llvm-svn: 112553	2010-08-30 23:34:17 +00:00
Owen Anderson	c910acb54a	Re-apply r112539, being more careful to respect the return values of the constant folding methods. Additionally, use the ConstantExpr::get*() methods to simplify some constant folding. llvm-svn: 112550	2010-08-30 23:22:36 +00:00
Anton Korobeynikov	48043d0173	Expand MOVi32imm in ARM mode after regalloc. This provides scheduling opportunities (extra instruction can go in between MOVT / MOVW pair removing the stall). llvm-svn: 112546	2010-08-30 22:50:36 +00:00
Owen Anderson	30bacbdfdf	Add statistics to evaluate this pass. llvm-svn: 112545	2010-08-30 22:45:55 +00:00
Owen Anderson	1ddcbbe49c	Revert r112539. It accidentally introduced a miscompilation. llvm-svn: 112543	2010-08-30 22:33:41 +00:00
Owen Anderson	75f6037c7c	Fixes and cleanups pointed out by Chris. In general, be careful to handle 0 results from ComputeValueKnownInPredecessors (indicating undef), and re-use existing constant folding APIs. llvm-svn: 112539	2010-08-30 22:07:52 +00:00
Bill Wendling	87bb14c566	Use the existing T2I_bin_s_irs pattern instead of creating T2I_bin_sw_irs, which is meant to do exactly the same thing. Thanks to Jim Grosbach for pointing this out! :-) llvm-svn: 112538	2010-08-30 22:05:23 +00:00
NAKAMURA Takumi	fe933eb8f6	Fix a comment. llvm-svn: 112535	2010-08-30 21:54:03 +00:00
Jakob Stoklund Olesen	4d30f90e35	Remember to clear the shadow kill flag at the same time as clearing the real kill flag. This could cause duplicate kill flags when the same register was used twice in a continuous sequence of STRs. There is no small test case. <rdar://problem/8218046> llvm-svn: 112534	2010-08-30 21:52:40 +00:00
Dan Gohman	60bcc102d7	Fix llc to run the verifier once, not twice. llvm-svn: 112532	2010-08-30 21:41:20 +00:00
Owen Anderson	f45ee8128c	Remove this from the main tree. I'll host it out of tree. llvm-svn: 112529	2010-08-30 21:34:26 +00:00
Dan Gohman	62ddc15f06	Add comments explaining why it's not necessary to include the is-function-local flag in metadata uniquing bits. llvm-svn: 112528	2010-08-30 21:18:41 +00:00
Bob Wilson	4cd8a126c3	Remove NEON vmovn intrinsic, replacing it with vector truncate operations. Auto-upgrade the old intrinsic and update tests. llvm-svn: 112507	2010-08-30 20:02:30 +00:00
Jim Grosbach	fef37287a8	Make ARM add rN, sp, #imm instructions rematerializable. That's how the address of locals is calculated, so this should help relieve register pressure a bit. Recalculating the local address is almost always going to be better than spilling. llvm-svn: 112503	2010-08-30 19:49:58 +00:00
Eric Christopher	e7a9db16bb	Fix LLVM target initialization to deal with sociopathic outside projects that like to randomly define things like "X86", regenerate autoconf bits and update cmake. Fixes PR7852. Patch by Xerxes Rånby! llvm-svn: 112499	2010-08-30 18:34:48 +00:00
Eric Christopher	470e84cfeb	Kill a couple of unused variables. llvm-svn: 112498	2010-08-30 18:31:44 +00:00
Chris Lattner	3f54837161	nuke dead ivar which was supposed to be committed with r112496 llvm-svn: 112497	2010-08-30 18:16:27 +00:00
Chris Lattner	34bfab0ad5	two changes: 1) nuke ConstDataCoalSection, which is dead. 2) revise my previous patch for rdar://8018335, which was completely wrong. Specifically, it doesn't make sense to mark __TEXT,__const_coal as PURE_INSTRUCTIONS, because it is for readonly data. templates (it turns out) go to const_coal_nt. The real fix for rdar://8018335 was to give ConstTextCoalSection a section kind of ReadOnly instead of Text. llvm-svn: 112496	2010-08-30 18:12:35 +00:00
Bob Wilson	e2f8bdac14	When expanding NEON VST pseudo instructions, if the original super-register operand is killed, add it to the expanded instruction as an implicit kill operand instead of marking the individual subregs with kill flags. This should work better in general and also handles the case for VST3 where one of the subregs was not referenced in the expanded instruction and so was not marked killed. llvm-svn: 112494	2010-08-30 18:10:48 +00:00
Benjamin Kramer	b1b493bcab	MCELF: The value of all common symbols is the offset from the start of the section. Patch by Roman Divacky. llvm-svn: 112492	2010-08-30 17:20:17 +00:00
Owen Anderson	9517943d11	It is possible to try to merge a not-constant with a constantrage, when dealing with ptrtoint ConstantExpr's. Unfortunately, the only testcase I have for this is huge and doesn't reduce well because the error is sensitive to iteration-order issues, since the problem only occurs when merging values in a particular order. llvm-svn: 112489	2010-08-30 17:03:45 +00:00
Michael J. Spencer	2f997cdedf	Partially revert r112480. Caused test failures. llvm-svn: 112486	2010-08-30 15:34:08 +00:00
NAKAMURA Takumi	e53cf6f85d	coff-dump.py: Fix PR7996. Now it is compatible to Python-2.4. llvm-svn: 112485	2010-08-30 15:19:56 +00:00
Michael J. Spencer	7983340465	Fix constant-over-index.ll test on windows. llvm-svn: 112483	2010-08-30 15:08:02 +00:00
Michael J. Spencer	41c18853c8	Test: Fix LLVMC tests on CMake. The CMake build didn't define TEST_COMPILE_CXX_CMD. The tests assumed gcc. llvm-svn: 112480	2010-08-30 14:49:00 +00:00
Benjamin Kramer	8548c892a8	Don't print two "0x" prefixes. Use a raw_ostream overload instead of llvm::format. llvm-svn: 112479	2010-08-30 14:46:53 +00:00
NAKAMURA Takumi	9c0e59de64	EE/JIT: Do not invoke parent's ctors/dtors from main()! (PR3897) On Mingw and Cygwin, the symbol __main is resolved to callee's(eg. tools/lli) one, to invoke wrong duplicated ctors (and register wrong callee's dtors with atexit(3)). We expect, by callee, ExecutionEngine::runStaticConstructorsDestructors() is called before ExecutionEngine::runFunctionAsMain() is called. llvm-svn: 112474	2010-08-30 14:00:29 +00:00
Benjamin Kramer	8199447851	The value is offset from the start of the section for non-common symbols, submitted by Jordan Gordeev. llvm-svn: 112473	2010-08-30 12:00:16 +00:00
Benjamin Kramer	f791b9fc56	Index external symbols by symbol table instead of parent section, by Roman Divacky. llvm-svn: 112472	2010-08-30 11:59:29 +00:00
Benjamin Kramer	6ebea89316	Mark all common symbols external. This is not exactly correct but it lets apps link for now and can be adjusted later. Patch by Roman Divacky. llvm-svn: 112471	2010-08-30 11:56:55 +00:00
Duncan Sands	1b6744a376	Remove a hack that tries to understand incorrect triples from the Triple class constructor. Only valid triples should now be used inside LLVM - front-ends are now responsable for rejecting or correcting invalid target triples. The Triple::normalize method can be used to straighten out funky triples provided by users. Give this a whirl through the buildbots to see if I caught all places where triples enter LLVM. llvm-svn: 112470	2010-08-30 10:57:54 +00:00
Duncan Sands	68c30907cc	Correct bogus module triple specifications. llvm-svn: 112469	2010-08-30 10:48:29 +00:00
Owen Anderson	75c205e7e2	Add a new example to the LLVM distribution: a trace-based Brainfuck compiler that uses LLVM as its code generator. llvm-svn: 112465	2010-08-30 07:33:39 +00:00
Chandler Carruth	1ddaacebf6	Attempt to remove the MSIL backend from CMake as well based on Chris's r112375. llvm-svn: 112464	2010-08-30 07:25:54 +00:00
Bill Wendling	f824489a1d	Revert r112461. It was failing on PPC... llvm-svn: 112463	2010-08-30 04:36:50 +00:00
Bill Wendling	f8dfa461fa	Create Thumb2sI_cpsr and T2sI_cpsr. These new classes indicate that CPSR is the optional modified register (instead of reg0). Along with r112461 it will make sure that the optional define of CPSR is marked as "def" and will thus mark the instructions using these classes (t2ANDS*) as setting the 's' flag. llvm-svn: 112462	2010-08-30 01:47:35 +00:00
Bill Wendling	938f299fa9	When adding a register, we should mark it as "def" if it can optionally define said (physical) register. llvm-svn: 112461	2010-08-30 01:36:05 +00:00
Chris Lattner	ea05bf2259	revert 112457, it looks like it broke selfhost. llvm-svn: 112459	2010-08-29 22:28:18 +00:00
Chris Lattner	c843fca2fd	rewrite DwarfEHPrepare to use SSAUpdater to promote its allocas instead of PromoteMemToReg. This allows it to stop using DF and DT, eliminating a computation of DT and DF from clang -O3. Clang is now down to 2 runs of DomFrontier. llvm-svn: 112457	2010-08-29 19:54:28 +00:00
Chris Lattner	d94a7c3dc1	inline function into its only caller. llvm-svn: 112455	2010-08-29 19:28:28 +00:00
Chris Lattner	f58382ed87	two changes: 1) make AliasSet hold the list of call sites with an assertingvh so we get a violent explosion if the pointer dangles. 2) Fix AliasSetTracker::deleteValue to remove call sites with by-pointer comparisons instead of by-alias queries. Using findAliasSetForCallSite can cause alias sets to get merged when they shouldn't, and can also miss alias sets when the call is readonly. #2 fixes PR6889, which only repros with a .c file :( llvm-svn: 112452	2010-08-29 18:42:23 +00:00
Chris Lattner	263f804699	LICM does get dead instructions input to it. Instead of sinking them out of loops, just delete them. llvm-svn: 112451	2010-08-29 18:22:25 +00:00
Chris Lattner	6ac0659a1c	use moveBefore instead of remove+insert, it avoids some symtab manipulation, so its faster (in addition to being more elegant) llvm-svn: 112450	2010-08-29 18:18:40 +00:00
Chris Lattner	f03b4eac48	revert 112448 for now. llvm-svn: 112449	2010-08-29 18:11:16 +00:00
Chris Lattner	11f8ad8211	optimize LICM::hoist to use moveBefore. Correct its updating of AST to remove the hoisted instruction from the AST, since it is no longer in the loop. llvm-svn: 112448	2010-08-29 18:03:33 +00:00
Chris Lattner	1a1ed69435	fix some bugs (found by inspection) where LICM would not update LICM correctly. When sinking an instruction, it should not add entries for the sunk instruction to the AST, it should remove the entry for the sunk instruction. The blocks being sunk to are not in the loop, so their instructions shouldn't be in the AST (yet)! llvm-svn: 112447	2010-08-29 18:00:00 +00:00
Chris Lattner	cc9cbc66a3	rework the ownership of subloop alias information: instead of keeping them around until the pass is destroyed, keep them around a) just when useful (not for outer loops) and b) destroy them right after we use them. This should reduce memory use and fixes potential bugs where a loop is deleted and another loop gets allocated to the same address. llvm-svn: 112446	2010-08-29 17:46:00 +00:00
Chris Lattner	bc1a65ac6c	apparently unswitch had the same "Feature". Stop its claims that it preserves domfrontier if it doesn't really. llvm-svn: 112445	2010-08-29 17:23:19 +00:00
Chris Lattner	d6f46b8af8	now that loop passes don't use DomFrontier, there is no reason for the unroller to pretend it supports updating it. It still has a horrible hack for DomTree. llvm-svn: 112444	2010-08-29 17:21:35 +00:00
Dan Gohman	3a08ed7904	Make IVUsers iterative instead of recursive. This has the side effect of reversing the order of most of IVUser's results. llvm-svn: 112442	2010-08-29 16:40:03 +00:00
Dan Gohman	002ff89cbd	Optionally rerun dedicated-register filtering after applying other filtering techniques, as those may allow it to filter out more obviously unprofitable candidates. llvm-svn: 112441	2010-08-29 16:39:22 +00:00
Dan Gohman	f031792cc6	Fix several areas in LSR to do a better job keeping the main LSRInstance data structures up to date. This fixes some pessimizations caused by stale data which will be exposed in an upcoming change. llvm-svn: 112440	2010-08-29 16:32:54 +00:00
Dan Gohman	e9e0873b08	Refactor the three main groups of code out of NarrowSearchSpaceUsingHeuristics into separate functions. llvm-svn: 112439	2010-08-29 16:09:42 +00:00
Dan Gohman	37a0f68036	Delete a bogus check. llvm-svn: 112438	2010-08-29 15:30:29 +00:00
Dan Gohman	b6a520d63c	Add some comments. llvm-svn: 112437	2010-08-29 15:27:08 +00:00
Dan Gohman	bf673e0652	Move this debug output into GenerateAllReuseFormula, to declutter the high-level logic. llvm-svn: 112436	2010-08-29 15:21:38 +00:00
Dan Gohman	d366b6d5c8	Delete an unused declaration. llvm-svn: 112435	2010-08-29 15:19:11 +00:00
Dan Gohman	4f13bbfefc	Do one lookup instead of two. llvm-svn: 112434	2010-08-29 15:18:49 +00:00
Dan Gohman	d1da5cdfee	Restructure the {A,+,B}<L> * {C,+,D}<L> folding so that it folds all applicable addrecs before recursing on getMulExpr, instead of recursing on getMulExpr for each one. llvm-svn: 112433	2010-08-29 15:16:58 +00:00
Dan Gohman	3e6fc18943	Batch up subtracts along with adds, when analyzing long chains of operations. llvm-svn: 112432	2010-08-29 15:10:06 +00:00
Dan Gohman	7712d2900d	Micro-optimize GroupByComplexity. llvm-svn: 112431	2010-08-29 15:07:13 +00:00
Dan Gohman	0f2de01355	Hold AddRec->getLoop() in a variable, to make the Mul code more consistent with the Add code. llvm-svn: 112430	2010-08-29 14:55:19 +00:00
Dan Gohman	028c18158a	Rename a variable, for consistency. llvm-svn: 112429	2010-08-29 14:53:34 +00:00
Dan Gohman	28a84d4ba1	Use iterators instead of indices. llvm-svn: 112428	2010-08-29 14:52:02 +00:00
Dan Gohman	dfbed53a84	Don't worry about union types. llvm-svn: 112427	2010-08-29 14:50:21 +00:00
Dan Gohman	6665550bca	Make this test less dependent on register allocation choices. llvm-svn: 112426	2010-08-29 14:49:42 +00:00
Dan Gohman	883fa863f8	Use exec. llvm-svn: 112425	2010-08-29 14:49:00 +00:00
Dan Gohman	42cb9be9fd	Delete an unused declaration. llvm-svn: 112424	2010-08-29 14:48:15 +00:00
Kalle Raiskila	1e616572d9	Fix lowering of INSERT_VECTOR_ELT in SPU. The IDX was treated as byte index, not element index. llvm-svn: 112422	2010-08-29 12:41:50 +00:00
Bill Wendling	8fc2b590b9	Fix whitespaces. No functionality changes. llvm-svn: 112421	2010-08-29 11:31:07 +00:00
Chris Lattner	0c4deab098	Stop explicitly scheduling domfrontier before the loop passes, since none of them use it. With this, we now only run domfrontier (an N^2 analysis) 3 times at clang -O3: once for "early" per-function cleanup, once at the start of the per-function pipeline to support SRoA, and once late because the EHPrepare class uses it. EHPrepare needs to stop using it, this is silly and wasteful. llvm-svn: 112420	2010-08-29 07:05:51 +00:00
Chris Lattner	f94f6bb0ba	licm preserves the cfg, it doesn't have to explicitly say it preserves domfrontier. It does preserve AA though. llvm-svn: 112419	2010-08-29 07:02:56 +00:00
Chris Lattner	abe61ef3b4	now that it doesn't use the PromoteMemToReg function, LICM doesn't require DomFrontier. Dropping this doesn't actually save any runs of the pass though. llvm-svn: 112418	2010-08-29 06:49:44 +00:00
Chris Lattner	1dc98b47b5	completely rewrite the memory promotion algorithm in LICM. Among other things, this uses SSAUpdater instead of PromoteMemToReg. llvm-svn: 112417	2010-08-29 06:43:52 +00:00
Bob Wilson	d0c054886c	Remove NEON vaddl, vaddw, vsubl, and vsubw intrinsics. Instead, use llvm IR add/sub operations with one or both operands sign- or zero-extended. Auto-upgrade the old intrinsics. llvm-svn: 112416	2010-08-29 05:57:34 +00:00
Chris Lattner	9c3931a544	use getUniqueExitBlocks instead of a manual set. llvm-svn: 112412	2010-08-29 05:12:21 +00:00
Eli Friedman	f75de6eae7	A couple of small missed optimizations. llvm-svn: 112411	2010-08-29 05:07:40 +00:00
Chris Lattner	85bf5421e1	reimplement LICM::sink to use SSAUpdater instead of PromoteMemToReg. This leads to much simpler code. llvm-svn: 112410	2010-08-29 04:55:06 +00:00
Chris Lattner	c3fb03e289	implement SSAUpdater::RewriteUseAfterInsertions, a helpful form of RewriteUse. llvm-svn: 112409	2010-08-29 04:54:06 +00:00
Chris Lattner	b50407f104	remove dead proto llvm-svn: 112408	2010-08-29 04:53:24 +00:00
Chris Lattner	cd96b4df56	reduce indentation in LICM::sink by using early exits, use getUniqueExitBlocks instead of getExitBlocks and a manual set to eliminate dupes. llvm-svn: 112405	2010-08-29 04:28:20 +00:00
Chris Lattner	188cc5a0fc	modernize this pass a bit: use efficient set/map and reduce indentation. llvm-svn: 112404	2010-08-29 04:23:04 +00:00
Duncan Sands	77b9df21ba	Flesh out the list of things I've worked on. llvm-svn: 112403	2010-08-29 04:22:35 +00:00
Chris Lattner	dc8070ed6d	when merging two alias sets, the result set is volatile if either of the sets is volatile. We were dropping the volatile bit of the merged in set, leading (luckily) to assertions in cases like PR7535. I cannot produce a testcase that repros with opt, but this is obviously correct. llvm-svn: 112402	2010-08-29 04:14:47 +00:00
Chris Lattner	eef6b19dcb	more cleanup llvm-svn: 112401	2010-08-29 04:13:43 +00:00
Chris Lattner	afb7074f18	clean this up llvm-svn: 112400	2010-08-29 04:06:55 +00:00
Bill Wendling	df9ec17d53	- Add a parameter to T2I_bin_irs for those patterns which set the S bit. - Create T2I_bin_sw_irs to be like T2I_bin_w_irs, but that it sets the S bit. llvm-svn: 112399	2010-08-29 03:55:31 +00:00
Chris Lattner	c2887bc283	merge a bunch of shuffle tests into sse2.ll llvm-svn: 112398	2010-08-29 03:19:04 +00:00
Chris Lattner	38ccc8b884	add a bunch more common shuffles to the instprinter. llvm-svn: 112397	2010-08-29 03:08:08 +00:00
Chris Lattner	b1ff978406	add some nounwind's llvm-svn: 112396	2010-08-29 03:07:47 +00:00
Bill Wendling	b0dc465c04	Name ANDflag to ANDS, which is less stupid. llvm-svn: 112395	2010-08-29 03:06:09 +00:00
Bill Wendling	ac64ed0923	File missing from last commit. llvm-svn: 112394	2010-08-29 03:02:28 +00:00
Bill Wendling	0a65116cce	Create an ARMISD::AND node. This node is exactly like the "ARM::AND" node, but it sets the CPSR register. llvm-svn: 112393	2010-08-29 03:02:11 +00:00
NAKAMURA Takumi	64be20e5a4	Minor change. This is test for git svn dcommit llvm-svn: 112389	2010-08-28 21:12:51 +00:00
Chris Lattner	7a05e6dca2	I have manually decoded the imm field of an insertps one too many times. This patch causes llc and llvm-mc (which both default to verbose-asm) to print out comments after a few common shuffle instructions which indicates the shuffle mask, e.g.: insertps $113, %xmm3, %xmm0 ## xmm0 = zero,xmm0[1,2],xmm3[1] unpcklps %xmm1, %xmm0 ## xmm0 = xmm0[0],xmm1[0],xmm0[1],xmm1[1] pshufd $1, %xmm1, %xmm1 ## xmm1 = xmm1[1,0,0,0] This is carefully factored to keep the information extraction (of the shuffle mask) separate from the printing logic. I plan to move the extraction part out somewhere else at some point for other parts of the x86 backend that want to introspect on the behavior of shuffles. llvm-svn: 112387	2010-08-28 20:42:31 +00:00
Chris Lattner	112b6ee3f2	fixme accomplished llvm-svn: 112386	2010-08-28 20:40:28 +00:00
Chris Lattner	d39d2aad3f	tidy up llvm-svn: 112385	2010-08-28 20:34:35 +00:00
NAKAMURA Takumi	d1ca3e145f	Add me to the "blame list"! And it is my 1st test commit. llvm-svn: 112384	2010-08-28 20:24:43 +00:00
Dan Gohman	d13b1a3581	Remove obsolete keywords which are no longer relevant. llvm-svn: 112382	2010-08-28 20:14:05 +00:00
Dan Gohman	5458b9ddc6	Remove unions from the vim syntax highlighting. llvm-svn: 112381	2010-08-28 20:11:28 +00:00
Chris Lattner	94656b1c8c	fix the buildvector->insertp[sd] logic to not always create a redundant insertp[sd] $0, which is a noop. Before: _f32: ## @f32 pshufd $1, %xmm1, %xmm2 pshufd $1, %xmm0, %xmm3 addss %xmm2, %xmm3 addss %xmm1, %xmm0 ## kill: XMM0<def> XMM0<kill> XMM0<def> insertps $0, %xmm0, %xmm0 insertps $16, %xmm3, %xmm0 ret after: _f32: ## @f32 movdqa %xmm0, %xmm2 addss %xmm1, %xmm2 pshufd $1, %xmm1, %xmm1 pshufd $1, %xmm0, %xmm3 addss %xmm1, %xmm3 movdqa %xmm2, %xmm0 insertps $16, %xmm3, %xmm0 ret The extra movs are due to a random (poor) scheduling decision. llvm-svn: 112379	2010-08-28 17:59:08 +00:00
Chris Lattner	bcb6090ad0	fix the BuildVector -> unpcklps logic to not do pointless shuffles when the top elements of a vector are undefined. This happens all the time for X86-64 ABI stuff because only the low 2 elements of a 4 element vector are defined. For example, on: _Complex float f32(_Complex float A, _Complex float B) { return A+B; } We used to produce (with SSE2, SSE4.1+ uses insertps): _f32: ## @f32 movdqa %xmm0, %xmm2 addss %xmm1, %xmm2 pshufd $16, %xmm2, %xmm2 pshufd $1, %xmm1, %xmm1 pshufd $1, %xmm0, %xmm0 addss %xmm1, %xmm0 pshufd $16, %xmm0, %xmm1 movdqa %xmm2, %xmm0 unpcklps %xmm1, %xmm0 ret We now produce: _f32: ## @f32 movdqa %xmm0, %xmm2 addss %xmm1, %xmm2 pshufd $1, %xmm1, %xmm1 pshufd $1, %xmm0, %xmm3 addss %xmm1, %xmm3 movaps %xmm2, %xmm0 unpcklps %xmm3, %xmm0 ret This implements rdar://8368414 llvm-svn: 112378	2010-08-28 17:28:30 +00:00
Chris Lattner	96db6e66f4	improve comments in the unpcklps generating logic, introduce a new EltStride variable instead of reusing NumElems variable for a non-obvious purpose. No functionality change. llvm-svn: 112377	2010-08-28 17:15:43 +00:00
Michael J. Spencer	d75cf22f72	Don't cast Win32 FILETIME structs to int64. Patch by Dimitry Andric! According to the Microsoft documentation here: http://msdn.microsoft.com/en-us/library/ms724284%28VS.85%29.aspx this cast used in lib/System/Win32/Path.inc: __int64 ft = reinterpret_cast<__int64>(&fi.ftLastWriteTime); should not be done. The documentation says: "Do not cast a pointer to a FILETIME structure to either a ULARGE_INTEGER* or __int64* value because it can cause alignment faults on 64-bit Windows." llvm-svn: 112376	2010-08-28 16:39:32 +00:00
Chris Lattner	bd24404718	remove the MSIL backend. It isn't maintained, is buggy, has no testcases and hasn't kept up with ToT. Approved by Anton. llvm-svn: 112375	2010-08-28 16:33:36 +00:00
Benjamin Kramer	2e5c14713c	Update ocaml test. llvm-svn: 112364	2010-08-28 10:29:41 +00:00
Benjamin Kramer	1d872f5a9f	Remove unions from the ocaml bindings. llvm-svn: 112363	2010-08-28 09:47:42 +00:00
Bob Wilson	950882be07	Use pseudo instructions for VST1 and VST2. llvm-svn: 112357	2010-08-28 05:12:57 +00:00
Chris Lattner	13ee795c42	remove unions from LLVM IR. They are severely buggy and not being actively maintained, improved, or extended. llvm-svn: 112356	2010-08-28 04:09:24 +00:00
Chris Lattner	504e5100d3	remove the ABCD and SSI passes. They don't have any clients that I'm aware of, aren't maintained, and LVI will be replacing their value. nlewycky approved this on irc. llvm-svn: 112355	2010-08-28 03:51:24 +00:00
Chris Lattner	a5217a19a4	remove dead proto llvm-svn: 112354	2010-08-28 03:45:03 +00:00
Chris Lattner	2c9e253ca9	more dead thing zapping. llvm-svn: 112353	2010-08-28 03:43:50 +00:00
Chris Lattner	d069114613	zap dead method llvm-svn: 112352	2010-08-28 03:42:45 +00:00
Chris Lattner	50df36ac0a	for completeness, allow undef also. llvm-svn: 112351	2010-08-28 03:36:51 +00:00
Chris Lattner	95bb297c26	squish dead code. llvm-svn: 112350	2010-08-28 03:21:03 +00:00
Chris Lattner	ca936ac966	zap dead code llvm-svn: 112349	2010-08-28 03:18:45 +00:00
Bruno Cardoso Lopes	a982aa24ef	Clean up the logic of vector shuffles -> vector shifts. Also teach this logic how to handle target specific shuffles if needed, this is necessary while searching recursively for zeroed scalar elements in vector shuffle operands. llvm-svn: 112348	2010-08-28 02:46:39 +00:00
Chris Lattner	d0214f3efe	handle the constant case of vector insertion. For something like this: struct S { float A, B, C, D; }; struct S g; struct S bar() { struct S A = g; ++A.B; A.A = 42; return A; } we now generate: _bar: ## @bar ## BB#0: ## %entry movq _g@GOTPCREL(%rip), %rax movss 12(%rax), %xmm0 pshufd $16, %xmm0, %xmm0 movss 4(%rax), %xmm2 movss 8(%rax), %xmm1 pshufd $16, %xmm1, %xmm1 unpcklps %xmm0, %xmm1 addss LCPI1_0(%rip), %xmm2 pshufd $16, %xmm2, %xmm2 movss LCPI1_1(%rip), %xmm0 pshufd $16, %xmm0, %xmm0 unpcklps %xmm2, %xmm0 ret instead of: _bar: ## @bar ## BB#0: ## %entry movq _g@GOTPCREL(%rip), %rax movss 12(%rax), %xmm0 pshufd $16, %xmm0, %xmm0 movss 4(%rax), %xmm2 movss 8(%rax), %xmm1 pshufd $16, %xmm1, %xmm1 unpcklps %xmm0, %xmm1 addss LCPI1_0(%rip), %xmm2 movd %xmm2, %eax shlq $32, %rax addq $1109917696, %rax ## imm = 0x42280000 movd %rax, %xmm0 ret llvm-svn: 112345	2010-08-28 01:50:57 +00:00
Duncan Sands	3bd97fec8f	Straighten out any triple strings passed on the command line before they hit the rest of the system. llvm-svn: 112344	2010-08-28 01:30:02 +00:00
Chris Lattner	dd6601048e	optimize bitcasts from large integers to vector into vector element insertion from the pieces that feed into the vector. This handles a pattern that occurs frequently due to code generated for the x86-64 abi. We now compile something like this: struct S { float A, B, C, D; }; struct S g; struct S bar() { struct S A = g; ++A.A; ++A.C; return A; } into all nice vector operations: _bar: ## @bar ## BB#0: ## %entry movq _g@GOTPCREL(%rip), %rax movss LCPI1_0(%rip), %xmm1 movss (%rax), %xmm0 addss %xmm1, %xmm0 pshufd $16, %xmm0, %xmm0 movss 4(%rax), %xmm2 movss 12(%rax), %xmm3 pshufd $16, %xmm2, %xmm2 unpcklps %xmm2, %xmm0 addss 8(%rax), %xmm1 pshufd $16, %xmm1, %xmm1 pshufd $16, %xmm3, %xmm2 unpcklps %xmm2, %xmm1 ret instead of icky integer operations: _bar: ## @bar movq _g@GOTPCREL(%rip), %rax movss LCPI1_0(%rip), %xmm1 movss (%rax), %xmm0 addss %xmm1, %xmm0 movd %xmm0, %ecx movl 4(%rax), %edx movl 12(%rax), %esi shlq $32, %rdx addq %rcx, %rdx movd %rdx, %xmm0 addss 8(%rax), %xmm1 movd %xmm1, %eax shlq $32, %rsi addq %rax, %rsi movd %rsi, %xmm1 ret This resolves rdar://8360454 llvm-svn: 112343	2010-08-28 01:20:38 +00:00
Dan Gohman	e06905d1f0	Completely disable tail calls when fast-isel is enabled, as fast-isel doesn't currently support dealing with this. llvm-svn: 112341	2010-08-28 00:51:03 +00:00
Dan Gohman	1e06dbf881	Trim a #include. llvm-svn: 112340	2010-08-28 00:49:13 +00:00
Dan Gohman	fe22f1d3cc	Fix an index calculation thinko. llvm-svn: 112337	2010-08-28 00:39:27 +00:00
Bob Wilson	8ee9394750	We don't need to custom-select VLDMQ and VSTMQ anymore. llvm-svn: 112336	2010-08-28 00:20:11 +00:00
Benjamin Kramer	83f9ff0452	Update CMake build. Add newline at end of file. llvm-svn: 112332	2010-08-28 00:11:12 +00:00
Bob Wilson	ca5af12920	When merging Thumb2 loads/stores, do not give up when the offset is one of the special values that for ARM would be used with IB or DA modes. Fall through and consider materializing a new base address is it would be profitable. llvm-svn: 112329	2010-08-27 23:57:52 +00:00
Owen Anderson	cf7f941121	Add a prototype of a new peephole optimizing pass that uses LazyValue info to simplify PHIs and select's. This pass addresses the missed optimizations from PR2581 and PR4420. llvm-svn: 112325	2010-08-27 23:31:36 +00:00
Owen Anderson	38f6b7fe3b	Improve the precision of getConstant(). llvm-svn: 112323	2010-08-27 23:29:38 +00:00
Bob Wilson	13ce07fa92	Change ARM VFP VLDM/VSTM instructions to use addressing mode #4 , just like all the other LDM/STM instructions. This fixes asm printer crashes when compiling with -O0. I've changed one of the NEON tests (vst3.ll) to run with -O0 to check this in the future. Prior to this change VLDM/VSTM used addressing mode #5, but not really. The offset field was used to hold a count of the number of registers being loaded or stored, and the AM5 opcode field was expanded to specify the IA or DB mode, instead of the standard ADD/SUB specifier. Much of the backend was not aware of these special cases. The crashes occured when rewriting a frameindex caused the AM5 offset field to be changed so that it did not have a valid submode. I don't know exactly what changed to expose this now. Maybe we've never done much with -O0 and NEON. Regardless, there's no longer any reason to keep a count of the VLDM/VSTM registers, so we can use addressing mode #4 and clean things up in a lot of places. llvm-svn: 112322	2010-08-27 23:18:17 +00:00
Chris Lattner	954e9557e3	tidy up test. llvm-svn: 112321	2010-08-27 23:15:21 +00:00
Chris Lattner	b8b7d52631	no really, fix the test. llvm-svn: 112317	2010-08-27 23:05:54 +00:00
Chris Lattner	c8908b4cdb	fix this test. It's not clear what it's really testing. llvm-svn: 112316	2010-08-27 23:05:27 +00:00
Chris Lattner	6c1395f62a	Enhance the shift propagator to handle the case when you have: A = shl x, 42 ... B = lshr ..., 38 which can be transformed into: A = shl x, 4 ... iff we can prove that the would-be-shifted-in bits are already zero. This eliminates two shifts in the testcase and allows eliminate of the whole i128 chain in the real example. llvm-svn: 112314	2010-08-27 22:53:44 +00:00
Devang Patel	f2855b147f	Simplify. llvm-svn: 112305	2010-08-27 22:25:51 +00:00
Chris Lattner	18d7fc8fc6	Implement a pretty general logical shift propagation framework, which is good at ripping through bitfield operations. This generalize a bunch of the existing xforms that instcombine does, such as (x << c) >> c -> and to handle intermediate logical nodes. This is useful for ripping up the "promote to large integer" code produced by SRoA. llvm-svn: 112304	2010-08-27 22:24:38 +00:00
Bob Wilson	aaff8f539a	Fix a comment typo. llvm-svn: 112302	2010-08-27 21:56:59 +00:00
Bob Wilson	af371b49a8	Unsigned value cannot be < 0. llvm-svn: 112300	2010-08-27 21:44:35 +00:00
Dan Gohman	15871f23e3	When merging adjacent operands, scan ahead and merge all equal adjacent operands at once, instead of just two at a time. llvm-svn: 112299	2010-08-27 21:39:59 +00:00
Eric Christopher	ec5030b213	Fix a couple of typos. Patch by Cameron Esfahani! llvm-svn: 112297	2010-08-27 21:38:11 +00:00
Chris Lattner	25a198e72b	remove some special shift cases that have been subsumed into the more general simplify demanded bits logic. llvm-svn: 112291	2010-08-27 21:04:34 +00:00
Dan Gohman	c866bf4fec	Make the {A,+,B}<L> + {C,+,D}<L> --> Other + {A+C,+,B+D}<L> transformation collect all the addrecs with the same loop add combine them at once rather than starting everything over at the first chance. llvm-svn: 112290	2010-08-27 20:45:56 +00:00
Chris Lattner	606b76eba6	merge and filecheckize test llvm-svn: 112289	2010-08-27 20:44:45 +00:00
Chris Lattner	c665156e9f	merge two tests llvm-svn: 112288	2010-08-27 20:42:10 +00:00
Bill Wendling	6628431a91	Remove now unneeded command line flag that enables 'optimize compares.' llvm-svn: 112287	2010-08-27 20:39:09 +00:00
Owen Anderson	99d4cb861b	Fix typos in comments. llvm-svn: 112286	2010-08-27 20:32:56 +00:00
Chris Lattner	7398434675	teach the truncation optimization that an entire chain of computation can be truncated if it is fed by a sext/zext that doesn't have to be exactly equal to the truncation result type. llvm-svn: 112285	2010-08-27 20:32:06 +00:00
Dan Gohman	9bad2fb378	Switch ScalarEvolution's main Value->SCEV map from std::map to DenseMap. llvm-svn: 112281	2010-08-27 18:55:03 +00:00
Chris Lattner	7413e87b6d	get this test passing on linux builders. llvm-svn: 112280	2010-08-27 18:49:08 +00:00
Chris Lattner	90cd746e63	Add an instcombine to clean up a common pattern produced by the SRoA "promote to large integer" code, eliminating some type conversions like this: %94 = zext i16 %93 to i32 ; <i32> [#uses=2] %96 = lshr i32 %94, 8 ; <i32> [#uses=1] %101 = trunc i32 %96 to i8 ; <i8> [#uses=1] This also unblocks other xforms from happening, now clang is able to compile: struct S { float A, B, C, D; }; float foo(struct S A) { return A.A + A.B+A.C+A.D; } into: _foo: ## @foo ## BB#0: ## %entry pshufd $1, %xmm0, %xmm2 addss %xmm0, %xmm2 movdqa %xmm1, %xmm3 addss %xmm2, %xmm3 pshufd $1, %xmm1, %xmm0 addss %xmm3, %xmm0 ret on x86-64, instead of: _foo: ## @foo ## BB#0: ## %entry movd %xmm0, %rax shrq $32, %rax movd %eax, %xmm2 addss %xmm0, %xmm2 movapd %xmm1, %xmm3 addss %xmm2, %xmm3 movd %xmm1, %rax shrq $32, %rax movd %eax, %xmm0 addss %xmm3, %xmm0 ret This seems pretty close to optimal to me, at least without using horizontal adds. This also triggers in lots of other code, including SPEC. llvm-svn: 112278	2010-08-27 18:31:05 +00:00
Bob Wilson	edf722add3	Add alignment arguments to all the NEON load/store intrinsics. Update all the tests using those intrinsics and add support for auto-upgrading bitcode files with the old versions of the intrinsics. llvm-svn: 112271	2010-08-27 17:13:24 +00:00
Owen Anderson	6ebbd92380	Use LVI to eliminate conditional branches where we've tested a related condition previously. Update tests for this change. This fixes PR5652. llvm-svn: 112270	2010-08-27 17:12:29 +00:00
Dan Gohman	2706567c5c	Optimize SCEVComplexityCompare. Use a 3-way return instead of a 2-way return to avoid needing two calls to test for equivalence, and sort addrecs by their degree before examining their operands. llvm-svn: 112267	2010-08-27 15:26:01 +00:00
Dan Gohman	ca15841394	Clarify a comment. llvm-svn: 112266	2010-08-27 15:16:40 +00:00
Dan Gohman	fa4a4705f9	Parse " (Hidden)" and cope with it. llvm-svn: 112265	2010-08-27 15:16:09 +00:00
Dan Gohman	7e64c985fb	Default to looking for clang++ in the PATH, rather than trying to guess a path that will work. llvm-svn: 112264	2010-08-27 15:15:31 +00:00
Anton Korobeynikov	c0b36921c2	Properly handle passing of FP stuff to varargs function on Win64: value should be copied to the corresponding shadow reg as well. Patch by Cameron Esfahani! llvm-svn: 112262	2010-08-27 14:43:06 +00:00
Benjamin Kramer	1f6012479f	MCELF: Port EmitInstruction changes from MachO streamer. Patch by Roman Divacky. llvm-svn: 112260	2010-08-27 10:40:51 +00:00
Benjamin Kramer	05e22982c8	MCELF: Always overwrite FixedValue. llvm-svn: 112259	2010-08-27 10:38:39 +00:00
Michael J. Spencer	788a6079e1	Fix the msvs 2010 build. The Microsoft (R) 32-bit C/C++ Optimizing Compiler Version 16.00.30319.01 implements parts of C++0x based on the draft standard. An old version of the draft had a bug that makes std::pair<T1, T2>(something, 0) fail to compile. This is because the template<class U, class V> pair(U&& x, V&& y) constructor is selected, even though it later fails to implicitly convert U and V to frist_type and second_type. This has been fixed in n3090, but it seems that Microsoft is not going to update msvc. llvm-svn: 112257	2010-08-27 02:49:45 +00:00
Daniel Dunbar	1844a71e66	X86: Fix an encoding issue with LOCK_ADD64mr, which could lead to very hard to find miscompiles with the integrated assembler. llvm-svn: 112250	2010-08-27 01:30:14 +00:00
Devang Patel	b12ff5999e	Revert r112213. It is not needed. llvm-svn: 112242	2010-08-26 23:35:15 +00:00
Jim Grosbach	6a77066913	Simplify eliminateFrameIndex() interface back down now that PEI doesn't need to try to re-use scavenged frame index reference registers. rdar://8277890 llvm-svn: 112241	2010-08-26 23:32:16 +00:00
Devang Patel	ea134f56b1	If node is not available then use FuncInfo.ValueMap to emit debug info for byval parameter. llvm-svn: 112238	2010-08-26 22:53:27 +00:00
Jim Grosbach	2a1915d04b	Remove the now obsolete frame index virtual re-use algorithm from PEI. Pre-RA virtual base registers handle this function, and more. A bit more cleanup to do on the interface to eliminateFrameIndex() after this. llvm-svn: 112237	2010-08-26 22:42:12 +00:00
Chris Lattner	c188b96bbe	filecheckize llvm-svn: 112235	2010-08-26 22:23:39 +00:00
Chris Lattner	387d6bcdcb	rename test. llvm-svn: 112234	2010-08-26 22:20:47 +00:00
Chris Lattner	bfd2228182	optimize "integer extraction out of the middle of a vector" as produced by SRoA. This is part of rdar://7892780, but needs another xform to expose this. llvm-svn: 112232	2010-08-26 22:14:59 +00:00
Jim Grosbach	e82d5b4aaf	tidy up a bit. no functional change. llvm-svn: 112228	2010-08-26 21:56:30 +00:00
Chris Lattner	d4ebd6df5a	optimize bitcast(trunc(bitcast(x))) where the result is a float and 'x' is a vector to be a vector element extraction. This allows clang to compile: struct S { float A, B, C, D; }; float foo(struct S A) { return A.A + A.B+A.C+A.D; } into: _foo: ## @foo ## BB#0: ## %entry movd %xmm0, %rax shrq $32, %rax movd %eax, %xmm2 addss %xmm0, %xmm2 movapd %xmm1, %xmm3 addss %xmm2, %xmm3 movd %xmm1, %rax shrq $32, %rax movd %eax, %xmm0 addss %xmm3, %xmm0 ret instead of: _foo: ## @foo ## BB#0: ## %entry movd %xmm0, %rax movd %eax, %xmm0 shrq $32, %rax movd %eax, %xmm2 addss %xmm0, %xmm2 movd %xmm1, %rax movd %eax, %xmm1 addss %xmm2, %xmm1 shrq $32, %rax movd %eax, %xmm0 addss %xmm1, %xmm0 ret ... eliminating half of the horribleness. llvm-svn: 112227	2010-08-26 21:55:42 +00:00
Chris Lattner	3c19d3d5c3	filecheckize llvm-svn: 112225	2010-08-26 21:51:41 +00:00
Chris Lattner	7717c616bd	rename test llvm-svn: 112224	2010-08-26 21:50:56 +00:00
Chris Lattner	e14ae4ec24	add m_BitCast for matching a bitcast. llvm-svn: 112222	2010-08-26 21:35:52 +00:00
Jim Grosbach	17da935964	Turn off the scavenging based frame reg reuse briefly to measure whether it's still having a significant effect. It shouldn't be now that the pre-RA virtual base reg stuff is in. Assuming that's valididated by the nightly testers, we can simplify a lot of the PEI frame index code. llvm-svn: 112220	2010-08-26 21:29:54 +00:00
Bruno Cardoso Lopes	e25ba0c7c2	zap the now unused MVT::getIntVectorWithNumElements llvm-svn: 112218	2010-08-26 20:53:12 +00:00
Devang Patel	42b4ac7ed3	Speculatively revert r112207. llvm-svn: 112216	2010-08-26 20:33:42 +00:00
Devang Patel	977057f481	80 col. llvm-svn: 112215	2010-08-26 20:32:32 +00:00
Devang Patel	384fa91deb	Update DanglingDebugInfo so that it can be used to track llvm.dbg.declare also. llvm-svn: 112213	2010-08-26 20:06:46 +00:00
Bob Wilson	97919e9c59	Use pseudo instructions for VST3. llvm-svn: 112208	2010-08-26 18:51:29 +00:00
Devang Patel	ab596a637c	Donot forget to resolve dangling debug info in a case where virtual register, used for a value, is initialized after a dbg intrinsic is seen. llvm-svn: 112207	2010-08-26 18:36:14 +00:00
Bill Wendling	a9c03f4fae	Reapply r112176 without removing the other CMN patterns (that was unintentional). llvm-svn: 112206	2010-08-26 18:33:51 +00:00
Dan Gohman	f34728a95c	Experimental clang-based code-completion support for vim. This currently depends on some clang patches which are not yet upstream. llvm-svn: 112204	2010-08-26 18:12:22 +00:00
Benjamin Kramer	2c45f431fa	MCELF: Fix a thinko of mine. llvm-svn: 112203	2010-08-26 18:12:04 +00:00
Bob Wilson	a967c42a3d	Fix comment typos. llvm-svn: 112202	2010-08-26 18:08:11 +00:00
Devang Patel	7b888fbd91	Fix prototypes. llvm-svn: 112200	2010-08-26 17:47:45 +00:00
Owen Anderson	bd2ecc7e68	Make JumpThreading smart enough to properly thread StrSwitch when it's compiled with clang++. llvm-svn: 112198	2010-08-26 17:40:24 +00:00
Benjamin Kramer	929cc7618f	MCELF: Compensate for the addend on i386. Patch by Roman Divacky, with some cleanups. llvm-svn: 112197	2010-08-26 17:23:02 +00:00

... 3 4 5 6 7 ...

64323 Commits