llvm-project

Commit Graph

Author	SHA1	Message	Date
Venkatraman Govindaraju	ee347f8091	Remove SPARC backend getpcx instruction's Uses. Also, insert an assert to ensure %o7 is not assigned as the destination of getpcx instruction. llvm-svn: 123304	2011-01-12 03:52:59 +00:00
Chris Lattner	dd5f60b7a7	revert 123144, reenabling the rest of memset formation. llvm-svn: 123302	2011-01-12 03:25:15 +00:00
Venkatraman Govindaraju	3b71b0ae3d	Fix SPARC backend call instruction so that arguments passed through registers are correctly marked as used instead of passing all possible argument registers as used. llvm-svn: 123301	2011-01-12 03:18:21 +00:00
Chris Lattner	654098f411	revert r123146 which disabled code that wasn't the root cause of the bootstrap miscompare issue. llvm-svn: 123299	2011-01-12 01:52:23 +00:00
Chris Lattner	fa7c29d255	revert r123149, reenabling an improvement to memcpyopt that wasn't the source of the bootstrap problem. llvm-svn: 123298	2011-01-12 01:43:46 +00:00
Matt Beaumont-Gay	ea43172297	Prefer llvm_unreachable to assert(0) llvm-svn: 123297	2011-01-12 01:42:42 +00:00
Jason W Kim	9c5b65d289	1. Support ELF pcrel relocations for movw/movt: R_ARM_MOVT_PREL and R_ARM_MOVW_PREL_NC. 2. Fix minor bug in ARMAsmPrinter - treat bitfield flag as a bitfield, not an enum. 3. Add support for 3 new elf section types (no-ops) llvm-svn: 123294	2011-01-12 00:19:25 +00:00
Jason W Kim	1f7bc0707d	Workaround for bug 8721. .s Test added. llvm-svn: 123292	2011-01-11 23:53:41 +00:00
Jakob Stoklund Olesen	43812bfa92	The world is not ready for LiveDebugVariables yet. llvm-svn: 123290	2011-01-11 23:20:33 +00:00
Jakob Stoklund Olesen	12cc296bd4	Remove the PR8954 workaround. llvm-svn: 123288	2011-01-11 22:56:41 +00:00
Jakob Stoklund Olesen	f2407aa98b	Fix a non-deterministic loop in llvm::MergeBlockIntoPredecessor. DT->changeImmediateDominator() trivially ignores identity updates, so there is really no need for the uniqueing provided by SmallPtrSet. I expect this to fix PR8954. llvm-svn: 123286	2011-01-11 22:54:38 +00:00
Jakob Stoklund Olesen	8c98495f43	Enable LiveDebugVariables by default. llvm-svn: 123282	2011-01-11 22:45:28 +00:00
Venkatraman Govindaraju	4d6ade0e31	SPARC backend: correct ICC/FCC uses for ADDX and SELECT_CC llvm-svn: 123281	2011-01-11 22:38:28 +00:00
Cameron Zwarich	cb9c4f85ec	Dial back the speculative fix for PR8954 a bit, so that we only recompute dominators once at the beginning of GVN instead of once per iteration. llvm-svn: 123278	2011-01-11 22:14:42 +00:00
Jakob Stoklund Olesen	803f48bcd1	Don't insert DBG_VALUE instructions after the first terminator. For one, MachineBasicBlock::getFirstTerminator() doesn't understand what is happening, and it also makes sense to have all control flow run through the DBG_VALUE. llvm-svn: 123277	2011-01-11 22:11:16 +00:00
Evan Cheng	e45d685895	Clean up ARM subtarget code by using Triple ADT. llvm-svn: 123276	2011-01-11 21:46:47 +00:00
Devang Patel	447cb38fbe	Appropriately truncate debug info range in dwarf output. This is not yet completely enabled. llvm-svn: 123274	2011-01-11 21:42:10 +00:00
Jakob Stoklund Olesen	819eb4ed0b	Put the Dominator improvements back in. They were not the cause of bootstrap miscomparisons. llvm-svn: 123273	2011-01-11 21:23:09 +00:00
Cameron Zwarich	51eb403907	Attempt to fix the bootstrap buildbot. Rafael says this works for him on x86-64 Linux. llvm-svn: 123270	2011-01-11 20:23:34 +00:00
Jakob Stoklund Olesen	32bd3a1e9a	Speculatively revert the recent improvements to Dominators.h in an attempt to track down the gcc bootstrap miscompare. llvm-svn: 123254	2011-01-11 19:26:30 +00:00
Daniel Dunbar	09264124c1	McARM: Fill in GetMnemonicAcceptInfo(). llvm-svn: 123253	2011-01-11 19:06:29 +00:00
Daniel Dunbar	6492807291	McARM: Write a silly Python script to compute some hard coded info from the generated ARM match table, which is substantially more efficient than dealing with tblgen. llvm-svn: 123252	2011-01-11 19:06:26 +00:00
Owen Anderson	0022a4b417	Remove dead variable, const-ref-ize an APInt. llvm-svn: 123248	2011-01-11 18:26:37 +00:00
Chris Lattner	d41db8f9cb	this pass claims to preserve scev, make sure to tell it about deletions. llvm-svn: 123247	2011-01-11 18:14:50 +00:00
Bob Wilson	e5863d6639	Fix a comment: We now have intrinsics for vcvtr. llvm-svn: 123246	2011-01-11 17:56:41 +00:00
Chris Lattner	d30de95520	some comment improvements. llvm-svn: 123243	2011-01-11 17:11:59 +00:00
Chris Lattner	abd2dfd3dc	Fix PR8946, a missing reg/reg form of movdqu. llvm-svn: 123242	2011-01-11 17:04:55 +00:00
Daniel Dunbar	5a384c86b2	McARM: Sketch some logic for determining when to add carry set and predication code operands based on the "canonical mnemonic". llvm-svn: 123239	2011-01-11 15:59:53 +00:00
Daniel Dunbar	9d944b3fcc	McARM: Add more hard coded logic to SplitMnemonicAndCC to also split out the carry setting flag from the mnemonic. Note that this currently involves me disabling a number of working cases in arm_instructions.s, this is a hopefully short term evil which will be rapidly fixed (and greatly surpassed), assuming my current approach flies. llvm-svn: 123238	2011-01-11 15:59:50 +00:00
Jay Foad	c8adf5f458	FixedNumOperandTraits and VariadicOperandTraits assumed that, given a "this" pointer for any subclass of User, you could static_cast it to User* and then reinterpret_cast that to Use* to get the end of the operand list. This isn't a safe assumption in general, because the static_cast might adjust the "this" pointer. Fixed by having these OperandTraits classes take an extra template parameter, which is the subclass of User. This is groundwork for PR889. llvm-svn: 123235	2011-01-11 15:07:38 +00:00
Frits van Bommel	8e158495f1	Factor the actual simplification out of SimplifyIndirectBrOnSelect and into a new helper function so it can be reused in e.g. an upcoming SimplifySwitchOnSelect. No functional change. llvm-svn: 123234	2011-01-11 12:52:11 +00:00
Oscar Fuentes	c8cb58e130	Add to the CMake build some options and platform tests supported by the traditional build. Patch by arrowdodger! llvm-svn: 123233	2011-01-11 12:31:54 +00:00
Oscar Fuentes	77c4f70123	Made llvm_replace_compiler_option more robust. Use it on llvm_process_sources. llvm-svn: 123232	2011-01-11 12:31:34 +00:00
Kalle Raiskila	a5538cdbf9	Fix a thinko in 123226 that caused test failures on "other" platforms. llvm-svn: 123229	2011-01-11 11:27:56 +00:00
Eric Christopher	31bb4c5811	Revert the testcase from the previous reverted commit. llvm-svn: 123227	2011-01-11 09:20:44 +00:00
Kalle Raiskila	be9ad1e631	Add a "nop filler" pass to SPU. Filling no-ops is done just before emitting of assembly, when the instruction stream is final. No-ops are inserted to align the instructions so the dual-issue of the pipeline is utilized. This speeds up generated code with a minimum of 1% on a select set of algorithms. This pass may be redundant if the instruction scheduler and all subsequent passes that modify the instruction stream (prolog+epilog inserter, register scavenger, are there others?) are made aware of the instruction alignments. llvm-svn: 123226	2011-01-11 09:07:54 +00:00
Eric Christopher	23bf3bafb7	Temporarily revert 123133, it's causing some regressions and I'm trying to get a testcase. llvm-svn: 123225	2011-01-11 09:02:09 +00:00
Chris Lattner	193ce7c4d1	update memdep when an instruction is deleted. This code isn't actually reached in the testcase in PR8954, but it's safe and good practice. llvm-svn: 123224	2011-01-11 08:19:16 +00:00
Chris Lattner	e2523b287c	when MergeBlockIntoPredecessor merges two blocks, update MemDep if it is floating around in the ether. llvm-svn: 123223	2011-01-11 08:16:49 +00:00
Chris Lattner	f6ae904e34	Fix FoldSingleEntryPHINodes to update memdep and AA when it deletes phi nodes. It is called from MergeBlockIntoPredecessor which is called from GVN, which claims to preserve these. I'm skeptical that this is the actual problem behind PR8954, but this is a stab in the right direction. llvm-svn: 123222	2011-01-11 08:13:40 +00:00
Chris Lattner	dfcfcb49fa	random cleanups llvm-svn: 123221	2011-01-11 08:00:40 +00:00
Chris Lattner	054d2a8525	merge tests into one crash.ll test. llvm-svn: 123220	2011-01-11 07:50:07 +00:00
Chris Lattner	63fe78de68	remove a bogus assertion: the latch block of a loop is not neccesarily an uncond branch to the header. This fixes PR8955 (the assertion tripping). llvm-svn: 123219	2011-01-11 07:47:59 +00:00
Chris Lattner	23109cb319	the GEP faq says that only inbounds geps are guaranteed to not overflow. llvm-svn: 123218	2011-01-11 06:44:41 +00:00
Jakob Stoklund Olesen	087f207009	Revert r123207: "Turn on memdep's verifyRemoved() in an attempt to smoke out the cause of our gcc bootstrap miscompare." It didn't. llvm-svn: 123215	2011-01-11 04:05:39 +00:00
Michael J. Spencer	0d771edeee	Support/Path: Deprecate PathV1::isDirectory and replace all uses with PathV2::is_directory. llvm-svn: 123209	2011-01-11 01:21:55 +00:00
Jakob Stoklund Olesen	9b6853efd6	Turn on memdep's verifyRemoved() in an attempt to smoke out the cause of our gcc bootstrap miscompare. llvm-svn: 123207	2011-01-11 01:18:03 +00:00
Chandler Carruth	b1e7f557b7	Teach constant folding to perform conversions from constant floating point values to their integer representation through the SSE intrinsic calls. This is the last part of a README.txt entry for which I have real world examples. llvm-svn: 123206	2011-01-11 01:07:24 +00:00
Chandler Carruth	fdf4969149	FileCheck-ize a test, and move a no-longer calling test case to another file and make it actually test something... llvm-svn: 123205	2011-01-11 01:07:20 +00:00
Owen Anderson	d490c2d2ae	Fix a random missed optimization by making InstCombine more aggressive when determining which bits are demanded by a comparison against a constant. llvm-svn: 123203	2011-01-11 00:36:45 +00:00
Eric Christopher	1bb2c00f65	Move ExpandAtomic into the integer expansion routines - it's only used there. llvm-svn: 123202	2011-01-11 00:36:08 +00:00
Eric Christopher	3904343c68	Even if we don't have 7 bytes of stack space we may need to save and restore the stack pointer from the frame pointer on thumbv6. Fixes rdar://8819685 llvm-svn: 123196	2011-01-11 00:16:04 +00:00
Eric Christopher	d5bbeba8d0	Expand on the safeness of restoring the sp from the fp a bit more. llvm-svn: 123193	2011-01-10 23:10:59 +00:00
Dale Johannesen	d2b48119b0	Fix PR 8916 (qv for analysis), at least the immediate problem. There's an inherent tension in DAGCombine between assuming that things will be put in canonical form, and the Depth mechanism that disables transformations when recursion gets too deep. It would not surprise me if there's a lot of little bugs like this one waiting to be discovered. The mechanism seems fragile and I'd suggest looking at it from a design viewpoint. llvm-svn: 123191	2011-01-10 21:53:07 +00:00
Chris Lattner	78cdd2a6c6	+0.0 vs -0.0 differences can be handled by looking at the user of the operation in some cases. llvm-svn: 123190	2011-01-10 21:01:17 +00:00
Daniel Dunbar	c0e8756ba9	McARM: Flush out hard coded known non-predicated mnemonic list. llvm-svn: 123189	2011-01-10 21:01:03 +00:00
Daniel Dunbar	2d01239fe7	McARM: Mark some T2 ...s instructions as codegen only, they aren't real instructions but are restricted pseudo forms. llvm-svn: 123177	2011-01-10 15:26:39 +00:00
Daniel Dunbar	6e3aedd830	ARM/MC: Mark several '...S' instructions as codegen only, they aren't real instructions but are restricted pseudo forms. llvm-svn: 123176	2011-01-10 15:26:35 +00:00
Daniel Dunbar	2be732ab5f	MC/ARM/AsmParser: Minor nitty fixes. llvm-svn: 123175	2011-01-10 15:26:21 +00:00
Daniel Dunbar	4035383937	MC/AsmMatcher: Fix indirect 80-col viola. llvm-svn: 123174	2011-01-10 15:26:11 +00:00
Anton Korobeynikov	5fb942a307	Fix merge fallout llvm-svn: 123172	2011-01-10 12:56:18 +00:00
Anton Korobeynikov	441ae5b88c	Update CMake stuff llvm-svn: 123171	2011-01-10 12:39:23 +00:00
Anton Korobeynikov	2f93128109	Rename TargetFrameInfo into TargetFrameLowering. Also, put couple of FIXMEs and fixes here and there. llvm-svn: 123170	2011-01-10 12:39:04 +00:00
Daniel Dunbar	876bb0180f	MC/ARM/AsmParser: Split out SplitMnemonicAndCC(). llvm-svn: 123169	2011-01-10 12:24:52 +00:00
Chandler Carruth	352d9b14b3	Cleanup some of the constant folding code to consistently test intrinsic IDs when available rather than using a mixture of IDs and textual name comparisons. llvm-svn: 123165	2011-01-10 09:02:58 +00:00
Chris Lattner	6c8b8dd522	fit in 80 cols and use MBB::isSuccessor instead of a hand rolled std::find. llvm-svn: 123164	2011-01-10 07:51:31 +00:00
Chandler Carruth	cf414cf0a6	Teach instcombine about the rest of the SSE and SSE2 conversion intrinsics element dependencies. Reviewed by Nick. llvm-svn: 123161	2011-01-10 07:19:37 +00:00
Jakob Stoklund Olesen	2fb5b31578	Simplify a bunch of isVirtualRegister() and isPhysicalRegister() logic. These functions not longer assert when passed 0, but simply return false instead. No functional change intended. llvm-svn: 123155	2011-01-10 02:58:51 +00:00
Chandler Carruth	7bb282ebb1	Fold two related tests into the newly FileCheck-ized test, migrating them to FileCheck as well. llvm-svn: 123154	2011-01-10 02:53:58 +00:00
Chandler Carruth	ef7aac5961	Clean up and FileCheck-ize a test. llvm-svn: 123153	2011-01-10 02:53:54 +00:00
Michael J. Spencer	83bd49d4f9	Fix Whitespace. llvm-svn: 123152	2011-01-10 02:34:40 +00:00
Michael J. Spencer	58df2e00b2	Support/Path: Deprecate PathV1::exists and replace all uses with PathV2::fs::exists. llvm-svn: 123151	2011-01-10 02:34:23 +00:00
Chris Lattner	88bc848ab6	another random stab in the dark trying to fix llvm-gcc-i386-linux-selfhost llvm-svn: 123149	2011-01-10 02:34:11 +00:00
Chris Lattner	ec1387cf4b	fix typo llvm-svn: 123148	2011-01-10 02:33:34 +00:00
Chris Lattner	4662bd4b13	another (more) aggressive attempt to bring llvm-gcc-i386-linux-selfhost back to life. llvm-svn: 123146	2011-01-10 00:47:34 +00:00
Chris Lattner	eef1455020	expand on a note llvm-svn: 123145	2011-01-10 00:33:01 +00:00
Chris Lattner	1017fa6746	temporarily disable memset formation from memsets in an effort to restore buildbot stability. llvm-svn: 123144	2011-01-09 23:52:48 +00:00
Chris Lattner	1032965cbe	add a testcase I missed in previous commit. llvm-svn: 123143	2011-01-09 23:52:31 +00:00
Chris Lattner	5b358c6825	typo llvm-svn: 123142	2011-01-09 23:48:41 +00:00
Chris Lattner	320370e3ca	xref a PR # llvm-svn: 123141	2011-01-09 23:42:22 +00:00
Jakob Stoklund Olesen	8786bda222	Remove TargetRegisterInfo::NoRegister. Fix the TargetRegisterInfo::NoRegister places where someone preferred typing 'TargetRegisterInfo::NoRegister' instead of typing '0'. Note that TableGen is already emitting xx::NoRegister in xxGenRegisterNames.inc. llvm-svn: 123140	2011-01-09 23:20:48 +00:00
Chris Lattner	67f82314af	add a fixme: ir isn't expressive enough. llvm-svn: 123139	2011-01-09 23:02:10 +00:00
Chris Lattner	28f140a33e	Step #4 in improving trip count analysis: HowFarToZero can analyze NUW AddRec's much more aggressively. We now get a trip count for @test2 in nsw.ll llvm-svn: 123138	2011-01-09 22:58:47 +00:00
Jakob Stoklund Olesen	8828e482b6	Change virtual register numbering to make more space for physical registers. The numbering plan is now: 0 NoRegister. [1;2^30) Physical registers. [2^30;2^31) Stack slots. [2^31;2^32) Virtual registers. (With -1u and -2u used by DenseMapInfo.) Each segment is filled from the left, so any mistaken interpretation should quickly cause crashes. FirstVirtualRegister has been removed. TargetRegisterInfo provides predicates conversion functions that should be used instead of interpreting register numbers manually. It is now legal to pass NoRegister to isPhysicalRegister() and isVirtualRegister(). The result is false in both cases. It is quite rare to represent stack slots in this way, so isPhysicalRegister() and isVirtualRegister() require that isStackSlot() be checked first if it can possibly return true. This allows a very fast implementation of the common predicates. llvm-svn: 123137	2011-01-09 22:42:48 +00:00
Chris Lattner	dff679f4b6	rearrange some code, no functionality change. llvm-svn: 123136	2011-01-09 22:39:48 +00:00
Chandler Carruth	d011d5317c	Add a note about the inability to model FP -> int conversions which perform rounding other than truncation in the IR. Common C code for this turns into really an LLVM intrinsic call that blocks a lot of further optimizations. llvm-svn: 123135	2011-01-09 22:36:18 +00:00
Chris Lattner	a44274cb4f	Step #3 to improving trip count analysis: If we fold a + {b,+,stride} into {a+b,+,stride} (because a is LIV), then the resultant AddRec is NUW/NSW if the client says it is. llvm-svn: 123133	2011-01-09 22:31:26 +00:00
Chris Lattner	fc87752d55	Step #2 to improve trip count analysis for loops like this: void f(int* begin, int* end) { std::fill(begin, end, 0); } which turns into a != exit expression where one pointer is strided and (thanks to step #1) known to not overflow, and the other is loop invariant. The observation here is that, though the IV is strided by 4 in this case, that the IV has to become equal to the end value. It cannot "miss" the end value by stepping over it, because if it did, the strided IV expression would eventually wrap around. Handle this by turning A != B into "A-B != 0" where the A-B part is known to be NUW. llvm-svn: 123131	2011-01-09 22:26:35 +00:00
Jakob Stoklund Olesen	d82ac37594	Remove MachineRegisterInfo::getLastVirtReg(), it was giving wrong results when no virtual registers have been allocated. It was only used to resize IndexedMaps, so provide an IndexedMap::resize() method such that Map.grow(MRI.getLastVirtReg()); can be replaced with the simpler Map.resize(MRI.getNumVirtRegs()); This works correctly when no virtuals are allocated, and it bypasses the to/from index conversions. llvm-svn: 123130	2011-01-09 21:58:20 +00:00
Chris Lattner	878665b4bc	sort this. llvm-svn: 123129	2011-01-09 21:31:39 +00:00
Jakob Stoklund Olesen	b83a6b23dc	Teach TargetRegisterInfo how to cram stack slot indexes in with the virtual and physical register numbers. This makes the hack used in LiveInterval official, and lets LiveInterval be oblivious of stack slots. The isPhysicalRegister() and isVirtualRegister() predicates don't know about this, so when a variable may contain a stack slot, isStackSlot() should always be tested first. llvm-svn: 123128	2011-01-09 21:17:37 +00:00
Chandler Carruth	0c68a668fa	Add a note about a missed FP optimization. llvm-svn: 123126	2011-01-09 21:00:19 +00:00
Jakob Stoklund Olesen	8145f85633	Fix comment. llvm-svn: 123125	2011-01-09 19:45:45 +00:00
Chris Lattner	caf5c0d037	fix a few old bugs (found by inspection) where we would zap instructions without informing memdep. This could cause nondeterminstic weirdness based on where instructions happen to get allocated, and will hopefully breath some life into some broken testers. llvm-svn: 123124	2011-01-09 19:26:10 +00:00
Jakob Stoklund Olesen	d65524da0f	Add a forgotten VireReg2IndexFunctor. llvm-svn: 123123	2011-01-09 18:58:33 +00:00
Oscar Fuentes	45539edad2	Apply -fPIC to C sources too. llvm-svn: 123122	2011-01-09 17:38:31 +00:00
Tobias Grosser	cc21c4aa98	Instcombine: Fix pattern where the sext did not dominate the icmp using it llvm-svn: 123121	2011-01-09 16:00:11 +00:00
Tobias Grosser	bc453f6e18	DominatorTree->print() now prints the status of the DFSNumbers correctly llvm-svn: 123120	2011-01-09 16:00:09 +00:00
Oscar Fuentes	edfc184222	Rewrite handling of LLVM_ENABLE_PIC. It was being processed after config.h was generated, so it had no effect on it. Thanks to arrowdodger for pointing out this and a tentative patch. llvm-svn: 123119	2011-01-09 14:34:39 +00:00
Cameron Zwarich	a42e5915bf	LoopInstSimplify preserves LoopSimplify. llvm-svn: 123117	2011-01-09 12:35:16 +00:00
Chandler Carruth	82e6f6a325	Another missed memset in std::vector initialization. llvm-svn: 123116	2011-01-09 11:29:57 +00:00
Cameron Zwarich	8a00d8175b	Eliminate some extra hash table lookups. llvm-svn: 123115	2011-01-09 10:54:21 +00:00
Cameron Zwarich	a5910d1b31	Add an informative comment. llvm-svn: 123114	2011-01-09 10:32:30 +00:00
Chandler Carruth	43f6d1b67e	Fix a cut-paste-o so that the sample code is correct for my last note. Also, switch to a more clear 'sink' function with its declaration to avoid any confusion about 'g'. Thanks for the suggestion Frits. llvm-svn: 123113	2011-01-09 10:10:59 +00:00
Chandler Carruth	ad6e1f0501	Another missed optimization of trivial vector code. llvm-svn: 123112	2011-01-09 09:58:36 +00:00
Chandler Carruth	f32619300a	Add a note about vector's size-constructor producing dead stores. llvm-svn: 123111	2011-01-09 09:58:33 +00:00
Jakob Stoklund Olesen	9adf5e09cd	Simplify LiveDebugVariables by storing MachineOperand copies locations instead of using a Location class with the same information. When making a copy of a MachineOperand that was already stored in a MachineInstr, it is necessary to clear the parent pointer on the copy. Otherwise the register use-def lists become inconsistent. Add MachineOperand::clearParent() to do that. An alternative would be a custom MachineOperand copy constructor that cleared ParentMI. I didn't want to do that because of the performance impact. llvm-svn: 123109	2011-01-09 05:33:21 +00:00
Jakob Stoklund Olesen	3a9e5c29c8	Shrink a BitVector that didn't mean to store bits for all physical registers. llvm-svn: 123108	2011-01-09 03:45:44 +00:00
Jakob Stoklund Olesen	1331a15b0c	Replace TargetRegisterInfo::printReg with a PrintReg class that also works without a TRI instance. Print virtual registers numbered from 0 instead of the arbitrary FirstVirtualRegister. The first virtual register is printed as %vreg0. TRI::NoRegister is printed as %noreg. llvm-svn: 123107	2011-01-09 03:05:53 +00:00
Jakob Stoklund Olesen	7f93d8d62c	Use IndexedMap for MachineRegisterInfo as well. No functional change. llvm-svn: 123106	2011-01-09 03:05:46 +00:00
Chris Lattner	10223a3fbf	teach SCEV analysis of PHI nodes that PHI recurences formed with GEP instructions are always NUW, because PHIs cannot wrap the end of the address space. llvm-svn: 123105	2011-01-09 02:28:48 +00:00
Chris Lattner	a337f5ec5c	reduce indentation. Print <nuw> and <nsw> when dumping SCEV AddRec's that have the bit set. llvm-svn: 123104	2011-01-09 02:16:18 +00:00
Chandler Carruth	5d684c17a7	Add a note about a missed memset optimization from std::fill. llvm-svn: 123103	2011-01-09 01:32:55 +00:00
Jakob Stoklund Olesen	4a7b48d5f4	Fix the last virtual register enumerations. llvm-svn: 123102	2011-01-08 23:11:11 +00:00
Jakob Stoklund Olesen	cf4d5ced0f	Fix VirtRegMap to use TRI::index2VirtReg and TRI::virtReg2Index instead of depending on TRI::FirstVirtualRegister. Also use TRI::printReg instead of printing virtual registers directly. llvm-svn: 123101	2011-01-08 23:11:07 +00:00
Jakob Stoklund Olesen	6ff70ad356	Fix a MachineVerifier loop that probably didn't mean to skip the last two virtual registers. llvm-svn: 123100	2011-01-08 23:11:02 +00:00
Jakob Stoklund Olesen	d3438eb27d	Don't document exactly how virtual registers are represented as integers. Code shouldn't depend directly on that. Give an example of how to iterate over all virtual registers in a function without depending on the representation. llvm-svn: 123099	2011-01-08 23:10:59 +00:00
Jakob Stoklund Olesen	28d76692b6	Use an IndexedMap for LiveVariables::VirtRegInfo. Provide MRI::getNumVirtRegs() and TRI::index2VirtReg() functions to allow iteration over virtual registers without depending on the representation of virtual register numbers. llvm-svn: 123098	2011-01-08 23:10:57 +00:00
Jakob Stoklund Olesen	a1e03cfba7	Do not talk about TargetRegisterInfo::FirstVirtualRegister. llvm-svn: 123097	2011-01-08 23:10:53 +00:00
Jakob Stoklund Olesen	793d7b7626	Use an IndexedMap for LiveOutRegInfo to hide its dependence on TargetRegisterInfo::FirstVirtualRegister. llvm-svn: 123096	2011-01-08 23:10:50 +00:00
Cameron Zwarich	0939bc3709	Fix coding style. llvm-svn: 123093	2011-01-08 22:36:53 +00:00
Chris Lattner	7d6433ae76	fix a latent bug in memcpyoptimizer that my recent patches exposed: it wasn't updating memdep when fusing stores together. This fixes the crash optimizing the bullet benchmark. llvm-svn: 123091	2011-01-08 22:19:21 +00:00
Chris Lattner	ff6ed2ac5f	tryMergingIntoMemset can only handle constant length memsets. llvm-svn: 123090	2011-01-08 22:11:56 +00:00
Chris Lattner	9a1d63ba9f	Merge memsets followed by neighboring memsets and other stores into larger memsets. Among other things, this fixes rdar://8760394 and allows us to handle "Example 2" from http://blog.regehr.org/archives/320, compiling it into a single 4096-byte memset: _mad_synth_mute: ## @mad_synth_mute ## BB#0: ## %entry pushq %rax movl $4096, %esi ## imm = 0x1000 callq ___bzero popq %rax ret llvm-svn: 123089	2011-01-08 21:19:19 +00:00
Chris Lattner	5120ebf184	fix an issue in IsPointerOffset that prevented us from recognizing that P and P+1 are relative to the same base pointer. llvm-svn: 123087	2011-01-08 21:07:56 +00:00
Chris Lattner	4dc1fd938f	enhance memcpyopt to merge a store and a subsequent memset into a single larger memset. llvm-svn: 123086	2011-01-08 20:54:51 +00:00
Chris Lattner	2f2c3351e1	fit in 80 cols llvm-svn: 123085	2011-01-08 20:53:41 +00:00
Chris Lattner	9dbbc49f74	merge two tests and filecheckify llvm-svn: 123082	2011-01-08 20:27:22 +00:00
Chris Lattner	c638147e9f	constify TargetData references. Split memset formation logic out into its own "tryMergingIntoMemset" helper function. llvm-svn: 123081	2011-01-08 20:24:01 +00:00
Chris Lattner	59c82f850d	When loop rotation happens, it is very common for the duplicated condbr to be foldable into an uncond branch. When this happens, we can make a much simpler CFG for the loop, which is important for nested loop cases where we want the outer loop to be aggressively optimized. Handle this case more aggressively. For example, previously on phi-duplicate.ll we would get this: define void @test(i32 %N, double* %G) nounwind ssp { entry: %cmp1 = icmp slt i64 1, 1000 br i1 %cmp1, label %bb.nph, label %for.end bb.nph: ; preds = %entry br label %for.body for.body: ; preds = %bb.nph, %for.cond %j.02 = phi i64 [ 1, %bb.nph ], [ %inc, %for.cond ] %arrayidx = getelementptr inbounds double* %G, i64 %j.02 %tmp3 = load double* %arrayidx %sub = sub i64 %j.02, 1 %arrayidx6 = getelementptr inbounds double* %G, i64 %sub %tmp7 = load double* %arrayidx6 %add = fadd double %tmp3, %tmp7 %arrayidx10 = getelementptr inbounds double* %G, i64 %j.02 store double %add, double* %arrayidx10 %inc = add nsw i64 %j.02, 1 br label %for.cond for.cond: ; preds = %for.body %cmp = icmp slt i64 %inc, 1000 br i1 %cmp, label %for.body, label %for.cond.for.end_crit_edge for.cond.for.end_crit_edge: ; preds = %for.cond br label %for.end for.end: ; preds = %for.cond.for.end_crit_edge, %entry ret void } Now we get the much nicer: define void @test(i32 %N, double* %G) nounwind ssp { entry: br label %for.body for.body: ; preds = %entry, %for.body %j.01 = phi i64 [ 1, %entry ], [ %inc, %for.body ] %arrayidx = getelementptr inbounds double* %G, i64 %j.01 %tmp3 = load double* %arrayidx %sub = sub i64 %j.01, 1 %arrayidx6 = getelementptr inbounds double* %G, i64 %sub %tmp7 = load double* %arrayidx6 %add = fadd double %tmp3, %tmp7 %arrayidx10 = getelementptr inbounds double* %G, i64 %j.01 store double %add, double* %arrayidx10 %inc = add nsw i64 %j.01, 1 %cmp = icmp slt i64 %inc, 1000 br i1 %cmp, label %for.body, label %for.end for.end: ; preds = %for.body ret void } With all of these recent changes, we are now able to compile: void foo(char X) { for (int i = 0; i != 100; ++i) for (int j = 0; j != 100; ++j) X[j+i100] = 0; } into a single memset of 10000 bytes. This series of changes should also be helpful for other nested loop scenarios as well. llvm-svn: 123079	2011-01-08 19:59:06 +00:00
Chris Lattner	5f7734c4a5	make domtree verification print something useful on failure. llvm-svn: 123078	2011-01-08 19:55:55 +00:00
Chris Lattner	30f318e5d1	split ssa updating code out to its own helper function. Don't bother moving the OrigHeader block anymore: we just merge it away anyway so its code layout doesn't matter. llvm-svn: 123077	2011-01-08 19:26:33 +00:00
Chris Lattner	2615130e1d	Implement a TODO: Enhance loopinfo to merge away the unconditional branch that it was leaving in loops after rotation (between the original latch block and the original header. With this change, it is possible for rotated loops to have just a single basic block, which is useful. llvm-svn: 123075	2011-01-08 19:10:28 +00:00
Chris Lattner	930b716e1b	various code cleanups, enhance MergeBlockIntoPredecessor to preserve loop info. llvm-svn: 123074	2011-01-08 19:08:40 +00:00
Chris Lattner	fee37c5fa3	inline preserveCanonicalLoopForm now that it is simple. llvm-svn: 123073	2011-01-08 18:55:50 +00:00
Chris Lattner	063dca0f6a	Three major changes: 1. Rip out LoopRotate's domfrontier updating code. It isn't needed now that LICM doesn't use DF and it is super complex and gross. 2. Make DomTree updating code a lot simpler and faster. The old loop over all the blocks was just to find a block?? 3. Change the code that inserts the new preheader to just use SplitCriticalEdge instead of doing an overcomplex reimplementation of it. No behavior change, except for the name of the inserted preheader. llvm-svn: 123072	2011-01-08 18:52:51 +00:00
Chris Lattner	30d95f9f87	reduce nesting. llvm-svn: 123071	2011-01-08 18:47:43 +00:00
Francois Pichet	7c9eab8fef	On Windows, replace each occurrence of '\' by '\\' on the replacement string. This is necessary to prevent re.sub from replacing escape sequences occurring in path. For example: llvm\tools\clang\test was replaced by llvm <tab> ools\clang <tab> est llvm-svn: 123070	2011-01-08 18:09:48 +00:00
Chris Lattner	7fab23bc1d	LoopRotate requires canonical loop form, so it always has preheaders and latch blocks. Reorder entry conditions to make hte pass faster and more logical. llvm-svn: 123069	2011-01-08 18:06:22 +00:00
Chris Lattner	d62691f4e8	use the LI ivar. llvm-svn: 123068	2011-01-08 17:49:51 +00:00
Chris Lattner	385f2ec6d8	some cleanups: remove dead arguments and eliminate ivars that are just passed to one function. llvm-svn: 123067	2011-01-08 17:48:33 +00:00
Chris Lattner	25ba40a0cc	fix an issue duncan pointed out, which could cause loop rotate to violate LCSSA form llvm-svn: 123066	2011-01-08 17:38:45 +00:00
Cameron Zwarich	b4ab257bcc	Fix coding style issues. llvm-svn: 123065	2011-01-08 17:07:11 +00:00
Cameron Zwarich	84986b298a	Make more passes preserve dominators (or state that they preserve dominators if they all ready do). This removes two dominator recomputations prior to isel, which is a 1% improvement in total llc time for 403.gcc. The only potentially suspect thing is making GCStrategy recompute dominators if it used a custom lowering strategy. llvm-svn: 123064	2011-01-08 17:01:52 +00:00
Rafael Espindola	45e6c195d7	First step in fixing PR8927: Add a unnamed_addr bit to global variables and functions. This will be used to indicate that the address is not significant and therefore the constant or function can be merged with others. If an optimization pass can show that an address is not used, it can set this. Examples of things that can have this set by the FE are globals created to hold string literals and C++ constructors. Adding unnamed_addr to a non-const global should have no effect unless an optimization can transform that global into a constant. Aliases are not allowed to have unnamed_addr since I couldn't figure out any use for it. llvm-svn: 123063	2011-01-08 16:42:36 +00:00
Cameron Zwarich	80bd9af7c5	Contract subloop bodies. However, it is still important to visit the phis at the top of subloop headers, as the phi uses logically occur outside of the subloop. llvm-svn: 123062	2011-01-08 15:52:22 +00:00
Frits van Bommel	6a1fb8f235	Fix a bug in r123034 (trying to sext/zext non-integers) and clean up a little. llvm-svn: 123061	2011-01-08 10:51:36 +00:00
Chris Lattner	8c5defd0b0	Have loop-rotate simplify instructions (yay instsimplify!) as it clones them into the loop preheader, eliminating silly instructions like "icmp i32 0, 100" in fixed tripcount loops. This also better exposes the bigger problem with loop rotate that I'd like to fix: once this has been folded, the duplicated conditional branch often turns into an uncond branch. Not aggressively handling this is pessimizing later loop optimizations somethin' fierce by making "dominates all exit blocks" checks fail. llvm-svn: 123060	2011-01-08 08:24:46 +00:00
Chris Lattner	75c82cb594	make this file properly self contained. llvm-svn: 123059	2011-01-08 08:19:49 +00:00
Chris Lattner	43f8d16482	Revamp the ValueMapper interfaces in a couple ways: 1. Take a flags argument instead of a bool. This makes it more clear to the reader what it is used for. 2. Add a flag that says that "remapping a value not in the map is ok". 3. Reimplement MapValue to share a bunch of code and be a lot more efficient. For lookup failures, don't drop null values into the map. 4. Using the new flag a bunch of code can vaporize in LinkModules and LoopUnswitch, kill it. No functionality change. llvm-svn: 123058	2011-01-08 08:15:20 +00:00
Chris Lattner	2b3f20e6ec	two minor changes: switch to the standard ValueToValueMapTy map from ValueMapper.h (giving us access to its utilities) and add a fastpath in the loop rotation code, avoiding expensive ssa updator manipulation for values with nothing to update. llvm-svn: 123057	2011-01-08 07:21:31 +00:00
Eric Christopher	46779e1983	I don't think I could find a 10.2.x box if I tried. llvm-svn: 123051	2011-01-08 01:52:20 +00:00
Evan Cheng	078b0b095e	Recognize inline asm 'rev /bin/bash, ' as a bswap intrinsic call. llvm-svn: 123048	2011-01-08 01:24:27 +00:00
Evan Cheng	6eb516dbea	Do not model all INLINEASM instructions as having unmodelled side effects. Instead encode llvm IR level property "HasSideEffects" in an operand (shared with IsAlignStack). Added MachineInstrs::hasUnmodeledSideEffects() to check the operand when the instruction is an INLINEASM. This allows memory instructions to be moved around INLINEASM instructions. llvm-svn: 123044	2011-01-07 23:50:32 +00:00
Bob Wilson	006089b761	Use __builtin_shufflevector to implement vget_low and vget_high intrinsics. This was suggested by Edmund Grimley Evans in pr8411. llvm-svn: 123043	2011-01-07 23:40:49 +00:00
Bob Wilson	3fa9c064c2	Add an explanatory message for an assertion. llvm-svn: 123042	2011-01-07 23:40:46 +00:00
Matt Beaumont-Gay	5cc7a1fcad	Eliminate variable only used in debug builds. llvm-svn: 123040	2011-01-07 22:34:58 +00:00
Devang Patel	acbee0b0d9	Speculatively revert r123032. llvm-svn: 123039	2011-01-07 22:33:41 +00:00
Devang Patel	39bd6cebcd	Do not include DataTypes.h in llvm-c/lto.h. This means avoid using uint32_t. This patch reverts r112200 and fixes original problem by fixing argument type in lto.cpp. llvm-svn: 123038	2011-01-07 22:26:25 +00:00
Evan Cheng	36f0a06593	Fix comment. INLINEASM node operand #3 is IsAlignStack bit. llvm-svn: 123036	2011-01-07 21:38:59 +00:00
Bob Wilson	6f2b8966ca	Lower some BUILD_VECTORS using VEXT+shuffle. Patch by Tim Northover. llvm-svn: 123035	2011-01-07 21:37:30 +00:00
Tobias Grosser	fc3d7f664b	InstCombine: Match min/max hidden by sext/zext X = sext x; x >s c ? X : C+1 --> X = sext x; X <s C+1 ? C+1 : X X = sext x; x <s c ? X : C-1 --> X = sext x; X >s C-1 ? C-1 : X X = zext x; x >u c ? X : C+1 --> X = zext x; X <u C+1 ? C+1 : X X = zext x; x <u c ? X : C-1 --> X = zext x; X >u C-1 ? C-1 : X X = sext x; x >u c ? X : C+1 --> X = sext x; X <u C+1 ? C+1 : X X = sext x; x <u c ? X : C-1 --> X = sext x; X >u C-1 ? C-1 : X Instead of calculating this with mixed types promote all to the larger type. This enables scalar evolution to analyze this expression. PR8866 llvm-svn: 123034	2011-01-07 21:33:14 +00:00
Tobias Grosser	411e6eedff	Some whitespace fixes llvm-svn: 123033	2011-01-07 21:33:13 +00:00
Devang Patel	6381e1584c	Appropriately truncate debug info range in dwarf output. Enable live debug variables pass. llvm-svn: 123032	2011-01-07 21:30:41 +00:00
Evan Cheng	0638c20e7c	DBG_VALUE does not have any side effects; it also makes no sense to mark it cheap as a copy. llvm-svn: 123031	2011-01-07 21:08:26 +00:00
Benjamin Kramer	134cde912a	Revert 122959, it needs more thought. Add it back to README.txt with additional notes. llvm-svn: 123030	2011-01-07 20:42:20 +00:00
Oscar Fuentes	9bf259581c	Don't use -O3 on Mingw, as people report it as unreliable. Use -O2 instead. llvm-svn: 123028	2011-01-07 20:31:03 +00:00
Jay Foad	d81f3c9659	Simplify the allocation and freeing of Users' operand lists, now that every BranchInst has a fixed number of operands. llvm-svn: 123027	2011-01-07 20:29:02 +00:00
Jay Foad	814f1bb8e3	Remove the "ugly" method BranchInst::setUnconditionalDest(). llvm-svn: 123026	2011-01-07 20:26:51 +00:00
Jay Foad	89afb43b1e	Remove all uses of the "ugly" method BranchInst::setUnconditionalDest(). llvm-svn: 123025	2011-01-07 20:25:56 +00:00
Evan Cheng	a048c83fe4	Revert r122955. It seems using movups to lower memcpy can cause massive regression (even on Nehalem) in edge cases. I also didn't see any real performance benefit. llvm-svn: 123015	2011-01-07 19:35:30 +00:00
David Greene	2f7cf7fcb4	Rename lisp-like functions as suggested by Gabor Greif as loooong time ago. This is both easier to learn and easier to read. llvm-svn: 123001	2011-01-07 17:05:37 +00:00
Benjamin Kramer	1ec7ecce86	Try to unbreak the arm buildbot. llvm-svn: 122999	2011-01-07 11:35:21 +00:00
Bob Wilson	99da75c17d	Add testcases for PR8411 (vget_low and vget_high implemented as shuffles). llvm-svn: 122997	2011-01-07 06:44:14 +00:00
Bob Wilson	8265d56638	Add ARM patterns to match EXTRACT_SUBVECTOR nodes. Also fix an off-by-one in SelectionDAGBuilder that was preventing shuffle vectors from being translated to EXTRACT_SUBVECTOR. Patch by Tim Northover. The test changes are needed to keep those spill-q tests from testing aligned spills and restores. If the only aligned stack objects are spill slots, we no longer realign the stack frame. Prior to this patch, an EXTRACT_SUBVECTOR was legalized by loading from the stack, which created an aligned frame index. Now, however, there is nothing except the spill slot in the stack frame, so I added an aligned alloca. llvm-svn: 122995	2011-01-07 04:59:04 +00:00
Bob Wilson	d23b3d2dfc	Fix a comment typo. llvm-svn: 122994	2011-01-07 04:58:58 +00:00
Bob Wilson	f291cb268f	Change EXTRACT_SUBVECTOR to require a constant index. We were never generating any of these nodes with variable indices, and there was one legalizer function asserting on a non-constant index. If we ever have a need to support variable indices, we can add this back again. llvm-svn: 122993	2011-01-07 04:58:56 +00:00
Bill Wendling	34e2bc0f08	Early exit if we don't have invokes. The 'Unwinds' vector isn't modified unless we have invokes, so there is no functionality change here. llvm-svn: 122990	2011-01-07 02:54:45 +00:00
Duncan Sands	61c5708b51	Fix the other problem reported in PR8582. Testcase and patch by Nadav Rotem. llvm-svn: 122983	2011-01-06 23:45:22 +00:00
Duncan Sands	64b75da088	Add a testcase for PR8582, which mysteriously fixed itself, in case the problem comes back some day. llvm-svn: 122982	2011-01-06 23:04:29 +00:00
Eric Christopher	e516af753b	Add some fairly duplicated code to let type legalization split illegal typed atomics. This will lower exclusively to libcalls at the moment. llvm-svn: 122979	2011-01-06 22:28:56 +00:00
Chris Lattner	84184b7207	With Benjamin's recent amazing patches, we should be able to do even better things :) llvm-svn: 122978	2011-01-06 22:25:00 +00:00
Chris Lattner	171608e738	use isNullValue() to simplify code, add an assert. llvm-svn: 122977	2011-01-06 22:24:29 +00:00
Devang Patel	70eb982843	Emit 128 bit constant. This fixes PR 8913 crash. llvm-svn: 122971	2011-01-06 21:39:25 +00:00
Bob Wilson	914df82a2e	PR8921: LDM/POP do not support interworking prior to v5t. llvm-svn: 122970	2011-01-06 19:24:41 +00:00
Bob Wilson	e0bafd93b0	Remove extra whitespace. llvm-svn: 122969	2011-01-06 19:24:36 +00:00
Bob Wilson	7c2c626805	Fix comment typo. llvm-svn: 122968	2011-01-06 19:24:32 +00:00
Benjamin Kramer	1e01ade2e8	Add a note from llvmdev, this time with more info. llvm-svn: 122966	2011-01-06 17:35:50 +00:00
Abramo Bagnara	a41d7aebee	Fixed parsing of hex floats. llvm-svn: 122963	2011-01-06 16:55:14 +00:00
Rafael Espindola	9f9a10691a	Correctly disassemble truncated asm. Patch by Richard Simth. llvm-svn: 122962	2011-01-06 16:48:42 +00:00
Benjamin Kramer	ae67cc13a9	InstCombine: Turn _chk functions into the "unsafe" variant if length and max langth are equal. This happens when we take the (non-constant) length from a malloc. llvm-svn: 122961	2011-01-06 14:22:52 +00:00
Benjamin Kramer	605f21a6c8	EarlyCSE does this now (and GVN always did it). llvm-svn: 122960	2011-01-06 13:19:46 +00:00
Benjamin Kramer	799b011276	InstCombine: If we call llvm.objectsize on a malloc call we can replace it with the size passed to malloc. llvm-svn: 122959	2011-01-06 13:11:05 +00:00
Benjamin Kramer	a76cc117e0	InstCombine: Teach llvm.objectsize folding to look through GEPs. llvm-svn: 122958	2011-01-06 13:07:49 +00:00
Benjamin Kramer	3aa955e906	Remove dead code and silence warnings. llvm-svn: 122957	2011-01-06 13:01:02 +00:00
Evan Cheng	7998b1d6fe	Use movups to lower memcpy and memset even if it's not fast (like corei7). The theory is it's still faster than a pair of movq / a quad of movl. This will probably hurt older chips like P4 but should run faster on current and future Intel processors. rdar://8817010 llvm-svn: 122955	2011-01-06 07:58:36 +00:00
Chris Lattner	245de78e06	add a note about object size from drystone, add a poorly optimized loop from 179.art. llvm-svn: 122954	2011-01-06 07:41:22 +00:00
Chris Lattner	73552c2cce	add a trivial instcombine missed in Dhrystone llvm-svn: 122953	2011-01-06 07:09:23 +00:00
Evan Cheng	3ae2b79aa3	Re-implement r122936 with proper target hooks. Now getMaxStoresPerMemcpy etc. takes an option OptSize. If OptSize is true, it would return the inline limit for functions with attribute OptSize. llvm-svn: 122952	2011-01-06 06:52:41 +00:00
Chris Lattner	5858e091a6	implement constant folding support for an exotic constant expr: ret i64 ptrtoint (i8* getelementptr ([1000 x i8]* @X, i64 1, i64 sub (i64 0, i64 ptrtoint ([1000 x i8]* @X to i64))) to i64) to "ret i64 1000". This allows us to correctly compute the trip count on a loop in PR8883, which occurs with std::fill on a char array. This allows us to transform it into a memset with a constant size. llvm-svn: 122950	2011-01-06 06:19:46 +00:00

... 2 3 4 5 6 ...

69215 Commits