llvm-project

Commit Graph

Author	SHA1	Message	Date
Nadav Rotem	d908ddc186	Improve the loading of load-anyext vectors by allowing the codegen to load multiple scalars and insert them into a vector. Next, we shuffle the elements into the correct places, as before. Also fix a small dagcombine bug in SimplifyBinOpWithSameOpcodeHands, when the migration of bitcasts happened too late in the SelectionDAG process. llvm-svn: 159991	2012-07-10 13:25:08 +00:00
Chandler Carruth	e18614dd17	Add an efficient merge operation to LiveInterval and use it to avoid quadratic behavior when performing pathological merges. Fixes the core element of PR12652. There is only one user of addRangeFrom left: join. I'm hoping to refactor further in a future patch and have join use this merge operation as well. llvm-svn: 159982	2012-07-10 05:16:17 +00:00
Chandler Carruth	ac766b9b42	Teach LiveIntervals how to verify themselves and start using it in some of the trick merge routines. This adds a layer of testing that was necessary when implementing more efficient (and complex) merge logic for this datastructure. No functionality changed here. llvm-svn: 159981	2012-07-10 05:06:03 +00:00
Andrew Trick	c50f06487c	indentation llvm-svn: 159958	2012-07-09 20:43:01 +00:00
Owen Anderson	d4b841f8f9	Teach the DAG combiner to turn sitofp/uitofp from i1 into a conditional move, since there are only two possible values. Previously, this would become an integer extension operation, followed by a real integer->float conversion. llvm-svn: 159957	2012-07-09 20:31:12 +00:00
Andrew Trick	87255e340e	I'm introducing a new machine model to simultaneously allow simple subtarget CPU descriptions and support new features of MachineScheduler. MachineModel has three categories of data: 1) Basic properties for coarse grained instruction cost model. 2) Scheduler Read/Write resources for simple per-opcode and operand cost model (TBD). 3) Instruction itineraties for detailed per-cycle reservation tables. These will all live side-by-side. Any subtarget can use any combination of them. Instruction itineraries will not change in the near term. In the long run, I expect them to only be relevant for in-order VLIW machines that have complex contraints and require a precise scheduling/bundling model. Once itineraries are only actively used by VLIW-ish targets, they could be replaced by something more appropriate for those targets. This tablegen backend rewrite sets things up for introducing MachineModel type #2: per opcode/operand cost model. llvm-svn: 159891	2012-07-07 04:00:00 +00:00
Chad Rosier	879c34f45a	Whitespace. llvm-svn: 159839	2012-07-06 17:44:22 +00:00
Chad Rosier	88d53eae56	[fast-isel] Tell fast-isel to do nothing with the new donothing intrinsic. llvm-svn: 159837	2012-07-06 17:33:39 +00:00
Alexey Samsonov	39602781f6	Fix PR13202 and a regtest. DwarfDebug class could generate the same (inlined) DIVariable twice: 1) when trying to find abstract debug variable for a concrete inlined instance. 2) when explicitly collecting info for variables that were optimized out. This change makes sure that this duplication won't happen and makes Clang pass "gdb.opt/inline-locals" test from gdb testsuite. Reviewed by Eric Christopher. llvm-svn: 159811	2012-07-06 08:45:08 +00:00
Jakob Stoklund Olesen	3f1bb93cab	Add some comments suggested in code review. llvm-svn: 159800	2012-07-06 02:31:22 +00:00
Chandler Carruth	1088676476	Optimize extendIntervalEndTo a tiny bit by saving one call through the vector erase. No functionality changed. llvm-svn: 159746	2012-07-05 12:40:45 +00:00
Chandler Carruth	264854f9a0	Finish fixing the MachineOperand hashing, providing a nice modern hash_value overload for MachineOperands. This addresses a FIXME sufficient for me to remove it, and cleans up the code nicely too. The important changes to the hashing logic: - TargetFlags are now included in all of the hashes. These were complete missed. - Register operands have their subregisters and whether they are a def included in the hash. - We now actually hash all of the operand types. Previously, many operand types were simply dropped on the floor. For example: - Floating point immediates - Large integer immediates (>64-bit) - External globals! - Register masks - Metadata operands - It removes the offset from the block-address hash; I'm a bit suspicious of this, but isIdenticalTo doesn't consider the offset for black addresses. Any patterns involving these entities could have triggered extreme slowdowns in MachineCSE or PHIElimination. Let me know if there are PRs you think might be closed now... I'm looking myself, but I may miss them. llvm-svn: 159743	2012-07-05 11:06:22 +00:00
Duncan Sands	71dacd09fe	All cases are covered, no need for a default. This deals with the corresponding clang warning. llvm-svn: 159742	2012-07-05 10:14:33 +00:00
Chandler Carruth	1d5d23106e	The hash function for MI expressions, used by MachineCSE, is really broken. This patch fixes the superficial problems which lead to the intractably slow compile times reported in PR13225. The specific issue is that we were failing to include the offset of a global variable in the hash code. Oops. This would in turn cause all MIs which were only distinguishable due to operating on different offsets of a global variable to produce identical hash functions. In some of the test cases attached to the PR I saw hash table activity where there were O(1000) probes-per-lookup on average. A very few entries were responsible for most of these probes. There is still quite a bit more to do here. The ad-hoc layering of data in MachineOperands makes them extremely brittle to hash correctly. We're missing quite a few other cases, the only ones I've fixed here are the specific MO types which were allowed through the assert() in getOffset(). llvm-svn: 159741	2012-07-05 10:03:57 +00:00
Duncan Sands	0552a2cad2	Use the right kind of booleans: we were emitting 0/1 booleans, instead of 0/-1 booleans. Patch by James Benton. llvm-svn: 159739	2012-07-05 09:32:46 +00:00
Nick Lewycky	765c699370	Remove ParentMap. You can just ask the domnode for its parent. No functionality change. Move the "Not profitable, avoid CSE!" debug message next to where we fail the check for profitability and use a different message for avoiding CSE due to being in different register classes. llvm-svn: 159729	2012-07-05 06:19:21 +00:00
Jakob Stoklund Olesen	c300ef0e50	Allow trailing physreg RegisterSDNode operands on non-variadic instructions. Also allow trailing register mask operands on non-variadic both MachineSDNodes and MachineInstrs. The extra physreg RegisterSDNode operands are added to the MI as <imp-use> operands. This makes it possible to have non-variadic call instructions. Call and return instructions really are non-variadic, the argument registers should only be used implicitly - they are not part of the encoding. llvm-svn: 159727	2012-07-04 23:53:23 +00:00
Jakob Stoklund Olesen	adb50a7a09	Print SlotIndexes when available for -print-machineinstrs. llvm-svn: 159726	2012-07-04 23:53:19 +00:00
Jakob Stoklund Olesen	2d827d628e	Allow multiple terminators to read virtual registers. Find the kill as the last terminator to read SrcReg. Patch by Philipp Brüschweiler! llvm-svn: 159722	2012-07-04 19:52:05 +00:00
Jakob Stoklund Olesen	29506f5e6d	Make sure -print-machineinstrs applies to the first pass as well. llvm-svn: 159720	2012-07-04 19:28:27 +00:00
Stepan Dyatkovskiy	7ff588f986	Reverted r156659, due to probable performance regressions, DenseMap should be used here: IntegersSubsetMapping - Replaced type of Items field from std::list with std::map. In neares future I'll test it with DenseMap and do the correspond replacement if possible. llvm-svn: 159703	2012-07-04 05:53:05 +00:00
Eric Christopher	ef9d710ea6	Reduce some code duplication. llvm-svn: 159701	2012-07-04 02:02:18 +00:00
Matt Beaumont-Gay	11d08b2e22	Fix some ascii art in a comment to not have trailing backslashes (inspiration from IfConversion.cc), and fix some spelling and grammar in the surrounding prose. llvm-svn: 159699	2012-07-04 01:09:45 +00:00
Jakob Stoklund Olesen	f8a63a1507	Add an experimental early if-conversion pass, off by default. This pass performs if-conversion on SSA form machine code by speculatively executing both sides of the branch and using a cmov instruction to select the result. This can help lower the number of branch mispredictions on architectures like x86 that don't have predicable instructions. The current implementation is very aggressive, and causes regressions on mosts tests. It needs good heuristics that have yet to be implemented. llvm-svn: 159694	2012-07-04 00:09:54 +00:00
Stepan Dyatkovskiy	8b0c97e0dd	Part of r159527. Splitted into series of patches and gone with fixed PR13256: IntegersSubsetMapping - Replaced type of Items field from std::list with std::map. In neares future I'll test it with DenseMap and do the correspond replacement if possible. llvm-svn: 159659	2012-07-03 13:46:45 +00:00
Eric Christopher	b65acc61a5	Revert "IntRange:" as it appears to be breaking self hosting. This reverts commit b2833d9dcba88c6f0520cad760619200adc0442c. llvm-svn: 159618	2012-07-02 23:22:21 +00:00
Chandler Carruth	34263a0c95	All glory to address sanitizer. ;] It appears to have caught a use-after-free introduced as by r159567 and/or friends which call 'addPass' from many more places. The bug in 'addPass' doesn't appear to be new, and was spotted by inspection when ASan shown a bright light of a stacktrace at these functions. Hopefully this will fix the ASan failure -- I have no test case other than running an ASan-built clang over the test suite. llvm-svn: 159614	2012-07-02 22:56:41 +00:00
Evan Cheng	39e90029a2	Target option DisableJumpTables is a gross hack. Move it to TargetLowering instead. llvm-svn: 159611	2012-07-02 22:39:56 +00:00
Andrew Trick	2f26b34806	misched: allow NULL InstrItineraries. llvm-svn: 159599	2012-07-02 21:55:12 +00:00
Eric Christopher	dd8638fb3e	Turn an assert into an error to make it a bit more friendly. Part of rdar://6880388 and rdar://11766377 llvm-svn: 159590	2012-07-02 21:16:43 +00:00
Bob Wilson	cac3b90633	Extend TargetPassConfig to allow running only a subset of the normal passes. This is still a work in progress but I believe it is currently good enough to fix PR13122 "Need unit test driver for codegen IR passes". For example, you can run llc with -stop-after=loop-reduce to have it dump out the IR after running LSR. Serializing machine-level IR is not yet supported but we have some patches in progress for that. The plan is to serialize the IR to a YAML file, containing separate sections for the LLVM IR, machine-level IR, and whatever other info is needed. Chad suggested that we stash the stop-after pass in the YAML file and use that instead of the start-after option to figure out where to restart the compilation. I think that's a great idea, but since it's not implemented yet I put the -start-after option into this patch for testing purposes. llvm-svn: 159570	2012-07-02 19:48:45 +00:00
Bob Wilson	a3f9fa710a	Move assertion with TargetPassConfig's Initialized flag. llvm-svn: 159569	2012-07-02 19:48:39 +00:00
Bob Wilson	b9b693650a	Consistently use AnalysisID types in TargetPassConfig. This makes it possible to just use a zero value to represent "no pass", so the phony NoPassID global variable is no longer needed. llvm-svn: 159568	2012-07-02 19:48:37 +00:00
Bob Wilson	bbd38dd9c0	Add all codegen passes to the PassManager via TargetPassConfig. This is a preliminary step toward having TargetPassConfig be able to start and stop the compilation at specified passes for unit testing and debugging. No functionality change. llvm-svn: 159567	2012-07-02 19:48:31 +00:00
Manman Ren	72098b2c91	Added assertion in getVRegDef of MachineRegisterInfo to make sure the virtual register does not have multiple definitions. Modified TwoAddressInstructionPass to use getUniqueVRegDef instead of getVRegDef. llvm-svn: 159545	2012-07-02 18:55:36 +00:00
Andrew Trick	f161e391f8	Reapply "Make NumMicroOps a variable in the subtarget's instruction itinerary." Reapplies r159406 with minor cleanup. The regressions appear to have been spurious. llvm-svn: 159541	2012-07-02 18:10:42 +00:00
Stepan Dyatkovskiy	8b9ecca42d	IntRange: - Changed isSingleNumber method behaviour. Now this flag is calculated on demand. IntegersSubsetMapping - Optimized diff operation. - Replaced type of Items field from std::list with std::map. - Added new methods: bool isOverlapped(self &RHS) void add(self& RHS, SuccessorClass S) void detachCase(self& NewMapping, SuccessorClass Succ) void removeCase(SuccessorClass Succ) SuccessorClass findSuccessor(const IntTy& Val) const IntTy* getCaseSingleNumber(SuccessorClass *Succ) IntegersSubsetTest - DiffTest: Added checks for successors. SimplifyCFG Updated SwitchInst usage (now it is case-ragnes compatible) for - SimplifyEqualityComparisonWithOnlyPredecessor - FoldValueComparisonIntoPredecessors llvm-svn: 159527	2012-07-02 13:02:18 +00:00
Rafael Espindola	a77d31d7fd	Now that RegistersDefinedFromSameValue handles one instruction being an implicit_def, the other instruction can be anything, including instructions that define multiple values. Be careful about that and don't assume what operand 0 is. Fixes pr13249. llvm-svn: 159509	2012-07-01 17:08:01 +00:00
Rafael Espindola	efab16d43b	Handle implicit_defs in the register coalescer. I am still trying to produce a reduced testcase, but this fixes pr13209. llvm-svn: 159479	2012-06-30 01:45:55 +00:00
Manman Ren	6fa76dc0e0	Add SrcReg2 to analyzeCompare and optimizeCompareInstr to handle Compare instructions with two register operands. llvm-svn: 159465	2012-06-29 21:33:59 +00:00
Jakob Stoklund Olesen	3e3cdecf98	Clear kill flags in InstrEmitter::EmitSubregNode(). When a local virtual register is made global, make sure to clear any existing kill flags. llvm-svn: 159461	2012-06-29 21:00:03 +00:00
Jakob Stoklund Olesen	da9ea1d6bc	Check for extra kill flags on live-out virtual registers. This would previously get reported as the misleading "Virtual register def doesn't dominate all uses." llvm-svn: 159460	2012-06-29 21:00:00 +00:00
Manman Ren	c146589aa4	Add getUniqueVRegDef to MachineRegisterInfo. This comes in handy during peephole optimization. llvm-svn: 159453	2012-06-29 19:16:05 +00:00
Alexey Samsonov	6e7e6b646b	Cleanup in DwarfDebug - fix a typo and remove two unused functions llvm-svn: 159433	2012-06-29 16:04:14 +00:00
Chandler Carruth	aafe0918bc	Move llvm/Support/IRBuilder.h -> llvm/IRBuilder.h This was always part of the VMCore library out of necessity -- it deals entirely in the IR. The .cpp file in fact was already part of the VMCore library. This is just a mechanical move. I've tried to go through and re-apply the coding standard's preferred header sort, but at 40-ish files, I may have gotten some wrong. Please let me know if so. I'll be committing the corresponding updates to Clang and Polly, and Duncan has DragonEgg. Thanks to Bill and Eric for giving the green light for this bit of cleanup. llvm-svn: 159421	2012-06-29 12:38:19 +00:00
Bill Wendling	f799efdedc	The DIBuilder class is just a wrapper around debug info creation (a.k.a. MDNodes). The module doesn't belong in Analysis. Move it to the VMCore instead. llvm-svn: 159414	2012-06-29 08:32:07 +00:00
Andrew Trick	51a8cf77b8	Revert "Make NumMicroOps a variable in the subtarget's instruction itinerary." This reverts commit r159406. I noticed a performance regression so I'll back out for now. llvm-svn: 159411	2012-06-29 07:10:41 +00:00
Andrew Trick	8c9e6728b3	misched: avoid scheduling instructions that can't be dispatched. llvm-svn: 159408	2012-06-29 03:23:24 +00:00
Andrew Trick	ce27bb999d	misched: count micro-ops toward the issue limit. llvm-svn: 159407	2012-06-29 03:23:22 +00:00
Andrew Trick	1f50152b2d	Make NumMicroOps a variable in the subtarget's instruction itinerary. The TargetInstrInfo::getNumMicroOps API does not change, but soon it will be used by MachineScheduler. Now each subtarget can specify the number of micro-ops per itinerary class. For ARM, this is currently always dynamic (-1), because it is used for load/store multiple which depends on the number of register operands. Zero is now a valid number of micro-ops. This can be used for nop pseudo-instructions or instructions that the hardware can squash during dispatch. llvm-svn: 159406	2012-06-29 03:23:18 +00:00
Nuno Lopes	ec9653b363	add a new @llvm.donothing intrinsic that, well, does nothing, and teach CodeGen to ignore calls to it llvm-svn: 159383	2012-06-28 22:30:12 +00:00
Jim Grosbach	e0c10d8b86	'Promote' vector [su]int_to_fp should widen elements. Teach vector legalization how to honor Promote for int to float conversions. The code checking whether to promote the operation knew to look at the operand, but the actual promotion code didn't. This fixes that. The operand is promoted up via [zs]ext. rdar://11762659 llvm-svn: 159378	2012-06-28 21:03:44 +00:00
Bill Wendling	e38859dc8e	Move lib/Analysis/DebugInfo.cpp to lib/VMCore/DebugInfo.cpp and include/llvm/Analysis/DebugInfo.h to include/llvm/DebugInfo.h. The reasoning is because the DebugInfo module is simply an interface to the debug info MDNodes and has nothing to do with analysis. llvm-svn: 159312	2012-06-28 00:05:13 +00:00
Jakob Stoklund Olesen	59a0d3243b	Allow targets to inject passes before the virtual register rewriter. Such passes can be used to tweak the register assignments in a target-dependent way, for example to avoid write-after-write dependencies. llvm-svn: 159209	2012-06-26 17:09:29 +00:00
Chandler Carruth	9139f44d23	Update a bunch of stale comments that dated from when this folled the very first (and worst) placement algorithm. These should now more accurately reflect the reality of the pass. llvm-svn: 159185	2012-06-26 05:16:37 +00:00
Andrew Trick	fb2ba3e1cb	Enable the new LoopInfo algorithm by default. The primary advantage is that loop optimizations will be applied in a stable order. This helps debugging and unit test creation. It is also a better overall implementation without pathologically bad performance on deep functions. On large functions (llvm-stress --size=200000 \| opt -loops) Before: 0.1263s After: 0.0225s On deep functions (after tweaking llvm-stress, thanks Nadav): Before: 0.2281s After: 0.0227s See r158790 for more comments. The loop tree is now consistently generated in forward order, but loop passes are applied in reverse order over the program. If we have a loop optimization that prefers forward order, that can easily be achieved by adding a different type of LoopPassManager. llvm-svn: 159183	2012-06-26 04:11:38 +00:00
Evan Cheng	4c6f917d34	Make sure type is not extended or untyped before create a constant of the type. No test case. Found by inspection. llvm-svn: 159179	2012-06-26 01:19:33 +00:00
Jakob Stoklund Olesen	a57fc12ec9	Enforce stricter liveness rules for PHIs. Verify that all paths from the entry block to a virtual register read pass through a def. Enable this check even when MRI->isSSA() is false. Verify that the live range of a virtual register is live out of all predecessor blocks, even for PHI-values. This requires that PHIElimination sometimes inserts IMPLICIT_DEF instruction in predecessor blocks. llvm-svn: 159150	2012-06-25 18:18:27 +00:00
Jakob Stoklund Olesen	eb49566447	Run ProcessImplicitDefs on SSA form where it can be much simpler. Implicitly defined virtual registers can simply have the <undef> bit set on all uses, and copies can be turned into implicit defs recursively. Physical registers are a bit trickier. We handle the common case where a physreg def is used by a nearby instruction in the same basic block. For more complicated cases, just leave the IMPLICIT_DEF instruction in. llvm-svn: 159149	2012-06-25 18:12:18 +00:00
Jakob Stoklund Olesen	70ed924e18	Teach PHIElimination to handle <undef> operands. When a PHI use is <undef>, don't emit a copy in the predecessor block, but insert an IMPLICIT_DEF instruction instead. This ensures that virtual register uses are always jointly dominated by defs, even if some of them are IMPLICIT_DEF. llvm-svn: 159121	2012-06-25 03:36:12 +00:00
Jakob Stoklund Olesen	6b556f824d	Handle <undef> operands in TwoAddressInstructionPass. When the source register to a 2-addr instruction is undefined, there is no need to attempt any transformations - simply replace the source register with the destination register. This also comes up when lowering IMPLICIT_DEF instructions - make sure the <undef> flag is moved to the new partial register def operand: %vreg8<def> = INSERT_SUBREG %vreg9<undef>, %vreg0<kill>, sub_16bit rewrite undef: %vreg8<def> = INSERT_SUBREG %vreg8<undef>, %vreg0<kill>, sub_16bit convert to: %vreg8:sub_16bit<def,read-undef> = COPY %vreg0<kill> llvm-svn: 159120	2012-06-25 03:27:12 +00:00
NAKAMURA Takumi	704de074b8	llvm/lib: [CMake] Add explicit dependency to intrinsics_gen. llvm-svn: 159112	2012-06-24 13:32:01 +00:00
Pete Cooper	fe212e762f	DAG legalisation can now handle illegal fma vector types by scalarisation llvm-svn: 159092	2012-06-24 00:05:44 +00:00
Jakob Stoklund Olesen	502e4c6ac4	Teach LiveVariables to handle <undef> operands. It's simple: Don't treat <undef> operands as uses, and don't assume a virtual register has a defining instruction unless a real use has been seen. llvm-svn: 159061	2012-06-23 02:23:00 +00:00
Jakob Stoklund Olesen	a127fc780a	Remove ProcessImplicitDefs.h which was unused. The ProcessImplicitDefs class can be local to its implementation file. llvm-svn: 159041	2012-06-22 22:27:36 +00:00
Jakob Stoklund Olesen	b033dede17	Also verify the def index for early clobbers. llvm-svn: 159039	2012-06-22 22:23:58 +00:00
Jakob Stoklund Olesen	4fa84ba8b9	Delete a boring statistic. llvm-svn: 159030	2012-06-22 20:40:15 +00:00
Jakob Stoklund Olesen	c61edda0ab	Store live intervals in an IndexedMap. It is both smaller and faster than DenseMap. llvm-svn: 159029	2012-06-22 20:37:52 +00:00
Hal Finkel	8db5547252	Revert r158679 - use case is unclear (and it increases the memory footprint). Original commit message: Allow up to 64 functional units per processor itinerary. This patch changes the type used to hold the FU bitset from unsigned to uint64_t. This will be needed for some upcoming PowerPC itineraries. llvm-svn: 159027	2012-06-22 20:27:13 +00:00
Jakob Stoklund Olesen	48828bb402	Fix a crash in --debug code. Don't try to print out the live range of a physreg. llvm-svn: 159021	2012-06-22 19:51:41 +00:00
Jakob Stoklund Olesen	48a1647c93	Don't depend on live ranges being present. DBG_VALUE instructions could be referring to non-existing virtual registers. llvm-svn: 159020	2012-06-22 18:51:35 +00:00
Jakob Stoklund Olesen	8a833649e5	Simplify handleMove() a bit. There is no need to check for physreg live ranges. They don't exist any more. llvm-svn: 159019	2012-06-22 18:38:57 +00:00
Jakob Stoklund Olesen	37e797fedc	Stop computing physreg live ranges. Everyone is using on-demand regunit ranges now. llvm-svn: 159018	2012-06-22 18:20:50 +00:00
Jakob Stoklund Olesen	bbad269a3e	Remove some redundant LIS->hasInterval() checks. These functions only operate on virtual registers now, and they all have live ranges. llvm-svn: 159015	2012-06-22 17:49:44 +00:00
Jakob Stoklund Olesen	7809578cfe	Use MRI::isConstantPhysReg() to check remat feasibility. Don't depend on LiveIntervals::hasInterval() to determine if a physreg is reserved and constant. llvm-svn: 159013	2012-06-22 17:31:01 +00:00
Jakob Stoklund Olesen	3244963ecc	Use regunit liveness to guide LiveDebugVariables. This should produce the same results as using physreg liveness directly. llvm-svn: 159009	2012-06-22 17:15:32 +00:00
Jakob Stoklund Olesen	b1b3e4aa58	Remove LiveIntervals::trackingRegUnits(). With regunit liveness permanently enabled, this function would always return true. Also remove now obsolete code for checking physreg interference. llvm-svn: 159006	2012-06-22 16:46:44 +00:00
Rafael Espindola	ea59166190	Remove another duplicated variable. We only need one to tell us if the linker knows dwarf or not. llvm-svn: 158993	2012-06-22 13:32:49 +00:00
Rafael Espindola	d7bdaf5795	Fix a FIXME: DwarfRequiresRelocationForSectionOffset is the same as DwarfUsesRelocationsAcrossSections. llvm-svn: 158992	2012-06-22 13:24:07 +00:00
Nick Lewycky	33da33676f	Emit relocations for DW_AT_location entries on systems which need it. This is a recommit of r127757. Fixes PR9493. Patch by Paul Robinson! llvm-svn: 158957	2012-06-22 01:25:12 +00:00
Lang Hames	b8650f106a	Rename -allow-excess-fp-precision flag to -fuse-fp-ops, and switch from a boolean flag to an enum: { Fast, Standard, Strict } (default = Standard). This option controls the creation by optimizations of fused FP ops that store intermediate results in higher precision than IEEE allows (E.g. FMAs). The behavior of this option is intended to match the behaviour specified by a soon-to-be-introduced frontend flag: '-ffuse-fp-ops'. Fast mode - allows formation of fused FP ops whenever they're profitable. Standard mode - allow fusion only for 'blessed' FP ops. At present the only blessed op is the fmuladd intrinsic. In the future more blessed ops may be added. Strict mode - allow fusion only if/when it can be proven that the excess precision won't effect the result. Note: This option only controls formation of fused ops by the optimizers. Fused operations that are explicitly requested (e.g. FMA via the llvm.fma.* intrinsic) will always be honored, regardless of the value of this option. Internally TargetOptions::AllowExcessFPPrecision has been replaced by TargetOptions::AllowFPOpFusion. llvm-svn: 158956	2012-06-22 01:09:09 +00:00
Jack Carter	c457f62033	The inline asm operand modifier 'n' is suppose to be generic across architectures. It has the following description in the gnu sources: Negate the immediate constant Several Architectures such as x86 have local implementations of operand modifier 'n' which go beyond the above description slightly. This won't affect them. Affected files: lib/CodeGen/AsmPrinter/AsmPrinterInlineAsm.cpp Added 'n' to the switch cases. test/CodeGen/Generic/asm-large-immediate.ll Generic compiled test (x86 for me) test/CodeGen/Mips/asm-large-immediate.ll Mips compiled version of the generic one Contributer: Jack Carter llvm-svn: 158939	2012-06-21 21:37:54 +00:00
Pete Cooper	5b61422d80	Fix potential crash if DAGCombine on stores sees a half type llvm-svn: 158927	2012-06-21 18:00:39 +00:00
Jack Carter	b2fd5f66b4	The inline asm operand modifier 'c' is suppose to be generic across architectures. It has the following description in the gnu sources: Substitute immediate value without immediate syntax Several Architectures such as x86 have local implementations of operand modifier 'c' which go beyond the above description slightly. To make use of the generic modifiers without overriding local implementation one can make a call to the base class method for AsmPrinter::PrintAsmOperand() in the locally derived method's "default" case in the switch statement. That way if it is already defined locally the generic version will never get called. This change is needed when test/CodeGen/generic/asm-large-immediate.ll failed on a native Mips board. The test was assuming a generic implementation was in place. Affected files: lib/Target/Mips/MipsAsmPrinter.cpp: Changed the default case to call the base method. lib/CodeGen/AsmPrinter/AsmPrinterInlineAsm.cpp Added 'c' to the switch cases. test/CodeGen/Mips/asm-large-immediate.ll Mips compiled version of the generic one Contributer: Jack Carter llvm-svn: 158925	2012-06-21 17:14:46 +00:00
Evan Cheng	8c2ad81238	Emit a single _udivmodsi4 libcall instead of two separate _udivsi3 and _umodsi3 libcalls if they have the same arguments. This optimization was apparently broken if one of the node was replaced in place. rdar://11714607 llvm-svn: 158900	2012-06-21 05:56:05 +00:00
Jakob Stoklund Olesen	58713de545	Update regunits in RegisterCoalescer::reMaterializeTrivialDef. Old code would only update physreg live intervals. llvm-svn: 158881	2012-06-21 00:09:15 +00:00
Jakob Stoklund Olesen	37a1338a16	Remove spurious typedefs. llvm-svn: 158878	2012-06-20 23:54:18 +00:00
Jakob Stoklund Olesen	1911a0203d	Remove the RenderMachineFunction HTML output pass. I don't think anyone has been using this functionality for a while, and it is getting in the way of refactoring now. llvm-svn: 158876	2012-06-20 23:47:58 +00:00
Jakob Stoklund Olesen	51c63e64e3	Remove the -live-regunits command line option. Register allocators depend on it being permanently enabled now. llvm-svn: 158873	2012-06-20 23:31:34 +00:00
Jakob Stoklund Olesen	781e0b9fd7	Fix some more LiveInterval enumerations. Deterministically enumerate the virtual registers instead. llvm-svn: 158872	2012-06-20 23:23:59 +00:00
Jakob Stoklund Olesen	2d2dec96e0	Remove LiveIntervalUnions from RegAllocBase. They are living in LiveRegMatrix now. llvm-svn: 158868	2012-06-20 22:52:29 +00:00
Jakob Stoklund Olesen	96eebf0b14	Convert RAGreedy to LiveRegMatrix interference checking. Stop depending on the LiveIntervalUnions in RegAllocBase, they are about to be removed. The changes are mostly replacing register alias iterators with regunit iterators, and querying LiveRegMatrix instrad of RegAllocBase. InterferenceCache is converted to work with per-regunit LiveIntervalUnions, and it checks fixed regunit interference separately, using the fixed live intervals provided by LiveIntervalAnalysis. The local splitting helper calcGapWeights() is also considering fixed regunit interference which is kept on the side now. llvm-svn: 158867	2012-06-20 22:52:26 +00:00
Jakob Stoklund Olesen	03b87d5aaa	Convert RABasic to using LiveRegMatrix interference checking. Stop using the LiveIntervalUnions provided by RegAllocBase, they will be removed soon. llvm-svn: 158866	2012-06-20 22:52:24 +00:00
Jakob Stoklund Olesen	effc6b2d18	Enable register unit liveness by default. Soon we won't need to compute live intervals for physical registers. llvm-svn: 158865	2012-06-20 22:52:22 +00:00
Jakob Stoklund Olesen	bfa664eaae	Teach PBQPBuilder::build() about regunit interference. Filter out physreg candidates with regunit interferrence. Also compute regmask interference more efficiently. llvm-svn: 158864	2012-06-20 22:32:05 +00:00
Jakob Stoklund Olesen	a1f43dcdb8	Avoid iterating with LiveIntervals::iterator. That is a DenseMap iterator keyed by pointers, so the iteration order is nondeterministic. I would like to replace the DenseMap with an IndexedMap which doesn't allow iteration. llvm-svn: 158856	2012-06-20 21:25:05 +00:00
Pete Cooper	fe5b84b404	Add users of a MERGE_VALUE node to the worklist to process again when the node is removed. Sorry, no test case. Foudn it by inspection of the code llvm-svn: 158839	2012-06-20 19:35:43 +00:00
Jakob Stoklund Olesen	833308d785	Only update regunit live ranges that have been precomputed. Regunit live ranges are computed on demand, so when mi-sched calls handleMove, some regunits may not have live ranges yet. That makes updating them easier: Just skip the non-existing ranges. They will be computed correctly from the rescheduled machine code when they are needed. llvm-svn: 158831	2012-06-20 18:00:57 +00:00
Jakob Stoklund Olesen	d702e8fddf	Delete dead code. llvm-svn: 158827	2012-06-20 16:38:50 +00:00
Hal Finkel	8a31138521	Fix DAGCombine to deal with ext-conversion of pre/post_inc loads. The test case for this will come with the PPC indexed preinc loads commit. llvm-svn: 158822	2012-06-20 15:42:48 +00:00
Aaron Ballman	421a5ba06d	Fixing a compiler warning in MSVC 10. llvm-svn: 158820	2012-06-20 14:44:44 +00:00
Chandler Carruth	c60fbe6b58	Fix two rather subtle internal vs. external linker issues. I'll admit I'm not entirely satisfied with this change, but it seemed the cleanest option. Other suggestions quite welcome The issue is that the traits specializations have static methods which return the typedef'ed PHI_iterator type. In both the IR and MI layers this is typedef'ed to a custom iterator class defined in an anonymous namespace giving the types and the functions returning them internal linkage. However, because the traits specialization is defined in the 'llvm' namespace (where it has to be, specialized template lives there), and is in turn used in the templated implementation of the SSAUpdater. This led to the linkage conflict that Clang now warns about. The simplest solution to me was just to define the PHI_iterator as a nested class inside the trait specialization. That way it still doesn't get scoped widely, it can't be accidentally reused somewhere, etc. This is a little gross just because nested class definitions are a little gross, but the alternatives seem more ad-hoc. llvm-svn: 158799	2012-06-20 08:39:30 +00:00
Andrew Trick	ff2ed7b687	A new algorithm for computing LoopInfo. Temporarily disabled. -stable-loops enables a new algorithm for generating the Loop forest. It differs from the original algorithm in a few respects: - Not determined by use-list order. - Initially guarantees RPO order of block and subloops. - Linear in the number of CFG edges. - Nonrecursive. I didn't want to change the LoopInfo API yet, so the block lists are still inclusive. This seems strange to me, and it means that building LoopInfo is not strictly linear, but it may not be a problem in practice. At least the block lists start out in RPO order now. In the future we may add an attribute or wrapper analysis that allows other passes to assume RPO order. The primary motivation of this work was not to optimize LoopInfo, but to allow reproducing performance issues by decomposing the compilation stages. I'm often unable to do this with the current LoopInfo, because the loop tree order determines Loop pass order. Serializing the IR tends to invert the order, which reverses the optimization order. This makes it nearly impossible to debug interdependent loop optimizations such as LSR. I also believe this will provide more stable performance results across time. llvm-svn: 158790	2012-06-20 05:23:33 +00:00
Andrew Trick	cda51d430d	Move the implementation of LoopInfo into LoopInfoImpl.h. The implementation only needs inclusion from LoopInfo.cpp and MachineLoopInfo.cpp. Clients of the interface should only include the interface. This makes the interface readable and speeds up rebuilds after modifying the implementation. llvm-svn: 158787	2012-06-20 03:42:09 +00:00
Jakob Stoklund Olesen	3802bbf35e	Add regunit liveness support to LiveIntervals::handleMove(). When LiveIntervals is tracking fixed interference in regunits, make sure to update those intervals as well. Currently guarded by -live-regunits. llvm-svn: 158766	2012-06-19 23:50:18 +00:00
Chad Rosier	651f9a485a	Tidy up. llvm-svn: 158762	2012-06-19 23:37:57 +00:00
Chad Rosier	7369692790	Add an ensureMaxAlignment() function to MachineFrameInfo (analogous to ensureAlignment() in MachineFunction). Also, drop setMaxAlignment() in favor of this new function. This creates a main entry point to setting MaxAlignment, which will be helpful for future work. No functionality change intended. llvm-svn: 158758	2012-06-19 22:59:12 +00:00
Lang Hames	39fb1d08dc	Add DAG-combines for aggressive FMA formation. This patch adds DAG combines to form FMAs from pairs of FADD + FMUL or FSUB + FMUL. The combines are performed when: (a) Either AllowExcessFPPrecision option (-enable-excess-fp-precision for llc) OR UnsafeFPMath option (-enable-unsafe-fp-math) are set, and (b) TargetLoweringInfo::isFMAFasterThanMulAndAdd(VT) is true for the type of the FADD/FSUB, and (c) The FMUL only has one user (the FADD/FSUB). If your target has fast FMA instructions you can make use of these combines by overriding TargetLoweringInfo::isFMAFasterThanMulAndAdd(VT) to return true for types supported by your FMA instruction, and adding patterns to match ISD::FMA to your FMA instructions. llvm-svn: 158757	2012-06-19 22:51:23 +00:00
Jakob Stoklund Olesen	2db1125b15	80 col. llvm-svn: 158755	2012-06-19 22:50:53 +00:00
Jakob Stoklund Olesen	0f855e4263	Implement PPCInstrInfo::isCoalescableExtInstr(). The PPC::EXTSW instruction preserves the low 32 bits of its input, just like some of the x86 instructions. Use it to reduce register pressure when the low 32 bits have multiple uses. This requires a small change to PeepholeOptimizer since EXTSW takes a 64-bit input register. This is related to PR5997. llvm-svn: 158743	2012-06-19 21:14:34 +00:00
Jakob Stoklund Olesen	8eb9905a7c	Style: Don't reuse variables for multiple purposes. No functional change. llvm-svn: 158742	2012-06-19 21:10:18 +00:00
Rafael Espindola	ca3e0ee8b3	Move the support for using .init_array from ARM to the generic TargetLoweringObjectFileELF. Use this to support it on X86. Unlike ARM, on X86 it is not easy to find out if .init_array should be used or not, so the decision is made via TargetOptions and defaults to off. Add a command line option to llc that enables it. llvm-svn: 158692	2012-06-19 00:48:28 +00:00
Hal Finkel	8eac009633	Allow up to 64 functional units per processor itinerary. This patch changes the type used to hold the FU bitset from unsigned to uint64_t. This will be needed for some upcoming PowerPC itineraries. llvm-svn: 158679	2012-06-18 21:08:18 +00:00
Benjamin Kramer	b9f84bb0ce	Guard private fields that are unused in Release builds with #ifndef NDEBUG. llvm-svn: 158608	2012-06-16 21:48:13 +00:00
Jakob Stoklund Olesen	38a6fbf933	Remove final verification in RABasic. We now have a proper machine code verifier pass between register allocation and rewriting. llvm-svn: 158577	2012-06-15 23:48:48 +00:00
Jakob Stoklund Olesen	45c1f9976c	Print out register number in InlineSpiller. llvm-svn: 158575	2012-06-15 23:47:09 +00:00
Jakob Stoklund Olesen	13dffcb766	Accept null PhysReg arguments to checkRegMaskInterference. Calling checkRegMaskInterference(VirtReg) checks if VirtReg crosses any regmask operands, regardless of the registers they clobber. llvm-svn: 158563	2012-06-15 22:24:22 +00:00
Bill Wendling	4fd966347a	Remove assignments which aren't used afterwards. llvm-svn: 158535	2012-06-15 19:30:42 +00:00
Jakob Stoklund Olesen	5767ad727c	Use regunit liveness in RegisterCoalescer when it is available. We only do very limited physreg coalescing now, but we still merge virtual registers into reserved registers. llvm-svn: 158526	2012-06-15 17:36:48 +00:00
Akira Hatanaka	1b420ac4c8	Make machine verifier check the first instruction of the last bundle instead of the last instruction of a basic block. llvm-svn: 158468	2012-06-14 20:51:13 +00:00
Lang Hames	a33db65bd9	Make comment slightly more helpful. llvm-svn: 158467	2012-06-14 20:37:15 +00:00
Andrew Trick	45877fa011	misched: disable SSA check pending PR13112. llvm-svn: 158461	2012-06-14 17:48:49 +00:00
Andrew Trick	344fb64fa3	sched: fix latency of memory dependence chain edges for consistency. For store->load dependencies that may alias, we should always use TrueMemOrderLatency, which may eventually become a subtarget hook. In effect, we should guarantee at least TrueMemOrderLatency on at least one DAG path from a store to a may-alias load. This should fix the standard mode as well as -enable-aa-sched-mi". llvm-svn: 158380	2012-06-13 02:39:03 +00:00
Andrew Trick	5b90645abb	sched: Avoid trivially redundant DAG edges. Take the one with higher latency. llvm-svn: 158379	2012-06-13 02:39:00 +00:00
Andrew Trick	3e465fb225	misched: When querying RegisterPressureTracker, always save current and max pressure. llvm-svn: 158340	2012-06-11 23:42:23 +00:00
Andrew Trick	d054bd833a	misched: regpressure getMaxPressureDelta, revert accidental checkin. llvm-svn: 158339	2012-06-11 23:42:20 +00:00
Benjamin Kramer	0748008df5	Allocate the contents of DwarfDebug's StringMaps in a single big BumpPtrAllocator. llvm-svn: 158265	2012-06-09 10:34:15 +00:00
Andrew Trick	fc8ce08be3	Register pressure: added getPressureAfterInstr. llvm-svn: 158256	2012-06-09 02:16:58 +00:00
Jakob Stoklund Olesen	c26fbbfba5	Sketch a LiveRegMatrix analysis pass. The LiveRegMatrix represents the live range of assigned virtual registers in a Live interval union per register unit. This is not fundamentally different from the interference tracking in RegAllocBase that both RABasic and RAGreedy use. The important differences are: - LiveRegMatrix tracks interference per register unit instead of per physical register. This makes interference checks cheaper and assignments slightly more expensive. For example, the ARM D7 reigster has 24 aliases, so we would check 24 physregs before assigning to one. With unit-based interference, we check 2 units before assigning to 2 units. - LiveRegMatrix caches regmask interference checks. That is currently duplicated functionality in RABasic and RAGreedy. - LiveRegMatrix is a pass which makes it possible to insert target-dependent passes between register allocation and rewriting. Such passes could tweak the register assignments with interference checking support from LiveRegMatrix. Eventually, RABasic and RAGreedy will be switched to LiveRegMatrix. llvm-svn: 158255	2012-06-09 02:13:10 +00:00
Jakob Stoklund Olesen	be336295cd	Also compute MBB live-in lists in the new rewriter pass. This deduplicates some code from the optimizing register allocators, and it means that it is now possible to change the register allocators' solutions simply by editing the VirtRegMap between the register allocator pass and the rewriter. llvm-svn: 158249	2012-06-09 00:14:47 +00:00
Jakob Stoklund Olesen	1224312f5b	Reintroduce VirtRegRewriter. OK, not really. We don't want to reintroduce the old rewriter hacks. This patch extracts virtual register rewriting as a separate pass that runs after the register allocator. This is possible now that CodeGen/Passes.cpp can configure the full optimizing register allocator pipeline. The rewriter pass uses register assignments in VirtRegMap to rewrite virtual registers to physical registers, and it inserts kill flags based on live intervals. These finalization steps are the same for the optimizing register allocators: RABasic, RAGreedy, and PBQP. llvm-svn: 158244	2012-06-08 23:44:45 +00:00
Evan Cheng	c5adccab1a	Start implementing pre-ra if-converter: using speculation and selects to eliminate branches. llvm-svn: 158234	2012-06-08 21:53:50 +00:00
Andrew Trick	423fa6faee	TargetInstrInfo hooks implemented in codegen should be declared pure virtual. llvm-svn: 158233	2012-06-08 21:52:38 +00:00
Andrew Trick	596af1b02e	Fix Target->Codegen dependence. Bulk move of TargetInstrInfo implementation into TargetInstrInfoImpl. This is dirty because the code isn't part of TargetInstrInfoImpl class, nor should it be, because the methods are not target hooks. However, it's the current mechanism for keeping libTarget useful outside the backend. You'll get a not-so-nice link error if you invoke a TargetInstrInfo method that depends on CodeGen. The TargetInstrInfoImpl class should probably be removed since it doesn't really solve this problem. To really fix this, we probably need separate interfaces for the CodeGen/nonCodeGen sides of TargetInstrInfo. llvm-svn: 158212	2012-06-08 17:23:27 +00:00
Pete Cooper	cd72016cab	Move terminator machine verification to check MachineBasicBlock::instr_iterator instead of MBB::iterator llvm-svn: 158154	2012-06-07 17:41:39 +00:00
Manman Ren	9c9641812c	Revert r157755. The commit is intended to fix rdar://11540023. It is implemented as part of peephole optimization. We can actually implement this in the SelectionDAG lowering phase. llvm-svn: 158122	2012-06-06 23:53:03 +00:00
Jakob Stoklund Olesen	00e7dffefb	Properly verify liveness with bundled machine instructions. Bundles should be treated as one atomic transaction when checking liveness. That is how the register allocator (and VLIW targets) treats bundles. llvm-svn: 158116	2012-06-06 22:34:30 +00:00
Andrew Trick	05ff4667eb	Move RegisterClassInfo.h. Allow targets to access this API. It's required for RegisterPressure. llvm-svn: 158102	2012-06-06 20:29:31 +00:00
Andrew Trick	88517f608c	Move RegisterPressure.h. Make it a general utility for use by Targets. llvm-svn: 158097	2012-06-06 19:47:35 +00:00
Benjamin Kramer	009b1c1cf1	Round 2 of dead private variable removal. LLVM is now -Wunused-private-field clean except for - lib/MC/MCDisassembler/Disassembler.h. Not sure why it keeps all those unaccessible fields. - gtest. llvm-svn: 158096	2012-06-06 19:47:08 +00:00
Benjamin Kramer	628a39faa3	Remove unused private fields found by clang's new -Wunused-private-field. There are some that I didn't remove this round because they looked like obvious stubs. There are dead variables in gtest too, they should be fixed upstream. llvm-svn: 158090	2012-06-06 18:25:08 +00:00
Jakob Stoklund Olesen	f435b1867d	Remove dead debug option -disable-rematerialization. Remat has been stable for years, and it isn't done by LiveIntervalAnalysis any longer. (See LiveRangeEdit). llvm-svn: 158079	2012-06-06 16:22:41 +00:00
Benjamin Kramer	3de5d40f4d	Stop leaking RegScavengers from TailDuplication. llvm-svn: 158069	2012-06-06 13:53:41 +00:00
Jakob Stoklund Olesen	c141ba584e	Move LiveUnionArray into LiveIntervalUnion.h It is useful outside RegAllocBase. llvm-svn: 158041	2012-06-05 23:57:30 +00:00
Jakob Stoklund Olesen	46d229c573	Don't print register names in LiveIntervalUnion::print(). Soon we'll be making LiveIntervalUnions for register units as well. This was the only place using the RepReg member, so just remove it. llvm-svn: 158038	2012-06-05 23:07:19 +00:00
Matt Beaumont-Gay	7ba769bedd	Suppress -Wunused-variable in -Asserts build llvm-svn: 158037	2012-06-05 23:00:03 +00:00
Jakob Stoklund Olesen	f3f7d6f6e2	Simplify LiveInterval::print(). Don't print out the register number and spill weight, making the TRI argument unnecessary. This allows callers to interpret the reg field. It can currently be a virtual register, a physical register, a spill slot, or a register unit. llvm-svn: 158031	2012-06-05 22:51:54 +00:00
Jakob Stoklund Olesen	12e03dae44	Add experimental support for register unit liveness. Instead of computing a live interval per physreg, LiveIntervals can compute live intervals per register unit. This makes impossible the confusing situation where aliasing registers could have overlapping live intervals. It should also make fixed interferernce checking cheaper since registers have fewer register units than aliases. Live intervals for regunits are computed on demand, using MRI use-def chains and the new LiveRangeCalc class. Only regunits live in to ABI blocks are precomputed during LiveIntervals::runOnMachineFunction(). The regunit liveness computations don't depend on LiveVariables. llvm-svn: 158029	2012-06-05 22:02:15 +00:00
Jakob Stoklund Olesen	989b3b1516	Implement LiveRangeCalc::extendToUses() and createDeadDefs(). These LiveRangeCalc methods are to be used when computing a live range from scratch. llvm-svn: 158027	2012-06-05 21:54:09 +00:00
Andrew Trick	4b037005d2	MachineInstr::eraseFromParent fix for removing bundled instrs. Patch by Ivan Llopard. llvm-svn: 158025	2012-06-05 21:44:23 +00:00
Andrew Trick	4544606c71	misched: API for minimum vs. expected latency. Minimum latency determines per-cycle scheduling groups. Expected latency determines critical path and cost. llvm-svn: 158021	2012-06-05 21:11:27 +00:00
Lang Hames	a59100cc08	Add a new intrinsic: llvm.fmuladd. This intrinsic represents a multiply-add expression (a * b + c) that can be implemented as a fused multiply-add (fma) if the target determines that this will be more efficient. This intrinsic will be used to implement FP_CONTRACT support and an aggressive FMA formation mode. If your target has a fast FMA instruction you should override the isFMAFasterThanMulAndAdd method in TargetLowering to return true. llvm-svn: 158014	2012-06-05 19:07:46 +00:00
Andrew Trick	73d7736b17	misched: Added MultiIssueItineraries. This allows a subtarget to explicitly specify the issue width and other properties without providing pipeline stage details for every instruction. llvm-svn: 157979	2012-06-05 03:44:40 +00:00
Andrew Trick	a88d46e818	sdsched: Use the right heuristics when -mcpu is not provided and we have no itinerary. Use ILP heuristics for long latency instrs if no scoreboard exists. llvm-svn: 157978	2012-06-05 03:44:34 +00:00
Andrew Trick	ed7c96d7d9	misched: Allow disabling scoreboard hazard checking for subtargets with a valid itinerary but no pipeline stages. An itinerary can contain useful scheduling information without specifying pipeline stages for each instruction. llvm-svn: 157977	2012-06-05 03:44:32 +00:00
Andrew Trick	d36adece50	misched: comments from code review. llvm-svn: 157975	2012-06-05 03:44:26 +00:00
Jakob Stoklund Olesen	345528944c	Remove the last remat-related code from LiveIntervalAnalysis. Rematerialization is handled by LiveRangeEdit now. llvm-svn: 157974	2012-06-05 01:06:15 +00:00
Jakob Stoklund Olesen	9e27e2621a	Stop using LiveIntervals::isReMaterializable(). It is an old function that does a lot more than required by CalcSpillWeights, which was the only remaining caller. The isRematerializable() function never actually sets the isLoad argument, so don't try to compute that. llvm-svn: 157973	2012-06-05 01:06:12 +00:00
Jakob Stoklund Olesen	188d830405	Delete dead code. llvm-svn: 157963	2012-06-04 23:01:41 +00:00
Jakob Stoklund Olesen	11fb248aa6	Switch LiveIntervals member variable to LLVM naming standards. No functional change. llvm-svn: 157957	2012-06-04 22:39:14 +00:00
Jakob Stoklund Olesen	5ef0e0b262	Pass context pointers to LiveRangeCalc::reset(). Remove the same pointers from all the other LiveRangeCalc functions, simplifying the interface. llvm-svn: 157941	2012-06-04 18:21:16 +00:00
Nadav Rotem	b7bb72e4f3	Remove the "-promote-elements" flag. This flag is now enabled by default. llvm-svn: 157925	2012-06-04 11:27:21 +00:00
Benjamin Kramer	bde9176663	Fix typos found by http://github.com/lyda/misspell-check llvm-svn: 157885	2012-06-02 10:20:22 +00:00
Stepan Dyatkovskiy	0e46d8a08c	PR1255: case ranges. IntRange converted from struct to class. So main change everywhere is replacement of ".Low/High" with ".getLow/getHigh()" llvm-svn: 157884	2012-06-02 09:42:43 +00:00
Stepan Dyatkovskiy	9549f5894b	PR1255: case ranges. IntegersSubsetGeneric, IntegersSubsetMapping: added IntTy template parameter, that allows use either APInt or IntItem. This change allows to write unittest for these classes. llvm-svn: 157880	2012-06-02 07:26:00 +00:00
Akira Hatanaka	6f3b2a670f	Fix a bug in the code which custom-lowers truncating stores in LegalizeDAG. Check that the SDValue TargetLowering::LowerOperation returns is not null before replacing the original node with the returned node. llvm-svn: 157873	2012-06-02 01:10:34 +00:00
Jakob Stoklund Olesen	54038d796c	Switch all register list clients to the new MC*Iterator interface. No functional change intended. Sorry for the churn. The iterator classes are supposed to help avoid giant commits like this one in the future. The TableGen-produced register lists are getting quite large, and it may be necessary to change the table representation. This makes it possible to do so without changing all clients (again). llvm-svn: 157854	2012-06-01 23:28:30 +00:00
Jakob Stoklund Olesen	ca487d2183	Remove physreg support from adjustCopiesBackFrom and removeCopyByCommutingDef. After physreg coalescing was disabled, these functions can't do anything useful with physregs anyway. llvm-svn: 157849	2012-06-01 22:38:19 +00:00
Jakob Stoklund Olesen	9b09cf0c11	Simplify some more getAliasSet callers. MCRegAliasIterator can include Reg itself in the list. llvm-svn: 157848	2012-06-01 22:38:17 +00:00
Jakob Stoklund Olesen	92a0083944	Switch some getAliasSet clients to MCRegAliasIterator. MCRegAliasIterator can optionally visit the register itself, allowing for simpler code. llvm-svn: 157837	2012-06-01 20:36:54 +00:00
Manman Ren	e873552091	ARM: properly handle alignment for struct byval. Factor out the expansion code into a function. This change is to be enabled in clang. rdar://9877866 llvm-svn: 157830	2012-06-01 19:33:18 +00:00
Stepan Dyatkovskiy	66305749f1	PR1255: case ranges. IntegersSubset devided into IntegersSubsetGeneric and into IntegersSubset itself. The first has no references to ConstantInt and works with IntItem only. IntegersSubsetMapping also made generic. Here added second template parameter "IntegersSubsetTy" that allows to use on of two IntegersSubset types described below. llvm-svn: 157815	2012-06-01 16:17:57 +00:00
Chris Lattner	cc84e6d2b5	quick fix for PR13006, will check in testcase later. llvm-svn: 157813	2012-06-01 15:02:52 +00:00
Chris Lattner	466076b95f	enhance the logic for looking through tailcalls to look through transparent casts in multiple-return value scenarios, like what happens on X86-64 when returning small structs. llvm-svn: 157800	2012-06-01 05:29:15 +00:00
Chris Lattner	182fe3eef1	enhance getNoopInput to know about vector<->vector bitcasts of legal types, as well as int<->ptr casts. This allows us to tailcall functions with some trivial casts between the call and return (i.e. because the return types disagree). llvm-svn: 157798	2012-06-01 05:16:33 +00:00
Chris Lattner	4f3615de97	rearrange some logic, no functionality change. llvm-svn: 157796	2012-06-01 05:01:15 +00:00
Eric Christopher	1cf3338bb4	Add support for enum forward declarations. Part of rdar://11570854 llvm-svn: 157786	2012-06-01 00:22:32 +00:00
Manman Ren	9bccb64e56	X86: replace SUB with CMP if possible This patch will optimize the following movq %rdi, %rax subq %rsi, %rax cmovsq %rsi, %rdi movq %rdi, %rax to cmpq %rsi, %rdi cmovsq %rsi, %rdi movq %rdi, %rax Perform this optimization if the actual result of SUB is not used. rdar: 11540023 llvm-svn: 157755	2012-05-31 17:20:29 +00:00
Jakob Stoklund Olesen	05e2245fc6	Prioritize smaller register classes for urgent evictions. It helps compile exotic inline asm. In the test case, normal GR32 virtual registers use up eax-edx so the final GR32_ABCD live range has no registers left. Since all the live ranges were tiny, we had no way of prioritizing the smaller register class. This patch allows tiny unspillable live ranges to be evicted by tiny unspillable live ranges from a smaller register class. <rdar://problem/11542429> llvm-svn: 157715	2012-05-30 21:46:58 +00:00
Owen Anderson	0eda3e1de6	Switch the canonical FMA term operand order to match both the comment I wrote and the usual LLVM convention. llvm-svn: 157708	2012-05-30 18:54:50 +00:00
Owen Anderson	c7aaf523e1	Teach DAGCombine to canonicalize the position of a constant in the term operands of an FMA node. llvm-svn: 157707	2012-05-30 18:50:39 +00:00
Chad Rosier	fba46a64aa	Remove extra space. llvm-svn: 157706	2012-05-30 18:47:55 +00:00
Jakob Stoklund Olesen	3a48c06456	Remove some redundant tests. An empty list is not represented as a null pointer. Let TRI do its own shortcuts. llvm-svn: 157702	2012-05-30 18:38:56 +00:00
Evan Cheng	bc2453dd3d	Teach taildup to update livein set. rdar://11538365 llvm-svn: 157663	2012-05-30 00:42:39 +00:00
Evan Cheng	50954fb3e1	If-converter models predicated defs as read + write. The read should be marked as 'undef' since it may not already be live. This appeases -verify-machineinstrs. llvm-svn: 157662	2012-05-30 00:42:02 +00:00
Bob Wilson	33e5188c27	Add an insertPass API to TargetPassConfig. <rdar://problem/11498613> Besides adding the new insertPass function, this patch uses it to enhance the existing -print-machineinstrs so that the MachineInstrs after a specific pass can be printed. Patch by Bin Zeng! llvm-svn: 157655	2012-05-30 00:17:12 +00:00
Evan Cheng	76f6e2671a	Optional def can be either a def or a use (of reg0). llvm-svn: 157640	2012-05-29 19:40:44 +00:00
Lang Hames	e256f71937	Clear the entering, exiting and internal ranges of a bundle before collecting ranges for the instruction about to be bundled. This fixes a bug in an external project where an assertion was triggered due to spurious 'multiple defs' within the bundle. Patch by Ivan Llopard. Thanks Ivan! llvm-svn: 157632	2012-05-29 18:19:54 +00:00
Stepan Dyatkovskiy	58107dd547	ConstantRangesSet renamed to IntegersSubset. CRSBuilder renamed to IntegersSubsetMapping. llvm-svn: 157612	2012-05-29 12:26:47 +00:00
Peter Collingbourne	913869be45	Add llvm.fabs intrinsic. llvm-svn: 157594	2012-05-28 21:48:37 +00:00
Stepan Dyatkovskiy	e3e19cbb13	PR1255: Case Ranges Implemented IntItem - the wrapper around APInt. Why not to use APInt item directly right now? 1. It will very difficult to implement case ranges as series of small patches. We got several large and heavy patches. Each patch will about 90-120 kb. If you replace ConstantInt with APInt in SwitchInst you will need to changes at the same time all Readers,Writers and absolutely all passes that uses SwitchInst. 2. We can implement APInt pool inside and save memory space. E.g. we use several switches that works with 256 bit items (switch on signatures, or strings). We can avoid value duplicates in this case. 3. IntItem can be easyly easily replaced with APInt. 4. Currenly we can interpret IntItem both as ConstantInt and as APInt. It allows to provide SwitchInst methods that works with ConstantInt for non-updated passes. Why I need it right now? Currently I need to update SimplifyCFG pass (EqualityComparisons). I need to work with APInts directly a lot, so peaces of code ConstantInt *V = ...; if (V->getValue().ugt(AnotherV->getValue()) { ... } will look awful. Much more better this way: IntItem V = ConstantIntVal->getValue(); if (AnotherV < V) { } Of course any reviews are welcome. P.S.: I'm also going to rename ConstantRangesSet to IntegersSubset, and CRSBuilder to IntegersSubsetMapping (allows to map individual subsets of integers to the BasicBlocks). Since in future these classes will founded on APInt, it will possible to use them in more generic ways. llvm-svn: 157576	2012-05-28 12:39:09 +00:00
Peter Collingbourne	4d358b55fa	Have getOrCreateSubprogramDIE store the DIE for a subprogram definition in the map before calling itself to retrieve the DIE for the declaration. Without this change, if this causes getOrCreateSubprogramDIE to be recursively called on the definition, it will create multiple DIEs for that definition. Fixes PR12831. llvm-svn: 157541	2012-05-27 18:36:44 +00:00
Benjamin Kramer	abb3fa69b4	Missed parens. llvm-svn: 157527	2012-05-27 10:56:55 +00:00
Benjamin Kramer	4b8f8e75e6	r157525 didn't work, just disable iterator checking. This is obviosly right but I don't see how to do this with proper vector iterators without building a horrible mess of workarounds. llvm-svn: 157526	2012-05-27 10:24:52 +00:00
Benjamin Kramer	48ff2751c1	SDAGBuilder: Avoid iterator invalidation harder. vector.begin()-1 is invalid too. llvm-svn: 157525	2012-05-27 09:44:52 +00:00
Benjamin Kramer	5aad872f8c	SDAGBuilder: Don't create an invalid iterator when there is only one switch case. Found by libstdc++'s debug mode. llvm-svn: 157522	2012-05-26 21:19:12 +00:00
Benjamin Kramer	f2beccf6b4	SelectionDAGBuilder: When emitting small compare chains for switches order them by using edge weights. SimplifyCFG tends to form a lot of 2-3 case switches when merging branches. Move the most likely condition to the front so it is checked first and the others can be skipped. This is currently not as effective as it could be because SimplifyCFG destroys profiling metadata when merging branches and switches. Merging branch weight metadata is tricky though. This code touches at most 3 cases so I didn't use a proper sorting algorithm. llvm-svn: 157521	2012-05-26 20:01:32 +00:00
Benjamin Kramer	484f4247aa	ScoreboardHazardRecognizer: Remove dead conditional in debug code. Negative cycles are filtered out earlier. llvm-svn: 157514	2012-05-26 11:37:37 +00:00
Justin Holewinski	aa58397b3c	Change interface for TargetLowering::LowerCallTo and TargetLowering::LowerCall to pass around a struct instead of a large set of individual values. This cleans up the interface and allows more information to be added to the struct for future targets without requiring changes to each and every target. NV_CONTRIB llvm-svn: 157479	2012-05-25 16:35:28 +00:00
Andrew Trick	4e7f6a7702	misched: trace formatting llvm-svn: 157455	2012-05-25 02:02:39 +00:00
Eli Friedman	315a0c79f3	Simplify code for calling a function where CanLowerReturn fails, fixing a small bug in the process. llvm-svn: 157446	2012-05-25 00:09:29 +00:00
Kaelyn Uhrain	85d8f0cba8	Silence unused variable warnings from when assertions are disabled. llvm-svn: 157438	2012-05-24 23:37:49 +00:00
Andrew Trick	a306a8a844	misched: Use the same scheduling heuristics with -misched-topdown/bottomup. (except the part about choosing direction) llvm-svn: 157437	2012-05-24 23:11:17 +00:00
Andrew Trick	79d3eecbb4	misched: Trace regpressure. llvm-svn: 157429	2012-05-24 22:11:14 +00:00
Andrew Trick	a8ad5f7c7b	misched: Give each ReadyQ a unique ID llvm-svn: 157428	2012-05-24 22:11:12 +00:00
Andrew Trick	61f1a278b8	misched: Added ScoreboardHazardRecognizer. The Hazard checker implements in-order contraints, or interlocked resources. Ready instructions with hazards do not enter the available queue and are not visible to other heuristics. The major code change is the addition of SchedBoundary to encapsulate the state at the top or bottom of the schedule, including both a pending and available queue. The scheduler now counts cycles in sync with the hazard checker. These are minimum cycle counts based on known hazards. Targets with no itinerary (x86_64) currently remain at cycle 0. To fix this, we need to provide some maximum issue width for all targets. We also need to add the concept of expected latency vs. minimum latency. llvm-svn: 157427	2012-05-24 22:11:09 +00:00
Andrew Trick	ca47335461	misched: Release bottom roots in reverse order. llvm-svn: 157426	2012-05-24 22:11:05 +00:00
Andrew Trick	dd375dd34a	misched: rename ReadyQ class llvm-svn: 157425	2012-05-24 22:11:03 +00:00
Andrew Trick	f378617773	misched: copy comments so compareRPDelta is readable by itself. llvm-svn: 157424	2012-05-24 22:11:01 +00:00
Andrew Trick	d5326aea81	regpressure: Added RegisterPressure::dump llvm-svn: 157423	2012-05-24 22:10:59 +00:00
Andrew Trick	b2c172e20a	regpressure: physreg livein/out fix llvm-svn: 157422	2012-05-24 22:10:57 +00:00
Craig Topper	9520719b9b	Mark some static arrays as const. llvm-svn: 157377	2012-05-24 06:35:32 +00:00
Jakob Stoklund Olesen	0ce90494e6	Add a last resort tryInstructionSplit() to RAGreedy. Live ranges with a constrained register class may benefit from splitting around individual uses. It allows the remaining live range to use a larger register class where it may allocate. This is like spilling to a different register class. This is only attempted on constrained register classes. <rdar://problem/11438902> llvm-svn: 157354	2012-05-23 22:37:27 +00:00
Bill Wendling	e351e8c52d	Forgot to reverse conditional. llvm-svn: 157349	2012-05-23 22:12:50 +00:00
Bill Wendling	041793c452	Reduce indentation by early detection of 'continue'. No functionality change. llvm-svn: 157348	2012-05-23 22:09:50 +00:00
Jakob Stoklund Olesen	5b8f476037	Correctly deal with identity copies in RegisterCoalescer. Now that the coalescer keeps live intervals and machine code in sync at all times, it needs to deal with identity copies differently. When merging two virtual registers, all identity copies are removed right away. This means that other identity copies must come from somewhere else, and they are going to have a value number. Deal with such copies by merging the value numbers before erasing the copy instruction. Otherwise, we leave dangling value numbers in the live interval. This fixes PR12927. llvm-svn: 157340	2012-05-23 20:21:06 +00:00
Patrik Hägglund	94537c2a06	Small fix for the debug output from PBQP (PR12822). llvm-svn: 157319	2012-05-23 12:12:58 +00:00
Eric Christopher	c49643586b	Add support for C++11 enum classes in llvm. Part of rdar://11496790 llvm-svn: 157303	2012-05-23 00:09:20 +00:00
Eric Christopher	d42b92f5c3	Untabify and 80-col. llvm-svn: 157274	2012-05-22 18:45:24 +00:00
Eric Christopher	775cbd2b47	Formatting consistency. llvm-svn: 157273	2012-05-22 18:45:18 +00:00
Jakob Stoklund Olesen	924279ca0e	Only erase virtregs with no uses left. Also make sure registers aren't erased twice if the dead def mentions the register twice. This fixes PR12911. llvm-svn: 157254	2012-05-22 14:52:12 +00:00
Owen Anderson	f2118ea826	Fix use of an unitialized value in the LegalizeOps expansion for ISD::SUB. No in-tree targets exercise this path. Patch by Micah Villmow. llvm-svn: 157215	2012-05-21 22:39:20 +00:00
Chad Rosier	5d1f5d2be3	Typo. llvm-svn: 157195	2012-05-21 17:13:41 +00:00
Jakob Stoklund Olesen	29268b50f2	Give a small negative bias to giant edge bundles. This helps compile time when the greedy register allocator splits live ranges in giant functions. Without the bias, we would try to grow regions through the giant edge bundles, usually to find out that the region became too big and expensive. If a live range has many uses in blocks near the giant bundle, the small negative bias doesn't make a big difference, and we still consider regions including the giant edge bundle. Giant edge bundles are usually connected to landing pads or indirect branches. llvm-svn: 157174	2012-05-21 03:11:23 +00:00
Jakob Stoklund Olesen	a7c3d2f902	Clear kill flags on the fly when joining intervals. With physreg joining out of the way, it is easy to recognize the instructions that need their kill flags cleared while testing for interference. This allows us to skip the final scan of all instructions for an 11% speedup of the coalescer pass. llvm-svn: 157169	2012-05-20 21:41:05 +00:00
Jakob Stoklund Olesen	2f06a6579c	Constrain regclasses in PeepholeOptimizer. It can be necessary to restrict to a sub-class before accessing sub-registers. llvm-svn: 157164	2012-05-20 18:42:55 +00:00
Jakob Stoklund Olesen	00f07dec0c	Constrain register classes in TailDup. When rewriting operands, make sure the new registers have a compatible register class. llvm-svn: 157163	2012-05-20 18:42:51 +00:00
Peter Collingbourne	8eb05fd093	When legalising shifts, do not pre-build a list of operands which may be RAUW'd by the recursive call to LegalizeOps; instead, retrieve the other operands when calling UpdateNodeOperands. Fixes PR12889. llvm-svn: 157162	2012-05-20 18:36:15 +00:00
Benjamin Kramer	76004e69a6	Plug a leak when using MCJIT. Found by valgrind. llvm-svn: 157160	2012-05-20 17:24:08 +00:00
Benjamin Kramer	a7c2c41c3c	Use TargetMachine's register info instead of creating a new one and leaking it. llvm-svn: 157155	2012-05-20 11:24:27 +00:00
Jakob Stoklund Olesen	1f1c6add10	Properly constrain register classes for sub-registers. Not all GR64 registers have sub_8bit sub-registers. llvm-svn: 157150	2012-05-20 06:38:37 +00:00
Jakob Stoklund Olesen	a103a516c6	Properly constrain register classes in 2-addr. X86 has 2-addr instructions with different constraints on the tied def and use operands. One is GR32, one is GR32_NOSP. llvm-svn: 157149	2012-05-20 06:38:32 +00:00
Jakob Stoklund Olesen	b8f950650b	Missed a push_back in r157147. llvm-svn: 157148	2012-05-20 05:28:53 +00:00
Jakob Stoklund Olesen	d0a38a8daa	Avoid deleting extra copies when RegistersDefinedFromSameValue is true. This function adds copies to be erased to DupCopies, avoid also adding them to DeadCopies. llvm-svn: 157147	2012-05-20 04:52:48 +00:00
Jakob Stoklund Olesen	64d82b74dd	Fix build bots. Avoid looking at the operands of a potentially erased instruction. llvm-svn: 157146	2012-05-20 03:57:12 +00:00
Jakob Stoklund Olesen	02d83e3b8b	LiveRangeQuery simplifies shrinkToUses(). llvm-svn: 157145	2012-05-20 02:54:52 +00:00
Jakob Stoklund Olesen	abc8c3d3ce	Use LiveRangeQuery in ScheduleDAGInstrs. llvm-svn: 157144	2012-05-20 02:44:38 +00:00
Jakob Stoklund Olesen	58165b92e6	Eliminate some uses of struct LiveRange. That struct ought to be a LiveInterval implementation detail. llvm-svn: 157143	2012-05-20 02:44:36 +00:00
Jakob Stoklund Olesen	2aeead4bf6	Use LiveRangeQuery instead of getLiveRangeContaining(). llvm-svn: 157142	2012-05-20 02:44:33 +00:00
Jakob Stoklund Olesen	4e1e43a355	Simplify overlap check. llvm-svn: 157137	2012-05-19 23:59:27 +00:00
Jakob Stoklund Olesen	a34a69ce0c	Fix 12892. Dead code elimination during coalescing could cause a virtual register to be split into connected components. The following rewriting would be confused about the already joined copies present in the code, but without a corresponding value number in the live range. Erase all joined copies instantly when joining intervals such that the MI and LiveInterval representations are always in sync. llvm-svn: 157135	2012-05-19 23:34:59 +00:00
Jakob Stoklund Olesen	e59d0c3252	Remove the late DCE in RegisterCoalescer. Dead code and joined copies are now eliminated on the fly, and there is no need for a post pass. This makes the coalescer work like other modern register allocator passes: Code is changed on the fly, there is no pending list of changes to be committed. llvm-svn: 157132	2012-05-19 21:02:31 +00:00
Jakob Stoklund Olesen	25ced18407	Erase joined copies immediately. The late dead code elimination is no longer necessary. The test changes are cause by a register hint that can be either %rdi or %rax. The choice depends on the use list order, which this patch changes. llvm-svn: 157131	2012-05-19 20:54:07 +00:00
Jakob Stoklund Olesen	1b707c8817	Fix an ancient bug in removeCopyByCommutingDef(). Before rewriting uses of one value in A to register B, check that there are no tied uses. That would require multiple A values to be rewritten. This bug can't bite in the current version of the code for a fairly subtle reason: A tied use would have caused 2-addr to insert a copy before the use. If the copy has been coalesced, it will be found by the same loop changed by this patch, and the optimization is aborted. This was exposed by 400.perlbench and lua after applying a patch that deletes joined copies aggressively. llvm-svn: 157130	2012-05-19 20:54:03 +00:00
Jakob Stoklund Olesen	d05148ba89	Collect inflatable virtual registers on the fly. There is no reason to defer the collection of virtual registers whose register class may be replaced with a larger class. llvm-svn: 157125	2012-05-19 19:25:00 +00:00
Jakob Stoklund Olesen	900f58441d	Eliminate dead code after remat. This will remove the original def once it has no more uses. llvm-svn: 157104	2012-05-19 05:25:59 +00:00
Jakob Stoklund Olesen	dcffc626c0	Don't remat during updateRegDefsUses(). Remaining virtreg->physreg copies were rematerialized during updateRegDefsUses(), but we already do the same thing in joinCopy() when visiting the physreg copy instruction. Eliminate the preserveSrcInt argument to reMaterializeTrivialDef(). It is now always true. llvm-svn: 157103	2012-05-19 05:25:56 +00:00
Jakob Stoklund Olesen	06dc721203	Immediately erase trivially useless copies. There is no need for these instructions to stick around since they are known to be not dead. llvm-svn: 157102	2012-05-19 05:25:53 +00:00
Jakob Stoklund Olesen	82d77e8145	Run proper recursive dead code elimination during coalescing. Dead copies cause problems because they are trivial to coalesce, but removing them gived the live range a dangling end point. This patch enables full dead code elimination which trims live ranges to their uses so end points don't dangle. DCE may erase multiple instructions. Put the pointers in an ErasedInstrs set so we never risk visiting erased instructions in the work list. There isn't supposed to be any dead copies entering RegisterCoalescer, but they do slip by as evidenced by test/CodeGen/X86/coalescer-dce.ll. llvm-svn: 157101	2012-05-19 05:25:50 +00:00
Jakob Stoklund Olesen	e5bbe37950	Allow LiveRangeEdit to be created with a NULL parent. The dead code elimination with callbacks is still useful. llvm-svn: 157100	2012-05-19 05:25:46 +00:00

... 3 4 5 6 7 ...

13969 Commits