llvm-project

Commit Graph

Author	SHA1	Message	Date
Benjamin Kramer	8da6468a34	Remove unused variable. llvm-svn: 185072	2013-06-27 11:26:41 +00:00
Elena Demikhovsky	fed077be03	Fixed a comment. llvm-svn: 184933	2013-06-26 12:15:53 +00:00
Elena Demikhovsky	6769c50d9e	Optimized integer vector multiplication operation by replacing it with shift/xor/sub when it is possible. Fixed a bug in SDIV, where the const operand is not a splat constant vector. llvm-svn: 184931	2013-06-26 10:55:03 +00:00
Manman Ren	aed30fa3de	Remove unused code. No functionality change. llvm-svn: 184866	2013-06-25 18:49:55 +00:00
Benjamin Kramer	866793109e	BlockFrequency: Bump up the entry frequency a bit. This is a band-aid to fix the most severe regressions we're seeing from basing spill decisions on block frequencies, until we have a better solution. llvm-svn: 184835	2013-06-25 13:34:40 +00:00
Eric Christopher	5fdd68e596	80-column and tab character fixes. llvm-svn: 184792	2013-06-24 23:20:02 +00:00
Eric Christopher	40b6bf6f46	Formatting. llvm-svn: 184788	2013-06-24 21:34:55 +00:00
Eric Christopher	bf2d23c0d7	Use const references instead of pointers to references that are never modified. No functional change. llvm-svn: 184781	2013-06-24 21:07:27 +00:00
Andrew Trick	97a1d7c475	Fix tail merging to assign the (more) correct BasicBlock when splitting. This makes it possible to write unit tests that are less susceptible to minor code motion, particularly copy placement. block-placement.ll covers this case with -pre-RA-sched=source which will soon be default. One incorrectly named block is already fixed, but without this fix, enabling new coalescing and scheduling would cause more failures. llvm-svn: 184680	2013-06-24 01:55:01 +00:00
David Blaikie	5acff7e691	DebugInfo: PR14404: Avoid truncating 64 bit values into 32 bits for ULEB128/SLEB128 generation llvm-svn: 184669	2013-06-23 18:31:11 +00:00
David Blaikie	2b380232c3	DebugInfo: Support (using GNU extensions) for template template parameters and parameter packs llvm-svn: 184643	2013-06-22 18:59:11 +00:00
Chad Rosier	295bd43adb	The getRegForInlineAsmConstraint function should only accept MVT value types. llvm-svn: 184642	2013-06-22 18:37:38 +00:00
Andrew Trick	cbd7305d09	Prevent LiveRangeEdit from deleting bundled instructions. We have no targets on trunk that bundle before regalloc. However, we have been advertising regalloc as bundle safe for use with out-of-tree targets. We need to at least contain the parts of the code that are still unsafe. llvm-svn: 184620	2013-06-22 00:33:48 +00:00
David Blaikie	97c6c5bd98	DebugInfo: Don't lose unreferenced non-trivial by-value parameters A FastISel optimization was causing us to emit no information for such parameters & when they go missing we end up emitting a different function type. By avoiding that shortcut we not only get types correct (very important) but also location information (handy) - even if it's only live at the start of a function & may be clobbered later. Reviewed/discussion by Evan Cheng & Dan Gohman. llvm-svn: 184604	2013-06-21 22:56:30 +00:00
Michael Liao	62ebfd8786	Fix PR16360 When (srl (anyextend x), c) is folded into (anyextend (srl x, c)), the high bits are not cleared. Add 'and' to clear off them. llvm-svn: 184575	2013-06-21 18:45:27 +00:00
Andrew Trick	5749b8be01	Update physreg live intervals during remat. llvm-svn: 184574	2013-06-21 18:33:26 +00:00
Andrew Trick	8d02e917f4	Added -precompute-phys-liveness for testing LiveIntervals updates. llvm-svn: 184573	2013-06-21 18:33:23 +00:00
Andrew Trick	6b9c49a275	Handle more cases in LiveRangeEdit::eliminateDeadDefs. Live intervals for dead physregs may be created during coalescing. We need to update these in the event that their instruction goes away. crash.ll is the unit test that catches it when MI sched is enabled on X86. llvm-svn: 184572	2013-06-21 18:33:20 +00:00
Andrew Trick	530fc1f486	Refactor LiveRangeEdit::eliminateDeadDefs. I want to add logic to handle more cases. llvm-svn: 184571	2013-06-21 18:33:17 +00:00
Andrew Trick	7df3f01703	whitespace llvm-svn: 184570	2013-06-21 18:33:14 +00:00
Andrew Trick	714aec021d	Fix a -join-globalcopies bug; handle undef operands. llvm-svn: 184569	2013-06-21 18:33:11 +00:00
Andrew Trick	75961ecc1a	Modify the -join-globalcopies option (off by default). Always coalesce in forward order to propagate rematerialization. I'm fixing this option so I can enable it by default soon. llvm-svn: 184568	2013-06-21 18:33:09 +00:00
Andrew Trick	3a851a27b8	Make rematerialization in the coalescer less sensitive to LRG order. llvm-svn: 184567	2013-06-21 18:33:06 +00:00
Andrew Trick	b55db58edf	MI-Sched: cleanup DEBUG output. llvm-svn: 184565	2013-06-21 18:33:01 +00:00
Andrew Trick	736dd9a255	MI-Sched: Adjust regpressure limits for reserved regs. llvm-svn: 184564	2013-06-21 18:32:58 +00:00
David Blaikie	ea2605dc1a	DebugInfo: don't use location lists when the location covers the whole function anyway Fix up three tests - one that was relying on abbreviation number, another relying on a location list in this case (& testing raw asm, changed that to use dwarfdump on the debug_info now that that's where the location is), and another which was added in r184368 - exposing a bug in that fix that is exposed when we emit the location inline rather than through a location list. Fix that bug while I'm here. llvm-svn: 184387	2013-06-20 00:25:24 +00:00
Bill Wendling	7ed411062b	This is now a duplicate. llvm-svn: 184376	2013-06-19 22:16:33 +00:00
Bill Wendling	cce21cc347	Make the comparison operators non-member functions. llvm-svn: 184373	2013-06-19 22:09:25 +00:00
David Blaikie	81a4dc75e9	DebugInfo: PR14763/r183329 correct the location of indirect parameters We had been papering over a problem with location info for non-trivial types passed by value by emitting their type as references (this caused the debugger to interpret the location information correctly, but broke the type of the function). r183329 corrected the type information but lead to the debugger interpreting the pointer parameter as the value - the debug info describing the location needed an extra dereference. Use a new flag in DIVariable to add the extra indirection (either by promoting an existing DW_OP_reg (parameter passed in a register) to DW_OP_breg + 0 or by adding DW_OP_deref to an existing DW_OP_breg + n (parameter passed on the stack). llvm-svn: 184368	2013-06-19 21:55:13 +00:00
Bill Wendling	a3cd350249	Access the TargetLoweringInfo from the TargetMachine object instead of caching it. The TLI may change between functions. No functionality change. llvm-svn: 184360	2013-06-19 21:36:55 +00:00
Bill Wendling	7a639ea2a4	Access the TargetLoweringInfo from the TargetMachine object instead of caching it. The TLI may change between functions. No functionality change. llvm-svn: 184352	2013-06-19 21:07:11 +00:00
Bill Wendling	afc1036f3e	Access the TargetLoweringInfo from the TargetMachine object instead of caching it. The TLI may change between functions. No functionality change. llvm-svn: 184349	2013-06-19 20:51:24 +00:00
Bill Wendling	0ccf31007f	Don't cache the TLI object since we have access to it through TargetMachine already. llvm-svn: 184346	2013-06-19 20:32:16 +00:00
Quentin Colombet	b51a68681a	During SelectionDAG building explicitly set a node to constant zero when the value is zero. This allows optmizations to kick in more easily. Fix some test cases so that they remain meaningful (i.e., not completely dead coded) when optimizations apply. <rdar://problem/14096009> superfluous multiply by high part of zero-extended value. llvm-svn: 184222	2013-06-18 20:14:39 +00:00
Timur Iskhodzhanov	302ba2314a	Hopefully fix the MSVS build after r184105 llvm-svn: 184178	2013-06-18 07:57:22 +00:00
Bill Wendling	bc07a8900c	Use pointers to the MCAsmInfo and MCRegInfo. Someone may want to do something crazy, like replace these objects if they change or something. No functionality change intended. llvm-svn: 184175	2013-06-18 07:20:20 +00:00
Bill Wendling	551a6775db	Simplify some of the code. No functionality change. llvm-svn: 184172	2013-06-18 06:07:26 +00:00
Andrew Trick	5d4861867a	MI-Sched: handle ReadAdvance latencies as used by Swift. llvm-svn: 184135	2013-06-17 21:45:18 +00:00
Andrew Trick	71f08a3e74	Give RegMax higher priority. llvm-svn: 184133	2013-06-17 21:45:13 +00:00
Andrew Trick	3c3a40e4c6	Remove compareRPDelta. A complex, expensive heuristic with little value in the current design. llvm-svn: 184132	2013-06-17 21:45:11 +00:00
Andrew Trick	d31db4b45d	MI Sched: fix a typo in RegPressure heuristics. llvm-svn: 184131	2013-06-17 21:45:09 +00:00
Andrew Trick	7e63046ce9	MI-Sched: Remove another heuristic that is sensitive to queue order. llvm-svn: 184130	2013-06-17 21:45:07 +00:00
Andrew Trick	d40d0f2c1b	MI-Sched: Track multiple candidates with the same priority level. This eliminates the MultiPressure scheduling "reason". It was sensitive to queue order. We don't like being sensitive to queue order. llvm-svn: 184129	2013-06-17 21:45:05 +00:00
Bill Wendling	626c991ce9	Directly access objects which may change during compilation. llvm-svn: 184121	2013-06-17 20:41:25 +00:00
Benjamin Kramer	e2a1d89e14	Switch spill weights from a basic loop depth estimation to BlockFrequencyInfo. The main advantages here are way better heuristics, taking into account not just loop depth but also __builtin_expect and other static heuristics and will eventually learn how to use profile info. Most of the work in this patch is pushing the MachineBlockFrequencyInfo analysis into the right places. This is good for a 5% speedup on zlib's deflate (x86_64), there were some very unfortunate spilling decisions in its hottest loop in longest_match(). Other benchmarks I tried were mostly neutral. This changes register allocation in subtle ways, update the tests for it. 2012-02-20-MachineCPBug.ll was deleted as it's very fragile and the instruction it looked for was gone already (but the FileCheck pattern picked up unrelated stuff). llvm-svn: 184105	2013-06-17 19:00:36 +00:00
David Blaikie	b735b4d6db	DebugInfo: remove target-specific Frame Index handling for DBG_VALUE MachineInstrs Frame index handling is now target-agnostic, so delete the target hooks for creation & asm printing of target-specific addressing in DBG_VALUEs and any related functions. llvm-svn: 184067	2013-06-16 20:34:27 +00:00
David Blaikie	0252265be0	Debug Info: Simplify Frame Index handling in DBG_VALUE Machine Instructions Rather than using the full power of target-specific addressing modes in DBG_VALUEs with Frame Indicies, simply use Frame Index + Offset. This reduces the complexity of debug info handling down to two representations of values (reg+offset and frame index+offset) rather than three or four. Ideally we could ensure that frame indicies had been eliminated by the time we reached an assembly or dwarf generation, but I haven't spent the time to figure out where the FIs are leaking through into that & whether there's a good place to convert them. Some FI+offset=>reg+offset conversion is done (see PrologEpilogInserter, for example) which is necessary for some SelectionDAG assumptions about registers, I believe, but it might be possible to make this a more thorough conversion & ensure there are no remaining FIs no matter how instruction selection is performed. llvm-svn: 184066	2013-06-16 20:34:15 +00:00
David Blaikie	c9380db4ba	DebugInfo: print DBG_VALUE MachineInstrs with [] for deref and drop the offset when it's zero llvm-svn: 184045	2013-06-15 15:52:58 +00:00
Andrew Trick	8e8415f5ab	Missing NDEBUGs. llvm-svn: 184039	2013-06-15 05:46:47 +00:00
Andrew Trick	f78e7fa140	MI-Sched: heuristics using the new latency and machine model. llvm-svn: 184038	2013-06-15 05:39:19 +00:00
Andrew Trick	de2109eb4c	Machine Model: Add MicroOpBufferSize and resource BufferSize. Replace the ill-defined MinLatency and ILPWindow properties with with straightforward buffer sizes: MCSchedMode::MicroOpBufferSize MCProcResourceDesc::BufferSize These can be used to more precisely model instruction execution if desired. Disabled some misched tests temporarily. They'll be reenabled in a few commits. llvm-svn: 184032	2013-06-15 04:49:57 +00:00
Andrew Trick	e2ff575bb8	MI-Sched: Rename IssueCount to CurrMOps. "Counts" refer to scaled resource counts within a region. CurrMOps is simply the number of micro-ops to be issue in the current cycle. llvm-svn: 184031	2013-06-15 04:49:49 +00:00
Andrew Trick	0cd8afc547	MI-Sched: Remove the temporary EnableCopyConstrain flag. llvm-svn: 184030	2013-06-15 04:49:46 +00:00
Andrew Trick	f5b8ef2690	MI-Sched: added tracking of dependent latency for better heuristics. Heuristics compare the critical path in the scheduled code, called ExpectedLatency, with the latency of instructions remaining to be scheduled. There are two ways to look at remaining latency: (1) Dependent latency includes the latency between unscheduled and scheduled instructions. (2) Independent latency is simply the height (bottom-up) or depth (top-down) of instructions currently in the ready Q. llvm-svn: 184029	2013-06-15 04:49:44 +00:00
Andrew Trick	5580e5ca80	MI-Sched: DEBUG: print critical resource. llvm-svn: 184028	2013-06-15 04:49:42 +00:00
Stephen Lin	605207fe75	SelectionDAG: slightly refactor DAGCombiner::visitSELECT_CC to avoid redudant checks... This doesn't really effect performance due to all the relevant calls being transparent but is clearer. llvm-svn: 184027	2013-06-15 04:03:33 +00:00
David Blaikie	2dbebcfede	Debug Info: Don't print the display name and colon prefix for DEBUG_VALUE comments if the display name is empty llvm-svn: 184026	2013-06-15 00:33:47 +00:00
Matt Arsenault	d2f0332a29	Introduce getSelect usage and use more getSelectCC llvm-svn: 184012	2013-06-14 22:04:37 +00:00
Stephen Lin	4e69d01b67	SelectionDAG: minor fix to order of operands in comments to match the code llvm-svn: 184008	2013-06-14 21:33:58 +00:00
Tim Northover	059cead5ed	Mark rematerialized super/sub registers as dead. When we're rematerializing into a not-quite-right register we already add the real definition as an imp-def, but we should also be marking the "official" register as dead, since nothing else is going to use it as a result of this remat. Not doing this can affect pressure tracking. rdar://problem/14158833 llvm-svn: 184002	2013-06-14 20:22:21 +00:00
Stephen Lin	e31f2d2d54	SelectionDAG: Fix incorrect condition checks in some cases of folding FADD/FMUL combinations; also improve accuracy of comments llvm-svn: 183993	2013-06-14 18:17:35 +00:00
Derek Schuff	1804b5795a	Make PrologEpilogInserter save/restore all callee saved registers in functions which call __builtin_unwind_init() __builtin_unwind_init() is an undocumented gcc intrinsic which has this effect, and is used in libgcc_eh. Goes part of the way toward fixing PR8541. llvm-svn: 183984	2013-06-14 16:15:29 +00:00
Jakub Staszak	80df8b837f	Move #include from .h to .cpp file. llvm-svn: 183960	2013-06-14 00:00:13 +00:00
Dan Gohman	6a0c9ae4f9	Print ConstantPool entries initialized to Values with WriteAsOperand instead of operator<< so that functions are printed as just their name instead of as their entire definition, which is excessively verbose in this context. llvm-svn: 183871	2013-06-12 22:19:19 +00:00
Eric Christopher	773c155c15	Formatting. llvm-svn: 183807	2013-06-11 23:41:38 +00:00
Eric Christopher	a2f806fd20	Remove a few fixmes, the only work we're doing is getting the string to return and this is done all over. llvm-svn: 183704	2013-06-10 22:24:10 +00:00
Eric Christopher	2280c5baa6	Fix up comment. llvm-svn: 183703	2013-06-10 22:24:07 +00:00
Eric Christopher	8c47bb1a02	Remove unused function. llvm-svn: 183698	2013-06-10 21:59:28 +00:00
Eric Christopher	b7e3b42c01	IndentCount is only used within NDEBUG code. llvm-svn: 183695	2013-06-10 20:58:53 +00:00
David Majnemer	0fc8670cb0	TargetLowering: Clean up method description comments llvm-svn: 183623	2013-06-08 23:51:45 +00:00
Quentin Colombet	249cb6756c	Reapply r183552. This time, use a standard type for the option to avoid template instantiation issue with non-standard type. Add a backend option to warn on a given stack size limit. Option: -mllvm -warn-stack-size=<limit> Output (if limit is exceeded): warning: Stack size limit exceeded (<actual size>) in <functionName>. The longer term plan is to hook that to a clang warning. PR:4072 <rdar://problem/13987214>. llvm-svn: 183595	2013-06-08 00:07:54 +00:00
Quentin Colombet	bd5a201c85	Revert commits related to stack warning. llvm-svn: 183579	2013-06-07 22:14:50 +00:00
Quentin Colombet	6baf581b93	Add a backend option to warn on a given stack size limit. Option: -mllvm -warn-stack-size=<limit> Output (if limit is exceeded): warning: Stack size limit exceeded (<actual size>) in <functionName>. The longer term plan is to hook that to a clang warning. PR:4072 <rdar://problem/13987214> llvm-svn: 183552	2013-06-07 20:18:12 +00:00
Quentin Colombet	ba366011c8	Teach AsmPrinter how to print odd constants. Fix an assertion when the compiler encounters big constants whose bit width is not a multiple of 64-bits. Although clang would never generate something like this, the backend should be able to handle any legal IR. <rdar://problem/13363576> llvm-svn: 183544	2013-06-07 18:36:03 +00:00
Rafael Espindola	aad6c24422	Support OpenBSD's native frame protection conventions. OpenBSD's stack smashing protection differs slightly from other platforms: 1. The smash handler function is "__stack_smash_handler(const char *funcname)" instead of "__stack_chk_fail(void)". 2. There's a hidden "long __guard_local" object that gets linked into each executable and DSO. Patch by Matthew Dempsky. llvm-svn: 183533	2013-06-07 16:35:57 +00:00
Duncan Sands	e0a607e376	Correct wrong register in this example, pointed out by Baoshan Pang. llvm-svn: 183495	2013-06-07 08:30:55 +00:00
David Blaikie	b272a75328	Fix break in r183446 - helps to increment the iterator in a loop llvm-svn: 183454	2013-06-06 22:28:26 +00:00
David Blaikie	36d5d2f08d	Debug Info: simplify parameter ordering preservation Seems we emit the parameter ordering number (spuriously named 'arg number') in the debug info, so there's no need to search through the variable list to figure out the parameter ordering. This implementation does 'always' do the work, even in non-optimized debug info (the previous implementation checked the existence of the 'variables' list on the subprogram which is only present in optimized builds). No intended functionality change. llvm-svn: 183446	2013-06-06 21:04:51 +00:00
Bill Wendling	f77190855d	Cache the TargetLowering info object as a pointer. Caching it as a pointer allows us to reset it if the TargetMachine object changes. llvm-svn: 183361	2013-06-06 00:43:09 +00:00
Bill Wendling	8db01cb262	Don't cache the TargetLoweringInfo object inside of the FunctionLowering object. The TargetLoweringInfo object is owned by the TargetMachine. In the future, the TargetMachine object may change, which may also change the TargetLoweringInfo object. llvm-svn: 183356	2013-06-06 00:11:39 +00:00
Bill Wendling	6fd15ca02e	Add space to assert message. llvm-svn: 183346	2013-06-05 23:13:26 +00:00
David Blaikie	6f1a8067fb	PR15662: Optimized debug info produces out of order function parameters When a function is inlined we lazily construct the variables representing the function's parameters. After that, we add any remaining unused parameters. If the function doesn't use all the parameters, or uses them out of order, then the DWARF would produce them in that order, producing a parameter order that doesn't match the source. This fix causes us to always keep the arg variables at the start of the variable list & in the original order from the source. llvm-svn: 183297	2013-06-05 05:39:59 +00:00
Shuxin Yang	8b8fd2171c	Fix a defect in code-layout pass, improving Benchmarks/Olden/em3d/em3d by about 30% (4.58s vs 3.2s on an oldish Mac Tower). The corresponding src is excerpted bellow. The lopp accounts for about 90% of execution time. -------------------- cat -n test-suite/MultiSource/Benchmarks/Olden/em3d/make_graph.c 90 91 for (k=0; k<j; k++) 92 if (other_node == cur_node->to_nodes[k]) break; The defective layout is sketched bellow, where the two branches need to swap. ------------------------------------------------------------------------ L: ... if (cond) goto out-of-loop goto L While this code sequence is defective, I don't understand why it incurs 1/3 of execution time. CPU-event-profiling indicates the poor laoyout dose not increase in br-misprediction; it dosen't increase stall cycle at all, and it dosen't prevent the CPU detect the loop (i.e. Loop-Stream-Detector seems to be working fine as well)... The root cause of the problem is that the layout pass calls AnalyzeBranch() with basic-block which is not updated to reflect its current layout. rdar://13966341 llvm-svn: 183174	2013-06-04 01:00:57 +00:00
Benjamin Kramer	7c275640e7	Move getRealLinkageName to a common place and remove all the duplicates of it. Also simplify code a bit while there. No functionality change. llvm-svn: 183076	2013-06-01 17:51:14 +00:00
Eric Christopher	65ac02ad78	Const-ify some printing and dumping code for DIEValues. llvm-svn: 183057	2013-05-31 22:50:40 +00:00
Quentin Colombet	bf490d4a32	Loop Strength Reduce: Scaling factor cost. Account for the cost of scaling factor in Loop Strength Reduce when rating the formulae. This uses a target hook. The default implementation of the hook is: if the addressing mode is legal, the scaling factor is free. <rdar://problem/13806271> llvm-svn: 183045	2013-05-31 21:29:03 +00:00
Kai Nacke	e1823b6b85	Remove useless code from transitioning to new EH scheme Removes all uses of the variable UsesNewEH. Simply return false in case that no resume instructions were found. llvm-svn: 183016	2013-05-31 16:30:36 +00:00
Tim Northover	69cd121dd9	Fix rematerialization into physical registers. r182872 introduced a bug in how the register-coalescer's rematerialization handled defining a physical register. It relied on the output of the coalescer's setRegisters method to determine whether the replacement instruction needed an implicit-def. However, this value isn't necessarily the same as the CopyMI's actual destination register which is what the rest of the basic-block expects us to be defining. The commit changes the rematerializer to use the actual register attached to CopyMI in its decision. This will be tested soon by an X86 patch which moves everything to using MOV32r0 instead of other sizes. llvm-svn: 182925	2013-05-30 12:30:50 +00:00
Eric Christopher	d1c5a31721	Rename variable to be more descriptive. llvm-svn: 182903	2013-05-30 00:43:35 +00:00
Eric Christopher	1e1c7f1b15	Formatting. llvm-svn: 182902	2013-05-30 00:43:32 +00:00
Andrew Trick	ad6d08ac6f	Order CALLSEQ_START and CALLSEQ_END nodes. Fixes PR16146: gdb.base__call-ar-st.exp fails after pre-RA-sched=source fixes. Patch by Xiaoyi Guo! This also fixes an unsupported dbg.value test case. Codegen was previously incorrect but the test was passing by luck. llvm-svn: 182885	2013-05-29 22:03:55 +00:00
Bill Wendling	70b1400e6d	Don't reach into the middle of TargetMachine and cache one of its ivars. Not only does this break encapsulation, it's gross. llvm-svn: 182876	2013-05-29 20:37:19 +00:00
Tim Northover	b65f6b0820	Teach ReMaterialization to be more cunning about subregisters This allows rematerialization during register coalescing to handle more cases involving operations like SUBREG_TO_REG which might need to be rematerialized using sub-register indices. For example, code like: v1(GPR64):sub_32 = MOVZ something v2(GPR64) = COPY v1(GPR64) should be convertable to: v2(GPR64):sub_32 = MOVZ something but previously we just gave up in places like this llvm-svn: 182872	2013-05-29 19:32:06 +00:00
Manman Ren	4213c39e3c	LTO+Debug Info: revert r182791. Since the testing case uses ref_addr, which requires version 3+ to work, we will solve the dwarf version issue first. This patch also causes failures in one of the bots. I will update the patch accordingly in my next attempt. rdar://13926659 llvm-svn: 182867	2013-05-29 17:16:59 +00:00
Manman Ren	b5b5453e61	LTO+Debug Info: correctly emit inlined_subroutine when the inlined callee is from a different CU. We used to print out an error message and fail to generate inlined_subroutine. If we use ref_addr in the generated DWARF, the DWARF version should be 3 or above. rdar://13926659 llvm-svn: 182791	2013-05-28 19:01:58 +00:00
Benjamin Kramer	262b154247	Simplify code. No functionality change. llvm-svn: 182779	2013-05-28 16:39:36 +00:00
Benjamin Kramer	351d53c225	Remove double semicolons. llvm-svn: 182778	2013-05-28 16:31:26 +00:00
Preston Gurd	048f99de11	Convert sqrt functions into sqrt instructions when -ffast-math is in effect. When -ffast-math is in effect (on Linux, at least), clang defines __FINITE_MATH_ONLY__ > 0 when including <math.h>. This causes the preprocessor to include <bits/math-finite.h>, which renames the sqrt functions. For instance, "sqrt" is renamed as "__sqrt_finite". This patch adds the 3 new names in such a way that they will be treated as equivalent to their respective original names. llvm-svn: 182739	2013-05-27 15:44:35 +00:00
Andrew Trick	c66d26adf0	Fix PR16143: Insert DEBUG_VALUE before terminator. llvm-svn: 182717	2013-05-26 08:58:50 +00:00
Andrew Trick	e2431c64bc	Track IR ordering of SelectionDAG nodes 3/4. Remove the old IR ordering mechanism and switch to new one. Fix unit test failures. llvm-svn: 182704	2013-05-25 03:08:10 +00:00
Andrew Trick	ef9de2a739	Track IR ordering of SelectionDAG nodes 2/4. Change SelectionDAG::getXXXNode() interfaces as well as call sites of these functions to pass in SDLoc instead of DebugLoc. llvm-svn: 182703	2013-05-25 02:42:55 +00:00
Andrew Trick	175143bf88	Track IR ordering of SelectionDAG nodes 1/4. Use a field in the SelectionDAGNode object to track its IR ordering. This adds fields and utility classes without changing existing interfaces or functionality. llvm-svn: 182701	2013-05-25 02:20:36 +00:00
Michael J. Spencer	df1ecbd734	Replace Count{Leading,Trailing}Zeros_{32,64} with count{Leading,Trailing}Zeros. llvm-svn: 182680	2013-05-24 22:23:49 +00:00
Jakob Stoklund Olesen	43711c51ec	Fix PR16110: Handle DBG_VALUE in ConnectedVNInfoEqClasses::Distribute(). Now that the LiveDebugVariables pass is running after register coalescing, the ConnectedVNInfoEqClasses class needs to deal with DBG_VALUE instructions. This only comes up when rematerialization during coalescing causes the remaining live range of a virtual register to separate into two connected components. llvm-svn: 182592	2013-05-23 17:02:23 +00:00
Benjamin Kramer	bc6666bedf	InlineSpiller: Store bucket pointers instead of iterators. Lets us use a SetVector instead of an explicit set + vector combination. llvm-svn: 182586	2013-05-23 15:42:57 +00:00
David Blaikie	5174c84add	Solidify the assumption that a DW_TAG_subprogram's type is a DW_TAG_subroutine_type There were bits & pieces of code lying around that may've given the impression that debug info metadata supported the possibility that a subprogram's type could be specified by a non-subroutine type describing the return type of a void function. This support was incomplete & unnecessary. Asserts & API have been changed to make the desired usage more clear. llvm-svn: 182532	2013-05-22 23:22:18 +00:00
Chad Rosier	abdb1d69ab	Simplify logic now that r182490 is in place. No functional change intended. llvm-svn: 182531	2013-05-22 23:17:36 +00:00
Chad Rosier	682ae15bb9	Simplify logic now that r182490 is in place. No functional change intended. llvm-svn: 182527	2013-05-22 22:36:55 +00:00
Chad Rosier	c7505ef8ba	Simplify logic now that r182490 is in place. No functional change intended. llvm-svn: 182526	2013-05-22 22:26:05 +00:00
Adrian Prantl	0d1e5592a6	Unify formatting of debug output. llvm-svn: 182495	2013-05-22 18:02:19 +00:00
Justin Holewinski	fff1f5f5e2	Drop @llvm.annotation and @llvm.ptr.annotation intrinsics during codegen. The intrinsic calls are dropped, but the annotated value is propagated. Fixes PR 15253 Original patch by Zeng Bin! llvm-svn: 182387	2013-05-21 14:37:16 +00:00
Benjamin Kramer	8aaf197990	DAGCombine: Avoid an edge case where it tried to create an i0 type for (x & 0) == 0. Fixes PR16083. llvm-svn: 182357	2013-05-21 08:51:09 +00:00
Manman Ren	9d4c735885	Dwarf: use a single line table to generate assembly when .loc is used. This is to fix PR15408 where an undefined symbol Lline_table_start1 is used. Since we do not generate the debug_line section when .loc is used, Lline_table_start1 is not emitted and we can't refer to it when calculating at_stmt_list for a compile unit. llvm-svn: 182344	2013-05-21 00:57:22 +00:00
David Blaikie	e63d5d1633	PR14606: Debug Info for namespace aliases/DW_TAG_imported_module This resolves the last of the PR14606 failures in the GDB 7.5 test suite by implementing an optional name field for DW_TAG_imported_modules/DIImportedEntities and using that to implement C++ namespace aliases (eg: "namespace X = Y;"). llvm-svn: 182328	2013-05-20 22:50:35 +00:00
Bill Wendling	eda5418e89	The DWARF EH pass doesn't need the TargetMachine, only the TargetLoweringBase like the other EH passes. llvm-svn: 182321	2013-05-20 21:54:18 +00:00
Bill Wendling	47447589c9	No need to store the TargetMachine variable in this class. llvm-svn: 182317	2013-05-20 21:28:28 +00:00
Matt Arsenault	75865923c9	Add LLVMContext argument to getSetCCResultType llvm-svn: 182180	2013-05-18 00:21:46 +00:00
Matt Arsenault	04126234e5	Replace redundant code Use EVT::changeExtendedVectorElementTypeToInteger instead of doing the same thing that it does llvm-svn: 182165	2013-05-17 21:43:43 +00:00
Matt Arsenault	52ddb7bcdd	Add missing -- C++ -- to headers llvm-svn: 182164	2013-05-17 21:43:39 +00:00
Adrian Prantl	9c93059aa4	Generate debug info for by-value struct args even if they are not used. radar://problem/13865940 llvm-svn: 182062	2013-05-16 23:44:12 +00:00
Rafael Espindola	b08d2c2db0	Remove addFrameMove. Now that we have good testing, remove addFrameMove and create cfi instructions directly. llvm-svn: 182052	2013-05-16 21:02:15 +00:00
Benjamin Kramer	fc88c3761f	DAGCombine: Also shrink eq compares where the constant is exactly as large as the smaller type. if ((x & 255) == 255) before: movzbl %al, %eax cmpl $255, %eax after: cmpb $-1, %al llvm-svn: 182038	2013-05-16 18:47:58 +00:00
Jakob Stoklund Olesen	9ae96c7aab	Add TargetRegisterInfo::getCoveringLanes(). This lane mask provides information about which register lanes completely cover super-registers. See the block comment before getCoveringLanes(). llvm-svn: 182034	2013-05-16 18:03:08 +00:00
Hal Finkel	1f6a7f53d8	Fix legalization of SETCC with promoted integer intrinsics If the input operands to SETCC are promoted, we need to make sure that we either use the promoted form of both operands (or neither); a mixture is not allowed. This can happen, for example, if a target has a custom promoted i1-returning intrinsic (where i1 is not a legal type). In this case, we need to use the promoted form of both operands. This change only augments the behavior of the existing logic in the case where the input types (which may or may not have already been legalized) disagree, and should not affect existing target code because this case would otherwise cause an assert in the SETCC operand promotion code. This will be covered by (essentially all of the) tests for the new PPCCTRLoops infrastructure. llvm-svn: 181926	2013-05-15 21:37:27 +00:00
Derek Schuff	d2c42d766d	Fix miscompile due to StackColoring incorrectly merging stack slots (PR15707) IR optimisation passes can result in a basic block that contains: llvm.lifetime.start(%buf) ... llvm.lifetime.end(%buf) ... llvm.lifetime.start(%buf) Before this change, calculateLiveIntervals() was ignoring the second lifetime.start() and was regarding %buf as being dead from the lifetime.end() through to the end of the basic block. This can cause StackColoring to incorrectly merge %buf with another stack slot. Fix by removing the incorrect Starts[pos].isValid() and Finishes[pos].isValid() checks. Just doing: Starts[pos] = Indexes->getMBBStartIdx(MBB); Finishes[pos] = Indexes->getMBBEndIdx(MBB); unconditionally would be enough to fix the bug, but it causes some test failures due to stack slots not being merged when they were before. So, in order to keep the existing tests passing, treat LiveIn and LiveOut separately rather than approximating the live ranges by merging LiveIn and LiveOut. This fixes PR15707. Patch by Mark Seaborn. llvm-svn: 181922	2013-05-15 21:15:09 +00:00
David Blaikie	041f1aa3e2	Use only explicit bool conversion operators BitVector/SmallBitVector::reference::operator bool remain implicit since they model more exactly a bool, rather than something else that can be boolean tested. The most common (non-buggy) case are where such objects are used as return expressions in bool-returning functions or as boolean function arguments. In those cases I've used (& added if necessary) a named function to provide the equivalent (or sometimes negative, depending on convenient wording) test. One behavior change (YAMLParser) was made, though no test case is included as I'm not sure how to reach that code path. Essentially any comparison of llvm::yaml::document_iterators would be invalid if neither iterator was at the end. This helped uncover a couple of bugs in Clang - test cases provided for those in a separate commit along with similar changes to `operator bool` instances in Clang. llvm-svn: 181868	2013-05-15 07:36:59 +00:00
Eric Christopher	8fd7ab07ca	Make getCompileUnit non-const and return the current DIE if it happens to be a compile unit. Noticed on inspection and tested via calling on a newly created compile unit. No functional change. llvm-svn: 181835	2013-05-14 21:33:10 +00:00
Kai Nacke	9a224ced0f	Add bitcast to store of personality function. The personality function is user defined and may have an arbitrary result type. The code assumes always i8. This results in an assertion failure if a different type is used. A bitcast to i8 is added to prevent this failure. Reviewed by: Renato Golin, Bob Wilson llvm-svn: 181802	2013-05-14 16:30:51 +00:00
Akira Hatanaka	1f24e6a6a2	StackColoring: don't clear an instruction's mem operand if the underlying object is a PseudoSourceValue and PseudoSourceValue::isConstant returns true (i.e., points to memory that has a constant value). llvm-svn: 181751	2013-05-14 01:42:44 +00:00
David Blaikie	7b770c6aed	Assert that DIEEntries are constructed with non-null DIEs This just brings a crash a little further forward from DWARF emission to DIE construction to make errors easier to diagnose. llvm-svn: 181748	2013-05-14 00:35:19 +00:00
Bob Wilson	c5c0823724	Remove redundant variable introduced by r181682. llvm-svn: 181721	2013-05-13 19:02:31 +00:00
Hao Liu	bc60196951	Fix PR15950 A bug in DAG Combiner about undef mask llvm-svn: 181682	2013-05-13 02:07:05 +00:00
Rafael Espindola	227144c23c	Remove the MachineMove class. It was just a less powerful and more confusing version of MCCFIInstruction. A side effect is that, since MCCFIInstruction uses dwarf register numbers, calls to getDwarfRegNum are pushed out, which should allow further simplifications. I left the MachineModuleInfo::addFrameMove interface unchanged since this patch was already fairly big. llvm-svn: 181680	2013-05-13 01:16:13 +00:00
Benjamin Kramer	63e39eb09d	StringRefize some debug accel table bits. llvm-svn: 181663	2013-05-11 18:24:28 +00:00
Rafael Espindola	1b09836bc3	Change getFrameMoves to return a const reference. To add a frame now there is a dedicated addFrameMove which also takes care of constructing the move itself. llvm-svn: 181657	2013-05-11 02:38:11 +00:00
Rafael Espindola	639890222e	Remove more dead code. llvm-svn: 181656	2013-05-11 02:24:41 +00:00
David Blaikie	a1e813dcd4	PR14492: Debug Info: Support for values of non-integer non-type template parameters. This is only tested for global variables at the moment (& includes tests for the unnamed parameter case, since apparently this entire function was completely untested previously) llvm-svn: 181632	2013-05-10 21:52:07 +00:00
Benjamin Kramer	a5d59333b3	DAGCombiner: Generate a correct constant for vector types when folding (xor (and)) into (and (not)). PR15948. llvm-svn: 181597	2013-05-10 14:09:52 +00:00
Owen Anderson	32baf99b1d	Teach SelectionDAG to constant fold all-constant FMA nodes the same way that it constant folds FADD, FMUL, etc. llvm-svn: 181555	2013-05-09 22:27:13 +00:00
Eric Christopher	f20ff979e9	Revert "Make sure debug info contains linkage names (DW_AT_MIPS_linkage_name)" temporarily while investigating gdb.cp/templates.exp. This reverts commit r181471. llvm-svn: 181496	2013-05-09 00:42:33 +00:00
Eric Christopher	697fa1c8be	Make sure debug info contains linkage names (DW_AT_MIPS_linkage_name) for constructors and destructors since the original declaration given by the AT_specification both won't and can't. Patch by Yacine Belkadi, I've cleaned up the testcases. llvm-svn: 181471	2013-05-08 21:23:22 +00:00
David Majnemer	386ab7f872	DAGCombiner: Simplify inverted bit tests Fold (xor (and x, y), y) -> (and (not x), y) This removes an opportunity for a constant to appear twice. llvm-svn: 181395	2013-05-08 06:44:42 +00:00
David Blaikie	3b6038b6f3	Debug Info: Support DW_TAG_imported_declaration This provides basic functionality for imported declarations. For subprograms and types some amount of lazy construction is supported (so the definition of a function can proceed the using declaration), but it still doesn't handle declared-but-not-defined functions (since we don't generally emit function declarations). Variable support is really rudimentary at the moment - simply looking up the existing definition with no support for out of order (declaration, imported_module, then definition). llvm-svn: 181392	2013-05-08 06:01:41 +00:00
David Blaikie	4dd2de7ae7	Finish renaming constructImportedModuleDIE to constructImportedEntityDIE llvm-svn: 181391	2013-05-08 06:01:38 +00:00
Eric Christopher	c57baeeee0	Pass the MDNode in and do the insertion at compile unit creation time instead of relying upon an extra call to finish initializing. llvm-svn: 181383	2013-05-08 00:58:51 +00:00
Eric Christopher	6156011ee8	Typo. llvm-svn: 181378	2013-05-08 00:11:10 +00:00
David Blaikie	1fd4365f91	Rename DIImportedModule to DIImportedEntity and allow imported declarations DIBuilder::createImportedDeclaration isn't fully plumbed through (note, lacking in AsmPrinter/DwarfDebug support) but this seemed like a sufficiently useful division of code to make the subsequent patch(es) easier to follow. llvm-svn: 181364	2013-05-07 21:35:53 +00:00
Rafael Espindola	9a383405a7	Remove exception handling support from the old JIT. llvm-svn: 181354	2013-05-07 20:53:59 +00:00
Matt Arsenault	a5733dc97e	Fix vselect when getSetCCResultType returns a different type from the operands llvm-svn: 181348	2013-05-07 20:24:18 +00:00
David Blaikie	6baa776173	Debug Info: Fix for break due to r181271 Apparently we didn't keep an association of Compile Unit metadata nodes to DIEs so looking up that parental context failed & thus caused no DW_TAG_imported_modules to be emitted at the CU scope. Fix this by adding the mapping & sure up the test case to verify this. llvm-svn: 181339	2013-05-07 17:57:13 +00:00
Timur Iskhodzhanov	ec4afe6680	Fix the VS2010 build broken by r181271 llvm-svn: 181296	2013-05-07 07:47:47 +00:00
David Blaikie	684fc5331e	DebugInfo: Support imported modules in lexical blocks llvm-svn: 181271	2013-05-06 23:33:07 +00:00
Eric Christopher	6c6de847a8	Remove unnecessary instance variable and rework logic accordingly. llvm-svn: 181227	2013-05-06 17:50:50 +00:00
Eric Christopher	f0303324be	Grammar. llvm-svn: 181226	2013-05-06 17:50:46 +00:00
Michael Kuperstein	ac868757d0	Fix slightly too aggressive conact_vector optimization. (Would sometimes optimize away conacts used to extend a vector with undef values) llvm-svn: 181186	2013-05-06 08:06:13 +00:00
Evan Cheng	8b8e8d88ff	Teach if-converter to avoid removing BBs whose addresses are takne. rdar://13782395 llvm-svn: 181160	2013-05-05 18:03:49 +00:00
Benjamin Kramer	391f5a6e21	InlineSpiller: Remove quadratic behavior. No functionality change. llvm-svn: 181149	2013-05-05 11:29:14 +00:00
Stepan Dyatkovskiy	8c02c98259	For ARM backend, fixed "byval" attribute support. Now even the small structures could be passed within byval (small enough to be stored in GPRs). In regression tests next function prototypes are checked: PR15293: %artz = type { i32 } define void @foo(%artz* byval %s) define void @foo2(%artz* byval %s, i32 %p, %artz* byval %s2) foo: "s" stored in R0 foo2: "s" stored in R0, "s2" stored in R2. Next AAPCS rules are checked: 5.5 Parameters Passing, C.4 and C.5, "ParamSize" is parameter size in 32bit words: -- NSAA != 0, NCRN < R4 and NCRN+ParamSize > R4. Parameter should be sent to the stack; NCRN := R4. -- NSAA != 0, and NCRN < R4, NCRN+ParamSize < R4. Parameter stored in GPRs; NCRN += ParamSize. llvm-svn: 181148	2013-05-05 07:48:36 +00:00
Dmitri Gribenko	3238fb7595	Add ArrayRef constructor from None, and do the cleanups that this constructor enables Patch by Robert Wilhelm. llvm-svn: 181138	2013-05-05 00:40:33 +00:00
Evan Cheng	f85a76f477	TiedTo flag can now be placed on implicit operands. isTwoAddrUse() should look at all of the operands. Previously it was skipping over implicit operands which cause infinite looping when the two-address pass try to reschedule a two-address instruction below the kill of tied operand. I'm unable to come up with a reasonably sized test case. rdar://13747577 llvm-svn: 180906	2013-05-02 02:07:32 +00:00
Bill Wendling	8f2e6feb8e	Revert r180737. The companion patch was reverted, and this is not relevant right now. llvm-svn: 180889	2013-05-01 22:32:08 +00:00
Filip Pizlo	dec20e43c0	This patch breaks up Wrap.h so that it does not have to include all of the things, and renames it to CBindingWrapping.h. I also moved CBindingWrapping.h into Support/. This new file just contains the macros for defining different wrap/unwrap methods. The calls to those macros, as well as any custom wrap/unwrap definitions (like for array of Values for example), are put into corresponding C++ headers. Doing this required some #include surgery, since some .cpp files relied on the fact that including Wrap.h implicitly caused the inclusion of a bunch of other things. This also now means that the C++ headers will include their corresponding C API headers; for example Value.h must include llvm-c/Core.h. I think this is harmless, since the C API headers contain just external function declarations and some C types, so I don't believe there should be any nasty dependency issues here. llvm-svn: 180881	2013-05-01 20:59:00 +00:00
Chad Rosier	8e4824f350	[inline asm] Return an undef SDValue of the expected value type, rather than report a fatal error. This allows us to continue processing the translation unit. Test case to come on the clang side because we need an inline asm diagnostics handler in place. rdar://13446483 llvm-svn: 180873	2013-05-01 19:49:26 +00:00
Nadav Rotem	e5a2dda372	Optimize away nop CONCAT_VECTOR nodes. Optimize CONCAT_VECTOR nodes that merge EXTRACT_SUBVECTOR values that extract from the same vector. rdar://13402653 PR15866 llvm-svn: 180871	2013-05-01 19:18:51 +00:00
Stephen Lin	699808ceb2	Only pass 'returned' to target-specific lowering code when the value of entire register is guaranteed to be preserved. llvm-svn: 180825	2013-04-30 22:49:28 +00:00
Adrian Prantl	a2888e71eb	Temporarily revert "Change the informal convention of DBG_VALUE so that we can express a" because it breaks some buildbots. This reverts commit 180816. llvm-svn: 180819	2013-04-30 22:35:14 +00:00
Adrian Prantl	9a576644e4	Change the informal convention of DBG_VALUE so that we can express a register-indirect address with an offset of 0. It used to be that a DBG_VALUE is a register-indirect value if the offset (operand 1) is nonzero. The new convention is that a DBG_VALUE is register-indirect if the first operand is a register and the second operand is an immediate. For plain registers use the combination reg, reg. rdar://problem/13658587 llvm-svn: 180816	2013-04-30 22:16:46 +00:00
Andrew Trick	dd77014acc	MI Sched: revert a minor heuristic that snuck in with -misched-vcopy. I'll fix the heuristic in a general way in a follow-up commit. llvm-svn: 180815	2013-04-30 22:10:59 +00:00
Hal Finkel	7153251ab5	LocalStackSlotAllocation improvements First, taking advantage of the fact that the virtual base registers are allocated in order of the local frame offsets, remove the quadratic register-searching behavior. Because of the ordering, we only need to check the last virtual base register created. Second, store the frame index in the FrameRef structure, and get the frame index and the local offset from this structure at the top of the loop iteration. This allows us to de-nest the loops in insertFrameReferenceRegisters (and I think makes the code cleaner). I also moved the needsFrameBaseReg check into the first loop over instructions so that we don't bother pushing FrameRefs for instructions that don't want a virtual base register anyway. Lastly, and this is the only functionality change, avoid the creation of single-use virtual base registers. These are currently not useful because, in general, they end up replacing what would be one r+r instruction with an add and a r+i instruction. Committing this removes the XFAIL in CodeGen/PowerPC/2007-09-07-LoadStoreIdxForms.ll Jim has okayed this off-list. llvm-svn: 180799	2013-04-30 20:04:37 +00:00
Bill Wendling	fb7e32ebd6	Emit the TLS initialization function pointers into the correct section. The `llvm.tls_init_funcs' (created by the front-end) holds pointers to the TLS initialization functions. These need to be placed into the correct section so that they are run before `main()'. <rdar://problem/13733006> llvm-svn: 180737	2013-04-29 22:25:40 +00:00
Andrew Trick	85058af650	Generalize the MachineTraceMetrics public API. Naturally, we should be able to pass in extra instructions, not just extra blocks. llvm-svn: 180667	2013-04-27 03:54:20 +00:00
Eric Christopher	203e12bf9e	Use the target triple from the target machine rather than the module to determine whether or not we're on a darwin platform for debug code emitting. Solves the problem of a module with no triple on the command line and no triple in the module using non-gdb ok features on darwin. Fix up the member-pointers test to check the correct things for cross platform (DW_FORM_flag is a good prefix). Unfortunately no testcase because I have no ideas how to test something without a triple and without a triple in the module yet check precisely on two platforms. Ideas welcome. llvm-svn: 180660	2013-04-27 01:07:52 +00:00
Adrian Prantl	d4c0dd4776	Cleanup and document MachineLocation. Clarify documentation and API to make the difference between register and register-indirect addressed locations more explicit. Put in a comment to point out that with the current implementation we cannot specify a register-indirect location with offset 0 (a breg 0 in DWARF). No functionality change intended. rdar://problem/13658587 llvm-svn: 180641	2013-04-26 21:57:17 +00:00
Bill Wendling	55a9c97c9c	Micro-optimization TLVs probably won't be as common as the other types of variables. Check for them last before defaulting to "DATA". llvm-svn: 180631	2013-04-26 21:15:08 +00:00
Silviu Baranga	af7e8c367f	Re-write the address propagation code for pre-indexed loads/stores to take into account some previously misssed cases (PRE_DEC addressing mode, the offset and base address are swapped, etc). This should fix PR15581. llvm-svn: 180609	2013-04-26 15:52:24 +00:00
Benjamin Kramer	d56ffc709d	DAGCombiner: Canonicalize vector integer abs in the same way we do it for scalars. This already helps SSE2 x86 a lot because it lacks an efficient way to represent a vector select. The long term goal is to enable the backend to match a canonicalized pattern into a single instruction (e.g. vabs or pabs). llvm-svn: 180597	2013-04-26 09:19:19 +00:00
Reid Kleckner	d973ca3c51	[mc-coff] Forward Linker Option flags into the .drectve section Summary: This is modelled on the Mach-O linker options implementation and should support a Clang implementation of #pragma comment(lib/linker). Reviewers: rafael CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D724 llvm-svn: 180569	2013-04-25 19:34:41 +00:00
Silviu Baranga	4ad2bc5963	Fix constant folding for one lane vector types. Constant folding one lane vector types not returns a vector instead of a scalar. llvm-svn: 180254	2013-04-25 09:32:33 +00:00
Andrew Trick	2e87517144	Fix for r180193 - MI Sched: eliminate local vreg. Fixes PR15838. Need to check for blocks with nothing but dbg.value. I'm not sure how to force this situation with a unit test. I tried to reduce the test case in PR15838 (1k lines of metadata) but gave up. llvm-svn: 180227	2013-04-24 23:19:56 +00:00
Chad Rosier	108d5a61b7	[inline asm] Fix a crasher for an invalid value type/register class. rdar://13731657 llvm-svn: 180226	2013-04-24 22:53:10 +00:00
Andrew Trick	85a1d4cbc0	MI Sched: eliminate local vreg copies. For now, we just reschedule instructions that use the copied vregs and let regalloc elliminate it. I would really like to eliminate the copies on-the-fly during scheduling, but we need a complete implementation of repairIntervalsInRange() first. The general strategy is for the register coalescer to eliminate as many global copies as possible and shrink live ranges to be extended-basic-block local. The coalescer should not have to worry about resolving local copies (e.g. it shouldn't attemp to reorder instructions). The scheduler is a much better place to deal with local interference. The coalescer side of this equation needs work. llvm-svn: 180193	2013-04-24 15:54:43 +00:00
Andrew Trick	608a698cdf	Register Coalescing: add a flag to disable rescheduling. When MachineScheduler is enabled, this functionality can be removed. Until then, provide a way to disable it for test cases and designing MachineScheduler heuristics. llvm-svn: 180192	2013-04-24 15:54:39 +00:00
Andrew Trick	7c791a3dc4	MI Sched: regpressure tracing. llvm-svn: 180191	2013-04-24 15:54:36 +00:00
Eric Christopher	4eb5eb5bc8	Formatting. llvm-svn: 180186	2013-04-24 12:56:18 +00:00
Owen Anderson	2d4cca35c3	DAGCombine should not aggressively fold SEXT(VSETCC(...)) into a wider VSETCC without first checking the target's vector boolean contents. This exposed an issue with PowerPC AltiVec where it appears it was setting the wrong vector boolean contents. The included change fixes the PowerPC tests, and was OK'd by Hal. llvm-svn: 180129	2013-04-23 18:09:28 +00:00
Stephen Lin	6c70dc7842	Add some constraints to use of 'returned': 1) Disallow 'returned' on parameter that is also 'sret' (no sensible semantics, as far as I can tell). 2) Conservatively disallow tail calls through 'returned' parameters that also are 'zext' or 'sext' (for consistency with treatment of other zero-extending and sign-extending operations in tail call position detection...can be revised later to handle situations that can be determined to be safe). This is a new attribute that is not yet used, so there is no impact. llvm-svn: 180118	2013-04-23 16:31:56 +00:00
Matt Arsenault	034ca0fe41	Remove unused DwarfSectionOffsetDirective string The value isn't actually used, and setting it emits a COFF specific directive. llvm-svn: 180064	2013-04-22 22:49:11 +00:00
Eric Christopher	04d4e9312c	Move C++ code out of the C headers and into either C++ headers or the C++ files themselves. This enables people to use just a C compiler to interoperate with LLVM. llvm-svn: 180063	2013-04-22 22:47:22 +00:00
Eli Bendersky	58b04b7e2e	Optimize MachineBasicBlock::getSymbol by caching the symbol. Since the symbol name computation is expensive, this helps save about 25% of the time spent in this function. llvm-svn: 180049	2013-04-22 21:21:08 +00:00
Rafael Espindola	74f2e46eef	Clarify that llvm.used can contain aliases. Also add a check for llvm.used in the verifier and simplify clients now that they can assume they have a ConstantArray. llvm-svn: 180019	2013-04-22 14:58:02 +00:00
Eric Christopher	44c6aa670f	Tidy. llvm-svn: 180000	2013-04-22 07:51:08 +00:00
Eric Christopher	25e3509c78	Update comment. Whitespace. llvm-svn: 179999	2013-04-22 07:47:40 +00:00
David Blaikie	f55abeaf4c	Revert "Revert "PR14606: debug info imported_module support"" This reverts commit r179840 with a fix to test/DebugInfo/two-cus-from-same-file.ll I'm not sure why that test only failed on ARM & MIPS and not X86 Linux, even though the debug info was clearly invalid on all of them, but this ought to fix it. llvm-svn: 179996	2013-04-22 06:12:31 +00:00
Jim Grosbach	563983c8a3	Legalize vector truncates by parts rather than just splitting. Rather than just splitting the input type and hoping for the best, apply a bit more cleverness. Just splitting the types until the source is legal often leads to an illegal result time, which is then widened and a scalarization step is introduced which leads to truly horrible code generation. With the loop vectorizer, these sorts of operations are much more common, and so it's worth extra effort to do them well. Add a legalization hook for the operands of a TRUNCATE node, which will be encountered after the result type has been legalized, but if the operand type is still illegal. If simple splitting of both types ends up with the result type of each half still being legal, just do that (v16i16 -> v16i8 on ARM, for example). If, however, that would result in an illegal result type (v8i32 -> v8i8 on ARM, for example), we can get more clever with power-two vectors. Specifically, split the input type, but also widen the result element size, then concatenate the halves and truncate again. For example on ARM, To perform a "%res = v8i8 trunc v8i32 %in" we transform to: %inlo = v4i32 extract_subvector %in, 0 %inhi = v4i32 extract_subvector %in, 4 %lo16 = v4i16 trunc v4i32 %inlo %hi16 = v4i16 trunc v4i32 %inhi %in16 = v8i16 concat_vectors v4i16 %lo16, v4i16 %hi16 %res = v8i8 trunc v8i16 %in16 This allows instruction selection to generate three VMOVN instructions instead of a sequences of moves, stores and loads. Update the ARMTargetTransformInfo to take this improved legalization into account. Consider the simplified IR: define <16 x i8> @test1(<16 x i32>* %ap) { %a = load <16 x i32>* %ap %tmp = trunc <16 x i32> %a to <16 x i8> ret <16 x i8> %tmp } define <8 x i8> @test2(<8 x i32>* %ap) { %a = load <8 x i32>* %ap %tmp = trunc <8 x i32> %a to <8 x i8> ret <8 x i8> %tmp } Previously, we would generate the truly hideous: .syntax unified .section __TEXT,__text,regular,pure_instructions .globl _test1 .align 2 _test1: @ @test1 @ BB#0: push {r7} mov r7, sp sub sp, sp, #20 bic sp, sp, #7 add r1, r0, #48 add r2, r0, #32 vld1.64 {d24, d25}, [r0:128] vld1.64 {d16, d17}, [r1:128] vld1.64 {d18, d19}, [r2:128] add r1, r0, #16 vmovn.i32 d22, q8 vld1.64 {d16, d17}, [r1:128] vmovn.i32 d20, q9 vmovn.i32 d18, q12 vmov.u16 r0, d22[3] strb r0, [sp, #15] vmov.u16 r0, d22[2] strb r0, [sp, #14] vmov.u16 r0, d22[1] strb r0, [sp, #13] vmov.u16 r0, d22[0] vmovn.i32 d16, q8 strb r0, [sp, #12] vmov.u16 r0, d20[3] strb r0, [sp, #11] vmov.u16 r0, d20[2] strb r0, [sp, #10] vmov.u16 r0, d20[1] strb r0, [sp, #9] vmov.u16 r0, d20[0] strb r0, [sp, #8] vmov.u16 r0, d18[3] strb r0, [sp, #3] vmov.u16 r0, d18[2] strb r0, [sp, #2] vmov.u16 r0, d18[1] strb r0, [sp, #1] vmov.u16 r0, d18[0] strb r0, [sp] vmov.u16 r0, d16[3] strb r0, [sp, #7] vmov.u16 r0, d16[2] strb r0, [sp, #6] vmov.u16 r0, d16[1] strb r0, [sp, #5] vmov.u16 r0, d16[0] strb r0, [sp, #4] vldmia sp, {d16, d17} vmov r0, r1, d16 vmov r2, r3, d17 mov sp, r7 pop {r7} bx lr .globl _test2 .align 2 _test2: @ @test2 @ BB#0: push {r7} mov r7, sp sub sp, sp, #12 bic sp, sp, #7 vld1.64 {d16, d17}, [r0:128] add r0, r0, #16 vld1.64 {d20, d21}, [r0:128] vmovn.i32 d18, q8 vmov.u16 r0, d18[3] vmovn.i32 d16, q10 strb r0, [sp, #3] vmov.u16 r0, d18[2] strb r0, [sp, #2] vmov.u16 r0, d18[1] strb r0, [sp, #1] vmov.u16 r0, d18[0] strb r0, [sp] vmov.u16 r0, d16[3] strb r0, [sp, #7] vmov.u16 r0, d16[2] strb r0, [sp, #6] vmov.u16 r0, d16[1] strb r0, [sp, #5] vmov.u16 r0, d16[0] strb r0, [sp, #4] ldm sp, {r0, r1} mov sp, r7 pop {r7} bx lr Now, however, we generate the much more straightforward: .syntax unified .section __TEXT,__text,regular,pure_instructions .globl _test1 .align 2 _test1: @ @test1 @ BB#0: add r1, r0, #48 add r2, r0, #32 vld1.64 {d20, d21}, [r0:128] vld1.64 {d16, d17}, [r1:128] add r1, r0, #16 vld1.64 {d18, d19}, [r2:128] vld1.64 {d22, d23}, [r1:128] vmovn.i32 d17, q8 vmovn.i32 d16, q9 vmovn.i32 d18, q10 vmovn.i32 d19, q11 vmovn.i16 d17, q8 vmovn.i16 d16, q9 vmov r0, r1, d16 vmov r2, r3, d17 bx lr .globl _test2 .align 2 _test2: @ @test2 @ BB#0: vld1.64 {d16, d17}, [r0:128] add r0, r0, #16 vld1.64 {d18, d19}, [r0:128] vmovn.i32 d16, q8 vmovn.i32 d17, q9 vmovn.i16 d16, q8 vmov r0, r1, d16 bx lr llvm-svn: 179989	2013-04-21 23:47:41 +00:00
Jim Grosbach	d4db72db61	Tidy up comment grammar. llvm-svn: 179986	2013-04-21 21:23:01 +00:00
Tim Northover	16aba17024	Remove unused ShouldFoldAtomicFences flag. I think it's almost impossible to fold atomic fences profitably under LLVM/C++11 semantics. As a result, this is now unused and just cluttering up the target interface. llvm-svn: 179940	2013-04-20 12:32:43 +00:00
Tim Northover	a2b533906a	Remove unused MEMBARRIER DAG node; it's been replaced by ATOMIC_FENCE. llvm-svn: 179939	2013-04-20 12:32:17 +00:00
Stephen Lin	b8bd232a3d	Add CodeGen support for functions that always return arguments via a new parameter attribute 'returned', which is taken advantage of in target-independent tail call opportunity detection and in ARM call lowering (when placed on an integral first parameter). llvm-svn: 179925	2013-04-20 05:14:40 +00:00
Stephen Lin	ffc445492c	Allow tail call opportunity detection through nested and/or multiple iterations of extractelement/insertelement indirection llvm-svn: 179924	2013-04-20 04:27:51 +00:00
Eli Bendersky	e80691dc0a	Simplify the code in FastISel::tryToFoldLoad, add an assertion and fix a comment. llvm-svn: 179908	2013-04-19 23:26:18 +00:00

... 2 3 4 5 6 ...

15221 Commits