llvm-project

Commit Graph

Author	SHA1	Message	Date
Bill Wendling	ce4fe41d21	Add `.pushsection', `.popsection', and `.previous' directives to Darwin ASM. There are situations where inline ASM may want to change the section -- for instance, to create a variable in the .data section. However, it cannot do this without (potentially) restoring to the wrong section. E.g.: asm volatile (".section __DATA, __data\n\t" ".globl _fnord\n\t" "_fnord: .quad 1f\n\t" ".text\n\t" "1:" :::); This may be wrong if this is inlined into a function that has a "section" attribute. The user should use `.pushsection' and `.popsection' here instead. The addition of `.previous' is added for completeness. <rdar://problem/12048387> llvm-svn: 161477	2012-08-08 06:30:30 +00:00
Andrew Trick	352abc19a5	Added MispredictPenalty to SchedMachineModel. This replaces an existing subtarget hook on ARM and allows standard CodeGen passes to potentially use the property. llvm-svn: 161471	2012-08-08 02:44:16 +00:00
Andrew Trick	db9b1b5e66	Minor cleanup of defaultDefLatency API llvm-svn: 161470	2012-08-08 02:44:11 +00:00
Andrew Trick	207c569cf3	whitespace llvm-svn: 161469	2012-08-08 02:44:08 +00:00
Eli Friedman	08ec0a8122	isAllocLikeFn is allowed to return true for functions which read memory; make sure we account for that correctly in DeadStoreElimination. Fixes a regression from r158919. PR13547. llvm-svn: 161468	2012-08-08 02:17:32 +00:00
Jakob Stoklund Olesen	0556be983d	Revert "Fix a quadratic algorithm in MachineBranchProbabilityInfo." It caused an assertion failure when compiling consumer-typeset. llvm-svn: 161463	2012-08-08 01:10:31 +00:00
Manman Ren	1be131ba27	X86: enable CSE between CMP and SUB We perform the following: 1> Use SUB instead of CMP for i8,i16,i32 and i64 in ISel lowering. 2> Modify MachineCSE to correctly handle implicit defs. 3> Convert SUB back to CMP if possible at peephole. Removed pattern matching of (a>b) ? (a-b):0 and like, since they are handled by peephole now. rdar://11873276 llvm-svn: 161462	2012-08-08 00:51:41 +00:00
Jakob Stoklund Olesen	3b9a442841	Don't scan physreg use-def chains looking for a PIC base. We can't rematerialize a PIC base after register allocation anyway, and scanning physreg use-def chains is very expensive in a function with many calls. <rdar://problem/12047515> llvm-svn: 161461	2012-08-08 00:40:47 +00:00
Jakob Stoklund Olesen	c0b61ff9c7	Fix a quadratic algorithm in MachineBranchProbabilityInfo. The getSumForBlock function was quadratic in the number of successors because getSuccWeight would perform a linear search for an already known iterator. llvm-svn: 161460	2012-08-08 00:20:37 +00:00
Dan Gohman	b948736002	Avoid recomputing the unique exit blocks and their insert points when doing multiple scalar promotions on a single loop. This also has the effect of preserving the order of stores sunk out of loops, which is aesthetically pleasing, and it happens to fix the testcase in PR13542, though it doesn't fix the underlying problem. llvm-svn: 161459	2012-08-08 00:00:26 +00:00
Jakob Stoklund Olesen	fbf45dc2bd	Skip tied operand pairs that already have the same register. llvm-svn: 161454	2012-08-07 22:47:06 +00:00
Jakob Stoklund Olesen	505715d816	Add SelectionDAG::getTargetIndex. This adds support for TargetIndex operands during isel. The meaning of these (index, offset, flags) operands is entirely defined by the target. llvm-svn: 161453	2012-08-07 22:37:05 +00:00
Bob Wilson	61f3ad5759	Fix a serious typo in InstCombine's optimization of comparisons. An unsigned value converted to floating-point will always be greater than a negative constant. Unfortunately InstCombine reversed the check so that unsigned values were being optimized to always be greater than all positive floating-point constants. <rdar://problem/12029145> llvm-svn: 161452	2012-08-07 22:35:16 +00:00
Evan Cheng	fbdd25c135	X86 cmp lowering is looking past truncate on the condition node. It should only do so when the high bits are known zero. This caused a subtle miscompilation. rdar://12027825 llvm-svn: 161451	2012-08-07 22:21:00 +00:00
Bill Wendling	61396b81a4	For non-Darwin platforms, we want to generate stack protectors only for character arrays. This is in line with what GCC does. <rdar://problem/10529227> llvm-svn: 161446	2012-08-07 20:59:05 +00:00
Jakob Stoklund Olesen	84689b0d5a	Add a new kind of MachineOperand: MO_TargetIndex. A target index operand looks a lot like a constant pool reference, but it is completely target-defined. It contains the 8-bit TargetFlags, a 32-bit index, and a 64-bit offset. It is preserved by all code generator passes. TargetIndex operands can be used to carry target-specific information in cases where immediate operands won't suffice. llvm-svn: 161441	2012-08-07 18:56:39 +00:00
Andrew Kaylor	1a568c3a4f	Enable lazy compilation in MCJIT llvm-svn: 161438	2012-08-07 18:33:00 +00:00
Jakob Stoklund Olesen	296448b293	Fix a couple of typos. llvm-svn: 161437	2012-08-07 18:32:57 +00:00
Jakob Stoklund Olesen	75d9d5159e	Add trace accessor methods, implement primitive if-conversion heuristic. Compare the critical paths of the two traces through an if-conversion candidate. If the difference is larger than the branch brediction penalty, reject the if-conversion. If would never pay. llvm-svn: 161433	2012-08-07 18:02:19 +00:00
Rafael Espindola	59564079e9	The dominance computation already has logic for computing if an edge dominates a use or a BB, but it is inline in the handling of the invoke instruction. This patch refactors it so that it can be used in other cases. For example, in define i32 @f(i32 %x) { bb0: %cmp = icmp eq i32 %x, 0 br i1 %cmp, label %bb2, label %bb1 bb1: br label %bb2 bb2: %cond = phi i32 [ %x, %bb0 ], [ 0, %bb1 ] %foo = add i32 %cond, %x ret i32 %foo } GVN should be able to replace %x with 0 in any use that is dominated by the true edge out of bb0. In the above example the only such use is the one in the phi. llvm-svn: 161429	2012-08-07 17:30:46 +00:00
Hal Finkel	895a5f5d12	Add a comment about mftb vs. mfspr on PPC. Thanks to Alex Rosenberg for the suggestion. llvm-svn: 161428	2012-08-07 17:04:20 +00:00
Alexey Samsonov	947228c4f7	Fix the representation of debug line table in DebugInfo LLVM library, and "instruction address -> file/line" lookup. Instead of plain collection of rows, debug line table for compilation unit is now treated as the number of row ranges, describing sequences (series of contiguous machine instructions). The sequences are not always listed in the order of increasing address, so previously used std::lower_bound() sometimes produced wrong results. Now the instruction address lookup consists of two stages: finding the correct sequence, and searching for address in range of rows for this sequence. llvm-svn: 161414	2012-08-07 11:46:57 +00:00
Benjamin Kramer	c99d0e9186	PR13095: Give an inline cost bonus to functions using byval arguments. We give a bonus for every argument because the argument setup is not needed anymore when the function is inlined. With this patch we interpret byval arguments as a compact representation of many arguments. The byval argument setup is implemented in the backend as an inline memcpy, so to model the cost as accurately as possible we take the number of pointer-sized elements in the byval argument and give a bonus of 2 instructions for every one of those. The bonus is capped at 8 elements, which is the number of stores at which the x86 backend switches from an expanded inline memcpy to a real memcpy. It would be better to use the real memcpy threshold from the backend, but it's not available via TargetData. This change brings the performance of c-ray in line with gcc 4.7. The included test case tries to reproduce the c-ray problem to catch regressions for this benchmark early, its performance is dominated by the inline decision of a specific call. This only has a small impact on most code, more on x86 and arm than on x86_64 due to the way the ABI works. When building LLVM for x86 it gives a small inline cost boost to virtually any function using StringRef or STL allocators, but only a 0.01% increase in overall binary size. The size of gcc compiled by clang actually shrunk by a couple bytes with this patch applied, but not significantly. llvm-svn: 161413	2012-08-07 11:13:19 +00:00
Chandler Carruth	2f6cf4884c	Fix PR13412, a nasty miscompile due to the interleaved instsimplify+inline strategy. The crux of the problem is that instsimplify was reasonably relying on an invariant that is true within any single function, but is no longer true mid-inline the way we use it. This invariant is that an argument pointer != a local (alloca) pointer. The fix is really light weight though, and allows instsimplify to be resiliant to these situations: when checking the relation ships to function arguments, ensure that the argumets come from the same function. If they come from different functions, then none of these assumptions hold. All credit to Benjamin Kramer for coming up with this clever solution to the problem. llvm-svn: 161410	2012-08-07 10:59:59 +00:00
Chandler Carruth	881d0a7966	Add a much more conservative strategy for aligning branch targets. Previously, MBP essentially aligned every branch target it could. This bloats code quite a bit, especially non-looping code which has no real reason to prefer aligned branch targets so heavily. As Andy said in review, it's still a bit odd to do this without a real cost model, but this at least has much more plausible heuristics. Fixes PR13265. llvm-svn: 161409	2012-08-07 09:45:24 +00:00
Manman Ren	cb36b8c2e6	MachineCSE: Update the heuristics for isProfitableToCSE. If the result of a common subexpression is used at all uses of the candidate expression, CSE should not increase the live range of the common subexpression. rdar://11393714 and rdar://11819721 llvm-svn: 161396	2012-08-07 06:16:46 +00:00
Bill Wendling	0acd0c0a69	Revert r161371. Removing the 'const' before Type is a "good thing". --- Reverse-merging r161371 into '.': U include/llvm/Target/TargetData.h U lib/Target/TargetData.cpp llvm-svn: 161394	2012-08-07 05:51:59 +00:00
Jack Carter	f4946cfbb9	The define for 64 bit sign extension neglected to initialize fields of the class that it used. The result was nonsense code. Before: 0000000000000000 <foo>: 0: 00441100 0x441100 4: 03e00008 jr ra 8: 00000000 nop After: 0000000000000000 <foo>: 0: 00041000 sll v0,a0,0x0 4: 03e00008 jr ra 8: 00000000 nop llvm-svn: 161377	2012-08-07 00:35:22 +00:00
Bill Wendling	654cd4aaee	Constify the Type parameter to some methods (which are const anyway). llvm-svn: 161371	2012-08-07 00:26:35 +00:00
Andrew Trick	e0c83b1f3b	Allow x86 subtargets to use the GenericModel defined in X86Schedule.td. This allows codegen passes to query properties like InstrItins->SchedModel->IssueWidth. It also ensure's that computeOperandLatency returns the X86 defaults for loads and "high latency ops". This should have no significant impact on existing schedulers because X86 defaults happen to be the same as global defaults. llvm-svn: 161370	2012-08-07 00:25:30 +00:00
Jack Carter	4c58381c3a	Mips relocation R_MIPS_64 relocates a 64 bit double word. I hit this in a very large program (spirit.cpp), but have not figured out how to make a small make check test for it. llvm-svn: 161366	2012-08-07 00:01:14 +00:00
Jack Carter	612c66314c	The Mips64InstrInfo.td definitions DynAlloc64 LEA_ADDiu64 were using a class defined for 32 bit instructions and thus the instruction was for addiu instead of daddiu. This was corrected by adding the instruction opcode as a field in the base class to be filled in by the defs. llvm-svn: 161359	2012-08-06 23:29:06 +00:00
Jack Carter	84491abb20	Mips relocations R_MIPS_HIGHER and R_MIPS_HIGHEST. These 2 relocations gain access to the highest and the second highest 16 bits of a 64 bit object. R_MIPS_HIGHER %higher(A+S) The %higher(x) function is [ (((long long) x + 0x80008000LL) >> 32) & 0xffff ]. R_MIPS_HIGHEST %highest(A+S) The %highest(x) function is [ (((long long) x + 0x800080008000LL) >> 48) & 0xffff ]. llvm-svn: 161348	2012-08-06 21:26:03 +00:00
Hal Finkel	33e529d56b	MFTB on PPC64 should really be encoded using MFSPR. The MFTB instruction itself is being phased out, and its functionality is provided by MFSPR. According to the ISA docs, using MFSPR works on all known chips except for the 601 (which did not have a timebase register anyway) and the POWER3. Thanks to Adhemerval Zanella for pointing this out! llvm-svn: 161346	2012-08-06 21:21:44 +00:00
Eric Christopher	22738d00a3	Add support for the OpenBSD for Bitrig. Patch by David Hill. llvm-svn: 161344	2012-08-06 20:52:18 +00:00
Roman Divacky	7d6e08560b	Remove empty overrides of processFunctionBeforeFrameFinalized(). llvm-svn: 161328	2012-08-06 18:14:18 +00:00
Craig Topper	ab47fe4e16	Implement proper handling for pcmpistri/pcmpestri intrinsics. Requires custom handling in DAGISelToDAG due to limitations in TableGen's implicit def handling. Fixes PR11305. llvm-svn: 161318	2012-08-06 06:22:36 +00:00
Craig Topper	6d0408d3a5	Remove custom inserter for MWAIT. It doesn't do anything that couldn't be represented in a pattern. llvm-svn: 161306	2012-08-05 00:36:57 +00:00
Craig Topper	43ee9fae92	Use a COPY node instead of an explicit MOVA opcode in the custom insterter for pcmpestrm/pcmpistrm. Allows the register allocator to handle it better and prevent wasted identity moves. llvm-svn: 161305	2012-08-05 00:17:48 +00:00
Hal Finkel	70381a7b18	Add readcyclecounter lowering on PPC64. On PPC64, this can be done with a simple TableGen pattern. To enable this, I've added the (otherwise missing) readcyclecounter SDNode definition to TargetSelectionDAG.td. llvm-svn: 161302	2012-08-04 14:10:46 +00:00
Anton Korobeynikov	ef731edf53	Skip impdef regs during eabi save/restore list emission to workaround PR11902 llvm-svn: 161301	2012-08-04 13:25:58 +00:00
Anton Korobeynikov	3a4fdfeceb	Recognize vst1.64 / vld1.64 with 3 and 4 regs as load from / store to stack stuff (this corresponds by spilling/reloading regs in DTriple / DQuad reg classes). No testcase, found by inspection. llvm-svn: 161300	2012-08-04 13:22:14 +00:00
Anton Korobeynikov	218aaf6d04	Add stack spill / reload instructions for DTriple and DQuad register classes, which were missed for no reason. This fixes PR13377 llvm-svn: 161299	2012-08-04 13:16:12 +00:00
Benjamin Kramer	3849fcbe0e	Postpone the deletion of the old name in StructType::setName to allow using a slice of the old name. Fixes PR13522. Add a rudimentary unit test to exercise the behavior. llvm-svn: 161296	2012-08-04 09:47:02 +00:00
Jakob Stoklund Olesen	a9d0b850b3	Delete a dead variable. TwoAddressInstructionPass doesn't remat any more. llvm-svn: 161285	2012-08-04 00:04:03 +00:00
Jakob Stoklund Olesen	a0c72ecf79	TwoAddressInstructionPass refactoring: Extract another method. llvm-svn: 161284	2012-08-03 23:57:58 +00:00
Bob Wilson	874886cd66	Refactor and check "onlyReadsMemory" before optimizing builtins. This patch is mostly just refactoring a bunch of copy-and-pasted code, but it also adds a check that the call instructions are readnone or readonly. That check was already present for sin, cos, sqrt, log2, and exp2 calls, but it was missing for the rest of the builtins being handled in this code. llvm-svn: 161282	2012-08-03 23:29:17 +00:00
Jakob Stoklund Olesen	1162a1548b	TwoAddressInstructionPass refactoring: Extract a method. No functional change intended, except replacing a DenseMap with a SmallDenseMap which should behave identically. llvm-svn: 161281	2012-08-03 23:25:45 +00:00
Jakob Stoklund Olesen	24bc514c0c	Begin adding support for updating LiveIntervals in TwoAddressInstructionPass. This is far from complete, and only changes behavior when the -early-live-intervals flag is passed to llc. llvm-svn: 161273	2012-08-03 22:58:34 +00:00
Akira Hatanaka	22bec282e9	1. Redo mips16 instructions to avoid multiple opcodes for same instruction. Change these to patterns. 2. Add another 16 instructions. Patch by Reed Kotler. llvm-svn: 161272	2012-08-03 22:57:02 +00:00
Jakob Stoklund Olesen	1c46589290	Add an experimental -early-live-intervals option. This option runs LiveIntervals before TwoAddressInstructionPass which will eventually learn to exploit and update the analysis. Eventually, LiveIntervals will run before PHIElimination, and we can get rid of LiveVariables. llvm-svn: 161270	2012-08-03 22:12:54 +00:00
Jakob Stoklund Olesen	918999db95	Delete merged physreg copies in joinReservedPhysReg(). Previously, the identity copy would survive through register allocation before it was removed by the rewriter. llvm-svn: 161269	2012-08-03 22:12:51 +00:00
Bob Wilson	871701c606	Try to reduce the compile time impact of r161232. The previous change caused fast isel to not attempt handling any calls to builtin functions. That included things like "printf" and caused some noticable regressions in compile time. I wanted to avoid having fast isel keep a separate list of functions that had to be kept in sync with what the code in SelectionDAGBuilder.cpp was handling. I've resolved that here by moving the list into TargetLibraryInfo. This is somewhat redundant in SelectionDAGBuilder but it will ensure that we keep things consistent. llvm-svn: 161263	2012-08-03 21:26:24 +00:00
Bob Wilson	fa59485b94	Fix memcmp code-gen to honor -fno-builtin. I noticed that SelectionDAGBuilder::visitCall was missing a check for memcmp in TargetLibraryInfo, so that it would use custom code for memcmp calls even with -fno-builtin. I also had to add a new -disable-simplify-libcalls option to llc so that I could write a test for this. llvm-svn: 161262	2012-08-03 21:26:18 +00:00
Jakob Stoklund Olesen	daae19f785	Completely eliminate VNInfo flags. The 'unused' state of a value number can be represented as an invalid def SlotIndex. This also exposed code that shouldn't have been looking at unused value VNInfos. llvm-svn: 161258	2012-08-03 20:59:32 +00:00
Jakob Stoklund Olesen	21809385a6	Fix a couple of loops that were processing unused value numbers. Unused VNInfos should be left alone. Their def SlotIndex doesn't point to anything. llvm-svn: 161257	2012-08-03 20:59:29 +00:00
Matt Beaumont-Gay	aaba08d503	Silence unused variable warning in -asserts build llvm-svn: 161256	2012-08-03 20:54:11 +00:00
Jakob Stoklund Olesen	9f565e19c5	Eliminate the VNInfo::hasPHIKill() flag. The only real user of the flag was removeCopyByCommutingDef(), and it has been switched to LiveIntervals::hasPHIKill(). All the code changed by this patch was only concerned with computing and propagating the flag. llvm-svn: 161255	2012-08-03 20:19:44 +00:00
Jakob Stoklund Olesen	06d6a5363b	Make the hasPHIKills flag a computed property. The VNInfo::HAS_PHI_KILL is only half supported. We precompute it in LiveIntervalAnalysis, but it isn't properly updated by live range splitting and functions like shrinkToUses(). It is only used in one place: RegisterCoalescer::removeCopyByCommutingDef(). This patch changes that function to use a new LiveIntervals::hasPHIKill() function that computes the flag for a given value number. llvm-svn: 161254	2012-08-03 20:10:24 +00:00
Jakob Stoklund Olesen	19c4596629	Delete dead function. llvm-svn: 161242	2012-08-03 15:21:21 +00:00
Jakob Stoklund Olesen	47ac20d4d6	Don't delete dead code in TwoAddressInstructionPass. This functionality was added before we started running DeadMachineInstructionElim on all targets. It serves no purpose now. llvm-svn: 161241	2012-08-03 15:11:57 +00:00
Gabor Greif	a1529b6ca4	allow 'make CPPFLAGS=<something>' work again this makes this hack a bit more bearable for poor souls who need to pass custom preprocessor flags to the build process llvm-svn: 161240	2012-08-03 13:31:24 +00:00
Bob Wilson	3e6fa462f3	Fall back to selection DAG isel for calls to builtin functions. Fast isel doesn't currently have support for translating builtin function calls to target instructions. For embedded environments where the library functions are not available, this is a matter of correctness and not just optimization. Most of this patch is just arranging to make the TargetLibraryInfo available in fast isel. <rdar://problem/12008746> llvm-svn: 161232	2012-08-03 04:06:28 +00:00
Bob Wilson	c740e3f0d1	Add new getLibFunc method to TargetLibraryInfo. This just provides a way to look up a LibFunc::Func enum value for a function name. Alphabetize the enums and function names so we can use a binary search. llvm-svn: 161231	2012-08-03 04:06:22 +00:00
Jush Lu	4705da9020	[arm-fast-isel] Add support for shl, lshr, and ashr. llvm-svn: 161230	2012-08-03 02:37:48 +00:00
Bill Wendling	8555a37c04	Move the "findUsedStructTypes" functionality outside of the Module class. The "findUsedStructTypes" method is very expensive to run. It needs to be optimized so that LTO can run faster. Splitting this method out of the Module class will help this occur. For instance, it can keep a list of seen objects so that it doesn't process them over and over again. llvm-svn: 161228	2012-08-03 00:30:35 +00:00
Eric Christopher	b3322364e4	Add support for the ARM GHC calling convention, this patch was in 3.0, but somehow managed to be dropped later. Patch by Karel Gardas. llvm-svn: 161226	2012-08-03 00:05:53 +00:00
Jim Grosbach	5d6d015969	ARM: Tidy up. Remove unused template parameters. llvm-svn: 161222	2012-08-02 22:08:27 +00:00
Jim Grosbach	b79c33ef55	ARM: More InstAlias refactors to use #NAME#. llvm-svn: 161220	2012-08-02 21:59:52 +00:00
Jim Grosbach	6d27ad62a8	ARM: Refactor instaliases using TableGen support for #NAME#. Now that TableGen supports references to NAME w/o it being explicitly referenced in the definition's own name, use that to simplify assembly InstAlias definitions in multiclasses. llvm-svn: 161218	2012-08-02 21:50:41 +00:00
Manman Ren	ba8122cc25	X86 Peephole: fold loads to the source register operand if possible. Add more comments and use early returns to reduce nesting in isLoadFoldable. Also disable folding for V_SET0 to avoid introducing a const pool entry and a const pool load. rdar://10554090 and rdar://11873276 llvm-svn: 161207	2012-08-02 19:37:32 +00:00
Jim Grosbach	bc5b61c74d	TableGen: Allow use of #NAME# outside of 'def' names. Previously, def NAME values were only populated, and references to NAME resolved, when NAME was referenced in the 'def' entry of the multiclass sub-entry. e.g., multiclass foo<...> { def prefix_#NAME : ... } It's useful, however, to be able to reference NAME even when the default def name is used. For example, when a multiclass has 'def : Pat<...>' or 'def : InstAlias<...>' entries which refer to earlier instruction definitions in the same multiclass. e.g., multiclass myMulti<RegisterClass rc> { def _r : myI<(outs rc:$d), (ins rc:$r), "r $d, $r", []>; def : InstAlias<\"wilma $r\", (!cast<Instruction>(NAME#\"_r\") rc:$r, rc:$r)>; } llvm-svn: 161198	2012-08-02 18:46:42 +00:00
Jakob Stoklund Olesen	5d30630e22	Compute the critical path length through a trace. Whenever both instruction depths and instruction heights are known in a block, it is possible to compute the length of the critical path as max(depth+height) over the instructions in the block. The stored live-in lists make it possible to accurately compute the length of a critical path that bypasses the current (small) block. llvm-svn: 161197	2012-08-02 18:45:54 +00:00
Akira Hatanaka	fab8929459	Move the code that creates instances of MipsInstrInfo and MipsFrameLowering out of MipsTargetMachine.cpp. llvm-svn: 161191	2012-08-02 18:21:47 +00:00
Akira Hatanaka	fffad897f2	Set transient stack alignment in constructor of MipsFrameLowering and re-enable test o32_cc_vararg.ll. llvm-svn: 161189	2012-08-02 18:15:13 +00:00
Jakob Stoklund Olesen	637c467528	Verify regunit intervals along with virtreg intervals. Don't cause regunit intervals to be computed just to verify them. Only check the already cached intervals. llvm-svn: 161183	2012-08-02 16:36:50 +00:00
Jakob Stoklund Olesen	374071dde2	Avoid creating dangling physreg live ranges during DCE. LiveRangeEdit::eliminateDeadDefs() can delete a dead instruction that reads unreserved physregs. This would leave the corresponding regunit live interval dangling because we don't have shrinkToUses() for physical registers. Fix this problem by turning the instruction into a KILL instead of deleting it. This happens in a landing pad in test/CodeGen/X86/2012-05-19-CoalescerCrash.ll: %vreg27<def,dead> = COPY %EDX<kill>; GR32:%vreg27 becomes: KILL %EDX<kill> An upcoming fix to the machine verifier will catch problems like this by verifying regunit live intervals. This fixes PR13498. I am not including the test case from the PR since we already have one exposing the problem once the verifier is fixed. llvm-svn: 161182	2012-08-02 16:36:47 +00:00
Jakob Stoklund Olesen	bde5dc5e46	Add report() functions that take a LiveInterval argument. llvm-svn: 161178	2012-08-02 14:31:49 +00:00
Hongbin Zheng	bb1d209210	Implement the block_iterator of Region based on df_iterator. llvm-svn: 161177	2012-08-02 14:20:02 +00:00
Nuno Lopes	79a4424f8e	JIT::runFunction(): add a fast path for functions with a single argument that is a pointer. llvm-svn: 161171	2012-08-02 12:09:32 +00:00
Jiangning Liu	fa18005a4c	Support fpv4 for ARM Cortex-M4. llvm-svn: 161163	2012-08-02 08:35:55 +00:00
Jiangning Liu	6a43bf7d74	Fix #13035 , a bug around Thumb instruction LDRD/STRD with negative #0 offset index issue. llvm-svn: 161162	2012-08-02 08:29:50 +00:00
Jiangning Liu	288e1af8c8	Fix #13138 , a bug around ARM instruction DSB encoding and decoding issue. llvm-svn: 161161	2012-08-02 08:21:27 +00:00
Jiangning Liu	10dd40e42d	Fix #13241 , a bug around shift immediate operand for ARM instruction ADR. llvm-svn: 161159	2012-08-02 08:13:13 +00:00
Manman Ren	5759d01230	X86 Peephole: fold loads to the source register operand if possible. Machine CSE and other optimizations can remove instructions so folding is possible at peephole while not possible at ISel. This patch is a rework of r160919 and was tested on clang self-host on my local machine. rdar://10554090 and rdar://11873276 llvm-svn: 161152	2012-08-02 00:56:42 +00:00
Jakob Stoklund Olesen	e736b97eff	Extract some methods from verifyLiveIntervals. No functional change. llvm-svn: 161149	2012-08-02 00:20:20 +00:00
Jakob Stoklund Olesen	a766b4746d	Also verify RegUnit intervals at uses. llvm-svn: 161147	2012-08-01 23:52:40 +00:00
Manman Ren	4059145396	X86: mark GATHER instructios as mayLoad llvm-svn: 161143	2012-08-01 23:28:59 +00:00
Jakob Stoklund Olesen	2db6b65330	Compute instruction heights through a trace. The height on an instruction is the minimum number of cycles from the instruction is issued to the end of the trace. Heights are computed for all instructions in and below the trace center block. The method for computing heights is different from the depth computation. As we visit instructions in the trace bottom-up, heights of used instructions are pushed upwards. This way, we avoid scanning long use lists, looking for uses in the current trace. At each basic block boundary, a list of live-in registers and their minimum heights is saved in the trace block info. These live-in lists are used when restarting depth computations on a trace that converges with an already computed trace. They will also be used to accurately compute the critical path length. llvm-svn: 161138	2012-08-01 22:36:00 +00:00
Jim Grosbach	8724d0fd99	ARM: Remove redundant instalias. llvm-svn: 161134	2012-08-01 20:33:05 +00:00
Jim Grosbach	96e8a8dc6d	Clean up formatting. llvm-svn: 161133	2012-08-01 20:33:02 +00:00
Jim Grosbach	b437a8c5d5	Tidy up. llvm-svn: 161132	2012-08-01 20:33:00 +00:00
Chad Rosier	24c19d20c0	Whitespace. llvm-svn: 161122	2012-08-01 18:39:17 +00:00
Eric Christopher	b1b9451337	Temporarily revert c23b933d5f8be9b51a1d22e717c0311f65f87dcd. It's causing failures in the debug testsuite and possibly PR13486. llvm-svn: 161121	2012-08-01 18:19:01 +00:00
Nuno Lopes	a9a8c62714	remove tabs from my previous commit. Sorry, not used to this editor anymore.. XCode please come back; you're forgiven :) llvm-svn: 161120	2012-08-01 17:13:28 +00:00
Nuno Lopes	e7220312c2	(hopefuly) fix the remaining cases where null wasnt expected (PR13497). I'll commit a test to the clang tree. llvm-svn: 161118	2012-08-01 16:58:51 +00:00
Jakob Stoklund Olesen	5e19d35e9a	Add DataDep constructors. Explicitly check SSA form. llvm-svn: 161115	2012-08-01 16:02:59 +00:00
Elena Demikhovsky	3cb3b0045c	Added FMA functionality to X86 target. llvm-svn: 161110	2012-08-01 12:06:00 +00:00
Nick Lewycky	fb78083b1c	Stay rational; don't assert trying to take the square root of a negative value. If it's negative, the loop is already proven to be infinite. Fixes PR13489! llvm-svn: 161107	2012-08-01 09:14:36 +00:00
Craig Topper	b8aec08819	Add more indirection to the disassembler tables to reduce amount of space used to store the operand types and encodings. Store only the unique combinations in a separate table and store indices in the instruction table. Saves about 32K of static data. llvm-svn: 161101	2012-08-01 07:39:18 +00:00
Nick Kledzik	5fce8c4ffe	Initial commit of new FileOutputBuffer support class. Since the llvm::sys::fs::map_file_pages() support function it relies on is not yet implemented on Windows, the unit tests for FileOutputBuffer are currently conditionalized to run only on unix. llvm-svn: 161099	2012-08-01 02:29:50 +00:00
Akira Hatanaka	0820f0ca8e	Implement MipsJITInfo::replaceMachineCodeForFunction. No new test case is added. This patch makes test JITTest.FunctionIsRecompiledAndRelinked pass on mips platform. Patch by Petar Jovanovic. llvm-svn: 161098	2012-08-01 02:29:24 +00:00
Akira Hatanaka	4a240b0f9d	Remove unused variable. llvm-svn: 161095	2012-08-01 00:37:53 +00:00
Akira Hatanaka	88d76cfd7a	Implement MipsSERegisterInfo::eliminateCallFramePseudoInstr. The function emits instructions that decrement and increment the stack pointer before and after a call when the function does not have a reserved call frame. llvm-svn: 161093	2012-07-31 23:52:55 +00:00
Akira Hatanaka	cb37e13fa7	Add definitions of two subclasses of MipsRegisterInfo, Mips16RegisterInfo and MipsSERegisterInfo. llvm-svn: 161092	2012-07-31 23:41:32 +00:00
Akira Hatanaka	d1c43cee24	Add definitions of two subclasses of MipsFrameLowering, Mips16FrameLowering and MipsSEFrameLowering. Implement MipsSEFrameLowering::hasReservedCallFrame. Call frames will not be reserved if there is a call with a large call frame or there are variable sized objects on the stack. llvm-svn: 161090	2012-07-31 22:50:19 +00:00
Akira Hatanaka	2c64adf672	Add Mips16InstrInfo.cpp and MipsSEInstrInfo.cpp to CMakeLists.txt. llvm-svn: 161083	2012-07-31 22:11:05 +00:00
Akira Hatanaka	b7fa3c9db0	Add definitions of two subclasses of MipsInstrInfo, MipsInstrInfo (for mips16), and MipsSEInstrInfo (for mips32/64). llvm-svn: 161081	2012-07-31 21:49:49 +00:00
Akira Hatanaka	30651805c4	Delete mips64 target machine classes. mips target machines can be used in place of them. llvm-svn: 161080	2012-07-31 21:39:17 +00:00
Akira Hatanaka	02de0e4425	Let PEI::calculateFrameObjectOffsets compute the final stack size rather than computing it in MipsFrameLowering::emitPrologue. llvm-svn: 161078	2012-07-31 21:28:49 +00:00
Akira Hatanaka	33a25af5a8	Expand DYNAMIC_STACKALLOC nodes rather than doing custom-lowering. The frame object which points to the dynamically allocated area will not be needed after changes are made to cease reserving call frames. llvm-svn: 161076	2012-07-31 20:54:48 +00:00
Manman Ren	f288d2f120	MachineSink: Sort the successors before trying to find SuccToSinkTo. Use stable_sort instead of sort. Follow-up to r161062. rdar://11980766 llvm-svn: 161075	2012-07-31 20:45:38 +00:00
Jakob Stoklund Olesen	059e647c6d	Compute instruction depths through the current trace. Assuming infinite issue width, compute the earliest each instruction in the trace can issue, when considering the latency of data dependencies. The issue cycle is record as a 'depth' from the beginning of the trace. This is half the computation required to find the length of the critical path through the trace. Heights are next. llvm-svn: 161074	2012-07-31 20:44:38 +00:00
Jakob Stoklund Olesen	1dfb101835	Rename CT -> MTM. MachineTraceMetrics is abbreviated MTM. llvm-svn: 161072	2012-07-31 20:25:13 +00:00
Akira Hatanaka	a66d676b20	Define ADJCALLSTACKDOWN/UP nodes. These nodes are emitted regardless of whether or not it is in mips16 mode. Define MipsPseudo (mode-independant pseudo) and PseudoSE (mips32/64 pseudo) classes. llvm-svn: 161071	2012-07-31 19:13:07 +00:00
Akira Hatanaka	3a810eda91	Change name of class MipsInst to InstSE to distinguish it from mips16's instruction class. SE stands for standard encoding. llvm-svn: 161069	2012-07-31 18:55:01 +00:00
Akira Hatanaka	beda2241a4	When store nodes or memcpy nodes are created to copy the function call arguments to the stack in MipsISelLowering::LowerCall, use stack pointer and integer offset operands rather than frame object operands. llvm-svn: 161068	2012-07-31 18:46:41 +00:00
Chad Rosier	710be7df71	[x86 frame lowering] In 32-bit mode, use ESI as the base pointer. Previously, we were using EBX, but PIC requires the GOT to be in EBX before function calls via PLT GOT pointer. llvm-svn: 161066	2012-07-31 18:29:21 +00:00
Akira Hatanaka	4ce7c4060d	Fix type of LUXC1 and SUXC1. These instructions were incorrectly defined as single-precision load and store. Also avoid selecting LUXC1 and SUXC1 instructions during isel. It is incorrect to map unaligned floating point load/store nodes to these instructions. llvm-svn: 161063	2012-07-31 18:16:49 +00:00
Manman Ren	8c549b586c	MachineSink: Sort the successors before trying to find SuccToSinkTo. One motivating example is to sink an instruction from a basic block which has two successors: one outside the loop, the other inside the loop. We should try to sink the instruction outside the loop. rdar://11980766 llvm-svn: 161062	2012-07-31 18:10:39 +00:00
Micah Villmow	b67d7a3a33	Conform to LLVM coding style. llvm-svn: 161061	2012-07-31 18:07:43 +00:00
Micah Villmow	6b12f596ef	Don't generate ordered or unordered comparison operations if it is not legal to do so. llvm-svn: 161053	2012-07-31 16:48:03 +00:00
Craig Topper	c2efce404e	Make INSTRUCTION_SPECIFIER_FIELDS match X86DisassemblerCommon.h. Also remove trailing whitespace. llvm-svn: 161029	2012-07-31 05:18:26 +00:00
Craig Topper	fb39f97d4c	Tidy up trailing whitespace llvm-svn: 161027	2012-07-31 04:58:05 +00:00
Craig Topper	5f33d90214	Tidy up trailing whitespace llvm-svn: 161026	2012-07-31 04:38:27 +00:00
Jakob Stoklund Olesen	0c807dfae2	Clear kill flags in removeCopyByCommutingDef(). We are extending live ranges, so kill flags are not accurate. They aren't needed until they are recomputed after RA anyway. <rdar://problem/11950722> llvm-svn: 161023	2012-07-31 02:47:24 +00:00
Manman Ren	2b6a0dfd4c	Reverse order of the two branches at end of a basic block if it is profitable. We branch to the successor with higher edge weight first. Convert from je LBB4_8 --> to outer loop jmp LBB4_14 --> to inner loop to jne LBB4_14 jmp LBB4_8 PR12750 rdar: 11393714 llvm-svn: 161018	2012-07-31 01:11:07 +00:00
Andrew Trick	79795897b3	Use the latest MachineRegisterInfo APIs. No functionality. llvm-svn: 161010	2012-07-30 23:48:17 +00:00
Andrew Trick	535a23c38b	Inline MachineRegisterInfo::hasOneUse llvm-svn: 161007	2012-07-30 23:48:12 +00:00
Jakob Stoklund Olesen	68c2cd059e	Avoid looking at stale data in verifyAnalysis(). llvm-svn: 161004	2012-07-30 23:15:12 +00:00
Jakob Stoklund Olesen	c14cf57ba9	Allow traces to enter nested loops. This lets traces include the final iteration of a nested loop above the center block, and the first iteration of a nested loop below the center block. We still don't allow traces to contain backedges, and traces are truncated where they would leave a loop, as seen from the center block. llvm-svn: 161003	2012-07-30 23:15:10 +00:00
Jim Grosbach	20666162e9	Keep empty assembly macro argument values in the middle of the list. Empty macro arguments at the end of the list should be as-if not specified at all, but those in the middle of the list need to be kept so as not to screw up the positional numbering. E.g.: .macro foo foo_-bash___: nop .endm foo 1, 2, 3, 4 foo 1, , 3, 4 Should create two labels, "foo_1_2_3_4" and "foo_1__3_4". rdar://11948769 llvm-svn: 161002	2012-07-30 22:44:17 +00:00
Jakob Stoklund Olesen	984cfe8322	Clarify invalidation strategy in comment. llvm-svn: 160997	2012-07-30 21:16:22 +00:00
Jakob Stoklund Olesen	f308c128ea	Assert that all trace candidate blocks have been visited by the PO. When computing a trace, all the candidates for pred/succ must have been visited. Filter out back-edges first, though. The PO traversal ignores them. Thanks to Andy for spotting this in review. llvm-svn: 160995	2012-07-30 21:10:27 +00:00
Jakob Stoklund Olesen	a12a7d5f74	Hook into PassManager's analysis verification. By overriding Pass::verifyAnalysis(), the pass contents will be verified by the pass manager. llvm-svn: 160994	2012-07-30 20:57:50 +00:00
Pete Cooper	91244268d7	Consider address spaces for hashing and CSEing DAG nodes. Otherwise two loads from different x86 segments but the same address would get CSEd llvm-svn: 160987	2012-07-30 20:23:19 +00:00
Kevin Enderby	5c490f1b8f	Fix a bug in ARMMachObjectWriter::RecordRelocation() in ARMMachObjectWriter.cpp where the other_half of the movt and movw relocation entries needs to get set and only with the 16 bits of the other half. rdar://10038370 llvm-svn: 160978	2012-07-30 18:46:15 +00:00
Jakob Stoklund Olesen	7361846f32	Add MachineInstr::isTransient(). This is a cleaned up version of the isFree() function in MachineTraceMetrics.cpp. Transient instructions are very unlikely to produce any code in the final output. Either because they get eliminated by RegisterCoalescing, or because they are pseudo-instructions like labels and debug values. llvm-svn: 160977	2012-07-30 18:34:14 +00:00
Jakob Stoklund Olesen	3df6c46fdd	Add MachineTraceMetrics::verify(). This function verifies the consistency of cached data in the MachineTraceMetrics analysis. llvm-svn: 160976	2012-07-30 18:34:11 +00:00
Jakob Stoklund Olesen	eb488fe165	Verify that the CFG hasn't changed during invalidate(). The MachineTraceMetrics analysis must be invalidated before modifying the CFG. This will catch some of the violations of that rule. llvm-svn: 160969	2012-07-30 17:36:49 +00:00
Jakob Stoklund Olesen	fee94ca15b	Add MachineBasicBlock::isPredecessor(). A->isPredecessor(B) is the same as B->isSuccessor(A), but it can tolerate a B that is null or dangling. This shouldn't happen normally, but it it useful for verification code. llvm-svn: 160968	2012-07-30 17:36:47 +00:00
Nadav Rotem	77f1b9c477	When constant folding GEP expressions, keep the address space information of pointers. Together with Ran Chachick <ran.chachick@intel.com> llvm-svn: 160954	2012-07-30 07:25:20 +00:00
Craig Topper	efd97044a3	Mark MOVZX16/MOVSX16 as neverHasSideEffects/mayLoad llvm-svn: 160953	2012-07-30 07:14:07 +00:00
Craig Topper	c6b7ef61f4	Mark MOVZX32_NOREX as isCodeGenOnly and neverHasSideEffects. The isCodeGenOnly change allows special detection of _NOREX instructions to be removed from tablegen disassembler code. llvm-svn: 160951	2012-07-30 06:48:11 +00:00
Craig Topper	14eac5dda8	Give VCVTTPD2DQ priority over CVTTPD2DQ. llvm-svn: 160942	2012-07-30 02:20:32 +00:00
Craig Topper	f881d385da	Fix patterns for CVTTPS2DQ to specify SSE2 instead of SSE1. llvm-svn: 160941	2012-07-30 02:14:02 +00:00
Craig Topper	415b3586d0	Fix up patterns for VCVTSS2SD. Specifically give it priority over SSE form. Add an OptForSpeed to explicitly pair up with an OptForSize that was already on another pattern. llvm-svn: 160939	2012-07-30 01:38:57 +00:00
Craig Topper	28402efcb6	Fix load types on intrinsic forms of SS2SD and SD2SS AVX/SSE convert instruction patterns. llvm-svn: 160938	2012-07-29 23:26:34 +00:00
Craig Topper	b6767f3acd	Move more SSE/AVX convert instruction patterns into their definitions. llvm-svn: 160937	2012-07-29 22:30:06 +00:00
Manman Ren	f87dd7c01b	Revert r160920 and r160919 due to dragonegg and clang selfhost failure llvm-svn: 160927	2012-07-29 02:44:09 +00:00

1 2 3 4 5 ...

55695 Commits