llvm-project

Commit Graph

Author	SHA1	Message	Date
David Blaikie	2e488d1f0d	Generalize debug info test to be resilient to changes in metadata node numbering llvm-svn: 177238	2013-03-17 21:08:22 +00:00
David Blaikie	08fb5457aa	Improve DIFile debug info annotation by letting it fallback to DIScope llvm-svn: 177236	2013-03-17 20:28:12 +00:00
Jakob Stoklund Olesen	13d4a07fa9	Use ArrayRef<MVT::SimpleValueType> when possible. Not passing vector references around makes it possible to use SmallVector in most places. llvm-svn: 177235	2013-03-17 17:26:09 +00:00
Sylvestre Ledru	37ef20d307	To avoid symbol clash, undefine PPC here. PPC may be predefined on some hosts. llvm-svn: 177234	2013-03-17 12:40:42 +00:00
Rafael Espindola	bd5bd89e77	Build LLVMgold.so on FreeBSD using cmake. Patch by Stephen Checkoway. llvm-svn: 177233	2013-03-17 12:01:05 +00:00
Michael Gottesman	9782183126	The promised test case for r175939. This test makes sure that the ObjCARC escape analysis looks at the uses of instructions which copy the block pointer value by checking all four cases where that can occur. llvm-svn: 177232	2013-03-17 08:42:58 +00:00
Hal Finkel	fcc51d4ff1	Improve PPC VR (Altivec) register spilling This change cleans up two issues with Altivec register spilling: 1. The spilling code was inefficient (using two instructions, and add and a load, when just one would do) 2. The code assumed that r0 would always be available (true for now, but this will change) The new code handles VR spilling just like GPR spills but forced into r+r mode. As a result, when any VR spills are present, we must now always allocate the register-scavenger spill slot. llvm-svn: 177231	2013-03-17 04:43:44 +00:00
Hal Finkel	57080382e6	Remove FIXMEs in PPC test cases related to unaligned loads/stores As pointed out by Bill in response to r177160, these two FIXMEs can also be removed. llvm-svn: 177229	2013-03-16 23:02:31 +00:00
Hal Finkel	8b0470393b	Remove PPC avoidWriteAfterWrite callback As a follow-up to r158719, remove PPCRegisterInfo::avoidWriteAfterWrite. Jakob pointed out in response to r158719 that this callback is currently unused and so this has no effect (and the speedups that I thought that I had observed as a result of implementing this function must have been noise). llvm-svn: 177228	2013-03-16 22:50:51 +00:00
Andrew Trick	6057017c68	Change the default latency for implicit defs. Implicit defs are not currently positional and not modeled by the per-operand machine model. Unfortunately, we treat defs that are part of the architectural instruction description, like flags, the same as other implicit defs. Really, they should have a fixed MachineInstr layout and probably shouldn't be "implicit" at all. For now, we'll change the default latency to be the max operand latency. That will give flag setting operands full latency for x86 folded loads. Other kinds of "fake" implicit defs don't occur prior to regalloc anyway, and we would like them to go away postRegAlloc as well. llvm-svn: 177227	2013-03-16 18:58:57 +00:00
Andrew Trick	bf8a28dc52	Machine model. Allow mixed itinerary classes and SchedRW lists. We always supported a mixture of the old itinerary model and new per-operand model, but it required a level of indirection to map itinerary classes to SchedRW lists. This was done for ARM A9. Now we want to define x86 SchedRW lists, with the goal of removing its itinerary classes, but still support the itineraries in the mean time. When I original developed the model, Atom did not have itineraries, so there was no reason to expect this requirement. llvm-svn: 177226	2013-03-16 18:58:55 +00:00
Sean Silva	ca11d2c7ff	[docs] Discuss a potential bug to be aware of. llvm-svn: 177224	2013-03-16 16:58:20 +00:00
Aaron Ballman	fcdf9a8240	Test case for graceful handling of long file names on Windows. Patch thanks to Paul Robinson! llvm-svn: 177223	2013-03-16 15:00:51 +00:00
Craig Topper	612f7bfa4d	Add X86 code emitter support AVX encoded MRMDestReg instructions. Previously we weren't skipping the VVVV encoded register. Based on patch by Michael Liao. llvm-svn: 177221	2013-03-16 03:44:31 +00:00
Jakob Stoklund Olesen	63bff2eb39	Define more SchedWrites for annotating X86 instructions. Since almost all X86 instructions can fold loads, use a multiclass to define register/memory pairs of SchedWrites. An X86FoldableSchedWrite represents the register version of an instruction. It holds a reference to the SchedWrite to use when the instruction folds a load. This will be used inside multiclasses that define rr and rm instruction versions together. llvm-svn: 177210	2013-03-16 00:02:17 +00:00
Jakob Stoklund Olesen	a4a361df5b	Add SchedRW as an Instruction field. Don't require instructions to inherit Sched<...>. Sometimes it is more convenient to say: let SchedRW = ... in { ... } Which is now possible. llvm-svn: 177199	2013-03-15 22:51:13 +00:00
Daniel Dunbar	3145eb8e54	[ADT] Fix StringSet::insert() to not allocate on every lookup. - The previous implementation always constructed the StringMap entry, even if the key was present in the set. llvm-svn: 177178	2013-03-15 20:16:59 +00:00
Michael J. Spencer	d932d41190	[Support][Path][Windows] Fix dangling else. Don't call CloseHandle when CloseFD is false. llvm-svn: 177175	2013-03-15 19:25:47 +00:00
Arnold Schwaighofer	9d7a3827e4	ARM cost model: Fix costs for some vector selects I was too pessimistic in r177105. Vector selects that fit into a legal register type lower just fine. I was mislead by the code fragment that I was using. The stores/loads that I saw in those cases came from lowering the conditional off an address. Changing the code fragment to: %T0_3 = type <8 x i18> %T1_3 = type <8 x i1> define void @func_blend3(%T0_3* %loadaddr, %T0_3* %loadaddr2, %T1_3* %blend, %T0_3* %storeaddr) { %v0 = load %T0_3* %loadaddr %v1 = load %T0_3* %loadaddr2 ==> FROM: ;%c = load %T1_3* %blend ==> TO: %c = icmp slt %T0_3 %v0, %v1 ==> USE: %r = select %T1_3 %c, %T0_3 %v0, %T0_3 %v1 store %T0_3 %r, %T0_3* %storeaddr ret void } revealed this mistake. radar://13403975 llvm-svn: 177170	2013-03-15 18:31:01 +00:00
Silviu Baranga	82dd6ac3bc	Adding an A15 specific optimization pass for interactions between S/D/Q registers. The pass handles all the required transformations pre-regalloc. llvm-svn: 177169	2013-03-15 18:28:25 +00:00
Benjamin Kramer	2f5457141a	ARM: Fix an old refacto. Fixes PR15520. llvm-svn: 177167	2013-03-15 17:27:39 +00:00
Hal Finkel	8d7fbc9dad	Enable unaligned memory access on PPC for scalar types Unaligned access is supported on PPC for non-vector types, and is generally more efficient than manually expanding the loads and stores. A few of the existing test cases were using expanded unaligned loads and stores to test other features (like load/store with update), and for these test cases, unaligned access remains disabled. llvm-svn: 177160	2013-03-15 15:27:13 +00:00
Arnold Schwaighofer	f5284ff61f	ARM cost model: Fix cost of fptrunc and fpext instructions A vector fptrunc and fpext simply gets split into scalar instructions. radar://13192358 llvm-svn: 177159	2013-03-15 15:10:47 +00:00
Hal Finkel	b0fac42987	Protect PPC Altivec patterns with a predicate In preparation for the addition of other SIMD ISA extensions (such as QPX) we need to make sure that all Altivec patterns are properly predicated on having Altivec support. No functionality change intended (one test case needed to be updated b/c it assumed that Altivec intrinsics would be supported without enabling Altivec support). llvm-svn: 177152	2013-03-15 13:21:21 +00:00
Alexey Samsonov	cd27b98d38	Fixup for r176933: more careful setup of path to llvm-symbolizer llvm-svn: 177144	2013-03-15 07:27:49 +00:00
Craig Topper	f6f549ce02	Use NumBaseBits in a few more places in SmallBitVector instead of recalculating it. No functional change. llvm-svn: 177142	2013-03-15 06:01:42 +00:00
Rafael Espindola	ef9d3494b2	Fix the FDE encoding to be relative on ELF. This is a very late complement to r130637 which fixed this on x86_64. Fixes pr15448. Since it looks like that every elf architecture uses this encoding when using cfi, make it the default for elf. Just exclude mips64el. It has a lovely .ll -> .o test (ef_frame.ll) that tests that nothing changes in the binary content of the .eh_frame produced by llc. Oblige it. llvm-svn: 177141	2013-03-15 05:51:57 +00:00
Hal Finkel	bb420f10e9	Allocate the RS spill slot for any PPC function with spills and a large stack frame For spills into a large stack frame, the FI-elimination code uses the register scavenger to obtain a free GPR for use with an r+r-addressed load or store. When there are no available GPRs, the scavenger gets one by using its spill slot. Previously, we were not always allocating that spill slot and the RS would assert when the spill slot was needed. I don't currently have a small test that triggered the assert, but I've created a small regression test that verifies that the spill slot is now added when the stack frame is sufficiently large. llvm-svn: 177140	2013-03-15 05:06:04 +00:00
Eric Christopher	31f4354c75	Turn anonymous type in anonymous union warning back on after cleaning up issues. llvm-svn: 177136	2013-03-15 00:43:00 +00:00
Eric Christopher	8996c5d469	Silence anonymous type in anonymous union warnings. llvm-svn: 177135	2013-03-15 00:42:55 +00:00
Nadav Rotem	4a4827ce21	Add a triple to the test. llvm-svn: 177131	2013-03-15 00:10:23 +00:00
Nadav Rotem	adfa5eaf8c	Unaligned loads should use the VMOVUPS opcode. llvm-svn: 177130	2013-03-14 23:49:44 +00:00
David Blaikie	6e5e0316aa	Remove some unused variables to clean the Clang -Werror build (these were added in r177089) llvm-svn: 177129	2013-03-14 23:11:07 +00:00
Akira Hatanaka	b83b2edae3	[mips] Set isAllocatable bit of unallocatable register classes to 0. llvm-svn: 177128	2013-03-14 23:09:19 +00:00
Andrew Trick	a5c747b0ca	Fix r177112: Add ProcResGroup. This is the other half of r177122 that I meant to commit at the same time. llvm-svn: 177123	2013-03-14 22:47:01 +00:00
Jakob Stoklund Olesen	712366821a	Prepare for adding InstrSchedModel annotations to X86 instructions. The new InstrSchedModel is easier to use than the instruction itineraries. It will be used to model instruction latency and throughput in modern Intel microarchitectures like Sandy Bridge. InstrSchedModel should be able to coexist with instruction itinerary classes, but for cleanliness we should switch the Atom processor model to the new InstrSchedModel as well. llvm-svn: 177122	2013-03-14 22:42:17 +00:00
Reed Kotler	fafaa9d967	Add a new method which enables one to change register classes. See the Mips16ISetLowering.cpp patch to see a use of this. For now now the extra code in Mips16ISetLowering.cpp is a nop but is used for test purposes. Mips32 registers are setup and then removed and then the Mips16 registers are setup. Normally you need to add register classes and then call computeRegisterProperties. llvm-svn: 177120	2013-03-14 22:02:09 +00:00
Arnold Schwaighofer	9b55e31bcb	LoopVectorizer: Insert some white space to make test case more readable Also remove some unneeded function attributes. llvm-svn: 177114	2013-03-14 21:31:09 +00:00
Chad Rosier	4b54f594b4	[fast-isel] The X86FastISel::FastLowerArguments function doesn't properly handle the win64 calling convention. rdar://13423768 llvm-svn: 177113	2013-03-14 21:25:04 +00:00
Andrew Trick	4e67cba8a6	MachineModel: Add a ProcResGroup class. This allows abitrary groups of processor resources. Using something in a subset automatically counts againts the superset. Currently, this only works if the superset is also a ProcResGroup as opposed to a SuperUnit. This allows SandyBridge to be expressed naturally, which will be checked in shortly. def SBPort01 : ProcResGroup<[SBPort0, SBPort1]>; def SBPort15 : ProcResGroup<[SBPort1, SBPort5]>; def SBPort23 : ProcResGroup<[SBPort2, SBPort3]>; def SBPort015 : ProcResGroup<[SBPort0, SBPort1, SBPort5]>; llvm-svn: 177112	2013-03-14 21:21:50 +00:00
Hal Finkel	628ba12823	Move estimateStackSize from ARM into MachineFrameInfo This is a generic function (derived from PEI); moving it into MachineFrameInfo eliminates a current redundancy between the ARM and AArch64 backends, and will allow it to be used by the PowerPC target code. No functionality change intended. llvm-svn: 177111	2013-03-14 21:15:20 +00:00
Hal Finkel	5a765fddb0	Provide the register scavenger to processFunctionBeforeFrameFinalized Add the current PEI register scavenger as a parameter to the processFunctionBeforeFrameFinalized callback. This change is necessary in order to allow the PowerPC target code to set the register scavenger frame index after the save-area offset adjustments performed by processFunctionBeforeFrameFinalized. Only after these adjustments have been made is it possible to estimate the size of the stack frame. llvm-svn: 177108	2013-03-14 20:33:40 +00:00
Hal Finkel	ad26f4ded2	Use frame-index scavenging for PPC register spilling Make requiresFrameIndexScavenging return true, and create virtual registers in the spilling code instead of using the register scavenger directly. This makes the target-level code simpler, and importantly, delays the scavenging until after callee-saved register processing (which will be important for later changes). Also cleans up trackLivenessAfterRegAlloc (makes it inline in the header with the other related functions). This makes it clear that it always returns true. No functionality change intended. llvm-svn: 177107	2013-03-14 20:21:47 +00:00
Hal Finkel	e987a311ba	Not all PPC functions with a frame pointer need a RS spill slot We used to add a spill slot for the register scavenger whenever the function has a frame pointer. This is unnecessarily conservative: We may need the spill slot for dynamic stack allocations, and functions with dynamic stack allocations always have a FP, but we might also have a FP for other reasons (such as the user explicitly disabling frame-pointer elimination), and we don't necessarily need a spill slot for those functions. The structsinregs test needed adjustment because it disables FP elimination. llvm-svn: 177106	2013-03-14 19:34:32 +00:00
Arnold Schwaighofer	8070b382ec	ARM cost model: Increase cost of some vector selects we do terrible on By terrible I mean we store/load from the stack. This matters on PAQp8 in _Z5trainPsS_ii (which is inlined into Mixer::update) where we decide to vectorize a loop with a VF of 8 resulting in a 25% degradation on a cortex-a8. LV: Found an estimated cost of 2 for VF 8 For instruction: icmp slt i32 LV: Found an estimated cost of 2 for VF 8 For instruction: select i1, i32, i32 The bug that tracks the CodeGen part is PR14868. radar://13403975 llvm-svn: 177105	2013-03-14 19:17:02 +00:00
Akira Hatanaka	44ebe00158	[mips] Fix filename in comment and delete unnecessary lines of code. No functionality changes. llvm-svn: 177104	2013-03-14 19:09:52 +00:00
Jyotsna Verma	ec613665c2	Hexagon: Removed asserts regarding alignment and offset. We are warning the user about the alignment, so we should not assert. llvm-svn: 177103	2013-03-14 19:08:03 +00:00
Arnold Schwaighofer	4991ce9d49	Add missing asserts flag to test - it uses debug flags llvm-svn: 177102	2013-03-14 19:01:58 +00:00
Akira Hatanaka	7239a6003f	Android uses cacheflush(long start, long end, long flags) for MIPS. Patch by Stephen Hines. llvm-svn: 177101	2013-03-14 19:01:00 +00:00
Arnold Schwaighofer	c63cf3a0ae	LoopVectorize: Invert case when we use a vector cmp value to query select cost We generate a select with a vectorized condition argument when the condition is NOT loop invariant. Not the other way around. llvm-svn: 177098	2013-03-14 18:54:36 +00:00

1 2 3 4 5 ...

90212 Commits