llvm-project

Commit Graph

Author	SHA1	Message	Date
Arnold Schwaighofer	2a70c69d31	Revert series of sched model patches until I figure out what is going on. llvm-svn: 183273	2013-06-04 22:35:17 +00:00
Arnold Schwaighofer	be3a06c85f	ARM sched model: Cortex A9 - More InstRW sched resources Add more InstRW mappings. llvm-svn: 183266	2013-06-04 22:16:00 +00:00
Arnold Schwaighofer	d5b9794a53	ARM sched model: Add divsion, loads, branches, vfp cvt Add some generic SchedWrites and assign resources for Swift and Cortex A9. llvm-svn: 183257	2013-06-04 22:15:46 +00:00
Arnold Schwaighofer	fb6b9f48d0	ARM scheduler model: Add scheduler info to more instructions and resource descriptions for compares llvm-svn: 178844	2013-04-05 05:01:06 +00:00
Arnold Schwaighofer	6793aebb84	ARM Scheduler Model: Add resources instructions, map resources in subtargets Reapply r177968: After commit 178074 we can now have undefined scheduler variants. Move the CortexA9 resources into the CortexA9 SchedModel namespace. Define resource mappings under the CortexA9 SchedModel. Define resources and mappings for the SwiftModel. Incooperate Andrew's feedback. llvm-svn: 178460	2013-04-01 13:07:05 +00:00
Arnold Schwaighofer	414ef565bb	Revert ARM Scheduler Model: Add resources instructions, map resources This reverts commit r177968. It is causing failures in a local build bot. "fatal error: error in backend: Expected a variant SchedClass" Original commit message: Move the CortexA9 resources into the CortexA9 SchedModel namespace. Define resource mappings under the CortexA9 SchedModel. Define resources and mappings for the SwiftModel. llvm-svn: 178028	2013-03-26 15:14:04 +00:00
Arnold Schwaighofer	ce6392611b	ARM Scheduler Model: Add resources instructions, map resources in subtargets Move the CortexA9 resources into the CortexA9 SchedModel namespace. Define resource mappings under the CortexA9 SchedModel. Define resources and mappings for the SwiftModel. llvm-svn: 177968	2013-03-26 02:01:42 +00:00
Andrew Trick	9f0b95f260	MIsched: add an ILP window property to machine model. This was an experimental option, but needs to be defined per-target. e.g. PPC A2 needs to aggressively hide latency. I converted some in-order scheduling tests to A2. Hal is working on more test cases. llvm-svn: 171946	2013-01-09 03:36:49 +00:00
Andrew Trick	99cc7f6a3b	Cortex-A9 latency fixes (w/ -schedmodel only). Quick review against the manual revealed a few obvious mistakes. llvm-svn: 164361	2012-09-21 05:06:40 +00:00
Andrew Trick	985dc0dd64	Cortex-A9 instruction-level scheduling machine model. This models the A9 processor at the level of instruction operands, as opposed to the itinerary, which models each operation at the level of pipeline stages. The two primary motivations are: 1) Allow MachineScheduler to model A9 as an out-of-order processor. It can now distinguish between hazards that force interlocking vs. buffered resources. 2) Reduce long-term maintenance by allowing the itinerary and target hooks to eventually be removed. Note that almost all of the complexity in the new model exists to model instruction variants, which the itinerary cannot handle. Instead the scheduler previously relied on processor-specific target hooks which are incomplete and buggy. llvm-svn: 163921	2012-09-14 18:31:58 +00:00
Andrew Trick	352abc19a5	Added MispredictPenalty to SchedMachineModel. This replaces an existing subtarget hook on ARM and allows standard CodeGen passes to potentially use the property. llvm-svn: 161471	2012-08-08 02:44:16 +00:00
Andrew Trick	87255e340e	I'm introducing a new machine model to simultaneously allow simple subtarget CPU descriptions and support new features of MachineScheduler. MachineModel has three categories of data: 1) Basic properties for coarse grained instruction cost model. 2) Scheduler Read/Write resources for simple per-opcode and operand cost model (TBD). 3) Instruction itineraties for detailed per-cycle reservation tables. These will all live side-by-side. Any subtarget can use any combination of them. Instruction itineraries will not change in the near term. In the long run, I expect them to only be relevant for in-order VLIW machines that have complex contraints and require a precise scheduling/bundling model. Once itineraries are only actively used by VLIW-ish targets, they could be replaced by something more appropriate for those targets. This tablegen backend rewrite sets things up for introducing MachineModel type #2: per opcode/operand cost model. llvm-svn: 159891	2012-07-07 04:00:00 +00:00
Andrew Trick	f161e391f8	Reapply "Make NumMicroOps a variable in the subtarget's instruction itinerary." Reapplies r159406 with minor cleanup. The regressions appear to have been spurious. llvm-svn: 159541	2012-07-02 18:10:42 +00:00
Andrew Trick	51a8cf77b8	Revert "Make NumMicroOps a variable in the subtarget's instruction itinerary." This reverts commit r159406. I noticed a performance regression so I'll back out for now. llvm-svn: 159411	2012-06-29 07:10:41 +00:00
Andrew Trick	1f50152b2d	Make NumMicroOps a variable in the subtarget's instruction itinerary. The TargetInstrInfo::getNumMicroOps API does not change, but soon it will be used by MachineScheduler. Now each subtarget can specify the number of micro-ops per itinerary class. For ARM, this is currently always dynamic (-1), because it is used for load/store multiple which depends on the number of register operands. Zero is now a valid number of micro-ops. This can be used for nop pseudo-instructions or instructions that the hardware can squash during dispatch. llvm-svn: 159406	2012-06-29 03:23:18 +00:00
Andrew Trick	b2680c718f	ARM itinerary properties. llvm-svn: 157980	2012-06-05 03:44:43 +00:00
Evan Cheng	aca6c822e6	Fix a number of problems with ARM fused multiply add/subtract instructions. 1. The new instruction itinerary entries are not properly described. 2. The asm parser can't handle vfms and vfnms. 3. There were no assembler, disassembler test cases. 4. HasNEON2 has the wrong assembler predicate. rdar://10139676 llvm-svn: 154456	2012-04-11 00:13:00 +00:00
Bob Wilson	f33715e554	Improvements for the Cortex-A9 scheduling itineraries. llvm-svn: 129770	2011-04-19 18:11:36 +00:00
Evan Cheng	7d6cd4902e	Change A9 scheduling itineraries VLD* / VST* entries default to "aligned". That is, it assumes addresses are 64-bit aligned (which should be the more common case). If the alignment is found not to be aligned, then getOperandLatency() would adjust the operand latency computation by one to compensate for it. rdar://9294833 llvm-svn: 129742	2011-04-19 01:21:49 +00:00
Evan Cheng	b8b0ad80a8	Sorry, several patches in one. TargetInstrInfo: Change produceSameValue() to take MachineRegisterInfo as an optional argument. When in SSA form, targets can use it to make more aggressive equality analysis. Machine LICM: 1. Eliminate isLoadFromConstantMemory, use MI.isInvariantLoad instead. 2. Fix a bug which prevent CSE of instructions which are not re-materializable. 3. Use improved form of produceSameValue. ARM: 1. Teach ARM produceSameValue to look pass some PIC labels. 2. Look for operands from different loads of different constant pool entries which have same values. 3. Re-implement PIC GA materialization using movw + movt. Combine the pair with a "add pc" or "ldr [pc]" to form pseudo instructions. This makes it possible to re-materialize the instruction, allow machine LICM to hoist the set of instructions out of the loop and make it possible to CSE them. It's a bit hacky, but it significantly improve code quality. 4. Some minor bug fixes as well. With the fixes, using movw + movt to materialize GAs significantly outperform the load from constantpool method. 186.crafty and 255.vortex improved > 20%, 254.gap and 176.gcc ~10%. llvm-svn: 123905	2011-01-20 08:34:58 +00:00
Andrew Trick	163a24420a	Fix the ARM IIC_iCMPsi itinerary and add an important assert. llvm-svn: 122794	2011-01-04 00:32:57 +00:00
Evan Cheng	7f3e9150d0	Fix an obvious cut-n-paste error. llvm-svn: 121307	2010-12-08 23:01:18 +00:00
Bob Wilson	431ac4ef50	Add support for NEON VLD3-dup instructions. The encoding for alignment in VLD4-dup instructions is still a work in progress. llvm-svn: 120356	2010-11-30 00:00:35 +00:00
Bob Wilson	77ab165afe	Add support for NEON VLD3-dup instructions. llvm-svn: 120312	2010-11-29 19:35:29 +00:00
Bob Wilson	8022367809	Fix copy-and-paste errors in VLD2-dup scheduling itineraries. llvm-svn: 120311	2010-11-29 19:35:23 +00:00
Bob Wilson	2d790df105	Add support for NEON VLD2-dup instructions. llvm-svn: 120236	2010-11-28 06:51:26 +00:00
Bob Wilson	c92eea0175	Add NEON VLD1-dup instructions (load 1 element to all lanes). llvm-svn: 120194	2010-11-27 06:35:16 +00:00
Bob Wilson	3a63f9d852	Fix incorrect scheduling itineraries for NEON vld1/vst1 instructions. I added these instructions recently but I have no idea where these "1" values in the NextCycles field came from. As far as I can tell now, these instruction stages are clearly intended to overlap. llvm-svn: 120193	2010-11-27 06:35:09 +00:00
Evan Cheng	79ff5238e9	Conditional moves are slightly more expensive than moves. llvm-svn: 118985	2010-11-13 05:14:20 +00:00
Evan Cheng	8740ee3637	Fix preload instruction isel. Only v7 supports pli, and only v7 with mp extension supports pldw. Add subtarget attribute to denote mp extension support and legalize illegal ones to nothing. llvm-svn: 118160	2010-11-03 06:34:55 +00:00
Evan Cheng	634ab6c2b7	Modify scheduling itineraries to correct instruction latencies (not operand latencies) of loads. llvm-svn: 118134	2010-11-03 00:40:22 +00:00
Bob Wilson	d80b29d6f7	Add NEON VST1-lane instructions. Partial fix for Radar 8599955. llvm-svn: 118069	2010-11-02 21:18:25 +00:00
Bob Wilson	dc44990c7d	Add NEON VLD1-lane instructions. Partial fix for Radar 8599955. llvm-svn: 117964	2010-11-01 22:04:05 +00:00
Evan Cheng	99cce36cf5	Fix fpscr <-> GPR latency info. llvm-svn: 117737	2010-10-29 23:16:55 +00:00
Evan Cheng	ff310737e5	Re-commit 117518 and 117519 now that ARM MC test failures are out of the way. llvm-svn: 117531	2010-10-28 06:47:08 +00:00
Evan Cheng	e2c211c1b9	Revert 117518 and 117519 for now. They changed scheduling and cause MC tests to fail. Ugh. llvm-svn: 117520	2010-10-28 02:00:25 +00:00
Evan Cheng	ff1c862f8e	- Assign load / store with shifter op address modes the right itinerary classes. - For now, loads of [r, r] addressing mode is the same as the [r, r lsl/lsr/asr #] variants. ARMBaseInstrInfo::getOperandLatency() should identify the former case and reduce the output latency by 1. - Also identify [r, r << 2] case. This special form of shifter addressing mode is "free". llvm-svn: 117519	2010-10-28 01:49:06 +00:00
Andrew Trick	f4ebec03e0	putback r116983 and fix simple-fp-encoding.ll tests llvm-svn: 116992	2010-10-21 03:40:16 +00:00
Owen Anderson	9e00f27e14	Revert r116983, which is breaking all the buildbots. llvm-svn: 116987	2010-10-21 03:11:16 +00:00
Evan Cheng	15c2ac90ec	Add missing scheduling itineraries for transfers between core registers and VFP registers. llvm-svn: 116983	2010-10-21 01:12:00 +00:00
Evan Cheng	3912158997	Limit load / store issues (at least until we have a true multi-issue aware scheduler). llvm-svn: 116389	2010-10-13 01:54:21 +00:00
Evan Cheng	e790afcbe1	More ARM scheduling itinerary fixes. llvm-svn: 116266	2010-10-11 23:41:41 +00:00
Evan Cheng	94ad008beb	Proper VST scheduling itineraries. llvm-svn: 116251	2010-10-11 22:03:18 +00:00
Evan Cheng	d7a404d85f	Add VLD4 scheduling itineraries. llvm-svn: 116143	2010-10-09 04:07:58 +00:00
Evan Cheng	a762400bed	Finish vld3 and vld4. llvm-svn: 116140	2010-10-09 01:45:34 +00:00
Evan Cheng	05f13e94bf	Correct some load / store instruction itinerary mistakes: 1. Cortex-A8 load / store multiplies can only issue on ALU0. 2. Eliminate A8_Issue, A8_LSPipe will correctly limit the load / store issues. 3. Correctly model all vld1 and vld2 variants. llvm-svn: 116134	2010-10-09 01:03:04 +00:00
Evan Cheng	1958cefd69	Model operand cycles of vldm / vstm; also fixes scheduling itineraries of vldr / vstr, etc. llvm-svn: 115898	2010-10-07 01:50:48 +00:00
Evan Cheng	49d4c0bd18	- Add TargetInstrInfo::getOperandLatency() to compute operand latencies. This allow target to correctly compute latency for cases where static scheduling itineraries isn't sufficient. e.g. variable_ops instructions such as ARM::ldm. This also allows target without scheduling itineraries to compute operand latencies. e.g. X86 can return (approximated) latencies for high latency instructions such as division. - Compute operand latencies for those defined by load multiple instructions, e.g. ldm and those used by store multiple instructions, e.g. stm. llvm-svn: 115755	2010-10-06 06:27:31 +00:00
Evan Cheng	73eac2aadf	Major changes to Cortex-A9 itinerary. 1. Model dual issues as two FUs. 2. Model the pipelines correctly: two symmetric ALUs, the multiplier is a dependent pipeline on ALU0. The changes do not have much impact on codegen right now. But I plan to make pre-RA scheduler multi-issue aware which should take good advantage of the changes. llvm-svn: 115457	2010-10-03 02:03:59 +00:00
Evan Cheng	a317815463	Fix r115332: correctly model AGU / NEON mux. llvm-svn: 115365	2010-10-01 22:52:29 +00:00

1 2

71 Commits