llvm-project

Commit Graph

Author	SHA1	Message	Date
Michael Kuperstein	84dff4c94c	[X86][Haswell][SchedModel] Fix patterns for scalar FMA3 variants. llvm-svn: 231073	2015-03-03 15:47:02 +00:00
Michael Kuperstein	4af7449659	[X86][Haswell][SchedModel] Fix WriteMULm latency. The latency for the WriteMULm class was set to 4, which is actually lower than the latency for WriteMULr (5). A better estimate would be 4 added to WriteMULr, that is, 9. llvm-svn: 230634	2015-02-26 14:30:09 +00:00
Andrea Di Biagio	196e873cdc	[X86][SchedModel] SSE reciprocal square root instruction latencies. The SSE rsqrt instruction (a fast reciprocal square root estimate) was grouped in the same scheduling IIC_SSE_SQRT* class as the accurate (but very slow) SSE sqrt instruction. For code which uses rsqrt (possibly with newton-raphson iterations) this poor scheduling was affecting performances. This patch splits off the rsqrt instruction from the sqrt instruction scheduling classes and creates new IIC_SSE_RSQER* classes with latency values based on Agner's table. Differential Revision: http://reviews.llvm.org/D5370 Patch by Simon Pilgrim. llvm-svn: 218517	2014-09-26 12:56:44 +00:00
Quentin Colombet	7e939fb431	[X86][Haswell][SchedModel] Tidy up. <rdar://problem/15607571> llvm-svn: 215924	2014-08-18 17:56:01 +00:00
Quentin Colombet	95e053119e	[X86][Haswell][SchedModel] Add architecture specific scheduling models. Group: Floating Point XMM and YMM instructions. Sub-group: Other instructions. <rdar://problem/15607571> llvm-svn: 215923	2014-08-18 17:55:59 +00:00
Quentin Colombet	81db56d931	[X86][Haswell][SchedModel] Add architecture specific scheduling models. Group: Floating Point XMM and YMM instructions. Sub-group: Logic instructions. <rdar://problem/15607571> llvm-svn: 215922	2014-08-18 17:55:56 +00:00
Quentin Colombet	c13c50e0f3	[X86][Haswell][SchedModel] Add architecture specific scheduling models. Group: Floating Point XMM and YMM instructions. Sub-group: Math instructions. <rdar://problem/15607571> llvm-svn: 215921	2014-08-18 17:55:53 +00:00
Quentin Colombet	45c469c0c3	[X86][Haswell][SchedModel] Add architecture specific scheduling models. Group: Floating Point XMM and YMM instructions. Sub-group: Arithmetic instructions. <rdar://problem/15607571> llvm-svn: 215920	2014-08-18 17:55:51 +00:00
Quentin Colombet	ca74f23df7	[X86][Haswell][SchedModel] Add architecture specific scheduling models. Group: Floating Point XMM and YMM instructions. Sub-group: Conversion instructions. <rdar://problem/15607571> llvm-svn: 215919	2014-08-18 17:55:49 +00:00
Quentin Colombet	71cdecd73c	[X86][Haswell][SchedModel] Add architecture specific scheduling models. Group: Floating Point XMM and YMM instructions. Sub-group: Move instructions. <rdar://problem/15607571> llvm-svn: 215918	2014-08-18 17:55:46 +00:00
Quentin Colombet	bd11563742	[X86][Haswell][SchedModel] Add architecture specific scheduling models. Group: Integer MMX and XMM instructions. Sub-group: Other instructions. <rdar://problem/15607571> llvm-svn: 215917	2014-08-18 17:55:43 +00:00
Quentin Colombet	91513d9522	[X86][Haswell][SchedModel] Add architecture specific scheduling models. Group: Integer MMX and XMM instructions. Sub-group: Logic instructions. <rdar://problem/15607571> llvm-svn: 215916	2014-08-18 17:55:41 +00:00
Quentin Colombet	e9f8b4b7ac	[X86][Haswell][SchedModel] Add architecture specific scheduling models. Group: Integer MMX and XMM instructions. Sub-group: Arithmetic instructions. <rdar://problem/15607571> llvm-svn: 215915	2014-08-18 17:55:39 +00:00
Quentin Colombet	f68e09418c	[X86][Haswell][SchedModel] Add architecture specific scheduling models. Group: Integer MMX and XMM instructions. Sub-group: Move instructions. <rdar://problem/15607571> llvm-svn: 215914	2014-08-18 17:55:36 +00:00
Quentin Colombet	33b0bf200d	[X86][Haswell][SchedModel] Add architecture specific scheduling models. Group: Floating Point x87 instructions. Sub-group: Math instructions. <rdar://problem/15607571> llvm-svn: 215913	2014-08-18 17:55:32 +00:00
Quentin Colombet	456c991fb4	[X86][Haswell][SchedModel] Add architecture specific scheduling models. Group: Floating Point x87 instructions. Sub-group: Arithmetic instructions. <rdar://problem/15607571> llvm-svn: 215912	2014-08-18 17:55:29 +00:00
Quentin Colombet	0bc907e5e8	[X86][Haswell][SchedModel] Add architecture specific scheduling models. Group: Floating Point x87 instructions. Sub-group: Move instructions. <rdar://problem/15607571> llvm-svn: 215911	2014-08-18 17:55:26 +00:00
Quentin Colombet	6e62be2f5a	[X86][Haswell][SchedModel] Add architecture specific scheduling models. Group: Integer instructions. Sub-group: Other instructions. <rdar://problem/15607571> llvm-svn: 215910	2014-08-18 17:55:23 +00:00
Quentin Colombet	a6c56f5072	[X86][Haswell][SchedModel] Add architecture specific scheduling models. Group: Integer instructions. Sub-group: Synchronization instructions. <rdar://problem/15607571> llvm-svn: 215909	2014-08-18 17:55:21 +00:00
Quentin Colombet	c58fc449fd	[X86][Haswell][SchedModel] Add architecture specific scheduling models. Group: Integer instructions. Sub-group: String instructions. <rdar://problem/15607571> llvm-svn: 215908	2014-08-18 17:55:19 +00:00
Quentin Colombet	e1b17768a0	[X86][Haswell][SchedModel] Add architecture specific scheduling models. Group: Integer instructions. Sub-group: Control transfer instructions. <rdar://problem/15607571> llvm-svn: 215907	2014-08-18 17:55:16 +00:00
Quentin Colombet	fb887b1c05	[X86][Haswell][SchedModel] Add architecture specific scheduling models. Group: Integer instructions. Sub-group: Logic instructions. <rdar://problem/15607571> llvm-svn: 215906	2014-08-18 17:55:13 +00:00
Quentin Colombet	df26059e13	[X86][Haswell][SchedModel] Add architecture specific scheduling models. Group: Integer instructions. Sub-group: Arithmetic instructions. <rdar://problem/15607571> llvm-svn: 215905	2014-08-18 17:55:11 +00:00
Quentin Colombet	35d37b7571	[X86][Haswell][SchedModel] Add architecture specific scheduling models. Group: Integer instructions. Sub-group: Move instructions. <rdar://problem/15607571> llvm-svn: 215904	2014-08-18 17:55:08 +00:00
Hal Finkel	6532c20faa	Move late partial-unrolling thresholds into the processor definitions The old method used by X86TTI to determine partial-unrolling thresholds was messy (because it worked by testing target features), and also would not correctly identify the target CPU if certain target features were disabled. After some discussions on IRC with Chandler et al., it was decided that the processor scheduling models were the right containers for this information (because it is often tied to special uop dispatch-buffer sizes). This does represent a small functionality change: - For generic x86-64 (which uses the SB model and, thus, will get some unrolling). - For AMD cores (because they still currently use the SB scheduling model) - For Haswell (based on benchmarking by Louis Gerbarg, it was decided to bump the default threshold to 50; we're working on a test case for this). Otherwise, nothing has changed for any other targets. The logic, however, has been moved into BasicTTI, so other targets may now also opt-in to this functionality simply by setting LoopMicroOpBufferSize in their processor model definitions. llvm-svn: 208289	2014-05-08 09:14:44 +00:00
Quentin Colombet	9c816f39ad	Revert r205599, the commit was not intended to have so many changes llvm-svn: 205600	2014-04-04 02:02:49 +00:00
Quentin Colombet	7ee4e79dec	[RegAllocGreedy][Last Chance Recoloring] Emit diagnostics when last chance recoloring cut-offs are hit. This is related to PR18747. Patch by MAYUR PANDEY <mayur.p@samsung.com> llvm-svn: 205599	2014-04-04 01:58:57 +00:00
Quentin Colombet	ca49851833	[X86][SchedModel] Add missing scheduling model for SSE related instructions. The patch defines new or refines existing generic scheduling classes to match the behavior of the SSE instructions. It also maps those scheduling classes on the related SSE instructions. <rdar://problem/15607571> llvm-svn: 202065	2014-02-24 19:33:51 +00:00
Quentin Colombet	9e16c8ac8a	[X86][SchedModel] Fix typos in the definitions of the ports for Haswell. llvm-svn: 200403	2014-01-29 18:26:59 +00:00
Andrew Trick	b6854d80e3	Mark the x86 machine model as incomplete. PR17367. Ideally, the machinel model is added at the time the instructions are defined. But many instructions in X86InstrSSE.td still need a model. Without this workaround the scheduler asserts because x86 already has itinerary classes for these instructions, indicating they should be modeled by the scheduler. Since we use the new machine model for other instructions, it expects a new machine model for these too. llvm-svn: 191391	2013-09-25 18:14:12 +00:00
Andrew Trick	7201f4f7ec	Fix IMULX machine model. Multiple def operands require multiple SchedWrites. llvm-svn: 184566	2013-06-21 18:33:04 +00:00
Andrew Trick	40c4f38071	Support BufferSize on ProcResGroup for unified MOp schedulers. And add Sandybridge/Haswell resource buffers. llvm-svn: 184034	2013-06-15 04:50:06 +00:00
Andrew Trick	18dc3da855	Update machine models. Specify buffer sizes for OOO processors. llvm-svn: 184033	2013-06-15 04:50:02 +00:00
Andrew Trick	de2109eb4c	Machine Model: Add MicroOpBufferSize and resource BufferSize. Replace the ill-defined MinLatency and ILPWindow properties with with straightforward buffer sizes: MCSchedMode::MicroOpBufferSize MCProcResourceDesc::BufferSize These can be used to more precisely model instruction execution if desired. Disabled some misched tests temporarily. They'll be reenabled in a few commits. llvm-svn: 184032	2013-06-15 04:49:57 +00:00
Andrew Trick	f7fd6b9e3a	X86 machine model: reduce SandyBridge and Haswell ILPWindow. The initial values were arbitrary. I want them to be more conservative. This represents the number of latency cycles hidden by OOO execution. In practice, I think it should be within a small factor of the complex floating point operation latency so the scheduler can make some attempt to hide latency even for smallish blocks. These are by no means the best values, just a starting point for tuning heuristics. Some benchmarks such as TSVC run faster with this lower value for SandyBridge. I haven't run anything on Haswell, but it's shouldn't be 2x SB. llvm-svn: 179450	2013-04-13 06:07:43 +00:00
Andrew Trick	e1d88cfb57	The divide unit is not pipeline, but it is still buffered. Buffered means a later divide may be executed out-of-order while a prior divide is sitting (buffered) in a reservation station. You can tell it's not pipelined, because operations that use it reserve it for more than one cycle: def : WriteRes<WriteIDiv, [HWPort0, HWDivider]> { let Latency = 25; let ResourceCycles = [1, 10]; } We don't currently distinguish between an unpipeline operation and one that is split into multiple micro-ops requiring the same unit. Except that the later may have NumMicroOps > 1 if they also consume issue/dispatch resources. llvm-svn: 178519	2013-04-02 01:58:47 +00:00
Nadav Rotem	e7b6a8aa8c	Add the Haswell machine model. llvm-svn: 178301	2013-03-28 22:34:46 +00:00

37 Commits