llvm-project

Commit Graph

Author	SHA1	Message	Date
Tom Stellard	94b7231740	R600/SI: Store immediate offsets > 12-bits in soffset This will save us from having to extend these offsets to 64-bits and storing them in a pair of vgprs. llvm-svn: 228776	2015-02-11 00:34:35 +00:00
Tom Stellard	c53861ab84	R600/SI: Add soffset operand to mubuf addr64 instruction We were previously hard-coding soffset to 0. llvm-svn: 228775	2015-02-11 00:34:32 +00:00
Tom Stellard	096b8c1e6d	R600: Don't promote i64 stores to v2i32 during DAG legalization We take care of this during instruction selection now. This fixes a potential infinite loop when lowering misaligned stores. llvm-svn: 228188	2015-02-04 20:49:49 +00:00
Eric Christopher	7792e32b64	Reuse a bunch of cached subtargets and remove getSubtarget calls without a Function argument. llvm-svn: 227638	2015-01-30 23:24:40 +00:00
Tom Stellard	20f6c0732f	R600/SI: Move i64 -> v2i32 load promotion into AMDGPUDAGToDAGISel::Select() We used to do this promotion during DAG legalization, but this caused an infinite loop in ExpandUnalignedLoad() because it assumed that i64 loads were legal if i64 was a legal type. It also seems better to report i64 loads as legal, since they actually are and we were just promoting them to simplify our tablegen files. llvm-svn: 226945	2015-01-23 22:05:45 +00:00
Tom Stellard	95292bbfcd	R600/SI: Use external symbols for scratch buffer We were passing the scratch buffer address to the shaders via user sgprs, but now we use external symbols and have the driver patch the shader using reloc information. llvm-svn: 226586	2015-01-20 17:49:47 +00:00
Tom Stellard	8058069529	R600/SI: Don't store scratch buffer frame index in MUBUF offset field We don't have a good way of legalizing this if the frame index offset is more than the 12-bits, which is size of MUBUF's offset field, so now we store the frame index in the vaddr field. llvm-svn: 226584	2015-01-20 17:49:43 +00:00
Tom Stellard	b6550529a6	R600/SI: Use RegisterOperands to specify which operands can accept immediates There are some operands which can take either immediates or registers and we were previously using different register class to distinguish between operands that could take immediates and those that could not. This patch switches to using RegisterOperands which should simplify the backend by reducing the number of register classes and also make it easier to implement the assembler. llvm-svn: 225662	2015-01-12 19:33:18 +00:00
Tom Stellard	45c0b3a882	R600/SI: Remove VReg_32 register class Use VGPR_32 register class instead. These two register classes were identical and having separate classes was causing SIInstrInfo::isLegalOperands() to be overly conservative in some cases. This change is necessary to prevent future paches from missing a folding opportunity in fneg-fabs.ll. llvm-svn: 225382	2015-01-07 20:59:25 +00:00
Matt Arsenault	4831ce5491	R600/SI: Add class intrinsic llvm-svn: 225305	2015-01-06 23:00:37 +00:00
Tom Stellard	794c8c0f78	R600/SI: Set the ATC bit on all resource descriptors for the HSA runtime llvm-svn: 223125	2014-12-02 17:05:41 +00:00
Matt Arsenault	1cffa4c191	R600/SI: Get rid of FCLAMP_SI pseudo It's not necessary. Also use complex patterns to allow src modifier usage. llvm-svn: 221916	2014-11-13 19:49:04 +00:00
Matt Arsenault	f3cd4512ac	R600/SI: Move all rsrc building functions to SIISelLowering llvm-svn: 221383	2014-11-05 19:01:19 +00:00
Matt Arsenault	485defe58c	R600/SI: Remove SI_ADDR64_RSRC llvm-svn: 221382	2014-11-05 19:01:17 +00:00
Matt Arsenault	83a535ff6b	R600/SI: Remove SI_BUFFER_RSRC pseudo Just use REG_SEQUENCE directly, so there are fewer instructions to need to deal with later. llvm-svn: 220056	2014-10-17 17:42:56 +00:00
Tom Stellard	c8d7920ad9	R600/SI: Fix bug where immediates were being used in DS addr operands The SelectDS1Addr1Offset complex pattern always tries to store constant lds pointers in the offset operand and store a zero value in the addr operand. Since the addr operand does not accept immediates, the zero value needs to first be copied to a register. This newly created zero value will not go through normal instruction selection, so we need to manually insert a V_MOV_B32_e32 in the complex pattern. This bug was hidden by the fact that if there was another zero value in the DAG that had not been selected yet, then the CSE done by the DAG would use the unselected node for the addr operand rather than the one that was just created. This would lead to the zero value being selected and the DAG automatically inserting a V_MOV_B32_e32 instruction. llvm-svn: 219848	2014-10-15 21:08:59 +00:00
Matt Arsenault	1a74aff846	R600/SI: Also try to use 0 base for misaligned 8-byte DS loads. llvm-svn: 219823	2014-10-15 18:06:43 +00:00
Matt Arsenault	e775f5fe76	R600/SI: Use DS offsets for constant addresses Use 0 as the base address for a constant address, so if we have a constant address we can save moves and form read2/write2s. llvm-svn: 219698	2014-10-14 17:21:19 +00:00
Tom Stellard	3457a8495a	R600/SI: Legalize CopyToReg during instruction selection The instruction emitter will crash if it encounters a CopyToReg node with a non-register operand like FrameIndex. llvm-svn: 219428	2014-10-09 19:06:00 +00:00
Matt Arsenault	272c50a1fe	R600/SI: Update VOP3b to not include obsolete operands abs / neg are now part of the srcN_modifiers operands llvm-svn: 218691	2014-09-30 19:49:43 +00:00
Tom Stellard	7980fc8562	R600/SI: Add support for global atomic add llvm-svn: 218457	2014-09-25 18:30:26 +00:00
Tom Stellard	744b99b476	R600/SI: Enable selecting SALU inside branches We can do this now that the FixSGPRLiveRanges pass is working. llvm-svn: 218353	2014-09-24 01:33:28 +00:00
Tom Stellard	9f73851e39	Revert "R600/SI: Add support for global atomic add" This reverts commit r218254. The global_atomics.ll test fails with asserts disabled. For some reason, the compiler fails to produce the atomic no return variants. llvm-svn: 218257	2014-09-22 16:44:04 +00:00
Tom Stellard	2355a77e74	R600/SI: Add support for global atomic add llvm-svn: 218254	2014-09-22 15:35:35 +00:00
Matt Arsenault	3f98140c87	R600/SI: Add preliminary support for flat address space llvm-svn: 217777	2014-09-15 15:41:53 +00:00
Tom Stellard	80942a1b50	R600/SI: Use S_ADD_U32 and S_SUB_U32 for low half of 64-bit operations https://bugs.freedesktop.org/show_bug.cgi?id=83416 llvm-svn: 217248	2014-09-05 14:07:59 +00:00
Tom Stellard	f3fc555e3b	R600/SI: Use READ2/WRITE2 instructions for 64-bit mem ops with 32-bit alignment llvm-svn: 216279	2014-08-22 18:49:35 +00:00
Tom Stellard	85e8b6d5f9	R600/SI: Use a ComplexPattern for DS loads and stores llvm-svn: 216278	2014-08-22 18:49:33 +00:00
Tom Stellard	162a947160	R600/SI: Make sure SCRATCH_WAVE_OFFSET is added as Live-In to the function This fixes a crash in an ocl conformance test. llvm-svn: 216219	2014-08-21 20:40:58 +00:00
Tom Stellard	155bbb7713	R600/SI: Add a ComplexPattern for selecting MUBUF _OFFSET variant This saves us from having to copy a 64-bit 0 value into VGPRs for BUFFER_* instruction which only have a 12-bit immediate offset. llvm-svn: 215399	2014-08-11 22:18:17 +00:00
Eric Christopher	fc6de428c8	Have MachineFunction cache a pointer to the subtarget to make lookups shorter/easier and have the DAG use that to do the same lookup. This can be used in the future for TargetMachine based caching lookups from the MachineFunction easily. Update the MIPS subtarget switching machinery to update this pointer at the same time it runs. llvm-svn: 214838	2014-08-05 02:39:49 +00:00
Eric Christopher	d913448b38	Remove the TargetMachine forwards for TargetSubtargetInfo based information and update all callers. No functional change. llvm-svn: 214781	2014-08-04 21:25:23 +00:00
Tom Stellard	b4a313a76f	R600/SI: Do abs/neg folding with ComplexPatterns Abs/neg folding has moved out of foldOperands and into the instruction selection phase using complex patterns. As a consequence of this change, we now prefer to select the 64-bit encoding for most instructions and the modifier operands have been dropped from integer VOP3 instructions. llvm-svn: 214467	2014-08-01 00:32:39 +00:00
Tom Stellard	b02094e115	R600/SI: Use scratch memory for large private arrays llvm-svn: 213551	2014-07-21 15:45:01 +00:00
Tom Stellard	b02c268cbd	R600/SI: Use a ComplexPattern for MUBUF stores Now that non-leaf ComplexPatterns are allowed we can fold all the MUBUF store patterns into the instruction definition. We will also be able to reuse this new ComplexPattern for MUBUF loads and atomic operations. llvm-svn: 211644	2014-06-24 23:33:07 +00:00
Matt Arsenault	f2b0aebb8a	R600/SI: Fix div_scale intrinsic. The operand that must match one of the others does matter, and implement selecting for it. llvm-svn: 211523	2014-06-23 18:28:28 +00:00
Matt Arsenault	b8b5153935	R600/SI: Handle i64 sub. We can handle it the same way as add llvm-svn: 211514	2014-06-23 18:00:38 +00:00
Matt Arsenault	9fa3f93173	R600/SI: Move selection of i64 add to separate function. Also don't use a SmallVector for fixed size array. llvm-svn: 211513	2014-06-23 18:00:34 +00:00
Tom Stellard	880a80ad07	R600: Use LDS and vectors for private memory llvm-svn: 211110	2014-06-17 16:53:14 +00:00
Tom Stellard	2e59a45f80	R600: Move AMDGPUInstrInfo from AMDGPUTargetMachine into AMDGPUSubtarget llvm-svn: 210869	2014-06-13 01:32:00 +00:00
Matt Arsenault	064c206d23	R600/SI: Fix selection failure on scalar_to_vector There seem to be only 2 places that produce these, and it's kind of tricky to hit them. Also fixes failure to bitcast between i64 and v2f32, although this for some reason wasn't actually broken in the simple bitcast testcase, but did in the scalar_to_vector one. llvm-svn: 210664	2014-06-11 17:40:32 +00:00
Tom Stellard	73b98ed8cf	R600/SI: Only use SALU instructions for 64-bit add in a block of CF depth 0 llvm-svn: 208886	2014-05-15 14:41:54 +00:00
Tom Stellard	58ac7440e6	R600/SI: Only select SALU instructions in the entry or exit block SALU instructions ignore control flow, so it is not always safe to use them within branches. This is a partial solution to this problem until we can come up with something better. llvm-svn: 207590	2014-04-29 23:12:48 +00:00
Craig Topper	5656db4a8b	[C++11] Add 'override' keywords and remove 'virtual'. Additionally add 'final' and leave 'virtual' on some methods that are marked virtual without overriding anything and have no obvious overrides themselves. R600 edition llvm-svn: 207503	2014-04-29 07:57:24 +00:00
Craig Topper	481fb2879f	Convert SelectionDAG::SelectNodeTo to use ArrayRef. llvm-svn: 207377	2014-04-27 19:21:11 +00:00
Matt Arsenault	209a7b92b5	R600: Minor cleanups. Fix indentation, better line wrapping, unused includes. llvm-svn: 206562	2014-04-18 07:40:20 +00:00
Matt Arsenault	78b8670aac	R600/SI: Try to use scalar BFE. Use scalar BFE with constant shift and offset when possible. This is complicated by the fact that the scalar version packs the two operands of the vector version into one. llvm-svn: 206558	2014-04-18 05:19:26 +00:00
Tom Stellard	1aa6cb4d88	R600/SI: Use SReg_64 instead of VSrc_64 when selecting BUILD_PAIR llvm-svn: 206541	2014-04-18 00:36:21 +00:00
Nick Lewycky	aad475b324	Break PseudoSourceValue out of the Value hierarchy. It is now the root of its own tree containing FixedStackPseudoSourceValue (which you can use isa/dyn_cast on) and MipsCallEntry (which you can't). Anything that needs to use either a PseudoSourceValue* and Value* is strongly encouraged to use a MachinePointerInfo instead. llvm-svn: 206255	2014-04-15 07:22:52 +00:00
Tom Stellard	50122a5890	R600: Match 24-bit arithmetic patterns in a Target DAGCombine Moving these patterns from TableGen files to PerformDAGCombine() should allow us to generate better code by eliminating unnecessary shifts and extensions earlier. This also fixes a bug where the MAD pattern was calling SimplifyDemandedBits with a 24-bit mask on the first operand even when the full pattern wasn't being matched. This occasionally resulted in some instructions being incorrectly deleted from the program. v2: - Fix bug with 64-bit mul llvm-svn: 205731	2014-04-07 19:45:41 +00:00

1 2

69 Commits