llvm-project

Commit Graph

Author	SHA1	Message	Date
Matt Arsenault	362f345bab	R600/SI: Fix off by 1 error in used register count The register numbers start at 0, so if only 1 register was used, this was reported as 0. llvm-svn: 217636	2014-09-11 22:51:37 +00:00
Aaron Watry	1885e53a75	R600: Add cmpxchg instruction for evergreen Refactored the R600_LDS_1A2D class a bit to get it to actually work. It seemed to be previously unused and broken. We also have to disable the conversion to the noret variant for now in R600ISelLowering because the getLDSNoRetOp method only handles 1A1D LDS ops. Someone can feel free to modify the AMDGPU::getLDSNoRetOp method to work for more than 1A1D variants of LDS operations. It's being left as a future TODO for now. Signed-off-by: Aaron Watry <awatry at gmail.com> Reviewed-by: Matt Arsenault <matthew.arsenault@amd.com> llvm-svn: 217596	2014-09-11 15:02:54 +00:00
Aaron Watry	21591670c9	R600: Add LDS_WRXCHG[_RET] instructions for Evergreen. Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Matt Arsenault <matthew.arsenault@amd.com> llvm-svn: 217594	2014-09-11 15:02:49 +00:00
Aaron Watry	564a22e995	R600: Add LDS_MIN_[U]INT[_RET] instructions for Evergreen Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Matt Arsenault <matthew.arsenault@amd.com> llvm-svn: 217593	2014-09-11 15:02:47 +00:00
Aaron Watry	e51794f2fa	R600: Add LDS_XOR[_RET] instructions for Evergreen Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Matt Arsenault <matthew.arsenault@amd.com> llvm-svn: 217592	2014-09-11 15:02:46 +00:00
Aaron Watry	cffa0114c7	R600: Add LDS_OR[_RET] instructions for Evergreen Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Matt Arsenault <matthew.arsenault@amd.com> llvm-svn: 217591	2014-09-11 15:02:44 +00:00
Aaron Watry	a7f122da60	R600: Add LDS_AND[_RET] instructions for Evergreen Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Matt Arsenault <matthew.arsenault@amd.com> llvm-svn: 217590	2014-09-11 15:02:43 +00:00
Aaron Watry	62a0af4a0d	R600: Add LDS_MAX_[U]INT[_RET] instructions for Evergreen This was only present for SI before. Cayman may still be missing, but I am unable to test that currently. v2: Don't create atomicrmw max tests in separate file Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Matt Arsenault <matthew.arsenault@amd.com> CC: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 217589	2014-09-11 15:02:41 +00:00
Matt Arsenault	61a528adc7	R600/SI: Fix losing chain when fixing reg class of loads. The lost chain resulting in earlier side effecting nodes being deleted. llvm-svn: 217561	2014-09-10 23:26:19 +00:00
Matt Arsenault	2e9911205f	R600/SI: Report offset in correct units for st64 DS instructions Need to convert the 64 element offset into bytes, not just the element size like the normal case instructions. Noticed by inspection. This can't be hit now because st64 instructions aren't emitted during instruction selection, and the post-RA scheduler isn't enabled. llvm-svn: 217560	2014-09-10 23:26:16 +00:00
Matt Arsenault	16e313343d	R600: Custom lower frem llvm-svn: 217553	2014-09-10 21:44:27 +00:00
Sanjay Patel	b653de1ada	Rename getMaximumUnrollFactor -> getMaxInterleaveFactor; also rename option names controlling this variable. "Unroll" is not the appropriate name for this variable. Clang already uses the term "interleave" in pragmas and metadata for this. Differential Revision: http://reviews.llvm.org/D5066 llvm-svn: 217528	2014-09-10 17:58:16 +00:00
Matt Arsenault	69bfb90419	R600/SI: Fix assertion from copying a TargetGlobalAddress Assert in scheduler from an inserted copy_to_regclass from a constant. This only seems to break sometimes when a constant initializer address is forced into VGPRs in a non-entry block. No test since the only case I've managed to hit only happens with a future patch, and that case will also not be a problem once scalar instructions are used in non-entry blocks. llvm-svn: 217380	2014-09-08 15:07:33 +00:00
Matt Arsenault	7ac9c4a074	R600/SI: Replace LDS atomics with no return versions llvm-svn: 217379	2014-09-08 15:07:31 +00:00
Matt Arsenault	9903ccf7ee	R600/SI: Add InstrMapping for noret atomics. Only handles LDS atomics for now, and will be used to replace atomics with no uses with the no return versions. llvm-svn: 217378	2014-09-08 15:07:27 +00:00
Matt Arsenault	76803bd384	R600/SI: Fix register class for some 64-bit atomics llvm-svn: 217323	2014-09-07 00:46:20 +00:00
Matt Arsenault	8ae5961065	R600/SI: Use same complex patterns for DS atomics This fixes hitting the same negative base offset problem that was already fixed for regular loads and stores. llvm-svn: 217256	2014-09-05 16:24:58 +00:00
Jan Vesely	d1d1334064	R600: Fix FROUND round halfway cases away from zero Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Tom Stellard <tom@stellard.net> llvm-svn: 217250	2014-09-05 14:26:54 +00:00
Tom Stellard	0c93c9ecee	R600/SI: Fix bug in SIInstrInfo::legalizeOpWithMove() We must constrain the destination register class of legalized operands to a VGPR class or else the illegal operand may be folded back into the instruction by the register coalescer. This fixes a bug in add.ll that will be uncovered by future commits. llvm-svn: 217249	2014-09-05 14:08:01 +00:00
Tom Stellard	80942a1b50	R600/SI: Use S_ADD_U32 and S_SUB_U32 for low half of 64-bit operations https://bugs.freedesktop.org/show_bug.cgi?id=83416 llvm-svn: 217248	2014-09-05 14:07:59 +00:00
Matt Arsenault	51b7e81d1b	R600/SI: Un-move pattern I forgot to remove in last commit llvm-svn: 217109	2014-09-03 23:28:57 +00:00
Matt Arsenault	869cd07158	R600/SI: Try to keep i32 mul on SALU Also fix bug this exposed where when legalizing an immediate operand, a v_mov_b32 would be created with a VSrc dest register. llvm-svn: 217108	2014-09-03 23:24:35 +00:00
Tom Stellard	102c68786c	R600/SI: Add a pattern for i64 and in a branch llvm-svn: 217041	2014-09-03 15:22:41 +00:00
Tom Stellard	b8b841366a	R600/SI: Fix typos in SIInstrInfo::areLoadsFromSameBasePtr() This fixes a crash in the OpenCV test: ImgprocWarpResizeArea/Resize.Mat/16 There is no test case for this, because this failure depends on a specific ordering of the loads, which could easily change. llvm-svn: 217040	2014-09-03 15:22:39 +00:00
Benjamin Kramer	8c90fd71f7	Add override to overriden virtual methods, remove virtual keywords. No functionality change. Changes made by clang-tidy + some manual cleanup. llvm-svn: 217028	2014-09-03 11:41:21 +00:00
Eric Christopher	79cc1e3ae7	Reinstate "Nuke the old JIT." Approved by Jim Grosbach, Lang Hames, Rafael Espindola. This reinstates commits r215111, 215115, 215116, 215117, 215136. llvm-svn: 216982	2014-09-02 22:28:02 +00:00
Craig Topper	fd38cbebda	Remove 'virtual' keyword from methods markedwith 'override' keyword. llvm-svn: 216823	2014-08-30 16:48:34 +00:00
Matt Arsenault	8675db15da	R600/SI: Use mad for fsub + fmul We can use a negate source modifier to match this for fsub. llvm-svn: 216735	2014-08-29 16:01:14 +00:00
Alexey Samsonov	a253bf9678	Use BitVector instead of int in R600 SIISelLowering. int may not have enough bits in it, which was detected by UBSan bootstrap (it reported left shift by a too large constant). llvm-svn: 216579	2014-08-27 19:36:53 +00:00
Tom Stellard	f3fc555e3b	R600/SI: Use READ2/WRITE2 instructions for 64-bit mem ops with 32-bit alignment llvm-svn: 216279	2014-08-22 18:49:35 +00:00
Tom Stellard	85e8b6d5f9	R600/SI: Use a ComplexPattern for DS loads and stores llvm-svn: 216278	2014-08-22 18:49:33 +00:00
Tom Stellard	ca7ecf3dfa	R600/SI: Wrap local memory pointer in AssertZExt on SI These pointers are really just offsets and they will always be less than 16-bits. Using AssertZExt allows us to use computeKnownBits to prove that these values are positive. We will use this information in a later commit. llvm-svn: 216277	2014-08-22 18:49:31 +00:00
Tom Stellard	0510514e36	R600/SI: Use correct helper class for DS_WRITE2 instructions DS_1A uses a single offset encoding, so offset1 wasn't being encoded. llvm-svn: 216276	2014-08-22 18:49:28 +00:00
Sanjay Patel	2cdea4c41e	name change: isPow2DivCheap -> isPow2SDivCheap isPow2DivCheap That name doesn't specify signed or unsigned. Lazy as I am, I eventually read the function and variable comments. It turns out that this is strictly about signed div. But I discovered that the comments are wrong: srl/add/sra is not the general sequence for signed integer division by power-of-2. We need one more 'sra': sra/srl/add/sra That's the sequence produced in DAGCombiner. The first 'sra' may be removed when dividing by exactly '2', but that's a special case. This patch corrects the comments, changes the name of the flag bit, and changes the name of the accessor methods. No functional change intended. Differential Revision: http://reviews.llvm.org/D5010 llvm-svn: 216237	2014-08-21 22:31:48 +00:00
Tom Stellard	745f2eddef	R600/SI: Teach moveToVALU how to handle more S_LOAD_* instructions llvm-svn: 216220	2014-08-21 20:41:00 +00:00
Tom Stellard	162a947160	R600/SI: Make sure SCRATCH_WAVE_OFFSET is added as Live-In to the function This fixes a crash in an ocl conformance test. llvm-svn: 216219	2014-08-21 20:40:58 +00:00
Tom Stellard	8e52375bb5	R600/SI: Remove unused SGPR spilling code llvm-svn: 216218	2014-08-21 20:40:56 +00:00
Tom Stellard	c5cf2f04d9	R600/SI: Use eliminateFrameIndex() to expand SGPR spill pseudos This will simplify the SGPR spilling and also allow us to use MachineFrameInfo for calculating offsets, which should be more reliable than our custom code. This fixes a crash in some cases where a register would be spilled in a branch such that the VGPR defined for spilling did not dominate all the uses when restoring. This fixes a crash in an ocl conformance test. The test requries register spilling and is too big to include. llvm-svn: 216217	2014-08-21 20:40:54 +00:00
Tom Stellard	11aa80cc4a	R600/SI: Handle VCC in SIRegisterInfo::getPhysRegSubReg() This fixes a crash in an ocl conformance test. The test requries register spilling and is too big to include. llvm-svn: 216216	2014-08-21 20:40:50 +00:00
Alexey Samsonov	ea0aee622e	Cleanup: Delete seemingly unused reference to MachineDominatorTree from ScheduleDAGInstrs. llvm-svn: 216124	2014-08-20 20:57:26 +00:00
Aaron Ballman	f12dc9c802	Silencing an MSVC warning about loop variable conflicting with a variable from an outer scope. NFC. llvm-svn: 215888	2014-08-18 11:51:41 +00:00
Matt Arsenault	fabf545299	R600/SI: Move all fabs / fneg handling to patterns llvm-svn: 215749	2014-08-15 18:42:22 +00:00
Matt Arsenault	13623d0e28	R600/SI: Use source modifiers for f64 fneg llvm-svn: 215748	2014-08-15 18:42:18 +00:00
Matt Arsenault	a147438e37	R600/SI: Use source modifier for f64 fabs llvm-svn: 215747	2014-08-15 18:42:15 +00:00
Matt Arsenault	9e7cf548ea	R600/SI: Refactor fneg / fabs patterns llvm-svn: 215746	2014-08-15 18:42:11 +00:00
Matt Arsenault	b2baffaffd	R600/SI: Fix offset folding in some cases with shifted pointers. Ordinarily (shl (add x, c1), c2) -> (add (shl x, c2), c1 << c2) is only done if the add has one use. If the resulting constant add can be folded into an addressing mode, force this to happen for the pointer operand. This ends up happening a lot because of how LDS objects are allocated. Since the globals are allocated next to each other, acessing the first element of the second object is directly indexed by a shifted pointer. llvm-svn: 215739	2014-08-15 17:49:05 +00:00
Matt Arsenault	2e7cc48baa	R600/SI: Add intrinsic for ldexp llvm-svn: 215734	2014-08-15 17:30:25 +00:00
Matt Arsenault	5015a89aa5	R600/SI: Implement isLegalAddressingMode The default assumes that a 16-bit signed offset is used. LDS instruction use a 16-bit unsigned offset, so it wasn't being used in some cases where it was assumed a negative offset could be used. More should be done here, but first isLegalAddressingMode needs to gain an addressing mode argument. For now, copy most of the rest of the default implementation with the immediate offset change. llvm-svn: 215732	2014-08-15 17:17:07 +00:00
Rafael Espindola	d610ba99cb	Remove HasLEB128. We already require CFI, so it should be safe to require .leb128 and .uleb128. llvm-svn: 215712	2014-08-15 14:01:07 +00:00
Matt Arsenault	74ef277774	R600: Correctly set the src value offset for scalarized kernel args This for some reason fixes v1i64 kernel arguments on pre-SI. This currently breaks some other cases in the kernel-args.ll test for R600, but I'm not particularly confident in the new output. VTX_READ_* are not used for some of the scalarized cases, and the code reading from the constant buffer doesn't make much sense to me. llvm-svn: 215564	2014-08-13 18:14:11 +00:00

1 2 3 4 5 ...

1190 Commits