llvm-project

Commit Graph

Author	SHA1	Message	Date
Aaron Ballman	f086a14d53	Silencing an "enumeral and non-enumeral type in conditional expression" warning. NFC. llvm-svn: 218381	2014-09-24 13:54:56 +00:00
Matt Arsenault	2c41987490	R600/SI: Add new helper isSGPRClassID Move these into header since they are trivial llvm-svn: 218360	2014-09-24 02:17:12 +00:00
Matt Arsenault	262407bc2f	R600/SI: Fix hardcoded and wrong operand numbers. Also fix leftover debug printing llvm-svn: 218359	2014-09-24 02:17:09 +00:00
Matt Arsenault	69612d6027	R600/SI: Enable named operand table for SALU instructions llvm-svn: 218358	2014-09-24 02:17:06 +00:00
Tom Stellard	744b99b476	R600/SI: Enable selecting SALU inside branches We can do this now that the FixSGPRLiveRanges pass is working. llvm-svn: 218353	2014-09-24 01:33:28 +00:00
Tom Stellard	deb3f9e643	R600/SI: Move PHIs that define SGPRs to the VALU in most cases This fixes a bug that is uncovered by a future commit and will be tested by the test/CodeGen/R600/sgpr-control-flow.ll test case. llvm-svn: 218352	2014-09-24 01:33:26 +00:00
Tom Stellard	60024a0558	R600/SI: Fix the FixSGPRLiveRanges pass The previous implementation was extending the live range of SGPRs by modifying the live intervals directly. This was causing a lot of machine verification errors when the machine scheduler was enabled. The new implementation adds pseudo instructions with implicit uses to extend the live ranges of SGPRs, which works much better. llvm-svn: 218351	2014-09-24 01:33:24 +00:00
Tom Stellard	be507fb5d3	R600/SI: Mark EXEC_LO and EXEC_HI as reserved These registers can be allocated and used like other 32-bit registers, but it seems like a likely source for bugs. llvm-svn: 218350	2014-09-24 01:33:23 +00:00
Tom Stellard	9a88593ed0	R600/SI: Fix SIRegisterInfo::getPhysRegSubReg() Correctly handle special registers: EXEC, EXEC_LO, EXEC_HI, VCC_LO, VCC_HI, and M0. The previous implementation would assertion fail when passed these registers. llvm-svn: 218349	2014-09-24 01:33:22 +00:00
Tom Stellard	96468903d4	R600/SI: Implement VGPR register spilling for compute at -O0 v3 VGPRs are spilled to LDS. This still needs more testing, but we need to at least enable it at -O0, because the fast register allocator spills all registers that are live at the end of blocks and without this some future commits will break the flat-address-space.ll test. v2: Only calculate thread id once v3: Move insertion of spill instructions to SIRegisterInfo::eliminateFrameIndex() llvm-svn: 218348	2014-09-24 01:33:17 +00:00
Tom Stellard	73ae1cb59a	R600/SI: Clean up checks for legality of immediate operands There are new register classes VCSrc_* which represent operands that can take an SGPR, VGPR or inline constant. The VSrc_* class is now used to represent operands that can take an SGPR, VGPR, or a 32-bit immediate. This allows us to have more accurate checks for legality of immediates, since before we had no way to distinguish between operands that supported any 32-bit immediate and operands which could only support inline constants. llvm-svn: 218334	2014-09-23 21:26:25 +00:00
Matt Arsenault	4364fef82f	Fix typo llvm-svn: 218324	2014-09-23 18:30:57 +00:00
Tom Stellard	9f73851e39	Revert "R600/SI: Add support for global atomic add" This reverts commit r218254. The global_atomics.ll test fails with asserts disabled. For some reason, the compiler fails to produce the atomic no return variants. llvm-svn: 218257	2014-09-22 16:44:04 +00:00
Tom Stellard	2355a77e74	R600/SI: Add support for global atomic add llvm-svn: 218254	2014-09-22 15:35:35 +00:00
Tom Stellard	5a9a61ed7d	R600/SI: Remove modifier operands from V_CNDMASK_B32_e64 Modifiers don't work for this instruction. llvm-svn: 218253	2014-09-22 15:35:34 +00:00
Tom Stellard	c9965f4186	R600: Don't set BypassSlowDiv for 64-bit division BypassSlowDiv is used by codegen prepare to insert a run-time check to see if the operands to a 64-bit division are really 32-bit values and if they are it will do 32-bit division instead. This is not useful for R600, which has predicated control flow since both the 32-bit and 64-bit paths will be executed in most cases. It also increases code size which can lead to more instruction cache misses. llvm-svn: 218252	2014-09-22 15:35:32 +00:00
Tom Stellard	4349b19efb	R600/SI: Use ISD::MUL instead of ISD::UMULO when lowering division ISD::MUL and ISD:UMULO are the same except that UMULO sets an overflow bit. Since we aren't using the overflow bit, we should use ISD::MUL. llvm-svn: 218251	2014-09-22 15:35:30 +00:00
Tom Stellard	ec2e43c073	R600/SI: Add enums for some hard-coded values llvm-svn: 218250	2014-09-22 15:35:29 +00:00
Matt Arsenault	a9627ae97a	Fix typo llvm-svn: 218223	2014-09-21 17:27:32 +00:00
Matt Arsenault	393366c691	Use llvm_unreachable instead of assert(!) llvm-svn: 218222	2014-09-21 17:27:31 +00:00
Matt Arsenault	3673eba568	R600/SI: Don't use strings for single characters llvm-svn: 218221	2014-09-21 17:27:28 +00:00
Tom Stellard	ff795900eb	R600/SI: Fix config value for number of gprs In r217636, the value stored in KernelInfo.Num[VS]GPRSs was changed from the highest GPR index used to the number of gprs in order to be consistent with the name of the variable. The code writing the config values still assumed that the value in this variable was the highest GPR index used, which caused the compiler to over report the number of GPRs being used. https://bugs.freedesktop.org/show_bug.cgi?id=84089 llvm-svn: 218150	2014-09-19 20:42:37 +00:00
Matt Arsenault	46cbc4367b	R600: Better fix for bug 20982 Just do the left shift as unsigned to avoid the UB. llvm-svn: 218092	2014-09-19 00:42:06 +00:00
Aaron Ballman	0bb041b5f4	Reverting NFC changes from r218050. Instead, the warning was disabled for GCC in r218059, so these changes are no longer required. llvm-svn: 218062	2014-09-18 17:34:23 +00:00
Matt Arsenault	6462f94884	R600: Bug 20982 - Avoid undefined left shift of negative value I'm not sure what the hardware actually does, so don't bother trying to fold it for now. llvm-svn: 218057	2014-09-18 15:52:26 +00:00
Aaron Ballman	11fa97fa32	Fixing a bunch of -Woverloaded-virtual warnings due to hiding getSubtargetImpl from the base class. NFC. llvm-svn: 218050	2014-09-18 13:27:14 +00:00
Eric Christopher	d85ffb1fc0	Add a new pass FunctionTargetTransformInfo. This pass serves as a shim between the TargetTransformInfo immutable pass and the Subtarget via the TargetMachine and Function. Migrate a single call from BasicTargetTransformInfo as an example and provide shims where TargetMachine begins taking a Function to determine the subtarget. No functional change. llvm-svn: 218004	2014-09-18 00:34:14 +00:00
Matt Arsenault	972c12aedc	R600/SI: Remove assert Since read2 / write2 are emitted for 4-byte aligned 8-byte accesses, these are seen by the scheduler. The DAG scheduler is semi-deprecated, so just ignore these for now. llvm-svn: 217969	2014-09-17 17:48:32 +00:00
Matt Arsenault	0e75a06451	R600/SI: Rough first implementation of shouldClusterLoads llvm-svn: 217968	2014-09-17 17:48:30 +00:00
Alexey Samsonov	cce5701cdb	Fix float division-by-zero in R600 scheduler. This bug was reported by UBSan. llvm-svn: 217967	2014-09-17 17:47:21 +00:00
Matt Arsenault	02dc26529e	R600/SI: Change formatting of printed FP immediates Only 1 decimal place should be printed for inline immediates. Other constants should be hex constants. Does not include f64 tests because folding those inline immediates currently does not work. llvm-svn: 217964	2014-09-17 17:32:13 +00:00
Matt Arsenault	253e5da7ad	R600/SI: Remove promotion of instructions to e64 forms. Instructions are now generally selected to the e64 forms originally, and shrunk down later. Rename foldOperands to legalizeOperands, since that's really most of what it tries to do. llvm-svn: 217959	2014-09-17 15:35:43 +00:00
Matt Arsenault	6652403c2d	Fix typo llvm-svn: 217892	2014-09-16 18:00:23 +00:00
Matt Arsenault	49dd4283ed	R600/SI: Prefer selecting more e64 instruction forms. Add some more tests to make sure better operand choices are still made. Leave some cases that seem to have no reason to ever be e64 alone. llvm-svn: 217789	2014-09-15 17:15:02 +00:00
Matt Arsenault	3f98140c87	R600/SI: Add preliminary support for flat address space llvm-svn: 217777	2014-09-15 15:41:53 +00:00
Matt Arsenault	65f67e4dfe	R600/SI: Fix promote alloca pass breaking addrspacecast llvm-svn: 217776	2014-09-15 15:41:44 +00:00
Matt Arsenault	5c4d8409b3	R600/SI: Enable named operand table for MTBUF There is already code trying to use it for getting the offset. llvm-svn: 217775	2014-09-15 15:41:43 +00:00
Matt Arsenault	5d26d04357	Fix typo llvm-svn: 217730	2014-09-13 19:58:27 +00:00
Matt Arsenault	362f345bab	R600/SI: Fix off by 1 error in used register count The register numbers start at 0, so if only 1 register was used, this was reported as 0. llvm-svn: 217636	2014-09-11 22:51:37 +00:00
Aaron Watry	1885e53a75	R600: Add cmpxchg instruction for evergreen Refactored the R600_LDS_1A2D class a bit to get it to actually work. It seemed to be previously unused and broken. We also have to disable the conversion to the noret variant for now in R600ISelLowering because the getLDSNoRetOp method only handles 1A1D LDS ops. Someone can feel free to modify the AMDGPU::getLDSNoRetOp method to work for more than 1A1D variants of LDS operations. It's being left as a future TODO for now. Signed-off-by: Aaron Watry <awatry at gmail.com> Reviewed-by: Matt Arsenault <matthew.arsenault@amd.com> llvm-svn: 217596	2014-09-11 15:02:54 +00:00
Aaron Watry	21591670c9	R600: Add LDS_WRXCHG[_RET] instructions for Evergreen. Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Matt Arsenault <matthew.arsenault@amd.com> llvm-svn: 217594	2014-09-11 15:02:49 +00:00
Aaron Watry	564a22e995	R600: Add LDS_MIN_[U]INT[_RET] instructions for Evergreen Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Matt Arsenault <matthew.arsenault@amd.com> llvm-svn: 217593	2014-09-11 15:02:47 +00:00
Aaron Watry	e51794f2fa	R600: Add LDS_XOR[_RET] instructions for Evergreen Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Matt Arsenault <matthew.arsenault@amd.com> llvm-svn: 217592	2014-09-11 15:02:46 +00:00
Aaron Watry	cffa0114c7	R600: Add LDS_OR[_RET] instructions for Evergreen Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Matt Arsenault <matthew.arsenault@amd.com> llvm-svn: 217591	2014-09-11 15:02:44 +00:00
Aaron Watry	a7f122da60	R600: Add LDS_AND[_RET] instructions for Evergreen Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Matt Arsenault <matthew.arsenault@amd.com> llvm-svn: 217590	2014-09-11 15:02:43 +00:00
Aaron Watry	62a0af4a0d	R600: Add LDS_MAX_[U]INT[_RET] instructions for Evergreen This was only present for SI before. Cayman may still be missing, but I am unable to test that currently. v2: Don't create atomicrmw max tests in separate file Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Matt Arsenault <matthew.arsenault@amd.com> CC: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 217589	2014-09-11 15:02:41 +00:00
Matt Arsenault	61a528adc7	R600/SI: Fix losing chain when fixing reg class of loads. The lost chain resulting in earlier side effecting nodes being deleted. llvm-svn: 217561	2014-09-10 23:26:19 +00:00
Matt Arsenault	2e9911205f	R600/SI: Report offset in correct units for st64 DS instructions Need to convert the 64 element offset into bytes, not just the element size like the normal case instructions. Noticed by inspection. This can't be hit now because st64 instructions aren't emitted during instruction selection, and the post-RA scheduler isn't enabled. llvm-svn: 217560	2014-09-10 23:26:16 +00:00
Matt Arsenault	16e313343d	R600: Custom lower frem llvm-svn: 217553	2014-09-10 21:44:27 +00:00
Sanjay Patel	b653de1ada	Rename getMaximumUnrollFactor -> getMaxInterleaveFactor; also rename option names controlling this variable. "Unroll" is not the appropriate name for this variable. Clang already uses the term "interleave" in pragmas and metadata for this. Differential Revision: http://reviews.llvm.org/D5066 llvm-svn: 217528	2014-09-10 17:58:16 +00:00
Matt Arsenault	69bfb90419	R600/SI: Fix assertion from copying a TargetGlobalAddress Assert in scheduler from an inserted copy_to_regclass from a constant. This only seems to break sometimes when a constant initializer address is forced into VGPRs in a non-entry block. No test since the only case I've managed to hit only happens with a future patch, and that case will also not be a problem once scalar instructions are used in non-entry blocks. llvm-svn: 217380	2014-09-08 15:07:33 +00:00
Matt Arsenault	7ac9c4a074	R600/SI: Replace LDS atomics with no return versions llvm-svn: 217379	2014-09-08 15:07:31 +00:00
Matt Arsenault	9903ccf7ee	R600/SI: Add InstrMapping for noret atomics. Only handles LDS atomics for now, and will be used to replace atomics with no uses with the no return versions. llvm-svn: 217378	2014-09-08 15:07:27 +00:00
Matt Arsenault	76803bd384	R600/SI: Fix register class for some 64-bit atomics llvm-svn: 217323	2014-09-07 00:46:20 +00:00
Matt Arsenault	8ae5961065	R600/SI: Use same complex patterns for DS atomics This fixes hitting the same negative base offset problem that was already fixed for regular loads and stores. llvm-svn: 217256	2014-09-05 16:24:58 +00:00
Jan Vesely	d1d1334064	R600: Fix FROUND round halfway cases away from zero Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Tom Stellard <tom@stellard.net> llvm-svn: 217250	2014-09-05 14:26:54 +00:00
Tom Stellard	0c93c9ecee	R600/SI: Fix bug in SIInstrInfo::legalizeOpWithMove() We must constrain the destination register class of legalized operands to a VGPR class or else the illegal operand may be folded back into the instruction by the register coalescer. This fixes a bug in add.ll that will be uncovered by future commits. llvm-svn: 217249	2014-09-05 14:08:01 +00:00
Tom Stellard	80942a1b50	R600/SI: Use S_ADD_U32 and S_SUB_U32 for low half of 64-bit operations https://bugs.freedesktop.org/show_bug.cgi?id=83416 llvm-svn: 217248	2014-09-05 14:07:59 +00:00
Matt Arsenault	51b7e81d1b	R600/SI: Un-move pattern I forgot to remove in last commit llvm-svn: 217109	2014-09-03 23:28:57 +00:00
Matt Arsenault	869cd07158	R600/SI: Try to keep i32 mul on SALU Also fix bug this exposed where when legalizing an immediate operand, a v_mov_b32 would be created with a VSrc dest register. llvm-svn: 217108	2014-09-03 23:24:35 +00:00
Tom Stellard	102c68786c	R600/SI: Add a pattern for i64 and in a branch llvm-svn: 217041	2014-09-03 15:22:41 +00:00
Tom Stellard	b8b841366a	R600/SI: Fix typos in SIInstrInfo::areLoadsFromSameBasePtr() This fixes a crash in the OpenCV test: ImgprocWarpResizeArea/Resize.Mat/16 There is no test case for this, because this failure depends on a specific ordering of the loads, which could easily change. llvm-svn: 217040	2014-09-03 15:22:39 +00:00
Benjamin Kramer	8c90fd71f7	Add override to overriden virtual methods, remove virtual keywords. No functionality change. Changes made by clang-tidy + some manual cleanup. llvm-svn: 217028	2014-09-03 11:41:21 +00:00
Eric Christopher	79cc1e3ae7	Reinstate "Nuke the old JIT." Approved by Jim Grosbach, Lang Hames, Rafael Espindola. This reinstates commits r215111, 215115, 215116, 215117, 215136. llvm-svn: 216982	2014-09-02 22:28:02 +00:00
Craig Topper	fd38cbebda	Remove 'virtual' keyword from methods markedwith 'override' keyword. llvm-svn: 216823	2014-08-30 16:48:34 +00:00
Matt Arsenault	8675db15da	R600/SI: Use mad for fsub + fmul We can use a negate source modifier to match this for fsub. llvm-svn: 216735	2014-08-29 16:01:14 +00:00
Alexey Samsonov	a253bf9678	Use BitVector instead of int in R600 SIISelLowering. int may not have enough bits in it, which was detected by UBSan bootstrap (it reported left shift by a too large constant). llvm-svn: 216579	2014-08-27 19:36:53 +00:00
Tom Stellard	f3fc555e3b	R600/SI: Use READ2/WRITE2 instructions for 64-bit mem ops with 32-bit alignment llvm-svn: 216279	2014-08-22 18:49:35 +00:00
Tom Stellard	85e8b6d5f9	R600/SI: Use a ComplexPattern for DS loads and stores llvm-svn: 216278	2014-08-22 18:49:33 +00:00
Tom Stellard	ca7ecf3dfa	R600/SI: Wrap local memory pointer in AssertZExt on SI These pointers are really just offsets and they will always be less than 16-bits. Using AssertZExt allows us to use computeKnownBits to prove that these values are positive. We will use this information in a later commit. llvm-svn: 216277	2014-08-22 18:49:31 +00:00
Tom Stellard	0510514e36	R600/SI: Use correct helper class for DS_WRITE2 instructions DS_1A uses a single offset encoding, so offset1 wasn't being encoded. llvm-svn: 216276	2014-08-22 18:49:28 +00:00
Sanjay Patel	2cdea4c41e	name change: isPow2DivCheap -> isPow2SDivCheap isPow2DivCheap That name doesn't specify signed or unsigned. Lazy as I am, I eventually read the function and variable comments. It turns out that this is strictly about signed div. But I discovered that the comments are wrong: srl/add/sra is not the general sequence for signed integer division by power-of-2. We need one more 'sra': sra/srl/add/sra That's the sequence produced in DAGCombiner. The first 'sra' may be removed when dividing by exactly '2', but that's a special case. This patch corrects the comments, changes the name of the flag bit, and changes the name of the accessor methods. No functional change intended. Differential Revision: http://reviews.llvm.org/D5010 llvm-svn: 216237	2014-08-21 22:31:48 +00:00
Tom Stellard	745f2eddef	R600/SI: Teach moveToVALU how to handle more S_LOAD_* instructions llvm-svn: 216220	2014-08-21 20:41:00 +00:00
Tom Stellard	162a947160	R600/SI: Make sure SCRATCH_WAVE_OFFSET is added as Live-In to the function This fixes a crash in an ocl conformance test. llvm-svn: 216219	2014-08-21 20:40:58 +00:00
Tom Stellard	8e52375bb5	R600/SI: Remove unused SGPR spilling code llvm-svn: 216218	2014-08-21 20:40:56 +00:00
Tom Stellard	c5cf2f04d9	R600/SI: Use eliminateFrameIndex() to expand SGPR spill pseudos This will simplify the SGPR spilling and also allow us to use MachineFrameInfo for calculating offsets, which should be more reliable than our custom code. This fixes a crash in some cases where a register would be spilled in a branch such that the VGPR defined for spilling did not dominate all the uses when restoring. This fixes a crash in an ocl conformance test. The test requries register spilling and is too big to include. llvm-svn: 216217	2014-08-21 20:40:54 +00:00
Tom Stellard	11aa80cc4a	R600/SI: Handle VCC in SIRegisterInfo::getPhysRegSubReg() This fixes a crash in an ocl conformance test. The test requries register spilling and is too big to include. llvm-svn: 216216	2014-08-21 20:40:50 +00:00
Alexey Samsonov	ea0aee622e	Cleanup: Delete seemingly unused reference to MachineDominatorTree from ScheduleDAGInstrs. llvm-svn: 216124	2014-08-20 20:57:26 +00:00
Aaron Ballman	f12dc9c802	Silencing an MSVC warning about loop variable conflicting with a variable from an outer scope. NFC. llvm-svn: 215888	2014-08-18 11:51:41 +00:00
Matt Arsenault	fabf545299	R600/SI: Move all fabs / fneg handling to patterns llvm-svn: 215749	2014-08-15 18:42:22 +00:00
Matt Arsenault	13623d0e28	R600/SI: Use source modifiers for f64 fneg llvm-svn: 215748	2014-08-15 18:42:18 +00:00
Matt Arsenault	a147438e37	R600/SI: Use source modifier for f64 fabs llvm-svn: 215747	2014-08-15 18:42:15 +00:00
Matt Arsenault	9e7cf548ea	R600/SI: Refactor fneg / fabs patterns llvm-svn: 215746	2014-08-15 18:42:11 +00:00
Matt Arsenault	b2baffaffd	R600/SI: Fix offset folding in some cases with shifted pointers. Ordinarily (shl (add x, c1), c2) -> (add (shl x, c2), c1 << c2) is only done if the add has one use. If the resulting constant add can be folded into an addressing mode, force this to happen for the pointer operand. This ends up happening a lot because of how LDS objects are allocated. Since the globals are allocated next to each other, acessing the first element of the second object is directly indexed by a shifted pointer. llvm-svn: 215739	2014-08-15 17:49:05 +00:00
Matt Arsenault	2e7cc48baa	R600/SI: Add intrinsic for ldexp llvm-svn: 215734	2014-08-15 17:30:25 +00:00
Matt Arsenault	5015a89aa5	R600/SI: Implement isLegalAddressingMode The default assumes that a 16-bit signed offset is used. LDS instruction use a 16-bit unsigned offset, so it wasn't being used in some cases where it was assumed a negative offset could be used. More should be done here, but first isLegalAddressingMode needs to gain an addressing mode argument. For now, copy most of the rest of the default implementation with the immediate offset change. llvm-svn: 215732	2014-08-15 17:17:07 +00:00
Rafael Espindola	d610ba99cb	Remove HasLEB128. We already require CFI, so it should be safe to require .leb128 and .uleb128. llvm-svn: 215712	2014-08-15 14:01:07 +00:00
Matt Arsenault	74ef277774	R600: Correctly set the src value offset for scalarized kernel args This for some reason fixes v1i64 kernel arguments on pre-SI. This currently breaks some other cases in the kernel-args.ll test for R600, but I'm not particularly confident in the new output. VTX_READ_* are not used for some of the scalarized cases, and the code reading from the constant buffer doesn't make much sense to me. llvm-svn: 215564	2014-08-13 18:14:11 +00:00
Benjamin Kramer	a7c40ef022	Canonicalize header guards into a common format. Add header guards to files that were missing guards. Remove #endif comments as they don't seem common in LLVM (we can easily add them back if we decide they're useful) Changes made by clang-tidy with minor tweaks. llvm-svn: 215558	2014-08-13 16:26:38 +00:00
Jan Vesely	e5ca27d716	R600: Use optimized 24bit path in udivrem v2: drop enum keyword use correct extension mode don't bother computing the sign in unsinged case Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 215462	2014-08-12 17:31:20 +00:00
Jan Vesely	e377a6b59a	R600: Remove unused code. Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 215461	2014-08-12 17:31:19 +00:00
Jan Vesely	4a33bc6206	R600: Use i24 optimized path for SREM v2: add tests rename LowerSDIV24 to LowerSDIVREM24 handle the rem part in this function Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 215460	2014-08-12 17:31:17 +00:00
NAKAMURA Takumi	5f79ee53ec	R600/SIInstrInfo.cpp: Suppress an warning. [-Wunused-variable] llvm-svn: 215406	2014-08-11 23:03:38 +00:00
Tom Stellard	155bbb7713	R600/SI: Add a ComplexPattern for selecting MUBUF _OFFSET variant This saves us from having to copy a 64-bit 0 value into VGPRs for BUFFER_* instruction which only have a 12-bit immediate offset. llvm-svn: 215399	2014-08-11 22:18:17 +00:00
Tom Stellard	ddea48673f	R600/SI: Add an _OFFEN variant MUBUF_STORE_* and use it for scratch writes llvm-svn: 215398	2014-08-11 22:18:14 +00:00
Tom Stellard	93ba12f163	R600/SI: Clear lds bit on MUBUF instructions used for private stores This bit was left uninitialized, which was causing some random failures of piglit tests. NOTE: This is a candidate for the 3.5 branch. llvm-svn: 215396	2014-08-11 22:18:09 +00:00
Sylvestre Ledru	469de19a09	Fix typos: * libaries => libraries * avaiable => available llvm-svn: 215366	2014-08-11 18:04:46 +00:00
Matt Arsenault	996a0ef99e	R600: Disable FP exceptions. llvm-svn: 215277	2014-08-09 03:46:58 +00:00
Tom Stellard	c0503db9e2	R600/SI: Custom lower CONCAT_VECTORS This will lower them using register copies rather than loads and stores to the stack. llvm-svn: 215270	2014-08-09 01:06:56 +00:00
Eric Christopher	b9fd9ed37e	Temporarily Revert "Nuke the old JIT." as it's not quite ready to be deleted. This will be reapplied as soon as possible and before the 3.6 branch date at any rate. Approved by Jim Grosbach, Lang Hames, Rafael Espindola. This reverts commits r215111, 215115, 215116, 215117, 215136. llvm-svn: 215154	2014-08-07 22:02:54 +00:00
Rafael Espindola	f8b27c41e8	Nuke the old JIT. I am sure we will be finding bits and pieces of dead code for years to come, but this is a good start. Thanks to Lang Hames for making MCJIT a good replacement! llvm-svn: 215111	2014-08-07 14:21:18 +00:00
Eric Christopher	b5217507c7	Remove the target machine from CCState. Previously it was only used to get the subtarget and that's accessible from the MachineFunction now. This helps clear the way for smaller changes where we getting a subtarget will require passing in a MachineFunction/Function as well. llvm-svn: 214988	2014-08-06 18:45:26 +00:00
Matt Arsenault	515c24b7e0	Correct comment llvm-svn: 214945	2014-08-06 00:44:25 +00:00
Matt Arsenault	d5f4de27b6	R600: Increase nearby load scheduling threshold. This partially fixes weird looking load scheduling in memcpy test. The load clustering doesn't seem particularly smart, but this method seems to be partially deprecated so it might not be worth trying to fix. llvm-svn: 214943	2014-08-06 00:29:49 +00:00
Matt Arsenault	c10853f29f	R600/SI: Implement areLoadsFromSameBasePtr This currently has a noticable effect on the kernel argument loads. LDS and global loads are more problematic, I think because of how copies are currently inserted to ensure that the address is a VGPR. llvm-svn: 214942	2014-08-06 00:29:43 +00:00
Matt Arsenault	1070511847	R600/SI: Add definitions for ds_read2st64_ / ds_write2st64_ llvm-svn: 214936	2014-08-05 23:53:20 +00:00
Matt Arsenault	6532520fbf	R600/SI: Use register class instead of list of registers I'm not sure if this has any consequence or not. llvm-svn: 214902	2014-08-05 17:52:40 +00:00
Matt Arsenault	2549bb4b83	R600/SI: Add exec_lo and exec_hi subregisters. This allows accessing an SReg subregister with a normal subregister index, instead of getting a machine verifier error. Also be sure to include all of these subregisters in SReg_32. This fixes inferring SGPR instead of SReg when finding a super register class. llvm-svn: 214901	2014-08-05 17:52:37 +00:00
Tom Stellard	229d5e669b	R600/SI: Update MUBUF assembly string to match AMD proprietary compiler llvm-svn: 214866	2014-08-05 14:48:12 +00:00
Tom Stellard	b37f797678	R600/SI: Avoid generating REGISTER_LOAD instructions. SI doesn't use REGISTER_LOAD anymore, but it was still hitting this code path for 8-bit and 16-bit private loads. llvm-svn: 214865	2014-08-05 14:40:52 +00:00
Eric Christopher	fc6de428c8	Have MachineFunction cache a pointer to the subtarget to make lookups shorter/easier and have the DAG use that to do the same lookup. This can be used in the future for TargetMachine based caching lookups from the MachineFunction easily. Update the MIPS subtarget switching machinery to update this pointer at the same time it runs. llvm-svn: 214838	2014-08-05 02:39:49 +00:00
Eric Christopher	d913448b38	Remove the TargetMachine forwards for TargetSubtargetInfo based information and update all callers. No functional change. llvm-svn: 214781	2014-08-04 21:25:23 +00:00
Matt Arsenault	fa097f8f3d	R600/SI: Fix definitions for ds_read2 / ds_write2 instructions. These were just wrong, using the wrong register classes and store2 was missing an operand. llvm-svn: 214756	2014-08-04 18:49:22 +00:00
Eric Christopher	34aaf970e2	Move the R600 intrinsic support back to the target machine - there's nothing subtarget dependent about the intrinsic support in any backend as far as I can tell. llvm-svn: 214738	2014-08-04 17:37:43 +00:00
Matt Arsenault	329eda3b82	Use the known address space constant rather than checking it llvm-svn: 214729	2014-08-04 16:55:35 +00:00
Matt Arsenault	efb3d5347d	R600: Remove unused include llvm-svn: 214728	2014-08-04 16:55:33 +00:00
Matt Arsenault	9215b17eb7	R600/SI: Fix extra whitespace in asm str This slipped in in r214467, so something like V_MOV_B32_e32 v0, ... is now printed with 2 spaces between the instruction name and first operand. llvm-svn: 214660	2014-08-03 05:27:14 +00:00
Matt Arsenault	a80c8770f9	R600/SI: Fix formatting. Avoid weird line wrapping of BuildMI dest register. llvm-svn: 214608	2014-08-02 01:10:28 +00:00
Chandler Carruth	356665a36c	[SDAG] MorphNodeTo recursively deletes dead operands of the old fromulation of the node, which isn't really the desired behavior from within the combiner or legalizer, but is necessary within ISel. I've added a hopefully helpful comment and fixed the only two places where this took place. Yet another step toward the combiner and legalizer not needing to use update listeners with virtual calls to manage the worklists behind legalization and combining. llvm-svn: 214574	2014-08-01 22:09:43 +00:00
Tom Stellard	4973a13680	Revert "R600: Move code for generating REGISTER_LOAD into R600ISelLowering.cpp" This reverts commit r214566. I did not mean to commit this yet. llvm-svn: 214572	2014-08-01 21:55:50 +00:00
Tom Stellard	d44c023b21	R600/SI: Remove leftover debugging code llvm-svn: 214569	2014-08-01 21:51:05 +00:00
Tom Stellard	c16f73d7c5	R600: Move code for generating REGISTER_LOAD into R600ISelLowering.cpp SI doesn't use REGISTER_LOAD anymore, but it was still hitting this code path for 8-bit and 16-bit private loads. llvm-svn: 214566	2014-08-01 21:50:47 +00:00
Matt Arsenault	cdcdb87a62	R600/SI: Don't display GDS bit for read2 This isn't displayed for any other instructions anymore, and isn't ever used. llvm-svn: 214523	2014-08-01 17:00:26 +00:00
Tom Stellard	aa9a1a813e	R600/SI: Fix build warning llvm-svn: 214475	2014-08-01 02:05:57 +00:00
Tom Stellard	b4a313a76f	R600/SI: Do abs/neg folding with ComplexPatterns Abs/neg folding has moved out of foldOperands and into the instruction selection phase using complex patterns. As a consequence of this change, we now prefer to select the 64-bit encoding for most instructions and the modifier operands have been dropped from integer VOP3 instructions. llvm-svn: 214467	2014-08-01 00:32:39 +00:00
Tom Stellard	0e975cf6e5	R600/SI: Simplify and fix handling of VOP2 in SIInstrInfo::legalizeOperands We were incorrectly assuming that all VOP2 instructions can read SGPRs in Src0, but this is not true for instructions that read carry-in from VCC. The old logic has been replaced with new logic which checks the defined register classes of the VOP2 instruction to determine whether or not to legalize the operands. llvm-svn: 214465	2014-08-01 00:32:35 +00:00
Tom Stellard	6407e1e632	R600/SI: Fold immediates when shrinking instructions This will prevent us from using extra MOV instructions once we prefer selecting 64-bit instructions. llvm-svn: 214464	2014-08-01 00:32:33 +00:00
Tom Stellard	86d12ebdbd	R600/SI: Fix incorrect commute operation in shrink instructions pass We were commuting the instruction by still shrinking it using the original opcode. NOTE: This is a candidate for the 3.5 branch. llvm-svn: 214463	2014-08-01 00:32:28 +00:00
Louis Gerbarg	67474e3755	Make sure no loads resulting from load->switch DAGCombine are marked invariant Currently when DAGCombine converts loads feeding a switch into a switch of addresses feeding a load the new load inherits the isInvariant flag of the left side. This is incorrect since invariant loads can be reordered in cases where it is illegal to reoarder normal loads. This patch adds an isInvariant parameter to getExtLoad() and updates all call sites to pass in the data if they have it or false if they don't. It also changes the DAGCombine to use that data to make the right decision when creating the new load. llvm-svn: 214449	2014-07-31 21:45:05 +00:00
Matt Arsenault	f2733709ab	R600/SI: Remove redundant setting of bits on instructions. neverHasSideEffects is deprecated, and hasSideEffects = 0 is already set on the base classes of the basic ALU instruction classes. The base classes also already set mayLoad = 0 and mayStore = 0 llvm-svn: 214283	2014-07-30 03:18:57 +00:00
Matt Arsenault	7eb0a1014d	R600/SI: Consider adjacent offsets in getLdStBaseRegImmOfs We can treat ds_read2_* as a single offset if the offsets are adjacent. No test since emission of read2 instructions for partially aligned loads isn't implemented yet. llvm-svn: 214269	2014-07-30 01:01:10 +00:00
Matt Arsenault	1acc72f431	R600/SI: Implement getLdStBaseRegImmOfs llvm-svn: 214225	2014-07-29 21:34:55 +00:00
Matt Arsenault	1eb18309d5	R600/SI: Enable named operand table for DS instructions llvm-svn: 214217	2014-07-29 21:00:56 +00:00
Matt Arsenault	620155fd1d	Remove line with no effect llvm-svn: 214216	2014-07-29 21:00:53 +00:00
Matt Arsenault	e2fabd35b5	R600/SI: Add isMUBUF / isMTBUF Also add missing comments about how the flags work. llvm-svn: 214195	2014-07-29 18:51:56 +00:00
Matt Arsenault	0040f18256	R600/SI: Set bits on SMRD instructions Set mayStore = 0 and enable named operand table. llvm-svn: 214194	2014-07-29 18:51:54 +00:00
Matt Arsenault	57e74d2010	Fix typos / grammar. llvm-svn: 214147	2014-07-29 00:02:40 +00:00
Matt Arsenault	60bd28cefd	Fix header including itself llvm-svn: 214146	2014-07-29 00:02:37 +00:00
Matt Arsenault	b9f46eeff1	R600/SI: Fix return type for isMIMG / isSMRD All the others use bool, so these should too. llvm-svn: 214106	2014-07-28 17:59:38 +00:00
Matt Arsenault	46645fa102	R600/SI: Implement getOptimalMemOpType The default guess uses i32. This needs an address space argument to really do the right thing in all cases. llvm-svn: 214104	2014-07-28 17:49:26 +00:00
Matt Arsenault	86033cae84	R600/SI: Make argument loads invariant llvm-svn: 214101	2014-07-28 17:31:39 +00:00
Matt Arsenault	6f2a526101	Add alignment value to allowsUnalignedMemoryAccess Rename to allowsMisalignedMemoryAccess. On R600, 8 and 16 byte accesses are mostly OK with 4-byte alignment, and don't need to be split into multiple accesses. Vector loads with an alignment of the element type are not uncommon in OpenCL code. llvm-svn: 214055	2014-07-27 17:46:40 +00:00
Matt Arsenault	a5789bb4e1	R600: Move intrinsic lowering to separate functions llvm-svn: 214023	2014-07-26 06:23:37 +00:00
Matt Arsenault	c824458e81	R600/SI: Allow partial unrolling and increase thresholds. llvm-svn: 213985	2014-07-25 23:02:42 +00:00
Eric Christopher	ac4b69e40b	Move R600 subtarget dependent variables onto the subtarget. No functional change. llvm-svn: 213982	2014-07-25 22:22:39 +00:00
Chandler Carruth	3de980d2ff	[SDAG] Enable the new assert for out-of-range result numbers in SDValues, fixing the two bugs left in the regression suite. The key for both of these was the use a single value type rather than a VTList which caused an unintentionally single-result merge-value node. Fix this by getting the appropriate VTList in place. Doing this exposed that the comments in x86's code abouth how MUL_LOHI operands are handle is wrong. The bug with the use of out-of-range result numbers was hiding the bug about the order of operands here (as best i can tell). There are more places where the code appears to get this backwards still... llvm-svn: 213931	2014-07-25 09:19:23 +00:00
Matt Arsenault	83592a2d32	R600: Add FMA instructions for Evergreen llvm-svn: 213882	2014-07-24 17:41:01 +00:00
Matt Arsenault	83e60581c3	R600: Add new functions for splitting vector loads and stores. These will be used in future patches and shouldn't change anything yet. llvm-svn: 213877	2014-07-24 17:10:35 +00:00
Matt Arsenault	9acb978105	R600: Match rcp node on pre-SI llvm-svn: 213844	2014-07-24 06:59:24 +00:00
Matt Arsenault	0daeb63f03	R600: Fix LowerSDIV24 Use ComputeNumSignBits instead of checking for i8 / i16 which only worked when AMDIL was lying about having legal i8 / i16. If an integer is known to fit in 24-bits, we can do division faster with float ops. llvm-svn: 213843	2014-07-24 06:59:20 +00:00
Matt Arsenault	034d666bb7	R600: Implement enableClusterLoads() llvm-svn: 213831	2014-07-24 02:10:17 +00:00
Saleem Abdulrasool	913666f9bc	R600: silence GCC warning GCC believes it may be possible to not return a value from the switch: lib/Target/R600/SIRegisterInfo.cpp:187:1: warning: control reaches end of non-void function [-Wreturn-type] Add an unreachable label to indicate that this is not possible and still permit switch coverage checking. llvm-svn: 213572	2014-07-21 17:52:00 +00:00
Tom Stellard	bda32c9e47	R600/SI: Refactor VOP3 instruction definitions llvm-svn: 213571	2014-07-21 17:44:29 +00:00
Tom Stellard	e5a1cdab47	R600/SI: Separate encoding and operand definitions into their own classes llvm-svn: 213570	2014-07-21 17:44:28 +00:00
Tom Stellard	f757b5ddc2	R600/SI: Initailize encoding fields of unused VOP3 modifiers to 0 llvm-svn: 213564	2014-07-21 17:12:40 +00:00
Tom Stellard	ca000c6c7b	R600/SI: Initialize unused VOP3 sources to 0 instead of SIOperand.ZERO llvm-svn: 213563	2014-07-21 17:12:37 +00:00
Tom Stellard	1aaad6970c	R600/SI: Add instruction shrinking pass This pass converts 64-bit instructions to 32-bit when possible. llvm-svn: 213561	2014-07-21 16:55:33 +00:00
Tom Stellard	63797d4a23	R600/SI: VOPC instructions explicitly define VCC Therefore we don't need to add it to the implict defs list. llvm-svn: 213558	2014-07-21 16:27:24 +00:00
Tom Stellard	e812f2fdd8	R600/SI: Clean up some of the unused REGISTER_{LOAD,STORE} code There are a few more cleanups to do, but I ran into some problems with ext loads and trunc stores, when I tried to change some of the vector loads and stores from custom to legal, so I wasn't able to get rid of everything. llvm-svn: 213552	2014-07-21 15:45:06 +00:00
Tom Stellard	b02094e115	R600/SI: Use scratch memory for large private arrays llvm-svn: 213551	2014-07-21 15:45:01 +00:00
Tom Stellard	42639a57de	R600/SI: Specify wavefront size for SI and CI llvm-svn: 213550	2014-07-21 15:44:58 +00:00
Tom Stellard	8e44d948b6	R600/SI: Remove vaddr operand from BUFFER_LOAD_*_OFFSET instructions This operand is never used. llvm-svn: 213549	2014-07-21 15:44:55 +00:00
Tom Stellard	067c81567b	R600/SI: Store constant initializer data in constant memory This implements a solution for constant initializers suggested by Vadim Girlin, where we store the data after the shader code and then use the S_GETPC instruction to compute its address. This saves use the trouble of creating a new buffer for constant data and then having to pass the pointer to the kernel via user SGPRs or the input buffer. llvm-svn: 213530	2014-07-21 14:01:14 +00:00
Tom Stellard	b2114caf62	R600/SI: Add isCFDepth0 Predicate to SALU addc pattern llvm-svn: 213529	2014-07-21 14:01:12 +00:00
Tom Stellard	54a3b65bb9	R600/SI: Use VALU for i1 XOR llvm-svn: 213528	2014-07-21 14:01:10 +00:00
Tom Stellard	01825afad7	R600/SI: Use a custom encoding method for simm16 in SOPP branch instructions This allows us to explicitly define the type of fixup that is needed, so we can distinguish this from future fixup types. llvm-svn: 213527	2014-07-21 14:01:08 +00:00
Tom Stellard	e08fe68bdd	R600/SI: Rename SOPP operands to match the encoding fields llvm-svn: 213526	2014-07-21 14:01:05 +00:00
NAKAMURA Takumi	45e0a83141	SIISelLowering.cpp: Define _USE_MATH_DEFINES to let M_PI provided on MS <cmath>. FIXME: Would it be better to move it into configure? llvm-svn: 213477	2014-07-20 11:15:07 +00:00
Matt Arsenault	0163e033e2	R600: Remove unused function llvm-svn: 213472	2014-07-20 06:31:06 +00:00
Matt Arsenault	e261b6e853	R600/SI: Remove dead code and add missing tests. This probably was killed by some generic DAGCombiner improvements in checking the TargetBooleanContents instead of just 1. llvm-svn: 213471	2014-07-20 06:11:02 +00:00
Matt Arsenault	1c407fb5a3	Revert accidentally committed r213459 llvm-svn: 213461	2014-07-19 19:17:33 +00:00
Matt Arsenault	b38677ee2f	XXX - Increase unroll threshold llvm-svn: 213459	2014-07-19 19:16:34 +00:00
Matt Arsenault	ad14ce84b7	R600/SI: implement range reduction for sin/cos These instructions can only take a limited input range, and return the constant value 1 out of range. We should do range reduction to be able to process arbitrary values. Use a FRACT instruction after normalization to achieve this. Also add a test for constant folding with the lowered code with unsafe-fp-math enabled. v2: use DAG lowering instead of intrinsic, adapt test v3: calculate constant, fold pattern into instruction definition v4: misc style fixes, add sin-fold testcase, cosmetics Patch by Grigori Goronzy llvm-svn: 213458	2014-07-19 18:44:39 +00:00
Matt Arsenault	a93441fe9c	R600: Implement a few simple TTI queries. I'm not sure if these have any effect right now. llvm-svn: 213455	2014-07-19 18:15:16 +00:00
Tim Northover	00fdbbbf60	R600: support fpext/fptrunc operations to and from f16. llvm-svn: 213376	2014-07-18 13:01:37 +00:00
Tim Northover	f861de3d7b	R600: support f16 -> f64 conversion intrinsic. Unfortunately, we don't seem to have a direct truncation, but the extension can be legally split into two operations so we should support that. llvm-svn: 213357	2014-07-18 08:43:24 +00:00
Matt Arsenault	3dd43fc75d	R600: Implement TTI:getPopcntSupport The test is just copied from X86, and I don't know of a better way to test it. llvm-svn: 213351	2014-07-18 06:07:13 +00:00
Matt Arsenault	97483694e7	Fix typos llvm-svn: 213285	2014-07-17 17:50:22 +00:00
Tim Northover	fd7e424935	CodeGen: extend f16 conversions to permit types > float. This makes the two intrinsics @llvm.convert.from.f16 and @llvm.convert.to.f16 accept types other than simple "float". This is only strictly needed for the truncate operation, since otherwise double rounding occurs and there's no way to represent the strict IEEE conversion. However, for symmetry we allow larger types in the extend too. During legalization, we can expand an "fp16_to_double" operation into two extends for convenience, but abort when the truncate isn't legal. A new libcall is probably needed here. Even after this commit, various target tweaks are needed to actually use the extended intrinsics. I've put these into separate commits for clarity, so there are no actual tests of f64 conversion here. llvm-svn: 213248	2014-07-17 10:51:23 +00:00
Matt Arsenault	ac6e39cf3b	Use range for llvm-svn: 213230	2014-07-17 06:19:06 +00:00
Matt Arsenault	5e2b0f51e7	R600: Short circuit alloca check if address space isn't private. Skip calling GetUnderlyingObject in cases where it obviously isn't from an alloca. This should only be a compile time improvement. llvm-svn: 213229	2014-07-17 06:13:41 +00:00
Matt Arsenault	22ca3f8860	R600/SI: Allow using f32 rcp / rsq when denormals not handled. These are precise enough to use for OpenCL unless denormals are handled. llvm-svn: 213107	2014-07-15 23:50:10 +00:00
Matt Arsenault	0d89e849bd	R600/SI: Fix select on i1 llvm-svn: 213096	2014-07-15 21:44:37 +00:00
Matt Arsenault	e9fa3b8e6b	R600/SI: Implement less wrong f32 fdiv Assuming single precision denormals and accurate sqrt/div are not reported, this passes the OpenCL conformance test. llvm-svn: 213089	2014-07-15 20:18:31 +00:00
Matt Arsenault	1d077749ea	R600: Add predicate for UnsafeFPMath llvm-svn: 213088	2014-07-15 20:18:24 +00:00
Matt Arsenault	84446a026b	R600: Remove intrinsics that appear to be unused llvm-svn: 213087	2014-07-15 20:10:27 +00:00
Jan Vesely	6ddb8dd442	R600: Implement zero undef variants of ctlz/cttz v2: use ffbh/l if available v3: Rebase on top of Matt's SI patches Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Tom Stellard <tom@stellard.net> llvm-svn: 213072	2014-07-15 15:51:09 +00:00
NAKAMURA Takumi	04b8b37f56	Prune Redundant libdeps in CMake's target_link_libraries and LLVMBuild.txt. I checked this with Release+Asserts on x86_64-mingw32. Please restore partially if this were overkill. llvm-svn: 213064	2014-07-15 11:37:03 +00:00
Matt Arsenault	ca3976f7ae	R600: Add dag combine for copy of an illegal type. This helps avoid redundant instructions to unpack, and repack the vectors. Ideally we could recognize that pattern and eliminate it. Currently v4i8 and other small element type vectors are scalarized, so this has the added bonus of avoiding that. llvm-svn: 213031	2014-07-15 02:06:31 +00:00
Matt Arsenault	f171cf23b8	R600: Add denormal handling subtarget features. llvm-svn: 213018	2014-07-14 23:40:49 +00:00
Matt Arsenault	c6ae7b4763	R600/SI: Default to no single precision denormals. llvm-svn: 213017	2014-07-14 23:40:43 +00:00
Matt Arsenault	c3f6a7e44e	Remove unused include llvm-svn: 212898	2014-07-13 03:08:59 +00:00
Matt Arsenault	d32dbb6a10	R600: Use range for and fix missing consts. llvm-svn: 212897	2014-07-13 03:06:43 +00:00
Matt Arsenault	762af96f46	R600: Make ShaderType private llvm-svn: 212896	2014-07-13 03:06:39 +00:00
Matt Arsenault	d9a23ab20d	R600: Add option to disable promote alloca This can make writing some tests harder, so add a flag to disable it. llvm-svn: 212893	2014-07-13 02:08:26 +00:00
Marek Olsak	eac5062cc0	R600/SI: Use i32 vectors for resources and samplers This affects new intrinsics only. What surprises me is that v32i8 still works. llvm-svn: 212831	2014-07-11 17:11:52 +00:00
Marek Olsak	d8ecaeec02	R600/SI: add sample and image intrinsics exposing all instruction fields We need the intrinsics with offsets, so why not just add them all. The R128 parameter will also be useful for reducing SGPR usage. GL_ARB_image_load_store also adds some image GLSL modifiers like "coherent", so Mesa will probably translate those to slc, glc, etc. When LLVM 3.5 is released, I'll switch Mesa to these new intrinsics. llvm-svn: 212830	2014-07-11 17:11:46 +00:00
Marek Olsak	ba77c3e4ed	R600/SI: fix shadow mapping for 1D and 2D array textures It was conflicting with def TEX_SHADOW_ARRAY, which also handles them. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 212829	2014-07-11 17:11:39 +00:00
Jan Vesely	2cb62ce2a0	R600: Implement float to long/ulong Use alg. from LegalizeDAG.cpp Move Expand setting to SIISellowering v2: Extend existing tests instead of creating new ones v3: use separate LowerFPTOSINT function v4: use TargetLowering::expandFP_TO_SINT add comment about using FP_TO_SINT for uints Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Tom Stellard <tom@stellard.net> llvm-svn: 212773	2014-07-10 22:40:21 +00:00
Matt Arsenault	b0df92577d	R600/SI: Add support for llvm.convert.{to\|from}.fp16 llvm-svn: 212676	2014-07-10 03:22:20 +00:00
Matt Arsenault	d2c9e08b63	R600: Fix mishandling of load / store chains. Fixes various bugs with reordering loads and stores. Scalarized vector loads weren't collecting the chains at all. llvm-svn: 212473	2014-07-07 18:34:45 +00:00
Matt Arsenault	fda9dad17f	Fix typo, weird indentation llvm-svn: 212472	2014-07-07 18:34:42 +00:00
Matt Arsenault	4261973548	Use cast<> instead of dyn_cast + assert llvm-svn: 212380	2014-07-05 21:16:43 +00:00
Matt Arsenault	258c6e7cd9	Fix grammar llvm-svn: 212379	2014-07-05 21:16:40 +00:00
Chandler Carruth	9d010fffe1	[codegen,aarch64] Add a target hook to the code generator to control vector type legalization strategies in a more fine grained manner, and change the legalization of several v1iN types and v1f32 to be widening rather than scalarization on AArch64. This fixes an assertion failure caused by scalarizing nodes like "v1i32 trunc v1i64". As v1i64 is legal it will fail to scalarize v1i32. This also provides a foundation for other targets to have more granular control over how vector types are legalized. Patch by Hao Liu, reviewed by Tim Northover. I'm committing it to allow some work to start taking place on top of this patch as it adds some really important hooks to the backend that I'd like to immediately start using. =] http://reviews.llvm.org/D4322 llvm-svn: 212242	2014-07-03 00:23:43 +00:00
Tom Stellard	e9219e0026	R600: Add a comment that llvm.AMDGPU.trunc is a legacy intrinsic llvm-svn: 212218	2014-07-02 20:53:57 +00:00
Tom Stellard	7c1838d797	R600/SI: Use a ComplexPattern for ADDR64 addressing of MUBUF loads llvm-svn: 212217	2014-07-02 20:53:56 +00:00
Tom Stellard	10ae6a0e6a	R600: Promote i64 loads to v2i32 llvm-svn: 212216	2014-07-02 20:53:54 +00:00
Tom Stellard	b2de94e0c6	R600/SI: Adjsut SGPR live ranges before register allocation SGPRs are written by instructions that sometimes will ignore control flow, which means if you have code like: if (VGPR0) { SGPR0 = S_MOV_B32 0 } else { SGPR0 = S_MOV_B32 1 } The value of SGPR0 will 1 no matter what the condition is. In order to deal with this situation correctly, we need to view the program as if it were a single basic block when we calculate the live ranges for the SGPRs. They way we actually update the live range is by iterating over all of the segments in each LiveRange object and setting the end of each segment equal to the start of the next segment. So a live range like: [3888r,9312r:0)[10032B,10384B:0) 0@3888r will become: [3888r,10032B:0)[10032B,10384B:0) 0@3888r This change will allow us to use SALU instructions within branches. llvm-svn: 212215	2014-07-02 20:53:48 +00:00
Tom Stellard	a305f93d81	R600/SI: Add verifier check for immediates in register operands. llvm-svn: 212214	2014-07-02 20:53:44 +00:00
Matt Arsenault	c324b95c77	R600: Fix crashes when an illegal type load or store is not handled. I don't think anything hits this now, but will be exposed in future patches. llvm-svn: 212197	2014-07-02 17:44:53 +00:00
Matt Arsenault	d0e0f0aea0	R600: Move mul combine to separate function llvm-svn: 212052	2014-06-30 17:55:48 +00:00
Matt Arsenault	5d293eefda	R600: Remove unused declarations leftover from AMDIL llvm-svn: 212051	2014-06-30 17:37:17 +00:00
Craig Topper	66e588be09	Add ops() method to SDNode that returns an ArrayRef<SDUse>. Use it to simplify some code. llvm-svn: 211993	2014-06-29 00:40:57 +00:00
Matt Arsenault	d782d05666	R600: Move trivial getters into header, use initializer list llvm-svn: 211917	2014-06-27 17:57:00 +00:00
Matt Arsenault	642d2e78b3	R600: Don't crash on unhandled instruction in promote alloca llvm-svn: 211906	2014-06-27 16:52:49 +00:00
Matt Arsenault	6f62cf80d0	Fix missing newline and simplify debug printing. llvm-svn: 211850	2014-06-27 02:36:59 +00:00
Matt Arsenault	961ca43180	R600: Move load/store ReplaceNodeResults to common code. Future patches will want to custom lower loads on SI. llvm-svn: 211848	2014-06-27 02:33:47 +00:00
Matt Arsenault	0989d51520	R600/SI: Add FP mode bits to binary. The default rounding mode to initialize the mode register needs to be reported to the runtime. Fill in other bits a kernel may be interested in setting for future use. llvm-svn: 211791	2014-06-26 17:22:30 +00:00
Aaron Ballman	3c81e46b57	Silencing a warning about isZExtFree hiding an inherited virtual function. No functional change intended. llvm-svn: 211783	2014-06-26 13:45:47 +00:00
Matt Arsenault	c6f8fdb4e5	R600: Fix vector FMA llvm-svn: 211757	2014-06-26 01:28:05 +00:00
Tom Stellard	b02c268cbd	R600/SI: Use a ComplexPattern for MUBUF stores Now that non-leaf ComplexPatterns are allowed we can fold all the MUBUF store patterns into the instruction definition. We will also be able to reuse this new ComplexPattern for MUBUF loads and atomic operations. llvm-svn: 211644	2014-06-24 23:33:07 +00:00
Tom Stellard	9b3816b5ee	R600: Promote i64 stores to v2i32 Now we need only one 64-bit pattern for stores. llvm-svn: 211643	2014-06-24 23:33:04 +00:00
Matt Arsenault	257d48d22c	R600: Fix inconsistency in rsq instructions. R600 was using a clamped version of rsq, but SI was not. Add a new rsq_clamped intrinsic and use them consistently. It's unclear to me from the documentation what behavior the R600 instructions have, so I assume they have the legacy behavior described by the SI documents. For R600, use RECIPSQRT_IEEE for both llvm.AMDGPU.rsq.legacy and llvm.AMDGPU.rsq. R600 also has RECIPSQRT_FF, which I'm not sure how it fits in here. llvm-svn: 211637	2014-06-24 22:13:39 +00:00
Matt Arsenault	d40b970616	R600: Remove DIV_INF This corresponded to an amdil instruction which there is a 2 instruction equivalent for. llvm-svn: 211616	2014-06-24 17:42:16 +00:00
Matt Arsenault	bd469d5e67	R600/SI: Move pattern to instruction definition llvm-svn: 211614	2014-06-24 17:17:06 +00:00
Matt Arsenault	becb140324	R600/SI: Verify restrictions on div_scale operands. llvm-svn: 211524	2014-06-23 18:28:31 +00:00
Matt Arsenault	f2b0aebb8a	R600/SI: Fix div_scale intrinsic. The operand that must match one of the others does matter, and implement selecting for it. llvm-svn: 211523	2014-06-23 18:28:28 +00:00
Matt Arsenault	1d555c4e91	R600: Remove AMDILISelLowering llvm-svn: 211519	2014-06-23 18:00:55 +00:00
Matt Arsenault	d5f91fd883	R600: Select is not expensive. llvm-svn: 211518	2014-06-23 18:00:52 +00:00
Matt Arsenault	c4d3d3a16e	R600: Move add/sub with overflow out of AMDILISelLowering Add more tests for these. llvm-svn: 211517	2014-06-23 18:00:49 +00:00
Matt Arsenault	e54e1c3a21	R600: Move more out of AMDILISelLowering llvm-svn: 211516	2014-06-23 18:00:44 +00:00
Matt Arsenault	72573adbf2	R600: Don't set fp_round_inreg action. There's no point in setting this since it seems to only by created in 1 place for ppcf128 llvm-svn: 211515	2014-06-23 18:00:41 +00:00
Matt Arsenault	b8b5153935	R600/SI: Handle i64 sub. We can handle it the same way as add llvm-svn: 211514	2014-06-23 18:00:38 +00:00
Matt Arsenault	9fa3f93173	R600/SI: Move selection of i64 add to separate function. Also don't use a SmallVector for fixed size array. llvm-svn: 211513	2014-06-23 18:00:34 +00:00
Matt Arsenault	c791f39912	R600: Rename AMDIL file llvm-svn: 211512	2014-06-23 18:00:31 +00:00
Matt Arsenault	f4d871b113	Fix missing words in sentence llvm-svn: 211511	2014-06-23 18:00:26 +00:00
Matt Arsenault	762ef017db	Use helper function llvm-svn: 211510	2014-06-23 18:00:24 +00:00
Matt Arsenault	236d9afd18	Alphabetize forward declarations llvm-svn: 211509	2014-06-23 18:00:20 +00:00
Jan Vesely	343cd6f056	R600: Use LowerSDIVREM for i64 node replace v2: move div/rem node replacement to R600ISelLowering make lowerSDIVREM protected Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 211478	2014-06-22 21:43:01 +00:00
Jan Vesely	109efdff6a	R600: Implement custom SDIVREM. Instead of separate SDIV/SREM. SDIV used UDIV which in turn used UDIVREM anyway. SREM used SDIV(UDIV->UDIVREM)+MUL+SUB, using UDIVREM directly is more efficient. v2: Don't use all caps names Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 211477	2014-06-22 21:43:00 +00:00
Tom Stellard	ae4c9e7bc3	R600/SI: Add patterns for ctpop inside a branch llvm-svn: 211378	2014-06-20 17:06:11 +00:00
Tom Stellard	9c603ebca4	R600/SI: Add a pattern for f32 ftrunc llvm-svn: 211377	2014-06-20 17:06:09 +00:00
Tom Stellard	a79e9f0f6d	R600: Expand vector flog2 llvm-svn: 211376	2014-06-20 17:06:07 +00:00
Tom Stellard	5222a88653	R600: Expand vector fexp2 llvm-svn: 211375	2014-06-20 17:06:05 +00:00
Tom Stellard	de16a2e59f	R600/SI: SI Control Flow Annotation bug fixed Mixing of AddAvailableValue and GetValueAtEndOfBlock methods of SSAUpdater leaded to the endless loop generation when the nested loops annotated. This fixes a bug in the OCL_ML/KNN OpenCV test. The test case is too complex for FileCheck and would be very fragile. Patch by: Elena Denisova llvm-svn: 211374	2014-06-20 17:06:02 +00:00
Tom Stellard	c9dedb8e29	R600/SI: Add a VALU pattern for i64 xor llvm-svn: 211373	2014-06-20 17:05:57 +00:00
Matt Arsenault	f5e2997aff	R600: Trivial subtarget feature cleanups. Remove an unused AMDIL leftover, correct extra periods appearing in the help menu. llvm-svn: 211341	2014-06-20 06:50:05 +00:00
Alp Toker	1d099d9339	Fix typos llvm-svn: 211304	2014-06-19 19:41:26 +00:00
Craig Topper	35b2f75733	Convert some assert(0) to llvm_unreachable or fold an 'if' condition into the assert. llvm-svn: 211254	2014-06-19 06:10:58 +00:00

... 3 4 5 6 7 ...

1428 Commits