llvm-project

Commit Graph

Author	SHA1	Message	Date
Matt Arsenault	f9a995d68c	R600: Fix extloads from i8 / i16 to i64. This appears to only be working for global loads. Private and local break for other reasons. llvm-svn: 203135	2014-03-06 17:34:12 +00:00
Matt Arsenault	9fe669c522	R600/SI: Expand selects on vectors. llvm-svn: 203134	2014-03-06 17:34:03 +00:00
Matt Arsenault	e6ed1d796f	Fix missing C++ mode comment llvm-svn: 203133	2014-03-06 17:33:58 +00:00
Chandler Carruth	7da14f1ab9	[Layering] Move InstVisitor.h into the IR library as it is pretty obviously coupled to the IR. llvm-svn: 203064	2014-03-06 03:23:41 +00:00
Matt Arsenault	ca6dcfcf59	Fix typo llvm-svn: 203013	2014-03-05 21:47:22 +00:00
Chandler Carruth	a4ea269f15	[Modules] Move ValueMap to the IR library. While this class does not directly care about the Value class (it is templated so that the key can be any arbitrary Value subclass), it is in fact concretely tied to the Value class through the ValueHandle's CallbackVH interface which relies on the key type being some Value subclass to establish the value handle chain. Ironically, the unittest is already in the right library. llvm-svn: 202824	2014-03-04 11:26:31 +00:00
Benjamin Kramer	b6d0bd48bd	[C++11] Replace llvm::next and llvm::prior with std::next and std::prev. Remove the old functions. llvm-svn: 202636	2014-03-02 12:27:27 +00:00
Craig Topper	73156025e0	Switch all uses of LLVM_OVERRIDE to just use 'override' directly. llvm-svn: 202621	2014-03-02 09:09:27 +00:00
Craig Topper	77dfe45f81	Switch all uses of LLVM_FINAL to just use 'final', and remove the macro. llvm-svn: 202618	2014-03-02 08:08:51 +00:00
Tom Stellard	9b9e926481	R600: Verify all instructions in the AsmPrinter on debug builds Make a call to R600's implementation of verifyInstruction() to check that instructions are only using legal operands. llvm-svn: 202544	2014-02-28 21:36:41 +00:00
Tom Stellard	d61a1c3360	R600/SI: Expand all v16[if]32 operations llvm-svn: 202543	2014-02-28 21:36:37 +00:00
Rafael Espindola	8837995b52	Remove MCPureStreamer. We moved MCJIT to use native object formats a long time ago and R600 now uses ELF, so it was dead. llvm-svn: 202408	2014-02-27 16:17:34 +00:00
Michel Danzer	9e61c4b6cd	R600/SI: Optimize SI_KILL for constant operands If the SI_KILL operand is constant, we can either clear the exec mask if the operand is negative, or do nothing otherwise. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 202337	2014-02-27 01:47:09 +00:00
Michel Danzer	6f273c57db	R600/SI: Allow SI_KILL for geometry shaders Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 202336	2014-02-27 01:47:02 +00:00
Matt Arsenault	530dde4386	R600: Remove unnecessary build_vector pattern. It is already fully handled in AMDGPUISelDAGToDAG. llvm-svn: 202312	2014-02-26 23:00:58 +00:00
Tom Stellard	fd0d86c322	R600: Don't unconditionally unroll loops with private memory accesses This causes the size of the scrypt kernel to explode and eats all the memory on some systems. llvm-svn: 202195	2014-02-25 21:36:21 +00:00
Tom Stellard	1f15bff0df	R600/SI: Custom select 64-bit ADD llvm-svn: 202194	2014-02-25 21:36:18 +00:00
Matt Arsenault	a81aee8277	Fix unused variable llvm-svn: 202080	2014-02-24 21:16:50 +00:00
Matt Arsenault	41e2f2bacd	R600/SI - Add new CI arithmetic instructions. Does not yet include larger part required to match v_mad_i64_i32 / v_mad_u64_u32. llvm-svn: 202077	2014-02-24 21:01:28 +00:00
Matt Arsenault	d0ce2bd8e4	R600: Make check clearer. The check is clearer as southern islands or later, rather than checking for later than northern islands. llvm-svn: 202076	2014-02-24 21:01:23 +00:00
Matt Arsenault	21a3faaf25	Fix DOT4 missing from getTargetOpcodeName llvm-svn: 202075	2014-02-24 21:01:21 +00:00
Tom Stellard	967bf5813f	R600/SI: Expand all v8[if]32 operations llvm-svn: 201371	2014-02-13 23:34:15 +00:00
Tom Stellard	f16d38cbb5	R600/SI: Add a pattern for i32 anyext Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 201370	2014-02-13 23:34:13 +00:00
Tom Stellard	6c7a7e82a7	R600/SI: Completely Disable TypeRewriter on compute llvm-svn: 201369	2014-02-13 23:34:12 +00:00
Tom Stellard	80be9650e3	R600/SI: Split global vector loads with more than 4 elements llvm-svn: 201368	2014-02-13 23:34:10 +00:00
Benjamin Kramer	53f9df4c93	R600: Always implement both versions of isTruncateFree and add a sanity check. llvm-svn: 201222	2014-02-12 10:17:54 +00:00
Matt Arsenault	71b71d25eb	R600/SI: Fix assertion on infinite loops. This isn't the most useful case to fix in the real world, but bugpoint runs into this. llvm-svn: 201177	2014-02-11 21:12:38 +00:00
Matt Arsenault	0cdcd961bf	R600: Implement isTruncateFree Truncation is just accessing a subregister for any multiple of the register size, so it's free. llvm-svn: 201107	2014-02-10 19:57:42 +00:00
Tom Stellard	5d7aaaed7d	R600/SI: Initialize M0 and emit S_WQM_B64 whenever DS instructions are used DS instructions that access local memory can only uses addresses that are less than or equal to the value of M0. When M0 is uninitialized, then we experience undefined behavior. This patch also changes the behavior to emit S_WQM_B64 on pixel shaders no matter what kind of DS instruction is used. llvm-svn: 201097	2014-02-10 16:58:30 +00:00
Tom Stellard	9a32e5f29a	R600/SI: Only use S_WQM_B64 in pixel shaders This doesn't change any functionality, since we only have two shader types (compute and pixel) that use local memory. We're just changing the logic to match the documentation. llvm-svn: 201096	2014-02-10 16:58:27 +00:00
Tom Stellard	e236794578	R600/SI: Add a MUBUF store pattern for Reg+Imm offsets llvm-svn: 200935	2014-02-06 18:36:41 +00:00
Tom Stellard	2937cbc005	R600/SI: Add a MUBUF store pattern for Imm offsets llvm-svn: 200934	2014-02-06 18:36:39 +00:00
Tom Stellard	11624bc577	R600/SI: Add a MUBUF load pattern for Reg+Imm offsets llvm-svn: 200933	2014-02-06 18:36:38 +00:00
Tom Stellard	044e418f15	R600/SI: Use immediates offsets for SMRD instructions whenever possible There was a problem with the old pattern, so we were copying some larger immediates into registers when we could have been encoding them in the instruction. llvm-svn: 200932	2014-02-06 18:36:34 +00:00
Matt Arsenault	25793a3f22	Add address space argument to allowsUnalignedMemoryAccess. On R600, some address spaces have more strict alignment requirements than others. llvm-svn: 200887	2014-02-05 23:15:53 +00:00
Michel Danzer	5d26fdfcba	R600/SI: Add pattern for zero-extending i1 to i32 Fixes opencl-example if_* tests with radeonsi. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=74469 Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 200830	2014-02-05 09:48:05 +00:00
Duncan P. N. Exon Smith	8e661efc00	cleanup: scc_iterator consumers should use isAtEnd No functional change. Updated loops from: for (I = scc_begin(), E = scc_end(); I != E; ++I) to: for (I = scc_begin(); !I.isAtEnd(); ++I) for teh win. llvm-svn: 200789	2014-02-04 19:19:07 +00:00
Rafael Espindola	7cbbd28c67	Every target uses .align. Simplify. llvm-svn: 200782	2014-02-04 18:39:51 +00:00
Tom Stellard	aeb456438c	R600/SI: Expand i1 BR_CC This fixes a crashes in the OpenCV test suite and also the scrypt kernel in bfgminer. I was unable to come up with a reduced test case for this. https://bugs.freedesktop.org/show_bug.cgi?id=72785 llvm-svn: 200776	2014-02-04 17:18:43 +00:00
Tom Stellard	b8725d84d6	R600/SI: Don't assume copies will be coalesced in SIFixSGPRCopies There is no lit test for this, because it would be too big and complicated, but it does fix a crash in the Arithm/Absdiff.* OpenCV test. llvm-svn: 200775	2014-02-04 17:18:42 +00:00
Tom Stellard	0ec134f3d6	R600/SI: Custom lower i64 ISD::SELECT llvm-svn: 200774	2014-02-04 17:18:40 +00:00
Tom Stellard	bfebd1fc7e	R600: Enable vector fpow. The OpenCL specs say: "The vector versions of the math functions operate component-wise. The description is per-component." Patch by: Jan Vesely Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 200773	2014-02-04 17:18:37 +00:00
Michel Danzer	624b02aa67	R600/SI: Fix fneg for 0.0 V_ADD_F32 with source modifier does not produce -0.0 for this. Just manipulate the sign bit directly instead. Also add a pattern for (fneg (fabs ...)). Fixes a bunch of bit encoding piglit tests with radeonsi. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 200743	2014-02-04 07:12:38 +00:00
Matt Arsenault	d5ab971b54	Add DEBUG_TYPE to SIAnnotateControlFlow llvm-svn: 200720	2014-02-03 22:58:05 +00:00
Matt Arsenault	f5958dded4	R600/SI: Fix insertelement with dynamic indices. This didn't work for any integer vectors, and didn't work with some sizes of float vectors. This should now work with all sizes of float and i32 vectors. llvm-svn: 200619	2014-02-02 00:05:35 +00:00
Rafael Espindola	277f9061fc	Remove the last hasRawTextSupport call from R600. There is nothing wrong with printing the disassembly section when printing text. An hypothetical assembler would then produce a .o just like our direct object emission produces. llvm-svn: 200583	2014-01-31 22:14:06 +00:00
Rafael Espindola	887541fe27	Replace another use with hasRawTextSupport+EmitRawText with emitRawComment. llvm-svn: 200582	2014-01-31 22:08:19 +00:00
Rafael Espindola	19656ba7ea	Use emitRawComment to avoid a call to hasRawTextSupport. llvm-svn: 200581	2014-01-31 21:54:49 +00:00
David Woodhouse	d2cca113df	Delete MCSubtargetInfo data members from target MCCodeEmitter classes The subtarget info is explicitly passed to the EncodeInstruction method and we should use that subtarget info to influence any encoding decisions. llvm-svn: 200350	2014-01-28 23:13:25 +00:00
David Woodhouse	3fa98a65e9	Propagate MCSubtargetInfo through TableGen's getBinaryCodeForInstr() llvm-svn: 200349	2014-01-28 23:13:18 +00:00

1 2 3 4 5 ...

729 Commits