llvm-project

Commit Graph

Author	SHA1	Message	Date
Tom Stellard	3a7c34c778	R600: Expand SUB for v2i32/v4i32 Patch by: Aaron Watry Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Signed-off-by: Aaron Watry <awatry@gmail.com> NOTE: This is a candidate for the 3.3 branch. llvm-svn: 181579	2013-05-10 02:09:39 +00:00
Tom Stellard	3deddc5079	R600: Expand MUL for v4i32/v2i32 Fixes piglit test for OpenCL builtin mul24, and allows mad24 to run. Patch by: Aaron Watry Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Signed-off-by: Aaron Watry <awatry@gmail.com> NOTE: This is a candidate for the 3.3 branch. llvm-svn: 181578	2013-05-10 02:09:34 +00:00
Tom Stellard	7fb3963498	R600: Expand SRA for v4i32/v2i32 v2: Add v4i32 test Patch by: Aaron Watry Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Signed-off-by: Aaron Watry <awatry@gmail.com> NOTE: This is a candidate for the 3.3 branch. llvm-svn: 181577	2013-05-10 02:09:29 +00:00
Tom Stellard	a99c6ae47a	R600: Expand vselect for v4i32 and v2i32 v2: Add vselect v4i32 test Patch by: Aaron Watry Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Signed-off-by: Aaron Watry <awatry@gmail.com> NOTE: This is a candidate for the 3.3 branch. llvm-svn: 181576	2013-05-10 02:09:24 +00:00
Tom Stellard	f787ef1d96	R600/SI: Add intrinsic for MIMG IMAGE_GET_RESINFO opcode Patch by: Michel Dänzer Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 181269	2013-05-06 23:02:19 +00:00
Tom Stellard	e363dbf7eb	R600/SI: Handle arbitrary destination type in SITargetLowering::adjustWritemask Patch by: Michel Dänzer Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 181268	2013-05-06 23:02:15 +00:00
Tom Stellard	353b336e8c	R600/SI: Add intrinsic for texture image loading Patch by: Michel Dänzer Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 181267	2013-05-06 23:02:12 +00:00
Tom Stellard	c932d7329c	R600/SI: Add pattern for uint_to_fp Patch by: Michel Dänzer Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 181266	2013-05-06 23:02:07 +00:00
Tom Stellard	cf6452c7d4	R600/SI: Add patterns for integer maxima / minima Patch by: Michel Dänzer Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 181265	2013-05-06 23:02:04 +00:00
Tom Stellard	9b3d2535bf	R600/SI: Add pattern for AMDGPU.trunc intrinsic Patch by: Michel Dänzer Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 181263	2013-05-06 23:02:00 +00:00
Tom Stellard	d93cede8e4	R600: Remove dead code from the CodeEmitter v2 v2: - Replace switch statement with TSFlags query Reviewed-by: Vincent Lejeune <vljn@ovi.com> Tested-By: Aaron Watry <awatry@gmail.com> llvm-svn: 181229	2013-05-06 17:50:57 +00:00
Tom Stellard	043de4c5af	R600: Emit config values in register / value pairs Reviewed-by: Vincent Lejeune <vljn@ovi.com> Tested-By: Aaron Watry <awatry@gmail.com> llvm-svn: 181228	2013-05-06 17:50:51 +00:00
Tom Stellard	cfe2ef8fea	R600: Stop emitting the instruction type byte before each instruction Reviewed-by: Vincent Lejeune <vljn@ovi.com> Tested-By: Aaron Watry <awatry@gmail.com> llvm-svn: 181225	2013-05-06 17:50:44 +00:00
Tom Stellard	dbbcaf31b6	R600: Emit ISA for CALL_FS_* instructions Reviewed-by: Vincent Lejeune <vljn@ovi.com> Tested-By: Aaron Watry <awatry@gmail.com> llvm-svn: 181223	2013-05-06 17:50:26 +00:00
Tom Stellard	4489b85f2b	R600: Expand vector or, shl, srl, and xor nodes llvm-svn: 181035	2013-05-03 17:21:31 +00:00
Tom Stellard	6a6ecedcb7	R600: BFI_INT is a vector-only instruction llvm-svn: 181034	2013-05-03 17:21:24 +00:00
Tom Stellard	eac65dde30	R600: Add pattern for SHA-256 Ma function This can be optimized using the BFI_INT instruction. llvm-svn: 181033	2013-05-03 17:21:20 +00:00
Tom Stellard	c2516c6e40	R600: Clean up comments in Processors.td llvm-svn: 181032	2013-05-03 17:21:14 +00:00
Vincent Lejeune	ddd43383ef	R600: Signed literals are 64bits wide llvm-svn: 180960	2013-05-02 21:53:03 +00:00
Vincent Lejeune	2a44ae0053	R600: If previous bundle is dot4, PV valid chan is always X llvm-svn: 180959	2013-05-02 21:52:55 +00:00
Vincent Lejeune	b0422e24a9	R600: Improve asmPrint of ALU clause llvm-svn: 180957	2013-05-02 21:52:40 +00:00
Vincent Lejeune	f97af796a9	R600: Prettier asmPrint of Alu llvm-svn: 180956	2013-05-02 21:52:30 +00:00
Tom Stellard	40b7f1f6c3	R600: Use new tablegen syntax for patterns All but two patterns have been converted to the new syntax. The remaining two patterns will require COPY_TO_REGCLASS instructions, which the VLIW DAG Scheduler cannot handle. llvm-svn: 180922	2013-05-02 15:30:12 +00:00
Tom Stellard	5447ae20ff	R600/SI: remove nonsense select pattern Fortunately this pattern never matched, otherwise we would have generated incorrect code. Signed-off-by: Christian K??nig <christian.koenig@amd.com> llvm-svn: 180921	2013-05-02 15:30:07 +00:00
Vincent Lejeune	3a8d78a2c3	R600: Always use texture cache for compute shaders This will improve the performance of memory reads. llvm-svn: 180762	2013-04-30 00:14:44 +00:00
Vincent Lejeune	3abdbf1cad	R600: use native for alu llvm-svn: 180761	2013-04-30 00:14:38 +00:00
Vincent Lejeune	147700b8b4	R600: Packetize instructions llvm-svn: 180760	2013-04-30 00:14:27 +00:00
Vincent Lejeune	076c0b28e3	R600: Rework Scheduling to handle difference between VLIW4 and VLIW5 chips llvm-svn: 180759	2013-04-30 00:14:17 +00:00
Vincent Lejeune	22c4248213	R600: Add a Bank Swizzle operand llvm-svn: 180758	2013-04-30 00:14:08 +00:00
Vincent Lejeune	7c395f77de	R600: Take inner dependency into tex/vtx clauses llvm-svn: 180757	2013-04-30 00:14:00 +00:00
Vincent Lejeune	3f1d136b02	R600: Turn TEX/VTX into native instructions llvm-svn: 180756	2013-04-30 00:13:53 +00:00
Vincent Lejeune	c299164284	R600: Add FetchInst bit to instruction defs to denote vertex/tex instructions v2[Vincent Lejeune]: Split FetchInst into usesTextureCache/usesVertexCache llvm-svn: 180755	2013-04-30 00:13:39 +00:00
Vincent Lejeune	7d820c0bef	R600: Add some new processor variants llvm-svn: 180753	2013-04-30 00:13:27 +00:00
Vincent Lejeune	f501ea298b	R600: Clean up instruction class definitions llvm-svn: 180752	2013-04-30 00:13:20 +00:00
Vincent Lejeune	4a0beb5207	R600: config section now reports use of killgt llvm-svn: 180751	2013-04-30 00:13:13 +00:00
Tom Stellard	119ad03c67	R600: Use correct CF_END instruction on Northern Island GPUs llvm-svn: 180735	2013-04-29 22:23:58 +00:00
Tom Stellard	8367067e02	R600: Fix encoding of CF_END_{EG, R600} instructions The EOP bit was not being encoded. llvm-svn: 180734	2013-04-29 22:23:54 +00:00
Tom Stellard	456adc6c4e	R600: Initialize AMDGPUMachineFunction::ShaderType to ShaderType::COMPUTE We need to intialize this to something and since clang does not set the shader type attribute and clang is used only for compute shaders, initializing it to COMPUTE seems like the best choice. Reviewed-by: Christian König <christian.koenig@amd.com> llvm-svn: 180620	2013-04-26 18:32:24 +00:00
Tom Stellard	87047f69ad	R600: Initialize BooleanVectorContents Fixes test/CodeGen/R600/setcc.ll llvm-svn: 180231	2013-04-24 23:56:18 +00:00
Tom Stellard	34e4068d05	R600: Use SHT_PROGBITS for the .AMDGPU.config section The libelf implementation that is distributed here: http://www.mr511.de/software/english.html will not parse sections that are marked SHT_NULL. llvm-svn: 180230	2013-04-24 23:56:14 +00:00
Vincent Lejeune	117f075f6e	R600: Use .AMDGPU.config section to emit stacksize llvm-svn: 180124	2013-04-23 17:34:12 +00:00
Vincent Lejeune	b6bfe85a07	R600: Add CF_END llvm-svn: 180123	2013-04-23 17:34:00 +00:00
Matt Arsenault	034ca0fe41	Remove unused DwarfSectionOffsetDirective string The value isn't actually used, and setting it emits a COFF specific directive. llvm-svn: 180064	2013-04-22 22:49:11 +00:00
Michael Liao	b53d8963ce	ArrayRefize getMachineNode(). No functionality change. llvm-svn: 179901	2013-04-19 22:22:57 +00:00
Tom Stellard	9d10c4ce86	R600: Add pattern for the BFI_INT instruction llvm-svn: 179830	2013-04-19 02:11:06 +00:00
Tom Stellard	ea977bc0e3	R600/SI: Use InstFlag for VOP3 modifier operands InstFlag has a default value of 0 and will simplify the VOP3 patterns. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 179829	2013-04-19 02:11:00 +00:00
Vincent Lejeune	2d5c341cee	R600: Make Export Instruction not duplicable llvm-svn: 179686	2013-04-17 15:17:39 +00:00
Vincent Lejeune	218093e834	R600: Export is emitted as a CF_NATIVE inst llvm-svn: 179685	2013-04-17 15:17:32 +00:00
Vincent Lejeune	98a7380859	R600: Emit used GPRs count llvm-svn: 179684	2013-04-17 15:17:25 +00:00
Tom Stellard	cb97e3acfa	R600/SI: Emit config values in register value pairs. Instead of emitting config values in a predefined order, the code emitter will now emit a 32-bit register index followed by the 32-bit config value. llvm-svn: 179546	2013-04-15 17:51:35 +00:00
Tom Stellard	3a7beafb32	R600/SI: Emit configuration value in the .AMDGPU.config ELF section llvm-svn: 179545	2013-04-15 17:51:30 +00:00
Tom Stellard	9991659fab	R600: Emit ELF formatted code rather than raw ISA. llvm-svn: 179544	2013-04-15 17:51:21 +00:00
NAKAMURA Takumi	3ee2b1e26f	R600ControlFlowFinalizer.cpp: Fix a warning. [-Wunused-variable] llvm-svn: 179263	2013-04-11 04:16:27 +00:00
NAKAMURA Takumi	3b0853be56	Whitespace. llvm-svn: 179262	2013-04-11 04:16:22 +00:00
Michel Danzer	8caa904bde	R600/SI: Add pattern for AMDGPUurecip 21 more little piglits with radeonsi. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 179186	2013-04-10 17:17:56 +00:00
Vincent Lejeune	04d9aa4822	R600: Add VTX_READ_* and RAT_WRITE_CACHELESS_* when computing cf addr llvm-svn: 179174	2013-04-10 13:29:20 +00:00
Christian Konig	8b1ed28ef1	R600/SI: dynamical figure out the reg class of MIMG Depending on the number of bits set in the writemask. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 179166	2013-04-10 08:39:16 +00:00
Christian Konig	8e06e2a8c4	R600/SI: adjust writemask to only the used components Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 179165	2013-04-10 08:39:08 +00:00
Christian Konig	4ace663255	R600/SI: remove image sample writemask Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 179164	2013-04-10 08:39:01 +00:00
Vincent Lejeune	5f11dd390a	R600: Control Flow support for pre EG gen llvm-svn: 179020	2013-04-08 13:05:49 +00:00
Tom Stellard	754f80ff3a	R600/SI: Add support for buffer stores v2 v2: - Use the ADDR64 bit Reviewed-by: Christian König <christian.koenig@amd.com> llvm-svn: 178931	2013-04-05 23:31:51 +00:00
Tom Stellard	6db08eb42f	R600/SI: Use same names for corresponding MUBUF operands and encoding fields The code emitter knows how to encode operands whose name matches one of the encoding fields. If there is no match, the code emitter relies on the order of the operand and field definitions to determine how operands should be encoding. Matching by order makes it easy to accidentally break the instruction encodings, so we prefer to match by name. Reviewed-by: Christian König <christian.koenig@amd.com> llvm-svn: 178930	2013-04-05 23:31:44 +00:00
Tom Stellard	60174bb9ca	R600: Add RV670 processor This is an R600 GPU with double support. Reviewed-by: Christian König <christian.koenig@amd.com> llvm-svn: 178929	2013-04-05 23:31:40 +00:00
Tom Stellard	2f21c7e551	R600/SI: Add processor types for each SI variant Reviewed-by: Christian König <christian.koenig@amd.com> llvm-svn: 178928	2013-04-05 23:31:35 +00:00
Tom Stellard	edbf1eb42b	R600/SI: Avoid generating S_MOVs with 64-bit immediates v2 SITargetLowering::analyzeImmediate() was converting the 64-bit values to 32-bit and then checking if they were an inline immediate. Some of these conversions caused this check to succeed and produced S_MOV instructions with 64-bit immediates, which are illegal. v2: - Clean up logic Reviewed-by: Christian König <christian.koenig@amd.com> llvm-svn: 178927	2013-04-05 23:31:20 +00:00
Vincent Lejeune	bcbb13d691	R600: Use a mask for offsets when encoding instructions llvm-svn: 178763	2013-04-04 14:00:09 +00:00
Vincent Lejeune	8e377fdba6	R600: Fix wrong address when substituting ENDIF llvm-svn: 178762	2013-04-04 14:00:03 +00:00
Vincent Lejeune	c44fa99719	R600: Take export into account when computing cf address llvm-svn: 178761	2013-04-04 13:59:59 +00:00
Vincent Lejeune	c3d3f9b66e	R600: Fix last ALU of a clause being emitted in a separate clause llvm-svn: 178675	2013-04-03 18:24:47 +00:00
Vincent Lejeune	80031d9fc4	R600: Factorize maximum alu per clause in a single location llvm-svn: 178667	2013-04-03 16:49:34 +00:00
Vincent Lejeune	b6d6c0d458	R600: Simplify data structure and add DEBUG to R600ControlFlowFinalizer llvm-svn: 178665	2013-04-03 16:24:09 +00:00
Vincent Lejeune	9931298b30	R600: Consider KILLGT as an ALU instruction Mesa does not override llvm behavior wrt KILLGT anymore so llvm has to handle KILLGT on its own. llvm-svn: 178664	2013-04-03 16:24:04 +00:00
NAKAMURA Takumi	fd98f7f2b6	Target/R600: Fix CMake build to add missing files. llvm-svn: 178508	2013-04-01 22:05:58 +00:00
Vincent Lejeune	bfaa63a6db	R600: Add support for native control flow llvm-svn: 178505	2013-04-01 21:48:05 +00:00
Vincent Lejeune	ace6f7351e	R600/SI: Share code recording ShaderTypeAttribute between generations llvm-svn: 178504	2013-04-01 21:47:53 +00:00
Vincent Lejeune	f43bc57b66	R600: Emit CF_ALU and use true kcache register. llvm-svn: 178503	2013-04-01 21:47:42 +00:00
Vincent Lejeune	53f3525d35	R600: Emit native instructions for tex llvm-svn: 178452	2013-03-31 19:33:04 +00:00
Eric Christopher	6c75232cf0	These two are default in the constructor for MCAsmInfo. llvm-svn: 178293	2013-03-28 21:37:18 +00:00
Christian Konig	08f5929942	R600/SI: add SETO/SETUO patterns 6 more piglit tests. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 178145	2013-03-27 15:27:31 +00:00
Christian Konig	3c14580acb	R600/SI: add cummuting of rev instructions Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> Tested-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 178127	2013-03-27 09:12:59 +00:00
Christian Konig	70a5032c1b	R600/SI: add mulhu/mulhs patterns Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> Tested-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 178126	2013-03-27 09:12:51 +00:00
Christian Konig	20a7e6b764	R600/SI: add srl/sha patterns for SI Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> Tested-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 178125	2013-03-27 09:12:44 +00:00
NAKAMURA Takumi	3234178bf9	R600/SIMCCodeEmitter.cpp: Prune a couple of unused members, STI and Ctx. [-Wunused-private-field] llvm-svn: 178065	2013-03-26 19:42:48 +00:00
Christian Konig	8370dbbffd	R600/SI: improve post ISel folding Not only fold immediates, but avoid unnecessary copies as well. Signed-off-by: Christian König <christian.koenig@amd.com> llvm-svn: 178024	2013-03-26 14:04:17 +00:00
Christian Konig	082c661f94	R600/SI: improve vector interpolation Prevent loading M0 multiple times. Signed-off-by: Christian König <christian.koenig@amd.com> llvm-svn: 178023	2013-03-26 14:04:12 +00:00
Christian Konig	25ce3e9f4c	R600/SI: avoid unecessary subreg extraction in IMAGE_SAMPLE Just define the address as unknown instead of VReg_32. Signed-off-by: Christian König <christian.koenig@amd.com> llvm-svn: 178022	2013-03-26 14:04:07 +00:00
Christian Konig	eecebd0bab	R600/SI: switch back to RegPressure scheduling Signed-off-by: Christian König <christian.koenig@amd.com> llvm-svn: 178021	2013-03-26 14:04:02 +00:00
Christian Konig	727d06de1d	R600/SI: mark most intrinsics as readnone v2 They read from constant register space anyway. v2: fix lit tests Signed-off-by: Christian König <christian.koenig@amd.com> llvm-svn: 178020	2013-03-26 14:03:57 +00:00
Christian Konig	737d4a1665	R600/SI: replace WQM intrinsic Just enable WQM when we see an LDS interpolation instruction. Signed-off-by: Christian König <christian.koenig@amd.com> llvm-svn: 178019	2013-03-26 14:03:50 +00:00
Christian Konig	6a9d390b6b	R600/SI: fix ELSE pseudo op handling Restore the EXEC mask early, otherwise a copy might end up not beeing executed. Candidate for the mesa stable branch. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> Tested-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 178018	2013-03-26 14:03:44 +00:00
Christian Konig	90b45124cd	R600: fix DenseMap with pointer key iteration in the structurizer Use a MapVector on types where the iteration order matters. Otherwise we doesn't always produce a deterministic output. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 177999	2013-03-26 10:24:20 +00:00
Michel Danzer	a2e28156b4	R600: Use legacy (0 * anything = 0) MUL instructions for pow intrinsics Fixes wrong lighting in some corner cases with r600g and radeonsi, e.g. manifested by failure of two piglit/glean tests and intermittent black patches in many apps. Tested on SI and RS880. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=62012 [radeonsi] Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=58150 [r600g] NOTE: This is a candidate for the Mesa stable branch. Reviewed-by: Christian König <christian.koenig@amd.com> llvm-svn: 177730	2013-03-22 14:09:10 +00:00
Christian Konig	2989ffcacc	R600/SI: implement indirect adressing for SI Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 177277	2013-03-18 11:34:16 +00:00
Christian Konig	4a1b9c3bb9	R600/SI: add float vector types Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 177276	2013-03-18 11:34:10 +00:00
Christian Konig	082a14a88a	R600/SI: add shl pattern Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 177275	2013-03-18 11:34:05 +00:00
Christian Konig	7a14a47e7a	R600/SI: add BUFFER_LOAD_DWORD pattern Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 177274	2013-03-18 11:34:00 +00:00
Christian Konig	49374087f5	R600/SI: implement SI.load.const intrinsic Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 177273	2013-03-18 11:33:55 +00:00
Christian Konig	9c7afd114f	R600/SI: enable all S_LOAD and S_BUFFER_LOAD opcodes Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 177272	2013-03-18 11:33:50 +00:00
Christian Konig	f1fd5fad93	R600/SI: fix inserting waits for all defines Unfortunately the previous fix for inserting waits for unordered defines wasn't sufficient, cause it's possible that even ordered defines are only partially used (or not used at all). Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 177271	2013-03-18 11:33:45 +00:00
Vincent Lejeune	0a22bc4156	R600: Factorize code handling Const Read Port limitation llvm-svn: 177078	2013-03-14 15:50:45 +00:00
Vincent Lejeune	14c3fd8480	R600: Remove unused Outputs variable llvm-svn: 176967	2013-03-13 20:13:25 +00:00
Vincent Lejeune	e5ecf10a02	R600: Fix JUMP handling so that MachineInstr verification can occur This allows R600 Target to use the newly created -verify-misched llc flag llvm-svn: 176819	2013-03-11 18:15:06 +00:00
NAKAMURA Takumi	756cf8867a	R600MachineScheduler.cpp: Fix use cases of dbgs(). Don't include <iostream> here. llvm-svn: 176797	2013-03-11 08:19:28 +00:00
Tom Stellard	5e524897ed	R600: Optimize another selectcc case fold selectcc (selectcc x, y, a, b, cc), b, a, b, setne -> selectcc x, y, a, b, cc Reviewed-by: Christian König <christian.koenig@amd.com> llvm-svn: 176700	2013-03-08 15:37:11 +00:00
Tom Stellard	2add82de09	R600: Improve custom lowering of select_cc Two changes: 1. Prefer SET* instructions when possible 2. Handle the CND*_INT case with floating-point args Reviewed-by: Christian König <christian.koenig@amd.com> llvm-svn: 176699	2013-03-08 15:37:09 +00:00
Tom Stellard	492ebeabe9	R600: Change operation action from Custom to Expand for BR_CC Reviewed-by: Christian König <christian.koenig@amd.com> llvm-svn: 176698	2013-03-08 15:37:07 +00:00
Tom Stellard	e8f9f2877b	R600: Change operation action from Custom to Expand for SETCC Reviewed-by: Christian König <christian.koenig@amd.com> llvm-svn: 176697	2013-03-08 15:37:05 +00:00
Tom Stellard	b852af5dc4	R600: Set BooleanContents to ZeroOrNegativeOneBooleanContent Reviewed-by: Christian König <christian.koenig@amd.com> llvm-svn: 176696	2013-03-08 15:37:03 +00:00
Michel Danzer	f52a672bf5	R600/SI: Use source scheduler This is certainly not the last word on scheduling for this target, but right now this allows a few apps to run / finish with radeonsi, most notably UT2004 / Lightsmark. They fail to compile some shaders with the default scheduler because it ends up trying to spill registers, which we don't support yet (and which is probably a bad idea in general for performance if it can be avoided). NOTE: This is a candidate for the Mesa stable branch. Reviewed-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 176687	2013-03-08 10:58:01 +00:00
Christian Konig	99ee0f4790	R600/SI: rework input interpolation v2 v2: update CMakeLists.txt as well Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 176626	2013-03-07 09:04:14 +00:00
Christian Konig	aa9f4e6d3a	R600/SI: remove SI_vs_load_buffer_index Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 176625	2013-03-07 09:04:04 +00:00
Christian Konig	189357c6b2	R600/SI: remove SGPR address space v2 v2: fix R600 regressions Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 176624	2013-03-07 09:03:59 +00:00
Christian Konig	2c8f6d5376	R600/SI: add proper formal parameter handling for SI Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 176623	2013-03-07 09:03:52 +00:00
Christian Konig	3625055b8c	R600/SI: remove shader type intrinsic Just encode the type as target specific attribute. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 176622	2013-03-07 09:03:46 +00:00
Christian Konig	2214f14ab9	R600/SI: switch types of SGPRs to v*i8 Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 176621	2013-03-07 09:03:38 +00:00
Christian Konig	a0ed657293	R600/SI: fix unused variable warning Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 176620	2013-03-07 09:03:30 +00:00
Vincent Lejeune	fe32bd87c2	R600: Do not predicate vector op llvm-svn: 176507	2013-03-05 19:12:06 +00:00
Benjamin Kramer	5dc831801a	Update cmake build. llvm-svn: 176501	2013-03-05 18:54:05 +00:00
Vincent Lejeune	68b6b6ddfb	R600: initial scheduler code This is a skeleton for a pre-RA MachineInstr scheduler strategy. Currently it only tries to expose more parallelism for ALU instructions (this also makes the distribution of GPR channels more uniform and increases the chances of ALU instructions to be packed together in a single VLIW group). Also it tries to reduce clause switching by grouping instruction of the same kind (ALU/FETCH/CF) together. Vincent Lejeune: - Support for VLIW4 Slot assignement - Recomputation of ScheduleDAG to get more parallelism opportunities Tom Stellard: - Fix assertion failure when trying to determine an instruction's slot based on its destination register's class - Fix some compiler warnings Vincent Lejeune: [v2] - Remove recomputation of ScheduleDAG (will be provided in a later patch) - Improve estimation of an ALU clause size so that heuristic does not emit cf instructions at the wrong position. - Make schedule heuristic smarter using SUnit Depth - Take constant read limitations into account Vincent Lejeune: [v3] - Fix some uninitialized values in ConstPair - Add asserts to ensure an ALU slot is always populated llvm-svn: 176498	2013-03-05 18:41:32 +00:00
Vincent Lejeune	0b72f1021d	R600: Remove LowerConstCopyPass and lower CONST_COPY right after ISel. Maintaining CONST_COPY Instructions until Pre Emit may prevent some ifcvt case and taking them in account for scheduling is difficult for no real benefit. llvm-svn: 176488	2013-03-05 15:04:55 +00:00
Vincent Lejeune	3b6f20e944	R600: Turn BUILD_VECTOR into Reg_Sequence Reviewed-by: Tom Stellard <thomas.stellard at amd.com> llvm-svn: 176487	2013-03-05 15:04:49 +00:00
Vincent Lejeune	10a5e4773e	R600: CONST_ADDRESS node is not marked as mayLoad anymore Reviewed-by: Tom Stellard <thomas.stellard at amd.com> mayLoad complexify scheduling and does not bring any usefull info as the location is not writeable at all. llvm-svn: 176486	2013-03-05 15:04:42 +00:00
Vincent Lejeune	a199d01e4d	R600: Use MUL_IEEE for trig/fdiv intrinsic Reviewed-by: Tom Stellard <thomas.stellard at amd.com> llvm-svn: 176485	2013-03-05 15:04:37 +00:00
Vincent Lejeune	743dca0446	R600: Add support for indirect addressing of non default const buffer NOTE: This is a candidate for the Mesa stable branch. llvm-svn: 176484	2013-03-05 15:04:29 +00:00
Tom Stellard	b2f2f960ce	R600: Clean up datalayout strings so they better match hardware capabilities llvm-svn: 176439	2013-03-04 17:40:28 +00:00
Christian Konig	d0e3da1818	R600/SI: handle all registers in copyPhysReg v2 v2: based on Michels patch, but now allows copying of all registers sizes. Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Signed-off-by: Christian König <christian.koenig@amd.com> llvm-svn: 176346	2013-03-01 09:46:27 +00:00
Christian Konig	1f344cda53	R600/SI: remove S_MOV immediate patterns They won't match anyway. Signed-off-by: Christian König <christian.koenig@amd.com> llvm-svn: 176345	2013-03-01 09:46:22 +00:00
Christian Konig	8465296420	R600/SI: remove GPR*AlignEncode It's much easier to specify the encoding with tablegen directly. Signed-off-by: Christian König <christian.koenig@amd.com> llvm-svn: 176344	2013-03-01 09:46:17 +00:00
Christian Konig	01fd1f6b36	R600/SI: fix warning about overloaded virtual Signed-off-by: Christian König <christian.koenig@amd.com> llvm-svn: 176343	2013-03-01 09:46:11 +00:00
Christian Konig	862fd9fa2c	R600/SI: fix inserting waits for unordered defines Signed-off-by: Christian König <christian.koenig@amd.com> llvm-svn: 176342	2013-03-01 09:46:04 +00:00
Christian Konig	e500e445c5	R600/SI: Add promotion of e32 to e64 in operand folding Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 176105	2013-02-26 17:52:47 +00:00
Christian Konig	f741fbfb1b	R600/SI: add VOP mapping functions Make it possible to map between e32 and e64 encoding opcodes. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 176104	2013-02-26 17:52:42 +00:00
Christian Konig	6612ac39c9	R600/SI: swap operands if it helps folding Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 176103	2013-02-26 17:52:36 +00:00
Christian Konig	76edd4f2bc	R600/SI: add some more instruction flags Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 176102	2013-02-26 17:52:29 +00:00
Christian Konig	f82901af2a	R600/SI: add post ISel folding for SI v2 Include immediate folding and SGPR limit handling for VOP3 instructions. v2: remove leftover hasExtraSrcRegAllocReq Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 176101	2013-02-26 17:52:23 +00:00
Christian Konig	d910b7d534	R600/SI: add folding helper Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 176100	2013-02-26 17:52:16 +00:00
Christian Konig	d303996918	R600/SI: fix VOP3b encoding v2 v2: document why we hardcode VCC for now. This is a candidate for the mesa-stable branch. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 176099	2013-02-26 17:52:09 +00:00
Christian Konig	0f0a8fe2dd	R600/SI: fix and cleanup SI register definition v2 Prevent producing real strange tablegen code by using proper register sizes, alignments and hierarchy. Also cleanup the unused definitions and add some comments. v2: add SGPR 512 bit registers, stop registers from wrapping around, fix SGPR alignment This is a candidate for the mesa-stable branch. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 176098	2013-02-26 17:52:03 +00:00
Christian Konig	d76ed54b60	R600/SI: fix stupid typo This is a candidate for the mesa-stable branch. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 176097	2013-02-26 17:51:57 +00:00
Michel Danzer	0cc991e17b	R600/SI: Add pattern for sign extension of i1 to i32. 16 more little piglits with radeonsi. NOTE: This is a candidate for the Mesa stable branch. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 175887	2013-02-22 11:22:58 +00:00
Michel Danzer	00fb283560	R600/SI: Add pattern for logical or of i1 values. 24 more little piglits with radeonsi. NOTE: This is a candidate for the Mesa stable branch. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 175886	2013-02-22 11:22:54 +00:00
Michel Danzer	c3ea4041b9	R600/SI: Add pattern for fceil. 9 more little piglits with radeonsi. NOTE: This is a candidate for the Mesa stable branch. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 175885	2013-02-22 11:22:49 +00:00
Christian Konig	71088e68e8	R600/SI: inline V_ADD\|SUB_F32 patterns Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 175758	2013-02-21 15:17:41 +00:00
Christian Konig	7c9de8e6e8	R600/SI: replace IMPLICIT_DEF with SIOperand.ZERO Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 175757	2013-02-21 15:17:36 +00:00
Christian Konig	2aca043312	R600/SI: replace SI_V_CNDLT with a pattern It actually fixes quite a bunch of piglit tests. This is a candidate for the mesa-stable branch. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 175756	2013-02-21 15:17:32 +00:00
Christian Konig	8dbe6f617c	R600/SI: use patterns for clamp, fabs, fneg Instead of using custom inserters, it's simpler and should make DAG folding easier. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 175755	2013-02-21 15:17:27 +00:00
Christian Konig	bf114b42a8	R600/SI: add all the other missing asm operands v2 v2: put implicit parameters in [] Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 175754	2013-02-21 15:17:22 +00:00
Christian Konig	08e768b4cf	R600/SI: add the missing M*BUF\|IMG asm operands Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 175753	2013-02-21 15:17:17 +00:00
Christian Konig	e0130a2f25	R600/SI: add the missing S_* asm operands Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 175752	2013-02-21 15:17:13 +00:00
Christian Konig	f5754a011d	R600/SI: rework VOP3 classes Order the classes and add asm operands. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 175751	2013-02-21 15:17:09 +00:00
Christian Konig	b19849a682	R600/SI: simplify VOPC_* pattern v2 Fixing asm operation names. v2: fix name of the e64 encoding, also add asm operands Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 175750	2013-02-21 15:17:04 +00:00
Christian Konig	ae034e63f1	R600/SI: rework VOP2_* pattern v2 Fixing asm operation names. v2: use ZERO constant, also add asm operands Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 175749	2013-02-21 15:16:58 +00:00
Christian Konig	3da7017e81	R600/SI: rework VOP1_* patterns v2 Fixing asm operation names. v2: use ZERO constant, also add asm operands Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 175748	2013-02-21 15:16:53 +00:00
Christian Konig	eabf8333d6	R600/SI: add constant for inline zero operand Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 175747	2013-02-21 15:16:49 +00:00
Christian Konig	72d5d5c754	R600/SI: cleanup SIInstrInfo.td and SIInstrFormat.td Those two files got mixed up. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 175746	2013-02-21 15:16:44 +00:00
Tom Stellard	0d171c8877	R600: Fix for Unigine when MachineSched is enabled Fixes for-loop.cl piglit test Patch By: Vincent Lejeune Reviewed-by: Tom Stellard <thomas.stellard@amd.com> NOTE: This is a candidate for the Mesa stable branch. llvm-svn: 175742	2013-02-21 15:06:59 +00:00
Michel Danzer	7f02a8c7a7	R600/SI: Make sure M0 is loaded for V_INTERP_MOV_F32 NOTE: This is a candidate for the Mesa stable branch. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 175733	2013-02-21 08:57:10 +00:00
Jim Grosbach	d15cd2a11c	R600: Update for name changes from r175667. llvm-svn: 175668	2013-02-20 21:31:28 +00:00
Tom Stellard	d4409e2cec	R600: Add AR_X to the R600_TReg_X register class. NOTE: This is a candidate for the Mesa stable branch. llvm-svn: 175519	2013-02-19 15:22:47 +00:00
Tom Stellard	a24a516737	R600: Mark all members of the TRegMem register class as reserved This stops the Machine Verifier from complaining about uses of undefined physical registers. NOTE: This is a candidate for the Mesa stable branch. llvm-svn: 175518	2013-02-19 15:22:45 +00:00
Tom Stellard	8d469edbe3	R600: Fix scheduler crash caused by invalid MachinePointerInfo Kernel function arguments are lowered to loads from the PARAM_I address space. When creating these load instructions, we were initializing their MachinePointerInfo with an Arguement object that was not attached to any function. This was causing the MachineScheduler to crash when it tried to access the parent of the Arguement. This has been fixed by initializing the MachinePointerInfo with a UndefValue instead. NOTE: This is a candidate for the Mesa stable branch. llvm-svn: 175517	2013-02-19 15:22:44 +00:00
Tom Stellard	0f965aaf9b	R600: Fix tracking of implicit defs in the IndirectAddressing pass In some cases, we were losing track of live implicit registers which was creating dead defs and causing the scheduler to produce invalid code. NOTE: This is a candidate for the Mesa stable branch. llvm-svn: 175516	2013-02-19 15:22:42 +00:00
David Blaikie	772d4f75f6	Use LLVM_DELETED_FUNCTION rather than '// do not implement' comments. Also removes some redundant DNI comments on function declarations already using the macro. llvm-svn: 175466	2013-02-18 23:11:17 +00:00
Vincent Lejeune	1ce13f553e	R600/SI: Use MULADD_IEEE/V_MAD_F32 instruction for mad pattern llvm-svn: 175446	2013-02-18 14:11:28 +00:00
Vincent Lejeune	685018009b	R600: Support for TBO NOTE: This is a candidate for the Mesa stable branch. Reviewed-by: Tom Stellard <thomas.stellard at amd.com> llvm-svn: 175445	2013-02-18 14:11:19 +00:00
Vincent Lejeune	4c1602b5c9	R600: Increase number of ArrayBase Reg to 32 Reviewed-by: Tom Stellard <thomas.stellard at amd.com> llvm-svn: 175443	2013-02-18 13:48:09 +00:00
NAKAMURA Takumi	eddbc713e1	Target/R600/CMakeLists.txt: Prune SILowerLiteralConstants.cpp corresponding to r175354. llvm-svn: 175361	2013-02-16 15:30:28 +00:00
Christian Konig	b559b079b4	R600/SI: Add pattern to simplify i64 loading This is a candidate for the stable branch. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 175356	2013-02-16 11:28:36 +00:00
Christian Konig	a881179ffe	R600/SI: nuke SReg_1 v3 It's completely unnecessary and can be replace with proper SReg_64 handling instead. This actually fixes a piglit test on SI. v2: use correct register class in addRegisterClass, set special classes as not allocatable v3: revert setting special classes as not allocateable This is a candidate for the stable branch. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 175355	2013-02-16 11:28:30 +00:00
Christian Konig	c756cb9901	R600/SI: cleanup literal handling v3 Seems to be allot simpler, and also paves the way for further improvements. v2: rebased on master, use 0 in BUFFER_LOAD_FORMAT_XYZW, use VGPR0 in dummy EXP, avoid compiler warning, break after encoding the first literal. v3: correctly use V_ADD_F32_e64 This is a candidate for the stable branch. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 175354	2013-02-16 11:28:22 +00:00
Christian Konig	b9e281a723	R600/SI: replace AllReg_* with [SV]Src_* v2 Mark all the operands that can also have an immediate. v2: SOFFSET is also an SSrc_32 operand This is a candidate for the stable branch. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 175353	2013-02-16 11:28:13 +00:00
Christian Konig	3c3a7bfb06	R600/SI: fix VOPC encoding v2 Previously it only worked because of coincident. v2: fix 64bit versions, use 0x80 (inline 0) instead of SGPR0 for the unused SRC2 This is a candidate for the stable branch. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 175352	2013-02-16 11:28:07 +00:00
Christian Konig	e3cba88714	R600/SI: move *_Helper definitions to SIInstrFormat.td This is a candidate for the stable branch. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 175351	2013-02-16 11:28:02 +00:00
Christian Konig	8590c1e371	R600/SI: remove some more unused code This is a candidate for the stable branch. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 175350	2013-02-16 11:27:56 +00:00
Christian Konig	d886099f13	R600/structurizer: improve inverting conditions Stop adding more instructions than necessary. This is a candidate for the stable branch. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 175349	2013-02-16 11:27:50 +00:00
Christian Konig	fc6a985c12	R600/structurizer: improve loop handling Generate more than one loop if it seems to make sense. This is a candidate for the stable branch. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 175348	2013-02-16 11:27:45 +00:00
Christian Konig	b5d8866b84	R600/structurizer: improve finding condition values Using the new NearestCommonDominator class. This is a candidate for the stable branch. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 175347	2013-02-16 11:27:40 +00:00
Christian Konig	0bccf9d60b	R600/structurizer: improve PHI value finding Using the new NearestCommonDominator class. This is a candidate for the stable branch. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 175346	2013-02-16 11:27:35 +00:00
Christian Konig	d08e3d753e	R600/structurizer: add class to find the Nearest Common Dominator This is a candidate for the stable branch. Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 175345	2013-02-16 11:27:29 +00:00
Michel Danzer	e9bb18b555	R600/SI: Fix int_SI_fs_interp_constant The important fix is that the constant interpolation value is stored in the parameter slot P0, which is encoded as 2. In addition, drop the SI_INTERP_CONST pseudo instruction, pass the parameter slot as an operand to V_INTERP_MOV_F32 instead of hardcoding it there, and add a special operand class for the parameter slots for type checking and pretty printing. NOTE: This is a candidate for the Mesa stable branch. Reviewed-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 175193	2013-02-14 19:03:25 +00:00
Vincent Lejeune	f940fd05bd	R600: Do not fold single instruction with more that 3 kcache read It fixes around 100 tfb piglit tests and 16 glean tests. NOTE: This is a candidate for the Mesa stable branch. Reviewed-by: Tom Stellard <thomas.stellard at amd.com> llvm-svn: 175183	2013-02-14 16:57:19 +00:00
Vincent Lejeune	ea710fe419	R600: Export instructions are no longer terminator This allows MachineInstScheduler to reorder them, and thus make scheduling more efficient. Reviewed-by: Tom Stellard <thomas.stellard at amd.com> llvm-svn: 175182	2013-02-14 16:55:11 +00:00
Vincent Lejeune	d80bc1561a	R600: Fold zero/one in export instructions Reviewed-by: Tom Stellard <thomas.stellard at amd.com> llvm-svn: 175181	2013-02-14 16:55:06 +00:00
Vincent Lejeune	f694c10c8e	R600: Do not fold modifier/litterals in vector inst This fixes a couple of regressions on (probably not just) cayman NOTE: This is a candidate for the Mesa stable branch. Reviewed-by: Tom Stellard <thomas.stellard at amd.com> llvm-svn: 175180	2013-02-14 16:55:01 +00:00
Michel Danzer	ae0a403dab	R600/SI: Check for empty stack in SIAnnotateControlFlow::isTopOfStack Fixes assertion failure in newly added lit test. Might just be a bandaid that needs to be revisited. llvm-svn: 175139	2013-02-14 08:00:33 +00:00
Tom Stellard	91da4e9199	R600: Add support for 128-bit parameters NOTE: This is a candidate for the Mesa stable branch. llvm-svn: 175096	2013-02-13 22:05:20 +00:00
Michel Danzer	3bb17ebd93	R600: Fix regression with shadow array sampler on pre-SI GPUs. 'R600/SI: Use proper instructions for array/shadow samplers.' removed two cases from TEX_SHADOW. Vincent Lejeune reported on IRC that this broke some shadow array piglit tests with the r600g driver. Reinstating the removed cases should fix this, and still works with radeonsi as well. I will follow up with some lit tests which would have caught the regression. NOTE: This is a candidate for the Mesa stable branch. Tested-by: Vincent Lejeune <vljn@ovi.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 174963	2013-02-12 12:11:23 +00:00
Michel Danzer	10ed47f927	R600/SI: Use V_ADD_F32 instead of V_MOV_B32 for clamp/neg/abs modifiers. The modifiers don't seem to have any effect with V_MOV_B32, supposedly it's meant to just move bits untouched. Fixes 46 piglit tests with radeonsi, though unfortunately 11 of those had just regressed because they started using the clamp modifier. NOTE: This is a candidate for the Mesa stable branch. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 174890	2013-02-11 15:58:21 +00:00
Vincent Lejeune	44bf8158c5	Test Commit - Remove some trailing whitespace in R600Instructions.td llvm-svn: 174839	2013-02-10 17:57:33 +00:00
Tom Stellard	47d4201348	R600: Dump the function name when TargetLowering::LowerCall() fails Also output a more useful error message. NOTE: This is a candidate for the Mesa stable branch llvm-svn: 174763	2013-02-08 22:24:40 +00:00
Tom Stellard	7370ede2cd	R600: rework flow creation in the structurizer v2 This fixes a couple of bugs and incorrect assumptions, in total four more piglit tests now pass. v2: fix small bug in the dominator updating Patch by: Christian König Signed-off-by: Christian König <christian.koenig@amd.com> llvm-svn: 174762	2013-02-08 22:24:38 +00:00
Tom Stellard	048f14fd3b	R600: fix loop analyses in the structurizer Patch by: Christian König Intersecting loop handling was wrong. Signed-off-by: Christian König <christian.koenig@amd.com> Tested-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 174761	2013-02-08 22:24:37 +00:00
Tom Stellard	7ec0e4fbe3	R600: fix PHI value adding in the structurizer Otherwise we sometimes produce invalid code. Patch by: Christian König Signed-off-by: Christian König <christian.koenig@amd.com> Tested-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 174760	2013-02-08 22:24:35 +00:00
Tom Stellard	1c822a8929	R600/SI: cleanup VGPR encoding Remove all the unused code. Patch by: Christian König Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 174656	2013-02-07 19:39:45 +00:00
Tom Stellard	aac1889a84	R600/SI: Handle VGPR64 destination in copyPhysReg(). Allows nexuiz to run with radeonsi. Patch by: Michel Dänzer Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 174655	2013-02-07 19:39:43 +00:00
Tom Stellard	ecacb8010d	R600/SI: Add pattern for mul. 20 more little piglits with radeonsi. Patch by: Michel Dänzer Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 174654	2013-02-07 19:39:42 +00:00
Tom Stellard	8909380e71	R600/SI: simplify and fix SMRD encoding The _SGPR variants where wrong. Patch by: Christian König Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 174653	2013-02-07 19:39:40 +00:00
Tom Stellard	26075d58a2	R600/SI: add proper 64bit immediate support v2 v2: rebased on current upstream Patch by: Christian König Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 174652	2013-02-07 19:39:38 +00:00
Tom Stellard	4ded0c1c42	R600: Add an explicit default processor This is for the case when no processor is passed to the backend. This prevents the '' is not a recognized processor for this target (ignoring processor) warning from being generated by clang. llvm-svn: 174651	2013-02-07 19:39:34 +00:00
Tom Stellard	462516b737	R600/SI: Use proper instructions for array/shadow samplers. Patch by: Michel Dänzer Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 174634	2013-02-07 17:02:14 +00:00
Tom Stellard	ae6c06e5de	R600/SI: Make sample intrinsic address parameter type overloaded. Handle vectors of 1 to 16 integers. Change the intrinsic names to prevent the wrong one from being selected at runtime due to the overloading. Patch By: Michel Dänzer Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 174633	2013-02-07 17:02:13 +00:00
Tom Stellard	538ceeb6e0	R600/SI: Add basic support for more integer vector types. v1i32, v2i32, v8i32 and v16i32. Only add VGPR register classes for integer vector types, to avoid attempts copying from VGPR to SGPR registers, which is not possible. Patch By: Michel Dänzer Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 174632	2013-02-07 17:02:09 +00:00
Michel Danzer	349cabed2f	R600/SI: Add pattern for flog2 22 more little piglits with radeonsi. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 174615	2013-02-07 14:55:16 +00:00
Tom Stellard	9355b22180	R600: Consolidate sub register indices. Use sub0-15 everywhere. Patch by: Michel Dänzerr Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 174610	2013-02-07 14:02:37 +00:00
Tom Stellard	e06163a9a6	R600: Add support for SET_DX10 instructions These instructions compare two floating point values and return an integer true (-1) or false (0) value. When compiling code generated by the Mesa GLSL frontend, the SET_DX10 instructions save us four instructions for most branch decisions that use floating-point comparisons. llvm-svn: 174609	2013-02-07 14:02:35 +00:00
Tom Stellard	b40ada9b85	R600: Fix assembly name for SETGT_INT llvm-svn: 174607	2013-02-07 14:02:27 +00:00
Tom Stellard	f3b2a1e8b3	R600: Support for indirect addressing v4 Only implemented for R600 so far. SI is missing implementations of a few callbacks used by the Indirect Addressing pass and needs code to handle frame indices. At the moment R600 only supports array sizes of 16 dwords or less. Register packing of vector types is currently disabled, which means that a vec4 is stored in T0_X, T1_X, T2_X, T3_X, rather than T0_XYZW. In order to correctly pack registers in all cases, we will need to implement an analysis pass for R600 that determines the correct vector width for each array. v2: - Add support for i8 zext load from stack. - Coding style fixes v3: - Don't reserve registers for indirect addressing when it isn't being used. - Fix bug caused by LLVM limiting the number of SubRegIndex declarations. v4: - Fix 64-bit defines llvm-svn: 174525	2013-02-06 17:32:29 +00:00
Jakob Stoklund Olesen	fdc37670f6	Don't use MRI liveouts in R600. Something very strange is going on with the output registers in this target. Its ISelLowering code is inserting dangling CopyToReg nodes, hoping that those physregs won't get clobbered before the RETURN. This patch adds the output registers as implicit uses on RETURN instructions in the custom emission pass. I'd much prefer to have those CopyToReg nodes glued to the RETURNs, but I don't see how. llvm-svn: 174400	2013-02-05 17:53:52 +00:00
Tom Stellard	df063e617f	R600: Fold remaining CONST_COPY after expand pseudo inst Patch by: Vincent Lejeune Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 174395	2013-02-05 17:09:16 +00:00
Tom Stellard	41afe6a6fe	R600: improve inputs/interpolation handling Use one intrinsic for all sorts of interpolation. Use two separate unexpanded instructions to represent INTERP_XY and _ZW - this will allow to eliminate one part if it's not used. Track liveness of special interpolation regs instead of reserving them - this will allow to reuse those regs, lowering reg pressure. Patch By: Vadim Girlin v2[Vincent Lejeune]: Rebased against current llvm master Signed-off-by: Vadim Girlin <vadimgirlin@gmail.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 174394	2013-02-05 17:09:14 +00:00
Tom Stellard	2e5e7a5bef	R600: Emit function name in the AsmPrinter Emitting the function name allows us to check for it in the FileCheck tests so we can make sure FileCheck is checking the output of the correct function. llvm-svn: 174392	2013-02-05 17:09:11 +00:00
Tom Stellard	836cdd97fe	R600/SI: Add patterns for fcos and fsin. Fixes 37 piglit tests and allows e.g. FlightGear to run with radeonsi. Patch by: Michel Dänzer Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 174391	2013-02-05 17:09:10 +00:00
NAKAMURA Takumi	e1137a2058	Update AMDGPURegisterInfo::eliminateFrameIndex() corresponding to r174083. llvm-svn: 174106	2013-01-31 22:55:51 +00:00
Tom Stellard	4926921bd4	R600: Fold clamp, neg, abs Patch by: Vincent Lejeune Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 174099	2013-01-31 22:11:54 +00:00
Tom Stellard	dd04c83a4d	R600: Consider bitcast when folding const_address node. Patch by: Vincent Lejeune Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 174098	2013-01-31 22:11:53 +00:00
Tom Stellard	af1bce7d1d	R600: Make store_dummy intrinsic more general by passing export type Patch by: Vincent Lejeune Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 174097	2013-01-31 22:11:46 +00:00
NAKAMURA Takumi	978b5a0e02	R600/AMDILPeepholeOptimizer.cpp: Tweak std::make_pair to satisfy C++11. llvm-svn: 173807	2013-01-29 16:31:56 +00:00
Tom Stellard	6f1b8657f9	R600: Add a llvm.R600.store.swizzle intrinsics This intrinsic is translated to ALLOC_EXPORT_WORD1_SWIZ, hence its name. It is used to store vs/fs outputs Patch by: Vincent Lejeune Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 173297	2013-01-23 21:39:49 +00:00
Tom Stellard	d8ac91d436	R600: Simplify stream outputs intrinsic Patch by: Vincent Lejeune Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 173296	2013-01-23 21:39:47 +00:00
Tom Stellard	365366f9ef	R600: rework handling of the constants Remove Cxxx registers, add new special register - "ALU_CONST" and new operand for each alu src - "sel". ALU_CONST is used to designate that the new operand contains the value to override src.sel, src.kc_bank, src.chan for constants in the driver. Patch by: Vadim Girlin Vincent Lejeune: - Use pointers for constants - Fold CONST_ADDRESS when possible Tom Stellard: - Give CONSTANT_BUFFER_0 its own address space - Use integer types for constant loads Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 173222	2013-01-23 02:09:06 +00:00
Tom Stellard	ff62c35da0	R600: Add a CONST_ADDRESS node to model constant buf read Patch by: Vincent Lejeune Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 173221	2013-01-23 02:09:03 +00:00
Tom Stellard	ab28e9a30a	R600: Factorise VTX_WORD0 and VTX_WORD1 in tblgen def Patch by: Vincent Lejeune Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 173220	2013-01-23 02:09:01 +00:00
Tom Stellard	c9b903138d	R600/SI: Use unnormalized coordinates for sampling with the RECT target. Patch by: Michel Dänzer Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com> Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 173053	2013-01-21 15:40:48 +00:00
Tom Stellard	14421a793f	R600/SI: Take target parameter for sample intrinsics. Patch by: Michel Dänzer Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com> Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 173052	2013-01-21 15:40:47 +00:00
Tom Stellard	74dda0da31	R600/SI: Derive all sample intrinsics from a single class. Patch by: Michel Dänzer Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Reviewed-by: Christian König <christian.koenig@amd.com> Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 173051	2013-01-21 15:40:46 +00:00
NAKAMURA Takumi	c96fb1bd36	R600/SILowerControlFlow.cpp: Fix a warning. [-Wunused-variable] llvm-svn: 173040	2013-01-21 14:06:48 +00:00
Tom Stellard	c4cabef782	R600: Proper insert S_WAITCNT instructions Some instructions like memory reads/writes are executed asynchronously, so we need to insert S_WAITCNT instructions to block before accessing their results. Previously we have just inserted S_WAITCNT instructions after each async instruction, this patch fixes this and adds a prober insertion pass. Patch by: Christian König Tested-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Signed-off-by: Christian König <deathsimple@vodafone.de> llvm-svn: 172846	2013-01-18 21:15:53 +00:00
Tom Stellard	be8ebeebf7	R600: Optimize and cleanup KILL on SI We shouldn't insert KILL optimization if we don't have a kill instruction at all. Patch by: Christian König Tested-by: Michel Dänzer <michel.daenzer@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Signed-off-by: Christian König <deathsimple@vodafone.de> llvm-svn: 172845	2013-01-18 21:15:50 +00:00
Dmitri Gribenko	226fea5bd6	Remove redundant 'llvm::' qualifications llvm-svn: 172358	2013-01-13 16:01:15 +00:00
Eli Bendersky	4d9ada036c	Renamed MCInstFragment to MCRelaxableFragment and added some comments. No change in functionality. llvm-svn: 171822	2013-01-08 00:22:56 +00:00
NAKAMURA Takumi	458a8277cc	R600/SIISelLowering.cpp: Suppress a warning. [-Wunused-variable] llvm-svn: 171728	2013-01-07 11:14:44 +00:00
Chandler Carruth	9fb823bbd4	Move all of the header files which are involved in modelling the LLVM IR into their new header subdirectory: include/llvm/IR. This matches the directory structure of lib, and begins to correct a long standing point of file layout clutter in LLVM. There are still more header files to move here, but I wanted to handle them in separate commits to make tracking what files make sense at each layer easier. The only really questionable files here are the target intrinsic tablegen files. But that's a battle I'd rather not fight today. I've updated both CMake and Makefile build systems (I think, and my tests think, but I may have missed something). I've also re-sorted the includes throughout the project. I'll be committing updates to Clang, DragonEgg, and Polly momentarily. llvm-svn: 171366	2013-01-02 11:36:10 +00:00
Chandler Carruth	be81023d74	Resort the #include lines in include/... and lib/... with the utils/sort_includes.py script. Most of these are updating the new R600 target and fixing up a few regressions that have creeped in since the last time I sorted the includes. llvm-svn: 171362	2013-01-02 10:22:59 +00:00
Tom Stellard	09ef8425e9	R600: Coding style - remove empty spaces from the beginning of functions No functionality change. llvm-svn: 170923	2012-12-21 20:12:02 +00:00
Tom Stellard	41398026e7	R600: Fix MAX_UINT definition Patch by: Vadim Girlin Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 170922	2012-12-21 20:12:01 +00:00
Tom Stellard	4fa7ac29f1	R600: Add SHADOWCUBE to TEX_SHADOW pattern Patch by: Vadim Girlin Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 170921	2012-12-21 20:11:59 +00:00
Tom Stellard	a8b0351720	R600: Expand vec4 INT <-> FP conversions llvm-svn: 170901	2012-12-21 16:33:24 +00:00
NAKAMURA Takumi	2a0b40f584	Target/R600: Update MIB according to r170588. llvm-svn: 170620	2012-12-20 00:22:11 +00:00
Tom Stellard	1c315d5411	R600: Remove unecessary VREG alignment. Unlike SGPRs VGPRs doesn't need to be aligned. Patch by: Christian König Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Tested-by: Michel Dänzer <michel.daenzer@amd.com> Signed-off-by: Christian König <deathsimple@vodafone.de> llvm-svn: 170593	2012-12-19 22:10:34 +00:00
Tom Stellard	e7b907d85c	R600: control flow optimization Branch if we have enough instructions so that it makes sense. Also remove branches if they don't make sense. Patch by: Christian König Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Tested-by: Michel Dänzer <michel.daenzer@amd.com> Signed-off-by: Christian König <deathsimple@vodafone.de> llvm-svn: 170592	2012-12-19 22:10:33 +00:00
Tom Stellard	f8794354b2	R600: New control flow for SI v2 This patch replaces the control flow handling with a new pass which structurize the graph before transforming it to machine instruction. This has a couple of different advantages and currently fixes 20 piglit tests without a single regression. It is now a general purpose transformation that could be not only be used for SI/R6xx, but also for other hardware implementations that use a form of structurized control flow. v2: further cleanup, fixes and documentation Patch by: Christian König Signed-off-by: Christian König <deathsimple@vodafone.de> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Tested-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 170591	2012-12-19 22:10:31 +00:00
Tom Stellard	5a6879466a	R600: enable S_N2_ instructions They seem to work fine. Patch by: Christian König Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Tested-by: Michel Dänzer <michel.daenzer@amd.com> Signed-off-by: Christian König <deathsimple@vodafone.de> llvm-svn: 170343	2012-12-17 15:14:56 +00:00
Tom Stellard	9e90b5895d	R600: BB operand support for SI Patch by: Christian König Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Tested-by: Michel Dänzer <michel.daenzer@amd.com> Signed-off-by: Christian König <deathsimple@vodafone.de> llvm-svn: 170342	2012-12-17 15:14:54 +00:00
Tom Stellard	16a17c6d3e	R600: remove nonsense setPrefLoopAlignment The Align parameter is a power of two, so 16 results in 64K alignment. Additional to that even 16 byte alignment doesn't make any sense, so just remove it. Patch by: Christian König Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Tested-by: Michel Dänzer <michel.daenzer@amd.com> Signed-off-by: Christian König <deathsimple@vodafone.de> llvm-svn: 170341	2012-12-17 15:14:53 +00:00
Tom Stellard	6975d35979	Fix warnings with -DNDEBUG Patch by: NAKAMURA Takumi llvm-svn: 170142	2012-12-13 19:38:52 +00:00
Jakob Stoklund Olesen	436eea9833	Avoid setIsInsideBundle in Target/R600. This function is going to be removed. llvm-svn: 170064	2012-12-13 00:59:38 +00:00
NAKAMURA Takumi	85292a1338	[CMake] Fixup R600. llvm-svn: 169962	2012-12-12 03:34:26 +00:00
Tom Stellard	75aadc2813	Add R600 backend A new backend supporting AMD GPUs: Radeon HD2XXX - HD7XXX llvm-svn: 169915	2012-12-11 21:25:42 +00:00

... 26 27 28 29 30 ...

1598 Commits