llvm-project

Commit Graph

Author	SHA1	Message	Date
Tom Stellard	35bb18c2a7	R600: Add support for vector local memory loads llvm-svn: 189226	2013-08-26 15:06:04 +00:00
Tom Stellard	c6f4a29ed5	R600: Add support for i8 and i16 local memory loads llvm-svn: 189225	2013-08-26 15:05:59 +00:00
Tom Stellard	2ffc330673	R600: Add support for v4i32 and v2i32 local stores llvm-svn: 189222	2013-08-26 15:05:44 +00:00
Tom Stellard	a92ff87929	R600: Expand vector float operations for both SI and R600 Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 188596	2013-08-16 23:51:24 +00:00
Tom Stellard	fbab827e2a	R600: Add support for global vector stores with elements less than 32-bits Tested-by: Aaron Watry <awatry@gmail.com> llvm-svn: 188520	2013-08-16 01:12:11 +00:00
Tom Stellard	d3ee8c103a	R600: Add support for i16 and i8 global stores Tested-by: Aaron Watry <awatry@gmail.com> llvm-svn: 188519	2013-08-16 01:12:06 +00:00
Tom Stellard	fc455471c3	R600: Set scheduling preference to Sched::Source R600 doesn't need to do any scheduling on the SelectionDAG now that it has a very good MachineScheduler. Also, using the VLIW SelectionDAG scheduler was having a major impact on compile times. For example with the phatk kernel here are the LLVM IR to machine code compile times: With Sched::VLIW Total Compile Time: 1.4890 Seconds (User + System) SelectionDAG Instruction Scheduling: 1.1670 Seconds (User + System) With Sched::Source Total Compile Time: 0.3330 Seconds (User + System) SelectionDAG Instruction Scheduling: 0.0070 Seconds (User + System) The code ouput was identical with both schedulers. This may not be true for all programs, but it gives me confidence that there won't be much reduction, if any, in code quality by using Sched::Source. llvm-svn: 188215	2013-08-12 22:33:21 +00:00
Tom Stellard	0344cdfe39	R600: Add 64-bit float load/store support * Added R600_Reg64 class * Added T#Index#.XY registers definition * Added v2i32 register reads from parameter and global space * Added f32 and i32 elements extraction from v2f32 and v2i32 * Added v2i32 -> v2f32 conversions Tom Stellard: - Mark vec2 operations as expand. The addition of a vec2 register class made them all legal. Patch by: Dmitry Cherkassov Signed-off-by: Dmitry Cherkassov <dcherkassov@gmail.com> llvm-svn: 187582	2013-08-01 15:23:42 +00:00
Tom Stellard	aa313d0a74	R600/SI: Expand vector fp <-> int conversions llvm-svn: 187421	2013-07-30 14:31:03 +00:00
Quentin Colombet	e2e0548d77	[R600] Replicate old DAGCombiner behavior in target specific DAG combine. build_vector is lowered to REG_SEQUENCE, which is something the register allocator does a good job at optimizing. llvm-svn: 187397	2013-07-30 00:27:16 +00:00
Tom Stellard	840214437b	R600: Move CONST_ADDRESS folding into AMDGPUDAGToDAGISel::Select() This increases the number of opportunites we have for folding. With the previous implementation we were unable to fold into any instructions other than the first when multiple instructions were selected from a single SDNode. Reviewed-by: Vincent Lejeune <vljn at ovi.com> llvm-svn: 186919	2013-07-23 01:48:24 +00:00
Tom Stellard	1e80309ebe	R600: Use KCache for kernel arguments Reviewed-by: Vincent Lejeune <vljn at ovi.com> llvm-svn: 186918	2013-07-23 01:48:18 +00:00
Tom Stellard	acfeebf883	R600: Use the same compute kernel calling convention for all GPUs A side-effect of this is that now the compiler expects kernel arguments to be 4-byte aligned. Reviewed-by: Vincent Lejeune <vljn at ovi.com> llvm-svn: 186916	2013-07-23 01:48:05 +00:00
Tom Stellard	78e012969c	R600: Use correct LoadExtType when lowering kernel arguments Reviewed-by: Vincent Lejeune <vljn at ovi.com> llvm-svn: 186915	2013-07-23 01:47:58 +00:00
Tom Stellard	33dd04bfbe	R600: Clean up extended load patterns Reviewed-by: Vincent Lejeune <vljn at ovi.com> llvm-svn: 186914	2013-07-23 01:47:52 +00:00
Tom Stellard	67ae4762ef	R600: Expand VSELECT for all types llvm-svn: 186613	2013-07-18 21:43:35 +00:00
Michel Danzer	49812b5bbd	R600/SI: Initial local memory support Enough for the radeonsi driver to use it for calculating derivatives. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 186012	2013-07-10 16:37:07 +00:00
Vincent Lejeune	b8aac8d720	R600: Fix a rare bug where swizzle optimization returns wrong values llvm-svn: 185942	2013-07-09 15:03:25 +00:00
Vincent Lejeune	b55940cc7d	R600: Use DAG lowering pass to handle fcos/fsin NOTE: This is a candidate for the stable branch. llvm-svn: 185940	2013-07-09 15:03:11 +00:00
Tom Stellard	c026e8bc8e	R600: Add local memory support via LDS Reviewed-by: Vincent Lejeune<vljn at ovi.com> llvm-svn: 185162	2013-06-28 15:47:08 +00:00
Tom Stellard	02661d9605	R600: Use new getNamedOperandIdx function generated by TableGen llvm-svn: 184880	2013-06-25 21:22:18 +00:00
Aaron Watry	0a794a4612	R600: Consolidate expansion of v2i32/v4i32 ops for EG/SI By default, we expand these operations for both EG and SI. Move the duplicated code into a common space for now. If the targets ever actually implement these operations as instructions, we can override that in the relevant target. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 184848	2013-06-25 13:55:57 +00:00
Aaron Watry	5527b6c6b6	R600/SI: Expand udiv v[24]i32 for SI and v2i32 for EG Also add lit test for both cases on SI, and v2i32 for evergreen. Note: I followed the guidance of the v4i32 EG check... UDIV produces really complex code, so let's just check that the instruction was lowered successfully. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 184843	2013-06-25 13:55:43 +00:00
Tom Stellard	6ec9e8043c	R600: Expand v2i32 load/store instead of custom lowering The custom lowering causes llc to crash with a segfault. Ideally, the custom lowering can be fixed, but this allows programs which load/store v2i32 to work without crashing. Patch by: Aaron Watry Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Signed-off-by: Aaron Watry<awatry@gmail.com> llvm-svn: 184480	2013-06-20 21:55:23 +00:00
Benjamin Kramer	193960c822	R600: Make helper functions static. llvm-svn: 183744	2013-06-11 13:32:25 +00:00
Bill Wendling	37e9adb091	Don't cache the instruction and register info from the TargetMachine, because the internals of TargetMachine could change. No functionality change intended. llvm-svn: 183561	2013-06-07 20:28:55 +00:00
Vincent Lejeune	276ceb8d5f	R600: Swizzle texture/export instructions llvm-svn: 183229	2013-06-04 15:04:53 +00:00
Vincent Lejeune	a09873dda7	R600: Constraints input regs of interp_xy,_zw llvm-svn: 183106	2013-06-03 15:44:16 +00:00
Andrew Trick	ef9de2a739	Track IR ordering of SelectionDAG nodes 2/4. Change SelectionDAG::getXXXNode() interfaces as well as call sites of these functions to pass in SDLoc instead of DebugLoc. llvm-svn: 182703	2013-05-25 02:42:55 +00:00
NAKAMURA Takumi	4f328e1c2f	R600ISelLowering.cpp: Avoid "using namespace Intrinsic;" to appease MSC. Specify namespaces explicitly here. MSC is confused about "memcpy" between <cstring> and llvm::Intrinsic::memcpy, when llvm::Intrinsic were exposed. llvm-svn: 182452	2013-05-22 06:37:31 +00:00
NAKAMURA Takumi	18ca09c1cc	R600: Whitespace and untabify. llvm-svn: 182451	2013-05-22 06:37:25 +00:00
Tom Stellard	5643c4ac72	R600: Swap the legality of rotl and rotr The hardware supports rotr and not rotl. llvm-svn: 182285	2013-05-20 15:02:19 +00:00
Matt Arsenault	75865923c9	Add LLVMContext argument to getSetCCResultType llvm-svn: 182180	2013-05-18 00:21:46 +00:00
Vincent Lejeune	d3fcb5016c	R600: Lower int_load_input to copyFromReg instead of Register node It solves a bug uncovered by dot4 patch where the register class of int_load_input use was ignored. llvm-svn: 182130	2013-05-17 16:51:06 +00:00
Vincent Lejeune	519f21eed3	R600: Relax some vector constraints on Dot4. Dot4 now uses 8 scalar operands instead of 2 vectors one which allows register coalescer to remove some unneeded COPY. This patch also defines some structures/functions that can be used to handle every vector instructions (CUBE, Cayman special instructions...) in a similar fashion. llvm-svn: 182126	2013-05-17 16:50:32 +00:00
Vincent Lejeune	d3eed66e8c	R600: Improve texture handling llvm-svn: 182125	2013-05-17 16:50:20 +00:00
Tom Stellard	3a7c34c778	R600: Expand SUB for v2i32/v4i32 Patch by: Aaron Watry Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Signed-off-by: Aaron Watry <awatry@gmail.com> NOTE: This is a candidate for the 3.3 branch. llvm-svn: 181579	2013-05-10 02:09:39 +00:00
Tom Stellard	3deddc5079	R600: Expand MUL for v4i32/v2i32 Fixes piglit test for OpenCL builtin mul24, and allows mad24 to run. Patch by: Aaron Watry Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Signed-off-by: Aaron Watry <awatry@gmail.com> NOTE: This is a candidate for the 3.3 branch. llvm-svn: 181578	2013-05-10 02:09:34 +00:00
Tom Stellard	7fb3963498	R600: Expand SRA for v4i32/v2i32 v2: Add v4i32 test Patch by: Aaron Watry Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Signed-off-by: Aaron Watry <awatry@gmail.com> NOTE: This is a candidate for the 3.3 branch. llvm-svn: 181577	2013-05-10 02:09:29 +00:00
Tom Stellard	a99c6ae47a	R600: Expand vselect for v4i32 and v2i32 v2: Add vselect v4i32 test Patch by: Aaron Watry Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Signed-off-by: Aaron Watry <awatry@gmail.com> NOTE: This is a candidate for the 3.3 branch. llvm-svn: 181576	2013-05-10 02:09:24 +00:00
Tom Stellard	4489b85f2b	R600: Expand vector or, shl, srl, and xor nodes llvm-svn: 181035	2013-05-03 17:21:31 +00:00
Tom Stellard	87047f69ad	R600: Initialize BooleanVectorContents Fixes test/CodeGen/R600/setcc.ll llvm-svn: 180231	2013-04-24 23:56:18 +00:00
Christian Konig	70a5032c1b	R600/SI: add mulhu/mulhs patterns Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> Tested-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 178126	2013-03-27 09:12:51 +00:00
Michel Danzer	a2e28156b4	R600: Use legacy (0 * anything = 0) MUL instructions for pow intrinsics Fixes wrong lighting in some corner cases with r600g and radeonsi, e.g. manifested by failure of two piglit/glean tests and intermittent black patches in many apps. Tested on SI and RS880. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=62012 [radeonsi] Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=58150 [r600g] NOTE: This is a candidate for the Mesa stable branch. Reviewed-by: Christian König <christian.koenig@amd.com> llvm-svn: 177730	2013-03-22 14:09:10 +00:00
Vincent Lejeune	e5ecf10a02	R600: Fix JUMP handling so that MachineInstr verification can occur This allows R600 Target to use the newly created -verify-misched llc flag llvm-svn: 176819	2013-03-11 18:15:06 +00:00
Tom Stellard	5e524897ed	R600: Optimize another selectcc case fold selectcc (selectcc x, y, a, b, cc), b, a, b, setne -> selectcc x, y, a, b, cc Reviewed-by: Christian König <christian.koenig@amd.com> llvm-svn: 176700	2013-03-08 15:37:11 +00:00
Tom Stellard	2add82de09	R600: Improve custom lowering of select_cc Two changes: 1. Prefer SET* instructions when possible 2. Handle the CND*_INT case with floating-point args Reviewed-by: Christian König <christian.koenig@amd.com> llvm-svn: 176699	2013-03-08 15:37:09 +00:00
Tom Stellard	492ebeabe9	R600: Change operation action from Custom to Expand for BR_CC Reviewed-by: Christian König <christian.koenig@amd.com> llvm-svn: 176698	2013-03-08 15:37:07 +00:00
Tom Stellard	e8f9f2877b	R600: Change operation action from Custom to Expand for SETCC Reviewed-by: Christian König <christian.koenig@amd.com> llvm-svn: 176697	2013-03-08 15:37:05 +00:00
Tom Stellard	b852af5dc4	R600: Set BooleanContents to ZeroOrNegativeOneBooleanContent Reviewed-by: Christian König <christian.koenig@amd.com> llvm-svn: 176696	2013-03-08 15:37:03 +00:00

1 2

69 Commits