llvm-project

Commit Graph

Author	SHA1	Message	Date
Jon Chesterfield	4b2e7d0215	[amdgpu] Default to code object v3 [amdgpu] Default to code object v3 v4 is not yet readily available, and doesn't appear to be implemented in the back end Reviewed By: t-tye Differential Revision: https://reviews.llvm.org/D93258	2020-12-15 01:11:09 +00:00
Tony	828602c772	[NFC]{AMDGPU] Update AMDGPUUsage with AMD RDNA 2 reference Differential Revision: https://reviews.llvm.org/D93172	2020-12-13 17:21:02 +00:00
Tony	87a4e14e40	[NFC][AMDGPU] AMDGPUUsage updates - Document which processors are supported by which runtimes. - Add missing mappings for code object V2 note records Differential Revision: https://reviews.llvm.org/D93016	2020-12-12 18:19:02 +00:00
Tony	3242eaef27	[NFC][AMDGPU] AMDGPUUsage updates - Document code object V2 gfx800. - Document amdpal is supported by Linux Pro. Differential Revision: https://reviews.llvm.org/D92708	2020-12-05 02:13:17 +00:00
Tony	ac1b2ae9dc	[NFC][AMDGPU] Fix broken link to ClangOffloadBundler in AMDGPUUsage	2020-12-02 03:04:28 +00:00
Tony	04424c69bc	[NFC][AMDGPU] AMDGPU code object V4 ABI documentation - Documantation for AMDGPU code object V4. - Documentation clarification for code object V2 and V3. - Documentation for the clang-offload-bundler. - Numerous other documentation clarifications. Change-Id: I338b327cc9e75da6c987b7e081b496402a5a020e Differential Revision: https://reviews.llvm.org/D92434	2020-12-01 23:31:04 +00:00
Tony	8605d3134c	[NFC][AMDGPU] Document kernel descriptor - Document that the kernel descriptor defined is for code object V3. Document that it also applies to earlier code object formats for CP. - Document the deprecated bits in kernel descriptor. Differential Revision: https://reviews.llvm.org/D91458	2020-11-21 04:54:17 +00:00
Michael Liao	f375885ab8	[InferAddrSpace] Teach to handle assumed address space. - In certain cases, a generic pointer could be assumed as a pointer to the global memory space or other spaces. With a dedicated target hook to query that address space from a given value, infer-address-space pass could infer and propagate that to all its users. Differential Revision: https://reviews.llvm.org/D91121	2020-11-16 17:06:33 -05:00
Sebastian Neubauer	a022b1ccd8	[AMDGPU] Add amdgpu_gfx calling convention Add a calling convention called amdgpu_gfx for real function calls within graphics shaders. For the moment, this uses the same calling convention as other calls in amdgpu, with registers excluded for return address, stack pointer and stack buffer descriptor. Differential Revision: https://reviews.llvm.org/D88540	2020-11-09 16:51:44 +01:00
Tony	45bcbe46d7	[NFC][AMDGPU] Minor editorial improvements to AMDGPUUsage.rst Differential Revision: https://reviews.llvm.org/D90661	2020-11-03 16:56:01 +00:00
Tim Renouf	89d41f3a2b	[AMDGPU] Add gfx1033 target Differential Revision: https://reviews.llvm.org/D90447 Change-Id: If2650fc7f31bbdd49c76e74a9ca8e3734d769761	2020-11-03 16:27:48 +00:00
Tim Renouf	ee3e642627	[AMDGPU] Add gfx90c target This differentiates the Ryzen 4000/4300/4500/4700 series APUs that were previously included in gfx909. Differential Revision: https://reviews.llvm.org/D90419 Change-Id: Ia901a7157eb2f73ccd9f25dbacec38427312377d	2020-11-03 16:27:43 +00:00
Tony	68160789c1	[NFC][AMDGPU] Restructure the AMDGPU memory model description Separate the AMDGPU memory model description into separate sections for each architecture. Differential Revision: https://reviews.llvm.org/D90548	2020-11-02 21:32:20 +00:00
Tony	fccf4f6add	[NFC][AMDGPU] Minor cleanup to AMDGPU memory model table Differential Revision: https://reviews.llvm.org/D90509	2020-10-30 22:50:22 +00:00
Scott Linder	580f99bcff	[NFC][AMDGPU] Resize Memory Model columns in AMDGPUUsage.rst Make all of the "AMDGPU Machine Code GFX*" columns in the Memory Model table a consistent width of 32-characters. Best viewed with something like --word-diff Differential Revision: https://reviews.llvm.org/D89977	2020-10-29 23:07:03 +00:00
Scott Linder	fb37943cc8	[AMDGPU] Update Memory Model in AMDGPUUsage.rst Mostly NFC, but some changes are "bug fixes" rather than just e.g. formatting changes or typo corrections. - Fix typo "competing" -> "completing". - Document why waintcnt is added to stores and not loads for sequentially consistent ordering. - Lowercase some mentions of `buffer_gl{0,1}_inv`. - Make mentions of `*cnt(0)` consistently include the `(0)` count. - Remove some mentions of instructions for incorrect address spaces. For example, remove mention of `flat_load` from `load atomic acquire workgroup global`. - Re-flow some text to get all the target columns to fit in a 32-character wide column. Makes a future NFC patch to make these columns both 32-character wide more straightforward. Modified cherry-pick of patch by Tony Tye Reviewed By: t-tye Differential Revision: https://reviews.llvm.org/D89596	2020-10-29 23:07:03 +00:00
Tony	661797bd76	[AMDGPU] Update AMD GPU documentation - AMDGPUUsage.rst: Correct AMD GPU DWARF address space table address sizes which are in bits and not bytes. - clang/.../Options.td: Improve description of AMD GPU options. - Re-generate ClangComamndLineReference.rst from clang/.../Options.td . Differential Revision: https://reviews.llvm.org/D90364	2020-10-29 20:12:47 +00:00
Tony	bf6518a806	[AMDGPU] Cleanup AMDGPUUsage.rst - Layout and typo improvements. - Add memory spaces section. - reStructure syntax fixes. Differential Revision: https://reviews.llvm.org/D90002	2020-10-24 06:21:27 +00:00
Stanislav Mekhanoshin	173389e16d	[AMDGPU] Fix gfx1032 description in AMDGPUUsage.rst. NFC. Differential Revision: https://reviews.llvm.org/D89565	2020-10-16 13:29:20 -07:00
Stanislav Mekhanoshin	d1beb95d12	[AMDGPU] gfx1032 target Differential Revision: https://reviews.llvm.org/D89487	2020-10-15 12:41:18 -07:00
Konstantin Zhuravlyov	3fdf3b1539	AMDGPU: Update AMDHSA code object version handling Differential Revision: https://reviews.llvm.org/D89076	2020-10-14 13:04:27 -04:00
Tony	fe145b66ec	[AMDGPU] Correct processor names for gfx1010 and gfx1011 Change-Id: Ie409f86876b0437d0b0405aff42872963708d926 Differential Revision: https://reviews.llvm.org/D89259	2020-10-12 20:16:12 +00:00
Tim Renouf	666ef0db20	[AMDGPU] Add gfx602, gfx705, gfx805 targets At AMD, in an internal audit of our code, we found some corner cases where we were not quite differentiating targets enough for some old hardware. This commit is part of fixing that by adding three new targets: * The "Oland" and "Hainan" variants of gfx601 are now split out into gfx602. LLPC (in the GPUOpen driver) and other front-ends could use that to avoid using the shaderZExport workaround on gfx602. * One variant of gfx703 is now split out into gfx705. LLPC and other front-ends could use that to avoid using the shaderSpiCsRegAllocFragmentation workaround on gfx705. * The "TongaPro" variant of gfx802 is now split out into gfx805. TongaPro has a faster 64-bit shift than its former friends in gfx802, and a subtarget feature could be set up for that to take advantage of it. This commit does not make that change; it just adds the target. V2: Add clang changes. Put TargetParser list in order. V3: AMDGCNGPUs table in TargetParser.cpp needs to be in GPUKind order, so fix the GPUKind order. Differential Revision: https://reviews.llvm.org/D88916 Change-Id: Ia901a7157eb2f73ccd9f25dbacec38427312377d	2020-10-10 17:22:22 +01:00
Tony	72e2fbde54	[AMDGPU] Correct gfx1031 XNACK setting documentation - gfx1031 does not support XNACK. Differential Revision: https://reviews.llvm.org/D87198	2020-09-09 19:43:02 +00:00
Tony	b690c1157e	[AMDGPU] Correct DWARF register defintions - Rename AMDGPU SCC DWARF register to STATUS since the scalar condition code is a bit within the STATUS register. - Correct bit size of the VCC_64 register to 64 which is the size in wave64 mode. Differential Revision: https://reviews.llvm.org/D86259	2020-08-20 01:15:04 +00:00
madhur13490	0313c540c2	[NFC] Fix typo in AMDGPU doc Reviewed By: t-tye, arsenm Differential Revision: https://reviews.llvm.org/D86206	2020-08-19 14:33:26 +00:00
Sebastian Neubauer	ca227d73e1	[AMDGPU] Fix typo. NFC	2020-08-13 10:41:48 +02:00
Kazu Hirata	a31b3893c7	[docs] Fix typos	2020-08-09 19:31:49 -07:00
Tony	ce74e97d9b	[AMDGPU] Correct missing sram-ecc target feature for gfx906 Differential Revision: https://reviews.llvm.org/D85476	2020-08-06 22:12:25 +00:00
Stanislav Mekhanoshin	ea7d0e2996	[AMDGPU] gfx1031 target Differential Revision: https://reviews.llvm.org/D85337	2020-08-05 12:36:26 -07:00
Tony	e24f5f3149	[AMDGPU] DWARF proposal changes - Clarify that these are extensions to DWARF 5 and not as yet a proposal. Reviewed By: scott.linder Differential Revision: https://reviews.llvm.org/D70523	2020-07-30 05:07:09 +00:00
Tony	5aa2fd88cf	[AMDGPU] DWARF proposal changes for expression context - Clarify what context is used in DWARF expression evaluation. - Define location descriptions to fully resolve the context and so include the context in their result. - As a consequence of location descriptions being fully resoved, change address spaces so only a swizzled and unswizzled private address space is defined. The lane is now part of the location description context. - Clarify how call frame information is used to fully resolve expressions that specify registers. Reviewed By: scott.linder Differential Revision: https://reviews.llvm.org/D70523	2020-07-30 01:59:22 +00:00
Matt Arsenault	31f4e43f3f	AMDGPU: Remove .value_type from kernel metadata This doesn't appear used for anything, and is emitted incorrectly based on the description. This also depends on the IR type, and pointee element type.	2020-07-10 18:16:31 -04:00
Tony	76b2d9cbeb	[AMDGPU] Correct AMDGPUUsage.rst DW_AT_LLVM_lane_pc example - Correct typo of DW_OP_xaddr to DW_OP_addrx in AMDGPUUsage.rst for DW_AT_LLVM_lane_pc example. Change-Id: I1b0ee2b24362a0240388e4c2f044c1d4883509b9	2020-07-01 08:23:15 +00:00
Tony	990f8702c9	[AMDGPU] Define DWARF encoding for condition code registers Summary: - Define DWARF register numbers for vector and scalar condition codes. - Document intended purpose of reserved DWARF register numbers. Reviewers: yaxunl, kzhuravl, arsenm, rampitec, b-sumner Subscribers: jvesely, wdng, nhaehnle, aprantl, dstuttard, tpr, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82519	2020-06-26 17:53:55 -04:00
Tony	ea6df2fb8f	[AMDGPU] Update AMD GPU processor information Summary: - Add product names for some processors. - Correct XNACK support for a processor. Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82348	2020-06-23 18:47:56 -04:00
Matt Arsenault	ae5adb8da5	AMDGPU: Update private null pointer value in documentation Private pointers used to workaround IR semantics by artifically reserving an object at offset 0 so no user object would be allocated there. Since alloca now uses a non-0 address space, that workaround is unnecssary and 0 can be treated as a valid pointer.	2020-06-18 17:27:19 -04:00
Stanislav Mekhanoshin	9ee272f13d	[AMDGPU] Add gfx1030 target Differential Revision: https://reviews.llvm.org/D81886	2020-06-15 16:18:05 -07:00
madhur13490	bca413b036	Fix a typo in AMDGPU docs Reviewers: t-tye, arsenm Reviewed By: arsenm Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81247	2020-06-05 13:30:17 +00:00
Tony	7318e24000	[AMDGPU] Add loaded code object path URI definition to AMDGPUUsage Differential Revision: https://reviews.llvm.org/D80407	2020-05-29 19:52:52 -04:00
Tony	e36be90c82	[AMDGPU] Correct formatting typos in documentation Summary: - Correct missing space in some "note" and "TODO" directives in AMDGPUUsage.rst - Correct warning for heading underline being too short in BitCodeFormat.rst Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80407	2020-05-21 20:36:46 -04:00
Jinsong Ji	628f008b20	[docs] Fix buildbot failures Buildbot has been failing since http://lab.llvm.org:8011/builders/llvm-sphinx-docs/builds/44711 This patch fix the minor issues that cause warnings.	2020-05-21 22:07:33 +00:00
Christudasan Devadasan	7c4e711ef8	[AMDGPU] Enable base pointer. When the callee requires a dynamic stack realignment, it is not possible to correcty access the incoming stack arguments using the stack pointer. We reserve a base pointer in such cases to access the function arguments inside the callee. The base pointer will hold the incoming stack pointer value before any kind of delta added to it. Reviewed By: arsenm, scott.linder Differential Revision: https://reviews.llvm.org/D78811	2020-05-17 16:13:55 +05:30
Christudasan Devadasan	375cec4b6c	[AMDGPU] Introduce more scratch registers in the ABI. The AMDGPU target has a convention that defined all VGPRs (execept the initial 32 argument registers) as callee-saved. This convention is not efficient always, esp. when the callee requiring more registers, ended up emitting a large number of spills, even though its caller requires only a few. This patch revises the ABI by introducing more scratch registers that a callee can freely use. The 256 vgpr registers now become: 32 argument registers 112 scratch registers and 112 callee saved registers. The scratch registers and the CSRs are intermixed at regular intervals (a split boundary of 8) to obtain a better occupancy. Reviewers: arsenm, t-tye, rampitec, b-sumner, mjbedy, tpr Reviewed By: arsenm, t-tye Differential Revision: https://reviews.llvm.org/D76356	2020-05-05 23:02:58 +05:30
Kazuaki Ishizaki	0312b9f550	[llvm] NFC: Fix trivial typo in rst and td files Differential Revision: https://reviews.llvm.org/D77469	2020-04-23 14:26:32 +09:00
Tony	1eac2c55d8	[AMDGPU] Move DWARF proposal to separate file - Move DWARF proposal for heterogeneous debugging to a separate file. - Add references. Differential Revision: https://reviews.llvm.org/D70523	2020-04-15 17:19:39 -04:00
Tony	b436124010	[AMDGPU] Update DWARF proposal - Unify the sections on DWARF expression and location lists. - Allow a location description to have one or more single location descriptions. - Define context of DWARF expression that includes an initial stack. Allow initial stack to be used when evaluating location list expression with overlapping PC ranges. - Reorganize the DWARF proposal in AMDGPUUsage so suitable for submission to the DWARF site. - Replace CFI instruction DW_CFA_LLVM_def_cfa_aspace with DW_CFA_def_aspace_cfa and DW_CFA_def_aspace_cfa_sf. This is to avoid the problem that DW_CFA_def_cfa and DW_CFA_def_cfa_sf cannot use a register that is not the size of an address in the CFA address space. - Clarify DWARF address class and DWARF address space. Define language values for DWARF address classes and specify how they are used by some common source languages. - Define rules for accessing registers and derefencing memory when the type size and register size or byte size operand do not match. - Numerous cleanups for consistency. Differential Revision: https://reviews.llvm.org/D70523	2020-04-14 20:05:15 -04:00
Sylvestre Ledru	72fd1033ea	Doc: Links should use https	2020-03-22 22:49:33 +01:00
Scott Linder	0e9368cc8c	[AMDGPU] Move frame pointer from s34 to s33 Remove the gap left between the stack pointer (s32) and frame pointer (s34) now that the scratch wave offset is no longer a part of the calling convention ABI. Update llvm/docs/AMDGPUUsage.rst to reflect the change. Tags: #llvm Differential Revision: https://reviews.llvm.org/D75657	2020-03-19 15:35:16 -04:00
Scott Linder	60b1967c39	[AMDGPU] Add Scratch Wave Offset to Scratch Buffer Descriptor in entry functions Add the scratch wave offset to the scratch buffer descriptor (SRSrc) in the entry function prologue. This allows us to removes the scratch wave offset register from the calling convention ABI. As part of this change, allow the use of an inline constant zero for the SOffset of MUBUF instructions accessing the stack in entry functions when a frame pointer is not requested/required. Entry functions with calls still need to set up the calling convention ABI stack pointer register, and reference it in order to address arguments of called functions. The ABI stack pointer register remains unswizzled, but is now wave-relative instead of queue-relative. Non-entry functions also use an inline constant zero SOffset for wave-relative scratch access, but continue to use the stack and frame pointers as before. When the stack or frame pointer is converted to a swizzled offset it is now scaled directly, as the scratch wave offset no longer needs to be subtracted first. Update llvm/docs/AMDGPUUsage.rst to reflect these changes to the calling convention. Tags: #llvm Differential Revision: https://reviews.llvm.org/D75138	2020-03-19 15:35:16 -04:00

1 2 3 4

152 Commits