llvm-project

Commit Graph

Author	SHA1	Message	Date
Changpeng Fang	022c8d4a3f	AMDGPU [NFC]: Fix a few typos in docs AMDGPUUsage.rst Summery: Fix a few typos in docs AMDGPUUsage.rst Differential Revision: https://reviews.llvm.org/D118272	2022-02-02 14:22:52 -08:00
Changpeng Fang	1194b9cdda	AMDGPU {NFC}: Add code object v5 support and generate metadata for implicit kernel args Summary: Add code object v5 support (deafult is still v4) Generate metadata for implicit kernel args for the new ABI Set the metadata version to be 1.2 Reviewers: t-tye, b-sumner, arsenm, and bcahoon Fixes: SWDEV-307188, SWDEV-307189 Differential Revision: https://reviews.llvm.org/D118272	2022-01-31 18:07:47 -08:00
Matt Arsenault	e6564f39c7	AMDGPU: Emit user sgpr count directives in text asm We were emitting these in the object file but not printing them.	2022-01-26 13:51:12 -05:00
Changpeng Fang	4cfea311cb	[AMDGPU][NFC] Update to AMDGPUUsage for default Code Object Version Summary: Update the documentation for default code object version (from v3 to v4). Reviewers: kzhuravl Differential Revision: https://reviews.llvm.org/D117845	2022-01-24 14:33:12 -08:00
Tony Tye	0ac939f3e2	[AMDGPU][NFC] Update to DWARF extension for heterogeneous debugging - Update documentation on the DWARF extension for heterogeneous debugging to better reference the DWARF Version 5 standard. - Numerous other corrections. Reviewed By: kzhuravl Differential Revision: https://reviews.llvm.org/D116275	2021-12-28 17:13:45 +00:00
Tony Tye	c6be2ad73a	[AMDGPU][NFC] Add documentation for location description DWARF extension Add documentation for the DWARF extension to allow location descriptions on the DWARF expression stack. This is part of the "DWARF Extensions For Heterogeneous Debugging" used by the AMD GPU target. Reviewed By: scott.linder Differential Revision: https://reviews.llvm.org/D115587	2021-12-14 00:58:17 +00:00
Jay Foad	5d602120c3	[AMDGPU] Update docs for nontemporal store Update the documented GFX10 code sequence for nontemporal stores after D114351. Differential Revision: https://reviews.llvm.org/D114707	2021-11-30 09:43:42 +00:00
Jay Foad	65d9dc7f1f	[AMDGPU] Fix list indentation in docs	2021-11-29 15:06:01 +00:00
Jay Foad	7319d11586	[AMDGPU] Fix "must generated" typo in docs	2021-11-29 15:01:18 +00:00
Carl Ritson	6d28dffb6b	[AMDGPU] Update GFX10 memory model to account for MALL Document memory attached last level (MALL) cache added in GFX10.3. Reviewed By: t-tye Differential Revision: https://reviews.llvm.org/D114076	2021-11-18 09:29:30 +09:00
Matt Arsenault	8d4b74ac3f	AMDGPU: Don't consider whether amdgpu-flat-work-group-size was set It should be semantically identical if it was set to the same value as the default. Also improve the documentation.	2021-10-22 16:23:50 -04:00
Scott Linder	0022426917	[AMDGPU] Update Call Convention docs for GFX90A Document the CSR AGPRs for GFX90A. Remove the TODO for gfx908, as the answer is that we don't mark any AGPRs as callee-saved except for GFX90A, i.e. the docs as-is are correct for gfx908. Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D109009	2021-09-01 20:02:41 +00:00
Kazu Hirata	5294a0f7c3	[llvm] Fix typos in documentation (NFC)	2021-08-28 06:37:03 -07:00
Matt Arsenault	088cc63640	AMDGPU: Invert AMDGPUAttributor Switch to using BitIntegerState for each of the inputs, and invert their meanings. This now diverges more from the old AMDGPUAnnotateKernelFeatures, but this isn't used yet anyway.	2021-08-26 21:32:13 -04:00
RamNalamothu	9b9e7f6f4e	[docs, AMDGPU] Fix typo in dwarf register number mapping Reviewed By: xgupta Differential Revision: https://reviews.llvm.org/D108557	2021-08-26 23:55:29 +05:30
Reshabh Sharma	5173854f19	[AMDGPU] Handle functions in llvm's global ctors and dtors list This patch introduces a new code object metadata field, ".kind" which is used to add support for init and fini kernels. HSAStreamer will use function attributes, "device-init" and "device-fini" to distinguish between init and fini kernels from the regular kernels and will emit metadata with ".kind" set to "init" and "fini" respectively. To reduce the number of init and fini kernels, the ctors and dtors present in the llvm's global.ctors and global.dtors lists are called from a single init and fini kernel respectively. Reviewed by: yaxunl Differential Revision: https://reviews.llvm.org/D105682	2021-08-06 15:53:33 +05:30
Reshabh Sharma	dce35ef104	Revert "[AMDGPU] Handle functions in llvm's global ctors and dtors list" This reverts commit `d42e70b3d3`.	2021-08-04 23:33:31 +05:30
Reshabh Sharma	d42e70b3d3	[AMDGPU] Handle functions in llvm's global ctors and dtors list This patch introduces a new code object metadata field, ".kind" which is used to add support for init and fini kernels. HSAStreamer will use function attributes, "device-init" and "device-fini" to distinguish between init and fini kernels from the regular kernels and will emit metadata with ".kind" set to "init" and "fini" respectively. To reduce the number of init and fini kernels, the ctors and dtors present in the llvm's global.ctors and global.dtors lists are called from a single init and fini kernel respectively. Reviewed by: yaxunl Differential Revision: https://reviews.llvm.org/D105682	2021-08-04 19:53:33 +05:30
Tony Tye	51e62e56f7	[AMDGPU] Reserve AMDGPU ELF e_flags machine 0x45 Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D106249	2021-07-19 20:17:35 +00:00
Tony Tye	53fed88159	[AMDGPU] Reserve AMDGPU ELF e_flags machine 0x44 Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D106034	2021-07-15 06:46:27 +00:00
Hafiz Abid Qadeer	b205f2bb89	[AMDGPU] Handle s_branch to another section. Currently, if target of s_branch instruction is in another section, it will fail with the error of undefined label. Although in this case, the label is not undefined but present in another section. This patch tries to handle this issue. So while handling fixup_si_sopp_br fixup in getRelocType, if the target label is undefined we issue an error as before. If it is defined, a new relocation type R_AMDGPU_REL16 is returned. This issue has been reported in https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100181 and https://bugs.llvm.org/show_bug.cgi?id=45887. Before https://reviews.llvm.org/D79943, we used to get an crash for this scenario. The crash is fixed now but the we still get an undefined label error. Jumps to other section can arise with hold/cold splitting. A patch to handle the relocation in lld will follow shortly. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D105760	2021-07-13 12:17:47 +01:00
Krzysztof Drewniak	8ba53152d7	Add newline to fix documentation build Reviewed By: xgupta Differential Revision: https://reviews.llvm.org/D105825	2021-07-12 19:00:58 +00:00
Krzysztof Drewniak	bef5ed1eea	[AMDGPU][Docs] Update Code Object V3 example to includes args section The documentation for the AMDGPU assembler's examples don't show the .args section, which, if ommitted, will cause arguments to silently not be passed into the kernel. This commit fixes this issue. Reviewed By: #amdgpu, scott.linder Differential Revision: https://reviews.llvm.org/D105222	2021-07-09 17:42:29 +00:00
Tony Tye	8d69635ed9	[NFC][AMDGPU] Add link to AMD GPU gfx906 instruction set architecture Reviewed By: kzhuravl Differential Revision: https://reviews.llvm.org/D105377	2021-07-06 20:21:26 +00:00
Sebastian Neubauer	db646de3ee	[AMDGPU] Set optional PAL metadata Set informational fields in the .shader_functions table. Also correct the documentation, .scratch_memory_size and .lds_size are integers. Differential Revision: https://reviews.llvm.org/D105116	2021-07-06 11:58:00 +02:00
Tony Tye	7f19aa73c2	[AMDGPU] Update gfx90a memory model support Update AMDGPU gfx90a memory model to make coarse grain memory allocations consistent when fine grained system scope atomic acquire and release is performed. Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D105137	2021-06-30 04:05:22 +00:00
Tony Tye	a1526af464	[AMDGPU] Reserve AMDGPU ELF e_flags machine 0x43 Reviewed By: kzhuravl, rampitec Differential Revision: https://reviews.llvm.org/D104872	2021-06-24 22:51:47 +00:00
Aakanksha Patil	3453f3dd46	[AMDGPU] Add gfx1035 target Differential Revision: https://reviews.llvm.org/D104804	2021-06-24 14:32:41 -04:00
Brendon Cahoon	294efbbd3e	Reland "[AMDGPU] Add gfx1013 target" This reverts commit `211e584fa2`. Fixed a use-after-free error that caused the sanitizers to fail.	2021-06-08 21:15:35 -04:00
Brendon Cahoon	211e584fa2	Revert "[AMDGPU] Add gfx1013 target" This reverts commit `ea10a86984`. A sanitizer buildbot reports an error.	2021-06-08 16:29:41 -04:00
Brendon Cahoon	ea10a86984	[AMDGPU] Add gfx1013 target Differential Revision: https://reviews.llvm.org/D103663	2021-06-08 12:49:49 -04:00
Tony Tye	355114a753	[NFC][AMDGPU] Add documentation for AMD Instinct MI100 accelerator Add link to documentation for "AMD Instinct MI100 Instruction Set Architecture" to AMDGPUUsage.rst. Reviewed By: kzhuravl, rampitec, dp Differential Revision: https://reviews.llvm.org/D102859	2021-05-21 16:51:13 +00:00
Tony Tye	b408efe4ff	[NFC][AMDGPU] Mark C code in AMDGPUUsage.rst Reviewed By: foad Differential Revision: https://reviews.llvm.org/D102910	2021-05-21 10:08:05 +00:00
Konstantin Zhuravlyov	4e297dcd18	AMDGPU/Docs: Remove reserved MACH 0x3E (it is no longer reserved), sort MACHs by value	2021-05-18 16:57:56 -04:00
Stanislav Mekhanoshin	6fb02596a2	[AMDGPU] Add support for architected flat scratch Add support for the readonly flat Scratch register initialized by the SPI. Differential Revision: https://reviews.llvm.org/D102432	2021-05-14 10:53:48 -07:00
Dmitry Preobrazhensky	434b278cde	[AMDGPU][MC][NFC][DOC] Updated AMD GPU assembler syntax description. Summary of changes: - added description of GFX90A; - minor bugfixing and improvements.	2021-05-14 16:13:30 +03:00
Aakanksha Patil	464e4dc50f	[AMDGPU] Add gfx1034 target Differential Revision: https://reviews.llvm.org/D102306	2021-05-13 14:25:18 -04:00
Tony Tye	d6a228cba4	[NFC][AMDGPU] Correct product name for gfx908 The product name for gfx908 is "AMD Instinct MI100 Accelerator". Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D102209	2021-05-11 15:17:04 +00:00
Konstantin Zhuravlyov	4fae63c612	AMDGPU: Add gfx90c support to code object v2 for backwards compatibility Differential Revision: https://reviews.llvm.org/D100126	2021-04-08 16:42:43 -04:00
Tony Tye	2e9465ce2e	[NFC][AMDGPU] Correct indentation in AMDGPUUsage.rst Correct indentation that results in rST syntax error.	2021-04-08 01:00:13 +00:00
Tony Tye	4658cd4c18	[AMDGPU] Update gfx90a memory model support Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D100070	2021-04-07 22:17:58 +00:00
Tony	4c70f56ec6	[NFC][AMDGPU] Add product names for gfx908 and gfx10 processors Reviewed By: msearles Differential Revision: https://reviews.llvm.org/D99781	2021-04-02 00:58:11 +00:00
Tim Renouf	083b0f1b40	[AMDGPU] Update AMDGPU PAL usage documentation Change-Id: I65f3edcfe5063551cad5aab0da1374c3a6ccd3a2	2021-03-30 08:33:18 +01:00
Tony	850fcedb27	[NFC][AMDGPU] Corrections to AMD GPU initial kernel launch documentation Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D99223	2021-03-26 02:05:45 +00:00
Tony	c181724a9b	[NFC][AMDGPU] Reserve AMD GPU ELF machine number 0x41 Reviewed By: foad Differential Revision: https://reviews.llvm.org/D99196	2021-03-23 17:53:02 +00:00
Tony	1e04706adb	[AMDGPU] Reserve ELF code Reserve AMD GPU ELF machine code 0x040. Minor AMDGPUUsage format consistency change. Reviewed By: kzhuravl Differential Revision: https://reviews.llvm.org/D99122	2021-03-23 04:30:38 +00:00
Tony Tye	2da13f1246	[NFC][AMDGPU] Document the AMDGPU target feature defaults Document the default for the XNACK and SRAMECC target features for code object V2-V3 and V4. Reviewed By: kzhuravl Differential Revision: https://reviews.llvm.org/D97598	2021-02-27 18:28:15 +00:00
Kazu Hirata	e8fa9014cc	[llvm] Fix typos in documentation (NFC)	2021-02-27 10:09:23 -08:00
Konstantin Zhuravlyov	71d1f785a5	AMDGPU/ELF: Sort MACHs by value and add missing reserved MACHs - Sort MACHs by its value - Add missing reserved MACHs - EF_AMDGPU_MACH_AMDGCN_RESERVED_0X3D - EF_AMDGPU_MACH_AMDGCN_RESERVED_0X3E Differential Revision: https://reviews.llvm.org/D97010	2021-02-18 20:46:27 -05:00
Stanislav Mekhanoshin	a8d9d50762	[AMDGPU] gfx90a support Differential Revision: https://reviews.llvm.org/D96906	2021-02-17 16:01:32 -08:00

1 2 3 4 5

212 Commits