llvm-project

Commit Graph

Author	SHA1	Message	Date
Matt Arsenault	870397739e	AMDGPU: Preserve undef flag when expanding SI_IF Fixes undefined value verifier error. llvm-svn: 355426	2019-03-05 18:38:00 +00:00
Carl Ritson	9e3f7d8ad0	[AMDGPU] Fix DPP operand order in atomic optimizer Summary: Ensure order of operands in DPP atomic optimizer final WWM step is appropriate for sub instructions. Change-Id: I631d050e1c00a3b4bc7c11a90437064403c4cf30 Reviewers: sheredom, tpr Reviewed By: sheredom Subscribers: arsenm, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, t-tye, jfb, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D58900 llvm-svn: 355394	2019-03-05 12:21:44 +00:00
David Stuttard	81eec58a0d	[AMDGPU] Omit KILL instructions from hazard recognizer Summary: In some cases the KILL was causing a hazard to be introduced as these were scheduled into hazard slots, but don't result in an instruction. KILL shouldn't be considered for hazard recognition. Change-Id: Ib6d2a2160f8c94cd0ce611ab198c7e4f46aeffcf Subscribers: arsenm, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, tpr, t-tye, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D58898 llvm-svn: 355384	2019-03-05 10:25:16 +00:00
Scott Linder	efec1396ac	[AMDGPU] Implement AMDGPUMCInstrAnalysis Implement MCInstrAnalysis for AMDGPU, with default implementations save for `evaluateBranch`. Differential Revision: https://reviews.llvm.org/D58400 llvm-svn: 355373	2019-03-05 03:02:00 +00:00
Dmitry Preobrazhensky	6023d5990d	[AMDGPU][MC] Enable lds_direct operand for v_readfirstlane_b32, v_readlane_b32 and v_writelane_b32 See bug 40662: https://bugs.llvm.org/show_bug.cgi?id=40662 Reviewers: artem.tamazov, arsenm, rampitec Differential Revision: https://reviews.llvm.org/D58713 llvm-svn: 355312	2019-03-04 12:48:32 +00:00
Stanislav Mekhanoshin	bb98841399	[AMDGPU] Mark ds instructions as meybeAtomic These were not recognized as potential atomics by memory legalizer. The test was working not because legalizer did a right thing, but because it has skipped all these instructions. When I have fixed DS desciption test started to fail because region address has changed from 4 to 2 a while ago. Differential Revision: https://reviews.llvm.org/D58802 llvm-svn: 355179	2019-03-01 07:59:17 +00:00
Tom Stellard	33634d1b25	AMDGPU/GlobalISel: Implement select for G_INSERT Re-commit r344310. Reviewers: arsenm Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, rovka, kristof.beyls, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D53116 llvm-svn: 355159	2019-03-01 00:50:26 +00:00
Tom Stellard	41f32196a0	AMDGPU/GlobalISel: Implement select for G_EXTRACT Reviewers: arsenm Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, rovka, kristof.beyls, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D49714 llvm-svn: 355156	2019-02-28 23:37:48 +00:00
Matt Arsenault	09a09ef8b7	AMDGPU: Fix typo llvm-svn: 355056	2019-02-28 00:52:33 +00:00
Matt Arsenault	5d567dc137	AMDGPU: Enable function calls by default Fixes some crashes on illegal call situations which are unfortunately still valid IR. llvm-svn: 355051	2019-02-28 00:40:32 +00:00
Matt Arsenault	aa03bcd23c	AMDGPU: Fix crashes in invalid call cases We have to at least tolerate calls to kernels, possibly with a mismatched calling convention on the callsite. llvm-svn: 355049	2019-02-28 00:28:44 +00:00
Matt Arsenault	d3093c2f1f	GlobalISel: Implement fewerElementsVector for phi llvm-svn: 355048	2019-02-28 00:16:32 +00:00
Matt Arsenault	72bcf15dbf	GlobalISel: Implement moreElementsVector for phi llvm-svn: 355047	2019-02-28 00:01:05 +00:00
Dmitry Preobrazhensky	7904231edb	[AMDGPU][MC] Added register size check for VOP3/SDWA/DPP operands See bug 37943: https://bugs.llvm.org/show_bug.cgi?id=37943 Reviewers: artem.tamazov, arsenm, rampitec Differential Revision: https://reviews.llvm.org/D58287 llvm-svn: 354974	2019-02-27 13:58:48 +00:00
Dmitry Preobrazhensky	ef92035827	[AMDGPU][MC][GFX8+] Added syntactic sugar for 'vgpr index' operand of instructions s_set_gpr_idx_on and s_set_gpr_idx_mode See bug 39331: https://bugs.llvm.org/show_bug.cgi?id=39331 Reviewers: artem.tamazov, arsenm Differential Revision: https://reviews.llvm.org/D58288 llvm-svn: 354969	2019-02-27 13:12:12 +00:00
Stanislav Mekhanoshin	da1628eb67	[AMDGPU] Fixed hang during DAG combine SITargetLowering::reassociateScalarOps() does not touch constants so that DAGCombiner::ReassociateOps() does not revert the combine. However a global address is not a ConstantSDNode. Switched to the method used by DAGCombiner::ReassociateOps() itself to detect constants. Differential Revision: https://reviews.llvm.org/D58695 llvm-svn: 354926	2019-02-26 20:56:25 +00:00
Matt Arsenault	752579736e	RegBankSelect: Handle slightly more complex value mappings Try to use concat_vectors. Also remove unnecessary assert on pointers. Fixes asserting for <4 x s16> operations and 64-bit pointers for AMDGPU. llvm-svn: 354828	2019-02-25 22:24:13 +00:00
Matt Arsenault	f4bfe4cd17	AMDGPU/GlobalISel: Fix bit ops for non-power-of-2 sizes llvm-svn: 354825	2019-02-25 21:32:48 +00:00
Matt Arsenault	82b103998b	AMDGPU/GlobalISel: Clamp max implicit_def elements llvm-svn: 354818	2019-02-25 20:46:06 +00:00
Matt Arsenault	f97ace5639	AMDGPU: Remove IntrReadMem from memtime/memrealtime intrinsics EarlyCSE with MemorySSA was able to use this to merge multiple calls with no intervening store. llvm-svn: 354814	2019-02-25 20:16:11 +00:00
Matt Arsenault	fd6fd00773	AMDGPU: Correct definitions for bitset instructions These really read and write the result register, so these need a tied input. llvm-svn: 354809	2019-02-25 19:24:46 +00:00
Konstantin Zhuravlyov	9a278bf6b5	Revert "AMDGPU/NFC: Cleanup subtarget predicates" It breaks one of our downstream merges, so revert it temporarily while investigating failures downstream llvm-svn: 354700	2019-02-22 23:21:06 +00:00
Matt Arsenault	476e26b5d3	AMDGPU: Use removeAllRegUnitsForPhysReg llvm-svn: 354686	2019-02-22 19:03:36 +00:00
Matt Arsenault	aa6fb4c45e	AMDGPU: Remove debugger related subtarget features As far as I know these aren't needed anymore. llvm-svn: 354634	2019-02-21 23:27:46 +00:00
Konstantin Zhuravlyov	c2650178a1	AMDGPU/NFC: Cleanup subtarget predicates Differential Revision: https://reviews.llvm.org/D58522 llvm-svn: 354620	2019-02-21 20:43:43 +00:00
Mark Searles	599ce44d3f	[AMDGPU] remove unused AssemblerPredicates An internal build is hitting asserts complaining about too many subtarget features: llvm/utils/TableGen/Types.cpp:42: const char* llvm::getMinimalTypeForEnumBitfield(uint64_t): Assertion `MaxIndex <= 64 && "Too many bits"' failed. llvm/utils/TableGen/AsmMatcherEmitter.cpp:1476: void {anonymous}::AsmMatcherInfo::buildInfo(): Assertion `SubtargetFeatures.size() <= 64 && "Too many subtarget features!"' failed. The short-term solution is to remove a few unused AssemblerPredicates to get under the limit. The long-term solution seems to be to revisit these asserts. E.g., rather than hardcoded '64', use the standard sized std::bitset like the other places that track subtarget features. Differential Revision: https://reviews.llvm.org/D58516 llvm-svn: 354604	2019-02-21 18:19:54 +00:00
Matt Arsenault	2e0ee47712	AMDGPU/GlobalISel: Make phis legal llvm-svn: 354592	2019-02-21 15:48:13 +00:00
Matt Arsenault	b10fa8df3f	AMDGPU/GlobalISel: Fix bit count ops for non-power-of-2 types llvm-svn: 354587	2019-02-21 15:22:20 +00:00
Stanislav Mekhanoshin	42e229e130	[AMDGPU] fix commuted case of sub combine Differential Revision: https://reviews.llvm.org/D58481 llvm-svn: 354543	2019-02-21 02:58:00 +00:00
Tom Stellard	79b5c3842b	AMDGPU/GlobalISel: Move SMRD selection logic to TableGen Reviewers: arsenm Reviewed By: arsenm Subscribers: volkan, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, rovka, kristof.beyls, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D52922 llvm-svn: 354516	2019-02-20 21:02:37 +00:00
Matt Arsenault	75e30c4d5d	GlobalISel: Fix fewerElementsVector for ctlz with different result type Also complete the set of related operations. llvm-svn: 354480	2019-02-20 16:42:52 +00:00
Matt Arsenault	c4d07554e4	GlobalISel: Implement moreElementsVector for g_insert results llvm-svn: 354477	2019-02-20 16:11:22 +00:00
Matt Arsenault	b4c95b338b	GlobalISel: Implement moreElementsVector for select llvm-svn: 354354	2019-02-19 17:03:09 +00:00
Matt Arsenault	4d88427a58	GlobalISel: Implement moreElementsVector for G_EXTRACT source llvm-svn: 354348	2019-02-19 16:44:22 +00:00
Matt Arsenault	26b7e859ef	GlobalISel: Implement moreElementsVector for bit ops llvm-svn: 354345	2019-02-19 16:30:19 +00:00
Changpeng Fang	4cabf6d3b5	AMDGPU: Use MachineInstr::mayAlias to replace areMemAccessesTriviallyDisjoint in LoadStoreOptimizer pass. Summary: This is to fix a memory dependence bug in LoadStoreOptimizer. Reviewers: arsenm, rampitec Differential Revision: https://reviews.llvm.org/D58295 llvm-svn: 354295	2019-02-18 23:00:26 +00:00
Matt Arsenault	fbe92a53d0	GlobalISel: Implement widenScalar for g_extract scalar results llvm-svn: 354293	2019-02-18 22:39:27 +00:00
Konstantin Zhuravlyov	1e126c503b	AMDGPU: Set ABI version to 1 for code object v3 Differential Revision: https://reviews.llvm.org/D57811 llvm-svn: 354085	2019-02-14 23:56:04 +00:00
Matt Arsenault	530d05e94a	GlobalISel: Add alignment to LegalityQuery MMOs This allows targets to specify the minimum alignment required for the load/store. llvm-svn: 354071	2019-02-14 22:41:09 +00:00
Matt Arsenault	9e5e868d95	AMDGPU/GlobalISel: Fix RegBankSelect for GEP. This is basically a pointer typed add, so shouldn't be any different. This was assuming everything was an SGPR, which is not true. Also cleanup legality for GEP. I don't seem to be seeing the problem the hack marking s64 as a legal pointer type the comment mentions. llvm-svn: 354067	2019-02-14 22:24:28 +00:00
Stanislav Mekhanoshin	871821f786	[AMDGPU] Ressociate 'add (add x, y), z' to use SALU Reassociate adds to collect scalar operands in a single instruction when possible. That will result in a scalar add followed by vector instead of two vector adds, thus better utilizing SALU. Differential Revision: https://reviews.llvm.org/D58220 llvm-svn: 354066	2019-02-14 22:11:25 +00:00
Matt Arsenault	d3d496338e	AMDGPU/GlobalISel: Handle split for 64-bit VALU select llvm-svn: 354065	2019-02-14 21:58:12 +00:00
Matt Arsenault	4cd9509e1d	AMDGPU: Try to use function specific ST Subtargets are a function level property, so ideally we would eliminate everywhere that needs to check the global one. Rename the function to try avoiding confusion. llvm-svn: 353900	2019-02-12 23:44:13 +00:00
Matt Arsenault	d24296e282	AMDGPU: Ignore CodeObjectV3 when inlining This was inhibiting inlining of library functions when clang was invoking the inliner directly. This is covering a bit of a mess with subtarget feature handling, and this shouldn't be a subtarget feature. The behavior is different depending on whether you are using a -mattr flag in clang, or llc, opt. llvm-svn: 353899	2019-02-12 23:30:11 +00:00
Konstantin Zhuravlyov	6220d62e5c	AMDGPU/NFC: Remove SubtargetFeatureISAVersion since it is not used anywhere llvm-svn: 353892	2019-02-12 22:49:49 +00:00
Konstantin Zhuravlyov	acb231c8d8	AMDGPU: Remove duplicate processor (gfx900) llvm-svn: 353889	2019-02-12 22:29:25 +00:00
Matt Arsenault	00ccd13c73	AMDGPU/GlobalISel: Only make f16 constants legal on f16 targets We could deal with it, but there's no real point. llvm-svn: 353845	2019-02-12 14:54:55 +00:00
Matt Arsenault	18ec382698	GlobalISel: Implement moreElementsVector for implicit_def llvm-svn: 353754	2019-02-11 22:00:39 +00:00
Matt Arsenault	9dba67f431	GlobalISel: Add G_FCANONICALIZE instruction llvm-svn: 353719	2019-02-11 17:05:20 +00:00
Benjamin Kramer	582c16013d	[AMDGPU] Remove unused variable llvm-svn: 353704	2019-02-11 14:49:54 +00:00

1 2 3 4 5 ...

3177 Commits