llvm-project

Commit Graph

Author	SHA1	Message	Date
Aditya Nandakumar	1023a2eca3	[GlobalISel]: Allow backends to custom legalize Intrinsics https://reviews.llvm.org/D31359 Add a hook "legalizeInstrinsic" to allow backends to override this and custom lower/legalize intrinsics. llvm-svn: 364821	2019-07-01 17:53:50 +00:00
Matt Arsenault	b2ea20eedd	AMDGPU/GlobalISel: RegBankSelect for sendmsg/sendmsghalt llvm-svn: 364819	2019-07-01 17:40:18 +00:00
Matt Arsenault	40d1faf38f	AMDGPU/GlobalISel: Legalize s16 fcmp llvm-svn: 364817	2019-07-01 17:35:53 +00:00
Matt Arsenault	6f74f55750	GlobalISel: Implement lower for min/max llvm-svn: 364816	2019-07-01 17:18:03 +00:00
Nicolai Haehnle	10c911db63	AMDGPU/GFX10: implement ds_ordered_count changes Summary: ds_ordered_count can now simultaneously operate on up to 4 dwords in a single instruction, which are taken from (and returned to) lanes 0..3 of a single VGPR. Change-Id: I19b6e7b0732b617c10a779a7f9c0303eec7dd276 Reviewers: mareko, arsenm, rampitec Subscribers: kzhuravl, jvesely, wdng, yaxunl, dstuttard, tpr, t-tye, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D63716 llvm-svn: 364815	2019-07-01 17:17:52 +00:00
Nicolai Haehnle	4dc3b2bf95	AMDGPU: Support GDS atomics Summary: Original patch by Marek Olšák Change-Id: Ia97d5d685a63a377d86e82942436d1fe6e429bab Reviewers: mareko, arsenm, rampitec Subscribers: kzhuravl, jvesely, wdng, yaxunl, dstuttard, tpr, t-tye, jfb, Petar.Avramovic, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D63452 llvm-svn: 364814	2019-07-01 17:17:45 +00:00
Matt Arsenault	1094e6a814	AMDGPU/GlobalISel: RegBankSelect for DS ordered add/swap llvm-svn: 364811	2019-07-01 17:04:57 +00:00
Matt Arsenault	732149b24e	AArch64/GlobalISel: Fix trying to select invalid MIR Physical registers are not allowed to be a phi operand. llvm-svn: 364810	2019-07-01 17:02:24 +00:00
Matt Arsenault	265059eaf6	AMDGPU/GlobalISel: RegBankSelect for amdgcn.writelane llvm-svn: 364808	2019-07-01 16:41:36 +00:00
Matt Arsenault	a310727830	AMDGPU/GlobalISel: Fail instead of assert when selecting loads llvm-svn: 364807	2019-07-01 16:36:39 +00:00
Matt Arsenault	0a52e9d026	AMDGPU/GlobalISel: Complete implementation of G_GEP Also works around tablegen defect in selecting add with unused carry, but if we have to manually select GEP, might as well handle add manually. llvm-svn: 364806	2019-07-01 16:34:48 +00:00
Matt Arsenault	e1006259d8	AMDGPU/GlobalISel: Select G_PHI llvm-svn: 364805	2019-07-01 16:32:47 +00:00
Matt Arsenault	d810ff2588	AMDGPU/GlobalISel: Try to select VOP3 form of add There are several things broken, but at least emit the right thing for gfx9. The import of the pattern with the unused carry out seems to not work. Needs a special class for clamp, because OperandWithDefaultOps doesn't really work. llvm-svn: 364804	2019-07-01 16:27:32 +00:00
Simon Pilgrim	e3e38cce4a	[X86] Add widenSubVector to size in bits helper. NFCI. We can already widenSubVector to a specific type (of the same scalar type) - this variant just specifies the target vector size. This will be useful when CombineShuffleWithExtract relaxes the need to have the same scalar type for all shuffle operand subvector sources. llvm-svn: 364803	2019-07-01 16:20:47 +00:00
Matt Arsenault	62d64b0c30	AMDGPU/GlobalISel: RegBankSelect for readlane/readfirstlane llvm-svn: 364801	2019-07-01 16:19:39 +00:00
Tom Stellard	9e9dd30de3	AMDGPU/GlobalISel: Implement select for 32-bit G_ADD Reviewers: arsenm Reviewed By: arsenm Subscribers: hiraditya, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, rovka, kristof.beyls, dstuttard, tpr, t-tye, Petar.Avramovic, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D58804 llvm-svn: 364797	2019-07-01 16:09:33 +00:00
Mikhail Maltsev	8b2e304bc5	[ARM] Fix MVE_VQxDMLxDH instruction class Summary: According to the ARMARM, the VQDMLADH, VQRDMLADH, VQDMLSDH and VQRDMLSDH instructions handle their results as follows: "The base variant writes the results into the lower element of each pair of elements in the destination register, whereas the exchange variant writes to the upper element in each pair". I.e., the initial content of the output register affects the result, as usual, we model this with an additional input. Also, for 32-bit variants Qd is not allowed to be the same register as Qm and Qn, we use @earlyclobber to indicate this. This patch also changes vpred_r to vpred_n because the instructions don't have an explicit 'inactive' operand. Reviewers: dmgreen, ostannard, simon_tatham Reviewed By: simon_tatham Subscribers: javed.absar, kristof.beyls, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D64007 llvm-svn: 364796	2019-07-01 16:07:58 +00:00
Matt Arsenault	2ab25f9ceb	AMDGPU/GlobalISel: Select G_BRCOND for vcc llvm-svn: 364795	2019-07-01 16:06:02 +00:00
Mikhail Maltsev	4a9e3f15bb	[ARM] MVE: support QQPRRegClass and QQQQPRRegClass Summary: QQPRRegClass and QQQQPRRegClass are used by the interleaving/deinterleaving loads/stores to represent sequences of consecutive SIMD registers. Reviewers: ostannard, simon_tatham, dmgreen Reviewed By: simon_tatham Subscribers: javed.absar, kristof.beyls, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D64009 llvm-svn: 364794	2019-07-01 16:05:23 +00:00
Roman Lebedev	04d3d3bbff	[InstCombine] (Y + ~X) + 1 --> Y - X fold (PR42459) Summary: To be noted, this pattern is not unhandled by instcombine per-se, it is somehow does end up being folded when one runs opt -O3, but not if it's just -instcombine. Regardless, that fold is indirect, depends on some other folds, and is thus blind when there are extra uses. This does address the regression being exposed in D63992. https://godbolt.org/z/7DGltU https://rise4fun.com/Alive/EPO0 Fixes [[ https://bugs.llvm.org/show_bug.cgi?id=42459 \| PR42459 ]] Reviewers: spatel, nikic, huihuiz Reviewed By: spatel Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D63993 llvm-svn: 364792	2019-07-01 15:55:24 +00:00
Roman Lebedev	72b8d41ce8	[InstCombine] Shift amount reassociation in bittest (PR42399) Summary: Given pattern: `icmp eq/ne (and ((x shift Q), (y oppositeshift K))), 0` we should move shifts to the same hand of 'and', i.e. rewrite as `icmp eq/ne (and (x shift (Q+K)), y), 0` iff `(Q+K) u< bitwidth(x)` It might be tempting to not restrict this to situations where we know we'd fold two shifts together, but i'm not sure what rules should there be to avoid endless combine loops. We pick the same shift that was originally used to shift the variable we picked to shift: https://rise4fun.com/Alive/6x1v Should fix [[ https://bugs.llvm.org/show_bug.cgi?id=42399 \| PR42399]]. Reviewers: spatel, nikic, RKSimon Reviewed By: spatel Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D63829 llvm-svn: 364791	2019-07-01 15:55:15 +00:00
Krzysztof Parzyszek	5abf80cdfa	[Hexagon] Custom-lower UADDO(x, 1) and USUBO(x, 1) llvm-svn: 364790	2019-07-01 15:50:09 +00:00
Matt Arsenault	cda82f0bb6	AMDGPU/GlobalISel: Select G_FRAME_INDEX llvm-svn: 364789	2019-07-01 15:48:18 +00:00
Nicolai Haehnle	7cfd99ab15	AMDGPU/GFX10: fix scratch resource descriptor Summary: The stride should depend on the wave size, not the hardware generation. Also, the 32_FLOAT format is 0x16, not 16; though that shouldn't be relevant. Change-Id: I088f93bf6708974d085d1c50967f119061da6dc6 Reviewers: arsenm, rampitec, mareko Subscribers: kzhuravl, jvesely, wdng, yaxunl, dstuttard, tpr, t-tye, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D63808 llvm-svn: 364788	2019-07-01 15:43:00 +00:00
Matt Arsenault	fdf36729c7	AMDGPU/GlobalISel: Make s16 select legal This is easy to handle and avoids legalization artifacts which are likely to obscure combines. llvm-svn: 364787	2019-07-01 15:42:47 +00:00
Matt Arsenault	6464280eb0	AMDGPU/GlobalISel: Select G_BRCOND for scc conditions llvm-svn: 364786	2019-07-01 15:39:27 +00:00
Matt Arsenault	1daad91af6	AMDGPU/GlobalISel: Tolerate copies with no type set isVCC has the same bug, but isn't used in a context where it can cause a problem. llvm-svn: 364784	2019-07-01 15:23:04 +00:00
Matt Arsenault	4f64ade04c	AMDGPU/GlobalISel: Select src modifiers llvm-svn: 364782	2019-07-01 15:18:56 +00:00
Diana Picus	2ba16011c1	Fixup r364512 Fix stack-use-after-scope errors from r364512. One instance was already fixed in r364611 - this patch simplifies that fix and addresses one more instance of similar code. Discussed in: https://reviews.llvm.org/D63905 llvm-svn: 364778	2019-07-01 15:07:38 +00:00
Krzysztof Parzyszek	511ad50db4	[Hexagon] Rework VLCR algorithm Add code to catch pattern for commutative instructions for VLCR. Patch by Suyog Sarda. llvm-svn: 364770	2019-07-01 13:50:47 +00:00
Matt Arsenault	1b317685e9	AMDGPU: Convert some places to Register llvm-svn: 364769	2019-07-01 13:44:46 +00:00
Matt Arsenault	5bf850d52e	AMDGPU/GlobalISel: Fix RegBankSelect for G_FCANONICALIZE llvm-svn: 364768	2019-07-01 13:40:18 +00:00
Matt Arsenault	b5fc94f3e7	AMDGPU/GlobalISel: Fix RegBankSelect for G_BUILD_VECTOR llvm-svn: 364767	2019-07-01 13:40:17 +00:00
Matt Arsenault	89fc8bcdd6	AMDGPU/GlobalISel: Fail on store to 32-bit address space llvm-svn: 364766	2019-07-01 13:37:39 +00:00
Matt Arsenault	3b7668ae4b	AMDGPU/GlobalISel: Improve icmp selection coverage. Select s64 eq/ne scalar icmp. llvm-svn: 364765	2019-07-01 13:34:26 +00:00
Matt Arsenault	c23149f612	AMDGPU/GlobalISel: RegBankSelect for WWM/WQM llvm-svn: 364763	2019-07-01 13:30:12 +00:00
Matt Arsenault	facf69e844	AMDGPU/GlobalISel: Use vcc reg bank for amdgcn.wqm.vote llvm-svn: 364762	2019-07-01 13:30:09 +00:00
Matt Arsenault	9f992c238a	AMDGPU/GlobalISel: Fix scc->vcc copy handling This was checking the size of the register with the value of the size, which happens to be exec. Also fix assuming VCC is 64-bit to fix wave32. Also remove some untested handling for physical registers which is skipped. This doesn't insert the V_CNDMASK_B32 if SCC is the physical copy source. I'm not sure if this should be trying to handle this special case instead of dealing with this in copyPhysReg. llvm-svn: 364761	2019-07-01 13:22:07 +00:00
Matt Arsenault	5dafcb9b11	AMDGPU/GlobalISel: Use and instead of BFE with inline immediate Zext from s1 is the only case where this should do anything with the current legal extensions. llvm-svn: 364760	2019-07-01 13:22:06 +00:00
Simon Atanasyan	ceb9da5bc7	[mips] Add missing schedinfo for MSA and ASE instructions llvm-svn: 364757	2019-07-01 13:21:05 +00:00
Simon Atanasyan	c0121bf874	[mips] Add missing schedinfo for atomic instructions llvm-svn: 364756	2019-07-01 13:20:56 +00:00
Simon Atanasyan	3a10810b7a	[mips] Add missing schedinfo for ADJCALLSTACKDOWN, ADJCALLSTACKUP llvm-svn: 364755	2019-07-01 13:20:48 +00:00
Florian Hahn	33c8c0ea27	[AMDGPU] Call isLoopExiting for blocks in the loop. isLoopExiting should only be called for blocks in the loop. A follow up patch makes this requirement an assertion. I've updated the usage here, to only match for actual exit blocks. Previously, it would also match blocks not in the loop. Reviewers: arsenm, nhaehnle Reviewed By: nhaehnle Differential Revision: https://reviews.llvm.org/D63980 llvm-svn: 364750	2019-07-01 12:36:44 +00:00
Fangrui Song	92e78b7bed	[RISCV] Add break; to the last switch case As suggested by jrtc27 in the post-commit review of D60528. llvm-svn: 364746	2019-07-01 11:41:07 +00:00
Simon Pilgrim	172fe5dd19	[X86] CombineShuffleWithExtract - updated description comments. NFCI. CombineShuffleWithExtract no longer requires that both shuffle ops are extract_subvectors, from the same type or from the same size. llvm-svn: 364745	2019-07-01 11:33:45 +00:00
Benjamin Kramer	ed13fef477	[SelectionDAG] Do minnum->minimum at legalization time instead of building time The SDAGBuilder behavior stems from the days when we didn't have fast math flags available in SDAG. We do now and doing the transformation in the legalizer has the advantage that it also works for vector types. llvm-svn: 364743	2019-07-01 11:00:23 +00:00
Roman Lebedev	f55818e3a7	[InstCombine] Omit 'urem' where possible This was added in D63390 / rL364286 to backend, but it makes sense to also handle it in middle-end. https://rise4fun.com/Alive/Zsln llvm-svn: 364738	2019-07-01 09:41:43 +00:00
Jeremy Morse	d2b6665e33	[DebugInfo] Avoid adding too much indirection to pointer-valued variables This patch addresses PR41675, where a stack-pointer variable is dereferenced too many times by its location expression, presenting a value on the stack as the pointer to the stack. The difference between a stack pointer DBG_VALUE and one that refers to a value on the stack, is currently the indirect flag. However the DWARF backend will also try to guess whether something is a memory location or not, based on whether there is any computation in the location expression. By simply prepending the stack offset to existing expressions, we can accidentally convert a register location into a memory location, which introduces a suprise (and unintended) dereference. The solution is to add DW_OP_stack_value whenever we add a DIExpression computation to a stack pointer. It's an implicit location computed on the expression stack, thus needs to be flagged as a stack_value. For the edge case where the offset is zero and the location could be a register location, DIExpression::prepend will still generate opcodes, and thus DW_OP_stack_value must still be added. Differential Revision: https://reviews.llvm.org/D63429 llvm-svn: 364736	2019-07-01 09:38:23 +00:00
Yevgeny Rouban	d4097b4a93	[SimpleLoopUnswitch] Implement handling of prof branch_weights metadata for SwitchInst Differential Revision: https://reviews.llvm.org/D60606 llvm-svn: 364734	2019-07-01 08:43:53 +00:00
Sam Parker	98722691b0	[ARM] WLS/LE Code Generation Backend changes to enable WLS/LE low-overhead loops for armv8.1-m: 1) Use TTI to communicate to the HardwareLoop pass that we should try to generate intrinsics that guard the loop entry, as well as setting the loop trip count. 2) Lower the BRCOND that uses said intrinsic to an Arm specific node: ARMWLS. 3) ISelDAGToDAG the node to a new pseudo instruction: t2WhileLoopStart. 4) Add support in ArmLowOverheadLoops to handle the new pseudo instruction. Differential Revision: https://reviews.llvm.org/D63816 llvm-svn: 364733	2019-07-01 08:21:28 +00:00

1 2 3 4 5 ...

124261 Commits