llvm-project

Commit Graph

Author	SHA1	Message	Date
Alex Bradbury	46db78b743	[ARM][NFC] Avoid recreating MCSubtargetInfo in ARMAsmBackend After D41349, we can now directly access MCSubtargetInfo from createARM*AsmBackend. This patch makes use of this, avoiding the need to create a fresh MCSubtargetInfo (which was previously always done with a blank CPU and feature string). Given the total size of the change remains pretty tiny and we're removing the old explicit destructor, I changed the STI field to a reference rather than a pointer. Differential Revision: https://reviews.llvm.org/D41693 llvm-svn: 321707	2018-01-03 13:46:21 +00:00
Florian Hahn	dcc0ba9bbb	[InstCombine] Add test to remove VarArg casts (NFC) llvm-svn: 321706	2018-01-03 13:35:43 +00:00
Hal Finkel	8b4bdfdbc4	[TableGen] Add support of Intrinsics with multiple returns This change deals with intrinsics with multiple outputs, for example load instrinsic with address updated. DAG selection for Instrinsics could be done either through source code or tablegen. Handling all intrinsics in source code would introduce a huge chunk of repetitive code if we have a large number of intrinsic that return multiple values (see NVPTX as an example). While intrinsic class in tablegen supports multiple outputs, tablegen only supports Intrinsics with zero or one output on TreePattern. This appears to be a simple bug in tablegen that is fixed by this change. For Intrinsics defined as: def int_xxx_load_addr_updated: Intrinsic<[llvm_i32_ty, llvm_ptr_ty], [llvm_ptr_ty, llvm_i32_ty], []>; Instruction will be defined as: def L32_X: Inst<(outs reg:$d1, reg:$d2), (ins reg:$s1, reg:$s2), "ld32_x $d1, $d2, $s2", [(set i32:$d1, i32:$d2, (int_xxx_load_addr_updated i32:$s1, i32:$s2))]>; Patch by Wenbo Sun, thanks! Differential Revision: https://reviews.llvm.org/D32888 llvm-svn: 321704	2018-01-03 11:35:09 +00:00
Sander de Smalen	dc5e081b93	[AArch64][SVE] Asm: Add restricted register classes for SVE predicate vectors. Summary: Add a register class for SVE predicate operands that can only be p0-p7 (as opposed to p0-p15) Patch [1/3] in a series to add predicated ADD/SUB instructions for SVE. Reviewers: rengolin, mcrosier, evandro, fhahn, echristo, olista01, SjoerdMeijer, javed.absar Reviewed By: fhahn Subscribers: aemerson, javed.absar, tschuett, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D41441 llvm-svn: 321699	2018-01-03 10:15:46 +00:00
Alex Bradbury	7c093bf1cf	Fix build of WebAssembly and AVR backends after r321692 As experimental backends, I didn't have them configured to build in my local build config. llvm-svn: 321696	2018-01-03 09:30:39 +00:00
Alex Bradbury	86b99cb5c4	Fix incorrect documentation comment left after r321692 TargetRegistryInfo::createMCAsmBackend no longer takes a TheTriple parameter. The majory of the TargetRegistryInfo::create* functions have no or very limitied per-parameter doc comments, and adding a comment for the MCSubtargetInfo, MCRegisterInfo and MCTargetOptions parameters seems like it would add no real value beyond reading the function signature. As such, I've just deleted the doc comment for TheTriple. llvm-svn: 321694	2018-01-03 09:14:02 +00:00
Alex Bradbury	b22f751fa7	Thread MCSubtargetInfo through Target::createMCAsmBackend Currently it's not possible to access MCSubtargetInfo from a TgtMCAsmBackend. D20830 threaded an MCSubtargetInfo reference through MCAsmBackend::relaxInstruction, but this isn't the only function that would benefit from access. This patch removes the Triple and CPUString arguments from createMCAsmBackend and replaces them with MCSubtargetInfo. This patch just changes the interface without making any intentional functional changes. Once in, several cleanups are possible: * Get rid of the awkward MCSubtargetInfo handling in ARMAsmBackend * Support 16-bit instructions when valid in MipsAsmBackend::writeNopData * Get rid of the CPU string parsing in X86AsmBackend and just use a SubtargetFeature for HasNopl * Emit 16-bit nops in RISCVAsmBackend::writeNopData if the compressed instruction set extension is enabled (see D41221) This change initially exposed PR35686, which has since been resolved in r321026. Differential Revision: https://reviews.llvm.org/D41349 llvm-svn: 321692	2018-01-03 08:53:05 +00:00
Amara Emerson	9de62130fd	[GlobalISel][Legalizer] Fix legalization of llvm.smul.with.overflow Previously the code for handling G_SMULO didn't properly check for the signed multiply overflow, instead treating it the same as the unsigned G_UMULO. Fixes PR35800. llvm-svn: 321690	2018-01-03 04:56:56 +00:00
Jake Ehrlich	30d927a128	[llvm-objcopy] Add support for visibility I have no clue how this was missed when symbol table support was added. This change ensures that the visibility of symbols is preserved by default. llvm-svn: 321681	2018-01-02 23:01:24 +00:00
Andrew Kaylor	e12e08c680	Handle the case of live 16-bit subregisters in X86FixupBWInsts Differential Revision: https://reviews.llvm.org/D40524 Change-Id: Ie3a405b28503ceae999f5f3ba07a68fa733a2400 llvm-svn: 321674	2018-01-02 21:04:38 +00:00
Sanjay Patel	24e6a8bde0	[AArch64] fix typos in comments; NFC llvm-svn: 321673	2018-01-02 21:04:08 +00:00
Sanjay Patel	7811430588	[ValueTracking] recognize min/max of min/max patterns This is part of solving PR35717: https://bugs.llvm.org/show_bug.cgi?id=35717 The larger IR optimization is proposed in D41603, but we can show the improvement in ValueTracking using codegen tests because SelectionDAG creates min/max nodes based on ValueTracking. Any target with min/max ops should show wins here. I chose AArch64 vector ops because they're clean and uniform. Some Alive proofs for the tests (can't put more than 2 tests in 1 page currently because the web app says it's too long): https://rise4fun.com/Alive/WRN https://rise4fun.com/Alive/iPm https://rise4fun.com/Alive/HmY https://rise4fun.com/Alive/CNm https://rise4fun.com/Alive/LYf llvm-svn: 321672	2018-01-02 20:56:45 +00:00
Sanjay Patel	35a6ee86af	[AArch64] add tests for min/max of min/max (PR35717); NFC llvm-svn: 321668	2018-01-02 20:16:45 +00:00
Amara Emerson	913918cbef	[AArch64][GlobalISel] Fix assert fail with unknown intrinsic. A call may have an intrinsic name but not have a valid intrinsic ID, for example with llvm.invariant.group.barrier. If so, treat it as a normal call like FastISel does. llvm-svn: 321662	2018-01-02 18:56:39 +00:00
Jonas Hahnfeld	9e4431fff0	[opt-viewer] Check for pygments.lexer.c_cpp Some systems still don't have this module which was introduced in version 2.0 (CentOS 7, sigh). Differential Revision: https://reviews.llvm.org/D41611 llvm-svn: 321659	2018-01-02 17:53:08 +00:00
Sanjay Patel	9a80871ffe	[x86] allow pairs of PCMPEQ for vector-sized integer equality comparisons (PR33325) This is an extension of D31156 with the goal that we'll allow memcmp() == 0 expansion for x86 to use 2 pairs of loads per block. The memcmp expansion pass (formerly part of CGP) will generate this kind of pattern with oversized integer compares, so we want to transform these into x86-specific vector nodes before legalization splits things into scalar chunks. See PR33325 for more details: https://bugs.llvm.org/show_bug.cgi?id=33325 Differential Revision: https://reviews.llvm.org/D41618 llvm-svn: 321656	2018-01-02 16:38:29 +00:00
Amara Emerson	854d10d10b	[AArch64][GlobalISel] Enable GlobalISel at -O0 by default Tests updated to explicitly use fast-isel at -O0 instead of implicitly. This change also allows an explicit -fast-isel option to override an implicitly enabled global-isel. Otherwise -fast-isel would have no effect at -O0. Differential Revision: https://reviews.llvm.org/D41362 llvm-svn: 321655	2018-01-02 16:30:47 +00:00
Anna Thomas	bdb9430917	[BasicBlockUtils] Check for unreachable preds before updating LI in UpdateAnalysisInformation Summary: We are incorrectly updating the LI when loop-simplify generates dedicated exit blocks for a loop. The issue is that there's an implicit assumption that the Preds passed into UpdateAnalysisInformation are reachable. However, this is not true and breaks LI by incorrectly updating the header of a loop. One such case is when we generate dedicated exits when the exit block is a landing pad (through SplitLandingPadPredecessors). There maybe other cases as well, since we do not guarantee that Preds passed in are reachable basic blocks. The added test case shows how loop-simplify breaks LI for the outer loop (and DT in turn) after we try to generate the LoopSimplifyForm. Reviewers: davide, chandlerc, sanjoy Reviewed By: davide Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D41519 llvm-svn: 321653	2018-01-02 16:25:50 +00:00
Krzysztof Parzyszek	cfe4a3616f	[Hexagon] Fix generation of vector sign extensions llvm-svn: 321650	2018-01-02 15:28:49 +00:00
Daniel Jasper	cc4903e2ba	Revert r321089: "[DAG] Elide overlapping store" (and subsequent fix in r321204) Our internal testing has revealed has discovered bugs in PPC builds. I have forward reproduction instructions to the original author (Nirav). llvm-svn: 321649	2018-01-02 14:38:52 +00:00
Dmitry Venikov	527784b30f	NFC. Add description comments to Function header Reviewers: ruiu, davidxl, silvas, brzycki Reviewed By: brzycki Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D41609 llvm-svn: 321648	2018-01-02 14:13:16 +00:00
Sander de Smalen	c9b3e1cf03	[AArch64][AsmParser] Add isScalarReg() and repurpose isReg() Summary: isReg() in AArch64AsmParser.cpp is a bit of a misnomer, and would be better named 'isScalarReg()' instead. Patch [1/3] in a series to add operand constraint checks for SVE's predicated ADD/SUB. Reviewers: rengolin, mcrosier, evandro, fhahn, echristo Reviewed By: fhahn Subscribers: aemerson, javed.absar, llvm-commits, kristof.beyls Differential Revision: https://reviews.llvm.org/D41445 llvm-svn: 321646	2018-01-02 13:39:44 +00:00
Simon Pilgrim	39f50e103b	Strip trailing whitespace. NFCI llvm-svn: 321644	2018-01-02 12:41:29 +00:00
Alex Bradbury	8cb894b34b	[RISCV] Add Defs Uses information for c.jal and c.addi4spn Differential Revision: https://reviews.llvm.org/D41339 Patch by Shiva Chen. llvm-svn: 321643	2018-01-02 12:09:29 +00:00
Alex Bradbury	3633d1205f	[RISCV][NFC] Resolve unused variable warning in RISCVISelLowering XLenVT in LowerFormalArguments is used only in an assert. llvm-svn: 321642	2018-01-02 11:54:59 +00:00
Sam Parker	3570c554b5	[DAGCombine] Fix for PR35765 Remove the acceptance of ANY_EXTEND nodes while trying to move and nodes back to loads. Bugzilla: https://bugs.llvm.org/show_bug.cgi?id=35765 Differential Revision: https://reviews.llvm.org/D41625 llvm-svn: 321641	2018-01-02 10:19:01 +00:00
Sam Parker	2dea5e0f3c	[X86] Codegen test for pr35765 Committing reproducer test for pr35765, fix to follow. llvm-svn: 321640	2018-01-02 10:14:00 +00:00
Craig Topper	e3b6bd337a	[SelectionDAG] Teach WidenVecOp_Convert to widen the operation if a widened result type would still be legal. llvm-svn: 321638	2018-01-02 07:30:53 +00:00
Dmitry Venikov	a58d8deb3a	[InstCombine] Missed optimization in math expression: squashing sqrt functions Summary: This patch enables folding under -ffast-math flag sqrt(a) * sqrt(b) -> sqrt(a*b) Reviewers: hfinkel, spatel, davide Reviewed By: spatel, davide Subscribers: davide, llvm-commits Differential Revision: https://reviews.llvm.org/D41322 llvm-svn: 321637	2018-01-02 05:58:11 +00:00
Dmitry Venikov	d2257be8b7	Test commit Reviewers: Quolyk Reviewed By: Quolyk Differential Revision: https://reviews.llvm.org/D41561 llvm-svn: 321636	2018-01-02 05:47:42 +00:00
Craig Topper	3e528c7518	[SelectionDAG] Remove ifs on getTypeAction being TypeWidenVector from some of the WideVecOp handlers. We should only be in the handler if the tyep action is TypeWidenVector. There's no reason to try to do anything else. llvm-svn: 321635	2018-01-02 01:55:07 +00:00
Simon Pilgrim	6720726d27	[ValueTracking] Don't assume shift values are in range Reduced (as best I could...) from oss-fuzz #4857 test case llvm-svn: 321634	2018-01-01 22:44:59 +00:00
Simon Pilgrim	af35f5ec1d	[InstCombine] Regenerate udiv tests. llvm-svn: 321633	2018-01-01 22:27:49 +00:00
Craig Topper	c8898b3640	[X86] Promote vXi1 fp_to_uint/fp_to_sint to vXi32 to avoid scalarization. llvm-svn: 321632	2018-01-01 21:12:18 +00:00
Craig Topper	bb8b79b0a0	[X86] Add test cases for vXi1 fptosi/fptoui. Currently we do a lot of scalarization in these test cases. llvm-svn: 321631	2018-01-01 21:12:10 +00:00
Craig Topper	e5943bb337	[X86] Replace custom lowering of vXi1 SINT_TO_FP/UINT_TO_FP with promotion. The custom lowering was just doing the same thing promotion would do. llvm-svn: 321630	2018-01-01 20:08:43 +00:00
Craig Topper	a4f9997675	[SelectionDAG][X86][AArch64] Require targets to specify the promotion type when using setOperationAction Promote for INT_TO_FP and FP_TO_INT Currently the promotion for these ignores the normal getTypeToPromoteTo and instead just tries to double the element width. This is because the default behavior of getTypeToPromote to just adds 1 to the SimpleVT, which has the affect of increasing the element count while keeping the scalar size the same. If multiple steps are required to get to a legal operation type, int_to_fp will be promoted multiple times. And fp_to_int will keep trying wider types in a loop until it finds one that works. getTypeToPromoteTo does have the ability to query a promotion map to get the type and not do the increasing behavior. It seems better to just let the target specify the promotion type in the map explicitly instead of letting the legalizer iterate via widening. FWIW, it's worth I think for any other vector operations that need to be promoted, we have to specify the type explicitly because the default behavior of getTypeToPromote isn't useful for vectors. The other types of promotion already require either the element count is constant or the total vector width is constant, but neither happens by incrementing the SimpleVT enum. Differential Revision: https://reviews.llvm.org/D40664 llvm-svn: 321629	2018-01-01 19:21:35 +00:00
Sanjay Patel	18962dabb7	[x86] add runs for more vector variants; NFC Preliminary step to see what the effects of D41618 look like. llvm-svn: 321624	2018-01-01 16:36:47 +00:00
Simon Pilgrim	e337268df7	[X86][SSE] Add test case from PR32160 llvm-svn: 321620	2018-01-01 13:04:04 +00:00
Uriel Korach	c06596ced4	[X86] Regenerate test checks in sse-intrinsics-x86-upgrade with update-llc Removing outdated checks. NFC llvm-svn: 321619	2018-01-01 09:00:13 +00:00
Uriel Korach	e87d240699	[X86] Regenerate test checks in sse2-intrinsics-x86-upgrade with update-llc Removing outdated checks. NFC llvm-svn: 321618	2018-01-01 08:47:50 +00:00
Craig Topper	0d35edda90	[X86] In LowerTruncateVecI1, don't add SHL if the input is known to be all sign bits. If the input is all sign bits then the LSB through MSB are all the same so we don't need to be move the LSB to the MSB. llvm-svn: 321617	2018-01-01 04:52:58 +00:00
Craig Topper	694c73adc2	[X86] Add missing NoVLX predicate around some patterns that use zmm registers to implement 128/256-bit operations without VLX. llvm-svn: 321613	2018-01-01 01:11:32 +00:00
Craig Topper	fc3ce4993c	[X86] Add patterns for using zmm registers for v8i32/v8f32 vselect with the false input being zero. We can use zmm move with zero masking for this. We already had patterns for using a masked move, but we didn't check for the zero masking case separately. llvm-svn: 321612	2018-01-01 01:11:29 +00:00
Craig Topper	f78b75fb59	[X86] Use CONCAT_VECTORS instead of INSERT_SUBVECTOR for padding v4i1/v2i1 vector to v8i1 pre-legalize. The CONCAT_VECTORS will be lowered to INSERT_SUBVECTOR later. In the modified cases this seems to be enough to trick a later DAG combine into running in a different order than allows the ANDs to be removed. I'll admit this is a bit of a hack that happens to work, but using CONCAT_VECTORS is more consistent with other legalization code anyway. llvm-svn: 321611	2017-12-31 19:17:52 +00:00
Simon Pilgrim	b000675374	[X86][AVX2] Combine extract(broadcast(scalar_value)) --> scalar_value As it has a scalar source we don't treat it as a target shuffle so needs special handling. llvm-svn: 321610	2017-12-31 18:59:30 +00:00
Simon Pilgrim	e940b86c5f	[X86][AVX] Add test case from PR33740 llvm-svn: 321608	2017-12-31 17:16:48 +00:00
Simon Pilgrim	f205ec716b	[X86][SSE] Don't vectorize splat buildvector of binops (PR30780) Don't combine buildvector(binop(),binop(),binop(),binop()) -> binop(buildvector(), buildvector()) if its a splat - keep the binop scalar and just splat the result to avoid large vector constants. llvm-svn: 321607	2017-12-31 17:07:47 +00:00
Davide Italiano	86b7949f62	[SimplifyCFG] Return to the pass manager the correct value. I wanted to commit this with r321603, but I failed to squash the two commits. llvm-svn: 321606	2017-12-31 16:54:03 +00:00
Davide Italiano	0512bf5af2	[Utils/Local] Use `auto` when the type is obvious. NFCI. llvm-svn: 321605	2017-12-31 16:51:50 +00:00

1 2 3 4 5 ...

158500 Commits