llvm-project

Commit Graph

Author	SHA1	Message	Date
Krzysztof Parzyszek	db60e64036	[Hexagon] Handle additional shuffles that can be made perfect	2020-10-29 19:09:00 -05:00
Craig Topper	74b078294f	[RISCV] Improve worklist management in the DAG combine for SLLW/SRLW/SRAW This combine makes two calls to SimplifyDemandedBits, one for the LHS and one for the RHS. If the LHS call returns true, we don't make the RHS call. When SimplifyDemandedBits makes a change, it will add the nodes around the change to the DAG combiner worklist. If the simplification happens on the first recursion step, the N will get added to the worklist. But if the simplification happens deeper in the recursion, then N will not be revisited until the next time the DAG combiner runs. This patch explicitly addes N to the worklist anytime a Simplification is made. Without this we might miss additional simplifications on the LHS or never simplify the RHS. Special care also needs to be taken to not add N if it has been CSEd by the simplification. There are similar examples in DAGCombiner and the X86 target, but I don't have a test for it for RISC-V. I've also returned SDValue(N, 0) instead of SDValue() so DAGCombiner knows a change was made and will update its Statistic variable. The test here was constructed so that 2 simplifications happen to the LHS. Without this fix one happens in the post type legalization DAG combine and the other happens after LegalizeDAG. This prevents the RHS from ever being simplified causing the left and right shift to clear the upper 32 bits of the RHS to be left behind. Differential Revision: https://reviews.llvm.org/D90339	2020-10-29 14:52:53 -07:00
Nikita Popov	6c2ad4cf87	[SDAG] Extract helper to determine neutral element (NFC) Make the existing VECREDUCE based code more generic, but expressing it in terms of the neutral value of the base opcode instead.	2020-10-29 22:05:06 +01:00
Stefanos Baziotis	a3345300b6	[LCSSA] Doc for special treatment of PHIs Differential Revision: https://reviews.llvm.org/D89739	2020-10-29 22:50:07 +02:00
Nikita Popov	20b386aae0	[LoopUtils] Fix neutral value for vector.reduce.fadd Use -0.0 instead of 0.0 as the start value. The previous use of 0.0 was fine for all existing uses of this function though, as it is always generated with fast flags right now, and thus nsz.	2020-10-29 21:45:13 +01:00
Florian Hahn	1922570489	[SLP] Consider alternatives for cost of select instructions. Some architectures do not have general vector select instructions (e.g. AArch64). But some cmp/select patterns can be vectorized using other instructions/intrinsics. One example is using min/max instructions for certain patterns. This patch updates the cost calculations for selects in the SLP vectorizer to consider using min/max intrinsics. This patch does not change SLP vectorizer's codegen itself to actually generate those intrinsics, but relies on the backends to lower the vector cmps & selects. This keeps things simple on the SLP side and works well in practice for AArch64. This exposes additional SLP vectorization opportunities in some benchmarks on AArch64 (-O3 -flto). Metric: SLP.NumVectorInstructions Program base slp diff test-suite...ications/JM/ldecod/ldecod.test 502.00 697.00 38.8% test-suite...ications/JM/lencod/lencod.test 1023.00 1414.00 38.2% test-suite...-typeset/consumer-typeset.test 56.00 65.00 16.1% test-suite...6/464.h264ref/464.h264ref.test 804.00 822.00 2.2% test-suite...006/453.povray/453.povray.test 3335.00 3357.00 0.7% test-suite...CFP2000/177.mesa/177.mesa.test 2110.00 2121.00 0.5% test-suite...:: External/Povray/povray.test 2378.00 2382.00 0.2% Reviewed By: RKSimon, samparker Differential Revision: https://reviews.llvm.org/D89969	2020-10-29 20:39:50 +00:00
Nikita Popov	a5f172927d	[SDAG] Fix neutral value for vecreduce_fadd The neutral value for FADD is -0.0, not 0.0, so this is what we need to pad vectors with.	2020-10-29 21:27:59 +01:00
Nikita Popov	91bf172088	[SDAG] Extract helper to get vecreduce base opcode (NFC)	2020-10-29 20:22:22 +01:00
Dávid Bolvanský	7a2abf5aca	[InferAttrs] Add nocapture/writeonly to string/mem libcalls One step closer to fix PR47644. Differential Revision: https://reviews.llvm.org/D89645	2020-10-29 20:06:43 +01:00
Craig Topper	22c3837634	[RISCV] Remove include of RISCVRegisterInfo.h from RISCVBaseInfo.h RISCVRegisterInfo.h is part of the CodeGen layer. The Utils library is intended to be shared with the MC layer so shouldn't use files from the CodeGen layer. The register enum names are already available from RISCVMCTargetDesc.h. It appears what was coming from this include was a transitive include of the Register class which I've replaced with MCRegister. Register has a constructor from MCRegister so it should be convertible.	2020-10-29 11:39:19 -07:00
Thomas Lively	be6f50798e	[WebAssembly] Implement SIMD signselect instructions As proposed in https://github.com/WebAssembly/simd/pull/124, using the opcodes adopted by V8 in https://chromium-review.googlesource.com/c/v8/v8/+/2486235/2/src/wasm/wasm-opcodes.h. Uses new builtin functions and a new target intrinsic exclusively to ensure that the new instructions are only emitted when a user explicitly opts in to using them since they are still in the prototyping and evaluation phase. Differential Revision: https://reviews.llvm.org/D90357	2020-10-29 11:06:20 -07:00
Jay Foad	9cee87d72a	[AMDGPU] Fix double space in disassembly of ds_gws_sema_* with gds By setting up the AsmStrings correctly we can remove some special cases from AMDGPUInstPrinter::printOffset. Differential Revision: https://reviews.llvm.org/D90307	2020-10-29 17:31:59 +00:00
Fangrui Song	8f8b5e5587	[MC] Error for .globl/.local which change the symbol binding and warn for .weak GNU as let .weak override .globl since binutils-gdb 5ca547dc2399a0a5d9f20626d4bf5547c3ccfddd (1996) while MC lets the last directive win (PR38921). This caused an issue to Linux's powerpc port which has been fixed by http://git.kernel.org/linus/968339fad422a58312f67718691b717dac45c399 Binding overriding is error-prone. This patch disallows a changed binding. (https://sourceware.org/pipermail/binutils/2020-March/000299.html ) Our behavior regarding `.globl x; .weak x` matches GNU as. Such usage is still suspicious but we issue a warning for now. We may upgrade it to an error in the future. Reviewed By: jhenderson, nickdesaulniers Differential Revision: https://reviews.llvm.org/D90108	2020-10-29 09:03:56 -07:00
Jay Foad	58de4b2053	[AMDGPU] Use pseudo instructions for readlane/writelane This reverts r227987 "R600/SI: Determine target-specific encoding of READLANE and WRITELANE early v2". All the codegen changes are caused by the post-RA scheduler no longer treating readlane/writelane as scheduling barriers due to having unmodelled side effects. (The pseudos are hasSideEffects = 0, but the real instructions are hasSideEffects = ? which TableGen conservatively treats as 1.) Differential Revision: https://reviews.llvm.org/D90401	2020-10-29 16:00:53 +00:00
Simon Pilgrim	dcb3dc101d	[InstCombine] visitShl - ensure inner shifts have inrange amounts Noticed when fixing OSS Fuzz #26716	2020-10-29 15:28:15 +00:00
Nicholas Guy	eb9fe24eaf	[ARM] Fix IT block generation after Thumb2SizeReduce with -Oz Fixes a regression caused by D82439, in which IT blocks were no longer being generated when -Oz is present. Differential Revision: https://reviews.llvm.org/D88496	2020-10-29 15:17:31 +00:00
Jay Foad	7a79921edd	[AMDGPU] Remove gds operand from ds_gws_* MachineInstrs The operand value was always 1 (except in some bad MIR tests) so it was redundant. Differential Revision: https://reviews.llvm.org/D90378	2020-10-29 15:04:23 +00:00
Jay Foad	a442fad911	[AMDGPU] Fix double space in disassembly of s_set_gpr_idx_mode Differential Revision: https://reviews.llvm.org/D90374	2020-10-29 14:54:33 +00:00
Jay Foad	e9dd2c4fe2	[AMDGPU] Fix double space in disassembly of some DPP instructions Differential Revision: https://reviews.llvm.org/D90373	2020-10-29 14:54:33 +00:00
Kazushi (Jam) Marukawa	58a6b7bcde	[VE] Add missing BCR format Add missing "BCR %sy, 0, target" format instruction and a regression test for this format. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D90387	2020-10-29 23:30:49 +09:00
Kazushi (Jam) Marukawa	07d1996601	[VE] Support register aliases in llvm-mc Support register aliases in MC layer to compile existing assembly files with clang and integrated assembler. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D90383	2020-10-29 23:28:32 +09:00
dfukalov	b3cdaef518	[MIR] Fix out of bounds access in MIRPrinter. Fixes: SWDEV-256460 Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D90239	2020-10-29 14:35:06 +03:00
Max Kazantsev	3fc601b641	[NFC][SCEV] Use generic predicate checkers to simplify code	2020-10-29 18:12:28 +07:00
Jay Foad	69f5105f5c	[AMDGPU] Simplify insertNoops functions. NFC.	2020-10-29 10:55:20 +00:00
Alok Kumar Sharma	930a8c60b6	[DebugInfo] [NFCI] Adding a missed out line in support for DW_TAG_generic_subrange. This commit adds a missed out line in earlier commit for DW_TAG_generic_subrange. Previous commit ID: `a6dd01afa3` Differential Revision: https://reviews.llvm.org/D89218 Thanks markus for pointing this out.	2020-10-29 16:18:20 +05:30
Florian Hahn	88d6421e4c	[SCEV] Match 'zext (trunc A to iB) to iY' as URem. URem operations with constant power-of-2 second operands are modeled as such. This patch on its own has very little impact (e.g. no changes in CodeGen for MultiSource/SPEC2000/SPEC2006 on X86 -O3 -flto), but I'll soon post follow-up patches that make use of it to more accurately determine the trip multiple. Reviewed By: mkazantsev Differential Revision: https://reviews.llvm.org/D89821	2020-10-29 10:46:52 +00:00
Kazushi (Jam) Marukawa	9c82944b2d	[VE] Add vector control instructions Add LVL/SVL/SMVL/LVIX isntructions. Add regression tests too. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D90355	2020-10-29 19:24:31 +09:00
Max Kazantsev	ef129f01e9	[SCEV][NFC] Use general predicate checkers in monotonicity check This makes the code more compact and readable.	2020-10-29 16:45:52 +07:00
Georgii Rymar	fcf6287916	[yaml2obj] - Improve handling of SectionHeaderTable::NoHeaders flag. When `NoHeaders` is set, we still have following issues: 1) We emit the `.shstrtab` implicit section of size 1 (empty string table). 2) We still align the start of the section header table, what affects the output size. 3) We still write section header table bytes. This patch fixes all of these issues. Differential revision: https://reviews.llvm.org/D90295	2020-10-29 12:16:52 +03:00
David Green	a4b6b1e1c8	[InterleaveAccess] Recognise Interleave loads through binary operations Instcombine will currently sink identical shuffles though vector binary operations. This is probably generally useful, but can break up the code pattern we use to represent an interleaving load group. This patch reverses that in the InterleaveAccessPass to re-recognise the pattern of shuffles sunk past binary operations and folds them back if an interleave group can be created. Differential Revision: https://reviews.llvm.org/D89489	2020-10-29 09:13:23 +00:00
Max Kazantsev	a5b2e795c3	[NFC][SCEV] Refactor monotonic predicate checks to return enums instead of bools This patch gets rid of output parameter which is not needed for most users and prepares this API for further refactoring.	2020-10-29 16:01:25 +07:00
Johannes Doerfert	d39f574dcc	[Attributor][FIX] Properly promote arguments pointers to arrays When we promote pointer arguments we did compute a wrong offset and use a wrong type for the array case. Bug reported and reduced by Whitney Tsang <whitneyt@ca.ibm.com>.	2020-10-29 00:45:32 -05:00
Ben Shi	076a8d915b	[NFC][AVR] Improve device list Reviewed By: dylanmckay https://reviews.llvm.org/D87968	2020-10-29 10:54:17 +08:00
Fangrui Song	39856d5d0b	[Debugify] Move global namespace functions into llvm:: Also move exportDebugifyStats from tools/opt to Debugify.cpp	2020-10-28 19:11:41 -07:00
Vedant Kumar	ffba94a9ac	Revert "[DebugInfo] Fix legacy ZExt emission when FromBits >= 64 (PR47927)" This reverts commit `9905346221`. It breaks the compiler-rt build, see https://reviews.llvm.org/D89838	2020-10-28 18:57:17 -07:00
Vedant Kumar	4fe81b6b6a	Revert "[DebugInfo] Shorten legacy [s\|z]ext dwarf expressions" This reverts commit `2ce36ebca5`. It depends on https://reviews.llvm.org/D89838, which needs to be reverted.	2020-10-28 18:57:17 -07:00
Mircea Trofin	735ab4be35	[ThinLTO] Fix .llvmcmd emission llvm::EmbedBitcodeInModule needs (what used to be called) EmbedMarker set, in order to emit .llvmcmd. EmbedMarker is really about embedding the command line, so renamed the parameter accordingly, too. This was not caught at test because the check-prefix was incorrect, but FileCheck does not report that when multiple prefixes are provided. A separate patch will address that. Differential Revision: https://reviews.llvm.org/D90278	2020-10-28 17:45:30 -07:00
Derek Schuff	77973f8dee	[WebAssembly] Add support for DWARF type units Since Wasm comdat sections work similarly to ELF, we can use that mechanism to eliminate duplicate dwarf type information in the same way. Differential Revision: https://reviews.llvm.org/D88603	2020-10-28 17:41:22 -07:00
Kazushi (Jam) Marukawa	7942960199	[VE] Add vector mask operation instructions Add VFMK/VFMS/VFMF/ANDM/ORM/XORM/EQVM/NNDM/NEGM/PCVM/LZVM/TOVM isntructions. Add regression tests too. Also add new patterns to parse VFMK/VFMS/VFMF mnemonics. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D90297	2020-10-29 08:42:41 +09:00
Amy Huang	7669f3c0f6	Recommit "[CodeView] Emit static data members as S_CONSTANTs." We used to only emit static const data members in CodeView as S_CONSTANTS when they were used; this patch makes it so they are always emitted. This changes CodeViewDebug.cpp to find the static const members from the class debug info instead of creating DIGlobalVariables in the IR whenever a static const data member is used. Bug: https://bugs.llvm.org/show_bug.cgi?id=47580 Differential Revision: https://reviews.llvm.org/D89072 This reverts commit `504615353f`.	2020-10-28 16:35:59 -07:00
Austin Kerbow	de51867343	[AMDGPU] Add Reset function to GCNHazardRecognizer Reset the tracked emitted instructions when starting scheduling on a new region. Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D90347	2020-10-28 16:32:32 -07:00
Gaurav Jain	f719fd7ade	[NFC] Use [MC]Register in CSE & LICM Differential Revision: https://reviews.llvm.org/D90327	2020-10-28 15:53:26 -07:00
Craig Disselkoen	c3783847ae	C API: support scalable vectors This adds support for scalable vector types in the C API and in llvm-c-test, and also adds a test to ensure that llvm-c-test can properly roundtrip operations involving scalable vectors. While creating this diff, I discovered that the C API cannot properly roundtrip _constant expressions_ involving shufflevector / scalable vectors, but that seems to be a separate enough issue that I plan to address it in a future diff (unless reviewers feel it should be addressed here). Differential Revision: https://reviews.llvm.org/D89816	2020-10-28 18:19:34 -04:00
Jay Foad	5b91a6a88b	[AMDGPU] Allow some modifiers on VOP3B instructions V_DIV_SCALE_F32/F64 are VOP3B encoded so they can't use the ABS src modifier, but they can still use NEG and the usual output modifiers. This partially reverts `3b99f12a4e` "AMDGPU: Remove modifiers from v_div_scale_*". Differential Revision: https://reviews.llvm.org/D90296	2020-10-28 21:54:14 +00:00
Florian Hahn	53f4c4b2cc	[InstCombine] Do not introduce bitcasts for swifterror arguments. The following constraints hold for swifterror values: A swifterror value (either the parameter or the alloca) can only be loaded and stored from, or used as a swifterror argument. This patch updates instcombine to not try to convert a bitcast of a function into a bitcast of a swifterror argument. Reviewed By: rjmccall Differential Revision: https://reviews.llvm.org/D90258	2020-10-28 21:52:12 +00:00
Jay Foad	50ee22d791	[AMDGPU] Fix double space in disassembly of SDWA instructions with vcc Differential Revision: https://reviews.llvm.org/D90317	2020-10-28 21:39:39 +00:00
Florian Hahn	772aaa6023	[AArch64] Improve lowering of insert_vector_elt with 0.0 consts. When moving +0.0 into a float vector, we can use to vi*gpr variants of INS. Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D90176	2020-10-28 21:35:33 +00:00
Austin Kerbow	8b127a8661	[AMDGPU] Fix inserting combined s_nop in bundles Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D90334	2020-10-28 14:34:04 -07:00
Philip Reames	4e4abd16a7	[Deref] Use maximum trip count instead of exact trip count When trying to prove that a memory access touches only dereferenceable memory across all iterations of a loop, use the maximum exit count rather than an exact one. In many cases we can't prove exact exit counts whereas we can prove an upper bound. The test included is for a single exit loop with a min(C,V) exit count, but the true motivation is support for multiple exits loops. It's just really hard to write a test case for multiple exits because the vectorizer (the primary user of this API), bails far before this. For multiple exits, this allows a mix of analyzeable and unanalyzable exits when only analyzeable exits are needed to prove deref.	2020-10-28 14:33:30 -07:00
Dávid Bolvanský	49cddb90f6	[MemLoc] Adjust memccpy support in MemoryLocation::getForArgument Use LocationSize::upperBound instead of precise since we only know an upper bound on the number of bytes read/written. Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D89885	2020-10-28 21:26:10 +01:00

1 2 3 4 5 ...

140615 Commits