llvm-project

Commit Graph

Author	SHA1	Message	Date
Valentin Clement	40626184cf	[flang][NFC] Unify OpenMP and OpenACC structure checker This patch remove duplicated code between the check-omp-structure and the check-acc-structure and unify it into a check-directive-structure templated class. Reviewed By: kiranchandramohan, sscalpone, ichoyjx Differential Revision: https://reviews.llvm.org/D85104	2020-08-05 14:25:49 -04:00
Evgenii Stepanov	f2c0423995	[msan] Remove readnone and friends from call sites. MSan removes readnone/readonly and similar attributes from callees, because after MSan instrumentation those attributes no longer apply. This change removes the attributes from call sites, as well. Failing to do this may cause DSE of paramTLS stores before calls to readonly/readnone functions. Differential Revision: https://reviews.llvm.org/D85259	2020-08-05 10:34:45 -07:00
Simon Pilgrim	b60f998859	[X86][SSE] Fold 128-bit PACK(EXTEND(X),EXTEND(Y)) -> CONCAT(X,Y) subvectors This is seen in the sub-128-bit vector trunc(ext()) of comparison results Fixes pr46585.ll regression in D66004	2020-08-05 18:27:40 +01:00
Jordan Rupprecht	3c39db0c44	Revert "[LoopVectorizer] Inloop vector reductions" This reverts commit `e9761688e4`. It breaks the build: ``` ~/src/llvm-project/llvm/lib/Analysis/IVDescriptors.cpp:868:10: error: no viable conversion from returned value of type 'SmallVector<[...], 8>' to function return type 'SmallVector<[...], 4>' return ReductionOperations; ```	2020-08-05 10:24:15 -07:00
David Green	e9761688e4	[LoopVectorizer] Inloop vector reductions Arm MVE has multiple instructions such as VMLAVA.s8, which (in this case) can take two 128bit vectors, sign extend the inputs to i32, multiplying them together and sum the result into a 32bit general purpose register. So taking 16 i8's as inputs, they can multiply and accumulate the result into a single i32 without any rounding/truncating along the way. There are also reduction instructions for plain integer add and min/max, and operations that sum into a pair of 32bit registers together treated as a 64bit integer (even though MVE does not have a plain 64bit addition instruction). So giving the vectorizer the ability to use these instructions both enables us to vectorize at higher bitwidths, and to vectorize things we previously could not. In order to do that we need a way to represent that the reduction operation, specified with a llvm.experimental.vector.reduce when vectorizing for Arm, occurs inside the loop not after it like most reductions. This patch attempts to do that, teaching the vectorizer about in-loop reductions. It does this through a vplan recipe representing the reductions that the original chain of reduction operations is replaced by. Cost modelling is currently just done through a prefersInloopReduction TTI hook (which follows in a later patch). Differential Revision: https://reviews.llvm.org/D75069	2020-08-05 18:14:05 +01:00
Roman Lebedev	3a3c9519e2	[InstCombine] Negator: 0 - (X + Y) --> (-X) - Y iff a single operand negated This was the most obvious regression in f5df5cd5586ae9cfb2d9e53704dfc76f47aff149.f5df5cd5586ae9cfb2d9e53704dfc76f47aff149 We really don't want to do this if the original/outermost subtraction isn't a negation, and therefore doesn't go away - just sinking negation isn't a win. We are actually appear to be missing folds so hoist it. https://rise4fun.com/Alive/tiVe	2020-08-05 20:01:13 +03:00
Roman Lebedev	26f79e258f	[NFC][InstCombine] Tests for negation of `add` w/ single negatible operand	2020-08-05 20:01:13 +03:00
Sanjay Patel	719954eacb	[InstSimplify] add tests for icmp with 'mul nuw' operand; NFC	2020-08-05 12:46:45 -04:00
Lang Hames	47cfffe893	[JITLink][AArch64] Handle addends on PAGE21 / PAGEOFF12 relocations.	2020-08-05 08:50:46 -07:00
Simon Pilgrim	a57bfb44bc	[X86][AVX] Fold CONCAT(HOP(X,Y),HOP(Z,W)) -> HOP(CONCAT(X,Z),CONCAT(Y,W)) for integer types	2020-08-05 15:09:51 +01:00
Denis Antrushin	d21ce40821	[Statepoints] Operand folding in presense of tied registers. Implement proper folding of statepoint meta operands (deopt and GC) when statepoint uses tied registers. For deopt operands it is just about properly preserving tiedness in new instruction. For tied GC operands folding is a little bit more tricky. We can fold tied GC operands only from InlineSpiller, because it knows how to properly reload tied def after it was turned into memory operand. Other users (e.g. peephole) cannot properly fold such operands as they do not know how (or when) to reload them from memory. We do this by un-tieing operand we want to fold in InlineSpiller and allowing to fold only untied operands in foldPatchpoint.	2020-08-05 20:18:28 +07:00
Roman Lebedev	f5df5cd558	Recommit "[InstCombine] Negator: -(X << C) --> X * (-1 << C)" This reverts commit `ac70b37a00` which reverted commit `8aeb2fe13a` because codegen tests got broken and i needed time to investigate. This shows some regressions in tests, but they are all around GEP's, so i'm not really sure how important those are. https://rise4fun.com/Alive/1Gn	2020-08-05 15:59:13 +03:00
Sam Parker	f2675ab45f	[ARM][CostModel] Implement getCFInstrCost As with other targets, set the throughput cost of control-flow instructions to free so that we don't miss out of vectorization opportunities. Differential Revision: https://reviews.llvm.org/D85283	2020-08-05 12:44:51 +01:00
Xing GUO	bd7f3f8a3e	[obj2yaml] Add support for dumping the .debug_aranges section. This patch adds support for dumping DWARF sections to obj2yaml. The .debug_aranges section is used to illustrate the basic idea. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D85094	2020-08-05 19:19:05 +08:00
Simon Pilgrim	300899b9c4	[X86][AVX] Add test showing unnecessary duplicate HADD instructions Taken from internal fuzz test	2020-08-05 12:00:27 +01:00
Hans Wennborg	3ab01550b6	Revert "[CMake] Simplify CMake handling for zlib" This quietly disabled use of zlib on Windows even when building with -DLLVM_ENABLE_ZLIB=FORCE_ON. > Rather than handling zlib handling manually, use find_package from CMake > to find zlib properly. Use this to normalize the LLVM_ENABLE_ZLIB, > HAVE_ZLIB, HAVE_ZLIB_H. Furthermore, require zlib if LLVM_ENABLE_ZLIB is > set to YES, which requires the distributor to explicitly select whether > zlib is enabled or not. This simplifies the CMake handling and usage in > the rest of the tooling. > > This is a reland of `abb0075` with all followup changes and fixes that > should address issues that were reported in PR44780. > > Differential Revision: https://reviews.llvm.org/D79219 This reverts commit `10b1b4a231` and follow-ups `64d99cc6ab` and `f9fec0447e`.	2020-08-05 12:31:44 +02:00
Paul Walker	927fc536ca	[SVE] Add lowering for fixed length vector and, or & xor operations. Since there are no ill effects when performing these operations with undefined elements, they are lowered to the already supported unpredicated scalable vector equivalents. Differential Revision: https://reviews.llvm.org/D85117	2020-08-05 11:28:34 +01:00
Simon Pilgrim	4aaf301fb8	[DAG] Fold vector (aext (load x)) -> (zext (truncate (zextload x))) We currently don't do anything to fold any_extend vector loads as no target has such an instruction. Instead I've added support for folding to a zextload, SimplifyDemandedBits does a good job of adjusting the zext(truncate(()) stages as required later on. We still need the custom scalar extload handling instead of using the tryToFoldExtOfLoad helper as it has different legality tests - we can probably tweak that to reduce most of the code duplication. Fixes the regression I mentioned in rG99a971cadff7 Differential Revision: https://reviews.llvm.org/D85129	2020-08-05 11:22:23 +01:00
Georgii Rymar	f97019ad6e	[llvm-readobj/elf] - Add a testing for --stackmap and refine the implementation. Currently, we only test the `--stackmap` option here: https://github.com/llvm/llvm-project/blob/master/llvm/test/Object/stackmap-dump.test it uses a precompiled MachO binary currently and I've found no tests for this option for ELF. The implementation also has issues. For example, it might assert on a wrong version of the .llvm-stackmaps section. Or it might crash on an empty or truncated section. This patch introduces a new tools/llvm-readobj/ELF test file as well as implements a few basic checks to catch simple crashes/issues It also eliminates `unwrapOrError` calls in `printStackMap()`. Differential revision: https://reviews.llvm.org/D85208	2020-08-05 13:09:04 +03:00
Benjamin Kramer	c558c22cab	[llvm-symbolizer] Add legacy aliases -demangle=true and -demangle=false. This is used in the wild, don't break compatibility for no good reason. https://github.com/google/pprof/blob/master/internal/binutils/addr2liner_llvm.go	2020-08-05 12:07:46 +02:00
Sander de Smalen	f2916636f8	[AArch64][SVE] Disable tail calls if callee does not preserve SVE regs. This fixes an issue triggered by the following code, where emitEpilogue got confused when trying to restore the SVE registers after the call, whereas the call to bar() is implemented as a TCReturn: int non_sve(); int sve(svint32_t x) { return non_sve(); } Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D84869	2020-08-05 09:38:54 +01:00
Jay Foad	8cbf4a17ac	[AMDGPU] Propagate fast math flags in frem lowering Differential Revision: https://reviews.llvm.org/D84518	2020-08-05 09:09:38 +01:00
Jay Foad	1bb07e1b91	[AMDGPU] Precommit tests for D84518 Propagate fast math flags in frem lowering	2020-08-05 09:09:02 +01:00
Jay Foad	04cf4a5a65	[AMDGPU] Lower frem f16 Without this it would fail to select on subtargets that have 16-bit instructions. Differential Revision: https://reviews.llvm.org/D84517	2020-08-05 09:08:40 +01:00
Yevgeny Rouban	bc10888dcd	DomTree: Make PostDomTree indifferent to block successors swap Fixed the commit `c35585e209`. This is a fix for the bug 46098 where PostDominatorTree is unexpectedly changed by InstCombine's branch swapping transformation. This patch fixes PostDomTree builder. While looking for the furthest away node in a reverse unreachable subgraph this patch runs DFS with successors in their function order. This order is indifferent to the order of successors, so is the furthest away node. Reviewers: kuhar, nikic, lebedev.ri Differential Revision: https://reviews.llvm.org/D84763	2020-08-05 14:26:32 +07:00
Martin Storsjö	b989fcbae6	[llvm-rc] Allow string table values split into multiple string literals This can practically easily be a product of combining strings with macros in resource files. This fixes https://github.com/mstorsjo/llvm-mingw/issues/140. As string literals within llvm-rc are handled as StringRefs, each referencing an uninterpreted slice of the input file, with actual interpretation of the input string (codepage handling, unescaping etc) done only right before writing them out to disk, it's hard to concatenate them other than just bundling them up in a vector, without rearchitecting a large part of llvm-rc. This matches how the same already is supported in VersionInfoValue, with a std::vector<IntOrString> Values. MS rc.exe only supports concatenated string literals in version info values (already supported), string tables (implemented in this patch) and user data resources (easily implemented in a separate patch, but hasn't been requested by any end user yet), while GNU windres supports string immediates split into multiple strings anywhere (e.g. like (100 ICON "myicon" ".ico"). Not sure if concatenation in other statements actually is used in the wild though, in resource files normally built by GNU windres. Differential Revision: https://reviews.llvm.org/D85183	2020-08-05 08:59:32 +03:00
Juneyoung Lee	e0d99e9aaf	[JumpThreading] Consider freeze as a zero-cost instruction This is a simple patch that makes freeze as a zero-cost instruction, as bitcast already is. Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D85023	2020-08-05 14:42:36 +09:00
Juneyoung Lee	3401f9706b	[JumpThreading] Add a test for D85023; NFC	2020-08-05 13:40:14 +09:00
Mehdi Amini	1366d66a22	Revert "DomTree: Make PostDomTree immune to block successors swap" This reverts commit `c35585e209`. The MLIR is broken with this patch, reproduce by adding -DLLVM_ENABLE_PROJECTS=mlir to the cmake configuration and build `ninja tools/mlir/lib/IR/CMakeFiles/obj.MLIRIR.dir/Dominance.cpp.o`	2020-08-05 04:32:44 +00:00
Yevgeny Rouban	c35585e209	DomTree: Make PostDomTree immune to block successors swap This is another fix for the bug 46098 where PostDominatorTree is unexpectedly changed by InstCombine's branch swapping transformation. This patch fixes PostDomTree builder. While looking for the furthest away node in a reverse unreachable subgraph this patch runs DFS with successors in their function order. This order is indifferent to the order of successors, so is the furthest away node. Reviewers: kuhar, nikic, lebedev.ri Differential Revision: https://reviews.llvm.org/D84763	2020-08-05 11:06:54 +07:00
Zequan Wu	e3df947175	[llvm-cov] reset executation count to 0 after wrapped segment Fix the bug: https://bugs.llvm.org/show_bug.cgi?id=36979. It also fixes this bug: https://bugs.llvm.org/show_bug.cgi?id=35404, which I think is caused by the same problem. Differential Revision: https://reviews.llvm.org/D85036	2020-08-04 18:38:44 -07:00
Vitaly Buka	fe74f731e7	[StackSafety,NFC] Add combined index test Missing file for the previous patch	2020-08-04 18:31:58 -07:00
Vitaly Buka	03a822f7a6	[StackSafety,NFC] Add combined index test	2020-08-04 17:40:40 -07:00
Roman Lebedev	ac70b37a00	Revert "[InstCombine] Negator: -(X << C) --> X * (-1 << C)" Breaks codegen tests, will recommit later. This reverts commit `8aeb2fe13a`.	2020-08-05 03:19:38 +03:00
Roman Lebedev	8aeb2fe13a	[InstCombine] Negator: -(X << C) --> X * (-1 << C) This shows some regressions in tests, but they are all around GEP's, so i'm not really sure how important those are. https://rise4fun.com/Alive/1Gn	2020-08-05 03:13:14 +03:00
Roman Lebedev	8fd57b06a4	[NFC][InstCombine] Fix value names (s/%tmp/%i/) and autogenerate a few tests being affected by negator change	2020-08-05 03:12:14 +03:00
Roman Lebedev	3f3303324e	[NFC][InstCombine] Negator: add tests for negation of left-shift by constant	2020-08-05 03:12:14 +03:00
Yonghong Song	00602ee7ef	BPF: simplify IR generation for __builtin_btf_type_id() This patch simplified IR generation for __builtin_btf_type_id(). For __builtin_btf_type_id(obj, flag), previously IR builtin looks like if (obj is a lvalue) llvm.bpf.btf.type.id(obj.ptr, 1, flag) !type else llvm.bpf.btf.type.id(obj, 0, flag) !type The purpose of the 2nd argument is to differentiate __builtin_btf_type_id(obj, flag) where obj is a lvalue vs. __builtin_btf_type_id(obj.ptr, flag) Note that obj or obj.ptr is never used by the backend and the `obj` argument is only used to derive the type. This code sequence is subject to potential llvm CSE when - obj is the same .e.g., nullptr - flag is the same - metadata type is different, e.g., typedef of struct "s" and strust "s". In the above, we don't want CSE since their metadata is different. This patch change IR builtin to llvm.bpf.btf.type.id(seq_num, flag) !type and seq_num is always increasing. This will prevent potential llvm CSE. Also report an error if the type name is empty for remote relocation since remote relocation needs non-empty type name to do relocation against vmlinux. Differential Revision: https://reviews.llvm.org/D85174	2020-08-04 16:29:42 -07:00
Adrian Prantl	bf82ff61a6	Teach SROA to handle allocas with more than one dbg.declare. It is technically legal for optimizations to create an alloca that is used by more than one dbg.declare, if one or both of them are inlined instances of aliasing variables. Differential Revision: https://reviews.llvm.org/D85172	2020-08-04 15:54:51 -07:00
Arthur Eubanks	f50b3ff02e	[Hexagon] Use InstSimplify instead of ConstantProp This is the last remaining use of ConstantProp, migrate it to InstSimplify in the goal of removing ConstantProp. Add -hexagon-instsimplify option to enable skipping of instsimplify in tests that can't handle the extra optimization. Differential Revision: https://reviews.llvm.org/D85047	2020-08-04 15:42:39 -07:00
Eli Friedman	4a47f1c4ce	[SelectionDAG][SVE] Support scalable vectors in getConstantFP() Differential Revision: https://reviews.llvm.org/D85249	2020-08-04 15:32:43 -07:00
Matt Arsenault	89011fc3c9	AMDGPU/GlobalISel: Select llvm.returnaddress	2020-08-04 17:14:38 -04:00
Matt Arsenault	f8fb7835d6	GlobalISel: Add utilty for getting function argument live ins Get the argument register and ensure there's a copy to the virtual register. AMDGPU and AArch64 have similarish code to get the livein value, and I also want to use this in multiple places. This is a bit more aggressive about setting the register class than the original function, but that's probably OK. I think we're missing a few verifier checks for function live ins. I noticed AArch64's calling convention code is not actually adding liveins to functions, only the entry block (which apparently might not matter that much?). There should probably be a verifier check that entry block live ins are also live into the function. We also might need a verifier check that the copy to the livein virtual register is in the entry block.	2020-08-04 16:55:55 -04:00
Eli Friedman	95efea4b93	[AArch64][SVE] Widen narrow sdiv/udiv operations. The SVE instruction set only supports sdiv/udiv for 32-bit and 64-bit integers. If we see an 8-bit or 16-bit divide, widen the operands to 32 bits, and narrow the result. Differential Revision: https://reviews.llvm.org/D85170	2020-08-04 13:22:15 -07:00
AK	f0f68c6e6c	[HotColdSplit] Add test case for unlikely attribute in outlined function Differential Revision: https://reviews.llvm.org/D85232	2020-08-04 13:16:33 -07:00
Ilya Leoshkevich	153df1373e	[SanitizerCoverage] Fix types of __stop* and __start* symbols If a section is supposed to hold elements of type T, then the corresponding CreateSecStartEnd()'s Ty parameter represents T. Forwarding it to GlobalVariable constructor causes the resulting GlobalVariable's type to be T, and its SSA value type to be T**, which is one indirection too many. This issue is mostly masked by pointer casts, however, the global variable still gets an incorrect alignment, which causes SystemZ to choose wrong instructions to access the section.	2020-08-04 21:53:27 +02:00
Cameron McInally	0f2b47b6da	[FastISel] Don't transform FSUB(-0, X) -> FNEG(X) in FastISel This corresponds with the SelectionDAGISel change in D84056. Also, rename some poorly named tests in CodeGen/X86/fast-isel-fneg.ll with NFC. Differential Revision: https://reviews.llvm.org/D85149	2020-08-04 14:42:53 -05:00
Yonghong Song	6d218b4adb	BPF: support type exist/size and enum exist/value relocations Four new CO-RE relocations are introduced: - TYPE_EXISTENCE: whether a typedef/record/enum type exists - TYPE_SIZE: the size of a typedef/record/enum type - ENUM_VALUE_EXISTENCE: whether an enum value of an enum type exists - ENUM_VALUE: the enum value of an enum type These additional relocations will make CO-RE bpf programs more adaptive for potential kernel internal data structure changes. Differential Revision: https://reviews.llvm.org/D83878	2020-08-04 12:35:39 -07:00
Fangrui Song	0729a77280	[llvm-symbolizer][test] Fix pdb/pdb.test after D83530 This is a Windows only test which requires HAVE_DIA_SDK, so I failed to notice it.	2020-08-04 12:23:39 -07:00
Matt Arsenault	3e16e2152c	GlobalISel: Handle llvm.localescape This one is pretty easy and shrinks the list of unhandled intrinsics. I'm not sure how relevant the insert point is. Using the insert position of EntryBuilder will place this after constants. SelectionDAG seems to end up emitting these after argument copies and before anything else, but I don't think it really matters. This also ends up emitting these in the opposite order from SelectionDAG, but I don't think that matters either. This also needs a fix to stop the later passes dropping this as a dead instruction. DeadMachineInstructionElim's version of isDead special cases LOCAL_ESCAPE for some reason, and I'm not sure why it's excluded from MachineInstr::isLabel (or why isDead doesn't check it). I also noticed DeadMachineInstructionElim never considers inline asm as dead, but GlobalISel will drop asm with no constraints.	2020-08-04 15:19:02 -04:00

1 2 3 4 5 ...

73811 Commits