llvm-project

Commit Graph

Author	SHA1	Message	Date
Roman Lebedev	b8121b2e62	[SimplifyCFG] Teach SinkCommonCodeFromPredecessors() to preserve DomTree	2020-12-30 00:48:11 +03:00
Roman Lebedev	18c407bf4c	[SimplifyCFG] Teach HoistThenElseCodeToIf() to preserve DomTree	2020-12-30 00:48:10 +03:00
Roman Lebedev	fe9bdd9621	[SimplifyCFG] Teach SimplifyEqualityComparisonWithOnlyPredecessor() to preserve DomTree, part 2	2020-12-30 00:48:10 +03:00
Roman Lebedev	6027e05dbf	[SimplifyCFG] Teach SimplifyEqualityComparisonWithOnlyPredecessor() to preserve DomTree, part 1	2020-12-30 00:48:10 +03:00
James Y Knight	145cbef587	Copy demangle changes from libcxxabi to llvm with cp_to_llvm.sh. This includes changes from these commits: `5641b1dfdd` `8d31392753`	2020-12-29 16:18:10 -05:00
Sanjay Patel	8d18bc8e6d	[Utils] reduce code in createTargetReduction(); NFC The switch duplicated the translation in getRecurrenceBinOp(). This code is still weird because it translates to the TTI ReductionFlags for min/max, but then createSimpleTargetReduction() converts that back to RecurrenceDescriptor::MinMaxRecurrenceKind.	2020-12-29 15:56:19 -05:00
Stella Laurenzo	df7ddeea66	[mlir][python] Add FlatSymbolRef attribute. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D93909	2020-12-29 12:24:28 -08:00
Sanjay Patel	21a3a0225d	[SLP] replace local reduction enum with RecurrenceKind; NFCI I'm not sure if the SLP enum was created before the IVDescriptor RecurrenceDescriptor / RecurrenceKind existed, but the code in SLP is now redundant with that class, so it just makes things more complicated to have both. We eventually call LoopUtils createSimpleTargetReduction() to create reduction ops, so we might as well standardize on those enum names. There's still a question of whether we need to use TTI::ReductionFlags vs. MinMaxRecurrenceKind, but that can be another clean-up step. Another option would just be to flatten the enums in RecurrenceDescriptor into a single enum. There isn't much benefit (smaller switches?) to having a min/max subset.	2020-12-29 14:52:11 -05:00
Fraser Cormack	f7f09e2b1c	[RISCV] Fill out basic integer RVV ISel patterns This complements the existing RVV ISel patterns for arithmetic, bitwise and shifts with the remaining operations in those categories: sub, and, xor, sra. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D93852	2020-12-29 19:32:18 +00:00
Juneyoung Lee	278aa65cc4	[IR] Let IRBuilder's CreateVectorSplat/CreateShuffleVector use poison as placeholder This patch updates IRBuilder to create insertelement/shufflevector using poison as a placeholder. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D93793	2020-12-30 04:21:04 +09:00
Andrew Litteken	6df161a2fb	[IROutliner] Adding a cost model, and debug option to turn the model off. This adds a cost model that takes into account the total number of machine instructions to be removed from each region, the number of instructions added by adding a new function with a set of instructions, and the instructions added by handling arguments. Tests not adding flags: llvm/test/Transforms/IROutliner/outlining-cost-model.ll Reviewers: jroelofs, paquette Differential Revision: https://reviews.llvm.org/D87299	2020-12-29 12:43:41 -06:00
Craig Topper	79cbb003c5	[RISCV] Don't use tail agnostic policy on instructions where destination is tied to source If the destination is tied, then user has some control of the register used for input. They would have the ability to control the value of any tail elements. By using tail agnostic we take this option away from them. Its not clear that the intrinsics are defined such that this isn't supposed to work. And undisturbed is a valid implementation for agnostic so code wouldn't even fail to work on all systems if we always used agnostic. The vcompress intrinsic is defined to require tail undisturbed so at minimum we need this for that instruction or need to redefine the intrinsic. I've made an exception here for vmv.s.x/fmv.s.f and reduction instructions which only write to element 0 regardless of the tail policy. This allows us to keep the agnostic policy on those which should allow better redundant vsetvli removal. An enhancement would be to check for undef input and keep the agnostic policy, but we don't have good test coverage for that yet. Reviewed By: khchen Differential Revision: https://reviews.llvm.org/D93878	2020-12-29 10:37:58 -08:00
Arthur Eubanks	7ecbe0c7a0	[NewPM][AMDGPU] Port amdgpu-lower-kernel-attributes And add it to the AMDGPU opt pipeline. This is a function pass instead of a module pass (like the legacy pass) because it's getting added to a CGSCCPassManager, and you can't put a module pass in a CGSCCPassManager. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D93885	2020-12-29 10:26:06 -08:00
Florian Hahn	b980bed34b	[MSSAUpdater] Skip renaming when inserting def in unreachable block. This fixes a updater crash when moving memory defs between unreachable blocks. Fixes PR48616.	2020-12-29 18:22:12 +00:00
Craig Topper	2ae760e27e	[RISCV] Add earlyclobber of destination register to vmsbf.m/vmsif.m/vmsof.m instructions The spec for these instructions include this note. "The destination register cannot overlap either the source register or the mask register ('v0') if the instruction is masked." So we need earlyclobber to enforce this constraint. I've regenerated the tests with update_llc_test_checks.py to show the effects of the earlyclobber. Reviewed By: khchen, frasercrmck Differential Revision: https://reviews.llvm.org/D93867	2020-12-29 10:00:04 -08:00
Michael Stapelberg	c1e85b6c1b	sanitizer: fix typo/spelling: Dissassemble → Disassemble Differential Revision: https://reviews.llvm.org/D93902	2020-12-29 12:26:57 -05:00
Thomas Raoux	cf216670a0	[mlir][linalg] Add vectorization for linalg on tensor ops Support vectorization of linalg ops using tensor inputs/outputs. Differential Revision: https://reviews.llvm.org/D93890	2020-12-29 09:02:23 -08:00
Nicolai Hähnle	b76014a4f1	RegionInfo: use a range-based for loop [NFCI] Change-Id: I9985d72191a2b0680195032acf8a14ad2ba954ed Differential Revision: https://reviews.llvm.org/D92932	2020-12-29 16:00:26 +01:00
Roman Lebedev	374ef57f13	[InstCombine] 'hoist xor-by-constant from xor-by-value': completely give up on constant exprs As Mikael Holmén is noting in the post-commit review for the first fix https://reviews.llvm.org/rGd4ccef38d0bb#967466 not hoisting constantexprs is not enough, because if the xor originally was a constantexpr (i.e. X is a constantexpr). `SimplifyAssociativeOrCommutative()` in `visitXor()` will immediately undo this transform, thus again causing an infinite combine loop. This transform has resulted in a surprising number of constantexpr failures.	2020-12-29 16:28:18 +03:00
Nemanja Ivanovic	7486de1b2e	[PowerPC] Provide patterns for permuted scalar to vector for pre-P8 We will emit these permuted nodes on all VSX little endian subtargets but don't have the patterns available to match them on subtargets that don't have direct moves. Fixes: https://bugs.llvm.org/show_bug.cgi?id=47916	2020-12-29 06:49:25 -06:00
Fraser Cormack	aebb4a6052	[RISCV] Rewrite and simplify helper function. NFC. Reviewed By: luismarques Differential Revision: https://reviews.llvm.org/D93851	2020-12-29 11:29:44 +00:00
Nemanja Ivanovic	0a19fc3088	[PowerPC] Disable CTR loops containing operations on half-precision On subtargets prior to Power9, conversions to/from half precision are lowered to libcalls. This makes loops containing such operations invalid candidates for HW loops. Fixes: https://bugs.llvm.org/show_bug.cgi?id=48519	2020-12-29 05:12:50 -06:00
Mark Murray	5abfeccf10	[ARM][AArch64] Add Cortex-A78C Support for Clang and LLVM This patch upstreams support for the Armv8-a Cortex-A78C processor for AArch64 and ARM. In detail: Adding cortex-a78c as cpu option for aarch64 and arm targets in clang Adding Cortex-A78C CPU name and ProcessorModel in llvm Details of the CPU can be found here: https://www.arm.com/products/silicon-ip-cpu/cortex-a/cortex-a78c	2020-12-29 10:18:59 +00:00
Pavel Labath	a0b68a2925	[lldb] Deduplicate some tests in TestLldbGdbServer Use the new gdb-remote test case factory to generate debugserver and llgs variants, handling the simple cases first.	2020-12-29 09:37:01 +01:00
Juneyoung Lee	ae6e89327b	Precommit tests that have poison as shufflevector's placeholder This commit copies existing tests at llvm/Transforms containing 'shufflevector X, undef' and replaces them with 'shufflevector X, poison'. The new copied tests have -inseltpoison.ll suffix at its file name (as `db7a2f347f` did) See https://reviews.llvm.org/D93793 Test files listed using grep -R -E "^[^;]shufflevector <.> ., <.> undef" \| cut -d":" -f1 \| uniq Test files copied & updated using file_org=llvm/test/Transforms/$1 if [[ "$file_org" = -inseltpoison.ll ]]; then file=$file_org else file=${file_org%.ll}-inseltpoison.ll if [ ! -f $file ]; then cp $file_org $file fi fi sed -i -E 's/^([^;])shufflevector <(.)> (.), <(.)> undef/\1shufflevector <\2> \3, <\4> poison/g' $file head -1 $file \| grep "Assertions have been autogenerated by utils/update_test_checks.py" -q if [ "$?" == 1 ]; then echo "$file : should be manually updated" # The test is manually updated exit 1 fi python3 ./llvm/utils/update_test_checks.py --opt-binary=./build-releaseassert/bin/opt $file	2020-12-29 17:09:31 +09:00
Georgii Rymar	ed146d6291	[LLD][ELF] - Use LLVM_ELF_IMPORT_TYPES_ELFT instead of multiple types definitions. NFCI. We can reduce the number of "using" declarations. `LLVM_ELF_IMPORT_TYPES_ELFT` was extended in D93801. Differential revision: https://reviews.llvm.org/D93856	2020-12-29 10:50:07 +03:00
Pavel Labath	53f80d6b3a	[lldb] Fix logging in lldb-server tests	2020-12-29 08:33:12 +01:00
Fangrui Song	a8970dff1a	[ubsan][test] FLush stdout before checking interleaved stdout/stderr Detected by musl.	2020-12-28 20:30:32 -08:00
Arthur Eubanks	c5d100fdf2	[test] Fix conditional-temporaries.cpp Broken by https://reviews.llvm.org/D93880. (but now the test is much better :) )	2020-12-28 20:17:31 -08:00
Arthur Eubanks	c2ef06d3dd	[NewPM] Port infer-address-spaces And add it to the AMDGPU opt pipeline. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D93880	2020-12-28 19:58:12 -08:00
Fangrui Song	55d13e6a86	[asan][test] Annotate glibc specific tests with REQUIRES: glibc-2.27	2020-12-28 19:56:08 -08:00
Kazu Hirata	2883cd98f3	[CFGPrinter] Use succ_empty (NFC)	2020-12-28 19:55:20 -08:00
Kazu Hirata	5d2529f28f	[Scalar] Construct SmallVector with iterator ranges (NFC)	2020-12-28 19:55:18 -08:00
Kazu Hirata	1e3ed09165	[CodeGen] Use llvm::append_range (NFC)	2020-12-28 19:55:16 -08:00
Andrew Litteken	1e23802507	[IROutliner] Merging identical output blocks for extracted functions. Many of the sets of output stores will be the same. When a block is created, we check if there is an output block with the same set of store instructions. If there is, we map the output block of the region back to the block, so that the extra argument controlling the switch statement can be set to the appropriate block value. Tests: - llvm/test/Transforms/IROutliner/outlining-same-output-blocks.ll Reviewers: jroelofs, paquette Differential Revision: https://reviews.llvm.org/D87298	2020-12-28 21:01:48 -06:00
Zakk Chen	f3f9ce3b79	[RISCV] Define vmclr.m/vmset.m intrinsics. Define vmclr.m/vmset.m intrinsics and lower to vmxor.mm/vmxnor.mm. Ideally all rvv pseudo instructions could be implemented in C header, but those two instructions don't take an input, codegen can not guarantee that the source register becomes the same as the destination. We expand pseduo-v-inst into corresponding v-inst in RISCVExpandPseudoInsts pass. Reviewed By: craig.topper, frasercrmck Differential Revision: https://reviews.llvm.org/D93849	2020-12-28 18:57:17 -08:00
Juneyoung Lee	8b67c98c47	[UpdateTestChecks] Fix update_analyze_test_checks.py failure	2020-12-29 11:56:59 +09:00
Nemanja Ivanovic	4f568fbd21	[PowerPC] Do not emit HW loop when TLS var accessed in PHI of loop exit If any PHI nodes in loop exit blocks have incoming values from the loop that are accesses of TLS variables with local dynamic or general dynamic TLS model, the address will be computed inside the loop. Since this includes a call to __tls_get_addr, this will in turn cause the CTR loops verifier to complain. Disable CTR loops in such cases. Fixes: https://bugs.llvm.org/show_bug.cgi?id=48527	2020-12-28 20:36:16 -06:00
Arthur Eubanks	0e9abcfc19	[AMDGPU][NewPM] Port amdgpu-promote-alloca(-to-vector) And add to AMDGPU opt pipeline. Don't pin an opt run to the legacy PM when -enable-new-pm=1 if these passes (or passes introduced in https://reviews.llvm.org/D93863) are in the list of passes. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D93875	2020-12-28 17:52:31 -08:00
Yuanfang Chen	94427af60c	Revert "[NewPM][CodeGen] Introduce CodeGenPassBuilder to help build codegen pipeline" This reverts commit `4646de5d75`. Some bots have build failure.	2020-12-28 17:44:22 -08:00
Yuanfang Chen	4646de5d75	[NewPM][CodeGen] Introduce CodeGenPassBuilder to help build codegen pipeline Following up on D67687. Please refer to the RFC here http://lists.llvm.org/pipermail/llvm-dev/2020-July/143309.html `CodeGenPassBuilder` is the NPM counterpart of `TargetPassConfig` with below differences. - Debugging features (MIR print/verify, disable pass, start/stop-before/after, etc.) living in `TargetPassConfig` are moved to use PassInstrument as much as possible. (Implementation also lives in `TargetPassConfig.cpp`) - `TargetPassConfig` is a polymorphic base (virtual inheritance) to build the target-dependent pipeline whereas `CodeGenPassBuilder` is the CRTP base/helper to implement the target-dependent pipeline. The motivation is flexibility for targets to customize the pipeline, inlining opportunity, and fits the overall NPM value semantics design. - `TargetPassConfig` is a legacy immutable pass to declare hooks for targets to customize some target-independent codegen layer behavior. This is partially ported to TargetMachine::options. The rest, such as `createMachineScheduler/createPostMachineScheduler`, are left out for now. They should be implemented in LLVMTargetMachine in the future. Reviewed By: arsenm, aeubanks Differential Revision: https://reviews.llvm.org/D83608	2020-12-28 17:36:36 -08:00
Tim Keith	f782d5ea86	[flang] Detect call to abstract interface A subroutine call or function reference to an abstract interface is not legal. Differential Revision: https://reviews.llvm.org/D93872	2020-12-28 16:36:34 -08:00
Arthur O'Dwyer	c0a2d3b90b	[libc++] Fix a test failure in `7b00e9fae3` (D93815). "LLVM Buildbot on libcxx-libcxxabi-libunwind-armv7-linux" is not happy with comparing `unsigned` and `int` [-Werror,-Wsign-compare].	2020-12-28 18:46:07 -05:00
James Y Knight	4ddf140c00	Fix PR35902: incorrect alignment used for ubsan check. UBSan was using the complete-object align rather than nv alignment when checking the "this" pointer of a method. Furthermore, CGF.CXXABIThisAlignment was also being set incorrectly, due to an incorrectly negated test. The latter doesn't appear to have had any impact, due to it not really being used anywhere. Differential Revision: https://reviews.llvm.org/D93072	2020-12-28 18:11:17 -05:00
Arthur Eubanks	85af1d6257	[test] Fix pr45360.ll under NPM The IR is the same under the NPM, but some basic block labels and value names are different.	2020-12-28 14:42:52 -08:00
Andrew Litteken	e6ae623314	[IROutliner] Adding support for consolidating functions with different output arguments. Certain regions can have values introduced inside the region that are used outside of the region. These may not be the same for each similar region, so we must create one over arching set of arguments for the consolidated function. We do this by iterating over the outputs for each extracted function, and creating as many different arguments to encapsulate the different outputs sets. For each output set, we create a different block with the necessary stores from the value to the output register. There is then one switch statement, controlled by an argument to the function, to differentiate which block to use. Changed Tests for consistency: llvm/test/Transforms/IROutliner/extraction.ll llvm/test/Transforms/IROutliner/illegal-assumes.ll llvm/test/Transforms/IROutliner/illegal-memcpy.ll llvm/test/Transforms/IROutliner/illegal-memmove.ll llvm/test/Transforms/IROutliner/illegal-vaarg.ll Tests to test new functionality: llvm/test/Transforms/IROutliner/outlining-different-output-blocks.ll llvm/test/Transforms/IROutliner/outlining-remapped-outputs.ll llvm/test/Transforms/IROutliner/outlining-same-output-blocks.ll Reviewers: jroelofs, paquette Differential Revision: https://reviews.llvm.org/D87296	2020-12-28 16:17:07 -06:00
Rahul Joshi	25007b4d7e	[MLIR][NFC] Change FunctionLike::setAllArgAttrs/setAllResultAttrs to do a one-shot attribute update. - Change FunctionLike::setAllArgAttrs() and setAllResultAttrs() to rebuild the new list of function attributes locally and call setAttr() just once instead of calling setArgAttr()/setResultAttrs() for each argument which incrementally build the attribute dictionary and can end up creating a lot of unused DictionaryAttr's (which are uniqued and nor garbage collected). Differential Revision: https://reviews.llvm.org/D93870	2020-12-28 14:15:47 -08:00
Thomas Lively	5e09e9979b	[WebAssembly] Prototype extending pairwise add instructions As proposed in https://github.com/WebAssembly/simd/pull/380. This commit makes the new instructions available only via clang builtins and LLVM intrinsics to make their use opt-in while they are still being evaluated for inclusion in the SIMD proposal. Depends on D93771. Differential Revision: https://reviews.llvm.org/D93775	2020-12-28 14:11:14 -08:00
Arthur Eubanks	6c36286a2e	[NewPM] Fix CGSCCOptimizerLateEPCallbacks place in pipeline CGSCCOptimizerLateEPCallbacks are supposed to be run before the function simplification pipeline, like in the legacy PM and as specified in the comments for registerCGSCCOptimizerLateEPCallback(). Reviewed By: ychen Differential Revision: https://reviews.llvm.org/D93871	2020-12-28 14:03:10 -08:00
Thomas Lively	44ee14f993	[WebAssembly][NFC] Finish cleaning up SIMD tablegen This commit is a follow-on to c2c2e9119e73, using the `Vec` records introduced in that commit in the rest of the SIMD instruction definitions. Also removes unnecessary types in output patterns. Differential Revision: https://reviews.llvm.org/D93771	2020-12-28 13:59:23 -08:00

1 2 3 4 5 ...

375779 Commits All Branches Search

375779 Commits

All Branches