llvm-project

Commit Graph

Author	SHA1	Message	Date
Arthur Eubanks	b8f22f9d30	[NewPM][AMDGPU] Run InternalizePass when -amdgpu-internalize-symbols The legacy PM doesn't run EP_ModuleOptimizerEarly on -O0, so skip running it here when given O0. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D93886	2021-01-04 11:34:40 -08:00
Craig Topper	dc9ac0e820	[RISCV] Replace i32 with XLenVT in (add AddrFI, simm12) isel patterns. With the i32 these patterns will only fire on RV32, but they don't look RV32 specific. Reviewed By: lenary Differential Revision: https://reviews.llvm.org/D93843	2021-01-04 10:53:27 -08:00
Sanjay Patel	6976812129	[InstCombine] add tests for ashr+icmp; NFC	2021-01-04 13:35:07 -05:00
Matt Arsenault	d8938c8bb5	CodeGen: Use Register	2021-01-04 12:53:06 -05:00
Philip Reames	7c63aac7bd	Revert "[LoopDeletion] Break backedge of loops when known not taken" This reverts commit `dd6bb367d1`. Multi-stage builders are showing an assertion failure w/LCSSA not being preserved on entry to IndVars. Reason isn't clear, reverting while investigating.	2021-01-04 09:50:47 -08:00
Philip Reames	dd6bb367d1	[LoopDeletion] Break backedge of loops when known not taken The basic idea is that if SCEV can prove the backedge isn't taken, we can go ahead and get rid of the backedge (and thus the loop) while leaving the rest of the control in place. This nicely handles cases with dispatch between multiple exits and internal side effects. Differential Revision: https://reviews.llvm.org/D93906	2021-01-04 09:19:29 -08:00
Joseph Huber	fe5d51a489	[OpenMP] Add using bit flags to select Libomptarget Information Summary: This patch adds more fine-grained support over which information is output from the libomptarget runtime when run with the environment variable LIBOMPTARGET_INFO set. An extensible set of flags can be used to pick and choose which information the user is interested in. Reviewers: jdoerfert JonChesterfield grokos Differential Revision: https://reviews.llvm.org/D93727	2021-01-04 12:03:15 -05:00
Simon Pilgrim	f7463ca3cc	[ProfileData] GCOVFile::readGCNO - silence undefined pointer warning. NFCI. Silence clang static analyzer warning that 'fn' could still be in an undefined state - this shouldn't happen depending on the likely tag order, but the analyzer can't know that.	2021-01-04 16:50:05 +00:00
Jon Chesterfield	76bfbb74d3	[libomptarget][amdgpu] Call into deviceRTL instead of ockl [libomptarget][amdgpu] Call into deviceRTL instead of ockl Amdgpu codegen presently emits a call into ockl. The same functionality is already present in the deviceRTL. Adds an amdgpu specific entry point to avoid the dependency. This lets simple openmp code (specifically, that which doesn't use libm) run without rocm device libraries installed. Reviewed By: ronlieb Differential Revision: https://reviews.llvm.org/D93356	2021-01-04 16:48:47 +00:00
Krzysztof Parzyszek	c55b609b77	[Hexagon] Fix bad SDNodeXForm Fixes https://llvm.org/PR48651	2021-01-04 10:43:01 -06:00
Florian Hahn	c367258b5c	[SimplifyCFG] Enabled hoisting late in LTO pipeline. `bb7d3af113` disabled hoisting in SimplifyCFG by default, but enabled it late in the pipeline. But it appears as if the LTO pipelines got missed. This patch adjusts the LTO pipelines to also enable hoisting in the later stages. Unfortunately there's no easy way to add a test for the change I think. Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D93684	2021-01-04 16:26:58 +00:00
Michael Munday	e2d3d501ef	[RISCV][NFC] Add additional cmov tests One or more cmov instructions could be generated for these functions when the Zbt extension is present. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D93768	2021-01-04 16:01:40 +00:00
Florian Hahn	ed936aad78	[InterleavedAccess] Return correct 'modified' status. Both tryReplaceExtracts and replaceBinOpShuffles may modify the IR, even if no interleaved loads are generated, but currently the pass pretends no changes were made. This patch updates the pass to return true if either of the functions made any changes. In case of tryReplaceExtracts, changes are made if there are any Extracts and true is returned. `replaceBinOpShuffles` always makes changes if BinOpShuffles is not empty. It also always returned true, so I went ahead and change it to just `replaceBinOpShuffles`. Fixes PR48208. Reviewed By: SjoerdMeijer Differential Revision: https://reviews.llvm.org/D93997	2021-01-04 15:49:47 +00:00
Simon Pilgrim	e9f401d8a2	[IR] CallBase::getBundleOpInfoForOperand - ensure Current iterator is defined. NFCI. Fix clang static analyzer undefined pointer warning in the case Begin == End.	2021-01-04 15:30:15 +00:00
Simon Pilgrim	9f8c0d15c7	DeclCXX - Fix getAs<> null-dereference static analyzer warnings. NFCI. getAs<> can return null if the cast is invalid, which can lead to null pointer deferences. Use castAs<> instead which will assert that the cast is valid.	2021-01-04 15:12:55 +00:00
Hansang Bae	82a29a62ab	[OpenMP] Add definition/interface for target memory routines The change includes new routines introduced in 5.1 and Fortran interface. Differential Revision: https://reviews.llvm.org/D93505	2021-01-04 08:12:57 -06:00
Simon Pilgrim	4d7cb6da9f	[Sparc] SparcMCExpr::printVariantKind - fix Wcovered-switch-default gcc warning. NFCI.	2021-01-04 14:08:44 +00:00
Caroline Concatto	060cfd9795	[AArch64][SVE]Add cost model for masked gather and scatter for scalable vector. A new TTI interface has been added 'Optional <unsigned>getMaxVScale' that returns the maximum vscale for a given target. When known getMaxVScale is used to compute the cost of masked gather scatter for scalable vector. Depends on D92094 Differential Revision: https://reviews.llvm.org/D93030	2021-01-04 13:59:58 +00:00
Florian Hahn	d38a0258a5	[AArch64] Add patterns for FMCLA*_indexed. This patch adds patterns for the indexed variants of FCMLA. Mostly based on a patch by Tim Northover. Reviewed By: SjoerdMeijer Differential Revision: https://reviews.llvm.org/D92947	2021-01-04 13:45:51 +00:00
Simon Pilgrim	23b4198652	[Support] Add KnownBits::icmp helpers. Check if all possible values for a pair of knownbits give the same icmp result - these are based off the checks performed in InstCombineCompares.cpp and D86578. Add exhaustive unit test coverage - a followup will update InstCombineCompares.cpp to use this.	2021-01-04 12:46:27 +00:00
David Green	901cc9b6f3	[ARM] Extend lowering for i64 reductions The lowering of a <4 x i16> or <4 x i8> vecreduce.add into an i64 would previously be expanded, due to the i64 not being legal. This patch adjusts our reduction matchers, making it produce a VADDLV(sext A to v4i32) instead. Differential Revision: https://reviews.llvm.org/D93622	2021-01-04 12:44:43 +00:00
LemonBoy	42652c1d6e	[Sparc] Fixes for the internal assembler * Prevent the generation of invalid shift instructions by constraining the immediate field. I've limited the shift field to constant values only, adding the `R_SPARC_5`/`R_SPARC_6` relocations is trivial if needed (but I can't really think of a use case for those). * Fix the generation of PC-relative `call` * Fix the transformation of `jmp sym` into `jmpl` * Emit fixups for simm13 operands I moved the choice of the correct relocation into the code emitter as I've seen the other backends do, it can be definitely cleaner but the aim was to reduce the scope of the patch as much as possible. Fixes the problems raised by joerg in L254199 Reviewed By: dcederman Differential Revision: https://reviews.llvm.org/D78193	2021-01-04 13:25:37 +01:00
Chih-Ping Chen	975b64b293	[docs] Release notes for IsDecl in DIModule. Please see https://reviews.llvm.org/D93462 for the actual code change. Differential Revision: https://reviews.llvm.org/D93558	2021-01-04 07:03:34 -05:00
David Green	6c89f6fae4	[AArch64] Attempt to fix Mac tests with a more specific triple. NFC	2021-01-04 11:29:18 +00:00
Kazushi (Jam) Marukawa	c287f90ccd	[VE] Change default CPU name to "generic" Change default CPU name of SX-Aurora VE from "ve" to "generic" similar to other architectures. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D93836	2021-01-04 20:09:57 +09:00
Usman Nadeem	685c8b537a	[AARCH64] Improve accumulator forwarding for Cortex-A57 model The old CPU model only had MLA->MLA forwarding. I added some missing MUL->MLA read advances and a missing absolute diff accumulator read advance according to the Cortex A57 Software Optimization Guide. The patch improves performance in EEMBC rgbyiqv2 by about 6%-7% and spec2006/milc by 8% (repeated runs on multiple devices), causes no significant regressions (none in SPEC). Differential Revision: https://reviews.llvm.org/D92296	2021-01-04 10:58:43 +00:00
Florian Hahn	e0905553b4	[ArgPromotion] Delay dead GEP removal until doPromotion. Currently ArgPromotion removes dead GEPs as part of the legality check in isSafeToPromoteArgument. If no promotion happens, this means the pass claims no modifications happened, even though GEPs were removed. This patch fixes the issue by delaying removal of dead GEPs until doPromotion: isSafeToPromoteArgument can simply skips dead GEPs and the code in doPromotion dealing with GEPs is updated to account for dead GEPs. Once we committed to promotion, it should be safe to remove dead GEPs. Alternatively isSafeToPromoteArgument could return an additional boolean to indicate whether it made changes, but this is quite cumbersome and there should be no real benefit of weeding out some dead GEPs here if we do not perform promotion. I added a test for the case where dead GEPs need to be removed when promotion happens in `578c5a0c6e`. Fixes PR47477. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D93991	2021-01-04 09:51:20 +00:00
Yang Fan	e43b3d1f5e	Revert "[Sema] Fix deleted function problem in implicitly movable test" This reverts commit `89b0972a`	2021-01-04 17:21:19 +08:00
Kazushi (Jam) Marukawa	74e7cb26b9	[VE] Remove VA.needsCustom checks Remove VA.needsCustom checks which are copied from Sparc implementation at the very beginning of VE implementation. Add assert to sanity-check VA.needsCustom flag, also. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D93847	2021-01-04 18:19:18 +09:00
David Sherwood	a65092040a	[SVE] Fix inline assembly parsing crash This patch fixes a crash encountered when compiling this code: ... float16_t a; __asm__("fminv %h[a], %[b], %[c].h" : [a] "=r" (a) : [b] "Upl" (b), [c] "w" (c)) The issue here is when using the 'h' modifier for a register constraint 'r'. Differential Revision: https://reviews.llvm.org/D93537	2021-01-04 09:11:05 +00:00
Craig Topper	94257d12cb	[RISCV] Remove unused method isUImm5NonZero() from RISCVAsmParser.cpp. NFC The operand predicate that used this has been gone for a while.	2021-01-04 00:17:39 -08:00
AnZhong Huang	09b3f3f22c	[benchmark] Fixed a build error when using CMake 3.15.1 + NDK-R20 std::decay_t used by llvm/utils/benchmark/include/benchmark/benchmark.h is a c++14 feature, but the CMakelist uses c++11, it's the root-cause of build error. There are two options to fix the error. 1) change the CMakelist to support c++14. 2) change std::decay_t to std::decay, it's what the patch done. This bug can only be reproduced by CMake 3.15, we didn't observer the bug with CMake 3.16. But based on the code's logic, it's an obvious bug of LLVM. Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D93794	2021-01-04 11:00:57 +03:00
Kai Luo	f6515b0520	[PowerPC] Do not fold `cmp(d\|w)` and `subf` instruction to `subf.` if `nsw` is not present In `PPCInstrInfo::optimizeCompareInstr` we seek opportunities to fold `cmp(d\|w)` and `subf` as an `subf.`. However, if `subf.` gets overflow, `cr0` can't reflect the correct order, violating the semantics of `cmp(d\|w)`. Fixed https://bugs.llvm.org/show_bug.cgi?id=47830. Reviewed By: #powerpc, nemanjai Differential Revision: https://reviews.llvm.org/D90156	2021-01-04 07:54:15 +00:00
Andrew Litteken	5c951623bc	[IROutliner] Refactoring errors in the cost model from past patches. There were was the reuse of a variable that should not have been occurred due to confusion during committing patches.	2021-01-04 00:11:18 -06:00
Andrew Litteken	05e6ac4eb8	[IROutliner] Removing a duplicate addition, causing overestimates in IROutliner. There was an extra addition left over from a previous commit for the cost model, this removes it.	2021-01-03 23:36:28 -06:00
sameeran joshi	6280bc1cc3	[Flang][openmp][5.0] Add task_reduction clause. See OMP-5.0 2.19.5.5 task_reduction Clause. To add a positive test case we need `taskgroup` directive which is not added hence skipping the test. This is a dependency for `taskgroup` construct. Reviewed By: clementval Differential Revision: https://reviews.llvm.org/D93105 Co-authored-by: Valentin Clement <clementval@gmail.com>	2021-01-04 08:48:11 +05:30
Jan Kratochvil	6988f7a6f4	[compiler-rt] [Sanitizers] Extend ThreadDescriptorSize() for glibc-2.32-2.fc33.x86_64+i686 before: $ echo 'int main(){}'\|clang -g -fsanitize=leak -x c++ -;./a.out Tracer caught signal 11: addr=0x7f4f73da5f40 pc=0x4222c8 sp=0x7f4f72cffd40 ==1164171==LeakSanitizer has encountered a fatal error. ==1164171==HINT: For debugging, try setting environment variable LSAN_OPTIONS=verbosity=1:log_threads=1 ==1164171==HINT: LeakSanitizer does not work under ptrace (strace, gdb, etc) $ _ after: $ echo 'int main(){}'\|clang -g -fsanitize=leak -x c++ -;./a.out) $ _ I haven't verified the size cannot be affected by Fedora patches of upstream glibc-2.32 - but I do not expect upstream glibc-2.32 would have the last sizes `(1216, 2304)` from 2013 around glibc-2.12. Differential Revision: https://reviews.llvm.org/D93386	2021-01-04 03:21:04 +01:00
Nathan James	59810c51e7	[clang-tidy] Fix windows tests Attempt to fix the 2 failing tests identifier in 48646. Appears that python3 doesn't like nested double quotes in single quoted strings, hopefully nested single quotes in double quoted strings is a-ok. Reviewed By: thakis Differential Revision: https://reviews.llvm.org/D93979	2021-01-04 00:39:34 +00:00
Roman Lebedev	98cd1c33e3	[NFC][SimplifyCFG] Hoist 'original' DomTree verification from simplifyOnce() into run() This is NFC since SimplifyCFG still currently defaults to not preserving DomTree. SimplifyCFGOpt::simplifyOnce() is only be called from SimplifyCFGOpt::run(), and can not be called externally, since SimplifyCFGOpt is defined in .cpp This avoids some needless verifications, and is thus a bit faster without sacrificing precision.	2021-01-04 01:02:02 +03:00
Roman Lebedev	a7684940f0	[SimplifyCFG] SimplifyTerminatorOnSelect(): fix/tune DomTree updates We only need to remove non-TrueBB/non-FalseBB successors, and we only need to do that once. We don't need to insert any new edges, because no new successors will be added.	2021-01-04 01:02:02 +03:00
Roman Lebedev	70935b9595	[NFC][SimplifyCFG] SimplifyTerminatorOnSelect(): pull out OldTerm->getParent() into a variable	2021-01-04 01:02:02 +03:00
Roman Lebedev	4fc908025f	[NFC][SimplifyCFG] Add a test where we fail to preserve DomTree validity	2021-01-04 01:02:01 +03:00
Nikita Popov	3715c99be9	[InstSimplify] Fold nnan/ninf violation to poison As the comment already indicates, performing an operation with nnan/ninf flags on a nan/inf or undef results in poison. Now that we have a proper poison value, we no longer need to relax it to undef.	2021-01-03 22:05:40 +01:00
Florian Hahn	edb52c626b	[LoopUnswitch] Precommit initial partial unswitching test cases.	2021-01-03 20:34:32 +00:00
Nikita Popov	766cf7f32e	[InstSimplify] Fold division by zero to poison Div/rem by zero is immediate undefined behavior and anything goes. Currently we fold it to undef, this patch changes it to fold to poison instead, which is slightly stronger. Differential Revision: https://reviews.llvm.org/D93995	2021-01-03 20:52:45 +01:00
Kazu Hirata	ba82c0b315	[llvm] Call *(Set\|Map)::erase directly (NFC) We can erase an item in a set or map without checking its membership first.	2021-01-03 09:57:47 -08:00
Kazu Hirata	0e219b6443	[Target] Construct SmallVector with iterator ranges (NFC)	2021-01-03 09:57:45 -08:00
Kazu Hirata	985f899bf2	[Target] Use llvm::append_range (NFC)	2021-01-03 09:57:43 -08:00
Nikita Popov	f094d65bea	[InstSimplify] Fix addo/subo with undef (PR43188) We can't fold the first result to undef, because not all values may be reachable under the constraint that no overflow occurred. Use the same folds we do for saturated math instead. Proofs: uaddo: https://alive2.llvm.org/ce/z/zf55N_ saddo: https://alive2.llvm.org/ce/z/a_xPgS usubo: https://alive2.llvm.org/ce/z/DmRqwt ssubo: https://alive2.llvm.org/ce/z/8ag7U-	2021-01-03 18:51:49 +01:00
Nikita Popov	c6ad00d709	[InstSimplify] Return poison for out of bounds extractelement This is the same change as D93990, but for extractelement rather than insertelement. > If idx exceeds the length of val for a fixed-length vector, the > result is a poison value. For a scalable vector, if the value of > idx exceeds the runtime length of the vector, the result is a > poison value.	2021-01-03 18:15:58 +01:00

1 2 3 4 5 ...

376105 Commits All Branches Search

376105 Commits

All Branches