llvm-project

Commit Graph

Author	SHA1	Message	Date
Adrian Prantl	035217ff51	Allow signposts to take advantage of deferred string substitution One nice feature of the os_signpost API is that format string substitutions happen in the consumer, not the logging application. LLVM's current Signpost class doesn't take advantage of this though and instead always uses a static "Begin/End %s" format string. This patch uses variadic macros to allow the API to be used as intended. Unfortunately, the primary use-case I had in mind (the LLDB_SCOPED_TIMER() macro) does not get much better from this, because __PRETTY_FUNCTION__ is not a macro, but a static string, so signposts created by LLDB_SCOPED_TIMER() still use a static "%s" format string. At least LLDB_SCOPED_TIMERF() works as intended. This reapplies the previously reverted patch with additional include order fixes for non-modular builds of LLDB. Differential Revision: https://reviews.llvm.org/D103575	2021-06-14 16:53:41 -07:00
Huihui Zhang	1c096bf09f	[SVE][LSR] Teach LSR to enable simple scaled-index addressing mode generation for SVE. Currently, Loop strengh reduce is not handling loops with scalable stride very well. Take loop vectorized with scalable vector type <vscale x 8 x i16> for instance, (refer to test/CodeGen/AArch64/sve-lsr-scaled-index-addressing-mode.ll added). Memory accesses are incremented by "16vscale", while induction variable is incremented by "8vscale". The scaling factor "2" needs to be extracted to build candidate formula i.e., "reg(%in) + 2reg({0,+,(8 %vscale)}". So that addrec register reg({0,+,(8vscale)}) can be reused among Address and ICmpZero LSRUses to enable optimal solution selection. This patch allow LSR getExactSDiv to recognize special cases like "C1XY /s C2X*Y", and pull out "C1 /s C2" as scaling factor whenever possible. Without this change, LSR is missing candidate formula with proper scaled factor to leverage target scaled-index addressing mode. Note: This patch doesn't fully fix AArch64 isLegalAddressingMode for scalable vector. But allow simple valid scale to pass through. Reviewed By: sdesmalen Differential Revision: https://reviews.llvm.org/D103939	2021-06-14 16:42:34 -07:00
Adrian Prantl	7a7c00761f	Revert "Allow signposts to take advantage of deferred string substitution" This reverts commit `03841edde7`. Unfortunately this still breaks the LLDB standalone bot.	2021-06-14 16:09:04 -07:00
Krzysztof Parzyszek	0577f4b178	[Hexagon] Add HVX and control register names to Hexagon target	2021-06-14 17:14:37 -05:00
Siva Chandra Reddy	a58b2827fe	[libc] Add hardware implementations of x86_64 sqrt functions.	2021-06-14 21:25:37 +00:00
Matt Morehouse	b87894a1d2	[HWASan] Enable globals support for LAM. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D104265	2021-06-14 14:20:44 -07:00
Adrian Prantl	03841edde7	Allow signposts to take advantage of deferred string substitution One nice feature of the os_signpost API is that format string substitutions happen in the consumer, not the logging application. LLVM's current Signpost class doesn't take advantage of this though and instead always uses a static "Begin/End %s" format string. This patch uses variadic macros to allow the API to be used as intended. Unfortunately, the primary use-case I had in mind (the LLDB_SCOPED_TIMER() macro) does not get much better from this, because __PRETTY_FUNCTION__ is not a macro, but a static string, so signposts created by LLDB_SCOPED_TIMER() still use a static "%s" format string. At least LLDB_SCOPED_TIMERF() works as intended. This reapplies the previsously reverted patch with additional MachO.h macro #undefs. Differential Revision: https://reviews.llvm.org/D103575	2021-06-14 14:19:41 -07:00
George Balatsouras	98504959a6	[dfsan] Add stack-trace printing functions to dfsan interface Reviewed By: stephan.yichao.zhao Differential Revision: https://reviews.llvm.org/D104165	2021-06-14 14:09:00 -07:00
Roman Lebedev	585e65d330	[TLI] SimplifyDemandedVectorElts(): handle SCALAR_TO_VECTOR(EXTRACT_VECTOR_ELT(?, 0)) Iff we have `SCALAR_TO_VECTOR` (and we demand it's only defined 0'th element), and said scalar was produced by `EXTRACT_VECTOR_ELT` from the 0'th element of some vector, then we can just continue traversal into said source vector. This comes up in X86 vector uniform shift lowering. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D104250	2021-06-14 23:52:53 +03:00
Aaron Ballman	00dbf8c832	Adding some of the documents for C11. This is not the complete set of language-related documents for C11, but is about 75% complete.	2021-06-14 16:43:44 -04:00
Hanhan Wang	e3bc4dbe8e	[mlir][Linalg] Make printer/parser have the same behavior. The parser of generic op did not recognize the output from mlir-opt when there are multiple outputs. One would wrap the result types with braces, and one would not. The patch makes the behavior the same. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D104256	2021-06-14 13:38:30 -07:00
Piotr Sobczak	e0c382a9d5	[AMDGPU] Limit runs of fixLdsBranchVmemWARHazard The code in fixLdsBranchVmemWARHazard looks for patterns of a vmem/lds access followed by a branch, followed by an lds/vmem access. The handling of the hazard requires an arbitrary number of instructions to process. In the worst case where a function has a vmem access, but no lds accesses, all instructions are examined only to conclude that the hazard cannot occur. Add the pre-processing stage which detects if there is both lds and vmem present in the function and only then does the more costly search. This patch significantly improves compilation time in the cases the hazard cannot happen. In one pathological case I looked at IsHazardInst is needlesly called 88.6 milions times. The numbers could also be improved by introducing a map around the inner calls to ::getWaitStatesSince in fixLdsBranchVmemWARHazard, but nothing will beat not running fixLdsBranchVmemWARHazard at all in the cases detected by shouldRunLdsBranchVmemWARHazardFixup(). Differential Revision: https://reviews.llvm.org/D104219	2021-06-14 22:30:23 +02:00
Xing Xue	ecb68f1c8b	[libc++abi] NFC: avoid a -Wunused-parameter warning Summary: A -Wunused-parameter warning was introduced by patch rG7f0244afa828 [libc++abi] NFC: adding a new parameter base to functions for calculating… (authored by xingxue). The unused parameter base will be used in a follow-on patch D101298. This patch is to avoid the warning before D101298 is landed. Reviewers: ldionne, sfertile, compnerd, libc++abi Reviewed by: ldionne Differential Revision: https://reviews.llvm.org/D104235	2021-06-14 16:04:02 -04:00
Louis Dionne	d9d20802d0	[libc++] Clean up scripts to setup CI on macOS	2021-06-14 15:55:36 -04:00
Alexey Bataev	4e15560879	[OPENMP][C++20]Add support for CXXRewrittenBinaryOperator in ranged for loops. Added support for CXXRewrittenBinaryOperator as a condition in ranged for loops. This is a new kind of expression, need to extend support for C++20 constructs. It fixes PR49970: range-based for compilation fails for libstdc++ vector with -std=c++20. Differential Revision: https://reviews.llvm.org/D104240	2021-06-14 11:50:27 -07:00
Chris Lattner	a490ca8e01	[PassManager] Save compile time by not running the verifier unnecessarily. NFC This changes the pass manager to not rerun the verifier when a pass says it didn't change anything or after an OpToOpPassAdaptor, since neither of those cases need verification (and if the pass lied, then there will be much larger semantic problems than will be caught by the verifier). This maintains behavior in EXPENSIVE_CHECKS mode. Differential Revision: https://reviews.llvm.org/D104243	2021-06-14 11:43:52 -07:00
Arthur Eubanks	cc8d32ae7d	Move some code under NDEBUG from D103135	2021-06-14 11:39:12 -07:00
River Riddle	66e2708205	[mlir:Linalg] Populate LinalgOp patterns on LinalgDialect as opposed to each op Interface patterns are unique in that they get added to every operation that also implements that interface, given that they aren't tied to individual operations. When the same interface pattern gets added to multiple operations (such as the current behavior with Linalg), an reference to each of these patterns is added to every op (meaning that an operation will now have N references to effectively the same pattern). This revision fixes this problematic behavior in Linalg, and can bring upwards of a 25% reduction in compile time in Linalg based workloads. Differential Revision: https://reviews.llvm.org/D104160	2021-06-14 11:20:15 -07:00
Arthur Eubanks	75d3b46ad2	Remove accidentally added debugging code from D103135	2021-06-14 11:11:40 -07:00
Saleem Abdulrasool	8c8dbc1082	X86: pass swift_async context in R14 on Win64 Pass swift_async context in a callee-saved register rather than as a regular parameter. This is similar to the Swift `self` and `error` parameters.	2021-06-14 11:02:21 -07:00
Arthur Eubanks	0e31e22ed9	[docs][OpaquePtr] Shuffle around the transition plan section Emphasize that this is basically an attempt to remove ``PointerType::getElementType`` and ``Type::getPointerElementType()``. Add a couple more subtasks. Differential Revision: https://reviews.llvm.org/D104151	2021-06-14 10:59:41 -07:00
Vitaly Buka	d650ccf639	[NFC] Remove unused variable To fix 'set but not used' warning on sanitizer-x86_64-linux-android bot.	2021-06-14 10:57:26 -07:00
Arthur Eubanks	8c5a44901c	[OpaquePtr] Remove existing support for forward compatibility It assumes that PointerType will keep having an optional pointee type, but we'd like to remove the pointee type in PointerType at some point. I feel like the current implementation could be simplified anyway, although perhaps I'm underestimating the amount of work needed throughout BitcodeReader. We will still need a side table to keep track of pointee types. This will be reimplemented at some point. This is essentially a revert of `a4771e9d` (which doesn't look like it was reviewed anyway). Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D103135	2021-06-14 10:52:56 -07:00
Jez Ng	cc17bfe489	[lld-macho] Fix "shift exponent too large" UBSAN error UBSAN seems to have added this check somewhere along the way... This might also fix the PPC buildbot, which is failing on the same test	2021-06-14 13:47:25 -04:00
Jez Ng	e06b9ba485	[lld-macho] Reword comment for clarity	2021-06-14 13:47:25 -04:00
Alexey Bataev	44f197e94b	[OpenMP] Fix C-only clang assert on parsing use_allocator clause of target directive The parser code assumes building with C++ compiler and asserts when using clang (not clang++) on C file. I made the code dependent on input language. This shows up for amdgpu target. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D103899	2021-06-14 10:36:27 -07:00
wlei	863184dd69	[CSSPGO] Aggregation by the last K context frames for cold profiles This change provides the option to merge and aggregate cold context by the last k frames instead of context-less name. By default K = 1 means the context-less one. This is for better perf tuning. The more selective merging and trimming will rely on llvm-profgen's preinliner. Reviewed By: wenlei, hoy Differential Revision: https://reviews.llvm.org/D104131	2021-06-14 10:33:43 -07:00
Michael Benfield	20f7b5f3f9	[Clang] Test case for -Wunused-but-set-variable, warn for volatile. Differential Revision: https://reviews.llvm.org/D103623	2021-06-14 10:25:59 -07:00
Fraser Cormack	c75e454cb9	[RISCV] Transform unaligned RVV vector loads/stores to aligned ones This patch adds support for loading and storing unaligned vectors via an equivalently-sized i8 vector type, which has support in the RVV specification for byte-aligned access. This offers a more optimal path for handling of unaligned fixed-length vector accesses, which are currently scalarized. It also prevents crashing when `LegalizeDAG` sees an unaligned scalable-vector load/store operation. Future work could be to investigate loading/storing via the largest vector element type for the given alignment, in case that would be more optimal on hardware. For instance, a 4-byte-aligned nxv2i64 vector load could loaded as nxv4i32 instead of as nxv16i8. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D104032	2021-06-14 18:12:18 +01:00
Asher Mancinelli	c58cf692f4	[flang] Move buffer runtime test to GTest Move buffer unit test from Runtime directory to RuntimeGtest directory and use GTest. Test coverage is only maintained. Differential Revision: https://reviews.llvm.org/D102335 Reviewed By: awarzynski, klausler	2021-06-14 10:13:32 -07:00
Chris Lattner	ce77039596	[Verifier] Parallelize verification and dom checking. NFC. This changes the outer verification loop to not recurse into IsolatedFromAbove operations - instead return them up to a place where a parallel for loop can process them all in parallel. This also changes Dominance checking to happen on IsolatedFromAbove chunks of the region tree, which makes it easy to fold operation and dominance verification into a single simple parallel regime. This speeds up firtool in CIRCT from ~40s to 31s on a large testcase in -verify-each mode (the default). The .fir parser and module passes in particular benefit from this - FModule passes (roughly analogous to function passes) were already running the verifier in parallel as part of the pass manager. This allows the whole-module passes to verify their enclosed functions / FModules in parallel. -verify-each mode is still faster (26.3s on the same testcase), but we do expect the verifier to take some time. Differential Revision: https://reviews.llvm.org/D104207	2021-06-14 10:03:07 -07:00
Sanjay Patel	8591640379	[InstCombine] add DeMorgan folds for logical ops in select form We canonicalized to these select patterns (poison-safe logic) with D101191, so we need to reduce 'not' ops when possible as we would with 'and'/'or' instructions. This is shown in a secondary example in: https://llvm.org/PR50389 https://alive2.llvm.org/ce/z/BvsESh	2021-06-14 12:54:35 -04:00
Sanjay Patel	56ae4f23b2	[InstCombine] add tests for logical and/or with not ops; NFC	2021-06-14 12:54:35 -04:00
Florian Hahn	ee9bb258bb	[LoopDeletion] Add test with irreducible control flow in loop. Currently the irreducible cycles in the loops are ignored. The irreducible cycle may loop infinitely in irreducible_subloop_no_mustprogress, which is allowed and the loop should not be removed. Discussed in D103382.	2021-06-14 17:42:32 +01:00
Christian Sigg	abe501f240	[mlir] Mark gpu dialect illegal in gpu-to-llvm conversion Reviewed By: herhut, bondhugula Differential Revision: https://reviews.llvm.org/D104208	2021-06-14 17:45:44 +02:00
Florian Hahn	96ca03493a	[VectorCombine] Limit scalarization to non-poison indices for now. As Eli mentioned post-commit in D103378, the result of the freeze may still be out-of-range according to Alive2. So for now, just limit the transform to indices that are non-poison.	2021-06-14 16:40:14 +01:00
Saleem Abdulrasool	5b5833b9e0	SelectionDAG: repair the Windows build `6e5628354e` regressed the Windows build as the return type no longer matched in both branches for the return value type deduction. This uses a bit more compiler magic to deal with that.	2021-06-14 08:25:36 -07:00
zhijian	7ed515d168	[AIX][XCOFF] emit vector info of traceback table. Summary: emit vector info of traceback table. Reviewers: Jason Liu,Hubert Tong Differential Revision: https://reviews.llvm.org/D93659	2021-06-14 11:15:22 -04:00
Florian Hahn	d767d1dd2c	[ADT] Use unnamed argument for unused arg in StringMapEntryStorage. This silences an 'unsused argument' warning. Similar to `c2006f857d`.	2021-06-14 15:54:57 +01:00
Jingu Kang	08ce52ef5e	[AArch64] Improve SAD pattern Given a vecreduce_add node, detect the below pattern and convert it to the node sequence with UABDL, [S\|U]ADB and UADDLP. i32 vecreduce_add( v16i32 abs( v16i32 sub( v16i32 [sign\|zero]_extend(v16i8 a), v16i32 [sign\|zero]_extend(v16i8 b)))) =================> i32 vecreduce_add( v4i32 UADDLP( v8i16 add( v8i16 zext( v8i8 [S\|U]ABD low8:v16i8 a, low8:v16i8 b v8i16 zext( v8i8 [S\|U]ABD high8:v16i8 a, high8:v16i8 b Differential Revision: https://reviews.llvm.org/D104042	2021-06-14 15:48:51 +01:00
Hans Wennborg	c60dd3b262	Revert "[clang] NRVO: Improvements and handling of more cases." This change caused build errors related to move-only __block variables, see discussion on https://reviews.llvm.org/D99696 > This expands NRVO propagation for more cases: > > Parse analysis improvement: > * Lambdas and Blocks with dependent return type can have their variables > marked as NRVO Candidates. > > Variable instantiation improvements: > * Fixes crash when instantiating NRVO variables in Blocks. > * Functions, Lambdas, and Blocks which have auto return type have their > variables' NRVO status propagated. For Blocks with non-auto return type, > as a limitation, this propagation does not consider the actual return > type. > > This also implements exclusion of VarDecls which are references to > dependent types. > > Signed-off-by: Matheus Izvekov <mizvekov@gmail.com> > > Reviewed By: Quuxplusone > > Differential Revision: https://reviews.llvm.org/D99696 This also reverts the follow-on change which was hard to tease apart form the one above: > "[clang] Implement P2266 Simpler implicit move" > > This Implements [[http://www.open-std.org/jtc1/sc22/wg21/docs/papers/2021/p2266r1.html\|P2266 Simpler implicit move]]. > > Signed-off-by: Matheus Izvekov <mizvekov@gmail.com> > > Reviewed By: Quuxplusone > > Differential Revision: https://reviews.llvm.org/D99005 This reverts commits `1e50c3d785` and `bf20631782`.	2021-06-14 16:46:58 +02:00
LLVM GN Syncbot	bfd451a0ca	[gn build] Port `c820b494d6`	2021-06-14 14:41:33 +00:00
zoecarver	c820b494d6	[libcxx][ranges] Implement views::all. Differential Revision: https://reviews.llvm.org/D102028	2021-06-14 10:41:00 -04:00
Raphael Isemann	d94ce1a391	[lldb][docs] Add the missing rst anchors to the Python enum docs	2021-06-14 16:31:28 +02:00
Raphael Isemann	e3d5e3193f	[lldb][docs] Fix section name for InputReaderGranularity	2021-06-14 16:21:40 +02:00
Arthur O'Dwyer	bbd717b9a3	[libc++] [test] No longer rely on std::hash<T>::argument_type. Differential Revision: https://reviews.llvm.org/D104166	2021-06-14 10:14:42 -04:00
Denys Shabalin	c83e696732	Add AutomaticAllocationScope to memref.alloca_scope This change adds `AutomaticAllocationScope` to the memref.alloca_scope op. Additionally, it also clarifies that alloca_scope is is conceptually a passthrough operation. Reviewed By: ftynse, bondhugula Differential Revision: https://reviews.llvm.org/D104227	2021-06-14 16:09:06 +02:00
Peter Steinfeld	b88fa0e39f	[flang] Fix compilation problem with rename of "MemRefDataFlow" Revision https://reviews.llvm.org/D104190 renamed MemRefDataFlow -> AffineScalarReplacement. After this rename, mlir failed to build. With this change, all of clang, mlir, and flang build and test correctly. Differential Revision: https://reviews.llvm.org/D104223	2021-06-14 07:01:11 -07:00
David Spickett	31b9acaec5	Reland "[lldb] Set return status to failed when adding a command error" This reverts commit `ac031c8db2`. SB API usage has been corrected.	2021-06-14 14:26:47 +01:00
Roman Lebedev	0f94c3c80d	[NFC][DAGCombine] Extract getFirstIndexOf() lambda back into a function Not all supported compilers like such lambdas, at least one buildbot is unhappy.	2021-06-14 16:25:59 +03:00

... 5 6 7 8 9 ...

391386 Commits All Branches Search

391386 Commits

All Branches