llvm-project

Commit Graph

Author	SHA1	Message	Date
Kazu Hirata	720c48b58e	[AMDGPU] Fix an unused variable warning (NFC) This patch fixes: llvm/lib/Target/AMDGPU/AMDGPULegalizerInfo.cpp:2245:12: error: unused variable 'Ins' [-Werror,-Wunused-variable]	2022-01-10 08:49:46 -08:00
Philip Reames	332642e693	Add test coverage for D116851	2022-01-10 08:48:49 -08:00
Philip Reames	f4c54683d6	[instcombine] Infer alignment for aligned_alloc with potentially zero size This change removes a previous restriction where we had to prove the allocation performed by aligned_alloc was non-zero in size before using the align parameter to annotate the result. I believe this was conservatism around the C11 specification of this routine which allowed UB when size was not a multiple of alignment, but if so, it was a partial one at best. (ex: align 32, size 16 was equally UB, but not restricted) The spec has since been clarified to require nullptr return, not UB. A nullptr - the documented return for this function on failure for all cases after UB mentioned above was removed - is trivially aligned for any power of two. This isn't totally new behavior even for this transform, we'd previously annotate potentially failing allocs (e.g. huge sizes) meaning we were putting align on potentially null pointers anyways. This change simpy does the same for all failure modes.	2022-01-10 08:48:49 -08:00
Arthur O'Dwyer	4b5d59ffd0	[libc++] libcxx/utils: s/preambule/preamble/g. NFC.	2022-01-10 11:38:25 -05:00
Simon Pilgrim	4e77868c7c	[SemaDecl] Use castAs<> instead of getAs<> to avoid dereference of nullptr This will assert the cast is correct instead of returning nullptr	2022-01-10 16:31:08 +00:00
James Y Knight	39f75ab6a7	Fix unused-variable warnings after `d0ee094d6a`.	2022-01-10 16:28:54 +00:00
Jeremy Morse	0f92c113a3	[Dexter] Allow DexUnreachable in supplementary .dex files DexUnreachable is a useful tool for specifying that lines shouldn't be stepped on. Right now they have to be placed in the source file; lets allow them to be placed instead in a detached .dex file, by adding on_line and line-range keyword arguments to the command. Differential Revision: https://reviews.llvm.org/D115449	2022-01-10 16:22:53 +00:00
Roman Lebedev	c22fb4b521	[NFC] clang-format the whole ScalarEvolutionExpressions.h This file has completely wrong formatting, and modifying it leads to having to fight around that. every time. This is a pure reformatting, there are NO other changes here.	2022-01-10 19:18:14 +03:00
Johannes Doerfert	7b39dccbe4	[Attributor][FIX] Ensure "IsExact" is false for non-exact accesses If we look at potentially interfering accesses we need to ensure the "IsExact" flag is set appropriately. Accesses that have an "unknown" size or offset cannot be exact matches and we missed to flag that. Error and test reported by Serguei N. Dmitriev.	2022-01-10 10:09:36 -06:00
Simon Pilgrim	c1f1359882	[PGOInstrumentation] populateEHOperandBundle - earlyout if !isa<CallBase> All paths (that actually do anything) require a successful dyn_cast<CallBase> - so just earlyout if the cast fails Fixes static analyzer nullptr deference warning	2022-01-10 15:34:37 +00:00
Simon Pilgrim	353484d191	[LowerExpectIntrinsic] Use cast<> instead of dyn_cast<> to avoid dereference of nullptr. NFC	2022-01-10 15:34:37 +00:00
Pavel Labath	8ccfcab34f	[lldb/platform-gdb] Clear cached protocol state upon disconnection Previously we would persist the flags indicating whether the remote side supports a particular feature across reconnects, which is obviously not a good idea. I implement the clearing by nuking (its the only way to be sure :) the entire GDBRemoteCommunication object in the disconnect operation and creating a new one upon connection. This allows us to maintain a nice invariant that the GDBRemoteCommunication object (which is now a pointer) exists only if it is connected. The downside to that is that a lot of functions now needs to check the validity of the pointer instead of blindly accessing the object. The process communication does not suffer from the same issue because we always destroy the entire Process object for a relaunch. Differential Revision: https://reviews.llvm.org/D116539	2022-01-10 16:27:30 +01:00
Nicolas Vasilache	d0ee094d6a	[mlir][Bufferize] Fix incorrect bufferization of rank-reducing tensor ops. This revision fixes SubviewOp, InsertSliceOp, ExtractSliceOp construction during bufferization where not all offset/size/stride operands were properly specified. A test that exhibited problematic behaviors related to incorrect memref casts is introduced. Init tensor optimization is disabled in teh testing func bufferize pass. Differential Revision: https://reviews.llvm.org/D116899	2022-01-10 10:14:55 -05:00
Nikita Popov	7543365449	[Thumb2] Regenerate test checks (NFC)	2022-01-10 16:12:41 +01:00
Yitzhak Mandelbaum	64f7b2d4bf	[clang][dataflow] Change `transfer` function to update lattice element in place. Currently, the transfer function returns a new lattice element, which forces an unnecessary copy on processing each CFG statement. Differential Revision: https://reviews.llvm.org/D116834	2022-01-10 14:45:30 +00:00
Matthias Springer	0696ad9563	[mlir][linalg][bufferize][NFC] Pass missing BufferizationState objs as const ref These should have been updated as part of D116742. Differential Revision: https://reviews.llvm.org/D116937	2022-01-10 23:38:04 +09:00
Nicolas Vasilache	1a2474b786	[mlir][Linalg] Disable init_tensor elimination by default init_tensor elimination is arguably a pre-optimization that should be separated from comprehensive bufferization. In any case it is still experimental and easily results in wrong IR with violated SSA def-use orderings. Isolate the optimization behind a flag, separate the test cases and add a test case that would results in wrong IR. Differential Revision: https://reviews.llvm.org/D116936	2022-01-10 09:19:18 -05:00
Haojian Wu	c2293bc17d	Revert "[AST] Add RParen loc for decltype AutoTypeloc." This breaks a clang-tidy check, needs to investigate and fix. Revert them to bring the buildbot back. This reverts commit `55d96ac3dc` and `37ec65e1d7`	2022-01-10 15:18:41 +01:00
David Sherwood	b0922a9dcd	[LoopVectorize] Make VPWidenCanonicalIVRecipe::execute work for scalable vectors The code in VPWidenCanonicalIVRecipe::execute only worked for fixed-width vectors due to the way we generate the values per lane. This patch changes the code to use a combination of vector splats and step vectors to get the same result. This then works for both fixed-width and scalable vectors. Tests that exercise this code path for scalable vectors have been added here: Transforms/LoopVectorize/AArch64/sve-tail-folding.ll Differential Revision: https://reviews.llvm.org/D113180	2022-01-10 14:12:32 +00:00
Nuno Lopes	7b1cb72ad9	[SROA] Switch replacement of dead/UB/unreachable ops from undef to poison SROA has 3 data-structures where it stores sets of instructions that should be deleted: - DeadUsers -> instructions that are UB or have no users - DeadOperands -> instructions that are UB or operands of useless phis - DeadInsts -> "dead" instructions, including loads of uninitialized memory with users The first 2 sets can be RAUW with poison instead of undef. No brainer as UB can be replaced with poison, and for instructions with no users RAUW is a NOP. The 3rd case cannot be currently replaced with poison because the set mixes the loads of uninit memory. I leave that alone for now. Another case where we can use poison is in the construction of vectors from multiple loads. The base vector for the first insertelement is now poison as it doesn't matter as it is fully overwritten by inserts. Differential Revision: https://reviews.llvm.org/D116887	2022-01-10 14:04:26 +00:00
Nikita Popov	7725331ccd	[CodeGen] Avoid some pointer element type accesses Possibly this is sufficient to fix PR53089.	2022-01-10 15:02:55 +01:00
Louis Dionne	7893bb7408	[libc++] Refactor the test for join_view's default constructor In particular, this removes the need for adding a ad-hoc `operator==` to forward_iterator. Differential Revision: https://reviews.llvm.org/D116614	2022-01-10 09:02:45 -05:00
Serge Guelton	d2cc6c2d0c	Use a sorted array instead of a map to store AttrBuilder string attributes Using and std::map<SmallString, SmallString> for target dependent attributes is inefficient: it makes its constructor slightly heavier, and involves extra allocation for each new string attribute. Storing the attribute key/value as strings implies extra allocation/copy step. Use a sorted vector instead. Given the low number of attributes generally involved, this is cheaper, as showcased by https://llvm-compile-time-tracker.com/compare.php?from=5de322295f4ade692dc4f1823ae4450ad3c48af2&to=05bc480bf641a9e3b466619af43a2d123ee3f71d&stat=instructions Differential Revision: https://reviews.llvm.org/D116599	2022-01-10 14:49:53 +01:00
Nikita Popov	2c0fb96254	[TypeFinder] Support opaque pointers We need to explicitly visit a number of types, as these are no longer reachable through the pointer type if opaque pointers are enabled. This is similar to ValueEnumerator changes that have been done previously.	2022-01-10 14:46:45 +01:00
Alex Xu (Hello71)	f282b68091	set __NO_MATH_ERRNO__ if -fno-math-errno This causes modern glibc to unset math_errhandling MATH_ERRNO. gcc 12 also sets some other macros, but most of them are associated with flags ignored by clang, so without library examples, it is difficult to determine whether they should be set. I think setting this one macro is OK for now.	2022-01-10 08:45:46 -05:00
Louis Dionne	c01260a781	[libc++] Add missing `return 0` to main() functions in the tests	2022-01-10 08:38:18 -05:00
Matthias Springer	446f0c609f	[mlir][linalg][bufferize][NFC] Update comments in BufferizableOpInterface Differential Revision: https://reviews.llvm.org/D116932	2022-01-10 22:34:26 +09:00
Louis Dionne	e6d7b2f0de	[libc++] Fix link to bug tracker	2022-01-10 08:33:28 -05:00
Simon Pilgrim	5eb47961c4	[CostModel][X86] Update ROTL/ROTR vXi8/vXi16 costs on AVX512BW targets Refresh based off recent improvements to codegen and the helper script from D103695	2022-01-10 13:18:25 +00:00
Louis Dionne	49b9b9dc9c	[libc++] Fix the documentation and re-enable documentation CI The documentation CI job is very cheap, so we can afford to keep it around even with reduced capacity. This commit fixes the documentation (which had an invalid reference in it) and re-enables that CI step. Differential Revision: https://reviews.llvm.org/D116897	2022-01-10 08:05:58 -05:00
Matthias Springer	cb64c541f9	[mlir][linalg][bufferize][NFC] Clean up bufferization entry point Differential Revision: https://reviews.llvm.org/D116892	2022-01-10 21:50:36 +09:00
Haojian Wu	37ec65e1d7	[clangd] Enable expand-auto for decltype(auto). Based on https://reviews.llvm.org/D116919. Fixes https://github.com/clangd/clangd/issues/121 Differential Revision: https://reviews.llvm.org/D116921	2022-01-10 13:46:56 +01:00
Martin Storsjö	392aa97acc	[llvm-objcopy] Implement the PE-COFF specific --subsystem option This implements the parsing of the highly PE-COFF specific option in ConfigManager.cpp, setting Optional<> values in COFFConfig, which then are used in COFFObjcopy. This should fix https://github.com/mstorsjo/llvm-mingw/issues/239. Differential Revision: https://reviews.llvm.org/D116556	2022-01-10 14:44:15 +02:00
Tim Northover	581e855623	AArch64: don't claim to preserve registers used by prologue code	2022-01-10 12:27:04 +00:00
Petar Avramovic	d9d2516aaf	AMDGPU/GlobalISel: Rework legalization for extract/insert vector elt Use G_MERGE_VALUES and G_UNMERGE_VALUES on vector elements instead of G_EXTRACT and G_INSERT when doing custom legalization for G_EXTRACT_VECTOR_ELT and G_INSERT_VECTOR_ELT. With this approach legalization artifact combiner gets direct access to all vector elements. Differential Revision: https://reviews.llvm.org/D116115	2022-01-10 13:15:20 +01:00
Florian Hahn	003ac239d8	[SROA] Reduce the number of times a IRBuilder is constructed (NFC). This patch reduces the number of times IRBuilders need to be constructed in SROA.cpp by passing existing ones by reference to the appropriate places.	2022-01-10 12:09:13 +00:00
Haojian Wu	55d96ac3dc	[AST] Add RParen loc for decltype AutoTypeloc. Differential Revision: https://reviews.llvm.org/D116919	2022-01-10 12:46:27 +01:00
Haojian Wu	abe3003ead	[AST] Use recovery-expr to preserve incomplete-type-member-access expression. Fixes https://github.com/clangd/clangd/issues/502 Differential Revision: https://reviews.llvm.org/D116922	2022-01-10 12:45:20 +01:00
Alex Zinenko	c44d521b30	[mlir] address post-commit review for D116759	2022-01-10 12:40:58 +01:00
Florian Hahn	aecad5828e	[SCEVExpander] Only create trunc when needed. `9345ab3a45` updated generateOverflowCheck to skip creating checks that always evaluate to false. This in turn means that we only need to create TruncTripCount if it is actually used. Sink the TruncTripCount creating into ComputeEndCheck, so it is only created when there's an actual check.	2022-01-10 11:31:27 +00:00
Jeremy Morse	3a094d8b27	[Dexter] Allow tests to specify command line options This patch adds a "DexCommandLine" command, allowing dexter tests to specify what command line options the test should be started with. I've also plumbed it through into the debuggers. This eases the matter of pointing Dexter at larger tests, or controlling different paths through a single binary from a Dexter test. Differential Revision: https://reviews.llvm.org/D115330	2022-01-10 11:30:06 +00:00
Mubashar Ahmad	d17fb46e89	[Clang][AArch64][ARM] PMUv3.4 Option Added An option has been added to Clang to enable or disable the PMU v3.4 architecture extension. Differential Revision: https://reviews.llvm.org/D116748	2022-01-10 11:28:19 +00:00
Sam McCall	1ab13793be	[clangd] Include fixer for missing functions in C A function call `unresolved()` in C will generate an implicit declaration of the missing function and warn `ext_implicit_function_decl` or so. (Compared to in C++ where we get `err_undeclared_var_use`). We want to try to resolve these names. Unfortunately typo correction is disabled in sema for performance reasons unless this warning is promoted to error. (We need typo correction for include-fixer.) It's not clear to me where a switch to force this correction on should go, include-fixer is kind of a hack. So hack more by telling sema we're promoting them to error. Fixes https://github.com/clangd/clangd/issues/937 Differential Revision: https://reviews.llvm.org/D115490	2022-01-10 12:17:19 +01:00
Alex Zinenko	2f672e2ffa	[mlir] Don't inline calls from dead SCCs During iterative inlining of the functions in a multi-step call chain, the inliner could add the same call operation several times to the worklist, which led to use-after-free when this op was considered more than once. Closes #52887. Reviewed By: wsmoses Differential Revision: https://reviews.llvm.org/D116820	2022-01-10 12:07:14 +01:00
David Green	e92d63b467	Revert "[LoopFlatten] Move it to a LoopPassManager" This commit caused performance regressions due to differences in the expected code during loop flattening. Reverting it until the fix is ready, which hopefully wont take too long. This reverts commit `86825fc2fb`.	2022-01-10 11:03:49 +00:00
Stephan Herhut	33cec20dbd	[mlir][memref] Tighten verification of memref.reinterpret_cast We allow the omission of a map in memref.reinterpret_cast under the assumption, that the cast might cast to an identity layout. This change adds verification that the static knowledge that is present in the reinterpret_cast supports this assumption. Differential Revision: https://reviews.llvm.org/D116601	2022-01-10 11:55:47 +01:00
David Sherwood	e3c84fb948	[LoopVectorize] Add support for tail folding using scalable vectors This patch fixes up an issue with InnerLoopVectorizer::getOrCreateVectorTripCount whereby we weren't correctly generating the runtime trip count for scalable vectors when tail-folding. It also removes some asserts in the tail-folding path for cases when the VF is not scalable. In this patch I have only permitted tail-folding to be enabled explicitly for scalable vectors when the user has specified one of the following flags: -prefer-predicate-over-epilogue=predicate-dont-vectorize -prefer-predicate-over-epilogue=predicate-else-scalar-epilogue For now it's best not to enable tail-folding with scalable vectors for low trip counts or when optimising for code size, since there has been no analysis on whether this is worth it. Various tests have been added here: Transforms/LoopVectorize/AArch64/sve-tail-folding.ll Transforms/LoopVectorize/AArch64/sve-tail-folding-forced.ll The tests cannot be target independent because they require masked load/store support, i.e. TTI.isLegalMaskedLoad and TTI.isLegalMaskedStore need to return true. Differential Revision: https://reviews.llvm.org/D113003	2022-01-10 10:55:40 +00:00
Cullen Rhodes	eee993ae4c	[AArch64][SVE] Fold predicate into compare Codegen of added testcase before this patch: ptrue p0.s cmpgt p1.s, p0/z, z0.s, z1.s cmpge p2.s, p0/z, z2.s, z1.s and p0.b, p0/z, p1.b, p2.b ret Patterns originally authored by Will Lovett. Reviewed By: david-arm Differential Revision: https://reviews.llvm.org/D116749	2022-01-10 10:52:06 +00:00
Shraiysh Vaishay	a8586b573e	[mlir][OpenMP] Change the syntax of omp.atomic.read op This patch changes the syntax of omp.atomic.read to take the address of destination, instead of having the value in a result. This will allow using omp.atomic.read operation within an omp.atomic.capture operation thus making its implementation less complex. Reviewed By: peixin Differential Revision: https://reviews.llvm.org/D116396	2022-01-10 16:19:45 +05:30
David Sherwood	8e773f4491	[NFC] Add tests for splats of illegal integer vector types Differential Revision: https://reviews.llvm.org/D116810	2022-01-10 10:37:36 +00:00

1 2 3 4 5 ...

409372 Commits All Branches Search

409372 Commits

All Branches