llvm-project

Commit Graph

Author	SHA1	Message	Date
Jacques Pienaar	9bae20b528	[mlir] Add shape.func Add shape func op for use (primarily) in shape function_library op. Allows setting default dialect for some simpler authoring. This is a minimal version of the ops needed. Differential Revision: https://reviews.llvm.org/D124055	2022-04-22 11:35:35 -07:00
Lei Zhang	6f28fd0bf7	[mlir][vector] Fold 1-element reduction into extract or arith ops If there is only one single element in the vector, then we can just extract the element to compute the final result. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D124129	2022-04-22 14:24:46 -04:00
Vitaly Buka	2518d4f6d8	[nfc][msan] Add D123875 into release notes	2022-04-22 11:14:28 -07:00
Mohammed Nurul Hoque	5dd99f71aa	[RISCV] transform MI to W variant to remove sext.w Backwards search The sext.w removal pass (before the new patch) checks if the input to sext.w is already in sign-extended form, so it can eliminate it. It does that by checking every definition/source that reaches the sext.w is an instruction that produces a sign-extended value, either by definition (e.g. ADDW), or it propagates sign-extension (e.g. OR) so we check its sources recursively. Forward search Sometimes, one of the sources is an instruction that doesn't always produce a sign-extended value, but it has a W-version that does (e.g. ADD / ADDW). If we transform the ADD to ADDW, the sext.w can be removed (assuming other def paths are satisfied), but this transformation is sound only if every use of this ADD/W only reqruires the lower 32-bits either directly (like sll %x, 32) or they propagate dependency (lower word of output only depends on lower word of input) so we check its uses recursively. When searching backwards, if an instruction that can be replaced with W-variant is encountered, this pass runs the forward search to verify it can be replaced, then adds it to a list of fixable instructions. After verifying all paths, it replaces the instruction and removes the sext.w. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D119928	2022-04-22 10:59:26 -07:00
Krzysztof Parzyszek	0278dee1e5	[Hexagon] Generate TargetConstant in SelectAnyInt At some point in instruction selection, A2_tfrsi Constant:i32<...> was created, where the "Constant" came from SelectAnyInt. Since it wasn't a TargetConstant, it was selected again, leading to %vreg = A2_tfrsi ... ... = A2_tfrsi %vreg which is not a valid code.	2022-04-22 10:36:37 -07:00
Keith Smiley	2d8cf26d08	[lld-macho] Fix crash on invalid framework tbd Previously these would crash because `file` is null in the case there is an invalid tbd file. Differential Revision: https://reviews.llvm.org/D124271	2022-04-22 10:26:48 -07:00
Eric Schweitz	67609cdf09	[NFC] Cosmetic changes to OpenACC.cpp. Differential Revision: https://reviews.llvm.org/D124272	2022-04-22 10:20:33 -07:00
Florian Hahn	1fa0bb8a80	[SimpleLoopUnswitch] Add test where all conds are guaranteed non poison. Extra test for D124259.	2022-04-22 18:09:05 +01:00
Alexey Lapshin	79c1991010	[llvm-objcopy][NFC] refactor restoreStatOnFile out of llvm-objcopy. Functionality of restoreStatOnFile may be reused. Move it into FileUtilities.cpp. Create helper class FilePermissionsApplier to store and apply permissions. Differential Revision: https://reviews.llvm.org/D123821	2022-04-22 20:06:01 +03:00
Tom Eccles	225b91e6cb	Fix crash getting name of a template decl NamedDecl::getIdentifier can return a nullptr when DeclarationName::isIdentifier is false, which leads to a null pointer dereference when TypePrinter::printTemplateId calls ->getName(). NamedDecl::getName does the same thing in the successful case and returns an empty string in the failure case. This crash affects the llvm 14 packages on llvm.org.	2022-04-22 13:03:28 -04:00
Christopher Di Bella	e9a902c7f7	Revert "Revert "Revert "[clang][pp] adds '#pragma include_instead'""" > Includes regression test for problem noted by @hans. > is reverts commit `973de71`. > > Differential Revision: https://reviews.llvm.org/D106898 Feature implemented as-is is fairly expensive and hasn't been used by libc++. A potential reimplementation is possible if libc++ become interested in this feature again. Differential Revision: https://reviews.llvm.org/D123885	2022-04-22 16:37:20 +00:00
Fraser Cormack	98db7ea262	[RISCV][NFC] Adjust some formatting in VL patterns	2022-04-22 17:19:27 +01:00
Fraser Cormack	2b0fedc2dd	[RISCV] Print human-readable VTYPE/SEW/LMUL in MIR This patch adds custom MIR operand comments to VTYPE immediate operands in VSETVLI instructions and SEW/LMUL operands in vector codegen pseudo instructions. The result is intended to be more human-readable and hopefully maintainable when working with MIR, particularly when writing or reading test cases. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D124187	2022-04-22 17:13:18 +01:00
owenca	c9e7eec7bc	[clang-format][NFC] Use isComment() in setCommentLineLevels() Also replace an unnecessary check with assert() in the unwrapped line parser. Differential Revision: https://reviews.llvm.org/D124215	2022-04-22 09:21:28 -07:00
Matthias Springer	d6dab38ae4	[mlir][bufferize][NFC] Add function boundary bufferization flag to BufferizationOptions This makes the API easier to use. Also allows us to check for incorrect API usage for easier debugging. Differential Revision: https://reviews.llvm.org/D124265	2022-04-23 01:11:37 +09:00
Simon Pilgrim	ffe13960b5	[InstCombine] Fold (A & 2^C1) + A => A & (2^C1 - 1) iff bit C1 in A is a sign bit (PR21929) Alive2: https://alive2.llvm.org/ce/z/Ygq26C This is the final missing fold to handle the modulo2 simplification: https://github.com/llvm/llvm-project/issues/22303 Fixes #22303 Differential Revision: https://reviews.llvm.org/D123374	2022-04-22 16:59:02 +01:00
Kadir Cetinkaya	2efccf5166	[clangd][NFC] Reduce memory usage while building dex Reduce peak memory usage by tearing down the intermediate representation as we build the final one. Rather than deleting it in the end. Differential Revision: https://reviews.llvm.org/D124240	2022-04-22 17:44:47 +02:00
Simon Pilgrim	34e7243464	[DAG] Fold freeze(bitcast(x)) -> bitcast(freeze(x)) This is a very specific fold to fix an upstream poor codegen issue. InstCombine has the much more flexible pushFreezeToPreventPoisonFromPropagating but I don't think we're quite there with DAG/TLI handling for canCreateUndefOrPoison/isGuaranteedNotToBeUndefOrPoison value tracking yet. Fixes #54911 Differential Revision: https://reviews.llvm.org/D124185	2022-04-22 16:39:25 +01:00
Matthias Springer	b0b19fae81	[mlir][bufferize][NFC] Rewrite op filter logic The `hasFilter` field is not needed. Instead, the filter accepts ops by default if no ALLOW rule was specified. Differential Revision: https://reviews.llvm.org/D124264	2022-04-23 00:25:24 +09:00
Vasileios Porpodas	7052a0ad68	Revert "[SLP][AArch64] Implement lookahead operand reordering score of splat loads for AArch64" This reverts commit `7ba702644b`.	2022-04-22 08:24:04 -07:00
Denis Antrushin	a6d1a76a07	[StatepointLowering] Add test for cross-BB gc.relocate lowering	2022-04-22 22:20:35 +07:00
Matt Arsenault	f7db8b7a9c	llvm-reduce: Fix variable name typo	2022-04-22 11:07:41 -04:00
Matt Arsenault	9c122537cd	MIR: Serialize FunctionContextIdx in MachineFrameInfo	2022-04-22 11:07:41 -04:00
Nikita Popov	de1e3c649f	[InstCombine] Fix typo in test (NFC)	2022-04-22 16:54:56 +02:00
Matt Arsenault	794a0bb547	AMDGPU: Directly implement computeKnownBits for workitem intrinsics Currently metadata is inserted in a late pass which is lowered to an AssertZext. The metadata would be more useful if it was inserted earlier after inlining, but before codegen. Probably shouldn't change anything now. Just replacing the late metadata annotation needs more work, since we lose out on optimizations after these are lowered to CopyFromReg. Seems to be slightly better than relying on the AssertZext from the metadata. The test change in cvt_f32_ubyte.ll is a quirk from it using -start-before=amdgpu-isel instead of running the usual codegen pipeline.	2022-04-22 10:49:50 -04:00
Matt Arsenault	40bc9112c0	GlobalISel: Relax handling of G_ASSERT_* with source register classes The most common situation where G_ASSERT_ZEXT appears for AMDGPU is a copy from a physical register, which happens to use set the actual register class on the virtual register. After copy coalescing, the assert's source operand had a vreg with a set class. The verifier was strictly rejecting cases where the set class/bank weren't an exact match. Additionally, RegBankSelect was also expecting a register bank to be set on the register, not a class. This is much stricter than regular copies so relax this behavior. This now allows these 2 cases: 1. Source register has either class or bank, and the result does not 2. Source register has a register class, and the result is a register with a matching bank. This should avoid needing some kind of special handling to avoid violating this constraint when folding copies.	2022-04-22 10:49:50 -04:00
Nikita Popov	369ef9bf60	[InstCombine] Extract code for or of icmp eq zero and icmp fold (NFC) To make it easier to extend this to the congruent and case.	2022-04-22 16:48:59 +02:00
Vasileios Porpodas	7ba702644b	[SLP][AArch64] Implement lookahead operand reordering score of splat loads for AArch64 The original patch (https://reviews.llvm.org/D121354) targets x86 and adjusts the lookahead score of splat loads ad they can be done by the `movddup` instruction that combines the load and the broadcast and is cheap to execute. A similar issue shows up on AArch64. The `ld1r` instruction performs a broadcast load and is cheap to execute. This patch implements the TargetTransformInfo hooks for AArch64. Differential Revision: https://reviews.llvm.org/D123638	2022-04-22 07:29:58 -07:00
Martin Storsjö	3f9ebc0cc8	[doc] [cmake] Fix a typo in examples for the cmake directory docs. NFC. The previous case was a tautology - this is probably what was intended. Differential Revision: https://reviews.llvm.org/D124072	2022-04-22 17:28:24 +03:00
Biplob Mishra	237c4bada9	InstCombine: Add tests to show or-and scenarios which can be possibly be combined by ORing the masks	2022-04-22 15:23:57 +01:00
Paul Walker	3c382ed71f	[AArch64][SVE] Remove BIC from logical operation DestructiveBinaryComm patterns This reverts part of https://reviews.llvm.org/D124224 that causes an assert because the register allocator triggers a pathological situation where there's no safe way to insert a zeroing MOVPFRX instruction.	2022-04-22 15:07:55 +01:00
Nikita Popov	074abdcc60	[InstCombine] Add tests for and of icmp ne zero and icmp uge (NFC) Partially based on the conjugated or tests.	2022-04-22 15:53:33 +02:00
Nico Weber	9c00e3d49e	[lld/win] Mention in release notes that /winsysroot: currently requires /machine: Differential Revision: https://reviews.llvm.org/D124254	2022-04-22 09:40:39 -04:00
Iain Sandoe	4c4ff004a2	[C++20][Modules][Driver][HU 2/N] Add fmodule-header, fmodule-header= These command-line flags are alternates to providing the -x c++-*-header indicators that we are building a header unit. Act on fmodule-header= for headers on the c/l: If we have x.hh -fmodule-header, then we should treat that header as a header unit input (equivalent to -xc++-header-unit-header x.hh). Likewise, for fmodule-header={user,system} the source should be now recognised as a header unit input (since this can affect the job list that we need). It's not practical to recognise a header without any suffix so -fmodule-header=system foo isn't going to happen. Although -fmodule-header=system foo.hh will work OK. However we can make it work if the user indicates that the item without a suffix is a valid header. (so -fmodule-header=system -xc++-header vector) Differential Revision: https://reviews.llvm.org/D121589	2022-04-22 14:14:19 +01:00
Lei Zhang	fc760c0260	[mlir][vector] Fold cancelling vector.shape_cast(vector.broadcast) vector.broadcast can inject all size one dimensions. If it's followed by a vector.shape_cast to the original type, we can cancel the op pair, like cancelling consecutive shape_cast ops. Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D124094	2022-04-22 08:58:26 -04:00
Byoungchan Lee	f69328049e	[cc1as] Add support for emitting the build version load command for -darwin-target-variant This patch extends cc1as to export the build version load command with LC_VERSION_MIN_MACOSX. This is especially important for Mac Catalyst as Mac Catalyst uses the MacOS's compiler rt built-ins. Differential Revision: https://reviews.llvm.org/D121868	2022-04-22 08:38:07 -04:00
zhongyunde	e1afae0311	[AArch64][SVE] Add some logical operation DestructiveBinaryComm patterns Add DestructiveBinaryComm* patterns for ORR, EOR, AND and BIC. The above instructions requires that the source and destination registers are equal, so use movprfx should be beneficial to performance. note: BIC (i.e. A & ~B) is not a commutative operation. Reviewed By: paulwalker-arm, david-arm Differential Revision: https://reviews.llvm.org/D124224	2022-04-22 20:31:00 +08:00
Vasileios Porpodas	ad12f468a3	[SLP][AArch64][NFC] Add test for a follow-up patch that fixes the lookahead cost of splat-loads for AArch64	2022-04-22 05:29:34 -07:00
Haojian Wu	864752cfeb	[AST] QualifiedTemplateName::getTemplateDecl cleanup. This is a followup cleanup of `1234b1c6d8` Differential Revision: https://reviews.llvm.org/D124238	2022-04-22 14:28:48 +02:00
Nico Weber	8dbc6b5600	Revert "[randstruct] Check final randomized layout ordering" This reverts commit `a7815d33bf`. Test fails on Windows, see comments on https://reviews.llvm.org/D124199	2022-04-22 08:27:32 -04:00
Matthias Springer	494505f39f	[mlir][bufferize][NFC] Move SCF test cases to SCF dialect Differential Revision: https://reviews.llvm.org/D124249	2022-04-22 20:35:20 +09:00
Daniel Kiss	de07cde67b	[AArch64] Emit .cfi_negate_ra_state for PAC-auth instructions. autiasp, autibsp instructions are the counterpart of paciasp/pacibsp instructions therefore let's emit .cfi_negate_ra_state for these too. In case of Armv8.3 instruction set the retaa/retbb will do the return and authentication in one step here we can't emit the . cfi_negate_ra_state because that would be point after the ret* instruction. Reviewed By: nickdesaulniers, MaskRay Differential Revision: https://reviews.llvm.org/D111780	2022-04-22 13:25:57 +02:00
Ying Yi	b09ba42620	Bug 51277: [DWARF] DW_AT_alignment incorrect when attribute((__aligned__)) is present but ignored` In the original code, the 'getDeclAlignIfRequired' function is used. The 'getDeclAlignIfRequired' function will return the max alignment of all aligned attributes if the type has aligned attributes. The function doesn't consider the type at all. The 'getTypeAlignIfRequired' function uses the type's alignment value, which also used by the 'alignof' function. I think we should use the function of 'getTypeAlignIfRequired'. Reviewed By: dblaikie, jmorse, wolfgangp Differential Revision: https://reviews.llvm.org/D124006	2022-04-22 12:15:00 +01:00
Nikita Popov	ba46ae7bd8	[InstCombine] Merge foldAndOfICmps() and foldOrOfICmps() (NFCI) Folds are supposed to always be added in conjugated pairs for and and or. Merge the two functions to make folds for which this is currently not the case more obvious.	2022-04-22 12:48:03 +02:00
Matthias Springer	e07a7fd5c0	[mlir][bufferization] Move ModuleBufferization to bufferization dialect * Move Module Bufferization to the bufferization dialect. The implementation is split into `OneShotModuleBufferize.cpp` and `FuncBufferizableOpInterfaceImpl.cpp`, so that the external model implementation can be easily moved to the func dialect in the future. * Split and clean up test cases. A few test cases are still remaining in Linalg and will be updated separately. * `linalg.inplaceable` is renamed to `bufferization.writable` to accurately reflect its current usage. * Attributes and their verifiers are moved from the Linalg dialect to the Bufferization dialect. * Expand documentation. * Add a new flag to One-Shot Bufferize to allow for function boundary bufferization. Differential Revision: https://reviews.llvm.org/D122229	2022-04-22 19:37:28 +09:00
Nikita Popov	3e1d2c352c	[InstCombine] Fix or of commuted foldable predicates `1d90e53044` switch this code to store the predicates and operands in variables, but retained a swapOperands() call here. Thus the commuted cases were no longer folded. Additionally, as the change was not reported, the next InstCombine iteration would not pick it up either.	2022-04-22 12:31:26 +02:00
Nikita Popov	2b252c159f	[InstCombine] Add commuted or of icmp test (NFC)	2022-04-22 12:28:12 +02:00
Matthias Springer	bd1d87e3d1	[mlir][bufferization][NFC] Remove layout post processing step The layout postprocessing step was removed and is now part of the FuncOp bufferization. If the user specified a certain layout map for a tensor function arg, use that layout map directly when bufferizing the function signature. Previously, the bufferization used a generic layout map for every tensor function arg and then updated function signatures and CallOps in a separate step. Differential Revision: https://reviews.llvm.org/D122228	2022-04-22 18:49:47 +09:00
Matthias Springer	70777d967f	[mlir][bufferize][NFC] Move FuncOp bufferization to BufferizableOpInterface impl FuncOps are now less special. They must still be analyzed + bufferized in a certain order, but they are now bufferized same as other ops that have a region: Bufferize the op first (`bufferize` interface method), then bufferize the region body with other bufferization patterns. In the case of FuncOps, the function signature is bufferized together with ReturnOps. Similar to how, e.g., scf.for ops are bufferized together with scf.yield ops. This change is essentially a reimplementation of the FuncOp bufferization, but mostly NFC from a user's perspective (apart from error messages). This change is in preparation of moving the code to the bufferization dialect. Differential Revision: https://reviews.llvm.org/D123214	2022-04-22 18:47:12 +09:00
Florian Hahn	85eb8b7244	[IndVars] Add test for crash exposed by D114650.	2022-04-22 10:44:29 +01:00

1 2 3 4 5 ...

421877 Commits All Branches Search

421877 Commits

All Branches