llvm-project

Commit Graph

Author	SHA1	Message	Date
Guillaume Chatelet	71405d90f0	[libc] Select FPUtils implementations via code instead of build We want to simplify the build system and rely on code to do the implementation selection. This is in preparation of adding a Bazel configuration (D114712). Differential Revision: https://reviews.llvm.org/D115034	2021-12-03 15:48:41 +00:00
Balázs Kéri	1cefe91d40	[clang-tidy][docs][NFC] Improve documentation of bugprone-unhandled-exception-at-new Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D114602	2021-12-03 16:53:08 +01:00
Stephen Tozer	98a021fcbf	[DebugInfo] Attempt to preserve more information during tail duplication Prior to this patch, tail duplication handled debug info poorly - specifically, debug instructions would be dropped instead of being set undef, potentially extending the lifetimes of prior debug values that should be killed. The pass was also very aggressive with dropping debug info, dropping debug info even when the SSA value it referred to was still present. This patch attempts to handle debug info more carefully, checking to see whether each affected debug value can still be live, setting it undef if not. Reviewed By: jmorse Differential Revision: https://reviews.llvm.org/D106875	2021-12-03 15:30:05 +00:00
David Green	ab0c5cea0b	[ARM] Use v2i1 for MVE and CDE intrinsics This adjusts all the MVE and CDE intrinsics now that v2i1 is a legal type, to use a <2 x i1> as opposed to emulating the predicate with a <4 x i1>. The v4i1 workarounds have been removed leaving the natural v2i1 types, notably in vctp64 which now generates a v2i1 type. AutoUpgrade code has been added to upgrade old IR, which needs to convert the old v4i1 to a v2i1 be converting it back and forth to an integer with arm.mve.v2i and arm.mve.i2v intrinsics. These should be optimized away in the final assembly. Differential Revision: https://reviews.llvm.org/D114455	2021-12-03 15:27:58 +00:00
Tue Ly	dbed678f4b	[libc] Fix bugs with negative and mixed normal/denormal inputs in hypot implementation. Fix a bug with negative and mixed normal/denormal inputs in hypot implementation. Differential Revision: https://reviews.llvm.org/D114726	2021-12-03 10:14:04 -05:00
Nemanja Ivanovic	d6c0ef7887	[PowerPC] Handle base load with reservation mnemonic The Power ISA defined l[bhwdq]arx as both base and extended mnemonics. The base mnemonic takes the EH bit as an operand and the extended mnemonic omits it, making it implicitly zero. The existing implementation only handles the base mnemonic when EH is 1 and internally produces a different instruction. There are historical reasons for this. This patch simply removes the limitation introduced by this implementation that disallows the base mnemonic with EH = 0 in the ASM parser. This resolves an issue that prevented some files in the Linux kernel from being built with -fintegrated-as. Also fix a crash if the value is not an integer immediate.	2021-12-03 09:13:02 -06:00
Anna Thomas	72750f0012	[TrivialDeadness] Introduce API separating two different usages The earlier usage of wouldInstructionBeTriviallyDead is based on the assumption that the use_count of that instruction being checked will be zero. This patch separates the API into two different ones: 1. The strictly conservative one where the instruction is trivially dead iff the uses are dead. 2. The slightly relaxed form, where an instruction is dead along paths where it is not used. The second form can be used in identifying instructions that are valid to sink down to uses (D109917). Reviewed-By: reames Differential Revision: https://reviews.llvm.org/D114647	2021-12-03 10:09:52 -05:00
Andy Yankovsky	0495301293	[lldb-vscode] Report supportsModulesRequest=true The adapter does support `Modules` request, implemented in `39239f9`. Reviewed By: wallace Differential Revision: https://reviews.llvm.org/D115033	2021-12-03 16:07:48 +01:00
Alexey Bataev	f6279562da	[OPENMP]Fix PR52117: Crash caused by target region inside of task construct. Need to do the analysis of the captured expressions in the clauses. Previously the compiler ignored them and it may lead to a compiler crash trying to get the address of the mapped variables. Differential Revision: https://reviews.llvm.org/D114546	2021-12-03 07:01:00 -08:00
Mehrnoosh Heidarpour	54dc03b97b	[InstSimplify] Add test case for logic 'or' fold; NFC	2021-12-03 09:29:43 -05:00
Matthias Springer	e359a1e548	[mlir][linalg][bufferize][NFC] Map only tensors in BufferizationState BufferizationState had map/lookup overloads for non-tensor values. This was necessary for IREE. There is now a better way to do this, so these overloads can be removed. Differential Revision: https://reviews.llvm.org/D114929	2021-12-03 23:07:09 +09:00
David Green	255ad73424	[ARM] Make MVE v2i1 predicates legal MVE can treat v16i1, v8i1, v4i1 and v2i1 as different views onto the same 16bit VPR.P0 register, with v2i1 holding two 8 bit values for the two halves. This was never treated as a legal type in llvm in the past as there are not many 64bit instructions and no 64bit compares. There are a few instructions that could use it though, notably a VSELECT (as it can handle any size using the underlying v16i8 VPSEL), AND/OR/XOR for similar reasons, some gathers/scatter and long multiplies and VCTP64 instructions. This patch goes through and makes v2i1 a legal type, handling all the cases that fall out of that. It also makes VSELECT legal for v2i64 as a side benefit. A lot of the codegen changes as a result - usually in way that is a little better or a little worse, but still expensive. Costs can change a little too in the process, again in a way that expensive things remain expensive. A lot of the tests that changed are mainly to ensure correctness - the code can hopefully be improved in the future where it comes up in practice. The intrinsics currently remain using the v4i1 they previously did to emulate a v2i1. This will be changed in a followup patch but this one was already large enough. Differential Revision: https://reviews.llvm.org/D114449	2021-12-03 14:05:41 +00:00
Jay Foad	b670dcb81b	[AMDGPU] Add some more GFX10 test coverage	2021-12-03 14:03:31 +00:00
Valentin Clement	d59a0f58f4	[fir] Add fir character builder This patch adds the FIR builder to generate the numeric intrinsic runtime call. This patch is part of the upstreaming effort from fir-dev branch. Reviewed By: rovka Differential Revision: https://reviews.llvm.org/D114900 Co-authored-by: Jean Perier <jperier@nvidia.com> Co-authored-by: mleair <leairmark@gmail.com>	2021-12-03 14:58:17 +01:00
Valentin Clement	c32421c925	[fir] Add fir derived type runtime builder This patch adds the builder to generate derived type runtime API calls. This patch is part of the upstreaming effort from fir-dev branch. Reviewed By: rovka Differential Revision: https://reviews.llvm.org/D114472 Co-authored-by: Peter Klausler <pklausler@nvidia.com> Co-authored-by: Jean Perier <jperier@nvidia.com>	2021-12-03 14:51:59 +01:00
Jay Foad	b29b6f92af	[AMDGPU] Add some more GFX10 GlobalISel test coverage	2021-12-03 13:40:27 +00:00
Matthias Springer	ed8c63115e	[mlir][linalg][bufferize][NFC] Provide default implementation of getAliasingOpOperand This simplifies op interface implementations. Differential Revision: https://reviews.llvm.org/D115025	2021-12-03 22:36:22 +09:00
Jay Foad	d133a21b71	[SelectionDAG] Add newline to a debug message	2021-12-03 13:33:32 +00:00
Florian Hahn	af86aa7980	[MemoryLocation] Use None instead of {}. (NFC)	2021-12-03 13:19:00 +00:00
Guillaume Chatelet	cca8e1e415	[libc][NFC] Fix typo in CMakeLists documentation	2021-12-03 13:52:09 +01:00
Adrian Kuegel	04d083b19e	[mlir][NFC] Use const reference for loop variables.	2021-12-03 13:07:54 +01:00
Simon Pilgrim	e85667a2fb	[PowerPC] Add non-constant fcopysign f128 test coverage As discussed on D114589 as the constant case gets affected by SimplifyDemandedBits a lot - the non-constant case currently falls back to copysignl libcalls	2021-12-03 12:04:06 +00:00
Petar Avramovic	0b34ffe4a6	AMDGPU/GlobalISel: Add clamp combine Add clamp combine. Source is fminnum(fmaxnum(Val, 0.0), 1.0) or fmaxnum(fminnum(Val, 1.0), 0.0) or fmed3 intrinsic with 0.0 and 1.0 as two out of three operands. Differential Revision: https://reviews.llvm.org/D90052	2021-12-03 12:49:39 +01:00
Petar Avramovic	ec54867d75	AMDGPU/GlobalISel: Add floating point med3 combine Add floating point version of med3 combine. Source is fminnum(fmaxnum(Val, K0), K1) or fmaxnum(fminnum(Val, K1), K0) where K0 and K1 are constants and K0 <= K1. Differential Revision: https://reviews.llvm.org/D90051	2021-12-03 12:49:39 +01:00
Petar Avramovic	ab01f4d264	AMDGPU/GlobalISel: Do not fcanonicalize const splat padded with undef Recognize constant splat padded with undef in isCanonicalized. Fcanonicalize will be removed by RemoveFcanonicalize in post-legalizer combiner. We will treat undef as value that will result in a splat in clamp combine after regbankselect. Differential Revision: https://reviews.llvm.org/D104408	2021-12-03 12:49:38 +01:00
Alex Zinenko	9dd1f8dfdd	[mlir] support recursive type conversion of named LLVM structs A previous commit added support for converting elemental types contained in LLVM dialect types in case they were not compatible with the LLVM dialect. It was missing support for named structs as they could be recursive, which was not supported by the conversion infra. Now that it is, add support for converting such named structs. Depends On D113579 Reviewed By: wsmoses Differential Revision: https://reviews.llvm.org/D113580	2021-12-03 12:41:40 +01:00
Matthias Springer	5e1c038f7d	[mlir][linalg][bufferize][NFC] Move FuncOp boundary bufferization to ModuleBufferization Differential Revision: https://reviews.llvm.org/D114670	2021-12-03 20:29:39 +09:00
Matthias Springer	ad1ba42f68	[mlir][linalg][bufferize] Allow unbufferizable ops in input Allow ops that are not bufferizable in the input IR. (Deactivated by default.) bufferization::ToMemrefOp and bufferization::ToTensorOp are generated at the bufferization boundaries. Differential Revision: https://reviews.llvm.org/D114669	2021-12-03 20:20:46 +09:00
Victor Perez	9eb7322748	[RISCV][VP] Add RVV codegen for vp.select Lower vp.select instrinsic to VSELECT_VL. Reviewed By: rogfer01 Differential Revision: https://reviews.llvm.org/D114629	2021-12-03 11:02:20 +00:00
Diana Picus	919738739a	[flang] Add missing LABEL in test. NFC	2021-12-03 10:56:24 +00:00
Diana Picus	3fd250d258	[fir] TargetRewrite: Rewrite fir.address_of(func) Rewrite AddrOfOp if taking the address of a function. Differential Revision: https://reviews.llvm.org/D114925 Co-authored-by: Eric Schweitz <eschweitz@nvidia.com>	2021-12-03 10:56:24 +00:00
Matthias Springer	867cd948ac	[mlir][linalg][bufferize][NFC] Move BufferizationOptions to op interface Also store a reference to BufferizationOptions in BufferizationState. This is in preparation of adding support for partial bufferization. Differential Revision: https://reviews.llvm.org/D114661	2021-12-03 19:51:34 +09:00
Valentin Clement	1f55103263	[fir] Add fircg.ext_embox conversion Convert a fircg.ext_embox operation to LLVM IR dialect. A fircg.ext_embox is converted to a sequence of operation that create, allocate if needed, and populate a descriptor. This patch is part of the upstreaming effort from fir-dev branch. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D114148 Co-authored-by: Eric Schweitz <eschweitz@nvidia.com> Co-authored-by: Jean Perier <jperier@nvidia.com>	2021-12-03 11:45:36 +01:00
Kristina Bessonova	0bf2c87785	[llvm-dwarfdump] Do not print preceding :: for local types Reviewed By: dblaikie, jhenderson Differential Revision: https://reviews.llvm.org/D114892	2021-12-03 12:27:29 +02:00
Qiu Chaofan	b9adaa1782	[PowerPC] [Clang] Fix alignment adjustment of single-elemented float128 This does similar thing to `6b1341e`, but fixes single element 128-bit float type: `struct { long double x; }`. Reviewed By: rjmccall Differential Revision: https://reviews.llvm.org/D114937	2021-12-03 18:07:34 +08:00
Qiu Chaofan	4f94c02616	[Clang] Mutate bulitin names under IEEE128 on PPC64 Glibc 2.32 and newer uses these symbol names to support IEEE-754 128-bit float. GCC transforms name of these builtins to align with Glibc header behavior. Since Clang doesn't have all GCC-compatible builtins implemented, this patch only mutates the implemented part. Note nexttoward is a special case (no nexttowardf128) so it's also handled here. Reviewed By: jsji Differential Revision: https://reviews.llvm.org/D112401	2021-12-03 17:50:18 +08:00
Guillaume Chatelet	1479a211d2	Fix typos in FPUtil README	2021-12-03 10:22:00 +01:00
Florian Hahn	f078536f46	[MemoryLocation] Move DSE's logic to new MemLoc::getForDest helper (NFC). DSE has some extra logic to determine the write location of library calls like strcpy and strcat. This patch moves the logic to a new MemoryLocation:getForDest variant, which takes a call and TLI. This patch should be NFC, because no other places take advantage of the new helper yet. Suggested by @reames post-commit `7eec832def`. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D114872	2021-12-03 09:12:01 +00:00
Nikita Popov	49d040ac97	[SCEV] Fix ValuesAtScopesUsers consistency Fixes verification failure reported at: https://reviews.llvm.org/rGc9f9be0381d1 The issue is that getSCEVAtScope() might compute a result without inserting it in the ValuesAtScopes map in degenerate cases, specifically if the ValuesAtScopes entry is invalidated during the calculation. Arguably we should still insert the result if no existing placeholder is found, but for now just tweak the logic to only update ValuesAtScopesUsers if ValuesAtScopes is updated.	2021-12-03 10:03:10 +01:00
Michal Terepeta	1423e8bf5d	[mlir][Vector] Support 0-D vectors in `BitCastOp` The implementation only allows to bit-cast between two 0-D vectors. We could probably support casting from/to vectors like `vector<1xf32>`, but I wasn't convinced that this would be important and it would require breaking the invariant that `BitCastOp` works only on vectors with equal rank. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114854	2021-12-03 08:55:59 +00:00
Michal Terepeta	8e2b373396	[mlir][Vector] Add some missing tests for `broadcast` and `splat` Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D114853	2021-12-03 08:52:51 +00:00
Florian Hahn	829b29b619	[MemoryLocation] strcat/strncat/strcpy read/write after their args. strcpy/strcat/strncat access memory starting from the passed in pointers. Construct memory locations for their args using getAfter. Discussed in D114872. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D114969	2021-12-03 08:48:23 +00:00
Kirill Bobyrev	bab7a30ab6	[clangd] IncludeCleaner: Do not require forward declarations of RecordDecls when definition is available This makes IncludeCleaner more useful in the presense of a large number of forward declarations. If the definition is already in the Translation Unit and visible to the Main File, forward declarations have no effect. The original patch D112707 was split in two: D114864 and this one. Reviewed By: kadircet Differential Revision: https://reviews.llvm.org/D114949	2021-12-03 09:36:50 +01:00
Dmitry Vyukov	4a5086dce3	tsan: disable munmap_invalid.cpp test on darwin It failed on bots: https://green.lab.llvm.org/green//job/clang-stage1-RA/25954/consoleFull#-1417328700a1ca8a51-895e-46c6-af87-ce24fa4cd561 and it doesn't provide the test output. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D114972	2021-12-03 09:03:45 +01:00
Matthias Springer	d30fcadf07	[mlir][linalg][bufferize] Op interface implementation for Bufferization dialect ops This change provides `BufferizableOpInterface` implementations for ops from the Bufferization dialects. These ops are needed at the bufferization boundaries for partial bufferization. Differential Revision: https://reviews.llvm.org/D114618	2021-12-03 16:25:44 +09:00
Jean Perier	1c16b0db9d	[flang] Return arrays in Transfer runtime with SIZE argument In TRANSFER runtime the result was an array only if the MOLD was an array. This is not in line with TRANSFER definition in 16.9.193 that rules that it must also be an array if MOLD is scalar and SIZE if provided. Differential Revision: https://reviews.llvm.org/D114943	2021-12-03 08:23:30 +01:00
Jessica Clarke	c1048e3eb9	[TableGen][SelectionDAG] Use ComplexPattern type for non-leaf nodes When used as a non-leaf node, TableGen does not currently use the type of a ComplexPattern for type inference, which also means it does not check it doesn't conflict with the use. This differs from when used as a leaf value, where the type is used for inference. This addresses that discrepancy. The test case is not representative of most real-world uses but is sufficient to demonstrate inference is working. Some of these uses also make use of ValueTypeByHwMode rather than SimpleValueType and so the existing type inference is extended to support that alongside the new type inference. There are also currently various cases of using ComplexPatterns with an untyped type, but only for non-leaf nodes. For compatibility this is permitted, and uses the old behaviour of not inferring for non-leaf nodes, but the existing logic is still used for leaf values. This remaining discrepancy should eventually be eliminated, either by removing all such uses of untyped so the special case goes away (I imagine Any, or a more specific type in certain cases, would be perfectly sufficient), or by copying it to the leaf value case so they're consistent with one another if this is something that does need to keep being supported. All non-experimental targets have been verified to produce bit-for-bit identical TableGen output with this change applied. Reviewed By: kparzysz Differential Revision: https://reviews.llvm.org/D109035	2021-12-03 07:04:59 +00:00
Jessica Clarke	a3530dc199	[AArch64][NFC] Alter ComplexPattern types to be consistent with their uses When used as a non-leaf node, TableGen does not currently use the type of a ComplexPattern for type inference, which also means it does not check it doesn't conflict with the use. This differs from when used as a leaf value, where the type is used for inference. Fixing that discrepancy is something I intend to upstream as a subsequent review. AArch64 currently has several ComplexPatterns that are used in contexts where they're expected to be an iPTR. The cases that lead to type contradictions are separated out in D108759, but there are additional differences to the TableGen output when using my locally-patched TableGen. None of these appear to matter, at least for passing all the CodeGen tests, but it's safer to avoid such changes (and similar changes were causing issues on some AMDGPU tests, causing failures to select). Changing these additional ComplexPatterns to use iPTR rather than i64 ensures that the TableGen output remains bit-for-bit identical (compared to without having this patch and my TableGen patch, as well as the intermediate state of having this patch but not my TableGen patch), and more accurately captures the higher-level meaning of these patterns. Reviewed By: david-arm Differential Revision: https://reviews.llvm.org/D109034	2021-12-03 07:04:59 +00:00
Jessica Clarke	3ee56eed2f	[AMDGPU][NFC] Alter ComplexPattern types to be consistent with their uses When used as a non-leaf node, TableGen does not currently use the type of a ComplexPattern for type inference, which also means it does not check it doesn't conflict with the use. This differs from when used as a leaf value, where the type is used for inference. Fixing that discrepancy is something I intend to upstream as a subsequent review. AMDGPU currently has several ComplexPatterns that are used in contexts where they're expected to be an iPTR, and where using an iPTR instead of a fixed-width integer type matters. With my locally-patched TableGen, none of these mismatches result in type contradictions, but do change the patterns and cause various failures to select. These changes to the ComplexPatterns' types reflect how they are actually used, result in bit-for-bit identical TableGen output (without my local TableGen patch), and ensure that with improved type inference AMDGPU's backend will continue to work. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D109032	2021-12-03 07:04:59 +00:00
Jessica Clarke	0cb44cfbb7	[AArch64][NFC] Fix ComplexPattern types conflicting with uses When used as a non-leaf node, TableGen does not currently use the type of a ComplexPattern for type inference, which also means it does not check it doesn't conflict with the use. This differs from when used as a leaf value, where the type is used for inference. Fixing that discrepancy is something I intend to upstream as a subsequent review, but these are all the type conflicts found (all legitimate) by my locally-patched TableGen. Reviewed By: paulwalker-arm Differential Revision: https://reviews.llvm.org/D108759	2021-12-03 07:04:59 +00:00

1 2 3 4 5 ...

406487 Commits All Branches Search

406487 Commits

All Branches