llvm-project

Commit Graph

Author	SHA1	Message	Date
Christopher Di Bella	d87f159ab6	[libcxx][NFC] removes `swap`'s dependency on `swap_ranges` Under the as-if rule, we can directly implement the array overload for `std::swap`. By removing this circular dependency where `swap` is implemented in terms of `swap_ranges` and `swap_ranges` is defined in terms of `swap`, we can split them into their own headers. This will: * limit the surface area in which Hyrum's law can bite us; * force users to include the correct headers; * make finding the definitions trivial (`swap` is a utility; `swap_ranges` is an algorithm). Differential Revision: https://reviews.llvm.org/D104760	2021-06-24 17:57:29 +00:00
Mehdi Amini	652f4b5140	Attempt to disable MLIR JIT tests on PowerPC to unbreak the bot This is until we figure how to turn on the large code size model.	2021-06-24 17:48:46 +00:00
zoecarver	9824f86760	[libcxx][nfc] Add one more test case for contiguous_range. If the `data` member function is different enough, `ranges::data` won't pick it, so the range remains a contiguous_range.	2021-06-24 10:45:25 -07:00
zoecarver	3450398738	[libcxx][ranges] Add contiguous_range. Differential Revision: https://reviews.llvm.org/D104262	2021-06-24 10:40:05 -07:00
Roman Lebedev	507df686af	[NFC][SimplifyCFG] Revisit tail-merge-resume.ll test Add an already somewhat-common resume block	2021-06-24 20:31:49 +03:00
Pablo Barrio	571c8c5263	[AArch64][v8.3A] Avoid inserting implicit landing pads (PACISP) PACISP have the advantage that they are in HINT space, meaning they can be run successfully in hardware without PAuth support - they will just behave as a NOP. However, PACISP are also implicit landing pads (think of an extra BTI jc). Therefore, they allow indirect jumps of all kinds into them, potentially inserting new gadgets. This patch replaces PACISP by PACI* LR, SP when compiling explicitly for hardware with full PAuth support. PACI* is not in the HINT space, therefore it will fault when run in hardware without PAuth support, but it is also not a landing pad, making programs safer in newer HW. Differential Revision: https://reviews.llvm.org/D101920	2021-06-24 18:24:32 +01:00
Sanjay Patel	50db987d59	[InstSimplify] move extract with undef index fold; NFC This puts it closer to the other undef query check and will avoid a potential ordering problem if we allow folding non-constant-int indexes.	2021-06-24 13:22:10 -04:00
Anna Thomas	e9a3637c0c	Precommit tests for context senstive attribute dropping Precommit tests from D104641. The patch will fix the callsites by dropping the context-sensitive attributes. Reviewed-By: Self	2021-06-24 13:18:16 -04:00
William S. Moses	44985872b8	[MLIR][SCF] Inline single block ExecuteRegionOp This commit adds a canonicalization pass which inlines any single block execute region Differential Revision: https://reviews.llvm.org/D104865	2021-06-24 13:15:26 -04:00
Sanjay Patel	3ba090e5f6	[InstSimplify][test] add test for extract of splat; NFC This is shown in: https://llvm.org/PR50817	2021-06-24 13:13:35 -04:00
Sanjay Patel	e13c62a103	[InstSimplify][test] move tests that don't require InstCombine; NFC These are existing/missing simplifications, so the tests don't need the full power of InstCombine.	2021-06-24 13:13:34 -04:00
Craig Topper	03f9e04bc3	[TargetLowering][ARM] Don't alter opaque constants in TargetLowering::ShrinkDemandedConstant. We don't constant fold based on demanded bits elsewhere in SimplifyDemandedBits, so I don't think we should shrink them either. The affected ARM test changes because a constant become non-opaque and eventually enabled some constant folding. This no longer happens. I checked and InstCombine is able to simplify this test. I'm not sure exactly what it was trying to test. Reviewed By: lebedev.ri, dmgreen Differential Revision: https://reviews.llvm.org/D104832	2021-06-24 10:09:36 -07:00
Petr Hosek	aac4de989e	[CMake] Don't LTO optimize targets on Darwin either This is a follow up to D102732 which also expands the logic to Darwin. Differential Revision: https://reviews.llvm.org/D104764	2021-06-24 10:02:03 -07:00
Anirudh Prasad	631362665c	[AsmParser][SystemZ][z/OS] Support for emitting labels in upper case - Currently, the emitting of labels in the parsePrimaryExpr function is case independent. It just takes the identifier and emits it. - However, for HLASM the emitting of labels is case independent. We are emitting them in the upper case only, to enforce case independency. So we need to ensure that at the time of parsing the label we are emitting the upper case (in `parseAsHLASMLabel`), but also, when we are processing a PC-relative relocatable expression, we need to ensure we emit it in upper case (in `parsePrimaryExpr`) - To achieve this a new MCAsmInfo attribute has been introduced which corresponding targets can override if needed. Reviewed By: abhina.sreeskantharajan, uweigand Differential Revision: https://reviews.llvm.org/D104715	2021-06-24 12:50:11 -04:00
Geoffrey Martin-Noble	1ca4cf9b24	Update Bazel build for `929189a499` Updates Bazel build files to match https://github.com/llvm/llvm-project/commit/929189a499 Differential Revision: https://reviews.llvm.org/D104864	2021-06-24 09:43:47 -07:00
David Spickett	31f9960c38	[lldb][AArch64] Add "memory tag read" command This new command looks much like "memory read" and mirrors its basic behaviour. (lldb) memory tag read new_buf_ptr new_buf_ptr+32 Logical tag: 0x9 Allocation tags: [0x900fffff7ffa000, 0x900fffff7ffa010): 0x9 [0x900fffff7ffa010, 0x900fffff7ffa020): 0x0 Important proprties: * The end address is optional and defaults to reading 1 tag if ommitted * It is an error to try to read tags if the architecture or process doesn't support it, or if the range asked for is not tagged. * It is an error to read an inverted range (end < begin) (logical tags are removed for this check so you can pass tagged addresses here) * The range will be expanded to fit the tagging granule, so you can get more tags than simply (end-begin)/granule size. Whatever you get back will always cover the original range. Reviewed By: omjavaid Differential Revision: https://reviews.llvm.org/D97285	2021-06-24 17:35:45 +01:00
Alex Zinenko	10b8eb482c	[mlir] remove repeated use of TypeToLLVM.cpp in cmake targets	2021-06-24 18:34:49 +02:00
David Spickett	5d34362001	[lldb][AArch64] Add MTE memory tag reading to lldb This adds GDB client support for the qMemTags packet which reads memory tags. Following the design which was recently committed to GDB. https://sourceware.org/gdb/current/onlinedocs/gdb/General-Query-Packets.html#General-Query-Packets (look for qMemTags) lldb commands will use the new Process methods GetMemoryTagManager and ReadMemoryTags. The former takes a range and checks that: * The current process architecture has an architecture plugin * That plugin provides a MemoryTagManager * That the range of memory requested lies in a tagged range (it will expand it to granules for you) If all that was true you get a MemoryTagManager you can give to ReadMemoryTags. This two step process is done to allow commands to get the tag manager without having to read tags as well. For example you might just want to remove a logical tag, or error early if a range with tagged addresses is inverted. Note that getting a MemoryTagManager doesn't mean that the process or a specific memory range is tagged. Those are seperate checks. Having a tag manager just means this architecture could have a tagging feature enabled. An architecture plugin has been added for AArch64 which will return a MemoryTagManagerAArch64MTE, which was added in a previous patch. Reviewed By: omjavaid Differential Revision: https://reviews.llvm.org/D95602	2021-06-24 17:17:10 +01:00
Alexander Yermolovich	a224c5199b	[LLD][LLVM] CG Graph profile using relocations Currently when .llvm.call-graph-profile is created by llvm it explicitly encodes the symbol indices. This section is basically a black box for post processing tools. For example, if we run strip -s on the object files the symbol table changes, but indices in that section do not. In non-visible behavior indices point to wrong symbols. The visible behavior indices point outside of Symbol table: "invalid symbol index". This patch changes the format by using R_*_NONE relocations to indicate the from/to symbols. The Frequency (Weight) will still be in the .llvm.call-graph-profile, but symbol information will be in relocation section. In LLD information from both sections is used to reconstruct call graph profile. Relocations themselves will never be applied. With this approach post processing tools that handle relocations correctly work for this section also. Tools can add/remove symbols and as long as they handle relocation sections with this approach information stays correct. Doing a quick experiment with clang-13. The size went up from 107KB to 322KB, aggregate of all the input sections. Size of clang-13 binary is ~118MB. For users of -fprofile-use/-fprofile-sample-use the size of object files will go up slightly, it will not impact final binary size. Reviewed By: jhenderson, MaskRay Differential Revision: https://reviews.llvm.org/D104080	2021-06-24 09:09:33 -07:00
William S. Moses	929189a499	[MLIR][LLVM] Expose type translator from LLVM to MLIR Type This commit moves the type translator from LLVM to MLIR to a public header for use by external projects or other code. Unlike a previous attempt (https://reviews.llvm.org/D104726), this patch moves the type conversion into separate files which remedies the linker error which was only caught by CI. Differential Revision: https://reviews.llvm.org/D104834	2021-06-24 12:06:34 -04:00
David Spickett	da2e614f56	[lldb][AArch64] Add memory tag reading to lldb-server This adds memory tag reading using the new "qMemTags" packet and ptrace on AArch64 Linux. This new packet is following the one used by GDB. (https://sourceware.org/gdb/current/onlinedocs/gdb/General-Query-Packets.html) On AArch64 Linux we use ptrace's PEEKMTETAGS to read tags and we assume that lldb has already checked that the memory region actually has tagging enabled. We do not assume that lldb has expanded the requested range to granules and expand it again to be sure. (although lldb will be sending aligned ranges because it happens to need them client side anyway) Also we don't assume untagged addresses. So for AArch64 we'll remove the top byte before using them. (the top byte includes MTE and other non address data) To do the ptrace read NativeProcessLinux will ask the native register context for a memory tag manager based on the type in the packet. This also gives you the ptrace numbers you need. (it's called a register context but it also has non register data, so it saves adding another per platform sub class) The only supported platform for this is AArch64 Linux and the only supported tag type is MTE allocation tags. Anything else will error. Ptrace can return a partial result but for lldb-server we will be treating that as an error. To succeed we need to get all the tags we expect. (Note that the protocol leaves room for logical tags to be read via qMemTags but this is not going to be implemented for lldb at this time.) Reviewed By: omjavaid Differential Revision: https://reviews.llvm.org/D95601	2021-06-24 17:02:55 +01:00
Florian Hahn	f6ba845da3	[VPlan] Fix indentation of check lines in sinking test (NFC).	2021-06-24 16:39:16 +01:00
Nico Weber	b1061e36d9	[gn build] Fix a comment typo and a comment copy-pasto	2021-06-24 11:06:48 -04:00
Nicolas Vasilache	57fe7fd37d	[mlir][Linalg] Add support for scf::ForOp in comprehensive bufferization (7/n) scf::ForOp bufferization analysis proceeds just like for any other op (including FuncOp) at its boundaries; i.e. if: 1. The tensor operand is inplaceable. 2. The matching result has no subsequent read (i.e. all reads dominate the scf::ForOp). 3. In and does not create a RAW interference. then it can bufferize inplace. Still there are a few differences: 1. bbArgs for an scf::ForOp are always considered inplaceable when seen from ops inside the body. This is because a) either the matching tensor operand is not inplaceable and an alloc will be inserted (which makes bbArg itself inplaceable); or b) the tensor operand and bbArg are both already inplaceable. 2. Bufferization within the scf::ForOp body has implications to the outside world : the scf.yield terminator may well ping-pong values of the same type. This muddies the water for alias analysis and is not supported atm. Such cases result in a pass failure. Differential revision: https://reviews.llvm.org/D104490	2021-06-24 15:03:28 +00:00
Sjoerd Meijer	c74aea4663	[AArch64] Precommit extending load tests for D104782. NFC.	2021-06-24 15:59:53 +01:00
David Spickett	cc05418d98	[lldb][AArch64] Fix unpack tags test case Use %zu to print size_t vars.	2021-06-24 15:53:23 +01:00
Saurabh Jha	cd256c8bcc	Add documentation for compound assignment and type conversion of matrix types	2021-06-24 15:50:58 +01:00
David Spickett	8d58fbd09e	[lldb][AArch64] Add memory-tagging qSupported feature This feature "memory-tagging+" indicates that lldb-server supports memory tagging packets. (added in a later patch) We check HWCAP2_MTE to decide whether to enable this feature for Linux. Reviewed By: omjavaid Differential Revision: https://reviews.llvm.org/D97282	2021-06-24 15:43:20 +01:00
Nico Weber	d57a5879ab	[gn build] Remove an unneeded -I flag Everything includes clang/Config/config.h by qualified "clang/Config/config.h" path, so there's no need for `-Igen/clang/include/clang/Config/clang/include`. No behavior change.	2021-06-24 10:18:49 -04:00
Tobias Gysi	78dc1e4978	[mlir][linalg][python] Add shape-only tensor support to OpDSL. Add an index_dim annotation to specify the shape to loop mapping of shape-only tensors. A shape-only tensor serves is not accessed withing the body of the operation but is required to span the iteration space of certain operations such as pooling. Differential Revision: https://reviews.llvm.org/D104767	2021-06-24 14:11:15 +00:00
David Spickett	e0f2744a11	[lldb][AArch64] Add class for managing memory tags This adds the MemoryTagManager class and a specialisation of that class for AArch64 MTE tags. It provides a generic interface for various tagging operations. Adding/removing tags, diffing tagged pointers, etc. Later patches will use this manager to handle memory tags in generic code in both lldb and lldb-server. Since it will be used in both, the base class header is in lldb/Target. (MemoryRegionInfo is another example of this pattern) Reviewed By: omjavaid Differential Revision: https://reviews.llvm.org/D97281	2021-06-24 15:10:01 +01:00
Brendon Cahoon	927b809783	[GlobalISel] Describe undefined values for G_SBFX/G_UBFX operands Differential Revision: https://reviews.llvm.org/D104245	2021-06-24 09:31:41 -04:00
Florian Hahn	833bdbe93c	[LV] Support sinking recipe in replicate region after another region. This patch handles sinking a replicate region after another replicate region. In that case, we can connect the sink region after the target region. This properly handles the case for which an assertion has been added in `337d765282`. Fixes https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=34842. Reviewed By: Ayal Differential Revision: https://reviews.llvm.org/D103514	2021-06-24 13:58:42 +01:00
Tobias Gysi	25bb616490	[mlir][linalg][python] Add attribute support to the YAML codegen. Extend the yaml code generation to support the index attributes that https://reviews.llvm.org/D104711 added to the OpDSL. Differential Revision: https://reviews.llvm.org/D104712	2021-06-24 12:33:48 +00:00
Stephen Tozer	adace79652	[DebugInfo] Enable variadic debug value salvaging This patch enables the salvaging of debug values that may be calculated from more than one SSA value, such as with binary operators that do not use a constant argument. The actual functionality for this behaviour is added in a previous commit (`c7270567`), but with the ability to actually emit the resulting debug values switched off. The reason for this is that the prior patch has been reverted several times due to issues discovered downstream, some time after the actual landing of the patch. The patch in question is rather large and touches several widely used header files, and all issues discovered are more related to the handling of variadic debug values as a whole rather than the details of the patch itself. Therefore, to minimize the build time impact and risk of conflicts involved in any potential future revert/reapply of that patch, this significantly smaller patch (that touches no header files) will instead be used as the capstone to enable variadic debug value salvaging. The review linked to this patch is mostly implemented by the previous commit, `c7270567`, but also contains the changes in this patch. Differential Revision: https://reviews.llvm.org/D91722	2021-06-24 13:16:29 +01:00
David Green	1113e06821	[ARM] Extend narrow values to allow using truncating scatters As a minor adjustment to the existing lowering of offset scatters, this extends any smaller-than-legal vectors into full vectors using a zext, so that the truncating scatters can be used. Due to the way MVE legalizes the vectors this should be cheap in most situations, and will prevent the vector from being scalarized. Differential Revision: https://reviews.llvm.org/D103704	2021-06-24 13:09:11 +01:00
Roman Lebedev	9f5f917787	[NFC][SimplifyCFG] Add basic test for tail-merging `resume` function terminators	2021-06-24 15:08:55 +03:00
Jay Foad	beebe5a056	[MCA] Allow unlimited cycles in the timeline view Change --max-timeline-cycles=0 to mean no limit on the number of cycles. Use this in AMDGPU tests to show all instructions in the timeline view instead of having it arbitrarily truncated. Differential Revision: https://reviews.llvm.org/D104846	2021-06-24 12:54:57 +01:00
Florian Hahn	a54c6fc083	[X86] Exclude invalid element types for bitcast/broadcast folding. It looks like the fold introduced in `63f3383ece` can cause crashes if the type of the bitcasted value is not a valid vector element type, like x86_mmx. To resolve the crash, reject invalid vector element types. The way it is done in the patch is a bit clunky. Perhaps there's a better way to check? Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D104792	2021-06-24 12:39:01 +01:00
Florian Hahn	121ecb05e7	[SCEV] Generalize MatchBinaryAddToConst to support non-add expressions. This patch generalizes MatchBinaryAddToConst to support matching (A + C1), (A + C2), instead of just matching (A + C1), A. The existing cases can be handled by treating non-add expressions A as A + 0. Reviewed By: mkazantsev Differential Revision: https://reviews.llvm.org/D104634	2021-06-24 12:16:15 +01:00
Rosie Sumpter	0c4651f0a8	[CostModel][AArch64] Improve cost model for vector reduction intrinsics OR, XOR and AND entries are added to the cost table. An extra cost is added when vector splitting occurs. This is done to address the issue of a missed SLP vectorization opportunity due to unreasonably high costs being attributed to the vector Or reduction (see: https://bugs.llvm.org/show_bug.cgi?id=44593). Differential Revision: https://reviews.llvm.org/D104538	2021-06-24 12:02:58 +01:00
Nicolas Vasilache	e3ea2d7061	[mlir][Linalg] Add basic lowering test to library calls This test shows how convert-linalg-to-std rewrites named linalg ops as library calls. This can be coupled with a C++ shim to connect to existing libraries such as https://gist.github.com/nicolasvasilache/691ef992404c49dc9b5d543c4aa6db38. Everything can then be linked together with mlir-cpu-runner and MLIR can call C++ (which can itself call MLIR if needed). This should evolve into specific rewrite patterns that can be applied on op instances independently rather than having to use a full conversion. Differential Revision: https://reviews.llvm.org/D104842	2021-06-24 10:56:48 +00:00
Muhammad Omair Javaid	c5028f3473	[Clang] XFAIL sanitize-coverage-old-pm.c on 32bit Armv8l sanitize-coverage-old-pm.c started failing on arm 32 bit where underlying architecture reported is armv8l fore 32bit arm. This patch XFAILS sanitize-coverage-old-pm.c on armv8l similar to armv7 and thumbv7.	2021-06-24 15:48:13 +05:00
Simon Pilgrim	c4d3eedc7f	[X86] Fold nested select_cc to select (cmp*ge/le Cond0, Cond1), LHS, Y) select (cmpeq Cond0, Cond1), LHS, (select (cmpugt Cond0, Cond1), LHS, Y) --> (select (cmpuge Cond0, Cond1), LHS, Y) etc, We already perform this fold in DAGCombiner for MVT::i1 comparison results, but these can still appear after legalization (in x86 case with MVT::i8 results), where we need to be more careful about generating new comparison codes. Pulled out of D101074 to help address the remaining regressions. Differential Revision: https://reviews.llvm.org/D104707	2021-06-24 11:27:57 +01:00
Sander de Smalen	d5e14ba88c	[GlobalISel] NFC: Change LLT::vector to take ElementCount. This also adds new interfaces for the fixed- and scalable case: * LLT::fixed_vector * LLT::scalable_vector The strategy for migrating to the new interfaces was as follows: * If the new LLT is a (modified) clone of another LLT, taking the same number of elements, then use LLT::vector(OtherTy.getElementCount()) or if the number of elements is halfed/doubled, it uses .divideCoefficientBy(2) or operator. That is because there is no reason to specifically restrict the types to 'fixed_vector'. If the algorithm works on the number of elements (as unsigned), then just use fixed_vector. This will need to be fixed up in the future when modifying the algorithm to also work for scalable vectors, and will need then need additional tests to confirm the behaviour works the same for scalable vectors. * If the test used the '/Scalable=/true` flag of LLT::vector, then this is replaced by LLT::scalable_vector. Reviewed By: aemerson Differential Revision: https://reviews.llvm.org/D104451	2021-06-24 11:26:12 +01:00
Roman Lebedev	9c4c2f2472	[SimplifyCFG] Tail-merging all blocks with `ret` terminator Based ontop of D104598, which is a NFCI-ish refactoring. Here, a restriction, that only empty blocks can be merged, is lifted. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D104597	2021-06-24 13:15:39 +03:00
Roman Lebedev	cba4b104a9	[NFC][AArch64] Un-autogenerate swifterror.ll tests It appears the change needed in D104597 is minimal and obvious, so let's not make them so verbose.	2021-06-24 13:11:26 +03:00
Tobias Gysi	31f888ea9a	[mlir][linalg][python] Add attribute support to the OpDSL. Extend the OpDSL with index attributes. After tensors and scalars, index attributes are the third operand type. An index attribute represents a compile-time constant that is limited to index expressions. A use cases are the strides and dilations defined by convolution and pooling operations. The patch only updates the OpDSL. The C++ yaml codegen is updated by a followup patch. Differential Revision: https://reviews.llvm.org/D104711	2021-06-24 09:40:32 +00:00
Denys Petrov	e76c008c90	[analyzer] Added a test case for PR46264 Summary: It's not able to reproduce the issue (https://bugs.llvm.org/show_bug.cgi?id=46264) for the latest sources. Add a reported test case to try to catch the problem if occur es. Differential Revision: https://reviews.llvm.org/D104381 Prevent: https://bugs.llvm.org/show_bug.cgi?id=46264	2021-06-24 12:24:26 +03:00
Fraser Cormack	a4729f7f88	[RISCV] Lower RVV vector SELECTs to VSELECTs This patch optimizes the code generation of vector-type SELECTs (LLVM select instructions with scalar conditions) by custom-lowering to VSELECTs (LLVM select instructions with vector conditions) by splatting the condition to a vector. This avoids the default expansion path which would either introduce control flow or fully scalarize. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D104772	2021-06-24 10:12:51 +01:00

1 2 3 4 5 ...

391974 Commits All Branches Search

391974 Commits

All Branches