llvm-project

Commit Graph

Author	SHA1	Message	Date
Petr Hosek	de98da2ece	[CMake] Only include LTO on Apple targets We only need libLTO when using ld64. Differential Revision: https://reviews.llvm.org/D103916	2021-06-08 11:00:01 -07:00
Nick Desaulniers	3787ee4571	reland [IR] make -stack-alignment= into a module attr Relands commit `433c8d950c` with fixes for MIPS. Similar to D102742, specifying the stack alignment via CodegenOpts means that this flag gets dropped during LTO, unless the command line is re-specified as a plugin opt. Instead, encode this information as a module level attribute so that we don't have to expose this llvm internal flag when linking the Linux kernel with LTO. Looks like external dependencies might need a fix: * https://github.com/llvm-hs/llvm-hs/issues/345 * https://github.com/halide/Halide/issues/6079 Link: https://github.com/ClangBuiltLinux/linux/issues/1377 Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D103048	2021-06-08 10:59:46 -07:00
Daniel McIntosh	ee2a92c29d	[libcxx] Remove VLA from libcxx locale header The buffer size (`__nbuf`) in `num_put::do_put` is currently not an integral/core constant expression. As a result, `__nar` is a Variable Length Array (VLA). VLAs are a GNU extension and not part of the base C++ standard, so unless there is good reason to do so they probably shouldn't be used in any of the standard library headers. The call to `__iob.flags()` is the only thing keeping `__nbuf` from being a compile time constant, so the solution here is to simply err on the side of caution and always allocate a buffer large enough to fit the base prefix. Note that, while the base prefix for hex (`0x`) is slightly longer than the base prefix for octal (`0`), this isn't a concern. The difference in the space needed for the value portion of the string is enough to make up for this. (Unless we're working with small, oddly sized types such as a hypothetical `uint9_t`, the space needed for the value portion in octal is at least 1 more than the space needed for the value portion in hex). This PR also adds `constexpr` to `__nbuf` to enforce compile time const-ness going forward. Reviewed By: Mordante, #libc, Quuxplusone, ldionne Differential Revision: https://reviews.llvm.org/D103558	2021-06-08 13:59:34 -04:00
Louis Dionne	4d680b06c9	[libc++] Add a CI configuration for the modular build Differential Revision: https://reviews.llvm.org/D103559	2021-06-08 13:32:08 -04:00
Justin Bogner	4271e1d2c5	[GlobalISel] Handle non-multiples of the base type in narrowScalarAddSub When narrowing G_ADD and G_SUB, handle types that aren't a multiple of the type we're narrowing to. This allows us to handle types like s96 on 64 bit targets. Note that the test here has a couple of dead instructions because of the way the setup legalizes. I wasn't able to come up with a way to write this test that avoids that easily. Differential Revision: https://reviews.llvm.org/D97811	2021-06-08 10:13:38 -07:00
Justin Bogner	2a7e759734	[GlobalISel] Handle non-multiples of the base type in narrowScalarInsert When narrowing G_INSERT, handle types that aren't a multiple of the type we're narrowing to. This comes up if we're narrowing something like an s96 to fit in 64 bit registers and also for non-byte multiple packed types if they come up. This implementation handles these cases by extending the extra bits to the narrow size and truncating the result back to the destination size. Differential Revision: https://reviews.llvm.org/D97791	2021-06-08 10:13:38 -07:00
Mehdi Amini	30bb5dcb0a	Add missing header <atomic> in lib/IR/Verifier.cpp (NFC) Fix the build on some platform.	2021-06-08 17:08:55 +00:00
Petr Hosek	1dba2a0269	[Fuchsia] Update some of the Fuchsia toolchain flags This should make the build more self-contained. Differential Revision: https://reviews.llvm.org/D103875	2021-06-08 10:04:19 -07:00
Mehdi Amini	a4e2cf712a	Revert "[llvm] Make Sequence reverse-iterable" This reverts commit `e772216e70` (and fixup `7f6c878a2c`). The build is broken with gcc5 host compiler: In file included from from mlir/lib/Dialect/Utils/StructuredOpsUtils.cpp:9: tools/mlir/include/mlir/IR/BuiltinAttributes.h.inc:424:57: error: type/value mismatch at argument 1 in template parameter list for 'template<class ItTy, class FuncTy, class FuncReturnTy> class llvm::mapped_iterator' std::function<T(ptrdiff_t)>>; ^ tools/mlir/include/mlir/IR/BuiltinAttributes.h.inc:424:57: note: expected a type, got 'decltype (seq<ptrdiff_t>(0, 0))::const_iterator'	2021-06-08 17:03:10 +00:00
Simon Pilgrim	52396577a2	Use llvm_unreachable for unsupported integer types. As suggested on rG937c4cffd024, use llvm_unreachable for unhandled integer types (which shouldn't be possible) instead of breaking and dropping down to the existing fatal error handler. Helps silence static analyzer warnings.	2021-06-08 17:59:05 +01:00
Simon Pilgrim	01b77159e3	PPCISelLowering.cpp - don't dereference a dyn_cast<>. dyn_cast<> can return nullptr which we would then dereference - use cast<> which will assert that the type is correct.	2021-06-08 17:59:05 +01:00
Simon Pilgrim	114e712c34	InstrEmitter.cpp - don't dereference a dyn_cast<>. dyn_cast<> can return nullptr which we would then dereference - use cast<> which will assert that the type is correct.	2021-06-08 17:59:04 +01:00
Brendon Cahoon	ea10a86984	[AMDGPU] Add gfx1013 target Differential Revision: https://reviews.llvm.org/D103663	2021-06-08 12:49:49 -04:00
Chris Lattner	92a79dbe91	[Core] Add Twine support for StringAttr and Identifier. NFC. This is both more efficient and more ergonomic than going through an std::string, e.g. when using llvm::utostr and in string concat cases. Unfortunately we can't just overload ::get(). This causes an ambiguity because both twine and stringref implicitly convert from std::string. Differential Revision: https://reviews.llvm.org/D103754	2021-06-08 09:47:07 -07:00
Chris Lattner	08664d005c	[Verifier] Speed up and parallelize dominance checking. NFC One of the key algorithms used in the "mlir::verify(op)" method is the dominance checker, which ensures that operand values properly dominate the operations that use them. The MLIR dominance implementation has a number of algorithmic problems, and is not really set up in general to answer dense queries: it's constant factors are really slow with multiple map lookups and scans, even in the easy cases. Furthermore, when calling mlir::verify(module) or some other high level operation, it makes sense to parallelize the dominator verification of all the functions within the module. This patch has a few changes to enact this: 1) It splits dominance checking into "IsolatedFromAbove" units. Instead of building a monolithic DominanceInfo for everything in a module, for example, it checks dominance for the module to all the functions within it (noop, since there are no operands at this level) then each function gets their own DominanceInfo for each of their scope. 2) It adds the ability for mlir::DominanceInfo (and post dom) to be constrained to an IsolatedFromAbove region. There is no reason to recurse into IsolatedFromAbove regions since use/def relationships can't span this region anyway. This is already checked by the time the verifier gets here. 3) It avoids querying DominanceInfo for trivial checks (e.g. intra Block references) to eliminate constant factor issues). 4) It switches to lazily constructing DominanceInfo because the trivial check case handles the vast majority of the cases and avoids constructing DominanceInfo entirely in some cases (e.g. at the module level or for many Regions's that contain a single Block). 5) It parallelizes analysis of collections IsolatedFromAbove operations, e.g. each of the functions within a Module. All together this is more than a 10% speedup on `firtool` in circt on a large design when run in -verify-each mode (our default) since the verifier is invoked after each pass. Still todo is to parallelize the main verifier pass. I decided to split this out to its own thing since this patch is already large-ish. Differential Revision: https://reviews.llvm.org/D103373	2021-06-08 09:47:07 -07:00
Craig Topper	8b4c80d380	Further improve register allocation for vwadd(u).wv, vwsub(u).wv, vfwadd.wv, and vfwsub.wv. The first source has the same EEW as the destination, but we're using earlyclobber which prevents them from ever being the same register. This patch attempts to work around this. -For unmasked .wv, add a special TIED pseudo that pretends like the first operand and the destination must be the same register. This disables the earlyclobber for that source. Mark the instruction as convertible to 3 address form which will switch it to the original untied pseudo when the TwoAddressInstructionPass decides that keeping them tied would require an extra copy. This uses code in RISCVInstrInfo.cpp to do the conversion to the untied opcode. The untie test case show that we can generate the untied version. Not sure it was profitable to do it in this case, but they have really simple IR. Reviewed By: arcbbb Differential Revision: https://reviews.llvm.org/D103552	2021-06-08 09:43:43 -07:00
Jake Vossen	71fb98e0c1	Fix a typo in the internals manual	2021-06-08 12:32:56 -04:00
Craig Topper	c57bce9cc5	[RISCV] Remove ForceTailAgnostic flag from vmv.s.x, vfmv.s.f and reductions. In 0.9 these were defined to leave elements other than 0 in the destination unmodified. They were changed to use the tail policy in 0.10. I missed that update. I assume no one has noticed because in order cores treat tail agnostic the same as tail undisturbed. I believe Spike and QEMU do the same. Reviewed By: arcbbb, frasercrmck Differential Revision: https://reviews.llvm.org/D103736	2021-06-08 09:22:40 -07:00
Fangrui Song	928a197d26	[ELF] Add a GRP_COMDAT test with a local signature symbol See https://groups.google.com/g/generic-abi/c/2X6mR-s2zoc Test that a local signature symbol does not suppress COMDAT deduplication.	2021-06-08 09:22:30 -07:00
Kiran Chandramohan	cd73af9231	[MLIR] Remove LLVM_AnyInteger type constraint LLVM Dialect uses builtin-integer types. The existing LLVM_AnyInteger type constraint is a dupe of AnyInteger. This patch removes LLVM_AnyInteger and replaces all usage with AnyInteger. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D103839	2021-06-08 17:21:00 +01:00
Louis Dionne	d2eccf9bb7	[libc++] NFC: Add regression tests for some <tuple> PRs that have been fixed	2021-06-08 12:17:10 -04:00
Hans Wennborg	172fcd9600	[clang-cl] Parse the /external: flags (PR36003) They are still unsupported, but at least this makes clang-cl not mistake them for being filenames. As pointed out in the bug, VS 16.10 now uses these flags in new projects by default.	2021-06-08 18:00:03 +02:00
Nick Desaulniers	a596b54d47	Revert "[IR] make -stack-alignment= into a module attr" This reverts commit `433c8d950c`. Breaks the MIPS build.	2021-06-08 08:55:50 -07:00
William S. Moses	965ad79ea7	[MLIR][MemRef] Only allow fold of cast for the pointer operand, not the value Currently canonicalizations of a store and a cast try to fold all casts into the store. In the case where the operand being stored is itself a cast, this is illegal as the type of the value being stored will change. This PR fixes this by not checking the value for folding with a cast. Depends on https://reviews.llvm.org/D103828 Differential Revision: https://reviews.llvm.org/D103829	2021-06-08 11:43:09 -04:00
David Blaikie	49454ebc56	.clang-tidy: Disable misc-no-recursion in general/across the monorepo	2021-06-08 08:31:33 -07:00
Nick Desaulniers	433c8d950c	[IR] make -stack-alignment= into a module attr Similar to D102742, specifying the stack alignment via CodegenOpts means that this flag gets dropped during LTO, unless the command line is re-specified as a plugin opt. Instead, encode this information as a module level attribute so that we don't have to expose this llvm internal flag when linking the Linux kernel with LTO. Looks like external dependencies might need a fix: * https://github.com/llvm-hs/llvm-hs/issues/345 * https://github.com/halide/Halide/issues/6079 Link: https://github.com/ClangBuiltLinux/linux/issues/1377 Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D103048	2021-06-08 08:31:04 -07:00
David Blaikie	c5d56fec50	NFC: .clang-tidy: Inherit configs from parents to improve maintainability In the interests of disabling misc-no-recursion across LLVM (this seems like a stylistic choice that is not consistent with LLVM's style/development approach) this NFC preliminary change adjusts all the .clang-tidy files to inherit from their parents as much as possible. This change specifically preserves all the quirks of the current configs in order to make it easier to review as NFC. I validatad the change is NFC as follows: for X in `cat ../files.txt`; do mkdir -p ../tmp/$(dirname $X) touch $(dirname $X)/blaikie.cpp clang-tidy -dump-config $(dirname $X)/blaikie.cpp > ../tmp/$(dirname $X)/after rm $(dirname $X)/blaikie.cpp done (similarly for the "before" state, without this patch applied) for X in `cat ../files.txt`; do echo $X diff \ ../tmp/$(dirname $X)/before \ <(cat ../tmp/$(dirname $X)/after \ \| sed -e "s/,readability-identifier-naming$.$,-readability-identifier-naming/\1/" \ \| sed -e "s/,-llvm-include-order$.$,llvm-include-order/\1/" \ \| sed -e "s/,-misc-no-recursion$.$,misc-no-recursion/\1/" \ \| sed -e "s/,-clang-diagnostic-\$.$,clang-diagnostic-\/\1/") done (using sed to strip some add/remove pairs to reduce the diff and make it easier to read) The resulting report is: .clang-tidy clang/.clang-tidy 2c2 < Checks: 'clang-diagnostic-,clang-analyzer-,-,clang-diagnostic-,llvm-,misc-,-misc-unused-parameters,-misc-non-private-member-variables-in-classes,-readability-identifier-naming,-misc-no-recursion' --- > Checks: 'clang-diagnostic-,clang-analyzer-,-,clang-diagnostic-,llvm-,misc-,-misc-unused-parameters,-misc-non-private-member-variables-in-classes,-misc-no-recursion' compiler-rt/.clang-tidy 2c2 < Checks: 'clang-diagnostic-,clang-analyzer-,-,clang-diagnostic-,llvm-,-llvm-header-guard,misc-,-misc-unused-parameters,-misc-non-private-member-variables-in-classes' --- > Checks: 'clang-diagnostic-,clang-analyzer-,-,clang-diagnostic-,llvm-,misc-,-misc-unused-parameters,-misc-non-private-member-variables-in-classes,-llvm-header-guard' flang/.clang-tidy 2c2 < Checks: 'clang-diagnostic-,clang-analyzer-,-,llvm-,-llvm-include-order,misc-,-misc-no-recursion,-misc-unused-parameters,-misc-non-private-member-variables-in-classes' --- > Checks: 'clang-diagnostic-,clang-analyzer-,-,llvm-,misc-,-misc-unused-parameters,-misc-non-private-member-variables-in-classes,-llvm-include-order,-misc-no-recursion' flang/include/flang/Lower/.clang-tidy flang/include/flang/Optimizer/.clang-tidy flang/lib/Lower/.clang-tidy flang/lib/Optimizer/.clang-tidy lld/.clang-tidy lldb/.clang-tidy llvm/tools/split-file/.clang-tidy mlir/.clang-tidy The `clang/.clang-tidy` change is a no-op, disabling an option that was never enabled. The compiler-rt and flang changes are no-op reorderings of the same flags. (side note, the .clang-tidy file in parallel-libs is broken and crashes clang-tidy because it uses "lowerCase" as the style instead of "lower_case" - so I'll deal with that separately) Differential Revision: https://reviews.llvm.org/D103842	2021-06-08 08:25:59 -07:00
Louis Dionne	491d045957	[libc++] Remove the old HTML documentation This commit finishes moving the <atomic> design documents to the RST documentation and removes the old documentation. https://libcxx.llvm.org is already pointing to the new documentation only now, so the removal of the old documentation is really a NFC. I went over the old documentation and I don't think we're leaving anything important behind - I think everything important was mentionned in the RST documentation anyway.	2021-06-08 11:18:12 -04:00
patacca	f60ea691a9	Revert "[Polly][Isl] Removing nullptr constructor from C++ bindings. NFC." This reverts commit `be5e2fc7bf`. This introduced a building error for polly. https://lab.llvm.org/buildbot#builders/10/builds/4951	2021-06-08 17:12:10 +02:00
Simon Pilgrim	61a2d6bfe4	[DAG] foldShuffleOfConcatUndefs - ensure shuffles of upper (undef) subvector elements is undef (PR50609) shuffle(concat(x,undef),concat(y,undef)) -> concat(shuffle(x,y),shuffle(x,y)) If the original shuffle references any of the upper (undef) subvector elements, ensure the split shuffle masks uses undef instead of an out-of-bounds value. Fixes PR50609	2021-06-08 15:49:41 +01:00
Whitney Tsang	dee1f0cb34	[LoopNest] Fix Wdeprecated-copy warnings error: definition of implicit copy constructor for 'LoopNest' is deprecated because it has a user-declared copy assignment operator [-Werror,-Wdeprecated-copy] LoopNest &operator=(const LoopNest &) = delete; Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D103752	2021-06-08 14:48:13 +00:00
Yaxun (Sam) Liu	054cc3b1b4	[CUDA][HIP] Fix store of vtbl in ctor vtbl itself is in default global address space. When clang emits ctor, it gets a pointer to the vtbl field based on the this pointer, then stores vtbl to the pointer. Since this pointer can point to any address space (e.g. an object created in stack), this pointer points to default address space, therefore the pointer to vtbl field in this object should also be in default address space. Currently, clang incorrectly casts the pointer to vtbl field in this object to global address space. This caused assertions in backend. This patch fixes that by removing the incorrect addr space cast. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D103835	2021-06-08 10:24:44 -04:00
Guillaume Chatelet	7f6c878a2c	Fix missing header and namespace qualifier in ADT Sequence	2021-06-08 14:11:54 +00:00
patacca	be5e2fc7bf	[Polly][Isl] Removing nullptr constructor from C++ bindings. NFC. [Polly][Isl] Removing nullptr constructor from C++ bindings. NFC. This is part of an effort to reduce the differences between the custom C++ bindings used right now by polly in `lib/External/isl/include/isl/isl-noxceptions.h` and the official isl C++ interface. Changes made: - Removed `std::nullptr_t` constructor from all the classes in the isl C++ bindings. - `isl-noexceptions.h` has been generated by this `a7e00bea38` Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D103751	2021-06-08 15:28:20 +02:00
Guillaume Chatelet	e772216e70	[llvm] Make Sequence reverse-iterable This patch simplifies the implementation of Sequence and makes it compatible with llvm::reverse. It exposes the reverse iterators through rbegin/rend which prevents a dangling reference in std::reverse_iterator::operator++(). Differential Revision: https://reviews.llvm.org/D102679	2021-06-08 13:18:57 +00:00
Hans Wennborg	386b66b2fc	Revert "3rd Reapply "[DebugInfo] Use variadic debug values to salvage BinOps and GEP instrs with non-const operands"" > This reapplies `c0f3dfb9`, which was reverted following the discovery of > crashes on linux kernel and chromium builds - these issues have since > been fixed, allowing this patch to re-land. This reverts commit `36ec97f76a`. The change caused non-determinism in the compiler, see comments on the code review at https://reviews.llvm.org/D91722. Reverting to unbreak people's builds until that can be addressed. This also reverts the follow-up "[DebugInfo] Limit the number of values that may be referenced by a dbg.value" in `a0bd6105d8`.	2021-06-08 14:54:08 +02:00
Fabian Schuiki	41eb2cec47	[MLIR] Mark additional builtin attr methods const * Mark the following methods const: * `ArrayAttr::getAsRange` * `ArrayAttr::getAsValueRange` * `DictionaryAttr::getAs` * Make `DictionarAttr::getAs` generic over the name class, such that `Identifier` and `StringRef` arguments get forwarded to the underlying call to `get`. (Made generic to avoid introducing a dependency on `include/mlir/IR/Identifier.h` as per the diff discussion.) Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D103822	2021-06-08 14:45:19 +02:00
Simon Moll	0f9d299122	[VP] getDeclarationForParams `VPIntrinsic::getDeclarationForParams` creates a vp intrinsic declaration for parameters you want to call it with. This is in preparation of a new builder class that makes emitting vp intrinsic code nearly as convenient as using a plain ir builder (aka `VectorBuilder`, to be used by D99750). Reviewed By: frasercrmck, craig.topper, vkmr Differential Revision: https://reviews.llvm.org/D102686	2021-06-08 14:21:28 +02:00
Timm Bäder	22875b2ce3	[NFC] Remove some include cycles These files include themselves directly.	2021-06-08 14:00:39 +02:00
Simon Moll	2b626aba44	[VE][NFC] IRBuilder<> -> IRBuilderBase VE's TTI broke with the switch from IRBuilder<> to IRBuilderBase. Following that change to compile again.	2021-06-08 13:55:49 +02:00
Nathan Sidwell	012898b92c	[clang] p1099 using enum part 1 This adds support for p1099's 'using SCOPED_ENUM::MEMBER;' functionality, bringing a member of an enumerator into the current scope. The novel feature here, is that there need not be a class hierarchical relationship between the current scope and the scope of the SCOPED_ENUM. That's a new thing, the closest equivalent is a typedef or alias declaration. But this means that Sema::CheckUsingDeclQualifier needs adjustment. (a) one can't call it until one knows the set of decls that are being referenced -- if exactly one is an enumerator, we're in the new territory. Thus it needs calling later in some cases. Also (b) there are two ways we hold the set of such decls. During parsing (or instantiating a dependent scope) we have a lookup result, and during instantiation we have a set of shadow decls. Thus two optional arguments, at most one of which should be non-null. Differential Revision: https://reviews.llvm.org/D100276	2021-06-08 04:40:42 -07:00
maekawatoshiki	09e92c607c	[LoopUnrollAndJam] Change LoopUnrollAndJamPass to LoopNest pass This patch changes LoopUnrollAndJamPass from FunctionPass to LoopNest pass. The next patch will utilize LoopNest to effectively handle loop nests. Also, a crash problem on legacy pass manager is fixed. Reviewed By: Whitney Differential Revision: https://reviews.llvm.org/D99149	2021-06-08 20:30:02 +09:00
Vignesh Balasubramanian	f61602b0d3	[OpenMP][OMPD] Implementation of OMPD debugging library - libompd. This is the first of seven patches that implements OMPD, a debugging interface to support debugging of OpenMP programs. It contains support code required in "openmp/runtime" for OMPD implementation. Reviewed By: @hbae Differential Revision: https://reviews.llvm.org/D100181	2021-06-08 16:44:22 +05:30
Kerry McLaughlin	5db52751a5	[CostModel] Return an invalid cost for memory ops with unsupported types Fixes getTypeConversion to return `TypeScalarizeScalableVector` when a scalable vector type cannot be legalized by widening/splitting. When this is the method of legalization found, getTypeLegalizationCost will return an Invalid cost. The getMemoryOpCost, getMaskedMemoryOpCost & getGatherScatterOpCost functions already call getTypeLegalizationCost and will now also return an Invalid cost for unsupported types. Reviewed By: sdesmalen, david-arm Differential Revision: https://reviews.llvm.org/D102515	2021-06-08 12:07:36 +01:00
Sven van Haastregt	d54e7b731e	[OpenCL] Add memory_scope_all_devices Add the `memory_scope_all_devices` enum value, which is restricted to OpenCL 3.0 or newer and the `__opencl_c_atomic_scope_all_devices` feature. Also guard `memory_scope_all_svm_devices` accordingly, which is already available in OpenCL 2.0. The `__opencl_c_atomic_scope_all_devices` feature is header-only, so set its define to 1 in `opencl-c-base.h`. This is done unconditionally at the moment, as the mechanism for disabling header-only options hasn't been decided yet. This patch only adds a negative test for now. Ideally adding a CL3.0 run line to atomic-ops.cl should suffice as a positive test, but we cannot do that yet until (at least) generic address spaces and program scope variables are supported in OpenCL 3.0 mode. Differential Revision: https://reviews.llvm.org/D103241	2021-06-08 11:51:12 +01:00
Fraser Cormack	ccd1e087f3	[RISCV] Add a test case showing inefficient vector codegen	2021-06-08 11:07:12 +01:00
Caroline Concatto	6fd1604d14	[InstCombine] Add instcombine fold for extractelement + splat for scalable vectors This patch allows that scalable vector can also use the fold that already exists for fixed vector, only when the lane index is lower than the minimum number of elements of the vector. Differential Revision: https://reviews.llvm.org/D102404	2021-06-08 10:43:38 +01:00
Simon Pilgrim	f96b5e801d	OptBisect.cpp - remove unused include. NFCI. StringRef.h is included in OptBisect.h and we have no uses of std::string.	2021-06-08 10:41:03 +01:00
Simon Pilgrim	49d3a367c0	[CostModel][X86] Improve AVX1/AVX2 truncation costs Based off the worse case numbers generated by D103695, we were overestimating the cost of a number of vector truncations: AVX2: v2i32->v2i8, v2i64->v2i16 + v4i64->v4i32 AVX1: v2i32->v2i8, v4i64->v4i16 + v16i16->v16i8 Once we have a working set of conversion costs, the intention is to cleanup the tables and use legalized types a lot more to reduce the number of entries we currently have.	2021-06-08 10:41:03 +01:00
Simon Pilgrim	596004a947	MemCpyOptimizer.cpp - hasUndefContentsMSSA - Pass DataLayout by reference. NFCI.	2021-06-08 10:41:02 +01:00

1 2 3 4 5 ...

390522 Commits All Branches Search

390522 Commits

All Branches