llvm-project

Commit Graph

Author	SHA1	Message	Date
Craig Topper	6f45fe9851	[RISCV] Use MxListW instead of MxList[0-5]. NFC Better to use the named list instead of assuming the size of MxList.	2021-12-31 00:22:55 -08:00
Craig Topper	8811a87e8c	[RISCV] Use defvar to simplify some code. NFC Rather than wrapping a def around a list, we can just make a defvar of the list.	2021-12-30 23:48:39 -08:00
wangpc	41454ab256	[RISCV] Use constant pool for large integers For large integers (for example, magic numbers generated by TargetLowering::BuildSDIV when dividing by constant), we may need about 4~8 instructions to build them. In the same time, it just takes two instructions to load constants (with extra cycles to access memory), so it may be profitable to put these integers into constant pool. Reviewed By: asb, craig.topper Differential Revision: https://reviews.llvm.org/D114950	2021-12-31 14:48:48 +08:00
jacquesguan	05f82dc877	[RISCV] Fix incorrect cases of vmv.s.f in the VSETVLI insert pass. Fix incorrect cases of vmv.s.f and add test cases for it. Differential Revision: https://reviews.llvm.org/D116432	2021-12-31 14:17:03 +08:00
Stella Laurenzo	5cd0b817e2	[mlir] Allow IntegerAttr to parse zero width integers. https://reviews.llvm.org/D109555 added support to APInt for this, so the special case to disable it is no longer valid. It is in fact legal to construct these programmatically today, and they print properly but do not parse. Justification: zero bit integers arise naturally in various bit reduction optimization problems, and having them defined for MLIR reduces special casing. I think there is a solid case for i0 and ui0 being supported. I'm less convinced about si0 and opted to just allow the parser to round-trip values that already verify. The counter argument is that the proper singular value for an si0 is -1. But the counter to this counter is that the sign bit is N-1, which does not exist for si0 and it is not unreasonable to consider this non-existent bit to be 0. Various sources consider it having the singular value "0" to be the least surprising. Reviewed By: lattner Differential Revision: https://reviews.llvm.org/D116413	2021-12-30 20:33:00 -08:00
Alexandre Ganea	7cd109b92c	[asan] Additionnal prologue decoding for WinSDK 10.0.22000 Fixes interception of atoi() entry point.	2021-12-30 20:11:45 -05:00
Sam McCall	09f8315bba	[Sema] a[x] has type T when a has type T* or T[], even when T is dependent This more precise type is useful for tools, e.g. fixes https://github.com/clangd/clangd/issues/831 Differential Revision: https://reviews.llvm.org/D107275	2021-12-31 01:30:39 +01:00
Fangrui Song	ed67d5a03a	[ELF] Switch cNamedSections to SmallVector. NFC Make it smaller	2021-12-30 16:08:26 -08:00
Craig Topper	7d659c6ac7	[LegalizeIntegerTypes] Rename NewLHS/NewRHS arguments to DAGTypeLegalizer::PromoteSetCCOperands. NFC The 'New' only makes sense in the context of these being output arguments, but they are also used as inputs first. Drop the 'New' and just call them LHS/RHS. Factored out of D116421.	2021-12-30 15:31:43 -08:00
Ellis Hoag	a699b2f1c0	[InstrProf] Mark counters as used in debug correlation mode In debug info correlation mode we do not emit the data globals so we need to explicitly mark the counter globals as used so they don't get stripped. Reviewed By: kyulee Differential Revision: https://reviews.llvm.org/D115981	2021-12-30 14:50:45 -08:00
MaheshRavishankar	59442a5460	[mlir][Linalg] Change signature of `get(Parallel/Reduce/Window)Dims` method. These method currently takes a SmallVector<AffineExpr> & as an argument to return the dims as AffineExpr. This creation of AffineExpr objects is unnecessary. Differential Revision: https://reviews.llvm.org/D116422	2021-12-30 14:02:15 -08:00
Fangrui Song	441de75f69	[lld][docs] Update _templates/indexsidebar.html after Bugzilla->GitHub issue migration	2021-12-30 13:34:45 -08:00
Alexey Bataev	e0efedd2c3	[SLP][NFC]Fix non-determinism in reordering, NFC. Need to clear CurrentOrder order mask if it is determined that extractelements form identity order and need to use a vector-like construct when iterating over ordered entries in the reorderTopToBottom function.	2021-12-30 13:10:25 -08:00
Krzysztof Parzyszek	db83e3e507	[Hexagon] Generate HVX/FP arithmetic instructions Co-authored-by: Anirudh Sundar Subramaniam <quic_sanirudh@quicinc.com> Co-authored-by: Sumanth Gundapaneni <sgundapa@quicinc.com> Co-authored-by: Joshua Herrera <joshherr@quicinc.com>	2021-12-30 12:47:30 -08:00
Louis Dionne	ee8e81b40e	[libc++][NFC] Fix incorrect synopsis in transform_view test	2021-12-30 15:43:27 -05:00
Mogball	4943cda398	[mlir][arith] fixing dependencies on memref/arith	2021-12-30 20:39:22 +00:00
Krzysztof Parzyszek	9e6afbedb0	[Hexagon] Generate HVX/FP compare instructions Co-authored-by: Anirudh Sundar Subramaniam <quic_sanirudh@quicinc.com>	2021-12-30 12:17:22 -08:00
Fangrui Song	dabac5feec	[ELF][LTO] Cache symbol table of lazy BitcodeFile Similar to D62188: a BitcodeFile's symbol table may be iterated twice, once in --start-lib (lazy) state, and once in the non-lazy state. This patch makes `parseLazy` save `symbols[i]` so that the non-lazy state does not need to re-insert to the global symbol table. Avoiding a redundant `saver.save` may save memory. `Maximum resident set size (kbytes)` for a large --thinlto-index-only link: * without the patch: 10164000 * with the patch: 10095716 (0.6% decrease) Note: we can remove `saver.save` if `BitcodeCompiler::add` does not transfer the ownership of `f.obj` in `checkError(ltoObj->add(std::move(f.obj), resols));`. Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D116390	2021-12-30 12:03:29 -08:00
Craig Topper	15787ccd45	[RISCV] Add support for STRICT_LRINT/LLRINT/LROUND/LLROUND. Tests for other strict intrinsics. This patch adds isel support for STRICT_LRINT/LLRINT/LROUND/LLROUND. It also adds test cases for f32 and f64 constrained intrinsics that correspond to the intrinsics in float-intrinsics.ll and double-intrinsics.ll. Support for promoting the integer argument of STRICT_FPOWI was added. I've skipped adding tests for f16 intrinsics, since we don't have libcalls for them and we have inconsistent support for promoting them in LegalizeDAG. This will need to be examined more closely. Reviewed By: asb Differential Revision: https://reviews.llvm.org/D116323	2021-12-30 11:54:32 -08:00
Fangrui Song	95c25fd52a	[Bazel] Make mlir:MemRefOpsTdFiles depend on :ArithmeticOpsTdFiles	2021-12-30 11:47:54 -08:00
long.chen	d295dd10f2	[MLIR] Add explicit `using` to disambiguate between multiple implementations from base classes (NFC) Both of DenseElementsAttr and ElementsAttrTrait define the method of getElementType, this commit makes it available on DenseIntOrFPElementsAttr and DenseStringElementsAttr. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D116389	2021-12-30 19:47:33 +00:00
Benjamin Kramer	4683ce2cd8	[InferAttrs] Give strnlen the same attributes as strlen This moves the only string function out of the big list of math funcs. And let's us CSE strnlen calls.	2021-12-30 20:43:43 +01:00
Fangrui Song	a96fe1bf3b	[ELF][LTO] Call madvise(MADV_DONTNEED) on MemoryBuffer instances @tejohnson noticed that freeing MemoryBuffer instances right before `lto->compile` can save RSS, likely because the memory can be reused by LTO indexing (e.g. ThinLTO import/export lists).). For ELFFileBase instances, symbol and section names are backed by MemoryBuffer, so destroying MemoryBuffer would make some infrequent passes (parseSymbolVersion, reportBackrefs) crash and make debugging difficult. For a BitcodeFile, its content is completely unused, but destroying its MemoryBuffer makes the buffer identifier inaccessible and may introduce constraints for future changes. This patch leverages madvise(MADV_DONTNEED) which achieves the major gain without the latent issues. `Maximum resident set size (kbytes): ` for a large --thinlto-index-only link: * current behavior: 10146104KiB * destroy MemoryBuffer instances: 8555240KiB * madvise(MADV_DONTNEED) just bitcodeFiles and lazyBitcodeFiles: 8737372KiB * madvise(MADV_DONTNEED) all MemoryBuffers: 8739796KiB (16% decrease) Depends on D116366 Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D116367	2021-12-30 11:36:58 -08:00
William S. Moses	a6a583dae4	[MLIR] Move AtomicRMW into MemRef dialect and enum into Arith Per the discussion in https://reviews.llvm.org/D116345 it makes sense to move AtomicRMWOp out of the standard dialect. This was accentuated by the need to add a fold op with a memref::cast. The only dialect that would permit this is the memref dialect (keeping it in the standard dialect or moving it to the arithmetic dialect would require those dialects to have a dependency on the memref dialect, which breaks linking). As the AtomicRMWKind enum is used throughout, this has been moved to Arith. Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D116392	2021-12-30 14:31:33 -05:00
Jack Andersen	9d37d0ea34	[Support] Expand `<CFGDIR>` as the base directory in configuration files. Extends response file expansion to recognize `<CFGDIR>` and expand to the current file's directory. This makes it much easier to author clang config files rooted in portable, potentially not-installed SDK directories. A typical use case may be something like the following: ``` # sample_sdk.cfg --target=sample -isystem <CFGDIR>/include -L <CFGDIR>/lib -T <CFGDIR>/ldscripts/link.ld ``` Reviewed By: sepavloff Differential Revision: https://reviews.llvm.org/D115604	2021-12-30 13:43:47 -05:00
Fangrui Song	890e8c8f7e	[Support] Add MemoryBuffer::dontNeedIfMmap On *NIX systems, this API calls madvise(MADV_DONTNEED) on read-only file mappings. It should not be used on a writable buffer. The API is used to implement ld.lld LTO memory saving trick (D116367). Note: on read-only file mappings, Linux's MADV_DONTNEED semantics match POSIX POSIX_MADV_DONTNEED and BSD systems' MADV_DONTNEED. On Windows, VirtualAllocEx MEM_COMMIT/MEM_RESET have similar semantics but are unfortunately not drop-in replacements. dontNeedIfMmap is currently a no-op. Reviewed By: aganea Differential Revision: https://reviews.llvm.org/D116366	2021-12-30 10:42:28 -08:00
Fangrui Song	25ff448aac	[docs][llvm-profdata] Prefer double-dash long options To match the `--help` message and most other utilities. While here, change `option:: -output=output` to `option:: --output=<output>` and omit the value name for the short options (convention of other utilities). Reviewed By: snehasish Differential Revision: https://reviews.llvm.org/D116353	2021-12-30 10:37:17 -08:00
Krzysztof Parzyszek	e107374e40	[Hexagon] Explicitly use integer types when rescaling a mask	2021-12-30 10:14:00 -08:00
Krzysztof Parzyszek	eb574259b6	[Hexagon] Handle HVX/FP {masked,wide} loads/stores Co-authored-by: Rahul Utkoor <quic_rutkoor@quicinc.com> Co-authored-by: Anirudh Sundar Subramaniam <quic_sanirudh@quicinc.com>	2021-12-30 10:14:00 -08:00
Luís Ferreira	8792cd75d0	Revert "[lld] Add support for other demanglers other than Itanium" This reverts commit `e60d6dfd5a`. clang-ppc64le-rhel buildbot failed (https://lab.llvm.org/buildbot#builders/57/builds/13424): tools/lld/MachO/CMakeFiles/lldMachO.dir/Symbols.cpp.o: In function `lld::demangle(llvm::StringRef, bool)': Symbols.cpp:(.text._ZN3lld8demangleEN4llvm9StringRefEb[_ZN3lld8demangleEN4llvm9StringRefEb]+0x90): undefined reference to `llvm::demangle(std::string const&)'	2021-12-30 18:04:21 +00:00
Krzysztof Parzyszek	cd997689f2	[Hexagon] Fix isTypeForHVX to recognize floating point types Co-authored-by: Sumanth Gundapaneni <sgundapa@quicinc.com>	2021-12-30 10:01:05 -08:00
Jacques Pienaar	4a8cef157b	[mlir] Change SCF/Complex to prefixed (NFC) See https://llvm.discourse.group/t/psa-ods-generated-accessors-will-change-to-have-a-get-prefix-update-you-apis/4476	2021-12-30 09:57:51 -08:00
Luís Ferreira	e60d6dfd5a	[lld] Add support for other demanglers other than Itanium LLVM core library supports demangling other mangled symbols other than itanium, such as D and Rust. LLD should use those demanglers in order to output pretty demangled symbols on error messages. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D116279	2021-12-30 17:52:38 +00:00
Sanjay Patel	0c6979b2d6	[InstCombine] fold opposite shifts around an add ((X << C) + Y) >>u C --> (X + (Y >>u C)) & (-1 >>u C) https://alive2.llvm.org/ce/z/DY9DPg This replaces a shift with an 'and', and in the case where the add has a constant operand, it eliminates both shifts. As noted in the TODO comment, we already have this fold when the shifts are in the opposite order (and that code handles bitwise logic ops too). Fixes #52851	2021-12-30 12:01:06 -05:00
Sanjay Patel	fd9cd3408b	Revert "[InstCombine] fold opposite shifts around an add" This reverts commit `2e3e0a5c28`. Some unintended diffs snuck into this patch.	2021-12-30 11:54:55 -05:00
Sanjay Patel	2e3e0a5c28	[InstCombine] fold opposite shifts around an add ((X << C) + Y) >>u C --> (X + (Y >>u C)) & (-1 >>u C) https://alive2.llvm.org/ce/z/DY9DPg This replaces a shift with an 'and', and in the case where the add has a constant operand, it eliminates both shifts. As noted in the TODO comment, we already have this fold when the shifts are in the opposite order (and that code handles bitwise logic ops too). Fixes #52851	2021-12-30 11:52:29 -05:00
Krzysztof Parzyszek	23423638cc	[Hexagon] Handle HVX/FP shuffles, insertion and extraction Co-authored-by: Anirudh Sundar Subramaniam <quic_sanirudh@quicinc.com>	2021-12-30 08:44:10 -08:00
Krzysztof Parzyszek	95c7dd8810	Revert "[Hexagon] Don't build two halves of HVX vector in parallel" This reverts commit `ba07f300c6`. A build-vector sequence is made of pairs: rotate+insert. When constructing a single vector, this results in a chain of 2*N instructions. The rotate operation is a permute operation, but the insert uses a multiplication resource: insert and rotate can execute in the same cycle, but obviously they cannot operate on the same vector. The original halving idea is still beneficial since it does allow for insert/rotate overlap, and for hiding insert's latency.	2021-12-30 07:57:11 -08:00
Nuno Lopes	7128bb61fb	[NFC] Pre-commit NewGVN tests for wrong phi(undef, X) optimization	2021-12-30 15:45:20 +00:00
Nicolas Vasilache	2e69f4f012	[mlir][vector] Fix illegal vector.transfer + tensor.insert/extract_slice folding vector.transfer operations do not have rank-reducing semantics. Bail on illegal rank-reduction: we need to check that the rank-reduced dims are exactly the leading dims. I.e. the following is illegal: ``` %0 = vector.transfer_write %v, %t[0,0], %cst : vector<2x4xf32>, tensor<2x4xf32> %1 = tensor.insert_slice %0 into %tt[0,0,0][2,1,4][1,1,1] : tensor<2x4xf32> into tensor<2x1x4xf32> ``` Cannot fold into: ``` %0 = vector.transfer_write %v, %t[0,0,0], %cst : vector<2x4xf32>, tensor<2x1x4xf32> ``` For this, check the trailing `vectorRank` dims of the insert_slice result tensor match the trailing dims of the inferred result tensor. Differential Revision: https://reviews.llvm.org/D116409	2021-12-30 14:55:16 +00:00
Nuno Lopes	84b285d6eb	[GVN] Set phi entries of unreachable predecessors to poison instead of undef This matches NewGVN's behavior.	2021-12-30 14:47:24 +00:00
Pavel Labath	9b8f9d33db	[lldb/qemu] More flexible emulator specification This small patch adds two useful improvements: - allows one to specify the emulator path as a bare filename, and have it be looked up in the PATH - allows one to leave the path empty and have the filename be derived from the architecture.	2021-12-30 15:14:41 +01:00
Nuno Lopes	e5e844b37e	[NFC] Pre-commit test for InstSimplify phi(poison)	2021-12-30 12:37:20 +00:00
Sjoerd Meijer	86825fc2fb	[LoopFlatten] Move it to a LoopPassManager In D109958 it was noticed that we could optimise the pipeline and avoid rerunning LoopSimplify/LCSSA for LoopFlatten by moving it to a LoopPassManager. Differential Revision: https://reviews.llvm.org/D110057	2021-12-30 12:32:14 +00:00
Nuno Lopes	72ea6fbc15	[NewGVN][NFC] Add test for x + poison -> poison	2021-12-30 12:08:07 +00:00
Nuno Lopes	64af9f61c3	[InstSimplify] add 'x + poison -> poison' (needed for NewGVN)	2021-12-30 11:52:42 +00:00
Pavel Labath	d7dbe2c4a0	[lldb] Remove lldbtest.getBuildFlags It was being used only in some very old tests (which pass even without it) and its implementation is highly questionable. These days we have different mechanisms for requesting a build with a particular kind of c++ library (USE_LIB(STD)CPP in the makefile).	2021-12-30 12:19:24 +01:00
Roman Lebedev	62b1682570	[Opaqueptrs][IR Serialization] Improve inlineasm [de]serialization The bitcode reader expected that the pointers are typed, so that it can extract the function type for the assembly so `bitc::CST_CODE_INLINEASM` did not explicitly store said function type. I'm not really sure how the upgrade path will look for existing bitcode, but i think we can easily support opaque pointers going forward, by simply storing the function type. Reviewed By: #opaque-pointers, nikic Differential Revision: https://reviews.llvm.org/D116341	2021-12-30 13:54:37 +03:00
Roman Lebedev	a5337d6a1c	[BitcodeReader] `bitc::CST_CODE_INLINEASM`: un-hardcode offsets	2021-12-30 13:50:02 +03:00
jacquesguan	128c6ed73b	[RISCV] Teach VSETVLInsert to eliminate redundant vsetvli for vmv.s.x and vfmv.s.f. Differential Revision: https://reviews.llvm.org/D116307	2021-12-30 17:16:18 +08:00

1 2 3 4 5 ...

408588 Commits All Branches Search

408588 Commits

All Branches