llvm-project

Commit Graph

Author	SHA1	Message	Date
Sam McCall	6fe20a44fd	[clangd] Fix yet-another gratuitous llvm::Error crash	2020-05-03 22:13:58 +02:00
Shilei Tian	cb038927ef	[OpenMP] Fix an issue of wrong return type of DeviceRTLTy::getNumOfDevices Summary: There is a typo in DeviceRTLTy::getNumOfDevices that the type of its return value is bool. It will lead to a problem of wrong device number returned from omp_get_num_devices. Reviewers: jdoerfert Reviewed By: jdoerfert Subscribers: yaxunl, guansong, openmp-commits Tags: #openmp Differential Revision: https://reviews.llvm.org/D79255	2020-05-03 15:59:06 -04:00
Kadir Cetinkaya	81e48ae2b4	[clangd] Reland LSP latency test	2020-05-03 21:06:57 +02:00
Sergey Dmitriev	0f70f73308	[Attributor] Bitcast constant to the returned value type if it has different type Reviewers: jdoerfert, sstefan1, uenoku Reviewed By: jdoerfert Subscribers: hiraditya, uenoku, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79277	2020-05-03 11:46:13 -07:00
Nikita Popov	46ee652c70	Revert "[InstSimplify] Remove known bits constant folding" This reverts commit `08556afc54`. This breaks some AMDGPU tests.	2020-05-03 20:45:10 +02:00
Nikita Popov	08556afc54	[InstSimplify] Remove known bits constant folding If SimplifyInstruction() does not succeed in simplifying the instruction, it will compute the known bits of the instruction in the hope that all bits are known and the instruction can be folded to a constant. I have removed a similar optimization from InstCombine in D75801, and would like to drop this one as well. On average, we spend ~1% of total compile-time performing this known bits calculation. However, if we introduce some additional statistics for known bits computations and how many of them succeed in simplifying the instruction we get (on test-suite): instsimplify.NumKnownBits: 216 instsimplify.NumKnownBitsComputed: 13828375 valuetracking.NumKnownBitsComputed: 45860806 Out of ~14M known bits calculations (accounting for approximately one third of all known bits calculations), only 0.0015% succeed in producing a constant. Those cases where we do succeed to compute all known bits will get folded by other passes like InstCombine later. On test-suite, only lencod.test and GCC-C-execute-pr44858.test show a hash difference after this change. On lencod we see an improvement (a loop phi is optimized away), on the GCC torture test a regression (a function return value is determined only after IPSCCP, preventing propagation from a noinline function.) There are various regressions in InstSimplify tests. However, all of these cases are already handled by InstCombine, and corresponding tests have already been added there. Differential Revision: https://reviews.llvm.org/D79294	2020-05-03 20:26:58 +02:00
Casey Carter	7e3ef299cb	[libc++][test] Use a non-narrowing conversion in assign_pair.pass.cpp ...to avoid warnings, e.g., from MSVC.	2020-05-03 10:59:10 -07:00
Hongtao Yu	911e06f5eb	[ICP] Handling must tail calls in indirect call promotion Per the IR convention, a musttail call must precede a ret with an optional bitcast. This was violated by the indirect call promotion optimization which could result an IR like: ; <label>:2192: br i1 %2198, label %2199, label %2201, !dbg !226012, !prof !229483 ; <label>:2199: ; preds = %2192 musttail call fastcc void @foo(i8* %2195), !dbg !226012 br label %2202, !dbg !226012 ; <label>:2201: ; preds = %2192 musttail call fastcc void %2197(i8* %2195), !dbg !226012 br label %2202, !dbg !226012 ; <label>:2202: ; preds = %605, %2201, %2199 ret void, !dbg !229485 This is being fixed in this change where the return statement goes together with the promoted indirect call. The code generated is like: ; <label>:2192: br i1 %2198, label %2199, label %2201, !dbg !226012, !prof !229483 ; <label>:2199: ; preds = %2192 musttail call fastcc void @foo(i8* %2195), !dbg !226012 ret void, !dbg !229485 ; <label>:2201: ; preds = %2192 musttail call fastcc void %2197(i8* %2195), !dbg !226012 ret void, !dbg !229485 Differential Revision: https://reviews.llvm.org/D79258	2020-05-03 10:42:22 -07:00
Mircea Trofin	bec4ab95a4	[llvm][NFC] Inliner: factor cost and reporting out of inlining process Summary: This factors cost and reporting out of the inlining workflow, thus making it easier to reuse when driving inlining from the upcoming InliningAdvisor. Depends on: D79215 Reviewers: davidxl, echristo Subscribers: eraman, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79275	2020-05-03 10:38:28 -07:00
Florian Hahn	bbdfcf8f69	[VPlan] Remove unused & undefined print method (NFC).	2020-05-03 18:36:20 +01:00
Johannes Doerfert	8228153f87	[Attributor][NFC] Encode IRPositions in the bits of a single pointer This reduces memory consumption for IRPositions by eliminating the vtable pointer and the `KindOrArgNo` integer. Since each abstract attribute has an associated IRPosition, the 12-16 bytes we save add up quickly. No functional change is intended. --- Single run of the Attributor module and then CGSCC pass (oldPM) for SPASS/clause.c (~10k LLVM-IR loc): Before: ``` calls to allocation functions: 469545 (260135/s) temporary memory allocations: 77137 (42735/s) peak heap memory consumption: 30.50MB peak RSS (including heaptrack overhead): 119.50MB total memory leaked: 269.07KB ``` After: ``` calls to allocation functions: 468999 (274108/s) temporary memory allocations: 77002 (45004/s) peak heap memory consumption: 28.83MB peak RSS (including heaptrack overhead): 118.05MB total memory leaked: 269.07KB ``` Difference: ``` calls to allocation functions: -546 (5808/s) temporary memory allocations: -135 (1436/s) peak heap memory consumption: -1.67MB peak RSS (including heaptrack overhead): 0B total memory leaked: 0B ``` --- CTMark 15 runs Metric: compile_time Program lhs rhs diff test-suite...:: CTMark/sqlite3/sqlite3.test 25.07 24.09 -3.9% test-suite...Mark/mafft/pairlocalalign.test 14.58 14.14 -3.0% test-suite...-typeset/consumer-typeset.test 21.78 21.58 -0.9% test-suite :: CTMark/SPASS/SPASS.test 21.95 22.03 0.4% test-suite :: CTMark/lencod/lencod.test 25.43 25.50 0.3% test-suite...ark/tramp3d-v4/tramp3d-v4.test 23.88 23.83 -0.2% test-suite...TMark/7zip/7zip-benchmark.test 60.24 60.11 -0.2% test-suite :: CTMark/kimwitu++/kc.test 15.69 15.69 -0.0% test-suite...:: CTMark/ClamAV/clamscan.test 25.43 25.42 -0.0% test-suite :: CTMark/Bullet/bullet.test 37.63 37.62 -0.0% Geomean difference -0.8% --- Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D78722	2020-05-03 12:15:19 -05:00
Johannes Doerfert	6bf16ee4c5	[Attributor][NFC] Let AbstractAttribute be an IRPosition Since every AbstractAttribute so far, and for the foreseeable future, corresponds to a single IRPosition we can simplify the class structure. We already did this for IRAttribute but there is no reason to stop there.	2020-05-03 12:13:40 -05:00
Nico Weber	fb5fd74685	Revert "Optimize path::remove_dots" This reverts commit `53913a65b4`. Breaks VFSFromYAMLTest.DirectoryIterationSameDirMultipleEntries in SupportTests on non-Windows.	2020-05-03 12:46:46 -04:00
Simon Pilgrim	ff5094c03f	[X86] Add tests showing failure to fold mul(abs(x),abs(x)) -> mul(x,x) (PR39476)	2020-05-03 17:39:48 +01:00
Mircea Trofin	667f558c3f	[llvm][NFC] Inliner.cpp shouldInline post-commit feedback Discussion is in https://reviews.llvm.org/D79215	2020-05-03 09:31:31 -07:00
Kadir Cetinkaya	7016043d0d	[clangd] Change include to be relative to current directory	2020-05-03 18:09:50 +02:00
Reid Kleckner	53913a65b4	Optimize path::remove_dots LLD calls this on every source file string in every object file when writing PDBs, so it is somewhat hot. Avoid rewriting paths that do not contain path traversal components (./..). Use find_first_not_of(separators) directly instead of using the path iterators. The path component iterators appear to be slow, and directly searching for slashes makes it easier to find double separators that need to be canonicalized. I discovered that the VFS relies on remote_dots to not canonicalize early slashes (/foo or C:/foo) on Windows, so I had to leave that behavior behind with unit tests for it. This is undesirable, but I claim that my change is NFC.	2020-05-03 07:58:05 -07:00
Reid Kleckner	9b7f6146bd	[COFF] Paritally inline Symbol::getName, NFC	2020-05-03 07:58:05 -07:00
Sanjay Patel	682f0b366b	[InstCombine] use select-of-constants with set/clear bit mask patterns Cond ? (X & ~C) : (X \| C) --> (X & ~C) \| (Cond ? 0 : C) Cond ? (X \| C) : (X & ~C) --> (X & ~C) \| (Cond ? C : 0) The select-of-constants form results in better codegen. There's an existing test diff that shows a transform that results in an extra IR instruction, but that's an existing problem. This is motivated by code seen in LLVM itself - see PR37581: https://bugs.llvm.org/show_bug.cgi?id=37581 define i8 @src(i8 %x, i8 %C, i1 %b) { %notC = xor i8 %C, -1 %and = and i8 %x, %notC %or = or i8 %x, %C %cond = select i1 %b, i8 %or, i8 %and ret i8 %cond } define i8 @tgt(i8 %x, i8 %C, i1 %b) { %notC = xor i8 %C, -1 %and = and i8 %x, %notC %mul = select i1 %b, i8 %C, i8 0 %or = or i8 %mul, %and ret i8 %or } http://volta.cs.utah.edu:8080/z/Vt2WVm Differential Revision: https://reviews.llvm.org/D78880	2020-05-03 09:44:43 -04:00
Kadir Cetinkaya	af28c74e8f	[clangd] Drop duplicate header	2020-05-03 15:20:20 +02:00
Benjamin Kramer	7a529ad2c1	[Support] Don't initialize buffer allocated by zlib::uncompress This is a somewhat annoying API, but not without precedend in this low level API.	2020-05-03 15:01:52 +02:00
Simon Pilgrim	7c203163c7	[X86] Use splitVector helper in truncateVectorWithPACK/splitVectorStore/combineHorizontalMinMaxResult/combineReductionToHorizontal. NFC. All these locations were performing the same type splitting/extractSubVector calls as the spltVector helper.	2020-05-03 13:40:38 +01:00
LLVM GN Syncbot	f914b500df	[gn build] Port `e64f99c51a`	2020-05-03 12:08:26 +00:00
Nico Weber	c5392e2eaf	[gn build] (manually) port `ad97ccf6b2` more, for include added in `e64f99c51a`	2020-05-03 08:07:52 -04:00
Simon Pilgrim	e8d9794a23	[X86] Don't limit splitVector helper to simple types. It can handle EVT just as well (and so can the extractSubVector calls).	2020-05-03 12:27:37 +01:00
Alexey Lapshin	4f576ea731	[Debuginfo][NFC] Avoid double calling of DWARFDie::find(DW_AT_name). Summary: Current implementation of DWARFDie::getName(DINameKind Kind) could lead to double call to DWARFDie::find(DW_AT_name) in following scenario: getName(LinkageName); getName(ShortName); getName(LinkageName) calls find(DW_AT_name) if linkage name is not found. Then, it is called again in getName(ShortName). This patch alows to request LinkageName and ShortName separately to avoid extra call to find(DW_AT_name). It helps D74169 to parse clang debuginfo faster(~1%). Reviewers: clayborg, dblaikie Differential Revision: https://reviews.llvm.org/D79173	2020-05-03 14:00:25 +03:00
Nikita Popov	7c649b58f0	[InstCombine] Duplicate some InstSimplify tests (NFC) Duplicate some tests in preparation for D79294.	2020-05-03 12:49:36 +02:00
Simon Pilgrim	74e9952c8e	[X86][SSE] splitAndLowerShuffle - use splitVector helper. NFC. The splitVector helper uses extractSubVector which splits build vectors like we do here, so avoid reimplementing it. splitVector could easily be extended to peek through bitcasts as well but I'd prefer to keep this commit NFC.	2020-05-03 11:26:51 +01:00
Simon Pilgrim	4d2b0ebd17	[X86] detectAVGPattern - use matchUnaryPredicate helper. NFC. Use the ISD::matchUnaryPredicate helper to check for inrange constants.	2020-05-03 11:26:51 +01:00
Nikita Popov	7cf0f8568c	[ValueTracking] Convert test to unit test (NFC) Test this directly, rather than going through InstSimplify.	2020-05-03 12:23:57 +02:00
Kadir Cetinkaya	6c24b59ca1	[clangd] Fix name hiding in TestTracer and disable racy test for now	2020-05-03 11:51:23 +02:00
Kadir Cetinkaya	e64f99c51a	[clangd] Metric tracking through Tracer Summary: Introduces an endpoint to Tracer for tracking metrics on internal events. Reviewers: sammccall Subscribers: ilya-biryukov, javed.absar, MaskRay, jkorous, arphaman, usaxena95, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D78429	2020-05-03 10:50:32 +02:00
Ten Tzen	21c1a0c730	Test Commit: add two head comments in WinEHPrepare.cpp This is a Test commit.	2020-05-03 01:15:59 -07:00
Reid Kleckner	1e5793345b	Re-land "[PDB] Avoid calling discoverTypeIndices for a known record kind" Fixed bad usage of slice API causing assertion failures. Reverts `810c8e9b49` Reinstates `bd7ea8641e`	2020-05-02 18:39:33 -07:00
Reid Kleckner	5070cecd72	[PDB] Bypass generic deserialization code for publics sorting The number of public symbols is very large, and each deserialization does a few heap allocations. The public symbols are serialized by the linker, so we can assume they have the expected layout and use it directly. Saves O(#publics) temporary heap allocations and shrinks some data structures.	2020-05-02 18:14:50 -07:00
Nico Weber	810c8e9b49	Revert "[PDB] Avoid calling discoverTypeIndices for a known record kind" This reverts commit `bd7ea8641e`. Breaks check-lld everywhere.	2020-05-02 21:06:06 -04:00
Craig Topper	cd75b74073	[X86] Fix a few issues in the evex-to-vex-compress.mir test. Don't use $noreg for instructions that take register inputs. Only allow $noreg for parts of memory operands. Don't use index register with $rip base. Use RETQ instead of the RET pseudo. This pass is after the ExpandPseudo pass that converts RET to RETQ.	2020-05-02 18:02:12 -07:00
Craig Topper	7867f4c15f	[PDB] Remove a couple asserts that are no longer valid now that C13Builders does not use unique_ptr. These asserts used to check that unique_ptr was not null. This fixes failures from `7af4bb1641`	2020-05-02 17:31:10 -07:00
Reid Kleckner	7af4bb1641	[PDB] Remove unique_ptr wrapper around C13 line table subsections This accounts for a large portion of the memory allocations in LLD. This DebugSubsectionRecordBuilder object can be stored directly in C13Builders, it mostly wraps other subsections. Remove the container kind field from the object. It is always the same for all elements in the vector, and we can pass it in during writing.	2020-05-02 16:35:07 -07:00
Reid Kleckner	bd7ea8641e	[PDB] Avoid calling discoverTypeIndices for a known record kind This particular overload allocates memory, and we do this for every S_[GL]PROC32_ID record. Instead, hardcode the offset of the typeindex that we are looking for in the LF_[MEM]FUNC_ID record. We already assumed that looking up the item index already found a record of this kind.	2020-05-02 15:51:08 -07:00
Thomas Preud'homme	0b85ea8533	[docs][FileCheck] Fix invalid example Summary: FileCheck documentation contains an example of a numeric variable defined and used on the same line. This is not currently supported by FileCheck so this commit fixes the example to use CHECK-SAME for the variable use. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D79253	2020-05-02 23:31:18 +01:00
LemonBoy	6d103ca855	[SelectionDAG] Unify scalarizeVectorLoad and VectorLegalizer::ExpandLoad The two code paths have the same goal, legalizing a load of a non-byte-sized vector by loading the "flattened" representation in memory, slicing off each single element and then building a vector out of those pieces. The technique employed by `ExpandLoad` is slightly more convoluted and produces slightly better codegen on ARM, AMDGPU and x86 but suffers from some bugs (D78480) and is wrong for BE machines. Differential Revision: https://reviews.llvm.org/D79096	2020-05-02 15:18:10 -07:00
Reid Kleckner	3542384ae9	[COFF] Use a global option table to avoid reconstructing it Otherwise an ArgumentParser is constructed for every directive section, and that involves copying the entire table of options into a vector. There is no need for this, just have one option table.	2020-05-02 15:04:19 -07:00
Thomas Preud'homme	d735c7048c	[test] Fix lld's ELF/linkerscript/thunk-gen-mips.s Summary: Lld test ELF/linkerscript/thunk-gen-mips.s was accidentally disabled due to the use of wrong FileCheck directives. As a result the test seems to have bitrotted as it fails to pass if fixing the directive. To ease updates to the test in case of change of the __start address the checks have been changed to use numeric variables to express all the addresses based on the __start address. Reviewed By: atanasyan Differential Revision: https://reviews.llvm.org/D79270	2020-05-02 22:49:23 +01:00
Milian Wolff	08e1812643	[libclang]: visit C++17 if init statements This makes the previously unaccessible AST nodes for C++17 "if with init statements" accessible to consumers of libclang. Differential Revision: https://reviews.llvm.org/D78214	2020-05-02 22:18:36 +02:00
Milian Wolff	4597e3bd47	[libclang]: visit BindingDecl in DecompositionDecl This makes the BindingDecl accessible to consumers of libclang as CXCursor_UnexposedDecl where previously these AST nodes were not visited at all from the libclang API. Differential Revision: https://reviews.llvm.org/D78213	2020-05-02 22:18:31 +02:00
River Riddle	cb9ae0025c	[mlir] Add a new context flag for disabling/enabling multi-threading This is useful for several reasons: * In some situations the user can guarantee that thread-safety isn't necessary and don't want to pay the cost of synchronization, e.g., when parsing a very large module. * For things like logging threading is not desirable as the output is not guaranteed to be in stable order. This flag also subsumes the pass manager flag for multi-threading. Differential Revision: https://reviews.llvm.org/D79266	2020-05-02 12:32:25 -07:00
Simon Pilgrim	a09a3c6d3e	Revert rG8e05ac0a510c - "[DAGCombine] visitTRUNCATE - remove GetDemandedBits call" Causing buildbot failures	2020-05-02 20:08:33 +01:00
Simon Pilgrim	8e05ac0a51	[DAGCombine] visitTRUNCATE - remove GetDemandedBits call rL368553 added SimplifyMultipleUseDemandedBits handling for ISD::TRUNCATE to SimplifyDemandedBits so we don't need to duplicate this (and it gets rid of another GetDemandedBits call which is slowly being replaced with SimplifyMultipleUseDemandedBits anyhow).	2020-05-02 19:52:17 +01:00
mydeveloperday	9e194a3b93	[sema] NFC Unable to build Sema library with MSVC Debug target due to missing /bigobj Summary: Unable to build sema library on MSVC with Debug target ``` C:\clang\llvm-project\clang\lib\Sema\SemaOpenMP.cpp : fatal error C1128: number of sections exceeded object file format limit: compile with /bigobj ``` Reviewed By: aaron.ballman Subscribers: mgorny, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D79292	2020-05-02 19:34:58 +01:00

1 2 3 4 5 ...

353256 Commits All Branches Search

353256 Commits

All Branches