llvm-project

Commit Graph

Author	SHA1	Message	Date
Craig Topper	84f007f7ef	[CallSite removal][CodeGen] Drop some unneeded includes of CallSite.h. NFC The uses of CallSite were removed in previous patches.	2020-04-16 11:05:35 -07:00
Craig Topper	9c7d917701	[CallSite removal][CodeGen] Remove CallSite use from BasicTTIImpl.h. NFC While there convert iterator loops to range-based. Differential Revision: https://reviews.llvm.org/D78275	2020-04-16 10:56:43 -07:00
Daniel Sanders	d9085f65db	[globalisel] Add lost debug locations verifier Summary: This verifier tries to ensure that DebugLoc's don't just disappear as we transform the MIR. It observes the instructions created, erased, and changed and at checkpoints chosen by the client algorithm verifies the locations affected by those changes. In particular, it verifies that: * Every DebugLoc for an erased/changing instruction is still present on at least one new/changed instruction * Failing that, that there is a line-0 location in the new/changed instructions. It's not possible to confirm which locations were merged so it conservatively assumes all unaccounted for locations are accounted for by any line-0 location to avoid false positives. If that fails, it prints the lost locations in the debug output along with the instructions that should have accounted for them. In theory, this is usable by the legalizer, combiner, selector and any other pass that performs incremental changes to the MIR. However, it has so far only really been tested on the legalizer (not including the artifact combiner) where it has caught lots of lost locations, particularly in Custom legalizations. There's only one example here as my initial testing was on an out-of-tree target and I haven't done a pass over the in-tree targets yet. Depends on D77575, D77446 Reviewers: bogner, aprantl, vsk Subscribers: jvesely, nhaehnle, mgorny, rovka, hiraditya, volkan, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77576	2020-04-16 10:43:35 -07:00
Daniel Sanders	7c6ca18fff	[globalisel] Allow backends to report an issue without triggering fallback. NFC Summary: This will allow us to fix the issue where the lost locations verifier causes CodeGen changes on lost locations because it falls back on DAGISel Reviewers: qcolombet, bogner, aprantl, vsk, paquette Subscribers: rovka, hiraditya, volkan, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78261	2020-04-16 10:43:35 -07:00
Simon Pilgrim	490443f822	MCSchedule.h - replace ArrayRef.h include with forward declaration. NFC.	2020-04-16 17:13:56 +01:00
Simon Pilgrim	e7fc356668	MCInstrDesc.h - move MCSubtargetInfo forward declaration down to MCInstrInfo.h. NFC. Remove unused FeatureBitset forward declaration	2020-04-16 17:13:56 +01:00
Simon Pilgrim	e1dc1ae01b	Wasm.h - remove unnecessary StringMap.h include. NFC	2020-04-16 17:13:55 +01:00
Simon Pilgrim	56e70fe8b7	MCAsmBackend.h - cleanup includes and forward declarations. NFC. Replace StringRef.h include to forward declaration Remove MCFragment/MCRelaxableFragment forward declarations - these are included in MCFragment.h	2020-04-16 17:13:55 +01:00
Simon Pilgrim	f701d8fa5f	MCValue.h - cleanup include and forward declaration. NFC. Remove MCSymbol.h include Remove unused MCAsmInfo forward declaration	2020-04-16 15:18:24 +01:00
Simon Pilgrim	5d3a400463	AntiDepBreaker.h - remove unused MachineOperand.h include. NFC.	2020-04-16 14:59:50 +01:00
Simon Pilgrim	1cbd6a5888	MCObjectWriter.h - remove unnecessary includes. NFC The EndianStream.h/raw_ostream.h headers should be removed as well but we have a lot of other files that are implicitly relying on them being present.	2020-04-16 14:59:49 +01:00
Simon Pilgrim	d9e81aab10	WasmEHFuncInfo.h - reduce BasicBlock.h/MachineBasicBlock.h includes to just forward declarations. NFC.	2020-04-16 14:59:49 +01:00
Bjorn Pettersson	fdf9bad573	[Float2Int] Stop passing around a reference to the class member Roots. NFC The Float2IntPass got a class member called Roots, but Roots was also passed around to member function as a reference. This patch simply remove those references.	2020-04-16 15:24:13 +02:00
Simon Pilgrim	da20740786	yaml2obj.h - cleanup includes and forward declaration. NFC. Reduce StringRef.h/Error.h includes to just the necessary STLExtras.h include and StringRef/Twine forward declarations Remove unused Expected<> forward declaration	2020-04-16 13:15:32 +01:00
Simon Pilgrim	b9a8e787ae	Parser.h/cpp - cleanup includes and forward declaration. NFC. Parser.h - Reduce MemoryBuffer.h include to just the necessary StringRef.h include and MemoryBufferRef forward declaration Parser.cpp - Remove unused raw_ostream.h include	2020-04-16 13:15:32 +01:00
Simon Pilgrim	ee66b5b0da	Pass.h/cpp - cleanup includes and forward declaration. NFC. Remove unused BasicBlock forward declaration from Pass.h and Attributes/BasicBlock includes from Pass.cpp Add BasicBlock forward declaration to UnifyFunctionExitNodes.h which was relying on Pass.h	2020-04-16 13:15:31 +01:00
Matthias Gehre	6d2f73f821	Revert "Revert "[LifetimeAnalysis] Add [[gsl::Pointer]] to llvm::StringRef"" This reverts commit `bac85ab3b5`.	2020-04-16 14:10:22 +02:00
Benjamin Kramer	bac85ab3b5	Revert "[LifetimeAnalysis] Add [[gsl::Pointer]] to llvm::StringRef" This reverts commit `83d5131d87`. Spams llvm/ADT/StringRef.h:57:11: warning: unknown attribute 'Pointer' ignored [-Wunknown-attributes]	2020-04-16 14:06:39 +02:00
Sergej Jaskiewicz	5cef31074f	Introduce llvm::sys::Process::getProcessId() and adopt it Differential Revision: https://reviews.llvm.org/D78022	2020-04-16 15:05:37 +03:00
Georgii Rymar	65a2de7e6c	[FileCheck] - Fix the false positive when -implicit-check-not is used with an unknown -check-prefix. Imagine we have the following invocation: `FileCheck -check-prefix=UNKNOWN-PREFIX -implicit-check-not=something` When the check prefix does not exist it does not fail. This patch fixes the issue. Differential revision: https://reviews.llvm.org/D78024	2020-04-16 15:00:50 +03:00
Konstantin Schwarz	1a3e89aa2b	[MIR] Add comments to INLINEASM immediate flag MachineOperands Summary: The INLINEASM MIR instructions use immediate operands to encode the values of some operands. The MachineInstr pretty printer function already handles those operands and prints human readable annotations instead of the immediates. This patch adds similar annotations to the output of the MIRPrinter, however uses the new MIROperandComment feature. Reviewers: SjoerdMeijer, arsenm, efriedma Reviewed By: arsenm Subscribers: qcolombet, sdardis, jvesely, wdng, nhaehnle, hiraditya, jrtc27, atanasyan, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78088	2020-04-16 13:46:14 +02:00
Carl Ritson	43e2460a89	[LiveIntervals] Replace handleMoveIntoBundle Summary: The current handleMoveIntoBundle implementation is unusable, it attempts to access the slot indexes of bundled instructions. It also leaves bundled instructions with slot indexes assigned. Replace handleMoveIntoBundle this with a more explicit handleMoveIntoNewBundle function which recalculates the live intervals for all instructions moved into a newly formed bundle, and removes slot indexes from these instructions. Reviewers: arsenm, MaskRay, kariddi, tpr, qcolombet Reviewed By: qcolombet Subscribers: MatzeB, wdng, hiraditya, arphaman, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77969	2020-04-16 19:58:19 +09:00
David Zarzycki	27e63d9b0e	Fix -Wdocumentation-html warning	2020-04-16 06:33:53 -04:00
Johannes Doerfert	d18bb24749	[Attributor][NFC] Do not create temporary maps during lookup The AAMap.lookup() call created a temporary value if the key was not present. Since the value was another map it was not free to create it. Instead of a lookup we now use find and compare the result against the end iterator explicitly. The result is the same but we never need to create a temporary map.	2020-04-16 02:32:31 -05:00
Dominik Montada	e5d666d768	Revert "Revert "[GlobalISel] Fix invalid combine of unmerge(merge) with intermediate cast"" This reverts commit `1265899c5f`.	2020-04-16 09:30:34 +02:00
Craig Topper	e882ac7c04	[CallSite removal][TargetLowering] Remove ArgListEntry::setAttributes signature that took an ImmutableCallSite. NFC There's another signature that takes a CallBase. The uses of the ImmutableCallSite version were removed in previous patches.	2020-04-16 00:07:59 -07:00
Matthias Gehre	83d5131d87	[LifetimeAnalysis] Add [[gsl::Pointer]] to llvm::StringRef Summary: This detected the bugs fixed in https://reviews.llvm.org/D66442 and https://reviews.llvm.org/D66440 The warning itself was implemented in https://reviews.llvm.org/D63954 https://reviews.llvm.org/D64256 https://reviews.llvm.org/D65120 https://reviews.llvm.org/D65127 https://reviews.llvm.org/D66152 Reviewers: zturner, mehdi_amini, gribozavr Subscribers: dexonsmith, Szelethus, xazax.hun, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D66443	2020-04-16 08:23:30 +02:00
Johannes Doerfert	0741dec27b	[Attributor][FIX] Handle droppable uses when replacing values Since we use the fact that some uses are droppable in the Attributor we need to handle them explicitly when we replace uses. As an example, an assumed dead value can have live droppable users. In those we cannot replace the value simply by an undef. Instead, we either drop the uses (via `dropDroppableUses`) or keep them as they are. In this patch we do both, depending on the situation. For values that are dead but not necessarily removed we keep droppable uses around because they contain information we might be able to use later. For values that are removed we drop droppable uses explicitly to avoid replacement with undef.	2020-04-16 00:56:08 -05:00
Johannes Doerfert	8ad19ffa7b	[MustExecute][NFC] Copy function_ref instead of passing a reference	2020-04-16 00:55:34 -05:00
Craig Topper	8e1408695c	[CallSite removal][TargetLibraryInfo] Replace ImmutableCallSite with CallBase in one of the getLibFunc signatures. NFC Differential Revision: https://reviews.llvm.org/D78083	2020-04-15 22:43:41 -07:00
Fangrui Song	e13a8a1fc5	[MC][COFF][ELF] Reject instructions in IMAGE_SCN_CNT_UNINITIALIZED_DATA/SHT_NOBITS sections For `.bss; nop`, MC inappropriately calls abort() (via report_fatal_error()) with a message `cannot have fixups in virtual section!` It is a bug to crash for invalid user input. Fix it by erroring out early in EmitInstToData(). Similarly, emitIntValue() in a virtual section (SHT_NOBITS in ELF) can crash with the mssage `non-zero initializer found in section '.bss'` (see D4199) It'd be nice to report the location but so many directives can call emitIntValue() and it is difficult to track every location. Note, COFF does not crash because MCAssembler::writeSectionData() is not called for an IMAGE_SCN_CNT_UNINITIALIZED_DATA section. Note, GNU as' arm64 backend reports ``Error: attempt to store non-zero value in section `.bss'`` for a non-zero .inst but fails to do so for other instructions. We simply reject all instructions, even if the encoding is all zeros. The Mach-O counterpart is D48517 (see `test/MC/MachO/zerofill-text.s`) Reviewed By: rnk, skan Differential Revision: https://reviews.llvm.org/D78138	2020-04-15 21:02:47 -07:00
Johannes Doerfert	898bbc252a	[Attributor] Lazily collect function information Before, we eagerly analyzed all the functions to collect information about them, e.g. what instructions may read/write memory. This had multiple drawbacks: - In CGSCC-mode we can end up looking at a callee which is not in the SCC but for which we need an initialized cache. - We end up looking at functions that we deem dead and never need to analyze in the first place. - We have a implicit dependence which is easy to break. This patch moves the function analysis into the information cache and makes it lazy. There is no real functional change expected except due to the first reason above.	2020-04-15 22:26:38 -05:00
Fangrui Song	90a63f6d2d	[MC] Replace MCSection*::getName() with MCSection::getName(). NFC I plan to use MCSection::getName() in D78138. Having the function in the base class is also convenient for debugging. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D78251	2020-04-15 18:35:27 -07:00
Richard Smith	1132c75bd7	Remove vptr dispatch from FoldingSet. Summary: Instead of storing a vptr in each FoldingSet instance, form an equivalent struct and pass it implicitly from FoldingSet into the various FoldingSetBase methods. This has three benefits: * FoldingSet becomes one pointer smaller. * Under LTO, the "virtual" functions are much easier to inline. * The element type no longer needs to be complete when instantiating FoldingSet<T>, only when instantiating an insert / lookup member. Reviewers: rnk Subscribers: hiraditya, dexonsmith, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78247	2020-04-15 17:39:35 -07:00
Fangrui Song	7d1ff446b6	[MC] Rename MCSection::getSectionName() to getName(). NFC A pending change will merge MCSection::getName() to MCSection::getName().	2020-04-15 16:48:14 -07:00
Johannes Doerfert	937025757c	[CallGraphUpdater] Remove nodes from their SCC (old PM) Summary: We can and should remove deleted nodes from their respective SCCs. We did not do this before and this was a potential problem even though I couldn't locally trigger an issue. Since the `DeleteNode` would assert if the node was not in the SCC, we know we only remove nodes from their SCC and only once (when run on all the Attributor tests). Reviewers: lebedev.ri, hfinkel, fhahn, probinson, wristow, loladiro, sstefan1, uenoku Subscribers: hiraditya, bollu, uenoku, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77855	2020-04-15 18:38:50 -05:00
Johannes Doerfert	1b34b84ddd	[CallGraphUpdater] Update the ExternalCallingNode for node replacements Summary: While it is uncommon that the ExternalCallingNode needs to be updated, it can happen. It is uncommon because most functions listed as callees have external linkage, modifying them is usually not allowed. That said, there are also internal functions that have, or better had, their "address taken" at construction time. We conservatively assume various uses cause the address "to be taken". Furthermore, the user might have become dead at some point. As a consequence, transformations, e.g., the Attributor, might be able to replace a function that is listed as callee of the ExternalCallingNode. Since there is no function corresponding to the ExternalCallingNode, we did just remove the node from the callee list if we replaced it (so far). Now it would be preferable to replace it if needed and remove it otherwise. However, removing the node has implications on the CGSCC iteration. Locally, that caused some other nodes to be never visited but it is for sure possible other (bad) side effects can occur. As it seems conservatively safe to keep the new node in the callee list we will do that for now. Reviewers: lebedev.ri, hfinkel, fhahn, probinson, wristow, loladiro, sstefan1, uenoku Subscribers: hiraditya, bollu, uenoku, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77854	2020-04-15 18:38:50 -05:00
Roman Lebedev	639b8da8dc	[Attributor] KindToAbstractAttributeMap: use SmallDenseMap Summary: While this is less efficient to allocate huge `SmallDenseMap` for each `IRPosition` in `AAMap`, in the larger picture this is much better, since we'd eventually either fill each `IRPosition`, with each possible attribute, or at least quert for it, which would allocate it anyway. So we are better off pre-allocating. Old: ``` 0.3460 ( 40.7%) 0.0183 ( 33.9%) 0.3643 ( 40.3%) 0.3644 ( 40.3%) Deduce and propagate attributes (CGSCC pass) 0.1135 ( 13.4%) 0.0080 ( 14.7%) 0.1215 ( 13.4%) 0.1215 ( 13.4%) Deduce and propagate attributes ``` ``` total runtime: 19.48s. bytes allocated in total (ignoring deallocations): 575.02MB (29.51MB/s) calls to allocation functions: 908876 (46644/s) temporary memory allocations: 276654 (14198/s) peak heap memory consumption: 26.68MB peak RSS (including heaptrack overhead): 944.78MB total memory leaked: 8.85MB ``` New: ``` 0.3223 ( 38.1%) 0.0299 ( 53.6%) 0.3522 ( 39.1%) 0.3522 ( 39.1%) Deduce and propagate attributes (CGSCC pass) 0.1150 ( 13.6%) 0.0037 ( 6.7%) 0.1188 ( 13.2%) 0.1188 ( 13.2%) Deduce and propagate attributes ``` ``` total runtime: 19.06s. bytes allocated in total (ignoring deallocations): 363.21MB (19.06MB/s) calls to allocation functions: 679660 (35658/s) temporary memory allocations: 83472 (4379/s) peak heap memory consumption: 27.00MB peak RSS (including heaptrack overhead): 931.66MB total memory leaked: 8.85MB ``` Diff: ``` total runtime: -0.42s. bytes allocated in total (ignoring deallocations): -211.81MB (498.38MB/s) calls to allocation functions: -229216 (539331/s) temporary memory allocations: -193182 (454545/s) peak heap memory consumption: 321.54KB peak RSS (including heaptrack overhead): 0B total memory leaked: 0B ``` Reviewers: jdoerfert, sstefan1, uenoku Reviewed By: jdoerfert Subscribers: uenoku, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78231	2020-04-16 00:12:45 +03:00
Roman Lebedev	f54dc12e46	[MustExecute] checkForAllContext(): use pre-increment Summary: You'd think there is no difference, but this halves (yikes!) compiler memory usage on `test-suite/MultiSource/Applications/SPASS/top.c` test, because `MustBeExecutedIterator operator++()` is, well, post-increment, it must create a duplicate of existing `MustBeExecutedIterator`, which involves duplicating `VisitedSetTy Visited` which is `DenseSet`.. Old ``` 0.3573 ( 42.9%) 0.0264 ( 33.7%) 0.3837 ( 42.1%) 0.3837 ( 42.1%) Deduce and propagate attributes (CGSCC pass) 0.1011 ( 12.1%) 0.0199 ( 25.4%) 0.1210 ( 13.3%) 0.1210 ( 13.3%) Deduce and propagate attributes ``` ``` total runtime: 20.04s. bytes allocated in total (ignoring deallocations): 1.09GB (54.63MB/s) calls to allocation functions: 1142410 (57020/s) temporary memory allocations: 500538 (24983/s) peak heap memory consumption: 26.68MB peak RSS (including heaptrack overhead): 944.85MB total memory leaked: 8.85MB ``` New: ``` 0.3309 ( 39.8%) 0.0164 ( 33.3%) 0.3473 ( 39.5%) 0.3473 ( 39.5%) Deduce and propagate attributes (CGSCC pass) 0.1152 ( 13.9%) 0.0076 ( 15.5%) 0.1229 ( 14.0%) 0.1229 ( 14.0%) Deduce and propagate attributes ``` ``` total runtime: 19.49s. bytes allocated in total (ignoring deallocations): 575.07MB (29.51MB/s) calls to allocation functions: 909059 (46651/s) temporary memory allocations: 276923 (14211/s) peak heap memory consumption: 26.68MB peak RSS (including heaptrack overhead): 942.90MB total memory leaked: 8.85MB ``` Diff: ``` total runtime: -0.55s. bytes allocated in total (ignoring deallocations): -519.41MB (946.11MB/s) calls to allocation functions: -233351 (425047/s) temporary memory allocations: -223615 (407313/s) peak heap memory consumption: 0B peak RSS (including heaptrack overhead): 0B total memory leaked: 0B ``` Reviewers: jdoerfert Reviewed By: jdoerfert Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78225	2020-04-16 00:12:17 +03:00
Francesco Petrogalli	89680f25e8	[llvm][CodeGen] Rename SVE gather prefetch intrinsics. [NFC] Summary: The renaming is necessary to make the naming scheme uniform with other gather/scatter load/stores SVE intrinsics. The naming of variables and functions have been adapted to make it explicit whether we are dealing with a scalar offset (which is unscaled) or an index (which is scaled according to the data type of the lanes of the vector). Reviewers: andwar, sdesmalen, rengolin Reviewed By: andwar Subscribers: tschuett, hiraditya, arphaman, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77839	2020-04-15 21:49:16 +01:00
Sam Clegg	2a68573a35	Enable finding bitcode in wasm objects This commit fixes using functions in `IRObjectFile` to load bitcode from wasm objects by recognizing the file magic for wasm and also inheriting the default implementation of classifying sections as bitcode. Patch By: alexcrichton Differential Revision: https://reviews.llvm.org/D78199	2020-04-15 12:33:33 -07:00
Davide Italiano	5f87415efc	[LICM] Try to merge debug locations when sinking. The current strategy LICM uses when sinking for debuginfo is that of picking the debug location of one of the uses. This causes stepping to be wrong sometimes, see, e.g. PR45523. This patch introduces a generalization of getMergedLocation(), that operates on a vector of locations instead of two, and try to merge all them together, and use the new API in LICM. <rdar://problem/61750950>	2020-04-15 12:29:34 -07:00
Nikita Popov	8e7d771cf9	[MC] Use subclass data for MCExpr to reduce memory usage MCExpr has a bunch of free space that is currently going to waste. Repurpose it as 24 bits of subclass data, which is enough to reduce the size of all subclasses by 8 bytes. This gives us some respectable savings for debuginfo builds. Here are the max-rss reductions for the fat LTO link step: kc.link 238MiB 231MiB (-2.82%) sqlite3.link 258MiB 250MiB (-3.27%) consumer-typeset.link 152MiB 148MiB (-2.51%) bullet.link 197MiB 192MiB (-2.30%) tramp3d-v4.link 578MiB 567MiB (-1.92%) pairlocalalign.link 92MiB 90MiB (-1.98%) clamscan.link 230MiB 223MiB (-2.81%) lencod.link 242MiB 235MiB (-2.67%) SPASS.link 235MiB 230MiB (-2.23%) 7zip-benchmark.link 450MiB 435MiB (-3.25%) Differential Revision: https://reviews.llvm.org/D77939	2020-04-15 20:02:11 +02:00
Amara Emerson	c22cb5bd31	[GlobalISel] Enable artifact combiner to combine starting from a G_MERGE_VALUES. We generally only combine starting from users to defs in the artifact combiner, but this doesn't catch cases where at the point of combining a G_UNMERGE we don't yet have the opposite G_MERGE on input yet since we haven't legalized that far. This change adds the users of a G_MERGE to the artifact combiner worklist if one of the uses is a G_UNMERGE or G_TRUNC. Differential Revision: https://reviews.llvm.org/D77931	2020-04-15 10:34:13 -07:00
Dominik Montada	1265899c5f	Revert "[GlobalISel] Fix invalid combine of unmerge(merge) with intermediate cast" This reverts commit `bddac41b9f`.	2020-04-15 18:47:39 +02:00
Dominik Montada	bddac41b9f	[GlobalISel] Fix invalid combine of unmerge(merge) with intermediate cast Summary: The combine for unmerge(cast(merge)) is only valid for vectors, but was missing a corresponding check. Add a check that the operands are vectors to avoid an invalid combine. Without this check, the combiner would emit incorrect code for scalars and pointers because the artifact cast (trunc/ext) only affects bits at the end of the type, while this combine assumes that the casted bits appear between meaningful bits. This also uncovered a segmentation fault in the AMDGPU InstructionSelector. The tests triggering this bug have been moved to their own file and a check for the segmentation fault has been added. Reviewers: arsenm, dsanders, aemerson, paquette, aditya_nandakumar Reviewed By: arsenm Subscribers: tpr, jvesely, wdng, nhaehnle, rovka, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78191	2020-04-15 17:19:14 +02:00
Dominik Montada	443c244cff	[GlobalISel] translate freeze to new generic G_FREEZE Summary: As a follow up to https://reviews.llvm.org/D29014, add translation support for freeze. Introduce a new generic instruction G_FREEZE and translate freeze to it. Reviewers: dsanders, aqjune, arsenm, aditya_nandakumar, t.p.northover, lebedev.ri, paquette, aemerson Reviewed By: aqjune, arsenm Subscribers: fhahn, lebedev.ri, wdng, rovka, hiraditya, jfb, volkan, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77795	2020-04-15 16:47:05 +02:00
Xing Xue	4578fa8a1c	[demangler] PPC and S390: Fix parsing of e-prefixed long double literals Summary: This patch is to fix the parsing of long double literals encoded with the e prefix on PowerPC and S390. For both PowerPC and S390, type code e is used for 64-bit long double literals and g is used for 128-bit long double literals. libcxxabi test case test_demangle.pass.cpp fails without the fix. Authored by: xingxue-ibm Reviewers: hubert.reinterpretcast, jasonliu, erik.pilkington, uweigand, mclow.li sts, libc++abi Reviewed by: hubert.reinterpretcast, erik.pilkington Differential Revision: https://reviews.llvm.org/D74163	2020-04-15 09:59:06 -04:00
Victor Campos	d85b3877dc	[CodeGen][ARM] Error when writing to specific reserved registers in inline asm Summary: No error or warning is emitted when specific reserved registers are written to in inline assembly. Therefore, writes to the program counter or to the frame pointer, for instance, were permitted, which could have led to undesirable behaviour. Example: int foo() { register int a __asm__("r7"); // r7 = frame-pointer in M-class ARM __asm__ __volatile__("mov %0, r1" : "=r"(a) : : ); return a; } In contrast, GCC issues an error in the same scenario. This patch detects writes to specific reserved registers in inline assembly for ARM and emits an error in such case. The detection works for output and input operands. Clobber operands are not handled here: they are already covered at a later point in AsmPrinter::emitInlineAsm(const MachineInstr *MI). The registers covered are: program counter, frame pointer and base pointer. This is ARM only. Therefore the implementation of other targets' counterparts remain open to do. Reviewers: efriedma Reviewed By: efriedma Subscribers: kristof.beyls, hiraditya, danielkiss, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76848	2020-04-15 14:40:42 +01:00
Simon Moll	a688a70d58	[nfc] clang-format TargetTransformInfoImpl.h	2020-04-15 14:01:31 +02:00
Simon Moll	b310daea21	[nfc] clang-format TargetTransformInfo.h	2020-04-15 14:00:07 +02:00
Denis Antrushin	edbb27ccb6	[Statepoint] Add getters to StatepointOpers. To simplify future work on statepoint representation, hide direct access to statepoint field indices and provide getters for them. Add getters for couple more statepoint fields. This also fixes two bugs in MachineVerifier for statepoint: First, the `break` statement was falling out of `if` statement scope, thus disabling following checks. Second, it was incorrectly accessing some fields like CallingConv - StatepointOpers gives index to their value directly, not to preceeding field type encoding. Reviewed By: skatkov Differential Revision: https://reviews.llvm.org/D78119	2020-04-15 14:31:42 +03:00
Simon Moll	2eeb6ca7ac	[NFC] clang-format IntrinsicInst.h\|cpp Differential Revision: https://reviews.llvm.org/D78188	2020-04-15 12:05:23 +02:00
Sameer Sahasrabuddhe	8c11bc0cd0	Introduce fix-irreducible pass An irreducible SCC is one which has multiple "header" blocks, i.e., blocks with control-flow edges incident from outside the SCC. This pass converts an irreducible SCC into a natural loop by introducing a single new header block and redirecting all the edges on the original headers to this new block. This is a useful workaround for a limitation in the structurizer which, which produces incorrect control flow in the presence of irreducible regions. The AMDGPU backend provides an option to enable this pass before the structurizer, which may eventually be enabled by default. Reviewed By: nhaehnle Differential Revision: https://reviews.llvm.org/D77198 This restores commit `2ada8e2525`. Originally reverted with commit `44e09b59b8`.	2020-04-15 15:05:51 +05:30
Sameer Sahasrabuddhe	44e09b59b8	Revert "Introduce fix-irreducible pass" This reverts commit `2ada8e2525`. Buildbots produced compilation errors which I was not able to quickly reproduce locally. Need more time to investigate.	2020-04-15 12:19:50 +05:30
Sameer Sahasrabuddhe	2ada8e2525	Introduce fix-irreducible pass An irreducible SCC is one which has multiple "header" blocks, i.e., blocks with control-flow edges incident from outside the SCC. This pass converts an irreducible SCC into a natural loop by introducing a single new header block and redirecting all the edges on the original headers to this new block. This is a useful workaround for a limitation in the structurizer which, which produces incorrect control flow in the presence of irreducible regions. The AMDGPU backend provides an option to enable this pass before the structurizer, which may eventually be enabled by default. Reviewed By: nhaehnle Differential Revision: https://reviews.llvm.org/D77198	2020-04-15 11:29:19 +05:30
QingShan Zhang	c9f9c79c5a	[NFC][DAGCombine] Change the value of NegatibleCost to make it align with the semantics This is a minor NFC change to make the code more clear. We have the NegatibleCost that has cheaper, neutral, and expensive. Typically, the smaller one means the less cost. It is inverse for current implementation, which makes following code not easy to read. If (CostX > CostY) negate(X) Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D77993	2020-04-15 02:20:58 +00:00
River Riddle	229e392b4e	[llvm][StringExtras] Merge StringExtras from MLIR into LLVM Summary: This revision adds two utilities currently present in MLIR to LLVM StringExtras: * convertToSnakeFromCamelCase Convert a string from a camel case naming scheme, to a snake case scheme * convertToCamelFromSnakeCase Convert a string from a snake case naming scheme, to a camel case scheme Differential Revision: https://reviews.llvm.org/D78167	2020-04-14 18:57:22 -07:00
Teresa Johnson	33ffb62e23	Allow disabling of vectorization using internal options Summary: Currently, the internal options -vectorize-loops, -vectorize-slp, and -interleave-loops do not have much practical effect. This is because they are used to initialize the corresponding flags in the pass managers, and those flags are then unconditionally overwritten when compiling via clang or via LTO from the linkers. The only exception was -vectorize-loops via opt because of some special hackery there. While vectorization could still be disabled when compiling via clang, using -fno-[slp-]vectorize, this meant that there was no way to disable it when compiling in LTO mode via the linkers. This only affected ThinLTO, since for regular LTO vectorization is done during the compile step for scalability reasons. For ThinLTO it is invoked in the LTO backends. See also the discussion on PR45434. This patch makes it so the internal options can actually be used to disable these optimizations. Ultimately, the best long term solution is to mark the loops with metadata (similar to the approach used to fix -fno-unroll-loops in D77058), but this enables a shorter term workaround, and actually makes these internal options useful. I constant propagated the initial values of these internal flags into the pass manager flags (for some reasons vectorize-loops and interleave-loops were initialized to true, while vectorize-slp was initialized to false). As mentioned above, they are overwritten unconditionally so this doesn't have any real impact, and these initial values aren't particularly meaningful. I then changed the passes to check the internl values and return without performing the associated optimization when false (I changed the default of -vectorize-slp to true so the options behave similarly). I was able to remove the hackery in opt used to get -vectorize-loops=false to work, as well as a special option there used to disable SLP vectorization. Finally, I changed thinlto-slp-vectorize-pm.c to: a) Only test SLP (moved the loop vectorization checking to a new test). b) Use code that is slp vectorized when it is enabled, and check that instead of whether the pass is enabled. c) Test the new behavior of -vectorize-slp. d) Test both pass managers. The loop vectorization (and associated interleaving) testing I moved to a new thinlto-loop-vectorize-pm.c test, with several changes: a) Changed the flags on the interleaving testing so that it will actually interleave, and check that. b) Test the new behavior of -vectorize-loops and -interleave-loops. c) Test both pass managers. Reviewers: fhahn, wmi Subscribers: hiraditya, steven_wu, dexonsmith, cfe-commits, davezarzycki, llvm-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D77989	2020-04-14 18:09:10 -07:00
Eli Friedman	2876b3eef3	[SelectionDAG] Always preserve offset in MachinePointerInfo Previously, getWithOffset() would drop the offset if the base was null. Because of this, MachineMemOperand would return the wrong result from getAlign() in these cases. MachineMemOperand stores the alignment of the pointer without the offset. A bunch of MIR tests changed because we print the offset now. Split off from D77687. Differential Revision: https://reviews.llvm.org/D78049	2020-04-14 15:29:41 -07:00
River Riddle	ebf190fcda	[llvm][ADT] Move TypeSwitch class from MLIR to LLVM This class implements a switch-like dispatch statement for a value of 'T' using dyn_cast functionality. Each `Case<T>` takes a callable to be invoked if the root value isa<T>, the callable is invoked with the result of dyn_cast<T>() as a parameter. Differential Revision: https://reviews.llvm.org/D78070	2020-04-14 15:14:41 -07:00
River Riddle	2f21a57966	[llvm][STLExtras] Move the algorithm `interleave*` methods from MLIR to LLVM These have proved incredibly useful for interleaving values between a range w.r.t to streams. After this revision, the mlir/Support/STLExtras.h is empty. A followup revision will remove it from the tree. Differential Revision: https://reviews.llvm.org/D78067	2020-04-14 15:14:40 -07:00
River Riddle	204c3b5516	[llvm][STLExtras] Move various iterator/range utilities from MLIR to LLVM This revision moves the various range utilities present in MLIR to LLVM to enable greater reuse. This revision moves the following utilities: * indexed_accessor_* This is set of utility iterator/range base classes that allow for building a range class where the iterators are represented by an object+index pair. * make_second_range Given a range of pairs, returns a range iterating over the `second` elements. * hasSingleElement Returns if the given range has 1 element. size() == 1 checks end up being very common, but size() is not always O(1) (e.g., ilist). This method provides O(1) checks for those cases. Differential Revision: https://reviews.llvm.org/D78064	2020-04-14 15:14:40 -07:00
River Riddle	8cbe371c28	[llvm][STLExtras] Add various type_trait utilities currently present in MLIR This revision moves several type_trait utilities from MLIR into LLVM. Namely, this revision adds: is_detected - This matches the experimental std::is_detected is_invocable - This matches the c++17 std::is_invocable function_traits - A utility traits class for getting the argument and result types of a callable type Differential Revision: https://reviews.llvm.org/D78059	2020-04-14 15:14:40 -07:00
River Riddle	f52ec5d5c0	[llvm][DenseMapInfo] Add an info specialization for std::tuple This revision adds a DenseMapInfo overload for std::tuples whose elements all have a DenseMapInfo. The implementation is similar to that of std::pair, and has been used within MLIR for over a year. Differential Revision: https://reviews.llvm.org/D78057	2020-04-14 15:14:40 -07:00
Eli Friedman	c285841a4f	Enable new passmanager plugin support for LTO. This should make both static and dynamic NewPM plugins work with LTO. And as a bonus, it makes static linking of OldPM plugins more reliable for plugins with both an OldPM and NewPM interface. I only implemented the command-line flag to specify NewPM plugins in llvm-lto2, to show it works. Support can be added for other tools later. Differential Revision: https://reviews.llvm.org/D76866	2020-04-14 15:07:07 -07:00
Juneyoung Lee	994543abc9	[ValueTracking] Implement canCreatePoison Summary: This PR adds `canCreatePoison(Instruction *I)` which returns true if `I` can generate poison from non-poison operands. Reviewers: spatel, nikic, lebedev.ri Reviewed By: spatel Subscribers: hiraditya, llvm-commits, regehr, nlopes Tags: #llvm Differential Revision: https://reviews.llvm.org/D77890	2020-04-15 05:58:06 +09:00
Sam Clegg	3ea1c62cba	[WebAssembly] Emit .llvmcmd and .llvmbc as custom sections Fixes: https://bugs.llvm.org/show_bug.cgi?id=45362 Differential Revision: https://reviews.llvm.org/D77115	2020-04-14 13:24:18 -07:00
Thomas Raoux	c228c717aa	[AntidepBreaker] Move AntiDepBreaker to include folder. This allows AntiDepBreaker to be used in target specific postRA scheduler. Differential Revision: https://reviews.llvm.org/D78047	2020-04-14 11:40:57 -07:00
Sourabh Singh Tomar	85b49ecb78	[DWARF5]: Added support for DW_MACRO_import form in llvm-dwarfdump GCC emits this new form along with others forms(supported in llvm-dwardump) and since it's support was missing in llvm-dwarfdump, it was not able to correctly dump the content a debug_macro section for GCC generated binaries. This patch extends llvm-dwarfdump to support this form, now GCC generated debug_macro section can be correctly dumped using llvm-dwarfdump. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D78006	2020-04-14 23:51:46 +05:30
Alina Sbirlea	d5fcb7966e	[STLExtras] Make const the * operator for mapped_iterator. Summary: The current non-const * operator shadows the const operator in iterator_adaptor_base. Reviewers: mehdi_amini, rriddle!, dblaikie, timshen Subscribers: dexonsmith, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, liufengdb, Joonsoo, grosul1, frgossen, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78073	2020-04-14 11:04:20 -07:00
Aaron Puchert	e833e58300	[ValueLattice] Remove unused DataLayout parameter of mergeIn, NFC Reviewed By: fhahn, echristo Differential Revision: https://reviews.llvm.org/D78061	2020-04-14 13:32:53 +02:00
Georgii Rymar	1647ff6e27	[ADT/STLExtras.h] - Add llvm::is_sorted wrapper and update callers. It can be used to avoid passing the begin and end of a range. This makes the code shorter and it is consistent with another wrappers we already have. Differential revision: https://reviews.llvm.org/D78016	2020-04-14 14:11:02 +03:00
Craig Topper	3043093822	[CallSite removal][CodeGen] Replace ImmutableCallSite with CallBase in isInTailCallPosition.	2020-04-13 23:04:57 -07:00
Mircea Trofin	fe8a2ad4a0	[llvm][NFC][CallSite] Remove CallSite from CGSCCPassManager Reviewers: craig.topper, dblaikie, davidxl Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78080	2020-04-13 22:52:15 -07:00
Mircea Trofin	4aae4e3f48	[llvm][NFC] CallSite removal from inliner-related files Summary: This removes CallSite from inliner files. Some dependencies where thus affected. Reviewers: dblaikie, davidxl, craig.topper Subscribers: arsenm, jvesely, nhaehnle, eraman, hiraditya, aheejin, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77991	2020-04-13 21:28:58 -07:00
Matt Arsenault	f48fe2c36e	GlobalISel: Fix casted unmerge of G_CONCAT_VECTORS This was assuming a scalarizing unmerge, and would fail assert if the unmerge was to smaller vector types.	2020-04-13 22:03:05 -04:00
Mehdi Amini	384ca190ae	Revert "Move ModuleSummaryAnalysis from libAnalysis to libObject to break the dependency from Analysis to Object" This reverts commit `10df1563d6`. Some buildbots are broken.	2020-04-14 00:27:08 +00:00
Christopher Tetreault	eab73dfed9	[SVE] Change return type of getNumElements to unsigned Reviewers: efriedma, sdesmalen, craig.topper, dexonsmith Reviewed By: efriedma, sdesmalen Subscribers: tschuett, hiraditya, rkruppe, psnobl, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, aartbik, liufengdb, Joonsoo, grosul1, frgossen, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77763	2020-04-13 16:24:18 -07:00
Mehdi Amini	10df1563d6	Move ModuleSummaryAnalysis from libAnalysis to libObject to break the dependency from Analysis to Object ModuleSummaryAnalysis is the only file in libAnalysis that brings a dependency on the CodeGen layer from libAnalysis, moving it breaks this dependency. Differential Revision: https://reviews.llvm.org/D77994	2020-04-13 23:12:11 +00:00
Craig Topper	113f37a1f9	[CallSite removal][TargetLowering] Replace ImmutableCallSite with CallBase Differential Revision: https://reviews.llvm.org/D77995	2020-04-13 13:50:15 -07:00
Eli Friedman	89e0662dee	Make IRBuilder automatically set alignment on load/store/alloca. This is equivalent in terms of LLVM IR semantics, but we want to transition away from using MaybeAlign to represent the alignment of these instructions. Differential Revision: https://reviews.llvm.org/D77984	2020-04-13 13:43:14 -07:00
Lang Hames	e823068306	[Support] Add support RTTI support for open class hierarchies. This patch extracts the RTTI part of llvm::ErrorInfo into its own class (RTTIExtends) so that it can be used in other non-error hierarchies, and makes it compatible with the existing LLVM RTTI function templates (isa, cast, dyn_cast, dyn_cast_or_null) by adding the classof method. Differential Revision: https://reviews.llvm.org/D39111	2020-04-13 12:52:44 -07:00
Rahman Lavaee	05192e585c	Extend BasicBlock sections to allow specifying clusters of basic blocks in the same section. Differential Revision: https://reviews.llvm.org/D76954	2020-04-13 12:19:59 -07:00
Rahman Lavaee	4ddf7ab454	Revert "Extend BasicBlock sections to allow specifying clusters of basic blocks" This reverts commit `0d4ec16d3d` Because tests were not added to the commit.	2020-04-13 12:19:59 -07:00
Lama	5c7bbe3659	[MachinePipeliner] Refine the RecMII calculation In the case of more than one SDep between two successor SUnits in the Nodeset, the current implementation sums the latencies of the dependencies, which could create a larger RecMII than necessary. for example, in case there is both a data dependency and an output dependency (with latency > 0) between successor nodes: SU(1) inst1: successors: SU(2): out latency = 1 SU(2): data latency = 1 SU(2) inst2: successors: SU(3): out latency = 1 SU(3): data latency = 1 SU(3) inst3: successors: SU(1): out latency = 1 SU(1): data latency = 1 the NodeSet latency returned would be 6, whereas it could be 3 if we take the max for each successor SUnit. In general this can be extended to finding the shortest path in the recurrence.. thoughts? Unfortunately I had a hard time creating a test for this in Hexagon/PowerPC, so help would be appreciated. Reviewed By: bcahoon Differential Revision: https://reviews.llvm.org/D75918	2020-04-13 19:17:15 +00:00
Austin Kerbow	cc4d7dced9	[AMDGPU] Fix comment for llvm.amdgcn.div.scale. NFC. Update comment to reflect actual lowering.	2020-04-13 12:04:55 -07:00
Rahman Lavaee	0d4ec16d3d	Extend BasicBlock sections to allow specifying clusters of basic blocks in the same section. This allows specifying BasicBlock clusters like the following example: !foo !!0 1 2 !!4 This places basic blocks 0, 1, and 2 in one section in this order, and places basic block #4 in a single section of its own.	2020-04-13 11:46:11 -07:00
Vedant Kumar	122a6bfb07	[Debugify] Strip added metadata in the -debugify-each pipeline Summary: Share logic to strip debugify metadata between the IR and MIR level debugify passes. This makes it simpler to hunt for bugs by diffing IR with vs. without -debugify-each turned on. As a drive-by, fix an issue causing CallGraphNodes to become invalid when a dead llvm.dbg.value prototype is deleted. Reviewers: dsanders, aprantl Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77915	2020-04-13 10:55:17 -07:00
Benjamin Kramer	01df04fb59	[ADT] Reduce the requirements for the simple loop in DenseMap::clear We can use it when just the value doesn't require destruction. Empty keys are safe to overwrite always. This gets the important case of std::pair values.	2020-04-13 19:33:45 +02:00
Craig Topper	68eb08646c	[CallSite removal][GlobalISel] Use CallBase instead of CallSite in lowerCall and translateCallBase. Differential Revision: https://reviews.llvm.org/D78001	2020-04-13 10:31:30 -07:00
Matt Arsenault	e6605a209c	DAG: Fix wrong legality check for ISD::FMAD Since `1725f28841`, this should check isFMADLegalForFAddFSub rather than the the plain isOperationLegal. This would assert in a subset of cases due to an oddity in how FMAD is selected. We will allow FMA formation pre-legalize, but not FMAD even in cases where it would be valid. The current hook requires passing in the root fadd/fsub. However, in this distributed case, this would be far more complicated to pass in the relevant operand. AMDGPU doesn't get any value from the node, and only needs the type and is the only implementor, so I'm not sure why we have this complexity. Just rename and expand the assert to avoid the more complicated checks spread through the distribution logic.	2020-04-13 10:25:39 -07:00
Simon Pilgrim	ad57286232	CodeMetrics.h - include and forward declaration cleanup. NFC. Remove SmallPtrSet include, replace with forward declaration and include SmallPtrSet.h in CodeMetrics.cpp directly. Remove unused llvm::DataLayout/Instruction forward declarations.	2020-04-13 13:09:39 +01:00
Simon Pilgrim	39e19588fa	IVDescriptors.h - include and forward declaration cleanup. NFC. Replace unused AliasAnalysis.h and EHPersonalities.h includes Remove unused llvm::AliasSet/AliasSetTracker/BasicBlock/DataLayout/LoopInfo/OptimizationRemarkEmitter/PredIteratorCache/TargetLibraryInfo/TargetTransformInfo forward declarations.	2020-04-13 13:09:39 +01:00
Tyker	813f438baa	[AssumeBundles] adapt Assumption cache to assume bundles Summary: change assumption cache to store an assume along with an index to the operand bundle containing the knowledge. Reviewers: jdoerfert, hfinkel Reviewed By: jdoerfert Subscribers: hiraditya, mgrang, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77402	2020-04-13 12:04:51 +02:00
Craig Topper	dbb272b0a3	[CallSite removal][FastISel] Use CallBase instead of CallSite in fastLowerCall.	2020-04-12 18:02:24 -07:00
Chris Lattner	89c8ffd542	NFC: Clean up the implementation of StringPool a bit, and remove dependence on some "implicitly MallocAllocator" based methods on StringMapEntry. This allows reducing the #includes in StringMapEntry.h. Summary: StringPool has many caveats and isn't used in the monorepo. I will propose removing it as a patch separate from this refactoring patch. Reviewers: rriddle Subscribers: hiraditya, dexonsmith, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77976	2020-04-12 16:37:17 -07:00
Huihui Zhang	8e2daa0c7f	[NFC] Silence compiler warning [-Wmissing-braces]. llvm/include/llvm/Support/FormatVariadic.h:146:15: warning: suggest braces around initialization of subobject [-Wmissing-braces] return {&Items...}; ^~~~~~ { }	2020-04-12 14:42:59 -07:00
Craig Topper	95192f548d	[CallSite removal][TargetLowering] Use CallBase instead of CallSite in TargetLowering::ParseConstraints interface. Differential Revision: https://reviews.llvm.org/D77929	2020-04-12 11:26:25 -07:00
Mircea Trofin	d2f1cd5d97	[llvm][NFC] Refactor uses of CallSite to CallBase - call promotion Summary: Updated CallPromotionUtils and impacted sites. Parameters that are expected to be non-null, and return values that are guranteed non-null, were replaced with CallBase references rather than pointers. Left FIXME in places where more changes are facilitated by CallBase, but aren't CallSites: Instruction* parameters or return values, for example, where the contract that they are actually CallBase values. Reviewers: davidxl, dblaikie, wmi Reviewed By: dblaikie Subscribers: arsenm, jvesely, nhaehnle, eraman, hiraditya, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77930	2020-04-12 08:27:29 -07:00
Chris Lattner	617b08ff9b	Refactor StringMap.h, splitting StringMapEntry out to its own header. Summary: StringMapEntry.h can have lower dependencies, than StringMap.h, which is useful for public headers that want to expose inline methods on StringMapEntry<> but don't need to expose all of StringMap.h. One example of this is mlir's Identifier.h, another example is the existing LLVM StringPool.h. StringPool also could use a cleanup, I'll deal with that in a follow-on patch. Reviewers: rriddle Subscribers: hiraditya, dexonsmith, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77963	2020-04-12 08:25:17 -07:00
Sanjay Patel	c23cbefd9d	[VectorUtils] add IR-level analysis for widening of shuffle mask This is similar to the recent move/addition of "scaleShuffleMask" (D76508), but there are a couple of differences: 1. The existing x86 helper (canWidenShuffleElements) always tries to divide-by-2, so it gets called iteratively and wouldn't handle the general case of non-pow-2 length. 2. The existing x86 code handles "SM_SentinelZero" - we don't have that in IR, but this code should be safe to use with that or other special (negative) values. The motivation is to enable shuffle folds in instcombine/vector-combine that are similar to D76844 and D76727, but in the reverse-bitcast direction. Those patterns are visible in the tests for D40633. Differential Revision: https://reviews.llvm.org/D77881	2020-04-12 10:14:19 -04:00
Simon Pilgrim	2b74755ec5	TrigramIndex.h - remove unnecessary StringMap.h include. NFC Include StringRef.h inside TrigramIndex.cpp as thats the only part of StringMap.h that is actually required.	2020-04-12 14:30:52 +01:00
Simon Pilgrim	688ac00bb2	IPO.h - replace SmallVector.h include with forward declaration. NFC.	2020-04-12 13:49:57 +01:00
Simon Pilgrim	d02bc5e150	Pass.h - replace StringRef.h include with forward declaration. NFC.	2020-04-12 13:36:20 +01:00
Simon Pilgrim	c46e65fa80	VirtRegMap.h - remove unnecessary MCRegisterInfo.h include. NFC	2020-04-12 13:11:13 +01:00
Simon Pilgrim	9ad6773790	SelectionDAGISel.h - remove Pass.h include and forward declarations. NFC. Remove Pass.h include - we don't use anything direct from the header and its implicitly included by MachineFunctionPass.h. Remove unused forward declarations.	2020-04-12 12:49:29 +01:00
Simon Pilgrim	0bd13e98cb	ParallelCG.h - replace TargetMachine.h include with forward declaration. NFC. We need to include memory.h as it was being implicitly included. Also remove unused llvm::TargetOptions forward declaration.	2020-04-12 12:36:02 +01:00
Chris Lattner	d985b0bf5c	A few cosmetic cleanups to StringMap/StringSet.h, including fixing the indentation of the StringSet.h file and its file comment header, and significantly reduce redundant #includes that are already pulled in transitively. NFC. This is in preparation for a more interesting patch I'll post to phab.	2020-04-11 22:14:35 -07:00
Eli Friedman	4e86e5eedc	[DenseMap] Add assertion that end() iterator isn't derefenced. This would have saved me some time this morning.	2020-04-11 14:53:09 -07:00
Simon Pilgrim	1b76c4cade	ModuleUtils.h - include and forward declaration cleanup. NFC. Replace SmallSet.h include with SmallVector.h - only SmallVector.h is used from inside SmallSet.h Remove llvm::StringRef forward declaration (we include StringRef.h) Remove unused llvm::GlobalVariable forward declaration.	2020-04-11 21:25:53 +01:00
Aaron Puchert	b96b9335ae	Use more LLVM_ENABLE_ABI_BREAKING_CHECKS in Error.h Summary: If we don't read the flag, we don't need to set it. Then we also don't need to filter it out to get the pointer. This should have a (small) positive impact on code size and performance. Reviewed By: lhames Differential Revision: https://reviews.llvm.org/D76259	2020-04-11 21:14:18 +02:00
Sanjay Patel	1318ddbc14	[VectorUtils] rename scaleShuffleMask to narrowShuffleMaskElts; NFC As proposed in D77881, we'll have the related widening operation, so this name becomes too vague. While here, change the function signature to take an 'int' rather than 'size_t' for the scaling factor, add an assert for overflow of 32-bits, and improve the documentation comments.	2020-04-11 10:05:49 -04:00
Simon Pilgrim	adb456b8d3	TargetLoweringObjectFileImpl.h - replace MCExpr.h and Module.h includes with forward declarations. NFC. Also remove unused llvm::Mangler forward declaration.	2020-04-11 14:13:39 +01:00
Benjamin Kramer	0292ddc711	[FormatVariadic] Put back return type in an attempt to make GCC happy	2020-04-11 15:11:44 +02:00
Benjamin Kramer	5ef2cb3df4	[FormatVariadic] Reduce allocations - Move Adapters array to the stack, we know the size precisely - Parse format string on demand into a SmallVector. In theory this could lead to parsing it multiple times, but I couldn't find a single instance of that in LLVM. - Make more of the implementation details private.	2020-04-11 14:54:32 +02:00
Nemanja Ivanovic	512600e3c0	[PowerPC] Handle f16 as a storage type only The PPC back end currently crashes (fails to select) with f16 input. This patch expands it on subtargets prior to ISA 3.0 (Power9) and uses the HW conversions on Power9. Fixes https://bugs.llvm.org/show_bug.cgi?id=39865 Differential revision: https://reviews.llvm.org/D68237	2020-04-11 07:34:47 -05:00
Simon Pilgrim	89f6ca05b7	CodeGen/EdgeBundles - move Twine.h include down into EdgeBundles.cpp. NFC. EdgeBundles.h has no use for it.	2020-04-11 12:21:04 +01:00
Simon Pilgrim	9eacd000cb	Local.h - remove unnecessary Twine.h include. NFC.	2020-04-11 12:21:04 +01:00
Simon Pilgrim	142dd80d09	PhiValues.h - remove unused llvm::Use forward declaration. NFC.	2020-04-11 12:21:04 +01:00
Simon Pilgrim	78730a6840	ObjCARCAnalysisUtils.h - remove unused llvm::raw_ostream forward declaration. NFC.	2020-04-11 12:21:04 +01:00
Simon Pilgrim	566359193d	SyntheticCountsUtils.h - remove unused llvm::CallGraph/Function forward declarations. NFC.	2020-04-11 12:21:03 +01:00
Simon Pilgrim	e3b47c5adc	OptimizationRemarkEmitter.h - remove unused llvm::DebugLoc/Loop/Pass/Twine forward declarations. NFC.	2020-04-11 12:21:03 +01:00
Simon Pilgrim	ca23d14fa2	Passes.h - remove unused llvm::LoopPass/Pass/PassInfo forward declarations. NFC.	2020-04-11 12:21:03 +01:00
Simon Pilgrim	6aa85d5214	PredicateInfo.h - remove unused llvm::Instruction/MemoryAccess/LLVMContext forward declarations. NFC.	2020-04-11 11:25:48 +01:00
Simon Pilgrim	595c28e618	IntrinsicLowering.h - remove unused llvm::Module forward declaration. NFC.	2020-04-11 11:25:48 +01:00
Simon Pilgrim	a2519be0e9	ExecutionDomainFix.h - remove unused llvm::MachineBasicBlock forward declaration. NFC.	2020-04-11 11:25:48 +01:00
Simon Pilgrim	fcabd7530f	CallingConvLower.h - remove unused llvm::TargetMachine forward declaration. NFC.	2020-04-11 11:25:48 +01:00
Simon Pilgrim	8340c844ae	Analysis.h - remove unused SDNode/SDValue/SelectionDAG forward declarations. NFC.	2020-04-11 11:25:48 +01:00
Simon Pilgrim	76503cb3e1	[PDB] Remove defunct PDBFileBuilder::commitFpm declaration. NFC.	2020-04-11 09:58:11 +01:00
Craig Topper	9c1842d8af	Change FastISel::CallLoweringInfo::CS to be an ImmutableCallSite instead of a pointer. NFCI. This is the same as what was done to the CallLoweringInfo in TargetLowering.h in r309159. This is just a step on the way to replacing this with CallBase.	2020-04-10 23:45:36 -07:00
Mehdi Amini	ed03d9485e	Revert "[TLI] Per-function fveclib for math library used for vectorization" This reverts commit `60c642e74b`. This patch is making the TLI "closed" for a predefined set of VecLib while at the moment it is extensible for anyone to customize when using LLVM as a library. Reverting while we figure out a way to re-land it without losing the generality of the current API. Differential Revision: https://reviews.llvm.org/D77925	2020-04-11 01:05:01 +00:00
Huihui Zhang	6c989d0248	[BasicAA] Fix aliasGEP/DecomposeGEPExpression for scalable type. Summary: Don't attempt to analyze the decomposed GEP for scalable type. GEP index scale is not compile-time constant for scalable type. Be conservative, return MayAlias. Explicitly call TypeSize::getFixedSize() to assert on places where scalable type doesn't make sense. Add unit tests to check functionality of -basicaa for scalable type. This patch is needed for D76944. Reviewers: sdesmalen, efriedma, spatel, bjope, ctetreau Reviewed By: efriedma Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77828	2020-04-10 16:58:26 -07:00
Daniel Sanders	f71350f05a	Add -debugify-and-strip-all to add debug info before a pass and remove it after Summary: This allows us to test each backend pass under the presence of debug info using pre-existing tests. The tests should not fail as a result of this so long as it's true that debug info does not affect CodeGen. In practice, a few tests are sensitive to this: * Tests that check the pass structure (e.g. O0-pipeline.ll) * Tests that check --debug output. Specifically instruction dumps containing MMO's (e.g. prelegalizercombiner-extends.ll) * Tests that contain debugify metadata as mir-strip-debug will remove it (e.g. fastisel-debugvalue-undef.ll) * Tests with partial debug info (e.g. patchable-function-entry-empty.mir had debug info but no !llvm.dbg.cu) * Tests that check optimization remarks overly strictly (e.g. prologue-epilogue-remarks.mir) * Tests that would inject the pass in an unsafe region (e.g. seqpairspill.mir would inject between register alloc and virt reg rewriter) In all cases, the checks can either be updated or --debugify-and-strip-all-safe=0 can be used to avoid being affected by something like llvm-lit -Dllc='llc --debugify-and-strip-all-safe' I tested this without the lost debug locations verifier to confirm that AArch64 behaviour is unaffected (with the fixes in this patch) and with it to confirm it finds the problems without the additional RUN lines we had before. Depends on D77886, D77887, D77747 Reviewers: aprantl, vsk, bogner Subscribers: qcolombet, kristof.beyls, hiraditya, danielkiss, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77888	2020-04-10 16:36:07 -07:00
Lang Hames	59ed45b483	[ORC] Add an OrcV2 C API function for configuring TargetMachines.	2020-04-10 15:51:29 -07:00
Daniel Sanders	dfca98d6a8	[mir-strip-debug] Optionally preserve debug info that wasn't from debugify/mir-debugify Summary: A few tests start out with debug info and expect it to reach the output. For these tests we shouldn't strip the debug info Reviewers: aprantl, vsk, bogner Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77886	2020-04-10 15:24:14 -07:00
Christopher Tetreault	889f6606ed	Clean up usages of asserting vector getters in Type Summary: Remove usages of asserting vector getters in Type in preparation for the VectorType refactor. The existence of these functions complicates the refactor while adding little value. Reviewers: stoklund, sdesmalen, efriedma Reviewed By: sdesmalen Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77272	2020-04-10 14:53:43 -07:00
Christopher Tetreault	40ed21bb71	Clean up usages of asserting vector getters in Type Summary: Remove usages of asserting vector getters in Type in preparation for the VectorType refactor. The existence of these functions complicates the refactor while adding little value. Reviewers: dexonsmith, sdesmalen, efriedma Reviewed By: efriedma Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77276	2020-04-10 14:18:47 -07:00
Daniel Sanders	c162bc2aed	Make TargetPassConfig and llc add pre/post passes the same way. NFC Summary: At the moment, any changes we make to the passes that can be injected before/after others (e.g. -verify-machineinstrs and -print-after-all) have to be duplicated in both TargetPassConfig (for normal execution, -start-before/ -stop-before/etc) and llc (for -run-pass). Unify this pass injection into addMachinePrePass/addMachinePostPass that both TargetPassConfig and llc can use. Reviewers: vsk, aprantl, bogner Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77887	2020-04-10 13:46:53 -07:00
Lang Hames	92210cb26f	[ORC] Use CodeGenOpt::Default as the default level in JITTargetMachineBuilder. This matches EngineBuilder/MCJIT's default.	2020-04-10 13:25:48 -07:00
Marcello Maggioni	ea11f4726f	Split LiveRangeCalc in LiveRangeCalc/LiveIntervalCalc. NFC Summary: Refactor LiveRangeCalc such that it is now split into two classes The objective is to split all the "register specific" logic away from LiveRangeCalc. The two new classes created are: - LiveRangeCalc - is meant as a generic class to compute and modify live ranges in a generic way. This class should deal only with SlotIndices and VNInfo objects. - LiveIntervalCals - is meant to be equivalent to the old LiveRangeCalc. It computes the liveness virtual registers tracked by a LiveInterval object. With this refactoring LiveRangeCalc can be used to implement tracking of liveness of LiveRanges that represent other things than just registers. Subscribers: MatzeB, qcolombet, mgorny, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76584	2020-04-10 11:26:21 -07:00
Matt Morehouse	bef187c750	Implement `-fsanitize-coverage-whitelist` and `-fsanitize-coverage-blacklist` for clang Summary: This commit adds two command-line options to clang. These options let the user decide which functions will receive SanitizerCoverage instrumentation. This is most useful in the libFuzzer use case, where it enables targeted coverage-guided fuzzing. Patch by Yannis Juglaret of DGA-MI, Rennes, France libFuzzer tests its target against an evolving corpus, and relies on SanitizerCoverage instrumentation to collect the code coverage information that drives corpus evolution. Currently, libFuzzer collects such information for all functions of the target under test, and adds to the corpus every mutated sample that finds a new code coverage path in any function of the target. We propose instead to let the user specify which functions' code coverage information is relevant for building the upcoming fuzzing campaign's corpus. To this end, we add two new command line options for clang, enabling targeted coverage-guided fuzzing with libFuzzer. We see targeted coverage guided fuzzing as a simple way to leverage libFuzzer for big targets with thousands of functions or multiple dependencies. We publish this patch as work from DGA-MI of Rennes, France, with proper authorization from the hierarchy. Targeted coverage-guided fuzzing can accelerate bug finding for two reasons. First, the compiler will avoid costly instrumentation for non-relevant functions, accelerating fuzzer execution for each call to any of these functions. Second, the built fuzzer will produce and use a more accurate corpus, because it will not keep the samples that find new coverage paths in non-relevant functions. The two new command line options are `-fsanitize-coverage-whitelist` and `-fsanitize-coverage-blacklist`. They accept files in the same format as the existing `-fsanitize-blacklist` option <https://clang.llvm.org/docs/SanitizerSpecialCaseList.html#format>. The new options influence SanitizerCoverage so that it will only instrument a subset of the functions in the target. We explain these options in detail in `clang/docs/SanitizerCoverage.rst`. Consider now the woff2 fuzzing example from the libFuzzer tutorial <https://github.com/google/fuzzer-test-suite/blob/master/tutorial/libFuzzerTutorial.md>. We are aware that we cannot conclude much from this example because mutating compressed data is generally a bad idea, but let us use it anyway as an illustration for its simplicity. Let us use an empty blacklist together with one of the three following whitelists: ``` # (a) src:* fun:* # (b) src:SRC/* fun:* # (c) src:SRC/src/woff2_dec.cc fun:* ``` Running the built fuzzers shows how many instrumentation points the compiler adds, the fuzzer will output //XXX PCs//. Whitelist (a) is the instrument-everything whitelist, it produces 11912 instrumentation points. Whitelist (b) focuses coverage to instrument woff2 source code only, ignoring the dependency code for brotli (de)compression; it produces 3984 instrumented instrumentation points. Whitelist (c) focuses coverage to only instrument functions in the main file that deals with WOFF2 to TTF conversion, resulting in 1056 instrumentation points. For experimentation purposes, we ran each fuzzer approximately 100 times, single process, with the initial corpus provided in the tutorial. We let the fuzzer run until it either found the heap buffer overflow or went out of memory. On this simple example, whitelists (b) and (c) found the heap buffer overflow more reliably and 5x faster than whitelist (a). The average execution times when finding the heap buffer overflow were as follows: (a) 904 s, (b) 156 s, and (c) 176 s. We explain these results by the fact that WOFF2 to TTF conversion calls the brotli decompression algorithm's functions, which are mostly irrelevant for finding bugs in WOFF2 font reconstruction but nevertheless instrumented and used by whitelist (a) to guide fuzzing. This results in longer execution time for these functions and a partially irrelevant corpus. Contrary to whitelist (a), whitelists (b) and (c) will execute brotli-related functions without instrumentation overhead, and ignore new code paths found in them. This results in faster bug finding for WOFF2 font reconstruction. The results for whitelist (b) are similar to the ones for whitelist (c). Indeed, WOFF2 to TTF conversion calls functions that are mostly located in SRC/src/woff2_dec.cc. The 2892 extra instrumentation points allowed by whitelist (b) do not tamper with bug finding, even though they are mostly irrelevant, simply because most of these functions do not get called. We get a slightly faster average time for bug finding with whitelist (b), which might indicate that some of the extra instrumentation points are actually relevant, or might just be random noise. Reviewers: kcc, morehouse, vitalybuka Reviewed By: morehouse, vitalybuka Subscribers: pratyai, vitalybuka, eternalsakura, xwlin222, dende, srhines, kubamracek, #sanitizers, lebedev.ri, hiraditya, cfe-commits, llvm-commits Tags: #clang, #sanitizers, #llvm Differential Revision: https://reviews.llvm.org/D63616	2020-04-10 10:44:03 -07:00
Matt Arsenault	4593e4131a	AMDGPU: Teach toolchain to link rocm device libs Currently the library is separately linked, but this isn't correct to implement fast math flags correctly. Each module should get the version of the library appropriate for its combination of fast math and related flags, with the attributes propagated into its functions and internalized. HIP already maintains the list of libraries, but this is not used for OpenCL. Unfortunately, HIP uses a separate --hip-device-lib argument, despite both languages using the same bitcode library. Eventually these two searches need to be merged. An additional problem is there are 3 different locations the libraries are installed, depending on which build is used. This also needs to be consolidated (or at least the search logic needs to deal with this unnecessary complexity).	2020-04-10 13:37:32 -04:00
David Blaikie	67a2cc80b6	Fix a few mismatched iterator types revealed from a libc++ + LLVM_EXPENSIVE_CHECKS build These were accidental SCARY iterator uses that weren't guaranteed and in libc++'s debug checking mode were actually distinct types. Use decltype to make it easier to keep these things up to date.	2020-04-10 10:12:51 -07:00
Kevin P. Neal	7f38812d5b	[FPEnv][AArch64] Platform-specific builtin constrained FP enablement When constrained floating point is enabled the AArch64-specific builtins don't use constrained intrinsics in some cases. Fix that. Neon is part of this patch, so ARM is affected as well. Differential Revision: https://reviews.llvm.org/D77074	2020-04-10 13:02:00 -04:00
Simon Pilgrim	c3db138795	TargetOptions.h - remove unused llvm::Module forward declaration. NFC.	2020-04-10 17:36:03 +01:00
Simon Pilgrim	123e0779e5	TargetLoweringObjectFile.h - remove unnecessary ArrayRef.h include. NFC	2020-04-10 17:36:03 +01:00
Simon Pilgrim	7f90af1375	[Orc] Speculation.h - remove unnecessary ArrayRef.h include. NFC	2020-04-10 17:05:07 +01:00
Simon Pilgrim	a88cc20456	ProfileSummaryInfo.h - remove unnecessary includes. NFC Remove a number of includes that aren't necessary (nor are we relying on the remaining includes to provide the declarations), we just needed a llvm::Instruction forward declaration. This exposed a couple of source files that were implicitly replying on the includes for their use of llvm::SmallSet or std::set, requiring local includes to be added there instead.	2020-04-10 16:25:48 +01:00
Simon Pilgrim	43882d9365	PromoteMemToReg.h - remove unused llvm::AliasSetTracker forward declaration. NFC.	2020-04-10 15:47:57 +01:00
Simon Pilgrim	6be9f1931c	SimplifyLibCalls.h - remove unused llvm::BasicBlock forward declaration. NFC.	2020-04-10 15:47:57 +01:00
Simon Pilgrim	75d02a2abf	VNCoercion.h - remove unused llvm::Function forward declaration. NFC.	2020-04-10 15:47:57 +01:00
Simon Pilgrim	8a33920f65	SizeOpts.h - remove ProfileSummaryInfo forward declaration. NFC. We're include the entire ProfileSummaryInfo.h as inline functions use it in the header.	2020-04-10 15:47:56 +01:00
Mehdi Amini	bbeeb35c1f	Revert "[DomTree] Replace ChildrenGetter with GraphTraits over GraphDiff." This reverts commit `0445c64998`. MLIR Build is broken by this change at the moment.	2020-04-10 07:44:06 +00:00
Alina Sbirlea	0445c64998	[DomTree] Replace ChildrenGetter with GraphTraits over GraphDiff. This replaces the ChildrenGetter inside the DominatorTree with GraphTraits over a GraphDiff object, an object which encapsulated the view of the previous CFG. This also simplifies the extentions in clang which use DominatorTree, as GraphDiff also filters nullptrs. Re-land `a90374988e` after moving CFGDiff.h to Support. Differential Revision: https://reviews.llvm.org/D77341	2020-04-10 07:38:53 +00:00
David Blaikie	a838aadae3	Move CFGDiff.h from IR to Support Now that it's generalized to use graph traits, it's no longer dependent on IR.	2020-04-10 00:14:10 -07:00
Mehdi Amini	57d2d48399	Revert "[DomTree] Replace ChildrenGetter with GraphTraits over GraphDiff." This reverts commit `a90374988e` and `5da1671bf8`. A new dependency is introduced here from Support to IR which seems like a layering violation. It also breaks the MLIR build at the moment.	2020-04-10 06:27:59 +00:00
John McCall	8423a6f363	Rename OptimalLayout to OptimizedStructLayout at Chris's request.	2020-04-10 00:14:20 -04:00
Serguei Katkov	4275eb1331	Re-land [Codegen/Statepoint] Allow usage of registers for non gc deopt values. The change introduces the usage of physical registers for non-gc deopt values. This require runtime support to know how to take a value from register. By default usage is off and can be switched on by option. The change also introduces additional fix-up patch which forces the spilling of caller saved registers (clobbered after the call) and re-writes statepoint to use spill slots instead of caller saved registers. Reviewers: reames, danstrushin Reviewed By: dantrushin Subscribers: mgorny, hiraditya, mgrang, llvm-commits Differential Revision: https://reviews.llvm.org/D77797	2020-04-10 10:13:39 +07:00
Alina Sbirlea	5da1671bf8	Update empty() call with namespace to clear ambiguity.	2020-04-09 18:42:10 -07:00
Brad Moody	27f1895f53	Make BitVector::operator== return false for different-sized vectors. This behaviour is in line with SmallBitVector and other vector-like types. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D77027	2020-04-09 20:28:55 -05:00
Wenlei He	60c642e74b	[TLI] Per-function fveclib for math library used for vectorization Summary: Encode `-fveclib` setting as per-function attribute so it can threaded through to LTO backends. Accordingly per-function TLI now reads the attributes and select available vector function list based on that. Now we also populate function list for all supported vector libraries for the shared per-module `TargetLibraryInfoImpl`, so each function can select its available vector list independently but without duplicating the vector function lists. Inlining between incompatbile vectlib attributed is also prohibited now. Subscribers: hiraditya, dexonsmith, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D77632	2020-04-09 18:26:38 -07:00
Alina Sbirlea	a90374988e	[DomTree] Replace ChildrenGetter with GraphTraits over GraphDiff. Summary: This replaces the ChildrenGetter inside the DominatorTree with GraphTraits over a GraphDiff object, an object which encapsulated the view of the previous CFG. This also simplifies the extentions in clang which use DominatorTree, as GraphDiff also filters nullptrs. Reviewers: kuhar, dblaikie, NutshellySima Subscribers: hiraditya, cfe-commits, llvm-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D77341	2020-04-09 18:08:39 -07:00
Lang Hames	37bcf2df01	[ORC] Require JITDylib to be specified when adding IR and objects in the C API.	2020-04-09 17:59:26 -07:00
Francesco Petrogalli	c846d2682b	[llvm][Codegen] Make `getVectorTypeBreakdownMVT` work with scalable types. Reviewers: efriedma, andwar, sdesmalen Reviewed By: efriedma Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77434	2020-04-10 00:48:27 +01:00
Lang Hames	0d5f15f700	[ORC] Add C API support for adding object files to an LLJIT instance.	2020-04-09 16:18:46 -07:00
Lang Hames	1cd8493e69	[ORC] Expand the OrcV2 C API bindings. Adds basic support for LLJITBuilder and DynamicLibrarySearchGenerator. This allows C API clients to configure LLJIT to expose process symbols to JIT'd code. An example of this is added in llvm/examples/OrcV2CBindingsReflectProcessSymbols.	2020-04-09 16:18:46 -07:00
Daniel Sanders	a79b2fc44b	Add pass to strip debug info from MIR Summary: Removes: * All LLVM-IR level debug info using StripDebugInfo() * All debugify metadata * 'Debug Info Version' module flag * All (valid) DEBUG_VALUE MachineInstrs All DebugLocs from MachineInstrs This is a more complete solution than the previous MIRPrinter option that just causes it to neglect to print debug-locations. * The qualifier 'valid' is used here because AArch64 emits an invalid one and tests depend on it Reviewers: vsk, aprantl, bogner Subscribers: mgorny, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77747	2020-04-09 15:44:38 -07:00
Mircea Trofin	655aa1ae4a	[llvm][NFC] Replace CallSite with CallBase in Inliner Summary: Almost all uses are replaced. Left FIXMEs for the two sites that require refactoring outside of Inliner, to scope this patch. Subscribers: eraman, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77817	2020-04-09 15:01:58 -07:00
Christopher Tetreault	994d84b919	[NFC] Make Type::isVectorTy call isa<VectorType> Reviewers: sdesmalen, efriedma, dexonsmith Reviewed By: efriedma Subscribers: tschuett, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77425	2020-04-09 14:15:43 -07:00
Eli Friedman	836ce9db7f	[opaque pointer types] Remove deprecated Instruction/IRBuilder APIs. Removes deprecated overloads of LoadInst constructor, CallInst::create, InvokeInst::Create, IRBuilder::CreateCall, IRBuilder::CreateInvoke. (Leaving around deprecated IRBuilder::CreateLoad for now.) Differential Revision: https://reviews.llvm.org/D76269	2020-04-09 13:42:36 -07:00
Christopher Tetreault	b96558f5e5	Clean up usages of asserting vector getters in Type Summary: Remove usages of asserting vector getters in Type in preparation for the VectorType refactor. The existence of these functions complicates the refactor while adding little value. Reviewers: sunfish, sdesmalen, efriedma Reviewed By: efriedma Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77273	2020-04-09 12:41:28 -07:00
Stefan Pintilie	75828ef615	[PowerPC][Future] Initial support for PCRel addressing for constant pool loads Add initial support for PC Relative addressing for constant pool loads. This includes adding a new relocation for @pcrel and adding a new PowerPC flag to identify PC relative addressing. Differential Revision: https://reviews.llvm.org/D74486	2020-04-09 11:17:23 -05:00
Serguei Katkov	44f0d7f136	Revert "[Codegen/Statepoint] Allow usage of registers for non gc deopt values." This reverts commit `a0275705bb`. It causes buildbot failures building LLVM with BUILD_SHARED_LIBS due to a linker error.	2020-04-09 18:24:47 +07:00
Serguei Katkov	a0275705bb	[Codegen/Statepoint] Allow usage of registers for non gc deopt values. The change introduces the usage of physical registers for non-gc deopt values. This require runtime support to know how to take a value from register. By default usage is off and can be switched on by option. The change also introduces additional fix-up patch which forces the spilling of caller saved registers (clobbered after the call) and re-writes statepoint to use spill slots instead of caller saved registers. Reviewers: reames, dantrushin Reviewed By: reames, dantrushin Subscribers: mgorny, hiraditya, mgrang, llvm-commits Differential Revision: https://reviews.llvm.org/D77371	2020-04-09 16:57:35 +07:00
Jay Foad	9c7bd94ce8	Fix typo in comment	2020-04-09 10:36:00 +01:00
Jay Foad	c63aed890e	[KnownBits] Move AND, OR and XOR logic into KnownBits Summary: There are at least three clients for KnownBits calculations: ValueTracking, SelectionDAG and GlobalISel. To reduce duplication the common logic should be moved out of these clients and into KnownBits itself. This patch does this for AND, OR and XOR calculations by implementing and using appropriate operator overloads KnownBits::operator& etc. Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74060	2020-04-09 10:10:37 +01:00
Nikita Popov	8f66f25f52	[RDA] Avoid negative shift UB Avoid "left shift of negative value -1" ubsan errors by casting to uintptr_t before performing the shift, rather than after.	2020-04-09 09:47:15 +02:00
Serge Pavlov	c7ff5b38f2	[FPEnv] Use single enum to represent rounding mode Now compiler defines 5 sets of constants to represent rounding mode. These are: 1. `llvm::APFloatBase::roundingMode`. It specifies all 5 rounding modes defined by IEEE-754 and is used in `APFloat` implementation. 2. `clang::LangOptions::FPRoundingModeKind`. It specifies 4 of 5 IEEE-754 rounding modes and a special value for dynamic rounding mode. It is used in clang frontend. 3. `llvm::fp::RoundingMode`. Defines the same values as `clang::LangOptions::FPRoundingModeKind` but in different order. It is used to specify rounding mode in in IR and functions that operate IR. 4. Rounding mode representation used by `FLT_ROUNDS` (C11, 5.2.4.2.2p7). Besides constants for rounding mode it also uses a special value to indicate error. It is convenient to use in intrinsic functions, as it represents platform-independent representation for rounding mode. In this role it is used in some pending patches. 5. Values like `FE_DOWNWARD` and other, which specify rounding mode in library calls `fesetround` and `fegetround`. Often they represent bits of some control register, so they are target-dependent. The same names (not values) and a special name `FE_DYNAMIC` are used in `#pragma STDC FENV_ROUND`. The first 4 sets of constants are target independent and could have the same numerical representation. It would simplify conversion between the representations. Also now `clang::LangOptions::FPRoundingModeKind` and `llvm::fp::RoundingMode` do not contain the value for IEEE-754 rounding direction `roundTiesToAway`, although it is supported natively on some targets. This change defines all the rounding mode type via one `llvm::RoundingMode`, which also contains rounding mode for IEEE rounding direction `roundTiesToAway`. Differential Revision: https://reviews.llvm.org/D77379	2020-04-09 13:26:47 +07:00
Pratyai Mazumder	e8d1c6529b	[SanitizerCoverage] sancov/inline-bool-flag instrumentation. Summary: New SanitizerCoverage feature `inline-bool-flag` which inserts an atomic store of `1` to a boolean (which is an 8bit integer in practice) flag on every instrumented edge. Implementation-wise it's very similar to `inline-8bit-counters` features. So, much of wiring and test just follows the same pattern. Reviewers: kcc, vitalybuka Reviewed By: vitalybuka Subscribers: llvm-commits, hiraditya, jfb, cfe-commits, #sanitizers Tags: #clang, #sanitizers, #llvm Differential Revision: https://reviews.llvm.org/D77244	2020-04-08 22:43:52 -07:00
WangTianQing	a3dc949000	[X86] Add TSXLDTRK instructions. Summary: For more details about these instructions, please refer to the latest ISE document: https://software.intel.com/en-us/download/intel-architecture-instruction-set-extensions-programming-reference Reviewers: craig.topper, RKSimon, LuoYuanke Reviewed By: craig.topper Subscribers: mgorny, hiraditya, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D77205	2020-04-09 13:17:29 +08:00
Lang Hames	5877d6f5f4	[ORC] Make mangling convenience methods part of the public API of LLJIT. This saves clients from having to manually construct a MangleAndInterner.	2020-04-08 20:20:13 -07:00
Matt Arsenault	0aa0d70067	MIR: Use Register	2020-04-08 22:07:26 -04:00
Sam Clegg	7baad0c53c	[WebAssembly][MC] Use StringRef over std::string pointer This is followup based on feedback on `5be42f36f5`. See: https://reviews.llvm.org/D77627. Differential Revision: https://reviews.llvm.org/D77674	2020-04-08 18:28:08 -07:00
Johannes Doerfert	0985554b70	[Attributor][NFC] Split AbstractAttributes out of Attributor.cpp Attributor.cpp became quite big and we need to start provide structure. The Attributor code is now in Attributor.cpp and the classes derived from AbstractAttribute are in AttributorAttributes.cpp. Minor changes were required but no intended functional changes. We also minimized includes as part of this. Reviewed By: baziotis Differential Revision: https://reviews.llvm.org/D76873	2020-04-08 19:02:14 -05:00
Amara Emerson	befc788cfa	GlobalISel: Add a setInstrAndDebugLoc(MachineInstr&) convenience helper to MachineIRBuilder. NFC. This saves doing two separate calls to set the Instr and DebugLoc from an existing MI.	2020-04-08 14:38:33 -07:00
Matt Arsenault	e49e33b610	CodeGen: Use Register in MachineInstrBuilder	2020-04-08 17:03:53 -04:00
River Riddle	400ad6f95d	[mlir] Eliminate the remaining usages of cl::opt instead of PassOption. Summary: Pass options are a better choice for various reasons and avoid the need for static constructors. Differential Revision: https://reviews.llvm.org/D77707	2020-04-08 13:05:08 -07:00
Kirill Naumov	8b67853a83	[CFGPrinter] Adding heat coloring to CFGPrinter This patch introduces the heat coloring of the Control Flow Graph which is based on the relative "hotness" of each BB. The patch is a part of sequence of three patches, related to graphs Heat Coloring. Reviewers: rcorcs, apilipenko, davidxl, sfertile, fedor.sergeev, eraman, bollu Differential Revision: https://reviews.llvm.org/D77161	2020-04-08 19:59:51 +00:00
Matt Arsenault	c42cc7fd24	CodeGen: Use Register in MachineSSAUpdater	2020-04-08 14:29:01 -04:00
Matt Arsenault	586769cce2	DAG: Use Register	2020-04-08 13:44:31 -04:00
Nikita Popov	4007e5f32e	[RDA] Try to fix build, again (NFC)	2020-04-08 18:30:53 +02:00
Nikita Popov	b5d941ea58	[RDA] Try to fix build (NFC) fuchsia-x86_64-linux builder fails with: /b/fuchsia-x86_64-linux/llvm.src/llvm/include/llvm/ADT/TinyPtrVector.h:85:15: error: no matching conversion for C-style cast from 'nullptr_t' to 'llvm::ReachingDef' RHS.Val = (EltTy)nullptr; Let's see whether adding an explicit nullptr_t constructor helps.	2020-04-08 18:15:01 +02:00
Matt Arsenault	dcce3ef1d2	FastISel: Partially use Register Doesn't try to convert the cases that depend on generated code.	2020-04-08 12:10:58 -04:00
Matt Arsenault	7a46e36d51	CodeGen: Use Register more in CallLowering Some of these MCPhysReg uses should probably be MCRegister, but right now this would require more invasive changes.	2020-04-08 12:10:58 -04:00
Matt Arsenault	ca0ace7298	CodeGen: Use Register in MachineBasicBlock	2020-04-08 12:10:58 -04:00
Matt Arsenault	84aa58cbe2	CodeGen: Use Register in TargetLowering	2020-04-08 12:10:58 -04:00
Kirill Naumov	0125db9ab2	[TimePasses] Small fix in "-time-passes" flag that makes it more stable Adds StringMap for TimingData. Differential Revision: https://reviews.llvm.org/D76946 Reviewed By: fedor.sergeev	2020-04-08 15:59:45 +00:00
Nikita Popov	952c274159	[RDA] Use TinyPtrVector to store reaching defs (NFCI) RDA currently uses SmallVector<int, 1> to store reaching definitions. A SmallVector<int, 1> is 24 bytes large, and X86 currently has 164 register units, which means we need 3936 bytes per block. If you have a large function with 1000 blocks, that's already 4MB. A large fraction of these reg units will not have any reaching defs (say, those corresponding to zmm registers), and many will have just one. A TinyPtrVector serves this use-case much better, as it only needs 8 bytes per register if it has 0 or 1 reaching defs. As the name implies, TinyPtrVector is designed to work with pointers, so we need to add some boilerplate to treat our reaching def integers as pointers, using an appropriate encoding. We need to keep the low bit free for tagging, and make sure at least one bit is set to distinguish the null pointer. Differential Revision: https://reviews.llvm.org/D77513	2020-04-08 17:46:24 +02:00
Stefan Pintilie	6c4b40def7	[PowerPC][Future] Add Support For Functions That Do Not Use A TOC. On PowerPC most functions require a valid TOC pointer. This is the case because either the function itself needs to use this pointer to access the TOC or because other functions that are called from that function expect a valid TOC pointer in the register R2. The main exception to this is leaf functions that do not access the TOC since they are guaranteed not to need a valid TOC pointer. This patch introduces a feature that will allow more functions to not require a valid TOC pointer in R2. Differential Revision: https://reviews.llvm.org/D73664	2020-04-08 08:07:35 -05:00

... 2 3 4 5 6 ...

40513 Commits