llvm-project

Commit Graph

Author	SHA1	Message	Date
spupyrev	81aedab7dd	introducing some profi flags Differential Revision: https://reviews.llvm.org/D120508	2022-03-08 12:35:15 -08:00
Lang Hames	151f809c55	[JITLink] Demote symbol scope to local during external-to-absolute conversion. When an external symbol is converted to an absolute it should be demoted to local scope so that the symbol does not become a new definition within this LinkGraph.	2022-03-08 10:31:20 -08:00
eopXD	550b2eaaa6	[RISCV] Add combination crypto extensions in ISAInfo The crypto extension have several shorthand extensions that don't consist of any extra instructions. Take `zk` for example, while the extension would imply `zkn, zkr, zkt`. The 3 extensions should also combine back into `zk` to maintain the canonical order in isa strings. This patch addresses the above. Reviewed By: VincentWu Differential Revision: https://reviews.llvm.org/D119530	2022-03-08 09:52:38 -08:00
Philip Reames	8e06058bfe	[MSSA] Add clarifying comment for isOptimized on MemoryUse [nfc]	2022-03-08 09:43:16 -08:00
Philip Reames	52f7578489	[MSSA] Add comments describing optimized uses for MemoryDefs [nfc] As clarified by a recent email chain with Alina.	2022-03-08 09:39:11 -08:00
Hongtao Yu	50bc945a8f	[CSSPGO][SCCIterator] Fix a non-determinism in scc_member_iterator Previously we initialed the work queue with MST roots based on NodeInfoMap which is an unordered map. This could cause a non-determinism. I'm fixing this by initializing the queue based on SortedEdges. I don't see any performance move with this change. However this helps debugging. Reviewed By: wenlei Differential Revision: https://reviews.llvm.org/D120670	2022-03-08 09:08:47 -08:00
Arthur Eubanks	53e5e58670	[NewPM][Inliner] Make inlined calls to functions in same SCC as callee exponentially expensive Introduce a new attribute "function-inline-cost-multiplier" which multiplies the inline cost of a call site (or all calls to a callee) by the multiplier. When processing the list of calls created by inlining, check each call to see if the new call's callee is in the same SCC as the original callee. If so, set the "function-inline-cost-multiplier" attribute of the new call site to double the original call site's attribute value. This does not happen when the original call site is intra-SCC. This is an alternative to D120584, which marks the call sites as noinline. Hopefully fixes PR45253. Reviewed By: davidxl Differential Revision: https://reviews.llvm.org/D121084	2022-03-07 23:51:09 -08:00
Arthur Eubanks	79a1f3e7c6	[NFC] Cleanup StandardInstrumentations	2022-03-07 16:24:36 -08:00
Stanislav Mekhanoshin	932f628121	[AMDGPU] new gfx940 fp atomics Differential Revision: https://reviews.llvm.org/D121028	2022-03-07 12:32:02 -08:00
Mitch Phillips	73df82572a	[MTE] Add NT_ANDROID_TYPE_MEMTAG This ELF note is aarch64 and Android-specific. It specifies to the dynamic loader that specific work should be scheduled to enable MTE protection of stack and heap regions. Current synthesis of the ".note.android.memtag" ELF note is done in the Android build system. We'd like to move that to the compiler, and this is the first step. Reviewed By: MaskRay, jhenderson Differential Revision: https://reviews.llvm.org/D119381	2022-03-07 11:28:56 -08:00
Craig Topper	845bfcede1	[RISCV] Rename 'SplatOperand' to 'ScalarOperand'. NFC vslide1up/down have this flag set, but the value isn't a splat. Rename for clarity. Reviewed By: khchen Differential Revision: https://reviews.llvm.org/D121037	2022-03-07 11:28:32 -08:00
Richard Howell	5917219438	[llvm] remove empty __LLVM segment in llvm-bitcode-strip When running llvm-bitcode-strip we want to remove the __LLVM segment as well as the __bundle section when there are no other sections in the segment. Differential Revision: https://reviews.llvm.org/D120927	2022-03-07 08:52:25 -08:00
Florian Hahn	542c335159	[ConstraintElimination] Remove dead variables when dropping constraints. This patch extends ConstraintElimination to also remove dead variables when removing a constraint. When a constraint is removed because it is out of scope, all new variables added for this constraint can also be removed. This keeps the total size of the systems much smaller, because it reduces the number of variables drastically. It also fixes a bug where variables where removed incorrectly. Fixes https://github.com/llvm/llvm-project/issues/54228	2022-03-07 09:04:07 +00:00
Simon Moll	5f62156762	[VP] Introducing VectorBuilder, the VP intrinsic builder VectorBuilder wraps around an IRBuilder and VectorBuilder::createVectorInstructions emits VP intrinsics as if they were regular instructions. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D105283	2022-03-07 10:02:07 +01:00
Nikita Popov	d1e880acaa	[SCEV] Enable verification in LoopPM Currently, we hardly ever actually run SCEV verification, even in tests with -verify-scev. This is because the NewPM LPM does not verify SCEV. The reason for this is that SCEV verification can actually change the result of subsequent SCEV queries, which means that you see different transformations depending on whether verification is enabled or not. To allow verification in the LPM, this limits verification to BECounts that have actually been cached. It will not calculate new BECounts. BackedgeTakenInfo::getExact() is still not entirely readonly, it still calls getUMinFromMismatchedTypes(). But I hope that this is not problematic in the same way. (This could be avoided by performing the umin in the other SCEV instance, but this would require duplicating some of the code.) Differential Revision: https://reviews.llvm.org/D120551	2022-03-07 09:46:20 +01:00
Johannes Doerfert	5af11ec34b	[Attributor] Determine potentially loaded values through memory We already look through memory to determine where a value that is stored might pop up again (potential copies). This patch introduces the other direction with similar logic. If a value is loaded, we can follow all the accesses to the pointer (or better object) and try to determine what value might have been stored.	2022-03-06 23:26:37 -06:00
Johannes Doerfert	ad26e199ff	[Attributor] Use CFG reasoning also for read accesses With D106397 we used CFG reasoning to filter out writes that will not interfere with a given load instruction. With this patch we use the same logic (modulo the reversal in reachability check order) for store instructions. As an example, we can now proof stores to shared memory are dead if all the loads of the shared memory are not reachable from them.	2022-03-06 23:26:22 -06:00
Qiu Chaofan	b2497e5435	[PowerPC] Add generic fnmsub intrinsic Currently in Clang, we have two types of builtins for fnmsub operation: one for float/double vector, they'll be transformed into IR operations; one for float/double scalar, they'll generate corresponding intrinsics. But for the vector version of builtin, the 3 op chain may be recognized as expensive by some passes (like early cse). We need some way to keep the fnmsub form until code generation. This patch introduces ppc.fnmsub.* intrinsic to unify four fnmsub intrinsics. Reviewed By: shchenz Differential Revision: https://reviews.llvm.org/D116015	2022-03-07 13:00:06 +08:00
Johannes Doerfert	efedf70aa5	[Attributor][NFC] Expose helper with more generic interface This simply makes the function argument of the `Attributor::checkForAllInstructions` helper explicit so one can iterate over instructions in other functions.	2022-03-06 19:59:23 -06:00
William S. Moses	87ec6f41bb	[OpenMPIRBuilder] Allocate temporary at the correct block in a nested parallel The OpenMPIRBuilder has a bug. Specifically, suppose you have two nested openmp parallel regions (writing with MLIR for ease) ``` omp.parallel { %a = ... omp.parallel { use(%a) } } ``` As OpenMP only permits pointer-like inputs, the builder will wrap all of the inputs into a stack allocation, and then pass this allocation to the inner parallel. For example, we would want to get something like the following: ``` omp.parallel { %a = ... %tmp = alloc store %tmp[] = %a kmpc_fork(outlined, %tmp) } ``` However, in practice, this is not what currently occurs in the context of nested parallel regions. Specifically to the OpenMPIRBuilder, the entirety of the function (at the LLVM level) is currently inlined with blocks marking the corresponding start and end of each region. ``` entry: ... parallel1: %a = ... ... parallel2: use(%a) ... endparallel2: ... endparallel1: ... ``` When the allocation is inserted, it presently inserted into the parent of the entire function (e.g. entry) rather than the parent allocation scope to the function being outlined. If we were outlining parallel2, the corresponding alloca location would be parallel1. This causes a variety of bugs, including https://github.com/llvm/llvm-project/issues/54165 as one example. This PR allows the stack allocation to be created at the correct allocation block, and thus remedies such issues. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D121061	2022-03-06 18:34:25 -05:00
Benjamin Kramer	4d2669002e	[YAML] Simplify code a bit. NFC.	2022-03-05 22:24:35 +01:00
Arthur Eubanks	f909aed671	Revert "[SCEV] Infer ranges for SCC consisting of cycled Phis" This reverts commit `fc539b0004`. Causes miscompiles, see D110620.	2022-03-04 19:52:44 -08:00
Augie Fackler	e1895a46dc	OpenMP: add allocsize(0) attribute to __kmpc_alloc_shared This is the second step in obviating two columns about allocation functions in MemoryBuiltins.cpp. Differential Revision: https://reviews.llvm.org/D119583	2022-03-04 16:26:03 -05:00
Augie Fackler	dba73135c8	getAllocAlignment: respect allocalign attribute if present As with allocsize(), we prefer the table data to attributes. Differential Revision: https://reviews.llvm.org/D118263	2022-03-04 15:57:54 -05:00
Augie Fackler	d664c4b73c	Attributes: add a new allocalign attribute This will let us start moving away from hard-coded attributes in MemoryBuiltins.cpp and put the knowledge about various attribute functions in the compilers that emit those calls where it probably belongs. Differential Revision: https://reviews.llvm.org/D117921	2022-03-04 15:57:53 -05:00
Snehasish Kumar	11314f4059	[memprof] Filter out callstack frames which cannot be symbolized. This patch filters out callstack frames which can't be symbolized or if the frames belong to the runtime. Symbolization may not be possible if debug information is unavailable or if the addresses are from a shared library. For now we only support optimization of the main binary which is statically linked to the compiler runtime. Differential Revision: https://reviews.llvm.org/D120860	2022-03-04 11:10:08 -08:00
Augie Fackler	5e4c75db3b	InstructionCombining: avoid eliding mismatched alloc/free pairs Prior to this change LLVM would happily elide a call to any allocation function and a call to any free function operating on the same unused pointer. This can cause problems in some obscure cases, for example if the body of operator::new can be inlined but the body of operator::delete can't, as in this example from jyknight: #include <stdlib.h> #include <stdio.h> int allocs = 0; void operator new(size_t n) { allocs++; void mem = malloc(n); if (!mem) abort(); return mem; } __attribute__((noinline)) void operator delete(void mem) noexcept { allocs--; free(mem); } void deleteit(inti) { delete i; } int main() { int*i = new int; deleteit(i); if (allocs != 0) printf("MEMORY LEAK! allocs: %d\n", allocs); } This patch addresses the issue by introducing the concept of an allocator function family and uses it to make sure that alloc/free function pairs are only removed if they're in the same family. Differential Revision: https://reviews.llvm.org/D117356	2022-03-04 10:41:10 -05:00
Nathan Sidwell	64221645a8	[demangler] Make OutputBuffer non-copyable In addressing the buffer ownership API, I discovered a rogue member function that returned by value rather than by reference. It clearly intended to return by reference, but because the copy ctor wasn't deleted this wasn't caught. It is not necessary to make this a move-only type, although that would be an alternative. Reviewed By: bruno Differential Revision: https://reviews.llvm.org/D120901	2022-03-04 04:43:37 -08:00
River Riddle	81f2f4dfb2	[PDLL] Add support for tablegen includes and importing ODS information This commit adds support for processing tablegen include files, and importing various information from ODS. This includes operations, attribute+type constraints, attribute/operation/type interfaces, etc. This will allow for much more robust tooling, and also allows for referencing ODS constructs directly within PDLL (imported interfaces can be used as constraints, operation result names can be used for member access, etc). Differential Revision: https://reviews.llvm.org/D119900	2022-03-03 16:14:03 -08:00
River Riddle	e865fa7530	[TableGen] Add a library-based entry point for parsing td files This commit adds a new `TableGenParseFile` entry point for tablegen that parses an input buffer and invokes a callback function with a record keeper (notably without an output buffer). This kind of entry point is very useful for tablegen consuming tools that don't create output, and want invoke tablegen multiple times. The current way that we interact with tablegen is via relative includes to TGParser(not great). Differential Revision: https://reviews.llvm.org/D119899	2022-03-03 16:14:03 -08:00
Matt Arsenault	27712243ab	Revert "Inliner: Correctly merge amdgpu-unsafe-fp-atomics attribute" This reverts commit `169ebf03ab`. This was effectively rendering the attribute useless in the real world, although this is still broken.	2022-03-03 15:25:32 -05:00
Snehasish Kumar	dda7b74967	[memprof] Symbolize and cache stack frames. Currently, symbolization of stack frames occurs on demand when the instrprof writer iterates over all the records in the raw memprof reader. With this change we symbolize and cache the frames immediately after reading the raw profiles. For a large internal binary this results in a runtime reduction of ~50% (2m -> 48s) when merging a memprof raw profile with a raw instr profile to generate an indexed profile. This change also makes it simpler in the future to generate additional calling context metadata to attach to each memprof record. Differential Revision: https://reviews.llvm.org/D120430	2022-03-03 11:00:37 -08:00
Craig Topper	608161225e	[InstCombine][Analysis] Move getFCmpCode and getPredForFCmpCode to CmpInstAnalysis. NFC The similar getICmpCode and getPredForICmpCode are already there. This moves FP for consistency. I think InstCombine is currently the only user of both. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D120754	2022-03-03 09:33:24 -08:00
Paul Robinson	7b85f0f32f	[PS4] isPS4 and isPS4CPU are not meaningfully different	2022-03-03 11:36:59 -05:00
Alexandros Lamprineas	910eb988eb	[FuncSpec][NFC] Refactor internal structures. `ArgInfo` is reduced to only contain a pair of {formal,actual} values. The specialized function `Fn` and the `Partial` flag are redundant in this structure. The `Gain` is moved to a new struct `SpecializationInfo`. The value mappings created by cloneCandidateFunction() are being used by rewriteCallSites() for matching the formal arguments of recursive functions. The list of specializations is passed by reference to calculateGains() instead of being returned by value. The `IsPartial` flag is removed from isArgumentInteresting() and getPossibleConstants() as it's no longer used anywhere in the code. Differential Revision: https://reviews.llvm.org/D120753	2022-03-03 13:08:13 +00:00
Simon Moll	8de8731591	Revert "[VP] Introducing VectorBuilder, the VP intrinsic builder" This reverts commit `8bcbfb50e8`. Taking this patch offline to fix breakage: https://lab.llvm.org/buildbot/#/builders/110/builds/10912	2022-03-03 13:34:37 +01:00
Simon Moll	8bcbfb50e8	[VP] Introducing VectorBuilder, the VP intrinsic builder VectorBuilder wraps around an IRBuilder and VectorBuilder::createVectorInstructions emits VP intrinsics as if they were regular instructions. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D105283	2022-03-03 11:31:57 +01:00
serge-sans-paille	59630917d6	Cleanup includes: Transform/Scalar Estimated impact on preprocessor output line: before: 1062981579 after: 1062494547 Discourse thread: https://discourse.llvm.org/t/include-what-you-use-include-cleanup Differential Revision: https://reviews.llvm.org/D120817	2022-03-03 07:56:34 +01:00
Aakanksha	840695814a	[AMDGPU] Add gfx1036 target Differential Revision: https://reviews.llvm.org/D120846	2022-03-02 23:26:38 +00:00
Stanislav Mekhanoshin	2e2e64df4a	[AMDGPU] Add gfx940 target This is target definition only. Differential Revision: https://reviews.llvm.org/D120688	2022-03-02 13:54:48 -08:00
Pavel Labath	e8784289c0	Revert "Remove a top-level "using namespace" from TargetTransformInfoImpl.h" Causing failures on many bots. This reverts commit `31efecfde9`.	2022-03-02 15:47:41 +01:00
Pavel Labath	31efecfde9	Remove a top-level "using namespace" from TargetTransformInfoImpl.h Move it into the implementation of the function that needs it. Avoids polluting the namespace of all files including the header.	2022-03-02 15:38:20 +01:00
Pavel Labath	11511e9357	Remove "using namespace llvm" from ReleaseModeModelRunner.h A using directive in a header pollutes the namespace of all files which include that header. It seems this snuck in in D115764 by moving some code from a cpp file.	2022-03-02 15:29:12 +01:00
Simon Moll	d05ddb86f6	[VP] vp.sitofp cast intrinsic and docs Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D119922	2022-03-02 10:16:19 +01:00
Martin Storsjö	6ec18aafec	[Object] [COFF] Improve error messages This aids debugging when working with possibly broken files, instead of just flat out erroring out without telling what's wrong. Differential Revision: https://reviews.llvm.org/D120679	2022-03-02 10:44:41 +02:00
Xiang1 Zhang	65588a0776	Revert "TLS loads opimization (hoist)" Revert for more reviews This reverts commit `30e612ebdf`.	2022-03-02 14:10:11 +08:00
Mircea Trofin	cb2160760e	[nfc][codegen] Move RegisterBank[Info].h under CodeGen This wraps up from D119053. The 2 headers are moved as described, fixed file headers and include guards, updated all files where the old paths were detected (simple grep through the repo), and `clang-format`-ed it all. Differential Revision: https://reviews.llvm.org/D119876	2022-03-01 21:53:25 -08:00
Xiang1 Zhang	30e612ebdf	TLS loads opimization (hoist) Reviewed By: Wang Pheobe, Topper Craig Differential Revision: https://reviews.llvm.org/D120000	2022-03-02 10:37:24 +08:00
Lang Hames	34e539dcd7	[ORC] Set ResolverBlockAddr in EPCIndirectionUtils::writeResolverBlock. Without this, EPCIndirectionUtils::getResolverBlockAddr (and lazy compilation via EPC) won't work. No test case: lli is still using LocalLazyCallThroughManager. I'll revisit this soon when I look at adding lazy compilation support to the ORC runtime.	2022-03-01 16:44:55 -08:00
Duncan P. N. Exon Smith	15ab7bc3af	Testing: Make TempFile safe to move; test Temp{Dir,File,Link} Default the moves and delete the copies for TempFile, matching TempDir and TempLink, and add tests for all of them to confirm that the destructor is not harmful after it has been moved from. Differential Revision: https://reviews.llvm.org/D120691	2022-03-01 13:45:51 -08:00

1 2 3 4 5 ...

47600 Commits