llvm-project

Commit Graph

Author	SHA1	Message	Date
Doug Gregor	a773db7d76	Add a command-line flag to control the Swift extended async frame info. Introduce a new command-line flag `-swift-async-fp={auto\|always\|never}` that controls how code generation sets the Swift extended async frame info bit. There are three possibilities: * `auto`: which determines how to set the bit based on deployment target, either statically or dynamically via `swift_async_extendedFramePointerFlags`. * `always`: the default, always set the bit statically, regardless of deployment target. * `never`: never set the bit, regardless of deployment target. Patch by Doug Gregor <dgregor@apple.com> Reviewed By: doug.gregor Differential Revision: https://reviews.llvm.org/D109392	2021-09-16 06:57:45 -07:00
zhijian	baa18ee63e	Add a new API seek for the Cursor class in the DataExtractor.cpp Summary: add a new API seek for the Cursor class in the DataExtractor.cpp Reviewers: James Henderson, Fangrui Song Differential Revision: https://reviews.llvm.org/D109603	2021-09-16 09:39:49 -04:00
Bjorn Pettersson	d9fc3d879e	[NewPM] Replace 'kasan-module' by 'asan-module<kernel>' Change the asan-module pass into a MODULE_PASS_WITH_PARAMS in the pass registry, and add a single parameter called 'kernel' that can be set instead of having a special pass name 'kasan-module' to trigger that special pass config. Main reason is to make sure that we have a unique mapping from ClassName to PassName in the new passmanager framework, making it possible to correctly identify the passes when dealing with options such as -print-after and -print-pipeline-passes. This is a follow-up to D105006 and D105007.	2021-09-16 14:58:42 +02:00
Bjorn Pettersson	8f8616655c	[NewPM] Use a separate struct for ModuleThreadSanitizerPass Split ThreadSanitizerPass into ThreadSanitizerPass (as a function pass) and ModuleThreadSanitizerPass (as a module pass). Main reason is to make sure that we have a unique mapping from ClassName to PassName in the new passmanager framework, making it possible to correctly identify the passes when dealing with options such as -print-after and -print-pipeline-passes. This is a follow-up to D105006 and D105007.	2021-09-16 14:58:42 +02:00
Bjorn Pettersson	ab41eef9ac	[NewPM] Use a separate struct for ModuleMemorySanitizerPass Split MemorySanitizerPass into MemorySanitizerPass (as a function pass) and ModuleMemorySanitizerPass (as a module pass). Main reason is to make sure that we have a unique mapping from ClassName to PassName in the new passmanager framework, making it possible to correctly identify the passes when dealing with options such as -print-after and -print-pipeline-passes. This is a follow-up to D105006 and D105007.	2021-09-16 14:58:42 +02:00
Konstantin Schwarz	d2e66d7fa4	[GlobalISel] Add a combine for and(load , mask) -> zextload This only handles simple masks, not shifted masks, for now. Reviewed By: aemerson Differential Revision: https://reviews.llvm.org/D109357	2021-09-16 10:42:46 +02:00
Alok Kumar Sharma	a5b72abc9e	[DebugInfo] Enhance DIImportedEntity to accept children entities New field `elements` is added to '!DIImportedEntity', representing list of aliased entities. This is needed to dump optimized debugging information where all names in a module are imported, but a few names are imported with overriding aliases. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D109343	2021-09-16 10:41:55 +05:30
Lang Hames	a01f772d19	[ORC] Add MethodWrapperHandler utility for WrapperFunction. MethodWrapperHandler removes some of the boilerplate when writing wrapper functions to wrap method calls. It can be used as a handler for wrapper functions whose first argument is an ExecutorAddress: the address is cast to a pointer of the given class type, then the given method function pointer is called on that object pointer (passing the rest of the arguments). E.g. class MyClass { public: void myMethod(uint32_t, bool) { ... } }; // SPS Method signature for myMethod -- note MyClass object address as first // argument. using SPSMyMethodWrapperSignature = SPSTuple<SPSExecutorAddress, uint32_t, bool>; // Wrapper function for myMethod. WrapperFunctionResult myMethodCallWrapper(const char *ArgData, size_t ArgSize) { return WrapperFunction<SPSMyMethodWrapperSignature>::handle( ArgData, ArgSize, makeMethodWrapperHandler(&MyClass::myMethod)); }	2021-09-16 14:36:05 +10:00
Fangrui Song	03c8b24ace	Delete unused GlobalIndirectSymbol::getBaseObject overloads	2021-09-15 20:44:24 -07:00
Anna Thomas	f9e4aebe4a	Revert "[InstCombine] Improve TryToSinkInstruction with multiple uses" This reverts commit `4ac4e52189`. There are couple of test failures, which needs update of the test cases. Doing a clean revert and will recommit the change along with fixed testcases.	2021-09-15 18:03:11 -04:00
Anna Thomas	3273430406	Re-add getSingleUndroppableUse API The API was removed in `4ac4e52189` in favor of getUniqueUndroppableUser. However, this caused a buildbot failure in AbstractCallSiteTest.cpp, which uses the API and the AbstractCallSite class requires a "use" rather than a user. Retain the API so that the unittest compiles and passes.	2021-09-15 17:06:20 -04:00
Anna Thomas	4ac4e52189	[InstCombine] Improve TryToSinkInstruction with multiple uses This patch allows sinking an instruction which can have multiple uses in a single user. We were previously over-restrictive by looking for exactly one use, rather than one user. Also, the API for retrieving undroppable user has been updated accordingly since in both usecases (Attributor and InstCombine), we seem to care about the user, rather than the use. Reviewed-By: nikic Differential Revision: https://reviews.llvm.org/D109700	2021-09-15 20:39:38 +00:00
Alina Sbirlea	b759381b75	[MemorySSA] Add verification levels to MemorySSA. [NFC] Add two levels of verification for MemorySSA: Fast and Full. The defaults are kept the same. Full verification always occurs under EXPENSIVE_CHECKS, but now it can also be requested in a specific pass for debugging purposes.	2021-09-15 11:09:54 -07:00
David Green	61cc873a8e	[LV] Recognize intrinsic min/max reductions This extends the reduction logic in the vectorizer to handle intrinsic versions of min and max, both the floating point variants already created by instcombine under fastmath and the integer variants from D98152. As a bonus this allows us to match a chain of min or max operations into a single reduction, similar to how add/mul/etc work. Differential Revision: https://reviews.llvm.org/D109645	2021-09-15 10:45:50 +01:00
Amara Emerson	5ec1845cad	[AArch64][GlobalISel] Add a new reassociation for G_PTR_ADDs. G_PTR_ADD (G_PTR_ADD X, C), Y) -> (G_PTR_ADD (G_PTR_ADD(X, Y), C) Improves CTMark -Os on AArch64: Program before after diff sqlite3 286932 287024 0.0% kc 432512 432508 -0.0% SPASS 412788 412764 -0.0% pairlocalalign 249460 249416 -0.0% bullet 475740 475512 -0.0% 7zip-benchmark 568864 568356 -0.1% consumer-typeset 419088 418648 -0.1% tramp3d-v4 367628 367224 -0.1% clamscan 383184 382732 -0.1% lencod 430028 429284 -0.2% Geomean difference -0.1% Differential Revision: https://reviews.llvm.org/D109528	2021-09-14 23:57:41 -07:00
Markus Lavin	1ac209ed76	[NPM] Added -print-pipeline-passes print params for a few passes. Added '-print-pipeline-passes' printing of parameters for those passes declared with _WITH_PARAMS macro in PassRegistry.def. Note that it only prints the parameters declared inside _WITH_PARAMS as in a few cases there appear to be additional parameters not parsable. The following passes are now covered (i.e. all of those with *_WITH_PARAMS in PassRegistry.def). LoopExtractorPass - loop-extract HWAddressSanitizerPass - hwsan EarlyCSEPass - early-cse EntryExitInstrumenterPass - ee-instrument LowerMatrixIntrinsicsPass - lower-matrix-intrinsics LoopUnrollPass - loop-unroll AddressSanitizerPass - asan MemorySanitizerPass - msan SimplifyCFGPass - simplifycfg LoopVectorizePass - loop-vectorize MergedLoadStoreMotionPass - mldst-motion GVN - gvn StackLifetimePrinterPass - print<stack-lifetime> SimpleLoopUnswitchPass - simple-loop-unswitch Differential Revision: https://reviews.llvm.org/D109310	2021-09-15 08:34:04 +02:00
Matt Arsenault	99dfe90695	Attributor: Fix typos	2021-09-14 19:49:04 -04:00
Philip Reames	c4048d8f50	[SCEV] Attempt to define what flags are legal on a SCEV This is an attempt to define what the current semantics are closest too. Unfortunately, the current implementation does appear to be inconsistent with all semantic variants we've considered. This semantics is the one which seems to be closest to the spirit of the code, and that matched several long time contributors mental models of how the code "should work". https://bugs.llvm.org/show_bug.cgi?id=51817 tracks the list of currently known violations of these rules. A series of follow up patches will be addressing each now that we've defined them to be bugs. Differential Revision: https://reviews.llvm.org/D109553	2021-09-14 13:13:49 -07:00
Ben Langmuir	64c8e7489b	[JITLink] Allow moving a Symbol to a Block in a different Section When moving a Symbol between Blocks that are in different Sections, update the symbol tables for each Section. Otherwise symbol.getBlock().getSection() will not match the contents of Section::symbols(), which asserts during linking. Differential Revision: https://reviews.llvm.org/D109724	2021-09-14 08:27:26 -07:00
Craig Topper	2fd180bbb9	[IR] Reduce max supported integer from 2^24-1 to 2^23. SelectionDAG will promote illegal types up to a power of 2 before splitting down to a legal type. This will create an IntegerType with a bit width that must be <= MAX_INT_BITS. This places an effective upper limit on any type of 2^23 so that we don't try create a 2^24 type. I considered putting a fatal error somewhere in the path from TargetLowering::getTypeConversion down to IntegerType::get, but limiting the type in IR seemed better. This breaks backwards compatibility with IR that is using a really large type. I suspect such IR is going to be very rare due to the the compile time costs such a type likely incurs. Prevents the ICE in PR51829. Reviewed By: efriedma, aaron.ballman Differential Revision: https://reviews.llvm.org/D109721	2021-09-14 07:52:10 -07:00
Sam Clegg	ef8c9135ef	[WebAssembly] Allow import and export of TLS symbols between DSOs We previously had a limitation that TLS variables could not be exported (and therefore could also not be imported). This change removed that limitation. Differential Revision: https://reviews.llvm.org/D108877	2021-09-14 06:47:37 -07:00
Anna Thomas	a1ae56d6ad	Remove incorrect comment in getSingleUndroppableUse. NFC We traverse the entire use list to find the undroppable use.	2021-09-14 09:29:55 -04:00
Florian Hahn	e248d69036	Recommit "[LAA] Support pointer phis in loop by analyzing each incoming pointer." SCEV does not look through non-header PHIs inside the loop. Such phis can be analyzed by adding separate accesses for each incoming pointer value. This results in 2 more loops vectorized in SPEC2000/186.crafty and avoids regressions when sinking instructions before vectorizing. Fixes PR50296, PR50288. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D102266	2021-09-14 11:19:12 +01:00
Esme-Yi	b98c3e957f	[yaml2obj][XCOFF] add the SectionIndex field for symbol. Summary: Add the SectionIndex field for symbol. 1: a symbol can reference a section by SectionName or SectionIndex. 2: a symbol can reference a section by both SectionName and SectionIndex. 3: if both Section and SectionIndex are specified, but the two values refer to different sections, an error will be reported. 4: an invalid SectionIndex is allowed. 5: if a symbol references a non-existent section by SectionName, an error will be reported. Reviewed By: jhenderson, Higuoxing Differential Revision: https://reviews.llvm.org/D109566	2021-09-14 06:18:03 +00:00
Chris Lattner	8b4afc5aef	[APInt] Add a concat method, use LLVM_UNLIKELY to help optimizer. Three unrelated changes: 1) Add a concat method as a convenience to help write bitvector use cases in a nicer way. 2) Use LLVM_UNLIKELY as suggested by @xbolva00 in a previous patch. 3) Fix casing of some "slow" methods to follow naming standards. Differential Revision: https://reviews.llvm.org/D109620	2021-09-13 22:02:54 -07:00
Lang Hames	3ca6eee2a9	[ORC] Fix self-assignment. We want to read the page size from EPI->PageSize. Thanks to Simon Pilgrim for spotting this.	2021-09-14 10:21:49 +10:00
Lang Hames	2c8e784915	[ORC] Add Shared/OrcRTBridge, and TargetProcess/OrcRTBootstrap. This is a small first step towards reorganization of the ORC libraries: Declarations for types and function names (as strings) to be found in the "ORC runtime bootstrap" set are moved into OrcRTBridge.h / OrcRTBridge.cpp. The current implementation of the "ORC runtime bootstrap" functions is moved into OrcRTBootstrap.h and OrcRTBootstrap.cpp. It is likely that this code will eventually be moved into ORT-RT proper (in compiler RT). The immediate goal of this change is to make these bootstrap functions usable for clients other than SimpleRemoteEPC/SimpleRemoteEPCServer. The first planned client is a new RuntimeDyld::MemoryManager that will run over EPC, which will allow us to remove the old OrcRemoteTarget code.	2021-09-14 10:19:45 +10:00
Thomas Lively	b2032f18c9	[lld][WebAssembly] Relax limitations on multithreaded instantiation For multithreaded modules (i.e. modules with a shared memory), lld injects a synthetic Wasm start function that is automatically called during instantiation to initialize memory from passive data segments. Even though the module will be instantiated separately on each thread, memory initialization should happen only once. Furthermore, memory initialization should be finished by the time each thread finishes instantiation. Since multiple threads may be instantiating their modules at the same time, the synthetic function must synchronize them. The current synchronization tries to atomically increment a flag from 0 to 1 in memory then enters one of two cases. First, if the increment was successful, the current thread is responsible for initializing memory. It does so, increments the flag to 2 to signify that memory has been initialized, then notifies all threads waiting on the flag. Otherwise, the thread atomically waits on the flag with an expected value of 1 until memory has been initialized. Either the initializer thread finishes initializing memory (i.e. sets the flag to 2) first and the waiter threads do not end up blocking, or the waiter threads succesfully start waiting before memory is initialized so they will be woken by the initializer thread once it has finished. One complication with this scheme is that there are various contexts on the Web, most notably on the main browser thread, that cannot successfully execute a wait. Executing a wait in these contexts causes a trap, and in this case would cause instantiation to fail. The embedder must therefore ensure that these contexts win the race and become responsible for initializing memory, since that is the only code path that does not execute a wait. Unfortunately, since only one thread can win the race and initialize memory, this scheme makes it impossible to have multiple threads in contexts that cannot wait. For example, it is not currently possible to instantiate the module on both the main browser thread as well as in an AudioWorklet. To loosen this restriction, this commit inserts an extra check so that the wait will not be executed at all when memory has already been initialized, i.e. when the flag value is 2. After this change, the module can be instantiated on threads in non-waiting contexts as long as the embedder can guarantee either that the thread will win the race and initialize memory (as before) or that memory has already been initialized when instantiation begins. Threads in contexts that can wait can continue racing to initialize memory. Fixes (or at least improves) PR51702. Reviewed By: dschuff Differential Revision: https://reviews.llvm.org/D109722	2021-09-13 15:03:51 -07:00
Arnamoy Bhattacharyya	b9a8f34d47	[flang][OpenMP] Add parsing support for nontemporal clause. This patch adds parsing support for the nontemporal clause. Also adds a couple of test cases. Reviewed By: clementval Differential Revision: https://reviews.llvm.org/D106896	2021-09-13 15:25:47 -04:00
Fehr Mathieu	b39f6a79ee	[ADT] Extend EnableIfCallable for callables with incomplete returns std::is_convertible has no defined behavior when its arguments are incomplete, even if they are equal. In practice, it returns false. Adding std::is_same allows us to use the constructor using a callable, even if the return value is incomplete. We also check the case where we convert a T into a const T. Reviewed By: DaniilSuchkov Differential Revision: https://reviews.llvm.org/D104703 Committer: Daniil Suchkov <dsuchkov@azul.com>	2021-09-13 19:16:16 +00:00
Simon Pilgrim	9db20822f7	[APInt] Add APIntOps::ScaleBitMask helper APInt is used to describe a bit mask in a variety of value tracking and demanded bits/elts functions. When traversing through dst/src operands, we have a number of places where these masks need to widened/narrowed to translate through bitcasts, reductions etc. to a different type. This patch add a APIntOps::ScaleBitMask common helper, adds unit test coverage, and updates a number of cases to use the the helper instead of their own implementation. This came up on D109065 where we currently have to add yet another implementation of the same code. Differential Revision: https://reviews.llvm.org/D109683	2021-09-13 16:27:12 +01:00
vnalamot	0fc3ebb70a	[SelectionDAG][NFC] Fix typo in VerifyDAGDiverence() function name Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D109674	2021-09-13 20:48:04 +05:30
Cullen Rhodes	9db7e8e3ce	[AArch64][SVE] NFC: Remove unused name from cvt builtins Identified in D109359. Reviewed By: paulwalker-arm, dmgreen Differential Revision: https://reviews.llvm.org/D109589	2021-09-13 10:09:08 +00:00
Cullen Rhodes	6bd26329bf	[NVPTX] NFC: Remove unused template arg '_' Identified in D109359. Reviewed By: tra Differential Revision: https://reviews.llvm.org/D109602	2021-09-13 10:09:08 +00:00
Esme-Yi	909f3d7380	[yaml2obj][XCOFF] customize the string table Summary: The patch adds support for yaml2obj customizing the string table. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D107421	2021-09-13 09:24:38 +00:00
luxufan	ff6069b891	[JITLink] Add initial native TLS support to ELFNix platform This patch use the same way as the https://reviews.llvm.org/rGfe1fa43f16beac1506a2e73a9f7b3c81179744eb to handle the thread local variable. It allocates 2 * pointerSize space in GOT to represent the thread key and data address. Instead of using the _tls_get_addr function, I customed a function __orc_rt_elfnix_tls_get_addr to get the address of thread local varible. Currently, this is a wip patch, only one TLS relocation R_X86_64_TLSGD is supported and I need to add the corresponding test cases. To allocate the TLS descriptor in GOT, I need to get the edge kind information in PerGraphGOTAndPLTStubBuilder, So I add a `Edge::Kind K` argument in some functions in PerGraphGOTAndPLTStubBuilder.h. If it is not suitable, I can think further to solve this problem. Differential Revision: https://reviews.llvm.org/D109293	2021-09-13 14:35:49 +08:00
Kuter Dinel	66a0b3464c	[Attributor] AAFunctionReachability, Handle CallBase Reachability. This patch makes it possible to query callbase reachability (Can a callbase reach a function Fn transitively). The patch moves the reachability query handling logic to a member class, this class will have more users within the AA once we add other function reachability queries. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D106402	2021-09-13 01:35:44 +03:00
Sam Clegg	b78c85a44a	[WebAssembly] Convert to new "dylink.0" section format This format is based on sub-sections (like the "linking" and "name" sections) and is therefore easier to extend going forward. spec change: https://github.com/WebAssembly/tool-conventions/pull/170 binaryen change: https://github.com/WebAssembly/binaryen/pull/4141 wabt change: https://github.com/WebAssembly/wabt/pull/1707 emscripten change: https://github.com/emscripten-core/emscripten/pull/15019 Differential Revision: https://reviews.llvm.org/D109595	2021-09-12 05:30:38 -07:00
Lang Hames	b64fc0af9a	[ORC] Add bootstrap symbols to ExecutorProcessControl. Bootstrap symbols are symbols whose addresses may be required to bootstrap the rest of the JIT. The bootstrap symbols map generalizes the existing JITDispatchInfo class provide an arbitrary map of symbol names to addresses. The JITDispatchInfo class will be replaced by bootstrap symbols with reserved names in upcoming commits.	2021-09-12 18:49:43 +10:00
Lang Hames	d11a0c5d91	[ORC] Fix out-of-range comparison errors.	2021-09-12 14:48:05 +10:00
Lang Hames	fcf4e25be5	[ORC] Fix missing '&' in definition of deleted move-assignment.	2021-09-12 14:48:04 +10:00
Lang Hames	bb72f07380	Re-apply `bb27e45643` and `5629afea91` with fixes. This reapplies `bb27e45643` (SimpleRemoteEPC support) and `2269a941a4` (#include <mutex> fix) with further fixes to support building with LLVM_ENABLE_THREADS=Off.	2021-09-12 14:23:22 +10:00
Nikita Popov	45c467346a	[LAA] Pass access type to getPtrStride() Pass the access type to getPtrStride(), so it is not determined from the pointer element type. Many cases still fetch the element type at a higher level though, so this only partially addresses the issue.	2021-09-11 19:16:49 +02:00
Lang Hames	2269a941a4	Revert `5629afea91` and `bb27e45643` while I look into bot failures. This reverts commit `5629afea91` ("[ORC] Add missing include."), and `bb27e45643` ("[ORC] Add SimpleRemoteEPC: ExecutorProcessControl over SPS + abstract transport."). The SimpleRemoteEPC patch currently assumes availability of threads, and needs to be rewritten with LLVM_ENABLE_THREADS guards.	2021-09-11 19:02:11 +10:00
Lang Hames	5629afea91	[ORC] Add missing include.	2021-09-11 18:51:16 +10:00
Lang Hames	bb27e45643	[ORC] Add SimpleRemoteEPC: ExecutorProcessControl over SPS + abstract transport. SimpleRemoteEPC is an ExecutorProcessControl implementation (with corresponding new server class) that uses ORC SimplePackedSerialization (SPS) to serialize and deserialize EPC-messages to/from byte-buffers. The byte-buffers are sent and received via a new SimpleRemoteEPCTransport interface that can be implemented to run SimpleRemoteEPC over whatever underlying transport system (IPC, RPC, network sockets, etc.) best suits your use case. The SimpleRemoteEPCServer class provides executor-side support. It uses a customizable SimpleRemoteEPCServer::Dispatcher object to dispatch wrapper function calls to prevent the RPC thread from being blocked (a problem in some earlier remote-JIT server implementations). Almost all functionality (beyond the bare basics needed to bootstrap) is implemented as wrapper functions to keep the implementation simple and uniform. Compared to previous remote JIT utilities (OrcRemoteTarget, OrcRPCExecutorProcessControl), more consideration has been given to disconnection and error handling behavior: Graceful disconnection is now always initiated by the ORC side of the connection, and failure at either end (or in the transport) will result in Errors being delivered to both ends to enable controlled tear-down of the JIT and Executor (in the Executor's case this means "as controlled as the JIT'd code allows"). The introduction of SimpleRemoteEPC will allow us to remove other remote-JIT support from ORC (including the legacy OrcRemoteTarget code used by lli, and the OrcRPCExecutorProcessControl and OrcRPCEPCServer classes), and then remove ORC RPC itself. The llvm-jitlink and llvm-jitlink-executor tools have been updated to use SimpleRemoteEPC over file descriptors. Future commits will move lli and other tools and example code to this system, and remove ORC RPC.	2021-09-11 18:16:38 +10:00
Lang Hames	a0a91ed3dd	[ORC] Merge LLVMSPSSerializers.h into SimplePackedSerialization.h. Since the ORC runtime and LLVM are no longer sharing SPS code (the ORC runtime has its own copy) there is no reason to keep these separate.	2021-09-11 11:24:00 +10:00
Lang Hames	22641f5853	[ORC] Use EPC for EPCGeneric MemoryAccess / JITLinkMemoryManager construction. This allows these classes to be created during EPC construction, before an ExecutionSession is available.	2021-09-11 11:24:00 +10:00
Joseph Huber	29b44ca896	[OpenMP] Add flag for setting debug in the offloading device This patch introduces the flags `-fopenmp-target-debug` and `-fopenmp-target-debug=` to set the value of a global in the device. This will be used to enable or disable debugging features statically in the device runtime library. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D109544	2021-09-10 18:19:19 -04:00
Johannes Doerfert	c09fbbdcfb	Reapply "[GlobalOpt][FIX] Do not embed initializers into AS!=0 globals"" This reapplies commit `7dbba3376f`, or, put differently, this reverts commit `d9a8d20827`. The test now requires the amdgpu and nvptx backend explicitly as it won't work without properly.	2021-09-10 15:22:56 -05:00

1 2 3 4 5 ...

46070 Commits