llvm-project

Commit Graph

Author	SHA1	Message	Date
Austin Kerbow	2cbb8c946a	[AMDGPU] Reuse register during frame index elimination If there were no free VGPRs we would need two emergency spill slots for register scavenging during PEI/frame index elimination. Reuse 'ResultReg' for scale calculation so that only one spill is needed. Differential Revision: https://reviews.llvm.org/D76387	2020-03-20 00:19:15 -07:00
cdevadas	728b878de6	[AMDGPU] Set the CostPerUse value for vgpr registers. Apart from the argument registers, set the CostPerUse value as per the ratio reg_index/allocation_granularity. It is a pre-commit for introducing the scratch registers in the ABI. This change should help in a balanced register allocation. Differential Revision: https://reviews.llvm.org/D76417	2020-03-20 11:49:35 +05:30
Wei Mi	a035726e5a	Revert "Generate Callee Saved Register (CSR) related cfi directives like .cfi_restore." This reverts commit `3c96d01d2e`. Got report that it caused test failures in libc++.	2020-03-19 22:45:27 -07:00
Jun Ma	032251e34d	[Coroutines] Fix PR45130 For now, when final suspend can be simplified by simplifySuspendPoint, handleFinalSuspend is executed as well to remove last case in switch instruction. This patch fixes it. Differential Revision: https://reviews.llvm.org/D76345	2020-03-20 11:27:08 +08:00
Shiva Chen	fc3752665f	[RISCV] Passing small data limitation value to RISCV backend Passing small data limit to RISCVELFTargetObjectFile by module flag, So the backend can set small data section threshold by the value. The data will be put into the small data section if the data smaller than the threshold. Differential Revision: https://reviews.llvm.org/D57497	2020-03-20 11:03:51 +08:00
Uday Bondhugula	0ddd04391d	[MLIR] Fix op folding to not run pre-replace when not constant folding OperationFolder::tryToFold was running the pre-replacement action even when there was no constant folding, i.e., when the operation was just being updated in place but was not going to be replaced. This led to nested ops being unnecessarily removed from the worklist and only being processed in the next outer iteration of the greedy pattern rewriter, which is also why this didn't affect the final output IR but only the convergence rate. It also led to an op's results' users to be unnecessarily added to the worklist. Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Differential Revision: https://reviews.llvm.org/D76268	2020-03-20 07:49:49 +05:30
Petr Hosek	6ef1f3718f	[sanitizer_coverage][Fuchsia] Set ZX_PROP_VMO_CONTENT_SIZE The VMO size is always page-rounded, but Zircon now provides a way to publish the precise intended size. Patch By: mcgrathr Differential Revision: https://reviews.llvm.org/D76437	2020-03-19 19:12:06 -07:00
Fangrui Song	011b785505	[ELF] Create readonly PT_LOAD in the presence of a SECTIONS command This essentially drops the change by r288021 (discussed with Georgii Rymar and Peter Smith and noted down in the release note of lld 10). GNU ld>=2.31 enables -z separate-code by default for Linux x86. By default (in the absence of a PHDRS command) a readonly PT_LOAD is created, which is different from its traditional behavior. Not emulating GNU ld's traditional behavior is good for us because it improves code consistency (we create a readonly PT_LOAD in the absence of a SECTIONS command). Users can add --no-rosegment to restore the previous behavior (combined readonly and read-executable sections in a single RX PT_LOAD).	2020-03-19 19:11:11 -07:00
Petr Hosek	4e6c778eca	[XRay] Record the XRay data size as a property of the VMO While the VMO size is always page aligned, we can record the content size as a property and then use this metadata when writing the data to a file. Differential Revision: https://reviews.llvm.org/D76462	2020-03-19 19:01:05 -07:00
Stephen Neuendorffer	f7d4bd8144	[MLIR] Fix for out-of-tree builds from install area. Because MLIR_HAS_EXPORTS is not set, MLIRTarget.cmake is not delivered to the install area. When this happens, the delivered MLIRConfig.cmake should not reference it. Independently, we need to determine under what conditions MLIR_HAS_EXPORTS should be set. Probably we are not exporting all the libraries correctly.	2020-03-19 18:43:19 -07:00
David Blaikie	1c15377496	Recommit: CFGDiff: Simplify/common the begin/end implementations to use a common range helper"" (would be nice to revisit the CFG traits and change them to use ranges rather than begin/end - if anyone wants to do that refactor) Also use more auto because writing the names of range utilty iterators isn't helping readability here - they're sort of implementation details for the most part, especially once you nest a few different filtering and adapting iterators. The fix (shooting from the hip since I couldn't reproduce this locally) was to capture by value in a lambda used in a filtering iterator - because the iterator would persist beyond the lifetime of the function (as the iterators are returned to callers). Originally committed in `79a7ed92a9`. This was reverted in `4a7f2032a3`.	2020-03-19 18:21:14 -07:00
Fangrui Song	09ac859c13	[ELF][test] Make tests less address sensitive and delete redundant tests	2020-03-19 18:04:47 -07:00
Yuta Saito	08670d435b	[WebAssembly] Support swiftself and swifterror for WebAssembly target Summary: Swift ABI is based on basic C ABI described here https://github.com/WebAssembly/tool-conventions/blob/master/BasicCABI.md Swift Calling Convention on WebAssembly is a little deffer from swiftcc on another architectures. On non WebAssembly arch, swiftcc accepts extra parameters that are attributed with swifterror or swiftself by caller. Even if callee doesn't have these parameters, the invocation succeed ignoring extra parameters. But WebAssembly strictly checks that callee and caller signatures are same. https://github.com/WebAssembly/design/blob/master/Semantics.md#calls So at WebAssembly level, all swiftcc functions end up extra arguments and all function definitions and invocations explicitly have additional parameters to fill swifterror and swiftself. This patch support signature difference for swiftself and swifterror cc is swiftcc. e.g. ``` declare swiftcc void @foo(i32, i32) @data = global i8* bitcast (void (i32, i32)* @foo to i8) define swiftcc void @bar() { %1 = load i8, i8** @data %2 = bitcast i8* %1 to void (i32, i32, i32)* call swiftcc void %2(i32 1, i32 2, i32 swiftself 3) ret void } ``` For swiftcc, emit additional swiftself and swifterror parameters if there aren't while lowering. These additional parameters are added for both callee and caller. They are necessary to match callee and caller signature for direct and indirect function call. Differential Revision: https://reviews.llvm.org/D76049	2020-03-19 17:39:52 -07:00
Thomas Lively	34db3c3a18	[WebAssembly] SIMD integer abs instructions Summary: These were merged to the SIMD proposal in https://github.com/WebAssembly/simd/pull/128. Depends on D76397 to avoid merge conflicts. Reviewers: aheejin Subscribers: dschuff, sbc100, jgravelle-google, hiraditya, sunfish, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76399	2020-03-19 17:25:58 -07:00
Sterling Augustine	6343526d64	Revert "Cleanup the plumbing for DILineInfoSpecifier. [NFC]" This broke lldb. Will fix and resubmit. This reverts commit `98ff6eb679`.	2020-03-19 17:25:05 -07:00
Thomas Lively	a3f974f3c3	[WebAssembly] SIMD bitmask intrinsics and builtin functions Summary: These experimental new instructions are proposed in https://github.com/WebAssembly/simd/pull/201. Reviewers: aheejin Subscribers: dschuff, sbc100, jgravelle-google, hiraditya, sunfish, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76397	2020-03-19 17:15:37 -07:00
Matt Arsenault	678da7b109	AMDGPU/GlobalISel: Remove leftover #if 0 The subtarget feature used to be missing from subtargets, but that was fixed.	2020-03-19 20:07:05 -04:00
Sterling Augustine	98ff6eb679	Cleanup the plumbing for DILineInfoSpecifier. [NFC] Summary: 1. FileLineInfoSpecifier::Default isn't the default for anything. Rename to RawValue, which accurately reflects its role. 2. Most functions that take a part of a FileLineInfoSpecifier end up constructing a full one later or plumb two values through. Make them all just take a complete FileLineInfoSpecifier. 3. Printing basenames only was handled differently from all other variants, make it parallel to all the other variants. Reviewers: jhenderson Subscribers: hiraditya, MaskRay, rupprecht, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76394	2020-03-19 16:56:43 -07:00
Jessica Paquette	c999084619	[GlobalISel] Port some basic shufflevector undef combines from the DAGCombiner Port over the following: - shuffle undef, undef, any_mask -> undef - shuffle anything, anything, undef_mask -> undef This sort of thing shows up a lot when you try to bugpoint code containing shufflevector. Differential Revision: https://reviews.llvm.org/D76382	2020-03-19 16:46:06 -07:00
Stephen Neuendorffer	6bc775a1fc	[MLIR] Interfaces need to used add_mlir_library The Interface libraries were moved from Analysis, but declared in cmake using add_llvm_library(). This breaks LLVM_BUILD_LLVM_DYLIB builds. Differential Revision: https://reviews.llvm.org/D76463	2020-03-19 16:44:24 -07:00
Lang Hames	39253a50f0	[ORC] Re-apply `98f2bb4461`, enable JITEventListeners in OrcV2, with fixes. Updates the object buffer ownership scheme in jitLinkForOrc and related functions: Ownership of both the object::ObjectFile and underlying MemoryBuffer is passed into jitLinkForOrc and passed back to the onEmit callback once linking is complete. This avoids the use-after-free errors that were seen in `98f2bb4461`.	2020-03-19 16:30:08 -07:00
Petr Hosek	d6fc61b7e8	[profile] Record the profile size as a property of the VMO While the VMO size is always page aligned, we can record the content size as a property and then use this metadata when writing the profile to a file. Differential Revision: https://reviews.llvm.org/D76402	2020-03-19 16:22:19 -07:00
Petr Hosek	98223f7931	[Fuchsia] Use -ffile-prefix-map This makes toolchain independent of the path it was built in by rewriting all absolute paths embedded in sources and debug info into relative ones. Differential Revision: https://reviews.llvm.org/D76189	2020-03-19 15:14:15 -07:00
Petr Hosek	8a8778f25f	[CMake] Enable the use of -ffile-prefix-map This handles not paths embedded in debug info, but also in sources. Since the use of this flag is controlled by an option, rather than replacing the new option, we add a new option. Differential Revision: https://reviews.llvm.org/D76018	2020-03-19 15:14:15 -07:00
Simon Pilgrim	95b6f62efb	[InstSimplify] Add some vector shift tests to show lack of DemandedElts support	2020-03-19 22:09:51 +00:00
Stefan Agner	f87563661d	[MC][ARM] add implicit immediate form for ldrsbt/ldrht/ldrsht Add pseudo instructions for ldrsbt/ldrht/ldrsht with implicit immediate and add fall back C++ code to transform the instruction to the equivalent LDRSBTi/LDRHTi/LDRSHTi form. This is similar to how it has been done in commit `fb3950ec63` This fixes: https://bugs.llvm.org/show_bug.cgi?id=45070	2020-03-19 22:36:42 +01:00
Nathan Ridge	b89202e842	[clangd] Do not trigger go-to-def textual fallback inside string literals Reviewers: sammccall Subscribers: ilya-biryukov, MaskRay, jkorous, arphaman, kadircet, usaxena95, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76098	2020-03-19 17:24:45 -04:00
Sam McCall	b4f02d89e5	[AST] Make Expr::setDependence protected and remove add/removeDependence. NFC Summary: The expected pattern is for subclasses to initialize through computeDependence, which needs only setDependence. The few places that still use addDependence can be simulated with get+set. Reviewers: hokein Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76392	2020-03-19 21:54:40 +01:00
Benjamin Kramer	1db8b341a6	[Matrix] Fold single-use variable into assert Avoids -Wunused-variable warnings in Release builds.	2020-03-19 21:42:22 +01:00
Ilya Leoshkevich	c985b244ee	[MSan] Simulate OOM in mmap_interceptor() Summary: Some kernels can provide 16EiB worth of mappings to each process, which causes mmap test to run for a very long time. In order to make it stop after a few seconds, make mmap_interceptor() fail when the original mmap() returns an address which is outside of the application range. Reviewers: eugenis Reviewed By: eugenis Subscribers: #sanitizers, Andreas-Krebbel, stefansf, jonpa, uweigand Tags: #sanitizers Differential Revision: https://reviews.llvm.org/D76426	2020-03-19 13:33:45 -07:00
Florian Hahn	796fb2e474	[Matrix] Move multiply-add code generation into separate function (NFC). This logic can be shared with the tiled code generation. Reviewers: anemet, Gerolf, hfinkel, andrew.w.kaylor, LuoYuanke Reviewed By: anemet Differential Revision: https://reviews.llvm.org/D75565	2020-03-19 20:26:19 +00:00
Kazu Hirata	e23d786526	[JumpThreading] Fix infinite loop (PR44611) Summary: This patch fixes https://bugs.llvm.org/show_bug.cgi?id=44611 by preventing an infinite loop in the jump threading pass when -jump-threading-across-loop-headers is on. Specifically, without this patch, jump threading through two basic blocks would trigger on the same area of the CFG over and over, resulting in an infinite loop. Consider testcase PR44611-across-header-hang.ll in this patch. The first opportunity to thread through two basic blocks is: from bb_body2 through bb_header and bb_body1 to bb_body2. The pass duplicates bb_header and bb_body1 as, say, bb_header.thread1 and bb_body1.thread1. Since bb_header contains a successor edge back to itself, bb_header.thread1 also contains a successor edge to bb_header, immediately giving rise to the next jump threading opportunity: from bb_header.thread1 through bb_header and bb_body1 to bb_body2. After that, we repeatedly thread an incoming edge into bb_header through bb_header and bb_body1 to bb_body2. In other words, we keep peeling one iteration from bb_header's self loop. The patch fixes the problem by preventing the pass from duplicating a basic block containing a self loop. Reviewers: wmi, junparser, efriedma Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76390	2020-03-19 12:49:36 -07:00
Richard Smith	b20ab412bf	Teach TreeTransform to substitute into resolved TemplateArguments. This comes up when substituting into an already-substituted template argument during constraint satisfaction checking.	2020-03-19 12:43:11 -07:00
Scott Linder	0e9368cc8c	[AMDGPU] Move frame pointer from s34 to s33 Remove the gap left between the stack pointer (s32) and frame pointer (s34) now that the scratch wave offset is no longer a part of the calling convention ABI. Update llvm/docs/AMDGPUUsage.rst to reflect the change. Tags: #llvm Differential Revision: https://reviews.llvm.org/D75657	2020-03-19 15:35:16 -04:00
Scott Linder	60b1967c39	[AMDGPU] Add Scratch Wave Offset to Scratch Buffer Descriptor in entry functions Add the scratch wave offset to the scratch buffer descriptor (SRSrc) in the entry function prologue. This allows us to removes the scratch wave offset register from the calling convention ABI. As part of this change, allow the use of an inline constant zero for the SOffset of MUBUF instructions accessing the stack in entry functions when a frame pointer is not requested/required. Entry functions with calls still need to set up the calling convention ABI stack pointer register, and reference it in order to address arguments of called functions. The ABI stack pointer register remains unswizzled, but is now wave-relative instead of queue-relative. Non-entry functions also use an inline constant zero SOffset for wave-relative scratch access, but continue to use the stack and frame pointers as before. When the stack or frame pointer is converted to a swizzled offset it is now scaled directly, as the scratch wave offset no longer needs to be subtracted first. Update llvm/docs/AMDGPUUsage.rst to reflect these changes to the calling convention. Tags: #llvm Differential Revision: https://reviews.llvm.org/D75138	2020-03-19 15:35:16 -04:00
Scott Linder	db099f994b	[AMDGPU][NFC] Refactor some uses of unsigned to Register Tags: #llvm Differential Revision: https://reviews.llvm.org/D76035	2020-03-19 15:35:16 -04:00
Scott Linder	30bb113beb	[AMDGPU][NFC] Refactor emitEntryFunctionPrologue Remove dead code and factor repeated conditions out into a single check. Rename and move code to make it more obvious what is running only for entry functions. Simplify function arguments to make it clearer what the relevant inputs are. Make flat scratch init accept an MBB iterator and move it to where it was logically being emitted within the prologue. These changes will make a future update to the calling convention simpler. Tags: #llvm Differential Revision: https://reviews.llvm.org/D75092	2020-03-19 15:35:16 -04:00
Sid Manning	430c9a80c1	[Hexagon] Enable linux #defines Enable standard linux defines when the triple is Linux and the environment is musl. Differential Revision: https://reviews.llvm.org/D76310	2020-03-19 14:33:49 -05:00
Florian Hahn	0cc2d23751	[Matrix] Hoist load/store generation logic, add helpers for tiled access. This patch slightly generalizes the code to emit loads and stores of a matrix and adds helpers to load/store a tile of a larger matrix. This will be used in a follow-up patch introducing initial tiling. Reviewers: anemet, Gerolf, hfinkel, andrew.w.kaylor, LuoYuanke Reviewed By: anemet Differential Revision: https://reviews.llvm.org/D75564	2020-03-19 19:28:21 +00:00
Simon Pilgrim	c2586cab89	[InstCombine][X86] Tests for variable but in-range vector-by-scalar shift amounts (PR40391) These shifts are masked to be inrange so we should be able to replace them with generic shifts.	2020-03-19 19:24:55 +00:00
Erich Keane	a983562b23	Precommit test for clang::CallGraph declared functions. https://reviews.llvm.org/D76435 fixes this problem, functions that are being declared but ARE called aren't entered into the callgraph.	2020-03-19 12:00:30 -07:00
Jonas Devlieghere	90308a4da1	[debugserver] Implement hardware breakpoints for ARM64 Add support for hardware breakpoints on ARM64. Differential revision: https://reviews.llvm.org/D76411	2020-03-19 11:55:48 -07:00
Sean Silva	c31ee83abb	Add Builder::get{I32,I64}TensorAttr. Builder::get{I32,I64}VectorAttr are actually of limited applicability since vector types can't have zero elements, whereas many uses of this kind of attribute (such as dimension lists for "transpose"-like and other tensor ops) often can result in empty lists. Differential Revision: https://reviews.llvm.org/D76403	2020-03-19 11:37:59 -07:00
Simon Pilgrim	a11e5b32df	[InstCombine][X86] simplifyX86immShift - handle variable out-of-range vector shift by immediate amounts (PR40391) If we know the SSE shift amount is out of range then we can simplify to zero value (logical) or a 'signsplat' bitwidth-1 shift (arithmetic). This allows us to remove the equivalent ConstantInt constant folding path from simplifyX86immShift.	2020-03-19 18:27:31 +00:00
zoecarver	9e2207a00b	[libc++] fix non-builtin is_void implementation Add the missing closing angle bracket to the call to remove_cv. This is only used when we can't use the builtin implementation. Fixes: `5ade17e0ca`	2020-03-19 11:25:41 -07:00
Cameron McInally	018dde4ce5	[AArch64][SVE] Add support for DestructiveBinaryImm DestructiveInstType Support prefixing destructive operations, with the MOVPRFX instruction, to build constructive operations. Differential Revision: https://reviews.llvm.org/D75064	2020-03-19 13:11:46 -05:00
Lang Hames	54aec178da	[ORC] Don't use a platform mutex for LLJIT's GenericLLVMIRPlatformSupport class. Along the same lines as eb918d8daf1: This code also had to acquire the session mutex, and this could cause a deadlock under the wrong circumstances. This patch updates GenericLLVMIRPlatformSupport to just use the session lock for everything.	2020-03-19 11:03:34 -07:00
Lang Hames	ad2da631bf	[ORC] Fix indentation in debugging output.	2020-03-19 11:02:56 -07:00
Lang Hames	eb918d8daf	[ORC] Use finer-grained and session locking in MachOPlatform to avoid deadlock. In MachOPlatform, obtaining the link-order for a JITDylib requires locking the session, but also needs to be part of a larger atomic operation that collates initializer symbols tracked by the platform. Trying to do this under a separate platform mutex leads to potential locking order issues, e.g. T1 locks session then tries to lock platform to register a new init symbol meanwhile T2 locks platform then tries to lock session to obtain link order. Removing the platform lock and performing all these operations under the session lock eliminates this possibility. At the same time we also need to collate init pointers from the MachOPlatform::InitScraperPlugin, and we don't need or want to lock the session for that. The new InitSeqMutex has been added to guard these init pointers, and the session mutex is never obtained while the InitSeqMutex is held.	2020-03-19 11:02:56 -07:00
Lang Hames	a7b8393ffe	[ORC] Don't waste time building empty replacement MaterializationUnits.	2020-03-19 11:02:56 -07:00

... 5 6 7 8 9 ...

346048 Commits All Branches Search

346048 Commits

All Branches