llvm-project

Commit Graph

Author	SHA1	Message	Date
Mikhail Maltsev	7a85e3585e	[ARM,CDE] Implement GPR CDE intrinsics Summary: This change implements ACLE CDE intrinsics that translate to instructions working with general-purpose registers. The specification is available at https://static.docs.arm.com/101028/0010/ACLE_2019Q4_release-0010.pdf Each ACLE intrinsic gets a corresponding LLVM IR intrinsic (because they have distinct function prototypes). Dual-register operands are represented as pairs of i32 values. Because of this the instruction selection for these intrinsics cannot be represented as TableGen patterns and requires custom C++ code. Reviewers: simon_tatham, MarkMurrayARM, dmgreen, ostannard Reviewed By: MarkMurrayARM Subscribers: kristof.beyls, hiraditya, danielkiss, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76296	2020-03-20 14:01:51 +00:00
Florian Hahn	ece6cf0fa5	[DSE,MSSA] Precommit additional tests for D73763.	2020-03-20 13:39:46 +00:00
Michael Liao	a4edea29be	Fix `-Wunused-variable` warning. NFC.	2020-03-20 09:31:58 -04:00
Simon Pilgrim	7f764fa18f	[ValueTracking] Add some initial isKnownNonZero DemandedElts support (PR36319)	2020-03-20 13:29:00 +00:00
Alexey Bataev	fcba7c3534	[OPENMP50]Initial support for scan directive. Addedi basic parsing/sema/serialization support for scan directive.	2020-03-20 07:58:15 -04:00
Nikita Popov	ce6c95aaca	[InstCombine] Move test to instcombine; NFC This test uses -instcombine, so move it into the appropriate directory. Also fork it for expensive checks enabled/disabled.	2020-03-20 12:41:19 +01:00
Dmitri Gribenko	9967352a03	Revert "[Syntax] Test both the default and windows target platforms in unittests" This reverts commit `fd7300f717`. The fix in this patch didn't help and the Windows buildbot broke: http://45.33.8.238/win/10881/step_7.txt	2020-03-20 12:13:49 +01:00
Simon Pilgrim	c1efdbcbe0	[ValueTracking] Add computeKnownBits DemandedElts support to shift instructions (PR36319)	2020-03-20 11:08:08 +00:00
Nikita Popov	a09ff56b5b	[Tests] Regenerate some test checks; NFC	2020-03-20 12:06:53 +01:00
James Henderson	86b093d1a1	[llvm-readobj] Allow syms from all sections to match stack size entries Prior to this change, for non-relocatable objects llvm-readobj would assume that all symbols that corresponded to a stack size section's entries were in the section specified by the section's sh_link field. In the presence of an output section description combining SHF_LINK_ORDER sections linking different output sections, this cannot be respected, since linker script section patterns are "by name" by nature. Consequently, the sh_link value would not be correct for all section entries. This patch changes llvm-readobj to ignore the section of symbols in a non-relocatable object. Fixes https://bugs.llvm.org/show_bug.cgi?id=45228. Reviewed by: grimar, MaskRay Differential Revision: https://reviews.llvm.org/D76425	2020-03-20 10:54:18 +00:00
Marcel Hlopko	fd7300f717	[Syntax] Test both the default and windows target platforms in unittests Summary: This increases the coverage for things that differ between Linux and Windows, such as `-fdelayed-template-parsing`. This would have prevented the rollback of https://reviews.llvm.org/D76346. While at it, update -std=c++11 to c++17 for the test. Reviewers: gribozavr2 Reviewed By: gribozavr2 Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76433	2020-03-20 11:42:18 +01:00
Jaroslav Sevcik	089cfe113d	Improve step over performance Summary: This patch improves step over performance for the case when we are stepping over a call with a next-branch-breakpoint (see https://reviews.llvm.org/D58678), and we encounter a stop during the call. Currently, this causes the thread plan to step-out //each frame// until it reaches the step-over range. This is a regression introduced by https://reviews.llvm.org/D58678 (which did improve other things!). Prior to that change, the step-over plan would always step-out just once. With this patch, if we find ourselves stopped in a deeper stack frame and we already have a next branch breakpoint, we simply return from the step-over plan's ShouldStop handler without pushing the step out plan. In my experiments this improved the time of stepping over a call that loads 12 dlls from 14s to 5s. This was in remote debugging scenario with 10ms RTT, the call in question was Vulkan initialization (vkCreateInstance), which loads various driver dlls. Loading those dlls must stop on the rendezvous breakpoint, causing the perf problem described above. Reviewers: clayborg, labath, jingham Reviewed By: jingham Subscribers: lldb-commits Tags: #lldb Differential Revision: https://reviews.llvm.org/D76216	2020-03-20 11:41:56 +01:00
Georgii Rymar	63778bc653	[llvm-readobj][llvm-readelf][test] - Add a test to check how we dump relocation addends. Seems we do not test how we print relocation addends well. And the behavior of dumpers does not seem to be ideal here (and llvm-readelf does not match GNU as the test case shows). This patch adds a test case to document the current behavior. Differential revision: https://reviews.llvm.org/D75671	2020-03-20 13:41:32 +03:00
Raphael Isemann	467c4902a1	[lldb] Enable now passing part of TestDataFormatterStdString.py This was fixed by `7b2442584e` .	2020-03-20 11:35:15 +01:00
Tyker	180581cfcf	[clang] Add support for consteval constructors Summary: Changes: - handle immediate invocations for constructors. - add tests after this patch i believe the implementation of consteval is nearly standard compliant, but IR-gen still needs to be taught not to emit consteval declarations. Reviewers: rsmith Reviewed By: rsmith Subscribers: wchilders Differential Revision: https://reviews.llvm.org/D74007	2020-03-20 11:33:54 +01:00
Adrian Kuegel	baa6f6a782	Revert "[TableGen][GlobalISel] Account for HwMode in RegisterBank register sizes" This reverts commit `e9f22fd429`. When building with -DLLVM_USE_SANITIZER="Thread", check-llvm has 70 failing tests with this revision, and 29 without this revision.	2020-03-20 11:02:50 +01:00
David Green	b3499f572d	[ARM] Change VDUP type to i32 for MVE The MVE VDUP instruction take a GPR and splats into every lane of a vector register. Unlike NEON we do not have a VDUPLANE equivalent instruction, doing the same splat from a fp register. Previously a VDUP to a v4f32/v8f16 would be represented as a (v4f32 VDUP f32), which would mean the instruction pattern needs to add a COPY_TO_REGCLASS to the GPR. Instead this now converts that earlier during an ISel DAG combine, converting (VDUP x) to (VDUP (bitcast x)). This can allow instruction selection to tell that the input needs to be an i32, which in one of the testcases allows it to use ldr (or specifically ldm) over (vldr;vmov). Whilst being simple enough for floats, as the types sizes are the same, these is no BITCAST equivalent for getting a half into a i32. This uses a VMOVrh ARMISD node, which doesn't know the same tricks yet. Differential Revision: https://reviews.llvm.org/D76292	2020-03-20 09:48:45 +00:00
Roger Ferrer Ibanez	3c24aee7ee	[RISCV] Select +0.0 immediate using fmv.{w,d}.x / fcvt.d.w Floating point positive zero can be selected using fmv.w.x / fmv.d.x / fcvt.d.w and the zero source register. Differential Revision: https://reviews.llvm.org/D75729	2020-03-20 09:42:24 +00:00
Roger Ferrer Ibanez	ebb04e9ca9	[NFC][RISCV] Test for 0.0 fp immediate To show a later change that impacts 0.0 fp constant generation. Differential Revision: https://reviews.llvm.org/D75728	2020-03-20 09:42:24 +00:00
Nikita Popov	0372768776	[InstCombine] Simplify calls with "returned" attribute If a call argument has the "returned" attribute, we can simplify the call to the value of that argument. This was already partially handled by InstSimplify/InstCombine for the case where the argument is an integer constant, and the result is thus known via known bits. The non-constant (or non-int) argument cases weren't handled though. This previously landed as an InstSimplify transform, but was reverted due to assertion failures when compiling the Linux kernel. The reason is that simplifying a call to another call breaks assumptions in call graph updating during inlining. As the code is not easy to fix, and there is no particularly strong motivation for having this in InstSimplify, the transform is only performed in InstCombine instead. Differential Revision: https://reviews.llvm.org/D75815	2020-03-20 10:23:39 +01:00
David Green	9cf920e64d	[ARM] Extra MVE float loop tests. NFC	2020-03-20 09:21:45 +00:00
Nikita Popov	5c10967157	[InstCombine] Don't replace musttail result based on known bits This is the same change as D75824, but for two cases where InstCombine performs the same optimization: Replacing an instruction whose bits are fully known with a constant. This is not (generally) legal for musttail calls. Differential Revision: https://reviews.llvm.org/D76457	2020-03-20 10:17:09 +01:00
Marcel Hlopko	e9630630ff	[Syntax] Split syntax tests Summary: This patch split Basic test into multple individual tests to allow simpler filtering and clearer signal into what's broken when it's broken. Reviewers: gribozavr2 Reviewed By: gribozavr2 Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76366	2020-03-20 09:53:43 +01:00
Florian Hahn	be86bc76f0	[Matrix] Generalize ColumnMatrixTy to MatrixTy (NFC). This patch sets the stage for supporting both row and column major layouts for matrixes. It renames ColumnMatrixTy to MatrixTy, adds booleans indicating the underlying layout to both MatrixTy and ShapeInfo and generalizes the methods of MatrixTy to support both row and column major layouts. Reviewers: Gerolf, anemet, andrew.w.kaylor, LuoYuanke Reviewed By: anemet Differential Revision: https://reviews.llvm.org/D76324	2020-03-20 08:32:13 +00:00
Florian Hahn	3a8372ed02	[DSE] Support traversing MemoryPhis. For MemoryPhis, we have to avoid that the MemoryPhi may be executed before before the access we are currently looking at. To do this we do a post-order numbering of the basic blocks in the function and bail out once we reach a MemoryPhi with a larger (or equal) post-order block number than the current MemoryAccess. This changes the order in which we visit stores for elimination. This patch also adds support for exploring multiple paths. We keep a worklist (ToCheck) of memory accesses that might be eliminated by our starting MemoryDef or MemoryPhis for further exploration. For MemoryPhis, we add the incoming values to the worklist, for MemoryDefs we add the defining access. Reviewers: dmgreen, rnk, efriedma, bryant, asbirlea Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D72148	2020-03-20 07:51:42 +00:00
Austin Kerbow	2cbb8c946a	[AMDGPU] Reuse register during frame index elimination If there were no free VGPRs we would need two emergency spill slots for register scavenging during PEI/frame index elimination. Reuse 'ResultReg' for scale calculation so that only one spill is needed. Differential Revision: https://reviews.llvm.org/D76387	2020-03-20 00:19:15 -07:00
cdevadas	728b878de6	[AMDGPU] Set the CostPerUse value for vgpr registers. Apart from the argument registers, set the CostPerUse value as per the ratio reg_index/allocation_granularity. It is a pre-commit for introducing the scratch registers in the ABI. This change should help in a balanced register allocation. Differential Revision: https://reviews.llvm.org/D76417	2020-03-20 11:49:35 +05:30
Wei Mi	a035726e5a	Revert "Generate Callee Saved Register (CSR) related cfi directives like .cfi_restore." This reverts commit `3c96d01d2e`. Got report that it caused test failures in libc++.	2020-03-19 22:45:27 -07:00
Jun Ma	032251e34d	[Coroutines] Fix PR45130 For now, when final suspend can be simplified by simplifySuspendPoint, handleFinalSuspend is executed as well to remove last case in switch instruction. This patch fixes it. Differential Revision: https://reviews.llvm.org/D76345	2020-03-20 11:27:08 +08:00
Shiva Chen	fc3752665f	[RISCV] Passing small data limitation value to RISCV backend Passing small data limit to RISCVELFTargetObjectFile by module flag, So the backend can set small data section threshold by the value. The data will be put into the small data section if the data smaller than the threshold. Differential Revision: https://reviews.llvm.org/D57497	2020-03-20 11:03:51 +08:00
Uday Bondhugula	0ddd04391d	[MLIR] Fix op folding to not run pre-replace when not constant folding OperationFolder::tryToFold was running the pre-replacement action even when there was no constant folding, i.e., when the operation was just being updated in place but was not going to be replaced. This led to nested ops being unnecessarily removed from the worklist and only being processed in the next outer iteration of the greedy pattern rewriter, which is also why this didn't affect the final output IR but only the convergence rate. It also led to an op's results' users to be unnecessarily added to the worklist. Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Differential Revision: https://reviews.llvm.org/D76268	2020-03-20 07:49:49 +05:30
Petr Hosek	6ef1f3718f	[sanitizer_coverage][Fuchsia] Set ZX_PROP_VMO_CONTENT_SIZE The VMO size is always page-rounded, but Zircon now provides a way to publish the precise intended size. Patch By: mcgrathr Differential Revision: https://reviews.llvm.org/D76437	2020-03-19 19:12:06 -07:00
Fangrui Song	011b785505	[ELF] Create readonly PT_LOAD in the presence of a SECTIONS command This essentially drops the change by r288021 (discussed with Georgii Rymar and Peter Smith and noted down in the release note of lld 10). GNU ld>=2.31 enables -z separate-code by default for Linux x86. By default (in the absence of a PHDRS command) a readonly PT_LOAD is created, which is different from its traditional behavior. Not emulating GNU ld's traditional behavior is good for us because it improves code consistency (we create a readonly PT_LOAD in the absence of a SECTIONS command). Users can add --no-rosegment to restore the previous behavior (combined readonly and read-executable sections in a single RX PT_LOAD).	2020-03-19 19:11:11 -07:00
Petr Hosek	4e6c778eca	[XRay] Record the XRay data size as a property of the VMO While the VMO size is always page aligned, we can record the content size as a property and then use this metadata when writing the data to a file. Differential Revision: https://reviews.llvm.org/D76462	2020-03-19 19:01:05 -07:00
Stephen Neuendorffer	f7d4bd8144	[MLIR] Fix for out-of-tree builds from install area. Because MLIR_HAS_EXPORTS is not set, MLIRTarget.cmake is not delivered to the install area. When this happens, the delivered MLIRConfig.cmake should not reference it. Independently, we need to determine under what conditions MLIR_HAS_EXPORTS should be set. Probably we are not exporting all the libraries correctly.	2020-03-19 18:43:19 -07:00
David Blaikie	1c15377496	Recommit: CFGDiff: Simplify/common the begin/end implementations to use a common range helper"" (would be nice to revisit the CFG traits and change them to use ranges rather than begin/end - if anyone wants to do that refactor) Also use more auto because writing the names of range utilty iterators isn't helping readability here - they're sort of implementation details for the most part, especially once you nest a few different filtering and adapting iterators. The fix (shooting from the hip since I couldn't reproduce this locally) was to capture by value in a lambda used in a filtering iterator - because the iterator would persist beyond the lifetime of the function (as the iterators are returned to callers). Originally committed in `79a7ed92a9`. This was reverted in `4a7f2032a3`.	2020-03-19 18:21:14 -07:00
Fangrui Song	09ac859c13	[ELF][test] Make tests less address sensitive and delete redundant tests	2020-03-19 18:04:47 -07:00
Yuta Saito	08670d435b	[WebAssembly] Support swiftself and swifterror for WebAssembly target Summary: Swift ABI is based on basic C ABI described here https://github.com/WebAssembly/tool-conventions/blob/master/BasicCABI.md Swift Calling Convention on WebAssembly is a little deffer from swiftcc on another architectures. On non WebAssembly arch, swiftcc accepts extra parameters that are attributed with swifterror or swiftself by caller. Even if callee doesn't have these parameters, the invocation succeed ignoring extra parameters. But WebAssembly strictly checks that callee and caller signatures are same. https://github.com/WebAssembly/design/blob/master/Semantics.md#calls So at WebAssembly level, all swiftcc functions end up extra arguments and all function definitions and invocations explicitly have additional parameters to fill swifterror and swiftself. This patch support signature difference for swiftself and swifterror cc is swiftcc. e.g. ``` declare swiftcc void @foo(i32, i32) @data = global i8* bitcast (void (i32, i32)* @foo to i8) define swiftcc void @bar() { %1 = load i8, i8** @data %2 = bitcast i8* %1 to void (i32, i32, i32)* call swiftcc void %2(i32 1, i32 2, i32 swiftself 3) ret void } ``` For swiftcc, emit additional swiftself and swifterror parameters if there aren't while lowering. These additional parameters are added for both callee and caller. They are necessary to match callee and caller signature for direct and indirect function call. Differential Revision: https://reviews.llvm.org/D76049	2020-03-19 17:39:52 -07:00
Thomas Lively	34db3c3a18	[WebAssembly] SIMD integer abs instructions Summary: These were merged to the SIMD proposal in https://github.com/WebAssembly/simd/pull/128. Depends on D76397 to avoid merge conflicts. Reviewers: aheejin Subscribers: dschuff, sbc100, jgravelle-google, hiraditya, sunfish, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76399	2020-03-19 17:25:58 -07:00
Sterling Augustine	6343526d64	Revert "Cleanup the plumbing for DILineInfoSpecifier. [NFC]" This broke lldb. Will fix and resubmit. This reverts commit `98ff6eb679`.	2020-03-19 17:25:05 -07:00
Thomas Lively	a3f974f3c3	[WebAssembly] SIMD bitmask intrinsics and builtin functions Summary: These experimental new instructions are proposed in https://github.com/WebAssembly/simd/pull/201. Reviewers: aheejin Subscribers: dschuff, sbc100, jgravelle-google, hiraditya, sunfish, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76397	2020-03-19 17:15:37 -07:00
Matt Arsenault	678da7b109	AMDGPU/GlobalISel: Remove leftover #if 0 The subtarget feature used to be missing from subtargets, but that was fixed.	2020-03-19 20:07:05 -04:00
Sterling Augustine	98ff6eb679	Cleanup the plumbing for DILineInfoSpecifier. [NFC] Summary: 1. FileLineInfoSpecifier::Default isn't the default for anything. Rename to RawValue, which accurately reflects its role. 2. Most functions that take a part of a FileLineInfoSpecifier end up constructing a full one later or plumb two values through. Make them all just take a complete FileLineInfoSpecifier. 3. Printing basenames only was handled differently from all other variants, make it parallel to all the other variants. Reviewers: jhenderson Subscribers: hiraditya, MaskRay, rupprecht, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76394	2020-03-19 16:56:43 -07:00
Jessica Paquette	c999084619	[GlobalISel] Port some basic shufflevector undef combines from the DAGCombiner Port over the following: - shuffle undef, undef, any_mask -> undef - shuffle anything, anything, undef_mask -> undef This sort of thing shows up a lot when you try to bugpoint code containing shufflevector. Differential Revision: https://reviews.llvm.org/D76382	2020-03-19 16:46:06 -07:00
Stephen Neuendorffer	6bc775a1fc	[MLIR] Interfaces need to used add_mlir_library The Interface libraries were moved from Analysis, but declared in cmake using add_llvm_library(). This breaks LLVM_BUILD_LLVM_DYLIB builds. Differential Revision: https://reviews.llvm.org/D76463	2020-03-19 16:44:24 -07:00
Lang Hames	39253a50f0	[ORC] Re-apply `98f2bb4461`, enable JITEventListeners in OrcV2, with fixes. Updates the object buffer ownership scheme in jitLinkForOrc and related functions: Ownership of both the object::ObjectFile and underlying MemoryBuffer is passed into jitLinkForOrc and passed back to the onEmit callback once linking is complete. This avoids the use-after-free errors that were seen in `98f2bb4461`.	2020-03-19 16:30:08 -07:00
Petr Hosek	d6fc61b7e8	[profile] Record the profile size as a property of the VMO While the VMO size is always page aligned, we can record the content size as a property and then use this metadata when writing the profile to a file. Differential Revision: https://reviews.llvm.org/D76402	2020-03-19 16:22:19 -07:00
Petr Hosek	98223f7931	[Fuchsia] Use -ffile-prefix-map This makes toolchain independent of the path it was built in by rewriting all absolute paths embedded in sources and debug info into relative ones. Differential Revision: https://reviews.llvm.org/D76189	2020-03-19 15:14:15 -07:00
Petr Hosek	8a8778f25f	[CMake] Enable the use of -ffile-prefix-map This handles not paths embedded in debug info, but also in sources. Since the use of this flag is controlled by an option, rather than replacing the new option, we add a new option. Differential Revision: https://reviews.llvm.org/D76018	2020-03-19 15:14:15 -07:00
Simon Pilgrim	95b6f62efb	[InstSimplify] Add some vector shift tests to show lack of DemandedElts support	2020-03-19 22:09:51 +00:00

... 3 4 5 6 7 ...

345973 Commits All Branches Search

345973 Commits

All Branches