llvm-project

Commit Graph

Author	SHA1	Message	Date
Hans Wennborg	8a197e0b16	Require shell for lld/test/ELF/arm-exidx-range.s The test fails in 32-bit Windows builds for unclear reasons: ld.lld: error: failed to open C:\src\llvm_package_1100-rc1\build32_stage0\tools\lld\test\ELF\Output\arm-exidx-range.s.tmp: The parameter is incorrect.	2020-07-20 17:49:10 +02:00
David Goldman	dde98c82c0	Fix issue in typo handling which could lead clang to hang Summary: We need to detect when certain TypoExprs are not being transformed due to invalid trees, otherwise we risk endlessly trying to fix it. Reviewers: rsmith Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D84067	2020-07-20 11:42:11 -04:00
Matt Arsenault	21ef01b7e3	AMDGPU: Remove outdated fixme	2020-07-20 11:41:41 -04:00
Matt Arsenault	84704d989b	AMDGPU: Fix not accounting for constantexpr uses of LDS globals This was failing to add the size of LDS globals that weren't directly used by an instruction. They could be used by constant expressions which are transitively used by the function. This requires a better search, but just abort on this for now for correctness.	2020-07-20 11:41:41 -04:00
Benjamin Kramer	177e5acbe4	[Sema] Promote SmallSet of enum to bitset Even with 300 elements, this still consumes less stack space than the SmallSet. NFCI.	2020-07-20 17:35:43 +02:00
Matt Arsenault	61f1f2a204	AMDGPU/GlobalISel: Initial Implementation of calls Return values, and tail calls are not yet handled.	2020-07-20 11:13:22 -04:00
Matt Arsenault	780cef1f34	Verifier: Check byref address space for AMDGPU calling conventions	2020-07-20 11:13:11 -04:00
Erich Keane	66aff32398	Issue error on invalid arithemtic conversions in C ternary As reported in PR46774, an invalid arithemetic conversion used in a C ternary operator resulted in an assertion. This patch replaces that assertion with a diagnostic stating that the conversion failed. At the moment, I believe the only case of this happening is _ExtInt types.	2020-07-20 08:02:37 -07:00
Matt Arsenault	ad8e900cb3	Verifier: Disallow byval and similar for AMDGPU calling conventions These imply stack-like semantics, which doesn't make any sense for entry points.	2020-07-20 10:58:57 -04:00
Benjamin Kramer	f3f1ce4fa9	[Driver] Promote SmallSet of enum to a bitset. NFCI.	2020-07-20 16:54:30 +02:00
Benjamin Kramer	33c9d0320e	Upgrade SmallSets of pointer-like types to SmallPtrSet This is slightly more efficient. NFC.	2020-07-20 16:54:29 +02:00
Frederik Gossen	71e7a37e7e	[MLIR][Shape] Allow `shape.rank` to accept extent tensors `tensor?xindex>` Differential Revision: https://reviews.llvm.org/D84156	2020-07-20 14:47:19 +00:00
Frederik Gossen	ccb40c84c5	[MLIR][Shape] Allow `cstr_broadcastable` to accept extent tensors Differential Revision: https://reviews.llvm.org/D84155	2020-07-20 14:39:44 +00:00
Alok Kumar Sharma	2d10258a31	[DebugInfo] Support for DW_AT_associated and DW_AT_allocated. Summary: This support is needed for the Fortran array variables with pointer/allocatable attribute. This support enables debugger to identify the status of variable whether that is currently allocated/associated. for pointer array (before allocation/association) without DW_AT_associated (gdb) pt ptr type = integer (140737345375288:140737354129776) (gdb) p ptr value requires 35017956 bytes, which is more than max-value-size with DW_AT_associated (gdb) pt ptr type = integer (:) (gdb) p ptr $1 = <not associated> for allocatable array (before allocation) without DW_AT_allocated (gdb) pt arr type = integer (140737345375288:140737354129776) (gdb) p arr value requires 35017956 bytes, which is more than max-value-size with DW_AT_allocated (gdb) pt arr type = integer, allocatable (:) (gdb) p arr $1 = <not allocated> Testing - unit test cases added - check-llvm - check-debuginfo Reviewed By: aprantl Differential Revision: https://reviews.llvm.org/D83544	2020-07-20 19:54:35 +05:30
Matt Arsenault	5e999cbe8d	IR: Define byref parameter attribute This allows tracking the in-memory type of a pointer argument to a function for ABI purposes. This is essentially a stripped down version of byval to remove some of the stack-copy implications in its definition. This includes the base IR changes, and some tests for places where it should be treated similarly to byval. Codegen support will be in a future patch. My original attempt at solving some of these problems was to repurpose byval with a different address space from the stack. However, it is technically permitted for the callee to introduce a write to the argument, although nothing does this in reality. There is also talk of removing and replacing the byval attribute, so a new attribute would need to take its place anyway. This is intended avoid some optimization issues with the current handling of aggregate arguments, as well as fixes inflexibilty in how frontends can specify the kernel ABI. The most honest representation of the amdgpu_kernel convention is to expose all kernel arguments as loads from constant memory. Today, these are raw, SSA Argument values and codegen is responsible for turning these into loads. Background: There currently isn't a satisfactory way to represent how arguments for the amdgpu_kernel calling convention are passed. In reality, arguments are passed in a single, flat, constant memory buffer implicitly passed to the function. It is also illegal to call this function in the IR, and this is only ever invoked by a driver of some kind. It does not make sense to have a stack passed parameter in this context as is implied by byval. It is never valid to write to the kernel arguments, as this would corrupt the inputs seen by other dispatches of the kernel. These argumets are also not in the same address space as the stack, so a copy is needed to an alloca. From a source C-like language, the kernel parameters are invisible. Semantically, a copy is always required from the constant argument memory to a mutable variable. The current clang calling convention lowering emits raw values, including aggregates into the function argument list, since using byval would not make sense. This has some unfortunate consequences for the optimizer. In the aggregate case, we end up with an aggregate store to alloca, which both SROA and instcombine turn into a store of each aggregate field. The optimizer never pieces this back together to see that this is really just a copy from constant memory, so we end up stuck with expensive stack usage. This also means the backend dictates the alignment of arguments, and arbitrarily picks the LLVM IR ABI type alignment. By allowing an explicit alignment, frontends can make better decisions. For example, there's real no advantage to an aligment higher than 4, so a frontend could choose to compact the argument layout. Similarly, there is a high penalty to using an alignment lower than 4, so a frontend could opt into more padding for small arguments. Another design consideration is when it is appropriate to expose the fact that these arguments are all really passed in adjacent memory. Currently we have a late IR optimization pass in codegen to rewrite the kernel argument values into explicit loads to enable vectorization. In most programs, unrelated argument loads can be merged together. However, exposing this property directly from the frontend has some disadvantages. We still need a way to track the original argument sizes and alignments to report to the driver. I find using some side-channel, metadata mechanism to track this unappealing. If the kernel arguments were exposed as a single buffer to begin with, alias analysis would be unaware that the padding bits betewen arguments are meaningless. Another family of problems is there are still some gaps in replacing all of the available parameter attributes with metadata equivalents once lowered to loads. The immediate plan is to start using this new attribute to handle all aggregate argumets for kernels. Long term, it makes sense to migrate all kernel arguments, including scalars, to be passed indirectly in the same manner. Additional context is in D79744.	2020-07-20 10:23:09 -04:00
Simon Pilgrim	017e5c949b	MCFixup.h - remove unnecessary MCExpr.h include. NFCI. Move the include down to files that actually depend on MCExpr definitions. Also exposes an implicit dependency on MCContext in AVRAsmBackend.h	2020-07-20 15:17:19 +01:00
Simon Pilgrim	a0ed0e3fac	CodeGenDAGPatterns.h - remove unnecessary ComplexPattern forward declaration. NFCI. This is defined in CodeGenTarget.h which we have to explicitly include already.	2020-07-20 15:17:19 +01:00
Simon Pilgrim	93c338fd0f	CodeGenDAGPatterns.h - remove unused CodeGenHwModes.h include. NFCI.	2020-07-20 15:17:18 +01:00
Petar Avramovic	6a1030aa0e	AMDGPU/GlobalISel: Legalize s16->s64 G_FPEXT Legalize using narrowScalar as s16->s32 G_FPEXT followed by s32->s64 G_FPEXT. Differential Revision: https://reviews.llvm.org/D84030	2020-07-20 16:12:19 +02:00
Matt Arsenault	100564bdf8	AMDGPU/GlobalISel: Remove outdated comment	2020-07-20 10:06:18 -04:00
Matt Arsenault	5cbd4e415e	GlobalISel: Don't handle widenScalar for vector G_INSERT This handling didn't make any sense for vectors.	2020-07-20 10:06:18 -04:00
Matt Arsenault	93311a9812	AMDGPU/GlobalISel: Fix custom lowering of llvm.trunc.f64 for SI This was missing an operand from BFE and not erasing the original instruction.	2020-07-20 10:06:18 -04:00
Matt Arsenault	57aae47056	AArch64/GlobalISel: Fix hardcoded registers in error message checks	2020-07-20 10:06:18 -04:00
Matt Arsenault	a679f27e98	GlobalISel: Consistently get TII from MIRBuilder	2020-07-20 10:06:18 -04:00
Pavel Labath	7fadd70069	[lldb/Utility] Simplify Scalar::SetValueFromData The function was fairly complicated and didn't support new bigger integer sizes. Use llvm function for loading an APInt from memory to write a unified implementation for all sizes.	2020-07-20 15:56:56 +02:00
Pavel Labath	9decf0405f	[lldb/test] Simplify Makefile rules for .d files The sed line in the rules was adding the .d file as a target to the dependency rules -- to ensure the file gets rebuild when the sources change. The same thing can be achieved more elegantly with some -M flags.	2020-07-20 15:53:19 +02:00
Benjamin Kramer	e88b6ed748	[LLE] std::inserter doesn't work with SmallSet, so don't use it.	2020-07-20 15:47:42 +02:00
Haojian Wu	70e2c7ad2e	[AST][RecoveryExpr] Add recovery-ast tests for C language, NFC. some examples are working already. Differential Revision: https://reviews.llvm.org/D84146	2020-07-20 15:33:59 +02:00
Haojian Wu	4b5b7c7541	[AST][RecoveryExpr] Fix a crash on opencl C++. Differential Revision: https://reviews.llvm.org/D84145	2020-07-20 15:15:30 +02:00
Haojian Wu	61d664c938	Fix clangd build, NFC	2020-07-20 15:13:20 +02:00
Benjamin Kramer	44ab60f74d	[LoopSimplify] Use SmallPtrSet and range for loops more. NFCI.	2020-07-20 15:00:59 +02:00
Haojian Wu	684e416ef1	[AST][RecoveryExpr] Preserve the AST for invalid conditions. Adjust an existing diagnostic test, which is an improvement of secondary diagnostic. Differential Revision: https://reviews.llvm.org/D81163	2020-07-20 14:58:36 +02:00
Pavel Labath	9199457bfb	[LLDB/test] Simplify result formatter code Now that the main test results are reported through lit, and we only have one formatter class, this code is unnecessarily baroque.	2020-07-20 14:56:49 +02:00
Sam McCall	72f2fb1db4	[clangd] Exclude preprocessed-to-nothing tokens from selection This prevents selection of empty preprocessor entities (like #define directives, or text in disabled sections) creating a selection in the parent element. Summary: Based on D83508 by Aleksandr Platonov. Reviewers: ArcsinX, kadircet Subscribers: ilya-biryukov, MaskRay, jkorous, arphaman, usaxena95, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D84012	2020-07-20 14:50:12 +02:00
Sam McCall	f0ab336e74	[Syntax] expose API for expansions overlapping a spelled token range. Summary: This allows efficiently accessing all expansions (without iterating over each token and searching), and also identifying tokens within a range that are affected by the preprocessor (which is how clangd will use it). Subscribers: ilya-biryukov, kadircet, usaxena95, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D84009	2020-07-20 14:48:12 +02:00
Simon Pilgrim	68a1cbe11a	SubtargetFeatureInfo.h - remove unnecessary include and forward declarations. NFCI. Move necessary include down to SubtargetFeatureInfo.cpp.	2020-07-20 13:39:24 +01:00
Muhammad Omair Javaid	d9920e0199	Revert "AArch64 SVE register infos and core file support" This reverts commit `7e017de0ad`.	2020-07-20 17:37:17 +05:00
Frederik Gossen	f9595857b9	[MLIR][Shape] Fold `shape.shape_eq` Fold `shape.shape_eq`. Differential Revision: https://reviews.llvm.org/D82533	2020-07-20 12:25:53 +00:00
Nicolas Vasilache	47cbd9f922	[mlir][Vector] NFC - Improve VectorInterfaces This revision improves and makes better use of OpInterfaces for the Vector dialect. Differential Revision: https://reviews.llvm.org/D84053	2020-07-20 08:24:22 -04:00
Muhammad Omair Javaid	7e017de0ad	AArch64 SVE register infos and core file support Summary: This patch adds support for AArch64 SVE register infos description and core file register access. AArch64 SVE is a an optional extension of Arm v8.3-a architecture. It has introduced 32 new vector registers Z, 16 predicate P registers and FFR predicate register. These registers have fixed names but can dynamically be configured to different size based on underlying OS configuration. This patch adds register info struct that describes SVE register infos and also provides RegisterContextPOSIXCore_arm64 routines to access SVE registers. This patch also introduces a mechanism to configure SVE register sizes and offsets at startup before exchanging register information across gdb-remote. TestLinuxCore.py has been updated to include testing of SVE core files. Reviewers: labath, clayborg, jankratochvil, jasonmolenda, rengolin Reviewed By: labath Subscribers: tschuett, kristof.beyls, danielkiss, lldb-commits Differential Revision: https://reviews.llvm.org/D77047	2020-07-20 17:21:16 +05:00
Alex Zinenko	ebbdecdd57	[mlir] Support translating function linkage between MLIR and LLVM IR Linkage support is already present in the LLVM dialect, and is being translated for globals other than functions. Translation support has been missing for functions because their conversion goes through a different code path than other globals. Differential Revision: https://reviews.llvm.org/D84149	2020-07-20 14:04:31 +02:00
Paul Walker	6384ec4099	[SVE] Add lowering for fixed length vector fdiv, fma, fmul and fsub operations. Differential Revision: https://reviews.llvm.org/D84034	2020-07-20 11:57:34 +00:00
Joachim Protze	f226171429	[OpenMP][Tests][NFC] Mark compatibility with older versions of clang	2020-07-20 13:53:29 +02:00
Haojian Wu	17ef788df5	[AST][RecoveryExpr] Preserve the AST for invalid class constructions. Differential Revision: https://reviews.llvm.org/D81090	2020-07-20 13:11:15 +02:00
Paul Walker	ab7abd8bf4	[Driver] Add support for -msve-vector-bits=scalable. No real action is taken for a value of scalable but it provides a route to disable an earlier specification and is effectively its default value when omitted. Patch also removes an "unused variable" warning. Differential Revision: https://reviews.llvm.org/D84021	2020-07-20 10:46:22 +00:00
George Mitenkov	b74ab49f47	[MLIR][SPIRVToLLVM] Documentation for SPIR-V to LLVM conversion This patch adds documentation for SPIR-V to LLVM conversion. It describes the approaches taken and what is currently supported by this conversion framework. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D83322	2020-07-20 13:42:57 +03:00
James Henderson	908804b093	[test][llvm-readobj] Fix build bot failure after `df952cb9` The test wasn't updated due to being an unusual target not included in my test run.	2020-07-20 11:23:13 +01:00
Florian Hahn	dc1087d408	[Matrix] Add minimal lowering pass that only requires TTI. This patch adds a new variant of the matrix lowering pass that only does a minimal lowering and only depends on TTI. The main purpose of this pass is to have a pass with minimal dependencies to run as part of the backend pipeline. At the moment, the only difference to the regular lowering pass is that it does not support remarks. But in subsequent patches add support for tiling to the lowering pass which will require more analysis, which we do not want to run in the backend, as the lowering should happen in the middle-end in practice and running it in the backend is mostly for convenience when running llc. Reviewers: anemet, Gerolf, efriedma, hfinkel Reviewed By: anemet Differential Revision: https://reviews.llvm.org/D76867	2020-07-20 11:16:11 +01:00
Hans Wennborg	8513a681f7	[clang-cl] Allow a colon after the /Fe option (PR46720)	2020-07-20 12:06:41 +02:00
Muhammad Omair Javaid	7ca9b589c4	Remove Linux sysroot dependencies of SVE PT macros Summary: SVE elf note data requires SVE PT macros for reading writing data. Same macros are used by Linux ptrace SVE register access. This patch makes necessary changes to lldb/source/Plugins/Process/Linux/LinuxPTraceDefines_arm64sve.h in order to make them sysroot independent. Reviewers: labath, rengolin Reviewed By: labath Subscribers: tschuett, lldb-commits, kristof.beyls Differential Revision: https://reviews.llvm.org/D83541	2020-07-20 14:54:51 +05:00

1 2 3 4 5 ...

360821 Commits All Branches Search

360821 Commits

All Branches