llvm-project

Commit Graph

Author	SHA1	Message	Date
Rong Xu	54e03d03a7	[PGO] Verify BFI counts after loading profile data This patch adds the functionality to compare BFI counts with real profile counts right after reading the profile. It will print remarks under -Rpass-analysis=pgo, or the internal option -pass-remarks-analysis=pgo. Differential Revision: https://reviews.llvm.org/D91813	2020-12-14 15:56:10 -08:00
Harald van Dijk	9eac818370	[X86] Fix variadic argument handling for x32 The X86-64 ABI defines va_list as typedef struct { unsigned int gp_offset; unsigned int fp_offset; void overflow_arg_area; void reg_save_area; } va_list[1]; This means the size, alignment, and reg_save_area offset will depend on whether we are in LP64 or in ILP32 mode, so this commit adds the checks. Additionally, the VAARG_64 pseudo-instruction assumed 64-bit pointers, so this commit adds a VAARG_X32 pseudo-instruction that behaves just like VAARG_64, except for assuming 32-bit pointers. Some of these changes were originally done by Michael Liao <michael.hliao@gmail.com>. Fixes https://bugs.llvm.org/show_bug.cgi?id=48428. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D93160	2020-12-14 23:47:27 +00:00
Nico Weber	d058b69b1c	[lld/mac] implement -compatibility_version, -current_version Differential Revision: https://reviews.llvm.org/D93237	2020-12-14 18:41:36 -05:00
Peter Collingbourne	f21f3339ba	scudo: Remove positional template arguments for secondary cache. NFCI. Make these arguments named constants in the Config class instead of being positional arguments to MapAllocatorCache. This makes the configuration easier to follow. Eventually we should follow suit with the other classes but this is a start. Differential Revision: https://reviews.llvm.org/D93251	2020-12-14 15:40:07 -08:00
Sanjay Patel	8593e197bc	[VectorCombine] add alignment test for gep load; NFC	2020-12-14 18:31:19 -05:00
Nico Weber	2733a5a5b4	[gn build] (semi-manually) port `19d57b5c42`	2020-12-14 18:23:15 -05:00
Nico Weber	9412932bb5	[gn build] (semi-manually) port `7ad49aec12`	2020-12-14 18:22:54 -05:00
Eugene Zhulenev	0b510e79ce	[mlir] Fix opaque struct typedef in AsyncRuntime header Differential Revision: https://reviews.llvm.org/D93250	2020-12-14 15:04:59 -08:00
Richard Uhler	ee43dcaad7	[mlir] Add section page for Rationale docs. With a brief overview and summary of each of the Rationale docs. Differential Revision: https://reviews.llvm.org/D93245	2020-12-14 14:49:30 -08:00
Gulfem Savrun Yeniceri	7c0e3a77bc	[clang][IR] Add support for leaf attribute This patch adds support for leaf attribute as an optimization hint in Clang/LLVM. Differential Revision: https://reviews.llvm.org/D90275	2020-12-14 14:48:17 -08:00
Louis Dionne	b3d1d1f4ff	[libc++] Remove unnecessary static assertion in allocate_shared Checking that `T` is constructible from `Args...` is technically not required by the Standard, although any implementation will obviously error out if that's not satisfied. However, this check is incompatible with using Allocator construction in the control block (upcoming change as part of implementing P0674), so I'm removing it now to reduce the upcoming diff as much as possible. Differential Revision: https://reviews.llvm.org/D93246	2020-12-14 17:47:43 -05:00
Louis Dionne	3b7280f5e4	[libc++] NFCI: Return pointer instead of reference from __shared_ptr_emplace helper method This makes __get_alloc consistent with __get_elem, and will reduce the diff required to implement P0674R1.	2020-12-14 17:46:09 -05:00
Sanjay Patel	d399f870b5	[VectorCombine] make load transform poison-safe As noted in D93229, the transform from scalar load to vector load potentially leaks poison from the extra vector elements that are being loaded. We could use freeze here (and x86 codegen at least appears to be the same either way), but we already have a shuffle in this logic to optionally change the vector size, so let's allow that instruction to serve both purposes. Differential Revision: https://reviews.llvm.org/D93238	2020-12-14 17:42:01 -05:00
Duncan P. N. Exon Smith	d636b881bb	Adapt lldb to `a40db5502b` The bots just told me about a place in LLDB I missed in `a40db5502b` when changing `HeaderSearch::LoadedModuleMaps`, but I think this will fix it.	2020-12-14 14:41:15 -08:00
Duncan P. N. Exon Smith	b61f288a58	Add comment to closing brace of anonymous namespace, NFC	2020-12-14 14:38:12 -08:00
Duncan P. N. Exon Smith	90d056ceb9	AST: Silence an instance of -Wsign-compare, NFC Looks this this was added by `68f53960e1`.	2020-12-14 14:36:59 -08:00
Duncan P. N. Exon Smith	a40db5502b	Lex: Migrate HeaderSearch::LoadedModuleMaps to FileEntryRef Migrate `HeaderSearch::LoadedModuleMaps` and a number of APIs over to `FileEntryRef`. This should have no functionality change. Note that two `FileEntryRef`s hash the same if they point at the same `FileEntry`. Differential Revision: https://reviews.llvm.org/D92975	2020-12-14 14:35:11 -08:00
Craig Topper	25067f179f	[LoopIdiomRecognize] Teach detectShiftUntilZeroIdiom to recognize loops where the counter is decrementing. This adds support for loops like unsigned clz(unsigned x) { unsigned w = sizeof (x) * CHAR_BIT; while (x) { w--; x >>= 1; } return w; } and unsigned clz(unsigned x) { unsigned w = sizeof (x) * CHAR_BIT - 1; while (x >>= 1) { w--; } return w; } To support these we look for add x, -1 as well as add x, 1 that we already matched. If the value was -1 we need to subtract from the initial counter value instead of adding to it. Fixes PR48404. Differential Revision: https://reviews.llvm.org/D92745	2020-12-14 14:25:05 -08:00
River Riddle	b3ee7f1f31	[mlir][OpDefGen] Add support for generating local functions for shared utilities This revision adds a new `StaticVerifierFunctionEmitter` class that emits local static functions in the .cpp file for shared operation verification. This class deduplicates shared operation verification code by emitting static functions alongside the op definitions. These methods are local to the definition file, and are invoked within the operation verify methods. The first bit of shared verification is for the type constraints used when verifying operands and results. An example is shown below: ``` static LogicalResult localVerify(...) { ... } LogicalResult OpA::verify(...) { if (failed(localVerify(...))) return failure(); ... } LogicalResult OpB::verify(...) { if (failed(localVerify(...))) return failure(); ... } ``` This allowed for saving >400kb of code size from a downstream TensorFlow project (~15% of MLIR code size). Differential Revision: https://reviews.llvm.org/D91381	2020-12-14 14:21:30 -08:00
Stanislav Mekhanoshin	cf5845d6c4	[AMDGPU] Use multi-dword flat scratch for spilling Differential Revision: https://reviews.llvm.org/D93067	2020-12-14 14:19:29 -08:00
Louis Dionne	19d57b5c42	[libc++] Refactor allocate_shared to use an allocation guard This commit is a step towards making it easier to add support for arrays in allocate_shared. Adding support for arrays will require writing multiple functions, and the current complexity of writing allocate_shared is prohibitive for understanding. Differential Revision: https://reviews.llvm.org/D93130	2020-12-14 17:10:05 -05:00
Bardia Mahjour	a29ecca781	Revert "[DDG] Data Dependence Graph - DOT printer" This reverts commit `fd4a10732c`, to investigate the failure on windows: http://lab.llvm.org:8011/#/builders/127/builds/3274	2020-12-14 16:54:20 -05:00
Christian Sigg	0cf7e4b252	Revert "[mlir] Remove methods from mlir::OpState that just forward to mlir::Operation." This reverts commit `6f271e921b`. Differential Revision: https://reviews.llvm.org/D93242	2020-12-14 22:47:17 +01:00
Philip Reames	3b3eb7f07f	Speculative fix for build bot failures (The clang build fails for me locally, so this is based on built bot output and a guess as to root cause.) `f5fe849` made the execution of LAA conditional, so I'm guessing that's the root cause.	2020-12-14 13:44:40 -08:00
Bardia Mahjour	fd4a10732c	[DDG] Data Dependence Graph - DOT printer This patch implements a DDG printer pass that generates a graph in the DOT description language, providing a more visually appealing representation of the DDG. Similar to the CFG DOT printer, this functionality is provided under an option called -dot-ddg and can be generated in a less verbose mode under -dot-ddg-only option. Differential Revision: https://reviews.llvm.org/D90159	2020-12-14 16:41:14 -05:00
Javier Setoain	aece4e2793	[mlir][ArmSVE][RFC] Add an ArmSVE dialect This revision starts an Arm-specific ArmSVE dialect discussed in the discourse RFC thread: https://llvm.discourse.group/t/rfc-vector-dialects-neon-and-sve/2284 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D92172	2020-12-14 21:35:01 +00:00
Matt Arsenault	2e0e03c6a0	OpaquePtr: Require byval on x86_intrcc parameter 0 Currently the backend special cases x86_intrcc and treats the first parameter as byval. Make the IR require byval for this parameter to remove this special case, and avoid the dependence on the pointee element type. Fixes bug 46672. I'm not sure the IR is enforcing all the calling convention constraints. clang seems to ignore the attribute for empty parameter lists, but the IR tolerates it.	2020-12-14 16:34:37 -05:00
Matt Arsenault	ef4da3c2ba	clang: Add byval on x86_intrcc parameter 0 This will allow removing the special case treatment of the parameter and avoid depending on the pointer's element type.	2020-12-14 16:34:37 -05:00
Louis Dionne	7ad49aec12	[libc++] Split allocator_traits and pointer_traits out of <memory> In addition to making the code a lot easier to grasp by localizing many helper functions to the only file where they are actually needed, this will allow creating helper functions that depend on allocator_traits outside of <memory>. This is done as part of implementing array support in allocate_shared, which requires non-trivial array initialization algorithms that would be better to keep out of <memory> for sanity. It's also a first step towards splitting up our monolithic headers into finer grained ones, which will make it easier to reuse functionality across the library. For example, it's just weird that we had to define `addressof` inside <type_traits> to avoid circular dependencies -- instead it's better to implement those in true helper headers. Differential Revision: https://reviews.llvm.org/D93074	2020-12-14 16:13:57 -05:00
Zequan Wu	b6b522c4db	[NFC] cleanup cg-profile emission on TargetLowerinng Differential Revision: https://reviews.llvm.org/D93150	2020-12-14 13:07:44 -08:00
Hafiz Abid Qadeer	670686ad8e	Add initial support for multilibs in Baremetal toolchain. This patch add support of riscv multilibs in the Baremetal toolchain. It is a bit different to what is done in GNU.cpp as we are not iterating a GNU sysroot to find the multilibs. This is intended for an llvm only toolchain. We are not checking for the presence of any runtime bits to enable a specific multilib. I have structured the patch so that other targets for which there is no multilibs support yet in Baremetal.cpp (e.g. arm-none-eabi) will not be affected. Patch also allows some multilibs reuse. Long term, I would like to go in the direction of data-driven specification of multilib directories and flags. Reviewed By: jroelofs Differential Revision: https://reviews.llvm.org/D93138	2020-12-14 20:49:45 +00:00
Guozhi Wei	d50d7c37a1	[MBP] Prevent rotating a chain contains entry block The entry block should always be the first BB in a function. So we should not rotate a chain contains the entry block. Differential Revision: https://reviews.llvm.org/D92882	2020-12-14 12:48:55 -08:00
Philip Reames	f5fe8493e5	[LAA] Relax restrictions on early exits in loop structure his is a preparation patch for supporting multiple exits in the loop vectorizer, by itself it should be mostly NFC. This patch moves the loop structure checks from LAA to their respective consumers (where duplicates don't already exist). Moving the checks does end up changing some of the optimization warnings and debug output slightly, but nothing that appears to be a regression. Why do this? Well, after auditing the code, I can't actually find anything in LAA itself which relies on having all instructions within a loop execute an equal number of times. This patch simply makes this explicit so that if one consumer - say LV in the near future (hopefully) - wants to handle a broader class of loops, it can do so. Differential Revision: https://reviews.llvm.org/D92066	2020-12-14 12:44:01 -08:00
River Riddle	6af2c4ca9b	[mlir] Change the internal representation of FrozenRewritePatternList to use shared_ptr This will allow for caching pattern lists across multiple pass instances, such as when multithreading. This is an extremely important invariant for PDL patterns, which are compiled at runtime when the FrozenRewritePatternList is built. Differential Revision: https://reviews.llvm.org/D93146	2020-12-14 12:32:44 -08:00
Christian Sigg	6f271e921b	[mlir] Remove methods from mlir::OpState that just forward to mlir::Operation. All call sites have been converted in previous changes. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D93176	2020-12-14 21:26:14 +01:00
Michael Kruse	2aa4335806	[flang] Fix copy elision assumption. Before this patch, the Restorer depended on copy elision to happen. Without copy elision, the function ScopedSet calls the move constructor before its dtor. The dtor will prematurely restore the reference to the original value. Instead of relying the compiler to not use the Restorer's copy constructor, delete its copy and assign operators. Hence, callers cannot move or copy a Restorer object anymore, and have to explicitly provide the reset state. ScopedSet avoids calling move/copy operations by relying on unnamed return value optimization, which is mandatory in C++17. Reviewed By: klausler Differential Revision: https://reviews.llvm.org/D88797	2020-12-14 14:07:05 -06:00
River Riddle	6bc9439f59	[mlir][OpAsmParser] Add support for parsing integer literals without going through IntegerAttr Some operations use integer literals as part of their custom format that don't necessarily map to an internal IntegerAttr. This revision exposes the same `parseInteger` functions as the DialectAsmParser to allow for these operations to parse integer literals without incurring the otherwise unnecessary roundtrip through IntegerAttr. Differential Revision: https://reviews.llvm.org/D93152	2020-12-14 12:00:43 -08:00
River Riddle	c234b65cef	[mlir][OpFormat] Add support for emitting newlines from the custom format of an operation This revision adds a new `printNewline` hook to OpAsmPrinter that allows for printing a newline within the custom format of an operation, that is then indented to the start of the operation. Support for the declarative assembly format is also added, in the form of a `\n` literal. Differential Revision: https://reviews.llvm.org/D93151	2020-12-14 12:00:43 -08:00
Artem Belevich	0936655bac	[CUDA] Do not diagnose host/device variable access in dependent types. `isCUDADeviceBuiltinSurfaceType()`/`isCUDADeviceBuiltinTextureType()` do not work on dependent types as they rely on specific type attributes. Differential Revision: https://reviews.llvm.org/D92893	2020-12-14 11:53:18 -08:00
Sanjay Patel	9c1765acab	[VectorCombine] add test for load with offset; NFC	2020-12-14 14:40:06 -05:00
Reid Kleckner	55fc64bce0	[Hexagon] Tweak _MSC_VER workaround version My bot runs VS 2019, but it could not compile this code. Message: [55/2465] Building CXX object lib\Target\Hexagon\CMakeFiles\LLVMHexagonCodeGen.dir\HexagonVectorCombine.cpp.obj FAILED: lib/Target/Hexagon/CMakeFiles/LLVMHexagonCodeGen.dir/HexagonVectorCombine.cpp.obj ... C:\Program Files (x86)\Microsoft Visual Studio\2019\Professional\VC\Tools\MSVC\14.23.28105\include\map(71): error C2976: 'std::map': too few template arguments C:\Program Files (x86)\Microsoft Visual Studio\2019\Professional\VC\Tools\MSVC\14.23.28105\include\map(71): note: see declaration of 'std::map' The version in the path, 14.23, corresponds to _MSC_VER 1923, so raise the version floor to 1924. I have not tested with versions between 1924 and 1928 (latest), but the latest works with the variadic version.	2020-12-14 11:26:36 -08:00
Alina Sbirlea	5a2d954671	[NFC] Remove stray comment.	2020-12-14 11:19:17 -08:00
Christian Sigg	a1eb154421	[flang] Use mlir::OpState::operator->() to get to methods of mlir::Operation. This is a preparation step to remove those methods from OpState. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D93194	2020-12-14 20:04:53 +01:00
Craig Topper	045304701b	[RISCV] Move vtype decoding and printing from RISCVInstPrinter to RISCVBaseInfo. Share with the assembly parser's debug output This moves the vtype decoding and printing to RISCVBaseInfo. This keeps all of the decoding code in the same area as the encoding code. This will make it easier to change the decoding for the 1.0 spec in the future. We're now sharing the printing with the debug output for operands in the assembler. This also fixes that debug output to include the tail and mask agnostic bits. Since the printing code works on the vtype immediate value, we now encode the immediate during parsing and store just the immediate in the operand.	2020-12-14 10:50:26 -08:00
Kuba Mracek	f276c00898	[sanitizer] Restrict querying VM size on Darwin only to iOS devices We currently do this for SANITIZER_IOS, which includes devices and simulators. This change opts out the check for simulators to unify the behavior with macOS, because VM size is really a property of the host OS, and not the simulator. <rdar://problem/72129387> Differential Revision: https://reviews.llvm.org/D93140	2020-12-14 10:48:48 -08:00
Thomas Raoux	8955e9f6b7	[mlir][linalg] Fix bug in elementwise vectorization Fix a bug causing to pick the wrong vector size to broadcast to when the source vectors have different ranks. Differential Revision: https://reviews.llvm.org/D93118	2020-12-14 10:44:36 -08:00
Jonas Paulsson	653b97690f	[SystemZ] Improve handling of backchain offset. - New function SDValue getBackchainAddress() used by lowerDYNAMIC_STACKALLOC() and lowerSTACKRESTORE() to properly handle the backchain offset also with packed-stack. - Make a common function getBackchainOffset() for the computation of the backchain offset and use in some places (NFC). Review: Ulrich Weigand Differential Revision: https://reviews.llvm.org/D93171	2020-12-14 12:39:38 -06:00
Sylvain Audi	5f53d28fa6	Revert "[clang-scan-deps] Support clang-cl" Reverting, as it breaks build on mac. This reverts commit `640ad76911`.	2020-12-14 13:32:38 -05:00
Arthur Eubanks	e814013932	[Wasm][LTO][NPM] Use NPM for LTO with ENABLE_EXPERIMENTAL_NEW_PASS_MANAGER Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D92867	2020-12-14 10:15:13 -08:00
Michael Liao	1fd1f638b6	[amdgpu] Fix a crash case when `V_CNDMASK` could be simplified. - Once an instruction is simplified, foldable candidates from it should be invalidated or skipped as the operand index is no longer valid. Differential Revision: https://reviews.llvm.org/D93174	2020-12-14 13:08:13 -05:00

1 2 3 4 5 ...

374892 Commits All Branches Search

374892 Commits

All Branches