llvm-project

Commit Graph

Author	SHA1	Message	Date
Sheng	aab5bd180a	[ADT] Adopt the new casting infrastructure for PointerUnion Reviewed By: lattner, bzcheeseman Differential Revision: https://reviews.llvm.org/D125609	2022-05-16 18:40:05 +08:00
Abinav Puthan Purayil	485dd0b752	[GlobalISel] Handle constant splat in funnel shift combine This change adds the constant splat versions of m_ICst() (by using getBuildVectorConstantSplat()) and uses it in matchOrShiftToFunnelShift(). The getBuildVectorConstantSplat() name is shortened to getIConstantSplatVal() so that the *SExtVal() version would have a more compact name. Differential Revision: https://reviews.llvm.org/D125516	2022-05-16 16:03:30 +05:30
bzcheeseman	0809f63826	[LLVM][Casting.h] Add trivial self-cast Casting from a type to itself should always be possible. Make this simple for all users, and add tests to ensure we keep being able to do this. Ref: https://reviews.llvm.org/D125543 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D125590	2022-05-15 22:22:16 -07:00
Alex Brachet	a74d9e74e5	[ifs] Add --strip-size flag st_size may not be of importance to the abi if you are not using copy relocations. This is helpful when you want to check the abi of a shared object both when instrumented and not because asan will increase the size of objects to include the redzone. Differential revision: https://reviews.llvm.org/D124792	2022-05-14 18:50:20 +00:00
Alex Brachet	1f61260847	Revert "[ifs] Add --strip-size flag" This reverts commit `b6b0fd6a94`.	2022-05-14 17:33:27 +00:00
Alex Brachet	b6b0fd6a94	[ifs] Add --strip-size flag st_size may not be of importance to the abi if you are not using copy relocations. This is helpful when you want to check the abi of a shared object both when instrumented and not because asan will increase the size of objects to include the redzone. Differential revision: https://reviews.llvm.org/D124792	2022-05-14 17:25:50 +00:00
Jay Foad	169ae6db69	[APInt] Allow extending and truncating to the same width Allow zext, sext, trunc, truncUSat and truncSSat to extend or truncate to the same bit width, which is a no-op. Disallowing this forced clients to use workarounds like using zextOrTrunc (even though they never wanted truncation) or zextOrSelf (even though they did not want its strange behaviour of allowing a smaller bit width, which is also treated as a no-op). Differential Revision: https://reviews.llvm.org/D125556	2022-05-14 09:54:24 +01:00
Simon Pilgrim	345ed58ed5	Fix implicit double -> float truncation warnings. NFCI.	2022-05-13 19:07:00 +01:00
bzcheeseman	0be41ed5bb	[LLVM][Casting.h] Don't create a temporary while casting. C-style casting can create a temporary when compiled by a C++ compiler, which was emitting a warning casting a reference to another reference. We can't use C++-style casting directly because it doesn't always work with incomplete types. In order to support the current use-cases, for references we switch to pointer space to perform the cast. Reviewed By: qiongsiwu1 Differential Revision: https://reviews.llvm.org/D125482	2022-05-12 23:11:02 -04:00
Krasimir Georgiev	52328dafda	silence new -Wunused-result warnings in test No functional changes intended. After `f156b51aec`, new -Wunused-result warnings popped up in this test: https://buildkite.com/llvm-project/upstream-bazel/builds/28320#bc3ec049-af39-4114-b7b8-4cbc180bc09b	2022-05-12 08:30:36 +02:00
bzcheeseman	f156b51aec	[LLVM][Casting.h] Update dyn_cast machinery to provide more control over how the casting is performed. This patch expands the expressive capability of the casting utilities in LLVM by introducing several levels of configurability. By creating modular CastInfo classes we can enable projects like MLIR that need more fine-grained control over how a cast is actually performed to retain that control, while making it easy to express the easy cases (like a checked pointer to pointer cast). The current implementation of Casting.h doesn't make it clear where the entry points for customizing the cast behavior are, so part of the motivation for this patch is adding that documentation. Another part of the motivation is to support using LLVM RTTI with a wider set of use cases, such as nullable value to value casts, or pointer to value casts (as in MLIR). Reviewed By: lattner, rriddle Differential Revision: https://reviews.llvm.org/D123901	2022-05-12 00:15:09 -04:00
River Riddle	5a9a438a54	[TableGen] Refactor TableGenParseFile to no longer use a callback Now that TableGen no longer relies on global Record state, we can allow for the client to own the RecordKeeper and SourceMgr. Given that TableGen internally still relies on the global llvm::SrcMgr, this method unfortunately still isn't thread-safe. Differential Revision: https://reviews.llvm.org/D125277	2022-05-11 11:55:33 -07:00
Arthur Eubanks	7e0802aeb5	[BasicAA] Fix order in which we pass MemoryLocations to alias() D98718 caused the order of Values/MemoryLocations we pass to alias() to be significant due to storing the offset in the PartialAlias case. But some callers weren't audited and were still passing swapped arguments, causing the returned PartialAlias offset to be negative in some cases. For example, the newly added unittests would return -1 instead of 1. Fixes #55343, a miscompile. Reviewed By: asbirlea, nikic Differential Revision: https://reviews.llvm.org/D125328	2022-05-10 12:05:38 -07:00
Andrew Litteken	96345f773c	[IRSim] Remove early check from similarity matching such that commutative instructions are checked correctly when using the same value. When the first commutative instruction in a region using the same value in both positions was compared to a corresponding instruction with two different values, there was an early check that determined that since the values were new, it was true that these values acted in the same way structurally. If this was not contradicted later in the program, the regions were marked as similar. This removes that check, so that it is clear that the same value cannot be mapped to two different values. Reviewer: paquette Differential Revision: https://reviews.llvm.org/D124775	2022-05-09 22:59:09 -05:00
Mircea Trofin	c35ad9ee4f	[mlgo] Support exposing more features than those supported by models This allows the compiler to support more features than those supported by a model. The only requirement (development mode only) is that the new features must be appended at the end of the list of features requested from the model. The support is transparent to compiler code: for unsupported features, we provide a valid buffer to copy their values; it's just that this buffer is disconnected from the model, so insofar as the model is concerned (AOT or development mode), these features don't exist. The buffers are allocated at setup - meaning, at steady state, there is no extra allocation (maintaining the current invariant). These buffers has 2 roles: one, keep the compiler code simple. Second, allow logging their values in development mode. The latter allows retraining a model supporting the larger feature set starting from traces produced with the old model. For release mode (AOT-ed models), this decouples compiler evolution from model evolution, which we want in scenarios where the toolchain is frequently rebuilt and redeployed: we can first deploy the new features, and continue working with the older model, until a new model is made available, which can then be picked up the next time the compiler is built. Differential Revision: https://reviews.llvm.org/D124565	2022-05-09 18:01:21 -07:00
Nathan Sidwell	bc150a07f1	[demangler] No need to space adjacent template closings With the demangler parenthesizing 'a >> b' inside template parameters, because C++11 parsing of >> there, we don't really need to add spaces between adjacent template arg closing '>' chars. In 2022, that just looks odd. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D123134	2022-05-09 06:14:44 -07:00
Philipp Tomsich	91b24b0180	[AArch64] Ampere1 does not support MTE The initial support for the Ampere1 mistakenly signalled support for the MTE feature. However, the core does not include the optional MTE functionality. Update the target parser to not include MTE for Ampere1. Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D125191	2022-05-09 11:29:42 +02:00
Stella Laurenzo	6dedbcd5e9	Make BinaryStreamWriter::padToAlignment write blocks vs bytes. While I think this is a performance improvement over the original, this actually fixes a correctness issue: For an appendable underlying stream, padToAlignment would fail if the additional padding would have caused the stream to grow since it was doing its own check on bounds. By deferring to the regular writeArray method this takes the same path as everything else, which does the correct bounds check in WritableBinaryStreamRef::checkOffsetForWrite (i.e. skips the extension check if BSF_Append is set). I had started to fix the existing bounds check in BinaryStreamWriter but deferred to this because it layered better and is more efficient/consistent. It didn't look like this method was tested at all, so I added a unit test. Differential Revision: https://reviews.llvm.org/D124746	2022-05-07 17:37:18 -07:00
Sam McCall	56ee5d9337	[Support] Fix asan AllocatorTest after `ba0d50ad7e` We were counting the number of bytes allocated, but under asan there's extra redzone bytes by default. Disable this.	2022-05-06 15:51:37 +02:00
Sam McCall	ba0d50ad7e	[Support] Fix UB in BumpPtrAllocator when first allocation is zero. BumpPtrAllocator::Allocate() is marked __attribute__((returns_nonnull)) when the compiler supports it, which makes it UB to return null. When there have been no allocations yet, the current slab is [nullptr, nullptr). A zero-sized allocation fits in this range, and so Allocate(0, 1) returns null. There's no explicit docs whether Allocate(0) is valid. I think we have to assume that it is: - the implementation tries to support it (e.g. >= tests instead of >) - malloc(0) is allowed - requiring each callsite to do a check is bug-prone - I found real LLVM code that makes zero-sized allocations Differential Revision: https://reviews.llvm.org/D125040	2022-05-06 08:57:27 +02:00
Lang Hames	98616cfc02	[ORC] Add an ExecutorAddr::toPtr overload for function types. In the common case of converting an ExecutorAddr to a function pointer type, this eliminates the need for the '()' boilerplate to explicitly specify a function pointer. E.g.: auto F = A.toPtr<int()()>(); can now be written as auto F = A.toPtr<int()>();	2022-05-05 12:37:23 -07:00
Teresa Johnson	655294866c	[memprof] Use unknown_function error type for missing functions Switch the error type when a function is not found in the memprof profile to unknown_function. This gives compatibility with normal PGO function matching, and also prevents issuing large numbers of additional matching errors since pgo-warn-missing-function is off by default. Differential Revision: https://reviews.llvm.org/D124953	2022-05-04 13:02:30 -07:00
Luboš Luňák	8ef5710e63	[ThreadPool] add ability to group tasks into separate groups This is needed for parallelizing of loading modules symbols in LLDB (D122975). Currently LLDB can parallelize indexing symbols when loading a module, but modules are loaded sequentially. If LLDB index cache is enabled, this means that the cache loading is not parallelized, even though it could. However doing that creates a threadpool-within-threadpool situation, so the number of threads would not be properly limited. This change adds ThreadPoolTaskGroup as a simple type that can be used with ThreadPool calls to put tasks into groups that can be independently waited for (even recursively from within a task) but still run in the same thread pool. Differential Revision: https://reviews.llvm.org/D123225	2022-05-04 06:16:55 +02:00
Chris Bieneman	15d20b9764	Fix DXBC magic parsing This gets identify_magic working correctly for DXContainer files	2022-05-03 14:41:48 -07:00
Philipp Tomsich	7e02bc5237	[AArch64] Add native CPU detection for Ampere1 Map the IMPLEMENTOR ID 0xc0 (Ampere Computing) and CPU ID 0xac3 (Ampere1) to ampere1. Differential Revision: https://reviews.llvm.org/D117111	2022-05-03 16:10:02 +01:00
Philipp Tomsich	64816e68f4	[AArch64] Support for Ampere1 core Add support for the Ampere Computing Ampere1 core. Ampere1 implements the AArch64 state and is compatible with ARMv8.6-A. Differential Revision: https://reviews.llvm.org/D117112	2022-05-03 15:54:02 +01:00
Simon Tatham	32814df442	[Windows] Fix handling of \" in program name on cmd line. Bugzilla #47579: if you invoke clang on Windows via a pathname in which a quoted section closes just after a backslash, e.g. "C:\Program Files\Whatever\"clang.exe then cmd.exe and CreateProcess will correctly find the binary, because when they parse the program name at the start of the command line, they don't regard the \ before the " as having any kind of escaping effect. This is different from the behaviour of the Windows standard C library when it parses the rest of the command line, which would consider that \" not to close the quoted string. But this confuses windows::GetCommandLineArguments, because the Windows API function GetCommandLineW() will return a command line containing that \" sequence, and cl::TokenizeWindowsCommandLine will tokenize the whole string according to the C library's rules. So it will misidentify where the program name stops and the arguments start. To fix this, I've introduced a new variant function cl::TokenizeWindowsCommandLineFull(), intended to be applied to the string returned from GetCommandLineW(). It parses the first word of the command line according to CreateProcess's rules, considering \ to never be an escaping character; thereafter, it switches over to the C library rules for the rest of the command line. Reviewed By: hans Differential Revision: https://reviews.llvm.org/D122914	2022-05-03 11:57:50 +01:00
Simon Tatham	1be024ee45	[Windows] Fix cmd line tokenization of unclosed quotes. When cl::TokenizeWindowsCommandLine received a command line with an unterminated double-quoted string at the end, it would discard the text within that string. That doesn't match the behavior of the standard Windows C library, which will return the text in the unclosed quoted string as an argv word. Fixed, and added extra unit tests in that area. In some cases (specifically the one in Bugzilla #47579) this could cause TokenizeWindowsCommandLine to return a zero-length list of arguments, leading to an array overrun at the call site in windows::GetCommandLineArguments. Added a check there, for extra safety: now windows::GetCommandLineArguments will return an error code instead of failing an assertion. (This change was written as part of https://reviews.llvm.org/D122914, but split into a separate commit at the last minute at the code reviewer's suggestion, because it's fixing an unrelated bug in the same area. The rest of D122914 will follow in the next commit.)	2022-05-03 11:57:49 +01:00
Chris Bieneman	966c40aea6	[Object][DX] Identify DXBC file magic This adds support to llvm::identify_magic to detect DXBC and classify it as the dxcontainer format.	2022-05-02 16:24:36 -05:00
Chris Bieneman	55e13a6bc0	[NFC] Fix warning reported on bots	2022-05-02 15:02:44 -05:00
Chris Bieneman	4070aa0156	[Object][DX] Initial DXContainer parsing support This patch begins adding DXContainer parsing support to libObject. Following the pattern used by ELFFile my goal here is to write a standalone DXContainer parser and later write an adapter interface to support a subset of the ObjectFile interfaces so that we can add limited objdump support. I will also be adding ObjectYAML support to help drive testing of the object tools and MC-level object writers as those come together. DXContainer is a slightly odd format. It is arranged in "parts" that are semantically similar to sections, but it doesn't support symbol listing. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D124643	2022-05-02 13:56:33 -05:00
Jack Andersen	09325d3606	[CAPI] Expose CastInst::getCastOpcode in C API Reviewed By: deadalnix Differential Revision: https://reviews.llvm.org/D91514	2022-04-30 18:40:04 -04:00
Ties Stuij	051deb2d9d	[ARM] add Armv9 build attribute The build attribute number can be found in the Arm ABI addenda32 document: https://github.com/ARM-software/abi-aa/blob/main/addenda32/addenda32.rst#335target-related-attributes Reviewed By: tmatheson Differential Revision: https://reviews.llvm.org/D124090	2022-04-28 10:48:26 +01:00
Michael Kruse	ff289feeba	[OpenMPIRBuilder] Remove ContinuationBB argument from Body callback. The callback is expected to create a branch to the ContinuationBB (sometimes called FiniBB in some lambdas) argument when finishing. This creates problems: 1. The InsertPoint used for CodeGenIP does not need to be the end of a block. If it is not, a naive callback will insert a branch instruction into the middle of the block. 2. The BasicBlock the CodeGenIP is pointing to may or may not have a terminator. There is an conflict where to branch to if the block already has a terminator. 3. Some API functions work only with block having a terminator. Some workarounds have been used to insert a temporary terminator that is removed again. 4. Some callbacks are sensitive to whether the BasicBlock has a terminator or not. This creates a callback ordering problem where different callback may have different behaviour depending on whether a previous callback created a terminator or not. The problem also exists for FinalizeCallbackTy where some callbacks do create branch to another "continue" block, but unlike BodyGenCallbackTy does not receive the target as argument. This is not addressed in this patch. With this patch, the callback receives an CodeGenIP into a BasicBlock where to insert instructions. If it has to insert control flow, it can split the block at that position as needed but otherwise no separate ContinuationBB is needed. In particular, a callback can be empty without breaking the emitted IR. If the caller needs the control flow to branch to a specific target, it can insert the branch instruction itself and pass an InsertPoint before the terminator to the callback. Certain frontends such as Clang may expect the current IRBuilder position to be at the end of a basic block. In this case its callbacks must split the block at CodeGenIP before setting the IRBuilder position such that the instructions after CodeGenIP are moved to another basic block and before returning create a new branch instruction to the split block. Some utility functions such as `splitBB` are supporting correct splitting of BasicBlocks, independent of whether they have a terminator or not, returning/setting the InsertPoint of an IRBuilder to the end of split predecessor block, and optionally omitting creating a branch to the split successor block to be added later. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D118409	2022-04-26 16:35:01 -05:00
Jeremy Morse	65d5beca13	Reapply D124184, [DebugInfo][InstrRef] Add a size operand to DBG_PHI This was reverted twice, in `987cd7c3ed` and `13815e8cbf`. The latter stemed from not accounting for rare register classes in a pre-allocated array, and the former from an array not being completely initialized, leading to asan complaining.	2022-04-26 15:49:22 +01:00
Alexey Lapshin	854c33946f	[llvm-gsymutil][NFC] refactor AddressRange&AddresRanges structures. llvm-gsymutil has an implementation of AddressRange and AddressRanges classes. That implementation might be reused in other parts of llvm. This patch moves AddressRange and AddressRanges classes into llvm/ADT. Differential Revision: https://reviews.llvm.org/D124350	2022-04-26 12:00:43 +03:00
Mircea Trofin	b1fa5ac3ba	[mlgo] Factor out TensorSpec This is a simple datatype with a few JSON utilities, and is independent of the underlying executor. The main motivation is to allow taking a dependency on it on the AOT side, and allow us build a correctly-sized buffer in the cases when the requested feature isn't supported by the model. This, in turn, allows us to grow the feature set supported by the compiler in a backward-compatible way; and also collect traces exposing the new features, but starting off the older model, and continue training from those new traces. Differential Revision: https://reviews.llvm.org/D124417	2022-04-25 18:35:46 -07:00
Chris Bieneman	e6f44a3cd2	Add PointerType analysis for DirectX backend As implemented this patch assumes that Typed pointer support remains in the llvm::PointerType class, however this could be modified to use a different subclass of llvm::Type that could be disallowed from use in other contexts. This does not rely on inserting typed pointers into the Module, it just uses the llvm::PointerType class to track and unique types. Fixes #54918 Reviewed By: kuhar Differential Revision: https://reviews.llvm.org/D122268	2022-04-25 17:49:43 -05:00
Jeremy Morse	987cd7c3ed	Revert "Reapply D124184, [DebugInfo][InstrRef] Add a size operand to DBG_PHI" This reverts commit `5db9250231`. Further to the early revert, the sanitizers have found something wrong with this.	2022-04-25 23:30:15 +01:00
Frederik Gossen	8fbf9acc8c	Add missing comparison operators to SmallVector Differential Revision: https://reviews.llvm.org/D124407	2022-04-25 18:18:14 -04:00
David Green	9727c77d58	[NFC] Rename Instrinsic to Intrinsic	2022-04-25 18:13:23 +01:00
Nathan Sidwell	c47bcf9af6	[demangler][NFC] OperatorInfo table unit test Placing a run-once test inside the operator lookup function caused problems with the thread sanitizer. See D122975. Break out the operator table into a member variable, and move the test to the unit test machinery. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D123390	2022-04-25 10:02:08 -07:00
Jeremy Morse	5db9250231	Reapply D124184, [DebugInfo][InstrRef] Add a size operand to DBG_PHI This was applied in `fda4305e53`, reverted in `13815e8cbf`, the problem was that fp80 X86 registers that were spilt to the stack aren't expected by LiveDebugValues. It pre-allocates a position number for all register sizes that can be spilt, and 80 bits isn't exactly common. The solution is to scan the register classes to find any unrecognised register sizes, adn pre-allocate those position numbers, avoiding a later assertion.	2022-04-25 15:50:15 +01:00
Shraiysh Vaishay	a5c52ff0d4	[OpenMP][IRBuilder] Handle unexcuted EXPECT_FALSE This patch addresses the comment about unexecuted test in D122371. Reviewed By: probinson Differential Revision: https://reviews.llvm.org/D123920	2022-04-25 09:08:29 +05:30
Alexander Yermolovich	c87d405b22	[DWARF] Add API to get data from MCDwarfLineStr This API will be used in D121876, to get finalized string data for .debug_line_str. Reviewed By: dblaikie, rafauler Differential Revision: https://reviews.llvm.org/D124052	2022-04-21 14:08:20 -07:00
Ulrich Weigand	1283ccb610	Support z16 processor name The recently announced IBM z16 processor implements the architecture already supported as "arch14" in LLVM. This patch adds support for "z16" as an alternate architecture name for arch14.	2022-04-21 19:58:22 +02:00
Matt Arsenault	507259820a	GlobalISel: Add LegalizeMutations to help use More/FewerElements	2022-04-19 21:04:32 -04:00
Matt Arsenault	12d79b1514	GlobalISel: Add LLT helper to multiply vector sizes	2022-04-19 21:04:32 -04:00
Ilia Diachkov	6c69427e88	[SPIR-V](3/6) Add MC layer, object file support, and InstPrinter The patch adds SPIRV-specific MC layer implementation, SPIRV object file support and SPIRVInstPrinter. Differential Revision: https://reviews.llvm.org/D116462 Authors: Aleksandr Bezzubikov, Lewis Crawford, Ilia Diachkov, Michal Paszkowski, Andrey Tretyakov, Konrad Trifunovic Co-authored-by: Aleksandr Bezzubikov <zuban32s@gmail.com> Co-authored-by: Ilia Diachkov <iliya.diyachkov@intel.com> Co-authored-by: Michal Paszkowski <michal.paszkowski@outlook.com> Co-authored-by: Andrey Tretyakov <andrey1.tretyakov@intel.com> Co-authored-by: Konrad Trifunovic <konrad.trifunovic@intel.com>	2022-04-20 01:10:25 +02:00
Michael Kruse	2d92ee97f1	Reapply "[OpenMP] Refactor OMPScheduleType enum." This reverts commit `af0285122f`. The test "libomp::loop_dispatch.c" on builder openmp-gcc-x86_64-linux-debian fails from time-to-time. See #54969. This patch is unrelated.	2022-04-18 21:56:47 -05:00

1 2 3 4 5 ...

7665 Commits