llvm-project

Commit Graph

Author	SHA1	Message	Date
Keith Smiley	066243057f	[Object] Fix updating darwin archives When creating an archive, llvm-ar looks at the host to determine the archive format to use, on Apple platforms this means it uses the K_DARWIN format. K_DARWIN is _virtually_ equivalent to K_BSD, expect for some very slight differences around padding, timestamps in deterministic mode, and 64 bit formats. When updating an archive using llvm-ar, or llvm-objcopy, Archive would try to determine the kind, but it was not possible to get K_DARWIN in the initialization of the archive, because they're virtually inciting usable from K_BSD, especially since the slight differences only apply in very specific cases. This leads to linker failures when the alignment workaround is not applied to an archive copied with llvm-objcopy. This change teaches Archive to infer the K_DARWIN type in the cases where it's possible and the first object in the archive is a macho object. This avoids using the host triple to determine this to not affect cross compiling. Ideally we would eliminate the separate K_DARWIN type entirely since it's not a truly separate archive type, but then we'd have to force the macho workarounds on the BSD format generally. This might be acceptable but then it would be unclear how to handle this case without forcing the K_DARWIN64 format on all BSD users: ``` if (LastOffset >= Sym64Threshold) { if (Kind == object::Archive::K_DARWIN) Kind = object::Archive::K_DARWIN64; else Kind = object::Archive::K_GNU64; } ``` The logic used to determine if the object is macho is derived from the logic llvm-ar uses. Previous context: - `111cd669e9` - `23a76be5ad` Differential Revision: https://reviews.llvm.org/D124895	2022-05-19 10:56:26 -07:00
Lang Hames	4bb18a89c4	[ORC] Add missing std::moves, pass SymbolLookupSet by value. Avoids some unnecessary SymbolStringPtr copies.	2022-05-19 10:51:20 -07:00
Paul Walker	d640442518	[NFC] Fix a couple of whitespace issues.	2022-05-19 17:27:09 +00:00
Sotiris Apostolakis	ca7c307d18	[SelectOpti][1/5] Setup new select-optimize pass This is the first commit for the cmov-vs-branch optimization pass. The goal is to develop a new profile-guided and target-independent cost/benefit analysis for selecting conditional moves over branches when optimizing for performance. Initially, this new pass is expected to be enabled only for instrumentation-based PGO. RFC: https://discourse.llvm.org/t/rfc-cmov-vs-branch-optimization/6040 Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D120230	2022-05-19 16:31:10 +00:00
Joseph Huber	dbffa4073c	[NVVM] Update intrinsic defintions to include the `nocallback` attribute This patch adds the `nocallback` attribute to the NVVM intrinsics that did not use the `DefaultAttrsIntrinsic` method that includes it already. The `nocallback` attribute states that the intrinsic function cannot enter back into the caller's translation-unit. This allows as to determine that a function calling a `nocallback` function can have the `norecurse` attribute. This should be safe for all the NVVM intrinsics because they do not call other functions within the translation unit. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D125937	2022-05-19 12:30:35 -04:00
Amy Kwan	c35ca3a1c7	[PowerPC] Implement XL compat __fnabs and __fnabss builtins. This patch implements the following floating point negative absolute value builtins that required for compatibility with the XL compiler: ``` double __fnabs(double); float __fnabss(float); ``` These builtins will emit : - fnabs on PWR6 and below, or if VSX is disabled. - xsnabsdp on PWR7 and above, if VSX is enabled. Differential Revision: https://reviews.llvm.org/D125506	2022-05-19 11:28:40 -05:00
Jay Foad	4e432f1b7c	[APInt] Deprecate truncOrSelf, zextOrSelf and sextOrSelf Differential Revision: https://reviews.llvm.org/D125558	2022-05-19 11:23:13 +01:00
Michael Kruse	797fabaab2	[Analysis] Avoid virtual dtor. NFC. Replace virtual destructor by a protected non-virtual one. Additionally also making derived structs as virtual avoids the warning from reappearing. Also see the mailing list discussion: https://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20220516/1038290.html Reviewed By: dblaikie, YangKeao Differential Revision: https://reviews.llvm.org/D125830	2022-05-18 17:41:17 -05:00
Yusra Syeda	1e14b1a797	[SystemZ][z/OS] Add missing include to llvm/include/llvm/BinaryFormat/GOFF.h Differential Revision: https://reviews.llvm.org/D125921	2022-05-18 15:57:05 -04:00
Michael Kitzan	29bebb0237	[GISel] Add new combines for G_FMINNUM/MAXNUM and G_FMINIMUM/MAXIMUM I noticed https://reviews.llvm.org/D87415 added SDAG combines to fold FMIN/MAX instrs with NaNs. The patch implements the same NaN combines for GISel GMIR FMIN/MAX opcodes: G_FMINNUM(X, NaN) -> X G_FMAXNUM(X, NaN) -> X G_FMINIMUM(X, NaN) -> NaN G_FMAXIMUM(X, NaN) -> NaN The patch adds AArch64 tests for these combines as well. Reviewed by: arsenm Differential revision: https://reviews.llvm.org/D125819	2022-05-18 12:08:53 -07:00
Martin Storsjö	d4257fbbba	[llvm-readobj] Improve printing of Windows ARM packed unwind info Fix a couple minor details in the existing logic for calculating saved registers and stack adjustment. Synthesize the corresponding prologues and epilogues and print them. (This supersedes the previous printout of one single list of stored registers; as there's lots of minor nuance differences in how registers are pushed/popped in various corner cases, it's better to print the full prologue/epilogue instead of trying to condense it into one single list.) Print the raw values of the fields Reg, R, L (LinkRegister) and C (Chaining) instead of only printing the derived values. Differential Revision: https://reviews.llvm.org/D125644	2022-05-18 21:33:08 +03:00
Yusra Syeda	5ac411aea8	[SystemZ][z/OS] Add the PPA1 to SystemZAsmPrinter Differential Revision: https://reviews.llvm.org/D125725	2022-05-18 14:13:17 -04:00
Nikita Popov	e1d47d86d8	[IR] Report whether replaceUsesOfWith() changed something (NFC) With change reporting in transformation passes in mind.	2022-05-18 11:46:28 +02:00
Nikita Popov	e9a1c82d69	[SCEVExpander] Expand umin_seq using freeze %x umin_seq %y is currently expanded to %x == 0 ? 0 : umin(%x, %y). This patch changes the expansion to umin(%x, freeze %y) instead (https://alive2.llvm.org/ce/z/wujUhp). The motivation for this change are the test cases affected by D124910, where the freeze expansion ultimately produces better optimization results. This is largely because `(%x umin_seq %y) == %x` is a common expansion pattern, which reliably optimizes in freeze representation, but only sometimes with the zero comparison (in particular, if %x == 0 can fold to something else, we generally won't be able to cover reasonable code from this.) Differential Revision: https://reviews.llvm.org/D125372	2022-05-18 09:53:07 +02:00
Stanislav Mekhanoshin	dee3190293	[AMDGPU] Add llvm.amdgcn.global.load.lds intrinsic Differential Revision: https://reviews.llvm.org/D125279	2022-05-17 12:35:27 -07:00
Stanislav Mekhanoshin	791ec1c68e	[AMDGPU] Add intrinsics llvm.amdgcn.{raw\|struct}.buffer.load.lds Differential Revision: https://reviews.llvm.org/D124884	2022-05-17 10:32:13 -07:00
Ruobing Han	e1cf702a02	fix typo error in DivergenceAnalysis.h Fix a typo error in the comment in DivergenceAnalysis.h Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D125808	2022-05-17 17:20:17 +00:00
Michael Kruse	bd93df937a	[Polly] Mark classes as final by default. NFC. This make is obivious that a class was not intended to be derived from. NPM analysis pass can unfortunately not marked as final because they are derived from a llvm::Checker<T> template internally by the NPM. Also normalize the use of classes/structs * NPM passes are structs * Legacy passes are classes * structs that have methods and are not a visitor pattern are classes * structs have public inheritance by default, remove "public" keyword * Use typedef'ed type instead of inline forward declaration	2022-05-17 12:05:39 -05:00
Nikita Popov	5df22e507b	[IRBuilder] Move insertvalue/extractvalue to fold infrastructure Move from the old CreateXYZ() to the new FoldXYZ() mechanism. This change is likely NFC in practice, because I don't think that the places using InstSimplifyFolder use insertvalue/extractvalue.	2022-05-17 16:04:55 +02:00
Jay Foad	77480556c4	[RegAllocGreedy] New hook regClassPriorityTrumpsGlobalness Add a new TargetRegisterInfo hook to allow targets to tweak the priority of live ranges, so that AllocationPriority of the register class will be treated as more important than whether the range is local to a basic block or global. This is determined per-MachineFunction. Differential Revision: https://reviews.llvm.org/D125102	2022-05-17 12:35:21 +01:00
Alexey Lapshin	4d9c083437	[DWARFLinker][NFC] Add None value to the DwarfLinkerAccelTableKind enum. this review is extracted from D86539. 1. Rename AccelTableKind to DwarfLinkerAccelTableKind (to differentiate from AccelTableKind from CodeGen/AsmPrinter/DwarfDebug.h) 2. Add None value to the DwarfLinkerAccelTableKind. 3. added 'None' value for 'accelerator' option of dsymutil. Differential Revision: https://reviews.llvm.org/D125474	2022-05-17 12:32:32 +03:00
Nikita Popov	a694546f7c	[KnownBits] Add operator== Checking whether two KnownBits are the same is somewhat common, mainly in test code. I don't think there is a lot of room for confusion with "determine what the KnownBits for an icmp eq would be", as that has a different result type (this is what the eq() method implements, which returns Optional<bool>). Differential Revision: https://reviews.llvm.org/D125692	2022-05-17 09:38:13 +02:00
luxufan	63c81b23be	[RISCV] Support getHostCpuName for sifive-u74 Reviewed By: kito-cheng Differential Revision: https://reviews.llvm.org/D123978	2022-05-17 14:06:59 +08:00
Yang Keao	7dce9eb6e5	[DomPrinter] Migrate -dot-dom to the new pass manager. In D123677, @YangKeao provided an implementation of `DOTGraphTraits{Viewer,Printer}` in the new pass manager. This commit migrates the `DomPrinter` and `DomViewer` to the new pass manager. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D124904	2022-05-16 15:07:16 -05:00
Paul Walker	7dd05ba9ed	[SelectionDAG] Remove duplicate "is scaled" information from gather/scatter SDNodes. During early gather/scatter enablement two different approaches were taken to represent scaled indices: * A Scale operand whereby byte_offsets = Index * Scale * An IndexType whereby byte_offsets = Index * sizeof(MemVT.ElementType) Having multiple representations is bad as shown by this patch which fixes instances where the two are out of sync. The dedicated scale operand is more flexible and pervasive so this patch removes the UNSCALED values from IndexType. This means all indices are scaled but the scale can be one, hence unscaled. SDNodes now use the scale operand to answer the "isScaledIndex" question. I toyed with the idea of keeping the UNSCALED enums and helper functions but because they will have no uses and force SDNodes to validate the set of supported values I figured it's best to remove them. We can re-add them if there's a real need. For similar reasons I've kept the IndexType enum when a bool could be used as I think being explicitly looks better. Depends On D123347 Differential Revision: https://reviews.llvm.org/D123381	2022-05-16 20:47:52 +01:00
zhijian	52c615553c	[AIX] fixed llvm-ar can not read empty big archive correctly. Summary: llvm-ar can not read empty big archive correctly. it output error as error: unable to load 'empty.a': truncated or malformed archive (characters in size field in archive member header are not all decimal numbers: '<bigaf>' Reviewers: James Henderson Differential Revision: https://reviews.llvm.org/D124017	2022-05-16 14:29:37 -04:00
Rahman Lavaee	5f7ef65245	[llvm-objdump] Let --symbolize-operands symbolize basic block addresses based on the SHT_LLVM_BB_ADDR_MAP section. `--symbolize-operands` already symbolizes branch targets based on the disassembly. When the object file is created with `-fbasic-block-sections=labels` (ELF-only) it will include a SHT_LLVM_BB_ADDR_MAP section which maps basic blocks to their addresses. In such case `llvm-objdump` can annotate the disassembly based on labels inferred on this section. In contrast to the current labels, SHT_LLVM_BB_ADDR_MAP-based labels are created for every machine basic block including empty blocks and those which are not branched into (fallthrough blocks). The old logic is still executed even when the SHT_LLVM_BB_ADDR_MAP section is present to handle functions which have not been received an entry in this section. Reviewed By: jhenderson, MaskRay Differential Revision: https://reviews.llvm.org/D124560	2022-05-16 10:11:11 -07:00
Adrian Prantl	9c7c8be4a3	Remove stale file from modulemap	2022-05-16 10:05:38 -07:00
Hongtao Yu	acfd0a3456	[llvm-profgen] Update callsite body samples by summing up all call target samples. Current profile generation caculcates callsite body samples and call target samples separately. The former is done based on LBR range samples while the latter is done based on branch samples. Note that there's a subtle difference. LBR ranges is formed from two consecutive branch samples. Therefore the last entry in a LBR record will not be counted towards body samples while there's still a chance for it to be counted towards call targets if it is a function call. I'm making sense of the call body samples by updating it to the aggregation of call targets. Reviewed By: wenlei Differential Revision: https://reviews.llvm.org/D122609	2022-05-16 09:13:37 -07:00
Ellis Hoag	6e23cd2bf0	[InstrProf][NFC] Save profile bias to function map Add a map from functions to load instructions that compute the profile bias. Previously we assumed that if the first instruction in the function was a load instruction, then it must be computing the bias. This was likely to work out because functions usually start with the `llvm.instrprof.increment` instruction, but optimizations could change this. For example, inlining into a non-profiled function. Reviewed By: phosek Differential Revision: https://reviews.llvm.org/D114319	2022-05-16 08:32:31 -07:00
Sanjay Patel	be7f09f7b2	[IR] create and use helper functions that test the signbit; NFCI	2022-05-16 11:26:23 -04:00
Philip Reames	55e2df7285	[LiveIntervals] Add range accessors for value numbers [nfc]	2022-05-16 08:23:12 -07:00
Florian Hahn	b7315ffc3c	[LAA,LV] Add initial support for pointer-diff memory checks. This patch adds initial support for a pointer diff based runtime check scheme for vectorization. This scheme requires fewer computations and checks than the existing full overlap checking, if it is applicable. The main idea is to only check if source and sink of a dependency are far enough apart so the accesses won't overlap in the vector loop. To do so, it is sufficient to compute the difference and compare it to the `VF * UF * AccessSize`. It is sufficient to check `(Sink - Src) <u VF * UF * AccessSize` to rule out a backwards dependence in the vector loop with the given VF and UF. If Src >=u Sink, there is not dependence preventing vectorization, hence the overflow should not matter and using the ULT should be sufficient. Note that the initial version is restricted in multiple ways: 1. Pointers must only either be read or written, by a single instruction (this allows re-constructing source/sink for dependences with the available information) 2. Source and sink pointers must be add-recs, with matching steps 3. The step must be a constant. 3. abs(step) == AccessSize. Most of those restrictions can be relaxed in the future. See https://github.com/llvm/llvm-project/issues/53590. Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D119078	2022-05-16 15:27:22 +01:00
Nikita Popov	8ab819ad90	[ConstantRange] Add toKnownBits() method Add toKnownBits() method to mirror fromKnownBits(). We know the top bits that are constant between min and max. The return value for an empty range is chosen to be conservative.	2022-05-16 16:12:25 +02:00
Liqin.Weng	ff3f4988ed	[CodeGen] Use ArrayRef in TargetLowering functions Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D123656	2022-05-16 13:30:58 +00:00
Sheng	aab5bd180a	[ADT] Adopt the new casting infrastructure for PointerUnion Reviewed By: lattner, bzcheeseman Differential Revision: https://reviews.llvm.org/D125609	2022-05-16 18:40:05 +08:00
Abinav Puthan Purayil	485dd0b752	[GlobalISel] Handle constant splat in funnel shift combine This change adds the constant splat versions of m_ICst() (by using getBuildVectorConstantSplat()) and uses it in matchOrShiftToFunnelShift(). The getBuildVectorConstantSplat() name is shortened to getIConstantSplatVal() so that the *SExtVal() version would have a more compact name. Differential Revision: https://reviews.llvm.org/D125516	2022-05-16 16:03:30 +05:30
Nicolas Abram Lujan	436bbce765	[llvm-c] Add functions for enabling and creating opaque pointers This is based on https://reviews.llvm.org/D125168 which adds a wrapper to allow use of opaque pointers from the C API. I added an opaque pointer mode test to echo.ll, and to fix assertions that forbid the use of mixed typed and opaque pointers that were triggering in it I had to also add wrappers for setOpaquePointers() and isOpaquePointer(). I also changed echo.ll to remove a bitcast i32* %x to i8*, because passing it through llvm-as and llvm-dis was generating a %0 = bitcast ptr %x to ptr, but when building that same bitcast in echo.cpp it was getting elided by IRBuilderBase::CreateCast (`08ac661248/llvm/include/llvm/IR/IRBuilder.h (L1998-L1999)`). Differential Revision: https://reviews.llvm.org/D125183	2022-05-16 10:53:46 +02:00
stk	9902a0945d	Add ThreadPriority::Low, and use QoS class Utility on Mac On Apple Silicon Macs, using a Darwin thread priority of PRIO_DARWIN_BG seems to map directly to the QoS class Background. With this priority, the thread is confined to efficiency cores only, which makes background indexing take forever. Introduce a new ThreadPriority "Low" that sits in the middle between Background and Default, and maps to QoS class "Utility" on Mac. Make this new priority the default for indexing. This makes the thread run on all cores, but still lowers priority enough to keep the machine responsive, and not interfere with user-initiated actions. I didn't change the implementations for Windows and Linux; on these systems, both ThreadPriority::Background and ThreadPriority::Low map to the same thread priority. This could be changed as a followup (e.g. by using SCHED_BATCH for Low on Linux). See also https://github.com/clangd/clangd/issues/1119. Reviewed By: sammccall, dgoldman Differential Revision: https://reviews.llvm.org/D124715	2022-05-16 10:01:49 +02:00
bzcheeseman	0809f63826	[LLVM][Casting.h] Add trivial self-cast Casting from a type to itself should always be possible. Make this simple for all users, and add tests to ensure we keep being able to do this. Ref: https://reviews.llvm.org/D125543 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D125590	2022-05-15 22:22:16 -07:00
Alexey Lapshin	fdae8641ad	[DWARFLinker][NFC] cleanup AddressManager interface. this review is extracted from D86539 1. delete areRelocationsResolved() method. 2. rename hasLiveMemoryLocation() -> isLiveVariable() hasLiveAddressRange() -> isLiveSubprogram(). Differential Revision: https://reviews.llvm.org/D125492	2022-05-15 22:47:04 +03:00
Sheng	c644488a8b	Rename `MCFixedLenDisassembler.h` as `MCDecoderOps.h` The name `MCFixedLenDisassembler.h` is out of date after D120958. Rename it as `MCDecoderOps.h` to reflect the change. Reviewed By: myhsu Differential Revision: https://reviews.llvm.org/D124987	2022-05-15 08:44:58 +08:00
Alex Brachet	a74d9e74e5	[ifs] Add --strip-size flag st_size may not be of importance to the abi if you are not using copy relocations. This is helpful when you want to check the abi of a shared object both when instrumented and not because asan will increase the size of objects to include the redzone. Differential revision: https://reviews.llvm.org/D124792	2022-05-14 18:50:20 +00:00
Alex Brachet	1f61260847	Revert "[ifs] Add --strip-size flag" This reverts commit `b6b0fd6a94`.	2022-05-14 17:33:27 +00:00
Alex Brachet	b6b0fd6a94	[ifs] Add --strip-size flag st_size may not be of importance to the abi if you are not using copy relocations. This is helpful when you want to check the abi of a shared object both when instrumented and not because asan will increase the size of objects to include the redzone. Differential revision: https://reviews.llvm.org/D124792	2022-05-14 17:25:50 +00:00
Xiaodong Liu	7ff7001ba9	[llvm] Fix comment nits in Module class, NFC. There is no member called "GlobalValRefMap" in Module class. It has been changed to "GlobalList". Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D125187	2022-05-14 17:41:37 +08:00
Jay Foad	169ae6db69	[APInt] Allow extending and truncating to the same width Allow zext, sext, trunc, truncUSat and truncSSat to extend or truncate to the same bit width, which is a no-op. Disallowing this forced clients to use workarounds like using zextOrTrunc (even though they never wanted truncation) or zextOrSelf (even though they did not want its strange behaviour of allowing a smaller bit width, which is also treated as a no-op). Differential Revision: https://reviews.llvm.org/D125556	2022-05-14 09:54:24 +01:00
Wolfgang Pieb	2740c1875d	[NFC][Metadata] Refactor allocation, initalization and deletion of MDNodes. This patch is refactoring the allocation, initialization and deletion of MDNodes. It is intended as a preparatory patch for the upcoming addition of dynamic resizability of MDNodes. It is fundamentally NFC, but removes the necessity for suppressing the memory sanitizer for MDNode's operator delete. Reviewers: dexonsmith Differential Revision: https://reviews.llvm.org/D125489	2022-05-13 16:05:29 -07:00
bzcheeseman	c758708018	[LLVM][Casting.h] Add ForwardToPointerCast trait Addresses use cases in Clang/MLIR that need pointer-to-pointer, reference-to-reference, and value-to-value casts from/to the same types. This should reduce boilerplate by allowing the user to simply specify the pointer cast and forward the reference cast directly to the pointer cast. This cast trait DOES NOT implement `castFailed` and `doCastIfPossible` because in the general case doing so could result in a nullptr dereference. Users can use `NullableValueCastFailed` and `DefaultDoCastIfPossible` as desired for those cases where `nullptr` is acceptable. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D125576	2022-05-13 18:48:50 -04:00
bzcheeseman	bc65fc8bb3	[LLVM][Casting.h] Remove CastInfo pointer partial specialization. Since cast_convert_val now has pointer specializations, we don't need the pointer partial specialization for CastInfo. We want to trim these down when possible to avoid future ambiguous partial specialization errors. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D125578	2022-05-13 18:31:10 -04:00
Alexander Shaposhnikov	badd088c57	[GlobalOpt] Enable optimization of constructors with different priorities Adjust `optimizeGlobalCtorsList` to handle the case of different priorities. This addresses the issue https://github.com/llvm/llvm-project/issues/55083. Test plan: ninja check-all Differential revision: https://reviews.llvm.org/D125278	2022-05-13 22:19:29 +00:00
Amara Emerson	41fef10449	[GlobalISel] Combine G_SHL, G_ASHR, G_SHL of undef shifts to undef. Differential Revision: https://reviews.llvm.org/D125041	2022-05-13 12:20:34 -07:00
Hongtao Yu	1662cfa4be	[CSSPGO][CSProfileConverter] Remove call target samples when including callee samples into caller. When a flat CS profile is converted to a nested profile, the call target samples for inlined callee contexts are left over in the callsite target map. This could cause indirect call promotion to function improperly. One issue is that the inlined callsites are treated with double amount of samples. The other is the inlined callsites are reconsidered for subsequent PGO ICP. I'm fixing this by excluding call targets from the callsite for inlined targets. While fixing this I found that callsite target sum and the number of body samples for that callsite could be mismatched. {D122609} has an explanation and a fix for that on llvm-profgen side. For now I'm tolerating it in this change. Reviewed By: wenlei Differential Revision: https://reviews.llvm.org/D125266	2022-05-13 09:19:32 -07:00
zhijian	fe3b621f05	[AIX] support write operation of big archive. SUMMARY 1. Enable supporting the write operation of big archive. 2. the first commit come from https://reviews.llvm.org/D104367 3. refactor the first commit and implement writing symbol table. 4. fixed the bugs and add more test cases in the second commit. Reviewers: James Henderson Differential Revision: https://reviews.llvm.org/D123949	2022-05-13 10:40:15 -04:00
Jay Foad	fcbf617dcc	[APInt] Fix documentation of *OrSelf methods Document that truncOrSelf, zextOrSelf and sextOrSelf only enforce an upper or lower bound on the bitwidth of the result.	2022-05-13 15:26:41 +01:00
Jonas Paulsson	eaa78035c6	[SystemZ] Patchset for expanding memcpy/memset using at most two stores. * Set MaxStoresPerMemcpy and MaxStoresPerMemset to 2. * Optimize stores of replicated values in SystemZ::combineSTORE(). This handles the now expanded memory operations and as well some other pre-existing cases. * Reject a big displacement in isLegalAddressingMode() for a vector type. * Return true from shouldConsiderGEPOffsetSplit(). Reviewed By: Ulrich Weigand Differential Revision: https://reviews.llvm.org/D122105	2022-05-13 15:31:09 +02:00
Nikita Popov	ed1cb01baf	[IRBuilder] Add IsInBounds parameter to CreateGEP() We commonly want to create either an inbounds or non-inbounds GEP based on a boolean value, e.g. when preserving inbounds from existing GEPs. Directly accept such a boolean in the API, rather than requiring a ternary between CreateGEP and CreateInBoundsGEP. This change is not entirely NFC, because we now preserve an inbounds flag in a constant expression edge-case in InstCombine.	2022-05-13 14:30:55 +02:00
Nathan Sidwell	562ce15924	[demangler] Avoid special-subst code duplication We need to expand special substitutions in four different ways. This refactors to only have one conversion from enum to string, and derive the other 3 needs off that. The SpecialSubstitution node is derived from the ExpandedSpecialSubstitution. While this may seem unintuitive, it works out quite well, as SpecialSubstitution can then use the former's getBaseName and remove an unneeded 'basic_' prefix, for those substitutions that are instantiations (to known typedef). Similarly all those instantiations use the same set of template arguments (with 'basic_string', getting an additional 'allocator' arg). Expansion tests were added in D123134, and remain unchanged. Reviewed By: MaskRay, dblaikie Differential Revision: https://reviews.llvm.org/D125257	2022-05-13 04:35:29 -07:00
Nikita Popov	0485211dd0	[IRBuilder] Remove redundant createGEP() overloads (NFC) ArrayRef<Value > also accepts a single Value , there's no need to create separate overloads for this.	2022-05-13 12:43:21 +02:00
Zakk Chen	7dfc56c107	[RISCV] Add the passthru operand for RVV unmasked segment load IR intrinsics. The goal is support tail and mask policy in RVV builtins. We focus on IR part first. If the passthru operand is undef, we use tail agnostic, otherwise use tail undisturbed. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D125323	2022-05-13 02:16:40 -07:00
Jay Foad	26e1ebd3ea	[GlobalISel] Change ConstantFoldVectorBinop to return vector of APInt Previously it built MIR for the results and returned a Register. This avoids building constants for earlier elements of the vector if later elements will fail to fold, and allows CSEMIRBuilder::buildInstr to avoid unconditionally building a copy from the result. Use a new helper function MachineIRBuilder::buildBuildVectorConstant to build a G_BUILD_VECTOR of G_CONSTANTs. Differential Revision: https://reviews.llvm.org/D117758	2022-05-13 09:33:07 +01:00
Alex Bradbury	cb778e9328	[WebAssembly] Implement ref.is_null MC layer support and codegen Custom type-checking (in WebAssemblyAsmTypeCheck.cpp) is used to workaround the fact that separate variants of the instruction are defined for externref and funcref. Based on an initial patch by Paulo Matos <pmatos@igalia.com>. Differential Revision: https://reviews.llvm.org/D123484	2022-05-13 07:08:10 +01:00
bzcheeseman	0be41ed5bb	[LLVM][Casting.h] Don't create a temporary while casting. C-style casting can create a temporary when compiled by a C++ compiler, which was emitting a warning casting a reference to another reference. We can't use C++-style casting directly because it doesn't always work with incomplete types. In order to support the current use-cases, for references we switch to pointer space to perform the cast. Reviewed By: qiongsiwu1 Differential Revision: https://reviews.llvm.org/D125482	2022-05-12 23:11:02 -04:00
Chen Zheng	0ca2b93cc2	[NFC] add the missing //@} address code review comments for D123995	2022-05-12 22:43:35 -04:00
Florian Hahn	5890b30105	[LAA] Initial support for runtime checks with pointer selects. Scaffolding support for generating runtime checks for multiple SCEV expressions per pointer. The initial version just adds support for looking through a single pointer select. The more sophisticated logic for analyzing forks is in D108699 Reviewed By: huntergr Differential Revision: https://reviews.llvm.org/D114487	2022-05-12 19:33:48 +01:00
Pedro Olsen Ferreira	28a0b94d22	Rename and fix ValueMap::resize to reserve The underlying map type (DenseMap) has had its resize() function renamed to reserve() as part of `c04fc7a60f` (SVN 264026). This is only visible when the member function is called, as it is template type name dependent. Differential Revision: https://reviews.llvm.org/D125387	2022-05-12 13:44:47 +01:00
Dmitry Vassiliev	7b22cf12ef	[Intrinsics] Fix `nvvm_prmt` intrinsic attributes `nvvm_prmt` doesn't seem to be `commutative`. nvvm also sets `IntrSpeculatable` for it. Here is the doc https://docs.nvidia.com/cuda/parallel-thread-execution/index.html#data-movement-and-conversion-instructions-prmt Reviewed By: tra, jchlanda Differential Revision: https://reviews.llvm.org/D125423	2022-05-12 10:46:03 +02:00
bzcheeseman	f156b51aec	[LLVM][Casting.h] Update dyn_cast machinery to provide more control over how the casting is performed. This patch expands the expressive capability of the casting utilities in LLVM by introducing several levels of configurability. By creating modular CastInfo classes we can enable projects like MLIR that need more fine-grained control over how a cast is actually performed to retain that control, while making it easy to express the easy cases (like a checked pointer to pointer cast). The current implementation of Casting.h doesn't make it clear where the entry points for customizing the cast behavior are, so part of the motivation for this patch is adding that documentation. Another part of the motivation is to support using LLVM RTTI with a wider set of use cases, such as nullable value to value casts, or pointer to value casts (as in MLIR). Reviewed By: lattner, rriddle Differential Revision: https://reviews.llvm.org/D123901	2022-05-12 00:15:09 -04:00
Austin Kerbow	2db700215a	[AMDGPU] Add llvm.amdgcn.sched.barrier intrinsic Adds an intrinsic/builtin that can be used to fine tune scheduler behavior. If there is a need to have highly optimized codegen and kernel developers have knowledge of inter-wave runtime behavior which is unknown to the compiler this builtin can be used to tune scheduling. This intrinsic creates a barrier between scheduling regions. The immediate parameter is a mask to determine the types of instructions that should be prevented from crossing the sched_barrier. In this initial patch, there are only two variations. A mask of 0 means that no instructions may be scheduled across the sched_barrier. A mask of 1 means that non-memory, non-side-effect inducing instructions may cross the sched_barrier. Note that this intrinsic is only meant to work with the scheduling passes. Any other transformations that may move code will not be impacted in the ways described above. Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D124700	2022-05-11 13:22:51 -07:00
River Riddle	5a9a438a54	[TableGen] Refactor TableGenParseFile to no longer use a callback Now that TableGen no longer relies on global Record state, we can allow for the client to own the RecordKeeper and SourceMgr. Given that TableGen internally still relies on the global llvm::SrcMgr, this method unfortunately still isn't thread-safe. Differential Revision: https://reviews.llvm.org/D125277	2022-05-11 11:55:33 -07:00
River Riddle	2ac3cd20ca	[TableGen] Remove the use of global Record state This commits removes TableGens reliance on managed static global record state by moving the RecordContext into the RecordKeeper. The RecordKeeper is now treated similarly to a (LLVM\|MLIR\|etc)Context object and is passed to static construction functions. This is an important step forward in removing TableGens reliance on global state, and in a followup will allow for users that parse tablegen to parse multiple tablegen files without worrying about Record lifetime. Differential Revision: https://reviews.llvm.org/D125276	2022-05-11 11:55:33 -07:00
Joseph Huber	e7858a9fab	[Cuda] Add initial support for wrapping CUDA images in the new driver. This patch adds the initial support for wrapping CUDA images. This requires changing some of the logic for how we bundle images. We now need to copy the image for all kinds that are active for the architecture. Then we need to run a separate wrapping job if the Kind is Cuda. For cuda wrapping we need to use the `fatbinary` program from the CUDA SDK to bundle all the binaries together. This is then passed to a new function to perfom the actual module code generation that will be implemented in a later patch. Depends on D120273 D123471 Reviewed By: tra Differential Revision: https://reviews.llvm.org/D123810	2022-05-11 07:30:23 -04:00
Nikita Popov	c1bb4a881e	[SCEVExpander] Deduplicate min/max expansion code (NFC)	2022-05-11 12:11:11 +02:00
Fraser Cormack	c1d48b35d8	[SelectionDAG][VP] Rename VP sext/zext/trunc ISD opcodes Rather than VP_SEXT/VP_ZEXT/VP_TRUNC, having VP_SIGN_EXTEND/VP_ZERO_EXTEND/VP_TRUNCATE better matches their non-VP counterparts. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D125298	2022-05-11 10:25:51 +01:00
Arthur Eubanks	7e0802aeb5	[BasicAA] Fix order in which we pass MemoryLocations to alias() D98718 caused the order of Values/MemoryLocations we pass to alias() to be significant due to storing the offset in the PartialAlias case. But some callers weren't audited and were still passing swapped arguments, causing the returned PartialAlias offset to be negative in some cases. For example, the newly added unittests would return -1 instead of 1. Fixes #55343, a miscompile. Reviewed By: asbirlea, nikic Differential Revision: https://reviews.llvm.org/D125328	2022-05-10 12:05:38 -07:00
Matthias Braun	cd19af74c0	Avoid 8 and 16bit switch conditions on x86 This adds a `TargetLoweringBase::getSwitchConditionType` callback to give targets a chance to control the type used in `CodeGenPrepare::optimizeSwitchInst`. Implement callback for X86 to avoid i8 and i16 types where possible as they often incur extra zero-extensions. This is NFC for non-X86 targets. Differential Revision: https://reviews.llvm.org/D124894	2022-05-10 10:00:10 -07:00
Nicolai Hähnle	38bb46523f	GlobalISel: Trivial documentation and comment fixes Differential Revision: https://reviews.llvm.org/D124808	2022-05-10 07:48:56 -05:00
Craig Topper	39e63bd2d8	[IR][CostModel] A scalable vector shuffle can't be an identity or reverse shuffle. Even if the minimum number of elements is 1 and the length doesn't change, we don't know what vscale is so we can't classify it as identity mask. Instead it is a zero element splat. For reverse, we shouldn't classify it as a reverse unless there are at least 2 elements in the mask. This applies to both fixed and scalable vectors. For fixed vectors, a single element would be an identity shuffle. For scalable vector it's a zero elt splat. Reviewed By: sdesmalen, liaolucy Differential Revision: https://reviews.llvm.org/D124655	2022-05-09 21:37:25 -07:00
Mircea Trofin	c35ad9ee4f	[mlgo] Support exposing more features than those supported by models This allows the compiler to support more features than those supported by a model. The only requirement (development mode only) is that the new features must be appended at the end of the list of features requested from the model. The support is transparent to compiler code: for unsupported features, we provide a valid buffer to copy their values; it's just that this buffer is disconnected from the model, so insofar as the model is concerned (AOT or development mode), these features don't exist. The buffers are allocated at setup - meaning, at steady state, there is no extra allocation (maintaining the current invariant). These buffers has 2 roles: one, keep the compiler code simple. Second, allow logging their values in development mode. The latter allows retraining a model supporting the larger feature set starting from traces produced with the old model. For release mode (AOT-ed models), this decouples compiler evolution from model evolution, which we want in scenarios where the toolchain is frequently rebuilt and redeployed: we can first deploy the new features, and continue working with the older model, until a new model is made available, which can then be picked up the next time the compiler is built. Differential Revision: https://reviews.llvm.org/D124565	2022-05-09 18:01:21 -07:00
Michael Kruse	588ffdaf37	[polly] Fix compiler warning. NFC. Fix the warning warning: 'polly::ScopViewer' has virtual functions but non-virtual destructor [-Wnon-virtual-dtor] and for several other classes by inserting virtual destructors.	2022-05-09 14:04:40 -05:00
Michael Kruse	6b3b87376b	[polly] migrate -polly-show to the new pass manager Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D123678	2022-05-09 14:04:29 -05:00
Michael Kruse	a6b399ad79	[PassManager] Implement DOTGraphTraitsViewer under NPM Rename the legacy `DOTGraphTraits{Module,}{Viewer,Printer}` to the corresponding `DOTGraphTraits...WrapperPass`, and implement a new `DOTGraphTraitsViewer` with new pass manager. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D123677	2022-05-09 14:04:28 -05:00
Kazu Hirata	3f64f03289	[CodeGen] Clarify the semantics of ADDCARRY/SUBCARRY This patch clarifies the semantics of ADDCARRY/SUBCARRY, specifically stating that both the incoming and outgoing carries are active high. Differential Revision: https://reviews.llvm.org/D125130	2022-05-09 10:17:00 -07:00
Alexey Bataev	9dc4ced204	[SLP]Try partial store vectorization if supported by target. We can try to vectorize number of stores less than MinVecRegSize / scalar_value_size, if it is allowed by target. Gives an extra opportunity for the vectorization. Fixes PR54985. Differential Revision: https://reviews.llvm.org/D124284	2022-05-09 09:48:15 -07:00
Nathan Sidwell	bc150a07f1	[demangler] No need to space adjacent template closings With the demangler parenthesizing 'a >> b' inside template parameters, because C++11 parsing of >> there, we don't really need to add spaces between adjacent template arg closing '>' chars. In 2022, that just looks odd. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D123134	2022-05-09 06:14:44 -07:00
Nathan Sidwell	e48cd7088b	[demangler] Buffer peeking needs buffer The output buffer has a 'back' member, which returns NUL when you try it with an empty buffer. But there are no use cases that need that additional functionality. This makes the 'back' member behave more like STL containers' back members. (It still returns a value, not a reference.) Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D123201	2022-05-09 04:17:22 -07:00
Philipp Tomsich	91b24b0180	[AArch64] Ampere1 does not support MTE The initial support for the Ampere1 mistakenly signalled support for the MTE feature. However, the core does not include the optional MTE functionality. Update the target parser to not include MTE for Ampere1. Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D125191	2022-05-09 11:29:42 +02:00
Sam McCall	d44ffd631c	[Bitstream] Only consider flushing to file on block boundaries The goal of flushing to disk is to keep a reasonable bound on peak memory usage. With a a default threshold of 512MB (and most BitstreamWriters having no backing file at all), checking after every byte whether to flush seems excessive. This change makes clangd's unittests run 5% faster (in opt), so it's not actually free even in the case with no backing file. Likely there are more important workloads where it makes some difference. Differential Revision: https://reviews.llvm.org/D125145	2022-05-07 16:57:03 +02:00
Amaury Séchet	c2c259224b	const char* for LLVMTargetMachineEmitToFile's argument The `LLVMTargetMachineEmitToFile` takes a `char* Filename` right now, but it doesn't modify it. This is annoying to use in the case where you want to pass a const string, because you either have to remove the const, or copy it somewhere else and pass that. Either way, it's not very nice. I added a const and clang formatted it. This shouldn't break any ABI in my opinion. I'm sorry but I didn't know whom to put as reviewer for this, so I chose someone with a lot of commits from the .cpp file. Reviewed By: deadalnix Differential Revision: https://reviews.llvm.org/D124453	2022-05-07 14:40:55 +00:00
Serge Pavlov	eb28da89a6	[InstCombine] Remove side effect of replaced constrained intrinsics If a constrained intrinsic call was replaced by some value, it was not removed in some cases. The dangling instruction resulted in useless instructions executed in runtime. It happened because constrained intrinsics usually have side effect, it is used to model the interaction with floating-point environment. In some cases side effect is actually absent or can be ignored. This change adds specific treatment of constrained intrinsics so that their side effect can be removed if it actually absents. Differential Revision: https://reviews.llvm.org/D118426	2022-05-07 19:04:11 +07:00
Paul Walker	702c4ade22	[ISD::IndexType] Helper functions for common queries. Add helper functions to query the signed and scaled properties of ISD::IndexType along with functions to change them. Remove setIndexType from MaskedGatherSDNode because it only has one usage and typically should only be changed alongside its index operand. Minimise the direct use of the enum values to lay the groundwork for more refactoring. Differential Revision: https://reviews.llvm.org/D123347	2022-05-07 11:23:42 +01:00
Sam McCall	0a83ff83af	[FuzzMutate] Move LLVM module (de)serialization from FuzzerCLI -> IRMutator. NFC These are not directly related to the CLI, and are mostly (always?) used when mutating the modules as part of fuzzing. Motivation: split FuzzerCLI into its own library that does not depend on IR. Subprojects that don't use IR should be be fuzzed without the dependency. Differential Revision: https://reviews.llvm.org/D125080	2022-05-07 12:09:49 +02:00
Peter S. Housel	981523b2e4	[ORC-RT][ORC] Handle dynamic unwind registration for libunwind This changes the ELFNix platform Orc runtime to use, when available, the __unw_add_dynamic_eh_frame_section interface provided by libunwind for registering .eh_frame sections loaded by JITLink. When libunwind is not being used for unwinding, the ELFNix platform detects this and defaults to the __register_frame interface provided by libgcc_s. Reviewed By: lhames Differential Revision: https://reviews.llvm.org/D114961	2022-05-06 14:00:29 -07:00
Craig Topper	2ca78d2bdf	[SelectionDAG] Improve asserts in SelectionDAG::getSelect. The VT passed in must match the type of LHS and RHS. Previously we only checked that the vectorness matched.	2022-05-06 09:41:11 -07:00
Marco Elver	9ae87b5973	[Instrumentation] Share InstrumentationIRBuilder between TSan and SanCov Factor our InstrumentationIRBuilder and share it between ThreadSanitizer and SanitizerCoverage. Simplify its usage at the same time (use function of passed Instruction or BasicBlock). This class may be used in other instrumentation passes in future. NFCI. Reviewed By: nickdesaulniers Differential Revision: https://reviews.llvm.org/D125038	2022-05-06 09:15:17 +02:00
Sam McCall	ba0d50ad7e	[Support] Fix UB in BumpPtrAllocator when first allocation is zero. BumpPtrAllocator::Allocate() is marked __attribute__((returns_nonnull)) when the compiler supports it, which makes it UB to return null. When there have been no allocations yet, the current slab is [nullptr, nullptr). A zero-sized allocation fits in this range, and so Allocate(0, 1) returns null. There's no explicit docs whether Allocate(0) is valid. I think we have to assume that it is: - the implementation tries to support it (e.g. >= tests instead of >) - malloc(0) is allowed - requiring each callsite to do a check is bug-prone - I found real LLVM code that makes zero-sized allocations Differential Revision: https://reviews.llvm.org/D125040	2022-05-06 08:57:27 +02:00
Ilia Diachkov	0098f2aebb	[SPIRV] Add SPIR-V specific intrinsics, two passes and tests The patch adds SPIR-V specific intrinsics required to keep information critical to SPIR-V consistency (types, constants, etc.) during translation from IR to MIR. Two related passes (SPIRVEmitIntrinsics and SPIRVPreLegalizer) and several LIT tests (passed with this change) have also been added. It also fixes the issue with opaque pointers in SPIRVGlobalRegistry.cpp and the mismatch of the data layout between the SPIR-V backend and clang (Issue #55122). Differential Revision: https://reviews.llvm.org/D124416 Co-authored-by: Aleksandr Bezzubikov <zuban32s@gmail.com> Co-authored-by: Michal Paszkowski <michal.paszkowski@outlook.com> Co-authored-by: Andrey Tretyakov <andrey1.tretyakov@intel.com> Co-authored-by: Konrad Trifunovic <konrad.trifunovic@intel.com>	2022-05-06 03:02:00 +03:00
Lang Hames	16dcbb53dc	[ORC] Return ExecutorAddrs rather than JITEvaluatedSymbols from LLJIT::lookup. Clients don't care about linkage, and ExecutorAddr is much more ergonomic.	2022-05-05 13:56:00 -07:00
Lang Hames	98616cfc02	[ORC] Add an ExecutorAddr::toPtr overload for function types. In the common case of converting an ExecutorAddr to a function pointer type, this eliminates the need for the '()' boilerplate to explicitly specify a function pointer. E.g.: auto F = A.toPtr<int()()>(); can now be written as auto F = A.toPtr<int()>();	2022-05-05 12:37:23 -07:00
David Blaikie	0d8cb8b399	DWARFVerifier: Verify CU/TU index overlap issues Discovered in a large object that would need a 64 bit index (but the cu/tu index format doesn't include a 64 bit offset/length mode in DWARF64 - a spec bug) but instead binutils dwp overflowed the offsets causing overlapping regions.	2022-05-05 18:18:53 +00:00

1 2 3 4 5 ...

48195 Commits