llvm-project

Commit Graph

Author	SHA1	Message	Date
Nemanja Ivanovic	cdead4f89c	[PowerPC][NFC] Fix an assert that cannot trip from `7d076e19e3` I mixed up the precedence of operators in the assert and thought I had it right since there was no compiler warning. This just adds the parentheses in the expression as needed.	2020-07-25 20:28:52 -04:00
Philip Reames	55dae9c20c	[Statepoints] Style cleanup after `3da1a963` [NFC] Just fixing a few minor stylistic issues.	2020-07-25 16:40:39 -07:00
Craig Topper	c5b2371436	[X86] Add masked versions of the VPTERNLOG test cases added for D83630. NFC We don't handle these yet and D83630 won't improve that, but at least we'll have the tests.	2020-07-25 16:37:17 -07:00
Roman Lebedev	96d74530c0	[Reduce] Argument reduction: do deal with function declarations We can happily turn function definitions into declarations, thus obscuring their argument from being elided by this pass. I don't believe there is a good reason to just ignore declarations. likely even proper llvm intrinsics ones, at worst the input becomes uninteresting. The other question here is that all these transforms are all-or-nothing. In some cases, should we be treating each use separately? The main blocker here seemed to be that llvm::CloneFunctionInto() does `&OldFunc->front()`, which inserts a nullptr into a densemap, which is not happy about it and asserts.	2020-07-26 01:31:56 +03:00
Roman Lebedev	9932d74740	[Reduce] Argument reduction: do properly handle invoke insts (PR46819) replaceFunctionCalls() is very non-exhaustive, it only handles CallInst's. Which means, by the time we drop old function, there may still be uses of it lurking around. Let's instead whack-a-mole them by all by replacing with undef. I'm not sure this is the best handling, especially for calls, but IMO poorly reduced input is much better than crashing reduction tool. A (previously-crashing!) test added. Fixes https://bugs.llvm.org/show_bug.cgi?id=46819	2020-07-26 01:29:00 +03:00
Roman Lebedev	af1dd0b1ad	[Reduce] Basic block reduction: do properly handle invoke insts (PR46818) Terminator may have returned value, so we need to replace uses, and in general handle invoke as a branch inst. I'm not sure this is the best handling, but IMO poorly reduced input is much better than crashing reduction tool. A (previously-crashing!) test added. Fixes https://bugs.llvm.org/show_bug.cgi?id=46818	2020-07-26 01:28:59 +03:00
Lang Hames	a01c4ee71c	[ORC] Rename TargetProcessControl DynamicLibraryHandle and loadLibrary. The new names, DylibHandle and loadDylib, are more concise and make clear that these utilities are for loading dynamic libraries, not static ones.	2020-07-25 15:21:43 -07:00
Lang Hames	11d5316afd	[ORC] Don't require PageSize or Triple during TargetProcessControl construction Subclasses will commonly gather that information from a remote during construction, in which case they won't have meaningful values to pass to TargetProcessControl's constructor.	2020-07-25 15:21:43 -07:00
Frederik Gossen	07f227c0eb	[MLIR][Shape] Allow `num_elements` to operate on extent tensors Re-landing with dependent change landed and error condition relaxed. Beyond the change to error condition exactly https://reviews.llvm.org/D84445.	2020-07-25 15:02:29 -07:00
Jacques Pienaar	5142448a5e	[MLIR][Shape] Refactor verification Based on https://reviews.llvm.org/D84439 but less restrictive, else we don't allow shape_of to be able to produce a ranked output and doesn't allow for iterative refinement here. We can consider making it more restrictive later.	2020-07-25 14:55:19 -07:00
Jacques Pienaar	7bfecd7739	Revert "[MLIR][Shape] Allow `num_elements` to operate on extent tensors" This reverts commit `55ced04d6b`. Forgot to submit depend change first.	2020-07-25 14:47:57 -07:00
Frederik Gossen	55ced04d6b	[MLIR][Shape] Allow `num_elements` to operate on extent tensors Differential Revision: https://reviews.llvm.org/D84445	2020-07-25 14:41:05 -07:00
Philip Reames	3da1a9634e	[Statepoints] Support lowering gc relocations to virtual registers (Disabled under flag for the moment) This is part of a larger project wherein we are finally integrating lowering of gc live operands with the register allocator. Today, we force spill all operands in SelectionDAG. The code to do so is distinctly non-optimal. The approach this patch is working towards is to instead lower the relocations directly into the MI form, and let the register allocator pick which ones get spilled and which stack slots they get spilled to. In terms of performance, the later part is actually more important as it avoids redundant shuffling of values between stack slots. This particular change adds ISEL support to produce the variadic def STATEPOINT form required by the above. In particular, the first N are lowered to variadic tied def/use pairs. So new statepoint looks like this: reloc1,reloc2,... = STATEPOINT ..., base1, derived1<tied-def0>, base2, derived2<tied-def1>, ... N is limited by the maximal number of tied registers machine instruction can have (15 at the moment). The current patch is restricted to handling relocations within a single basic block. Cross block relocations (e.g. invokes) are handled via the legacy mechanism. This restriction will be relaxed in future patches. Patch By: dantrushin Differential Revision: https://reviews.llvm.org/D81648	2020-07-25 14:26:05 -07:00
Craig Topper	9182dc7814	[X86] Add llvm.roundeven test cases. Add f80 tests cases for constrained intrinsics that lower to libcalls. NFC	2020-07-25 13:29:47 -07:00
Craig Topper	60a5799e6e	[X86] Fix intrinsic names in strict fp80 tests to use f80 in their names instead of x86_fp80. The type is called x86_fp80, but when it is printed in the intrinsic name it should be f80. The parser doesn't seem to care that the name was wrong.	2020-07-25 13:12:49 -07:00
Fangrui Song	6a75496836	[Driver] Define LinkOption and fix forwarded options to GCC for linking Many driver options are neither 'DriverOption' nor 'LinkerInput'. When gcc is used for linking, these options get forwarded even if they don't have anything to do with linking. Among these options, clang-specific ones can cause gcc to error. Just use 'OPT_Link_Group' and a new flag 'LinkOption' for options which already have a group. gfortran support apparently bit rots (which does not seem to make much sense). XFAIL the test.	2020-07-25 12:33:18 -07:00
LLVM GN Syncbot	48c3228c5c	[gn build] Port `136c8f50e9`	2020-07-25 18:51:58 +00:00
Roman Lebedev	136c8f50e9	[Reduce] Try turning function definitions into declarations first, NFCI-ish ReduceFunctions could do it, but it also replaces all calls with undef, so if any of undef replacements makes reduction uninteresting, it won't work. ReduceBasicBlocks also could do it, but well, it may take many guesses for all the blocks of a function to happen to be out-of-chunk, which is not a very efficient way to go about it. So let's just do this first.	2020-07-25 21:43:36 +03:00
Adrian Prantl	1d9b860fb6	Unify the return value of GetByteSize to an llvm::Optional<uint64_t> (NFC-ish) This cleanup patch unifies all methods called GetByteSize() in the ValueObject hierarchy to return an optional, like the methods in CompilerType do. This means fewer magic 0 values, which could fix bugs down the road in languages where types can have a size of zero, such as Swift and C (but not C++). Differential Revision: https://reviews.llvm.org/D84285	2020-07-25 08:27:21 -07:00
Florian Hahn	c09a10845b	[X86] Remove stress-scheduledagrrlist.ll. This test seems to take quite a long time with EXPENSIVE_CHECKS. Remove it.	2020-07-25 15:45:24 +01:00
Nikita Popov	bc79ed7e16	[LVI] Don't require operand number for range (NFC) Pass the Value* instead of the operand number, rename I to CxtI. This makes the function a bit more generally useful.	2020-07-25 16:33:45 +02:00
Matt Arsenault	392b969c32	AMDGPU/GlobalISel: Don't assert on G_INSERT > 128-bits Just fallback for now. Really tablegen needs to generate all of the subregister index handling we need.	2020-07-25 10:05:44 -04:00
Nikita Popov	f4199b8f0b	[SCCP] Add assume non null test (NFC)	2020-07-25 16:02:15 +02:00
Nikita Popov	632a89e866	[SCCP] Restore the change reporting as well Reapply `5db5b4bc43`.	2020-07-25 15:11:30 +02:00
Nikita Popov	ad16e71c95	Reapply [SCCP] Directly remove non-feasible edges Reapply with DTU update moved after CFG update, which is a requirement of the API. ----- Non-feasible control-flow edges are currently removed by replacing the branch condition with a constant and then calling ConstantFoldTerminator. This happens in a rather roundabout manner, by inspecting the users (effectively: predecessors) of unreachable blocks, and further complicated by the need to explicitly materialize the condition for "forced" edges. I would like to extend SCCP to discard switch conditions that are non-feasible based on range information, but this is incompatible with the current approach (as there is no single constant we could use.) Instead, this patch explicitly removes non-feasible edges. It currently only needs to handle the case where there is a single feasible edge. The llvm_unreachable() branch will need to be implemented for the aforementioned switch improvement. Differential Revision: https://reviews.llvm.org/D84264	2020-07-25 14:52:35 +02:00
Simon Pilgrim	b5e14d78f1	SimplifyLibCalls - remove unnecessary header and forward declaration. NFC. We include TargetLibraryInfo.h so don't need to forward declare it, and we don't need to include TargetLibraryInfo.h in SimplifyLibCalls.cpp as well.	2020-07-25 12:58:39 +01:00
Simon Pilgrim	3b21823e4a	[X86][SSE] combineX86ShufflesRecursively - move all Root node asserts to the same location. NFCI. Minor tidyup for some upcoming shuffle combine improvements.	2020-07-25 12:48:14 +01:00
Simon Pilgrim	18d481cdf9	SymbolRemappingReader.h - pass Twine by reference not value. NFCI.	2020-07-25 12:48:14 +01:00
Florian Hahn	3c1476d26c	[IPSCCP] Drop argmemonly after replacing pointer argument. This patch updates IPSCCP to drop argmemonly and inaccessiblemem_or_argmemonly if it replaces a pointer argument. Fixes PR46717. Reviewers: efriedma, davide, nikic, jdoerfert Reviewed By: efriedma, jdoerfert Differential Revision: https://reviews.llvm.org/D84432	2020-07-25 11:52:14 +01:00
Nathan James	4363ea6105	Fix C2975 error under MSVC Apparantly a constexpr value isn't a compile time constant under certain versions of MSVC.	2020-07-25 11:03:59 +01:00
Simon Pilgrim	66998ae59f	[X86][SSE] getFauxShuffle - ignore undemanded sources for PACKSS/PACKUS faux shuffles If we don't care about an entire LHS/RHS of the PACK op, then can just treat it the same as undef (we don't care if it saturates) and is safe to treat as a shuffle. This can happen if we attempt to decode as a faux shuffle before SimplifyDemandedVectorElts has been called on the PACK which should replace the source with UNDEF entirely.	2020-07-25 10:51:14 +01:00
Nathan James	6c25fc35e0	[ADT] Add a range-based version of std::move Adds a range-based version of `std::move`, the version that moves a range, not the one that creates r-value references. Reviewed By: dblaikie, gamesh411 Differential Revision: https://reviews.llvm.org/D83902	2020-07-25 10:37:34 +01:00
Jessica Paquette	604e33e83a	[AArch64][GlobalISel] Look through constants when selection stores of 0 Very minor code size improvements (hits 8 times in Bullet at -O3), but still something. Also very minor NFC change to make sure we only search for a 0 constant when selecting a store. Before, we'd do this for loads as well. Differential Revision: https://reviews.llvm.org/D84573	2020-07-24 22:46:14 -07:00
Kuba Mracek	33d9c4109a	[tsan] Allow TSan in the Clang driver for Apple Silicon Macs Differential Revision: https://reviews.llvm.org/D84082	2020-07-24 20:14:00 -07:00
Amy Kwan	739cd2638b	[PowerPC] Exploit the High Order Vector Multiply Instructions on Power10 This patch aims to exploit the following vector multiply high instructions on Power10. vmulhsw VRT, VRA, VRB vmulhsd VRT, VRA, VRB vmulhuw VRT, VRA, VRB vmulhud VRT, VRA, VRB Differential Revision: https://reviews.llvm.org/D82584	2020-07-24 20:57:57 -05:00
Adrian Prantl	e937840dbd	Upstream macCatalyst support in ArchSpec and associated unit tests.	2020-07-24 18:01:41 -07:00
Rong Xu	1dd39b1133	[PGO] Fix incorrect function entry count Function entry count might be zero after the profile counts reset and before reentry to the function. Zero profile entry count is very bad as the profile count from BFI will be wrong. A simple fix is to set the profile entry count to 1 if there are non-zero profile counts in this function. Differential Revision: https://reviews.llvm.org/D84378	2020-07-24 17:39:55 -07:00
Rong Xu	31bd15c562	[PGO][InstrProf] Do not promote count if the exit blocks contains ret instruction Skip profile count promotion if any of the ExitBlocks contains a ret instruction. This is to prevent dumping of incomplete profile -- if the the loop is a long running loop and dump is called in the middle of the loop, the result profile is incomplete. ExitBlocks containing a ret instruction is an indication of a long running loop -- early exit to error handling code. Differential Revision: https://reviews.llvm.org/D84379	2020-07-24 17:38:31 -07:00
Rong Xu	5546c2ab42	Revert "[PGO][InstrProf] Do not promote count if the exit blocks contains ret instruction" This reverts commit `6fdc6f6c7d`.	2020-07-24 17:35:44 -07:00
Rong Xu	dcf1bca0de	Revert "[PGO][InstrProf] Do not promote count if the exit blocks contains ret instruction" This reverts commit `867ef4472d`.	2020-07-24 17:33:49 -07:00
Rong Xu	867ef4472d	[PGO][InstrProf] Do not promote count if the exit blocks contains ret instruction Forgot including the tests in the commit `6fdc6f6c7d`.	2020-07-24 17:23:33 -07:00
Amy Kwan	74790a5dde	[PowerPC] Implement Truncate and Store VSX Vector Builtins This patch implements the `vec_xst_trunc` function in altivec.h in order to utilize the Store VSX Vector Rightmost [byte \| half \| word \| doubleword] Indexed instructions introduced in Power10. Differential Revision: https://reviews.llvm.org/D82467	2020-07-24 19:22:39 -05:00
Jessica Paquette	fcc55c0952	[AArch64][GlobalISel] Use wzr/xzr for 16 and 32 bit stores of zero We weren't performing this optimization on 16 and 32 bit stores. SDAG happily does this though. e.g. https://godbolt.org/z/cWocKr This saves about 0.2% in code size on CTMark at -O3. Differential Revision: https://reviews.llvm.org/D84568	2020-07-24 17:15:20 -07:00
Rong Xu	6fdc6f6c7d	[PGO][InstrProf] Do not promote count if the exit blocks contains ret instruction Skip profile count promotion if any of the ExitBlocks contains a ret instruction. This is to prevent dumping of incomplete profile -- if the the loop is a long running loop and dump is called in the middle of the loop, the result profile is incomplete. ExitBlocks containing a ret instruction is an indication of a long running loop -- early exit to error handling code. Differential Revision: https://reviews.llvm.org/D84379	2020-07-24 17:13:58 -07:00
Matt Arsenault	4b53072ee5	GlobalISel: Define mulfix/divfix opcodes The full expansion involves the funnel shifts, which depend on another patch to expand those.	2020-07-24 20:02:20 -04:00
Amara Emerson	f320f83f3a	[AArch64][GlobalISel] Promote G_UITOFP vector operands to same elt size as result. Fixes legalization failures.	2020-07-24 17:00:50 -07:00
Jonas Devlieghere	34d4c8a53e	[lldb] Have LanguageRuntime and SystemRuntime share a base class (NFC) LangaugeRuntime and SystemRuntime now both inherit from Runtime.	2020-07-24 16:28:34 -07:00
Jonas Devlieghere	99996213eb	[lldb] Don't wrap and release raw pointer in unique_ptr (NFC)	2020-07-24 16:28:34 -07:00
Jez Ng	06a0dd2467	[lld-macho] Ignore -dependency_info and its argument XCode passes in this flag, which we do not yet implement. Skip over the argument for now so we can at least successfully parse the linker invocation. Reviewed By: #lld-macho, compnerd Differential Revision: https://reviews.llvm.org/D84485	2020-07-24 15:55:27 -07:00
Jez Ng	31d5885842	[lld-macho] Partial support for weak definitions This diff adds support for weak definitions, though it doesn't handle weak symbols in dylibs quite correctly -- we need to emit binding opcodes for them in the weak binding section rather than the lazy binding section. What is covered in this diff: 1. Reading the weak flag from symbol table / export trie, and writing it to the export trie 2. Refining the symbol table's rules for choosing one symbol definition over another. Wrote a few dozen test cases to make sure we were matching ld64's behavior. We can now link basic C++ programs. Reviewed By: #lld-macho, compnerd Differential Revision: https://reviews.llvm.org/D83532	2020-07-24 15:55:25 -07:00

1 2 3 4 5 ...

361422 Commits All Branches Search

361422 Commits

All Branches