llvm-project

Commit Graph

Author	SHA1	Message	Date
Thomas Raoux	6e557bc405	[mlir][spirv] Add Vector to SPIR-V conversion pass Add conversion pass for Vector dialect to SPIR-V dialect and add some simple conversion pattern for vector.broadcast, vector.insert, vector.extract. Differential Revision: https://reviews.llvm.org/D88761	2020-10-06 11:53:23 -07:00
Scott Linder	bf5c1d92d9	[AMDGPU] Fix remaining kernel descriptor test Follow up on `e4a9e4ef55` to fix a test I missed in the original patch. Committed as obvious.	2020-10-06 18:45:04 +00:00
Eric Schweitz	0f8294072f	[NFC][flang] Add the header file Todo.h. This file is being upstreamed to satisfy dependencies and enable continued progress on lowering of OpenMP, OpenACC, etc. Differential Revision: https://reviews.llvm.org/D88909	2020-10-06 11:31:46 -07:00
Fanbo Meng	43cd0a98d1	[SystemZ][z/OS] Set default alignment rules for z/OS target Update RUN line to fix lit failure Differential Revision: https://reviews.llvm.org/D88845	2020-10-06 14:21:21 -04:00
Nicolas Vasilache	a3adcba645	[mlir][Linalg] Implement tiling on tensors This revision implements tiling on tensors as described in: https://llvm.discourse.group/t/an-update-on-linalg-on-tensors/1878/4 Differential revision: https://reviews.llvm.org/D88733	2020-10-06 17:51:11 +00:00
Konrad Dobros	c9f1c50fc0	[mlir][spirv] Fix extended insts deserialization generation This change replaces container used for storing temporary strings for generated code to std::list. SmallVector may reallocate internal data, which will invalidate references when more than one extended instruction set is generated. Reviewed By: mravishankar, antiagainst Differential Revision: https://reviews.llvm.org/D88626	2020-10-06 13:34:58 -04:00
Scott Linder	e4a9e4ef55	[AMDGPU] Emit correct kernel descriptor on big-endian hosts Previously we wrote multi-byte values out as-is from host memory. Use the `emitIntN` helpers in `MCStreamer` to produce a valid descriptor irrespective of the host endianness. Reviewed By: arsenm, rochauha Differential Revision: https://reviews.llvm.org/D88858	2020-10-06 17:29:38 +00:00
Thomas Raoux	92e83afe44	[mlir][vector] Fold extractOp coming from broadcastOp Combine ExtractOp with scalar result with BroadcastOp source. This is useful to be able to incrementally convert degenerated vector of one element into scalar. Differential Revision: https://reviews.llvm.org/D88751	2020-10-06 10:27:39 -07:00
Stanislav Mekhanoshin	acce6b6082	[AMDGPU] Create isGFX9Plus utility function Introduce a utility function to make it more convenient to write code that is the same on the GFX9 and GFX10 subtargets. Use isGFX9Plus in the AsmParser for AMDGPU. Authored By: Joe_Nash Differential Revision: https://reviews.llvm.org/D88908	2020-10-06 10:18:43 -07:00
Fanbo Meng	c781dc74a8	[SystemZ][z/OS] Set default alignment rules for z/OS target Set the default alignment control variables for z/OS target and add test case for alignment rules on z/OS. Reviewed By: abhina.sreeskantharajan Differential Revision: https://reviews.llvm.org/D88845	2020-10-06 13:16:15 -04:00
Simon Pilgrim	6c7d713cf5	[X86][SSE] combineX86ShuffleChain add 'CanonicalizeShuffleInput' helper. NFCI. As part of PR45974, we're getting closer to not creating 'padded' vectors on-the-fly in combineX86ShufflesRecursively, and only pad the source inputs if we have a definite match inside combineX86ShuffleChain. At the moment combineX86ShuffleChain just has to bitcast an input to the correct shuffle type, but eventually we'll need to pad them as well. So, move the bitcast into a 'CanonicalizeShuffleInput helper for now, making the diff for future padding support a lot smaller.	2020-10-06 17:47:24 +01:00
Sebastian Neubauer	b4264210f2	[AMDGPU] Remove SIInstrInfo::calculateLDSSpillAddress This function does not seem to be used anymore. Differential Revision: https://reviews.llvm.org/D88904	2020-10-06 18:45:22 +02:00
Nikita Popov	616f545048	[MemCpyOpt] Use dereferenceable pointer helper The call slot optimization has some home-grown code for checking whether the destination is dereferenceable. Replace this with the generic isDereferenceableAndAlignedPointer() helper. I'm not checking alignment here, because that is currently handled separately and may be an enforced alignment for allocas. The clean way of integrating that part would probably be to accept a callback in isDereferenceableAndAlignedPointer() for the actual isAligned check, which would then have a chance to use an enforced alignment instead. This allows the destination to be a GEP (among other things), though the two open TODOs may prevent it from working in practice. Differential Revision: https://reviews.llvm.org/D88805	2020-10-06 18:41:19 +02:00
Nikita Popov	6b441ca523	[MemCpyOpt] Check for throwing calls during call slot optimization When performing call slot optimization for a non-local destination, we need to check whether there may be throwing calls between the call and the copy. Otherwise, the early write to the destination may be observable by the caller. This was already done for call slot optimization of load/store, but not for memcpys. For the sake of clarity, I'm moving this check into the common optimization function, even if that does need an additional instruction scan for the load/store case. As efriedma pointed out, this check is not sufficient due to potential accesses from another thread. This case is left as a TODO. Differential Revision: https://reviews.llvm.org/D88799	2020-10-06 18:24:40 +02:00
Nikita Popov	80cde02e85	[MemCpyOpt] Add separate statistic for call slot optimization (NFC)	2020-10-06 18:14:10 +02:00
Hafiz Abid Qadeer	f78bb4d84e	[libc++] Check _LIBCPP_USE_CLOCK_GETTIME before using clock_gettime The clock_gettime function is available when _POSIX_TIMERS is defined. We check for this and set _LIBCPP_USE_CLOCK_GETTIME accordingly since `59b3102739`. But check for _LIBCPP_USE_CLOCK_GETTIME was removed in `babd3aefc9`. As a result, code is now trying to use clock_gettime even on platforms where it is not available and it is causing build failure with newlib. This patch restores the checks to fix this. Differential Revision: https://reviews.llvm.org/D88825	2020-10-06 11:56:54 -04:00
peter klausler	53bf28b80c	[flang] Track CHARACTER length better in TypeAndShape CHARACTER length expressions were not always being captured or computed as part of procedure "characteristics", leading to test failures due to an inability to compute memory size expressions accurately. Differential revision: https://reviews.llvm.org/D88689	2020-10-06 08:45:46 -07:00
Simon Pilgrim	3cb8347c94	[APIntTest] Extend extractBits to check 'lshr+trunc' pattern for each case as well. Noticed while triaging PR47731 that we don't have great coverage for such patterns.	2020-10-06 16:32:40 +01:00
Louis Dionne	281de8f361	[libc++] Allow retries in two flaky tests	2020-10-06 11:32:19 -04:00
Fangrui Song	43c7dc52f1	[X86] .code16: temporarily set Mode32Bit when matching an instruction with the data32 prefix PR47632 This allows MC to match `data32 ...` as one instruction instead of two (data32 without insn + insn). The compatibility with GNU as improves: `data32 ljmp` will be matched as ljmpl. `data32 lgdt 4(%eax)` will be matched as `lgdtl` (prefixes: 0x67 0x66, instead of 0x66 0x67). GNU as supports many other `data32 w` as `l`. We currently just hard code `data32 callw` and `data32 ljmpw`. Generalizing the suffix replacement is tricky and requires a think about the "bwlq" appending suffix rules in MatchAndEmitATTInstruction. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D88772	2020-10-06 08:32:03 -07:00
Aaron En Ye Shi	8d2a0c115e	[HIP] NFC Add comments to cmath functions Add missing comments to cmath functions. Differential Revision: https://reviews.llvm.org/D88837	2020-10-06 15:26:56 +00:00
Aaron En Ye Shi	42093562a7	[HIP] NFC properly reference Differential Revision Committed [HIP] Restructure hip headers to add cmath with typo in commit message. Should be Differential Revision instead of Review. Using this to close the diff. Differential Revision: https://reviews.llvm.org/D88837	2020-10-06 15:19:00 +00:00
Dávid Bolvanský	86429c4eaf	[SimplifyLibCalls] Optimize mempcpy_chk to mempcpy	2020-10-06 17:08:46 +02:00
LLVM GN Syncbot	260892dff0	[gn build] Port `aa2b593f14`	2020-10-06 14:49:44 +00:00
Aaron En Ye Shi	aa2b593f14	[HIP] Restructure hip headers to add cmath Separate __clang_hip_math.h header into __clang_hip_cmath.h and __clang_hip_math.h. Improve the math function definition, and add missing definitions or declarations. Add missing overloads. Reviewed By: tra, JonChesterfield Differential Review: https://reviews.llvm.org/D88837	2020-10-06 14:48:53 +00:00
Arthur Eubanks	40251fee00	[BPF][NewPM] Make BPFTargetMachine properly adjust NPM optimizer pipeline This involves porting BPFAbstractMemberAccess and BPFPreserveDIType to NPM, then adding them BPFTargetMachine::registerPassBuilderCallbacks (the NPM equivalent of adjustPassManager()). Reviewed By: yonghong-song, asbirlea Differential Revision: https://reviews.llvm.org/D88855	2020-10-06 07:42:32 -07:00
Arthur Eubanks	8df17b4dc1	[test][InstCombine][NewPM] Fix InstCombine tests under NPM Some of these depended on analyses being present that aren't provided automatically in NPM. early_dce_clobbers_callgraph.ll was previously inlining a noinline function? cast-call-combine.ll relied on the legacy always-inline pass being a CGSCC pass and getting rerun. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D88187	2020-10-06 07:39:00 -07:00
Arthur Eubanks	61d4b342d1	[test][NewPM] Make dead-uses.ll work under NPM This one is weird... globals-aa needs to be already computed at licm, or else a function pass can't run a module analysis and won't have access to globals-aa. But the globals-aa result is impacted by instcombine in a way that affects what the test is expecting. If globals-aa is computed before instcombine, it is cached and globals-aa used in licm won't contain the necessary info provided by instcombine. Another catch is that if we don't invalidate AAManager, it will use the cached AAManager that instcombine requested, which may not contain globals-aa. So we have to invalidate<aa> so that licm can recompute an AAManager with the globals-aa created by the require<globals-aa>. This is essentially the problem described in https://reviews.llvm.org/D84259. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D88118	2020-10-06 07:33:02 -07:00
Johannes Doerfert	4a7a988442	[Attributor][FIX] Move assertion to make it not trivially fail The idea of this assertion was to check the simplified value before we assign it, not after, which caused this to trivially fail all the time.	2020-10-06 09:32:18 -05:00
Johannes Doerfert	04f6951397	[Attributor][FIX] Dead return values are not `noundef` When we assume a return value is dead we might still visit return instructions via `Attributor::checkForAllReturnedValuesAndReturnInsts(..)`. When we do so the "returned value" is potentially simplified to `undef` as it is the assumed "returned value". This is a problem if there was a preexisting `noundef` attribute that will only be removed as we manifest the `undef` return value. We should not use this combination to derive `unreachable` though. Two test cases fixed.	2020-10-06 09:32:18 -05:00
Johannes Doerfert	957094e31b	[Attributor][NFC] Ignore benign uses in AAMemoryBehaviorFloating In AAMemoryBehaviorFloating we used to track benign uses in a SetVector. With this change we look through benign uses eagerly to reduce the number of elements (=Uses) we look at during an update. The test does actually not fail prior to this commit but I already wrote it so I kept it.	2020-10-06 09:32:18 -05:00
Shivanshu Goyal	66e4f07198	Add ability to turn off -fpch-instantiate-templates in clang-cl A lot of our code building with clang-cl.exe using Clang 11 was failing with the following 2 type of errors: 1. explicit specialization of 'foo' after instantiation 2. no matching function for call to 'bar' Note that we also use -fdelayed-template-parsing in our builds. I tried pretty hard to get a small repro for these failures, but couldn't. So there is some subtle edge case in the -fpch-instantiate-templates feature introduced by this change: https://reviews.llvm.org/D69585 When I tried turning this off using -fno-pch-instantiate-templates, builds would silently fail with the same error without any indication that -fno-pch-instantiate-templates was being ignored by the compiler. Then I realized this "no" option wasn't actually working when I ran Clang under a debugger. Differential revision: https://reviews.llvm.org/D88680	2020-10-06 16:23:23 +02:00
Dmitri Gribenko	b3876ef490	Silence -Wunused-variable in NDEBUG mode	2020-10-06 16:02:17 +02:00
Dmitri Gribenko	37c74dfe72	Revert "[c++17] Implement P0145R3 during constant evaluation." This reverts commit `ded79be635`. It causes a crash (I sent the crash reproducer directly to the author).	2020-10-06 15:49:44 +02:00
Simon Pilgrim	17b9a91ec2	[InstCombine] canRewriteGEPAsOffset - don't dereference a dyn_cast<>. NFCI. We know V is a IntToPtrInst or PtrToIntInst type so we know its a CastInst - so use cast<> directly. Prevents clang static analyzer warning that we could deference a null pointer.	2020-10-06 14:48:34 +01:00
Simon Pilgrim	75d33a3a97	[InstCombine] FoldShiftByConstant - consistently use ConstantExpr in logicalshift(trunc(shift(x,c1)),c2) fold. NFCI. This still only gets used for scalar types but now always uses ConstantExpr in preparation for vector support - it was using APInt methods in some places.	2020-10-06 14:48:34 +01:00
Haojian Wu	8a3cbb1535	[clangd] Add basic keyword-name-validation in rename. Differential Revision: https://reviews.llvm.org/D88875	2020-10-06 15:47:57 +02:00
Sam Tebbs	68e002e181	[ARM] Fold select_cc(vecreduce_[u\|s][min\|max], x) into VMINV or VMAXV This folds a select_cc or select(set_cc) of a max or min vector reduction with a scalar value into a VMAXV or VMINV. Differential Revision: https://reviews.llvm.org/D87836	2020-10-06 14:44:58 +01:00
Dmitry Preobrazhensky	e2452f57fa	[AMDGPU][MC] Added detection of unsupported instructions Implemented identification of unsupported instructions; improved errors reporting. See bug 42590. Reviewers: rampitec Differential Revision: https://reviews.llvm.org/D88211	2020-10-06 16:44:27 +03:00
Nicolas Vasilache	d8ee28b96e	[mlir][Linalg] Extend buffer allocation to support Linalg init tensors This revision adds init_tensors support to buffer allocation for Linalg on tensors. Currently makes the assumption that the init_tensors fold onto the first output tensors. This assumption is not currently enforced or cast in stone and requires experimenting with tiling linalg on tensors for ops without reductions. Still this allows progress towards the end-to-end goal.	2020-10-06 13:24:27 +00:00
Chuyang Chen	8fa45e1fd5	Convert diagnostics about multi-character literals from extension to warning This addresses PR46797.	2020-10-06 08:47:17 -04:00
Jonas Paulsson	5588dbce73	[SystemZAsmParser] Treat VR128 separately in ParseDirectiveInsn(). This patch makes the parser - reject higher vector registers (>=16) in operands where they should not be accepted. - accept higher integers (>=16) in vector register operands. Review: Ulrich Weigand Differential Revision: https://reviews.llvm.org/D88888	2020-10-06 14:42:40 +02:00
Michał Górny	a825eaa90e	[lldb] [Platform] Move common ::DebugProcess() to PlatformPOSIX Move common ::DebugProcess() implementation shared by Linux and NetBSD (and to be shared by FreeBSD shortly) into PlatformPOSIX, and move the old base implementation used only by Darwin to PlatformDarwin. Differential Revision: https://reviews.llvm.org/D88852	2020-10-06 14:38:54 +02:00
Simon Pilgrim	21100f885d	[InstCombine] FoldShiftByConstant - use PatternMatch for logicalshift(trunc(shift(x,c1)),c2) fold. NFCI.	2020-10-06 13:13:08 +01:00
Simon Pilgrim	0b402e985e	[InstCombine] FoldShiftByConstant - remove unnecessary cast<>. NFC. Op1 is already a Constant*	2020-10-06 13:13:08 +01:00
Alexey Lapshin	7bbb65b0a4	[llvm-objcopy][NFC] fix style issues reported by clang-format.	2020-10-06 15:06:25 +03:00
LLVM GN Syncbot	95429b88a4	[gn build] Port `d6c9dc3c17`	2020-10-06 12:02:07 +00:00
Adam Balogh	d6c9dc3c17	[clang-tidy] Remove obsolete checker google-runtime-references The rules which is the base of this checker is removed from the //Google C++ Style Guide// in May: [[ https://github.com/google/styleguide/pull/553 \| Update C++ styleguide ]]. Now this checker became obsolete. Differential Revision: https://reviews.llvm.org/D88831	2020-10-06 14:03:55 +02:00
Alexander Shaposhnikov	315970de1d	[llvm-objcopy][MachO] Add support for universal binaries This diff adds support for universal binaries to llvm-objcopy. This is a recommit of `32c8435ef7` with the asan issue fixed. Test plan: make check-all Differential revision: https://reviews.llvm.org/D88400	2020-10-06 04:01:40 -07:00
Denis Antrushin	c08d48fc2d	[Statepoints] Change statepoint machine instr format to better suit VReg lowering. Current Statepoint MI format is this: STATEPOINT <id>, <num patch bytes >, <num call arguments>, <call target>, [call arguments...], <StackMaps::ConstantOp>, <calling convention>, <StackMaps::ConstantOp>, <statepoint flags>, <StackMaps::ConstantOp>, <num deopt args>, [deopt args...], <gc base/derived pairs...> <gc allocas...> Note that GC pointers are listed in pairs <base,derived>. This causes base pointers to appear many times (at least twice) in instruction, which is bad for us when VReg lowering is ON. The problem is that machine operand tiedness is 1-1 relation, so it might look like this: %vr2 = STATEPOINT ... %vr1, %vr1(tied-def0) Since only one instance of %vr1 is tied, that may lead to incorrect codegen (see PR46917 for more details), so we have to always spill base pointers. This mostly defeats new VReg lowering scheme. This patch changes statepoint instruction format so that every gc pointer appears only once in operand list. That way they all can be tied. Additional set of operands is added to preserve base-derived relation required to build stackmap. New statepoint has following format: STATEPOINT <id>, <num patch bytes>, <num call arguments>, <call target>, [call arguments...], <StackMaps::ConstantOp>, <calling convention>, <StackMaps::ConstantOp>, <statepoint flags>, <StackMaps::ConstantOp>, <num deopt args>, [deopt args...], <StackMaps::ConstantOp>, <num gc pointers>, [gc pointers...], <StackMaps::ConstantOp>, <num gc allocas>, [gc allocas...] <StackMaps::ConstantOp>, <num entries in gc map>, [base/derived indices...] Changes are: - every gc pointer is listed only once in a flat length-prefixed list; - alloca list is prefixed with its length too; - following alloca list is length-prefixed list of base-derived indices of pointers from gc pointer list. Note that indices are logical (number of pointer), not absolute (index of machine operand). Differential Revision: https://reviews.llvm.org/D87154	2020-10-06 17:40:29 +07:00

... 3 4 5 6 7 ...

368449 Commits All Branches Search

368449 Commits

All Branches