llvm-project

Commit Graph

Author	SHA1	Message	Date
Yonghong Song	edd71db38b	BPF: avoid duplicated globals for CORE relocations This patch fixed two issues related with relocation globals. In LLVM, if a global, e.g. with name "g", is created and conflict with another global with the same name, LLVM will rename the global, e.g., with a new name "g.2". Since relocation global name has special meaning, we do not want llvm to change it, so internally we have logic to check whether duplication happens or not. If happens, just reuse the previous global. The first bug is related to non-btf-id relocation (BPFAbstractMemberAccess.cpp). Commit `54d9f743c8` ("BPF: move AbstractMemberAccess and PreserveDIType passes to EP_EarlyAsPossible") changed ModulePass to FunctionPass, i.e., handling each function at a time. But still just one BPFAbstractMemberAccess object is created so module level de-duplication still possible. Commit `40251fee00` ("[BPF][NewPM] Make BPFTargetMachine properly adjust NPM optimizer pipeline") made a change to create a BPFAbstractMemberAccess object per function so module level de-duplication is not possible any more without going through all module globals. This patch simply changed the map which holds reloc globals as class static, so it will be available to all BPFAbstractMemberAccess objects for different functions. The second bug is related to btf-id relocation (BPFPreserveDIType.cpp). Before Commit `54d9f743c8`, the pass is a ModulePass, so we have a local variable, incremented for each instance, and works fine. But after Commit `54d9f743c8`, the pass becomes a FunctionPass. Local variable won't work properly since different functions will start with the same initial value. Fix the issue by change the local count variable as static, so it will be truely unique across the whole module compilation. Differential Revision: https://reviews.llvm.org/D88942	2020-10-06 22:37:49 -07:00
Max Kazantsev	0c009e092e	[Test] Add test showing that we can avoid inserting trunc/zext	2020-10-07 12:19:01 +07:00
Johannes Doerfert	5a3f6bfe8a	Reapply "[OpenMP][FIX] Verify compatible types for declare variant calls" D88384 This reapplies D88384 with the minor modification that an assertion was changed to a regular conditional and graceful exit from ASTContext::mergeTypes.	2020-10-07 00:06:51 -05:00
Chen Zheng	ed46e84c7a	[MachineInstr] exclude call instruction in mayAlias we now get noAlias result for a call instruction and other load/store/call instructions if we query mayAlias. This is not right as call instruction is not with mayloadorstore, but it may alter the memory. This patch fixes this wrong alias query. Differential Revision: https://reviews.llvm.org/D87490	2020-10-07 00:12:21 -04:00
Chen Zheng	f05608707c	[PowerPC] implement target hook getTgtMemIntrinsic This patch can make pass recognize Powerpc related memory intrinsics. Reviewed By: steven.zhang Differential Revision: https://reviews.llvm.org/D88373	2020-10-07 00:02:44 -04:00
Chen Zheng	0492dd91c4	[PowerPC] add more builtins for PPCTargetLowering::getTgtMemIntrinsic Reviewed By: steven.zhang Differential Revision: https://reviews.llvm.org/D88374	2020-10-06 23:48:33 -04:00
Bill Wendling	d2c61d2bf9	[CodeGen][TailDuplicator] Don't duplicate blocks with INLINEASM_BR Tail duplication of a block with an INLINEASM_BR may result in a PHI node on the indirect branch. This is okay, but it also introduces a copy for that PHI node after the INLINEASM_BR, which is not okay. See: https://github.com/ClangBuiltLinux/linux/issues/1125 Differential Revision: https://reviews.llvm.org/D88823	2020-10-06 18:44:59 -07:00
Valentin Clement	2f40e20613	[flang][openacc] Fix device_num and device_type clauses for init directive This patch fix the device_num and device_type clauses used in the init clause. device_num was not spelled correctly in the parser and was to restrictive with scalarIntConstantExpr instead of scalarIntExpr. device_type is now taking a list of ScalarIntExpr. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D88571	2020-10-06 21:27:01 -04:00
Johannes Doerfert	7993d61177	[Attributor] Use smarter way to determine alignment of GEPs Use same logic existing in other places to deal with base case GEPs. Add the original Attributor talk example.	2020-10-06 19:31:08 -05:00
Johannes Doerfert	c4cfe7a435	[Attributor] Ignore read accesses to constant memory The old function attribute deduction pass ignores reads of constant memory and we need to copy this behavior to replace the pass completely. First step are constant globals. TBAA can also describe constant accesses and there are other possibilities. We might want to consider asking the alias analyses that are available but for now this is simpler and cheaper.	2020-10-06 19:31:07 -05:00
Johannes Doerfert	3f540c05df	[Attributor] Give up early on AANoReturn::initialize If the function is not assumed `noreturn` we should not wait for an update to mark the call site as "may-return". This has two kinds of consequences: - We have less iterations in many tests. - We have less deductions based on "known information" (since we ask earlier, point 1, and therefore assumed information is not "known" yet). The latter is an artifact that we might want to tackle properly at some point but which is not easily fixable right now.	2020-10-06 19:31:07 -05:00
Jonas Devlieghere	e3b0414b0e	[lldb] Change the xcrun (fallback) logic in GetXcodeSDK This changes the logic in GetXcodeSDK to find an SDK with xcrun. The code now executes the following steps: 1. If DEVELOPER_DIR is set in the environment, it invokes xcrun with the given developer dir. If this fails we stop and don't fall back. 2. If the shlib dir is set and exists,it invokes xcrun with the developer dir corresponding to the shlib dir. If this fails we fall back to 3. 3. We run xcrun without a developer dir. The new behavior introduced in this patch is that we fall back to running xcrun without a developer dir if running it based on the shlib dir failed. A situation where this matters is when you're running lldb from an Xcode that has no SDKs and that is not xcode-selected. Based on lldb's shlib dir pointing into this Xcode installation, it will do an xcrun with the developer set to the Xcode without any SDKs which will fail. With this patch, when that happens, we'll fall back to trying the xcode-selected Xcode by running xcrun without a developer dir. Differential revision: https://reviews.llvm.org/D88866	2020-10-06 15:55:06 -07:00
Nico Weber	dfa70a483a	[gn build] manually port `5e4409f308`	2020-10-06 18:43:49 -04:00
Ahmed S. Taei	7060920bd1	Relax FuseTensorReshapeOpAsproducer identity mapping constraint Differential Revision: https://reviews.llvm.org/D88869	2020-10-06 22:31:39 +00:00
Dave Airlie	5e4409f308	Fix out-of-tree clang build due to sysexits change The sysexists change broke clang building out of tree against llvm. https://reviews.llvm.org/D88467	2020-10-06 18:21:17 -04:00
Lang Hames	b45b5166f8	[RuntimeDyld][COFF] Report fatal error on error, rather than emiting diagnostic. Report a fatal error if an IMAGE_REL_AMD64_ADDR32NB cannot be applied due to an out-of-range target. Previously we emitted a diagnostic to llvm::errs and continued. Patch by Dale Martin. Thanks Dale!	2020-10-06 15:16:29 -07:00
Duncan P. N. Exon Smith	7193f72798	docs: Emphasize ArrayRef over SmallVectorImpl The section on SmallVector has a note about preferring SmallVectorImpl for APIs but doesn't mention ArrayRef. Although ArrayRef is discussed elsewhere, let's re-emphasize here. Differential Revision: https://reviews.llvm.org/D49881	2020-10-06 18:13:52 -04:00
Jianzhou Zhao	4d1d8ae710	Replace shadow space zero-out by madvise at mmap After D88686, munmap uses MADV_DONTNEED to ensure zero-out before the next access. Because the entire shadow space is created by MAP_PRIVATE and MAP_ANONYMOUS, the first access is also on zero-filled values. So it is fine to not zero-out data, but use madvise(MADV_DONTNEED) at mmap. This reduces runtime overhead. Reviewed-by: morehouse Differential Revision: https://reviews.llvm.org/D88755	2020-10-06 21:29:50 +00:00
Petr Hosek	4540d66248	[CMake] Track TSan's dependency on C++ headers TSan relies on C++ headers, so when libc++ is being built as part of the runtimes build, include an explicit dependency on cxx-headers which is the same approach that's already used for other sanitizers. Differential Revision: https://reviews.llvm.org/D88912	2020-10-06 13:58:35 -07:00
Chris Palmer	9eff07a746	[libc++] Add assert to check bounds in `constexpr string_view::operator[]` Differential Revision: https://reviews.llvm.org/D88864	2020-10-06 16:57:41 -04:00
Aart Bik	c6c67f643d	[mlir] [sparse] convenience runtime support to read Matrix Market format Setting up input data for benchmarks and integration tests can be tedious in pure MLIR. With more sparse tensor work planned, this convenience library simplifies reading sparse matrices in the popular Matrix Market Exchange Format (see https://math.nist.gov/MatrixMarket). Note that this library is not part of core MLIR. It is merely intended as a convenience library for benchmarking and integration testing. Reviewed By: penpornk Differential Revision: https://reviews.llvm.org/D88856	2020-10-06 13:17:05 -07:00
Mehdi Amini	5a305f81bf	Remove unneeded "allow-unregistered-dialect" from shape-type-conversion.mlir test (NFC)	2020-10-06 20:11:39 +00:00
Alexandre Ganea	d3d790fc98	Revert [lit] Support running tests on Windows without GnuWin32 This reverts `b3418cb4eb` and `d12ae042e1` This breaks some external bots, see discussion in https://reviews.llvm.org/D84380 In the meanwhile, please use `cmake -DLLVM_LIT_TOOLS_DIR="C:/Program Files/Git/usr/bin"` or add it to %PATH%.	2020-10-06 15:38:18 -04:00
Louis Dionne	370b7887e5	[libc++] Add a script to setup CI on macOS nodes	2020-10-06 15:34:09 -04:00
Richard Smith	00d3e6c1b4	[c++17] Implement P0145R3 during constant evaluation. Ensure that we evaluate assignment and compound-assignment right-to-left, and array subscripting left-to-right. Fixes PR47724. This is a re-commit of `ded79be`, reverted in `37c74df`, with a fix and test for the crasher bug previously introduced.	2020-10-06 12:30:26 -07:00
Mircea Trofin	d85b845cb2	[NFC][MC] Type uses of MCRegUnitIterator as MCRegister This is one of many subsequent similar changes. Note that we're ok with the parameter being typed as MCPhysReg, as MCPhysReg -> MCRegister is a correct conversion; Register -> MCRegister assumes the former is indeed physical, so we stop relying on the implicit conversion and use the explicit, value-asserting asMCReg(). Differential Revision: https://reviews.llvm.org/D88862	2020-10-06 12:09:56 -07:00
Thomas Raoux	6e557bc405	[mlir][spirv] Add Vector to SPIR-V conversion pass Add conversion pass for Vector dialect to SPIR-V dialect and add some simple conversion pattern for vector.broadcast, vector.insert, vector.extract. Differential Revision: https://reviews.llvm.org/D88761	2020-10-06 11:53:23 -07:00
Scott Linder	bf5c1d92d9	[AMDGPU] Fix remaining kernel descriptor test Follow up on `e4a9e4ef55` to fix a test I missed in the original patch. Committed as obvious.	2020-10-06 18:45:04 +00:00
Eric Schweitz	0f8294072f	[NFC][flang] Add the header file Todo.h. This file is being upstreamed to satisfy dependencies and enable continued progress on lowering of OpenMP, OpenACC, etc. Differential Revision: https://reviews.llvm.org/D88909	2020-10-06 11:31:46 -07:00
Fanbo Meng	43cd0a98d1	[SystemZ][z/OS] Set default alignment rules for z/OS target Update RUN line to fix lit failure Differential Revision: https://reviews.llvm.org/D88845	2020-10-06 14:21:21 -04:00
Nicolas Vasilache	a3adcba645	[mlir][Linalg] Implement tiling on tensors This revision implements tiling on tensors as described in: https://llvm.discourse.group/t/an-update-on-linalg-on-tensors/1878/4 Differential revision: https://reviews.llvm.org/D88733	2020-10-06 17:51:11 +00:00
Konrad Dobros	c9f1c50fc0	[mlir][spirv] Fix extended insts deserialization generation This change replaces container used for storing temporary strings for generated code to std::list. SmallVector may reallocate internal data, which will invalidate references when more than one extended instruction set is generated. Reviewed By: mravishankar, antiagainst Differential Revision: https://reviews.llvm.org/D88626	2020-10-06 13:34:58 -04:00
Scott Linder	e4a9e4ef55	[AMDGPU] Emit correct kernel descriptor on big-endian hosts Previously we wrote multi-byte values out as-is from host memory. Use the `emitIntN` helpers in `MCStreamer` to produce a valid descriptor irrespective of the host endianness. Reviewed By: arsenm, rochauha Differential Revision: https://reviews.llvm.org/D88858	2020-10-06 17:29:38 +00:00
Thomas Raoux	92e83afe44	[mlir][vector] Fold extractOp coming from broadcastOp Combine ExtractOp with scalar result with BroadcastOp source. This is useful to be able to incrementally convert degenerated vector of one element into scalar. Differential Revision: https://reviews.llvm.org/D88751	2020-10-06 10:27:39 -07:00
Stanislav Mekhanoshin	acce6b6082	[AMDGPU] Create isGFX9Plus utility function Introduce a utility function to make it more convenient to write code that is the same on the GFX9 and GFX10 subtargets. Use isGFX9Plus in the AsmParser for AMDGPU. Authored By: Joe_Nash Differential Revision: https://reviews.llvm.org/D88908	2020-10-06 10:18:43 -07:00
Fanbo Meng	c781dc74a8	[SystemZ][z/OS] Set default alignment rules for z/OS target Set the default alignment control variables for z/OS target and add test case for alignment rules on z/OS. Reviewed By: abhina.sreeskantharajan Differential Revision: https://reviews.llvm.org/D88845	2020-10-06 13:16:15 -04:00
Simon Pilgrim	6c7d713cf5	[X86][SSE] combineX86ShuffleChain add 'CanonicalizeShuffleInput' helper. NFCI. As part of PR45974, we're getting closer to not creating 'padded' vectors on-the-fly in combineX86ShufflesRecursively, and only pad the source inputs if we have a definite match inside combineX86ShuffleChain. At the moment combineX86ShuffleChain just has to bitcast an input to the correct shuffle type, but eventually we'll need to pad them as well. So, move the bitcast into a 'CanonicalizeShuffleInput helper for now, making the diff for future padding support a lot smaller.	2020-10-06 17:47:24 +01:00
Sebastian Neubauer	b4264210f2	[AMDGPU] Remove SIInstrInfo::calculateLDSSpillAddress This function does not seem to be used anymore. Differential Revision: https://reviews.llvm.org/D88904	2020-10-06 18:45:22 +02:00
Nikita Popov	616f545048	[MemCpyOpt] Use dereferenceable pointer helper The call slot optimization has some home-grown code for checking whether the destination is dereferenceable. Replace this with the generic isDereferenceableAndAlignedPointer() helper. I'm not checking alignment here, because that is currently handled separately and may be an enforced alignment for allocas. The clean way of integrating that part would probably be to accept a callback in isDereferenceableAndAlignedPointer() for the actual isAligned check, which would then have a chance to use an enforced alignment instead. This allows the destination to be a GEP (among other things), though the two open TODOs may prevent it from working in practice. Differential Revision: https://reviews.llvm.org/D88805	2020-10-06 18:41:19 +02:00
Nikita Popov	6b441ca523	[MemCpyOpt] Check for throwing calls during call slot optimization When performing call slot optimization for a non-local destination, we need to check whether there may be throwing calls between the call and the copy. Otherwise, the early write to the destination may be observable by the caller. This was already done for call slot optimization of load/store, but not for memcpys. For the sake of clarity, I'm moving this check into the common optimization function, even if that does need an additional instruction scan for the load/store case. As efriedma pointed out, this check is not sufficient due to potential accesses from another thread. This case is left as a TODO. Differential Revision: https://reviews.llvm.org/D88799	2020-10-06 18:24:40 +02:00
Nikita Popov	80cde02e85	[MemCpyOpt] Add separate statistic for call slot optimization (NFC)	2020-10-06 18:14:10 +02:00
Hafiz Abid Qadeer	f78bb4d84e	[libc++] Check _LIBCPP_USE_CLOCK_GETTIME before using clock_gettime The clock_gettime function is available when _POSIX_TIMERS is defined. We check for this and set _LIBCPP_USE_CLOCK_GETTIME accordingly since `59b3102739`. But check for _LIBCPP_USE_CLOCK_GETTIME was removed in `babd3aefc9`. As a result, code is now trying to use clock_gettime even on platforms where it is not available and it is causing build failure with newlib. This patch restores the checks to fix this. Differential Revision: https://reviews.llvm.org/D88825	2020-10-06 11:56:54 -04:00
peter klausler	53bf28b80c	[flang] Track CHARACTER length better in TypeAndShape CHARACTER length expressions were not always being captured or computed as part of procedure "characteristics", leading to test failures due to an inability to compute memory size expressions accurately. Differential revision: https://reviews.llvm.org/D88689	2020-10-06 08:45:46 -07:00
Simon Pilgrim	3cb8347c94	[APIntTest] Extend extractBits to check 'lshr+trunc' pattern for each case as well. Noticed while triaging PR47731 that we don't have great coverage for such patterns.	2020-10-06 16:32:40 +01:00
Louis Dionne	281de8f361	[libc++] Allow retries in two flaky tests	2020-10-06 11:32:19 -04:00
Fangrui Song	43c7dc52f1	[X86] .code16: temporarily set Mode32Bit when matching an instruction with the data32 prefix PR47632 This allows MC to match `data32 ...` as one instruction instead of two (data32 without insn + insn). The compatibility with GNU as improves: `data32 ljmp` will be matched as ljmpl. `data32 lgdt 4(%eax)` will be matched as `lgdtl` (prefixes: 0x67 0x66, instead of 0x66 0x67). GNU as supports many other `data32 w` as `l`. We currently just hard code `data32 callw` and `data32 ljmpw`. Generalizing the suffix replacement is tricky and requires a think about the "bwlq" appending suffix rules in MatchAndEmitATTInstruction. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D88772	2020-10-06 08:32:03 -07:00
Aaron En Ye Shi	8d2a0c115e	[HIP] NFC Add comments to cmath functions Add missing comments to cmath functions. Differential Revision: https://reviews.llvm.org/D88837	2020-10-06 15:26:56 +00:00
Aaron En Ye Shi	42093562a7	[HIP] NFC properly reference Differential Revision Committed [HIP] Restructure hip headers to add cmath with typo in commit message. Should be Differential Revision instead of Review. Using this to close the diff. Differential Revision: https://reviews.llvm.org/D88837	2020-10-06 15:19:00 +00:00
Dávid Bolvanský	86429c4eaf	[SimplifyLibCalls] Optimize mempcpy_chk to mempcpy	2020-10-06 17:08:46 +02:00
LLVM GN Syncbot	260892dff0	[gn build] Port `aa2b593f14`	2020-10-06 14:49:44 +00:00

1 2 3 4 5 ...

368375 Commits All Branches Search

368375 Commits

All Branches