llvm-project

Commit Graph

Author	SHA1	Message	Date
Siva Chandra Reddy	4e8830830e	[libc] Add a missing deps to the linux syscalls target. Submitted as obvious.	2020-03-18 12:48:53 -07:00
Eli Friedman	ebec984e14	[AliasAnalysis] Misc fixes for checking aliasing with scalable types. This is fixing up various places that use the implicit TypeSize->uint64_t conversion. The new overloads in MemoryLocation.h are already used in various places that construct a MemoryLocation from a TypeSize, including MemorySSA. (They were using the implicit conversion before.) Differential Revision: https://reviews.llvm.org/D76249	2020-03-18 12:28:47 -07:00
Alexey Bataev	2f8894a5b8	[OPENMP50]Add support for extended device clause in target directives. Added parsing/sema/serialization support for extended device clause in executable target directives.	2020-03-18 15:02:37 -04:00
Simon Pilgrim	1010c44b4c	[ValueTracking] Add computeKnownBits DemandedElts support to EXTRACTELEMENT/OR/BSWAP/BITREVERSE instructions (PR36319) These are all covered by the bswap/bitreverse vector tests.	2020-03-18 18:49:58 +00:00
Yaxun (Sam) Liu	6f79f80e6e	[HIP] Fix duplicate clang -cc1 options on MSVC toolchain HIPToolChain::TranslateArgs call TranslateArgs of host toolchain with the input args to get a list of derived args called DAL, then go through the input args by itself and append them to DAL. This assumes that the host toolchain should not append any unchanged args to DAL, otherwise there will be duplicates since HIPToolChain will append it again. This works for GNU toolchain since it returns an empty list for DAL. However, MSVC toolchain will append unchanged args to DAL, which causes duplicate args. This patch let MSVC toolchain not append unchanged args for HIP offloading kind, which fixes this issue. Differential Revision: https://reviews.llvm.org/D76032	2020-03-18 14:48:04 -04:00
Julian Lettner	f8e8f0a603	[TSan] Support pointer authentication in setjmp/longjmp interceptors arm64e adds support for pointer authentication, which was adopted by libplatform to harden setjmp/longjmp and friends. We need to teach the TSan interceptors for those functions about this. Reviewed By: kubamracek Differential Revision: https://reviews.llvm.org/D76257	2020-03-18 11:46:23 -07:00
Nemanja Ivanovic	e009fad342	[PowerPC] Remove UB from PPCInstrInfo when handling rotates fed by constants As pointed out in https://bugs.llvm.org/show_bug.cgi?id=45232 this code can end up shifting a 64-bit unsigned value left by 64 bits. Althought this works as expected on some platforms it is definitely UB. This patch removes the UB and adds the associated test case. Fixes: https://bugs.llvm.org/show_bug.cgi?id=45232	2020-03-18 13:40:39 -05:00
Simon Pilgrim	746bd860c9	Replace getAlignment() methods with getAlign() equivalents. Fixes deprecation warning in EXPENSIVE_CHECKS builds.	2020-03-18 18:25:07 +00:00
Simon Pilgrim	9c6458ecf8	[InstSimplify] Add bitreverse/bswap vector tests Shows missing DemandedElts support (PR36319)	2020-03-18 18:17:10 +00:00
Jessica Paquette	dc5f982639	[GlobalISel] Port some basic undef combines from DAGCombiner.cpp This ports some combines from DAGCombiner.cpp which perform some trivial transformations on instructions with undef operands. Not having these can make it extremely annoying to find out where we differ from SelectionDAG by looking at existing lit tests. Without them, we tend to produce pretty bad code generation when we run into instructions which use undef operands. Also remove the nonpow2_store_narrowing testcase from arm64-fallback.ll, since we no longer fall back on the add. Differential Revision: https://reviews.llvm.org/D76339	2020-03-18 11:05:44 -07:00
Jakub Kuderski	1e4ee0bfc5	[Dominators] Fixup comments in GenericDominatorTreeConstruction. NFC. Reviewers: asbirlea, brzycki, NutshellySima, grosser Reviewed By: asbirlea, NutshellySima Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76340	2020-03-18 13:59:58 -04:00
Adrian Prantl	1cc09dcefc	Add missing module map entry.	2020-03-18 10:50:45 -07:00
Jin Lin	0d896278c8	Support repeated machine outlining Summary: The following change is to allow the machine outlining can be applied for Nth times, where N is specified by the compiler option. By default the value of N is 1. The motivation is that the repeated machine outlining can further reduce code size. Please refer to the presentation "Improving Swift Binary Size via Link Time Optimization" in LLVM Developers' Meeting in 2019. Reviewers: aschwaighofer, tellenbach, paquette Reviewed By: paquette Subscribers: tellenbach, hiraditya, llvm-commits, jinlin Tags: #llvm Differential Revision: https://reviews.llvm.org/D71027	2020-03-18 10:48:52 -07:00
Florian Hahn	e6a74803d4	[VPlan] Use underlying value for printing, if available. When the an underlying value is available, we can use its name for printing, as discussed in D73078. Reviewers: rengolin, hsaito, Ayal, gilr Reviewed By: Ayal Differential Revision: https://reviews.llvm.org/D76200	2020-03-18 17:46:57 +00:00
Simon Tatham	e13d153c1b	[ARM,MVE] Add intrinsics for the VQDMLAD family. Summary: This is another set of instructions too complicated to be sensibly expressed in IR by anything short of a target-specific intrinsic. Given input vectors a,b, the instruction generates intermediate values 2(a[0]b[0]+a[1]+b[1]), 2(a[2]b[2]+a[3]+b[3]), etc; takes the high half of each double-width values, and overwrites half the lanes in the output vector c, which you therefore have to provide the input value of. Optionally you can swap the elements of b so that the are things like a[0]b[1]+a[1]b[0]; optionally you can round to nearest when taking the high half; and optionally you can take the difference rather than sum of the two products. Finally, saturation is applied when converting back to a single-width vector lane. Reviewers: dmgreen, MarkMurrayARM, miyuki, ostannard Reviewed By: miyuki Subscribers: kristof.beyls, hiraditya, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76359	2020-03-18 17:11:22 +00:00
Nico Weber	881f5b5a7b	Revert "[Syntax] Build template declaration nodes" This reverts commit `dd12826808`. Breaks tests on Windows, see https://reviews.llvm.org/D76346#1929208	2020-03-18 12:57:55 -04:00
Guillaume Chatelet	04a309dd0b	[libc] Adding memcpy implementation for x86_64 Summary: The patch is not ready yet and is here to discuss a few options: - How do we customize the implementation? (i.e. how to define `kRepMovsBSize`), - How do we specify custom compilation flags? (We'd need `-fno-builtin-memcpy` to be passed in), - How do we build? We may want to test in debug but build the libc with `-march=native` for instance, - Clang has a brand new builtin `__builtin_memcpy_inline` which makes the implementation easy and efficient, but: - If we compile with `gcc` or `msvc` we can't use it, resorting on less efficient code generation, - With gcc we can use `__builtin_memcpy` but then we'd need a postprocess step to check that the final assembly do not contain call to `memcpy` (unlikely but allowed), - For msvc we'd need to resort on the compiler optimization passes. Reviewers: sivachandra, abrachet Subscribers: mgorny, MaskRay, tschuett, libc-commits, courbet Tags: #libc-project Differential Revision: https://reviews.llvm.org/D74397	2020-03-18 17:43:21 +01:00
Nico Weber	642a424bc4	[gn build] remove a workaround that is no longer needed	2020-03-18 12:37:15 -04:00
Sam Parker	fc2a5ef9c8	[NFC][PowerPC] Update test Run the update script on one of the loop unroll tests.	2020-03-18 16:21:37 +00:00
Matt Arsenault	4ea1baf6a0	AMDGPU: Initial, crude support for indirect calls This isn't really usable, and requires using the -amdgpu-fixed-function-abi flag to work. Assumes a uniform call target, and will hit a verifier error if the call target ends up in a VGPR. Also doesn't attempt to do anything sensible for the reported register/stack usage.	2020-03-18 12:03:48 -04:00
Matt Arsenault	ea4597eef1	Reapply "AMDGPU/GlobalISel: Fully handle 0 dmask case during legalize" This reverts commit `9bca8fc4cf`. Rearrange handling to avoid changing the instruction in the case where it's going to be erased and replaced with undef.	2020-03-18 12:01:22 -04:00
Piotr Sobczak	d1a7bfca74	[AMDGPU] Fix AMDGPUUnifyDivergentExitNodes Summary: For the case where "done" bits on existing exports are removed by unifyReturnBlockSet(), unify all return blocks - even the uniformly reached ones. We do not want to end up with a non-unified, uniformly reached block containing a normal export with the "done" bit cleared. That case is believed to be rare - possible with infinite loops in pixel shaders. This is a fix for D71192. Subscribers: arsenm, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76364	2020-03-18 16:49:30 +01:00
Nico Weber	f57290ec57	[gn build] add rebase changes that should have been in `9f981e9adf`	2020-03-18 11:38:37 -04:00
Simon Pilgrim	06150e8356	[ValueTracking] Add computeKnownBits DemandedElts support to AND instructions (PR36319)	2020-03-18 15:38:15 +00:00
Nico Weber	9f981e9adf	Reland "[gn build] (manually) port 8b409eaba" This reverts commit `4060016fce` and re-merges `c5b81466c`.	2020-03-18 11:31:18 -04:00
Marcel Hlopko	dd12826808	[Syntax] Build template declaration nodes Summary: Copy of https://reviews.llvm.org/D72334, submitting with Ilya's permission. Handles template declaration of all kinds. Also builds template declaration nodes for specializations and explicit instantiations of classes. Some missing things will be addressed in the follow-up patches: specializations of functions and variables, template parameters. Reviewers: gribozavr2 Reviewed By: gribozavr2 Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76346	2020-03-18 16:16:59 +01:00
Sander de Smalen	ef64ba8311	[InstCombine] GEPOperator::accumulateConstantOffset does not support scalable vectors Avoid transforming: %0 = bitcast i8* %base to <vscale x 16 x i8>* %1 = getelementptr <vscale x 16 x i8>, <vscale x 16 x i8>* %0, i64 1 into: %0 = getelementptr i8, i8* %base, i64 16 %1 = bitcast i8* %0 to <vscale x 16 x i8>* Reviewers: efriedma, ctetreau Reviewed By: efriedma Tags: #llvm Differential Revision: https://reviews.llvm.org/D76236	2020-03-18 14:58:46 +00:00
Chris Bowler	c21866476e	[PowerPC][AIX] Implement by-val caller arguments in a single register. This is the first of a series of patches that adds caller support for by-value arguments. This patch add support for arguments that are passed in a single GPR. There are 3 limitation cases: -The by-value argument is larger than a single register. -There are no remaining GPRs even though the by-value argument would otherwise fit in a single GPR. -The by-value argument requires alignment greater than register width. Future patches will be required to add support for these cases as well as for the callee handling (in LowerFormalArguments_AIX) that corresponds to this work. Differential Revision: https://reviews.llvm.org/D75863	2020-03-18 10:57:28 -04:00
Jan Kratochvil	3481062bc6	[lldb] [testsuite] Enable forgotten -gsplit-dwarf for 2 testfiles D63643 added these testfiles but some of the %t4dwo and %t5dwo builds are the same as corresponding %t4 and %t5 builds. Fortunately the testcases do PASS. After just adding -gsplit-dwarf these both skeleton files: tools/lldb/test/SymbolFile/DWARF/Output/debug-types-expressions.test.tmp4dwo tools/lldb/test/SymbolFile/DWARF/Output/debug-types-expressions.test.tmp5dwo were referencing to this one non-skeleton file: tools/lldb/test/SymbolFile/DWARF/debug-types-expressions.dwo Surprisingly it does not affect the other test debug-types-basic.test probably because it compiles to .o and then links it. While debug-types-expressions.test compiles directly to an executable. So fixed that while keeping the direct executable compilation. Differential Revision: https://reviews.llvm.org/D76316	2020-03-18 15:49:24 +01:00
Simon Pilgrim	24c2e61362	[InstCombine][X86] Add additional demandedelts style test for in-range variable per-element shift amounts (PR40391) If we've shuffled the shift amount some of the (undemanded) elements may have become undef - this should be handled by the missing support in PR36319.	2020-03-18 14:36:34 +00:00
Mehdi Amini	4d506da91c	Fix `warning: extra ‘;’` (NFC)	2020-03-18 14:22:10 +00:00
Mehdi Amini	f3e297d90f	Fix build with gcc 7.5 by adding a "redundant move" The constructor of Expected<T> expects as T&&, but gcc-7.5 does not infer an rvalue in this context apparently.	2020-03-18 14:22:10 +00:00
Roman Lebedev	85334b030a	[NFCI][SCEV] Avoid recursion in SCEVExpander::isHighCostExpansion() Summary: As noted in [[ https://bugs.llvm.org/show_bug.cgi?id=45201 \| PR45201 ]], [[ https://bugs.llvm.org/show_bug.cgi?id=10090 \| PR10090 ]] SCEV doesn't always avoid recursive algorithms, and that causes issues with large expression depths and/or smaller stack sizes. In `SCEVExpander::isHighCostExpansion()` case, the refactoring to avoid recursion is rather idiomatic. We simply need to place the root expr into a vector, and iterate over vector elements accounting for the cost of each one, adding new exprs at the end of the vector, thus achieving recursion-less traversal. The order in which we will visit exprs doesn't matter here, so we will be fine with the most basic approach of using SmallVector and inserting/extracting from the back, which accidentally is the same depth-first traversal that we were doing previously recursively. Reviewers: mkazantsev, reames, wmi, ekatz Reviewed By: mkazantsev Subscribers: hiraditya, javed.absar, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76273	2020-03-18 17:10:54 +03:00
Oliver Stannard	73cea83a6f	[IPRA][ARM] Spill extra registers at -Oz When optimising for code size at the expense of performance, it is often worth saving and restoring some of r0-r3, if IPRA will be able to take advantage of them. This doesn't cost any extra code size if we already have a PUSH/POP pair, and increases the number of available registers across any calls to the function. We already have an optimisation which tries fold the subtract/add of the SP into the PUSH/POP by using extra registers, which somewhat conflicts with this. I've made the new optimisation less aggressive in cases where the existing one is likely to trigger, which gives better results than either of these optimisations by themselves. Differential revision: https://reviews.llvm.org/D69936	2020-03-18 13:51:16 +00:00
Guillaume Chatelet	d000655a8c	[Alignment][NFC] Deprecate getMaxAlignment Summary: This is patch is part of a series to introduce an Alignment type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Reviewers: courbet Subscribers: jholewinski, arsenm, dschuff, jyknight, sdardis, nemanjai, jvesely, nhaehnle, sbc100, jgravelle-google, hiraditya, aheejin, kbarton, fedor.sergeev, asb, rbar, johnrusso, simoncook, sabuasal, niosHD, jrtc27, MaskRay, zzheng, edward-jones, atanasyan, rogfer01, MartinMosbeck, brucehoult, the_o, PkmX, jocewei, Jim, lenary, s.egerton, pzheng, sameer.abuasal, apazos, luismarques, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76348	2020-03-18 14:48:45 +01:00
Kang Zhang	96b70809d9	[NFC][PowerPC] Add a new MIR file to test if-converter pass	2020-03-18 13:39:49 +00:00
Danila Malyutin	2aaafaf500	[NFC] Add missing REQUIRES clause to a test	2020-03-18 16:35:10 +03:00
Michael Liao	4cf01ed75e	[hip] Revise `GlobalDecl` constructors. NFC. Summary: - https://reviews.llvm.org/D68578 revises the `GlobalDecl` constructors to ensure all GPU kernels have `ReferenceKenelKind` initialized properly with an explicit constructor and static one. But, there are lots of places using the implicit constructor triggering the assertion on non-GPU kernels. That's found in compilation of many tests and workloads. - Fixing all of them may change more code and, more importantly, all of them assumes the default kernel reference kind. This patch changes that constructor to tell `CUDAGlobalAttr` and construct `GlobalDecl` properly. Reviewers: yaxunl Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76344	2020-03-18 09:33:39 -04:00
Oliver Stannard	6739805e24	[ARM] Track epilogue instructions with FrameDestroy flag (NFC) Rather than trying to work out which instructions are part of the epilogue by examining them, we can just mark them with the FrameDestroy flag, like we do in the AArch64 backend.	2020-03-18 13:32:59 +00:00
Kazuaki Ishizaki	a8901a0354	[mlir] NFC: Fix trivial typos in documents Fix trivial typos Reviewers: mravishankar, antiagainst, ftynse Reviewed By: ftynse Subscribers: ftynse, mehdi_amini, rriddle, jpienaar, burmako, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, aartbik, liufengdb, Joonsoo, bader, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76347	2020-03-18 22:20:17 +09:00
Med Ismail Bennani	db31e2e1e6	[lldb/Target] Support more than 2 symbols in StackFrameRecognizer This patch changes the way the StackFrame Recognizers match a certain frame. Until now, recognizers could be registered with a function name but also an alternate symbol. This change is motivated by a test failure for the Assert frame recognizer on Linux. Depending the version of the libc, the abort function (triggered by an assertion), could have more than two signatures (i.e. `raise`, `__GI_raise` and `gsignal`). Instead of only checking the default symbol name and the alternate one, lldb will iterate over a list of symbols to match against. rdar://60386577 Differential Revision: https://reviews.llvm.org/D76188 Signed-off-by: Med Ismail Bennani <medismail.bennani@gmail.com>	2020-03-18 14:15:58 +01:00
Alexey Bataev	b09cce07c7	[OPENMP50]Codegen for detach clause. Implemented codegen for detach clause in task directives.	2020-03-18 09:01:17 -04:00
Francesco Petrogalli	9bdcd9bf44	[llvm][SVE] Addressing mode for FF/NF loads. Summary: This patch adds addressing mode computation for the following SVE instructions: * ldff1{s}<T1> { <Zt>.<T2> }, <Pg>/Z, [<Xn\|SP>{, <Xm>{, lsl #imm}}] * ldnf1{s}<T1> { <Zt>.<T2> }, <Pg>/Z, [<Xn\|SP>{, #<imm>, mul vl}] Reviewers: andwar, sdesmalen, rengolin, efriedma Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76209	2020-03-18 12:46:07 +00:00
Sander de Smalen	4788ca450f	[AArch64][SVE] Change pointer type of nontemporal load/store intrinsics Summary: This fixes a discrepancy between the non-temporal loads/store intrinsics and other SVE load intrinsics (such as nf/ff), so that Clang can use the same code to generate these intrinsics. Reviewers: andwar, kmclaughlin, rengolin, efriedma Reviewed By: efriedma Subscribers: tschuett, kristof.beyls, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76237	2020-03-18 12:44:51 +00:00
Danila Malyutin	940ba1465b	Fix possible assertion when using PBQP with debug info Skip debug instructions before calling functions not expecting them. In particular, LIS.getInstructionIndex(*mi) would fail if mi was a debg instr. Differential Revision: https://reviews.llvm.org/D76129	2020-03-18 15:29:42 +03:00
David Stenberg	a0a3a9c5a8	[DebugInfo] Fix multi-byte entry values in call site values Summary: In D67768/D67492 I added support for entry values having blocks larger than one byte, but I now noticed that the DIE implementation I added there was broken. The takeNodes() function, that moves the entry value block from a temporary buffer to the output buffer, would destroy the input iterator when transferring the first node, meaning that only that node was moved. In practice, this meant that when emitting a call site value using a DW_OP_entry_value operation with a DWARF register number larger than 31, that multi-byte DW_OP_regx expression would be truncated. Reviewers: djtodoro, aprantl, vsk Reviewed By: djtodoro Subscribers: llvm-commits Tags: #debug-info, #llvm Differential Revision: https://reviews.llvm.org/D76279	2020-03-18 13:23:17 +01:00
Florian Hahn	0db7244295	[SCCP] Precommit some additional tests for integer ranges.	2020-03-18 11:34:04 +00:00
Simon Pilgrim	f4e495a18e	[InstCombine][X86] simplifyX86varShift - convert variable in-range per-element shift amounts to generic shifts (PR40391) AVX2/AVX512 per-element shifts can be replaced with generic shifts if the shift amounts are guaranteed to be in-range (upper bits are known zero).	2020-03-18 11:26:54 +00:00
Sander de Smalen	c5b81466c2	Reland D75470 [SVE] Auto-generate builtins and header for svld1. Reworked the patch to avoid sharing a header (SVETypeFlags.h) between include/clang/Basic and utils/TableGen/SveEmitter.cpp. Now the patch generates the enum/flags which is included in TargetBuiltins.h. Also renamed one of the SveEmitter options to be in line with MVE. Summary: This is a first patch in a series for the SveEmitter to generate the arm_sve.h header file and builtins. I've tried my best to strip down this patch as best as I could, but there are still a few changes that are not necessarily exercised by the load intrinsics in this patch, mostly around the SVEType class which has some common logic to represent types from a type and prototype string. I thought it didn't make much sense to remove that from this patch and split it up.	2020-03-18 11:16:28 +00:00
Simon Tatham	928776de92	[ARM,MVE] Add intrinsics for the VQDMLAH family. Summary: These are complicated integer multiply+add instructions with extra saturation, taking the high half of a double-width product, and optional rounding. There's no sensible way to represent that in standard IR, so I've converted the clang builtins directly to target-specific intrinsics. Reviewers: dmgreen, MarkMurrayARM, miyuki, ostannard Reviewed By: miyuki Subscribers: kristof.beyls, hiraditya, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76123	2020-03-18 10:55:04 +00:00

... 2 3 4 5 6 ...

345727 Commits All Branches Search

345727 Commits

All Branches