llvm-project

Commit Graph

Author	SHA1	Message	Date
Sam McCall	4f2e7f6fb1	[clangd] Try to fix windows buildbot. NFC http://45.33.8.238/win/19116/step_9.txt	2020-07-04 12:03:46 +02:00
Uday Bondhugula	6d6d5db251	[MLIR][Linalg] Generate the right type of load/store when lowering max/min pooling ops While lowering min/max pooling ops to loops, generate the right kind of load/stores (std or affine) instead of always generating std load/stores. Differential Revision: https://reviews.llvm.org/D83080	2020-07-04 14:55:02 +05:30
Paul Walker	7356b4243a	[SVE] Fix invalid assert in expand_DestructiveOp. AArch64ExpandPseudo::expand_DestructiveOp contains an assert to ensure the destructive operand's register is unique. However, this is only required when psuedo expansion emits a movprfx. A simple example when a movprfx is not required is Z0 = FADD_ZPZZ_UNDEF_S P0, Z0, Z0 which expands to an unprefixed FADD_ZPmZ_S instruction. This patch moves the assert to the places where a movprfx is emitted. Differential Revision: https://reviews.llvm.org/D83029	2020-07-04 09:21:40 +00:00
Sam McCall	15a60fe09f	[clangd] Config: compute config in TUScheduler and BackgroundIndex Summary: ClangdServer owns the question of exactly which config to create, but TUScheduler/BackgroundIndex control threads and so decide at which point to inject it. Reviewers: kadircet Subscribers: ilya-biryukov, javed.absar, MaskRay, jkorous, arphaman, usaxena95, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D83095	2020-07-04 11:18:14 +02:00
Nikita Popov	3b671022e4	[InstSimplify] Simplify comparison between zext(x) and sext(x) This is picking up a loose thread from D69006: We can simplify (zext x) ule (sext x) and (zext x) sge (sext x) to true, with various permutations. Oddly, SCEV knows about this identity, but nothing on the IR level does. Differential Revision: https://reviews.llvm.org/D83081	2020-07-04 11:03:00 +02:00
Nikita Popov	93ccb8eb52	[InstSimplify] Add additional zext/sext comparison tests (NFC) Add vector variants, and negative tests where the operand does not match.	2020-07-04 11:03:00 +02:00
LLVM GN Syncbot	2ac9c45910	[gn build] Port `8bd000a65f`	2020-07-04 08:53:11 +00:00
Sam McCall	8bd000a65f	[clangd] Config: loading and caching config from disk. Summary: The Provider extension point is designed to also be implemented by ClangdLSPServer (to inject config-over-lsp) and likely by embedders. Reviewers: kadircet Subscribers: mgorny, ilya-biryukov, MaskRay, jkorous, arphaman, jfb, usaxena95, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D82964	2020-07-04 10:48:31 +02:00
Craig Topper	fed432523e	[X86] Directly emit VPTERNLOG from canonicalizeBitSelect when possible. Seems to produce better results on some rotate tests. And is neutral for other tests.	2020-07-03 22:08:28 -07:00
Kai Luo	c352e0885a	[PowerPC] Implement probing for prologue This patch is part of supporting `-fstack-clash-protection`. Implemented probing when emitting prologue. Differential Revision: https://reviews.llvm.org/D81460	2020-07-04 03:07:08 +00:00
Craig Topper	e75f2d5a8c	[X86] Add matching support for X86ISD::ANDNP to X86DAGToDAGISel::tryVPTERNLOG.	2020-07-03 17:50:35 -07:00
peter klausler	0006354c3b	[flang] Further implementation of external I/O unit operations (part 6) Rework initial implementation of external I/O unit operations to fix problems exposed in unit tests (in a later patch). Add flushing. Reviewed By: sscalpone Differential Revision: https://reviews.llvm.org/D83147	2020-07-03 17:31:01 -07:00
Thomas Lively	8df30d988e	[WebAssembly] Do not omit range checks for i64 switches Summary: Since the br_table instruction takes an i32, switches over i64s (and larger integers) must use the i32.wrap_i64 instruction to truncate the table index. This truncation makes numbers just over 2^32 indistinguishable from small numbers, so it was a miscompilation to omit the range check preceding these br_tables. This change fixes the problem by skipping the "fixing" of the br_table when the range check is an i64 instruction. Fixes PR46447. Reviewers: aheejin, dschuff, kripken Reviewed By: kripken Subscribers: sbc100, jgravelle-google, hiraditya, sunfish, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D83017	2020-07-03 17:15:39 -07:00
Fangrui Song	1c6e2eceeb	[gcov][test] Add `UNSUPPORTED: host-byteorder-big-endian` to gcov-fork.c This test strangely failed on ppc64be http://lab.llvm.org:8011/builders/clang-ppc64be-linux/builds/50913	2020-07-03 17:06:54 -07:00
Fangrui Song	fba8523fb5	[gcov][test] Reorganize some compiler-rt/test/profile tests	2020-07-03 16:17:16 -07:00
Francis Visoiu Mistrih	aa5ec34e31	[LoopDeletion] Emit a remark when a dead loop is deleted This emits a remark when LoopDeletion deletes a dead loop, using the source location of the loop's header. There are currently two reasons for removing the loop: invariant loop or loop that never executes. Differential Revision: https://reviews.llvm.org/D83113	2020-07-03 15:20:23 -07:00
Lei Huang	e359ab1eca	[PowerPC][NFC] Fix indentation	2020-07-03 16:47:24 -05:00
Roman Lebedev	17a15c32af	[NFCI][LoopUnroll] s/%tmp/%i/ in one test to silence update script warning	2020-07-04 00:39:36 +03:00
Roman Lebedev	341ab51149	[NFCI][InstCombine] shift.ll: s/%tmp/%i/ to silence update script warning	2020-07-04 00:39:35 +03:00
Sanjay Patel	26543f1c0c	[x86] improve codegen for bit-masked vector compare and select (PR46531) We canonicalize patterns like: %s = lshr i32 %a0, 1 %t = trunc i32 %s to i1 to: %a = and i32 %a0, 2 %c = icmp ne i32 %a, 0 ...in IR, but the bit-shifting original sequence may be better for x86 vector codegen. I tried several variants of the transform, and it's tricky to not induce regressions. In particular, I did not find a way to cleanly handle non-splat constants, so I've left that as a TODO item here (currently negative tests for those are included). AVX512 resulted in some diffs, but didn't look meaningful, so I left that out too. Some of the 256-bit AVX1 diffs are questionable, but close enough that they are probably insignificant. Differential Revision: https://reviews.llvm.org/D83073.	2020-07-03 17:31:57 -04:00
Sanjay Patel	7fd8af1de0	[InstCombine] fold mul of sext bools to 'and' Alive2: define i32 @src(i1 %x, i1 %y) { %0: %zx = sext i1 %x to i32 %zy = sext i1 %y to i32 %r = mul i32 %zx, %zy ret i32 %r } => define i32 @tgt(i1 %x, i1 %y) { %0: %a = and i1 %x, %y %r = zext i1 %a to i32 ret i32 %r } Transformation seems to be correct! https://alive2.llvm.org/ce/z/gaPQxA	2020-07-03 17:28:40 -04:00
Sanjay Patel	5504d8b04a	[InstCombine] add more tests for mul of bools; NFC	2020-07-03 17:28:22 -04:00
Vy Nguyen	5cde3c9633	[libcxx] Put clang::trivial_abi on std::unique_ptr, std::shared_ptr, and std::weak_ptr Reviewers: jyknight, EricWF, #libc! Subscribers: arphaman, libcxx-commits Tags: #libc Differential Revision: https://reviews.llvm.org/D82490	2020-07-03 17:23:13 -04:00
Kadir Cetinkaya	50ba9f994c	[clangd] Fix hover crash on invalid decls Summary: This also changes the way we display Size and Offset to be independent. Reviewers: sammccall Subscribers: ilya-biryukov, MaskRay, jkorous, arphaman, usaxena95, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D83143	2020-07-03 22:51:04 +02:00
Biplob Mishra	0939e04e41	[PowerPC] Implement Vector Insert Builtins in LLVM/Clang Implements vec_insertl() and vec_inserth(). Differential Revision: https://reviews.llvm.org/D82365	2020-07-03 15:30:41 -05:00
Stephen Kelly	551092bc3d	Revert AST Matchers default to AsIs mode Reviewers: aaron.ballman, klimek Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D83076	2020-07-03 21:19:46 +01:00
peter klausler	7926969afc	[flang] Track known file size, add IsATerminal (ext. I/O work part 5) Add a data member knownSize_ and an accessor to allow the size of an external file to be tracked when known. Also add a wrapper for ::isatty() here in the filesystem encapsulation module. These features are needed for the external I/O rework changes still to come. Reviewed By: sscalpone Differential Revision: https://reviews.llvm.org/D83141	2020-07-03 13:04:18 -07:00
peter klausler	c7cabf9d60	[flang] Define new runtime error IOSTAT values (I/O runtime work part 4) Add more IOSTAT= values for errors that can arise in external I/O. Reviewed By: sscalpone Differential Revision: https://reviews.llvm.org/D83140	2020-07-03 12:41:33 -07:00
Florian Hahn	31971ca1c6	[InstCombine] Try to narrow expr if trunc cannot be removed. Narrowing an input expression of a truncate to a type larger than the result of the truncate won't allow removing the truncate, but it may enable further optimizations, e.g. allowing for larger vectorization factors. For now this is intentionally limited to integer types only, to avoid producing new vector ops that might not be suitable for the target. If we know that the only user is a trunc, we can also be allow more cases, e.g. also shortening expressions with some additional shifts. I would appreciate feedback on the best place to do such a narrowing. This fixes PR43580. Reviewers: spatel, RKSimon, lebedev.ri, xbolva00 Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D82973	2020-07-03 20:22:51 +01:00
Louis Dionne	71d88cebfb	[libc++/libc++abi] Automatically detect whether exceptions are enabled Instead of detecting it automatically (in libc++) and relying on _LIBCXXABI_NO_EXCEPTIONS being set explicitly (in libc++abi), always detect whether exceptions are enabled automatically. This commit also removes support for specifying -D_LIBCPP_NO_EXCEPTIONS and -D_LIBCXXABI_NO_EXCEPTIONS explicitly -- those should just be inferred from using -fno-exceptions (or an equivalent flag). Allowing both -D_FOO_NO_EXCEPTIONS to be provided explicitly and trying to detect it automatically is just confusing, especially since we did specify it explicitly when building libc++abi. We should have only one way to detect whether exceptions are enabled, but it should be robust.	2020-07-03 14:58:09 -04:00
Eric Schweitz	35808ab8e1	[flang] Add FIRBuilder.cpp The FIR builder is a helper class that manages the creation of MLIR operations from the bridge. The focus of the builder is the creation of Operations, Types, etc. Differential revision: htps://reviews.llvm.org/D83107	2020-07-03 11:52:00 -07:00
jasonliu	572dde55ee	[XCOFF][AIX] Use 'L..' instead of '.L' for getPrivateGlobalPrefix in DataLayout Summary: D80831 changed part of the prefix usage for AIX. But there are other places getting prefix from DataLayout. This patch intends to make prefix usage consistent on AIX. Reviewed by: hubert.reinterpretcast, daltenty Differential Revision: https://reviews.llvm.org/D81270	2020-07-03 18:25:14 +00:00
sameerarora101	fc81f48fde	[llvm-ar][test] Unsupport error-opening-directory.test on FreeBSD Differential Revision: https://reviews.llvm.org/D82786	2020-07-03 10:57:32 -07:00
Sanjay Patel	40fcc42498	[InstCombine] fold mul of zext bools to 'and' The base case only works because we are relying on a poison-unsafe select transform; if that is fixed, we would regress on patterns like this. The extra use tests show that the select transform can't be applied consistently. So it may be a regression to have an extra instruction on 1 test, but that result was not created safely and does not happen reliably.	2020-07-03 13:14:18 -04:00
Sanjay Patel	5d60377864	[InstCombine] add tests for mul of bools; NFC	2020-07-03 13:14:18 -04:00
Roman Lebedev	4dd784000e	[NFC][InstCombine] Add some more tests for select based on non-canonical bit-test	2020-07-03 20:12:46 +03:00
Nikita Popov	cf1d9f9f49	[InstSimplify] Fold icmp with dominating assume If we assume(x > y), then we should be able to fold the basic implications of that, like x >= y. This already happens if either one of the operands is constant (LVI) or if the conditions are exactly the same (GVN), but not if we have an implication with non-constant operands. Support this by querying AssumptionCache. Fixes https://bugs.llvm.org/show_bug.cgi?id=40149. Differential Revision: https://reviews.llvm.org/D82717	2020-07-03 18:53:58 +02:00
Fangrui Song	6fa1343bb3	[ELF] Resolve R_DTPREL in .debug_* referencing discarded symbols to -1 The location of a TLS variable is encoded as a DW_OP_const4u/DW_OP_const8u followed by a DW_OP_push_tls_address (or DW_OP_GNU_push_tls_address https://sourceware.org/bugzilla/show_bug.cgi?id=11616 ). This change follows up to D81784 and makes relocations types generalized as R_DTPREL (e.g. R_X86_64_DTPOFF{32,64}, R_PPC64_DTPREL64) use -1 as the tombstone value as well. This works for both TLS Variant I and Variant II architectures. * arm: .long tls(tlsldo) # not working currently (R_ARM_TLS_LDO32 is R_ABS) * mips64: .dtpreldword tls+32768 * ppc64: .quad tls@DTPREL+0x8000 * riscv: neither GCC nor clang has implemented DW_AT_location. It is likely .long/.quad tls@dtprel+0x800 * x86-32: .long tls@DTPOFF * x86-64: .long tls@DTPOFF; .quad tls@DTPOFF tls has a non-negative st_value, so such relocations (st_value+addend) never resolve to -1 in a normal (not discarded) case. ``` // clang -fuse-ld=lld -g -ffunction-sections a.c -Wl,--gc-sections // foo and tls will be discarded by --gc-sections. // DW_AT_location [DW_FORM_exprloc] (DW_OP_const8u 0xffffffffffffffff, DW_OP_GNU_push_tls_address) thread_local int tls; int foo() { return ++tls; } int main() {} ``` Also, drop logic added in D26201 intended to address PR30793. It added a test (gc-debuginfo-tls.s) using a non-SHF_ALLOC section and a local symbol, which does not reflect the intended scenario: a relocation in a SHF_ALLOC section referencing a discarded non-local symbol. For such a non .debug_* section, just emit an error. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D82899	2020-07-03 09:50:30 -07:00
Florian Hahn	eb46137daa	[SLP] Make sure instructions are ordered when computing spill cost. The entries in VectorizableTree are not necessarily ordered by their position in basic blocks. Collect them and order them by dominance so later instructions are guaranteed to be visited first. For instructions in different basic blocks, we only scan to the beginning of the block, so their order does not matter, as long as all instructions in a basic block are grouped together. Using dominance ensures a deterministic order. The modified test case contains an example where we compute a wrong spill cost (2) without this patch, even though there is no call between any instruction in the bundle. This seems to have limited practical impact, .e.g on X86 with a recent Intel Xeon CPU with -O3 -march=native -flto on MultiSource,SPEC2000,SPEC2006 there are no binary changes. Reviewers: craig.topper, RKSimon, xbolva00, ABataev, spatel Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D82444	2020-07-03 17:30:17 +01:00
David Green	9e03547cab	[ARM][HWLoops] Create hardware loops for sibling loops Given a loop with two subloops, it should be possible for both to be converted to hardware loops. That's what this patch does, simply enough. It slightly alters the loop iterating order to try and convert all subloops. If one (or more) succeeds, it stops as before. Differential Revision: https://reviews.llvm.org/D78502	2020-07-03 17:20:02 +01:00
Florian Hahn	039145c72b	[SLP] Precommit test for which spill cost is computed incorrectly. Test for D82444.	2020-07-03 17:15:52 +01:00
Florian Hahn	7a1161767b	[InstCombine] Precommit tests for PR43580.	2020-07-03 17:14:02 +01:00
Sean Fertile	484a36b97d	Enable basepointer for AIX. Differential Revision: https://reviews.llvm.org/D82030	2020-07-03 11:55:49 -04:00
Sanjay Patel	63774642af	[InstCombine] add one-use check to cast+select narrowing transform Prevent increasing the instruction count.	2020-07-03 11:54:09 -04:00
Sanjay Patel	0cd0ae1f29	[InstCombine] add tests to show missing one-use checks; NFC	2020-07-03 11:54:09 -04:00
Xing GUO	3b4a0adec2	[DWARFYAML][test] Use --ignore-case to suppress errors. This patch is to fix build bot failure (http://lab.llvm.org:8011/builders/llvm-clang-win-x-aarch64/builds/553).	2020-07-03 23:46:37 +08:00
peter klausler	98d576c78f	[flang] Improve API for runtime allocator (I/O runtime work part 3) New<A> used to return an A&; now it returns an OwningPtr<A> to force better ownership tracking of allocations. Its API has also been split into New<A> and SizedNew<A> to allow allocations with a size override. Reviewed By: tskeith Differential Revision: https://reviews.llvm.org/D83108	2020-07-03 08:37:40 -07:00
Andrzej Warzynski	ef875c228a	[clang][NFC] Removed unused parameters in InitializeSourceManager	2020-07-03 16:16:20 +01:00
Simon Pilgrim	eb0e7acbd4	[InstCombine] canEvaluateTruncated - use KnownBits to check for inrange shift amounts Currently canEvaluateTruncated can only attempt to truncate shifts if they are scalar/uniform constant amounts that are in range. This patch replaces the constant extraction code with KnownBits handling, using the KnownBits::getMaxValue to check that the amounts are inrange. This enables support for nonuniform constant cases, and also variable shift amounts that have been masked somehow. Annoyingly, this still won't work for vectors with (demanded) undefs as KnownBits returns nothing in those cases, but its a definite improvement on what we currently have. Differential Revision: https://reviews.llvm.org/D83127	2020-07-03 16:02:10 +01:00
Dmitry Preobrazhensky	53422e8b4f	[AMDGPU] Added support of new inline assembler constraints Added support for constraints 'I', 'J', 'L', 'B', 'C', 'Kf', 'DA', 'DB'. See https://gcc.gnu.org/onlinedocs/gcc/Machine-Constraints.html#Machine-Constraints. Reviewers: arsenm, rampitec Differential Revision: https://reviews.llvm.org/D81657	2020-07-03 18:01:12 +03:00

1 2 3 4 5 ...

359301 Commits All Branches Search

359301 Commits

All Branches