llvm-project

Commit Graph

Author	SHA1	Message	Date
Louis Dionne	e1ce3dabf0	[libc++] Fix some tests that were broken in the single-threaded configuration We never noticed it because our CI doesn't actually build against a C library that doesn't have threading functionality, however building against a truly thread-free platform surfaces these issues. Differential Revision: https://reviews.llvm.org/D114242	2021-11-19 14:24:15 -05:00
Louis Dionne	1b4c0cb391	[libc++] Avoid potential truncation warnings in std::abs test One some platforms, -Wimplicit-int-conversion is enabled by default, which can lead to additional warnings being triggered in this test. Since we're only trying to test errors related to calling abs(), the assignment is superfluous. As a fly-by fix, correct one instance of ::abs to std::abs and made the test a .verify.cpp test instead. Differential Revision: https://reviews.llvm.org/D114244	2021-11-19 14:22:26 -05:00
Krzysztof Drewniak	bd22554af0	[MLIR][GPU] Run generic LLVM optimizations when serializing (on AMD) - Adds hooks that allow SerializeTo* passes to arbitrarily transform the produced LLVM Module before it is passed to the code generation passes. - Uses these hooks within the SerializeToHsaco pass in order to run LLVM optimizations and to set the optimization level on the TargetMachine. - Adds an optLevel parameter to SerializeToHsaco Future work may include moving much of what's been added to SerializeToHsaco to SerializeToBlob, but that would require confirmation from the NVVM backend maintainers that it would be appropriate to do so. Depends on D114107 Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D114113	2021-11-19 19:21:24 +00:00
Thomas Raoux	47555d73f6	[mlir][gpu] Extend shuffle op modes and add nvvm lowering Add up, down and idx modes to gpu shuffle ops, also change the mode from string to enum Differential Revision: https://reviews.llvm.org/D114188	2021-11-19 11:14:31 -08:00
Jay Foad	ff7f2cfa95	[AMDGPU] Add an implicit use of M0 to all V_MOV_B32_indirect_read/write NFCI. Previously the implicit use was added to V_MOV_B32_indirect_read when building the instruction. V_MOV_B32_indirect_write didn't have an implicit use of M0 at all, but apparently it did not cause any problems. Differential Revision: https://reviews.llvm.org/D114239	2021-11-19 19:00:17 +00:00
Fangrui Song	2997441b85	[ELF] Support discarding .got.plt Fix a null pointer dereference when .got.plt is discarded. This also adds a test for discarding `.plt`. Reviewed By: ikudrin Differential Revision: https://reviews.llvm.org/D114180	2021-11-19 10:50:53 -08:00
Jon Chesterfield	9cdaf0b01b	[openmp][amdgpu][nfc] Inline interop_hsa_get_kernel_info into only caller	2021-11-19 18:45:17 +00:00
Thomas Raoux	7cde516513	[mlir][vector] NFC, move some vector patterns in a separate file Move patterns related to dropping lead unit dim into their own file. Differential Revision: https://reviews.llvm.org/D114265	2021-11-19 10:39:29 -08:00
Thomas Raoux	06dbb28569	[mlir][vector] Remove usage of shapecast to remove unit dim Instead of using shape_cast op in the pattern removing leading unit dimensions we use extract/broadcast ops. This is part of the effort to restrict ShapeCastOp fuirther in the future and only allow them to convert to or from 1D vector. This also adds extra canonicalization to fill the gaps in simplifying broadcast/extract ops. Differential Revision: https://reviews.llvm.org/D114205	2021-11-19 10:25:21 -08:00
Mingming Liu	ffdace4892	[SROA] Add new test cases to cover existing SROA behavior that structs will be scalarized. Add an IR in unit test directory, which demonstrate the scalarization for struct allocations. This is added to pave the way for an SROA change to skip scalarization for some cases. Reviewed By: davidxl Differential Revision: https://reviews.llvm.org/D114128	2021-11-19 18:16:49 +00:00
Fabian Wolff	7eec832def	[DSE] Improve handling of `strncpy` in Dead Store Elimination Fixes PR#52062 and one of the remaining cases of PR#47644. Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D114035	2021-11-19 17:46:29 +00:00
Balazs Benics	d5de568cc7	[analyzer][NFC] MaybeUInt -> MaybeCount I forgot to include this in D113594 Differential Revision: https://reviews.llvm.org/D113594	2021-11-19 18:36:55 +01:00
Balazs Benics	e6ef134f3c	[analyzer][NFC] Use enum for CallDescription flags Yeah, let's prefer a slightly stronger type representing this. Reviewed By: martong, xazax.hun Differential Revision: https://reviews.llvm.org/D113595	2021-11-19 18:32:13 +01:00
Balazs Benics	97f1bf15b1	[analyzer][NFC] Consolidate the inner representation of CallDescriptions `CallDescriptions` have a `RequiredArgs` and `RequiredParams` members, but they are of different types, `unsigned` and `size_t` respectively. In the patch I use only `unsigned` for both, that should be large enough anyway. I also introduce the `MaybeUInt` type alias for `Optional<unsigned>`. Additionally, I also avoid the use of the //smart// less-than operator. template <typename T> constexpr bool operator<=(const Optional<T> &X, const T &Y); Which would check if the optional has a value and compare the data only after. I found it surprising, thus I think we are better off without it. Reviewed By: martong, xazax.hun Differential Revision: https://reviews.llvm.org/D113594	2021-11-19 18:32:13 +01:00
Balazs Benics	de9d7e42ac	[analyzer][NFC] CallDescription should own the qualified name parts Previously, CallDescription simply referred to the qualified name parts by `const char` pointers. In the future we might want to dynamically load and populate `CallDescriptionMaps`, hence we will need the `CallDescriptions` to actually own* their qualified name parts. Reviewed By: martong, xazax.hun Differential Revision: https://reviews.llvm.org/D113593	2021-11-19 18:32:13 +01:00
Balazs Benics	9ad0a90baa	[analyzer][NFC] Demonstrate the use of CallDescriptionSet Reviewed By: martong, xazax.hun Differential Revision: https://reviews.llvm.org/D113592	2021-11-19 18:32:13 +01:00
Balazs Benics	f18da190b0	[analyzer][NFC] Switch to using CallDescription::matches() instead of isCalled() This patch replaces each use of the previous API with the new one. In variadic cases, it will use the ADL `matchesAny(Call, CDs...)` variadic function. Also simplifies some code involving such operations. Reviewed By: martong, xazax.hun Differential Revision: https://reviews.llvm.org/D113591	2021-11-19 18:32:13 +01:00
Balazs Benics	6c512703a9	[analyzer][NFC] Introduce CallDescription::matches() in addition to isCalled() This patch introduces `CallDescription::matches()` member function, accepting a `CallEvent`. Semantically, `Call.isCalled(CD)` is the same as `CD.matches(Call)`. The patch also introduces the `matchesAny()` variadic free function template. It accepts a `CallEvent` and at least one `CallDescription` to match against. Reviewed By: martong Differential Revision: https://reviews.llvm.org/D113590	2021-11-19 18:32:13 +01:00
Balazs Benics	d448fcd9b2	[analyzer][NFC] Introduce CallDescriptionSets Sometimes we only want to decide if some function is called, and we don't care which of the set. This `CallDescriptionSet` will have the same behavior, except instead of `lookup()` returning a pointer to the mapped value, the `contains()` returns `bool`. Internally, it uses the `CallDescriptionMap<bool>` for implementing the behavior. It is preferred, to reuse the generic `CallDescriptionMap::lookup()` logic, instead of duplicating it. The generic version might be improved by implementing a hash lookup or something along those lines. Reviewed By: martong, Szelethus Differential Revision: https://reviews.llvm.org/D113589	2021-11-19 18:32:13 +01:00
Florian Hahn	76effb001d	[LV] Remove obsolete comment about creating a dummy block (NFC) No dummy pre-entry block is created since `a6c4969f5f`. The comment is stale now and can be removed. Mentioned by @Ayal in D113182.	2021-11-19 17:17:04 +00:00
Krzysztof Drewniak	f849640a0c	[MLIR] Make the ROCM integration tests runnable - Move the #define s to the GPU Transform library from GPU Ops so that SerializeToHsaco is non-trivially compiled - Add required includes to SerializeToHsaco - Move MCSubtargetInfo creation to the correct point in the compilation process - Change mlir in ROCM tests to account for renamed/moved ops Differential Revision: https://reviews.llvm.org/D114184	2021-11-19 17:09:53 +00:00
Adrian Prantl	587a397917	Skip tests when compiler with older versions of clang	2021-11-19 09:05:48 -08:00
Siva Chandra Reddy	d9bbad277c	[libc][Obvious][NFC] A bunch of cosmetic cleanup. * Added missing header guards. * Fixed license header format in a few files. * Renamed files to more suitable names.	2021-11-19 17:02:55 +00:00
Pavel Labath	f3b7cc8bb2	[lldb/test] Add ability to terminate connection from a gdb-client handler We were using the client socket close as a way to terminate the handler thread. But this kind of concurrent access to the same socket is not safe. It also complicates running the handler without a dedicated thread (next patch). Instead, here I add an explicit way for a packet handler to request termination. Waiting for lldb to terminate the connection would almost be sufficient, but in the pty test we want to keep the pty open so we can examine its state. Ability to disconnect at an arbitrary point may be useful for testing other aspects of lldb functionality as well. The way this works is that now each packet handler can optionally return a list of responses (instead of just one). One of those responses (it only makes sense for it to be the last one) can be a special RESPONSE_DISCONNECT object, which triggers a disconnection (via a new TerminateConnectionException). As the mock server now cleans up the connection whenever it disconnects, the pty test needs to explicitly dup(2) the descriptors in order to inspect the post-disconnect state. Differential Revision: https://reviews.llvm.org/D114156	2021-11-19 18:00:14 +01:00
Philip Reames	28000587e1	[SCEV] Revert two speculative compile time optimizations which made no difference Revert "[SCEV] Defer all work from `ea12c2cb` as late as possible" Revert "[SCEV] Defer loop property checks from `ea12c2cb` as late as possible" This reverts commit `734abbad79` and `1a5666acb2`. Both of these changes were speculative attempts to address a compile time regression. Neither worked, and both complicated the code in undesirable ways.	2021-11-19 08:45:56 -08:00
Philipp Tomsich	af57a71d18	[RISCV] Don't call setHasMultipleConditionRegisters(), so icmp is sunk On RISC-V, icmp is not sunk (as the following snippet shows) which generates the following suboptimal branch pattern: ``` core_list_find: lh a2, 2(a1) seqz a3, a0 << bltz a2, .LBB0_5 bnez a3, .LBB0_9 << should sink the seqz [...] j .LBB0_9 .LBB0_5: bnez a3, .LBB0_9 << should sink the seqz lh a1, 0(a1) [...] ``` due to an icmp not being sunk. The blocks after `codegenprepare` look as follows: ``` define dso_local %struct.list_head_s* @core_list_find(%struct.list_head_s* readonly %list, %struct.list_data_s* nocapture readonly %info) local_unnamed_addr #0 { entry: %idx = getelementptr inbounds %struct.list_data_s, %struct.list_data_s* %info, i64 0, i32 1 %0 = load i16, i16* %idx, align 2, !tbaa !4 %cmp = icmp sgt i16 %0, -1 %tobool.not37 = icmp eq %struct.list_head_s* %list, null br i1 %cmp, label %while.cond.preheader, label %while.cond9.preheader while.cond9.preheader: ; preds = %entry br i1 %tobool.not37, label %return, label %land.rhs11.lr.ph ``` where the `%tobool.not37` is the result of the icmp that is not sunk. Note that it is computed in the basic-block up until what becomes the `bltz` instruction and the `bnez` is a basic-block of its own. Compare this to what happens on AArch64 (where the icmp is correctly sunk): ``` define dso_local %struct.list_head_s* @core_list_find(%struct.list_head_s* readonly %list, %struct.list_data_s* nocapture readonly %info) local_unnamed_addr #0 { entry: %idx = getelementptr inbounds %struct.list_data_s, %struct.list_data_s* %info, i64 0, i32 1 %0 = load i16, i16* %idx, align 2, !tbaa !6 %cmp = icmp sgt i16 %0, -1 br i1 %cmp, label %while.cond.preheader, label %while.cond9.preheader while.cond9.preheader: ; preds = %entry %1 = icmp eq %struct.list_head_s* %list, null br i1 %1, label %return, label %land.rhs11.lr.ph ``` This is caused by sinkCmpExpression() being skipped, if multiple condition registers are supported. Given that the check for multiple condition registers affect only sinkCmpExpression() and shouldNormalizeToSelectSequence(), this change adjusts the RISC-V target as follows: * we no longer signal multiple condition registers (thus changing the behaviour of sinkCmpExpression() back to sinking the icmp) * we override shouldNormalizeToSelectSequence() to let always select the preferred normalisation strategy for our backend With both changes, the test results remain unchanged. Note that without the target-specific override to shouldNormalizeToSelectSequence(), there is worse code (more branches) generated for select-and.ll and select-or.ll. The original test case changes as expected: ``` core_list_find: lh a2, 2(a1) bltz a2, .LBB0_5 beqz a0, .LBB0_9 << [...] j .LBB0_9 .LBB0_5: beqz a0, .LBB0_9 << lh a1, 0(a1) [...] ``` Differential Revision: https://reviews.llvm.org/D98932	2021-11-19 08:32:59 -08:00
Craig Topper	4b3518d50b	[RISCV] Pre-commit test for D98932. NFC	2021-11-19 08:32:58 -08:00
Victor Huang	86e77cdb08	[PowerPC] Add a flag for conditional trap optimization This patch adds a flag to enable/disable conditional trap optimization. Optimization disabled by default. Peer reviewed by: nemanjai	2021-11-19 10:24:54 -06:00
Fabian Wolff	ffe1741b5c	[DSE] Add additional strncpy tests. Test for PR#52062 and one of the remaining cases of PR#47644.	2021-11-19 16:18:54 +00:00
Quinn Pham	6774cc33f7	[NFC][llvm] Inclusive language: remove instance of master in IntrinsicsNVVM.td [NFC] As part of using inclusive language within the llvm project, this patch replaces master with main in `IntrinsicsNVVM.td`. Reviewed By: steffenlarsen Differential Revision: https://reviews.llvm.org/D114193	2021-11-19 09:53:59 -06:00
Mark de Wever	ed86610c7b	[libc++][nfc] Move functions to a generic place. This allows the floating-point formatter to use the same functions as the integral formatter. This was tested in D114001.	2021-11-19 16:38:35 +01:00
Mark de Wever	3624c4d845	[libc++] Adds (to\|from)_chars_result operator==. Implements part of P1614 The Mothership has Landed. Reviewed By: #libc, Quuxplusone, Mordante Differential Revision: https://reviews.llvm.org/D112366	2021-11-19 16:29:33 +01:00
Ben Langmuir	4c94760f36	[ORC] Fix materialization of weak local symbols We were adding all defined weak symbols to the materialization responsibility, but local symbols will not be in the symbol table, so it failed to materialize due to the "missing" symbol. Local weak symbols come up in practice when using `ld -r` with a hidden weak symbol. rdar://85574696	2021-11-19 07:25:56 -08:00
Matt Morehouse	671f0930fe	[X86] Selective relocation relaxation for +tagged-globals For tagged-globals, we only need to disable relaxation for globals that we actually tag. With this patch function pointer relocations, which we do not instrument, can be relaxed. This patch also makes tagged-globals work properly with LTO, as -Wa,-mrelax-relocations=no doesn't work with LTO. Reviewed By: pcc Differential Revision: https://reviews.llvm.org/D113220	2021-11-19 07:18:27 -08:00
Alexey Bataev	d1fdf867b1	[SLP][NFC]Introduce TreeEntry::getVectorFactor member function, NFC. Added TreeEntry::getVectorFactor to get the final vectotization factor to simplify the code. Differential Revision: https://reviews.llvm.org/D114190	2021-11-19 06:32:19 -08:00
Alexey Bataev	80256605f8	[OpenMP] support depend clause for taskwait directive, by Deepak Eachempati. This patch adds clang (parsing, sema, serialization, codegen) support for the 'depend' clause on the 'taskwait' directive. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D113540	2021-11-19 06:30:17 -08:00
Nico Weber	8b76d33c59	[asm] Allow block address operands in `asm inteldialect` This makes the following program build with -masm=intel: int foo(int count) { asm goto ("dec %0; jb %l[stop]" : "+r" (count) : : : stop); return count; stop: return 0; } It's also is another step towards merging EmitGCCInlineAsmStr() and EmitMSInlineAsmStr(). Differential Revision: https://reviews.llvm.org/D114167	2021-11-19 09:27:30 -05:00
Nico Weber	bc20bcb39e	[lld/mac] Crash even less on undefined symbols with --icf=all Follow-up to https://reviews.llvm.org/D112643. Even after that change, we were still asserting if two separate functions that are eligible for ICF (same size, same data, same number of relocs, same reloc types, ...) referred to Undefineds. This fixes that oversight. Differential Revision: https://reviews.llvm.org/D114195	2021-11-19 09:23:19 -05:00
Nico Weber	4f9a5c2a14	[asm] Remove explicit branch for modifier 'l' No intended behavior change. EmitGCCInlineAsmStr() used to explicitly check for modifier 'l' after handling block address and machine basic block operands. This prevented passing a MachineOperand with 'l' modifier to PrintAsmMemoryOperand(). Conceptually that seems kind of nice, but in practice the overrides of PrintAsmMemoryOperand() in all () AsmPrinter subclasses already reject modifiers they don't know about, and none of them don't know about 'l'. So removing this doesn't have a behavior difference, is less code, and it makes EmitGCCInlineAsmStr() and EmitMSInlineAsmStr() more similar, to prepare for merging them later. (Why not _add_ the branch to EmitMSInlineAsmStr() instead? Because that always works with X86AsmPrinter I think, and X86AsmPrinter::PrintAsmMemoryOperand() very decisively rejects the 'l' modifier, so it's hard to motivate adding that branch.) : The one exception was AVRAsmPrinter, which had an llvm_unreachable instead of returning true. So this commit changes that, so that the AVR target keeps emitting an error instead of crashing when passing a mem operand with a :l modifier to it. All the other targets already don't crash on this. Differential Revision: https://reviews.llvm.org/D114216	2021-11-19 09:19:53 -05:00
Zahira Ammarguellat	6623c02d70	The _Float16 type is supported on x86 systems with SSE2 enabled. Operations are emulated by software emulation and “float” instructions. This patch is allowing the support of _Float16 type without the use of -max512fp16 flag. The final goal being, perform _Float16 emulation for all arithmetic expressions.	2021-11-19 08:59:50 -05:00
Manuel Klimek	c2271926a4	Make clang-format fuzz through Lexing with asserts enabled. Makes clang-format bail out if an in-memory source file with an unsupported BOM is handed in instead of creating source locations that are violating clang's assumptions. In the future, we should add support to better transport error messages like this through clang-format instead of printing to stderr and not creating any changes.	2021-11-19 14:44:06 +01:00
Jay Foad	30b27ecfc2	[AMDGPU] Use new opcode for indexed vgpr reads Introduce V_MOV_B32_indirect_read for indexed vgpr reads (and rename the old V_MOV_B32_indirect to V_MOV_B32_indirect_write) so they can be unambiguously distinguished from regular V_MOV_B32_e32. Previously they were distinguished by looking for extra implicit operands but this is fragile because regular moves sometimes have extra implicit operands too: - either by accident, when instructions end up with duplicate implicit operands (see e.g. D100939) - or by design, when SIInstrInfo::copyPhysReg breaks a multi-dword copy into individual subreg mov instructions and adds implicit operands for the super-register. The effect of this is that SIInstrInfo::isFoldableCopy can be simplified and identifies more foldable copies. The test diffs show that more immediate 0 values have been folded as inline operands. SIInstrInfo::isReallyTriviallyReMaterializable could probably be simplified too but that is not part of this patch. Differential Revision: https://reviews.llvm.org/D114230	2021-11-19 13:08:11 +00:00
Roman Lebedev	049799c311	[X86][Costmodel] `getReplicationShuffleCost()`: promote 1 bit-wide elements to 8 bit when have AVX512BW+AVX512VBMI If in addition to AVX512BW (that provides `{k}<->{i8,i16}` casts and i16 shuffles), we have AVX512VBMI, which provides i8 shuffles, we are in an optimal situation. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D114071	2021-11-19 15:58:10 +03:00
Roman Lebedev	a751084bb4	[X86][Costmodel] `trunc v16i8 to v8i1` can appear after legalization, cost is same as for `trunc v8i8 to v8i1` Note that there are many other missing costs, i'm only adding the ones that are queried from `getReplicationShuffleCost()` for the existing (quite exhaustive) test coverage. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D114070	2021-11-19 15:57:32 +03:00
Roman Lebedev	a50fdd3fc9	[X86][Costmodel] `getReplicationShuffleCost()`: promote 1 bit-wide elements to 16 bit when have AVX512BW Here we get pretty lucky. AVX512F does not provide any instructions to convert between a `k` vector mask and a vector, but AVX512BW adds `{k}<->nX{i8,i16}`conversions, and just as it happens, with AVX512BW we have a i16 shuffle. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D113915	2021-11-19 15:55:41 +03:00
Fraser Cormack	92d279fd6d	[LangRef][VP] Correct operands' types in vp.select documentation The types of llvm.vp.select's operands much match the return type.	2021-11-19 12:08:34 +00:00
Simon Pilgrim	0f652d8f52	[X86] LowerRotate - recognise hidden ROTR patterns for better vXi8 codegen Check for a hidden ISD::ROTR (rotl(sub(0,x))) - vXi8 lowering can handle both (its always beneficial for splats, but otherwise only if we have VPTERNLOG). We currently hit infinite loops in TargetLowering::expandROT if we set ISD::ROTR to custom, which needs addressing before we extend this much further.	2021-11-19 11:49:15 +00:00
Andrew Ng	47eb3f155f	[ELF] Ensure output section is not discarded in addStartEndSymbols() Fixes https://bugs.llvm.org/show_bug.cgi?id=52534. Differential Revision: https://reviews.llvm.org/D114179	2021-11-19 11:45:58 +00:00
Simon Pilgrim	812e64ef0c	[DAG] MatchRotate - support rotate-by-constant of illegal types Patch to fix some of the regressions in D77804. By folding to rotate/funnel-shift by constant amounts for illegal types, we prevent SimplifyDemandedBits from destroying the patterns prematurely, allowing us to use the rotate/funnel-shift legalization that was added in D112443. Differential Revision: https://reviews.llvm.org/D113192	2021-11-19 11:12:04 +00:00
Balazs Benics	bf55b9f0d0	[analyzer][docs] Ellaborate the docs of cplusplus.StringChecker Let's describe accurately what the users can expect from the checker in a direct way. Also, add an example warning message. Reviewed By: martong, Szelethus Differential Revision: https://reviews.llvm.org/D113401	2021-11-19 11:59:46 +01:00

1 2 3 4 5 ...

405254 Commits All Branches Search

405254 Commits

All Branches