llvm-project

Commit Graph

Author	SHA1	Message	Date
Craig Topper	1b47a3de48	[RISCV] Enable cross basic block aware vsetvli insertion This patch extends D102737 to allow VL/VTYPE changes to be taken into account before adding an explicit vsetvli. We do this by using a data flow analysis to propagate VL/VTYPE information from predecessors until we've determined a value for every value in the function. We use this information to determine if a vsetvli needs to be inserted before the first vector instruction the block. Differential Revision: https://reviews.llvm.org/D102739	2021-05-26 09:25:42 -07:00
Sebastian Neubauer	ea91a8cbab	[AMDGPU][NFC] Remove non-existing function header	2021-05-26 18:20:33 +02:00
Jon Chesterfield	07f59baad6	[libomptarget][nfc][amdgpu] Remove atmi_status_t type ATMI_STATUS_UNKNOWN was unused, deleted references to it. Replaced ATMI_STATUS_{SUCCESS,ERROR} with HSA_STATUS_{SUCCESS,ERROR} Replaced atmi_status_t with hsa_status_t Otherwise no change. In particular, conversions between atmi_status_t and hsa_status_t will now be conversions between hsa_status_t and itself. Reviewed By: pdhaliwal Differential Revision: https://reviews.llvm.org/D103115	2021-05-26 17:02:19 +01:00
LLVM GN Syncbot	e47311d888	[gn build] Port `de9df3f5b9`	2021-05-26 15:57:01 +00:00
Mark de Wever	963495f0d4	[libc++][format] Adds availability macros for std::format. This prevents std::format to be available until there's an ABI stable version. (This only impacts the Apple platform.) Depends on D102703 Reviewed By: ldionne, #libc Differential Revision: https://reviews.llvm.org/D102705	2021-05-26 17:54:33 +02:00
Alexander Belyaev	74a89cba8c	[mlir] Add `distributionTypes` to LinalgTilingOptions. Differential Revision: https://reviews.llvm.org/D103161	2021-05-26 17:51:38 +02:00
Mark de Wever	de9df3f5b9	[libc++][NFC] Move basic_format_parse_context to its own header. This is a preparation to split the format header in smaller parts for the upcoming patches. Depends on D101723 Reviewed By: #libc, ldionne Differential Revision: https://reviews.llvm.org/D102703	2021-05-26 17:50:09 +02:00
LLVM GN Syncbot	deb6a0f94a	[gn build] Port `16342e3994`	2021-05-26 15:45:57 +00:00
Mark de Wever	16342e3994	[libc++][NFC] Move format_error to its own header. Reviewed By: #libc, ldionne Differential Revision: https://reviews.llvm.org/D101723	2021-05-26 17:43:23 +02:00
Valentin Clement	1005ef445d	[mlir][openacc] Translate UpdateOp to LLVM IR Add translation to LLVM IR for the UpdateOp with host and device operands. Translation is done with call using the runtime. This is done in a similar way as D101504 and D102381. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D102382	2021-05-26 11:42:15 -04:00
Philip Reames	9cc2181ec3	[unroll] Use value domain for symbolic execution based cost model The current full unroll cost model does a symbolic evaluation of the loop up to a fixed limit. That symbolic evaluation currently simplifies to constants, but we can generalize to arbitrary Values using the InstructionSimplify infrastructure at very low cost. By itself, this enables some simplifications, but it's mainly useful when combined with the branch simplification over in D102928. Differential Revision: https://reviews.llvm.org/D102934	2021-05-26 08:41:25 -07:00
Louis Dionne	31191e15b6	[libc++] Fix concepts tests with GCC	2021-05-26 11:21:55 -04:00
Jonas Paulsson	d058262b14	[SystemZ] Support i128 inline asm operands. Support virtual, physical and tied i128 register operands in inline assembly. i128 is on SystemZ not really supported and is not a legal type and generally such a value will be split into two i64 parts. There are however some instructions that require a pair of two GPR64 registers contained in the GR128 bit reg class, which is untyped. For inline assmebly operands, it proved to be very cumbersome to first follow the general behavior of splitting an i128 operand into two parts and then later rebuild the INLINEASM MI to have one GR128 register. Instead, some minor common code changes were made to SelectionDAGBUilder to only create one GR128 register part to begin with. In particular: - getNumRegisters() now has an optional parameter "RegisterVT" which is passed by AddInlineAsmOperands() and GetRegistersForValue(). - The bitcasting in GetRegistersForValue is not performed if RegVT is Untyped. - The RC for a tied use in AddInlineAsmOperands() is now computed either from the tied def (virtual register), or by getMinimalPhysRegClass() (physical register). - InstrEmitter.cpp:EmitCopyFromReg() has been fixed so that the register class (DstRC) can also be computed for an illegal type. In the SystemZ backend getNumRegisters(), splitValueIntoRegisterParts() and joinRegisterPartsIntoValue() have been implemented to handle i128 operands. Differential Revision: https://reviews.llvm.org/D100788 Review: Ulrich Weigand	2021-05-26 10:08:32 -05:00
Kadir Cetinkaya	8f79203a22	[clangd] New ParsingCallback for semantics changes Previously notification of the Server about semantic happened strictly before notification of the AST thread. Hence a racy Server could make a request (like semantic tokens) after the notification, with the assumption that it'll be served fresh content. But it wasn't true if AST thread wasn't notified about the change yet. This change reverses the order of those notifications to prevent racy interactions. Differential Revision: https://reviews.llvm.org/D102761	2021-05-26 16:57:30 +02:00
Andrea Di Biagio	5f500d73cd	[MCA] Add a test for PR50483. NFC	2021-05-26 15:52:11 +01:00
Anirudh Prasad	1bc0e857bf	[SystemZ][z/OS] Enable the AllowAtInName attribute for the HLASM dialect - Currently, LLVM supports symbols of the name "token1@token2". - "token2" is used to identify whether an appropriate symbol reference can be used for the symbol. - Now, if the symbol reference couldn't be found, the AsmParser usually emits an error, unless the backend is configured to accept the "@" in a symbol name - Thus, this patch aims to do that. It sets the `AllowAtInName` attribute in the SystemZ backend for the HLASM dialect. - Setting this attribute ensures that, if a particular symbol reference is found, it uses that. If it doesn't, and there exists an "@" in the symbol name, it will use that instead of explicitly erroring out. Reviewed By: uweigand Differential Revision: https://reviews.llvm.org/D103111	2021-05-26 10:49:57 -04:00
jweightma	fcd32d62c0	[AMDGPU] Fix function pointer argument bug in AMDGPU Propagate Attributes pass. This patch fixes a bug in the AMDGPU Propagate Attributes pass where a call instruction with a function pointer argument is identified as a user of the passed function, and illegally replaces the called function of the instruction with the function argument. For example, given functions f and g with appropriate types, the following illegal transformation could occur without this fix: call void @f(void ()* @g) --> call void @g(void ()* @g.1) The solution introduced in this patch is to prevent the cloning and substitution if the instruction's called function and the function which might be cloned do not match. Reviewed By: arsenm, madhur13490 Differential Revision: https://reviews.llvm.org/D101847	2021-05-26 16:40:15 +02:00
Anirudh Prasad	b37a2fcd8d	[SystemZ][z/OS] Validate symbol names for z/OS for printing without quotes - Currently, before printing a label in MCSymbol.cpp (MCSymbol::print), the current code "validates" the label that is to be printed. - If it fails the validation step, then it prints the label within double quotes. - However, the validation is provided as a virtual function in MCAsmInfo.h (i.e. isAcceptableChar() function). So we can override this for the AD_HLASM dialect in SystemZMCAsmInfo.cpp. Reviewed By: uweigand Differential Revision: https://reviews.llvm.org/D103091	2021-05-26 10:37:09 -04:00
Hans Wennborg	a8f75d497d	[clang-cl] Add driver support for /std:c++20 and bump /std:c++latest (PR50465) VS 2019 16.11 (just released in Preview) is adding support for the /std:c++20 option and bumping /std:c++latest to "post-c++20". This updates clang-cl to match. Differential revision: https://reviews.llvm.org/D103155	2021-05-26 16:05:52 +02:00
Luo, Yuanke	4ed2b6cccd	[X86][AMX] Fix a bug on tile config. The previous code detect if a MBB is bottom block to determine if it is a backedge of a loop. We should check latch block instead of bottom block and we should check the header and the bottom block are in the same loop. Differential Revision: https://reviews.llvm.org/D103145	2021-05-26 21:57:49 +08:00
Sjoerd Meijer	b6f6501b24	[CostModel][AArch64] Add tests for bitreverse. NFC.	2021-05-26 14:56:58 +01:00
David Green	a409fcddae	[ARM] Extra test for reverted WLS memset. NFC	2021-05-26 14:54:36 +01:00
Simon Pilgrim	629e2b3442	[X86][SSE] Regenerate some tests to expose the rip relative vector/broadcast loads	2021-05-26 14:50:47 +01:00
Andrea Di Biagio	63cc9fd579	[MCA][InOrderIssueStage] Fix LastWriteBackCycle computation. Conservatively use the instruction latency to compute the last write-back cycle. Before this patch, the last write cycle computation was incorrect for store instructions that didn't declare any register writes.	2021-05-26 14:17:43 +01:00
Alexey Bataev	8be23ed3f0	[SLP][NFC]Add a test for multiple uses of insertelement instruction, NFC.	2021-05-26 06:17:03 -07:00
Kerry McLaughlin	9f76a85260	[LoopVectorize] Enable strict reductions when allowReordering() returns false When loop hints are passed via metadata, the allowReordering function in LoopVectorizationLegality will allow the order of floating point operations to be changed: bool allowReordering() const { // When enabling loop hints are provided we allow the vectorizer to change // the order of operations that is given by the scalar loop. This is not // enabled by default because can be unsafe or inefficient. The -enable-strict-reductions flag introduced in D98435 will currently only vectorize reductions in-loop if hints are used, since canVectorizeFPMath() will return false if reordering is not allowed. This patch changes canVectorizeFPMath() to query whether it is safe to vectorize the loop with ordered reductions if no hints are used. For testing purposes, an additional flag (-hints-allow-reordering) has been added to disable the reordering behaviour described above. Reviewed By: sdesmalen Differential Revision: https://reviews.llvm.org/D101836	2021-05-26 13:59:12 +01:00
Max Kazantsev	be1a23203b	Return "[LoopDeletion] Break backedge if we can prove that the loop is exited on 1st iteration" (try 2) The patch was reverted due to compile time impact of contextual SCEV queries. It also appeared that it introduced a miscompile on irreducible CFG. Changes made: 1. isKnownPredicateAt is replaced with more lightweight isKnownPredicate; 2. Irreducible CFG in live code is now detected and excluded from processing. Differential Revision: https://reviews.llvm.org/D102615	2021-05-26 19:47:14 +07:00
Sanjay Patel	01120fe5b3	[InstCombine] add fmul tests with shared operand; NFC Baseline tests for: D102698	2021-05-26 08:32:08 -04:00
Sanjay Patel	9e43b1e9a1	[InstCombine] avoid 'tmp' usage in test files; NFC The update script ( utils/update_test_checks.py ) warns against this.	2021-05-26 08:32:07 -04:00
Sanjay Patel	b70fe92f08	[InstCombine] avoid 'tmp' usage in test file; NFC The update script ( utils/update_test_checks.py ) warns against this.	2021-05-26 08:32:07 -04:00
Max Kazantsev	0de553dce0	Revert "Return "[LoopDeletion] Break backedge if we can prove that the loop is exited on 1st iteration"" This reverts commit `43d2e51c2e`. Commited wrong version.	2021-05-26 19:29:07 +07:00
Max Kazantsev	43d2e51c2e	Return "[LoopDeletion] Break backedge if we can prove that the loop is exited on 1st iteration" The patch was reverted due to compile time impact of contextual SCEV queries. It also appeared that it introduced a miscompile on irreducible CFG. Changes made: 1. isKnownPredicateAt is replaced with more lightweight isKnownPredicate; 2. Irreducible CFG in live code is now detected and excluded from processing. Differential Revision: https://reviews.llvm.org/D102615	2021-05-26 19:23:21 +07:00
Adrian Kuegel	dee46d0829	[mlir] Fold complex.create(complex.re(op), complex.im(op)) Differential Revision: https://reviews.llvm.org/D103148	2021-05-26 14:02:53 +02:00
Andrew Savonichev	8ac66d61ea	[AArch64] Generate LD1 for anyext i8 or i16 vector load The existing LD1 patterns do not cover cases where result type does not match the memory type. This happens when illegal vector types are extended and scalarized, for example: load <2 x i16>* %v2i16 is lowered into: // first element (v4i32 (insert_subvector (v2i32 (scalar_to_vector (load anyext from i16))))) // other elements (v4i32 (insert_vector_elt (i32 (load anyext from i16)) idx)) Before this patch these patterns were compiled into LDR + INS. Now they are compiled into LD1. The problem was reported in PR24820: LLVM Generates abysmal code in simple situation. Differential Revision: https://reviews.llvm.org/D102938	2021-05-26 14:44:21 +03:00
Max Kazantsev	5fb58d4598	[Test] Add Loop Deletion test with irreducible CFG Authored by Mikael Holmén. It demonstrated miscompile on irreducible CFG with patch "[LoopDeletion] Break backedge if we can prove that the loop is exited on 1st iteration". The patch is reverted. Checking in the test to make sure this bug does not return.	2021-05-26 18:40:14 +07:00
Sven van Haastregt	ba0fe85ec0	[OpenCL] Include header for atomic-ops test Avoid duplicating the memory_order and memory_scope enum definitions.	2021-05-26 12:32:07 +01:00
Tomas Matheson	ab8c44112c	[MC] Move elf-unique-sections-by-flags.ll to X86/	2021-05-26 12:28:17 +01:00
pooja2299	cebdf5d846	[Docs] Updated the content of getting started documentation under llvm/lib/MC Wrote about llvm/lib/MC subproject on https://llvm.org/docs/GettingStarted.html page. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D101047	2021-05-26 16:25:26 +05:30
Tomas Matheson	165321b3d2	[MC][ELF] Emit unique sections for different flags Global values imply flags such as readable, writable, executable for the sections that they will be placed in. Currently MC places all such entries into the same section, using the first set of flags seen. This can lead to situations in LTO where a writable global is placed in the same named section as a readable global from another file, and the section may not be marked writable. D72194 ensures that mergeable globals with explicit sections are placed in separate sections with compatible entry size, by emitting the `unique` assembly syntax where appropriate. This change extends that approach to include section flags, so that globals with different section flags are emitted in separate unique sections. Differential revision: https://reviews.llvm.org/D100944	2021-05-26 11:51:29 +01:00
Tomas Matheson	e79e8041c5	[MC][NFCI] Factor out ELF section unique ID calculation Precursor to D100944. The logic for determining the unique ID had become quite difficult to reason about, so I have factored this out into a separate function. Differential Revision: https://reviews.llvm.org/D102336	2021-05-26 11:51:29 +01:00
Pushpinder Singh	a2d6ef5876	[AMDGPU][Libomptarget] Inline atmi_init/atmi_finalize After D102847, these functions can be inlined. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D103075	2021-05-26 10:50:08 +00:00
Pushpinder Singh	cc8661ac4a	[AMDGPU][Libomptarget] Delete g_atmi_initialized This patch drops g_atmi_initialized and inlines the Initialize & Finalize methods from Runtime class. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D102847	2021-05-26 10:46:54 +00:00
Raphael Isemann	76e47d4887	[lldb][NFC] Use C++ versions of the deprecated C standard library headers The C headers are deprecated so as requested in D102845, this is replacing them all with their (not deprecated) C++ equivalent. Reviewed By: shafik Differential Revision: https://reviews.llvm.org/D103084	2021-05-26 12:46:12 +02:00
Simon Pilgrim	21aec4fdc5	[X86][SLM] Fix vector PSHUFB + variable shift resource/throughputs Match whats documented in the Intel AOM (+Agner) - PSHUFB xmm is really slow, and mmx/xmm vector shifts are half rate. Noticed while working to get the cost tables to more closely match llvm-mca analysis, in this case for shifts and truncations.	2021-05-26 11:14:21 +01:00
Florian Hahn	2a41d702be	[SCEV] Add tests with signed predicates for applyLoopGuards.	2021-05-26 11:10:11 +01:00
Pushpinder Singh	7648b6978e	[AMDGPU][Libomptarget] Move Kernel/Symbol info tables to RTLDeviceInfoTy Two globals KernelInfoTable & SymbolInfoTable are moved into RTLDeviceInfoTy class. This builds on the top of D102691. [2/2] Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D102692	2021-05-26 10:02:28 +00:00
Kerry McLaughlin	6b0fe3c63b	[NFC] Add CHECK lines for unordered FP reductions An additional RUN line has been added to both strict-fadd.ll & scalable-strict-fadd.ll to ensure the correct behaviour of these tests where `-enable-strict-reductions` is false. Reviewed By: david-arm Differential Revision: https://reviews.llvm.org/D103015	2021-05-26 11:00:20 +01:00
Mirko Brkusanin	9601849984	[AMDGPU][GlobalISel] Stop foldInsertEltToCmpSelect from changing reg banks This function can change regbank for registers which already have a selected bank. Depending on the instruction where these registers were used it can cause instruction selection to fail. Differential Revision: https://reviews.llvm.org/D98515	2021-05-26 11:57:41 +02:00
Mirko Brkusanin	7386ad4e9e	Revert "[AMDGPU][GlobalISel] Stop foldInsertEltToCmpSelect from changing reg banks" This reverts commit `18c5444702`.	2021-05-26 11:57:41 +02:00
Fraser Cormack	7e27e4273d	[RISCV] Pre-commit fixed-length mask vselect tests These are default-expanded but later unrolled due to RISC-V's vector boolean content policy. A patch to improve this codegen will follow shortly.	2021-05-26 10:44:45 +01:00

1 2 3 4 5 ...

389523 Commits All Branches Search

389523 Commits

All Branches