llvm-project

Commit Graph

Author	SHA1	Message	Date
Sergei Grechanik	d80b04ab00	[mlir][Affine][Vector] Support vectorizing reduction loops This patch adds support for vectorizing loops with 'iter_args' implementing known reductions along the vector dimension. Comparing to the non-vector-dimension case, two additional things are done during vectorization of such loops: - The resulting vector returned from the loop is reduced to a scalar using `vector.reduce`. - In some cases a mask is applied to the vector yielded at the end of the loop to prevent garbage values from being written to the accumulator. Vectorization of reduction loops is disabled by default. To enable it, a map from loops to array of reduction descriptors should be explicitly passed to `vectorizeAffineLoops`, or `vectorize-reductions=true` should be passed to the SuperVectorize pass. Current limitations: - Loops with a non-unit step size are not supported. - n-D vectorization with n > 1 is not supported. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D100694	2021-05-05 09:03:59 -07:00
Jinsong Ji	20d0aca430	[clang][Driver] Add -fintegrate-as to debug-pass-structure test CGProfilePass is not always on, it will be disabled when using non-intergrated assemblers. // Only enable CGProfilePass when using integrated assembler, since // non-integrated assemblers don't recognize .cgprofile section. PMBuilder.CallGraphProfile = !CodeGenOpts.DisableIntegratedAS; Add -fintegrate-as to make sure the output don't rely on the platform default. Reviewed By: evgeny777 Differential Revision: https://reviews.llvm.org/D101918	2021-05-05 16:10:57 +00:00
Sushma Unnibhavi	67ee2f870d	Added a faster method to clone llvm project [DOCS] Reviewed By: xgupta, amccarth Differential Revision: https://reviews.llvm.org/D101433	2021-05-05 21:37:53 +05:30
Pooja Yadav	0b9447157b	[docs] Update the llvm/example section Added details about the llvm/example section. Reviewed By: xgupta Differential Revision: https://reviews.llvm.org/D101284	2021-05-05 21:33:14 +05:30
Jessica Clarke	897d7bceb9	Revert "[SelectionDAG][Mips][PowerPC][RISCV][WebAssembly] Teach computeKnownBits/ComputeNumSignBits about atomics" This seems to have broken sanitizers, giving lots of Assertion `NumBits <= MAX_INT_BITS && "bitwidth too large"' failed. failures across multiple targets (currently X86 and PowerPC). Reverting until I have a chance to reproduce and debug. This reverts commit `6e876f9ded`.	2021-05-05 17:02:05 +01:00
Guillaume Chatelet	7c2ece523d	[libc] Normalize LIBC_TARGET_MACHINE Current implementation defines LIBC_TARGET_MACHINE with the use of CMAKE_SYSTEM_PROCESSOR. Unfortunately CMAKE_SYSTEM_PROCESSOR is OS dependent and can produce different results. An evidence of this is the various matchers used to detect whether the architecture is x86. This patch normalizes LIBC_TARGET_MACHINE and renames it LIBC_TARGET_ARCHITECTURE. I've added many architectures but we may want to limit ourselves to x86 and ARM. Differential Revision: https://reviews.llvm.org/D101524	2021-05-05 15:52:42 +00:00
Fraser Cormack	efc31be7f8	[RISCV][NFC] Fix up pseudoinstruction name in comment	2021-05-05 16:40:28 +01:00
Jessica Clarke	6e876f9ded	[SelectionDAG][Mips][PowerPC][RISCV][WebAssembly] Teach computeKnownBits/ComputeNumSignBits about atomics Unlike normal loads these don't have an extension field, but we know from TargetLowering whether these are sign-extending or zero-extending, and so can optimise away unnecessary extensions. This was noticed on RISC-V, where sign extensions in the calling convention would result in unnecessary explicit extension instructions, but this also fixes some Mips inefficiencies. PowerPC sees churn in the tests as all the zero extensions are only for promoting 32-bit to 64-bit, but these zero extensions are still not optimised away as they should be, likely due to i32 being a legal type. This also simplifies the WebAssembly code somewhat, which currently works around the lack of target-independent combines with some ugly patterns that break once they're optimised away. Reviewed By: RKSimon, atanasyan Differential Revision: https://reviews.llvm.org/D101342	2021-05-05 16:34:45 +01:00
Vang Thao	a3d273c9ff	[GlobalISel] Fix buildZExtInReg creating new register. Fix a bug where buildZExtInReg will create and use a new register instead of using the register from parameter DstOp Res. Reviewed By: arsenm, foad Differential Revision: https://reviews.llvm.org/D101871	2021-05-05 08:19:52 -07:00
Sanjay Patel	0034197874	[InstCombine] improve readability; NFC	2021-05-05 11:05:47 -04:00
Simon Pilgrim	0f97afe320	[MIPS][MSA] Regenerate immediates tests. NFCI. Simplifies an upcoming patch diff	2021-05-05 16:03:19 +01:00
Simon Pilgrim	679e30dc3f	[MIPS][MSA] Regenerate i5-b tests. NFCI. Simplifies an upcoming patch diff	2021-05-05 16:03:19 +01:00
Simon Pilgrim	c673a95cb4	[MIPS][MSA] Regenerate bitwise tests. NFCI. Simplifies an upcoming patch diff	2021-05-05 16:03:19 +01:00
Baptiste Saleil	83646f60a8	[AMDGPU] Fix llc pipeline lit test for bots enabling expensive checks	2021-05-05 10:57:58 -04:00
Tobias Gysi	4a6ee23d83	[mlir][linalg] Fix bug in the fusion on tensors index op handling. The old index op handling let the new index operations point back to the producer block. As a result, after fusion some index operations in the fused block had back references to the old producer block resulting in illegal IR. The patch now relies on a block and value mapping to avoid such back references. Differential Revision: https://reviews.llvm.org/D101887	2021-05-05 14:46:08 +00:00
Pushpinder Singh	1f5cacfcb8	[AMDGPU][OpenMP] Fix clang driver crash when provided -c The offload action is used in four different ways as explained in Driver.cpp:4495. When -c is present, the final phase will be assemble (linker when -c is not present). However, this phase is skipped according to D96769 for amdgcn. So, offload action arrives into following situation, compile (device) ---> offload ---> offload without -c the chain looks like, compile (device) ---> offload ---> linker (device) ---> offload The former situation creates an unhandled case which causes problem. The solution presented in this patch delays the D96769 logic until job creation time. This keeps the offload action in the 1 of the 4 specified situations. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D101901	2021-05-05 14:26:58 +00:00
Anirudh Prasad	ae2aef1361	[AsmParser][SystemZ][z/OS] Reject character and string literals for HLASM - As per the HLASM support we are providing, i.e. support only for the first parameter of the inline asm block, only pertaining to Z machine instructions defined in LLVM, character literals and string literals are not supported (see Figure 4 - https://www-01.ibm.com/servers/resourcelink/svc00100.nsf/pages/zOSV2R3sc264940/$file/asmr1023.pdf for more information) - This patch explicitly rejects the usage of char literals and string literals (for example "abc 'a'") when the relevant field is set - This is achieved by introducing a field called `LexHLASMStrings` in MCAsmLexer similar to `LexMasmStrings` Reviewed By: abhina.sreeskantharajan, Kai Differential Revision: https://reviews.llvm.org/D101660	2021-05-05 10:21:55 -04:00
Stelios Ioannou	3f4bad5ead	[AArch64] Fix for the pre-indexed paired load/store optimization. This patch fixes an issue where a pre-indexed store e.g., STR x1, [x0, #24]! with a store like STR x0, [x0, #8] are merged into a single store: STP x1, x0, [x0, #24]! . They shouldn’t be merged because the second store uses x0 as both the stored value and the address and so it needs to be using the updated x0. Therefore, it should not be folded into a STP <>pre. Additionally a new test case is added to verify this fix. Differential Revision: https://reviews.llvm.org/D101888 Change-Id: I26f1985ac84e970961e2cdca23c590fa6773851a	2021-05-05 15:15:07 +01:00
Anastasia Stulova	e994e74bca	[OpenCL] Add clang extension for non-portable kernel parameters. Added __cl_clang_non_portable_kernel_param_types extension that allows using non-portable types as kernel parameters. This allows bypassing the portability guarantees from the restrictions specified in C++ for OpenCL v1.0 s2.4. Currently this only disables the restrictions related to the data layout. The programmer should ensure the compiler generates the same layout for host and device or otherwise the argument should only be accessed on the device side. This extension could be extended to other case (e.g. permitting size_t) if desired in the future. Patch by olestrohm (Ole Strohm)! https://reviews.llvm.org/D101168	2021-05-05 14:58:23 +01:00
Jinsong Ji	f6ef409406	[DebugInfo][test][MIPS] Use mtriple in tests Mips tests are using -march in RUN lines, this will fail on AIX OS , when we get the mips-ibm-aix triple. This is caused/exposed recently due to https://reviews.llvm.org/D101194 changed the default getMultiarchTriple in toolchain. Update the tests to use -mtriple instead to avoid unintended failures. Reviewed By: atanasyan Differential Revision: https://reviews.llvm.org/D101863	2021-05-05 13:51:27 +00:00
Abhina Sreeskantharajan	6a12875046	[SystemZ][z/OS] Fix return values in AutoConversion functions My previous patch https://reviews.llvm.org/rG1527a5e4b4834e65678f9c30f786a2f4c17932bf incorrectly set int return values instead of std::error_code. This patch correctly returns and std::error_code value. Reviewed By: fanbo-meng, Jonathan.Crowther Differential Revision: https://reviews.llvm.org/D101904	2021-05-05 09:43:14 -04:00
Andrew Savonichev	1ee50b4731	[AArch64] Fix scalar imm variants of SIMD shift left instructions This issue was reported in PR50057: Cannot select: t10: i64 = AArch64ISD::VSHL t2, Constant:i32<2> Shift intrinsics (llvm.aarch64.neon.ushl.i64 and sshl) with a constant shift operand are lowered into AArch64ISD::VSHL in tryCombineShiftImm. VSHL has i64 and v1i64 patterns for a right shift, but only v1i64 for a left shift. This patch adds the missing i64 pattern for AArch64ISD::VSHL, and LIT tests to cover scalar variants (i64 and v1i64) of all shift intrinsics (only ushl and sshl cases fail without the patch, others were just not covered). Differential Revision: https://reviews.llvm.org/D101580	2021-05-05 16:26:29 +03:00
Bjorn Pettersson	3ee826594a	Make dependency between certain analysis passes transitive (reapply) LazyBlockFrequenceInfoPass, LazyBranchProbabilityInfoPass and LoopAccessLegacyAnalysis all cache pointers to their nestled required analysis passes. One need to use addRequiredTransitive to describe that the nestled passes can't be freed until those analysis passes no longer are used themselves. There is still a bit of a mess considering the getLazyBPIAnalysisUsage and getLazyBFIAnalysisUsage functions. Those functions are used from both Transform, CodeGen and Analysis passes. I figure it is OK to use addRequiredTransitive also when being used from Transform and CodeGen passes. On the other hand, I figure we must do it when used from other Analysis passes. So using addRequiredTransitive should be more correct here. An alternative solution would be to add a bool option in those functions to let the user tell if it is a analysis pass or not. Since those lazy passes will be obsolete when new PM has conquered the world I figure we can leave it like this right now. Intention with the patch is to fix PR49950. It at least solves the problem for the reproducer in PR49950. However, that reproducer need five passes in a specific order, so there are lots of various "solutions" that could avoid the crash without actually fixing the root cause. This is a reapply of commit `3655f0757f`, that was reverted in `33ff3c2049` due to problems with assertions in the polly lit tests. That problem is supposed to be solved by also adjusting ScopPass to explicitly preserve LazyBlockFrequencyInfo and LazyBranchProbabilityInfo (it already preserved OptimizationRemarkEmitter which depends on those lazy passes). Differential Revision: https://reviews.llvm.org/D100958	2021-05-05 15:17:55 +02:00
Simon Pilgrim	85460a2f5b	[X86][SSE] Move unpack(hop,hop) fold from foldShuffleOfHorizOp to combineTargetShuffle By moving this after more of the shuffle canonicalization we reduce the demanded vector elts, avoiding a few unnecessary copies/moves etc.	2021-05-05 13:36:09 +01:00
Martin Storsjö	6f5670a4c3	Revert "[Passes] Enable the relative lookup table converter pass on aarch64" This reverts commit `57b259a852`. The relative lookup table converter pass seems to cause problems for chromium on Windows/ARM64, see https://crbug.com/1204788.	2021-05-05 15:23:14 +03:00
Fraser Cormack	61a46375a2	[RISCV][VP][NFC] Add tests for VP_SREM and VP_UREM As agreed in D101826, these are follow-up tests for the RISC-V VP support.	2021-05-05 13:13:34 +01:00
Jay Foad	f106fe5f23	[AMDGPU] Autogenerate checks for a clustering test and add GFX10	2021-05-05 13:18:17 +01:00
Fraser Cormack	437468f319	[RISCV][VP][NFC] Add tests for VP_MUL and VP_[US]DIV As agreed in D101826, these are follow-up tests for the RISC-V VP support.	2021-05-05 13:08:57 +01:00
Alexey Bataev	13a51e017c	[X86]Fix a crash trying to convert indices to proper type. Need to perfortm a bitcast on IndicesVec rather than subvector extract if the original size of the IndicesVec is the same as the size of the destination type. Differential Revision: https://reviews.llvm.org/D101838	2021-05-05 05:14:42 -07:00
Uday Bondhugula	62851ea7ea	[MLIR] Rename free function `verify` on OffsetSizeAndStrideOpInterface Using a free function verify(<Op>) is error prone. Rename it. Differential Revision: https://reviews.llvm.org/D101886	2021-05-05 17:44:15 +05:30
Fraser Cormack	491a3d1359	[RISCV][VP][NFC] Add tests for VP_SHL and VP_LSHR As agreed in D101826, these are follow-up tests for the RISC-V VP support. Tests for VP_ASHR were landed as part of D101826.	2021-05-05 13:01:04 +01:00
Fraser Cormack	3fbcf07a99	[RISCV][VP][NFC] Add tests for VP_AND, VP_XOR, VP_OR As agreed in D101826, these are follow-up tests for the RISC-V VP support.	2021-05-05 12:58:08 +01:00
Fraser Cormack	6f17613bfb	[RISCV][VP] Lower VP ISD nodes to RVV instructions This patch supports all of the current set of VP integer binary intrinsics by lowering them to to RVV instructions. It does so by using the existing RISCVISD *_VL custom nodes as an intermediate layer. Both scalable and fixed-length vectors are supported by using this method. One notable change to the existing vector codegen strategy is that scalable all-ones and all-zeros mask SPLAT_VECTORs are now lowered to RISCVISD VMSET_VL and VMCLR_VL nodes to match their fixed-length BUILD_VECTOR counterparts. This allows them to reuse the existing "all-ones" VL patterns. To reduce the size of the phabricator diff, some tests are intentionally left out and will be added later if the patch is accepted. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D101826	2021-05-05 12:32:24 +01:00
Alexander Belyaev	2865d114f9	[mlir] Use ReassociationIndices instead of affine maps in linalg.reshape. Differential Revision: https://reviews.llvm.org/D101861	2021-05-05 12:59:57 +02:00
Sushma Unnibhavi	e4eec51937	[DOCS] Added example for G_EXTRACT and G_INSERT Reviewed By: xgupta, gargaroff Differential Revision: https://reviews.llvm.org/D101227	2021-05-05 15:47:35 +05:30
Hans Wennborg	4f4aa7b78d	Require asserts for clang/test/Headers/wasm.c The test doesn't pass in no-asserts builds, see comment on https://reviews.llvm.org/D101805	2021-05-05 11:42:18 +02:00
Fraser Cormack	cd6a52fede	[RISCV] Cap legal fixed-length vectors to 256-element types Previously, RISC-V would make legal all fixed-length vectors types whose size are less than or equal to some function of the minimum value of VLEN and the maximum-permissible LMUL grouping. Due to vector legalization issues, this patch instead caps the legal fixed-length vector types to those with 256 elements. This value was chosen because it is the longest vector length which has corresponding MVTs across all supported element types. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D101839	2021-05-05 09:51:08 +01:00
Julien Pagès	a1ed39df96	[AMDGPU] Select V_CVT_*16_F16 more often Improve the code generation of fp_to_sint and fp_to_uint for integer on 16-bits. Differential Revision: https://reviews.llvm.org/D101481 Patch by Julien Pagès!	2021-05-05 08:57:51 +01:00
Javier Setoain	001d601ac4	[mlir][ArmSVE] Add basic arithmetic operations While we figure out how to best add Standard support for scalable vectors, these instructions provide a workaround for basic arithmetic between scalable vectors. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D100837	2021-05-05 09:50:18 +02:00
Fangrui Song	e510860656	[llvm-objdump] Add -M {att,intel} & deprecate --x86-asm-syntax={att,intel} The internal `cl::opt` option --x86-asm-syntax sets the AsmParser and AsmWriter dialect. The option is used by llc and llvm-mc tests to set the AsmWriter dialect. This patch adds -M {att,intel} as GNU objdump compatible aliases (PR43413). Note: the dialect is initialized when the MCAsmInfo is constructed. `MCInstPrinter::applyTargetSpecificCLOption` is called too late and its MCAsmInfo reference is const, so changing the `cl::opt` in `MCInstPrinter::applyTargetSpecificCLOption` is not an option, at least without large amount of refactoring. Reviewed By: hoy, jhenderson, thakis Differential Revision: https://reviews.llvm.org/D101695	2021-05-05 00:20:41 -07:00
Yang Fan	cab3c6c6c4	[clang][TargetCXXABI] Fix -Wreturn-type warning (NFC) GCC warning: ``` In file included from /llvm-project/clang/include/clang/Basic/LangOptions.h:22, from /llvm-project/clang/include/clang/Frontend/CompilerInvocation.h:16, from /llvm-project/clang/lib/Frontend/CompilerInvocation.cpp:9: /llvm-project/clang/include/clang/Basic/TargetCXXABI.h: In static member function ‘static bool clang::TargetCXXABI::isSupportedCXXABI(const llvm::Triple&, clang::TargetCXXABI::Kind)’: /llvm-project/clang/include/clang/Basic/TargetCXXABI.h:114:3: warning: control reaches end of non-void function [-Wreturn-type] 114 \| }; \| ^ ```	2021-05-05 14:44:48 +08:00
Med Ismail Bennani	9775582e34	[lldb/Test] Disable testBreakpointByLineAndColumnNearestCode on Windows Signed-off-by: Med Ismail Bennani <medismail.bennani@gmail.com>	2021-05-05 06:04:08 +00:00
Jianzhou Zhao	79debe8d7b	[dfsan] Turn off all dfsan test cases on non x86_64 OSs https://reviews.llvm.org/D101666 enables sanitizer allocator. This broke all test cases on non x86-64.	2021-05-05 05:30:53 +00:00
Med Ismail Bennani	35ecfda01c	[lldb/Symbol] Fix column breakpoint `move_to_nearest_code` match This patch fixes the column symbol resolution when creating a breakpoint with the `move_to_nearest_code` flag set. In order to achieve this, the patch adds column information handling in the `LineTable`'s `LineEntry` finder. After experimenting a little, it turns out the most natural approach in case of an inaccurate column match, is to move backward and match the previous `LineEntry` rather than going forward like we do with simple line breakpoints. The patch also reflows the function to reduce code duplication. Finally, it updates the `BreakpointResolver` heuristic to align it with the `LineTable` method. rdar://73218201 Differential Revision: https://reviews.llvm.org/D101221 Signed-off-by: Med Ismail Bennani <medismail.bennani@gmail.com>	2021-05-05 05:07:50 +00:00
LLVM GN Syncbot	72cefd50e5	[gn build] Port `600686d75f`	2021-05-05 04:57:55 +00:00
Brad Smith	3a62d4fde8	Fix typo, arvm7 -> armv7	2021-05-05 00:56:44 -04:00
zoecarver	6f1b10df91	[libcxx][ranges] Add ranges::ssize CPO. Based on D101079. Differential Revision: https://reviews.llvm.org/D101189	2021-05-04 21:50:00 -07:00
zoecarver	600686d75f	[libcxx][ranges] Add ranges::size CPO. The begining of [range.prim]. Differential Revision: https://reviews.llvm.org/D101079	2021-05-04 21:50:00 -07:00
Juneyoung Lee	1fef5c88a6	[InstCombine] Fold more select of selects using isImpliedCondition This is a simple folding that does these: ``` select x_inv, true, (select y, x, false) => select x_inv, true, y ``` https://alive2.llvm.org/ce/z/-STJ2d ``` select (select y, x, false), true, x_inv => select y, true, x_inv ``` https://alive2.llvm.org/ce/z/6ruYt6 Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D101807	2021-05-05 13:44:58 +09:00
Juneyoung Lee	a71d666d18	[InstCombine] Precommit tests for D101807 (NFC)	2021-05-05 13:44:57 +09:00

1 2 3 4 5 ...

387572 Commits All Branches Search

387572 Commits

All Branches