llvm-project

Commit Graph

Author	SHA1	Message	Date
Thomas Lively	602f318cfd	[WebAssembly] Fix constness of pointer params to load intrinsics Update the SIMD builtin load functions to take pointers to const data and update the intrinsics themselves to not cast away constness. Differential Revision: https://reviews.llvm.org/D101884	2021-05-05 13:16:56 -07:00
Thomas Lively	627a526955	[WebAssembly] Update narrowing builtin function operand types Make the inputs to all narrowing builtins signed, which is how they are interpreted by the underlying instructions (only the result changes sign between instructions). Differential Revision: https://reviews.llvm.org/D101883	2021-05-05 13:04:04 -07:00
Tomasz Miąsko	0e7c2aeaa8	Add fuzzer for Rust demangler Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D101823	2021-05-05 12:50:50 -07:00
Jez Ng	75ba351300	[lld-macho] Try to unbreak build Looks like the PointerUnion casting cares about const-ness...	2021-05-05 15:47:14 -04:00
Martin Storsjö	9b24ff9cd2	[libcxx] [ci] Add a Windows CI configuration for a statically linked libc++ On Windows, static vs DLL linking affects details in quite a few cases, so it's good to have coverage for both cases. Testing with static linking also increases coverage for a number of cases and individual checks that have had to be waived for the DLL case, and allows testing libc++experimental, increasing the number of test cases actually executed by 180 (176 new tests from libc++experimental and 4 ones that are XFAIL windows-dll). Also drop the "generic-" prefix from these configuration names, as they're perhaps not what the "generic" prefix intended originally in the other generic-posix configurations. Differential Revision: https://reviews.llvm.org/D101565	2021-05-05 22:28:00 +03:00
Louis Dionne	7fbc7bfdfd	[libc++] NFC: Remove stray semicolon in from-scratch config files	2021-05-05 15:06:12 -04:00
Thomas Lively	89333b35a7	[WebAssembly] Set alignment to 1 for SIMD memory intrinsics The WebAssembly SIMD intrinsics in wasm_simd128.h generally try not to require any particular alignment for memory operations to be maximally flexible. For builtin memory access functions and their corresponding LLVM IR intrinsics, there's no way to set the expected alignment, so the best we can do is set the alignment to 1 in the backend. This change means that the alignment hints in the emitted code will no longer be incorrect when users use the intrinsics to access unaligned data. Differential Revision: https://reviews.llvm.org/D101850	2021-05-05 11:59:33 -07:00
Jon Chesterfield	25fe17d3c1	[libomptarget] Initial documentation on amdgpu offload [libomptarget] Initial documentation on amdgpu offload Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D101927	2021-05-05 19:58:52 +01:00
Evgenii Stepanov	18959a6a09	[hwasan] Fix missing synchronization in AllocThread. The problem was introduced in D100348. It's really hard to trigger the bug in a stress test - the race is just too narrow - but the new checks in Thread::Init should at least provide usable diagnostic if the problem ever returns. Differential Revision: https://reviews.llvm.org/D101881	2021-05-05 11:57:18 -07:00
Jez Ng	8806df4778	[lld-macho] Preliminary support for ARM_RELOC_BR24 ARM_RELOC_BR24 is used for BL/BLX instructions from within ARM (i.e. not Thumb) code. This diff just handles the basic case: branches from ARM to ARM, or from ARM to Thumb where no shimming is required. (See comments in ARM.cpp for why shims are required.) Note: I will likely be deprioritizing ARM work for the near future to focus on other parts of LLD. Apologies for the half-done state of this; I'm just trying to wrap up what I've already worked on. Reviewed By: #lld-macho, alexshap Differential Revision: https://reviews.llvm.org/D101814	2021-05-05 14:41:01 -04:00
Jez Ng	20f51ffe67	[lld-macho] Have --reproduce account for path rerooting We need to account for path rerooting when generating the response file. We could either reroot the paths before generating the file, or pass through the original filenames and change just the syslibroot. I've opted for the latter, in order that the reproduction run more closely mirrors the original. We must also be careful not to make an absolute path relative if it is shadowed by a rerooted path. See repro6.tar in reroot-path.s for details. I've moved the call to `createResponseFile()` after the initialization of `config->systemLibraryRoots`, since it now needs to know what those roots are. Reviewed By: #lld-macho, oontvoo Differential Revision: https://reviews.llvm.org/D101224	2021-05-05 14:41:01 -04:00
Harald van Dijk	7907c46fe6	Make clangd CompletionModel not depend on directory layout. The current code accounts for two possible layouts, but there is at least a third supported layout: clang-tools-extra may also be checked out as clang/tools/extra with the releases, which was not yet handled. Rather than treating that as a special case, use the location of CompletionModel.cmake to handle all three cases. This should address the problems that prompted D96787 and the problems that prompted the proposed revert D100625. Reviewed By: usaxena95 Differential Revision: https://reviews.llvm.org/D101851	2021-05-05 19:25:34 +01:00
Nick Desaulniers	aefbfbcbd7	[Clang] remove text extension from diag::err_drv_invalid_value_with_suggestion This hinders translations, as per: https://clang.llvm.org/docs/InternalsManual.html#the-format-string Reviewed By: MaskRay, xbolva00 Differential Revision: https://reviews.llvm.org/D101387	2021-05-05 11:01:43 -07:00
Roman Lebedev	8048005739	[NFC][SimplifyCFG] Update documentation comments for SinkCommonCodeFromPredecessors() after `1886aad`	2021-05-05 20:34:59 +03:00
Fangrui Song	b3336bfa2e	[llvm-objcopy][ELF] --only-keep-debug: set offset/size of segments with no sections to zero PR50160: we currently ignore non-PT_PHDR segments with no sections, not accounting for its p_offset and p_filesz: this can cause an out-of-bounds write in `writeSegmentData` if the p_offset+p_filesz is larger than the total file size. This can be fixed by setting p_offset=p_filesz=0. The logic nicely unifies with the logic added in D90897. Reviewed By: jhenderson, rupprecht Differential Revision: https://reviews.llvm.org/D101560	2021-05-05 10:26:57 -07:00
Saleem Abdulrasool	ba5c122647	RISSCV: clang-format RISC-V AsmParser (NFC) This corrects a few issues identified by `clang-format`. This is meant to be preparation for a subsequent change.	2021-05-05 10:16:41 -07:00
Roman Lebedev	833b33a7f4	[NFC][X86][CostModel] Add tests for byteswap intrinsic	2021-05-05 20:11:46 +03:00
Philipp Krones	632ebc4ab4	[MC] Untangle MCContext and MCObjectFileInfo This untangles the MCContext and the MCObjectFileInfo. There is a circular dependency between MCContext and MCObjectFileInfo. Currently this dependency also exists during construction: You can't contruct a MOFI without a MCContext without constructing the MCContext with a dummy version of that MOFI first. This removes this dependency during construction. In a perfect world, MCObjectFileInfo wouldn't depend on MCContext at all, but only be stored in the MCContext, like other MC information. This is future work. This also shifts/adds more information to the MCContext making it more available to the different targets. Namely: - TargetTriple - ObjectFileType - SubtargetInfo Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D101462	2021-05-05 10:03:02 -07:00
Philip Reames	80e8025083	[LV] Workaround PR49900 (a crash due to analyzing partially mutated IR) LoopVectorize has a fairly deeply baked in design problem where it will try to query analysis (primarily SCEV, but also ValueTracking) in the midst of mutating IR. In particular, the intermediate IR state does not represent the semantics of the original (or final) program. Fixing this for real is hard, but all of the cases seen so far share a common symptom. In cases seen to date, the analysis being queried is the computation of the original loop's trip count. We can fix this particular instance of the issue by simply computing the trip count early, and caching it. I want to be really clear that this is nothing but a workaround. It does nothing to fix the root issue, and at best, delays the time until we have to fix this for real. Florian and I have discussed an eventual solution in the review comments for https://reviews.llvm.org/D100663, but it's a lot of work. Test taken from https://reviews.llvm.org/D100663. Differential Revision: https://reviews.llvm.org/D101487	2021-05-05 09:56:28 -07:00
Javier Setoain	95861216ac	[mlir][ArmSVE] Add masked arithmetic operations These instructions map to SVE-specific instrinsics that accept a predicate operand to support control flow in vector code. Differential Revision: https://reviews.llvm.org/D100982	2021-05-05 17:41:58 +01:00
Nico Weber	f16afcd9b5	[clang] remove an incremental build workaround This cleaned up an oversight over a year ago. Should no longer be needed.	2021-05-05 12:21:56 -04:00
Stanislav Mekhanoshin	4c178d809b	[AMDGPU] Pre-commit 2 new saddr load tests. NFC.	2021-05-05 09:16:11 -07:00
Sergei Grechanik	d80b04ab00	[mlir][Affine][Vector] Support vectorizing reduction loops This patch adds support for vectorizing loops with 'iter_args' implementing known reductions along the vector dimension. Comparing to the non-vector-dimension case, two additional things are done during vectorization of such loops: - The resulting vector returned from the loop is reduced to a scalar using `vector.reduce`. - In some cases a mask is applied to the vector yielded at the end of the loop to prevent garbage values from being written to the accumulator. Vectorization of reduction loops is disabled by default. To enable it, a map from loops to array of reduction descriptors should be explicitly passed to `vectorizeAffineLoops`, or `vectorize-reductions=true` should be passed to the SuperVectorize pass. Current limitations: - Loops with a non-unit step size are not supported. - n-D vectorization with n > 1 is not supported. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D100694	2021-05-05 09:03:59 -07:00
Jinsong Ji	20d0aca430	[clang][Driver] Add -fintegrate-as to debug-pass-structure test CGProfilePass is not always on, it will be disabled when using non-intergrated assemblers. // Only enable CGProfilePass when using integrated assembler, since // non-integrated assemblers don't recognize .cgprofile section. PMBuilder.CallGraphProfile = !CodeGenOpts.DisableIntegratedAS; Add -fintegrate-as to make sure the output don't rely on the platform default. Reviewed By: evgeny777 Differential Revision: https://reviews.llvm.org/D101918	2021-05-05 16:10:57 +00:00
Sushma Unnibhavi	67ee2f870d	Added a faster method to clone llvm project [DOCS] Reviewed By: xgupta, amccarth Differential Revision: https://reviews.llvm.org/D101433	2021-05-05 21:37:53 +05:30
Pooja Yadav	0b9447157b	[docs] Update the llvm/example section Added details about the llvm/example section. Reviewed By: xgupta Differential Revision: https://reviews.llvm.org/D101284	2021-05-05 21:33:14 +05:30
Jessica Clarke	897d7bceb9	Revert "[SelectionDAG][Mips][PowerPC][RISCV][WebAssembly] Teach computeKnownBits/ComputeNumSignBits about atomics" This seems to have broken sanitizers, giving lots of Assertion `NumBits <= MAX_INT_BITS && "bitwidth too large"' failed. failures across multiple targets (currently X86 and PowerPC). Reverting until I have a chance to reproduce and debug. This reverts commit `6e876f9ded`.	2021-05-05 17:02:05 +01:00
Guillaume Chatelet	7c2ece523d	[libc] Normalize LIBC_TARGET_MACHINE Current implementation defines LIBC_TARGET_MACHINE with the use of CMAKE_SYSTEM_PROCESSOR. Unfortunately CMAKE_SYSTEM_PROCESSOR is OS dependent and can produce different results. An evidence of this is the various matchers used to detect whether the architecture is x86. This patch normalizes LIBC_TARGET_MACHINE and renames it LIBC_TARGET_ARCHITECTURE. I've added many architectures but we may want to limit ourselves to x86 and ARM. Differential Revision: https://reviews.llvm.org/D101524	2021-05-05 15:52:42 +00:00
Fraser Cormack	efc31be7f8	[RISCV][NFC] Fix up pseudoinstruction name in comment	2021-05-05 16:40:28 +01:00
Jessica Clarke	6e876f9ded	[SelectionDAG][Mips][PowerPC][RISCV][WebAssembly] Teach computeKnownBits/ComputeNumSignBits about atomics Unlike normal loads these don't have an extension field, but we know from TargetLowering whether these are sign-extending or zero-extending, and so can optimise away unnecessary extensions. This was noticed on RISC-V, where sign extensions in the calling convention would result in unnecessary explicit extension instructions, but this also fixes some Mips inefficiencies. PowerPC sees churn in the tests as all the zero extensions are only for promoting 32-bit to 64-bit, but these zero extensions are still not optimised away as they should be, likely due to i32 being a legal type. This also simplifies the WebAssembly code somewhat, which currently works around the lack of target-independent combines with some ugly patterns that break once they're optimised away. Reviewed By: RKSimon, atanasyan Differential Revision: https://reviews.llvm.org/D101342	2021-05-05 16:34:45 +01:00
Vang Thao	a3d273c9ff	[GlobalISel] Fix buildZExtInReg creating new register. Fix a bug where buildZExtInReg will create and use a new register instead of using the register from parameter DstOp Res. Reviewed By: arsenm, foad Differential Revision: https://reviews.llvm.org/D101871	2021-05-05 08:19:52 -07:00
Sanjay Patel	0034197874	[InstCombine] improve readability; NFC	2021-05-05 11:05:47 -04:00
Simon Pilgrim	0f97afe320	[MIPS][MSA] Regenerate immediates tests. NFCI. Simplifies an upcoming patch diff	2021-05-05 16:03:19 +01:00
Simon Pilgrim	679e30dc3f	[MIPS][MSA] Regenerate i5-b tests. NFCI. Simplifies an upcoming patch diff	2021-05-05 16:03:19 +01:00
Simon Pilgrim	c673a95cb4	[MIPS][MSA] Regenerate bitwise tests. NFCI. Simplifies an upcoming patch diff	2021-05-05 16:03:19 +01:00
Baptiste Saleil	83646f60a8	[AMDGPU] Fix llc pipeline lit test for bots enabling expensive checks	2021-05-05 10:57:58 -04:00
Tobias Gysi	4a6ee23d83	[mlir][linalg] Fix bug in the fusion on tensors index op handling. The old index op handling let the new index operations point back to the producer block. As a result, after fusion some index operations in the fused block had back references to the old producer block resulting in illegal IR. The patch now relies on a block and value mapping to avoid such back references. Differential Revision: https://reviews.llvm.org/D101887	2021-05-05 14:46:08 +00:00
Pushpinder Singh	1f5cacfcb8	[AMDGPU][OpenMP] Fix clang driver crash when provided -c The offload action is used in four different ways as explained in Driver.cpp:4495. When -c is present, the final phase will be assemble (linker when -c is not present). However, this phase is skipped according to D96769 for amdgcn. So, offload action arrives into following situation, compile (device) ---> offload ---> offload without -c the chain looks like, compile (device) ---> offload ---> linker (device) ---> offload The former situation creates an unhandled case which causes problem. The solution presented in this patch delays the D96769 logic until job creation time. This keeps the offload action in the 1 of the 4 specified situations. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D101901	2021-05-05 14:26:58 +00:00
Anirudh Prasad	ae2aef1361	[AsmParser][SystemZ][z/OS] Reject character and string literals for HLASM - As per the HLASM support we are providing, i.e. support only for the first parameter of the inline asm block, only pertaining to Z machine instructions defined in LLVM, character literals and string literals are not supported (see Figure 4 - https://www-01.ibm.com/servers/resourcelink/svc00100.nsf/pages/zOSV2R3sc264940/$file/asmr1023.pdf for more information) - This patch explicitly rejects the usage of char literals and string literals (for example "abc 'a'") when the relevant field is set - This is achieved by introducing a field called `LexHLASMStrings` in MCAsmLexer similar to `LexMasmStrings` Reviewed By: abhina.sreeskantharajan, Kai Differential Revision: https://reviews.llvm.org/D101660	2021-05-05 10:21:55 -04:00
Stelios Ioannou	3f4bad5ead	[AArch64] Fix for the pre-indexed paired load/store optimization. This patch fixes an issue where a pre-indexed store e.g., STR x1, [x0, #24]! with a store like STR x0, [x0, #8] are merged into a single store: STP x1, x0, [x0, #24]! . They shouldn’t be merged because the second store uses x0 as both the stored value and the address and so it needs to be using the updated x0. Therefore, it should not be folded into a STP <>pre. Additionally a new test case is added to verify this fix. Differential Revision: https://reviews.llvm.org/D101888 Change-Id: I26f1985ac84e970961e2cdca23c590fa6773851a	2021-05-05 15:15:07 +01:00
Anastasia Stulova	e994e74bca	[OpenCL] Add clang extension for non-portable kernel parameters. Added __cl_clang_non_portable_kernel_param_types extension that allows using non-portable types as kernel parameters. This allows bypassing the portability guarantees from the restrictions specified in C++ for OpenCL v1.0 s2.4. Currently this only disables the restrictions related to the data layout. The programmer should ensure the compiler generates the same layout for host and device or otherwise the argument should only be accessed on the device side. This extension could be extended to other case (e.g. permitting size_t) if desired in the future. Patch by olestrohm (Ole Strohm)! https://reviews.llvm.org/D101168	2021-05-05 14:58:23 +01:00
Jinsong Ji	f6ef409406	[DebugInfo][test][MIPS] Use mtriple in tests Mips tests are using -march in RUN lines, this will fail on AIX OS , when we get the mips-ibm-aix triple. This is caused/exposed recently due to https://reviews.llvm.org/D101194 changed the default getMultiarchTriple in toolchain. Update the tests to use -mtriple instead to avoid unintended failures. Reviewed By: atanasyan Differential Revision: https://reviews.llvm.org/D101863	2021-05-05 13:51:27 +00:00
Abhina Sreeskantharajan	6a12875046	[SystemZ][z/OS] Fix return values in AutoConversion functions My previous patch https://reviews.llvm.org/rG1527a5e4b4834e65678f9c30f786a2f4c17932bf incorrectly set int return values instead of std::error_code. This patch correctly returns and std::error_code value. Reviewed By: fanbo-meng, Jonathan.Crowther Differential Revision: https://reviews.llvm.org/D101904	2021-05-05 09:43:14 -04:00
Andrew Savonichev	1ee50b4731	[AArch64] Fix scalar imm variants of SIMD shift left instructions This issue was reported in PR50057: Cannot select: t10: i64 = AArch64ISD::VSHL t2, Constant:i32<2> Shift intrinsics (llvm.aarch64.neon.ushl.i64 and sshl) with a constant shift operand are lowered into AArch64ISD::VSHL in tryCombineShiftImm. VSHL has i64 and v1i64 patterns for a right shift, but only v1i64 for a left shift. This patch adds the missing i64 pattern for AArch64ISD::VSHL, and LIT tests to cover scalar variants (i64 and v1i64) of all shift intrinsics (only ushl and sshl cases fail without the patch, others were just not covered). Differential Revision: https://reviews.llvm.org/D101580	2021-05-05 16:26:29 +03:00
Bjorn Pettersson	3ee826594a	Make dependency between certain analysis passes transitive (reapply) LazyBlockFrequenceInfoPass, LazyBranchProbabilityInfoPass and LoopAccessLegacyAnalysis all cache pointers to their nestled required analysis passes. One need to use addRequiredTransitive to describe that the nestled passes can't be freed until those analysis passes no longer are used themselves. There is still a bit of a mess considering the getLazyBPIAnalysisUsage and getLazyBFIAnalysisUsage functions. Those functions are used from both Transform, CodeGen and Analysis passes. I figure it is OK to use addRequiredTransitive also when being used from Transform and CodeGen passes. On the other hand, I figure we must do it when used from other Analysis passes. So using addRequiredTransitive should be more correct here. An alternative solution would be to add a bool option in those functions to let the user tell if it is a analysis pass or not. Since those lazy passes will be obsolete when new PM has conquered the world I figure we can leave it like this right now. Intention with the patch is to fix PR49950. It at least solves the problem for the reproducer in PR49950. However, that reproducer need five passes in a specific order, so there are lots of various "solutions" that could avoid the crash without actually fixing the root cause. This is a reapply of commit `3655f0757f`, that was reverted in `33ff3c2049` due to problems with assertions in the polly lit tests. That problem is supposed to be solved by also adjusting ScopPass to explicitly preserve LazyBlockFrequencyInfo and LazyBranchProbabilityInfo (it already preserved OptimizationRemarkEmitter which depends on those lazy passes). Differential Revision: https://reviews.llvm.org/D100958	2021-05-05 15:17:55 +02:00
Simon Pilgrim	85460a2f5b	[X86][SSE] Move unpack(hop,hop) fold from foldShuffleOfHorizOp to combineTargetShuffle By moving this after more of the shuffle canonicalization we reduce the demanded vector elts, avoiding a few unnecessary copies/moves etc.	2021-05-05 13:36:09 +01:00
Martin Storsjö	6f5670a4c3	Revert "[Passes] Enable the relative lookup table converter pass on aarch64" This reverts commit `57b259a852`. The relative lookup table converter pass seems to cause problems for chromium on Windows/ARM64, see https://crbug.com/1204788.	2021-05-05 15:23:14 +03:00
Fraser Cormack	61a46375a2	[RISCV][VP][NFC] Add tests for VP_SREM and VP_UREM As agreed in D101826, these are follow-up tests for the RISC-V VP support.	2021-05-05 13:13:34 +01:00
Jay Foad	f106fe5f23	[AMDGPU] Autogenerate checks for a clustering test and add GFX10	2021-05-05 13:18:17 +01:00
Fraser Cormack	437468f319	[RISCV][VP][NFC] Add tests for VP_MUL and VP_[US]DIV As agreed in D101826, these are follow-up tests for the RISC-V VP support.	2021-05-05 13:08:57 +01:00

... 4 5 6 7 8 ...

387794 Commits All Branches Search

387794 Commits

All Branches