llvm-project

Commit Graph

Author	SHA1	Message	Date
Stella Laurenzo	3bb7999339	[mlir] Add global_load and global_store ops to ml_program. * Adds simple, non-atomic, non-volatile, non-synchronized direct load/store ops. Differential Revision: https://reviews.llvm.org/D126230	2022-06-01 11:32:15 -07:00
Aaron Ballman	aaf04c7215	Fix an incorrect bit-width for storing attribute syntax information This field corresponds to the Syntax enumeration, and that gained another entry in `1fdf952dee`. However, the bit-field for storing the syntax used was not adjusted to handle the extra field. This turns out to be unobservable for HLSL attributes at the moment, so there is no test coverage. But it's also not really an NFC change either.	2022-06-01 14:24:51 -04:00
Alexander Belyaev	f711785e61	[mlir] Add conversion and tests for complex.[sqrt\|atan2] to Arith. Differential Revision: https://reviews.llvm.org/D126799	2022-06-01 20:21:51 +02:00
Anders Waldenborg	86f9cf88cb	[clang] Add tests for (const) weak variables This adds tests checking the behavior of const variables declared with weak attribute. Both checking that they can not be used in places where a constant expression is required and that a dynamic initializer is emitted when used as an initializer expression. Differential Revision: https://reviews.llvm.org/D126578	2022-06-01 20:18:54 +02:00
bixia1	548f0841cd	[mlir][sparse] Enable the test for operator expm1. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D126732	2022-06-01 11:18:17 -07:00
Stanislav Mekhanoshin	c9e242f6dd	[AMDGPU] Change GISel error handling for TFE on GFX90A Differential Revision: https://reviews.llvm.org/D126797	2022-06-01 11:07:25 -07:00
Alexey Bataev	fd5a6ce9dc	[SLP]Improve shuffles cost estimation where possible. Improved/fixed cost modeling for shuffles by providing masks, improved cost model for non-identity insertelements. Differential Revision: https://reviews.llvm.org/D115462	2022-06-01 11:01:37 -07:00
Andrew Browne	31d12df3b9	[DFSan] Remove deprecated flag from build-libc-list.py Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D126429	2022-06-01 11:00:13 -07:00
Siva Chandra Reddy	ad89cf4e2d	[libc] Keep all thread state information separate from the thread structure. The state is now stored on the thread's stack memory. This enables implementing pthread API like pthread_detach which takes the pthread_t structure argument by value. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D126716	2022-06-01 17:36:58 +00:00
Adrian Prantl	62b4482175	Revert "Adapt LLDB for D120540." This reverts commit `ca73de4374`. That patch was just hiding the problem, instead of fixing it.	2022-06-01 10:33:53 -07:00
Joseph Huber	f4f23de1a4	[Libomptarget] Add basic support for dynamic shared memory on AMDGPU This patchs adds the arguments necessary to allocate the size of the dynamic shared memory via the `LIBOMPTARGET_SHARED_MEMORY_SIZE` environment variable. This patch only allocates the memory, AMDGPU has a limitation that shared memory can only be accessed from the kernel directly. So this will currently only work with optimizations to inline the accessor function. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D125252	2022-06-01 13:32:50 -04:00
Yi Kong	716d428ab5	[BOLT] Add `-o` option to merge-fdata Differential Revision: https://reviews.llvm.org/D126788	2022-06-02 01:29:04 +08:00
Fangrui Song	241e645036	ar_to_bc.sh: Ignore non-bitcode files in archives The script uses llvm-link to link LLVM bitcode files. `5426da8ffa` used -DLLVM_DISABLE_ASSEMBLY_FILES=ON to ignore object files compiled from lib/Support/BLAKE3/*.S. A better approach (which fits Bazel better) is to ignore non-bitcode files. Reviewed By: akyrtzi Differential Revision: https://reviews.llvm.org/D126728	2022-06-01 10:28:14 -07:00
Nico Weber	be223eb541	sanitizers: Do not include crypt.h if SANITIZER_INTERCEPT_CRYPT_R is undef sanitizer_intercept_overriders.h might override SANITIZER_INTERCEPT_CRYPT_R to be undefined. There's no need to require crypt.h in that case. (The motivation is that crypt() moved from glibc into its own package at some point, which makes intercepting it and building with a single sysroot that supports both pre-bullseye and post-bullseye a bit hairy.) Differential Revision: https://reviews.llvm.org/D126696	2022-06-01 13:27:06 -04:00
Aart Bik	d668218946	[mlir][python][ctypes] fix ctype python binding complication for complex There is no direct ctypes for MLIR's complex (and thus np.complex128 and np.complex64) yet, causing the mlir python binding methods for memrefs to crash. This revision fixes this by passing complex arrays as tuples of floats, correcting at the boundaries for the proper view. NOTE: some of these changes (4 -> 2) were forced by the new "linting" Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D126422	2022-06-01 10:15:24 -07:00
Brooks Davis	18efa420da	compiler-rt: Allow build without __c11_atomic_fetch_nand Don't build atomic fetch nand libcall functions when the required compiler builtin isn't available. Without this compiler-rt can't be built with LLVM 13 or earlier. Not building the libcall functions isn't optimal, but aligns with the usecase in FreeBSD where compiler-rt from LLVM 14 is built with an LLVM 13 clang and no LLVM 14 clang is built. Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D126710	2022-06-01 12:58:30 -04:00
Balazs Benics	e5ece11e76	[analyzer][NFC] Add test for `3a07280290` I'm adding a test demonstraing that the issue reported by @mikaelholmen is fixed. Differential Revision: https://reviews.llvm.org/D126198	2022-06-01 18:53:19 +02:00
Alexey Bataev	fe4949942d	[SLP]Fix PR55796: insert point for extractelements from different basic blocks. Extractelement instructions may come from different basic blocks, need to take it into account when looking for a last instruction in the bundle to prevent compiler crash. Differential Revision: https://reviews.llvm.org/D126777	2022-06-01 09:44:53 -07:00
Stephen Long	a5b056fe49	[MSVC] Fix pragma alloc_text failing for C files `isExternCContext()` is returning false for functions in C files Reviewed By: rnk, aaron.ballman Differential Revision: https://reviews.llvm.org/D126559	2022-06-01 09:39:46 -07:00
Simon Pilgrim	4565f7e747	[Hexagon] Regenerate store-imm-amode.ll	2022-06-01 17:39:07 +01:00
Simon Pilgrim	0f7bd78483	[AMDGPU] Regenerate fabs.f16.ll tests	2022-06-01 17:36:13 +01:00
Scott Linder	2d43955cec	[AMDGPU][NFC] Refactor AMDGPUCallingConv.td Rename CalleeSavedRegs defs to avoid being overly specific: * CSR_AMDGPU_AGPRs_32_255 => CSR_AMDGPU_AGPRs * CSR_AMDGPU_SGPRs_30_31 + CSR_AMDGPU_SGPRs_32_105 => CSR_AMDGPU_SGPRs * CSR_AMDGPU_SI_Gfx_SGPRs_4_29 + CSR_AMDGPU_SI_Gfx_SGPRs_64_105 => CSR_AMDGPU_SI_Gfx_SGPRs * CSR_AMDGPU_HighRegs => CSR_AMDGPU * CSR_AMDGPU_HighRegs_With_AGPRs => CSR_AMDGPU_GFX90AInsts * CSR_AMDGPU_SI_Gfx_With_AGPRs => CSR_AMDGPU_SI_Gfx_GFX90AInsts Introduce a class RegMask to mark the cases where we use the CalleeSavedRegs class purely as an expedient way to produce a mask. Update the names of these masks to not mention "CSR". Other targets also seem to do this, so a reasonable alternative is to actually update table-gen to include a new class to do this explicitly, but the current approach seems harmless so I opted to just make it more explicit. Reviewed By: arsenm, sebastian-ne Differential Revision: https://reviews.llvm.org/D109008	2022-06-01 16:24:09 +00:00
Mats Petersson	dc4bf2c33c	[flang][OpenMP]Make omp.wsloop arguments appear in memory (#1277 ) As per issue #1196, the loop induction variable, which is an argument in the omp.wsloop operation, does not have a memory location, so when passed to a function or subroutine, the reference to the value is not a memory location, but the value of the induction variable. The callee function/subroutine is then trying to dereference memory at address 1 or some other "not a good memory location". This is fixed by creating a temporary memory location and storing the value of the induction variable in that. Test fixes as a consequence of the changed code generated. Add checking for some of the omp-unstructured.f90 to check for alloca, store and load operations, to ensure the correct flow. Add a test for CYCLE inside a omp-do loop. Also convert to use -emit-fir in the omp-unstructrued, and make the symbol matching consistent in the omp-wsloop-variable test. Reviewed By: peixin Differential Revision: https://reviews.llvm.org/D126711	2022-06-01 17:20:06 +01:00
Matthias Braun	53753531bc	TensorFlowCompile: Add object file to list of sources rather than LINK_LIBS Differential Revision: https://reviews.llvm.org/D126736	2022-06-01 09:04:48 -07:00
Arjun P	8f99cdd27c	[MLIR][Presburger] Simplex: remove redundant zeroing out of row This fillRow(..., 0) is redundant because when the size of the tableau is consistent, the resize always creates a new row, which is zero-initialized. Also added asserts throughout to ensure the dimensions of the tableau remain consistent. Reviewed By: Groverkss Differential Revision: https://reviews.llvm.org/D126709	2022-06-01 16:59:37 +01:00
Arjun P	ec145ba2a3	[MLIR][Presburger] Matrix: inline trivial accessors This resolves a comment from https://reviews.llvm.org/D126708 that was previously missed.	2022-06-01 16:56:46 +01:00
Arjun P	d5e31cf38a	[MLIR][Presburger] Move Matrix accessors inline This gives a 1.5x speedup on the Presburger unittests. Reviewed By: Groverkss Differential Revision: https://reviews.llvm.org/D126708	2022-06-01 16:51:42 +01:00
Mark de Wever	04a3146caa	[libc++][format] Fixes string-literal formatting. Formatting a string-literal had an off-by-one issue where the NUL terminator became part of the formatted output. Reviewed By: #libc, ldionne Differential Revision: https://reviews.llvm.org/D126665	2022-06-01 17:49:09 +02:00
PeixinQiao	fe2cc16035	[NFC][MLIR] Fix -Wtype-limits warning Fix the warning: comparison of unsigned expression in ‘>= 0’ is always true. Reviewed By: kiranchandramohan, shraiysh Differential Revision: https://reviews.llvm.org/D126784	2022-06-01 23:42:07 +08:00
Mital Ashok	872f74440f	Fix std::has_unique_object_representations for _BitInt types with padding bits "std::has_unique_object_representations<_BitInt(N)>" was always true, even if the type has padding bits (since the trait assumes all integer types have no padding bits). The standard has an explicit note that this should not hold for types with padding bits. Differential Revision: https://reviews.llvm.org/D125802	2022-06-01 11:34:40 -04:00
Luke Nihlen	1f6ea2a37c	Expand definition deprecation warning to include constexpr statements. Clang currently warns on definitions downgraded to declarations with a const modifier, but not for a constexpr modifier. This patch updates the warning logic to warn on both inputs, and adds a test to check the additional case as well. See also: https://bugs.chromium.org/p/chromium/issues/detail?id=1284718 Differential Revision: https://reviews.llvm.org/D126664	2022-06-01 11:31:07 -04:00
Craig Topper	aeb27f133a	[RISCV] Fix i64<->f64 and i32<->f32 bitcasts with VLS vectors enabled. We enable a custom handler to optimize conversions between scalars and fixed vectors. Unfortunately, the custom handler picks up scalar to scalar conversions as well. If the scalar types are both legal, we wouldn't match any of the fixed vector cases and would return SDValue() causing the LegalizeDAG to expand the bitcast through memory. This patch fixes this by checking if it's a scalar to scalar conversion and returns `Op` if both types are legal. Differential Revision: https://reviews.llvm.org/D126739	2022-06-01 08:13:49 -07:00
PeixinQiao	0a90b72c43	[flang] Add semantic checks for threadprivate and declare target directives This patch supports the following checks: ``` [5.1] 2.21.2 THREADPRIVATE Directive The threadprivate directive must appear in the declaration section of a scoping unit in which the common block or variable is declared. [5.1] 2.14.7 Declare Target Directive The directive must appear in the declaration section of a scoping unit in which the common block or variable is declared. ``` Reviewed By: kiranchandramohan, shraiysh, NimishMishra Differential Revision: https://reviews.llvm.org/D125767	2022-06-01 22:40:51 +08:00
Denis Antrushin	7047d79fde	[TwoAddressInstructionPass] Relax assert in statepoint processing. D124631 added special processing for STATEPOINT instructions. It appears that assertion added there is too strong. We can get two tied operands with the same register tied to different defs. If we hit such case, do not process it in statepoint-specific code and delegate it to common case.	2022-06-01 21:34:52 +07:00
Simon Pilgrim	0a96885940	[ARM] uxtb.ll - adjust armv6 triple so the update_llc_test_checks.py script can be used to regenerate the tests No need to specify armv6-apple-darwin in these UXTB codegen tests	2022-06-01 15:28:19 +01:00
Simon Pilgrim	e1d02f6c37	[ARM][Thumb2] Refresh UXTB16 tests to match optimized IR from instcombine As discussed on D77804, instcombine will have already performed a similar SimplifyMultipleUseDemandedBits call which will break the UXTB16 pattern that was being match in these DAG tests I've updated the existing tests so that it match the instcombine IR (with a suitable FIXME) and added an equivalent test pattern suggested by @dmgreen	2022-06-01 15:28:19 +01:00
Balazs Benics	3a07280290	[analyzer] Fix wrong annotation of PointerToMemberData Unfortunately I don't have a reproducer for this. Reported by @mikaelholmen! Differential Revision: https://reviews.llvm.org/D126198	2022-06-01 16:12:54 +02:00
Simon Pilgrim	de2b543505	[X86] LowerVSETCC - merge getConstant() calls with flipped/unflipped sign masks. NFCI.	2022-06-01 15:09:48 +01:00
Sanjay Patel	3a503a4a9c	[x86] fix miscompile from wrongly identified fneg We may need to peek through a bitcast when identifying an fneg idiom via its pool constant, but we can't allow a different-sized constant in that match. This is noted in issue #55758 with an example that needs fast-math, but as the test here shows, this has potential to miscompile more generally (no fast-math required). Differential Revision: https://reviews.llvm.org/D126775	2022-06-01 09:56:33 -04:00
Guillaume Chatelet	ffa479a452	[libc] fix typo in BUILD.bazel feature	2022-06-01 13:53:36 +00:00
Matt Arsenault	0e1c71e4a4	CodeGen: Move getAddressSpaceForPseudoSourceKind into TargetMachine Avoid the dependency on TargetInstrInfo, which depends on the subtarget and therefore the individual function. Currently AMDGPU is constructing PseudoSourceValue instances in MachineFunctionInfo. In order to facilitate copying MachineFunctionInfo, we need to stop allocating these there. Alternatively we could allow targets to subclass PseudoSourceValueManager, and allocate them similarly to MachineFunctionInfo.	2022-06-01 09:45:40 -04:00
Sanjay Patel	3c3f2f99c4	[x86] add test for mismatched fneg; NFC issue #55758	2022-06-01 09:45:33 -04:00
Guillaume Chatelet	b2a9ea4420	[libc] Apply no-builtin everywhere, remove unnecessary flags Note, this is a re-submission of D125894 with `features = ["-header_modules"]` added to the main BUILD.bazel file. Some functions like `stpncpy` are implemented in terms of `memset` but are not currently using `-fno-builtin-memset`. This is somewhat hidden by the fact that we use `-ffreestanding` globally and that `-ffreestanding` implies `-fno-builtin` for Clang. This patch also removes `-mllvm -combiner-global-alias-analysis` that is Clang specific and that does not bring substantial gains on modern processors. Also we keep `-mllvm --tail-merge-threshold=0` for aarch64 in CMakeLists.txt but we omit it in the Bazel config. This is because Bazel consumes the source files directly and so it can use PGO to take optimal decisions locally. Differential Revision: https://reviews.llvm.org/D126773	2022-06-01 13:34:36 +00:00
Mikhail Goncharov	f951a6b2f3	Fix potentially uninitialized memory For `7d76d60958`	2022-06-01 15:31:37 +02:00
Alexander Kornienko	7aa8a67882	Revert "[LAA] Initial support for runtime checks with pointer selects." This reverts commit `5890b30105` as per discussion on the review thread: https://reviews.llvm.org/D114487#3547560.	2022-06-01 15:24:27 +02:00
LLVM GN Syncbot	b0f868f007	[gn build] Port `a0dcbe45bd`	2022-06-01 13:19:42 +00:00
LLVM GN Syncbot	b9b13a5645	[gn build] Port `2011052150`	2022-06-01 13:19:41 +00:00
Matt Arsenault	a0dcbe45bd	llvm-reduce: Add reduction pass to remove regalloc hints I'm a bit confused by what's actually stored for the allocation hints. The MIR parser only handles the "simple" case where there's a single hint. I don't really understand the assertion in clearSimpleHint, or under what circumstances there are multiple hint registers.	2022-06-01 09:15:41 -04:00
Matt Arsenault	2011052150	llvm-reduce: Add pass to reduce MIR instruction flags	2022-06-01 08:58:34 -04:00
Florian Hahn	f68c547158	[LAA] Remove unused RuntimeCheckingPtrGroup constructor (NFC). The constructor is not used. Remove it.	2022-06-01 13:30:33 +01:00

1 2 3 4 5 ...

425353 Commits All Branches Search

425353 Commits

All Branches