llvm-project

Commit Graph

Author	SHA1	Message	Date
Craig Topper	00c1cc867f	[RISCV] Add more i32 srem/sdiv with power of 2 constant tests. NFC Add a small power 2 srem test to match existing sdiv test. Add larger power of 2 test to both. The larger constant test shows materialization of a constant for an AND in the RV64 code. We should be using W shift instructions to match the RV32 code.	2021-07-18 00:21:14 -07:00
David Blaikie	dac582ad3a	DebugInfo: Name class templates with default arguments consistently (both direct naming, and as a template argument for a function template) It's noteworthy that GCC has the same bug here, which is a bit surprising. Both Clang and GCC's bug is only for function template arguments that are themselves templates with default template arguments (f1<t1<int[, missing_default_here]>>). Probably because function name matching isn't generally necessary - whereas type matching is necessary for DWARF consumers to associate declarations and definitions across translation units, so the bug's been addressed there already - but continued to exist for function templates since it's fairly benign there. I came across this while working on a change that could reconstitute these pretty printed names based on the rest of the DWARF, reducing the size of the DWARF by not having to encode all the template parameters in the name string. That reconstitution code can't tell the difference between a defaulted argument or not, so couldn't create the current buggy-ish output. Making the names more consistent between direct and indirect references, and between function and class templates seems all to the good. (I fixed the function template version of this a few years back in `9fdd09a4cc` - clearly I should've looked more closely and generalized the code better so it only had to be fixed once - well, doing that here now)	2021-07-17 23:58:15 -07:00
Amara Emerson	4c55cdb00a	[GlobalISel] Fix known bits for G_BSWAP and B_BITREVERSE not doing anything. llvm::KnownBits::byteSwap() and reverse() don't modify in-place, so we weren't actually computing anything. This was causing a miscompile on an arm64 stage2 bootstrap clang build.	2021-07-17 23:07:16 -07:00
David Carlier	657eb94324	[Sanitizers] FutexWake fix typo for FreeBSD code path.	2021-07-18 07:02:21 +01:00
Jon Roelofs	5cd63e9ec2	[AArch64][GlobalISel] Legalize bswap <2 x i16> Differential revision: https://reviews.llvm.org/D105935	2021-07-17 15:31:15 -07:00
Nikita Popov	ffe94738ed	[ExecutionEngine] Fix GEP type Fix bug introduced in `2c68ecccc9`, the GEP type was off-by-ptr. Apparently I didn't run the MLIR tests.	2021-07-17 23:45:00 +02:00
David Green	5acddf5b09	[ARM] Lower non-extended small gathers via truncated gathers. Corollary to `1113e06821` this allows us to match gather that dont produce a full vector width results. They use an extended gather which is truncated back to the original type.	2021-07-17 22:38:31 +01:00
Eli Friedman	e41e865b15	[AArch64] Prepare for changes to STEP_VECTOR. Rewrite patterns to assume that the operand of STEP_VECTOR is a constant. The old patterns will stop working when the operand is changed from a Constant to a TargetConstant. (See D105673.) Add test coverage for certain patterns that weren't exercised by existing regression tests. Differential Revision: https://reviews.llvm.org/D105847	2021-07-17 14:13:41 -07:00
Nikita Popov	f164bc52b6	[IRBuilder] Deprecate CreateGEP() without element type This API is incompatible with opaque pointers and deprecated in favor of the version that accepts an explicit element type. Also remove the separate overload for a single index, as this is already covered by the ArrayRef overload.	2021-07-17 22:57:51 +02:00
Nikita Popov	2c68ecccc9	[OpaquePtr] Remove uses of CreateGEP() without element type Remove uses of to-be-deprecated API. In cases where the correct element type was not immediately obvious to me, fall back to explicit getPointerElementType().	2021-07-17 22:56:27 +02:00
Nikita Popov	f95d26006e	[IRBuilder] Deprecate CreateInBoundsGEP() without element type This API is incompatible with opaque pointers and deprecated in favor of the version that accepts an explicit element type.	2021-07-17 21:27:16 +02:00
Nikita Popov	6225d0cc6e	[OpaquePtr] Remove uses of CreateInBoundsGEP() without element type Remove uses of to-be-deprecated API. Unfortunately this one mostly just makes the use of getPointerElementType() explicit, as the correct type to use wasn't immediately available (deriving it from QualType is left as an excercise to the reader).	2021-07-17 21:27:16 +02:00
Craig Topper	d0f8047d37	[RISCV] Teach computeKnownBitsForTargetNode that VLENB will never be more than 65536/8.	2021-07-17 11:24:20 -07:00
Vy Nguyen	f44fc35149	[libcxx] Updated test and seemingly incorrect comment from it. Background: https://reviews.llvm.org/D82490#inline-1007741 Differential Revision: https://reviews.llvm.org/D106092	2021-07-17 13:46:28 -04:00
Jez Ng	428a7c1b38	[lld-macho] Have ICF operate on all sections at once ICF previously operated only within a given OutputSection. We would merge all CFStrings first, then merge all regular code sections in a second phase. This worked fine since CFStrings would never reference regular `__text` sections. However, I would like to expand ICF to merge functions that reference unwind info. Unwind info references the LSDA section, which can in turn reference the `__text` section, so we cannot perform ICF in phases. In order to have ICF operate on InputSections spanning multiple OutputSections, we need a way to distinguish InputSections that are destined for different OutputSections, so that we don't fold across section boundaries. We achieve this by creating OutputSections early, and setting `InputSection::parent` to point to them. This is what LLD-ELF does. (This change should also make it easier to implement the `section$start$` symbols.) This diff also folds InputSections w/o checking their flags, which I think is the right behavior -- if they are destined for the same OutputSection, they will have the same flags in the output (even if their input flags differ). I.e. the `parent` pointer check subsumes the `flags` check. In practice this has nearly no effect (ICF did not become any more effective on chromium_framework). I've also updated ICF.cpp's block comment to better reflect its current status. Reviewed By: #lld-macho, smeenai Differential Revision: https://reviews.llvm.org/D105641	2021-07-17 13:42:51 -04:00
Christopher Di Bella	182ba8ab1b	[libcxx][ranges] makes `ranges::subrange` a borrowed range Differential Revision: https://reviews.llvm.org/D106207	2021-07-17 17:25:56 +00:00
Shilei Tian	d3454ee8d2	[AbstractAttributor] Fix two issues in folding __kmpc_is_spmd_exec_mode This patch fixed two issues found when folding `__kmpc_is_spmd_exec_mode`: 1. When the reaching kernels are empty, it should not fold to generic mode. 2. When creating AA for the caller when updating information, the dependency should be required. Reviewed By: ye-luo Differential Revision: https://reviews.llvm.org/D106209	2021-07-17 13:13:44 -04:00
Nikita Popov	ca161e0c35	[IRBuilder] Deprecate CreateStructGEP() without element type This API is incompatible with opaque pointers and deprecated in favor of the version that accepts an explicit element type.	2021-07-17 18:48:22 +02:00
Nikita Popov	4ace6008f2	[OpaquePtr] Remove uses of CreateStructGEP() without element type Remove uses of to-be-deprecated API.	2021-07-17 18:48:21 +02:00
ShihPo Hung	be8159bfa5	[RISCV][RVV] Precommit a test case for D105684 Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D105685	2021-07-18 00:43:17 +08:00
Nikita Popov	03e4351013	[IRBuilder] Deprecate CreateConstGEP1_32() without element type This API is incompatible with opaque pointers and deprecated in favor of the version that accepts an explicit element type.	2021-07-17 18:32:36 +02:00
Nikita Popov	6d3e7c783b	[OpaquePtr] Remove uses of CreateConstGEP1_32() without element type Remove uses of to-be-deprecated API. I've fallen back to calling getPointerElementType() in some cases where the correct type wasn't immediately obvious to me.	2021-07-17 18:32:36 +02:00
Simon Pilgrim	9277ce7932	[DebugInfo] Remove unnecessary <string> include dependency from DebugInfo headers. NFC. At most these use the StringRef/Twine wrappers and don't have any implicit uses of std::string. Move the include down to any cpp implementation where std::string is actually used.	2021-07-17 16:56:06 +01:00
Nikita Popov	5df48493f0	[IRBuilder] Deprecate CreateConstInBoundsGEP1_64() without element type This API is incompatible with opaque pointers and deprecated in favor of the version that accepts an explicit element type.	2021-07-17 17:07:48 +02:00
Nikita Popov	5071360eb1	[OpaquePtr] Remove uses of CGF.Builder.CreateConstInBoundsGEP1_64() without type Remove uses of to-be-deprecated API.	2021-07-17 17:07:46 +02:00
Nikita Popov	32e2729e33	[IRBuilder] Deprecate CreateConstGEP1_64() without element type This API is incompatible with opaque pointers and deprecated in favor of the version that accepts an explicit element type.	2021-07-17 16:43:42 +02:00
Nikita Popov	357756ecf6	[OpaquePtr] Remove uses of CreateConstGEP1_64() without element type Remove uses of to-be-deprecated API.	2021-07-17 16:43:20 +02:00
Nikita Popov	251a11fdcf	[IRBuilder] Deprecate CreateConstInBoundsGEP2_64() without element type This API is incompatible with opaque pointers and deprecated in favor of the version that accepts an explicit element type.	2021-07-17 16:42:39 +02:00
Nikita Popov	4737eebc0d	[OpaquePtr] Remove uses of CreateConstInBoundsGEP2_64() without type Remove uses of to-be-deprecated API.	2021-07-17 16:42:10 +02:00
Nikita Popov	7db463ced5	[IRBuilder] Deprecate CreateConstGEP2_64() without element type This API is incompatible with opaque pointers and deprecated in favor of the version that accepts an explicit element type.	2021-07-17 16:41:51 +02:00
Kazu Hirata	1993b73755	[Analaysis, CodeGen] Remove getHotSucc (NFC) These functions seem to be unused for at least 5 years.	2021-07-17 07:31:36 -07:00
Nikita Popov	7e21ded88d	[IR] Don't accept null type in ConstantExpr::getGetElementPtr() This is the same change as D105653, but for the constant expression version of the API.	2021-07-17 15:59:31 +02:00
Nikita Popov	be5af50e7d	[BPF] Use elementtype attribute for preserve.array/struct.index intrinsics Use the elementtype attribute introduced in D105407 for the llvm.preserve.array/struct.index intrinsics. It carries the element type of the GEP these intrinsics effectively encode. This patch: * Adds a verifier check that the attribute is required. * Adds it in the IRBuilder methods for these intrinsics. * Autoupgrades old bitcode without the attribute. * Updates the lowering code to use the attribute rather than the pointer element type. * Updates lots of tests to specify the attribute. * Adds -force-opaque-pointers to the intrinsic-array.ll test to demonstrate they work now. https://reviews.llvm.org/D106184	2021-07-17 11:09:18 +02:00
Craig Topper	173332d175	[RISCV] Manually emit the best shift for VSCALE lowering to improve codegen. We assume VLENB is a multiple of 8 and previously relied on shift pairs being optimized to an AND+SHL/SHR and computeKnownBits removing the AND. This doesn't happen if (vlenb >> 3) gets CSEd to have multiple uses. This patch manually emits the best shift to workaround this.	2021-07-17 00:52:07 -07:00
Martin Storsjö	1f1369e476	[sanitizers] Fix building on case sensitive mingw platforms Make synchronization.lib all lowercase name for mingw, where casing matters. This fixes building after 6d160abd7eba73031a2af500981f8ef44bd75ee4.	2021-07-17 09:34:16 +03:00
Giorgis Georgakoudis	e9c7291cb2	[OpenMP] Codegen aggregate for outlined function captures Parallel regions are outlined as functions with capture variables explicitly generated as distinct parameters in the function's argument list. That complicates the fork_call interface in the OpenMP runtime: (1) the fork_call is variadic since there is a variable number of arguments to forward to the outlined function, (2) wrapping/unwrapping arguments happens in the OpenMP runtime, which is sub-optimal, has been a source of ABI bugs, and has a hardcoded limit (16) in the number of arguments, (3) forwarded arguments must cast to pointer types, which complicates debugging. This patch avoids those issues by aggregating captured arguments in a struct to pass to the fork_call. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D102107	2021-07-16 23:27:44 -07:00
Lang Hames	92430b4937	[ORC] Fix typo in declaration	2021-07-17 16:10:15 +10:00
Matthias Springer	d1a9e9a7cb	[mlir][vector] Remove vector.transfer_read/write to LLVM lowering This simplifies the vector to LLVM lowering. Previously, both vector.load/store and vector.transfer_read/write lowered directly to LLVM. With this commit, there is a single path to LLVM vector load/store instructions and vector.transfer_read/write ops must first be lowered to vector.load/store ops. * Remove vector.transfer_read/write to LLVM lowering. * Allow non-unit memref strides on all but the most minor dimension for vector.load/store ops. * Add maxTransferRank option to populateVectorTransferLoweringPatterns. * vector.transfer_reads with changing element type can no longer be lowered to LLVM. (This functionality is needed only for SPIRV.) Differential Revision: https://reviews.llvm.org/D106118	2021-07-17 14:07:27 +09:00
Matthias Springer	4a3defa629	[mlir][vector] Refactor TransferReadToVectorLoadLowering * TransferReadToVectorLoadLowering no longer generates memref.load ops. * Add new pattern VectorLoadToMemrefLoadLowering that lowers scalar vector.loads to memref.loads. * Add vector::BroadcastOp canonicalization pattern that folds broadcast chains. Differential Revision: https://reviews.llvm.org/D106117	2021-07-17 13:53:09 +09:00
jacquesguan	f4ec30d808	[RISCV] Make VLEN no greater than 65536 Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D106134	2021-07-17 12:47:46 +08:00
Lang Hames	89aa11ed28	[ORC] Remove LLVM-side MachO Platform runtime support. Support for this functionality is moving to the ORC runtime.	2021-07-17 14:25:31 +10:00
Carl Ritson	c7f2f81f5e	[AMDGPU] Tidy SReg/SGPR definitions using template class Use a multiclass to consistently define SReg/SGPR/TTMP register classes. Add missing TTMP registers for 96b, 160b, 192b, 224b. Reviewed By: foad Differential Revision: https://reviews.llvm.org/D105800	2021-07-17 11:26:46 +09:00
Kazu Hirata	6545fdc6d7	[Analysis] Remove isJoinDivergent (NFC) The last use was removed on Sep 30, 2020 in commit `05ae04c396`.	2021-07-16 18:23:17 -07:00
Wenlei He	f9f3c34e0f	[CSSPGO] Turn on iterative-BFI for CSSPGO Iterative-BFI produces better count quality and performance when evaluated on internal benchmarks. Turning it on by default now for CSSPGO. We can consider turn it on by default for AutoFDO as well in the future. Differential Revision: https://reviews.llvm.org/D106202	2021-07-16 17:35:49 -07:00
Matt Arsenault	71de6e9b4a	Mips/GlobalISel: Remove leftover dead code	2021-07-16 20:20:55 -04:00
Matt Arsenault	51f115b078	AMDGPU/GlobalISel: Add a few tests for struct arguments Test structs with pointers and vectors of pointers since this stresses a future patch.	2021-07-16 20:20:55 -04:00
Matt Arsenault	27addb85a6	AMDGPU/GlobalISel: Fix some incorrect memory types in tests	2021-07-16 20:20:55 -04:00
Emily Shi	b316c30269	[NFC][compiler-rt][test] when using ptrauth, strip before checking if poisoned ptrauth stores info in the address of functions, so it's not the right address we should check if poisoned rdar://75246928 Differential Revision: https://reviews.llvm.org/D106199	2021-07-16 17:13:19 -07:00
Walter Erquinigo	b0aa70761b	[trace][intel pt] Implement the Intel PT cursor D104422 added the interface for TraceCursor, which is the main way to traverse instructions in a trace. This diff implements the corresponding cursor class for Intel PT and deletes the now obsolete code. Besides that, the logic for the "thread trace dump instructions" was adapted to use this cursor (pretty much I ended up moving code from Trace.cpp to TraceCursor.cpp). The command by default traverses the instructions backwards, and if the user passes --forwards, then it's not forwards. More information about that is in the Options.td file. Regarding the Intel PT cursor. All Intel PT cursors for the same thread share the same DecodedThread instance. I'm not yet implementing lazy decoding because we don't need it. That'll be for later. For the time being, the entire thread trace is decoded when the first cursor for that thread is requested. Differential Revision: https://reviews.llvm.org/D105531	2021-07-16 16:47:43 -07:00
Hongtao Yu	77aec978a9	[CSSPGO] Turn on unique linkage name by default for pseudo probe. Turning on -funique-internal-linkage-names when -fpseudo-probe-for-profiling is on, unless -fno-unique-internal-linkage-names is specified. Reviewed By: wenlei Differential Revision: https://reviews.llvm.org/D106193	2021-07-16 16:43:23 -07:00

... 2 3 4 5 6 ...

394083 Commits All Branches Search

394083 Commits

All Branches