llvm-project

Commit Graph

Author	SHA1	Message	Date
Martin Storsjö	1f1369e476	[sanitizers] Fix building on case sensitive mingw platforms Make synchronization.lib all lowercase name for mingw, where casing matters. This fixes building after 6d160abd7eba73031a2af500981f8ef44bd75ee4.	2021-07-17 09:34:16 +03:00
Giorgis Georgakoudis	e9c7291cb2	[OpenMP] Codegen aggregate for outlined function captures Parallel regions are outlined as functions with capture variables explicitly generated as distinct parameters in the function's argument list. That complicates the fork_call interface in the OpenMP runtime: (1) the fork_call is variadic since there is a variable number of arguments to forward to the outlined function, (2) wrapping/unwrapping arguments happens in the OpenMP runtime, which is sub-optimal, has been a source of ABI bugs, and has a hardcoded limit (16) in the number of arguments, (3) forwarded arguments must cast to pointer types, which complicates debugging. This patch avoids those issues by aggregating captured arguments in a struct to pass to the fork_call. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D102107	2021-07-16 23:27:44 -07:00
Lang Hames	92430b4937	[ORC] Fix typo in declaration	2021-07-17 16:10:15 +10:00
Matthias Springer	d1a9e9a7cb	[mlir][vector] Remove vector.transfer_read/write to LLVM lowering This simplifies the vector to LLVM lowering. Previously, both vector.load/store and vector.transfer_read/write lowered directly to LLVM. With this commit, there is a single path to LLVM vector load/store instructions and vector.transfer_read/write ops must first be lowered to vector.load/store ops. * Remove vector.transfer_read/write to LLVM lowering. * Allow non-unit memref strides on all but the most minor dimension for vector.load/store ops. * Add maxTransferRank option to populateVectorTransferLoweringPatterns. * vector.transfer_reads with changing element type can no longer be lowered to LLVM. (This functionality is needed only for SPIRV.) Differential Revision: https://reviews.llvm.org/D106118	2021-07-17 14:07:27 +09:00
Matthias Springer	4a3defa629	[mlir][vector] Refactor TransferReadToVectorLoadLowering * TransferReadToVectorLoadLowering no longer generates memref.load ops. * Add new pattern VectorLoadToMemrefLoadLowering that lowers scalar vector.loads to memref.loads. * Add vector::BroadcastOp canonicalization pattern that folds broadcast chains. Differential Revision: https://reviews.llvm.org/D106117	2021-07-17 13:53:09 +09:00
jacquesguan	f4ec30d808	[RISCV] Make VLEN no greater than 65536 Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D106134	2021-07-17 12:47:46 +08:00
Lang Hames	89aa11ed28	[ORC] Remove LLVM-side MachO Platform runtime support. Support for this functionality is moving to the ORC runtime.	2021-07-17 14:25:31 +10:00
Carl Ritson	c7f2f81f5e	[AMDGPU] Tidy SReg/SGPR definitions using template class Use a multiclass to consistently define SReg/SGPR/TTMP register classes. Add missing TTMP registers for 96b, 160b, 192b, 224b. Reviewed By: foad Differential Revision: https://reviews.llvm.org/D105800	2021-07-17 11:26:46 +09:00
Kazu Hirata	6545fdc6d7	[Analysis] Remove isJoinDivergent (NFC) The last use was removed on Sep 30, 2020 in commit `05ae04c396`.	2021-07-16 18:23:17 -07:00
Wenlei He	f9f3c34e0f	[CSSPGO] Turn on iterative-BFI for CSSPGO Iterative-BFI produces better count quality and performance when evaluated on internal benchmarks. Turning it on by default now for CSSPGO. We can consider turn it on by default for AutoFDO as well in the future. Differential Revision: https://reviews.llvm.org/D106202	2021-07-16 17:35:49 -07:00
Matt Arsenault	71de6e9b4a	Mips/GlobalISel: Remove leftover dead code	2021-07-16 20:20:55 -04:00
Matt Arsenault	51f115b078	AMDGPU/GlobalISel: Add a few tests for struct arguments Test structs with pointers and vectors of pointers since this stresses a future patch.	2021-07-16 20:20:55 -04:00
Matt Arsenault	27addb85a6	AMDGPU/GlobalISel: Fix some incorrect memory types in tests	2021-07-16 20:20:55 -04:00
Emily Shi	b316c30269	[NFC][compiler-rt][test] when using ptrauth, strip before checking if poisoned ptrauth stores info in the address of functions, so it's not the right address we should check if poisoned rdar://75246928 Differential Revision: https://reviews.llvm.org/D106199	2021-07-16 17:13:19 -07:00
Walter Erquinigo	b0aa70761b	[trace][intel pt] Implement the Intel PT cursor D104422 added the interface for TraceCursor, which is the main way to traverse instructions in a trace. This diff implements the corresponding cursor class for Intel PT and deletes the now obsolete code. Besides that, the logic for the "thread trace dump instructions" was adapted to use this cursor (pretty much I ended up moving code from Trace.cpp to TraceCursor.cpp). The command by default traverses the instructions backwards, and if the user passes --forwards, then it's not forwards. More information about that is in the Options.td file. Regarding the Intel PT cursor. All Intel PT cursors for the same thread share the same DecodedThread instance. I'm not yet implementing lazy decoding because we don't need it. That'll be for later. For the time being, the entire thread trace is decoded when the first cursor for that thread is requested. Differential Revision: https://reviews.llvm.org/D105531	2021-07-16 16:47:43 -07:00
Hongtao Yu	77aec978a9	[CSSPGO] Turn on unique linkage name by default for pseudo probe. Turning on -funique-internal-linkage-names when -fpseudo-probe-for-profiling is on, unless -fno-unique-internal-linkage-names is specified. Reviewed By: wenlei Differential Revision: https://reviews.llvm.org/D106193	2021-07-16 16:43:23 -07:00
Eli Friedman	cbba71bfb5	[ScalarEvolution] Fix overflow in computeBECount. The current implementation of computeBECount doesn't account for the possibility that adding "Stride - 1" to Delta might overflow. For almost all loops, it doesn't, but it's not actually proven anywhere. To deal with this, use a variety of tricks to try to prove that the addition doesn't overflow. If the proof is impossible, use an alternate sequence which never overflows. Differential Revision: https://reviews.llvm.org/D105216	2021-07-16 16:15:18 -07:00
Joel E. Denny	b8355b7126	[lit] Add --xfail-not/LIT_XFAIL_NOT For example, I need this lately in my CI config: LIT_XFAIL_NOT='libomptarget :: nvptx64-nvidia-cuda :: unified_shared_memory/api.c' That test specifies an XFAIL directive, but I get an XPASS result. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D106022	2021-07-16 19:13:34 -04:00
Mehdi Amini	2bad8bd089	Revert "Build libSupport with -Werror=global-constructors (NFC)" This reverts commit `1f71bcabb7`. Some platform have global destructors for std::mutex that still needs to be fixed.	2021-07-16 22:47:04 +00:00
Jim Ingham	6eb576dcff	Add a mutex to guard access to the ThreadPlanStack class We've seen reports of crashes (none we've been able to reproduce locally) that look like they are caused by concurrent access to a thread plan stack. It looks like there are error paths when an interrupt request to debugserver times out that cause this problem. The thread plan stack access is never in a hot loop, and there aren't enough of them for the extra data member to matter, so there's really no good reason not to protect the access. Adding the mutex revealed a couple of places where we were using "auto" in an iteration when we should have been using "auto &" - we didn't intend to copy the stack - and I fixed those as well. Except for preventing crashes this should be NFC. Differential Revision: https\://reviews.llvm.org/D106122	2021-07-16 15:40:58 -07:00
Christopher Di Bella	e37bbfe59c	[libcxx][modules] protects users from relying on libc++ detail headers (1/n) libc++ has started splicing standard library headers into much more fine-grained content for maintainability. It's very likely that outdated and naive tooling (some of which is outside of LLVM's scope) will suggest users include things such as `<__algorithm/find.h>` instead of `<algorithm>`, and Hyrum's law suggests that users will eventually begin to rely on this without the help of tooling. As such, this commit intends to protect users from themselves, by making it a hard error for anyone outside of the standard library to include libc++ detail headers. This is the first of four patches. Patch #2 will solve the problem for pre-processor `#include`s; patches #3 and #4 will solve the problem for `<__tree>` and `<__hash_table>` (since I've never touched the test cases that are failing for these two, I want to split them out into their own commits to be extra careful). Patch #5 will concern itself with `<__threading_support>`, which intersects with libcxxabi (which I know even less about). Differential Revision: https://reviews.llvm.org/D105932	2021-07-16 22:39:18 +00:00
Fangrui Song	263fe133ed	[Bazel] Delete blank line to make buildifier happy	2021-07-16 15:33:42 -07:00
Vitaly Buka	c14f26846e	[sanitizer] Fix test build on Windows	2021-07-16 15:28:51 -07:00
Mehdi Amini	1f71bcabb7	Build libSupport with -Werror=global-constructors (NFC) Ensure that libSupport does not carry any static global initializer. libSupport can be embedded in use cases where we don't want to load all cl::opt unless we want to parse the command line. ManagedStatic can be used to enable lazy-initialization of globals.	2021-07-16 22:25:03 +00:00
David Green	ad8e75caa2	[ARM] Fix for matching reductions that are both sext and zext. Fix a silly mistake that was not making sure that _both_ operands were the correct extend code.	2021-07-16 23:11:42 +01:00
Sami Tolvanen	0ad1d9fdf2	Revert "ThinLTO: Fix inline assembly references to static functions with CFI" This reverts commit `8e3b5cb39e`. Reverting to investigate test failures.	2021-07-16 14:47:33 -07:00
Sami Tolvanen	8e3b5cb39e	ThinLTO: Fix inline assembly references to static functions with CFI Create an internal alias with the original name for static functions that are renamed in promoteInternals to avoid breaking inline assembly references to them. This version uses module inline assembly to avoid issues with LowerTypeTestsModule. Link: https://github.com/ClangBuiltLinux/linux/issues/1354 Reviewed By: nickdesaulniers, pcc Differential Revision: https://reviews.llvm.org/D104058	2021-07-16 14:33:34 -07:00
Nemanja Ivanovic	35a18a981f	[PowerPC] Implement intrinsics for mtfsf[i] This provides intrinsics for emitting instructions that set the FPSCR (`mtfsf/mtfsfi`). The patch also conservatively marks the rounding mode as an implicit def for both since they both may set the rounding mode depending on the operands. Reviewed By: #powerpc, qiucf Differential Revision: https://reviews.llvm.org/D105957	2021-07-16 16:26:11 -05:00
Mehdi Amini	68ec4aa63b	Add `lli` as dependency of MLIR integration tests This fixes running `ninja check-mlir` from a clean build when the integration tests are enabled.	2021-07-16 21:10:28 +00:00
Geoffrey Martin-Noble	b92f28cc43	[Bazel] Add examples to bazelignore This avoids Bazel recursing into these directories when overlayed, which will break if someone has run Bazel in these dirs (which would only be successful with the http_archive example) because of the Bazel directory symlinks (already gitignored). Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D105357	2021-07-16 13:58:21 -07:00
Geoffrey Martin-Noble	9c5b73fef4	[Bazel] Make gentbl_test compatible with coverage Add explicit coverage provider. Also remove output_to_genfiles which isn't necessary for this test (it's just copy-pasta from gentbl_rule, which needs it for output C++ header files). Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D106115	2021-07-16 13:57:15 -07:00
Geoffrey Martin-Noble	8b86b3a846	[Bazel] Use bazel_skylib paths for paths munging We do a lot of path munging and bazel_sklyib is a pretty reasonable dep. Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D106175	2021-07-16 13:52:07 -07:00
Alexey Bataev	da3dbfcacf	[SLP]Improve calculations of the cost for reused/reordered scalars. Part of D105020. Also, fixed FIXMEs that need to use wider vector type when trying to calculate the cost of reused scalars. This may cause regressions unless D100486 is landed to improve the cost estimations for long vectors shuffling. Differential Revision: https://reviews.llvm.org/D106060	2021-07-16 13:40:15 -07:00
LLVM GN Syncbot	1daaf1df56	[gn build] Port `0bf4b81d57`	2021-07-16 20:32:47 +00:00
Amara Emerson	9637848f51	[GlobalISel] Fix non-pow-2 legalization of s56 stores. s56 stores are broken down into s32 + s24 stores. During this step both of those new stores use an anyextended s64 value, resulting in truncating stores. With s56, the s24 requires another lower step to make it legal, and we were crashing because we didn't expect non-pow-2 stores to also be truncating as well. Differential Revision: https://reviews.llvm.org/D106183	2021-07-16 13:29:49 -07:00
Fangrui Song	fa3231eb18	[COFF][test] Fix llvm-readobj tests	2021-07-16 13:28:46 -07:00
Emily Shi	df1c3aaa17	[NFC][compiler-rt][test] pass through MallocNanoZone to iossim env This is required for no-fd.cpp test rdar://79354597 Differential Revision: https://reviews.llvm.org/D106174	2021-07-16 13:16:40 -07:00
Martin Storsjö	f23f299c04	[libcxx] [test] Fix experimental/memory.resource.adaptor.mem/db_deallocate on Windows The checks within the libc++experimental memory_resource class uses this limit: _MaxAlign = _LIBCPP_ALIGNOF(max_align_t); Therefore, only use max_align_t for this limit instead of using `__STDCPP_DEFAULT_NEW_ALIGNMENT__` if available. Differential Revision: https://reviews.llvm.org/D105905	2021-07-16 23:02:47 +03:00
Alexey Bataev	1b18e9ab67	[PATCH] D105827: [SLP]Workaround for InsertSubVector cost. The cost of the InsertSubvector shuffle kind cost is not complete and may end up with just extracts + inserts costs in many cases. Added a workaround to represent it as a generic PermuteSingleSrc, which is still pessimistic but better than InsertSubvector. Differential Revision: https://reviews.llvm.org/D105827	2021-07-16 12:59:08 -07:00
Nico Weber	304293d85d	[gn build] (semi-manually) port `6a4054ef06`	2021-07-16 15:54:13 -04:00
Jon Roelofs	15267595fd	[RISCV] Compose vector subregs hierarchically This fixes the test I broke in: https://reviews.llvm.org/D105953#2883579 Differential revision: https://reviews.llvm.org/D106168	2021-07-16 12:32:13 -07:00
Mark Leair	d0f1ae6eb6	Add a scalar argument case for the Fortran spread intrinsic unit test.	2021-07-16 12:19:08 -07:00
Fangrui Song	3f9004c19c	[llvm-readelf/llvm-readobj] Remove one-dash long options llvm-readelf is a user-facing tool which emulates GNU readelf. Remove one-dash long options which are not recognized by GNU style `getopt_long`. This ensures long options cannot collide with grouped short options. Note: the documentation (D63719)/help messages have recommended the double-dash forms since LLVM 9.0.0. llvm-readobj is intended as an internal tool which has some flexibility. llvm-readelf/llvm-readobj use the same option parsing code and llvm-readobj's one-dash long options aren't used after test migration. Differential Revision: https://reviews.llvm.org/D106037	2021-07-16 12:03:08 -07:00
Fangrui Song	8f806d5f52	[test] Avoid llvm-readelf/llvm-readobj one-dash long options	2021-07-16 12:03:07 -07:00
David Green	58e5e504f5	[ARM] Extra MLA vecreduce tests. NFC	2021-07-16 20:01:52 +01:00
Simon Pilgrim	d2458bcdc6	[X86][SSE] combineX86ShufflesRecursively - bail if constant folding fails due to oneuse limits. Fixes issue reported on D105827 where a single shuffle of a constant (with multiple uses) was caught in an infinite loop where one shuffle (UNPCKL) used an undef arg but then that got recombined to SHUFPS as the constant value had its own undef that confused matching.....	2021-07-16 19:21:46 +01:00
Lei Huang	c8937b6cb9	[PowerPC] Implement XL compact math builtins Implement a subset of builtins required for compatiblilty with AIX XL compiler. Reviewed By: nemanjai Differential Revision: https://reviews.llvm.org/D105930	2021-07-16 13:21:13 -05:00
Joseph Huber	b910a109f8	[OpenMP][NFC] Update the comment header for optimizations.	2021-07-16 14:13:13 -04:00
Joseph Huber	1616407921	[OpenMP] Add remark documentation to the OpenMP webpage This patch begins adding documentation for each remark emitted by `openmp-opt`. This builds on the IDs introduced in D105939 so that users can more easily identify each remark in the webpage. Depends on D105939. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D106018	2021-07-16 14:09:43 -04:00
Joseph Huber	2c31d5ebfb	[OpenMP] Add IDs to OpenMP remarks This patch adds unique idenfitiers to the existing OpenMP remarks. This makes it easier to identify the corresponding documentation for each remark that will be hosted in the OpenMP webpage. Depends on D105898 Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D105939	2021-07-16 14:07:03 -04:00

1 2 3 4 5 ...

393949 Commits All Branches Search

393949 Commits

All Branches