llvm-project

Commit Graph

Author	SHA1	Message	Date
Jacques Pienaar	ee7242c662	[mlir] Update to use ValueShapeRange (NFC) Update to use alias in preparation for changing it to not just be a pure alias.	2021-07-22 12:24:49 -07:00
Jon Chesterfield	9e05c084e5	[libomptarget][amdgpu][nfc] Normalise license headers Reviewed By: gregrodgers, jdoerfert Differential Revision: https://reviews.llvm.org/D106581	2021-07-22 20:23:41 +01:00
Roman Lebedev	7ef6f01909	[SimplifyCFG] FoldTwoEntryPHINode(): bailout on inverted logical and/or (PR51149) The logical (select) form of and/or will now be a source of problems. We don't really account for it's inverted form, yet it exists, and presumably we should treat it just like non-inverted form: https://alive2.llvm.org/ce/z/BU9AXk https://bugs.llvm.org/show_bug.cgi?id=51149 reports a reportedly-serious perf regression that will hopefully be mitigated by this.	2021-07-22 22:19:34 +03:00
Roman Lebedev	952dc2e561	[NFC][SimplifyCFG] Add some more tests w/ two-entry PHI nodes and	2021-07-22 22:19:34 +03:00
Felix Berger	00edae9203	[clang-tidy] performance-unnecessary-copy-initialization: Disable check when variable and initializer have different replaced template param types. This can happen when a template with two parameter types is instantiated with a single type. The fix would only be valid for this instantiation but fail for others that rely on an implicit type conversion. The test cases illustrate when the check should trigger and when not. Differential Revision: https://reviews.llvm.org/D106011	2021-07-22 15:17:24 -04:00
Jon Chesterfield	e8da963922	[nfc] Fix typo in comment, s/node/note	2021-07-22 20:16:53 +01:00
Simon Pilgrim	4185c5502c	[CostModel][X86] Adjust shift SSE4 legalized costs based on llvm-mca reports. Update shl/lshr/ashr costs based on the worst case costs from the script in D103695 - many of the 128-bit shifts (usually where integer multiplies aren't used) have similar behaviour to AVX1 so we can merge them.	2021-07-22 20:07:32 +01:00
Simon Pilgrim	2657fe1721	[CostModel][X86] Fix funnel shift check prefixes We'd lost AVX1 test coverage due to bulldozer (XOP) trying to use the same check prefixes - we really need to fix the update script to avoid this!	2021-07-22 20:07:31 +01:00
Jon Chesterfield	14e34a83b0	[libomptarget][amdgpu][nfc] Replace use of gelf.h with libelf.h AMDGPU can assume Elf64 so doesn't need to abstract over Elf32 Drop a few other unused headers at the same time. Now only llvm elf and libelf are used by the plugin. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D106579	2021-07-22 20:04:13 +01:00
Aaron Ballman	178c2b4c1e	Correctly diagnose taking the address of a register variable in C We caught the cases where the user would explicitly use the & operator, but we were missing implicit conversions such as array decay. Fixes PR26336. Thanks to Samuel Neves for inspiration for the patch.	2021-07-22 14:53:23 -04:00
LLVM GN Syncbot	4e0cefc640	[gn build] Port `3959c95deb`	2021-07-22 18:41:45 +00:00
Simon Pilgrim	d073b19dbf	[X86] Fix SLM FP<->INT throughputs. Noticed while trying to clean up the shift costs model for SSE4 targets using the script in D10369 - SLM double-pumps all the 128-bit vector conversion ops and only use FP0 pipe - numbers taken from Intel AOM + Agner.	2021-07-22 19:39:04 +01:00
Thomas Johnson	1cda1e6186	[ARC] Add disassembly for the conditioned RSUB immediate instruction Differential Revision: https://reviews.llvm.org/D106497	2021-07-22 11:34:39 -07:00
Fangrui Song	3b181568db	[Matrix] Fix -Wunused-variable in -DLLVM_ENABLE_ASSERTIONS=off build after D106457. NFC	2021-07-22 11:33:02 -07:00
Louis Dionne	3959c95deb	[libc++] Add helper type non-propagating-cache Differential Revision: https://reviews.llvm.org/D102121	2021-07-22 14:30:16 -04:00
zoecarver	6f5064cd0c	[libc++][docs] Take lock for range.single.view. Mark this item as in progress and assigned to me.	2021-07-22 11:15:24 -07:00
Alex Lorenz	40d2d0c412	[clang][test] Add -fuse-ld= to test case added in `2542c1a5a1` to resolve test failure with CLANG_DEFAULT_LINKER=lld	2021-07-22 11:12:38 -07:00
Adam Nemet	ce5b1320a7	[Matrix] Fix miscompile for NT matmul if the transpose has other use We should only add the fake lowering entry for the matrix remark if the transpose is not lowered on its own. `MapVector::insert` is used to insert the entry during proper lowering which does not overwrite the fake entry in the map. We actually had test coverage for this but the reference output code was wrong; it was storing undef rather than the transposed column. Also add an assert that would have caught this. Differential Revision: https://reviews.llvm.org/D106457	2021-07-22 10:45:56 -07:00
Marius Brehler	49d840c35c	[mlir] Improve description of interface options Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D106539	2021-07-22 19:35:56 +02:00
Krishna Kariya	ae4e8f7d52	[InstCombine][test] add coverage for possible fabs folds; NFC This goes with D101727 (adds FMF to the select). Differential Revision: https://reviews.llvm.org/D106563	2021-07-22 13:22:56 -04:00
Alex Lorenz	2542c1a5a1	[clang][driver][darwin] Add driver support for Mac Catalyst This commit adds driver support for the Mac Catalyst target, as supported by the Apple clang compile Differential Revision: https://reviews.llvm.org/D105960	2021-07-22 10:20:19 -07:00
David Green	c9cebda772	[AArch64] Adjust the cost of integer sum reductions This changes the cost to (LT.first-1) * cost(add) + 2, where the cost of an add is assumed to be 1. This brings it inline with the other reductions. Differential Revision: https://reviews.llvm.org/D106240	2021-07-22 18:19:54 +01:00
Simon Pilgrim	e1bdb57958	[CostModel][X86] Adjust shift SSE legalized costs based on llvm-mca reports. Update shl/lshr/ashr costs based on the worst case costs from the script in D103695.	2021-07-22 18:12:49 +01:00
Shilei Tian	1a7f779022	[OpenMPOpt] Add support for BooleanStateWithSetVector D101977 added `BooleanStateWithPtrSetVector` to store pointers to a set meanwhile tracking boolean state. One of the limitation is that it can only store pointer. We might want it to store other types of values, such as integer for parallel level. This patch generalizes the idea and create `BooleanStateWithSetVector`. `BooleanStateWithPtrSetVector` therefore becomes a type alias of `BooleanStateWithSetVector`. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D106149	2021-07-22 13:12:29 -04:00
Fangrui Song	db6217a3f7	[test] Add llvm-stress to LLVM_TEST_DEPENDS and lit substitutions D106430 added a test which needs LLVM_TEST_DEPENDS and lit substitution.	2021-07-22 09:37:01 -07:00
Rahul Joshi	f8d3755f00	[MLIR][memref] Fix findDealloc() to handle > 1 dealloc for the given alloc. - Change findDealloc() to return Optional<Operation *> and return None if > 1 dealloc is associated with the given alloc. - Add findDeallocs() to return all deallocs associated with the given alloc. - Fix current uses of findDealloc() to bail out if > 1 dealloc is found. Differential Revision: https://reviews.llvm.org/D106456	2021-07-22 09:34:19 -07:00
Jon Chesterfield	1a96570621	[libomptarget][amdgpu] Implement dlopen of libhsa AMDGPU plugin equivalent of D95155, build without HSA installed locally Compiles a new file, plugins/amdgpu/dynamic_hsa/hsa.cpp, to an object file that exposes the same symbols that the plugin presently uses from hsa. The object file contains dlopen of hsa and cached dlsym calls. Also provides header files corresponding to the subset that is used. This is behind a feature flag, LIBOMPTARGET_FORCE_DLOPEN_LIBHSA, default off. That allows developers to build against the dlopen/dlsym implementation, e.g. while testing this mode. Enabling by default will cause this plugin to build on a wider variety of machines than it does at present so may break some CI builds. That risk can be minimised by reviewing the header dependencies of the library and ensuring it doesn't use any libraries that are not already used by libomptarget. Separating the implementation from enabling by default in case the latter needs to be rolled back after wider CI results. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D106559	2021-07-22 16:54:10 +01:00
Victor Huang	26ea4a4432	[PowerPC] Add PowerPC "__stbcx" builtin and intrinsic for XL compatibility This patch is in a series of patches to provide builtins for compatibility with the XL compiler. This patch adds the builtin and intrinsic for "__stbcx". Reviewed By: nemanjai, #powerpc Differential revision: https://reviews.llvm.org/D106484	2021-07-22 10:48:46 -05:00
Anastasia Stulova	b510e0127d	[OpenCL][NFC] Refactors lang version check in test. Fixed test to use predefined version marco instead of passing extra macro in the command line. Patch by Topotuna (Justas Janickas)! Differential Revision: https://reviews.llvm.org/D106254	2021-07-22 16:47:38 +01:00
Alexey Bataev	b88a68c45e	[OPENMP]Fix PR49787: Codegen for calling __tgt_target_teams_nowait_mapper has too few arguments. Added missed arguments in __tgt_target_teams_nowait_mapper/__tgt_target_nowait_mapper runtime functions calls. Differential Revision: https://reviews.llvm.org/D106542	2021-07-22 08:44:37 -07:00
Nico Weber	9d43c000e1	[lld/mac] Move handling of special undefineds later treatUndefinedSymbol() was previously called before gatherInputSections() and markLive() for these special symbols, but after them for normal undefineds. For PR50760, treatUndefinedSymbol() will have to potentially create sections, so it's good to move treatUndefinedSymbol() for special undefineds later, so that it can assume that gatherInputSections() and markLive() has already been called always. No intended behavior change, but part of PR50760 (and covered in tests in the patch for the full feature). Differential Revision: https://reviews.llvm.org/D106552	2021-07-22 11:43:49 -04:00
Alexey Bataev	f828f0a90f	Revert "[OPENMP]Fix PR49787: Codegen for calling __tgt_target_teams_nowait_mapper has too few arguments." This reverts commit `b455f7f225` to fix buildbots.	2021-07-22 08:06:29 -07:00
Raphael Isemann	3d9a9fa691	[lldb] Remove a wrong assert in TestStructTypes that checks that empty structs in C always have size 0 D105471 fixes the way we assign sizes to empty structs in C mode. Instead of just giving them a size 0, we instead use the size we get from DWARF if possible. After landing D105471 the TestStructTypes test started failing on Windows. The tests checked that the size of an empty C struct is 0 while the size LLDB now reports is 4 bytes. It turns out that 4 bytes are the actual size Clang is using for C structs with the MicrosoftRecordLayoutBuilder. The commit that introduced that behaviour is `00a061dccc`. This patch removes that specific check from TestStructTypes. Note that D105471 added a series of tests that already cover this case (and the added checks automatically adjust to whatever size the target compiler chooses for empty structs).	2021-07-22 16:56:50 +02:00
Alexey Bataev	b455f7f225	[OPENMP]Fix PR49787: Codegen for calling __tgt_target_teams_nowait_mapper has too few arguments. Added missed arguments in __tgt_target_teams_nowait_mapper/__tgt_target_nowait_mapper runtime functions calls. Differential Revision: https://reviews.llvm.org/D106542	2021-07-22 07:53:37 -07:00
Aaron En Ye Shi	9ce931bd71	[HIP] Fix no matching constructor for init of shared_ptr and malloc Allow standard header versions of malloc and free to be defined before introducing the device versions. Fixes: SWDEV-295901 Reviewed By: yaxunl Differential Revision: https://reviews.llvm.org/D106463	2021-07-22 14:32:41 +00:00
Jon Chesterfield	6e9cd3e9f1	[libomptarget][nfc] Improve static assert message in dlwrap Revision of D102858. Raise dlwrap arity argument to template argument so the correct value is given in the error message. E.g. '2 == 1' instead of '2 == trait<>::nargs'. Arity higher than it should be: Before diff ``` $/plugins/cuda/dynamic_cuda/cuda.cpp:23:1: error: static_assert failed due to requirement '2 == trait<cudaError_enum (*)(unsigned int)>::nargs' "Arity Error" DLWRAP_INTERNAL(cuInit, 2); ^~~~~~~~~~~~~~~~~~~~~~~~~~ ... $/include/dlwrap.h:166:3: note: expanded from macro 'DLWRAP_COMMON' static_assert(ARITY == trait<decltype(&SYMBOL)>::nargs, "Arity Error"); \ ``` After diff In file included from $/plugins/cuda/dynamic_cuda/cuda.cpp:16: ``` $/include/dlwrap.h:131:3: error: static_assert failed due to requirement '2UL == 1UL' "Arity Error" static_assert(Requested == Required, "Arity Error"); ^ ~~~~~~~~~~~~~~~~~~~~~ $/plugins/cuda/dynamic_cuda/cuda.cpp:23:1: note: in instantiation of function template specialization 'dlwrap::verboseAssert<2UL, 1UL>' requested here DLWRAP_INTERNAL(cuInit, 2); ``` Arity lower than it should be: Before diff ``` $/plugins/cuda/dynamic_cuda/cuda.cpp:131:10: error: no matching function for call to 'dlwrap_cuInit' return dlwrap_cuInit(X); ^~~~~~~~~~~~~ $/plugins/cuda/dynamic_cuda/cuda.cpp:23:1: note: candidate function not viable: requires 0 arguments, but 1 was provided DLWRAP_INTERNAL(cuInit, 0); ``` After diff In file included from $/plugins/cuda/dynamic_cuda/cuda.cpp:16: ``` $/include/dlwrap.h:131:3: error: static_assert failed due to requirement '0UL == 1UL' "Arity Error" static_assert(Requested == Required, "Arity Error"); ^ ~~~~~~~~~~~~~~~~~~~~~ $/plugins/cuda/dynamic_cuda/cuda.cpp:23:1: note: in instantiation of function template specialization 'dlwrap::verboseAssert<0UL, 1UL>' requested here DLWRAP_INTERNAL(cuInit, 0); ``` Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D106543	2021-07-22 15:24:20 +01:00
Cullen Rhodes	00e87e1c5b	[AArch64][SME] Improve diagnostic for vector select register Reviewed By: sdesmalen Differential Revision: https://reviews.llvm.org/D106540	2021-07-22 13:46:40 +00:00
Melanie Blower	4296d633b0	Revert "[clang][fpenv][patch] Change clang option -ffp-model=precise to select ffp-contract=on" This reverts commit `b9b696bba6`. Buildbot failures see https://lab.llvm.org/buildbot#builders/118/builds/4138 and https://lab.llvm.org/buildbot#builders/110/builds/5112	2021-07-22 09:40:54 -04:00
Raphael Isemann	eb61ffbcb2	[lldb] Fix TestCompletion by using SIGPIPE instead of SIGINT as test signal The test I added in commit `078003482e` was using SIGINT for testing the tab completion. The idea is to have a signal that only has one possible completion and I ended up picking SIGIN -> SIGINT for the test. However on non-Linux systems there is SIGINFO which is a valid completion for `SIGIN' and so the test fails there. This replaces SIGIN -> SIGINT with SIGPIP -> SIGPIPE completion which according to LLDB's signal list in Host.cpp is the only valid completion.	2021-07-22 15:35:28 +02:00
Kazu Hirata	f6413d8aaa	[Transforms] Remove getOrCreateInitFunction (NFC) The last use was removed on Jan 16, 2019 in commit `81101de585`.	2021-07-22 06:30:39 -07:00
Joseph Huber	a158d3663f	[OpenMP] Fix warnings for uninitialized block counts Summary: Fixes some warning given for uninitialized block counts if the exection mode is not recognized. This shouldn't happen in practice because the execution mode is checked when it's read from the device.	2021-07-22 09:24:07 -04:00
Nico Weber	fd3823cc82	[gn build] (manually) port `78bda89412` from 2012 because `924d62ca4a` added it to check-llvm	2021-07-22 09:11:54 -04:00
Aaron Ballman	6bb042e700	Implement _ExtInt conversion rules Clang implemented the _ExtInt datatype as a bit-precise integer type, which was then proposed to WG14. WG14 has accepted the proposal (http://www.open-std.org/jtc1/sc22/wg14/www/docs/n2709.pdf), but Clang requires some additional work as a result. In the original Clang implementation, we elected to disallow implicit conversions involving these types until after WG14 finalized the rules. This patch implements the rules decided by WG14: no integer promotion for bit-precise types, conversions prefer the larger of the two types and in the event of a tie (say _ExtInt(32) and a 32-bit int), the standard type wins. There are more changes still needed to conform to N2709, but those will be handled in follow-up patches.	2021-07-22 09:10:36 -04:00
Raphael Isemann	77440d644b	[lldb][NFC] Allow range-based for loops over DWARFDIE's children This patch adds the ability to get a DWARFDIE's children as an LLVM range. This way we can use for range loops to iterate over them and we can use LLVM's algorithms like `llvm::all_of` to query all children. The implementation has to do some small shenanigans as the iterator needs to store a DWARFDIE, but a DWARFDIE container is also a DWARFDIE so it can't return the iterator by value. I just made the `children` getter a templated function to avoid the cyclic dependency. Reviewed By: #lldb, werat, JDevlieghere Differential Revision: https://reviews.llvm.org/D103172	2021-07-22 15:03:30 +02:00
Med Ismail Bennani	312b43da05	[lldb/Plugins] Add ScriptedProcess Process Plugin This patch introduces Scripted Processes to lldb. The goal, here, is to be able to attach in the debugger to fake processes that are backed by script files (in Python, Lua, Swift, etc ...) and inspect them statically. Scripted Processes can be used in cooperative multithreading environments like the XNU Kernel or other real-time operating systems, but it can also help us improve the debugger testing infrastructure by writting synthetic tests that simulates hard-to-reproduce process/thread states. Although ScriptedProcess is not feature-complete at the moment, it has basic execution capabilities and will improve in the following patches. rdar://65508855 Differential Revision: https://reviews.llvm.org/D100384 Signed-off-by: Med Ismail Bennani <medismail.bennani@gmail.com>	2021-07-22 14:47:33 +02:00
Melanie Blower	b9b696bba6	[clang][fpenv][patch] Change clang option -ffp-model=precise to select ffp-contract=on Change the ffp-model=precise to enables -ffp-contract=on (previously -ffp-model=precise enabled -ffp-contract=fast). This is a follow-up to Andy Kaylor's comments in the llvm-dev discussion "Floating Point semantic modes". From the same email thread, I put Andy's distillation of floating point options and floating point modes into UsersManual.rst Also fixes bugs.llvm.org/show_bug.cgi?id=50222 Reviewed By: rjmccall, andrew.kaylor Differential Revision: https://reviews.llvm.org/D74436	2021-07-22 07:59:18 -04:00
Raphael Isemann	078003482e	[lldb] Fix that `process signal` completion always returns all signals `CompletionRequest::AddCompletion` adds the given string as completion of the current command token. `CompletionRequest::TryCompleteCurrentArg` only adds it if the current token is a prefix of the given string. We're using `AddCompletion` for the `process signal` handler which means that `process signal SIGIN` doesn't get uniquely completed to `process signal SIGINT` as we unconditionally add all other signals (such as `SIGABRT`) as possible completions. By using `TryCompleteCurrentArg` we actually do the proper filtering which will only add `SIGINT` (as that's the only signal with the prefix 'SIGIN' in the example above). Reviewed By: mib Differential Revision: https://reviews.llvm.org/D105028	2021-07-22 13:51:21 +02:00
Caroline Concatto	5a4de84d55	[LoopVectorize] Fix crash for predicated instruction with scalable VF This patch avoids computing discounts for predicated instructions when the VF is scalable. There is no support for vectorization of loops with division because the vectorizer cannot guarantee that zero divisions will not happen. This loop now does not use VF scalable ``` for (long long i = 0; i < n; i++) if (cond[i]) a[i] /= b[i]; ``` Differential Revision: https://reviews.llvm.org/D101916	2021-07-22 12:48:27 +01:00
Paulo Matos	842e718b66	Add support for zero-sized Scalars as a LowLevelType Opaque values (of zero size) can be stored in memory with the implemention of reference types in the WebAssembly backend. Since MachineMemOperand uses LLTs we need to be able to support zero-sized scalars types in LLTs. Differential Revision: https://reviews.llvm.org/D105423	2021-07-22 13:47:19 +02:00
Raphael Isemann	12a89e14b8	[lldb][NFCI] Remove redundant accessibility heuristic in the DWARF parser LLDB's DWARF parser has some heuristics for guessing and fixing up the accessibility of C++ class/struct members after they were already created in the internal Clang AST. The heuristic is that if a struct/class has a base class, then it's actually a class and it's members are private unless otherwise specified. From what I can see this heuristic isn't sound and also unnecessary. The idea that inheritance implies that the `class` keyword was used and the default visibility is `private` is incorrect. Also both GCC and Clang use `DW_TAG_structure_type` and `DW_TAG_class_type` for `struct` and `class` types respectively, so the default visibility we infer from that information is always correct and there is no need to fix it up. And finally, the access specifiers we set in the Clang AST are anyway unused within LLDB. The expression parser explicitly ignores them to give users access to private members and there is not SBAPI functionality that exposes this information. This patch removes all the heuristic code for the reasons above and instead just relies on the access values we infer from the tag kind and explicit annotations in DWARF. This patch is NFCI. Reviewed By: werat Differential Revision: https://reviews.llvm.org/D105463	2021-07-22 13:36:23 +02:00

1 2 3 4 5 ...

394477 Commits All Branches Search

394477 Commits

All Branches