llvm-project

Commit Graph

Author	SHA1	Message	Date
Praveen Velliengiri	e90b512c4d	[AMDGPU] Change ASAN init/fini kernels linkage to external. HSA runtime fails to find the symbols for Init and Fini kernels as they mark with internal linkage, changing the linkage to external to fix those errors. Differential Revision: https://reviews.llvm.org/D110054	2021-09-27 11:50:37 -06:00
Sumesh Udayakumaran	b2af2aeea6	[mlir] Mode for explicitly controlling the fusion kind New mode option that allows for either running the default fusion kind that happens today or doing either of producer-consumer or sibling fusion. This will also be helpful to minimize the compile-time of the fusion tests. Reviewed By: bondhugula, dcaballe Differential Revision: https://reviews.llvm.org/D110102	2021-09-27 20:37:42 +03:00
Quinn Pham	682e15f371	[PowerPC] Fix td pattern for P10 VSLDBI and VSRDBI This patch fixes the pattern for the P10 instructions Vector Shift Left Double by Bit Immediate VN-form and Vector Shift Right Double by Bit Immediate VN-form. The third argument should be a target constant (`timm`) instead of an `i32` because an immediate is expected. Reviewed By: lei Differential Revision: https://reviews.llvm.org/D109920	2021-09-27 12:36:18 -05:00
Yaxun (Sam) Liu	c4afb5f81b	[HIP] Fix linking of asanrt.bc HIP currently uses -mlink-builtin-bitcode to link all bitcode libraries, which changes the linkage of functions to be internal once they are linked in. This works for common bitcode libraries since these functions are not intended to be exposed for external callers. However, the functions in the sanitizer bitcode library is intended to be called by instructions generated by the sanitizer pass. If their linkage is changed to internal, their parameters may be altered by optimizations before the sanitizer pass, which renders them unusable by the sanitizer pass. To fix this issue, HIP toolchain links the sanitizer bitcode library with -mlink-bitcode-file, which does not change the linkage. A struct BitCodeLibraryInfo is introduced in ToolChain as a generic approach to pass the bitcode library information between ToolChain and Tool. Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D110304	2021-09-27 13:25:46 -04:00
William S. Moses	6dd5b1e33e	[MLIR][LLVM] Add error if using incorrect attribute type for specifying LLVM linkage Address post-commit review in https://reviews.llvm.org/D108524 to add appropriate diagnostics. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D110566	2021-09-27 13:24:05 -04:00
peter klausler	1c2e5fd66e	[flang] Enforce constraint: defined ass't in WHERE must be elemental A defined assignment subroutine invoked in the context of a WHERE statement or construct must necessarily be elemental (C1032). Differential Revision: https://reviews.llvm.org/D109932	2021-09-27 10:12:53 -07:00
Craig Topper	a2a07e8db3	[RISCV] Fold store of vmv.x.s to a vse with VL=1. This can avoid a loss of decoupling with the scalar unit on cores with decoupled scalar and vector units. We should support FP too, but those use extract_element and not a custom ISD node so it is a little different. I also left a FIXME in the test for i64 extract and store on RV32. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D109482	2021-09-27 09:54:46 -07:00
Fangrui Song	2bf06d9345	[ELF] Support symbol names with space in linker script expressions Fix PR51961 Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D110490	2021-09-27 09:50:42 -07:00
Kazu Hirata	59540b29f8	[InstCombine] Fix an "unused variable" warning	2021-09-27 09:49:32 -07:00
Bixia Zheng	fbd5821c6f	Implement the conversion from sparse constant to sparse tensors. The sparse constant provides a constant tensor in coordinate format. We first split the sparse constant into a constant tensor for indices and a constant tensor for values. We then generate a loop to fill a sparse tensor in coordinate format using the tensors for the indices and the values. Finally, we convert the sparse tensor in coordinate format to the destination sparse tensor format. Add tests. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D110373	2021-09-27 09:47:29 -07:00
@vladaindjic	5357a98c82	[OpenMP] libomp: Usage of TASK_TIED constant inside kmp_gsupport.cpp The minor code refactorization introduces the TASK_TIED constant inside kmp_gsupprot.cpp as a replacement for the literal value 1. The mentioned constant is now used in both kmp_tasking.cpp and kmp_gsupport.cpp files. Differential Revision: https://reviews.llvm.org/D110441	2021-09-27 19:45:56 +03:00
Craig Topper	933182e948	[RISCV] Improve support for forming widening multiplies when one input is a scalar splat. If one input of a fixed vector multiply is a sign/zero extend and the other operand is a splat of a scalar, we can use a widening multiply if the scalar value has sufficient sign/zero bits. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D110028	2021-09-27 09:37:07 -07:00
Daniil Fukalov	1f73f0c19d	[NFC][AMDGPU] Update cost model tests: 1. Convert to generated tests. 2. Added code-size case in few places.	2021-09-27 19:26:02 +03:00
Sanjay Patel	9075edc89b	[InstCombine] move shl-only folds out from under commonShiftTransforms(); NFCI This is no-functional-change-intended, but it hopefully makes things slightly clearer and more efficient to have transforms that require 'shl' be called only from visitShl(). Further cleanup is possible.	2021-09-27 12:09:47 -04:00
Pavel Labath	3dbf27e762	[lldb] A different fix for Domain Socket tests we need to drop nuls from the end of the string.	2021-09-27 18:00:27 +02:00
Kazu Hirata	b68a62b3a9	[Lanai] Remove redundant declaration getTheLanaiTarget (NFC) Note that getTheLanaiTarget is declared in TargetInfo/LanaiTargetInfo.h, which LanaiDisassembler.cpp includes. Identified with readability-redundant-declaration.	2021-09-27 08:58:27 -07:00
Kirill Bobyrev	0b1eff1bc5	[clangd] Refactor IncludeStructure: use File (unsigned) for most computations Preparation for D108194. Reviewed By: sammccall Differential Revision: https://reviews.llvm.org/D110386	2021-09-27 17:50:53 +02:00
Joseph Huber	74d622dea4	[OpenMP] Add new worksharing definitions into device RTL This path defines the newly added `__kmpc_disitrute_static_init` functions in the device runtime library. These functions are currently exact copies of the current worksharing method but can be tuned later. Depends on D110429 Reviewed By: tianshilei1992 Differential Revision: https://reviews.llvm.org/D110430	2021-09-27 11:36:41 -04:00
Joseph Huber	b4a5543624	[OpenMP] Introduce a new worksharing RTL function for distribute This patch adds a new RTL function for worksharing. Currently we use `__kmpc_for_static_init` for both the `distribute` and `parallel` portion of the loop clause. This patch replaces the `distribute` portion with a new runtime call `__kmpc_distribute_static_init`. Currently this will be used exactly the same way, but will make it easier in the future to fine-tune the distribute and parallel portion of the loop. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D110429	2021-09-27 11:36:37 -04:00
Raphael Isemann	be2a4216fc	[lldb] Fix SocketTest.DomainGetConnectURI on macOS by stripping more zeroes from getpeername result Apparently macOS is padding the name result with several padding zeroes at the end. Just strip them all to pretend it's a C-string. Thanks to Pavel for suggesting this fix.	2021-09-27 17:34:45 +02:00
Nico Weber	7664508910	[llvm/OptTable] Add named param comment for GroupedShortOption	2021-09-27 11:33:29 -04:00
Jake Egan	56049b7129	Fix tests defaulting to incorrect triples on AIX The tests only specify -march, so when the tests are run on AIX the target OS defaults to AIX, which causes the tests to misbehave. This patch constrains the tests by specifying -mtriple instead of -march. Reviewed By: daltenty, jsji, MaskRay Differential Revision: https://reviews.llvm.org/D110186	2021-09-27 11:30:45 -04:00
Nico Weber	730bbc6f72	[llvm/OptTable] Drop "The" prefix on fields	2021-09-27 11:24:51 -04:00
Nico Weber	6ffd8e3902	[llvm] Convert OptTable::ParseOneArg() to std::unique_ptr<>	2021-09-27 11:19:21 -04:00
Nico Weber	7789a68e5a	[llvm] Convert OptTable::parseOneArgGrouped() to std::unique_ptr<>	2021-09-27 11:19:15 -04:00
Nico Weber	2f955424c4	[llvm] ConvertOption::accept(), acceptInternal() to std::unique_ptr<> These functions transfer ownership to the caller. Make this clear in the type system. No behavior change.	2021-09-27 11:05:02 -04:00
Sanjay Patel	21429cf43a	[InstCombine] generalize fold for (trunc (X u>> C1)) u>> C This is another step towards trying to re-apply D110170 by eliminating conflicting transforms that cause infinite loops. `a47c8e40c7` was a previous patch in this direction. The diffs here are mostly cosmetic, but intentional: 1. The existing code that would handle this pattern in FoldShiftByConstant() is limited to 'shl' only now. The formatting change to IsLeftShift shows that we could move several transforms into visitShl() directly for efficiency because they are not common shift transforms. 2. The tests are regenerated to show new instruction names to prove that we are getting (almost) identical logic results. 3. The one case where we differ ("trunc_sandwich_small_shift1") shows that we now use a narrow 'and' instruction. Previously, we relied on another transform to do that, but it is limited to legal types. That seems to be a legacy constraint from when IR analysis and codegen were less robust. https://alive2.llvm.org/ce/z/JxyGA4 declare void @llvm.assume(i1) define i8 @src(i32 %x, i32 %c0, i8 %c1) { ; The sum of the shifts must not overflow the source width. %z1 = zext i8 %c1 to i32 %sum = add i32 %c0, %z1 %ov = icmp ult i32 %sum, 32 call void @llvm.assume(i1 %ov) %sh1 = lshr i32 %x, %c0 %tr = trunc i32 %sh1 to i8 %sh2 = lshr i8 %tr, %c1 ret i8 %sh2 } define i8 @tgt(i32 %x, i32 %c0, i8 %c1) { %z1 = zext i8 %c1 to i32 %sum = add i32 %c0, %z1 %maskc = lshr i8 -1, %c1 %s = lshr i32 %x, %sum %t = trunc i32 %s to i8 %a = and i8 %t, %maskc ret i8 %a }	2021-09-27 10:57:31 -04:00
Sanjay Patel	025a805d7c	[InstCombine] match variable names and code comments; NFC Similar to: `29c09c7` Planned follow-up is to add a transform here to allow removing a common shift fold that is conflicting with D110170.	2021-09-27 10:57:31 -04:00
Amy Kwan	1f5b60ad47	Explicitly specify -fintegrated-as to clang/test/Driver/compilation_database.c test case. It appears that this test assumes that the toolchain utilizes the integrated assembler by default, since the expected output in the CHECKs are compilation_database.o. However, this test fails on AIX as AIX does not utilize the integrated assembler. On AIX, the output instead is of the form /tmp/compilation_database-*.s. Thus, this patch explicitly adds the -fintegrated-as option to match the assumption that the integrated assembler is used by default. Differential Revision: https://reviews.llvm.org/D110431	2021-09-27 09:56:18 -05:00
Eugene Zhulenev	92db09cde0	[mlir] AsyncRuntime: use int64_t for ref counting operations Workaround for SystemZ ABI problem: https://bugs.llvm.org/show_bug.cgi?id=51898 Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D110550	2021-09-27 07:55:01 -07:00
Dmitry Vyukov	94ea36649e	tsan: fix trace tests on darwin The trace tests crashed on darwin because of some thread initialization issues (thread initialization is somewhat different on darwin). Instead of starting real threads, create a new ThreadState in the main thread. This makes the tests more unit-testy and hopefully won't crash on darwin (there is almost no platform-specific code involved now). This will also help with future trace tests that will need more than 1 thread. Creating more than 1 real thread and dispatching test actions across multiple threads in the required deterministic order is painful. Depends on D110539. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D110546	2021-09-27 16:40:57 +02:00
Dmitry Vyukov	b72176b9bc	tsan: add a test for stack init race Depends on D110538. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D110539	2021-09-27 16:40:17 +02:00
Dmitry Vyukov	b4c1e5cb73	tsan: fix and test detection of TLS races Currently detection of races with TLS/stack initialization is broken because we imitate the write before thread initialization, so it's modelled with a wrong thread/epoch. Fix that and add a test. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D110538	2021-09-27 16:40:08 +02:00
Sebastian Neubauer	bf980930e5	[AMDGPU] Ignore KILLs when forming clauses KILL instructions are sometimes present and prevented hard clauses from being formed. Fix this by ignoring all meta instructions in clauses. Differential Revision: https://reviews.llvm.org/D106042	2021-09-27 16:33:52 +02:00
Nico Weber	63bb2d585e	[clang] Put original flags on 'Driver args:' crash report line We used to put the canonical spelling of flags after alias processing on that line. For clang-cl in particular, that meant that we put flags on that line that the clang-cl driver doesn't even accept, and the "Driver args:" line wasn't usable. Differential Revision: https://reviews.llvm.org/D110458	2021-09-27 10:24:46 -04:00
Dmitry Vyukov	1455b552b7	tsan: de-hardcode MemCount const Use MemCount instead of hard-coded value 7. Reviewed By: melver Differential Revision: https://reviews.llvm.org/D110532	2021-09-27 16:11:49 +02:00
Michał Górny	33031545bf	[lldb] [DynamicRegisterInfo] Add a convenience method to add suppl. registers Add a convenience method to add supplementary registers that takes care of adding invalidate_regs to all (potentially) overlapping registers. Differential Revision: https://reviews.llvm.org/D110023	2021-09-27 16:01:30 +02:00
Sjoerd Meijer	eba76056a3	[FuncSpec] Don't specialise (or crash) on poison or constexpr values Function specialization was crashing on poison values and constexpr values. The problem is that these values are not added to the solver, so it crashes when a lookup is performed for these values. This fixes that by not specialising on these values. For poison that is obvious, but for constexpr this is a change in behaviour. Thus, in one way this is a bit of a stopgap, but specialising on constexpr values wasn't done very intentionally, and need some more work and tests if we wanted to support this. As a follow up, we need to look if the solver should exit more gracefully and return a "don't know", or that it should really support these constexprs. This should fix PR51600 (https://bugs.llvm.org/show_bug.cgi?id=51600). Differential Revision: https://reviews.llvm.org/D110529	2021-09-27 14:58:53 +01:00
David Green	ebee606e38	[AArch64] Fix neon-reverseshuffle test extension. NFC Apparently I gave a ll file a .patch extension. Oops.	2021-09-27 14:43:26 +01:00
Aaron Ballman	38d09080c9	Removing a default constructor argument; NFC The argument is always used with its default value, so remove the argument entirely.	2021-09-27 09:41:28 -04:00
Sjoerd Meijer	a588ae482b	[LoopFlatten] Precommit new test widen-iv2.ll for D110234.	2021-09-27 14:37:44 +01:00
gbreynoo	05b1c7aebf	[llvm-dwarfdump][docs] Add missing options to the help output and the command guide This change is to add some missing details to the help text and command guide: - Added a note to the command guide that --debug-macro also dumps .debug_macinfo. - Added a note to the command guide that --debug-frame and --eh_frame are aliases, and in cases where both sections are present one command outputs both. - Changed the wording in the help output for --ignore-case and --regex to closer match the command guide.	2021-09-27 14:28:31 +01:00
Jun Ma	3a998c06a8	Revert "Recommit "Revert "[CVP] processSwitch: Remove default case when switch cover all possible values.""" This reverts commit `8ba2adcf9e`.	2021-09-27 20:39:05 +08:00
LLVM GN Syncbot	e2eb651cfc	[gn build] Port `9da2fa277e`	2021-09-27 12:33:13 +00:00
Michał Górny	9da2fa277e	[lldb] Move StringConvert inside debugserver The StringConvert API is no longer used anywhere but in debugserver. Since debugserver does not use LLVM API, we cannot replace it with llvm::to_integer() and llvm::to_float() there. Let's just move the sources into debugserver. Differential Revision: https://reviews.llvm.org/D110478	2021-09-27 14:32:42 +02:00
Pushpinder Singh	b1695c2eb8	[AMDGPU][OpenMP] Add memory pool size check to isValidMemoryPool Keeping all the checks in one place for future simplification. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D110513	2021-09-27 12:29:00 +00:00
Michał Górny	93b82f45bc	[lldb] [Host] Refactor XML converting getters Refactor the XML converting attribute and text getters to use LLVM API. While at it, remove some redundant error and missing XML support handling, as the called base functions do that anyway. Add tests for these methods. Note that this patch changes the getter behavior to be IMHO more correct. In particular: - negative and overflowing integers are now reported as failures to convert, rather than being wrapped over or capped - digits followed by text are now reported as failures to convert to double, rather than their numeric part being converted Differential Revision: https://reviews.llvm.org/D110410	2021-09-27 14:26:33 +02:00
Michael Kruse	1b242dccff	[OpenMP][CMake] Use in-project clang as CUDA->IR compiler for new DeviceRTL. Use the in-project clang, llvm-link and opt if available and unless CMake cache variables specify to use a different compiler. This applies D101265 to the new DeviceRTL's CMakeLists.txt which was copied before D101265 was applied. Fixes the openmp-offloading-cuda-runtime builder which was failing since D110006. Reviewed By: tianshilei1992 Differential Revision: https://reviews.llvm.org/D110251	2021-09-27 07:14:19 -05:00
Tobias Gysi	e158b5634a	[mlir][linalg] Make fusion on tensor rewriter friendly (NFC). Let the calling pass or pattern replace the uses of the original root operation. Internally, the tileAndFuse still replaces uses and updates operands but only of newly created operations. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D110169	2021-09-27 11:28:25 +00:00
Emre Kultursay	d5629b5d4d	Fix rendezvous for rebase_exec=true case When rebase_exec=true in DidAttach(), all modules are loaded before the rendezvous breakpoint is set, which means the LoadInterpreterModule() method is not called and m_interpreter_module is not initialized. This causes the very first rendezvous breakpoint hit with m_initial_modules_added=false to accidentally unload the module_sp that corresponds to the dynamic loader. This bug (introduced in D92187) was causing the rendezvous mechanism to not work in Android 28. The mechanism works fine on older/newer versions of Android. Test: Verified rendezvous on Android 28 and 29 Test: Added dlopen test Reviewed By: labath Differential Revision: https://reviews.llvm.org/D109797	2021-09-27 13:27:27 +02:00

1 2 3 4 5 ...

400160 Commits All Branches Search

400160 Commits

All Branches