llvm-project

Commit Graph

Author	SHA1	Message	Date
Saleem Abdulrasool	7a20670d16	AST: correct name decoration for swift async functions on Windows The name decoration scheme on Windows does not have a vendor namespace, and the decoration scheme is not shared ownership - it is controlled by Microsoft. `T` is a reserved identifier for an unknown calling convention. The `W` identifier has been discussed with Microsoft offline and is reserved as `Swift_3` as the identifier for the swift async calling convention. Adjust the name decoration accordingly.	2021-07-13 10:04:11 -07:00
Philip Reames	e4b43973fb	[ScalarEvolution] Fix overflow when computing max trip counts This is split from D105216 to reduce patch complexity. Original code by Eli with very minor modification by me. The primary point of this patch is to add the getUDivCeilSCEV routine. I included the two callers with constant arguments as we know those must constant fold even without any of the fancy inference logic.	2021-07-13 10:01:10 -07:00
Arthur Eubanks	489742991f	[NFC] Inline variable to prevent unused variable warning	2021-07-13 09:57:59 -07:00
thomasraoux	ae4cea38f1	[mlir] Add support for tensor.extract to comprehensive bufferization Differential Revision: https://reviews.llvm.org/D105870	2021-07-13 09:54:46 -07:00
Craig Topper	46e8970817	[RISCV] Prevent use of t0(aka x5) as rs1 for jalr instructions. Some microarchitectures treat rs1=x1/x5 on jalr as a hint to pop the return-address stack. We should avoid using x5 on jalr instructions since we aren't using x5 as an alternate link register. Differential Revision: https://reviews.llvm.org/D105875	2021-07-13 09:46:21 -07:00
Guillaume Chatelet	2c47b8847e	Revert "[llvm] Add enum iteration to Sequence" This reverts commit `a006af5d6e`.	2021-07-13 16:44:42 +00:00
Arthur Eubanks	ab5693aa4a	[OpaquePtr] Use byval type more	2021-07-13 09:34:34 -07:00
Arthur Eubanks	113a807977	[OpaquePtr] Get load/store type without PointerType::getElementType()	2021-07-13 09:34:34 -07:00
Arthur Eubanks	693bc04bf6	[OpaquePtr] Use GlobalValue::getValueType() more	2021-07-13 09:34:34 -07:00
Arthur Eubanks	b25aca503d	[OpaquePtr] Use AllocaInst::getAllocatedType()	2021-07-13 09:34:33 -07:00
Julian Lettner	1893b630fe	Avoid triggering assert when program calls OSAtomicCompareAndSwapLong A previous change brought the new, relaxed implementation of "on failure memory ordering" for synchronization primitives in LLVM over to TSan land [1]. It included the following assert: ``` // 31.7.2.18: "The failure argument shall not be memory_order_release // nor memory_order_acq_rel". LLVM (2021-05) fallbacks to Monotonic // (mo_relaxed) when those are used. CHECK(IsLoadOrder(fmo)); static bool IsLoadOrder(morder mo) { return mo == mo_relaxed \|\| mo == mo_consume \|\| mo == mo_acquire \|\| mo == mo_seq_cst; } ``` A previous workaround for a false positive when using an old Darwin synchronization API assumed this failure mode to be unused and passed a dummy value [2]. We update this value to `mo_relaxed` which is also the value used by the actual implementation to avoid triggering the assert. [1] https://reviews.llvm.org/D99434 [2] https://reviews.llvm.org/D21733 rdar://78122243 Differential Revision: https://reviews.llvm.org/D105844	2021-07-13 09:33:50 -07:00
Nicolas Vasilache	68ae8bacfc	[mlir][Linalg] Properly specify Linalg attribute. This fixes undefined reference introduced by https://reviews.llvm.org/D105859 Differential Revision: https://reviews.llvm.org/D105897	2021-07-13 16:33:33 +00:00
Fangrui Song	3d89fb4d13	[RISCV] Support machine constraint "S" Similar to D46745, "S" represents an absolute symbolic operand, which can be used to specify the access models, e.g. extern int var; void addr_via_asm() { void ret; asm("lui %0, %%hi(%1)\naddi %0,%0,%%lo(%1)" : "=r"(ret) : "S"(&var)); return ret; } 'S' is documented in trunk GCC: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101275 Reviewed By: luismarques Differential Revision: https://reviews.llvm.org/D105254	2021-07-13 09:30:09 -07:00
Guillaume Chatelet	a006af5d6e	[llvm] Add enum iteration to Sequence This patch allows iterating typed enum via the ADT/Sequence utility. Differential Revision: https://reviews.llvm.org/D103900	2021-07-13 16:22:19 +00:00
Aart Bik	7039dfc6dd	[mlir][memref] adjust integration tests to new lowering passes these tests run under the emulator and thus were overlooked Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D105855	2021-07-13 09:14:41 -07:00
Albion Fung	f1aca5ac96	[PowerPC] Fix L[D\|W]ARX Implementation LDARX and LWARX sometimes gets optimized out by the compiler when it is critical to the correctness of the code. This inline asm generation ensures that it preserved. Differential Revision: https://reviews.llvm.org/D105754	2021-07-13 11:02:07 -05:00
Simon Pilgrim	4975837f14	[InstCombine] Add basic (select C, (gep Ptr, Idx), Ptr) tests from PR50183	2021-07-13 16:57:40 +01:00
Simon Pilgrim	1bfec34ac3	[InstCombine] Regenerate select-gep.ll tests	2021-07-13 16:54:18 +01:00
Victor Huang	10e0cdfc65	[PowerPC][NFC] Power ISA features for Semachecking [NFC] This patch adds features for pwr7, pwr8, and pwr9 that can be used for semachecking builtin functions that are only valid for certain versions of ppc. Reviewed By: nemanjai, #powerpc Authored By: Quinn Pham <Quinn.Pham@ibm.com> Differential revision: https://reviews.llvm.org/D105501	2021-07-13 10:51:25 -05:00
Anton Zabaznov	03d8fed349	[OpenCL] Add verbosity when checking support of read_write images Parenthesis were fixed incorrectly by D105890 Reviewed By: Anastasia Differential Revision: https://reviews.llvm.org/D105892	2021-07-13 18:47:29 +03:00
Tres Popp	32627f4ab4	[mlir] Handle unused variable when assertions are disabled.	2021-07-13 17:31:12 +02:00
John Ericson	1e03c37b97	Prepare Compiler-RT for GnuInstallDirs, matching libcxx, document all This is a second attempt at D101497, which landed as `9a9bc76c0e` but had to be reverted in `8cf7ddbdd4`. This issue was that in the case that `COMPILER_RT_INSTALL_PATH` is empty, expressions like "${COMPILER_RT_INSTALL_PATH}/bin" evaluated to "/bin" not "bin" as intended and as was originally. One solution is to make `COMPILER_RT_INSTALL_PATH` always non-empty, defaulting it to `CMAKE_INSTALL_PREFIX`. D99636 adopted that approach. But, I think it is more ergonomic to allow those project-specific paths to be relative the global ones. Also, making install paths absolute by default inhibits the proper behavior of functions like `GNUInstallDirs_get_absolute_install_dir` which make relative install paths absolute in a more complicated way. Given all this, I will define a function like the one asked for in https://gitlab.kitware.com/cmake/cmake/-/issues/19568 (and needed for a similar use-case). --- Original message: Instead of using `COMPILER_RT_INSTALL_PATH` through the CMake for complier-rt, just use it to define variables for the subdirs which themselves are used. This preserves compatibility, but later on we might consider getting rid of `COMPILER_RT_INSTALL_PATH` and just changing the defaults for the subdir variables directly. --- There was a seaming bug where the (non-Apple) per-target libdir was `${target}` not `lib/${target}`. I suspect that has to do with the docs on `COMPILER_RT_INSTALL_PATH` saying was the library dir when that's no longer true, so I just went ahead and fixed it, allowing me to define fewer and more sensible variables. That last part should be the only behavior changes; everything else should be a pure refactoring. --- I added some documentation of these variables too. In particular, I wanted to highlight the gotcha where `-DSomeCachePath=...` without the `:PATH` will lead CMake to make the path absolute. See [1] for discussion of the problem, and [2] for the brief official documentation they added as a result. [1]: https://cmake.org/pipermail/cmake/2015-March/060204.html [2]: https://cmake.org/cmake/help/latest/manual/cmake.1.html#options In `38b2dec37e` the problem was somewhat misidentified and so `:STRING` was used, but `:PATH` is better as it sets the correct type from the get-go. --- D99484 is the main thrust of the `GnuInstallDirs` work. Once this lands, it should be feasible to follow both of these up with a simple patch for compiler-rt analogous to the one for libcxx. Reviewed By: phosek, #libc_abi, #libunwind Differential Revision: https://reviews.llvm.org/D105765	2021-07-13 15:21:41 +00:00
Matt Arsenault	fb44c3223e	AMDGPU: Promote signext/zeroext i16 shader returns This makes them consistent with all the other return convention handling. If we don't do this, we lose the sext/zext flag if treated as a full assignment, which complicates a future GlobalISel patch.	2021-07-13 11:04:51 -04:00
Matt Arsenault	222fde1eec	GlobalISel: Use extension instead of merge with undef in common case This fixes not respecting signext/zeroext in these cases. In the anyext case, this avoids a larger merge with undef and should be a better canonical form. This should also handle this if a merge is needed, but I'm not aware of a case where that can happen. In a future change this will also allow AMDGPU to drop some custom code without introducing regressions.	2021-07-13 11:04:47 -04:00
Matt Arsenault	77a608d9de	GlobalISel: Remove getIntrinsicID utility function This is redundant with a method directly on MachineInstr	2021-07-13 11:04:10 -04:00
Matt Arsenault	121541fdcd	Mips/GlobalISel: Use more standard call lowering infrastructure This also fixes some missing implicit uses on call instructions, adds missing G_ASSERT_SEXT/ZEXT annotations, and some missing outgoing sext/zexts. This also fixes not respecting tablegen requested type promotions. This starts treating f64 passed in i32 GPRs as a type of custom assignment, which restores some previously XFAILed tests. This is due to getNumRegistersForCallingConv returns a static value, but in this case it is context dependent on other arguments. Most of the ugliness is reproducing a hack CC_MipsO32 uses in SelectionDAG. CC_MipsO32 depends on a bunch of vectors populated from the original IR argument types in MipsCCState. The way this ends up working in GlobalISel is it only ends up inspecting the most recently added vector element. I'm pretty sure there are cleaner ways to do this, but this seemed easier than fixing up the current DAG handling. This is another case where it would be easier of the CCAssignFns were passed the original type instead of only the pre-legalized ones. There's still a lot of junk here that shouldn't be necessary. This also likely breaks big endian handling, but it wasn't complete/tested anyway since the IRTranslator gives up on big endian targets.	2021-07-13 11:04:10 -04:00
Matt Arsenault	6a3904f16e	Mips: Mark special case calling convention handling as custom The number of registers used for passing f64 in some cases is context dependent, and thus getNumRegistersForCallingConv is sometimes inaccurate. For f64, it reports 1 but is sometimes split into 2 32-bit registers. For GlobalISel, the generic argument assignment code expects getNumRegistersForCallingConv to return an accurate answer. Switch to marking these arguments as custom so we can deal with this case as a custom assignment rather. This temporarily breaks a few globalisel tests which are fixed by a future change to use more of the generic infrastructure.	2021-07-13 11:04:10 -04:00
Louis Dionne	0da95a5cf2	[libc++] Workaround non-constexpr std::exchange pre C++20 std::exchange is only constexpr in C++20 and later. We were using it in a constructor marked unconditionally constexpr, which caused issues when building with -std=c++17. The weird part is that the issue only showed up when building on the arm64 macs, but that must be caused by the specific version of Clang used on those. Since the code is clearly wrong and the fix is obvious, I'm not going to investigate this further.	2021-07-13 10:51:03 -04:00
Louis Dionne	c5ad8bb8d4	[libc++] Target x86_64 only for the backdeployment jobs Differential Revision: https://reviews.llvm.org/D105846	2021-07-13 10:29:08 -04:00
Louis Dionne	2a9366c0e5	[libc++] Generate ABI list for macOS arm64	2021-07-13 10:16:50 -04:00
Hansang Bae	db635a28e6	[OpenMP] Minor improvement in task allocation This patch includes a few changes to improve task allocation performance slightly. These changes are enough to restore performance drop observed after introducing hidden helper. Differential Revision: https://reviews.llvm.org/D105715	2021-07-13 09:07:14 -05:00
Bogdan Graur	e9533b8492	[NFC] Add paranthesis around logical expression to silence -Wlogical-op-parentheses warning. Reviewed By: alexfh Differential Revision: https://reviews.llvm.org/D105890	2021-07-13 15:54:31 +02:00
Simon Pilgrim	b2f6cf1479	[InstCombine] Fold lshr/ashr(or(neg(x),x),bw-1) --> zext/sext(icmp_ne(x,0)) (PR50816) Handle the missing fold reported in PR50816, which is a variant of the existing ashr(sub_nsw(X,Y),bw-1) --> sext(icmp_sgt(X,Y)) fold. We also handle the lshr(or(neg(x),x),bw-1) --> zext(icmp_ne(x,0)) equivalent - https://alive2.llvm.org/ce/z/SnZmSj We still allow multi uses of the neg(x) - as this is likely to let us further simplify other uses of the neg - but not multi uses of the or() which would increase instruction count. Differential Revision: https://reviews.llvm.org/D105764	2021-07-13 14:44:54 +01:00
Dave MacLachlan	45ffe6341d	[clang/objc] Optimize getters for non-atomic, copied properties Properties that were declared `@property(copy, nonatomic) id foo` make an unnecessary call to objc_get_property(). This call can be replaced with a direct access to the backing variable identical to how a `@property(nonatomic) id foo` would do it. This reduces codegen by 4 bytes (x86_64/arm64) and removes a cross linkage unit function call per property declared as copy/nonatomic. Differential Revision: https://reviews.llvm.org/D105311	2021-07-13 09:22:13 -04:00
Simon Pilgrim	c99e17fef5	[InstCombine] Pre-commit ashr(or(neg(x),x),bw-1) --> sext(icmp_ne(x,0)) tests from D105764 Added 'thwart complexity-based canonicalization' hacks and the lshr(or(neg(x),x),bw-1) --> zext(icmp_ne(x,0)) variants suggested by Sanjay.	2021-07-13 13:48:17 +01:00
Anton Zabaznov	ab76101f40	[OpenCL] Add support of __opencl_c_read_write_images feature macro This feature requires support of __opencl_c_images, so diagnostics for that is provided as well Reviewed By: Anastasia Differential Revision: https://reviews.llvm.org/D104915	2021-07-13 15:38:23 +03:00
Roman Lebedev	4709d9d5be	[libomp] ompd_init(): fix heap-buffer-overflow when constructing libompd.so path There is no guarantee that the space allocated in `libname` is enough to accomodate the whole `dl_info.dli_fname`, because it could e.g. have an suffix - `.5`, and that highlights another problem - what it should do about suffxies, and should it do anything to resolve the symlinks before changing the filename? ``` $ LD_LIBRARY_PATH="$LD_LIBRARY_PATH:/usr/local/lib" ./src/utilities/rstest/rstest -c /tmp/f49137920.NEF dl_info.dli_fname "/usr/local/lib/libomp.so.5" strlen(dl_info.dli_fname) 26 lib_path_length 14 lib_path_length + 12 26 ================================================================= ==30949==ERROR: AddressSanitizer: heap-buffer-overflow on address 0x60300000002a at pc 0x000000548648 bp 0x7ffdfa0aa780 sp 0x7ffdfa0a9f40 WRITE of size 27 at 0x60300000002a thread T0 #0 0x548647 in strcpy (/home/lebedevri/rawspeed/build-Clang-SANITIZE/src/utilities/rstest/rstest+0x548647) #1 0x7fb9e3e3d234 in ompd_init() /repositories/llvm-project/openmp/runtime/src/ompd-specific.cpp:102:5 #2 0x7fb9e3dcb446 in __kmp_do_serial_initialize() /repositories/llvm-project/openmp/runtime/src/kmp_runtime.cpp:6742:3 #3 0x7fb9e3dcb40b in __kmp_get_global_thread_id_reg /repositories/llvm-project/openmp/runtime/src/kmp_runtime.cpp:251:7 #4 0x59e035 in main /home/lebedevri/rawspeed/build-Clang-SANITIZE/../src/utilities/rstest/rstest.cpp:491 #5 0x7fb9e3762d09 in __libc_start_main csu/../csu/libc-start.c:308:16 #6 0x4df449 in _start (/home/lebedevri/rawspeed/build-Clang-SANITIZE/src/utilities/rstest/rstest+0x4df449) 0x60300000002a is located 0 bytes to the right of 26-byte region [0x603000000010,0x60300000002a) allocated by thread T0 here: #0 0x55cc5d in malloc (/home/lebedevri/rawspeed/build-Clang-SANITIZE/src/utilities/rstest/rstest+0x55cc5d) #1 0x7fb9e3e3d224 in ompd_init() /repositories/llvm-project/openmp/runtime/src/ompd-specific.cpp:101:17 #2 0x7fb9e3762d09 in __libc_start_main csu/../csu/libc-start.c:308:16 SUMMARY: AddressSanitizer: heap-buffer-overflow (/home/lebedevri/rawspeed/build-Clang-SANITIZE/src/utilities/rstest/rstest+0x548647) in strcpy Shadow bytes around the buggy address: 0x0c067fff7fb0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 0x0c067fff7fc0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 0x0c067fff7fd0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 0x0c067fff7fe0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 0x0c067fff7ff0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 =>0x0c067fff8000: fa fa 00 00 00[02]fa fa fa fa fa fa fa fa fa fa 0x0c067fff8010: fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa 0x0c067fff8020: fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa 0x0c067fff8030: fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa 0x0c067fff8040: fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa 0x0c067fff8050: fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa fa Shadow byte legend (one shadow byte represents 8 application bytes): Addressable: 00 Partially addressable: 01 02 03 04 05 06 07 Heap left redzone: fa Freed heap region: fd Stack left redzone: f1 Stack mid redzone: f2 Stack right redzone: f3 Stack after return: f5 Stack use after scope: f8 Global redzone: f9 Global init order: f6 Poisoned by user: f7 Container overflow: fc Array cookie: ac Intra object redzone: bb ASan internal: fe Left alloca redzone: ca Right alloca redzone: cb ==30949==ABORTING Aborted ```	2021-07-13 15:36:46 +03:00
Simon Pilgrim	3cee36c5ac	[X86][SSE] X86ISD::FSETCC nodes (cmpss/cmpsd) return a 0/-1 allbits signbits result (REAPPLIED) Annoyingly, i686 cmpsd handling still fails to remove the unnecessary neg(and(x,1)) Reapplied rGe4aa6ad13216 with fix for intrinsic variants of the opcode which uses a vector return type	2021-07-13 12:31:09 +01:00
Frederik Gossen	9c90725eae	[MLIR] Fix documentation of the `ExecutionEngine` in the toy tutorial example Differential Revision: https://reviews.llvm.org/D105813	2021-07-13 13:23:43 +02:00
George Rokos	bb0166dc72	[libomptarget] Update device pointer only if needed Currently, libomptarget will always perform a host-to-device memory transfer in order to update the device pointer of a PTR_AND_OBJ entry. This is not always necessary because the device pointer may have been set to the correct pointee address already, so we can eliminate the redundant memory transfer.	2021-07-13 04:18:55 -07:00
Hafiz Abid Qadeer	b205f2bb89	[AMDGPU] Handle s_branch to another section. Currently, if target of s_branch instruction is in another section, it will fail with the error of undefined label. Although in this case, the label is not undefined but present in another section. This patch tries to handle this issue. So while handling fixup_si_sopp_br fixup in getRelocType, if the target label is undefined we issue an error as before. If it is defined, a new relocation type R_AMDGPU_REL16 is returned. This issue has been reported in https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100181 and https://bugs.llvm.org/show_bug.cgi?id=45887. Before https://reviews.llvm.org/D79943, we used to get an crash for this scenario. The crash is fixed now but the we still get an undefined label error. Jumps to other section can arise with hold/cold splitting. A patch to handle the relocation in lld will follow shortly. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D105760	2021-07-13 12:17:47 +01:00
Jon Chesterfield	b6b53ffef4	[libomptarget][devicertl] Remove branches around setting parallelLevel Simplifies control flow to allow store/load forwarding This change folds two basic blocks into one, leaving a single store to parallelLevel. This is a step towards spmd kernels with sufficiently aggressive inlining folding the loads from parallelLevel and thus discarding the nested parallel handling when it is unused. Transform: ``` int threadId = GetThreadIdInBlock(); if (threadId == 0) { parallelLevel[0] = expr; } else if (GetLaneId() == 0) { parallelLevel[GetWarpId()] = expr; } // => if (GetLaneId() == 0) { parallelLevel[GetWarpId()] = expr; } // because unsigned GetLaneId() { return GetThreadIdInBlock() & (WARPSIZE - 1);} // so whenever threadId == 0, GetLaneId() is also 0. ``` That replaces a store in two distinct basic blocks with as single store. A more aggressive follow up is possible if the threads in the warp/wave race to write the same value to the same address. This is not done as part of this change. ``` if (GetLaneId() == 0) { parallelLevel[GetWarpId()] = expr; } // => parallelLevel[GetWarpId()] = expr; // because unsigned GetWarpId() { return GetThreadIdInBlock() / WARPSIZE; } // so GetWarpId will index the same element for every thread in the warp // and, because expr is lane-invariant in this case, every lane stores the // same value to this unique address ``` Reviewed By: tianshilei1992 Differential Revision: https://reviews.llvm.org/D105699	2021-07-13 12:06:57 +01:00
Jan Kratochvil	72748488ad	[lldb] Fix editline unicode on Linux Based on: [lldb-dev] proposed change to remove conditional WCHAR support in libedit wrapper https://lists.llvm.org/pipermail/lldb-dev/2021-July/016961.html There is already setlocale in lldb/source/Core/IOHandlerCursesGUI.cpp but that does not apply for Editline GUI editing. Unaware how to make automated test for this, it requires pty. Reviewed By: teemperor Differential Revision: https://reviews.llvm.org/D105779	2021-07-13 12:37:53 +02:00
Nicolas Vasilache	af55335924	[mlir][Linalg] Better support for bufferizing non-tensor results. Clean up corner cases related to elemental tensor / buffer type return values that would previously fail. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D105857	2021-07-13 10:27:40 +00:00
Simon Pilgrim	afdae7c5d7	[X86][SSE] Add signbit tests to show cmpss/cmpsd intrinsics not recognised as 'allbits' results. This adds test coverage for the crash reported on rGe4aa6ad13216	2021-07-13 11:25:52 +01:00
Tim Northover	85cb4f9904	Support: reduce stack used in default size test. When the sanitizers aren't enabled they can use more than 1KB of stack, causing an overflow where there shouldn't be. Should fix Green Dragon test.	2021-07-13 11:24:12 +01:00
Nicolas Vasilache	e312fc49ae	[mlir][Linalg] Add layout specification support to bufferization. Previously, linalg bufferization always had to be conservative at function boundaries and assume the most dynamic strided memref layout. This revision introduce the mechanism to specify a linalg.buffer_layout function argument attribute that carries an affine map used to set a less pessimistic layout. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D105859	2021-07-13 10:22:18 +00:00
Sebastian Neubauer	ad2c66ec5d	[AMDGPU] Optimize VGPR LiveRange in waterfall loops The loops are run exactly once per lane, so VGPRs do not need to be saved. Use the SIOptimizeVGPRLiveRange pass to add phi nodes that take undef when coming from the loop. There is still a shortcoming: Return values from a function call in the loop are copied because their live range conflicts with the live range of arguments, even if arguments are only IMPLICIT_DEF after the phi insertion. Differential Revision: https://reviews.llvm.org/D105192	2021-07-13 12:15:08 +02:00
Sebastian Neubauer	9d72c0ad43	[AMDGPU] Mark waterfall loops as SI_WATERFALL_LOOP This way, they can be detected later, e.g. by the SIOptimizeVGPRLiveRange pass. Differential Revision: https://reviews.llvm.org/D105467	2021-07-13 12:15:08 +02:00
Anton Zabaznov	78463ebde2	[OpenCL] Add support of __opencl_c_generic_address_space feature macro Reviewed By: Anastasia Differential Revision: https://reviews.llvm.org/D103401	2021-07-13 13:14:10 +03:00

1 2 3 4 5 ...

393508 Commits All Branches Search

393508 Commits

All Branches