llvm-project

Commit Graph

Author	SHA1	Message	Date
Johannes Doerfert	095cecbe0d	[OpenMP] `omp begin/end declare variant` - part 1, parsing This is the first part extracted from D71179 and cleaned up. This patch provides parsing support for `omp begin/end declare variant`, as defined in OpenMP technical report 8 (TR8) [0]. A major purpose of this patch is to provide proper math.h/cmath support for OpenMP target offloading. See PR42061, PR42798, PR42799. The current code was developed with this feature in mind, see [1]. [0] https://www.openmp.org/wp-content/uploads/openmp-TR8.pdf [1] https://reviews.llvm.org/D61399#change-496lQkg0mhRN Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D74941	2020-03-27 02:30:58 -05:00
Johannes Doerfert	b29322435a	[OpenMP][NFC] Open `llvm` and `llvm::omp` namespace in OpenMPClause.cpp	2020-03-27 02:30:57 -05:00
Johannes Doerfert	56d1553dd2	[OpenMP][NFC] Outline common functionality (skipUntilPragmaOpenMPEnd) The same code was repeated multiple times, we put it in a function now.	2020-03-27 02:30:57 -05:00
Fangrui Song	b3cc5dcef0	[MCInstPrinter] Add parameter `Address` to MCInstPrinter::printAliasInstr. NFC Follow-up of D72172.	2020-03-27 00:03:32 -07:00
Shengchen Kan	1fb4f99a21	[X86][MC] Fix the bug for prefix padding support Summary: There is a tiny logic error of D75300, making branch is not correctly aligned with option -x86-pad-max-prefix-size Reviewers: reames, MaskRay, craig.topper, LuoYuanke, jyknight Reviewed By: reames Subscribers: hiraditya, llvm-commits, annita.zhang Tags: #llvm Differential Revision: https://reviews.llvm.org/D76285	2020-03-27 14:16:09 +08:00
Kai Luo	26b46b67d8	[PowerPC] Fix test for PR45297 to adapt build without asserts. NFC.	2020-03-27 05:28:34 +00:00
Kai Luo	351b192315	[PowerPC] Enhance test for PR45297. NFC.	2020-03-27 04:45:21 +00:00
Uday Bondhugula	ad4b4acbb0	[MLIR][NFC] drop some unnecessary includes Drop unnecessary includes Differential Revision: https://reviews.llvm.org/D76898	2020-03-27 09:17:27 +05:30
Juneyoung Lee	1bcc500b48	[DAGCombine] Add basic optimizations for FREEZE in SelDag Summary: This patch is the first effort to adding basic optimizations for FREEZE in SelDag. Reviewers: spatel, lebedev.ri Reviewed By: spatel Subscribers: xbolva00, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76707	2020-03-27 12:20:39 +09:00
David Blaikie	819e540208	Use llvm_unreachable after a fully covered/always-returning switch	2020-03-26 20:09:57 -07:00
David Blaikie	324f5a14d7	Make llvm::function_ref's operator bool explicit This can avoid all sorts of mistakes with implicit conversion (indirectly) to int, etc. I'm quite surprise there aren't any things to fixup with this - but I guess most uses of function_ref aren't optional/nullable.	2020-03-26 20:09:57 -07:00
Zakk Chen	64fe841856	Fix typo, targetFeature should be lowercase. this fixing also enable llc -mattr=+cpuhelp Reviewers: ziangwan, kongyi Reviewed By: kongyi Tags: #llvm Differential Revision: https://reviews.llvm.org/D76757	2020-03-26 19:40:04 -07:00
Kai Wang	1a6b7318dd	[NFC] Clang format for the ELF header and ARM build attributes. Differential Revision: https://reviews.llvm.org/D76819	2020-03-27 09:53:12 +08:00
Leonard Chan	5d929e6646	Move setBugReportMsg() out from under a conditional Fixes a build break with LLVM_ENABLE_BACKTRACES=OFF. Differential Revision: https://reviews.llvm.org/D76893	2020-03-26 16:39:03 -07:00
Cyndy Ishida	d26e0bcfba	[llvm][TextAPI/MachO] silence clang-tidy warnings, NFC * applies only to tests	2020-03-26 16:32:04 -07:00
Dan Gohman	66bfbedbdf	[WebAssembly] Support wasm exports with zero-length names. Zero-length strings are valid export names in WebAssembly, so allow users to specify them. Differential Revision: https://reviews.llvm.org/D71793	2020-03-26 16:20:43 -07:00
Dan Gohman	d865437d9c	[WebAssembly] Fix the order of destructors in the LowerGlobalDtors pass. Fix the LowerGlobalDtors pass to run destructors in the same order as the regular LLVM destructor lowering -- in reverse order. Adjacent destructors with the same associated object are grouped, but destructors are not reordered based on associated objects. Differential Revision: https://reviews.llvm.org/D70685	2020-03-26 16:19:02 -07:00
Douglas Yung	5db37f3bca	Make PS4 use -fno-use-init-array only as the ABI does not support .init_array. Reviewed by Paul Robinson	2020-03-26 15:45:40 -07:00
Sid Manning	b0da094983	[Hexagon] Add support for Linux/Musl ABI (part 2) A continuation of https://reviews.llvm.org/D72701. This adds support needed in clang. Differential Revision: https://reviews.llvm.org/D75638	2020-03-26 17:19:46 -05:00
Stanislav Mekhanoshin	4c4b71843b	[AMDGPU] Propagate amdgpu-waves-per-eu to callees Differential Revision: https://reviews.llvm.org/D76868	2020-03-26 14:43:44 -07:00
Alexey Bataev	2a43a1610d	[OPENMP50]Fix the checks for the nesting of scan directives. Fixed the check for the orhaned scan directives and improved checks for parallel for and parallel for simd directives.	2020-03-26 17:30:02 -04:00
Paolo Severini	aff75e1a1f	[lld][Wasm] Wasm-ld emits invalid .debug_ranges entries for non-live symbols When the debug info contains a relocation against a dead symbol, wasm-ld may emit spurious range-list terminator entries (entries with Start==0 and End==0). This change fixes this by emitting the WasmRelocation Addend as End value for a non-live symbol. Reviewed by: sbc100, dblaikie Differential Revision: https://reviews.llvm.org/D74781	2020-03-26 14:26:31 -07:00
LLVM GN Syncbot	196286434d	[gn build] Port `9f7d4150b9`	2020-03-26 21:10:45 +00:00
Craig Topper	9f7d4150b9	[X86] Move combineLoopMAddPattern and combineLoopSADPattern to an IR pass before SelecitonDAG. These transforms rely on a vector reduction flag on the SDNode set by SelectionDAGBuilder. This flag exists because SelectionDAG can't see across basic blocks so SelectionDAGBuilder is looking across and saving the info. X86 is the only target that uses this flag currently. By removing the X86 code we can remove the flag and the SelectionDAGBuilder code. This pass adds a dedicated IR pass for X86 that looks across the blocks and transforms the IR into a form that the X86 SelectionDAG can finish. An advantage of this new approach is that we can enhance it to shrink the phi nodes and final reduction tree based on the zeroes that we need to concatenate to bring the partially reduced reduction back up to the original width. Differential Revision: https://reviews.llvm.org/D76649	2020-03-26 14:10:20 -07:00
Sam Clegg	0731372ee2	[clang] Allow -DDEFAULT_SYSROOT to be a relative path In this case we interpret the path as relative the clang driver binary. This allows SDKs to be built that include clang along with a custom sysroot without requiring users to specify --sysroot to point to the directory where they installed the SDK. See https://github.com/WebAssembly/wasi-sdk/issues/58 Differential Revision: https://reviews.llvm.org/D76653	2020-03-26 13:48:57 -07:00
Simon Pilgrim	ad36491ebb	[X86] Prefer PACKUS(AND(),AND()) to SHUFFLE(PSHUFB(),PSHUFB()) on all targets Extends rG9d1721ce3926 to support AVX2+ targets.	2020-03-26 20:46:24 +00:00
Jay Foad	0fe096c4e9	[AMDGPU] Rename overloaded getMaxWavesPerEU to getWavesPerEUForWorkGroup Summary: I think Max in the name was misleading. NFC. Reviewers: arsenm, rampitec Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76860	2020-03-26 20:21:04 +00:00
Jay Foad	bb9c4fd7ea	[AMDGPU] Remove getMaxWavesPerCU in favour of getWavesPerWorkGroup. Summary: These methods were identical. I chose to remove getMaxWavesPerCU because I think Max in the name was misleading. NFC. Reviewers: arsenm, rampitec Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76859	2020-03-26 20:21:04 +00:00
Derek Schuff	e110897e28	[WEbAssembly] Clear frame base vreg in explicit-locals when stack pointer is dead Having an alloca in a function causes the stack pointer to be generated in the prolog, but if it's unused other than for debug info, explicit-locals will drop it and not allocate a local. In this case we need to reset the FrameBaseVreg. Differential Revision: https://reviews.llvm.org/D76784	2020-03-26 13:07:32 -07:00
Simon Pilgrim	39a52a19ed	[X86] lowerV16I8Shuffle - create v8i16 mask for PACKUS(AND(),AND()) patterns. We can improve computeKnownBits results by avoiding excess bitcasts. For this pattern we were doing: (v16i8 PACKUS(v8i16 BITCAST(v16i8 AND(V1, MASK)), v8i16 BITCAST(v16i8 AND(V2, MASK)))) By performing the MASK/AND with a v8i16 type and bitcasting V1/V2 directly we can help computeKnownBits see that the mask is clearing the upper bits and allows shuffle combining to peek through later on. This will be necessary to extend rG9d1721ce3926 to AVX2+ targets in a future patch.	2020-03-26 19:59:57 +00:00
Alexey Bataev	f9e71f4d9d	Revert "[OPENMP50]Add basic support for inscan reduction modifier." This reverts commit `8099e0fe82` to fix the problems with the Windows-based buildbots.	2020-03-26 15:57:19 -04:00
Luís Marques	ad1466f8cb	[sanitizer][RISCV] Implement SignalContext::GetWriteFlag for RISC-V This patch follows the approach also used for MIPS, where we decode the offending instruction to determine if the fault was caused by a read or write operation, as that seems to be the only relevant information we have in the signal context structure to determine that. Differential Revision: https://reviews.llvm.org/D75168	2020-03-26 19:55:35 +00:00
diggerlin	fdfe411e7c	[AIX] discard the label in the csect of function description and use qualname for linkage SUMMARY: SUMMARY for a source file "test.c" void foo() {}; llc will generate assembly code as (assembly patch) .globl foo .globl .foo .csect foo[DS] foo: .long .foo .long TOC[TC0] .long 0 and symbol table as (xcoff object file) [4] m 0x00000004 .data 1 unamex foo [5] a4 0x0000000c 0 0 SD DS 0 0 [6] m 0x00000004 .data 1 extern foo [7] a4 0x00000004 0 0 LD DS 0 0 After first patch, the assembly will be as .globl foo[DS] # -- Begin function foo .globl .foo .align 2 .csect foo[DS] .long .foo .long TOC[TC0] .long 0 and symbol table will as [6] m 0x00000004 .data 1 extern foo [7] a4 0x00000004 0 0 DS DS 0 0 Change the code for the assembly path and xcoff objectfile patch for llc. Reviewers: Jason Liu Subscribers: wuzish, nemanjai, hiraditya Differential Revision: https://reviews.llvm.org/D76162	2020-03-26 15:46:52 -04:00
Jon Chesterfield	856c995436	[libomptarget] Add missing elf_end call in elf_common.c Summary: [libomptarget] Add missing elf_end call in elf_common.c Noticed when reviewing D76843. Reviewers: simoll, jdoerfert, efocht, AndreyChurbanov, grokos, manorom Reviewed By: grokos Subscribers: openmp-commits Tags: #openmp Differential Revision: https://reviews.llvm.org/D76874	2020-03-26 19:07:33 +00:00
Alexey Bataev	8099e0fe82	[OPENMP50]Add basic support for inscan reduction modifier. Added basic support (parsing/sema checks) for the inscan modifier in the reduction clauses.	2020-03-26 14:51:09 -04:00
Michael Liao	6a9ad5f3f4	[cuda][hip] Add CUDA builtin surface/texture reference support. Summary: - Even though the bindless surface/texture interfaces are promoted, there are still code using surface/texture references. For example, [PR#26400](https://bugs.llvm.org/show_bug.cgi?id=26400) reports the compilation issue for code using `tex2D` with texture references. For better compatibility, this patch proposes the support of surface/texture references. - Due to the absent documentation and magic headers, it's believed that `nvcc` does use builtins for texture support. From the limited NVVM documentation[^nvvm] and NVPTX backend texture/surface related tests[^test], it's believed that surface/texture references are supported by replacing their reference types, which are annotated with `device_builtin_surface_type`/`device_builtin_texture_type`, with the corresponding handle-like object types, `cudaSurfaceObject_t` or `cudaTextureObject_t`, in the device-side compilation. On the host side, that global handle variables are registered and will be established and updated later when corresponding binding/unbinding APIs are called[^bind]. Surface/texture references are most like device global variables but represented in different types on the host and device sides. - In this patch, the following changes are proposed to support that behavior: + Refine `device_builtin_surface_type` and `device_builtin_texture_type` attributes to be applied on `Type` decl only to check whether a variable is of the surface/texture reference type. + Add hooks in code generation to replace that reference types with the correponding object types as well as all accesses to them. In particular, `nvvm.texsurf.handle.internal` should be used to load object handles from global reference variables[^texsurf] as well as metadata annotations. + Generate host-side registration with proper template argument parsing. --- [^nvvm]: https://docs.nvidia.com/cuda/pdf/NVVM_IR_Specification.pdf [^test]: https://raw.githubusercontent.com/llvm/llvm-project/master/llvm/test/CodeGen/NVPTX/tex-read-cuda.ll [^bind]: See section 3.2.11.1.2 ``Texture reference API` in [CUDA C Programming Guide](https://docs.nvidia.com/cuda/pdf/CUDA_C_Programming_Guide.pdf). [^texsurf]: According to NVVM IR, `nvvm.texsurf.handle` should be used. But, the current backend doesn't have that supported. We may revise that later. Reviewers: tra, rjmccall, yaxunl, a.sidorin Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76365	2020-03-26 14:44:52 -04:00
Scott Linder	bd12ecb88f	[AMDGPU] Fix PC register mapping in wave32 mode Summary: The PC_32 DWARF register is for a 32-bit process address space which we don't implement in AMDGCN; another way of putting this is that the size of the PC register is not a function of the wavefront size. If we ever implement a 32-bit process address space we will need to add two more DwarfFlavours i.e. we will need to represent the product of (wave32, wave64) x (64-bit address space, 32-bit address space). Tags: #llvm Differential Revision: https://reviews.llvm.org/D76732	2020-03-26 14:43:25 -04:00
David Blaikie	9002db05a2	Roll otherwise unused subexpressions into an assertion	2020-03-26 11:32:33 -07:00
Sanjay Patel	5237262feb	[InstCombine] add shuffle-with-bitcast-operand tests; NFC	2020-03-26 14:28:47 -04:00
Sterling Augustine	6c6fba8808	Correctly handle using foo = std::foo inside namespaces. Summary: The gdb pretty printer misprints variables declared via using declarations of the form: namespace foo { using string_view = std::string_view; string_view bar; } This change fixes that, by deferring the decision to ignore types not inside std until after desugaring. Reviewers: #libc! Subscribers: broadwaylamb, libcxx-commits Tags: #libc Differential Revision: https://reviews.llvm.org/D76816	2020-03-26 11:20:09 -07:00
Guillaume Chatelet	b727aabcb8	[Alignment][NFC] Use llvmTargetFrameLowering::getStackAlign Summary: This is patch is part of a series to introduce an Alignment type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Reviewers: courbet Reviewed By: courbet Subscribers: wuzish, arsenm, jyknight, nemanjai, jvesely, nhaehnle, hiraditya, kbarton, fedor.sergeev, jrtc27, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76613	2020-03-26 18:15:53 +00:00
Jonathan Roelofs	7a89a5d81b	[InstCombine] Fix Incorrect fold of ashr+xor -> lshr w/ vectors Fixes https://bugs.llvm.org/show_bug.cgi?id=43665	2020-03-26 12:09:36 -06:00
Jinsong Ji	fe025a3490	[docs][Phabricator] git migration related update 1.Add instructions to update author when committing other's patch We have updated DeveloperPolicy to show how to change author in https://reviews.llvm.org/D72468 We should also update Phabricator page to include such infomation, in case people follow the steps here and forget to update author info. 2. Replace `git llvm push` with `git push` Reviewed By: probinson Differential Revision: https://reviews.llvm.org/D76718	2020-03-26 18:08:06 +00:00
Heejin Ahn	f033f201a7	[WebAssembly] Add test for event section order change Summary: This adds a test for D76752. Now the global section comes after the event section, and this change makes sure it is satisfied. Reviewers: sbc100, tlively Reviewed By: tlively Subscribers: dschuff, jgravelle-google, sunfish, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76823	2020-03-26 11:05:24 -07:00
Marius Brehler	6946ca4b4c	[mlir] Rename CMake target MLIRQuantOps to MLIRQuant With commit `4d60f47` VectorOps was renamed to Vector and the naming of the CMake target was adjusted. With commit `363dd3f` QuantOps was renamed to Quant, but the naming of the CMake target is left untouched. This renames the CMake target.	2020-03-26 11:03:29 -07:00
Dan Liew	445b810fbd	[ASan] Fix issue where system log buffer was not cleared after reporting an issue. Summary: When ASan reports an issue the contents of the system log buffer (`error_message_buffer`) get flushed to the system log (via `LogFullErrorReport()`). After this happens the buffer is not cleared but this is usually fine because the process usually exits soon after reporting the issue. However, when ASan runs in `halt_on_error=0` mode execution continues without clearing the buffer. This leads to problems if more ASan issues are found and reported. 1. Duplicate ASan reports in the system log. The Nth (start counting from 1) ASan report will be duplicated (M - N) times in the system log if M is the number of ASan issues reported. 2. Lost ASan reports. Given a sufficient number of reports the buffer will fill up and consequently cannot be appended to. This means reports can be lost. The fix here is to reset `error_message_buffer_pos` to 0 which effectively clears the system log buffer. A test case is included but unfortunately it is Darwin specific because querying the system log is an OS specific activity. rdar://problem/55986279 Reviewers: kubamracek, yln, vitalybuka, kcc, filcab Subscribers: #sanitizers, llvm-commits Tags: #sanitizers Differential Revision: https://reviews.llvm.org/D76749	2020-03-26 10:56:17 -07:00
Sean Silva	3dceb6d246	Allow IndexType inside tensors. It's common in many dialects to use tensors to themselves hold tensor shapes (for example, the shape is itself the result of some non-trivial calculation). Currently, such dialects have to use `tensor<?xi64>` or worse (like allowing either i32 or i64 tensors to represent shapes). `tensor<?xindex>` is the natural type to represent this, but is currently disallowed. This patch allows it. Differential Revision: https://reviews.llvm.org/D76726	2020-03-26 10:52:48 -07:00
Sam McCall	47e7bdb107	Test that would have caught recovery-expr crashes in `0788acbccb`. NFC	2020-03-26 18:43:29 +01:00
Alex Zinenko	04ed07bc17	[mlir] StandardToLLVM: clean up conversion patterns for vector operations Summary: Provide a public VectorConvertToLLVMPattern utility class to implement conversions with automatic unrolling of operation on multidimensional vectors to lists of operations on single-dimensional vectors when lowering to the LLVM dialect. Drop the template-based check on the number of operands since the actual implementation does not depend on the operand number anymore. This check only creates spurious concepts (UnaryOpLowering, BinaryOpLowering, etc). Differential Revision: https://reviews.llvm.org/D76865	2020-03-26 18:24:10 +01:00
Alex Zinenko	987fbae0ad	[mlir] StandardToLLVM: make one-to-one convresion pattern publicly available Summary: The Standard-to-LLVM dialect convresion has a set of utility classes that simplify conversions, including patterns that provide one-to-one conversion operation conversion with optional result packing. Expose these classes in a public header so that conversions other than Standard-to-LLVM (e.g. vectors, or LLVM-based intrinsics) could also use them. Since the patterns are implemented as class templates and in order to keep the code size limited, keep the implementation private by resorting to op identifiers instead of template-based builders. Differential Revision: https://reviews.llvm.org/D76864	2020-03-26 18:24:07 +01:00

1 2 3 4 5 ...

346357 Commits All Branches Search

346357 Commits

All Branches