llvm-project

Commit Graph

Author	SHA1	Message	Date
Fangrui Song	512c03bac4	[DebugInfo] Add a DWARFDataExtractor constructor that takes ArrayRef<uint8_t> Similar to D67797 (DataExtractor).	2020-02-09 17:45:32 -08:00
Matt Arsenault	312a9d1b83	GlobalISel: Fix narrowScalar for G_{CTLZ\|CTTZ}_ZERO_UNDEF Narrow these for 64-bit VALU for AMDGPU.	2020-02-09 19:02:38 -05:00
Matt Arsenault	c437f6c687	AMDGPU/GlobalISel: Split 64-bit G_CTPOP in RegBankSelect	2020-02-09 18:39:33 -05:00
Matt Arsenault	6135f5eda4	GlobalISel: Fix narrowing of G_CTLZ/G_CTTZ The result type is separate from the source type.	2020-02-09 18:11:43 -05:00
Matt Arsenault	2126c70e3a	AMDGPU/GlobalISel: Don't mis-select vector index on a constant Vector indexing with a constant index should be folded out in the legalizer, but this was accidentally falling through. This would produce the indexing operation with $noreg. Handle this case as a dynamic index just in case a bug like this happens again in the future.	2020-02-09 18:02:37 -05:00
Matt Arsenault	f4a38c114e	AMDGPU/GlobalISel: Look through casts when legalizing vector indexing We were failing to find constants that were casted. I feel like the artifact combiner should have folded the constant in the trunc before the custom lowering, but that doesn't happen.	2020-02-09 18:02:10 -05:00
Matt Arsenault	00115d767f	AMDGPU: Remove dead kill handling At one point a custom node was used for kill handling, but now the intrinsic is directly selected. Remove leftover pattern machinery.	2020-02-09 17:59:24 -05:00
Matt Arsenault	6e1770821f	AMDGPU: Fix SI_IF lowering when the save exec reg has terminator uses Reverts part of `6524a7a2b9`. Since that commit, the expansion was ignoring the actual save exec register produced by the instruction, and looking at other instructions. I do not understand why it was looking at other instructions, but relying on this scan was wrong. Fixes verifier errors after SI_IF is tail duplicated, which should be correct to do. The results were fed into a phi, which was lowered to the S_MOV_B64_term instructions.	2020-02-09 17:59:19 -05:00
Simon Pilgrim	29e646fe65	[X86] combineConcatVectorOps - combine VROTLI/VROTRI ops Fix issue mentioned on rGe82e17d4d4ca - non-AVX512BW targets failed to concatenate 256-bit rotations back to 512-bits (split during shuffle lowering as they don't have v32i16/v64i8 types).	2020-02-09 21:50:10 +00:00
Craig Topper	656d66f5fc	[X86] Use custom isel for (X86sbb_flag 0, 0) so we can use 32-bit SBB for i8/i16. We were using MOV32r0 and an extract_subreg as an input. By using custom isel we can move the extract_subreg to after the SBB instead of on the input.	2020-02-09 13:19:35 -08:00
Craig Topper	e1cbfecdb8	[X86] Add flag result VT to a MOV32r0 created in X86DAGToDAGISel::Select The flag isn't used, but I believe this matches the MOV32r0 that would be created by the table emitter. This should allow this node to be CSEed with any others created by the table.	2020-02-09 13:19:21 -08:00
Simon Pilgrim	e82e17d4d4	[X86] Add lowerShuffleAsBitRotate (PR44379) As noted on PR44379, we didn't attempt to lower vector shuffles using bit rotations on XOP/AVX512F targets. This patch lowers to uniform ISD:ROTL nodes - ROTR isn't supported by XOP and they are interchangeable for constant values anyway. There might be cases where targets without ISD:ROTL support would benefit from this (expanding to SRL+SHL+OR), which I'll investigate in a future patch. Also, non-AVX512BW targets fail to concatenate 256-bit rotations back to 512-bits (split during shuffle lowering as they don't have v32i16/v64i8 types).	2020-02-09 21:15:03 +00:00
Craig Topper	dd262222b4	[X86] Use MVT::i32 for the type of a MOV32r0 created in X86DAGToDAGISel::Select. Not sure if this really matters. The VT isn't really used after this point. At best it might affect CSE.	2020-02-09 11:57:42 -08:00
Craig Topper	dbcc1392b3	[X86] Remove isel patterns that include a vselect/X86selects and a strict FP node. A vselect+strictfp node is not equivalent to a masked operation. The exceptions of the strictfp node are not masked by a vselect after it so we can't match it to a masked operation. We already had a hack in IsLegalToFold to prevent these patterns from matching. This patch removes that hack and removes the patterns.	2020-02-09 11:45:54 -08:00
Jan Vesely	85e2fa44c6	libclc/r600: Use target specific builtins to implement rsqrt and native_rsqrt Fixes OCL CTS rsqrt and half_rsqrt (1 thread, scalaer) tests on AMD Turks. Reviewer: awatry Differential Revision: https://reviews.llvm.org/D74016	2020-02-09 14:42:15 -05:00
Jan Vesely	4b23a2e8e9	libclc: Move rsqrt implementation to a .cl file Reviewer: awatry Differential Revision: https://reviews.llvm.org/D74013	2020-02-09 14:42:09 -05:00
Simon Pilgrim	0ae119f835	[X86][XOP] Add XOP target to vXi16/vXi8 shuffle tests Helps with bit rotation test coverage for PR44379	2020-02-09 18:35:51 +00:00
Simon Pilgrim	2278073125	[X86][SSE] Add more tests showing failure to lower shuffles as bit rotations	2020-02-09 18:35:51 +00:00
Simon Pilgrim	29621b2534	[X86] Rename matchShuffleAsRotate - matchShuffleAsByteRotate. NFCI. A matchShuffleAsBitRotate variant will be added soon and we need to make the difference more obvious.	2020-02-09 18:35:50 +00:00
Jan Kratochvil	9d223a0106	[lldb] [doc] Status: Linux: Update the paragraph	2020-02-09 18:13:04 +01:00
Kamil Rytarowski	273f638384	[LLDB] [doc] Document NetBSD status and sort OSs alphabetically	2020-02-09 18:02:07 +01:00
LLVM GN Syncbot	628462e30a	[gn build] Port `a17f03bd93`	2020-02-09 15:41:05 +00:00
Sanjay Patel	a17f03bd93	[VectorCombine] new IR transform pass for partial vector ops We have several bug reports that could be characterized as "reducing scalarization", and this topic was also raised on llvm-dev recently: http://lists.llvm.org/pipermail/llvm-dev/2020-January/138157.html ...so I'm proposing that we deal with these patterns in a new, lightweight IR vector pass that runs before/after other vectorization passes. There are 4 alternate options that I can think of to deal with this kind of problem (and we've seen various attempts at all of these), but they all have flaws: InstCombine - can't happen without TTI, but we don't want target-specific folds there. SDAG - too late to assist other vectorization passes; TLI is not equipped for these kind of cost queries; limited to a single basic block. CGP - too late to assist other vectorization passes; would need to re-implement basic cleanups like CSE/instcombine. SLP - doesn't fit with existing transforms; limited to a single basic block. This initial patch/transform is based on existing code in AggressiveInstCombine: we walk backwards through the function looking for a pattern match. But we diverge from that cost-independent IR canonicalization pass by using TTI to decide if the vector alternative is profitable. We probably have at least 10 similar bug reports/patterns (binops, constants, inserts, cheap shuffles, etc) that would fit in this pass as follow-up enhancements. It's possible that we could iterate on a worklist to fix-point like InstCombine does, but it's safer to start with a most basic case and evolve from there, so I didn't try to do anything fancy with this initial implementation. Differential Revision: https://reviews.llvm.org/D73480	2020-02-09 10:04:41 -05:00
Jan Kratochvil	74857b4260	[lldb] [doc] Status: Debugserver (remote debugging) is OK now	2020-02-09 15:22:36 +01:00
Jan Kratochvil	8b37e1e5ac	[lldb] [doc] Testing: Fix typos	2020-02-09 15:11:38 +01:00
Kamil Rytarowski	5a285f207e	[LLDB] [doc] Remove note about libpanel(3) and NetBSD libpanel(3) is now supported in all supported versions of NetBSD.	2020-02-09 15:01:17 +01:00
Kamil Rytarowski	0ea4d18a28	[LLDB] [doc] Update the current status of pkgsrc (NetBSD) building	2020-02-09 15:01:17 +01:00
Jan Kratochvil	420a518068	[lldb] [testsuite] TestGdbRemoteLibrariesSvr4Support: Fix symlinked builddir When I have symlinked builddir on Fedora 31 x86_64 I get: FAIL: test_libraries_svr4_libs_present (TestGdbRemoteLibrariesSvr4Support.TestGdbRemoteLibrariesSvr4Support) ---------------------------------------------------------------------- ... File "lldb/packages/Python/lldbsuite/test/tools/lldb-server/libraries-svr4/TestGdbRemoteLibrariesSvr4Support.py", line 106, in libraries_svr4_libs_present self.assertIn(self.getBuildDir() + "/" + lib, libraries_svr4_names) AssertionError: '/home/jkratoch/redhat/llvm-monorepo-clangassertsymlink/lldb-test-build.noindex/tools/lldb-server/libraries-svr4/TestGdbRemoteLibrariesSvr4Support.test_libraries_svr4_libs_present/libsvr4lib_a.so' not found in ['/home/jkratoch/redhat/llvm-monorepo/lldb/packages/Python/lldbsuite/test/tools/lldb-server/libraries-svr4/linux-vdso.so.1', '/quad/home/jkratoch/redhat/llvm-monorepo-clangassertsymlink/lldb-test-build.noindex/tools/lldb-server/libraries-svr4/TestGdbRemoteLibrariesSvr4Support.test_libraries_svr4_libs_present/libsvr4lib_a.so', '/quad/home/jkratoch/redhat/llvm-monorepo-clangassertsymlink/lldb-test-build.noindex/tools/lldb-server/libraries-svr4/TestGdbRemoteLibrariesSvr4Support.test_libraries_svr4_libs_present/libsvr4lib_b".so', '/usr/lib64/libdl-2.30.so', '/usr/lib64/libstdc++.so.6.0.27', '/usr/lib64/libm-2.30.so', '/usr/lib64/libgcc_s-9-20190827.so.1', '/usr/lib64/libc-2.30.so', '/usr/lib64/ld-2.30.so'] Config=x86_64-/quad/home/jkratoch/redhat/llvm-monorepo-clangassertsymlink/bin/clang-11 ---------------------------------------------------------------------- Differential Revision: https://reviews.llvm.org/D74295	2020-02-09 14:49:38 +01:00
Simon Pilgrim	3ec6de07e9	Fix signed/unsigned warning.	2020-02-09 13:35:03 +00:00
Simon Pilgrim	644d56b432	[X86] Recognise ROTLI/ROTRI rotations as faux shuffles Allows us to combine rotations with shuffles. One of many things necessary to fix PR44379 (lowering shuffles to rotations)	2020-02-09 12:25:49 +00:00
Ehud Katz	3b70ee27a5	[LoopExtractor] Convert LoopExtractor from LoopPass to ModulePass The LoopExtractor created new functions (by definition), which violates the restrictions of a LoopPass. The correct implementation of this pass should be as a ModulePass. Includes reverting rL82990 implications on the LoopExtractor. Fixes PR3082 and PR8929. Differential Revision: https://reviews.llvm.org/D69069	2020-02-09 12:25:21 +02:00
Ayman Musa	10c7b7708b	[AggressiveInstCombine] Add test with baseline CHECKs for aggressive inst combine for SELECT.	2020-02-09 12:07:25 +02:00
serge_sans_paille	e67cbac812	Support -fstack-clash-protection for x86 Implement protection against the stack clash attack [0] through inline stack probing. Probe stack allocation every PAGE_SIZE during frame lowering or dynamic allocation to make sure the page guard, if any, is touched when touching the stack, in a similar manner to GCC[1]. This extends the existing `probe-stack' mechanism with a special value `inline-asm'. Technically the former uses function call before stack allocation while this patch provides inlined stack probes and chunk allocation. Only implemented for x86. [0] https://www.qualys.com/2017/06/19/stack-clash/stack-clash.txt [1] https://gcc.gnu.org/ml/gcc-patches/2017-07/msg00556.html This a recommit of `39f50da2a3` with proper LiveIn declaration, better option handling and more portable testing. Differential Revision: https://reviews.llvm.org/D68720	2020-02-09 10:42:45 +01:00
serge-sans-paille	4546211600	Revert "Support -fstack-clash-protection for x86" This reverts commit `0fd51a4554`. Failures: http://lab.llvm.org:8011/builders/llvm-clang-win-x-armv7l/builds/4354	2020-02-09 10:06:31 +01:00
serge_sans_paille	0fd51a4554	Support -fstack-clash-protection for x86 Implement protection against the stack clash attack [0] through inline stack probing. Probe stack allocation every PAGE_SIZE during frame lowering or dynamic allocation to make sure the page guard, if any, is touched when touching the stack, in a similar manner to GCC[1]. This extends the existing `probe-stack' mechanism with a special value `inline-asm'. Technically the former uses function call before stack allocation while this patch provides inlined stack probes and chunk allocation. Only implemented for x86. [0] https://www.qualys.com/2017/06/19/stack-clash/stack-clash.txt [1] https://gcc.gnu.org/ml/gcc-patches/2017-07/msg00556.html This a recommit of `39f50da2a3` with proper LiveIn declaration, better option handling and more portable testing. Differential Revision: https://reviews.llvm.org/D68720	2020-02-09 09:35:42 +01:00
Fangrui Song	1732f50ee0	[ELF][test] Use llvm-readelf -l instead of llvm-readobj -l for some memory region tests	2020-02-08 22:45:00 -08:00
MaheshRavishankar	aaddca1efd	[mlir][GPUToSPIRV] Modify the lowering of gpu.block_dim to be consistent with Vulkan SPEC The existing lowering of gpu.block_dim added a global variable with the WorkGroupSize decoration. This raises an error within Vulkan/SPIR-V validation since Vulkan requires this to have a constant initializer. This is not yet supported in SPIR-V dialect. Changing the lowering to return the workgroup size as a constant value instead, obtained from spv.entry_point_abi attribute gets around the issue for now. The validation goes through since the workgroup size is specified using spv.execution_mode operation.	2020-02-08 22:30:03 -08:00
Craig Topper	e629674176	[X86] Add more scalar intrinsic instructions to isNonFoldablePartialRegisterLoad. I think this covers most if not all of the scalar intrinsic instructions.	2020-02-08 20:41:36 -08:00
Johannes Doerfert	b0c77c36d2	[Attributor] Add an Attributor CGSCC pass and run it In addition to the module pass, this patch introduces a CGSCC pass that runs the Attributor on a strongly connected component of the call graph (both old and new PM). The Attributor was always design to be used on a subset of functions which makes this patch mostly mechanical. The one change is that we give up `norecurse` deduction in the module pass in favor of doing it during the CGSCC pass. This makes the interfaces simpler but can be revisited if needed. Reviewed By: hfinkel Differential Revision: https://reviews.llvm.org/D70767	2020-02-08 21:27:34 -06:00
Fangrui Song	ee3f13b81d	Fix -Wunused-lambda-capture for -DLLVM_ENABLE_ASSERTIONS=off builds after `6556c615f3`	2020-02-08 19:03:58 -08:00
Johannes Doerfert	08c0a06d8f	[FIX] Ordering problem accidentally introduced with D72304	2020-02-08 20:14:41 -06:00
Craig Topper	0152b106ae	[X86] Add the recently added (V)CVTSS2SI/CVTSD2SI instructions used for LRINT/LLRINT to the load folding tables.	2020-02-08 17:54:48 -08:00
Johannes Doerfert	c057d1d3af	[FIX] Fix warning in LazyCallGraphTest caused by D70927	2020-02-08 18:58:16 -06:00
fady	e8a436c5ea	[OpenMP][OMPIRBuilder] Add Directives (master and critical) to OMPBuilder. Add support for Master and Critical directive in the OMPIRBuilder. Both make use of a new common interface for emitting inlined OMP regions called `emitInlinedRegion` which was added in this patch as well. Also this patch modifies clang to use the new directives when `-fopenmp-enable-irbuilder` commandline option is passed. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D72304	2020-02-08 18:55:48 -06:00
Johannes Doerfert	e565db49c6	[OpenMP][Opt] Delete terminating and read-only parallel regions Parallel regions known to be read-only, e.g., after we removed all dead write accesses, and terminating (`willreturn`) can be removed. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D69954	2020-02-08 18:52:04 -06:00
Johannes Doerfert	e28936f613	[OpenMP][Opt] Annotate known runtime functions and deduplicate more This adds ~27 more runtime calls to the OpenMPKinds.def file, all with attributes. We deduplicate 16 of those automatically in function = thread scope. And we annotate all of them automatically during the OpenMPOpt discovery step. A test with all omp_XXXX runtime calls to track annotation coverage is included. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D69984	2020-02-08 18:35:39 -06:00
Nico Weber	8df173f399	[gn build] (manually) port `72277ecd62` and the LLVMBuild bit of `9548b74a83`	2020-02-08 19:01:55 -05:00
Craig Topper	d643a39aba	[X86] Use any_fadd/sub/mul/div/sqrt with the AVX512 scalar_*_patterns. Making sure not to use them with patterns for masked instructions. Also fix FMA patterns that were matching strict_fma+x86selects to masked instructions.	2020-02-08 15:54:40 -08:00
River Riddle	2f94ce0dcf	[mlir][DeclarativeParser] Move several missed parsers over to the declarative form. Differential Revision: https://reviews.llvm.org/D74283	2020-02-08 15:47:55 -08:00
River Riddle	1b2c16f2ae	[mlir][DeclarativeParser] Add support for attributes with buildable types. This revision adds support in the declarative assembly form for printing attributes with buildable types without the type, and moves several more parsers over to the declarative form. Differential Revision: https://reviews.llvm.org/D74276	2020-02-08 15:46:46 -08:00

1 2 3 4 5 ...

342142 Commits All Branches Search

342142 Commits

All Branches