llvm-project

Commit Graph

Author	SHA1	Message	Date
Petr Hosek	082b89b25f	[builtins] Reformat builtins with clang-format Update formatting to use the LLVM style. This is part of the cleanup proposed in "[RFC] compiler-rt builtins cleanup and refactoring". Differential Revision: https://reviews.llvm.org/D60351 llvm-svn: 359410	2019-04-28 21:53:32 +00:00
Simon Pilgrim	65f12f66f6	[X86] Add PR39921 HADD pairwise reduction test and AVX2 test coverage llvm-svn: 359409	2019-04-28 21:04:47 +00:00
Simon Pilgrim	85bacd0f95	[X86][AVX] Add fast-hops target for add/fadd reduction tests llvm-svn: 359408	2019-04-28 20:04:08 +00:00
Simon Pilgrim	e375257e95	[X86] Add PR39936 HADD Tests llvm-svn: 359407	2019-04-28 20:03:11 +00:00
Simon Pilgrim	d5cc753b6d	[X86][SSE] combineExtractVectorElt - add early-out to return zero/undef for out-of-range extraction indices. llvm-svn: 359406	2019-04-28 19:12:58 +00:00
Eric Fiselier	a1f420de8c	Fix permission error while running bots llvm-svn: 359405	2019-04-28 17:00:46 +00:00
Eric Fiselier	290a3a3648	attempt to unbreak build bots llvm-svn: 359404	2019-04-28 16:42:49 +00:00
Eric Fiselier	2f5f9a159b	Attempt to switch to auto-scaling bots llvm-svn: 359403	2019-04-28 15:54:50 +00:00
Nikita Popov	7a94795b2b	[ConstantRange] Add makeExactNoWrapRegion() I got confused on the terminology, and the change in D60598 was not correct. I was thinking of "exact" in terms of the result being non-approximate. However, the relevant distinction here is whether the result is * Largest range such that: Forall Y in Other: Forall X in Result: X BinOp Y does not wrap. (makeGuaranteedNoWrapRegion) * Smallest range such that: Forall Y in Other: Forall X not in Result: X BinOp Y wraps. (A hypothetical makeAllowedNoWrapRegion) * Both. (makeExactNoWrapRegion) I'm adding a separate makeExactNoWrapRegion method accepting a single APInt (same as makeExactICmpRegion) and using it in the places where the guarantee is relevant. Differential Revision: https://reviews.llvm.org/D60960 llvm-svn: 359402	2019-04-28 15:40:56 +00:00
Simon Pilgrim	d394195221	[X86][AVX] Enabled AVX512F tests and add PR40815 test case llvm-svn: 359401	2019-04-28 15:04:30 +00:00
Simon Pilgrim	22d1476bfa	[X86][AVX] Combine non-lane crossing binary shuffles using X86ISD::VPERMV3 Some of the combines might be further improved if we lower more shuffles with X86ISD::VPERMV3 directly, instead of waiting to combine the results. llvm-svn: 359400	2019-04-28 14:31:01 +00:00
Sanjay Patel	ce8cfe96f7	[SelectionDAG] include FP min/max variants as binary operators The x86 test diffs don't look great because of extra move ops, but FP min/max should clearly be included in the list. llvm-svn: 359399	2019-04-28 13:19:29 +00:00
Sanjay Patel	fb9a5307a9	[DAGCombiner] try repeated fdiv divisor transform before building estimate This was originally part of D61028, but it's an independent diff. If we try the repeated divisor reciprocal transform before producing an estimate sequence, then we have an opportunity to use scalar fdiv. On x86, the trade-off is 1 divss vs. 5 vector FP ops in the default estimate sequence. On recent chips (Skylake, Ryzen), the full-precision division is only 3 cycle throughput, so that's probably the better perf default option and avoids problems from x86's inaccurate estimates. The last 2 tests show that users still have the option to override the defaults by using the function attributes for reciprocal estimates, but those patterns are potentially made faster by converting the vector ops (including ymm ops) to scalar math. Differential Revision: https://reviews.llvm.org/D61149 llvm-svn: 359398	2019-04-28 12:23:43 +00:00
Andrea Di Biagio	43003f0fec	[MCA] Fix typo in AVX2 gather tests. NFC llvm-svn: 359397	2019-04-28 10:54:45 +00:00
Simon Pilgrim	93ad48210c	[X86][SSE] Optimize llvm.experimental.vector.reduce.xor.vXi1 parity reduction (PR38840) An xor reduction of a bool vector can be optimized to a parity check of the MOVMSK/BITCAST'd integer - if the population count is odd return 1, else return 0. Differential Revision: https://reviews.llvm.org/D61230 llvm-svn: 359396	2019-04-28 10:46:17 +00:00
Simon Pilgrim	fed302ae37	[X86][AVX] Add AVX512DQ coverage for masked memory ops tests (PR34584) llvm-svn: 359395	2019-04-28 10:02:34 +00:00
Dan Liew	8651edf898	[CMake] Don't modify `FUZZER_SUPPORTED_ARCH` is place. On a Darwin host we were modifying the `FUZZER_SUPPORTED_ARCH` in place which would strip out non-x86 architectures. This unhelpful if we want to use `FUZZER_SUPPORTED_ARCH` later. To fix this we introduce `FUZZER_TEST_ARCH` which is similar to what we have for for the other sanitizers. For non-Darwin host platforms `FUZZER_TEST_ARCH` is the same as `FUZZER_SUPPORTED_ARCH` but for Darwin host platforms we use `darwin_filter_host_archs(...)` as the previous code did. llvm-svn: 359394	2019-04-28 09:44:53 +00:00
Qiu Chaofan	8eeb33497c	[PowerPC][Clang] Add tests for PowerPC MMX intrinsics Add the rest of test cases covering functions defined in mmintrin.h on PowerPC. Reviewed By: Jinsong Ji llvm-svn: 359393	2019-04-28 06:27:33 +00:00
Craig Topper	bd35a30940	[X86] Remove (V)MOV64toSDrr/m and (V)MOVDI2SSrr/m. Use 128-bit result MOVD/MOVQ and COPY_TO_REGCLASS instead Summary: The register form of these instructions are CodeGenOnly instructions that cover GR32->FR32 and GR64->FR64 bitcasts. There is a similar set of instructions for the opposite bitcast. Due to the patterns using bitcasts these instructions get marked as "bitcast" machine instructions as well. The peephole pass is able to look through these as well as other copies to try to avoid register bank copies. Because FR32/FR64/VR128 are all coalescable to each other we can end up in a situation where a GR32->FR32->VR128->FR64->GR64 sequence can be reduced to GR32->GR64 which the copyPhysReg code can't handle. To prevent this, this patch removes one set of the 'bitcast' instructions. So now we can only go GR32->VR128->FR32 or GR64->VR128->FR64. The instruction that converts from GR32/GR64->VR128 has no special significance to the peephole pass and won't be looked through. I guess the other option would be to add support to copyPhysReg to just promote the GR32->GR64 to a GR64->GR64 copy. The upper bits were basically undefined anyway. But removing the CodeGenOnly instruction in favor of one that won't be optimized seemed safer. I deleted the peephole test because it couldn't be made to work with the bitcast instructions removed. The load version of the instructions were unnecessary as the pattern that selects them contains a bitcasted load which should never happen. Fixes PR41619. Reviewers: RKSimon, spatel Reviewed By: RKSimon Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D61223 llvm-svn: 359392	2019-04-28 06:25:33 +00:00
Simon Pilgrim	03c4e2663c	Revert rL359389: [X86][SSE] Add support for <64 x i1> bool reduction Minor generalization of the existing <32 x i1> pre-AVX2 split code. ........ Causing irregular buildbot failures. llvm-svn: 359391	2019-04-27 20:44:08 +00:00
Simon Pilgrim	1a4a43250e	[X86][AVX] Add additional SSE/AVX expandload and compressstore targets llvm-svn: 359390	2019-04-27 20:20:02 +00:00
Simon Pilgrim	4118be3af6	[X86][SSE] Add support for <64 x i1> bool reduction Minor generalization of the existing <32 x i1> pre-AVX2 split code. llvm-svn: 359389	2019-04-27 20:04:44 +00:00
Simon Pilgrim	399746eaf6	[X86][AVX] Cleanup and add additional expandload and compressstore tests sort order by types and add vXi32/vXi16/vXi8 test coverage llvm-svn: 359388	2019-04-27 19:57:34 +00:00
Raphael Isemann	e2849a031c	Fix UNPREDICTABLE check in EmulateInstructionARM::EmulateADDRegShift Summary: As reported in LLVM bug 41487, the check in this function is wrong and should be the same as the described check in the comment (which is correctly copied from the ARM ISA reference). Reviewers: #lldb, davide, JDevlieghere Reviewed By: #lldb, davide, JDevlieghere Subscribers: davide, javed.absar, kristof.beyls, lldb-commits Tags: #lldb Differential Revision: https://reviews.llvm.org/D60654 llvm-svn: 359387	2019-04-27 18:15:35 +00:00
Simon Pilgrim	2a2d422400	[X86][AVX512] Improve vector bool reductions As predicate masks are legal on AVX512 targets, we avoid MOVMSK in these cases, but we can just bitcast the bool vector to the integer equivalent directly - avoiding expansion of the reduction to a shuffle pattern. llvm-svn: 359386	2019-04-27 17:32:46 +00:00
Simon Pilgrim	913bfd3363	[X86] Add vector boolean reduction tests (PR38840) AND/OR/XOR tests for the @llvm.experimental.vector.reduce intrinsics AND/OR are pretty good (pre-AVX512), XOR (not so common but used for parity reduction) is still pretty bad. llvm-svn: 359385	2019-04-27 16:49:54 +00:00
Lang Hames	15c072a161	[lli] Fix a typo in a cl::opt description. Patch by Wasiher. Thanks Wasiher! Differential Revision: https://reviews.llvm.org/D61135 llvm-svn: 359384	2019-04-27 16:13:53 +00:00
Fangrui Song	763a2e1f36	[llvm-nm][llvm-readelf] Avoid single-dash -long-option in tests llvm-svn: 359383	2019-04-27 16:12:14 +00:00
Simon Pilgrim	5cf616530a	Fix check-prefixes typo llvm-svn: 359382	2019-04-27 15:41:14 +00:00
Fangrui Song	795c00b21f	[DJB] Fix variable case after D61178 llvm-svn: 359381	2019-04-27 15:33:22 +00:00
Fangrui Song	0969af6962	[llvm-nm] Support section type 'u': STB_GNU_UNIQUE llvm-svn: 359380	2019-04-27 15:32:53 +00:00
Simon Pilgrim	3879b2cd45	[X86][SSE] Add initial test case for subvector insert/extract of illegal types Suggested by @nikic on D59188 llvm-svn: 359379	2019-04-27 15:30:06 +00:00
Simon Pilgrim	acc1e6d1c6	[X86][AVX] Merge mask select with shuffles across extract_subvector (PR40332) Fixes PR40332 in the limited case where we're selecting between a target shuffle and a zero vector. We can extend this in the future to handle more opcodes and non-zero selections. llvm-svn: 359378	2019-04-27 13:35:32 +00:00
Andrea Di Biagio	d77dc9ada2	[MCA] Add field `IsEliminated` to class Instruction. NFCI llvm-svn: 359377	2019-04-27 11:59:11 +00:00
Simon Atanasyan	b82144b6e4	[cmake] Disable a GCC optimization when building LLVM for MIPS GCC when compiling LLVM for MIPS can introduce a jump to an uninitialized value when shrink wrapping is enabled. As shrink wrapping is enabled in GCC at all optimization levels, it must be disabled. This bug exists for all versions of GCC since 4.9.2. This partially resolves PR37701 / GCC PR target/86069. Patch by Simon Dardis. Differential Revision: https://reviews.llvm.org/D48069 llvm-svn: 359376	2019-04-27 09:28:54 +00:00
Michal Gorny	b907c44265	[lldb] [lit] Cleanly terminate Register tests Continue the Register test processes, and let them terminate cleanly rather than implicitly terminating them along with lldb. llvm-svn: 359375	2019-04-27 07:43:29 +00:00
Vitaly Buka	0f21545a3c	[sanitizer] Calculate SizeClassAllocator32::ByteMap type from Params::kSpaceSize and Params::kRegionSizeLog Reviewers: eugenis Subscribers: kubamracek, cryptoad, #sanitizers, llvm-commits Tags: #sanitizers, #llvm Differential Revision: https://reviews.llvm.org/D61206 llvm-svn: 359374	2019-04-27 06:30:52 +00:00
Jonas Devlieghere	57179860a0	[CommandObject] Use GetDebugger() helper method (NFC) In r359354 a GetDebugger() method was added to the CommandObject class, so that we didn't have to go through the command interpreter to obtain the script interpreter. This patch simplifies other call sites where m_interpreter.GetDebugger() was used, and replaces them with a shorter call to the new method. llvm-svn: 359373	2019-04-27 06:19:42 +00:00
Jonas Devlieghere	40dfc3920e	[FormatEntity] Remove unused format type (NFC) The FormatType enum and corresponding field are unused. This patch removes the type, field and simplifies the macros that initialize them. llvm-svn: 359372	2019-04-27 05:36:57 +00:00
Craig Topper	063b471ff7	[X86] Use MOVQ for i64 atomic_stores when SSE2 is enabled Summary: If we have SSE2 we can use a MOVQ to store 64-bits and avoid falling back to a cmpxchg8b loop. If its a seq_cst store we need to insert an mfence after the store. Reviewers: spatel, RKSimon, reames, jfb, efriedma Reviewed By: RKSimon Subscribers: hiraditya, dexonsmith, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D60546 llvm-svn: 359368	2019-04-27 03:38:15 +00:00
Richard Smith	31cfb311c5	Reinstate r359059, reverted in r359361, with a fix to properly prevent us emitting the operand of __builtin_constant_p if it has side-effects. Original commit message: Fix interactions between __builtin_constant_p and constexpr to match current trunk GCC. GCC permits information from outside the operand of __builtin_constant_p (but in the same constant evaluation context) to be used within that operand; clang now does so too. A few other minor deviations from GCC's behavior showed up in my testing and are also fixed (matching GCC): * Clang now supports nullptr_t as the argument type for __builtin_constant_p * Clang now returns true from __builtin_constant_p if called with a null pointer * Clang now returns true from __builtin_constant_p if called with an integer cast to pointer type llvm-svn: 359367	2019-04-27 02:58:17 +00:00
Vitaly Buka	ae01981d03	[AArch64] Initialize HasMTE llvm-svn: 359366	2019-04-27 02:40:01 +00:00
Vitaly Buka	83c6d10b62	[sanitizer] NFC: add static_assert to confirm that we use reasonable ByteMap type Summary: If bots work we can replace #ifs with template specialization by TwoLevelByteMapSize1. There is known users of TwoLevelByteMap with TwoLevelByteMapSize1 equal 8, and users of FlatByteMap with TwoLevelByteMapSize1 equal 2. Reviewers: eugenis Subscribers: kubamracek, #sanitizers, llvm-commits Tags: #sanitizers, #llvm Differential Revision: https://reviews.llvm.org/D61200 llvm-svn: 359364	2019-04-27 02:13:26 +00:00
Mark Searles	76c5b62988	Revert "AMDGPU: Split block for si_end_cf" This reverts commit 7a6ef3004655dd86d722199c471ae78c28e31bb4. We discovered some internal test failures, so reverting for now. Differential Revision: https://reviews.llvm.org/D61213 llvm-svn: 359363	2019-04-27 00:51:18 +00:00
Julian Lettner	c1008e4d3d	[NFC][Sanitizer] Remove GetRealFunctionAddress and replace usages Reviewers: vitalybuka Differential Revision: https://reviews.llvm.org/D61205 llvm-svn: 359362	2019-04-27 00:49:14 +00:00
Jorge Gorbe Moya	1dbd42ab5b	Revert Fix interactions between __builtin_constant_p and constexpr to match current trunk GCC. This reverts r359059 (git commit `0b098754b7`) llvm-svn: 359361	2019-04-27 00:32:04 +00:00
Petr Hosek	b58561baa5	[Fuchsia] Support multilib for -fsanitize=address and -fno-exceptions This introduces a support for multilibs to Fuchsia driver. Unlike the existing multilibs that are used primarily for handling different architecture variants, we use multilibs to handle different variants of Clang runtime libraries: -fsanitize=address and -fno-exceptions are the two we support initially. This replaces the existing support for sanitized runtimes libraries that was only used by Fuchsia driver and it also refactors some of the logic to allow sharing between GNU and Fuchsia drivers. Differential Revision: https://reviews.llvm.org/D61040 llvm-svn: 359360	2019-04-27 00:25:13 +00:00
Petr Hosek	0f9f021d05	[Driver] Support priority for multilibs When more than one multilib flag matches, try to select the best possible match based on priority. When two different multilibs with the same same priority match, we still throw an error matching the existing behavior. Differential Revision: https://reviews.llvm.org/D60990 llvm-svn: 359359	2019-04-27 00:25:11 +00:00
Stanislav Mekhanoshin	4f331cb1f3	[AMDGPU] gfx1010 VOPC implementation Differential Revision: https://reviews.llvm.org/D61208 llvm-svn: 359358	2019-04-26 23:16:16 +00:00
Lang Hames	a9fdf375b3	[ORC] Add a 'plugin' interface to ObjectLinkingLayer for events/configuration. ObjectLinkingLayer::Plugin provides event notifications when objects are loaded, emitted, and removed. It also provides a modifyPassConfig callback that allows plugins to modify the JITLink pass configuration. This patch moves eh-frame registration into its own plugin, and teaches llvm-jitlink to only add that plugin when performing execution runs on non-Windows platforms. This should allow us to re-enable the test case that was removed in r359198. llvm-svn: 359357	2019-04-26 22:58:39 +00:00

1 2 3 4 5 ...

315167 Commits All Branches Search

315167 Commits

All Branches