llvm-project

Commit Graph

Author	SHA1	Message	Date
Min-Yih Hsu	175139b6fd	[M68k][NFC] Tidy up the just-migrated MC tests Cleanup the formats of the MC tests that were just migrated. NFC	2021-08-22 22:43:02 -07:00
Min-Yih Hsu	da253d5690	[M68k][test] Migrate some MOVE instruction MC tests Migrate some MOVE instruction MC tests from test/CodeGen/M68k. Unfortunately the tests touched in this commit were failed due to lacking of the `abs.W` operand, which forces any memory address parsed from assembly being represented in 32-bits. We're temporarily allowing these unwanted widening in the tests until the support for `abs.W` is there.	2021-08-22 22:28:40 -07:00
Siva Chandra Reddy	ca6b354229	[libc] Add range reduction functions based on Paine and Hanek algorithm. These functions will be used in a future patch to implement trigonometric functions. Unit tests have been added but to the libc-long-running-tests suite. The unit tests long running because we compare against MPFR computations performed at 1280 bits of precision. Some cleanups or elimination of repeated patterns can be done as follow up changes. Differential Revision: https://reviews.llvm.org/D104817	2021-08-23 05:18:41 +00:00
Shilei Tian	2c6ffb4eb2	[NFC] clang-format -i clang/lib/CodeGen/CGStmtOpenMP.cpp	2021-08-22 22:57:05 -04:00
Kai Luo	7165e6713f	[PowerPC] Use int64_t to represent stack object offset and frame size This is the first step to enable PPC64 support huge frame size(>2G). Also fix an assertion error for frame size, i.e.,`int x; !isInt<32>(x);` should be always evaluated false, so the guard code for frame size is impossible to hit. Reviewed By: jsji Differential Revision: https://reviews.llvm.org/D107435	2021-08-23 02:13:21 +00:00
Michael Kruse	9cfab5e249	[Polly] Add support for -polly-dump-before/after with NPM. The new pass manager does not allow adding module passes at the -polly-position=before-vectorizer extension point. Introduce a DumpFunctionPass that dumps only current function. In contrast to the legacy pass manager's -polly-dump-before, each function will be dumped into its own file. -polly-dump-before-file is still not supported. The DumpFunctionPass uses llvm::CloneModule to copy the current function into a new module and then write it into a file.	2021-08-22 20:43:35 -05:00
Stella Laurenzo	a8de667af0	[mlir] Add op for NCHW conv2d. * This is the native data layout for PyTorch and npcomp was using the prior version before cleanup. Differential Revision: https://reviews.llvm.org/D108527	2021-08-22 17:27:33 -07:00
Stella Laurenzo	64e74e9d7c	[mlir][linalg] Add script to update the LinalgNamedStructuredOps.yaml. nfc Also adds banners to the files with update instructions. Differential Revision: https://reviews.llvm.org/D108529	2021-08-22 16:54:51 -07:00
Stella Laurenzo	e78b745cf2	[mlir][python] Makes C++ extension code relocatable by way of a macro. * Resolves a TODO by making this configurable by downstreams. * This seems to be the last thing allowing full use of the Python bindings as a library within another project (i.e. be embedding them). Differential Revision: https://reviews.llvm.org/D108523	2021-08-22 13:46:14 -07:00
Nikita Popov	79b55e5038	[GVN] Fix test for loop load PRE on alloca (NFC) This test was not modifying the pointer in the loop, so the loads just ended up as undef, without relation to loop load PRE. Pass the alloca to the called function, so the memory is potentially modified.	2021-08-22 22:28:53 +02:00
Nikita Popov	2b70b68efb	[GVN] Don't short-circuit load PRE `4ad41902e8` changed this code to propagate Changed if scalar GEP PRE is performed. However, as implemented this would skip the load PRE entirely if GEP indices were PREd. Make sure load PRE runs even if Changed is already true. This likely has no functional effect as load PRE would then occur on a later GVN iteration.	2021-08-22 21:12:58 +02:00
Amy Kwan	4cd8dd3fe0	[scudo][standalone] Link tests against libatomic if libatomic exists It is possible that libatomic does not exist on some systems. This patch updates the scudo standalone tests to link against libatomic if the library exists. This is an update to the original patch: https://reviews.llvm.org/D64134 and aims to resolve https://bugs.llvm.org/show_bug.cgi?id=51431. Differential Revision: https://reviews.llvm.org/D108503	2021-08-22 13:47:04 -05:00
Philip Reames	d8d84c9df8	[runtimeunroll] Use early return to reduce nesting [nfc]	2021-08-22 11:34:50 -07:00
Philip Reames	aec08e8600	Special case common branch patterns in breakLoopBackedge This special cases an unconditional latch and a conditional branch latch exit to improve codegen and test readability. I am hoping to reuse this function in the runtime unroll code, but without this change, the test diffs are far too complex to assess.	2021-08-22 10:42:23 -07:00
Simon Pilgrim	805fb1f6c1	[X86] combineMul - move MUL_IMM comment inside function. NFC. combineMul is now used for other things as well as the mul-with-constant expansion - move the comment to where its actually relevant.	2021-08-22 18:27:03 +01:00
Alexey Lapshin	07d44cc0b1	[DWARF][Verifier] Do not add child DieRangeInfo with empty address range to the parent. verifyDieRanges function checks for the intersected address ranges. It adds child DieRangeInfo into parent DieRangeInfo to check whether children have overlapping address ranges. It is safe to not add DieRangeInfo with empty address range into parent's children list. This decreases the number of children which should be navigated and as a result decreases execution time(parents having a lot of children with empty ranges spend much time navigating them). For this command: "llvm-dwarfdump --verify clang-repl" execution time decreased from 220 sec till 75 sec. Differential Revision: https://reviews.llvm.org/D107554	2021-08-22 19:39:21 +03:00
Kazu Hirata	40fd2d93c0	[Transforms] Remove unused declaration emitStrNLen (NFC) The corresponding definition has been missing for at least 5 years.	2021-08-22 09:08:22 -07:00
Arthur O'Dwyer	ca7926bd79	[libc++] Eliminate needless `add_lvalue_reference` from <algorithm> helpers. NFCI. When `_Compare` is a function parameter already (so it's not `void` and it's not an abominable function type), `add_lvalue_reference_t<_Compare>` is simply a synonym for `_Compare&`. We don't need to pull in `<type_traits>` and instantiate a template trait to figure that out. Differential Revision: https://reviews.llvm.org/D108400	2021-08-22 11:43:12 -04:00
Nikita Popov	fafe5a6f44	[InstCombine] Perform "eq of parts" fold with logical ops The pattern matched here is too complex for the general logical and/or to bitwise and/or conversion to trigger. However, the fold is poison-safe, so match it with a select root as well: https://alive2.llvm.org/ce/z/vNzzSg https://alive2.llvm.org/ce/z/Beyumt	2021-08-22 16:55:53 +02:00
Nikita Popov	be4b8366fb	[InstCombine] Add tests for "eq of parts" with logical op (NFC) We currently only handle this with a bitwise and/or instruction, but not a logical.	2021-08-22 16:52:44 +02:00
Simon Pilgrim	352df10a23	[X86][AVX] matchShuffleAsBlend - use isElementEquivalent to help match broadcast/repeated elements Extend matchShuffleAsBlend to not only match against known in-place elements for BLEND shuffles, but use isElementEquivalent to determine if the shuffle mask's referenced element is the same as the in-place element. This allows us to replace a number of insertps instructions with more general blendps instructions (better opportunities for commutation, concatenation etc.).	2021-08-22 15:26:17 +01:00
Simon Pilgrim	96fb3eef66	Fix signed/unsigned comparison warning. NFCI.	2021-08-22 15:02:19 +01:00
Simon Pilgrim	7b7ac4b16a	[X86] Expose memory codegen in element insert load tests to improve accuracy of checks Also replace X32 with X86 check prefixes for i686 tests (we tend to try to use X32 for gnux32 targets)	2021-08-22 14:54:48 +01:00
Simon Pilgrim	a1c892b439	[X86][SSE] lowerVECTOR_SHUFFLE - canonicalize with horizontal ops. Before lowering shuffles, see if we can merge horizontal ops or canonicalize the shuffle mask to point to the same LHS/RHS of the HOps when an HOp's args are repeated.	2021-08-22 14:54:48 +01:00
Sanjay Patel	dcf659e821	[InstSimplify] fold rotate of -1 to -1 This is part of solving more general rotate patterns seen in bugs related to: https://llvm.org/PR51575 https://alive2.llvm.org/ce/z/GpkFCt	2021-08-22 09:15:48 -04:00
Sanjay Patel	d41e308f10	[InstSimplify] fold rotate of zero to zero This is part of solving more general rotate patterns seen in bugs related to: https://llvm.org/PR51575 https://alive2.llvm.org/ce/z/fjKwqv	2021-08-22 09:15:48 -04:00
Sanjay Patel	a0ebac4466	[InstSimplify] add tests for rotates of 0/-1; NFC	2021-08-22 09:15:48 -04:00
Simon Pilgrim	8533e782ef	[X86] Try to sync HSW + BDW model class defs to simplify comparisons. NFC. Broadwell is mainly a die shrink of Haswell, but the model had many of the scheduling classes in different orders, making side-by-side comparisons very difficult. The InstRW overrides are still quite different, but at least that part of the side-by-side diff is now in the same position. This was noticed while I was trying to investigate diffs between llvm-mca and other perf analyzers in https://uica.uops.info/ - we used to be able to do diffs between most of the models very easily, but we seem to have lost that simplicity as classes have been altered, models have been refined and other models have rotted.	2021-08-22 13:02:51 +01:00
Sanjay Patel	3aa009cc87	[InstCombine] generalize subtract with 'not' operands The motivation was to get min/max intrinsics to parity with cmp+select idioms, but this unlocks a few more folds because isFreeToInvert recognizes add/sub with constants too. In the min/max example, we have too many extra uses for smaller folds to improve things, but this fold is able to eliminate uses even though we can't reduce the number of instructions.	2021-08-22 07:18:31 -04:00
Simon Pilgrim	7f48bd3bed	CGBuiltin.cpp - pass SVETypeFlags by const reference. NFC. Don't pass the struct by value.	2021-08-22 12:13:17 +01:00
Florian Hahn	9baed023b4	[LV] Adjust reduction recipes before recurrence handling. Adjusting the reduction recipes still relies on references to the original IR, which can become outdated by the first-order recurrence handling. Until reduction recipe construction does not require IR references, move it before first-order recurrence handling, to prevent a crash as exposed by D106653.	2021-08-22 11:02:33 +01:00
Ben Shi	f69fb7ac72	[DAGCombiner] Add target hook function to decide folding (mul (add x, c1), c2) Reviewed by: lebedev.ri, spatel, craig.topper, luismarques, jrtc27 Differential Revision: https://reviews.llvm.org/D107711	2021-08-22 16:53:32 +08:00
luxufan	dda116bc3d	[JITLink] Add support of R_X86_64_32S relocation This patch supported the R_X86_64_32S relocation and add the Pointer32Signed generic edge kind. Reviewed By: lhames Differential Revision: https://reviews.llvm.org/D108446	2021-08-22 16:45:25 +08:00
Lang Hames	1e5e1bee49	[ORC] Add std::tuple support to SimplePackedSerialization.	2021-08-22 11:17:04 +10:00
Lang Hames	76d6a8df20	[ORC] Rename blobSerializationRoundTrip, drop explicit arg types on calls. Renames the blobSerializationRoundTrip test helper function to spsSerializationRoundTrip ('blob' was the placeholder name for the serialization scheme during prototyping, this function was missed when renaming everything for the mainline). Also drops explicit template arguments at call sites where they can be inferred (and are obvious) from the call argument type.	2021-08-22 11:17:04 +10:00
Wang, Pengfei	b088536ce9	[X86] AVX512FP16 instructions enabling 4/6 Enable FP16 unary operator instructions. Ref.: https://software.intel.com/content/www/us/en/develop/download/intel-avx512-fp16-architecture-specification.html Reviewed By: LuoYuanke Differential Revision: https://reviews.llvm.org/D105267	2021-08-22 08:59:35 +08:00
Lang Hames	75e5f35aea	[ORC] Add missing header. Should fix bot failure at https://green.lab.llvm.org/green/job/clang-stage2-Rthinlto/4367	2021-08-22 10:36:52 +10:00
Fangrui Song	1dfb30e54c	[TargetCallingConv] Change OutputArg ctor to match its members This avoids unneeded MVT->EVT conversion.	2021-08-21 16:41:48 -07:00
Fangrui Song	0473e9f41a	[AArch64] Replace unneeded CCAssignToRegWithShadow with CCAssignToReg CCState::AllocateReg handles aliased registers.	2021-08-21 16:33:29 -07:00
Fangrui Song	a83d99c55e	[TargetMachine] Drop special case for *-win32-macho clang CodeGenModule shouldAssumeDSOLocal has set dso_local.	2021-08-21 13:59:17 -07:00
Fangrui Song	c5ee312368	[TargetMachine] Simplify shouldAssumeDSOLocal. NFC	2021-08-21 12:37:29 -07:00
Kazu Hirata	612048aec1	[clang] Fix typos in documentation (NFC)	2021-08-21 12:17:58 -07:00
Sanjay Patel	41af8f0ad5	[InstCombine] combine constants by reassociating add/sub/add This may overlap partially with the reassociate pass, but it seems simple enough that we should try it here in InstCombine to enable other folds. This shows up as an opportunity and potential regression if we improve a subtract fold with 'not' ops to be more general.	2021-08-21 11:45:43 -04:00
Sanjay Patel	c0844de7a2	[InstCombine] add tests for add/sub/add combines; NFC	2021-08-21 11:45:43 -04:00
Sanjay Patel	0751347bc3	[InstCombine] add tests for min/max with nots and sub; NFC	2021-08-21 11:45:43 -04:00
David Green	605489d593	[ARM] Fix VQDMULH fold for scalar smin Add a variant of mve-vqdmulh tests that uses min/max intrinsics directly, including a scalar test that shows it misbehaving for min intrinsics and a fix for the combine to prevent it from misbehaving.	2021-08-21 16:33:18 +01:00
Andrzej Warzynski	787c443a8d	[flang] Refine output file generation This patch cleans-up the file generation code in Flang's frontend driver. It improves the layering between `CompilerInstance::CreateDefaultOutputFile`, `CompilerInstance::CreateOutputFile` and their various clients. * Rename `CreateOutputFile` as `CreateOutputFileImpl` and make it private. This method is an implementation detail. * Instead of passing an `std::error_code` out parameter into `CreateOutputFileImpl`, have it return Expected<>. This is a bit shorter and idiomatic LLVM. * Make `CreateDefaultOutputFile` (which calls `CreateOutputFileImpl`) issue an error when file creation fails. The error code from `CreateOutputFileImpl` is used to generate a meaningful diagnostic message. * Remove error reporting from `PrintPreprocessedAction::ExecuteAction`. This is only for cases when output file generation fails. This is handled in `CreateDefaultOutputFile` instead (see the previous point). * Inline `AddOutputFile` into its only caller, `CreateDefaultOutputFile`. * Switch from `lvm::buffer_ostream` to `llvm::buffer_unique_ostream>` for non-seekable output streams. This simplifies the logic in the driver and was introduced for this very reason in [1] * Moke sure that the diagnostics from the prescanner when running `-E` (`PrintPreprocessedAction::ExecuteAction`) are printed before the actual output is generated. * Update comments, add test. NOTE: This patch relands [2]. As suggested by Michael Kruse in the post-commit/post-revert review, I've added the following: ``` config.errc_messages = "@LLVM_LIT_ERRC_MESSAGES@" ``` in Flang's `lit.site.cfg.py.in`. This way, `%errc_ENOENT` in output-paths.f90 gets the correct value on Windows as well as on Linux. [1] https://reviews.llvm.org/D93260 [2] `fd21d1e198` Reviewed By: ashermancinelli Differential Revision: https://reviews.llvm.org/D108390	2021-08-21 15:18:48 +00:00
Kirill Shmakov	2cc1198e36	[lldb] Fix typo in the description of breakpoint options	2021-08-21 12:24:29 +02:00
LLVM GN Syncbot	93de779d63	[gn build] Port `7f99337f9b`	2021-08-21 09:44:22 +00:00
Lang Hames	7f99337f9b	[ORC] Add EPCGenericMemoryAccess: generic executor memory access via EPC calls. All ExecutorProcessControl subclasses must provide an ExecutorProcessControl::MemoryAccess object that can be used to access executor memory from the JIT process. The EPCGenericMemoryAccess class provides an off-the-shelf MemoryAccess implementation for JITs that do not need (or cannot provide) a specialized MemoryAccess implementation. This simplifies the process of creating new ExecutorProcessControl implementations.	2021-08-21 19:33:39 +10:00

... 5 6 7 8 9 ...

397473 Commits All Branches Search

397473 Commits

All Branches