llvm-project

Commit Graph

Author	SHA1	Message	Date
Haojian Wu	facea4a2d4	[clangd] Fix a missing override keyword, NFC.	2021-01-21 11:06:43 +01:00
Florian Hahn	bee486851c	[LoopUnswitch] Implement first version of partial unswitching. This patch applies the idea from D93734 to LoopUnswitch. It adds support for unswitching on conditions that are only invariant along certain paths through a loop. In particular, it targets conditions in the loop header that depend on values loaded from memory. If either path from the true or false successor through the loop does not modify memory, perform partial loop unswitching. That is, duplicate the instructions feeding the condition in the pre-header. Then unswitch on the duplicated condition. The condition is now known in the unswitched version for the 'invariant' path through the original loop. On caveat of this approach is that one of the loops created can be partially unswitched again. To avoid this behavior, `llvm.loop.unswitch.partial.disable` metadata is added to the unswitched loops, to avoid subsequent partial unswitching. If that's the approach to go, I can move the code handling the metadata kind into separate functions. This increases the cases we unswitch quite a bit in SPEC2006/SPEC2000 & MultiSource. It also allows us to eliminate a dead loop in SPEC2017's omnetpp ``` Tests: 236 Same hash: 170 (filtered out) Remaining: 66 Metric: loop-unswitch.NumBranches Program base patch diff test-suite...000/255.vortex/255.vortex.test 2.00 23.00 1050.0% test-suite...T2006/401.bzip2/401.bzip2.test 7.00 55.00 685.7% test-suite :: External/Nurbs/nurbs.test 5.00 26.00 420.0% test-suite...s-C/unix-smail/unix-smail.test 1.00 3.00 200.0% test-suite.../Prolangs-C++/ocean/ocean.test 1.00 3.00 200.0% test-suite...tions/lambda-0.1.3/lambda.test 1.00 3.00 200.0% test-suite...yApps-C++/PENNANT/PENNANT.test 2.00 5.00 150.0% test-suite...marks/Ptrdist/yacr2/yacr2.test 1.00 2.00 100.0% test-suite...lications/viterbi/viterbi.test 1.00 2.00 100.0% test-suite...plications/d/make_dparser.test 12.00 24.00 100.0% test-suite...CFP2006/433.milc/433.milc.test 14.00 27.00 92.9% test-suite.../Applications/lemon/lemon.test 7.00 12.00 71.4% test-suite...ce/Applications/Burg/burg.test 6.00 10.00 66.7% test-suite...T2006/473.astar/473.astar.test 16.00 26.00 62.5% test-suite...marks/7zip/7zip-benchmark.test 78.00 121.00 55.1% ``` Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D93764	2021-01-21 09:46:41 +00:00
Alexander Belyaev	fc58bfd02f	[mlir] Remove complex ops from Standard dialect. `complex` dialect should be used instead. https://llvm.discourse.group/t/rfc-split-the-complex-dialect-from-std/2496/2 Differential Revision: https://reviews.llvm.org/D95077	2021-01-21 10:34:26 +01:00
Fangrui Song	71635ea5ff	MCDwarf: Delete uneeded parameter And change signature	2021-01-21 00:55:07 -08:00
Georgii Rymar	dd5c982804	[llvm-nm][ELF] - Make -D display symbol versions. This fixes https://bugs.llvm.org/show_bug.cgi?id=48670. Since binutils 2.35, nm -D displays symbol versions by default. This patch teaches llvm-nm to do the same. Differential revision: https://reviews.llvm.org/D94907	2021-01-21 11:23:45 +03:00
Luo, Yuanke	20013d02f3	[X86][AMX] Fix tile config register spill issue. Previous code build the model that tile config register is the user of each AMX instruction. There is a problem for the tile config register spill. When across function, the ldtilecfg instruction may be inserted on each AMX instruction which use tile config register. This cause all tile data register clobber. To fix this issue, we remove the model of tile config register. We analyze the regmask of call instruction and insert ldtilecfg if there is any tile data register live across the call. Inserting the sttilecfg before the call is unneccessary, because the tile config doesn't change and we can just reload the config. Besides we also need check tile config register interference. Since we don't model the config register we should check interference from the ldtilecfg to each tile data register def. ldtilecfg / \ BB1 BB2 / \ call BB3 / \ %1=tileload %2=tilezero We can start from the instruction of each tile def, and backward to ldtilecfg. If there is any call instruction, and tile data register is not preserved, we should insert ldtilecfg after the call instruction. Differential Revision: https://reviews.llvm.org/D94155	2021-01-21 16:01:50 +08:00
Georgii Rymar	51f4958057	[yaml2obj/obj2yaml] - Improve dumping/creating of ELF versioning sections. This makes the following improvements. For `SHT_GNU_versym`: * yaml2obj: set `sh_link` to index of `.dynsym` section automatically. For `SHT_GNU_verdef`: * yaml2obj: set `sh_link` to index of `.dynstr` section automatically. * yaml2obj: set `sh_info` field automatically. * obj2yaml: don't dump the `Info` field when its value matches the number of version definitions. For `SHT_GNU_verneed`: * yaml2obj: set `sh_link` to index of `.dynstr` section automatically. * yaml2obj: set `sh_info` field automatically. * obj2yaml: don't dump the `Info` field when its value matches the number of version dependencies. Also, simplifies few test cases. Differential revision: https://reviews.llvm.org/D94956	2021-01-21 10:36:48 +03:00
madhur13490	dd8ae42674	[IndirectFunctions] Skip propagating attributes to address taken functions In case of indirect calls or address taken functions, skip propagating any attributes to them. We just propagate features to such functions. Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D94585	2021-01-21 07:04:28 +00:00
Kazu Hirata	6de4865545	[llvm] Use hasSingleElement (NFC)	2021-01-20 21:35:55 -08:00
Kazu Hirata	e53472de68	[Transforms] Use llvm::append_range (NFC)	2021-01-20 21:35:54 -08:00
Kazu Hirata	8f5da41c4d	[llvm] Construct SmallVector with iterator ranges (NFC)	2021-01-20 21:35:52 -08:00
Max Kazantsev	d6bb96e677	[X86] Add experimental option to separately tune alignment of innermost loops We already have an experimental option to tune loop alignment. Its impact is very wide (and there is a suspicion that it's not always profitable). We want to have something more narrow to play with. This patch adds similar option that overrides preferred alignment for innermost loops. This is for experimental purposes, default values do not change the existing behavior. Differential Revision: https://reviews.llvm.org/D94895 Reviewed By: pengfei	2021-01-21 11:15:16 +07:00
Hsiangkai Wang	a8b96eadfd	[RISCV] Implement vssseg intrinsics. Define vlsseg intrinsics and pseudo instructions. Lower vlsseg intrinsics to pseudo instructions in RISCVDAGToDAGISel. Differential Revision: https://reviews.llvm.org/D94863	2021-01-21 11:51:35 +08:00
Hsiangkai Wang	e5e329023b	[RISCV] Implement vlsseg intrinsics. Define vlsseg intrinsics and pseudo instructions. Lower vlsseg intrinsics to pseudo instructions in RISCVDAGToDAGISel. Differential Revision: https://reviews.llvm.org/D94763	2021-01-21 11:51:35 +08:00
Hsiangkai Wang	47228f7854	[RISCV] Implement vsseg intrinsics. Define vsseg intrinsics and pseudo instructions. Lower vsseg intrinsics to pseudo instructions in RISCVDAGToDAGISel. Differential Revision: https://reviews.llvm.org/D94688	2021-01-21 11:51:35 +08:00
Jonas Devlieghere	baf6c2987e	[lldb] Upstream eCore_arm_arm64e enum value in ArchSpec Upstream the eCore_arm_arm64e enum value in ArchSpec. All the other arm64e triple changes already landed in LLVM. Differential revision: https://reviews.llvm.org/D95110	2021-01-20 19:39:47 -08:00
Craig Topper	6cab3f88ee	[RISCV] Use update_llc_test_checks.py to regenerate check lines in vleff-rv32.ll and vleff-rv64.ll. This should minimize change in a future patch.	2021-01-20 18:51:02 -08:00
Jonas Devlieghere	f354b87df2	[dsymutil] Compare object modification times using second precision The modification time in the debug map is expressed using second precision, while the modification time returned by the filesystem could be more precise. Avoid spurious warnings about timestamp mismatches by truncating the modification time reported by the system to seconds.	2021-01-20 18:45:30 -08:00
Jim Ingham	98feb08e44	Use CXX_SOURCES and point to the right source file. Copy paste error, but the test still built on macOS. Weird. It failed on debian linux with an error about -fno-limit-debug-info not being a supported flag??? Not sure how this goof would cause that error, but let's see if it did...	2021-01-20 18:38:07 -08:00
Jianzhou Zhao	f86db34def	[MSan] Move origins for overlapped memory transfer Reviewed-by: eugenis Differential Revision: https://reviews.llvm.org/D94572	2021-01-21 02:11:26 +00:00
Jim Ingham	bff389120f	Fix a bug with setting breakpoints on C++11 inline initialization statements. If they occurred before the constructor that used them, we would refuse to set the breakpoint because we thought they were crossing function boundaries. Differential Revision: https://reviews.llvm.org/D94846	2021-01-20 17:58:34 -08:00
Jez Ng	34e8fcf63f	[lld-macho] Add dependency on ObjCARC to fix shared build	2021-01-20 20:41:51 -05:00
Shilei Tian	3809e5dac9	[Clang][OpenMP] Use `clang_cc1` test for `declare_target_device_only_compilation.cpp` Use `clang_cc1` test for `declare_target_device_only_compilation.cpp` Reviewed By: echristo Differential Revision: https://reviews.llvm.org/D95089	2021-01-20 20:34:10 -05:00
Guozhi Wei	96296d9220	[DAGCombiner] Precommit test case for D95086 This is the test case for D95086 with worse result. Differential Revision: https://reviews.llvm.org/D95103	2021-01-20 17:15:47 -08:00
River Riddle	825c2b4a41	[mlir][OpFormatGen] Fix incorrect kind used for RegionsDirective I attempted to write a test case for this, but the situations in which the kind is used for RegionDirective and ResultsDirective have zero overlap; meaning that there isn't a situation in which sharing the kind creates a conflict. Differential Revision: https://reviews.llvm.org/D94988	2021-01-20 16:50:59 -08:00
mfehr	8a7ff7301a	[mlir] Make MLIRContext::getOrLoadDialect(StringRef, TypeID, ...) public Having this function in a public scope is helpful to register dialects that are defined at runtime, and thus that need a runtime-defined TypeID. Also, a similar function in DialectRegistry, insert(TypeID, StringRef, ...), has a public scope. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D95091	2021-01-21 00:29:58 +00:00
River Riddle	c78219f644	[mlir] Add a new builtin `unrealized_conversion_cast` operation An `unrealized_conversion_cast` operation represents an unrealized conversion from one set of types to another, that is used to enable the inter-mixing of different type systems. This operation should not be attributed any special representational or execution semantics, and is generally only intended to be used to satisfy the temporary intermixing of type systems during the conversion of one type system to another. This operation was discussed in the following RFC(and ODM): https://llvm.discourse.group/t/open-meeting-1-14-dialect-conversion-and-type-conversion-the-question-of-cast-operations/ Differential Revision: https://reviews.llvm.org/D94832	2021-01-20 16:28:18 -08:00
River Riddle	6ccf2d62b4	[mlir] Add an interface for Cast-Like operations A cast-like operation is one that converts from a set of input types to a set of output types. The arity of the inputs may be from 0-N, whereas the arity of the outputs may be anything from 1-N. Cast-like operations are removable in cases where they produce a "no-op", i.e when the input types and output types match 1-1. Differential Revision: https://reviews.llvm.org/D94831	2021-01-20 16:28:17 -08:00
Varun Gandhi	87a89549c4	[NFC] Minor cleanup for ValueHandle code. Based on feedback in https://reviews.llvm.org/D93433. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D94238	2021-01-20 16:27:55 -08:00
Michael Jones	689de5841c	[libc][NFC][obvious] fix the names of MPFR tests I missed the MPFR tests in my previous commit. They have now been fixed to not fail the prefix check in the test macro.	2021-01-20 23:42:01 +00:00
Michael Jones	1df0dbfcb5	[libc][NFC] add "LlvmLibc" as a prefix to all test names Summary: Having a consistent prefix makes selecting all of the llvm libc tests easier on any platform that is also using the gtest framework. This also modifies the TEST and TEST_F macros to enforce this change moving forward. Reviewers: sivachandra Subscribers:	2021-01-20 23:15:36 +00:00
Dávid Bolvanský	bb3f169b59	[BuildLibcalls, Attrs] Support more variants of C++'s new, add attributes for C++'s delete Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D95095	2021-01-21 00:12:37 +01:00
Craig Topper	e996f1d419	[RISCV] Add another isel pattern for slliu.w. Previously we only matched (and (shl X, C1), 0xffffffff << C1) which matches the InstCombine canonicalization order. But its possible to see (shl (and X, 0xffffffff), C1) if the pattern is introduced in SelectionDAG. For example, through expansion of a GEP.	2021-01-20 14:54:40 -08:00
Craig Topper	0f8386c4f6	[RISCV] Add addu.w and slliu.w test that uses getelementptr with zero extended indices. This is closer to the kind of code that these intrinsics are targeted at. Note we fail to match slliu.w here because our pattern looks for (and (shl X, C1), 0xffffffff << C1) rather than (shl (and X, 0xffffffff), C1). I'll fix this in a follow up commit.	2021-01-20 14:54:40 -08:00
Diego Caballero	735a07f047	Revert "[mlir][Affine] Add support for multi-store producer fusion" This reverts commit `7dd198852b`. ASAN issue.	2021-01-21 00:37:23 +02:00
Aart Bik	5959c28f24	[mlir][sparse] add asserts on reading in tensor data Rationale: Since I made the argument that metadata helps with extra verification checks, I better actually do that ;-) Reviewed By: penpornk Differential Revision: https://reviews.llvm.org/D95072	2021-01-20 14:30:13 -08:00
Ryan Houdek	045d84f4e6	D94954: Fixes Snapdragon Kryo CPU core detection All of these families were claiming to be a73 based, which was causing -mcpu/mtune=native to never use the newer features available to these cores. Goes through each and bumps the individual cores to their respective Big counterparts. Since this code path doesn't support big.little detection, there was already a precedent set with the Qualcomm line to choose the big cores only. Adds a comment on each line for the product's name that the part number refers to. Confirmed on-device and through Linux header naming convections. Additionally newer SoCs mix CPU implementer parts from multiple implementers. Both 0x41 (ARM) and 0x51 (Qualcomm) in the Snapdragon case This was causing a desync in information where the scan at the start to find the implementer would mismatch the part scan later on. Now scan for both implementer and part at the start so these stay in sync. Differential Revision: https://reviews.llvm.org/D94954	2021-01-20 22:23:43 +00:00
Fangrui Song	6afdf13ae4	Makefile.rules: Avoid redundant .d generation (make restart) and inline archive rule to the only test Take an example when `CXX_SOURCES` is main.cpp. main.d is an included file. make will rebuild main.d, re-executes itself [1] to read in the new main.d file, then rebuild main.o, finally link main.o into a.out. main.cpp is parsed twice in this process. This patch merges .d generation into .o generation [2], writes explicit rules for .c/.m and deletes suffix rules for %.m and %.o. Since a target can be satisfied by either of .c/.cpp/.m/.mm, we use multiple pattern rules. The rule with the prerequisite (with VPATH considered) satisfied is used [3]. Since suffix rules are disabled, the implicit rule for archive member targets is no long available [4]. Rewrite, simplify the archive rule and inline it into the only test `test/API/functionalities/archives/Makefile`. [1]: https://www.gnu.org/software/make/manual/html_node/Remaking-Makefiles.html [2]: http://make.mad-scientist.net/papers/advanced-auto-dependency-generation/ [3]: https://www.gnu.org/software/make/manual/html_node/Pattern-Match.html [4]: https://www.gnu.org/software/make/manual/html_node/Archive-Update.html ObjC/ObjCXX tests only run on macOS. I don't have testing environment. Hope someone can do it for me. Reviewed By: JDevlieghere Differential Revision: https://reviews.llvm.org/D94890	2021-01-20 14:22:33 -08:00
Nicolas Vasilache	555a395f2c	[mlir] NFC - Fix unused variable in non-debug mode	2021-01-20 22:20:38 +00:00
Tony Tye	079967cdf9	[NFC][AMDGPU] Document target ID syntax for code object V2 to V3 Differential Revision: https://reviews.llvm.org/D95018	2021-01-20 21:48:52 +00:00
Michael Liao	7b5d7c7b0a	[hip] Fix `<complex>` compilation on Windows with VS2019. Differential Revision: https://reviews.llvm.org/D95075	2021-01-20 16:43:44 -05:00
Mircea Trofin	ccec2cf1d9	Reland "[NPM][Inliner] Factor ImportedFunctionStats in the InlineAdvisor" This reverts commit `d97f776be5`. The original problem was due to build failures in shared lib builds. D95079 moved ImportedFunctionsInliningStatistics under Analysis, unblocking this.	2021-01-20 13:33:43 -08:00
Eugene Zhulenev	a2223b09b1	[mlir:async] Fix data races in AsyncRuntime Resumed coroutine potentially can deallocate the token/value/group and destroy the mutex before the std::unique_ptr destructor. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D95037	2021-01-20 13:23:39 -08:00
LLVM GN Syncbot	36b05d2e9f	[gn build] Port `95ce32c787`	2021-01-20 21:18:20 +00:00
Mircea Trofin	95ce32c787	[NFC] Move ImportedFunctionsInliningStatistics to Analysis This is related to D94982. We want to call these APIs from the Analysis component, so we can't leave them under Transforms. Differential Revision: https://reviews.llvm.org/D95079	2021-01-20 13:18:03 -08:00
Peter Steinfeld	bebbe64075	[flang] Fix creation of deferred shape arrays by POINTER statement It's possible to declare deferred shape array using the POINTER statement, for example: POINTER :: var(:) When analyzing POINTER declarations, we were not capturing the array specification information, if present. I fixed this by changing the "Post" function for "parser::PointerDecl" to check to see if the declaration contained a "DeferredShapeSpecList". In such cases, I analyzed the shape and used to information to declare an "ObjectEntity" that contains the shape information rather than an "UnknownEntity". I also added a couple of small tests that fail to compile without these changes. Differential Revision: https://reviews.llvm.org/D95080	2021-01-20 13:08:11 -08:00
Shilei Tian	33a5d212c6	[OpenMP][NVPTX] Added forward declaration to pave the way for building deviceRTLs with OpenMP Once we switch to build deviceRTLs with OpenMP, primitives and CUDA intrinsics cannot be used directly anymore because `__device__` is not recognized by OpenMP compiler. To avoid involving all CUDA internal headers we had in `clang`, we forward declared these functions. Eventually they will be transformed into right LLVM instrinsics. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D95058	2021-01-20 15:56:02 -05:00
Amy Huang	a3d7cee7f9	[CodeView] Emit function types in -gline-tables-only. This change adds function types to further differentiate between FUNC_IDs in -gline-tables-only. Size increase of object files in clang are Before: 917990 kb After: 999312 kb Bug: https://bugs.llvm.org/show_bug.cgi?id=48432 Differential Revision: https://reviews.llvm.org/D95001	2021-01-20 12:47:35 -08:00
peter klausler	0996b590aa	[flang] Infrastructure improvements in utility routines * IsArrayElement() needs another option to control whether it should ignore trailing component references. * Add IsObjectPointer(). * Add const Scope& variants of IsFunction() and IsProcedure(). * Make TypeAndShape::Characterize() work with procedure bindings. * Handle CHARACTER length in MeasureSizeInBytes(). * Fine-tune FindExternallyVisibleObject()'s handling of dummy arguments to conform with Fortran 2018: only INTENT(IN) and dummy pointers in pure functions signify; update two tests accordingly. Also: resolve some stylistic inconsistencies and add a missing "const" in the expression traversal template framework. Differential Revision: https://reviews.llvm.org/D95011	2021-01-20 12:40:25 -08:00
Nicolas Vasilache	8dd58a509c	[mlir][Linalg] NFC - Fully compose map and operands when creating AffineMin in tiling. This may simplify the composition of patterns but is otherwise NFC.	2021-01-20 20:36:18 +00:00

1 2 3 4 5 ...

377605 Commits All Branches Search

377605 Commits

All Branches