llvm-project

Commit Graph

Author	SHA1	Message	Date
River Riddle	c8c45985fb	[mlir][Type] Remove usages of Type::getKind This is in preparation for removing the use of "kinds" within attributes and types in MLIR. Differential Revision: https://reviews.llvm.org/D85475	2020-08-07 13:43:25 -07:00
River Riddle	fff39b62bb	[mlir][Attribute] Remove usages of Attribute::getKind This is in preparation for removing the use of "kinds" within attributes and types in MLIR. Differential Revision: https://reviews.llvm.org/D85370	2020-08-07 13:43:25 -07:00
River Riddle	1d6a8deb41	[mlir] Remove the need to define `kindof` on attribute and type classes. This revision refactors the default definition of the attribute and type `classof` methods to use the TypeID of the concrete class instead of invoking the `kindof` method. The TypeID is already used as part of uniquing, and this allows for removing the need for users to define any of the type casting utilities themselves. Differential Revision: https://reviews.llvm.org/D85356	2020-08-07 13:43:25 -07:00
River Riddle	dd48773396	[mlir][Types] Remove the subclass data from Type Subclass data is useful when a certain amount of memory is allocated, but not all of it is used. In the case of Type, that hasn't been the case for a while and the subclass is just taking up a full `unsigned`. Removing this frees up ~8 bytes for almost every type instance. Differential Revision: https://reviews.llvm.org/D85348	2020-08-07 13:43:25 -07:00
River Riddle	9f24640b7e	[mlir] Add a utility class, ThreadLocalCache, for storing non static thread local objects. This class allows for defining thread local objects that have a set non-static lifetime. This internals of the cache use a static thread_local map between the various different non-static objects and the desired value type. When a non-static object destructs, it simply nulls out the entry in the static map. This will leave an entry in the map, but erase any of the data for the associated value. The current use cases for this are in the MLIRContext, meaning that the number of items in the static map is ~1-2 which aren't particularly costly enough to warrant the complexity of pruning. If a use case arises that requires pruning of the map, the functionality can be added. This is especially useful in the context of MLIR for implementing thread-local caching of context level objects that would otherwise have very high lock contention. This revision adds a thread local cache in the MLIRContext for attributes, identifiers, and types to reduce some of the locking burden. This led to a speedup of several hundred miliseconds when compiling a conversion pass on a very large mlir module(>300K operations). Differential Revision: https://reviews.llvm.org/D82597	2020-08-07 13:43:25 -07:00
River Riddle	86646be315	[mlir] Refactor StorageUniquer to require registration of possible storage types This allows for bucketing the different possible storage types, with each bucket having its own allocator/mutex/instance map. This greatly reduces the amount of lock contention when multi-threading is enabled. On some non-trivial .mlir modules (>300K operations), this led to a compile time decrease of a single conversion pass by around half a second(>25%). Differential Revision: https://reviews.llvm.org/D82596	2020-08-07 13:43:24 -07:00
Fangrui Song	164a02d0fa	[ELF]: --icf: don't fold sections referencing sections with LCDA after D84610	2020-08-07 13:42:25 -07:00
Matt Arsenault	5a0b1472c0	GlobalISel: Handle zext(sext x) in artifact combiner This eliminates the illegal intermediate s8 value in the added test.	2020-08-07 16:37:46 -04:00
Adrian Prantl	968cba8e89	lldbutil: add a retry mechanism for the ios simulator We've been seeing this failure on green dragon when the system is under high load. Unfortunately this is outside of LLDB's control. Differential Revision: https://reviews.llvm.org/D85542	2020-08-07 13:28:46 -07:00
Sameer Arora	d6c00edf2e	[FileCheck] Add docs for --allow-empty This diff adds documentation for `allow-empty` flag under FileCheck docs. Reviewed by jhenderson, smeenai, thopre Differential Revision: https://reviews.llvm.org/D83682	2020-08-07 13:27:57 -07:00
peter klausler	43b304b09f	[flang] Support DATA statement initialization of numeric with Hollerith/CHARACTER This is a common Fortran language extension. Differential Revision: https://reviews.llvm.org/D85492	2020-08-07 13:17:36 -07:00
cgyurgyik	dc13a9a781	[libc] Add strcpsn and strpbrk implementation. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D85386	2020-08-07 16:14:32 -04:00
peter klausler	cc01194c2f	[flang] Descriptor-based I/O data item transfers Add support for OutputDescriptor() and InputDescriptor() in the I/O runtime. Change existing scalar formatted I/O functions to drive descriptor-based I/O routines internally. Differential Revision: https://reviews.llvm.org/D85491	2020-08-07 13:09:09 -07:00
Jonas Devlieghere	e3eb3cf550	[lldb] Only check for --apple-sdk argument on Darwin	2020-08-07 13:05:42 -07:00
Sameer Arora	bb4b70f792	[llvm-install-name-tool] Adds docs for llvm-install-name-tool Adding documentation for llvm-install-name-tool. Reviewed by smeenai, Ktwu Differential Revision: https://reviews.llvm.org/D81944	2020-08-07 12:51:58 -07:00
Gui Andrade	17ff170e3a	Revert "[MSAN] Instrument libatomic load/store calls" Problems with instrumenting atomic_load when the call has no successor, blocking compiler roll This reverts commit `33d239513c`.	2020-08-07 19:45:51 +00:00
Tim Shen	b53fd9cdba	[MLIR] Add getSizeInBits() for tensor of complex Differential Revision: https://reviews.llvm.org/D85382	2020-08-07 12:38:49 -07:00
Konrad Dobros	9414a71aaa	[mlir][spirv] Add correct handling of Kernel and Addresses capabilities This change adds initial support needed to generate OpenCL compliant SPIRV. If Kernel capability is declared then memory model becomes OpenCL. If Addresses capability is declared then addressing model becomes Physical64. Additionally for Kernel capability interface variable ABI attributes are not generated as entry point function is expected to have normal arguments. Differential Revision: https://reviews.llvm.org/D85196	2020-08-07 12:29:21 -07:00
peter klausler	0e9e06a6d4	[flang][NFC] Reformat files with current clang-format Differential Revision: https://reviews.llvm.org/D85489	2020-08-07 12:10:26 -07:00
LLVM GN Syncbot	7764b52cbd	[gn build] Port `320eab2d55`	2020-08-07 19:01:40 +00:00
Yuanfang Chen	320eab2d55	Revert "[NewPM][CodeGen] Introduce machine pass and machine pass manager" This reverts commit `911565d108`. Broke some non-Linux bots.	2020-08-07 11:59:58 -07:00
Nicolas Vasilache	2a01d7f7b6	[mlir][SCF] Add utility to outline the then and else branches of an scf.IfOp Differential Revision: https://reviews.llvm.org/D85449	2020-08-07 14:49:49 -04:00
Jianzhou Zhao	aedaa077f5	Reduce dropTriviallyDeadConstantArrays cumulative time percentage from 17% to 4% The history of dropTriviallyDeadConstantArrays is like this. Because the appending linkage uses too much memory (http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20150105/251381.html), dropTriviallyDeadConstantArrays was introduced (https://reviews.llvm.org/rG81f385b0c6ea37dd7195a65be162c75bbdef29d2) to release unused constant arrays. Recently, dropTriviallyDeadConstantArrays was improved (https://reviews.llvm.org/rG81f385b0c6ea37dd7195a65be162c75bbdef29d2) to reduce its quadratic cost. Our recent LTO profiling shows that when a target is large, 15-20% of time cost is from the SetVector::insert called by dropTriviallyDeadConstantArrays. A large application has hundreds or thousands of modules; each module calls dropTriviallyDeadConstantArrays once for cleaning up tens of thousands of ConstantArrays a module has. In those ConstantArrays, usually around 5 can be deleted; a very very few deleted ConstantArrays reference other ConstantArrays: less than 10 out of millions. Given this, the cost of SetVector::insert is mainly from the construction of WorkList from ArrayConstants. This motivated the fix that iterates ArrayConstants directly, and uses WorkList only when necessary. Our evaluation shows that 1) The cumulative time percentage of dropTriviallyDeadConstantArrays is reduced from 15-17% to 4-6%. 2) For targets with LTO time > 20min, the time reduction is about 20%. 3) No observable performance impact for build without using LTO. {F12506218} {F12506221} Reviewed By: mehdi_amini, tejohnson, jdoerfert Differential Revision: https://reviews.llvm.org/D85379	2020-08-07 11:36:30 -07:00
Nicolas Vasilache	3110e7b077	[mlir] Introduce AffineMinSCF folding as a pattern This revision adds a folding pattern to replace affine.min ops by the actual min value, when it can be determined statically from the strides and bounds of enclosing scf loop . This matches the type of expressions that Linalg produces during tiling and simplifies boundary checks. For now Linalg depends both on Affine and SCF but they do not depend on each other, so the pattern is added there. In the future this will move to a more appropriate place when it is determined. The canonicalization of AffineMinOp operations in the context of enclosing scf.for and scf.parallel proceeds by: 1. building an affine map where uses of the induction variable of a loop are replaced by `%lb + %step * floordiv(%iv - %lb, %step)` expressions. 2. checking if any of the results of this affine map divides all the other results (in which case it is also guaranteed to be the min). 3. replacing the AffineMinOp by the result of (2). The algorithm is functional in simple parametric tiling cases by using semi-affine maps. However simplifications of such semi-affine maps are not yet available and the canonicalization does not succeed yet. Differential Revision: https://reviews.llvm.org/D82009	2020-08-07 14:30:38 -04:00
Arthur Eubanks	1bf4629f11	[PPC] Rename bool-ret-to-int -> ppc-bool-ret-to-int Reviewed By: #powerpc, nemanjai Differential Revision: https://reviews.llvm.org/D85391	2020-08-07 11:27:05 -07:00
LLVM GN Syncbot	cc5f6252c7	[gn build] Port `911565d108`	2020-08-07 18:22:24 +00:00
Arthur Eubanks	2b5502c350	[NFC] Use value initializer for OVERLAPPED To fix ../llvm/lib/Support/Windows/Path.inc(1265,21): warning: missing field 'InternalHigh' initializer [-Wmissing-field-initializers] OVERLAPPED OV = {0}; Differential Revision: https://reviews.llvm.org/D85480	2020-08-07 11:18:33 -07:00
Vang Thao	04bd5b5286	[AMDGPU] Fix not rescheduling without clustering Regions are sometimes skipped which should be rescheduled without memory op clustering. RegionIdx is not incremented when iterating over regions that are flagged to be skipped, causing the index to be incorrect. Thanks to Vang Thao for discovering this bug! Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D85498	2020-08-07 11:15:58 -07:00
Jonas Devlieghere	f1d525734f	[lldb] Store the Apple SDK in dotest's configuration. This patch stores the --apple-sdk argument in the dotest configuration. When it's set, use it instead of the triple to determine the current platform. Differential revision: https://reviews.llvm.org/D85537	2020-08-07 11:13:18 -07:00
Zequan Wu	c354b2e3bf	[Clang] Add note for bad conversion when expression is pointer to forward-declared type Differential Revision: https://reviews.llvm.org/D85390	2020-08-07 11:06:08 -07:00
Eduardo Caldas	8abb5fb68f	[SyntaxTree] Use simplified grammar rule for `NestedNameSpecifier` grammar nodes This is our grammar rule for nested-name-specifiers: globalbal-specifier: /empty/ simple-template-specifier: template_opt simple-template-id name-specifier: global-specifier decltype-specifier identifier simple-template-specifier nested-name-specifier: list(name-specifier, ::, non-empty, terminated) It is a relaxed version of C++ [expr.prim.id] and quite simpler to map to our API. TODO: refine name specifiers, `simple-template-name-specifier` and decltype-name-specifier` are token soup for now.	2020-08-07 18:05:47 +00:00
Jez Ng	25367dfefb	[lld-macho] Add .tbd support for frameworks Required for e.g. linking iOS apps since they don't have a platform-native SDK Reviewed By: #lld-macho, compnerd, smeenai Differential Revision: https://reviews.llvm.org/D85153	2020-08-07 11:04:54 -07:00
Jez Ng	ca85e37338	[lld-macho] Support static linking of thread-locals Note: What ELF refers to as "TLS", Mach-O seems to refer to as "TLV", i.e. thread-local variables. This diff implements support for TLV relocations that reference defined symbols. On x86_64, TLV relocations are always used with movq opcodes, so for defined TLVs, we don't need to create a synthetic section to store the addresses of the symbols -- we can just convert the `movq` to a `leaq`. One notable quirk of Mach-O's TLVs is that absolute-address relocations inside TLV-defining sections behave differently -- their addresses are no longer absolute, but relative to the start of the target section. (AFAICT, RIP-relative relocations are not allowed in these sections.) Reviewed By: #lld-macho, compnerd, smeenai Differential Revision: https://reviews.llvm.org/D85080	2020-08-07 11:04:52 -07:00
Jez Ng	4e43f18048	[lld-macho] Ensure .tbss sections are also considered as ZeroFilled This diff makes the behavior in {D80859} and {D81888} apply to thread-local ZeroFill sections too. I realized this was necessary whie trying to implement thread-local variables. Reviewed By: #lld-macho, compnerd, MaskRay Differential Revision: https://reviews.llvm.org/D85079	2020-08-07 11:04:41 -07:00
Yuanfang Chen	911565d108	[NewPM][CodeGen] Introduce machine pass and machine pass manager machine pass could define four methods: - `PreservedAnalyses run(MachineFunction &, MachineFunctionAnalysisManager &)` - `Error doInitialization(Module &, MachineFunctionAnalysisManager &)` - `Error doFinalization(Module &, MachineFunctionAnalysisManager &)` - `Error run(Module &, MachineFunctionAnalysisManager &)` machine pass manger: - MachineFunctionAnalysisManager: Basically an AnalysisManager<MachineFunction> augmented with the ability to register and query IR analyses - MachineFunctionPassManager: support only two methods, `addPass` and `run` Reviewed By: arsenm, asbirlea, aeubanks Differential Revision: https://reviews.llvm.org/D67687	2020-08-07 11:00:31 -07:00
Yuanfang Chen	954bd9c861	[NewPM] Only verify loop for nonskipped user loop pass No verification for pass mangers since it is not needed. No verification for skipped loop pass since the asserted condition is not used. Add a BeforeNonSkippedPass callback for this. The callback needs more inputs than its parameters to work so the callback is added on-the-fly. Reviewed By: aeubanks, asbirlea Differential Revision: https://reviews.llvm.org/D84977	2020-08-07 11:00:31 -07:00
Mitch Phillips	382df1c674	Revert "Reland D64327 [MC][ELF] Allow STT_SECTION referencing SHF_MERGE on REL targets" This reverts commit `b497665d98`. Spent some time trying to reproduce this locally, reverting in a desparate attempt to fix the sanitizer buildbot: - http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux/builds/28828 I don't know exactly why or how this patch breaks the bots, but it seems pretty concrete that it's the culprit.	2020-08-07 10:56:33 -07:00
Yaxun (Sam) Liu	ac3e720dc1	Make clang HIP headers compatible with C++98 Automation to detect compiler features, such as CMake's target_compile_features, would attempt to detect compiler features by explicitly using langugage flags. This change ensures that the HIP headers would still work with C++98. Patch by Siu Chi Chan Differential Revision: https://reviews.llvm.org/D85471 Change-Id: I304e964b18a525b0fde55efd841da74b6c4dc8ed	2020-08-07 13:50:22 -04:00
Artem Dergachev	47cadd6106	[analyzer] pr47030: MoveChecker: Unforget a comma in the suppression list.	2020-08-07 10:39:28 -07:00
Tim Keith	cf03bcf929	[flang] Remove extra CMAKE_CXX_FLAGS in Lower and Optimizer `-Wno-error` and `-Wno-unused-parameter` appear to no longer be needed for Lower and Optimizer. Differential Revision: https://reviews.llvm.org/D85465	2020-08-07 10:21:54 -07:00
Tyker	7d0f69118e	[NFC] Add utility to sum/merge stats files Add a small script to sum .stats file given as input and output the totals usage example: merge-stats.py $(find ./builddir/ -name ".stats") > total.stats Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D83505	2020-08-07 19:02:42 +02:00
aartbik	c3c95b9c80	[mlir] [VectorOps] Improve lowering of extract_strided_slice (and friends like shape_cast) Using a shuffle for the last recursive step in progressive lowering not only results in much more compact IR, but also more efficient code (since the backend is no longer confused on subvector aliasing for longer vectors). E.g. the following %f = vector.shape_cast %v0: vector<1024xf32> to vector<32x32xf32> yields much better x86-64 code that runs 3x faster than the original. Reviewed By: bkramer, nicolasvasilache Differential Revision: https://reviews.llvm.org/D85482	2020-08-07 09:21:05 -07:00
David Green	25e38c3f3c	[ARM] Extra reduction plus tailpredication tests. NFC	2020-08-07 17:16:56 +01:00
Amy Kwan	98eccec3ae	[PowerPC] Add Vector Extract/Expand/Count with Mask, Move to VSR Mask Instruction Definitions and MC Tests This patch adds the instruction definitions and assembly/disassembly tests for the following set of instructions: Vector Extract [byte \| half \| word \| doubleword \| quad] with mask Vector Expand [byte \| half \| word \| doubleword \| quad] with mask Move to VSR [byte \| byte immediate \| half \| word \| doubleword \| quad] with mask Vector Count Mask Bits [byte \| half \| word \| doubleword] Differential Revision: https://reviews.llvm.org/D83724	2020-08-07 11:02:08 -05:00
Mehdi Amini	575b22b5d1	Revisit Dialect registration: require and store a TypeID on dialects This patch moves the registration to a method in the MLIRContext: getOrCreateDialect<ConcreteDialect>() This method requires dialect to provide a static getDialectNamespace() and store a TypeID on the Dialect itself, which allows to lazyily create a dialect when not yet loaded in the context. As a side effect, it means that duplicated registration of the same dialect is not an issue anymore. To limit the boilerplate, TableGen dialect generation is modified to emit the constructor entirely and invoke separately a "init()" method that the user implements. Differential Revision: https://reviews.llvm.org/D85495	2020-08-07 15:57:08 +00:00
Kamau Bridgeman	d8c6d083c9	[PowerPC][PCRelative] Set TLS unsupported with PC relative memops Introduce a fatal error if any thread local storage code is compiled using pc relative memory operations as well as a hidden override option `-enable-ppc-pcrel-tls` so that this support can be incrementally added if possible. Reviewed By: #powerpc, nemanjai Differential Revision: https://reviews.llvm.org/D85448	2020-08-07 10:56:24 -05:00
Alexey Bataev	4a7aedb843	[OPENMP]Simplify representation for atomic, critical, master and section constrcut. Several constructs may be represented wityout relying on CapturedStmt. It saves memory and improves compilation speed.	2020-08-07 09:58:23 -04:00
Victor Huang	6c64f05b90	[PowerPC] Add compatibility check for PPC PLT stubs Compatibility checks for PPC64PltCallStub and PPC64PCRelPLTStub are added in this patch to prevent the usage of incompatible thunk/stub. Reviewed By: sfertile, nemanjai, stefanp Differential Revision: https://reviews.llvm.org/D85459	2020-08-07 13:45:18 +00:00
Jay Foad	ffe1edfc53	[NFC][GVN] Fix "avaliable" typos Differential Revision: https://reviews.llvm.org/D85520	2020-08-07 14:22:24 +01:00
Bevin Hansson	aa0d19a0c8	[Fixed Point] Add fixed-point shift operations and consteval. Reviewers: rjmccall, leonardchan, bjope Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D83212	2020-08-07 15:09:24 +02:00

1 2 3 4 5 ...

362951 Commits All Branches Search

362951 Commits

All Branches