llvm-project

Commit Graph

Author	SHA1	Message	Date
Aart Bik	779dcd2ecc	[mlir][sparse] move sparse tensor rewriting into its own pass Makes individual testing and debugging easier. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D135319	2022-10-05 14:52:55 -07:00
Murali Vijayaraghavan	617ca92bf1	Revert "Added canonicalization for vector.multi_reduction" This reverts commit `c16f3260a9`. There's a bug in the commit creates a scalar result with `ShapeCastOp`. Reverting till that fix is done.	2022-10-05 21:43:51 +00:00
Quentin Colombet	6e440ee2aa	[RISCV][ISel] Finally fix the UBSan error Forgot another SDValue check and a boolean initialization.	2022-10-05 21:43:09 +00:00
Valentin Clement	91dd872ea6	[flang] Keep current polymorphic implementation under a flag It is useful for couple of test suite like NAG to keep failing with a TODO until the polymorphic entities is implemented all the way done to codegen. This pass adds a flag to LoweringOptions for experimental development. This flag is off by default and can be enable in `bbc` with `-polymorphic-type`. Options can be added in the driver and tco when needed. Reviewed By: PeteSteinfeld Differential Revision: https://reviews.llvm.org/D135283	2022-10-05 23:05:28 +02:00
Sam McCall	60b4b39f5a	[clangd] Avoid lexicographic compare when sorting SymbolIDs. NFC These are 8 bytes and we don't care about the actual ordering, so use integer compare. The array generated code has some extra byte swaps (clang), calls memcmp (gcc) or inlines a big chain of comparisons (MSVC): https://godbolt.org/z/e79r6jM6K	2022-10-05 23:02:28 +02:00
Quentin Colombet	6bbe7d376e	[RISCV][ISel] Attempt to fix UBSan error Explicitly check an SDValue with the invalid SDValue. UBSan reports: runtime error: load of value 36, which is not a valid value for type 'bool' https://lab.llvm.org/buildbot/#/builders/85/builds/11231	2022-10-05 20:59:28 +00:00
Quentin Colombet	c5c2de287e	[RISCV][ISel] Fold extensions when all the users can consume them This patch allows the combines that fold extensions in binary operations to have more than one use. The approach here is pretty conservative: if all the users of an extension can fold the extension, then the folding is done, otherwise we don't fold. This is the first step towards avoiding the one-use limitation. As a result, we make a decision to fold/don't fold for a web of instructions. An instruction is part of the web of instructions as soon as it consumes an extension that needs to be folded for all its users. Because of how SDISel works a web of instructions can be visited over and over. More precisely, if the folding happens, it happens for the whole web and that's the end of it, but if the folding fails, the whole web may be revisited when another member of the web is visited. To avoid a compile time explosion in pathological cases, we bail out earlier for webs that are bigger than a given threshold (arbitrarily set at 18 for now.) This size can be changed using `--riscv-lower-ext-max-web-size=<maxWebSize>`. At the current time, I didn't see a better scheme for that. Assuming we want to stick with doing that in SDISel. Differential Revision: https://reviews.llvm.org/D133739	2022-10-05 20:49:21 +00:00
David Green	03b145480d	[AArch64] Add tablegen patterns for bf16 trn/zip/uzp. This adds some missing tablegen patterns to handle trn1/trn2/zip1/zip2/uzp1/uzp2, similar to the Arm handling in `5e1a9d319d`, but via tablegen patterns for the AArch64 backend.	2022-10-05 21:47:36 +01:00
oToToT	0da59bb865	[libc++] Fix wrong implementation of CityHash As PR56606 stated, the current implementation of CityHash in libc++ would drop some bits unintentionally. Cast the 32bit int to the 64bit int to avoid this happened. Reviewed By: ldionne, #libc Differential Revision: https://reviews.llvm.org/D134124	2022-10-06 04:32:04 +08:00
David Blaikie	b61860e63e	Use inheriting ctors for OSTargetInfo (& remove PSPTargetInfo because it's unused - it had the wrong ctor in it anyway, so wouldn't've been able to be instantiated - must've happened due to bitrot over the years)	2022-10-05 20:22:19 +00:00
Mark de Wever	a1beb0a382	[libc++][format] Implements formattable concept. This concept is introduced in P2286, but was implemented in libc++ before. This implementation was used in the library internally. This implementation lacked the resolution of LWG3636. The original formatter had a non-const member function that wasn't trivial to make a const member. The recent parser improvements made this member a const member in preparation of LWG3636. Note LWG3636 isn't voted in. Its status is Ready. P2286's concept has been written as-if LWG3636 is accepted and refers to that LWG issue. Updates some tests make format a const member function and removes a tests that's mainly a duplicate of the formattable concept test. Implements - LWG3636 formatter<T>::format should be const-qualified Implements parts of - P2286R8 Formatting Ranges Reviewed By: ldionne, #libc Differential Revision: https://reviews.llvm.org/D134110	2022-10-05 22:21:00 +02:00
TatWai Chong	ff23599a0d	[mlir][tosa] Update TOSA resize to match specification Attribute stride and shift are removed, and has new scale and border. Signed-off-by: TatWai Chong <tatwai.chong@arm.com> Change-Id: I6cdbeb3978f5ee540bc6cf59eb7c273eb0131430 Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D131629	2022-10-05 13:18:00 -07:00
Ben Langmuir	5ea78c4113	[clang] Update ModuleMap::getModuleMapFile* to use FileEntryRef Update SourceManager::ContentCache::OrigEntry to keep the original FileEntryRef, and use that to enable ModuleMap::getModuleMapFile* to return the original FileEntryRef. This change should be NFC for most users of SourceManager::ContentCache, but it could affect behaviour for users of getNameAsRequested such as in compileModuleImpl. I have not found a way to detect that difference without additional functional changes, other than incidental cases like changes from / to \ on Windows so there is no new test. Differential Revision: https://reviews.llvm.org/D135220	2022-10-05 13:12:43 -07:00
Filipp Zhinkin	1888dc91ac	[AArch64] Add tests for i128 comparison; NFC Baseline tests for D135302.	2022-10-05 23:11:56 +03:00
River Riddle	54cdc03dfa	[mlir:Parser] Always splice parsed operations to the end of the parsed block The current splicing behavior dates back to when all blocks had terminators, so we would "helpfully" splice before the terminator. This doesn't make sense anymore, and leads to somewhat unexpected results when parsing multiple pieces of IR into the same block. Differential Revision: https://reviews.llvm.org/D135096	2022-10-05 13:11:38 -07:00
Zixu Wang	5301826fa8	[clang][ExtractAPI] Don't print locations for anonymous tags ExtractAPI doesn't care about locations of anonymous TagDecls. Set the printing policy to exclude that from anonymous decl names. Differential Revision: https://reviews.llvm.org/D135295	2022-10-05 13:11:21 -07:00
Ivan Butygin	a93ec06ae6	[mlir][gpu] Introduce `host_shared` flag to `gpu.alloc` Motivation: we have lowering pipeline based on upstream gpu and spirv dialects and and we are using host shared gpu memory to transfer data between host and device. Add `host_shared` flag to `gpu.alloc` to distinguish between shared and device-only gpu memory allocations. Differential Revision: https://reviews.llvm.org/D133533	2022-10-05 22:01:30 +02:00
Slava Zakharin	606245ad54	[flang] Fixed build issue after `88f07a736b` nullptr matches both against ::mlir::UnitAttr and ::mlir::TypeRange, so the following two candidates fit: static void mlir::omp::OrderedRegionOp::build(::mlir::OpBuilder &odsBuilder, ::mlir::OperationState &odsState, /optional/::mlir::UnitAttr simd) static void mlir::omp::OrderedRegionOp::build(::mlir::OpBuilder &odsBuilder, ::mlir::OperationState &odsState, ::mlir::TypeRange resultTypes, /optional/bool simd = false)	2022-10-05 12:59:31 -07:00
Argyrios Kyrtzidis	371883f46d	[clang/Sema] Fix non-deterministic order for certain kind of diagnostics In the context of caching clang invocations it is important to emit diagnostics in deterministic order; the same clang invocation should result in the same diagnostic output. rdar://100336989 Differential Revision: https://reviews.llvm.org/D135118	2022-10-05 12:58:01 -07:00
Joseph Huber	9223315903	[DeviceRTL] Allow IsSPMDMode to be optimized out in LTO mode A previous patch merged the static and bitcode versions of the deviceRTL. We previously used the static library's separate compilation to set a special flag that prevented `IsSPMDMode` from being put in the used list and preventing it from being optimized out. When they were merged we could no longer do this separate compilation that allowed users of LTO to get more optimal code. This patch rearranges the code. The `IsSPMDMode` global is now transitively used by its inclusion in the changed `__keep_alive` function. This allows us to then manually delete the `__keep_alive` function from the module when building the static library via `llvm-extract`. The result is that the bitcode library correctly will maintain the needed shared state, while the static library will be able to internalize it and optimize it out. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D135280	2022-10-05 14:40:01 -05:00
Joseph Huber	a8ec170e01	[OpenMP] Make the exec_mode global have protected visibility We use protected visibility for almost everything with offloading. This is because it provides us with the ability to read things from the host without the expectation that it will be preempted by a shared library load, bugs related to this have happened when offloading to the host. This patch just makes the `exec_mode` global generated for each plugin have protected visibility. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D135285	2022-10-05 14:39:22 -05:00
Sam McCall	5f7f4846e8	[clangd] Fix non-idempotent cases of canonicalRenameDecl() The simplest way to ensure full canonicalization is to canonicalize recursively in most cases. This fixes an assertion failure and presumably correctness bugs. It does show up that D132797's index-based virtual method renames doesn't handle templates well (the AST behavior is different and IMO better). We could choose to disable in this case or change the index behavior, but this patch doesn't do either. Differential Revision: https://reviews.llvm.org/D133415	2022-10-05 21:22:08 +02:00
Jakub Kuderski	e99e8ad24d	[mlir][arith] Add shli support to WIE Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D135234	2022-10-05 15:09:58 -04:00
Diego Caballero	2cdd246a39	[mlir][NFC] Make 'printOp' public in AsmPrinter This patch moves the 'printOp' functionality to the public API of AsmPrinter and rename it to 'printCustomOrGenericOp'. No 'parseOp' is needed at this time as existing APIs are able to parse operations producing results where results are omitted in the textual form (the LHS of an operation is redundant when it comes to building the operation itself as it only contains the result names). Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D135006	2022-10-05 19:00:53 +00:00
Sam McCall	6ea83fc98f	[clangd] Don't clone SymbolSlab::Builder arenas when finalizing. SymbolSlab::Builder has an arena to store strings of owned symbols, and deduplicates them. build() copies all the strings and deduplicates them again! This is potentially useful: we may have overwritten a symbol and rendered some strings unreachable. However in practice this is not the case. When testing on a variety of files in LLVM (e.g. SemaExpr.cpp), the strings for the full preamble index are 3MB and shrink by 0.4% (12KB). For comparison the serializde preamble is >50MB. There are also hundreds of smaller slabs (file sharding) that do not shrink at all. CPU time spent on this is significant (something like 3-5% of buildPreamble). We're better off not bothering. Differential Revision: https://reviews.llvm.org/D135231	2022-10-05 20:55:53 +02:00
Leonard Chan	34004d2d03	Fix build failures from `4f6477a615`.	2022-10-05 18:54:28 +00:00
Murali Vijayaraghavan	c16f3260a9	Added canonicalization for vector.multi_reduction If there are reductions only along unit dimensions, then they are folded Reviewed By: dcaballe Differential Revision: https://reviews.llvm.org/D134996	2022-10-05 18:43:33 +00:00
Mats Petersson	4d1460c77d	Revert "[flang] Add -fpass-plugin option to Flang frontend" This reverts commit `43fe6f7cc3`. Reverting this as CI breaks. To reproduce, run check-flang, and it will fail with an error saying .../lib/Bye.so not found in pass-plugin.f90	2022-10-05 19:43:02 +01:00
Leonard Chan	4f6477a615	[llvm][NFC] Consolidate equivalent function type parsing code into single function Differential Revision: https://reviews.llvm.org/D135296	2022-10-05 18:41:19 +00:00
Jakub Kuderski	40126e66b6	[mlir][arith] Add andi, ori, and xori support to WIE Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D135204	2022-10-05 14:34:42 -04:00
Ellis Hoag	7ca3fd72c7	[Dwarf] Remove unnecessary module flags from test These extra module flags are not needed for this test, so remove them. In fact, leaving them in produces the following error message: > invalid behavior operand in module flag (unexpected constant) Differential Revision: https://reviews.llvm.org/D135294	2022-10-05 11:18:44 -07:00
Siva Chandra Reddy	3e1986682f	[libc][Obvious] Add "__" prefix to sched_getcpucount in the spec and elsewhere. Without this fix, the declaration in sched.h will not have the "__" prefix and will cause a compile failure. Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D135286	2022-10-05 18:01:13 +00:00
Rob Suderman	684e8bfabb	[mlir][mlprogram] Add CAPI project for MLProgram Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D135291	2022-10-05 10:55:29 -07:00
Quentin Colombet	4852f26acd	[RISCV][ISel] Refactor the formation of VW operations This patch centralizes all the combines of add\|sub\|mul with extended operands in one "framework". The rationale for this change is to offer a one-stop-shop for all these transformations so that, in the future, it is easier to make combine decisions for a web of instructions (i.e., instructions connected through s\|zext operands). Technically this patch is not NFC because the new version is more powerful than the previous version. In particular, it diverges in two cases: - VWMULSU can now also be produced from `mul(splat, zext)`, whereas previously only `mul(sext, splat)` were supported when `splat`s were involved. (As demonstrated in rvv/fixed-vectors-vwmulsu.ll) - VWSUB(U) can now also be produced from `sub(splat, ext)`, whereas previously only `sub(ext, splat)` were supported when `splat`s were involved. (As demonstrated in rvv/fixed-vectors-vwsub.ll) If we wanted, we could block these transformations to make this patch really NFC. For instance, we could do something similar to `AllowSplatInVW_W`, which prevents the combines to form vw(add\|sub)(u)_w when the RHS is a splat. Regarding the "framework" itself, the bulk of the patch is some boilderplate code that abstracts away the actual extensions that are present in the DAG. This allows us to handle `vwadd_w(ext a, b)` as if it was a regular `add(ext a, ext b)`. Since the node `ext b` doesn't actually exist in the DAG, we have a bunch of methods (all in the NodeExtensionHelper class) that fake all that for us. The other half of the change is around `CombineToTry` and `CombineResult`. These helper structures respectively: - Represent the kind of combines that can be applied to a node, and - Store what needs to happen to do that combine. This can be viewed as a two step approach: - First, check if a pattern applies, and - Second apply it. The checks and the materialization of the combines are decoupled so that in the future we can perform several checks and do all the related applies in one go. Differential Revision: https://reviews.llvm.org/D134703	2022-10-05 17:43:48 +00:00
Jacques Pienaar	88f07a736b	[mlir] Make UnitAttr's default val in unwrapped builder UnitAttr is optional but unwrapped builders require it. Make Change onstructing from bool as required for when not set at moment (for UnitAttr nothing needs to be constructed, this is true for others here too and can be addressed together). Differential Revision: https://reviews.llvm.org/D135058	2022-10-05 10:40:58 -07:00
Akira Hatanaka	65a961f719	[Sema][ObjC] Fix assertion failure in getCommonNonSugarTypeNode Instead of checking that the protocols of both types are all equal, check that the canonical decls are equal.	2022-10-05 10:38:25 -07:00
Nico Weber	79d87740b7	[aarch64] add missing run line to a test The CHECK-IOS lines were added in `1c353419ab`, but without a matching FileCheck invocation. Add it. The dead CHECK-IOS lines were found by Daniel Bertalan. Differential Revision: https://reviews.llvm.org/D133772	2022-10-05 13:36:59 -04:00
Mahesh Ravishankar	827ab049dc	Add `const` to `dump` method of `OpFoldResult`. While most `dump` methods are marked `const`, some arent marked as `const`. Adding `const` to `OpFoldResult` here since this was encountered as an issue while debugging (doing `dump` within a debug console threw an error indicating the method should be marked `const`). Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D135241	2022-10-05 17:26:34 +00:00
Mahesh Ravishankar	a0ef8af8d5	[mlir][Linalg] Expose vectorization precondition check as a utility function. This patch exposes the method to check if an op can be vectorized or not for downstream uses. Also adds a check to mark elementwise operations that have non-vectorizable ops (like `tensor.extract`) as non vectorizable. Reviewed By: nicolasvasilache, dcaballe, ThomasRaoux Differential Revision: https://reviews.llvm.org/D135201	2022-10-05 17:24:52 +00:00
Aiden Grossman	f741815ddb	[lld][ELF] Fix lazy ThinLTO index writing in thin archives Currently when the --thinlto-emit-index-files is used with LLD and a thin archive is passed containing references to object files to link against where the object files are in a different folder than the thin archive and some of the archives aren't linked against (ie stay lazy), the empty index file writer ends up trying to write to a path that doesn't exist. This patch changes the behavior of that function to use the path of the obj member of the BitcodeFile object rather than just the path of the BitcodeFile object itself, which matches the behavior of the default (non-lazy) case. Fixes #57963 Regression test added. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D135014	2022-10-05 17:12:15 +00:00
Vitaly Buka	68f4ceaf9b	Revert "[compiler-rt][test] Heed COMPILER_RT_DEBUG when compiling unittests" Breaks some bots, details in https://reviews.llvm.org/D91620 This reverts commit `93b1256e38`.	2022-10-05 09:59:25 -07:00
Vitaly Buka	0df37528eb	Revert "[mlir][sparse] Restore case coverage warning fix" Breaks https://lab.llvm.org/buildbot/#/builders/168/builds/9288 This reverts commit `83839700c3`.	2022-10-05 09:53:58 -07:00
Aart Bik	c48e90877f	[mlir][sparse] introduce a higher-order tensor mapping This extension to the sparse tensor type system in MLIR opens up a whole new set of sparse storage schemes, such as block sparse storage (e.g. BCSR) and ELL (aka jagged diagonals). This revision merely introduces the type extension and initial documentation. The actual interpretation of the type (reading in tensors, lowering to code, etc.) will follow. Reviewed By: Peiming Differential Revision: https://reviews.llvm.org/D135206	2022-10-05 09:40:51 -07:00
Mark de Wever	1522f190eb	[libc++][chrono] Implements formatter month. Partially implements: - P1361 Integration of chrono with text formatting Reviewed By: ldionne, #libc Differential Revision: https://reviews.llvm.org/D134138	2022-10-05 18:35:50 +02:00
Sam McCall	41b51007e6	Fix SourceManager::isBeforeInTranslationUnit bug with token-pasting isBeforeInTranslationUnit compares SourceLocations across FileIDs by mapping them onto a common ancestor file, following include/expansion edges. It is possible to get a tie in the common ancestor, because multiple "chunks" of a macro arg will expand to the same macro param token in the body: #define ID(X) X #define TWO 2 ID(1 TWO) Here two FileIDs both expand into `X` in ID's expansion: - one containing `1` and spelled on line 3 - one containing `2` and spelled by the macro expansion of TWO isBeforeInTranslationUnit breaks this tie by comparing the two FileIDs: the one "on the left" is always created first and is numerically smaller. This seems correct so far. Prior to this patch it also takes a shortcut (unclear if intentionally). Instead of comparing the two FileIDs that directly expand to the same location, it compares the original FileIDs being compared. These may not be the same if there are multiple macro expansions in between. This almost always yields the right answer, because macro expansion yields "trees" of FileIDs allocated in a contiguous range: when comparing tree A to tree B, it doesn't matter what representative you pick. However, the splitting of >> tokens is modeled as macro expansion (as if the first '>' was a macro that expands to a '>' spelled a scratch buffer). This splitting occurs retroactively when parsing, so the FileID allocated is larger than expected if it were a real macro expansion performed during lexing. As a result, macro tree A can be on the left of tree B, and yet contain a token-split FileID whose numeric value is greator than those in B. In this case the tiebreak gives the wrong answer. Concretely: #define ID(X) X template <typename> class S{}; ID( ID(S<S<int>> x); int y; ) Given Greater = (typeloc of S<int>).getEndLoc(); Y = (decl of y).getLocation(); isBeforeInTranslationUnit(Greater, Y) should return true, but returns false. Here the common FileID of (Greater, Y) is the body of the outer ID expansion, and they both expand to X within it. With the current tiebreak rules, we compare the FileID of Greater (a split) to the FileID of Y (a macro arg expansion into X of the outer ID). The former is larger because the token split occurred relatively late. This patch fixes the issue by removing the shortcut. It tracks the immediate FileIDs used to reach the common file, and uses these IDs to break ties. In the example, we now compare the macro arg expansion of the inner ID() to the macro arg expansion of Y, and find that it is smaller. This requires some changes to the InBeforeInTUCacheEntry (sic). We store a little more data so it's probably slightly slower. It was difficult to resist more invasive changes: - performance: the sizing is very suspicious, and once the cache "fills up" we're thrashing a single entry - API: the class seems to be needlessly complicated However I tried to avoid mixing these with subtle behavior changes, and will send a followup instead. Differential Revision: https://reviews.llvm.org/D134685	2022-10-05 18:29:01 +02:00
Xiang Li	15aa64301a	[HLSL] Support register binding attribute on global variable Allow register binding attribute on variables. Report warning when register binding attribute applies to local variable or static variable. It will be ignored in this case. Type check for register binding is tracked with https://github.com/llvm/llvm-project/issues/57886. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D134617	2022-10-05 09:23:35 -07:00
Ellis Hoag	549773f9e9	[Dwarf] Reference the correct CU when inlining Sometimes when a function is inlined into a different CU, `llvm-dwarfdump --verify` would find an inlined subroutine with an invalid abstract origin. This is because `DwarfUnit::addDIEEntry()` will incorrectly assume the inlined subroutine and the abstract origin are from the same CU if it can't find the CU for the inlined subroutine. In the added test, the inlined subroutine for `bar()` is created before the CU for `B.swift` is created, so it tries to point to `goo()` in the wrong CU. Interestingly, if we swap the order of the two functions then we don't see a crash since the module for `goo()` is created first. The fix is to give a parent DIE to `ScopeDIE` before calling `addDIEEntry()` so that its CU can be found. Luckily, `constructInlinedScopeDIE()` is only called once so we can pass it the DIE of the scope's parent and give it a child just after it's created. `constructInlinedScopeDIE()` should always return a DIE, so assert that it is not null. Reviewed By: aprantl Differential Revision: https://reviews.llvm.org/D135114	2022-10-05 09:19:12 -07:00
Alexandre Ganea	083617afc2	[mlir][unittest] Fix crash when building with MSVC 2022 The test Dialect/Affine/ops.mlir was failing when building with Visual Studio 2022 version 17.3.5. This was caused by a bad MSVC codegen, when capturing a `constexpr` in a lambda. The bug was reported to Microsoft, see differential for more information. Differential revision: https://reviews.llvm.org/D134227	2022-10-05 12:16:54 -04:00
Alexandre Ganea	941f71adf6	[mlir] Fix ambiguity when building with Clang 14.0.6 Differential revision: https://reviews.llvm.org/D134219	2022-10-05 12:16:54 -04:00
Alexandre Ganea	1c25ce1738	[Orc] Fix the SharedMemoryMapper dtor As briefly discussed on https://reviews.llvm.org/rG1134d3a03facccd75efc5385ba46918bef94fcb6, fix the unintended copy while iterating on Reservations and add a mutex guard, to be symmetric with other usages of Reservations. Differential revision: https://reviews.llvm.org/D134212	2022-10-05 12:16:54 -04:00

1 2 3 4 5 ...

437982 Commits All Branches Search

437982 Commits

All Branches