llvm-project

Commit Graph

Author	SHA1	Message	Date
Zain Jaffal	966411790e	[AArch64] Add support to loop vectorization for non temporal loads Currently, AArch64 doesn't support vectorization for non temporal loads because `isLegalNTLoad` is not implemented for the target. This patch applies similar functionality as `D73158` but for non temporal loads Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D131964	2022-10-03 17:06:47 +01:00
Alex Richardson	3890a456d8	[SimplifyLibCalls] Reduce code duplication. NFC Reviewed By: nikic, nickdesaulniers, xbolva00 Differential Revision: https://reviews.llvm.org/D135073	2022-10-03 15:44:00 +00:00
Joseph Huber	11adae5089	[Clang] Make offloading flags accept '-' and '--' Currently all of the flags beginning with `--offload` such as `--offload-arch` or `--offload-device-only` require the double-dash form. However, if a user uses a single dash it will instead name a file 'ffload-arch' for example. This has been the cause of a lot of user confusion. This patch changes these options to also accept a single dash. This is similar to many other driver arguments beginning with `-o`. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D135076	2022-10-03 10:35:51 -05:00
Sam Clegg	664a5c6d03	[WebAssembly] Fix return type of __builtin_return_address under wasm64 Differential Revision: https://reviews.llvm.org/D135005	2022-10-03 08:31:52 -07:00
Jonathan Peyton	f8d081c1a5	[OpenMP][libomp] Allow unused-but-set warnings Only a few remaining which are taken care of by this patch. Differential Revision: https://reviews.llvm.org/D133528	2022-10-03 10:24:33 -05:00
Sanjay Patel	ba7da14d83	Revert "[InstSimplify] reduce code duplication for fmul folds; NFC" This reverts commit `7b7940f9da`. This missed a test update.	2022-10-03 11:21:23 -04:00
Florian Hahn	22fe40068c	[ConstraintElimination] Add tests for chained GEPs with const offsets. Add extra tests for chained GEPs where the second GEP has a constant offset. Inspired by missed optimizations from #51358.	2022-10-03 15:55:36 +01:00
Yevgeny Rouban	e351821088	Fix compilation of CodeLayout.cpp for MacOS llvm/lib/Transforms/Utils/CodeLayout.cpp uses std::abs() with double argument, which is provided by cmath header, which is not explicitly included into CodeLayout.cpp. The implicit include in llvm/include/llvm/Support/MathExtras.h was removed in commit `16544cbe64` Inserting explicit include of cmath into CodeLayout.cpp in order to fix build on MacOS. Committed on behalf of alsemenov (Aleksei Semenov) Reviewed By: thieta Differential Revision: https://reviews.llvm.org/D135072	2022-10-03 21:47:43 +07:00
Sanjay Patel	7b7940f9da	[InstSimplify] reduce code duplication for fmul folds; NFC The constant is already commuted for an fmul opcode, but this code can be called more directly for fma, so we have to swap for that caller. There are tests in InstSimplify and InstCombine to verify that this works as expected.	2022-10-03 10:36:02 -04:00
David Green	4987ae8462	[ARM][AArch64] Dont use macros for half instrinsics in NeonEmitter We don't require arm_neon.h fp16 intrinsics to be treated as macros any more. Differential Revision: https://reviews.llvm.org/D131504	2022-10-03 15:27:23 +01:00
David Green	1441d49bb6	[AArch64] Add pr58109 tests, one showing incorrect lowering of SUBS. NFC Also added global-isel coverage for the same file.	2022-10-03 15:08:46 +01:00
Igor Kirillov	a94a85552c	[LoopVectorize] Add missing test for D133687	2022-10-03 14:54:17 +01:00
Christian Sigg	5faebb5624	[mlir][shape] fix test added in `9f77909`. The stderr to stdout piping results in the two streams being interleaved on Windows. Write stderr to a temp-file instead and run separate FileCheck on it.	2022-10-03 15:48:23 +02:00
Tomasz Kamiński	73716baa30	[analyzer][NFC] Add tests for D132236 D132236 would have introduced regressions in the symbol lifetime handling. However, the testsuite did not catch this, so here we have some tests, which would have break if D132236 had landed. This patch addresses the comment https://reviews.llvm.org/D132236#3753238 Co-authored-by: Balazs Benics <balazs.benics@sonarsource.com> Reviewed By: martong Differential Revision: https://reviews.llvm.org/D134941	2022-10-03 15:42:38 +02:00
Hansang Bae	772fb97c0b	[OpenMP] Ignore schedule modifier in static scheduling The modifier bits in the schedule type is not used/supported in the static scheduler, so it should be ignored. Differential Revision: https://reviews.llvm.org/D134983	2022-10-03 08:29:57 -05:00
chenglin.bi	b0fff3db6a	[ARM64EC][clang-cl] Add /arm64EC flag Reviewed By: DavidSpickett Differential Revision: https://reviews.llvm.org/D134788	2022-10-03 21:20:51 +08:00
Nico Weber	248237364b	[lld/mac] Extract a reportUndefinedSymbol function Makes things look more similar to the ELF port, and removes some slightly deep nesting. No behavior change. Differential Revision: https://reviews.llvm.org/D135032	2022-10-03 09:17:57 -04:00
Alex Zinenko	8291fa0817	[mlir] fix markdown headers in structured ops doc	2022-10-03 13:17:10 +00:00
Markus Böck	36af4c8418	[SelectionDAG] Fix use-after-free introduced in D130881 The code introduced in https://reviews.llvm.org/D130881 has a bug as it may cause a use-after-free error that can be caught by ASAN. The bug essentially boils down to iterator invalidation of `DenseMap`. The expression `SDEI[To] = I->second;` may cause `SDEI` to grow if `To` is inserted for the very first time. When that happens, all existing iterators to the map are invalidated as their backing storage has been freed. Accessing `I->second` is then invalid and attempts to access freed memory (as `I` is an iterator of `SDEI`). This patch fixes that quite simply by first making a copy of `I->second`, and then moving into the possibly newly inserted KV of the ` DenseMap`. No test attached as I am not sure it is practible to test. Differential revision: https://reviews.llvm.org/D135019	2022-10-03 15:09:14 +02:00
Hans Wennborg	20a269cf77	Revert "[CMake] Use libcxx-abi-* targets for in-tree sanitizer C++ ABI" It casued some runtimes builds to fail with cmake error No target "libcxx-abi-static" see code review. > When in-tree libcxx is selected as the sanitizer C++ ABI, use > libcxx-abi-* targets rather than libcxxabi and libunwind directly. > > Differential Revision: https://reviews.llvm.org/D134855 This reverts commit `414f9b7d2f`.	2022-10-03 14:56:07 +02:00
Alex Richardson	0a893cfb44	[libc++] Avoid relying on non-portable behaviour in std::align Round-tripping pointers via size_t is not portable, the C/C++ standards only require this to be valid when using (u)intptr_t. Originally committed to the CHERI fork of LLVM as `dd01245185`, but I forgot to upstream the change. I rediscovered this issue due to a compiler warning when building libc++ on a Arm Morello system. Reviewed By: #libc, ldionne, philnik Differential Revision: https://reviews.llvm.org/D134363	2022-10-03 12:45:21 +00:00
Tobias Gysi	12b68ad886	[mlir][llvmir] Add filename debug info when converting from LLVMIR. The revision enriches the debug locations generated during LLVMIR to MLIR translation with file name information and adds a separate test to exercise the debug location translation. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D135069	2022-10-03 15:14:37 +03:00
Rainer Orth	1774a8a763	[flang] Add Sparc support to Optimizer/CodeGen/Target.cpp As described in Issue #57642, `flang` currently lacks SPARC support in `Optimizer/CodeGen/Target.cpp`, which causes a considerable number of tests to `FAIL` with error: flang/lib/Optimizer/CodeGen/Target.cpp:310: not yet implemented: target not implemented This patch fixes this by following GCC`s documentation of the ABI described in the Issue. Tested on `sparcv9-sun-solaris2.11`. Differential Revision: https://reviews.llvm.org/D133561	2022-10-03 14:03:35 +02:00
Petar Avramovic	1fa2019828	[SelectionDAG] Add check for BUILD_VECTOR in isKnownNeverNaN Includes handling of constants with vector type in isKnownNeverNaN. For AMDGPU results in not making fcanonicalize during legalization for vector inputs to fmaxnum_ieee and fminnum_ieee. Does not affect end result since there is a combine that eliminates fcanonicalize. Differential Revision: https://reviews.llvm.org/D88573	2022-10-03 12:47:07 +02:00
Bjorn Pettersson	66fcdfca4d	[Analysis][SimplifyLibCalls] Refactor code related to size_t in lib func signatures. NFC Added a helper in TargetLibraryInfo to get size of "size_t" in bits, given a Module reference. The new getSizeTSize helper is using the same strategy as for example isValidProtoForLibFunc has been using in the past, assuming that the size can be derived by asking DataLayout about the size/type of a pointer to int. FortifiedLibCallSimplifier::optimizeStrpCpyChk was changed to use the new getSizeTSize helper instead of assuming that sizeof(size_t) is equal to sizeof(int*) by itself (that is the assumption used in TargetLibraryInfoImpl::getSizeTSize so the result will be the same). Having a common helper for this ensure that we use the same strategy when deriving the size of "size_t" in different parts of the code. One bonus with this refactoring (basing it on Module instead of just DataLayout) is that it makes it easier to override this for a specific target triple, in case the assumption of using getPointerSizeInBits wouldn't hold. Differential Revision: https://reviews.llvm.org/D110585	2022-10-03 12:02:50 +02:00
Javier Setoain	8199a43a89	[mlir][Affine] Add pass options to supervectorizer The only current options to create a supervectorization pass from an external dialect is to use `createSuperVectorizePass` with the virtual vector dimensions as a parameter, but the pass accepts other parameters. This patch enables external users to create a supervectorizer pass exposing all available option. Differential Revision: https://reviews.llvm.org/D134632	2022-10-03 10:50:31 +01:00
Jean Perier	eb1eb7a144	[flang] Do not resolve TRIM/REPEAT length to its argument length For TRIM and REPEAT calls, semantics was creating ProcedureDesignators using the length parameter of the arguments. This caused bugs when folding LEN(TRIM(char_explicit_constant_length)). The same did not appeared in folding for REPEAT because it is rewritten at a higher level to LEN(c)*N. This is not only a folding issue since any place (like lowering) may try to use the bad length parameter from the created ProcedureDesignator. Update intrinsic resolution to not copy the length parameter for TRIM and REPEAT. Differential Revision: https://reviews.llvm.org/D134970	2022-10-03 11:42:41 +02:00
Max Kazantsev	a34b5212ec	[Test] Add test showing that Constraint Elimination can deal with case where SCEV fails	2022-10-03 16:41:22 +07:00
Weining Lu	6faac7b822	[DebugInfo][test] XFAIL DebugInfo/Generic/missing-abstract-variable.ll on LoongArch The same as SPARC and RISCV. See D119122. Differential Revision: https://reviews.llvm.org/D134932	2022-10-03 17:11:54 +08:00
Valentin Clement	a89b04805a	[flang][NFC] Use prefixed accessors for fircg dialect The raw accessor is going away soon so switch to prefixed accessors in the fircg dialect. The main dialect was switched some months ago. https://github.com/llvm/llvm-project/issues/58090 Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D135061	2022-10-03 11:02:43 +02:00
Alvin Wong	d90633a74b	[Clang][MinGW][cygwin] Fix __declspec with -fdeclspec enabled Fixes https://github.com/llvm/llvm-project/issues/49958 Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D135027	2022-10-03 10:58:45 +03:00
Alvin Wong	0ce4f57052	[LLD][COFF] Reduce chance of symbol name collision with delay-load Delay-loaded imports creats a load thunk with a symbol name. Before this change, the name uses a `__imp_load_` prefix. On the other hand, normal import uses the `__imp_` prefix for the import address pointer. If an import symbol named `load_func` is imported normally and another named `func` is imported using delay-load, this can cause a symbol name collision. This patch changes delay-load imports to use `__imp___load_` prefix. Because it is less likely for normal imports to have a name starting in `__load_` this should reduce the chance of a name collision. Reviewed By: mstorsjo Differential Revision: https://reviews.llvm.org/D134464	2022-10-03 10:58:45 +03:00
Alvin Wong	e2e132c5d9	[LLD][COFF] Set OrdinalBase to 1 for export table Before this, LLD sets OrdinalBase to 0, which deviates from usual practices. This technically would allow LLD to export a symbol using ordinal 0, however LLD never use export ordinal 0, which results in binaries with export tables always having an empty export at ordinal 0. This change makes LLD set OrdinalBase to 1 and not create the empty export with ordinal 0, which makes its behaviour more in line with both the MSVC linker and the GNU linker. Reviewed By: mstorsjo Differential Revision: https://reviews.llvm.org/D134140	2022-10-03 10:58:44 +03:00
Vitaly Buka	dd9dfb57da	[libc++] Remove a part of reverted D131898 or D130695	2022-10-03 00:51:21 -07:00
Vitaly Buka	724af35849	Revert "[libc++] Updates generated transitve includes." Looks like a part of reverted D131898. This reverts commit `cfd5b8f111`.	2022-10-03 00:42:55 -07:00
Peixin Qiao	c4f04a126a	[flang] Make real type of kind 10 target dependent The real(10) is supported on x86_64. On aarch64, the value of selected_real_kind(16) should be 16 rather than 10 since real(10) is not supported on x86_64. Previously, the real type support check is not target dependent. Support it now through the target triple information. Reviewed By: clementval Differential Revision: https://reviews.llvm.org/D134021	2022-10-03 15:24:39 +08:00
Christian Sigg	2ddbe56b34	[Bazel] fixes for `9f77909`.	2022-10-03 09:12:21 +02:00
Matthias Springer	90dac71a9a	[mlir][bufferize][NFC] Fix FileCheck capture One of the test cases matched IR from a subsequent test case. For this reason, the test case appeared to pass while it is actually broken. This change does not fix the test case itself. It will be fixed when we overhaul the buffer deallocation implementation. (The memory leak in this test case is an edge case.) Differential Revision: https://reviews.llvm.org/D135046	2022-10-03 16:06:10 +09:00
Amara Emerson	3daf7ddaef	[GlobalISel] Allow prelegalizer combiners to have access to LegalizerInfo. Before, the isPreLegalize() query in CombinerHelper only checked for the presence of a LegalizerInfo object. This is problematic when we want to have a combine actually check for legality in a pre-legalizer combine pass, since if we pass a LegalizerInfo object to the constructor it causes the combines to think that we're running post legalizer, which isn't true. This change fixes it to instead check an explicit bool that passes to signal whether the pass will be run before or after legalization. Doing so exposed a bug in the extending loads combine, which tried to check for legality of candidate extending loads if LegalizerInfo was present. Since we only ran it pre-legalizer and therefore with a null LegalizerInfo, it never actually ran. Also fixes the legality checks to keep the tests passing. Differential Revision: https://reviews.llvm.org/D135044	2022-10-03 07:36:18 +01:00
Matthias Springer	598f5275c1	[mlir][interfaces] Add ShapedDimOpInterface This interface is implemented by memref.dim and tensor.dim. This change makes it possible to remove a build dependency of the Affine dialect on the Tensor dialect (and maybe also the MemRef dialect in the future). Differential Revision: https://reviews.llvm.org/D133595	2022-10-03 13:58:52 +09:00
Fangrui Song	9f9bab19e3	[ELF] Replace some config->ekind with file->ekind. NFC	2022-10-02 21:27:41 -07:00
Vitaly Buka	e68c7a9917	Revert "Add APFloat and MLIR type support for fp8 (e5m2)." Breaks bots https://lab.llvm.org/buildbot/#/builders/37/builds/17086 This reverts commit `2dc68b5398`.	2022-10-02 21:22:44 -07:00
Fangrui Song	d9dbf9e30a	[ELF] Move init from ELFFileBase constructor to a separate function. NFC	2022-10-02 21:10:28 -07:00
Yuanqiang Liu	9f77909a5e	[mlir][shape] add outline-shape-computation pass Add outline-shape-computation pass. This pass his pass outlines the shape computation part in high level IR by adding shape.func and populate corresponding mapping information into ShapeMappingAnalysis. Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D131810	2022-10-02 20:24:49 -07:00
Fangrui Song	8bcf22e318	[ELF] Remove redundant getELFKind call. NFC	2022-10-02 20:16:13 -07:00
Fangrui Song	c171250e38	[ELF] Simplify addFile. NFC	2022-10-02 19:49:17 -07:00
Matthias Springer	2d2737667e	[mlir][linalg][NFC] Drop emitAccessorPrefix from Linalg dialect Differential Revision: https://reviews.llvm.org/D135048	2022-10-03 11:35:41 +09:00
LLVM GN Syncbot	2d27b56be5	[gn build] Port `71410fd2c0`	2022-10-03 01:41:14 +00:00
Vitaly Buka	71410fd2c0	Revert "[libc++] Implement P0591R4 (Utility functions to implement uses-allocator construction)" Breaks ubsan tests https://lab.llvm.org/buildbot/#/builders/85/builds/11131 This reverts commit `099384dcea`.	2022-10-02 18:40:43 -07:00
Stella Laurenzo	2dc68b5398	Add APFloat and MLIR type support for fp8 (e5m2). This is a first step towards high level representation for fp8 types that have been built in to hardware with near term roadmaps. Like the BFLOAT16 type, the family of fp8 types are inspired by IEEE-754 binary floating point formats but, due to the size limits, have been tweaked in various ways in order to maximally use the range/precision in various scenarios. The list of variants is small/finite and bounded by real hardware. This patch introduces the E5M2 FP8 format as proposed by Nvidia, ARM, and Intel in the paper: https://arxiv.org/pdf/2209.05433.pdf As the more conformant of the two implemented datatypes, we are plumbing it through LLVM's APFloat type and MLIR's type system first as a template. It will be followed by the range optimized E4M3 FP8 format described in the paper. Since that format deviates further from the IEEE-754 norms, it may require more debate and implementation complexity. Given that we see two parts of the FP8 implementation space represented by these cases, we are recommending naming of: * `F8M<N>` : For FP8 types that can be conceived of as following the same rules as FP16 but with a smaller number of mantissa/exponent bits. Including the number of mantissa bits in the type name is enough to fully specify the type. This naming scheme is used to represent the E5M2 type described in the paper. * `F8M<N>F` : For FP8 types such as E4M3 which only support finite values. The first of these (this patch) seems fairly non-controversial. The second is previewed here to illustrate options for extending to the other known variant (but can be discussed in detail in the patch which implements it). Many conversations about these types focus on the Machine-Learning ecosystem where they are used to represent mixed-datatype computations at a high level. At that level (which is why we also expose them in MLIR), it is important to retain the actual type definition so that when lowering to actual kernels or target specific code, the correct promotions, casts and rescalings can be done as needed. We expect that most LLVM backends will only experience these types as opaque `I8` values that are applicable to some instructions. MLIR does not make it particularly easy to add new floating point types (i.e. the FloatType hierarchy is not open). Given the need to fully model FloatTypes and make them interop with tooling, such types will always be "heavy-weight" and it is not expected that a highly open type system will be particularly helpful. There are also a bounded number of floating point types in use for current and upcoming hardware, and we can just implement them like this (perhaps looking for some cosmetic ways to reduce the number of places that need to change). Creating a more generic mechanism for extending floating point types seems like it wouldn't be worth it and we should just deal with defining them one by one on an as-needed basis when real hardware implements a new scheme. Hopefully, with some additional production use and complete software stacks, hardware makers will converge on a set of such types that is not terribly divergent at the level that the compiler cares about. (I cleaned up some old formatting and sorted some items for this case: If we converge on landing this in some form, I will NFC commit format only changes as a separate commit) Differential Revision: https://reviews.llvm.org/D133823	2022-10-02 17:17:08 -07:00

1 2 3 4 5 ...

437662 Commits All Branches Search

437662 Commits

All Branches