llvm-project

Commit Graph

Author	SHA1	Message	Date
Thomas Raoux	9f6ba4be26	[mlir][vector] Extend transfer_write to read propagation Folding of transfer_write into transfer_read is already supported but this requires the read and write to have the same permuation map. After linalg vectorization it is common to have different ppermuation map for write followed by read even though the cases could be propagated. This canonicalization handle cases where the permuation maps are different but the data read and written match and replace the transfer ops with broadcast and permuation Differential Revision: https://reviews.llvm.org/D130135	2022-07-22 17:11:06 +00:00
Alex Brachet	3878973bd4	[llvm-driver] Fix build after `07b749800` The llvm-driver build is not enabled on any bots so this wasn't caught earlier.	2022-07-22 17:05:58 +00:00
Dylan Fleming	846439dd97	[Flang] Generate documentation for compiler flags This patch aims to create a webpage to document Flang's command line options on https://flang.llvm.org/docs/ in a similar way to Clang's https://clang.llvm.org/docs/ClangCommandLineReference.html This is done by using clang_tablegen to generate an .rst file from Options.td (which is current shared with Clang) For this to work, ClangOptionDocEmitter.cpp was updated to allow specific Flang flags to be included, rather than bulk excluding clang flags. Note: Some headings in the generated documentation will incorrectly contain references to Clang, e.g. "Flags controlling the behaviour of Clang during compilation" This is because Options.td (Which is shared between both Clang and Flang) contains hard-coded DocBrief sections. I couldn't find a non-intrusive way to make this target-dependant, as such I've left this as is, and it will need revisiting later. Reviewed By: awarzynski Differential Revision: https://reviews.llvm.org/D129864	2022-07-22 17:05:04 +00:00
Alex Brachet	5e2d5071ff	[libc] Don't call user comparator function for equal pointers The standard says two equal pointers must compare equal so there is no need to call the user comparator function in this case. Differential Revision: https://reviews.llvm.org/D130310	2022-07-22 17:03:16 +00:00
zhijian	066afe03c5	[NFC] Fixed build fail of https://lab.llvm.org/buildbot/#/builders/207/builds/8752 which caused by https://reviews.llvm.org/D127864	2022-07-22 13:02:09 -04:00
Konstantin Varlamov	14cf74d65d	[libc++][ranges] Implement `ranges::shuffle`. Differential Revision: https://reviews.llvm.org/D130321	2022-07-22 09:59:13 -07:00
tlattner	44f81dfba4	Remove references to old mailing lists that have moved to discourse. Replace with links to discourse. Reviewed By: #libc_abi, ldionne Differential Revision: https://reviews.llvm.org/D129675	2022-07-22 09:59:03 -07:00
Jacques Pienaar	13448db06a	[mlir][tosa] Flip accessors used to prefixed form (NFC) Follow up from dialect flip, just flipping accessors. Both forms still generated.	2022-07-22 09:56:08 -07:00
Stefan Pintilie	475a39fbc3	[PowerPC][NFC] Convert the MMA test cases to use opaque pointers. This patch modifies only test cases. Converted the MMA test cases to use opaque pointers. Reviewed By: lei, amyk Differential Revision: https://reviews.llvm.org/D130090	2022-07-22 11:43:40 -05:00
Simon Pilgrim	939cf9b1be	[AArch64] Use neon instructions for i64/i128 ISD::PARITY calculation As noticed on D129765 and reported on Issue #56531 - aarch64 targets can use the neon ctpop + add-reduce instructions to speed up scalar ctpop instructions, but we fail to do this for parity calculations. I'm not sure where the cutoff should be for specific CPUs, but i64 (+ i128 special case) shows a definite reduction in instruction count. i32 is about the same (but scalar <-> neon transfers are probably more costly?), and sub-i32 promotion looks to be a definite regression compared to parity expansion optimized for those widths. Differential Revision: https://reviews.llvm.org/D130246	2022-07-22 17:24:17 +01:00
Simon Pilgrim	8f0ba6c405	[X86] Add X64 test coverage to smul-with-overflow.ll	2022-07-22 17:24:16 +01:00
Mircea Trofin	7b81a81d5f	[NFC] FunctionSamples::getEntrySamples -> getHeadSamplesEstimate The name `getEntrySamples` was misleading for 2 reasons. One, it's close in name to `Function::getEntryCount`, but the equivalent here is `getHeadSamples`; second, as opposed to the other get* APIs in `FunctionSamples`, it performs an estimate/heuristic rather than just retrieving raw data (or a non-heuristic derivate off that data, like `getMaxCountInside`) The new name should more clearly communicate its intent; and, being close (in name) to `getHeadSamples`, it should allow the reader discover the relation between them. Also updated the doc comments for both `getHeadSamples[Estimate]` so a reader may better understand the relation between them. Differential Revision: https://reviews.llvm.org/D130281	2022-07-22 09:17:59 -07:00
Slava Zakharin	f5759add70	[flang] Try to lower math intrinsics to math operations first. This commit changes how math intrinsics are lowered: we, first, try to lower them into MLIR operations or libm calls via mathOperations table and only then fallback to pgmath runtime calls. The pgmath fallback is needed, because mathOperations does not support all intrinsics that pgmath supports. The main purpose of this change is to get rid of llvmIntrinsics table so that we do not have to update both llvmIntrinsics and mathOperations when adding new intrinsic support. mathOperations lowering should phase out pgmath lowering, when more operations are available (e.g. power operations being added in D129809 and D129811; complex type operations from Complex dialect). Differential Revision: https://reviews.llvm.org/D130129	2022-07-22 09:04:44 -07:00
Slava Zakharin	fa3c770438	[flang] Reduced CHECKs for transpose_opt.f90 This commit addresses concerns raised in D129497. Differential Revision: https://reviews.llvm.org/D130300	2022-07-22 08:50:42 -07:00
Shilei Tian	77cb30e3a6	Revert "[OpenMP][DeviceRTL] Fix the issue that multiple calls to `omp_get_wtime` is optimized out by mistake" This reverts commit `ad34f1dba8`.	2022-07-22 11:45:13 -04:00
Benjamin Kramer	5a445395e4	[LV] Remove unused variable. NFC.	2022-07-22 17:43:58 +02:00
Shilei Tian	ad34f1dba8	[OpenMP][DeviceRTL] Fix the issue that multiple calls to `omp_get_wtime` is optimized out by mistake Multiple calls to `omp_get_wtime` could be optimized out due to the function is mistakenly marked as `readnone`. This patch fixes the issue, and also add the support to run optimization on `libomptarget` tests. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D130179	2022-07-22 11:43:30 -04:00
Craig Topper	be208b40c1	[DAGCombiner] Simplify code around call to reduceLoadWidth in visitAND. NFC We were looking for loads or any_extend+load. reduceLoadWidth hasn't known how to look through such an any_extend to find the load since D40667 almost 5 years ago. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D130333	2022-07-22 08:36:56 -07:00
Philip Reames	d7bf81fd51	[LV] Rework widening cost of uniform memory ops for clarity [nfc] Reorganize the code to make it clear what is and isn't handle, and why. Restructure bailout to remove (false and confusing) dependence on CM_Scalarize; just return invalid cost and propagate, that's what it is for.	2022-07-22 08:35:45 -07:00
Jeff Niu	edfc4bb9b9	[mlir][ods] Remove warning in `AttrOrTypeDef` This warning was added because using attribute or type assembly formats with `skipDefaultBuilders` set could cause compilation errors, since the required builder prototype may not necessarily be generated and would need to be checked by hand. This patch removes the warning because a warning that the generated C++ "might" not compile is not particularly useful. Attempting to address the TODO (i.e. detect whether a builder of the correct prototype is provided) would be fragile since it would not be possible to account for implicit conversions, etc. In general, ODS should not be emitting warnings in cases like these. Reviewed By: rriddle, wrengr Differential Revision: https://reviews.llvm.org/D130102	2022-07-22 08:29:23 -07:00
VitalyR	effe79993f	[CUDA] remove duplicate condition Reviewed by: Yaxun Liu Differential Revision: https://reviews.llvm.org/D130168 Change-Id: Ia00c3dfa9ea20e61235817fd4bb61d33c7c98a60	2022-07-22 11:27:19 -04:00
Sam Estep	aed1ab8cab	[clang][dataflow] Refactor ApplyBuiltinTransfer field out into DataflowAnalysisOptions struct Depends On D130304 This patch pulls the `ApplyBuiltinTransfer` from the `TypeErasedDataflowAnalysis` class into a new `DataflowAnalysisOptions` struct, to allow us to add additional options later without breaking existing code. Reviewed By: gribozavr2, sgatev Differential Revision: https://reviews.llvm.org/D130305	2022-07-22 15:16:29 +00:00
Maksim Panchenko	661577b5f4	[BOLT] Add support for the latest perf tool The latest perf tool can return non-empty buffer when executing buildid-list command, even when perf.data was recorded with -B flag. Some binaries will be listed without the ID, while others may have a recorded ID. Allow invalid entires on the input, while checking the valid ones for the match. Reviewed By: Amir Differential Revision: https://reviews.llvm.org/D130223	2022-07-22 07:56:15 -07:00
Malhar Jajoo	41958f76d8	[Costmodel] Add "type-based-intrinsic-cost" cli option This patch adds a command line flag to be able to test the type based cost-model analysis for Intrinsics. Differential Revision: https://reviews.llvm.org/D129109	2022-07-22 15:50:57 +01:00
Tue Ly	600172a72b	[libc] Temporarily disable arm32's sinf, cosf, sincosf entrypoints. With correctly rounded implementations, these functions will be tested for all rounding modes. Since fegetround and fesetround are not implemented for arm32, these tests will fail in all non-default rounding modes. We will re-enable these entrypoints and tests once fegetround and fesetround are implemented for arm32.	2022-07-22 10:46:23 -04:00
Shubham Narlawar	f55dbfbd9d	[AArch64] Move SeparateConstOffsetFromGEPPass before LSR and enable EnableGEPOpt by default. GEP's across basic blocks were not getting splitted due to EnableGEPOpt which was turned off by default. Hence, EarlyCSE missed the opportunity to eliminate common part of GEP's. This can be achieved by simply turning GEP pass on. - This patch moves SeparateConstOffsetFromGEPPass() just before LSR. - It enables EnableGEPOpt by default. Resolves - https://github.com/llvm/llvm-project/issues/50528 Added an unit test. Differential Revision: https://reviews.llvm.org/D128582	2022-07-22 15:20:53 +01:00
Jacques Pienaar	1b7feac2a6	[mlir][tosa] Split canonicalization and folders out of TosaOps. Scope ops file to ops. Used canonicalization as grouping for canonicalization patterns and folders (also considered OpTransforms but that felt too generic and the former two are used together). Reviewed By: silvas, rsuderman Differential Revision: https://reviews.llvm.org/D130297	2022-07-22 07:20:25 -07:00
Sam Estep	32dcb759c3	[clang][dataflow] Move NoopAnalysis from unittests to include This patch moves `Analysis/FlowSensitive/NoopAnalysis.h` from `clang/unittests/` to `clang/include/clang/`, so that we can use it for doing context-sensitive analysis. Reviewed By: ymandel, gribozavr2, sgatev Differential Revision: https://reviews.llvm.org/D130304	2022-07-22 14:11:32 +00:00
Nikita Popov	c2be703c6c	[AsmPrinter] Move lowerConstant() error code out of switch (NFC) Move this out of the switch, so that different branches can indicate an error by breaking out of the switch. This becomes important if there are more than the two current error cases.	2022-07-22 16:08:28 +02:00
Tue Ly	d883a4ad02	[libc] Implement sinf function that is correctly rounded to all rounding modes. Implement sinf function that is correctly rounded to all rounding modes. - We use a simple range reduction for `pi/16 < \|x\|` : Let `k = round(x / pi)` and `y = (x/pi) - k`. So `k` is an integer and `-0.5 <= y <= 0.5`. Then ``` sin(x) = sin(ypi + kpi) = (-1)^(k & 1) * sin(ypi) ~ (-1)^(k & 1) y * P(y^2) ``` where `yP(y^2)` is a degree-15 minimax polynomial generated by Sollya with: ``` > P = fpminimax(sin(xpi)/x, [\|0, 2, 4, 6, 8, 10, 12, 14\|], [\|D...\|], [0, 0.5]); ``` - Performance benchmark using perf tool from CORE-MATH project (https://gitlab.inria.fr/core-math/core-math/-/tree/master) on Ryzen 1700: Before this patch (not correctly rounded): ``` $ CORE_MATH_PERF_MODE="rdtsc" ./perf.sh sinf CORE-MATH reciprocal throughput : 17.892 System LIBC reciprocal throughput : 25.559 LIBC reciprocal throughput : 29.381 ``` After this patch (correctly rounded): ``` $ CORE_MATH_PERF_MODE="rdtsc" ./perf.sh sinf CORE-MATH reciprocal throughput : 17.896 System LIBC reciprocal throughput : 25.740 LIBC reciprocal throughput : 27.872 LIBC reciprocal throughput : 20.012 (with `-msse4.2` flag) LIBC reciprocal throughput : 14.244 (with `-mfma` flag) ``` Reviewed By: zimmermann6 Differential Revision: https://reviews.llvm.org/D123154	2022-07-22 10:07:31 -04:00
zhijian	4f2cfbe531	[llvm-ar] Add object mode option -X for AIX Summary: 1. Added a new option object mode -X for llvm-ar. In AIX OS , there is a object mode option -X for ar command. please see the "-X mode" part of https://www.ibm.com/docs/ko/aix/7.1?topic=ar-command Specifies the type of object file ar should examine. The mode must be one of the following: 32 Processes only 32-bit object files 64 Processes only 64-bit object files 32_64 Processes both 32-bit and 64-bit object files any Processes all of the supported object files. The default is to process 32-bit object files (ignore 64-bit objects). The mode can also be set with the OBJECT_MODE environment variable. For example, OBJECT_MODE=64 causes ar to process any 64-bit objects and ignore 32-bit objects. The -X flag overrides the OBJECT_MODE variable. 2. Before adding the new option -X, the default behaviors of llvm-ar like -Xany, but after the adding the new option -X, the default behaviors of llvm-ar change to -X32 ,in order to let some test cases which has 32bit and 64bit object file in the same llvm-ar command, we need to add the "export OBJECT_MODE=any" into test case to change the default behaviors of llvm-ar's object mode. Reviewers: James Henderson, Owen Reynolds, Fangrui Song Differential Revision: https://reviews.llvm.org/D127864	2022-07-22 09:55:21 -04:00
Joseph Huber	a3804a3145	[Libomptarget] Make the plugins link as LLVM libraries Previously we made `libomptarget` link as an LLVM library so we have access to the LLVM core libraries. After the initial patch stuck we can now apply the same changes to the plugins. This will allow us to use LLVM in all of `libomptarget` when we have uses for them. In the future this should allow us to remove the dependencies on `libelf`, `libffi`, and `dl`. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D130262	2022-07-22 09:34:12 -04:00
Egor Zhdan	1d0cc51051	[Clang][Driver] Fix include paths for `--sysroot /` on OpenBSD/FreeBSD This is the same change as https://reviews.llvm.org/D126289, but applied for OpenBSD & FreeBSD. Differential Revision: https://reviews.llvm.org/D129654	2022-07-22 14:30:32 +01:00
Tue Ly	ed261e7106	[libc] Add float type and flag for nearest_integer to enable SSE4.2. Add float type and flag for nearest integer to automatically test with and without SSE4.2 flag. Reviewed By: sivachandra Differential Revision: https://reviews.llvm.org/D129916	2022-07-22 09:29:41 -04:00
Kiran Chandramohan	06dbcf7b2b	[MLIR][OpenMP] Add a constraint to the Threadprivate Op Add a constraint to ensure that the operand and result of the threadprivate operation are the same. Reviewed By: peixin Differential Revision: https://reviews.llvm.org/D128609	2022-07-22 13:12:24 +00:00
Kiran Chandramohan	4ee9f3d59e	[MLIR,OpenMP] : Add Conversion pattern for Critical Op The Conversion pattern enables conversion of Critical Op with block arguments. Fixes https://github.com/llvm/llvm-project/issues/56629 Reviewed By: shraiysh Differential Revision: https://reviews.llvm.org/D130343	2022-07-22 12:57:48 +00:00
Nikita Popov	5ab077f911	[LangRef] Update opaque pointers status (NFC) Opaque pointers support is complete and default. Specify ptr as the normal pointer type and i8* as something supported under non-default options. A larger update of examples in LangRef is still needed.	2022-07-22 14:47:31 +02:00
Kadir Cetinkaya	4839929bed	[clangd] Make forwarding parameter detection logic resilient This could crash when our heuristic picks the wrong function. Make sure there is enough parameters in the candidate to prevent those crashes. Also special case copy/move constructors to make the heuristic work in presence of those. Fixes https://github.com/llvm/llvm-project/issues/56620 Differential Revision: https://reviews.llvm.org/D130260	2022-07-22 14:37:13 +02:00
Louis Dionne	deb3b5552f	[libc++] Take advantage of -fexperimental-library in libc++ When -fexperimental-library is passed, libc++ will now pick up the appropriate __has_feature flag defined by Clang to enable the experimental library features. As a fly-by, also update the documentation for the various TSes. Differential Revision: https://reviews.llvm.org/D130176	2022-07-22 08:33:39 -04:00
Louis Dionne	07e984bc52	[libc++] Support int8_t and uint8_t in integer distributions as an extension In D125283, we ensured that integer distributions would not compile when used with arbitrary unsupported types. This effectively enforced what the Standard mentions here: http://eel.is/c++draft/rand#req.genl-1.5. However, this also had the effect of breaking some users that were using integer distributions with unsupported types like int8_t. Since we already support using __int128_t in those distributions, it is reasonable to also support smaller types like int8_t and its unsigned variant. This commit implements that, adds tests and documents the extension. Note that we voluntarily don't add support for instantiating these distributions with bool and char, since those are not integer types. However, it is trivial to replace uses of these random distributions on char using int8_t. It is also interesting to note that in the process of adding tests for smaller types, I discovered that our distributions sometimes don't provide as faithful a distribution when instantiated with smaller types, so I had to relax a couple of tests. In particular, we do a really bad job at implementing the negative binomial, geometric and poisson distributions for small types. I think this all boils down to the algorithm we use in std::poisson_distribution, however I am running out of time to investigate that and changing the algorithm would be an ABI break (which might be reasonable). As part of this patch, I also added a mitigation for a very likely integer overflow bug we were hitting in our tests in negative_binomial_distribution. I also filed http://llvm.org/PR56656 to track fixing the problematic distributions with int8_t and uint8_t. Supersedes D125283. Differential Revision: https://reviews.llvm.org/D126823	2022-07-22 08:33:01 -04:00
Joseph Huber	908054df4f	[Libomptarget] Only export needed definitions in the BC library This patch adds the use of the `-internalize-public-api-file` option in the internalization pass to internalize any definition that isn't explicitly needed for the interface. This will allow us to perform more optimizations on the file that normally would not have been possible with functions internal to the library not being internal. Depends on D130293 Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D130298	2022-07-22 08:24:35 -04:00
Joseph Huber	3d0ab8638b	[Internalize] Support glob patterns for API lists The internalize pass supports an option to provide a list of symbols that should not be internalized. THis is useful retaining certain defintions that should be kept alive. However, this interface is somewhat difficult to use as it requires knowing every single symbol's name and specifying it. Many APIs provide common prefixes for the symbols exported by the library, so it would make sense to be able to match these using a simple glob pattern. This patch changes the handling from a simple string comparison to a glob pattern match. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D130319	2022-07-22 08:24:32 -04:00
Joseph Huber	e82e07d74a	[Libomptarget] Build the DeviceRTL BC using clang directly Currently the bitcode library is build using the clang front-end manually. This was originally done because we did not support device only compilation. Now we support device only compilation, at least for a single offloading toolchain, so we can instead use clang directly rather than using the front-end. This saves us needing to define things like `aux_triple`. Reviewed By: tianshilei1992 Differential Revision: https://reviews.llvm.org/D130293	2022-07-22 08:24:29 -04:00
Nikita Popov	5102084787	[Docs] Add release notes for opaque pointers (NFC)	2022-07-22 14:14:03 +02:00
Ron Lieberman	45a379ce2f	Revert "[Libomptarget] Stop testing CPU offloading with LTO" This reverts commit `3e8d46921f`.	2022-07-22 12:10:06 +00:00
Matthias Springer	0eb0dfb20b	[mlir][linalg] Add tile-and-fuse with transform dialect example Differential Revision: https://reviews.llvm.org/D130346	2022-07-22 13:55:18 +02:00
Matthias Springer	32c6e0815a	[mlir][linalg] Add attribute matcher to structured.match transform op This is useful for building small test cases and will be utilized in a subsequent commit that adds a fusion example. Differential Revision: https://reviews.llvm.org/D130344	2022-07-22 13:55:12 +02:00
Matthias Springer	bc882ed21f	[mlir][linalg][transform] Add fuse_into_containing op This op fuses a given payload op into a given container op. Inside the container, all uses of the producer are replaced (fused) with the newly inserted op. If the producer is tileable and accessed via a tensor.extract_slice, the new op computes only the requested slice ("tile and fuse"). Otherwise, the entire tensor value is computed inside the container ("clone and fuse"). Differential Revision: https://reviews.llvm.org/D130244	2022-07-22 13:55:04 +02:00
Zhouyi Zhou	934d603826	[clang-tidy][NFC] Add preposition "of" to code annotation of ElseAfterReturnCheck Reviewed By: njames93 Differential Revision: https://reviews.llvm.org/D129953	2022-07-22 12:40:08 +01:00
Jay Foad	798fa7e9d6	[AMDGPU] Add a test where regClassPriorityTrumpsGlobalness uses more vgprs	2022-07-22 12:08:47 +01:00

1 2 3 4 5 ...

430772 Commits All Branches Search

430772 Commits

All Branches