llvm-project

Commit Graph

Author	SHA1	Message	Date
Gabor Marton	12887a2024	[Analyzer][Core] Better simplification in SimpleSValBuilder::evalBinOpNN Make the SValBuilder capable to simplify existing SVals based on a newly added constraints when evaluating a BinOp. Before this patch, we called `simplify` only in some edge cases. However, we can and should investigate the constraints in all cases. Differential Revision: https://reviews.llvm.org/D113753	2021-11-23 16:38:01 +01:00
Yaxun (Sam) Liu	e13246a2ec	[HIP] Add HIP scope atomic operations Add an AtomicScopeModel for HIP and support for OpenCL builtins that are missing in HIP. Patch by: Michael Liao Revised by: Anshil Ghandi Reviewed by: Yaxun Liu Differential Revision: https://reviews.llvm.org/D113925	2021-11-23 10:13:37 -05:00
Jinsong Ji	b0784d1d14	[PowerPC] Remove FreeBSD test in mm-malloc.c due to cross-compilation limitation Fix failures on powerpc BE buildbots https://lab.llvm.org/buildbot/#/builders/93/builds/6031 https://lab.llvm.org/buildbot/#/builders/100/builds/10836 https://lab.llvm.org/buildbot/#/builders/52/builds/12719	2021-11-23 15:09:10 +00:00
Sanjay Patel	430ad9697d	[InstCombine] enhance bitwise select matching I noticed that adding a seemingly unrelated fold for xor caused regressions on similar patterns, and this is one of the underlying causes. This could also be a variation for code as seen in: https://llvm.org/PR34047 ...although that exact example should be fixed after: D113035 / `c36b7e21bd` The vector test shows that we are actually missing a potential canonicalization for bitcast-of-sext-of-not or the inverse. The scalar test shows that even if we had that canonicalization, it would still be possible to see this pattern due to extra uses. https://alive2.llvm.org/ce/z/y2BAgi	2021-11-23 09:57:44 -05:00
Sanjay Patel	e6cd157407	[InstCombine] add tests for logical select; NFC	2021-11-23 09:57:44 -05:00
Louis Dionne	13fa4fcfe7	[libc++] Tidy up how %T and %t are created during configuration checks Instead of having ad-hoc cleanup in various places, handle all creation and removal of temporary files and directories inside _makeConfigTest. As a fly-by, also remove testPrefix since we don't keep any source file around anymore. Setting a prefix for the files is hence not useful anymore. Differential Revision: https://reviews.llvm.org/D114390	2021-11-23 09:51:22 -05:00
David Green	871418c5b0	[ARM] Expand rev.ll test with more triples. NFC Useful in showing Thumb2 and Thumb1 rev instructions as well as the arm already tested, as well as testing the more canonical llvm.bswap.i16 form.	2021-11-23 14:24:58 +00:00
Zahira Ammarguellat	fd759d42c9	Revert "The _Float16 type is supported on x86 systems with SSE2 enabled." This reverts commit `6623c02d70`. The change seems to be breaking build of compiler-rt on Debian.	2021-11-23 08:00:57 -05:00
Nicolas Vasilache	3ff4e5f2a4	[mlir][Vector] Thread 0-d vectors through InsertElementOp. This revision makes concrete use of 0-d vectors to extend the semantics of InsertElementOp. Reviewed By: dcaballe, pifon2a Differential Revision: https://reviews.llvm.org/D114388	2021-11-23 12:55:11 +00:00
Nicolas Vasilache	e7026aba00	[mlir][Vector] Thread 0-d vectors through ExtractElementOp. This revision starts making concrete use of 0-d vectors to extend the semantics of ExtractElementOp. In the process a new VectorOfAnyRank Tablegen OpBase.td is added to allow progressive transition to supporting 0-d vectors by gradually opting in. Differential Revision: https://reviews.llvm.org/D114387	2021-11-23 12:39:44 +00:00
Matthias Springer	f24d9313cc	[mlir][linalg][bufferize][NFC] Specify bufferize traversal in `bufferize` The interface method `bufferize` controls how (and it what order) nested ops are traversed. This simplifies bufferization of scf::ForOps and scf::IfOps, which used to need special rules in scf::YieldOp. Differential Revision: https://reviews.llvm.org/D114057	2021-11-23 21:33:19 +09:00
Diana Picus	cdc476ab2f	[fir] Set !fir.len_param_index conversion to unimplemented This patch is part of the upstreaming effort from fir-dev. The conversion of len_param_index in fir-dev is incomplete, so for now we're marking this as unimplemented until we can settle on a design for the runtime support of LEN parameters. Differential Revision: https://reviews.llvm.org/D114241 Co-authored-by: Eric Schweitz <eschweitz@nvidia.com>	2021-11-23 12:14:28 +00:00
Tonko Sabolčec	f66b69a392	[lldb] Fix lookup for global constants in namespaces LLDB uses mangled name to construct a fully qualified name for global variables. Sometimes DW_TAG_linkage_name attribute is missing from debug info, so LLDB has to rely on parent entries to construct the fully qualified name. Currently, the fallback is handled when the parent DW_TAG is either DW_TAG_compiled_unit or DW_TAG_partial_unit, which may not work well for global constants in namespaces. For example: namespace ns { const int x = 10; } may produce the following debug info: <1><2a>: Abbrev Number: 2 (DW_TAG_namespace) <2b> DW_AT_name : (indirect string, offset: 0x5e): ns <2><2f>: Abbrev Number: 3 (DW_TAG_variable) <30> DW_AT_name : (indirect string, offset: 0x61): x <34> DW_AT_type : <0x3c> <38> DW_AT_decl_file : 1 <39> DW_AT_decl_line : 2 <3a> DW_AT_const_value : 10 Since the fallback didn't handle the case when parent tag is DW_TAG_namespace, LLDB wasn't able to match the variable by its fully qualified name "ns::x". This change fixes this by additional check if the parent is a DW_TAG_namespace. Reviewed By: werat, clayborg Differential Revision: https://reviews.llvm.org/D112147	2021-11-23 12:53:03 +01:00
Jay Foad	5ee625bf6b	[AMDGPU] Fix the name of a test case	2021-11-23 11:33:21 +00:00
Dmitry Vyukov	ebd47b0fb7	tsan: new runtime (v3) This change switches tsan to the new runtime which features: - 2x smaller shadow memory (2x of app memory) - faster fully vectorized race detection - small fixed-size vector clocks (512b) - fast vectorized vector clock operations - unlimited number of alive threads/goroutimes Differential Revision: https://reviews.llvm.org/D112603	2021-11-23 11:44:59 +01:00
mydeveloperday	1cb3cfd932	[clang-format] [NFC] build clang-format with -Wall When building clang-format with -Wall on Visual Studio 20119 we see the following, prevent this the only -Wall error ``` ..FormatTokenLexer.cpp(45) : warning C4868: compiler may not enforce left-to-right evaluation order in braced initializer list ``` Reviewed By: HazardyKnusperkeks Differential Revision: https://reviews.llvm.org/D113844	2021-11-23 10:43:27 +00:00
mydeveloperday	e7cb3283c8	[clang-format] [PR52527] can join * with /* to form an outside of comment error C4138 https://bugs.llvm.org/show_bug.cgi?id=52527 The follow patch ensures there is always a space between * and /* to prevent transforming ``` void foo(* /* comment /)(int bar); ``` into ``` void foo(/* comment */)(int bar); ``` Differential Revision: https://reviews.llvm.org/D114142	2021-11-23 10:36:06 +00:00
Evgeniy Brevnov	47e2644c89	[DSE][NFC] Introduce "doesn't overwrite" return code for isOverwrite Add OR_None code to indicate that there is no overwrite. This has no any effect for current uses but will be used in one of the next patches building support for PHI translation. Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D105098	2021-11-23 17:11:15 +07:00
Florian Hahn	a5fff58781	[ThreadPool] Do not return shared futures. The only users of returned futures from ThreadPool is llvm-reduce after D113857. There should be no cases where multiple threads wait on the same future, so there should be no need to return std::shared_future<>. Instead return plain std::future<>. If users need to share a future between multiple threads, they can share the futures themselves. Reviewed By: Meinersbur, mehdi_amini Differential Revision: https://reviews.llvm.org/D114363	2021-11-23 10:06:08 +00:00
Alexander Belyaev	c7cc70c8f8	Revert "Revert "[mlir] Move AllocationOpInterface to Bufferize/IR/AllocationOpInterface.td."" This reverts and fixes commit `de18b7dee6`.	2021-11-23 10:49:26 +01:00
David Green	32b6c17b29	[SDAG] Use UnknownSize for masked load/store MMO size A masked load or store will load a potentially unknown number of bytes from a memory location - that is not generally known at compile time. They do not necessarily load/store the entire vector width, and treating them as such can lead to incorrect aliasing information (for example, if the underlying object is smaller than the size of the vector). This makes sure that the MMO is given an unknown size to represent this. which is less accurate that "may load/store from up to 16 bytes", but less incorrect that "will load/store from 16 bytes". Differential Revision: https://reviews.llvm.org/D113888	2021-11-23 09:47:56 +00:00
Qiu Chaofan	59f4b3d308	[PowerPC] Implement more fusion types for Power10 This implements the rest of Power10 instruction fusion pairs, according to user manual, including 'wide immediate', 'load compare', 'zero move' and 'SHA3 assist'. Only 'SHA3 assist' is enabled by default. Reviewed By: shchenz Differential Revision: https://reviews.llvm.org/D112912	2021-11-23 17:21:17 +08:00
David Green	8ea3e70fb0	[X86] Regenerate X86/vmaskmov-offset.ll check lines as per new mir format. NFC	2021-11-23 08:41:47 +00:00
David Green	dc79d73605	[ARM] Add an test for showing the incorrect aliasing info around masked loads/stores. NFC	2021-11-23 08:41:47 +00:00
Martin Storsjö	d703b92296	[LLD] [COFF] Omit section symbols and IMAGE_SYM_CLASS_LABEL from the PE symbol table The section symbols aren't of much practical use when looking at a linked image. This shrinks one observed mingw style unstripped binary by 14%. IMAGE_SYM_CLASS_LABEL is in spirit the same as a temporary assembler label that isn't emitted on the object file level at all. Differential Revision: https://reviews.llvm.org/D113866	2021-11-23 10:17:04 +02:00
Martin Storsjö	4e5488afb2	[AArch64] [COFF] Move jump tables back to the readonly section This essentially reverts `f5884d255e` (D57277). That commit was made as a workaround since LLVM back then didn't support cross-section relative relocations (IMAGE_REL_ARM64_REL32) in COFF for ARM64. Support for this was implemented later, in `d5c5cf5ce8` (D99572) and `382c505d9c` (D102217). The commit that moved jump tables to the function section noted that it woud be ideal to utilize IMAGE_REL_ARM64_REL32. Differential Revision: https://reviews.llvm.org/D113576	2021-11-23 10:13:48 +02:00
Martin Storsjö	7c15da6761	[LLD] [COFF] Interpret the immediate in ARM64 adr/adrp relocations as signed 21 bit This matches how MS link.exe interprets this relocation. Differential Revision: https://reviews.llvm.org/D114347	2021-11-23 10:13:01 +02:00
Martin Storsjö	06d0d449d8	[COFF] [ARM64] Create symbols with regular intervals for relocations against temporary symbols For relocations against temporary symbols (that don't persist in the object file), we normally adjust them to reference the start of the section. For adrp relocations, the immediate offset from the referenced symbol is stored in the opcode as the 21 bit signed immediate; this means that the symbol referenced must be within +/- 1 MB from the referenced symbol. Create label symbols with regular intervals (1 MB intervals). For relocations against temporary symbols, pick the preceding added offset symbol and make the relocation against that instead of against the start of the section. This should fix the root issue behind https://bugs.llvm.org/show_bug.cgi?id=52378. Differential Revision: https://reviews.llvm.org/D114340	2021-11-23 10:12:41 +02:00
Nicolas Vasilache	b2729fda60	[mlir][Vector] Add a vblendps-based impl for transpose8x8 (both intrin and inline_asm) This revision follows up on the conversation titled: ```[llvm-dev] Understanding and controlling some of the AVX shuffle emission paths``` The revision adds a vblendps-based implementation for transpose8x8 and further distinguishes between and intrinsics and an inline_asm implementation. This results in roughly 20% fewer cycles as reported by llvm-mca: After this revision (intrinsic version, resolves to virtually identical assembly as per the llvm-dev discussion, no vblendps instruction is emitted): ``` Iterations: 100 Instructions: 5900 Total Cycles: 2415 Total uOps: 7300 Dispatch Width: 6 uOps Per Cycle: 3.02 IPC: 2.44 Block RThroughput: 24.0 Cycles with backend pressure increase [ 89.90% ] Throughput Bottlenecks: Resource Pressure [ 89.65% ] - SKXPort1 [ 0.04% ] - SKXPort2 [ 12.42% ] - SKXPort3 [ 12.42% ] - SKXPort5 [ 89.52% ] Data Dependencies: [ 37.06% ] - Register Dependencies [ 37.06% ] - Memory Dependencies [ 0.00% ] ``` After this revision (inline_asm version, vblendps instructions are indeed emitted): ``` Iterations: 100 Instructions: 6300 Total Cycles: 2015 Total uOps: 7700 Dispatch Width: 6 uOps Per Cycle: 3.82 IPC: 3.13 Block RThroughput: 20.0 Cycles with backend pressure increase [ 83.47% ] Throughput Bottlenecks: Resource Pressure [ 83.18% ] - SKXPort0 [ 14.49% ] - SKXPort1 [ 14.54% ] - SKXPort2 [ 19.70% ] - SKXPort3 [ 19.70% ] - SKXPort5 [ 83.03% ] - SKXPort6 [ 14.49% ] Data Dependencies: [ 39.75% ] - Register Dependencies [ 39.75% ] - Memory Dependencies [ 0.00% ] ``` An accessible copy of the conversation is available [here](https://gist.github.com/nicolasvasilache/68c7f34012584b0e00f335bcb374ede0). Differential Revision: https://reviews.llvm.org/D114393	2021-11-23 07:31:22 +00:00
Sandeep Dasgupta	e5a8c8c883	[mlir] Refactoring a few Parser APIs Refactored two new parser APIs parseGenericOperationAfterOperands and parseCustomOperationName out of parseGenericOperation and parseCustomOperation. Motivation: Sometimes an op can be printed in a special way if certain criteria is met. While parsing, we need to handle all the versions. `parseGenericOperationAfterOperands` is handy in situation where we already parsed the operands and decide to fall back to default parsing. `parseCustomOperationName` is useful when we need to know details (dialect, operation name etc.) about a parsed token meant to be an mlir operation. Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D113719	2021-11-23 06:11:01 +00:00
Kazu Hirata	d5b73a70a0	[llvm] Use range-based for loops (NFC)	2021-11-22 20:33:28 -08:00
Matthias Springer	fb99686bfd	[mlir][linalg][bufferize] Limited support for scf.execute_region Add support for analysis only. Differential Revision: https://reviews.llvm.org/D114055	2021-11-23 12:20:39 +09:00
Matthias Springer	26c0dd83ab	[mlir][linalg][bufferize][NFC] Move helper function to op interface This is in preparation of changing the op traversal during bufferization. Differential Revision: https://reviews.llvm.org/D114040	2021-11-23 11:59:47 +09:00
Matthias Springer	8d0994ed21	[mlir][linalg][bufferize][NFC] Remove special casing of CallOps Differential Revision: https://reviews.llvm.org/D113966	2021-11-23 11:14:10 +09:00
Matthias Springer	b1083830d6	[mlir][linalg][bufferize][NFC] Clean up headers and function visibility Differential Revision: https://reviews.llvm.org/D113964	2021-11-23 10:29:26 +09:00
Walter Erquinigo	a2c76312ed	Attempt to fix `e3dea5cf0e` https://lab.llvm.org/buildbot/#/builders/17/builds/13728 found an issue in the optional formatter.	2021-11-22 16:33:40 -08:00
Peter Klausler	bb0d8e4bd9	[flang] Correct the argument keyword for AIMAG(Z=...) It was X= in the intrinsics table. Differential Revision: https://reviews.llvm.org/D114296	2021-11-22 16:13:21 -08:00
Walter Erquinigo	e3dea5cf0e	[formatters] Add a formatter for libstdc++ optional Besides adding the formatter and the summary, this makes the libcxx tests also work for this case. This is the polished version of https://reviews.llvm.org/D114266, authored by Danil Stefaniuc. Differential Revision: https://reviews.llvm.org/D114403	2021-11-22 15:36:46 -08:00
Huihui Zhang	9cd7c534e2	[InstCombine] Enable fold select into operand for FAdd, FMul, FSub and FDiv. For FAdd, FMul, FSub and FDiv, fold select into one of the operands to enable further optimizations, i.e., floating-point reduction detection. Turn code: %C = fadd %A, %B %D = select %cond, %C, %A into: %C = select %cond, %B, -0.000000e+00 %D = fadd %A, %C Alive2 verification (with --disable-undef-input), timed out otherwise. FAdd - https://alive2.llvm.org/ce/z/eUxN4Y FMul - https://alive2.llvm.org/ce/z/5SWZz4 FSub - https://alive2.llvm.org/ce/z/Dhj8dU FDiv - https://alive2.llvm.org/ce/z/Yj_NA2 Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D113442	2021-11-22 15:10:10 -08:00
Peter Klausler	d02b318af6	[flang] Remove typo that affected complex namelist input A recent patch to real/complex formatted input included what must have been an editing hiccup: "++ ++p" instead of "++p". This compiles, and it broke the consumption of the trailing ')' of a complex value in namelist input by skipping over the character. Extend existing test to cover this case. Differential Revision: https://reviews.llvm.org/D114297	2021-11-22 15:06:46 -08:00
Shoaib Meenai	2f5d6a0ea5	[MachO] Fix struct size assertion std::vector can have different sizes depending on the STL's debug level, so account for its size separately. (You could argue that we should be accounting for all the other members separately as well, but that would be very unergonomic, and std::vector is the only one that's caused problems so far.)	2021-11-22 15:02:30 -08:00
Jon Chesterfield	ae5348a38e	[openmp][amdgpu] Make plugin robust to presence of explicit implicit arguments OpenMP (compiler) does not currently request any implicit kernel arguments. OpenMP (runtime) allocates and initialises a reasonable guess at the implicit kernel arguments anyway. This change makes the plugin check the number of explicit arguments, instead of all arguments, and puts the pointer to hostcall buffer in both the current location and at the offset expected when implicit arguments are added to the metadata by D113538. This is intended to keep things running while fixing the oversight in the compiler (in D113538). Once that patch lands, and a following one marks openmp kernels that use printf such that the backend emits an args element with the right type (instead of hidden_node), the over-allocation can be removed and the hardcoded 8*e+3 offset replaced with one read from the .offset of the corresponding metadata element. Reviewed By: estewart08 Differential Revision: https://reviews.llvm.org/D114274	2021-11-22 23:00:20 +00:00
Fangrui Song	7aafe467d2	[ELF] Simplify a condition with config->copyRelocs. NFC	2021-11-22 13:59:23 -08:00
Benjamin Kramer	966b720983	[mlir][memref] Fix expanded shape ops memref.cast folding with changed type `memref.expand_shape` has verification logic to make sure result dim must be static if all the collapsing src dims are static. This can be relaxed once expand_shape supports more dynamism. Differential Revision: https://reviews.llvm.org/D114391	2021-11-22 22:56:15 +01:00
Jan Beich	2dec2aa3ad	[Driver] Default to libc++ on FreeBSD All supported FreeBSD releases use libc++, so default to it if the target's major version is not specified. Reviewed by: dim, emaste Differential Revision: https://reviews.llvm.org/D77776	2021-11-22 16:47:03 -05:00
Christian Ulmann	f6718fc6d3	[mlir] FlatAffineConstraint parsing for unit tests This patch adds functionality to parse FlatAffineConstraints from a StringRef with the intention to be used for unit tests. This should make the construction of FlatAffineConstraints easier for testing purposes. The patch contains an example usage of the functionality in a unit test that uses FlatAffineConstraints. Reviewed By: bondhugula, grosser Differential Revision: https://reviews.llvm.org/D113275	2021-11-23 03:04:30 +05:30
Snehasish Kumar	a4b92d6158	[memprof] Remove the "Live on exit:" print for text format. We dropped the printing of live on exit blocks in rG1243cef245f6 - the commit changed the insertOrMerge logic. Remove the message since it is no longer needed (all live blocks are inserted into the hashmap) before serializing/printing the profile. Furthermore, the original intent was to capture evicted blocks so it wasn't entirely correct. Also update the binary format test invocation to remove the redundant print_text directive now that it is the default. Differential Revision: https://reviews.llvm.org/D114285	2021-11-22 13:30:48 -08:00
Groverkss	98daa4e425	[MLIR] Fix incorrect removal of source loop in loop fusion This patch fixes a bug in loop fusion pass where the source loop was removed even when the fused loop did not cover all iterations of the source loop. This was because the fast hueristic check for checking if source loop and fused loop have same iterations did not take into account steps in loop. Reviewed By: dcaballe, bondhugula Differential Revision: https://reviews.llvm.org/D114164	2021-11-23 02:54:09 +05:30
Bill Wendling	2975f37d8d	[llvm-diff] Implement diff of PHI nodes Implement diff of PHI nodes Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D114211	2021-11-22 13:23:10 -08:00
Florian Hahn	6149e57dc1	[ThreadPool] Support returning futures with results. This patch adjusts ThreadPool::async to return futures that wrap the result type of the passed in callable. To do so, ThreadPool::asyncImpl first creates a shared promise. The result of the promise is set in a new callable that first executes the task. The callable is added to the task queue. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D114183	2021-11-22 21:20:55 +00:00

1 2 3 4 5 ...

405451 Commits All Branches Search

405451 Commits

All Branches