llvm-project

Commit Graph

Author	SHA1	Message	Date
Florian Hahn	7c3c352d82	[VPlan] Separate ctors for VPWidenIntOrFpInduction. (NFC) VPWidenIntOrFpInductionRecipes can either be constructed with a PHI and an optional cast or a PHI and a trunc instruction. Reflect this in 2 separate constructors. This also simplifies a follow-up change.	2021-12-05 12:15:18 +00:00
Kristina Bessonova	75b622a795	Reland [DwarfDebug] Support emitting function-local declaration for a lexical block This is another attempt to make function-local declarations (like static variables, structs/classes and other) be correctly emitted within a lexical (bracketed) block. Fixes https://bugs.llvm.org/show_bug.cgi?id=19238. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D113741	2021-12-05 13:56:45 +02:00
Kristina Bessonova	0ac75e82ff	Reland [DwarfDebug] Move emission of global vars, types and imports to endModule() This patch proposes to move emission of global variables, types, imported entities, etc from DwarfDebug::beginModule() to DwarfDebug::endModule(). Effectively, this changes nothing but the order of debug entities which will be as follows: * subprograms (including related context, local variables/labels, local imported entities; related types can be created as a part of the emission of local entities of an abstract subprogram); * global variables (including related context and types); * retained types and enums; * non-local-scoped imported entities; * basic types; * other types left (as a part of local variables attributes emission). Note that the order of emitted compile units may also be changed as now we emit units that contain subprograms first and then all other non-empty units. The motivation behind this change is the following: (1) DwarfDebug::beginModule() is run at the very beginning of backend's pipeline, from this time IR can be significantly changed by target-specific passes. If it happens for debug metadata of global entities, those changes will not be reflected in the emitted DWARF. (2) imported subprogram names should refer to an abstract subprogram if it exists, but it isn't known in DwarfDebug::beginModule() (it's possible to make some guesses based on location info, but it's not quite reliable); (3) aforementioned entities if they are scoped within a bracketed block (subject of D113741) couldn't be emitted in DwarfDebug::beginModule() (they need parent emitted first). Another problem is if to try to gather some information about local entities and defer their emission (till subprogram's processing or DwarfDebug::endModule()) all the gathered details might be irrelevant / invalid by the time the entities are being emitted (because of (1)). Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D114705	2021-12-05 13:56:45 +02:00
Phoebe Wang	f37d9b4112	[X86][FP16] Replace vXi16 to vXf16 instead of v8f16 Fixes pr52561 Reviewed By: LuoYuanke Differential Revision: https://reviews.llvm.org/D114304	2021-12-05 19:19:11 +08:00
Florian Hahn	203f29b40c	[MemoryLocation] Use getForArgument in getForSource/getForDest. (NFC) getForArgument already knows how to extract a memory location for all memory intrinsics. Use it instead of duplicating the logic.	2021-12-05 11:13:14 +00:00
Lang Hames	21562c03ed	[JITLink][ELF][x86-64] Adjust addends for R_X86_64_PLT32 relocations. R_X86_64_PLT32 explicitly represents the '-4' PC-adjustment in the relocation's addend, but JITLink's x86_64::Branch32PCRel includes the PC-adjustment implicitly. We have been zeroing the addend to account for the difference, but this breaks for branches to non-zero offsets past labels. This patch updates the relocation parsing code to unconditionally adjust the offset by '+4' instead. For branches directly to labels the result is still 0, for branches to offsets past labels the result is the correct addend for x86_64::Branch32PCRel.	2021-12-05 20:37:55 +11:00
David Green	57ff805a6d	[DAG] Create fptoui.sat from clamped fptosi As an extension to D111976, this converts clamp fptosi, clamped between 0 and (2^n)-1 to a fptoui.sat. This can greatly help on targets with conversions that naturally saturate, such as Arm. X86 disables the transform as some of the test cases increases in size. A fptoui.sat necessitates a fp clamp without native support, so there is little use in converting if the instruction is just going to be expanded. Differential Revision: https://reviews.llvm.org/D112428	2021-12-05 09:25:52 +00:00
Lang Hames	c22b110612	[JITLink][ELF][x86-64] Use the right edge-naming function for debugging output. Graph edges use the generic x86-64 edge set (the ELF specific edges are only used during parsing).	2021-12-05 17:12:39 +11:00
Lang Hames	01353d81ea	[llvm-jitlink] Allow -entry option to find hidden symbols. This is useful when debugging failures in object files compiled with visibility=hidden.	2021-12-05 16:26:21 +11:00
Michael Liao	53fc971a4b	Fix `-Wunused-variable` warning. NFC.	2021-12-04 23:34:55 -05:00
Nico Weber	92ceba7d13	[gn build] port `f1585a4b47`	2021-12-04 22:29:05 -05:00
Leonard Grey	134275d994	[Support] Use final filename for Caching buffer identifier Mach-O LLD uses the buffer identifier of the memory buffer backing an object file to generate stabs which are used by `dsymutil` to find the object file for dSYM generation. When using thinLTO, these buffers are provided by the cache which initially saves them to disk as temporary files beginning with "Thin-" but renames them to persistent files beginning with "llvmcache-" before the buffer is provided to the cache user. However, the buffer is created before the file is renamed and is given the temp file's name as an identifier. This causes the generated stabs to point to nonexistent files. This change names the buffer with the eventual persistent filename. I think this is safe because failing to rename the temp file is a fatal error. Differential Revision: https://reviews.llvm.org/D115055	2021-12-04 22:25:49 -05:00
Kazu Hirata	ee4b462693	[lldb] Fix a warning This patch fixes: lldb/source/Plugins/Platform/Windows/PlatformWindows.cpp:386:13: error: comparison between NULL and non-pointer ('lldb::addr_t' (aka 'unsigned long') and NULL) [-Werror,-Wnull-arithmetic]	2021-12-04 18:34:29 -08:00
Peter Klausler	06ca9f24e7	[flang] OPEN(RECL=) handling for sequential formatted I/O RECL= is required for direct access I/O, but is permitted as well for sequential I/O, where it is defined by the standard to specify a maximum record (line) length. The standard does not say what should happen when an sequential formatted input record appears whose length is unequal to RECL= when it is specified. Precedents from other compilers are unclear: one raises an error, some honor RECL= as an effective truncation, and a few ignore the situation. On output, all other compilers tested raised an error when an attempt is made to emit a record longer than RECL=. This patch treats RECL= as effective truncation on input and as a hard limit with error on output, and also ensures that RECL= can be set longer than the actual input record lengths. Differential Revision: https://reviews.llvm.org/D115102	2021-12-04 16:02:48 -08:00
Zhihao Yuan	41a0e850fa	[PowerPC] Drop stdlib paths in freestanding tests When targeting FreeBSD on a Linux host with a copy of system libc++, Clang prepends /usr/include/c++/v1 to the search paths even with -ffreestanding, and fails to compile a program with a single #include <xmmintrin.h> Dropping the path with -nostdlibinc. Differential Revision: https://reviews.llvm.org/D114497	2021-12-04 16:51:13 -06:00
Florian Hahn	a9125792b3	[MemoryLocation] Support missing atomic intrinsics in getForArg. getForArgument is missing support for atomic memory transfer intrinsics. In terms of accessed locations they behave like regular memory transfer intrinsics and we already support them as such in getForSource/getForDest.	2021-12-04 22:18:39 +00:00
Butygin	91072b74f8	[mlir] Add InlinerInterface to bufferization dialect Differential Revision: https://reviews.llvm.org/D115080	2021-12-04 23:45:56 +03:00
Björn Schäpers	6e86789035	[clang-format][NFC] Use member directly Instead of passing it as argument to the member function. Differential Revision: https://reviews.llvm.org/D115072	2021-12-04 21:29:31 +01:00
Björn Schäpers	88fa4bfe1e	[clang-format][NFC] Use range based for for fake l parens Differential Revision: https://reviews.llvm.org/D115071	2021-12-04 21:29:30 +01:00
Björn Schäpers	4041f16bb4	[clang-format][NFC] Early return when nothing to do Do not compute SkipFirstExtraIndent just to see that there are no fake l parens. Differential Revision: https://reviews.llvm.org/D115070	2021-12-04 21:29:30 +01:00
Björn Schäpers	8d1c85454d	[clang-format][NFC] Move static variable in scope Let only the JS/TS users pay for the initialistation. Differential Revision: https://reviews.llvm.org/D115068	2021-12-04 21:29:30 +01:00
Björn Schäpers	c25536e4fe	[clang-format][NFC] Use range based for That's much easier to read. Differential Revision: https://reviews.llvm.org/D115067	2021-12-04 21:29:30 +01:00
Björn Schäpers	4483e9b527	[clang-format][NFC] Reorder conditions Prefer to check the local variables first before dereferencing the pointer. Differential Revision: https://reviews.llvm.org/D115066	2021-12-04 21:29:29 +01:00
Björn Schäpers	5878ac7d2d	[clang-format][NFC] Merge two calls of isOneOf Differential Revision: https://reviews.llvm.org/D115065	2021-12-04 21:29:29 +01:00
Björn Schäpers	e7fdeda2c9	[clang-format][NFC] Rename variable so no shadowing happens In the loop there is also a Node. Differential Revision: https://reviews.llvm.org/D115063	2021-12-04 21:29:29 +01:00
Björn Schäpers	25f637913f	[clang-format][NFC] Prefer pass by reference Differential Revision: https://reviews.llvm.org/D115061	2021-12-04 21:29:29 +01:00
Mehrnoosh Heidarpour	e94134052f	[InstSimplify] Add logic 'or' fold to -1 Adding the following folding opportunity: (~A \| B) \| (A ^ B) --> -1 https://alive2.llvm.org/ce/z/PMtdYB Differential revision: https://reviews.llvm.org/D114996	2021-12-04 15:04:18 -05:00
Peter Klausler	e337dc8bfe	[flang] Fix folding of EXPONENT() intrinsic function The definition of the EXPONENT() intrinsic function differs by one from the real arithmetic folding templates concept of an unbiased exponent, and also needs special handling for zero. Fix, and add more tests. Differential Revision: https://reviews.llvm.org/D115084	2021-12-04 11:23:09 -08:00
Saleem Abdulrasool	f1585a4b47	Windows: support `DoLoadImage` This implements `DoLoadImage` and `UnloadImage` in the Windows platform plugin modelled after the POSIX platform plugin. This was previously unimplemented and resulted in a difficult to decipher error without any logging. This implementation is intended to support enables the use of LLDB's Swift REPL on Windows. Paths which are added to the library search path are persistent and applied to all subsequent loads. This can be adjusted in the future by storing all the cookies and restoring the path prior to returning from the helper. However, the dynamic path count makes this a bit more challenging. Reviewed By: @JDevlieghere Differential Revision: https://reviews.llvm.org/D77287	2021-12-04 11:11:47 -08:00
Dimitry Andric	bbba9d8c1b	[XRay] fix more -Wformat warnings Building xray with recent clang on a 64-bit system results in a number of -Wformat warnings: compiler-rt/lib/xray/xray_allocator.h:70:11: warning: format specifies type 'int' but the argument has type '__sanitizer::uptr' (aka 'unsigned long') [-Wformat] RoundedSize, B); ^~~~~~~~~~~ compiler-rt/lib/xray/xray_allocator.h:119:11: warning: format specifies type 'int' but the argument has type '__sanitizer::uptr' (aka 'unsigned long') [-Wformat] RoundedSize, B); ^~~~~~~~~~~ Since `__sanitizer::uptr` has the same size as `size_t`, these can be fixed by using the printf specifier `%zu`. compiler-rt/lib/xray/xray_basic_logging.cpp:348:46: warning: format specifies type 'int' but the argument has type '__sanitizer::tid_t' (aka 'unsigned long long') [-Wformat] Report("Cleaned up log for TID: %d\n", GetTid()); ~~ ^~~~~~~~ %llu compiler-rt/lib/xray/xray_basic_logging.cpp:353:62: warning: format specifies type 'int' but the argument has type '__sanitizer::tid_t' (aka 'unsigned long long') [-Wformat] Report("Skipping buffer for TID: %d; Offset = %llu\n", GetTid(), ~~ ^~~~~~~~ %llu Since `__sanitizer::tid_t` is effectively declared as `unsigned long long`, these can be fixed by using the printf specifier `%llu`. compiler-rt/lib/xray/xray_basic_logging.cpp:354:14: warning: format specifies type 'unsigned long long' but the argument has type 'size_t' (aka 'unsigned long') [-Wformat] TLD.BufferOffset); ^~~~~~~~~~~~~~~~ Since `BufferOffset` is declared as `size_t`, this one can be fixed by using `%zu` as a printf specifier. compiler-rt/lib/xray/xray_interface.cpp:172:50: warning: format specifies type 'int' but the argument has type 'uint64_t' (aka 'unsigned long') [-Wformat] Report("Unsupported sled kind '%d' @%04x\n", Sled.Address, int(Sled.Kind)); ~~ ^~~~~~~~~~~~ %lu Since ``xray::SledEntry::Address` is declared as `uint64_t`, this one can be fixed by using `PRIu64`, and adding `<cinttypes>`. compiler-rt/lib/xray/xray_interface.cpp:308:62: warning: format specifies type 'long long' but the argument has type 'size_t' (aka 'unsigned long') [-Wformat] Report("System page size is not a power of two: %lld\n", PageSize); ~~~~ ^~~~~~~~ %zu compiler-rt/lib/xray/xray_interface.cpp:359:64: warning: format specifies type 'long long' but the argument has type 'size_t' (aka 'unsigned long') [-Wformat] Report("Provided page size is not a power of two: %lld\n", PageSize); ~~~~ ^~~~~~~~ %zu Since `PageSize` is declared as `size_t`, these can be fixed by using `%zu` as a printf specifier. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D114469	2021-12-04 20:01:20 +01:00
Nikita Popov	573a9bc4ad	[llvm-c] Avoid deprecated APIs in tests Avoid the use of deprecated (opaque pointer incompatible) APIs in C API tests, in preparation for header deprecation. Add a LLVMGetGEPSourceElementType() to cover a bit of functionality that is necessary for the echo test. This change is split out from https://reviews.llvm.org/D114936.	2021-12-04 18:58:08 +01:00
Kazu Hirata	ca2f53897a	[CodeGen] Use range-based for loops (NFC)	2021-12-04 08:48:05 -08:00
Nikita Popov	8bd62119f9	[NewPM] Test more options in pipeline test (NFC) As suggested on D115098, this tests the positioning of HotColdSplitting, IROutliner and MergeFunctions in the optimization pipeline.	2021-12-04 17:30:30 +01:00
Nikita Popov	ae7f468073	[NewPM] Fix MergeFunctions scheduling MergeFunctions (as well as HotColdSplitting an IROutliner) are incorrectly scheduled under the new pass manager. The code makes it look like they run towards the end of the module optimization pipeline (as they should), while in reality the run at the start. This is because the OptimizePM populated around them is only scheduled later. I'm fixing this by moving these three passes until after OptimizePM to avoid splitting the function pass pipeline. It doesn't seem important to me that some of the function passes run after these late module passes. Differential Revision: https://reviews.llvm.org/D115098	2021-12-04 17:30:30 +01:00
Matt Arsenault	a25111c9e2	Attributor: Fix typo in function name	2021-12-04 11:25:22 -05:00
Matt Arsenault	90f914c870	OpenMP: Un-xfail tests that pass now `729bf9b26b` should have fixed these	2021-12-04 11:25:22 -05:00
Kristina Bessonova	a961604819	Revert "[DwarfDebug] Support emitting function-local declaration for a lexical block" This reverts commits * `ee691970a9` (D113741), * `79d3132998` (D114705) due to lldb and dexter test failures.	2021-12-04 18:06:57 +02:00
Matt Arsenault	729bf9b26b	AMDGPU: Enable fixed function ABI by default Code using indirect calls is broken without this, and there isn't really much value in supporting the old attempt to vary the argument placement based on uses. This resulted in more argument shuffling code anyway. Also have the option stop implying all inputs need to be passed. This will no rely on the amdgpu-no-* attributes to avoid passing unnecessary values.	2021-12-04 10:49:18 -05:00
Florian Hahn	89f0f2771a	[BasicAA] Add atomic mem intrinsic tests.	2021-12-04 15:44:33 +00:00
Matt Arsenault	2959e082e1	AMDGPU: Assume all amdhsa kernarg passed implicit arguments by default Previously we would require adding an attribute to kernels to enable the inputs passed in the kernarg segment, accessed by llvm.amdgcn.implicitarg.ptr. This violates the principle of being correct by default. Some OpenMP testcases were broken recently since it wasn't correctly setting this attribute, and no known frontends are setting this to anything other than the maximum. Most of the test changes are from load widening of argument loads since there now more implied dereferenceable bytes.	2021-12-04 10:38:25 -05:00
Matt Arsenault	ae0ba7dedd	AMDGPU: Optimize out implicit kernarg argument allocation if unused We already annotate whether llvm.amdgcn.implicitarg.ptr is known to be unused. Start using it to avoid allocating the implicit arguments if unneeded.	2021-12-04 10:38:25 -05:00
Kristina Bessonova	ee691970a9	[DwarfDebug] Support emitting function-local declaration for a lexical block This is another attempt to make function-local declarations (like static variables, structs/classes and other) be correctly emitted within a lexical (bracketed) block. Fixes https://bugs.llvm.org/show_bug.cgi?id=19238. Differential Revision: https://reviews.llvm.org/D113741	2021-12-04 17:12:47 +02:00
Hugo Pompougnac	5d49511b30	Apply the permutation map on each affine nest When using -test-loop-permutation="permutation-map=...", applies the permutation map on each affine nest in the function (and not only the first one). If the size of the permutation map and the size of a nest are not consistent, do nothing on this particular nest (instead of making MLIR crash). Differential Revision: https://reviews.llvm.org/D112947	2021-12-04 17:48:34 +05:30
Kristina Bessonova	79d3132998	[DwarfDebug] Move emission of global vars, types and imports to endModule() This patch proposes to move emission of global variables, types, imported entities, etc from DwarfDebug::beginModule() to DwarfDebug::endModule(). Effectively, this changes nothing but the order of debug entities which will be as follows: * subprograms (including related context, local variables/labels, local imported entities; related types can be created as a part of the emission of local entities of an abstract subprogram); * global variables (including related context and types); * retained types and enums; * non-local-scoped imported entities; * basic types; * other types left (as a part of local variables attributes emission). Note that the order of emitted compile units may also be changed as now we emit units that contain subprograms first and then all other non-empty units. The motivation behind this change is the following: (1) DwarfDebug::beginModule() is run at the very beginning of backend's pipeline, from this time IR can be significantly changed by target-specific passes. If it happens for debug metadata of global entities, those changes will not be reflected in the emitted DWARF. (2) imported subprogram names should refer to an abstract subprogram if it exists, but it isn't known in DwarfDebug::beginModule() (it's possible to make some guesses based on location info, but it's not quite reliable); (3) aforementioned entities if they are scoped within a bracketed block (subject of D113741) couldn't be emitted in DwarfDebug::beginModule() (they need parent emitted first). Another problem is if to try to gather some information about local entities and defer their emission (till subprogram's processing or DwarfDebug::endModule()) all the gathered details might be irrelevant / invalid by the time the entities are being emitted (because of (1)). Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D114705	2021-12-04 14:10:01 +02:00
Dmitry Vyukov	fd26417a74	tsan: disable dlopen_static_tls.cpp test on aarch64 Fails on bots: https://lab.llvm.org/buildbot#builders/184/builds/1580 Differential Revision: https://reviews.llvm.org/D115095	2021-12-04 13:01:47 +01:00
Anton Afanasyev	c34d157fc7	[Passes] Move AggressiveInstCombine after InstCombine Swap AIC and IC neighbouring in pipeline. This looks more natural and even almost has no effect for now (three slightly touched tests of test-suite). Also this could be the first step towards merging AIC (or its part) to -O2 pipeline. After several changes in AIC (like D108091, D108201, D107766, D109515, D109236) there've been observed several regressions (like PR52078, PR52253, PR52289) that were fixed in different passes (see D111330, D112721) by extending their functionality, but these regressions were exposed since changed AIC prevents IC from making some of early optimizations. This is common problem and it should be fixed by just moving AIC after IC which looks more logically by itself: make aggressive instruction combining only after failed ordinary one. Fixes PR52289 Reviewed By: spatel, RKSimon Differential Revision: https://reviews.llvm.org/D113179	2021-12-04 14:22:43 +03:00
Jay Foad	2774bad112	[AMDGPU] Change llvm.amdgcn.image.bvh.intersect.ray to take vec3 args The ray_origin, ray_dir and ray_inv_dir arguments should all be vec3 to match how the hardware instruction works. Don't change the API of the corresponding OpenCL builtins. Differential Revision: https://reviews.llvm.org/D115032	2021-12-04 10:32:11 +00:00
Jay Foad	c8e84c7a5f	[IR,TableGen] Add support for vec3 intrinsic arguments Add generic support for vec3 types, and in particular define llvm_v3f32_ty which will be used by AMDGPU's llvm.amdgcn.image.bvh.intersect.ray intrinsic. Differential Revision: https://reviews.llvm.org/D114956	2021-12-04 10:32:11 +00:00
Jay Foad	bc7dacf589	[AMDGPU] Generate checks for llvm.amdgcn.image.bvh.intersect.ray Differential Revision: https://reviews.llvm.org/D114955	2021-12-04 10:32:11 +00:00
Nikita Popov	5b94037a30	[PhaseOrdering] Add test for incorrect merge function scheduling Add an -enable-merge-functions option to allow testing of function merging as it will actually happen in the optimization pipeline. Based on that add a test where we currently produce two identical functions without merging them due to incorrect pass scheduling under the new pass manager.	2021-12-04 10:12:04 +01:00

1 2 3 4 5 ...

406487 Commits All Branches Search

406487 Commits

All Branches