llvm-project

Commit Graph

Author	SHA1	Message	Date
Simon Pilgrim	f72d350bfb	[ValueTracking] Update computeKnownBitsFromShiftOperator callbacks to take KnownBits shift amount. NFCI. We were creating this internally, but will need to support general KnownBits amounts as part of D90479.	2020-11-12 16:56:55 +00:00
Fangrui Song	40a42f9f3f	[ELF] Make SORT_INIT_PRIORITY support .ctors.N Input sections `.ctors/.ctors.N` may go to either the output section `.init_array` or the output section `.ctors`: * output `.ctors`: currently we sort them by name. This patch changes to sort by priority from high to low. If N in `.ctors.N` is in the form of %05u, there is no semantic difference. Actually GCC and Clang do use %05u. (In the test `ctors_dtors_priority.s` and Gold's test `gold/testsuite/script_test_14.s`, we can see %03u, but they are not really produced by compilers.) * output `.init_array`: users can provide an input section description `SORT_BY_INIT_PRIORITY(.init_array.* .ctors.)` to mix `.init_array.` and `.ctors.`. This can make .init_array.N and .ctors.(65535-N) interchangeable. With this change, users can mix `.ctors.N` and `.init_array.N` in `.init_array` (PR44698 and PR48096) with linker scripts. As an example: ``` SECTIONS { .init_array : { (SORT_BY_INIT_PRIORITY(.init_array.* .ctors.)) (.init_array EXCLUDE_FILE (crtbegin.o crtbegin?.o crtend.o crtend?.o ) .ctors) } } INSERT AFTER .fini_array; SECTIONS { .fini_array : { (SORT_BY_INIT_PRIORITY(.fini_array. .dtors.)) (.fini_array EXCLUDE_FILE (crtbegin.o crtbegin?.o crtend.o crtend?.o ) .dtors) } } INSERT BEFORE .init_array; ``` Reviewed By: psmith Differential Revision: https://reviews.llvm.org/D91187	2020-11-12 08:56:12 -08:00
Fangrui Song	73d01a80ce	[ELF] Sort by input order within an input section description According to https://sourceware.org/binutils/docs/ld/Input-Section-Basics.html#Input-Section-Basics for `(.a .b)`, the order should match the input order: for `ld 1.o 2.o`, sections from 1.o precede sections from 2.o * within a file, `.a` and `.b` appear in the section header table order This patch implements the behavior. The interaction with `SORT` and --sort-section is: Matched sections are ordered by radix sort with the keys being `(SORT, --sort-section, input order)`, where `SORT` (if present) is most significant. > Note, multiple `SORT` within an input section description has undocumented and > confusing behaviors in GNU ld: > https://sourceware.org/pipermail/binutils/2020-November/114083.html > Therefore multiple `SORT` is not the focus for this patch but > this patch still strives to have an explainable behavior. As an example, we partition `SORT(a.) b.* c.* SORT(d.)`, into `SORT(a.) \| b.* c.* \| SORT(d.)` and perform sorting within groups. Sections matched by patterns between two `SORT` are sorted by input order. If --sort-alignment is given, they are sorted by --sort-alignment, breaking tie by input order. This patch also allows a section to be matched by multiple patterns, previously duplicated sections could occupy more space in the output and had erroneous zero bytes. The patch is in preparation for support for `(SORT_BY_INIT_PRIORITY(.init_array. .ctors.)) (.init_array .ctors)`, which will allow LLD to mix .ctors/.init_array like GNU ld (gold's --ctors-in-init-array) PR44698 and PR48096 Reviewed By: grimar, psmith Differential Revision: https://reviews.llvm.org/D91127	2020-11-12 08:53:11 -08:00
Fangrui Song	2a9aed0e8b	[ELF] Support multiple SORT in an input section description The second `SORT` in `(SORT(...) SORT(...))` is incorrectly parsed as a file pattern. Fix the bug by stopping at `SORT` in `readInputSectionsList`. Reviewed By: grimar Differential Revision: https://reviews.llvm.org/D91180	2020-11-12 08:46:53 -08:00
Baptiste Saleil	170e45ae18	[PowerPC] Prevent the use of MMA with P9 and earlier We want to allow using MMA on P10 CPU only. This patch prevents the use of MMA with the -mmma option on P9 CPUs and earlier. Differential Revision: https://reviews.llvm.org/D91200	2020-11-12 10:36:50 -06:00
Raphael Isemann	d4b08ccb87	[lldb] Replace TestAbortExitCode with a debugserver specific test When I added TestAbortExitCode I actually planned this to be a generic test for the exit code functionality on POSIX systems. However due to all the different test setups we can have I don't think this worked out. Right now the test had to be made so permissive that it pretty much can't fail. Just to summarize, we would need to support the following situations: 1. ToT debugserver (on macOS) 2. lldb-server (on other platforms) 3. Any old debugserver version when using the system debugserver (on macOS) This patch is removing TestAbortExitCode and adds a ToT debugserver specific test that checks the patch that motivated the whole exit code testing. There is already an exit-code test for lldb-server from what I can see and 3) is pretty much untestable as we don't know anything about the system debugserver. Reviewed By: kastiglione Differential Revision: https://reviews.llvm.org/D89305	2020-11-12 17:33:21 +01:00
Zbigniew Sarbinowski	173b51169b	[SystemZ][ZOS] Porting the time functions within libc++ to z/OS This patch is one part of many steps required to build libc++ and libc++abi libraries on z/OS. This particular deals with time related functions and consists of the following 3 parts. 1) Initialization of :timeval within libc++ library need to be adjusted to work on z/OS. The following is z/OS definition from time.h which includes additional aggregate member. typedef signed int suseconds_t; struct timeval { time_t tv_sec; char tv_usec_pad[4]; suseconds_t tv_usec; }; In contracts the following is definition from time.h on Linux. typedef long int __suseconds_t; struct timeval { __time_t tv_sec; __suseconds_t tv_usec; }; 2) In addition, retrieving ::timespec within libc++ library needs to be adjusted to compensate the difference of some of the members of ::stat depending of the target host. Here are the 2 members in conflict on z/OS extracted from stat.h. struct stat { ... time_t st_atime; time_t st_mtime; ... }; In contract here is Linux equivalent from stat.h. struct stat { ... struct timespec st_atim; struct timespec st_mtim; ... }; 3) On Linux both members are of type timespec whereas on z/OS an object of type timespec need to be constructed first before retrieving it within libc++ library. The libc++ header file __threading_support calls nanosleep, which is not available on z/OS. The equivalent functionality will be implemented by using both sleep() and usleep(). Reviewed By: ldionne, #libc Differential Revision: https://reviews.llvm.org/D87940	2020-11-12 11:29:13 -05:00
Simon Pilgrim	8996742741	[KnownBits] Add KnownBits::makeConstant helper. NFCI. Helper for cases where we need to create a KnownBits from a (fully known) constant value.	2020-11-12 16:16:04 +00:00
Anh Tuyen Tran	a20b3620bb	Revert "Introduce -dot-cfg-mssa option which creates dot-cfg style file with mssa comments included in source" This reverts commit `45d459e752` due to build issue in Poly.	2020-11-12 15:48:14 +00:00
Craig Topper	0add5f9122	[RISCV] Don't include CodeGen layer files in MC layer -Use MCRegister instead of Register in MC layer. -Move some enums from RISCVInstrInfo.h to RISCVBaseInfo.h to be with other TSFlags bits. Differential Revision: https://reviews.llvm.org/D91114	2020-11-12 07:45:38 -08:00
Jamie Schmeiser	45d459e752	Introduce -dot-cfg-mssa option which creates dot-cfg style file with mssa comments included in source Summary: Expand the print-memoryssa and print<memoryssa> passes with a new hidden option -cfg-dot-mssa that names a file. When set, a dot-cfg style file will be generated into the named file with the memoryssa comments retained and those blocks containing them shown in light pink. The option does nothing in isolation. Author: Jamie Schmeiser <schmeise@ca.ibm.com> Reviewed By: asbirlea (Alina Sbirlea), dblaikie (David Blaikie) Differential Revision: https://reviews.llvm.org/D90638	2020-11-12 15:41:16 +00:00
Craig Topper	9ca02d6fe1	[RISCV] Add an ANDI to shift amount of FSL/FSR instructions The fshl and fshr intrinsics are defined to modulo their shift amount by the bitwidth of one of their inputs. The FSR/FSL instructions read one extra bit from the shift amount. If that bit is set the inputs are swapped. In order to preserve the semantics of the llvm intrinsics we need to make sure that the extra bit isn't set. DAG combine or instcombine may have removed any mask that was originally present. We could be smarter here and try to use computeKnownBits to check if the bit is known zero, but wanted to start with correctness. Differential Revision: https://reviews.llvm.org/D90905	2020-11-12 07:33:40 -08:00
Simon Pilgrim	11c106544b	[ValueTracking] Update computeKnownBitsFromShiftOperator callbacks to use KnownBits shift handling. NFCI.	2020-11-12 15:31:26 +00:00
Jamie Schmeiser	782d6a6963	Introduce -print-before-changed, making -print-changed also print before passes that modify IR Summary: Add an option -print-before-changed that modifies the print-changed behaviour so that it prints the IR before a pass that changed it in addition to printing the IR after the pass. Note that the option does nothing in isolation. The filtering options work as expected. Lit tests are included. Author: Jamie Schmeiser <schmeise@ca.ibm.com> Reviewed By: aeubanks (Arthur Eubanks) Differential Revision: https://reviews.llvm.org/D88757	2020-11-12 15:20:50 +00:00
Raphael Isemann	d85cc03c9c	[lldb] Add expect_var_path to test variable path results This adds `expect_var_path` to test variable paths so we no longer have to use `frame var` and find substrs in the command output. The behaviour is identical with `expect_expr` (and it also uses the same checking backend), but it instead calls `GetValueForVariablePath` to evaluate the string as a variable path. Also rewrites a few of the tests that previously used `frame variable` to use `expect_var_path`. Reviewed By: labath Differential Revision: https://reviews.llvm.org/D90450	2020-11-12 16:14:48 +01:00
Zhuojia Shen	0c0eeb78eb	[builtins] Add support for single-precision-only-FPU ARM targets. This patch enables building compiler-rt builtins for ARM targets that only support single-precision floating point instructions (e.g., those with -mfpu=fpv4-sp-d16). This fixes PR42838 Differential Revision: https://reviews.llvm.org/D90698	2020-11-12 15:10:48 +00:00
Jamie Schmeiser	f79b483385	[NFC intended] Refactor SinkAndHoistLICMFlags to allow others to construct without exposing internals Summary: Refactor SinkAdHoistLICMFlags from a struct to a class with accessors and constructors to allow other classes to construct flags with meaningful defaults while not exposing LICM internal details. Author: Jamie Schmeiser <schmeise@ca.ibm.com> Reviewed By: asbirlea (Alina Sbirlea) Differential Revision: https://reviews.llvm.org/D90482	2020-11-12 15:06:59 +00:00
Paul C. Anagnostopoulos	ba906eb16c	[CODE_OWNERS.TXT] Update to include yours truly as the TableGen owner	2020-11-12 09:49:00 -05:00
Jean-Michel Gorius	62ed69b01d	[clang][docs] Remove wrongly spaced \brief in Doxygen comment (NFC)	2020-11-12 15:44:43 +01:00
Raphael Isemann	b4b836563a	[lldb][NFC] Move OptionDefinition from lldb-private-types.h to its own Utility header Also moves the curious isprint8 function (which was used to check whether we have a valid short option) into the struct and documents it.	2020-11-12 15:30:26 +01:00
Sylvain Audi	79105e4644	[clang-scan-deps] Fix for input file given as relative path in compilation database "command" entry. Differential Revision: https://reviews.llvm.org/D91204	2020-11-12 08:48:17 -05:00
David Green	11dee2eae2	[ARM] Ensure CountReg definition dominates InsertPt when creating t2DoLoopStartTP Of course there was something missing, in this case a check that the def of the count register we are adding to a t2DoLoopStartTP would dominate the insertion point. In the future, when we remove some of these COPY's in between, the t2DoLoopStartTP will always become the last instruction in the block, preventing this from happening. In the meantime we need to check they are created in a sensible order. Differential Revision: https://reviews.llvm.org/D91287	2020-11-12 13:47:46 +00:00
Alexandre Ganea	ec63dfe368	[LLD] Fix include following `45b8a741fb`	2020-11-12 08:32:16 -05:00
Alexander Kornienko	a196e8092a	[lld] Use temporary directory to create test outputs	2020-11-12 14:24:05 +01:00
Alexandre Ganea	45b8a741fb	[LLD][COFF] When using LLD-as-a-library, always prevent re-entrance on failures This is a follow-up for D70378 (Cover usage of LLD as a library). While debugging an intermittent failure on a bot, I recalled this scenario which causes the issue: 1.When executing lld/test/ELF/invalid/symtab-sh-info.s L45, we reach lld:🧝:Obj-File::ObjFile() which goes straight into its base ELFFileBase(), then ELFFileBase::init(). 2.At that point fatal() is thrown in lld/ELF/InputFiles.cpp L381, leaving a half-initialized ObjFile instance. 3.We then end up in lld::exitLld() and since we are running with LLD_IN_TEST, we hapily restore the control flow to CrashRecoveryContext::RunSafely() then back in lld::safeLldMain(). 4.Before this patch, we called errorHandler().reset() just after, and this attempted to reset the associated SpecificAlloc<ObjFile<ELF64LE>>. That tried to free the half-initialized ObjFile instance, and more precisely its ObjFile::dwarf member. Sometimes that worked, sometimes it failed and was catched by the CrashRecoveryContext. This scenario was the reason we called errorHandler().reset() through a CrashRecoveryContext. But in some rare cases, the above repro somehow corrupted the heap, creating a stack overflow. When the CrashRecoveryContext's filter (that is, __except (ExceptionFilter(GetExceptionInformation()))) tried to handle the exception, it crashed again since the stack was exhausted -- and that took the whole application down. That is the issue seen on the bot. Locally it happens about 1 times out of 15. Now this situation can happen anywhere in LLD. Since catching stack overflows is not a reliable scenario ATM when using CrashRecoveryContext, we're now preventing further re-entrance when such failures occur, by signaling lld::SafeReturn::canRunAgain=false. When running with LLD_IN_TEST=2 (or above), only one iteration will be executed, instead of two. Differential Revision: https://reviews.llvm.org/D88348	2020-11-12 08:14:43 -05:00
Michał Górny	f37834c7dc	[lldb] [test] Add a minimal test for x86 dbreg reading Add a test verifying that after the 'watchpoint' command, new values of x86 debug registers can be read back correctly. The primary purpose of this test is to catch broken DRn reading and help debugging it. Differential Revision: https://reviews.llvm.org/D91264	2020-11-12 14:09:03 +01:00
Michał Górny	a8bfee2a35	[lldb] [Process/Utility] Fix DR offsets for FreeBSD Fix Debug Register offsets to be specified relatively to UserArea on FreeBSD/amd64 and FreeBSD/i386, and add them to UserArea on i386. This fixes overlapping GPRs and DRs in gdb-remote protocol, making it impossible to correctly get and set debug registers from the LLDB client. Differential Revision: https://reviews.llvm.org/D91254	2020-11-12 14:09:03 +01:00
Raphael Isemann	1115d1d083	Revert "Generalize regex matching std::string variants to compensate for recent" This reverts commit `856fd98a17`. The type formatters use inline namespaces to find the formatter that fits the type ABI, so they can't just ignore the inline namespaces. The failing tests should be fixed by `da121fff11` .	2020-11-12 14:01:22 +01:00
Raphael Isemann	da121fff11	[lldb] Introduce a LLDB printing policy for Clang type names that suppressed inline namespaces Commit `5f12f4ff90` made suppressing inline namespaces when printing typenames default to true. As we're using the inline namespaces in LLDB to construct internal type names (which need internal namespaces in them to, for example, differentiate libc++'s std::__1::string from the std::string from libstdc++), this broke most of the type formatting logic.	2020-11-12 14:00:33 +01:00
Hans Wennborg	a088766508	[dllexport] Instantiate default ctor default args for explicit specializations (PR45811) For dllexported default constructors with default arguments, we export default constructor closures which pass in the default args. (See D8331 for a good explanation.) For templates, that means those default args must be instantiated even if the function isn't called. That is done by the InstantiateDefaultCtorDefaultArgs() function, but it wasn't done for explicit specializations, causing asserts (see bug). Differential revision: https://reviews.llvm.org/D91089	2020-11-12 13:29:34 +01:00
Jean-Michel Gorius	e47805c995	[mlir] Add plus, star and optional less/greater parsing The tokens are already handled by the lexer. This revision exposes them through the parser interface. This revision also adds missing functions for question mark parsing and completes the list of valid punctuation tokens in the documentation. Differential Revision: https://reviews.llvm.org/D90907	2020-11-12 13:28:31 +01:00
Hans Wennborg	b9d36540a8	[dllexport] Avoid assert for explicitly defaulted methods in explicit instantiation definitions (PR47683) Clang was asserting due to attempting to codegen such methods twice. Differential revision: https://reviews.llvm.org/D90849	2020-11-12 13:19:29 +01:00
Alex Zinenko	f9265de8c6	[mlir] Generate Op builders for Python bindings Add an ODS-backed generator of default builders. This currently does not support operation with attribute arguments, for which the builder is just ignored. Attribute support will be introduced separately for builders and accessors. Default builders are always generated with the same number of result and operand groups as the ODS specification, i.e. one group per each operand or result. Optional elements accept None but cannot be omitted. Variadic groups accept iterable objects and cannot be replaced with a single object. For some operations, it is possible to infer the result type given the traits, but most traits rely on inline pieces of C++ that we cannot (yet) forward to Python bindings. Since the Ops where the inference is possible (having the `SameOperandAndResultTypes` trait or `TypeMatchesWith` without transform field) are a small minority, they also require the result type to make the builder syntax more consistent. Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D91190	2020-11-12 11:29:23 +01:00
Kazushi (Jam) Marukawa	a72d384249	[VE] Change the default type of v64 register class Change the default type of v64 register class from v512i32 to v256f64. Add a regression test also. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D91301	2020-11-12 19:07:07 +09:00
Julian Gross	0313e3bfe6	[MLIR] Added documentation and manual to use bufferization features. Added documentation about the bufferization features. Furthermore, the usage of pre- and post-processing is described. This also includes information about optimization functionalities. Differential Revision: https://reviews.llvm.org/D90675	2020-11-12 10:43:05 +01:00
Kadir Cetinkaya	6484aa1add	[clangd] Simplify relations deserialization loop, NFC.	2020-11-12 10:33:39 +01:00
David Sherwood	3225fcf11e	[SVE] Deal with SVE tuple call arguments correctly when running out of registers When passing SVE types as arguments to function calls we can run out of hardware SVE registers. This is normally fine, since we switch to an indirect mode where we pass a pointer to a SVE stack object in a GPR. However, if we switch over part-way through processing a SVE tuple then part of it will be in registers and the other part will be on the stack. I've fixed this by ensuring that: 1. When we don't have enough registers to allocate the whole block we mark any remaining SVE registers temporarily as allocated. 2. We temporarily remove the InConsecutiveRegs flags from the last tuple part argument and reinvoke the autogenerated calling convention handler. Doing this prevents the code from entering an infinite recursion and, in combination with 1), ensures we switch over to the Indirect mode. 3. After allocating a GPR register for the pointer to the tuple we then deallocate any SVE registers we marked as allocated in 1). We also set the InConsecutiveRegs flags back how they were before. 4. I've changed the AArch64ISelLowering LowerCALL and LowerFormalArguments functions to detect the start of a tuple, which involves allocating a single stack object and doing the correct numbers of legal loads and stores. Differential Revision: https://reviews.llvm.org/D90219	2020-11-12 08:41:50 +00:00
David Green	1551d8dd48	[ARM] Remove unused check labels. NFC	2020-11-12 08:37:46 +00:00
Marek Kurdej	e331dfea70	[libc++] [P0340] [C++20] Update status page. NFC. This was implemented in 410b650e674496e61506fa88f3026759b8759d0f: "Implement P0340R3: Make 'underlying_type' SFINAE-friendly. Reviewed as https://reviews.llvm.org/D63574 llvm-svn: 364094"	2020-11-12 09:32:29 +01:00
MaheshRavishankar	5ca20851e4	[mlir][Linalg] Improve the logic to perform tile and fuse with better dependence tracking. This change does two main things 1) An operation might have multiple dependences to the same producer. Not tracking them correctly can result in incorrect code generation with fusion. To rectify this the dependence tracking needs to also have the operand number in the consumer. 2) Improve the logic used to find the fused loops making it easier to follow. The only constraint for fusion is that linalg ops (on buffers) have update semantics for the result. Fusion should be such that only one iteration of the fused loop (which is also a tiled loop) must touch only one (disjoint) tile of the output. This could be relaxed by allowing for recomputation that is the default when oeprands are tensors, or can be made legal with promotion of the fused view (in future). Differential Revision: https://reviews.llvm.org/D90579	2020-11-12 00:25:24 -08:00
Amara Emerson	ad376657c1	[AArch64][GlobalISel] Optimize G_PTR_ADD with a negated offset to be a G_SUB.	2020-11-11 22:46:53 -08:00
Max Kazantsev	2734a9ebf4	[NFC][SCEV] Generalize monotonicity check for full and limited iteration space A piece of logic of `isLoopInvariantExitCondDuringFirstIterations` is actually a generalized predicate monotonicity check. This patch moves it into the corresponding method and generalizes it a bit. Differential Revision: https://reviews.llvm.org/D90395 Reviewed By: apilipenko	2020-11-12 12:37:07 +07:00
Chuanqi Xu	cd89c4dbdd	[NFC][coroutines] remove unused argument in SemaCoroutine Test plan: check-llvm, check-clang Reviewers: lxfind, junparser Differential Revision: https://reviews.llvm.org/D91243	2020-11-12 13:22:20 +08:00
Xun Li	94a45a8098	Revert "[Coroutine] Allocas used by StoreInst does not always escape" This reverts commit `8bc7b9278e`, which landed by accident.	2020-11-11 21:09:39 -08:00
Aart Bik	0846659648	[mlir][sparse] export sparse tensor runtime support through header file Exposing the C versions of the methods of the sparse runtime support lib through header files will enable using the same methods in an MLIR program as well as a C++ program, which will simplify future benchmarking comparisons (e.g. comparing MLIR generated code with eigen for Matrix Market sparse matrices). Reviewed By: penpornk Differential Revision: https://reviews.llvm.org/D91316	2020-11-11 21:03:39 -08:00
Max Kazantsev	d6dd938589	[IndVars] IV user should not prevent use widening Sometimes the an instruction we are trying to widen is used by the IV (which means the instruction is the IV increment). Currently this may prevent its widening. We should ignore such user because it will be dead once the transform is done anyways. Differential Revision: https://reviews.llvm.org/D90920 Reviewed By: fhahn	2020-11-12 12:02:01 +07:00
Xun Li	8bc7b9278e	[Coroutine] Allocas used by StoreInst does not always escape In the existing logic, for a given alloca, as long as its pointer value is stored into another location, it's considered as escaped. This is a bit too conservative. Specifically, in non-optimized build mode, it's often to have patterns of code that first store an alloca somewhere and then load it right away. These used should be handled without conservatively marking them escaped. This patch tracks how the memory location where an alloca pointer is stored into is being used. As long as we only try to load from that location and nothing else, we can still consider the original alloca not escaping and keep it on the stack instead of putting it on the frame. Differential Revision: https://reviews.llvm.org/D91305	2020-11-11 20:53:51 -08:00
Max Kazantsev	2e01ceafaa	[IndVars] Recognize 'sub nuw' expressed as 'add' for widening InstCombine canonicalizes 'sub nuw' instructions to 'add' without the `nuw` flag. The typical case where we see it is decrementing induction variables. For them, IndVars fails to prove that it's legal to widen them, and inserts unprofitable `zext`'s. This patch adds recognition of such pattern using SCEV. Differential Revision: https://reviews.llvm.org/D89550 Reviewed By: fhahn, skatkov	2020-11-12 10:51:29 +07:00
Max Kazantsev	813781a923	[Test] Add Check statement	2020-11-12 10:47:34 +07:00
Richard Smith	2d4035e493	Fix structural comparison of template template arguments to compare the right union member. Should fix the armv8 buildbot.	2020-11-11 19:15:21 -08:00

... 3 4 5 6 7 ...

372138 Commits All Branches Search

372138 Commits

All Branches