llvm-project

Commit Graph

Author	SHA1	Message	Date
Carl Ritson	2ef9d21e1a	[AMDGPU] SIInsertSkips: Tidy block splitting to use splitAt Convert to use new MachineBasicBlock splitAt function. Place code in splitBlock function for reuse in future changes. Should yield no functional change. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D88537	2020-10-02 11:10:55 +09:00
Jason Molenda	a1e97923a0	Have kernel binary scanner load dSYMs as binary+dSYM if best thing found lldb's PlatforDarwinKernel scans the local filesystem (well known locations, plus user-specified directories) for kernels and kexts when doing kernel debugging, and loads them automatically. Sometimes kernel developers want to debug with only a dSYM, in which case they give lldb the DWARF binary + the dSYM as a binary and symbol file. This patch adds code to lldb to do this automatically if that's the best thing lldb can find. A few other bits of cleanup in PlatformDarwinKernel that I undertook at the same time: 1. Remove the 'platform.plugin.darwin-kernel.search-locally-for-kexts' setting. When I added the local filesystem index at start of kernel debugging, I thought people might object to the cost of the search and want a way to disable it. No one has. 2. Change the behavior of 'plugin.dynamic-loader.darwin-kernel.load-kexts' setting so it does not disable the local filesystem scan, or use of the local filesystem binaries. 3. PlatformDarwinKernel::GetSharedModule into GetSharedModuleKext and GetSharedModuleKernel for easier readability & maintenance. 4. Added accounting of .dSYM.yaa files (an archive format akin to tar) that I come across during the scan. I'm not using these for now; it would be very expensive to expand the archives & see if the UUID matches what I'm searching for. <rdar://problem/69774993> Differential Revision: https://reviews.llvm.org/D88632	2020-10-01 18:55:37 -07:00
Carl Ritson	5136f4748a	CodeGen: Fix livein calculation in MachineBasicBlock splitAt Fix and simplify computation of liveins for new block. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D88535	2020-10-02 10:45:04 +09:00
Esme-Yi	c4690b0077	[PowerPC] Put the CR field in low bits of GRC during copying CRRC to GRC. Summary: How we copying the CRRC to GRC is using a single MFOCRF to copy the contents of CR field n (CR bits 4×n+32:4×n+35) into bits 4×n+32:4×n+35 of register GRC. That’s not correct because we expect the value of destination register equals to source so we have to put the the contents of CR field in the lowest 4 bits. This patch adds a RLWINM after MFOCRF to achieve that. The problem came up when adding builtins for xvtdivdp, xvtdivsp, xvtsqrtdp, xvtsqrtsp, as posted in D88278. We need to move the outputs (in CR register) to GRC. However outputs of these instructions may not in a fixed CR# register, so we can’t directly add a rotation instruction in the .td patterns, but need to wait until the CR register is determined. Then we confirmed this should be a bug in POST-RA PSEUDO PASS. Reviewed By: nemanjai, shchenz Differential Revision: https://reviews.llvm.org/D88274	2020-10-02 01:26:18 +00:00
Joseph Huber	82453e759c	[OpenMP] Add Missing Runtime Call for Globalization Remarks Summary: Add a missing runtime call to perform data globalization checks. Reviewers: jdoerfert Subscribers: guansong hiraditya llvm-commits sstefan1 yaxunl Tags: #LLVM #OpenMP Differential Revision: https://reviews.llvm.org/D88621	2020-10-01 21:19:53 -04:00
Valentin Clement	c1dcb573a8	[flang][openacc] Update loop construct lowering Update the loop construct lowering to support multiple occurences of the same clauses such as private. Add some utility functions used by other constructs. Upstreaming part of https://github.com/flang-compiler/f18-llvm-project/pull/438/ Reviewed By: schweitz Differential Revision: https://reviews.llvm.org/D88253	2020-10-01 20:39:04 -04:00
peter klausler	3261aefc72	[flang] Extend runtime API for PAUSE to allow a stop code Support integer and default character stop codes on PAUSE statements. Add length argument to STOP statement with a character stop code. Differential revision: https://reviews.llvm.org/D88692	2020-10-01 17:20:11 -07:00
peter klausler	a94d943f1a	[flang] Fix actions at end of output record It turns out that unformatted fixed-size output records do need to be padded out if short, in order to avoid a spurious EOF crash on a short record at the end of the file. While here in AdvanceRecord(), move the unformatted variable-length record header/footer writing code to here from EndIoStatement(). Differential revision: https://reviews.llvm.org/D88685	2020-10-01 17:18:20 -07:00
jasonliu	78a9e62aa6	[XCOFF] Enable -fdata-sections on AIX Summary: Some design decision worth noting about: I've noticed a recent mailing discussing about why string literal is not affected by -fdata-sections for ELF target: http://lists.llvm.org/pipermail/llvm-dev/2020-September/145121.html But on AIX, our linker could not split the mergeable string like other target. So I think it would make more sense for us to emit separate csect for every mergeable string in -fdata-sections mode, as there might not be other ways for linker to do garbage collection on unused mergeable string. Reviewed By: daltenty, hubert.reinterpretcast Differential Revision: https://reviews.llvm.org/D88339	2020-10-02 00:16:24 +00:00
peter klausler	61687f3a48	[flang] Fix buffering read->write transition The buffer needs to be Reset() after a Flush(), since the Flush() can be a no-op after a read->write transition. And record numbers are 1-based, not 0-based. This fixes a bug with rewrites of records that have been recently read. Differential revision: https://reviews.llvm.org/D88612	2020-10-01 16:57:38 -07:00
peter klausler	75a5ec1bad	[flang][msvc] Rework a MSVC work-around to avoid clang warning A recent MSVC work-around patch is eliciting unused variable warnings from clang; package the lambda reference arguments into a struct to avoid the warning. Differential revision: https://reviews.llvm.org/D88695	2020-10-01 16:52:30 -07:00
Philip Reames	bb0344644a	[memcpyopt] Conservatively handle non-integral pointers If we allow the non-integral pointers to become memset and memcpy, we loose the ability to reason about pointer propagation. This patch is modeled on changes we've carried downstream for a long time, figured it was worth being equally conservative for other users. There is room to refine the semantics and handling here if anyone is motivated.	2020-10-01 16:46:56 -07:00
Muhammad Asif Manzoor	aab6f7db47	[AArch64][SVE] Add lowering for llvm fabs Add the functionality to lower fabs for passthru variant Reviewed By: paulwalker-arm Differential Revision: https://reviews.llvm.org/D88679	2020-10-01 19:41:25 -04:00
Philip Reames	de3cb9548d	Fix a bug in memset formation with vectors of non-integral pointers We were converting the non-integral store into a integer store which is not legal.	2020-10-01 16:11:11 -07:00
Stanislav Mekhanoshin	caeb13aba8	[AMDGPU] Allow SOP asm mnemonic to differ Allows the creation of real SOP1 instructions with assembler mnemonics that differ from their pseudo-instruction mnemonics. The default behavior keeps the mnemonics matching. Corrects a subtarget label typo in a comment. Authored By: Joe_Nash Differential Revision: https://reviews.llvm.org/D88708	2020-10-01 16:00:04 -07:00
peter klausler	e99d184d54	[flang] Readability improvement in binary->decimal conversion Tweak binary->decimal conversions to avoid an integer multiplication in a hot loop to improve readability and get a minor (~5%) speed-up. Use native integer division by constants for more readability, too, since current build compilers seem to optimize it correctly now. Delete the now needless temporary work-around facility in Common/unsigned-const-division.h. Differential revision: https://reviews.llvm.org/D88604	2020-10-01 15:49:27 -07:00
Jessica Paquette	5402d11b1d	[GlobalISel][AArch64] Don't emit cset for G_FCMPs feeding into G_BRCONDs Similar to the FP case in `AArch64TargetLowering::LowerBR_CC`. Instead of emitting the csets + a tbnz, just emit a compare + bcc (or two bccs, depending on the condition code) This improves cases like this: https://godbolt.org/z/v8hebx This is a 0.1% geomean code size improvement for CTMark at -O3. Differential Revision: https://reviews.llvm.org/D88624	2020-10-01 15:34:16 -07:00
Jessica Paquette	8e8664e55e	[AArch64][GlobalISel] Use emitTestBit in selection for G_BRCOND Partially refactoring, partially fixing a bug. - We shouldn't use TB(N)ZX unless the bit number is >= 32 - We can fold more than xor using emitTestBit Also remove a check which isn't relevant anymore + update tests. Rename select-brcond-of-not.mir to select-brcond-of-binop.mir, since it now tests more than just G_XOR. Differential Revision: https://reviews.llvm.org/D88702	2020-10-01 15:33:34 -07:00
Amara Emerson	017b871502	[AArch64][GlobalISel] Alias rules for G_FCMP to G_ICMP. No need to be different here for the vast majority of rules.	2020-10-01 15:20:09 -07:00
Amara Emerson	e28c5899a2	[AArch64][GlobalISel] Make <8 x s8> integer arithmetic ops legal.	2020-10-01 14:35:21 -07:00
Amara Emerson	a97e97faed	[AArch64][GlobalISel] Make <8 x s8> shifts legal and add selection support.	2020-10-01 14:21:18 -07:00
Amara Emerson	9a2b3bbc59	Revert "[AArch64][GlobalISel] Make <8 x s8> shifts legal." Accidentally pushed this.	2020-10-01 14:15:57 -07:00
Amara Emerson	8071c2f5c6	[AArch64][GlobalISel] Make <8 x s8> shifts legal.	2020-10-01 14:10:10 -07:00
Alexandre Ganea	4140f0744f	[LLD][COFF] Fix crash with /summary and PCH input files Before this patch /summary was crashing with some .PCH.OBJ files, because tpiMap[srcIdx++] was reading at the wrong location. When the TpiSource depends on a .PCH.OBJ file, the types should be offset by the previously merged PCH.OBJ set of indices. Differential Revision: https://reviews.llvm.org/D88678	2020-10-01 17:08:35 -04:00
Raphael Isemann	15ea45f16b	[lldb] Skip unique_ptr import-std-module tests on Linux This seems to fail on ubuntu 18.04.5 with Clang 9 due to: Error output: error: Couldn't lookup symbols: std::__1::default_delete<int>::operator()(int) const	2020-10-01 23:04:36 +02:00
Amara Emerson	9f6acb1358	[AArch64][GlobalISel] Merge G_SHL, G_ASHR and G_LSHR legalizer rules together. There's no need for any difference between these.	2020-10-01 14:02:45 -07:00
Arthur Eubanks	b29573b672	[gn build] Support building with ThinLTO Differential Revision: https://reviews.llvm.org/D88584	2020-10-01 13:48:31 -07:00
Aaron Puchert	1c1a810558	libclc: Use find_package to find Python 3 and require it The script's shebang wants Python 3, so we use FindPython3. The original code didn't work when an unversioned python was not available. This is explicitly allowed in PEP 394. ("Distributors may choose to set the behavior of the python command as follows: python2, python3, not provide python command, allow python to be configurable by an end user or a system administrator.") Also I think it's actually required, so let the configuration fail if we can't find it. Lastly remove the shebang, since the script is only run via interpreter and doesn't have the executable bit set anyway. Reviewed By: jvesely Differential Revision: https://reviews.llvm.org/D88366	2020-10-01 22:31:33 +02:00
Amara Emerson	73457536ff	[AArch64][GlobalISel] Use custom legalization for G_TRUNC for v8i8 vectors. Truncating to v8i8 is a case where we want to split the source but also generate intermediate truncates to reduce the size of the source vector before truncating down to v8i8. This implements the same strategy as what SelectionDAG does, but I'm not certain where if anywhere in generic code it should live. Use it for legalization of v8s8 = G_ICMP v8s32. Differential Revision: https://reviews.llvm.org/D88191	2020-10-01 13:22:00 -07:00
Amara Emerson	4c265ce665	[AArch64][GlobalISel] Camp oversize v4s64 G_FPEXT operations.	2020-10-01 13:08:31 -07:00
Hubert Tong	35ecc7fe49	[clang][Sema] Fix PR47676: Handle dependent AltiVec C-style cast Fix premature decision in the presence of type-dependent expression operands on whether AltiVec vector initializations from single expressions are "splat" operations. Verify that the instantiation is able to determine the correct cast semantics for both the scalar type and the vector type case. Note that, because the change only affects the single-expression case (and the target type is an AltiVec-style vector type), the replacement of a parenthesized list with a parenthesized expression does not change the semantics of the program in a program-observable manner. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D88526	2020-10-01 15:57:01 -04:00
Fangrui Song	88f2fe5cad	Raland D87318 [LLD][PowerPC] Add support for R_PPC64_GOT_TLSGD_PCREL34 used in TLS General Dynamic Add Thread Local Storage support for the 34 bit relocation R_PPC64_GOT_TLSGD_PCREL34 used in General Dynamic. The compiler will produce code that looks like: ``` pla r3, x@got@tlsgd@pcrel R_PPC64_GOT_TLSGD_PCREL34 bl __tls_get_addr@notoc(x@tlsgd) R_PPC64_TLSGD R_PPC64_REL24_NOTOC ``` LLD should be able to correctly compute the relocation for R_PPC64_GOT_TLSGD_PCREL34 as well as do the following two relaxations where possible: General Dynamic to Local Exec: ``` paddi r3, r13, x@tprel nop ``` and General Dynamic to Initial Exec: ``` pld r3, x@got@tprel@pcrel add r3, r3, r13 ``` Note: This patch adds support for the PC Relative (no TOC) version of General Dynamic on top of the existing support for the TOC version of General Dynamic. The ABI does not provide any way to tell by looking only at the relocation `R_PPC64_TLSGD` when it is being used in a TOC instruction sequence or and when it is being used in a no TOC sequence. The TOC sequence should always be 4 byte aligned. This patch adds one to the offset of the relocation when it is being used in a no TOC sequence. In this way LLD can tell by looking at the alignment of the offset of `R_PPC64_TLSGD` whether or not it is being used as part of a TOC or no TOC sequence. Reviewed By: NeHuang, sfertile, MaskRay Differential Revision: https://reviews.llvm.org/D87318	2020-10-01 12:36:33 -07:00
Petr Hosek	de47e7122f	[CMake][Fuchsia] Don't set WIN32 API, rely on autodetection We prefer autodetection here to avoid persisting this configuration in the generated __config header which is shared across targets. Differential Revision: https://reviews.llvm.org/D88694	2020-10-01 12:35:52 -07:00
Reid Kleckner	d12ae042e1	[lit] Fix Python 2/3 compat in new winreg search code This should fix the test failures on the clang win64 bot: http://lab.llvm.org:8011/builders/clang-x64-windows-msvc/builds/18830 It has been red since Sept 23-ish. This was subtle to debug. Windows has 'find' and 'sort' utilities in C:\Windows\system32, but they don't support all the same flags as the coreutils programs. I configured the buildbot above with Python 2.7 64-bit (hey, it was set up in 2016). When I installed git for Windows, I opted to add all the Unix utilities that come with git to the system PATH. This is almost enough to make the LLVM tests pass, but not quite, because if you use the system PATH, the Windows version of find and sort come first, but the tests that use diff, cmp, etc, will all pass. So only a handful of tests will fail, and with cryptic error messages. The code changed in this CL doesn't work with Python 2. Before Python 3.2, the winreg.OpenKey function did not accept the `access=` keyword argument, the caller was required to pass an unused `reserved` positional argument of 0. The try/except/pass around the OpenKey operation masked this usage error in Python 2. Further, the result of the registry operation has to be converted from unicode to add it to the environment, but that was incidental.	2020-10-01 12:22:28 -07:00
Reid Kleckner	5d46d7e8b2	[PDB] Use one func id DenseMap instead of per-source maps, NFC This avoids some DenseMap copies when /Zi is in use, and results in fewer data structures. Differential Revision: https://reviews.llvm.org/D88617	2020-10-01 12:22:27 -07:00
Nikita Popov	9d1c8c0ba9	[InstCombine] Fix select operand simplification with undef (PR47696) When replacing X == Y ? f(X) : Z with X == Y ? f(Y) : Z, make sure that Y cannot be undef. If it may be undef, we might end up picking a different value for undef in the comparison and the select operand.	2020-10-01 21:15:48 +02:00
Petr Hosek	8d26760a95	[CMake] Use -isystem flag to access libc++ headers This is a partial revert of D62155. Rather than copying libc++ headers into the build directory to be later overwritten by the final headers, use -isystem flag to access libc++ headers during CMake checks. This should address the occasional flake we've seen, especially on Windows builders where CMake fails to overwrite __config with the final version. Differential Revision: https://reviews.llvm.org/D88454	2020-10-01 12:09:27 -07:00
Sanjay Patel	149f5b573c	[APFloat] convert SNaN to QNaN in convert() and raise Invalid signal This is an alternate fix (see D87835) for a bug where a NaN constant gets wrongly transformed into Infinity via truncation. In this patch, we uniformly convert any SNaN to QNaN while raising 'invalid op'. But we don't have a way to directly specify a 32-bit SNaN value in LLVM IR, so those are always encoded/decoded by calling convert from/to 64-bit hex. See D88664 for a clang fix needed to allow this change. Differential Revision: https://reviews.llvm.org/D88238	2020-10-01 14:37:38 -04:00
Arthur Eubanks	499260c03b	Revert "[CFGuard] Add address-taken IAT tables and delay-load support" This reverts commit `ef4e971e5e`.	2020-10-01 11:29:54 -07:00
Stefan Pintilie	5f3e565f59	Revert "[LLD][PowerPC] Add support for R_PPC64_GOT_TLSGD_PCREL34 used in TLS General Dynamic" This reverts commit `79122868f9`.	2020-10-01 13:28:35 -05:00
Stefan Pintilie	79122868f9	[LLD][PowerPC] Add support for R_PPC64_GOT_TLSGD_PCREL34 used in TLS General Dynamic Add Thread Local Storage support for the 34 bit relocation R_PPC64_GOT_TLSGD_PCREL34 used in General Dynamic. The compiler will produce code that looks like: ``` pla r3, x@got@tlsgd@pcrel R_PPC64_GOT_TLSGD_PCREL34 bl __tls_get_addr@notoc(x@tlsgd) R_PPC64_TLSGD R_PPC64_REL24_NOTOC ``` LLD should be able to correctly compute the relocation for R_PPC64_GOT_TLSGD_PCREL34 as well as do the following two relaxations where possible: General Dynamic to Local Exec: ``` paddi r3, r13, x@tprel nop ``` and General Dynamic to Initial Exec: ``` pld r3, x@got@tprel@pcrel add r3, r3, r13 ``` Note: This patch adds support for the PC Relative (no TOC) version of General Dynamic on top of the existing support for the TOC version of General Dynamic. The ABI does not provide any way to tell by looking only at the relocation `R_PPC64_TLSGD` when it is being used in a TOC instruction sequence or and when it is being used in a no TOC sequence. The TOC sequence should always be 4 byte aligned. This patch adds one to the offset of the relocation when it is being used in a no TOC sequence. In this way LLD can tell by looking at the alignment of the offset of `R_PPC64_TLSGD` whether or not it is being used as part of a TOC or no TOC sequence. Reviewed By: NeHuang, sfertile, MaskRay Differential Revision: https://reviews.llvm.org/D87318	2020-10-01 13:00:37 -05:00
Haojian Wu	c1b209cc61	[Format] Don't treat compound extension headers (foo.proto.h) as foo.cc main-file header. We receive internal bugs about this false positives after D86597. Differential Revision: https://reviews.llvm.org/D88640.	2020-10-01 19:57:57 +02:00
Louis Dionne	ba9b15072c	[libc++][ci] Add a job to run the vanilla configuration on Apple Previously, we'd only have jobs testing the Apple cache on Apple platforms, but libc++ should also work out-of-the-box.	2020-10-01 13:56:58 -04:00
Sanjay Patel	686eb0d8de	[AST] do not error on APFloat invalidOp in default mode If FP exceptions are ignored, we should not error out of compilation just because APFloat indicated an exception. This is required as a preliminary step for D88238 which changes APFloat behavior for signaling NaN convert() to set the opInvalidOp exception status. Currently, there is no way to trigger this error because convert() never sets opInvalidOp. FP binops that set opInvalidOp also create a NaN, so the path to checkFloatingPointResult() is blocked by a different diagnostic: // [expr.pre]p4: // If during the evaluation of an expression, the result is not // mathematically defined [...], the behavior is undefined. // FIXME: C++ rules require us to not conform to IEEE 754 here. if (LHS.isNaN()) { Info.CCEDiag(E, diag::note_constexpr_float_arithmetic) << LHS.isNaN(); return Info.noteUndefinedBehavior(); } return checkFloatingPointResult(Info, E, St); Differential Revision: https://reviews.llvm.org/D88664	2020-10-01 13:46:45 -04:00
Sanjay Patel	114e964dce	[InstCombine] auto-generate complete test checks; NFC	2020-10-01 13:44:31 -04:00
zoecarver	6c25816d7b	[DSE] Look through memory PHI arguments when removing noop stores in MSSA. Summary: Adds support for "following" memory through MSSA PHI arguments. This will help catch more noop stores that exist between blocks. Originally part of D79391. Reviewers: fhahn, jfb, asbirlea Differential Revision: https://reviews.llvm.org/D82588	2020-10-01 10:42:02 -07:00
Louis Dionne	a0119e5675	[libc++] NFC: Add missing SHA to ABI Changelog	2020-10-01 13:40:13 -04:00
Jamie Schmeiser	71124a9dbd	Reland No.3: Add new hidden option -print-changed which only reports changes to IR A new hidden option -print-changed is added along with code to support printing the IR as it passes through the opt pipeline in the new pass manager. Only those passes that change the IR are reported, with others only having the banner reported, indicating that they did not change the IR, were filtered out or ignored. Filtering of output via the -filter-print-funcs is supported and a new supporting hidden option -filter-passes is added. The latter takes a comma separated list of pass names and filters the output to only show those passes in the list that change the IR. The output can also be modified via the -print-module-scope function. The code introduces an abstract template base class that generalizes the comparison of IRs that takes an IR representation as template parameter. Derived classes provide overrides that provide an event based API for generalized reporting of IRs as they are changed in the opt pipeline through the new pass manager. The first of several instantiations is provided that prints the IR in a form similar to that produced by -print-after-all with the above mentioned filtering capabilities. This version, and the others to follow will be introduced at the upcoming developer's conference. Reviewed By: aeubanks (Arthur Eubanks), yrouban (Yevgeny Rouban), ychen (Yuanfang Chen), MaskRay (Fangrui Song) Differential Revision: https://reviews.llvm.org/D86360	2020-10-01 17:39:13 +00:00
peter klausler	e29c9d77f1	[flang] Fix WRITE after BACKSPACE A WRITE to an unformatted sequential variable-length unit after a BACKSPACE needs to forget its previous knowledge of the length of the record that's about to be overwritten, and a BACKSPACE after an ENDFILE or at the start of the file needs to be a no-op. Differential revision: https://reviews.llvm.org/D88675	2020-10-01 10:33:31 -07:00
Louis Dionne	8654a0f8bb	[libc++] Don't re-export new/delete from libc++abi when they are defined in libc++ This is a temporary workaround until the new/delete situation is made better (i.e. we don't include new/delete in both libc++ and libc++abi by default).	2020-10-01 13:31:55 -04:00

1 2 3 4 5 ...

367887 Commits All Branches Search

367887 Commits

All Branches