llvm-project

Commit Graph

Author	SHA1	Message	Date
Andrea Di Biagio	5ad52e35a8	[MCA][LSUnit] Return the ID of the dependent memory operation from method isReady(). NFCI This is yet another change in preparation for a fix for PR37494. llvm-svn: 354150	2019-02-15 18:05:59 +00:00
Philip Reames	d9c4646d05	[Tests] Demonstrate more missing atomicrmw transforms llvm-svn: 354146	2019-02-15 17:11:30 +00:00
Sanjay Patel	8a2b543a13	[InstCombine] fix crash while trying to narrow a binop of shuffles (PR40734) https://bugs.llvm.org/show_bug.cgi?id=40734 llvm-svn: 354144	2019-02-15 16:31:55 +00:00
Serge Guelton	b5d00c9b73	Revert r354137 - OptionalStorage implementation for trivial type, take III This still fails on some random platform, and I fail to reproduce the issue. llvm-svn: 354142	2019-02-15 16:12:46 +00:00
Matt Arsenault	59ecdb0d8b	GlobalISel: Fix inadequate verification of g_build_vector Testing based on the total size of the elements failed to catch a few invalid scenarios, so explicitly check the number of elements/operands and types. This failed to catch situations like <4 x s16> = G_BUILD_VECTOR s32, s32 since the total size added up. This also would fail to catch an implicit conversion between pointers and scalars. llvm-svn: 354139	2019-02-15 15:24:34 +00:00
Matt Arsenault	4673fdc531	Try to organize MachineVerifier tests The Verifier is separate from the MachineVerifier, so move it to a different directory. Some other verifier tests were scattered in target codegen tests as well (although I'm sure I missed some). Work towards using a more consistent naming scheme to make it clearer where the gaps still are for generic instructions. llvm-svn: 354138	2019-02-15 15:24:31 +00:00
Serge Guelton	fb4df68f48	OptionalStorage implementation for trivial type, take III This is another attempt at implementating optional storage for trivially copyable type, using an union instead of a raw buffer to hold the actual storage. This make it possible to get rid of the reinterpret_cast, and hopefully to fix the UB of the previous attempts. This validates fine on my laptop for gcc 8.2 and gcc 4.8, I'll revert if it breaks the validation. llvm-svn: 354137	2019-02-15 15:17:29 +00:00
Clement Courbet	f7e84a2ccc	[MergeICmps] Make base ordering really deterministic. Summary: The idea is that we now manipulate bases through a `unsigned BaseID` based on order of appearance in the comparison chain rather than through the `Value*`. Fixes 40714. Reviewers: gchatelet Subscribers: mgrang, jfb, jdoerfert, llvm-commits, hans Tags: #llvm Differential Revision: https://reviews.llvm.org/D58274 llvm-svn: 354131	2019-02-15 14:17:17 +00:00
Clement Courbet	cc004df7eb	[MergeICmps][NFC] Improve doc. llvm-svn: 354128	2019-02-15 12:58:06 +00:00
Hans Wennborg	cc980bfa8e	Speculatively revert r354051 "Recommit Optional specialization for trivially copyable types" and r354055 "Optional specialization for trivially copyable types, part2" These are suspected to cause Clang to get miscompiled on Ubuntu 14.04 (Trusty) which uses GCC 4.8.4. Reverting for an hour to see if this helps. See llvm-commits thread. > Recommit Optional specialization for trivially copyable types > > Unfortunately the original code gets misscompiled by GCC (at least 8.1), > this is a tentative workaround using std::memcpy instead of inplace new > for trivially copyable types. I'll revert if it breaks. > > Original revision: https://reviews.llvm.org/D57097 llvm-svn: 354126	2019-02-15 12:20:33 +00:00
Max Kazantsev	c065b025a6	[NFCI] Factor out block removal from stack of nested loops llvm-svn: 354124	2019-02-15 12:18:10 +00:00
Simon Pilgrim	623c38d6cd	Fix "field 'DFS' will be initialized after field 'DTU'" warning. NFCI. llvm-svn: 354123	2019-02-15 12:13:16 +00:00
Sam Parker	0b53e8454b	[BPI] Look through bitcasts in calcZeroHeuristic Constant hoisting may have hidden a constant behind a bitcast so that it isn't folded into its users. However, this prevents BPI from calculating some of its heuristics that are based upon constant values. So, I've added a simple helper function to look through these casts. Differential Revision: https://reviews.llvm.org/D58166 llvm-svn: 354119	2019-02-15 11:50:21 +00:00
Max Kazantsev	136f09bea1	[NFC] Promote DFS to field for further use llvm-svn: 354118	2019-02-15 11:39:35 +00:00
Simon Pilgrim	6ce08672fb	[X86][AVX] lowerShuffleAsLanePermuteAndPermute - fully populate the lane shuffle mask (PR40730) As detailed on PR40730, we are not correctly filling in the lane shuffle mask (D53148/rL344446) - we fill in for the correct src lane but don't add it to the correct mask element, so any reference to the correct element is likely to see an UNDEF mask index. This allows constant folding to propagate UNDEFs prior to the lane mask being (correctly) lowered to vperm2f128. This patch fixes the issue by fully populating the lane shuffle mask - this is more than is necessary (if we only filled in the required mask elements we might be able to match other shuffle instructions - broadcasts etc.), but its the most cautious approach as this needs to be cherrypicked into the 8.0.0 release branch. Differential Revision: https://reviews.llvm.org/D58237 llvm-svn: 354117	2019-02-15 11:39:21 +00:00
Diana Picus	c0f964eb2f	[ARM GlobalISel] Style fix. NFCI Add the opcode for ADDrr / t2ADDrr to the Opcode cache, as we did for all other opcodes where the handling is otherwise the same between arm mode and thumb2. llvm-svn: 354115	2019-02-15 10:50:02 +00:00
Diana Picus	a00425ff0d	[ARM GlobalISel] Support branches for Thumb2 Just like arm mode, but with different opcodes. llvm-svn: 354113	2019-02-15 10:24:03 +00:00
Alex Bradbury	22531c4a14	[RISCV] Add assembler support for LA pseudo-instruction This patch also introduces the emitAuipcInstPair helper, which is then used for both emitLoadAddress and emitLoadLocalAddress. Differential Revision: https://reviews.llvm.org/D55325 Patch by James Clarke. llvm-svn: 354111	2019-02-15 09:53:32 +00:00
Alex Bradbury	8eb87e59a6	[RISCV] Support assembling %got_pcrel_hi operator Differential Revision: https://reviews.llvm.org/D55279 Patch by James Clarke. llvm-svn: 354110	2019-02-15 09:43:46 +00:00
Sam Parker	3c17cb7bc4	[ARM CGP] Fix ConvertTruncs ConvertTruncs is used to replace a trunc for an AND mask, however this function wasn't working as expected. By performing the change later, we can create a wide type integer mask instead of a narrow -1 value, which could then be simply removed (incorrectly). Because we now perform this action later, it's necessary to cache the trunc type before we perform the promotion. Differential Revision: https://reviews.llvm.org/D57686 llvm-svn: 354108	2019-02-15 09:04:39 +00:00
Max Kazantsev	73db5c137a	[NFC] Tweak SplitBlockAndInsertIfThen to use existing ThenBlock llvm-svn: 354107	2019-02-15 08:18:00 +00:00
Max Kazantsev	184bd7a0d8	[TEST] Update test comments, refactor checks with update_test_checks.py This patch changes messages in guards-related tests to adequately reflect reality. llvm-svn: 354101	2019-02-15 07:06:52 +00:00
Matt Arsenault	2a5488b877	X86: Replace isSafeToClobberEFLAGS implementation Also use modifiesRegister instead of looping over operands. llvm-svn: 354098	2019-02-15 04:01:39 +00:00
Francis Visoiu Mistrih	3fb7d4f55f	Revert "[SystemZ] Do not emit VEXTEND or VROUND nodes without vector support." This reverts commit `aa0b77d339`. This fails to pass the machine verifier: http://green.lab.llvm.org/green/job/clang-stage1-cmake-RA-expensive/13579/ llvm-svn: 354096	2019-02-15 03:01:09 +00:00
Julian Lettner	bd40ecf7d6	[lit][NFC] Cleanup copy&paste naming mistake llvm-svn: 354095	2019-02-15 02:44:52 +00:00
Matt Davis	582274329e	[llvm-cxxfilt] Fix a comment typo. NFC. llvm-svn: 354094	2019-02-15 02:43:14 +00:00
Aditya Nandakumar	0e362ec19a	[GISel][NFC]: Add methods to speed up insertion into GISelWorklist https://reviews.llvm.org/D58073 Speed up insertion during the initial populating phase into the GISelWorkList by deferring repeatedly resizing the DenseMap. This results in ~10% improvement in the combiner passes, and ~3% speedup in the Legalizer. reviewed by: aemerson. llvm-svn: 354093	2019-02-15 01:37:54 +00:00
Konstantin Zhuravlyov	1e126c503b	AMDGPU: Set ABI version to 1 for code object v3 Differential Revision: https://reviews.llvm.org/D57811 llvm-svn: 354085	2019-02-14 23:56:04 +00:00
Matt Davis	123be5d4c0	[symbolizer] Avoid collecting symbols belonging to invalid sections. Summary: llvm-symbolizer would originally report symbols that belonged to an invalid object file section. Specifically the case where: `*Symbol.getSection() == ObjFile.section_end()` This patch prevents the Symbolizer from collecting symbols that belong to invalid sections. The test (from PR40591) introduces a case where two symbols have address 0, one symbol is defined, 'foo', and the other is not defined, 'bar'. This patch will cause the Symbolizer to keep 'foo' and ignore 'bar'. As a side note, the logic for adding symbols to the Symbolizer's store (`SymbolizableObjectFile::addSymbol`) replaces symbols with the same <address, size> pair. At some point that logic should be revisited as in the aforementioned case, 'bar' was overwriting 'foo' in the Symbolizer's store, and 'foo' was forgotten. This fixes PR40591 Reviewers: jhenderson, rupprecht Reviewed By: rupprecht Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D58146 llvm-svn: 354083	2019-02-14 23:50:35 +00:00
Nick Desaulniers	6a84cd3b8e	Revert "[INLINER] allow inlining of address taken blocks" This reverts commit `19e95fe611`. llvm-svn: 354082	2019-02-14 23:42:21 +00:00
Nick Desaulniers	19e95fe611	[INLINER] allow inlining of address taken blocks as long as their uses does not contain calls to functions that capture the argument (potentially allowing the blockaddress to "escape" the lifetime of the caller). TODO: - add more tests - fix crash in llvm::updateCGAndAnalysisManagerForFunctionPass when invoking Transforms/Inline/blockaddress.ll llvm-svn: 354079	2019-02-14 23:35:53 +00:00
Sanjay Patel	0b2dca9f83	[x86] add tests for extractelement of FP; NFC llvm-svn: 354077	2019-02-14 23:17:22 +00:00
Julian Lettner	3c76c09ebf	[lit] Remove --single-process option (use -j1 instead) Remove `--single-process` command line option. Use `-j1` instead. Also see commit: `96adb78b12` llvm-svn: 354073	2019-02-14 22:46:56 +00:00
Konstantin Zhuravlyov	e0484eb2f2	MC/ELF: Allow targets to set ABI version Tests are in the follow up change Differential Revision: https://reviews.llvm.org/D57810 llvm-svn: 354072	2019-02-14 22:42:09 +00:00
Matt Arsenault	530d05e94a	GlobalISel: Add alignment to LegalityQuery MMOs This allows targets to specify the minimum alignment required for the load/store. llvm-svn: 354071	2019-02-14 22:41:09 +00:00
Matt Arsenault	294483f1c0	Replace gcroot verifier tests These haven't been checking anything useful and have been testing the wrong failure reason for many years. Replace them with something which stresses what is actually implemented in the verifier now. llvm-svn: 354070	2019-02-14 22:41:01 +00:00
Julian Lettner	96adb78b12	[lit] Set --single-process for single tests and --threads=1 Summary: Automatically upgrade debugging experience (single process, no thread pool) when: 1) we only run a single test 2) user specifies `-j1` Details: Fix `--max-failures` in single process mode. Option did not have an effect in single process mode. Add display feedback for single process mode. Adapted test. Improve argument checking (require positive integers). `--single-process` is now essentially an alias for `-j1`. Should we remove it? Reviewers: rnk Differential Revision: https://reviews.llvm.org/D58249 llvm-svn: 354068	2019-02-14 22:30:07 +00:00
Matt Arsenault	9e5e868d95	AMDGPU/GlobalISel: Fix RegBankSelect for GEP. This is basically a pointer typed add, so shouldn't be any different. This was assuming everything was an SGPR, which is not true. Also cleanup legality for GEP. I don't seem to be seeing the problem the hack marking s64 as a legal pointer type the comment mentions. llvm-svn: 354067	2019-02-14 22:24:28 +00:00
Stanislav Mekhanoshin	871821f786	[AMDGPU] Ressociate 'add (add x, y), z' to use SALU Reassociate adds to collect scalar operands in a single instruction when possible. That will result in a scalar add followed by vector instead of two vector adds, thus better utilizing SALU. Differential Revision: https://reviews.llvm.org/D58220 llvm-svn: 354066	2019-02-14 22:11:25 +00:00
Matt Arsenault	d3d496338e	AMDGPU/GlobalISel: Handle split for 64-bit VALU select llvm-svn: 354065	2019-02-14 21:58:12 +00:00
Teresa Johnson	d0b1f30b32	[ThinLTO] Detect partially split modules during the thin link Summary: The changes to disable LTO unit splitting by default (r350949) and detect inconsistently split LTO units (r350948) are causing some crashes when the inconsistency is detected in multiple threads simultaneously. Fix that by having the code always look for the inconsistently split LTO units during the thin link, by checking for the presence of type tests recorded in the summaries. Modify test added in r350948 to remove single threading required to fix a bot failure due to this issue (and some debugging options added in the process of diagnosing it). Reviewers: pcc Subscribers: mehdi_amini, inglorion, eraman, steven_wu, dexonsmith, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57561 llvm-svn: 354062	2019-02-14 21:22:50 +00:00
Chris Bieneman	e8d95ad9ae	[CMake] Fix ability to use LLVM_ENABLE_PROJECTS with LLVM_EXTERNAL_PROJECTS LLVM r353148, changed the circumstances in which the project source directory variables are created to only create them for LLVM projects. This patch initializes the directory variables for projects specified in `LLVM_EXTERNAL_PROJECTS` as well. llvm-svn: 354060	2019-02-14 20:57:17 +00:00
Philip Reames	db57ef6238	[InstCombine] Add todos for possible atomicrmw transforms llvm-svn: 354059	2019-02-14 20:48:36 +00:00
Philip Reames	485474208e	Canonicalize all integer "idempotent" atomicrmw ops For "idempotent" atomicrmw instructions which we can't simply turn into load, canonicalize the operation and constant. This reduces the matching needed elsewhere in the optimizer, but doesn't directly impact codegen. For any architecture where OR/Zero is not a good default choice, you can extend the AtomicExpand lowerIdempotentRMWIntoFencedLoad mechanism. I reviewed X86 to make sure this works well, haven't audited other backends. Differential Revision: https://reviews.llvm.org/D58244 llvm-svn: 354058	2019-02-14 20:41:17 +00:00
Nico Weber	04a1ee4660	Stop enabling clang-tools-extra automatically when clang is in LLVM_ENABLE_PROJECTS If you want to build clang-tools-extra with monorepo, just add it to LLVM_ENABLE_PROJECTS like with other projects. See also "Separating clang-tools-extra from clang in LLVM_ENABLE_PROJECTS" on cfe-dev. Differential Revision: https://reviews.llvm.org/D58157 llvm-svn: 354057	2019-02-14 20:26:35 +00:00
Serge Guelton	794b536736	Optional specialization for trivially copyable types, part2 llvm-svn: 354055	2019-02-14 19:41:44 +00:00
Serge Guelton	4d0934c48f	Recommit Optional specialization for trivially copyable types Unfortunately the original code gets misscompiled by GCC (at least 8.1), this is a tentative workaround using std::memcpy instead of inplace new for trivially copyable types. I'll revert if it breaks. Original revision: https://reviews.llvm.org/D57097 llvm-svn: 354051	2019-02-14 19:17:00 +00:00
Philip Reames	97067d3c73	Teach instcombine about remaining "idempotent" atomicrmw types Expand on Quentin's r353471 patch which converts some atomicrmws into loads. Handle remaining operation types, and fix a slight bug. Atomic loads are required to have alignment. Since this was within the InstCombine fixed point, somewhere else in InstCombine was adding alignment before the verifier saw it, but still, we should fix. Terminology wise, I'm using the "idempotent" naming that is used for the same operations in AtomicExpand and X86ISelLoweringInfo. Once this lands, I'll add similar tests for AtomicExpand, and move the pattern match function to a common location. In the review, there was seemingly consensus that "idempotent" was slightly incorrect for this context. Once we setle on a better name, I'll update all uses at once. Differential Revision: https://reviews.llvm.org/D58242 llvm-svn: 354046	2019-02-14 18:39:14 +00:00
Saleem Abdulrasool	cb914cf683	Support: use internal `call_once` on PPC64le Always use the internal `call_once` for PPC64le. This is needed to support the Swift toolchain on PPC64le. Patch by Sarvesh Tamba! llvm-svn: 354045	2019-02-14 18:36:52 +00:00
Jordan Rupprecht	b7ae7297b9	[llvm-ar] Implement the P modifier. Summary: GNU ar has a `P` modifier that changes filename comparisons to use full paths instead of the basename. As noted in the GNU docs, regular archives are not created with full path names, so P is used to deal with archives created by other archive programs (e.g. see the updated `absolute-paths.test` test case). Since thin archives use full path names -- paths are relative to the archive -- it seems very error prone to not imply P when dealing with thin archives, so P is implied in those cases. (I think this is a deviation from GNU ar that makes sense). This fixes PR37436 via https://github.com/ClangBuiltLinux/linux/issues/33. Reviewers: mstorsjo, pcc, ruiu, davide, david2050, rnk Subscribers: tpimh, llvm-commits, nickdesaulniers Tags: #llvm Differential Revision: https://reviews.llvm.org/D57927 llvm-svn: 354044	2019-02-14 18:35:13 +00:00
Nirav Dave	5ffdc43dc9	[X86] cleanup inline asm register generation. NFCI. llvm-svn: 354042	2019-02-14 18:06:21 +00:00
Jonas Paulsson	aa0b77d339	[SystemZ] Do not emit VEXTEND or VROUND nodes without vector support. Review: Ulrich Weigand https://reviews.llvm.org/D58240 llvm-svn: 354039	2019-02-14 17:58:48 +00:00
Philip Reames	617cd10bf7	[Tests] Add tests for all idemptotent atomicrmws Base for a followup patch to strengthen the InstCombine transform, and then integrate the ExpandAtomics logic. llvm-svn: 354036	2019-02-14 17:01:15 +00:00
Simon Pilgrim	362fe56034	[X86][AVX] Add PR40730 test case llvm-svn: 354034	2019-02-14 14:45:32 +00:00
Teresa Johnson	c374a800e7	Refine ArgPromotion metadata handling Summary: In r353537 we now copy all metadata to the new function, with the old being removed when the old function is eliminated. In some cases the old function is dropped to a declaration (seems to only occur with the old PM). Go ahead and clear all metadata from the old function to handle that case, since verification will complain otherwise. This is consistent with what was being done for debug metadata before r353537. Reviewers: davidxl, uabelho Subscribers: jdoerfert, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D58215 llvm-svn: 354032	2019-02-14 14:14:24 +00:00
Florian Hahn	6ab83b7db6	[LoopUnrollPeel] Add case where we should forget the peeled loop from SCEV. The test case requires the peeled loop to be forgotten after peeling, even though it does not have a parent. When called via the unroller, SE->forgetTopmostLoop is also called, so the test case would also pass without any SCEV invalidation, but peelLoop is exposed as utility function. Also, in the test case, simplifyLoop will make changes, removing the loop from SCEV, but it is better to not rely on this behavior. Reviewers: sanjoy, mkazantsev Reviewed By: mkazantsev Tags: #llvm Differential Revision: https://reviews.llvm.org/D58192 llvm-svn: 354031	2019-02-14 13:59:39 +00:00
Alex Bradbury	4efa0b674d	[RISCV][NFC] Add RV64I CHECK lines to inline-asm.ll test llvm-svn: 354028	2019-02-14 13:09:54 +00:00
Sam McCall	15e475e222	Reapply [VFS] Allow multiple RealFileSystem instances with independent CWDs. This reverts commit r351091. The original mac breakages are addressed by ensuring the root directory we're working from is fully symlink-resolved before starting. Differential Revision: https://reviews.llvm.org/D58169 llvm-svn: 354026	2019-02-14 12:57:01 +00:00
Petar Avramovic	14c7ecfe84	[MIPS GlobalISel] Select phi instruction for integers Select G_PHI for integers for MIPS32. Differential Revision: https://reviews.llvm.org/D58183 llvm-svn: 354025	2019-02-14 12:36:19 +00:00
Clement Courbet	c6e768f0ee	[Instrumentation][NFC] Fix warning. lib/Transforms/Instrumentation/AddressSanitizer.cpp:1173:29: warning: extra ‘;’ [-Wpedantic] llvm-svn: 354024	2019-02-14 12:10:49 +00:00
Petar Avramovic	5d9b8eed85	[MIPS GlobalISel] Select branch instructions Select G_BR and G_BRCOND for MIPS32. Unconditional branch G_BR does not have register operand, for that reason we only add tests. Since conditional branch G_BRCOND compares register to zero on MIPS32, explicit extension must be performed on i1 condition in order to set high bits to appropriate value. Differential Revision: https://reviews.llvm.org/D58182 llvm-svn: 354022	2019-02-14 11:39:53 +00:00
Max Kazantsev	24383cd7bb	Make widenable condition transparent for MemoryWriteTracking Side effects of widenable condition intrinsic are modelled via InaccessibleMemOnly, and there is no way to say that it isn't really writing any memory. This patch teaches MemoryWriteTracking ignore this intrinsic. llvm-svn: 354021	2019-02-14 11:10:29 +00:00
Max Kazantsev	b3168a400f	Teach isGuaranteedToTransferExecutionToSuccessor about widenable conditions Widenable condition intrinsic is guaranteed to return value, notify the isGuaranteedToTransferExecutionToSuccessor function about it. llvm-svn: 354020	2019-02-14 11:10:21 +00:00
Jeremy Morse	897a9f8d00	Fix an accidentally flipped pair of arguments, NFCI While rebasing a refactor in r353950 I accidentally swapped two function arguments; one is SelectionDAGBuilders "current" DebugLoc, the other is the one from the "current" debug intrinsic. They're probably always identical, but I haven't proved that yet. llvm-svn: 354019	2019-02-14 11:09:24 +00:00
David Green	743abf2bd9	[ARM] Ensure we update the correct flags in the peephole optimiser The Arm peephole optimiser code keeps track of both an MI and a SubAdd that can be used to optimise away a CMP. In the rare case that both are found and not ruled-out as valid, we could end up setting the flags on the wrong one. Instead make sure we are using SubAdd if it exists, as it will be closer to the CMP. The testcase here is a little theoretical, with a dead def of cpsr. It should hopefully show the point. Differential Revision: https://reviews.llvm.org/D58176 llvm-svn: 354018	2019-02-14 11:09:24 +00:00
Andrew Ng	d27cf27eb1	[Support] Fix TempFile::discard to not leave behind temporary files Moved the remove of the temporary file to after the close to avoid remove failures caused by ETXTBSY errors. This issue was seen when FileOutputBuffer falls back to an in memory buffer due to the inability to mmap the on disk file. This occurred when running LLD on an Ubuntu VM in VirtualBox on a Windows host attempting to write the output to a VirtualBox shared folder. Differential Revision: https://reviews.llvm.org/D57960 llvm-svn: 354017	2019-02-14 11:08:49 +00:00
Max Kazantsev	deaf2ba280	[NFC] Refactor LICM code for better readability llvm-svn: 354013	2019-02-14 09:04:12 +00:00
Fangrui Song	91c4fa9d32	[llvm-readobj][test] Add all GNU_PROPERTY_X86_FEATURE_2_{NEEDED,USED} bits And delete trailing whitespace llvm-svn: 354011	2019-02-14 07:52:51 +00:00
Douglas Yung	27343bd8f2	Relax test to check for a valid number instead of a specific number to be like every other check in this test. llvm-svn: 354007	2019-02-14 04:11:09 +00:00
Craig Topper	1d158dd930	[X86] Make (f80 (sint_to_fp (i16))) use fistps/fisttps instead of fistpl/fisttpl when SSE is enabled. When SSE is enabled sint_to_fp with i16 is blindly promoted to i32, but that changes the behavior of f80 conversion. Move the promotion to i16 to LowerFP_TO_INT so we can limit it based on the floating point type. llvm-svn: 354003	2019-02-14 01:41:43 +00:00
Matthew Voss	9cb76856d8	Revert "[llvm-objdump] Allow short options without arguments to be grouped" Reverted due to failures on the llvm-hexagon-elf. This reverts commit `77e1f27476`. llvm-svn: 354002	2019-02-14 01:39:43 +00:00
Matthew Voss	77e1f27476	[llvm-objdump] Allow short options without arguments to be grouped Summary: https://bugs.llvm.org/show_bug.cgi?id=31679 Reviewers: kristina, jhenderson, grimar, jakehehrlich, rupprecht Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57904 llvm-svn: 353998	2019-02-14 00:39:40 +00:00
Daniel Sanders	dfa0f556bf	[globalisel][combine] Split existing rules into a match and apply step Summary: The declarative tablegen definitions split rules into match and apply steps. Prepare for that by doing the same in the C++ implementations. This aids some of the migration effort while the tablegen version is incomplete. Reviewers: bogner, volkan, aditya_nandakumar, paquette, aemerson Reviewed By: aditya_nandakumar Subscribers: rovka, kristof.beyls, Petar.Avramovic, jdoerfert, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D58150 llvm-svn: 353996	2019-02-14 00:15:28 +00:00
Jordan Rupprecht	451c2ef199	[llvm-ar][libObject] Fix relative paths when nesting thin archives. Summary: When adding one thin archive to another, we currently chop off the relative path to the flattened members. For instance, when adding `foo/child.a` (which contains `x.txt`) to `parent.a`, when flattening it we should add it as `foo/x.txt` (which exists) instead of `x.txt` (which does not exist). As a note, this also undoes the `IsNew` parameter of handling relative paths in r288280. The unit test there still passes. This was reported as part of testing the kernel build with llvm-ar: https://patchwork.kernel.org/patch/10767545/ (see the second point). Reviewers: mstorsjo, pcc, ruiu, davide, david2050, inglorion Reviewed By: ruiu Subscribers: void, jdoerfert, tpimh, mgorny, hans, nickdesaulniers, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57842 llvm-svn: 353995	2019-02-13 23:39:41 +00:00
Stefan Pintilie	1113940df2	[PowerPC][NFC] Added tests for prologue and epilogue code gen. Added four test files to check the existing behaviour of prologue and epilogue code generation. This patch was done as a setup for the upcoming patch listed on Phabricator that will change how the prologue and epilogue work. The upcoming patch is: https://reviews.llvm.org/D42590 llvm-svn: 353994	2019-02-13 23:37:23 +00:00
Sanjay Patel	d05ba496bc	[ConstProp] add IR tests to show miscompiles; NFC A fix for these is proposed in D51216. llvm-svn: 353992	2019-02-13 23:27:31 +00:00
Fangrui Song	91ab9bf32c	[llvm-readobj] Dump GNU_PROPERTY_X86_ISA_1_{NEEDED,USED} notes in .note.gnu.property Reviewers: grimar, rupprecht Reviewed By: rupprecht Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D58175 llvm-svn: 353991	2019-02-13 23:18:05 +00:00
Philip Reames	e4cfb7dae8	[SelectionDAG] Inline a single use helper function, and remove last non-MMO interface [NFC] For D57601, we need to know whether the instruction is volatile. We'd either have to pass yet another parameter, or just standardize on the MMO interface. I chose the second. llvm-svn: 353989	2019-02-13 23:01:11 +00:00
Mark Lacey	c18b8a8bc5	[RegAllocGreedy] Take last chance recoloring into account in evicting. Last chance recoloring inserts into FixedRegisters those virtual registers it is attempting to assign a physical register to. We must consider these when we consider candidates for eviction so that we do not end up evicting something while we are attempting to recolor to assign it. This is hitting in an out-of-tree target and no longer reproduces on trunk. That does not appear to be a result of it having been fixed, but rather, it appears that optimization changes and/or other changes to register allocation mask the problem. I haven't found a way to come up with a reasonable test case for this (i.e. one that I can actually commit to open source, is reasonable in size, and actually reproduces the issue). rdar://problem/45708741 llvm-svn: 353988	2019-02-13 22:56:43 +00:00
Dylan McKay	8a56d10a2f	[AVR] Fix a typo - 's/analisys/analysis' llvm-svn: 353987	2019-02-13 22:31:37 +00:00
Thomas Lively	bba3f06d05	[WebAssembly] memory.fill Summary: memset lowering, fix argument types in memcpy lowering, and test encodings. Depends on D57736. Reviewers: aheejin Subscribers: dschuff, sbc100, jgravelle-google, hiraditya, sunfish, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57791 llvm-svn: 353986	2019-02-13 22:25:18 +00:00
Leonard Chan	436fb2bd82	[NewPM] Second attempt at porting ASan This is the second attempt to port ASan to new PM after D52739. This takes the initialization requried by ASan from the Module by moving it into a separate class with it's own analysis that the new PM ASan can use. Changes: - Split AddressSanitizer into 2 passes: 1 for the instrumentation on the function, and 1 for the pass itself which creates an instance of the first during it's run. The same is done for AddressSanitizerModule. - Add new PM AddressSanitizer and AddressSanitizerModule. - Add legacy and new PM analyses for reading data needed to initialize ASan with. - Removed DominatorTree dependency from ASan since it was unused. - Move GlobalsMetadata and ShadowMapping out of anonymous namespace since the new PM analysis holds these 2 classes and will need to expose them. Differential Revision: https://reviews.llvm.org/D56470 llvm-svn: 353985	2019-02-13 22:22:48 +00:00
Thomas Lively	de7a0a1526	[WebAssembly] Bulk memory intrinsics and builtins Summary: implements llvm intrinsics and clang intrinsics for memory.init and data.drop. Reviewers: aheejin Subscribers: dschuff, sbc100, jgravelle-google, hiraditya, sunfish, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D57736 llvm-svn: 353983	2019-02-13 22:11:16 +00:00
Serge Guelton	221c39165d	Revert r353962 Specialization of Optional for trivially copyable types yields failure on the buildbots I fail to reproduce locally. Better safe than sorry, reverting. llvm-svn: 353982	2019-02-13 22:11:09 +00:00
Peter Collingbourne	7761790881	gn build: Merge r353957. llvm-svn: 353980	2019-02-13 21:03:23 +00:00
Philip Reames	41f400c948	[SelectionDAG] Kill last uses of getAtomic w/o a MMO operand [NFC] The helper function was used by only two callers, and largely ended up providing distinct functionality based on optional arguments and opcode. Inline and simply to make the functionality much more clear. llvm-svn: 353977	2019-02-13 20:42:59 +00:00
Craig Topper	fa533f2152	[X86] Add 'mpx' to getHostCPUFeatures. llvm-svn: 353974	2019-02-13 20:12:41 +00:00
Vedant Kumar	4b0cc9a7c8	[CodeExtractor] Only lift lifetime markers present in the extraction region When CodeExtractor finds liftime markers referencing inputs to the extraction region, it lifts these markers out of the region and inserts them around the call to the extracted function (see r350420, PR39671). However, it should only lift lifetime markers that are actually present in the extraction region. I.e., if a start marker is present in the extraction region but a corresponding end marker isn't (or vice versa), only the start marker (or end marker, resp.) should be lifted. Differential Revision: https://reviews.llvm.org/D57834 llvm-svn: 353973	2019-02-13 19:53:38 +00:00
Philip Reames	9fc51bae73	[Tests] More unordered atomic lowering tests This time, focused around narrowing and widening transformations. Also, include a few simple memory optimization tests to highlight missed oppurtunities. This is part of building up the test base for D57601. llvm-svn: 353972	2019-02-13 19:49:17 +00:00
Philip Reames	9239b9a0e2	[Tests] RMW folding tests w/unordered atomic operations We get a suprising number of these today actually, but some are missed. The main point of this is strengthen the test set for D57601. llvm-svn: 353966	2019-02-13 18:41:54 +00:00
Philip Reames	430d294f0b	[Tests] Add a bunch of tests for load folding w/unordered atomics llvm-svn: 353964	2019-02-13 18:26:01 +00:00
Craig Topper	6829ca975d	[X86] Add 'fxsr' to the getHostCPUFeatures detection code. We implicitly mark this feature as enabled when the target is 64-bits, but our detection code for -march=native didn't support it so you can't detect it on 32-bit targets. llvm-svn: 353963	2019-02-13 18:21:36 +00:00
Serge Guelton	f688293393	Re-commit rL353927, patch included Make llvm::Optional<T> trivially copyable when T is trivially copyable This is an ever-recurring issue (see https://bugs.llvm.org/show_bug.cgi?id=39427 and https://bugs.llvm.org/show_bug.cgi?id=35978) but I believe that thanks to https://reviews.llvm.org/D54472 we can now ship a decent implementation of this. Basically the fact that llvm::is_trivially_copyable has a consistent behavior across compilers should prevent any ABI issue, and using in-place new instead of memcpy should keep compiler bugs away. This patch is slightly different from the original revision https://reviews.llvm.org/rL353927 but achieves the same goal. It just avoids going through std::conditional which may the code more explicit. llvm-svn: 353962	2019-02-13 18:12:04 +00:00
Philip Reames	5dddedee93	[Tests] First batch of cornercase tests for unordered atomics Mixture of things we legally can't do, and things we're missing. Once D57601 is in, the later will serve as a punch list. llvm-svn: 353959	2019-02-13 18:00:58 +00:00
Philip Reames	80e40959a0	[Tests] Auto update a test llvm-svn: 353958	2019-02-13 17:30:03 +00:00
Petr Hosek	fcbec02ea6	[AArch64] Support reserving arbitrary general purpose registers This is a follow up to D48580 and D48581 which allows reserving arbitrary general purpose registers with the exception of registers with special purpose (X8, X16-X18, X29, X30) and registers used by LLVM (X0, X19). This change also generalizes some of the existing logic to rely entirely on values generated from tablegen. Differential Revision: https://reviews.llvm.org/D56305 llvm-svn: 353957	2019-02-13 17:28:47 +00:00
Philip Reames	122e8132b4	[Tests] Rename some test files for consistency Most are named "atomic-something" so rename the few which were "atomic_something". I keep typing the wrong name due to the inconsistency. :) llvm-svn: 353956	2019-02-13 17:23:11 +00:00
Jeremy Morse	291713a596	[DebugInfo][DAG] Either salvage dangling debug info or emit Undef DBG_VALUEs In this patch SelectionDAG tries to salvage any dbg.values that are going to be dropped, in case they can be recovered from Values in the current BB. It also strengthens SelectionDAGs handling of dangling debug data, so that dbg.values are always emitted (as Undef or otherwise) instead of dangling forever. The motivation behind this patch exists in the new test case: a memory address (here a bitcast and GEP) exist in one basic block, and a dbg.value referring to the address is left in the 'next' block. The base pointer is live across all basic blocks. In current llvm trunk the dbg.value cannot be encoded, and it isn't even emitted as an Undef DBG_VALUE. The change is simply: if we're definitely going to drop a dbg.value, repeatedly apply salvageDebugInfo to its operand until either we find something that can be encoded, or we can't salvage any further in which case we produce an Undef DBG_VALUE. To know when we're "definitely going to drop a dbg.value", SelectionDAG signals SelectionDAGBuilder when all IR instructions have been encoded to force salvaging. This ensures that any dbg.value that's dangling after DAG creation will have a corresponding DBG_VALUE encoded. Differential Revision: https://reviews.llvm.org/D57694 llvm-svn: 353954	2019-02-13 16:33:05 +00:00
Simon Pilgrim	48d27e8393	[X86][AVX] Add shuffle_v8i32_0dcd3f14 shuffle test case llvm-svn: 353953	2019-02-13 16:12:36 +00:00
Fangrui Song	6a03b93224	[llvm-readobj] Rename pr_data to PrData As requested by grimar in D58112. llvm-svn: 353951	2019-02-13 15:58:23 +00:00
Jeremy Morse	6d3cd3b4ec	[DebugInfo][DAG] Refactor dbg.value lowering into its own method This is a pure copy-and-paste job, moving the logic for lowering dbg.value intrinsics to SDDbgValues into its own function. This is ahead of adding some more users of this logic. Differential Revision: https://reviews.llvm.org/D57697 llvm-svn: 353950	2019-02-13 15:53:10 +00:00
Andrea Di Biagio	245163ffd0	[MCA] Store a bitmask of used groups in the instruction descriptor. This is to speedup 'checkAvailability' queries in class ResourceManager. No functional change intended. llvm-svn: 353949	2019-02-13 14:56:06 +00:00
Jeremy Morse	a9a11aac0f	[DebugInfo][DAG] Limit special-casing of dbg.values for Arguments SelectionDAGBuilder has special handling for dbg.value intrinsics that are understood to define the location of function parameters on entry to the function. To enable this, we avoid recording a dbg.value as a virtual register reference if it might be such a parameter, so that it later hits EmitFuncArgumentDbgValue. This patch reduces the set of circumstances where we avoid recording a dbg.value as a virtual register reference, to allow more "normal" variables to be recorded that way. We now only bypass for potential parameters if: * The dbg.value operand is an Argument, * The Variable is a parameter, and * The Variable is not inlined. meaning it's very likely that the dbg.value is a function-entry parameter location. Differential Revision: https://reviews.llvm.org/D57584 llvm-svn: 353948	2019-02-13 13:37:33 +00:00
Max Kazantsev	3fe9ad7a9f	[NFC] Add const qualifiers where possible llvm-svn: 353941	2019-02-13 11:54:45 +00:00
Serge Guelton	699c22839a	Revert r353927 llvm-svn: 353940	2019-02-13 11:35:45 +00:00
Diana Picus	aa4118a873	[ARM GlobalISel] Support G_SELECT for Thumb2 Same as arm mode, but slightly different opcodes. llvm-svn: 353938	2019-02-13 11:25:32 +00:00
Andrea Di Biagio	318f990aee	[MCA][Scheduler] Use latency information to further classify busy instructions. This patch introduces a new instruction stage named 'IS_PENDING'. An instruction transitions from the IS_DISPATCHED to the IS_PENDING stage if input registers are not available, but their latency is known. This patch also adds a new set of instructions named 'PendingSet' to class Scheduler. The idea is that the PendingSet will only contain instructions that have reached the IS_PENDING stage. By construction, an instruction in the PendingSet is only dependent on instructions that have already reached the execution stage. The plan is to use this knowledge to identify bottlenecks caused by data dependencies (see PR37494). Differential Revision: https://reviews.llvm.org/D58066 llvm-svn: 353937	2019-02-13 11:02:42 +00:00
Jeremy Morse	f10af3f134	[DebugInfo][InstCombine] Prefer to salvage debuginfo over sinking it When instcombine sinks an instruction between two basic blocks, it sinks any dbg.value users in the source block with it, to prevent debug use-before-free. However we can do better by attempting to salvage the debug users, which would avoid moving where the variable location changes. If we successfully salvage, still sink a (cloned) dbg.value with the sunk instruction, as the sunk instruction is more likely to be "live" later in the compilation process. If we can't salvage dbg.value users of a sunk instruction, mark the dbg.values in the original block as being undef. This terminates any earlier variable location range, and represents the fact that we've optimized out the variable location for a portion of the program. Differential Revision: https://reviews.llvm.org/D56788 llvm-svn: 353936	2019-02-13 10:54:53 +00:00
Serge Guelton	f8ffb926e2	Missing header llvm-svn: 353933	2019-02-13 10:19:06 +00:00
Max Kazantsev	2bb95e7c76	[GuardWidening] Support widening of explicitly expressed guards This patch adds support of guards expressed in explicit form via `widenable_condition` in Guard Widening pass. Differential Revision: https://reviews.llvm.org/D56075 Reviewed By: reames llvm-svn: 353932	2019-02-13 09:56:30 +00:00
David Stenberg	9dbeca3d77	[DebugInfo] Stop changing labels for register-described parameter DBG_VALUEs Summary: This is a follow-up to D57510. This patch stops DebugHandlerBase from changing the starting label for the first non-overlapping, register-described parameter DBG_VALUEs to the beginning of the function. That code did not consider what defined the registers, which could result in the ranges for the debug values starting before their defining instructions. We currently do not emit debug values for constant values directly at the start of the function, so this code is still useful for such values, but my intention is to remove the code from DebugHandlerBase completely when we get there. One reason for removing it is that the code violates the history map's ranges, which I think can make it quite confusing when troubleshooting. In D57510, PrologEpilogInserter was amended so that parameter DBG_VALUEs now are kept at the start of the entry block, even after emission of prologue code. That was done to reduce the degradation of debug completeness from this patch. PR40638 is another example, where the lexical-scope trimming that LDV does, in combination with scheduling, results in instructions after the prologue being left without locations. There might be other cases where the DBG_VALUEs are pushed further down, for which the DebugHandlerBase code may be helpful, but as it now quite often result in incorrect locations, even after the prologue, it seems better to remove that code, and try to work our way up with accurate locations. In the long run we should maybe not aim to provide accurate locations inside the prologue. Some single location descriptions, at least those referring to stack values, generate inaccurate values inside the epilogue, so we maybe should not aim to achieve accuracy for location lists. However, it seems that we now emit line number programs that can result in GDB and LLDB stopping inside the prologue when doing line number stepping into functions. See PR40188 for more information. A summary of some of the changed test cases is available in PR40188#c2. Reviewers: aprantl, dblaikie, rnk, jmorse Reviewed By: aprantl Subscribers: jdoerfert, jholewinski, jvesely, javed.absar, llvm-commits Tags: #debug-info, #llvm Differential Revision: https://reviews.llvm.org/D57511 llvm-svn: 353928	2019-02-13 09:34:07 +00:00
Serge Guelton	ab061d351e	Make llvm::Optional<T> trivially copyable when T is trivially copyable This is an ever-recurring issue (see https://bugs.llvm.org/show_bug.cgi?id=39427 and https://bugs.llvm.org/show_bug.cgi?id=35978) but I believe that thanks to https://reviews.llvm.org/D54472 we can now ship a decent implementation of this. Basically the fact that llvm::is_trivially_copyable has a consistent behavior across compilers should prevent any ABI issue, and using in-place new instead of memcpy should keep compiler bugs away. Differential Revision: https://reviews.llvm.org/D57097 llvm-svn: 353927	2019-02-13 09:31:22 +00:00
Michal Gorny	5590548f6b	[llvm] [cmake] Provide split include paths in LLVMConfig Modify LLVMConfig to provide split variables for in-source and generated include paths. Currently, it uses a single value for both LLVM_INCLUDE_DIRS and LLVM_INCLUDE_DIR which works for install tree but fails hard at build tree (where LLVM_INCLUDE_DIR incorrectly contains multiple values). Instead, put the generated directory in LLVM_INCLUDE_DIR, and the source tree in LLVM_MAIN_INCLUDE_DIR which is consistent with in-LLVM builds. For install tree, both variables will have the same value. Differential Revision: https://reviews.llvm.org/D58109 llvm-svn: 353924	2019-02-13 08:34:40 +00:00
Anton Afanasyev	ca9aff9353	[X86][SLP] Enable SLP vectorization for 128-bit horizontal X86 instructions (add, sub) Try to use 64-bit SLP vectorization. In addition to horizontal instrs this change triggers optimizations for partial vector operations (for instance, using low halfs of 128-bit registers xmm0 and xmm1 to multiply <2 x float> by <2 x float>). Fixes llvm.org/PR32433 llvm-svn: 353923	2019-02-13 08:26:43 +00:00
Craig Topper	9b61f48e4b	[X86] Use default expansion for (i64 fp_to_uint f80) when avx512 is enabled on 64-bit targets to match what happens without avx512. In 64-bit mode prior to avx512 we use Expand, but with avx512 we need to make f32/f64 conversions Legal so we use Custom and then do our own expansion for f80. But this seems to produce codegen differences relative to avx2. This patch corrects this. llvm-svn: 353921	2019-02-13 07:42:34 +00:00
Craig Topper	3099e442a6	[X86] Refactor the FP_TO_INTHelper interface. NFCI -Pull the final stack load creation from the two callers into the helper. -Return a single SDValue instead of a std::pair. -Remove the Replace flag which isn't really needed. llvm-svn: 353920	2019-02-13 07:42:31 +00:00
Eugene Leviant	2db1062906	[llvm-objcopy] Add --strip-unneeded-symbol(s) Differential revision: https://reviews.llvm.org/D58027 llvm-svn: 353919	2019-02-13 07:34:54 +00:00
Max Kazantsev	5cf777e413	[LoopSimplifyCFG] Re-enable const branch folding by default Known underlying bugs have been fixed, intensive fuzz testing did not find any new problems. Re-enabling by default. Feel free to revert if it causes any functional failures. llvm-svn: 353911	2019-02-13 06:12:48 +00:00
Fangrui Song	12d5599000	[llvm-readobj] Dump GNU_PROPERTY_X86_FEATURE_2_{NEEDED,USED} notes in .note.gnu.property Summary: And change the output ("X86 features" -> "x86 feature") a bit. Reviewers: grimar, xiangzhangllvm, hjl.tools, rupprecht Reviewed By: rupprecht Subscribers: rupprecht, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D58112 llvm-svn: 353908	2019-02-13 01:51:45 +00:00
Reid Kleckner	afe1e3e669	[MC] Make symbol version errors non-fatal We stil don't have a source location, which is pretty lame, but at least we won't tell the user to file a clang bug report anymore. Fixes PR40712 llvm-svn: 353907	2019-02-13 01:39:32 +00:00
Jonas Devlieghere	9ea90acfeb	[dsymutil] Improve readability of cloneAllCompileUnits (NFC) Add some newlines and improve consistency between the two loops. llvm-svn: 353904	2019-02-13 00:32:21 +00:00
Jonas Devlieghere	1bf1b9857f	[dsymutil] Don't clone empty CUs The DWARF standard says that an empty compile unit is not valid: > Each such contribution consists of a compilation unit header (see > Section 7.5.1.1 on page 200) followed by a single DW_TAG_compile_unit or > DW_TAG_partial_unit debugging information entry, together with its > children. Therefore we shouldn't clone them in dsymutil. Differential revision: https://reviews.llvm.org/D57979 llvm-svn: 353903	2019-02-13 00:32:06 +00:00
Alina Sbirlea	0a8bc14ad7	[MemorySSA & LoopPassManager] Add remaining book keeping [NFCI]. Add plumbing to get MemorySSA in the remaining loop passes. Also update unit test to add the dependency. [EnableMSSALoopDependency remains disabled]. llvm-svn: 353901	2019-02-12 23:48:02 +00:00
Matt Arsenault	4cd9509e1d	AMDGPU: Try to use function specific ST Subtargets are a function level property, so ideally we would eliminate everywhere that needs to check the global one. Rename the function to try avoiding confusion. llvm-svn: 353900	2019-02-12 23:44:13 +00:00
Matt Arsenault	d24296e282	AMDGPU: Ignore CodeObjectV3 when inlining This was inhibiting inlining of library functions when clang was invoking the inliner directly. This is covering a bit of a mess with subtarget feature handling, and this shouldn't be a subtarget feature. The behavior is different depending on whether you are using a -mattr flag in clang, or llc, opt. llvm-svn: 353899	2019-02-12 23:30:11 +00:00
Jonas Paulsson	749dc51e45	[SystemZ] Remember to cast value to void to disable warning. Hopefully fixes buildbot problems. llvm-svn: 353898	2019-02-12 23:13:18 +00:00
Alina Sbirlea	8567ff0c34	[LICM] Cap the clobbering calls in LICM. Summary: Unlimitted number of calls to getClobberingAccess can lead to high compile times in pathological cases. Switching EnableLicmCap flag from bool to int, and enabling to default 100. (tested to be appropriate for current bechmarks) We can revisit this value when enabling MemorySSA. Reviewers: sanjoy, chandlerc, george.burgess.iv Subscribers: jlebar, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57968 llvm-svn: 353897	2019-02-12 23:05:40 +00:00
Philip Reames	3908221356	[Tests] A few more live-in deopt lowering tests Nothing super interesting, just making sure obvious cases work. llvm-svn: 353895	2019-02-12 23:00:07 +00:00
Konstantin Zhuravlyov	6220d62e5c	AMDGPU/NFC: Remove SubtargetFeatureISAVersion since it is not used anywhere llvm-svn: 353892	2019-02-12 22:49:49 +00:00
Konstantin Zhuravlyov	acb231c8d8	AMDGPU: Remove duplicate processor (gfx900) llvm-svn: 353889	2019-02-12 22:29:25 +00:00
David Major	5b07e30408	[gn build] Separate debug and optimization settings This patch adds an `is_optimized` variable, orthogonal to `is_debug`, to allow for a gn analogue to `RelWithDebInfo` builds. As part of this we'll want to explicitly enable GC+ICF, for the sake of `is_debug && is_optimized` builds. The flags normally default to true except that if you pass `/DEBUG` they default to false. Differential Revision: https://reviews.llvm.org/D58075 llvm-svn: 353888	2019-02-12 22:24:45 +00:00
Bjorn Pettersson	ecd0960718	[SelectionDAG] Clean up comments in SelectionDAGBuilder.h. NFC Remove redundant function/variable names from doxygen comments (as suggested in https://reviews.llvm.org/D57697). llvm-svn: 353886	2019-02-12 22:11:20 +00:00
Erik Pilkington	4ecd7a90a6	Fix auto-upgrade for the new parameter to llvm.objectsize r352664 added a 'dynamic' parameter to objectsize, but the AutoUpgrade changes were incomplete. Also, fix an off-by-one error I made in the upgrade logic that is now no longer unreachable. Differential revision: https://reviews.llvm.org/D58071 llvm-svn: 353884	2019-02-12 21:55:38 +00:00
Sanjay Patel	cf3a906fb4	[ConstProp] add test for miscompile from bitcast transform; NFC This problem goes with the fix in D51215. llvm-svn: 353883	2019-02-12 21:49:56 +00:00
Jordan Rupprecht	08c3841b21	[llvm-dwp] Use color-formatted error reporting llvm-svn: 353876	2019-02-12 20:37:33 +00:00
Sean Fertile	9850a48275	Fix undefined behaviour in PPCInstPrinter::printBranchOperand. Fix the undefined behaviour introduced by my previous patch r353865 (left shifting a potentially negative value), which was caught by the bots that run UBSan. llvm-svn: 353874	2019-02-12 20:03:04 +00:00
Jordan Rupprecht	706a965295	[llvm-dwp] Avoid writing the output dwp file when there is an error Summary: Use ToolOutputFile to clean up the output file unless dwp actually finishes successfully. Reviewers: dblaikie Reviewed By: dblaikie Subscribers: jdoerfert, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D58130 llvm-svn: 353873	2019-02-12 20:00:51 +00:00
Nikita Popov	a3be17ea1c	[AArch64] Expand v8i8 cttz (PR39729) Fix for https://bugs.llvm.org/show_bug.cgi?id=39729. Rather than adding just a case for v8i8 I'm setting cttz to expand for all vector types. Differential Revision: https://reviews.llvm.org/D58008 llvm-svn: 353872	2019-02-12 18:55:53 +00:00
Philip Reames	7403fac3a8	[InlineSpiller] Fix a crash due to lack of forward progress from remat (try 2) This is a recommit of r335091 Add more test cases for deopt-operands via regalloc, and r335077 [InlineSpiller] Fix a crash due to lack of forward progress from remat specifically for STATEPOINT. They were reverted due to a crash. This change includes the text of both original changes, but also includes three aditional pieces: 1) A bug fix for the observed crash. I had failed to record the failed remat value as live which resulted in an instruction being deleted which still had uses. With the machine verifier, this is caught quickly. Without it, we fail in StackSlotColoring due to an empty live interval from LiveStack. 2) A test case which demonstrates the fix for (1). See @test11. 3) A control flag which defaults to disabling this for the moment. Once I've run more extensive validaton, I will switch the default and then remove this flag. llvm-svn: 353871	2019-02-12 18:33:01 +00:00
Jonas Paulsson	34bead750c	[SystemZ] Use VGM whenever possible to load FP immediates. isFPImmLegal() has been extended to recognize certain FP immediates that can be built with VGM (Vector Generate Mask). These scalar FP immediates (that were previously loaded from the constant pool) are now selected as VGMF/VGMG in Select(). Review: Ulrich Weigand https://reviews.llvm.org/D58003 llvm-svn: 353867	2019-02-12 18:06:06 +00:00
Sean Fertile	c069452027	[PowerPC] Fix printing of negative offsets in call instruction dissasembly. llvm-svn: 353865	2019-02-12 17:48:22 +00:00
Jessica Paquette	acbb7ca26c	[GlobalISel][NFC] Gardening: Make translateSimpleUnaryIntrinsic general Instead of only having this code work for unary intrinsics, have it work for an arbitrary number of parameters. Factor out the cases that fall under this (fma, pow). This makes it a bit easier to add more intrinsics which don't require any special work. Differential Revision: https://reviews.llvm.org/D58079 llvm-svn: 353863	2019-02-12 17:38:34 +00:00
Daniel Sanders	dff673bb52	[tablegen] Add locations to many PrintFatalError() calls Summary: While working on the GISel Combiner, I noticed I was producing location-less error messages fairly often and set about fixing this. In the process, I noticed quite a few places elsewhere in TableGen that also neglected to include a relevant location. This patch adds locations to errors that relate to a specific record (or a field within it) and also have easy access to the relevant location. This is particularly useful when multiclasses are involved as many of these errors refer to the full name of a record and it's difficult to guess which substring is grep-able. Unfortunately, tablegen currently only supports Record granularity so it's not currently possible to point at a specific Init so these sometimes point at the record that caused the error rather than the precise origin of the error. Reviewers: bogner, aditya_nandakumar, volkan, aemerson, paquette, nhaehnle Reviewed By: nhaehnle Subscribers: jdoerfert, nhaehnle, asb, rbar, johnrusso, simoncook, apazos, sabuasal, niosHD, jrtc27, zzheng, edward-jones, rogfer01, MartinMosbeck, brucehoult, the_o, PkmX, jocewei, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D58077 llvm-svn: 353862	2019-02-12 17:36:57 +00:00
Jessica Paquette	0e71e73faa	[GlobalISel][AArch64] Select llvm.bswap* for non-vector types This teaches the IRTranslator to emit G_BSWAP when it runs into Intrinsic::bswap. This allows us to select G_BSWAP for non-vector types in AArch64. Add a select-bswap.mir test, and add global isel checks to a couple existing tests in test/CodeGen/AArch64. This doesn't handle every bswap case, since some of these rely on known bits stuff. This just lets us handle the naive case. Differential Revision: https://reviews.llvm.org/D58081 llvm-svn: 353861	2019-02-12 17:28:17 +00:00
Simon Pilgrim	5338f41ced	[X86][AVX] Enable shuffle combining support for zero_extend A more limited version of rL352997 that had to be disabled in rL353198 - allow extension of any 128/256/512 bit vector that at least uses byte sized scalars. llvm-svn: 353860	2019-02-12 17:22:35 +00:00
Sanjay Patel	86fac11d5a	[DAGCombiner] convert logic-of-setcc into bit magic (PR40611) If we're comparing some value for equality against 2 constants and those constants have an absolute difference of just 1 bit, then we can offset and mask off that 1 bit and reduce to a single compare against zero: and/or (setcc X, C0, ne), (setcc X, C1, ne/eq) --> setcc ((add X, -C1), ~(C0 - C1)), 0, ne/eq https://rise4fun.com/Alive/XslKj This transform is disabled by default using a TLI hook ("convertSetCCLogicToBitwiseLogic()"). That should be overridden for AArch64, MIPS, Sparc and possibly others based on the asm shown in: https://bugs.llvm.org/show_bug.cgi?id=40611 llvm-svn: 353859	2019-02-12 17:07:47 +00:00
Sanjay Patel	ab7e26a2de	[x86] add negative tests for setcc folds; NFC llvm-svn: 353855	2019-02-12 16:44:37 +00:00
whitequark	77ccc2eba4	[SelectionDAG] Fix return calling convention in expansion of ?MULO Summary: The SMULO/UMULO DAG nodes, when not directly supported by the target, expand to a multiplication twice as wide. In case that the resulting type is not legal, the legalizer cannot directly call the intrinsic with the wide arguments; instead, it "pre-lowers" them by splitting them in halves. rL283203 made sure that on big endian targets, the legalizer passes the argument halves in the correct order. It did not do the same for the return value halves because the existing code used a hack; it put an illegal type into DAG and hoped that nothing would break and it would be correctly lowered elsewhere. rL307207 fixed this, handling return value halves similar to how argument handles are handled, but did not take big-endian targets into account. This commit fixes the expansion on big-endian targets, such as the out-of-tree OR1K target. Reviewers: eli.friedman, vadimcn Subscribers: george-hopkins, efriedma, llvm-commits Differential Revision: https://reviews.llvm.org/D45355 llvm-svn: 353854	2019-02-12 16:41:50 +00:00
Andrea Di Biagio	d30fff9a90	[MCA] Improved debug prints. NFC llvm-svn: 353852	2019-02-12 16:18:57 +00:00
Simon Pilgrim	015cc0f0fa	[PowerPC] Regenerate test llvm-svn: 353851	2019-02-12 16:10:50 +00:00
Matt Arsenault	a180554020	AMDGPU/GlobalISel: Add more insert/extract testcases llvm-svn: 353848	2019-02-12 15:04:03 +00:00
David Green	c93c6f3274	[Codegen] Make sure kill flags are not incorrect from removed machine phi's We need to clear the kill flags on both SingleValReg and OldReg, to ensure they remain conservatively correct. Differential Revision: https://reviews.llvm.org/D58114 llvm-svn: 353847	2019-02-12 15:02:57 +00:00
Jordan Rupprecht	4b78d4f347	[llvm-dwp] Abort when dwo_id is unset Summary: An empty dwo_id indicates a degenerate .dwo file that should not have been generated in the first place. Instead of discovering this error later when merging with another degenerate .dwo file, print an error immediately when noticing an unset dwo_id, including the filename of the offending file. Test case created by compiling a trivial file w/ `-fno-split-dwarf-inlining -gmlt -gsplit-dwarf -c` prior to r353771 Reviewers: dblaikie Reviewed By: dblaikie Subscribers: jdoerfert, aprantl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D58085 llvm-svn: 353846	2019-02-12 15:01:07 +00:00
Matt Arsenault	00ccd13c73	AMDGPU/GlobalISel: Only make f16 constants legal on f16 targets We could deal with it, but there's no real point. llvm-svn: 353845	2019-02-12 14:54:55 +00:00
Matt Arsenault	996c66620e	GlobalISel: Use default rounding mode when extending fconstant I don't think this matters since the values should all be exactly representable. llvm-svn: 353844	2019-02-12 14:54:54 +00:00
Matt Arsenault	1cf713664d	GlobalISel: Move some more legalize cases into functions llvm-svn: 353843	2019-02-12 14:54:52 +00:00
Sam McCall	a860219c5e	[LoopSimplifyCFG] Fix test broken in release mode in r353813 llvm-svn: 353842	2019-02-12 14:43:30 +00:00
Max Kazantsev	4a1c02987e	[NFC] Simplify code & reduce nest slightly llvm-svn: 353832	2019-02-12 11:31:46 +00:00
Jeremy Morse	b33a5c7347	[DebugInfo] Don't salvage load operations (PR40628). Salvaging a redundant load instruction into a debug expression hides a memory read from optimisation passes. Passes that alter memory behaviour (such as LICM promoting memory to a register) aren't aware of these debug memory reads and leave them unaltered, making the debug variable location point somewhere unsafe. Teaching passes to know about these debug memory reads would be challenging and probably incomplete. Finding dbg.value instructions that need to be fixed would likely be computationally expensive too, as more analysis would be required. It's better to not generate debug-memory-reads instead, alas. Changed tests: * DeadStoreElim: test for salvaging of intermediate operations contributing to the dead store, instead of salvaging of the redundant load, * GVN: remove debuginfo behaviour checks completely, this behaviour is still covered by other tests, * InstCombine: don't test for salvaged loads, we're removing that behaviour. Differential Revision: https://reviews.llvm.org/D57962 llvm-svn: 353824	2019-02-12 10:54:30 +00:00
David Stenberg	bbd2f97293	[DebugInfo] Keep parameter DBG_VALUEs before prologue code Summary: This is a preparatory change for removing the code from DebugHandlerBase::beginFunction() which changes the starting label for the first non-overlapping DBG_VALUEs of parameters to the beginning of the function. It does that to be able to show parameters when entering a function. However, that code does not consider what defines the values, which can result in the ranges for the debug values starting before their defining instructions. That code is removed in a follow-up patch. When prologue code is inserted, it leads to DBG_VALUEs that start directly in the entry block being moved down after the prologue instructions. This patch fixes that by stashing away DBG_VALUEs for parameters before emitting the prologue, and then reinserts them at the start of the block. This assumes that there is no target that somehow clobbers parameter registers in the frame setup; there is no such case in the lit tests at least. See PR40188 for more information. Reviewers: aprantl, dblaikie, rnk, jmorse Reviewed By: aprantl Subscribers: bjope, llvm-commits Tags: #debug-info Differential Revision: https://reviews.llvm.org/D57510 llvm-svn: 353823	2019-02-12 10:51:27 +00:00
Max Kazantsev	2a184af221	[IndVars] Fix corner case with unreachable Phi inputs. PR40454 Logic in `getInsertPointForUses` doesn't account for a corner case when `Def` only comes to a Phi user from unreachable blocks. In this case, the incoming value may be arbitrary (and not even available in the input block) and break the loop-related invariants that are asserted below. In fact, if we encounter this situation, no IR modification is needed. This Phi will be simplified away with nearest cleanup. Differential Revision: https://reviews.llvm.org/D58045 Reviewed By: spatel llvm-svn: 353816	2019-02-12 09:59:44 +00:00
Fangrui Song	8e0d5ac715	[llvm-readobj] Only allow 4-byte pr_data Summary: AMD64 psABI says: "The pr_data field of each property contains a 4-byte unsigned integer." Thus we don't need to handle 8-byte pr_data. Reviewers: mike.dvoretsky, grimar, craig.topper, xiangzhangllvm, hjl.tools Reviewed By: grimar Subscribers: rupprecht, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D58103 llvm-svn: 353815	2019-02-12 09:56:01 +00:00
George Rimar	b1d6f52005	[llvm-readobj] - Simplify .gnu.version_r dumping a bit. Current implementation takes "Number of needed versions" from DT_VERNEEDNUM dynamic tag entry. Though it would be a bit simpler to take it from sh_info section header field directly: https://docs.oracle.com/cd/E19683-01/816-1386/chapter6-94076/index.html Differential revision: https://reviews.llvm.org/D58048 llvm-svn: 353814	2019-02-12 09:50:04 +00:00
Max Kazantsev	bf6af8fbf0	[LoopSimplifyCFG] Change logic of dead loops removal to avoid hitting asserts The function `LI.erase` has some invariants that need to be preserved when it tries to remove a loop which is not the top-level loop. In particular, it requires loop's preheader to be strictly in loop's parent. Our current logic of deletion of dead blocks may erase the information about preheader before we handle the loop, and therefore we may hit this assertion. This patch changes the logic of loop deletion: we make them top-level loops before we actually erase them. This allows us to trigger the simple branch of `erase` logic which just detatches blocks from the loop and does not try to do some complex stuff that need this invariant. Thanks to @uabelho for reporting this! Differential Revision: https://reviews.llvm.org/D57221 Reviewed By: fedor.sergeev llvm-svn: 353813	2019-02-12 09:37:00 +00:00
George Rimar	b87ea73706	[yaml2obj/obj2yaml] - Move `Info` field out from `Section` class. ELFYAML.h contains a `Section` class which is a base for a few other sections classes that are used for mapping different section types. `Section` has a `StringRef Info` field used for storing sh_info. At the same time, sh_info has very different meanings for sections and cannot be processed in a similar way generally, for example ELFDumper does not handle it in `dumpCommonSection` but do that in `dumpGroup` and `dumpCommonRelocationSection` respectively. At this moment, we have and handle it as a string, because that was possible for the current use case. But also it can simply be a number: For SHT_GNU_verdef is "The number of version definitions within the section." The patch moves `Info` field out to be able to have it as a number. With that change, each class will be able to decide what type and purpose of the sh_info field it wants to use. I also had to edit 2 test cases. This is because patch fixes a bug. Previously we accepted yaml files with Info fields for all sections (for example, for SHT_DYNSYM too). But we do not handle it and the resulting objects had zero sh_info fields set for such sections. Now it is accepted only for sections that supports it. Differential revision: https://reviews.llvm.org/D58054 llvm-svn: 353810	2019-02-12 09:08:59 +00:00
Hans Wennborg	be4c0ff00a	LibFuzzer.rst: double backticks llvm-svn: 353809	2019-02-12 09:08:52 +00:00
Max Kazantsev	9aae9da947	Delete blocks from DTU to avoid dangling pointers llvm-svn: 353804	2019-02-12 08:10:29 +00:00
Max Kazantsev	6bf861597c	[LoopSimplifyCFG] Pay respect to LCSSA when removing dead blocks Utility function that we use for blocks deletion always unconditionally removes one-input Phis. In LoopSimplifyCFG, it can lead to breach of LCSSA form. This patch alters this function to keep them if needed. Differential Revision: https://reviews.llvm.org/D57231 Reviewed By: fedor.sergeev llvm-svn: 353803	2019-02-12 07:48:07 +00:00
Max Kazantsev	20b9189975	[NFC] Rename DontDeleteUselessPHIs --> KeepOneInputPHIs llvm-svn: 353801	2019-02-12 07:09:29 +00:00
Philip Reames	b6dc6eb8bb	[Statepoint Lowering] Update misleading comments about chains llvm-svn: 353800	2019-02-12 06:25:58 +00:00
Max Kazantsev	0686d1ae41	[NFC] Add parameter for keeping one-input Phis in DeleteDeadBlock(s) llvm-svn: 353799	2019-02-12 06:14:27 +00:00
Craig Topper	7670ede434	[X86] Collapse FP_TO_INT16_IN_MEM/FP_TO_INT32_IN_MEM/FP_TO_INT64_IN_MEM into a single opcode using memory VT to distinquish. NFC llvm-svn: 353798	2019-02-12 06:14:18 +00:00
Craig Topper	d7303ecd0b	[X86] Remove the value type operand from the floating point load/store MemIntrinsicSDNodes. Use the MemoryVT instead. NFCI We already have the memory VT, we can just match from that during isel. llvm-svn: 353797	2019-02-12 06:14:16 +00:00
Shoaib Meenai	9e624d5410	[build] Remove a stray comment. NFC The CMake change associated with this comment was removed but the comment got left behind. Add a newline instead. llvm-svn: 353793	2019-02-12 02:25:27 +00:00
Petr Hosek	d3ebe7126b	[CMake] Don't override required compiler flags in the runtimes build Ensure that HandleLLVMOptions adds all necessary required flags, including -Wno-error when building with LLVM_ENABLE_WERROR enabled. Differential Revision: https://reviews.llvm.org/D58092 llvm-svn: 353790	2019-02-12 02:11:25 +00:00
Sanjay Patel	093b896dcb	[x86] add tests for logic of setcc (PR40611); NFC llvm-svn: 353789	2019-02-12 01:46:30 +00:00
Sanjay Patel	14fb86310f	[PowerPC] add tests for logic of setcc (PR40611); NFC llvm-svn: 353788	2019-02-12 01:46:26 +00:00
David Blaikie	43d6122f73	Fix r353771 to target linux only (split-dwarf isn't supported on macho) llvm-svn: 353785	2019-02-12 01:19:00 +00:00
Eli Friedman	806136f8ef	[LoopReroll] Fix reroll root legality checking. The code checked that the first root was an appropriate distance from the base value, but skipped checking the other roots. This could lead to rerolling a loop that can't be legally rerolled (at least, not without rewriting the loop in a non-trivial way). Differential Revision: https://reviews.llvm.org/D56812 llvm-svn: 353779	2019-02-12 00:33:25 +00:00
Philip Reames	5292a3b6aa	[Test] Use autogenerated checks for more statepoint tests llvm-svn: 353776	2019-02-12 00:12:46 +00:00
Philip Reames	8663b00ce1	[Tests] Fill out a few tests around gc relocation uniquing llvm-svn: 353773	2019-02-12 00:01:39 +00:00
David Blaikie	104dcb348f	DebugInfo: Split DWARF + gmlt + no-split-dwarf-inlining shouldn't emit anything to the .dwo file This configuration (due to r349207) was intended not to emit any DWO CU, but a degenerate CU was still being emitted - containing a header and a DW_TAG_compile_unit with no attributes. Under that situation, emit nothing to the .dwo file. (since this is a dynamic property of the input the .dwo file is still emitted, just with nothing in it (so a valid, but empty, ELF file) - if some other CU didn't satisfy this criteria, its DWO CU would still go there, etc) llvm-svn: 353771	2019-02-12 00:00:38 +00:00
Philip Reames	6a3862e3c2	[Test] Autogenerate a statepoint test and actual show the reload llvm-svn: 353770	2019-02-11 23:55:24 +00:00
Philip Reames	5906a6591c	Be conservative about unordered accesses for the moment Background: As described in https://reviews.llvm.org/D57601, I'm working towards separating volatile and atomic in the MMO uses for atomic instructions. In https://reviews.llvm.org/D57593, I fixed a bug where isUnordered was returning the wrong result, but didn't account for the fact I was getting slightly ahead of myself. While both uses of isUnordered are correct (as far as I can tell), we don't have tests to demonstrate this and being aggressive gets in the way of having the removal of volatile truly be non-functional. Once D57601 lands, I will return to these call sites, revert this patch, and add the appropriate tests to show the expected behaviour. Differential Revision: https://reviews.llvm.org/D57802 llvm-svn: 353766	2019-02-11 23:34:33 +00:00
Daniel Sanders	6cbc92915a	[tblgen] Add a timer covering the time spent reading the Instruction defs This patch adds a -time-regions option to tablegen that can enable timers (currently only one) that assess the performance of tablegen itself. This can be useful for identifying scaling problems with tablegen backends. This particular timer has allowed me to ignore time that is not attributed the GISel combiner pass. It's useful by itself but it is particularly useful in combination with https://reviews.llvm.org/D52954 which causes this period of time to be annotated within Xcode Instruments which in turn allows profile samples and recorded allocations attributed to reading instructions to be filtered out. llvm-svn: 353763	2019-02-11 23:02:02 +00:00
Matt Arsenault	b2d245771f	GlobalISel: Verify G_EXTRACT llvm-svn: 353759	2019-02-11 22:12:43 +00:00
Evandro Menezes	f4a369596f	[TargetLibraryInfo] Update run time support for Windows It seems that, since VC19, the `float` C99 math functions are supported for all targets, unlike the C89 ones. According to the discussion at https://reviews.llvm.org/D57625. llvm-svn: 353758	2019-02-11 22:12:01 +00:00
Ana Pazos	9a3dc3e60b	[LegalizeTypes] Expand FNEG to bitwise op for IEEE FP types Summary: Except for custom floating point types x86_fp80 and ppc_fp128, expand Y = FNEG(X) to Y = X ^ sign mask to avoid library call. Using bitwise operation can improve code size and performance. Reviewers: efriedma Reviewed By: efriedma Subscribers: efriedma, kpn, arsenm, eli.friedman, javed.absar, rbar, johnrusso, simoncook, sabuasal, niosHD, zzheng, edward-jones, rogfer01, MartinMosbeck, brucehoult, the_o, PkmX, jocewei, asb, llvm-commits Differential Revision: https://reviews.llvm.org/D57875 llvm-svn: 353757	2019-02-11 22:10:08 +00:00
Scott Linder	72a0f4e8db	[IRReader] Expose getLazyIRModule Currently there is no way to lazy-load an in-memory IR module without first writing it to disk. This patch just exposes the existing implementation of getLazyIRModule. This is effectively a revert of rL212364 Differential Revision: https://reviews.llvm.org/D56203 llvm-svn: 353755	2019-02-11 22:01:13 +00:00
Matt Arsenault	18ec382698	GlobalISel: Implement moreElementsVector for implicit_def llvm-svn: 353754	2019-02-11 22:00:39 +00:00
Matt Arsenault	68fc38ce80	GlobalISel: Fix not calling the observer when legalizing G_EXTRACT llvm-svn: 353750	2019-02-11 21:33:54 +00:00
Daniel Sanders	24e0af6906	[globalisel] Correct string emitted by GISelChangeObserver::erasingInstr() The API indicates that the MI is about to be erased rather than it has been erased. llvm-svn: 353746	2019-02-11 20:45:19 +00:00
Craig Topper	75eb0af874	[X86] Correct the memory operand for the FLD emitted in FP_TO_INTHelper for 32-bit SSE targets. We were using DstTy, but that represents the integer type we are converting to which is i64 in this case. The FLD is part of an intermediate step to get from the SSE registers to the x87 registers. If the floating point type is f32, the memory operand should reflect a 4 byte access not an 8 byte access. The store we used to get from SSE to the stack is using the corect size. While there, consistenly use TheVT in place of Op.getOperand(0).getValueType() throughout the function. llvm-svn: 353745	2019-02-11 20:38:10 +00:00
Matt Davis	22c21934ce	[llvm-cxxfilt] Split and demangle stdin input Summary: Originally, llvm-cxxfilt would treat a line as a single mangled item to be demangled. If a mangled name appears in the middle of that string, that name would not be demangled. GNU c++filt splits and demangles every word in a string that is piped to it via stdin. Prior to this patch llvm-cxxfilt would never split strings piped to it. This patch replicates the GNU behavior and splits strings that are piped to it via stdin. This fixes PR39990 Reviewers: compnerd, jhenderson, davide Reviewed By: compnerd, jhenderson Subscribers: erik.pilkington, jhenderson, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57350 llvm-svn: 353743	2019-02-11 20:30:53 +00:00
Daniel Sanders	b31180d0de	[globalisel] Restore comment explaining the nits of GISelChangeObserver::createdInstr() llvm-svn: 353741	2019-02-11 20:05:49 +00:00
Alina Sbirlea	d77edc00a8	[MemorySSA] Remove verifyClobberSanity. Summary: This verification may fail after certain transformations due to BasicAA's fragility. Added a small explanation and a testcase that triggers the assert in checkClobberSanity (before its removal). Addresses PR40509. Reviewers: george.burgess.iv Subscribers: sanjoy, jlebar, llvm-commits, Prazek Tags: #llvm Differential Revision: https://reviews.llvm.org/D57973 llvm-svn: 353739	2019-02-11 19:51:21 +00:00
Michael Kruse	77a614a6e1	Refactor setAlreadyUnrolled() and setAlreadyVectorized(). Loop::setAlreadyUnrolled() and LoopVectorizeHints::setLoopAlreadyUnrolled() both add loop metadata that stops the same loop from being transformed multiple times. This patch merges both implementations. In doing so we fix 3 potential issues: * setLoopAlreadyUnrolled() kept the llvm.loop.vectorize/interleave.* metadata even though it will not be used anymore. This already caused problems such as http://llvm.org/PR40546. Change the behavior to the one of setAlreadyUnrolled which deletes this loop metadata. * setAlreadyUnrolled() used to create a new LoopID by calling MDNode::get with nullptr as the first operand, then replacing it by the returned references using replaceOperandWith. It is possible that MDNode::get would instead return an existing node (due to de-duplication) that then gets modified. To avoid, use a fresh TempMDNode that does not get uniqued with anything else before replacing it with replaceOperandWith. * LoopVectorizeHints::matchesHintMetadataName() only compares the suffix of the attribute to set the new value for. That is, when called with "enable", would erase attributes such as "llvm.loop.unroll.enable", "llvm.loop.vectorize.enable" and "llvm.loop.distribute.enable" instead of the one to replace. Fortunately, function was only called with "isvectorized". Differential Revision: https://reviews.llvm.org/D57566 llvm-svn: 353738	2019-02-11 19:45:44 +00:00
Sanjay Patel	587fd849f0	[InstCombine] Fix matchRotate bug when one operand is a ConstantExpr shift This bug seems to be harmless in release builds, but will cause an error in UBSAN builds or an assertion failure in debug builds. When it gets to this opcode comparison, it assumes both of the operands are BinaryOperators, but the prior m_LogicalShift will also match a ConstantExpr. The cast<BinaryOperator> will assert in a debug build, or reading an invalid value for BinaryOp from memory with ((BinaryOperator*)constantExpr)->getOpcode() will cause an error in a UBSAN build. The test I added will fail without this change in debug/UBSAN builds, but not in release. Patch by: @AndrewScheidecker (Andrew Scheidecker) Differential Revision: https://reviews.llvm.org/D58049 llvm-svn: 353736	2019-02-11 19:26:27 +00:00
Bjorn Pettersson	4892f06e06	[SelectionDAGBuilder] Add restrictions to EmitFuncArgumentDbgValue Summary: This patch fixes PR40587. When a dbg.value instrinsic is emitted to the DAG by using EmitFuncArgumentDbgValue the resulting DBG_VALUE is hoisted to the beginning of the entry block. I think the idea is to be able to locate a formal argument already from the start of the function. However, EmitFuncArgumentDbgValue only checked that the value that was used to describe a variable was originating from a function parameter, not that the variable itself actually was an argument to the function. So when for example assigning a local variable "local" the value from an argument "a", the assocated DBG_VALUE instruction would be hoisted to the beginning of the function, even if the scope for "local" started somewhere else (or if "local" was mapped to other values earlier in the function). This patch adds some logic to EmitFuncArgumentDbgValue to check that the variable being described actually is an argument to the function. And that the dbg.value being lowered already is in the entry block. Otherwise we bail out, and the dbg.value will be handled as an ordinary dbg.value (not as a "FuncArgumentDbgValue"). A tricky situation is when both the variable and the value is related to function arguments, but not neccessarily the same argument. We make sure that we do not describe the same argument more than once as a "FuncArgumentDbgValue". This solution works as long as opt has injected a "first" dbg.value that corresponds to the formal argument at the function entry. Reviewers: jmorse, aprantl Subscribers: jyknight, hiraditya, fedor.sergeev, dstenb, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57702 llvm-svn: 353735	2019-02-11 19:23:30 +00:00
Alina Sbirlea	605b21739d	[LICM&MSSA] Limit store hoisting. Summary: If there is no clobbering access for a store inside the loop, that store can only be hoisted if there are no interfearing loads. A more general verification introduced here: there are no loads that are not optimized to an access outside the loop. Addresses PR40586. Reviewers: george.burgess.iv Subscribers: sanjoy, jlebar, Prazek, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57967 llvm-svn: 353734	2019-02-11 19:07:15 +00:00

... 2 3 4 5 6 ...

175390 Commits