llvm-project

Commit Graph

Author	SHA1	Message	Date
Matt Arsenault	a6f32f4015	DAG: Factor out helper function for odd vector sizes llvm-svn: 341392	2018-09-04 18:47:43 +00:00
Hiroshi Yamauchi	72ee6d6000	Fix build failures after rL341386. Reviewers: davidxl Reviewed By: davidxl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D51647 llvm-svn: 341391	2018-09-04 18:10:54 +00:00
Dan Gohman	045a217bee	[WebAssembly] Fix operand rewriting in inline asm lowering. Use MachineOperand::ChangeToImmediate rather than reassigning MachineOperands to new values created from MachineOperand::CreateImm, so that their parent pointers are preserved. This fixes "Instruction has operand with wrong parent set" errors reported by the MachineVerifier. llvm-svn: 341389	2018-09-04 17:46:12 +00:00
Hiroshi Yamauchi	9775a620b0	[PGO] Control Height Reduction Summary: Control height reduction merges conditional blocks of code and reduces the number of conditional branches in the hot path based on profiles. if (hot_cond1) { // Likely true. do_stg_hot1(); } if (hot_cond2) { // Likely true. do_stg_hot2(); } -> if (hot_cond1 && hot_cond2) { // Hot path. do_stg_hot1(); do_stg_hot2(); } else { // Cold path. if (hot_cond1) { do_stg_hot1(); } if (hot_cond2) { do_stg_hot2(); } } This speeds up some internal benchmarks up to ~30%. Reviewers: davidxl Reviewed By: davidxl Subscribers: xbolva00, dmgreen, mehdi_amini, llvm-commits, mgorny Differential Revision: https://reviews.llvm.org/D50591 llvm-svn: 341386	2018-09-04 17:19:13 +00:00
Scott Linder	cab029f474	[CodeGen] Fix remaining zext() assertions in SelectionDAG Fix remaining cases not committed in https://reviews.llvm.org/D49574 Differential Revision: https://reviews.llvm.org/D50659 llvm-svn: 341380	2018-09-04 16:33:34 +00:00
Francis Visoiu Mistrih	2d3f01c5dc	[MachO] Fix inconsistency between error messages when validating LC_DYSYMTAB llvm-svn: 341379	2018-09-04 16:31:53 +00:00
Francis Visoiu Mistrih	7690af4da9	[MachO] Fix LC_DYSYMTAB validation for external symbols We were validating the same index (ilocalsym) twice, while iextdefsym was never validated. llvm-svn: 341378	2018-09-04 16:31:48 +00:00
Jonas Devlieghere	881452384a	[dwarfdump] Improve -diff option by hiding more data. The -diff option makes it easy to diff dwarf by hiding addresses and offsets. However not all of them were hidden, which should be fixed by this patch. Differential revision: https://reviews.llvm.org/D51593 llvm-svn: 341377	2018-09-04 16:21:37 +00:00
Chandler Carruth	6cb12444cc	Revert r341269: [Constant Hoisting] Hoisting Constant GEP Expressions One of the tests is failing 50% of the time when expensive checks are enabled. Not sure how deep the problem is so just reverting while the author can investigate so that the bots stop repeatedly failing and blaming things incorrectly. Will respond with details on the original commit. llvm-svn: 341365	2018-09-04 13:36:44 +00:00
Sven van Haastregt	9a5cd78e7e	Fix some Wundef warnings in Compiler.h Check for definedness of the __cpp_sized_deallocation and __cpp_aligned_new feature test macros. These will not be defined when the feature is not available, and that prevents any code that includes this header from compiling with -Wundef -Werror. Differential Revision: https://reviews.llvm.org/D51171 llvm-svn: 341364	2018-09-04 12:46:21 +00:00
Chandler Carruth	664aa868f5	[x86/SLH] Add a real Clang flag and LLVM IR attribute for Speculative Load Hardening. Wires up the existing pass to work with a proper IR attribute rather than just a hidden/internal flag. The internal flag continues to work for now, but I'll likely remove it soon. Most of the churn here is adding the IR attribute. I talked about this Kristof Beyls and he seemed at least initially OK with this direction. The idea of using a full attribute here is that we do expect at least some forms of this for other architectures. There isn't anything inherently x86-specific about this technique, just that we only have an implementation for x86 at the moment. While we could potentially expose this as a Clang-level attribute as well, that seems like a good question to defer for the moment as it isn't 100% clear whether that or some other programmer interface (or both?) would be best. We'll defer the programmer interface side of this for now, but at least get to the point where the feature can be enabled without relying on implementation details. This also allows us to do something that was really hard before: we can enable just the indirect call retpolines when using SLH. For x86, we don't have any other way to mitigate indirect calls. Other architectures may take a different approach of course, and none of this is surfaced to user-level flags. Differential Revision: https://reviews.llvm.org/D51157 llvm-svn: 341363	2018-09-04 12:38:00 +00:00
Aaron Ballman	89225ed5bc	Disable -Wnoexcept-type due to false positives with GCC. GCC triggers false positives if a nothrow function is called through a template argument. See https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80985 for details. The LLVM libraries have no stable C++ API, so the warning is not useful. llvm-svn: 341361	2018-09-04 12:03:49 +00:00
Chandler Carruth	163222f569	Revert r341342: Dwarf .debug section compression support (zlib, zlib-gnu). Also reverts follow-up commits r341343 and r341344. The primary commit continues to break some build bots even after the fixes in r341343 for UBSan issues: http://lab.llvm.org:8011/builders/clang-cmake-aarch64-full/builds/5823 It is also failing for me locally (linux, x86-64). llvm-svn: 341360	2018-09-04 11:55:57 +00:00
Chandler Carruth	219888d1b2	[x86/SLH] Teach SLH to harden against the "ret2spec" attack by implementing the proposed mitigation technique described in the original design document. The idea is to check after calls that the return address used to arrive at that location is in fact the correct address. In the event of a mis-predicted return which reaches a valid return but not the correct return, this will detect the mismatch much like it would a mispredicted conditional branch. This is the last published attack vector that I am aware of in the Spectre v1 space which is not mitigated by SLH+retpolines. However, don't read too much into that: this is an area of ongoing research where we expect more issues to be discovered in the future, and it also makes no attempt to mitigate Spectre v4. Still, this is an important completeness bar for SLH. The change here is of course delightfully simple. It was predicated on cutting support for post-instruction symbols into LLVM which was not at all simple. Many thanks to Hal Finkel, Reid Kleckner, and Justin Bogner who helped me figure out how to do a bunch of the complex changes involved there. Differential Revision: https://reviews.llvm.org/D50837 llvm-svn: 341358	2018-09-04 10:59:10 +00:00
Kristina Brooks	51ae9346db	Do not leak the Mach host port in sys::getHostCPUName() Patch by rsesek (Robert Sesek) llvm-svn: 341357	2018-09-04 10:54:09 +00:00
Chandler Carruth	8d8489f513	[x86/SLH] Teach SLH to harden indirect branches and switches without retpolines. This implements the core design of tracing the intended target into the target, checking it, and using that to update the predicate state. It takes advantage of a few interesting aspects of SLH to make it a bit easier to implement: - We already split critical edges with conditional branches, so we can assume those are gone. - We already unfolded any memory access in the indirect branch instruction itself. I've left hard errors in place to catch if any of these somewhat subtle invariants get violated. There is some code that I can factor out and share with D50837 when it lands, but I didn't want to couple landing the two patches, so I'll do that in a follow-up cleanup commit if alright. Factoring out the code to handle different scenarios of materializing an address remains frustratingly hard. In a bunch of cases you want to fold one of the cases into an immediate operand of some other instruction, and you also have both symbols and basic blocks being used which require different methods on the MI builder (and different operand kinds). Still, I'll take a stab at sharing at least some of this code in a follow-up if I can figure out how. Differential Revision: https://reviews.llvm.org/D51083 llvm-svn: 341356	2018-09-04 10:44:21 +00:00
Nicola Zaghen	9588ad9611	[InstCombine] Fold icmp ugt/ult (add nuw X, C2), C --> icmp ugt/ult X, (C - C2) Support for sgt/slt was added in rL294898, this adds the same cases also for unsigned compares. This is the Alive proof: https://rise4fun.com/Alive/nyY Differential Revision: https://reviews.llvm.org/D50972 llvm-svn: 341353	2018-09-04 10:29:48 +00:00
Fedor Sergeev	961811f3e1	[NFC] correcting patterns in time-passes test to fix buildbot llvm-svn: 341348	2018-09-04 08:21:37 +00:00
Max Kazantsev	f34115c627	[NFC] Add assert to detect LCSSA breaches early llvm-svn: 341347	2018-09-04 06:34:40 +00:00
Fedor Sergeev	f2d4372e0e	[PassTiming] reporting time-passes separately for multiple pass instances of the same pass Summary: Refactoring done by rL340872 accidentally appeared to be non-NFC, changing the way how multiple instances of the same pass are handled - aggregation of results by PassName forced data for multiple instances to be merged together and reported as one line. Getting back to creating/reporting timers per pass instance. Reporting was a bit enhanced by counting pass instances and adding #<num> suffix to the pass description. Note that it is instances that are being counted, not invocations of them. time-passes test updated to account for multiple passes being run. Reviewers: paquette, jhenderson, MatzeB, skatkov Reviewed By: skatkov Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D51535 llvm-svn: 341346	2018-09-04 06:12:28 +00:00
Max Kazantsev	2cbba56337	[IndVars] Fix usage of SCEVExpander to not mess with SCEVConstant. PR38674 This patch removes the function `expandSCEVIfNeeded` which behaves not as it was intended. This function tries to make a lookup for exact existing expansion and only goes to normal expansion via `expandCodeFor` if this lookup hasn't found anything. As a result of this, if some instruction above the loop has a `SCEVConstant` SCEV, this logic will return this instruction when asked for this `SCEVConstant` rather than return a constant value. This is both non-profitable and in some cases leads to breach of LCSSA form (as in PR38674). Whether or not it is possible to break LCSSA with this algorithm and with some non-constant SCEVs is still in question, this is still being investigated. I wasn't able to construct such a test so far, so maybe this situation is impossible. If it is, it will go as a separate fix. Rather than do it, it is always correct to just invoke `expandCodeFor` unconditionally: it behaves smarter about insertion points, and as side effect of this it will choose a constant value for SCEVConstants. For other SCEVs it may end up finding a better insertion point. So it should not be worse in any case. NOTE: So far the only known case for which this transform may break LCSSA is mapping of SCEVConstant to an instruction. However there is a suspicion that the entire algorithm can compromise LCSSA form for other cases as well (yet not proved). Differential Revision: https://reviews.llvm.org/D51286 Reviewed By: etherzhhb llvm-svn: 341345	2018-09-04 05:01:35 +00:00
Puyan Lotfi	bd203e03f8	[NFC][llvm-objcopy] clang-formating Object.cpp llvm-svn: 341344	2018-09-04 01:58:32 +00:00
Puyan Lotfi	a7a5816b96	[NFC][llvm-objcopy] Fixing a ubi-san problem with unaligned memory writes. llvm-svn: 341343	2018-09-04 01:57:30 +00:00
Puyan Lotfi	5a40cd5b50	[llvm-objcopy] Dwarf .debug section compression support (zlib, zlib-gnu). Usage: llvm-objcopy --compress-debug-sections=zlib foo.o llvm-objcopy --compress-debug-sections=zlib-gnu foo.o In both cases the debug section contents is compressed with zlib. In the GNU style case the header is the "ZLIB" magic string followed by the uint64 big- endian decompressed size. In the non-GNU mode the header is the Elf(32\|64)_Chdr. Decompression support is coming soon. Differential Revision: https://reviews.llvm.org/D49678 llvm-svn: 341342	2018-09-03 22:25:56 +00:00
Sanjay Patel	0945959869	[AArch64][x86] add tests for pow(x, 0.25); NFC Folds for this were proposed in D49306, but we decided the transform is better suited for the backend. llvm-svn: 341341	2018-09-03 22:11:47 +00:00
Simon Atanasyan	4d13cb0a8a	[mips] Disable the selection of mixed microMIPS/MIPS code This patch modifies hasStandardEncoding() / inMicroMipsMode() / inMips16Mode() methods of the MipsSubtarget class so only one can be true at any one time. That prevents the selection of microMIPS and MIPS instructions and patterns that are defined in TableGen files at the same time. A few new patterns and instruction definitions hae been added to keep test cases passed. Differential revision: https://reviews.llvm.org/D51483 llvm-svn: 341338	2018-09-03 20:48:55 +00:00
Sanjay Patel	2fe1f62c88	[InstCombine] simplify xor/not folds; NFCI llvm-svn: 341336	2018-09-03 18:40:56 +00:00
Sanjay Patel	d75064e6d5	[InstCombine] allow add+not --> sub for arbitrary vector constants. llvm-svn: 341335	2018-09-03 18:21:59 +00:00
Brian Gesiak	dfe9957418	Revert r341329 due to MSAN error Pushing https://reviews.llvm.org/rL341329 revealed an MSAN error. Revert it so that we can fix the error. llvm-svn: 341333	2018-09-03 18:13:46 +00:00
Sanjay Patel	faa02b1abb	[InstCombine] consolidate tests for ~(X+C); NFC llvm-svn: 341332	2018-09-03 18:04:21 +00:00
Sid Manning	220f288720	Revert [Hexagon] Add support for getRegisterByName. Support required to build the Hexagon Linux kernel. llvm-svn: 341331	2018-09-03 17:59:10 +00:00
Florian Hahn	cc9dc599ba	[SLC] Support expanding pow(x, n+0.5) to x * x * ... * sqrt(x) Reviewers: evandro, efriedma, spatel Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D51435 llvm-svn: 341330	2018-09-03 17:37:39 +00:00
Brian Gesiak	f534485387	Re-push "[Option] Fix PR37006 prefix choice in findNearest" Summary: Original changeset (https://reviews.llvm.org/D46776) by @modocache. It was reverted after the PS4 bot failed. The issue has been determined to be with the way the PS4 SDK handles this particular option. https://reviews.llvm.org/D50410 removes this test, so we can push this again. Patch by Arnaud Coomans! Reviewers: cfe-commits, modocache Reviewed By: modocache Differential Revision: https://reviews.llvm.org/D50515 llvm-svn: 341329	2018-09-03 17:30:57 +00:00
Andrea Di Biagio	fb3d9e1449	[X86] Remove wrong ReadAdvance from multiclass sse_fp_unop_s. A ReadAdvance was incorrectly added to the SchedReadWrite list associated with the following SSE instructions: sqrtss sqrtsd rsqrtss rcpss As a consequence, a wrong operand latency was computed for the register operand used as the base address of the folded load operand. This patch removes the wrong ReadAdvance, and updates the llvm-mca test cases. There is still a problem with correctly modeling partial register writes on XMM registers This other problem is currently tracked here: https://bugs.llvm.org/show_bug.cgi?id=38813 Differential Revision: https://reviews.llvm.org/D51542 llvm-svn: 341326	2018-09-03 16:47:34 +00:00
Argyrios Kyrtzidis	c30340b207	Add header guards to some headers that are missing them Also adjust some of dsymutil's headers to put the header guards at the top, otherwise the compiler will not recognize them as header guards. llvm-svn: 341323	2018-09-03 16:22:05 +00:00
Matt Arsenault	ca25b58957	DAG: Handle extract_vector_elt in isKnownNeverNaN llvm-svn: 341317	2018-09-03 14:01:03 +00:00
Nico Weber	8267b333ee	Rename a few unittests/.../Foo.cpp files to FooTest.cpp The convention for unit test sources is that they're called FooTest.cpp. No behavior change. https://reviews.llvm.org/D51579 llvm-svn: 341313	2018-09-03 12:43:26 +00:00
Jonas Devlieghere	6e5c7e6037	[DebugInfo] Have the verifier accept missing linkage names. According to the standard, for the .debug_names (the "dwarf accelerator tables"): > If a subprogram or inlined subroutine is included, and has a > DW_AT_linkage_name attribute, there will be an additional index entry > for the linkage name. For Swift we generate DW_structure_types with a linkage name and the verifier was incorrectly rejecting this. This patch fixes that by only considering the linkage name in those particular cases. The test is the "reduced" debug info of the failing swift test on swift.org. Differential revision: https://reviews.llvm.org/D51420 llvm-svn: 341311	2018-09-03 12:12:17 +00:00
Martin Storsjo	5c984fb16d	[AArch64] Simplify code in LowerGlobalAddress. NFCI. When initial support for dllimport was added for aarch64 in SVN r316555, ClassifyGlobalReference didn't set the MO_DLLIMPORT flag - that was only completed in SVN r323810. Reuse the return value from ClassifyGlobalReference for this purpose as well. llvm-svn: 341310	2018-09-03 11:59:23 +00:00
Daniel Cederman	e9e38c207e	[Sparc] allow tls_add/tls_call syntax in assembler parser Summary: Removing unneeded isCodeGenOnly from tls-specific instructions - TLS_ADD/TLS_LD/TLS_LDX/TLS_CALL. Author: fedor.sergeev Reviewers: jyknight, fedor.sergeev Reviewed By: jyknight Subscribers: dcederman, brad, llvm-commits Differential Revision: https://reviews.llvm.org/D36463 llvm-svn: 341308	2018-09-03 10:38:12 +00:00
Sander de Smalen	0c78da5132	Fix issue introduced by r341301 that broke buildbot. A condition in isSpillInstruction() updates a small vector rather than the 'FI' by-ref parameter, which was used in a subsequent call to 'isSpillSlotObjectIndex()'. This patch fixes the condition to check the FIs in the vector instead. llvm-svn: 341305	2018-09-03 10:23:34 +00:00
Simon Pilgrim	2e35c1e399	Remove unnecessary semicolon to silence -Wpedantic warning. NFCI. llvm-svn: 341303	2018-09-03 10:17:25 +00:00
Carlos Alberto Enciso	eaf2c1f449	Test commit. Revert change done in r341297. NFC. Differential Revision: https://reviews.llvm.org/D51583 llvm-svn: 341302	2018-09-03 09:41:43 +00:00
Sander de Smalen	6cab60fa06	Extend hasStoreToStackSlot with list of FI accesses. For instructions that spill/fill to and from multiple frame-indices in a single instruction, hasStoreToStackSlot and hasLoadFromStackSlot should return an array of accesses, rather than just the first encounter of such an access. This better describes FI accesses for AArch64 (paired) LDP/STP instructions. Reviewers: t.p.northover, gberry, thegameg, rengolin, javed.absar, MatzeB Reviewed By: MatzeB Differential Revision: https://reviews.llvm.org/D51537 llvm-svn: 341301	2018-09-03 09:15:58 +00:00
Carlos Alberto Enciso	f03e049234	Test commit - adding a new line. llvm-svn: 341297	2018-09-03 08:26:37 +00:00
Kristina Brooks	12aaf964f8	[MC] - ConstantPools.cpp: Style consistency, remove redundant braces. NFC. Remove braces around two, single statement "if" blocks in line with rest of the file and the general LLVM code style. NFC, testing commit access. llvm-svn: 341294	2018-09-03 03:48:39 +00:00
QingShan Zhang	c2b6c547dc	[PowerPC] Add Itineraries of IIC_IntRotateDI for P7/P8 When doing some instruction scheduling work, we noticed some missing itineraries. Before we switch to machine scheduler, those missing itineraries might not have impact to actually scheduling, because we can still get same latency due to default values. With machine scheduler, however, itineraries will have impact to scheduling. eg: NumMicroOps will default to be 0 if there is NO itineraries for specific instruction class. And most of the instruction class with itineraries will have NumMicroOps default to 1. This will has impact on the count of RetiredMOps, affects the Pending/Available Queue, then causing different scheduling or suboptimal scheduling further. Patch by jsji (Jinsong Ji) Differential Revision: https://reviews.llvm.org/D51506 llvm-svn: 341293	2018-09-03 03:14:29 +00:00
Sanjay Patel	17e709b66a	[InstCombine] allow not+sub fold for arbitrary vector constants The fold was implemented for the general case but use-limitation, but the later constant version which didn't check uses was only matching splat constants. llvm-svn: 341292	2018-09-02 19:31:45 +00:00
Sanjay Patel	04ab22b3f4	[InstCombine] move/add tests for not+sub; NFC llvm-svn: 341291	2018-09-02 19:18:13 +00:00
Hsiangkai Wang	e0dcc28a4d	Revert "[DebugInfo] Fix bug in LiveDebugVariables." This reverts commit 8f548ff2a1819e1bc051e8218584f1a3d2cf178a. buildbot failure in LLVM on clang-ppc64be-linux http://lab.llvm.org:8011/builders/clang-ppc64le-linux/builds/19765 llvm-svn: 341290	2018-09-02 16:35:42 +00:00
Hsiangkai Wang	1368434b49	[DebugInfo] Fix bug in LiveDebugVariables. In lib/CodeGen/LiveDebugVariables.cpp, it uses std::prev(MBBI) to get DebugValue's SlotIndex. However, the previous instruction may be also a debug instruction. It could not use a debug instruction to query SlotIndex in mi2iMap. Scan all debug instructions and use the first debug instruction to query SlotIndex for following debug instructions. Only handle DBG_VALUE in handleDebugValue(). Differential Revision: https://reviews.llvm.org/D50621 llvm-svn: 341289	2018-09-02 15:57:22 +00:00
Sanjay Patel	ca36eb4e33	[Reassociate] swap binop operands to increase factoring potential If we have a pair of binops feeding another pair of binops, rearrange the operands so the matching pair are together because that allows easy factorization folds to happen in instcombine: ((X << S) & Y) & (Z << S) --> ((X << S) & (Z << S)) & Y (reassociation) --> ((X & Z) << S) & Y (factorize shift from 'and' ops optimization) This is part of solving PR37098: https://bugs.llvm.org/show_bug.cgi?id=37098 Note that there's an instcombine version of this patch attached there, but we're trying to make instcombine have less responsibility to improve compile-time efficiency. For reasons I still don't completely understand, reassociate does this kind of transform sometimes, but misses everything in my motivating cases. This patch on its own is gluing an independent cleanup chunk to the end of the existing RewriteExprTree() loop. We can build on it and do something stronger to better order the full expression tree like D40049. That might be an alternative to the proposal to add a separate reassociation pass like D41574. Differential Revision: https://reviews.llvm.org/D45842 llvm-svn: 341288	2018-09-02 14:22:54 +00:00
Roman Lebedev	d7a6244475	[DAGCombine] optimizeSetCCOfSignedTruncationCheck(): handle inverted pattern Summary: A follow-up for D49266 / rL337166 + D49497 / rL338044. This is still the same pattern to check for the [lack of] signed truncation, but in this case the constants and the predicate are negated. https://rise4fun.com/Alive/BDV https://rise4fun.com/Alive/n7Z Reviewers: spatel, craig.topper, RKSimon, javed.absar, efriedma, dmgreen Reviewed By: spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D51532 llvm-svn: 341287	2018-09-02 13:56:22 +00:00
Lang Hames	b993cf50d6	clang-format r341282. llvm-svn: 341283	2018-09-02 01:29:29 +00:00
Lang Hames	6a2a889b8a	[ORC] Tidy up JITSymbolFlags to remove the need for some explicit static_casts. Removes the implicit conversion to the underlying type for JITSymbolFlags::FlagNames and replaces it with some bitwise and comparison operators. llvm-svn: 341282	2018-09-02 01:28:26 +00:00
Matt Davis	e0d03e9665	[llvm-mca] Fix typo in debug output. NFC. llvm-svn: 341281	2018-09-01 18:32:33 +00:00
Sanjay Patel	099b1a4b0c	[InstCombine] simplify code for 'or' fold This is no-outwardly-visible-change intended, so no test. But the code is smaller and more efficient. The check for a 'not' op is intended to avoid the expensive value tracking call when it should not be necessary, and it might prevent infinite looping when we resurrect: rL300977 llvm-svn: 341280	2018-09-01 15:08:59 +00:00
Dylan McKay	454258671d	[AVR] Redefine the 'LSL' instruction as an alias of 'ADD' The 'LSL Rd' instruction is equivalent to 'ADD Rd, Rd'. llvm-svn: 341278	2018-09-01 12:23:00 +00:00
Dylan McKay	97daa142f4	[AVR] Redefine the 'SBR' instruction as an alias This fixes a TableGen warning about duplicate bit patterns. SBR === This is an alias of 'ORI Rd, K'. llvm-svn: 341277	2018-09-01 12:22:54 +00:00
Dylan McKay	d118024387	[AVR] Define the TST instruction as an alias of AND The 'tst Rd' instruction is equivalent to 'and Rd, Rd'. llvm-svn: 341276	2018-09-01 12:22:50 +00:00
Dylan McKay	8b0f9d2e58	[AVR] Define the ROL instruction as an alias of ADC The 'rol Rd' instruction is equivalent to 'adc Rd'. This caused compile warnings from tablegen because of conflicting bits shared between each instruction. llvm-svn: 341275	2018-09-01 12:22:07 +00:00
Tom Stellard	ffc6bd6f3d	AMDGPU/GlobalISel: Define instruction mapping for G_SELECT Reviewers: arsenm Reviewed By: arsenm Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, rovka, kristof.beyls, dstuttard, tpr, llvm-commits, t-tye Differential Revision: https://reviews.llvm.org/D49737 llvm-svn: 341271	2018-09-01 02:41:19 +00:00
Sanjin Sijaric	61ddb7df82	Make HasWinCFI a plain bool instead of Optional<bool> Summary: Reid suggested making HasWinCFI a plain bool defaulting to false in D50288. It's needed in order to add HasWinCFI to MIRPrinter. Otherwise, we'll get the assertion: HasWinCFI.hasValue() && "HasWinCFI not set yet!"' Also, a few ARM64 Windows test cases will fail with the same assert if the ARM64 MCLayer part of EH work (D50166) goes in before the frame lowering part that sets HasWinCFI (D50288 as of now). Reviewers: rnk, mstorsjo, hans, javed.absar Reviewed By: rnk Subscribers: kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D51560 llvm-svn: 341270	2018-09-01 00:33:43 +00:00
Zhaoshi Zheng	f5297fb24b	[Constant Hoisting] Hoisting Constant GEP Expressions Leverage existing logic in constant hoisting pass to transform constant GEP expressions sharing the same base global variable. Multi-dimensional GEPs are rewritten into single-dimensional GEPs. Differential Revision: https://reviews.llvm.org/D51396 llvm-svn: 341269	2018-09-01 00:04:56 +00:00
Jessica Paquette	a69696dca6	Fix typo in size remarks for module passes ModuleCount = InstrCount was incorrect. It should have been InstrCount = ModuleCount. This was making it emit an extra, incorrect remark for Print Module IR. The test didn't catch this, because it didn't ensure that the only remark output was from the desired pass. So, it was possible to have an extra remark come through and not fail. Updated the test so that we ensure that the last remark that's output comes from the desired pass. This is done by ensuring that whatever is being read after the last remark is YAML output rather than some incorrect garbage. llvm-svn: 341267	2018-08-31 22:43:41 +00:00
Stanislav Mekhanoshin	44451b3344	[AMDGPU] Split v32i32 loads Differential Revision: https://reviews.llvm.org/D51555 llvm-svn: 341266	2018-08-31 22:43:36 +00:00
Krzysztof Parzyszek	4cef462922	[Hexagon] Don't access non-existent instructions llvm-svn: 341264	2018-08-31 22:10:04 +00:00
Matthias Braun	4f340e975e	Revamp test-suite documentation - Remove duplication: Both TestingGuide and TestSuiteMakefileGuide would give a similar overview over the test-suite. - Present cmake/lit as the default/normal way of running the test-suite: - Move information about the cmake/lit testsuite into the new TestSuiteGuide.rst file. Mark the remaining information in TestSuiteMakefilesGuide.rst as deprecated. - General simplification and shorting of language. - Remove paragraphs about tests known to fail as everything should pass nowadays. - Remove paragraph about zlib requirement; it's not required anymore since we copied a zlib source snapshot into the test-suite. - Remove paragraph about comparison with "native compiler". Correctness is always checked against reference outputs nowadays. - Change cmake/lit quickstart section to recommend `pip` for installing lit and use `CMAKE_C_COMPILER` and a cache file in the example as that is what most people will end up doing anyway. Also a section about compare.py to quickstart. - Document `Bitcode` and `MicroBenchmarks` directories. - Add section with commonly used cmake configuration options. - Add section about showing and comparing result files via compare.py. - Add section about using external benchmark suites. - Add section about using custom benchmark suites. - Add section about profile guided optimization. - Add section about cross-compilation and running on external devices. Differential Revision: https://reviews.llvm.org/D51465 llvm-svn: 341260	2018-08-31 21:47:01 +00:00
Craig Topper	caf6672779	[X86] Add intrinsics for KTEST instructions. These intrinsics use the same implementation as PTEST intrinsics, but use vXi1 vectors. New clang builtins will be accompanying them shortly. llvm-svn: 341259	2018-08-31 21:31:53 +00:00
Jessica Paquette	71e9778006	[NFC] Optionally pass a function to emitInstrCountChangedRemark In basic block, loop, and function passes, we already have a function that we can use to emit optimization remarks. We can use that instead of searching the module for the first suitable function (that is, one that contains at least one basic block.) llvm-svn: 341253	2018-08-31 20:54:37 +00:00
Jessica Paquette	397c05dd7d	[NFC] Check if P is a pass manager on entry to emitInstrCountChangedRemark There's no point in finding a function to use for remark output when we're not going to emit anything. llvm-svn: 341252	2018-08-31 20:51:54 +00:00
Jessica Paquette	9a23c55920	[NFC] Pass the instruction delta to emitInstrCountChangedRemark Instead of counting the size of the entire module every time we run a pass, pass along a delta instead and use that to emit the remark. This means we only have to use (on average) smaller IR units to calculate instruction counts. E.g, in a BB pass, we only need to look at the delta of the BB instead of the delta of the entire module. 6/6 (This improved compile time for size remarks on sqlite3 + O2 significantly) llvm-svn: 341250	2018-08-31 20:20:57 +00:00
Jessica Paquette	1fc443b887	[NFC] Pre-calculate SCC IR counts in size remarks. Same vein as the previous commits. Pre-calculate the size of the module and use that to decide if we're going to emit a remark. This one comes with a FIXME and TODO. First off, CallGraphSCC and CallGraphNode don't have a getInstructionCount function. So, for now, we do the same thing as in a module pass. Second off, we're not really saving anything here yet, because as before, I need to change emitInstrCountChangedRemark to take in a delta. Keeping the patches small though, so that's coming up next. 5/6 llvm-svn: 341249	2018-08-31 20:20:56 +00:00
Jessica Paquette	454d1032e9	[NFC] Pre-calculate module IR counts in size remarks. Same as the previous NFC commits in the same vein. This one introduces a TODO. I'm going to change emitInstrCountChangedRemark so that it takes in a delta. Since the delta isn't necessary yet, it's not there. For now, this means that we're calculating the size of the module twice. Just done separately to keep the patches small. 4/6 llvm-svn: 341248	2018-08-31 20:20:55 +00:00
Jessica Paquette	872a4c92b2	[NFC] Pre-calculate loop IR counts in size remarks. Another commit reducing compile time in size remarks. Cache the size of the module and loop, and update values based off of deltas instead. Avoid recalculating the size of the whole module whenever possible. 3/6 llvm-svn: 341247	2018-08-31 20:20:54 +00:00
Jessica Paquette	9eda13e976	[NFC] Pre-calculate basic block IR counts in size remarks. Size remarks are slow due to lots of recalculation of the module. This is similar to the previous commit. Cache the size of the module and update counts in basic block passes based off a less-expensive delta. 2/6 llvm-svn: 341246	2018-08-31 20:20:53 +00:00
Jessica Paquette	f2a202ce7a	[NFC] Pre-calculate function IR counts in size remarks. Size remarks are slow due to lots of recalculation of the module. Pre-calculate the module size and initial function size for a remark. Use deltas calculated using the less-expensive function IR count to update the module counts for Function passes. 1/6 llvm-svn: 341245	2018-08-31 20:19:41 +00:00
Tom Stellard	04cbe721da	lit: Use sys.executable for executing builtin commands Summary: The python executable may not exist on all systems so use sys.executable instead. Reviewers: ddunbar, stella.stamenova Subscribers: delcypher, llvm-commits Differential Revision: https://reviews.llvm.org/D51511 llvm-svn: 341244	2018-08-31 20:15:31 +00:00
Dean Michael Berris	f135ac4bbc	[XRay] Update RecordInitializer for PIDRecord Since we changed the storage for the PID in PIDRecord instances, we need to also update the way we load the data from a DataExtractor through the RecordInitializer. llvm-svn: 341243	2018-08-31 20:02:55 +00:00
Dean Michael Berris	4cae04873b	[XRay] Use correct type for PID records Previously we've been reading and writing the wrong types which only worked in little endian implementations. This time we're writing the same typed values the runtime is using, and reading them appropriately as well. llvm-svn: 341241	2018-08-31 19:32:46 +00:00
Tim Northover	cc8f593d29	Tests: fix tests encoding specific hash values for 32-bit systems. I changed the seed slightly, but forgot to run the tests on a 32-bit system, so tests which hard-code a specific hash value started breaking. llvm-svn: 341240	2018-08-31 19:24:37 +00:00
Dean Michael Berris	250c56d127	[XRay] Use correct type for thread ID parsing Previously we were reading only a uint16_t when we really needed to read an int32_t from the log. llvm-svn: 341239	2018-08-31 19:11:19 +00:00
Sid Manning	b1c9813042	[Hexagon] Add support for getRegisterByName. Support required to build the Hexagon Linux kernel. Differential Revision: https://reviews.llvm.org/D51363 llvm-svn: 341238	2018-08-31 19:08:23 +00:00
Dean Michael Berris	7975e274da	[XRay] Improve test matching granularity (NFC) Simplify matchers for unittest to better isolate which differences there are that we're finding in failures. llvm-svn: 341237	2018-08-31 18:56:42 +00:00
Dean Michael Berris	3fc4cbfe10	[XRay] Change function record reader to be endian-aware This change allows us to let the compiler do the right thing for when handling big-endian and little-endian records for FDR mode function records. Previously, we assumed that the encoding was little-endian that reading the first byte to look for the function id and function record types was ordered in a little-endian manner. This change allows us to better handle function records where the first four bytes may actually be encoded in big-endian thus giving us the wrong bytes where we're seeking the function information from. This is a follow-up to D51210 and D51289. llvm-svn: 341236	2018-08-31 18:36:58 +00:00
Dean Michael Berris	c1dceee50b	[XRay] Fix FunctionRecord serialization This change makes the writer implementation more consistent with the way fields are written down to avoid assumptions on bitfield order and padding. We also fix an inconsistency between the type returned by the `delta()` accessor to match the data member it's returning. This is a follow-up to D51289 and D51210. llvm-svn: 341230	2018-08-31 17:49:59 +00:00
Alexandre Ganea	6a7efef4af	[DebugInfo] Common behavior for error types Following D50807, and heading towards D50664, this intermediary change does the following: 1. Upgrade all custom Error types in llvm/trunk/lib/DebugInfo/ to use the new StringError behavior (D50807). 2. Implement std::is_error_code_enum and make_error_code() for DebugInfo error enumerations. 3. Rename GenericError -> PDBError (the file will be renamed in a subsequent commit) 4. Update custom error messages to follow the same formatting: (\w\s*)+\. 5. Keep generic "file not found" (ENOENT) errors as they are in PDB code. Previously, there used to be a custom enumeration for that purpose. 6. Remove a few extraneous LF in log() implementations. Printing LF is a responsability at a higher level, not at the error level. Differential Revision: https://reviews.llvm.org/D51499 llvm-svn: 341228	2018-08-31 17:41:58 +00:00
Craig Topper	b7bb9f0078	[X86] Add support for turning vXi1 shuffles into KSHIFTL/KSHIFTR. This patch recognizes shuffles that shift elements and fill with zeros. I've copied and modified the shift matching code we use for normal vector registers to do this. I'm not sure if there's a good way to share more of this code without making the existing function more complex than it already is. This will be used to enable kshift intrinsics in clang. Differential Revision: https://reviews.llvm.org/D51401 llvm-svn: 341227	2018-08-31 17:17:21 +00:00
Dean Michael Berris	5b7548c653	[XRay] Make Trace loading endian-aware This change makes the XRay Trace loading functions first use a little-endian data extractor, then on failures try a big-endian data extractor. Without this change, the trace loading facility will not work with data written from a big-endian machine. Follow-up to D51210 and D51289. llvm-svn: 341226	2018-08-31 17:06:28 +00:00
Dean Michael Berris	98717978c9	[XRay] Make the FDRTraceWriter Endian-aware Before this patch, the FDRTraceWriter would not take endianness into account when writing data into the output stream. This is a follow-up to D51289 and D51210. llvm-svn: 341223	2018-08-31 16:08:38 +00:00
Andrea Di Biagio	a59ec4efa0	[X86][BtVer2] Remove wrong ReadAdvance from AVX vbroadcast(ss\|sd\|f128) instructions. The presence of a ReadAdvance for input operand #0 is problematic because it changes the input latency of the register used as the base address for the folded load. A broadcast cannot start executing if the load address hasn't been computed yet. In the llvm-mca example, the VBROADCASTSS is dependent on the address generated by the LEAQ. That means, it cannot start until LEAQ reaches the write-back stage. If we apply ReadAdvance, then we wrongly assume that the load can start 3 cycles in advance. Differential Revision: https://reviews.llvm.org/D51534 llvm-svn: 341222	2018-08-31 16:05:48 +00:00
Simon Atanasyan	3785e84cf2	[mips] Fix `mtc1` and `mfc1` definitions for microMIPS R6 The `mtc1` and `mfc1` definitions in the MipsInstrFPU.td have MMRel, but do not have StdMMR6Rel tags. When these instructions are emitted for microMIPS R6 targets, `Mips::MipsR62MicroMipsR6` nor `Mips::Std2MicroMipsR6` cannot find correct op-codes and as a result the backend uses mips32 variant of the instructions encoding. The patch fixes this problem by adding the StdMMR6Rel tag and check instructions encoding in the test case. Differential revision: https://reviews.llvm.org/D51482 llvm-svn: 341221	2018-08-31 15:57:17 +00:00
Matt Arsenault	bf07a50a98	AMDGPU: Restrict extract_vector_elt combine to loads The intention is to enable the extract_vector_elt load combine, and doing this for other operations interferes with more useful optimizations on vectors. Handle any type of load since in principle we should do the same combine for the various load intrinsics. llvm-svn: 341219	2018-08-31 15:39:52 +00:00
Matt Arsenault	6f35f0c212	AMDGPU: Actually commit re-run of update_llc_test_checks llvm-svn: 341218	2018-08-31 15:05:06 +00:00
Jonas Devlieghere	e3d6b9786e	[Wasm] Add missing EOF checks for floats Adds the same checks we already do for ints to floats. Fixes: https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=8698 llvm-svn: 341216	2018-08-31 14:54:01 +00:00
Matt Arsenault	c807ce0ee4	SLPVectorizer: Fix assert with different sized address spaces llvm-svn: 341215	2018-08-31 14:34:53 +00:00
Matt Arsenault	28c16bd534	AMDGPU: Fix broken generated check lines This was incorrectly using the same check prefix for multiple lines llvm-svn: 341214	2018-08-31 14:34:22 +00:00
Andrea Di Biagio	69da3f3df6	[X86] Add llvm-mca tests that show how operand latency is wrongly computed for SSE sqrtss/sd and rcpss. According to the timeline view, sqrtss/sd/rcpss start executing before the load address for the memory operand is available. This problem is caused by the presence of a ReadAfterLd (a ReadAdvance). Those unary operations should not specify a ReadAdvance at all. llvm-svn: 341213	2018-08-31 14:12:13 +00:00
Francis Visoiu Mistrih	8e864be70a	[llvm-objdump] Keep the memory buffer from the dSYM alive when using -g -dsym When using -g and -dsym, llvm-objdump opens the dsym file and keeps the MachOObjectFile alive, while the memory buffer that the MachOObjectFile was based on gets destroyed. Differential Revision: https://reviews.llvm.org/D51365 llvm-svn: 341209	2018-08-31 13:10:54 +00:00
Dean Michael Berris	fc774e29d2	[XRay] Remove array for Metadata Record Types This simplifies the implementation of the metadata lookup by using scoped enums, rather than using enum classes. This way we can get the number-name mapping without having to resort to comments. Follow-up to D51289. llvm-svn: 341205	2018-08-31 11:41:08 +00:00
Alexander Ivchenko	9d053074a1	[GlobalISel][X86] Add the support for G_FPTRUNC Differential Revision: https://reviews.llvm.org/D49855 llvm-svn: 341202	2018-08-31 11:26:51 +00:00
Alexander Ivchenko	9b0b492653	[GlobalISel][X86_64] Support for G_FPTOSI Differential Revision: https://reviews.llvm.org/D49183 llvm-svn: 341200	2018-08-31 11:16:58 +00:00
Alexander Ivchenko	58a5d6fde7	[GlobalIsel][X86] Support for llvm.trap intrinsic Differential Revision: https://reviews.llvm.org/D49180 llvm-svn: 341199	2018-08-31 11:05:13 +00:00
Alexander Ivchenko	5b8418983c	[NFC] Fix unused variable warning in X86RegisterBankInfo.cpp llvm-svn: 341198	2018-08-31 10:39:54 +00:00
Andrea Di Biagio	0e21ca1278	[X86][BtVer2] Add an llvm-mca test that shows how the read latency of AVX broadcastss on ymm registers is incorrectly set. llvm-svn: 341197	2018-08-31 10:39:33 +00:00
Dean Michael Berris	7a07a41cbb	[XRay] Attempt to fix failure on Windows Original version of the code relied on implementation-defined order of bitfields. Follow-up on D51210. llvm-svn: 341194	2018-08-31 10:03:52 +00:00
Alexander Ivchenko	a26a364e75	[GlobalIsel][X86] Support for G_FCMP Differential Revision: https://reviews.llvm.org/D49172 llvm-svn: 341193	2018-08-31 09:38:27 +00:00
Simon Pilgrim	95f4120f09	Fix MSVC "not all control paths return a value" warning. NFCI. llvm-svn: 341191	2018-08-31 09:24:09 +00:00
Roman Lebedev	a8c22c0894	[XRay] FDRProducerConsumerTest: unbreak (gcc?) build /build/llvm/unittests/XRay/FDRProducerConsumerTest.cpp:90:27: error: declaration of ‘std::unique_ptr<llvm::xray::Record> llvm::xray::{anonymous}::RoundTripTest<T>::Record’ [-fpermissive] std::unique_ptr<Record> Record; ^~~~~~ In file included from /build/llvm/include/llvm/XRay/FDRLogBuilder.h:12, from /build/llvm/unittests/XRay/FDRProducerConsumerTest.cpp:15: /build/llvm/include/llvm/XRay/FDRRecords.h:28:7: error: changes meaning of ‘Record’ from ‘class llvm::xray::Record’ [-fpermissive] class Record { ^~~~~~ llvm-svn: 341189	2018-08-31 08:59:15 +00:00
Roman Lebedev	75c2961b76	[NFC][X86][AArch64] A few more patterns for [lack of] signed truncation check pattern.[NFC][X86][AArch64] A few more patterns for [lack of] signed truncation check pattern. llvm-svn: 341188	2018-08-31 08:52:03 +00:00
Andrea Di Biagio	b998eae2f2	[X86][BtVer2] Fix WriteFShuffle256 schedule write info. This patch fixes the number of micro opcodes, and processor resource cycles for the following AVX instructions: vinsertf128rr/rm vperm2f128rr/rm vbroadcastf128 Tests have been regenerated using the usual scripts in the llvm/utils directory. Differential Revision: https://reviews.llvm.org/D51492 llvm-svn: 341185	2018-08-31 08:30:47 +00:00
Dean Michael Berris	146d5791d9	[XRay] FDR Record Producer/Consumer Implementation Summary: This patch defines two new base types called `RecordProducer` and `RecordConsumer` which have default implementations for convenience (particularly for testing). A `RecordProducer` implementation has one member function called `produce()` which serves as a factory constructor for `Record` instances. This code exercises the `RecordInitializer` code path in the implementation for `FileBasedRecordProducer`. A `RecordConsumer` has a single member function called `consume(...)` which, as the name implies, consumes instances of `std::unique_ptr<Record>`. We have two implementations, one of which is used in the test to generate a vector of `std::unique_ptr<Record>` similar to how the `LogBuilder` implementation works. We introduce a test in `FDRProducerConsumerTest` which ensures that records we write through the `FDRTraceWriter` can be loaded by the `FileBasedRecordProducer`. The record(s) loaded this way are written again through the `FDRTraceWriter` into a separate string, which we then compare. This ensures that the read-in bytes to create the `Record` instances in memory can be replicated when written out through the `FDRTraceWriter`. This change depends on D51210 and is part of the refactoring of D50441 into smaller, more focused changes. Reviewers: eizan, kpw Subscribers: mgorny, hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D51289 llvm-svn: 341180	2018-08-31 08:04:56 +00:00
Martin Storsjo	9e4d5f9b7b	[AArch64] Hook up the missed machine operand flag name for MO_DLLIMPORT llvm-svn: 341178	2018-08-31 08:00:34 +00:00
Martin Storsjo	f010872b5c	[MinGW] [X86] Pass true for the second parameter to StubValueTy for MO_COFFSTUB. NFC. These stubs should never be emitted for internal symbols, and nothing in AsmPrinter ever actually use this value when producing the stubs for COFF anyway. llvm-svn: 341177	2018-08-31 08:00:31 +00:00
Martin Storsjo	2dcaa41e1e	[MinGW] [ARM] Add stubs for potential automatic dllimported variables The runtime pseudo relocations can't handle the ARM format embedded addresses in movw/movt pairs. By using stubs, the potentially dllimported addresses can be touched up by the runtime pseudo relocation framework. Differential Revision: https://reviews.llvm.org/D51450 llvm-svn: 341176	2018-08-31 08:00:25 +00:00
Craig Topper	83e9f928ba	[X86] Don't do anything in ReplaceNodeResults for (v2i32 (fptoui/fptosi v2f32)) when -x86-experimental-vector-widening-legalization is on. We don't need to do our own widening, the generic legalizer can do it. llvm-svn: 341174	2018-08-31 07:05:39 +00:00
Craig Topper	7073f03f70	[X86] Add a -x86-experimental-vector-widening command line to vec_fp_to_int.ll. llvm-svn: 341173	2018-08-31 07:05:38 +00:00
Craig Topper	e9af89a78b	[X86] Don't custom widen (v2i32 (setcc v2f32)) when -x86-experimental-vector-widening-legalization is in effect. We aren't doing anything than what the generic legalizer will do so just let it do it. llvm-svn: 341172	2018-08-31 07:05:37 +00:00
Craig Topper	2140a8e307	[X86] Add -x86-experimental-vector-widening-legalization run line to avx512-cvt.ll This will cover the (v2i32 (setcc v2f32)) case in replaceNodeResults. That code shouldn't be needed at all in this mode. A future patch will skip it. llvm-svn: 341171	2018-08-31 07:05:36 +00:00
Matt Arsenault	65e43cade8	AMDGPU: Remove obsolete tests llvm-svn: 341169	2018-08-31 06:07:45 +00:00
Matt Arsenault	988df63525	AMDGPU: Stop forcing internalize at -O0 This doesn't really matter if clang is always emitting the visibility as hidden by default. llvm-svn: 341168	2018-08-31 06:02:36 +00:00
Matt Arsenault	0da6350dc8	AMDGPU: Remove remnants of old address space mapping llvm-svn: 341165	2018-08-31 05:49:54 +00:00
Lang Hames	b8b8de423d	[ORC] Remove a stray debugging output line left in a unit test. llvm-svn: 341155	2018-08-31 00:53:53 +00:00
Lang Hames	6d32002e2b	[ORC] Add utilities to RTDyldObjectLinkingLayer2 to simplify symbol flag management and materialization responsibility registration. The setOverrideObjectFlagsWithResponsibilityFlags method instructs RTDyldObjectlinkingLayer2 to override the symbol flags produced by RuntimeDyld with the flags provided by the MaterializationResponsibility instance. This can be used to enable symbol visibility (hidden/exported) for COFF object files, which do not currently support the SF_Exported flag. The setAutoClaimResponsibilityForObjectSymbols method instructs RTDyldObjectLinkingLayer2 to claim responsibility for any symbols provided by a given object file that were not already in the MaterializationResponsibility instance. Setting this flag allows higher-level program representations (e.g. LLVM IR) to be added based on only a subset of the symbols they provide, without having to write intervening layers to scan and add the additional symbols. This trades diagnostic quality for convenience however: If all symbols are enumerated up-front then clashes can be detected and reported early. If this option is set, clashes for the additional symbols may not be detected until late, and detection may depend on the flow of control through JIT'd code. llvm-svn: 341154	2018-08-31 00:53:17 +00:00
Fangrui Song	780dfe11fc	Import lit.llvm after rL341135 llvm-svn: 341149	2018-08-31 00:22:20 +00:00
Max Kazantsev	c683d643c9	Revert "[NFC] Add severe validation of InstructionPrecedenceTracking" for discussion llvm-svn: 341147	2018-08-31 00:01:54 +00:00
Michael Berg	7b9e86445c	[NFC] adding initial intersect test for Node to Instruction association llvm-svn: 341138	2018-08-30 22:43:34 +00:00
Krzysztof Parzyszek	d51f7b3b43	[Hexagon] Check validity of register class when generating bitsplit llvm-svn: 341137	2018-08-30 22:26:43 +00:00
Eli Friedman	d5d0a4d27f	[ARM] Enable GEP offset splitting for 32-bit ARM. It has essentially the same benefit it has on 64-bit ARM: it substantially reduces the number of constants used by large GEP operations. Seems to be generally helpful across a few different codebases I've tried. Differential Revision: https://reviews.llvm.org/D51462 llvm-svn: 341136	2018-08-30 22:18:27 +00:00
Nico Weber	f5415179bc	Remove LIT_SITE_CFG_IN_FOOTER, llvm It's always replaced with the same (short) static string, so just put that there directly. No intended behavior change. https://reviews.llvm.org/D51357 llvm-svn: 341135	2018-08-30 22:13:34 +00:00
Thomas Lively	abf6bdcb59	[WebAssembly] Update utility functions with SIMD types Reviewers: aheejin, dschuff Subscribers: sbc100, jgravelle-google, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D51516 llvm-svn: 341131	2018-08-30 22:10:43 +00:00
Thomas Lively	80725808a3	[WebAssembly] Vector conversions Summary: Lowers away bitconverts between vector types. This CL depends on D51383. Reviewers: aheejin, dschuff Subscribers: sbc100, jgravelle-google, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D51498 llvm-svn: 341128	2018-08-30 21:43:51 +00:00
Thomas Lively	d183d8c772	[WebAssembly] SIMD loads and stores Summary: Reuse the patterns from WebAssemblyInstrMemory.td. Reviewers: aheejin, dschuff Subscribers: sbc100, jgravelle-google, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D51383 llvm-svn: 341127	2018-08-30 21:36:48 +00:00
Adrian Prantl	bdffea12d0	dsymutil: Avoid pruning non-type forward declarations inside DW_TAG_module forward declarations. Especially with template instantiations, there are legitimate reasons why for declarations might be emitted into a DW_TAG_module skeleton / forward-declaration sub-tree, that are not forward declarations in the sense of that there is a more complete definition over in a .pcm file. The example in the testcase is a constant DW_TAG_member of a DW_TAG_class template instatiation. rdar://problem/43623196 llvm-svn: 341123	2018-08-30 21:21:16 +00:00
Zachary Turner	a1f57030c6	Remove some debugging code that was accidentally left in. llvm-svn: 341122	2018-08-30 21:00:57 +00:00
Zachary Turner	40d05cc11e	Add a utility script to stress test the demangler. llvm-svn: 341120	2018-08-30 20:53:48 +00:00
Zachary Turner	78ab3cb238	[MS Demangler] Add support for $$Z parameter pack separator. $$Z appears between adjacent expanded parameter packs in the same template instantiation. We don't need to print it, it's only there to disambiguate between manglings that would otherwise be ambiguous. So we just need to parse it and throw it away. llvm-svn: 341119	2018-08-30 20:53:29 +00:00
Vlad Tsyrklevich	2499aeead9	SafeStack: Prevent OOB reads with mem intrinsics Summary: Currently, the SafeStack analysis disallows out-of-bounds writes but not out-of-bounds reads for mem intrinsics like llvm.memcpy. This could cause leaks of pointers to the safe stack by leaking spilled registers/ frame pointers. Check for allocas used as source or destination pointers to mem intrinsics. Reviewers: eugenis Reviewed By: eugenis Subscribers: pcc, llvm-commits, kcc Differential Revision: https://reviews.llvm.org/D51334 llvm-svn: 341116	2018-08-30 20:44:51 +00:00
Tim Northover	c044509851	Hashing: use 64-bit seed for hashing on all platforms. get_execution_seed returns a size_t which varies across platforms, but its users actually always feed it into a uint64_t role so it makes sense to be consistent. Mostly this is just a tidy-up, but it also apparently allows PCH files to be shared between Clang compilers built for 32-bit and 64-bit hosts. llvm-svn: 341113	2018-08-30 20:28:32 +00:00
Craig Topper	b5de35a5ba	[X86] Add -x86-experimental-vector-widening-legalization command lines to vector-idiv-v2i32.ll If we're legalizing via widening already, then the type legalizer will scalarize the divs/rems as i32. llvm-svn: 341108	2018-08-30 20:10:10 +00:00
Ana Pazos	6b34051b33	[RISCV] Fixed SmallVector.h Assertion `idx < size()' Summary: RISCVAsmParser needs to handle the case the error message is of specific type, other than the generic Match_InvalidOperand, and the corresponding operand is missing. This bug was uncovered by a LLVM MC Assembler Protocol Buffer Fuzzer for the RISC-V assembly language. Reviewers: asb Reviewed By: asb Subscribers: llvm-commits, jocewei, asb, rbar, johnrusso, simoncook, sabuasal, niosHD, kito-cheng, shiva0217, zzheng, edward-jones, mgrang, rogfer01, MartinMosbeck, brucehoult, the_o, PkmX Differential Revision: https://reviews.llvm.org/D50790 llvm-svn: 341104	2018-08-30 19:43:19 +00:00
Craig Topper	6666861158	[DAGCombiner] Fix bad identation. NFC llvm-svn: 341103	2018-08-30 19:35:40 +00:00
Craig Topper	1a8c99e670	[X86] Weaken an overly aggressive assert. This assert tried to check that AND constants are only on the RHS. But its possible for both operands to be constants if one is opaque which will prevent the AND from being constant folded. Fixes PR38771 llvm-svn: 341102	2018-08-30 19:35:38 +00:00
Evandro Menezes	64ed9a7fe8	[ARM] Adjust the feature set for Exynos Enable `FeatureUseAA` for all Exynos processors. llvm-svn: 341101	2018-08-30 19:22:00 +00:00
Evandro Menezes	2123ea7d5c	[InstCombine] Expand the simplification of pow() into exp2() Generalize the simplification of `pow(2.0, y)` to `pow(2.0 ** n, y)` for all scalar and vector types. This improvement helps some benchmarks in SPEC CPU2000 and CPU2006, such as 252.eon, 447.dealII, 453.povray. Otherwise, no significant regressions on x86-64 or A64. Differential revision: https://reviews.llvm.org/D49273 llvm-svn: 341095	2018-08-30 19:04:51 +00:00
Eli Friedman	94d3e4dd77	[SROA] Fix alignment for uses of PHI nodes. Splitting an alloca can decrease the alignment of GEPs into the partition. Normally, rewriting accounts for this, but the code was missing for uses of PHI nodes and select instructions. Fixes https://bugs.llvm.org/show_bug.cgi?id=38707 . Differential Revision: https://reviews.llvm.org/D51335 llvm-svn: 341094	2018-08-30 18:59:24 +00:00
Andrew Kaylor	d9b6b81d08	Reverting r340807. This patch restores the old behavior of getAllocationDataForFunction in MemoryBuiltins.cpp. llvm-svn: 341091	2018-08-30 18:37:18 +00:00
Craig Topper	b7e14332ea	[X86] Add kshift test cases for D51401. NFC llvm-svn: 341088	2018-08-30 17:51:02 +00:00
Vladimir Stefanovic	7e58ebf6b8	Allow inconsistent offsets for 'noreturn' basic blocks when '-verify-cfiinstrs' With r295105, some 'noreturn' blocks (those that don't return and have no successors) may be merged. If such blocks' predecessors have different outgoing offset or register, don't report an error in CFIInstrInserter verify(). Thanks to Vlad Tsyrklevich for reporting the issue. Differential Revision: https://reviews.llvm.org/D51161 llvm-svn: 341087	2018-08-30 17:31:38 +00:00
Robert Widmann	0a35b7668b	[LLVM-C] Add Bindings For Named Metadata Summary: Add a new type for named metadata nodes. Use this to implement iterators and accessors for NamedMDNodes and extend the echo test to use them to copy module-level debug information. Reviewers: whitequark, deadalnix, aprantl, dexonsmith Reviewed By: whitequark Subscribers: Wallbraker, JDevlieghere, llvm-commits, harlanhaskins Differential Revision: https://reviews.llvm.org/D47179 llvm-svn: 341085	2018-08-30 17:09:43 +00:00
Sanjay Patel	8d39ed895f	[IR] fix declaration of shuffle mask An address sanitizer bot flagged this as a potential bug. llvm-svn: 341084	2018-08-30 16:44:07 +00:00
Matt Morehouse	7e042bb1d1	[libFuzzer] Port to Windows Summary: Port libFuzzer to windows-msvc. This patch allows libFuzzer targets to be built and run on Windows, using -fsanitize=fuzzer and/or fsanitize=fuzzer-no-link. It allows these forms of coverage instrumentation to work on Windows as well. It does not fix all issues, such as those with -fsanitize-coverage=stack-depth, which is not usable on Windows as of this patch. It also does not fix any libFuzzer integration tests. Nearly all of them fail to compile, fixing them will come in a later patch, so libFuzzer tests are disabled on Windows until them. Patch By: metzman Reviewers: morehouse, rnk Reviewed By: morehouse, rnk Subscribers: #sanitizers, delcypher, morehouse, kcc, eraman Differential Revision: https://reviews.llvm.org/D51022 llvm-svn: 341082	2018-08-30 15:54:44 +00:00
Wouter van Oortmerssen	a733d08db2	[WebAssembly] Made disassembler only use stack instructions. Summary: Now uses the StackBased bit from the tablegen defs to identify stack instructions (and ignore register based or non-wasm instructions). Also changed how we store operands, since we now have up to 16 of them per instruction. To not cause static data bloat, these are compressed into a tiny table. + a few other cleanups. Tested: - MCTest - llvm-lit -v `find test -name WebAssembly` Reviewers: dschuff, jgravelle-google, sunfish, tlively Subscribers: sbc100, aheejin, llvm-commits Differential Revision: https://reviews.llvm.org/D51320 llvm-svn: 341081	2018-08-30 15:40:53 +00:00
Nicolai Haehnle	65c026259e	Move test/Analysis/DivergenceAnalysis/AMDGPU/loads.ll Should fix failures of buildbots that don't build the AMDGPU backend. Change-Id: I01cb84b4b47803b10c5b21ea0353546239860a51 llvm-svn: 341079	2018-08-30 15:24:00 +00:00
Sanjay Patel	ac619a09ec	[IR] add shuffle queries for identity extend/extract This was one of the potential follow-ups suggested in D48236, and these will be used to make matching the patterns in PR38691 cleaner: https://bugs.llvm.org/show_bug.cgi?id=38691 About the vocabulary: in the DAG, these would be concat_vector with an undef operand or extract_subvector. Alternate names are discussed in the review, but I think these are familiar/good enough to proceed. Once we have uses of them in code, we might adjust if there are better options. https://reviews.llvm.org/D51392 llvm-svn: 341075	2018-08-30 15:05:38 +00:00
Alexander Ivchenko	af96112ec6	Make TargetInstrInfo::isCopyInstr return true for regular COPY-instructions ..Move all target-dependent checks into new isCopyInstrImpl method. This change allows us to treat MoveReg-type instructions and generic COPY instruction in the same way Differential Revision: https://reviews.llvm.org/D49913 llvm-svn: 341072	2018-08-30 14:32:47 +00:00
Nicolai Haehnle	35617ed4cb	[NFC] Rename the DivergenceAnalysis to LegacyDivergenceAnalysis Summary: This is patch 1 of the new DivergenceAnalysis (https://reviews.llvm.org/D50433). The purpose of this patch is to free up the name DivergenceAnalysis for the new generic implementation. The generic implementation class will be shared by specialized divergence analysis classes. Patch by: Simon Moll Reviewed By: nhaehnle Subscribers: jvesely, jholewinski, arsenm, nhaehnle, mgorny, jfb, llvm-commits Differential Revision: https://reviews.llvm.org/D50434 Change-Id: Ie8146b11be2c50d5312f30e11c7a3036a15b48cb llvm-svn: 341071	2018-08-30 14:21:36 +00:00
Alexandre Ganea	607a7be532	More build fix for r341064. llvm-svn: 341070	2018-08-30 14:05:49 +00:00
Daniel Cederman	8f0bf6c19a	[Sparc] Use ANDN instead of AND if constant can be encoded more efficiently Summary: In the case of (and reg, constant) or (or reg, constant), it can be beneficial to use a ANDNrr/ORNrr instruction instead of ANDrr/ORrr, if the complement of the constant can be encoded using a single SETHI instruction instead of a SETHI/ORri pair. If the constant has more than one use, it is probably better to keep it in its original form. Reviewers: jyknight, venkatra Reviewed By: jyknight Subscribers: fedor.sergeev, jrtc27, llvm-commits Differential Revision: https://reviews.llvm.org/D50964 llvm-svn: 341069	2018-08-30 14:05:26 +00:00
Alexander Timofeev	201f892b3b	[AMDGPU] Preliminary patch for divergence driven instruction selection. Operands Folding 1. Reviewers: rampitec Differential revision: https://reviews/llvm/org/D51316 llvm-svn: 341068	2018-08-30 13:55:04 +00:00
Alexandre Ganea	ca49e391b3	Build fix for r341064. Temporarily disable compile-time validation for createFileError(). llvm-svn: 341067	2018-08-30 13:36:07 +00:00
Alexandre Ganea	e11f221786	[Error] Add FileError helper; upgrade StringError behavior FileError is meant to encapsulate both an Error and a file name/path. It should be used in cases where an Error occurs deep down the call chain, and we want to return it to the caller along with the file name. StringError was updated to display the error messages in different ways. These can be: 1. display the error_code message, and convert to the same error_code (ECError behavior) 2. display an arbitrary string, and convert to a provided error_code (current StringError behavior) 3. display both an error_code message and a string, in this order; and convert to the same error_code These behaviors can be triggered depending on the constructor. The goal is to use StringError as a base class, when a library needs to provide a explicit Error type. Differential Revision: https://reviews.llvm.org/D50807 llvm-svn: 341064	2018-08-30 13:10:42 +00:00
Ties Stuij	9c16d809d2	[CodeGen] emit inline asm clobber list warnings for reserved (cont) Summary: This is a continuation of https://reviews.llvm.org/D49727 Below the original text, current changes in the comments: Currently, in line with GCC, when specifying reserved registers like sp or pc on an inline asm() clobber list, we don't always preserve the original value across the statement. And in general, overwriting reserved registers can have surprising results. For example: extern int bar(int[]); int foo(int i) { int a[i]; // VLA asm volatile( "mov r7, #1" : : : "r7" ); return 1 + bar(a); } Compiled for thumb, this gives: $ clang --target=arm-arm-none-eabi -march=armv7a -c test.c -o - -S -O1 -mthumb ... foo: .fnstart @ %bb.0: @ %entry .save {r4, r5, r6, r7, lr} push {r4, r5, r6, r7, lr} .setfp r7, sp, #12 add r7, sp, #12 .pad #4 sub sp, #4 movs r1, #7 add.w r0, r1, r0, lsl #2 bic r0, r0, #7 sub.w r0, sp, r0 mov sp, r0 @APP mov.w r7, #1 @NO_APP bl bar adds r0, #1 sub.w r4, r7, #12 mov sp, r4 pop {r4, r5, r6, r7, pc} ... r7 is used as the frame pointer for thumb targets, and this function needs to restore the SP from the FP because of the variable-length stack allocation a. r7 is clobbered by the inline assembly (and r7 is included in the clobber list), but LLVM does not preserve the value of the frame pointer across the assembly block. This type of behavior is similar to GCC's and has been discussed on the bugtracker: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=11807 . No consensus seemed to have been reached on the way forward. Clang behavior has briefly been discussed on the CFE mailing (starting here: http://lists.llvm.org/pipermail/cfe-dev/2018-July/058392.html). I've opted for following Eli Friedman's advice to print warnings when there are reserved registers on the clobber list so as not to diverge from GCC behavior for now. The patch uses MachineRegisterInfo's target-specific knowledge of reserved registers, just before we convert the inline asm string in the AsmPrinter. If we find a reserved register, we print a warning: repro.c:6:7: warning: inline asm clobber list contains reserved registers: R7 [-Winline-asm] "mov r7, #1" ^ Reviewers: efriedma, olista01, javed.absar Reviewed By: efriedma Subscribers: eraman, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D51165 llvm-svn: 341062	2018-08-30 12:52:35 +00:00
David Green	1f203bcd75	[AArch64] Optimise load(adr address) to ldr address Providing that the load is known to be 4 byte aligned, we can optimise a ldr(adr address) to just ldr address. Differential Revision: https://reviews.llvm.org/D51030 llvm-svn: 341058	2018-08-30 11:55:16 +00:00
Andrea Di Biagio	7f2230ff16	[llvm-mca] correctly initialize field 'CycleRetired' in the TimelineView. This fixes a [-Wmissing-field-initializers] warning reported by buildbot lld-x86_64-darwin13, build #25152. llvm-svn: 341056	2018-08-30 11:17:58 +00:00
Andrea Di Biagio	8b647dcf4b	[llvm-mca] Report the number of dispatched micro opcodes in the DispatchStatistics view. This patch introduces the following changes to the DispatchStatistics view: * DispatchStatistics now reports the number of dispatched opcodes instead of the number of dispatched instructions. * The "Dynamic Dispatch Stall Cycles" table now also reports the percentage of stall cycles against the total simulated cycles. This change allows users to easily compare dispatch group sizes with the processor DispatchWidth. Before this change, it was difficult to correlate the two numbers, since DispatchStatistics view reported numbers of instructions (instead of opcodes). DispatchWidth defines the maximum size of a dispatch group in terms of number of micro opcodes. The other change introduced by this patch is related to how DispatchStage generates "instruction dispatch" events. In particular: * There can be multiple dispatch events associated with a same instruction * Each dispatch event now encapsulates the number of dispatched micro opcodes. The number of micro opcodes declared by an instruction may exceed the processor DispatchWidth. Therefore, we cannot assume that instructions are always fully dispatched in a single cycle. DispatchStage knows already how to handle instructions declaring a number of opcodes bigger that DispatchWidth. However, DispatchStage always emitted a single instruction dispatch event (during the first simulated dispatch cycle) for instructions dispatched. With this patch, DispatchStage now correctly notifies multiple dispatch events for instructions that cannot be dispatched in a single cycle. A few views had to be modified. Views can no longer assume that there can only be one dispatch event per instruction. Tests (and docs) have been updated. Differential Revision: https://reviews.llvm.org/D51430 llvm-svn: 341055	2018-08-30 10:50:20 +00:00
Max Kazantsev	b167e3ae1b	[NFC] Whitespace fix llvm-svn: 341054	2018-08-30 10:42:08 +00:00
Alex Bradbury	d4e2c785a5	[RISCV] Fix r341050 A few stray lines were accidentally committed. Remove these. llvm-svn: 341053	2018-08-30 10:39:30 +00:00
Florian Hahn	521dc4dda4	Fix "Q" and "R" inline assembly template modifiers for big-endian Arm Consider the endianness of the target when printing register names. This is in line with the documentation at http://llvm.org/docs/LangRef.html#asm-template-argument-modifiers Patch by Jackson Woodruff <jackson.woodruff@arm.com> Reviewers: t.p.northover, echristo, javed.absar, efriedma Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D49778 llvm-svn: 341052	2018-08-30 10:28:23 +00:00
Max Kazantsev	ec9b386820	[NFC] Add severe validation of InstructionPrecedenceTracking llvm-svn: 341051	2018-08-30 10:26:06 +00:00
Alex Bradbury	f56837f70f	[RISCV][NFC] Rework CHECK lines in rvi-aliases-valid.s Previously CHECK prefixes weren't defined that can be used to check _only_ the InstPrinter output when generating .s from llvm-mc, or that check _only_ the output after passing the generated object through objdump. This means we can't write useful checks for instructions that reference symbols. Instead, use: CHECK-S Match the .s output with aliases enabled CHECK-S-NOALIAS Match the .s output with aliases disabled CHECK-OBJ Match the objdumped object output with aliases enabled CHECK-OBJ-NOALIAS Match the objdumped object output with aliases enabled CHECK-S-OBJ Match both the .s and objdumped object output with aliases enabled CHECK-S-OBJ-NOALIAS Match both the .s and objdumped object output with aliases disabled While we're at it, use whitespace consistently within this file. llvm-svn: 341050	2018-08-30 10:25:27 +00:00
Roman Lebedev	cf57af1a5c	Revert "[Hexagon][Test] Remove undef and infinite loop from test" Bots are unhappy: /Users/buildslave/jenkins/workspace/clang-stage1-cmake-RA-incremental/llvm/test/CodeGen/Hexagon/swp-const-tc2.ll:10:14: error: CHECK-NOT: excluded string found in input ; CHECK-NOT: = mpy ^ <stdin>:22:6: note: found here r5 += mpyi(r2,r3) ^~~~~ This reverts commit r341046. llvm-svn: 341049	2018-08-30 10:01:03 +00:00
Roman Lebedev	26a1836757	[NFC][CodeGen][SelectionDAG] Tests for X % C == 0 codegen improvement. Hacker's Delight 10-17: when C is constant, the result of X % C == 0 can be computed more cheaply without actually calculating the remainder. The motivation is discussed here: https://bugs.llvm.org/show_bug.cgi?id=35479. Patch by: hermord (Dmytro Shynkevych)! For https://reviews.llvm.org/D50222 llvm-svn: 341047	2018-08-30 09:32:21 +00:00
Roman Lebedev	c0b8022891	[Hexagon][Test] Remove undef and infinite loop from test Summary: As suggested in D50222, this has been refactored into a separate patch. The undef and the infinite loop at the end cause this test to be translated unpredictably. In particular, the checked-for `mpy` disappears under certain legal optimizations (e.g. the one in D50222). Since the use of these constructs is not relevant to the behavior tested, according to the header comment, this change, suggested by @kparzysz, eliminates them. Patch by: hermord (Dmytro Shynkevych)! Reviewers: kparzysz Reviewed By: kparzysz Subscribers: llvm-commits, kparzysz Differential Revision: https://reviews.llvm.org/D50944 llvm-svn: 341046	2018-08-30 09:32:15 +00:00
Roman Lebedev	f1ec7f83b6	Revert "[CMake] Use LLVM_ENABLE_IDE instead of CMAKE_CONFIGURATION_TYPES" That resulted in the check-llvm-* targets not being avaliable in the QtCreator-configured build directories. Moreover, that was a clearly non-NFC change, and i can't find any review for it. This reverts commit rL340435. llvm-svn: 341045	2018-08-30 09:32:09 +00:00
Max Kazantsev	d3487bdb61	[NFC] Rename map to make the naming consistent llvm-svn: 341043	2018-08-30 09:24:33 +00:00
Dean Michael Berris	17045975da	[XRay] Help gcc disambiguate names Follow-up to D51210. llvm-svn: 341042	2018-08-30 09:04:12 +00:00
Dean Michael Berris	d859668c76	[XRay] Move out template and use perfect forwarding Follow up to D51210. llvm-svn: 341032	2018-08-30 08:15:42 +00:00
Martin Storsjo	22dcddf651	Revert "[SimplifyCFG] Common debug handling [NFC]" This reverts commit r340997. This change turned out not to be NFC after all, but e.g. causes clang to crash when building the linux kernel for aarch64. llvm-svn: 341031	2018-08-30 08:06:50 +00:00
Dean Michael Berris	edf11fd450	[XRay] Remove attribute packed Followup to D51210. llvm-svn: 341030	2018-08-30 07:57:32 +00:00
Dean Michael Berris	a6c6343a78	[XRay] FDRTraceWriter and FDR Trace Loading Summary: This is the first step in the larger refactoring and reduction of D50441. This step in the process does the following: - Introduces more granular types of `Record`s representing the many kinds of records written/read by the Flight Data Recorder (FDR) mode `Trace` loading function(s). - Introduces an abstract `RecordVisitor` type meant to handle the processing of the various `Record` derived types. This `RecordVisitor` has two implementations in this patch: `RecordInitializer` and `FDRTraceWriter`. - We also introduce a convenience interface for building a collection of `Record` instances called a `LogBuilder`. This allows us to generate sequences of `Record` instances manually (used in unit tests but useful otherwise). - The`FDRTraceWriter` class implements the `RecordVisitor` interface and handles the writing of metadata records to a `raw_ostream`. We demonstrate that in the unit test, we can generate in-memory FDR mode traces using the specific `Record` derived types, which we load through the `loadTrace(...)` function yielding valid `Trace` objects. This patch introduces the required types and concepts for us to start replacing the logic implemented in the `loadFDRLog` function to use the more granular types. In subsequent patches, we will introduce more visitor implementations which isolate the verification, printing, indexing, production/consumption, and finally the conversion of the FDR mode logs. The overarching goal of these changes is to make handling FDR mode logs better tested, more understandable, more extensible, and more systematic. This will also allow us to better represent the execution trace, as we improve the fidelity of the events we represent in an XRay `Trace` object, which we intend to do after FDR mode log processing is in better shape. Reviewers: eizan Reviewed By: eizan Subscribers: mgorny, hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D51210 llvm-svn: 341029	2018-08-30 07:22:21 +00:00
Matt Arsenault	f2edba8e43	Don't count debug instructions towards neighborhood count In computeRegisterLiveness, the max instructions to search was counting dbg_value instructions, which could potentially cause an observable codegen change from the presence of debug info. llvm-svn: 341028	2018-08-30 07:18:19 +00:00
Matt Arsenault	015a147c9f	CodeGen: Make computeRegisterLiveness search forward first If there is an unused def, this would previously report that the register was live. Check for uses first so that it is reported as dead if never used. llvm-svn: 341027	2018-08-30 07:18:10 +00:00
Matt Arsenault	eba9e9a266	CodeGen: Make computeRegisterLiveness consider successors If the end of the block is reached during the scan, check the live ins of the successors. This was already done in the other direction if the block entry was reached. llvm-svn: 341026	2018-08-30 07:17:51 +00:00
Carlos Alberto Enciso	06adfa1718	[DWARF] Missing location debug information with -O2. Check that Machine CSE correctly handles during the transformation, the debug location information for local variables. Differential Revision: https://reviews.llvm.org/D50887 llvm-svn: 341025	2018-08-30 07:17:41 +00:00
Andrew V. Tischenko	62f7a3207b	[X86] Improved sched model for X86 CMPXCHG* instructions. Differential Revision: https://reviews.llvm.org/D50070 llvm-svn: 341024	2018-08-30 06:26:00 +00:00
Craig Topper	f0531da109	[InstCombine] Add test cases for D51398 These tests contain the pattern (neg (max ~X, C)) which we should transform to ((min X, ~C) + 1) llvm-svn: 341023	2018-08-30 06:14:54 +00:00
Craig Topper	b7b353be60	[X86] Make Feature64Bit useful We now only add +64bit to the CPU string for "generic" CPU. All other CPU names are assumed to have the feature flag already set if they support 64-bit. I've remove the implies from CMPXCHG8 so that Feature64Bit only comes in via CPUs or user passing -mattr=+64bit. I've changed the assert to a report_fatal_error so it's not lost in Release builds. The test updates are to fix things that tripped the new error. Differential Revision: https://reviews.llvm.org/D51231 llvm-svn: 341022	2018-08-30 06:01:05 +00:00
Craig Topper	987ef2ddfd	[X86] Update test command line to not use 64-bit mode on a 32-bit only athlon cpu. llvm-svn: 341021	2018-08-30 06:01:03 +00:00
Craig Topper	2b3edb902d	[X86] Remove powerpc cpu name and features from uwtables.ll llvm-svn: 341020	2018-08-30 06:01:01 +00:00
Matt Arsenault	167601e629	DAG: Don't use ABI copies in some contexts If an ABI-like value is used in a different block, the type split used is not necessarily the same as the call's ABI. The value is used through an intermediate copy virtual registers from the other block. This resulted in copies with inconsistent sizes later. Fixes regressions since r338197 when AMDGPU started splitting vector types for calls. llvm-svn: 341018	2018-08-30 05:49:28 +00:00
Max Kazantsev	d3a4cbe153	[NFC] Move OrderedInstructions and InstructionPrecedenceTracking to Analysis These classes don't make any changes to IR and have no reason to be in Transform/Utils. This patch moves them to Analysis folder. This will allow us reusing these classes in some analyzes, like MustExecute. llvm-svn: 341015	2018-08-30 04:49:03 +00:00
Max Kazantsev	3c284bde3f	Re-enable "[NFC] Unify guards detection" rL340921 has been reverted by rL340923 due to linkage dependency from Transform/Utils to Analysis which is not allowed. In this patch this has been fixed, a new utility function moved to Analysis. Differential Revision: https://reviews.llvm.org/D51152 llvm-svn: 341014	2018-08-30 03:39:16 +00:00
Dean Michael Berris	f6c87eb965	[XRay][llvm] Load XRay Profiles Summary: This change implements the profile loading functionality in LLVM to support XRay's profiling mode in compiler-rt. We introduce a type named `llvm::xray::Profile` which allows building a profile representation. We can load an XRay profile from a file to build Profile instances, or do it manually through the Profile type's API. The intent is to get the `llvm-xray` tool to generate `Profile` instances and use that as the common abstraction through which all conversion and analysis can be done. In the future we can generate `Profile` instances from `Trace` instances as well, through conversion functions. Some of the key operations supported by the `Profile` API are: - Path interning (`Profile::internPath(...)`) which returns a unique path identifier. - Block appending (`Profile::addBlock(...)`) to add thread-associated profile information. - Path ID to Path lookup (`Profile::expandPath(...)`) to look up a PathID and return the original interned path. - Block iteration. A 'Path' in this context represents the function call stack in leaf-to-root order. This is represented as a path in an internally managed prefix tree in the `Profile` instance. Having a handle (PathID) to identify the unique Paths we encounter for a particular Profile allows us to reduce the amount of memory required to associate profile data to a particular Path. This is the first of a series of patches to migrate the `llvm-stacks` tool towards using a single profile representation. Depends on D48653. Reviewers: kpw, eizan Reviewed By: kpw Subscribers: kpw, thakis, mgorny, llvm-commits, hiraditya Differential Revision: https://reviews.llvm.org/D48370 llvm-svn: 341012	2018-08-30 01:43:22 +00:00
Sam Clegg	88599bf6f4	[WebAssembly] Be a little more conservative in WebAssemblyFixFunctionBitcasts We don't have enough information to know if struct types being bitcast will cause validation failures or not, so be conservative and allow such cases to persist (fot now). Fixes: https://bugs.llvm.org/show_bug.cgi?id=38711 Subscribers: dschuff, jgravelle-google, aheejin, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D51460 llvm-svn: 341010	2018-08-30 01:01:30 +00:00
Huihui Zhang	2f4106592d	[GlobalMerge] Fix GlobalMerge on bss external global variables. Summary: Global variables that are external and zero initialized are supposed to be merged with global variables in the bss section rather than the data section. Reviewers: efriedma, rengolin, t.p.northover, javed.absar, asl, john.brawn, pcc Reviewed By: efriedma Subscribers: dmgreen, llvm-commits Differential Revision: https://reviews.llvm.org/D51379 llvm-svn: 341008	2018-08-30 00:49:50 +00:00
Philip Reames	bed556133d	[SimplifyCFG] Rename a variable for readibility of a future change [NFC] llvm-svn: 341004	2018-08-30 00:12:29 +00:00
Philip Reames	6bd16b5850	[SimplifyCFG] Fix a cost modeling oversight in branch commoning The cost modeling was not accounting for the fact we were duplicating the instruction once per predecessor. With a default threshold of 1, this meant we were actually creating #pred copies. Adding to the fun, there is absolutely no test coverage for this. Simply bailing for more than one predecessor passes all checked in tests. llvm-svn: 341001	2018-08-30 00:03:02 +00:00
Zachary Turner	32a8a2028c	[MS Demangler] Fix several crashes and demangling bugs. These bugs were found by writing a Python script which spidered the entire Chromium build directory tree demangling every symbol in every object file. At the start, the tool printed: Processed 27443 object files. 2926377/2936108 symbols successfully demangled (99.6686%) 9731 symbols could not be demangled (0.3314%) 14589 files crashed while demangling (53.1611%) After this patch, it prints: Processed 27443 object files. 41295518/41295617 symbols successfully demangled (99.9998%) 99 symbols could not be demangled (0.0002%) 0 files crashed while demangling (0.0000%) The issues fixed in this patch are: * Ignore empty parameter packs. Previously we would encounter a mangling for an empty parameter pack and add a null node to the AST. Since we don't print these anyway, we now just don't add anything to the AST and ignore it entirely. This fixes some of the crashes. * Account for "incorrect" string literal demanglings. Apparently an older version of clang would not truncate mangled string literals to 32 bytes of encoded character data. The demangling code however would allocate a 32 byte buffer thinking that it would not encounter more than this, and overrun the buffer. We now demangle up to 128 bytes of data, since the buggy clang would encode up to 32 characters of data. * Extended support for demangling init-fini stubs. If you had something like struct Foo { static vector<string> S; }; this would generate a dynamic atexit initializer for the variable. We didn't handle this, but now we print something nice. This is actually an improvement over undname, which will fail to demangle this at all. * Fixed one case of static this adjustment. We weren't handling several thunk codes so we didn't recognize the mangling. These are now handled. * Fixed a back-referencing problem. Member pointer templates should have their components considered for back-referencing The remaining 99 symbols which can't be demangled are all symbols which are compiler-generated and undname can't demangle either. llvm-svn: 341000	2018-08-29 23:56:09 +00:00
Eli Friedman	3769639335	[NFC] Make getPreferredAlignment honor section markings. This should more accurately reflect what the AsmPrinter will actually do. This is NFC, as far as I can tell; all the places that might be affected already have an extra check to avoid using the result of getPreferredAlignment in this situation. Differential Revision: https://reviews.llvm.org/D51377 llvm-svn: 340999	2018-08-29 23:46:26 +00:00
Philip Reames	7c57dac955	[SimplifyCFG] Common debug handling [NFC] llvm-svn: 340997	2018-08-29 23:22:07 +00:00
Jordan Rupprecht	7481540fd9	[llvm-strip] Fix -p\|--preserve-dates to not truncate output when used in-place. The restoreDateOnFile() method used to preserve dates uses sys::fs::openFileForWrite(). That method defaults to opening files with CD_CreateAlways, which truncates the output file if it exists. Use CD_OpenExisting instead to open it and not truncate it, which also has the side benefit of erroring if the file does not exist (it should always exist, because we just wrote it out). Also, fix the test case to make sure the output is a valid output file, and not empty. The extra test assertions are enough to catch this regression. llvm-svn: 340996	2018-08-29 23:21:56 +00:00
Alina Sbirlea	6edcc9ee86	[MemorySSA] Silence warning. llvm-svn: 340995	2018-08-29 23:20:29 +00:00
Matthias Braun	b7b5860657	Reverse subregister saved loops in register usage info collector; NFC On AMDGPU we have 70 register classes, so iterating over all 70 each time and exiting is costly on the CPU, this flips the loop around so that it loops over the 70 register classes first, and exits without doing the inner loop if needed. On my test just starting radv this takes RegUsageInfoCollector::runOnMachineFunction from 6.0% of total time to 2.7% of total time, and reduces the startup from 2.24s to 2.19s Patch by David Airlie! Differential Revision: https://reviews.llvm.org/D48582 llvm-svn: 340993	2018-08-29 23:12:42 +00:00
Reid Kleckner	9397c2a23b	Revert r340947 "[InstCombine] Expand the simplification of pow() into exp2()" It broke the clang-cl self-host. llvm-svn: 340991	2018-08-29 22:58:33 +00:00
Alina Sbirlea	5bce4d5a85	[MemorySSA] Fix checkClobberSanity to skip Start only for Defs and Uses. llvm-svn: 340981	2018-08-29 22:38:51 +00:00
Philip Reames	1887c40b22	Add a todo and tests to Address a review commnt from D50925 [NFC] llvm-svn: 340978	2018-08-29 22:09:21 +00:00
Philip Reames	f562fc8dbf	[LICM] Hoist stores of invariant values to invariant addresses out of loops Teach LICM to hoist stores out of loops when the store writes to a location otherwise unused in the loop, writes a value which is invariant, and is guaranteed to execute if the loop is entered. Worth noting is that this transformation is partially overlapping with the existing promotion transformation. Reasons this is worthwhile anyway include: * For multi-exit loops, this doesn't require duplication of the store. * It kicks in for case where we can't prove we exit through a normal exit (i.e. we may throw), but can prove the store executes before that possible side exit. Differential Revision: https://reviews.llvm.org/D50925 llvm-svn: 340974	2018-08-29 21:49:30 +00:00
Marek Olsak	3fc2079cf4	AMDGPU: Handle 32-bit address wraparounds for SMRD opcodes Summary: This fixes GPU hangs with OpenGL bindless handle arithmetic. Reviewers: arsenm, nhaehnle Subscribers: kzhuravl, jvesely, wdng, yaxunl, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D51203 llvm-svn: 340959	2018-08-29 20:03:00 +00:00
Fedor Sergeev	7b49aa03af	[SimpleLoopUnswitch] After unswitch delete dead blocks in parent loops Summary: Assert from PR38737 happens on the dead block inside the parent loop after unswitching nontrivial switch in the inner loop. deleteDeadBlocksFromLoop now takes extra care to detect/remove dead blocks in all the parent loops in addition to the blocks from original loop being unswitched. Reviewers: asbirlea, chandlerc Reviewed By: asbirlea Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D51415 llvm-svn: 340955	2018-08-29 19:10:44 +00:00
Matt Morehouse	cf311cfc20	Revert "[libFuzzer] Port to Windows" This reverts r340949 due to bot breakage again. llvm-svn: 340954	2018-08-29 18:40:41 +00:00
Sanjay Patel	0f29e953b7	[InstCombine] canonicalize fneg with llvm.sin This is a follow-up to rL339604 which did the same transform for a sin libcall. The handling of intrinsics vs. libcalls is unfortunately scattered, so I'm just adding this next to the existing transform for llvm.cos for now. This should resolve PR38458: https://bugs.llvm.org/show_bug.cgi?id=38458 If the call was already negated, the negates will cancel each other out. llvm-svn: 340952	2018-08-29 18:27:49 +00:00
Alina Sbirlea	f5403d83e7	[MemorySSA] Add expesive check for validating clobber accesses. Summary: Add validation of clobber accesses as expensive check. Reviewers: george.burgess.iv Subscribers: sanjoy, jlebar, Prazek, llvm-commits Differential Revision: https://reviews.llvm.org/D51327 llvm-svn: 340951	2018-08-29 18:26:04 +00:00
Sanjay Patel	12a7ea44ed	[InstCombine] add tests for llvm.sin(-x); NFC Also add a corresponding test for llvm.cos with FMF to make sure that was handled correctly. llvm-svn: 340950	2018-08-29 18:11:42 +00:00
Matt Morehouse	245ebd71ef	[libFuzzer] Port to Windows Summary: Port libFuzzer to windows-msvc. This patch allows libFuzzer targets to be built and run on Windows, using -fsanitize=fuzzer and/or fsanitize=fuzzer-no-link. It allows these forms of coverage instrumentation to work on Windows as well. It does not fix all issues, such as those with -fsanitize-coverage=stack-depth, which is not usable on Windows as of this patch. It also does not fix any libFuzzer integration tests. Nearly all of them fail to compile, fixing them will come in a later patch, so libFuzzer tests are disabled on Windows until them. Reviewers: morehouse, rnk Reviewed By: morehouse, rnk Subscribers: #sanitizers, delcypher, morehouse, kcc, eraman Differential Revision: https://reviews.llvm.org/D51022 llvm-svn: 340949	2018-08-29 18:08:34 +00:00
Evandro Menezes	22e0bdf4ed	[InstCombine] Expand the simplification of pow() with nested exp{,2}() Expand the simplification of `pow(exp{,2}(x), y)` to all FP types. This improvement helps some benchmarks in SPEC CPU2000 and CPU2006, such as 252.eon, 447.dealII, 453.povray. Otherwise, no significant regressions on x86-64 or A64. Differential revision: https://reviews.llvm.org/D51195 llvm-svn: 340948	2018-08-29 17:59:48 +00:00
Evandro Menezes	a3a7b53571	[InstCombine] Expand the simplification of pow() into exp2() Generalize the simplification of `pow(2.0, y)` to `pow(2.0 ** n, y)` for all scalar and vector types. This improvement helps some benchmarks in SPEC CPU2000 and CPU2006, such as 252.eon, 447.dealII, 453.povray. Otherwise, no significant regressions on x86-64 or A64. Differential revision: https://reviews.llvm.org/D49273 llvm-svn: 340947	2018-08-29 17:59:34 +00:00
Andrea Di Biagio	a2eee47450	[llvm-mca] Add fields "Total uOps" and "uOps Per Cycle" to the report generated by the SummaryView. This patch adds two new fields to the perf report generated by the SummaryView. Fields are now logically organized into two small groups; only the second group contains throughput indicators. Example: ``` Iterations: 100 Instructions: 300 Total Cycles: 414 Total uOps: 700 Dispatch Width: 4 uOps Per Cycle: 1.69 IPC: 0.72 Block RThroughput: 4.0 ``` This patch also updates the docs for llvm-mca. Due to the nature of this change, several tests in the tools/llvm-mca directory were affected, and had to be updated using script `update_mca_test_checks.py`. llvm-svn: 340946	2018-08-29 17:56:39 +00:00
Andrea Di Biagio	5221e17fd6	[llvm-mca] Don't disable the SummaryView if flag `-all-stats` is false. llvm-svn: 340945	2018-08-29 17:40:04 +00:00
Martin Storsjo	489993db94	[MinGW] [X86] Add stubs for references to data variables that might end up imported from a dll Variables declared with the dllimport attribute are accessed via a stub variable named __imp_<var>. In MinGW configurations, variables that aren't declared with a dllimport attribute might still end up imported from another DLL with runtime pseudo relocs. For x86_64, this avoids the risk that the target is out of range for a 32 bit PC relative reference, in case the target DLL is loaded further than 4 GB from the reference. It also avoids having to make the text section writable at runtime when doing the runtime fixups, which makes it worthwhile to do for i386 as well. Add stub variables for all dso local data references where a definition of the variable isn't visible within the module, since the DLL data autoimporting might make them imported even though they are marked as dso local within LLVM. Don't do this for variables that actually are defined within the same module, since we then know for sure that it actually is dso local. Don't do this for references to functions, since there's no need for runtime pseudo relocations for autoimporting them; if a function from a different DLL is called without the appropriate dllimport attribute, the call just gets routed via a thunk instead. GCC does something similar since 4.9 (when compiling with -mcmodel=medium or large; from that version, medium is the default code model for x86_64 mingw), but only for x86_64. Differential Revision: https://reviews.llvm.org/D51288 llvm-svn: 340942	2018-08-29 17:28:34 +00:00
Craig Topper	2bcb1eeee1	[InstCombine] Replace two calls to getNumUses() with !hasNUsesOrMore We were calling getNumUses to check for 1 or 2 uses. But getNumUses is linear in the number of uses. We can instead use !hasNUsesOrMore(3) which will stop the linear scan as soon as it determines there are at least 3 uses even if there are more. llvm-svn: 340939	2018-08-29 17:09:21 +00:00
Zachary Turner	68a46d8966	Update Visual Studio Integration version number. This updates the version number in the manifest file to match the SVN revision at which it was committed. llvm-svn: 340938	2018-08-29 16:57:37 +00:00
Farhana Aleen	9250c92d0e	[AMDGPU] Match udot4 pattern. Summary: D.u32 = S0.u8[0] * S1.u8[0] + S0.u8[1] * S1.u8[1] + S0.u8[2] * S1.u8[2] + S0.u8[3] * S1.u8[3] + S2.u32 Author: FarhanaAleen Reviewed By: arsenm Subscribers: llvm-commits, AMDGPU Differential Revision: https://reviews.llvm.org/D50921 llvm-svn: 340936	2018-08-29 16:31:18 +00:00
Alexandre Ganea	bc2f06c9ef	[DebugCounters] Fix DebugCounterTest when running all SupportTests Previously, the DebugCounterTest was failing because CommandLineTest.GetCommandLineArguments was clearing all the global singletons. Differential Revision: https://reviews.llvm.org/D51423 llvm-svn: 340935	2018-08-29 16:11:48 +00:00
Sanjay Patel	3abd9f6bdc	[InstCombine] add test for vector demanded elements + shrinking; NFC llvm-svn: 340933	2018-08-29 15:34:19 +00:00
Simon Atanasyan	dc7f04bcea	[mips] Fix microMIPS unconditional branch offset handling MipsSEInstrInfo class defines for internal purpose unconditional branches as Mips::B nad Mips:J even in case of microMIPS code generation. Under some conditions that leads to the bug - for rather long branch which fits to Mips jump instruction offset size, but does not fit to microMIPS jump offset size, we generate 'short' branch and later show an error 'out of range PC16 fixup' after check in the isBranchOffsetInRange routine. Differential revision: https://reviews.llvm.org/D50615 llvm-svn: 340932	2018-08-29 14:54:01 +00:00
Simon Atanasyan	a7999216f8	[mips] Involves microMIPS's jump in the analyzable branch set Involves microMIPS's jump in the analyzable branch set to reduce some code patterns. Differential revision: https://reviews.llvm.org/D50613 llvm-svn: 340931	2018-08-29 14:53:55 +00:00
Sanjay Patel	d4e19d272a	[InstCombine] move declarations closer to uses; NFC llvm-svn: 340930	2018-08-29 14:42:12 +00:00
Vladimir Stefanovic	0ef60da858	[mips] Prevent shrink-wrap for BuildPairF64, ExtractElementF64 when they use $sp For a certain combination of options, BuildPairF64_{64}, ExtractElementF64{_64} may be expanded into instructions using stack. Add implicit operand $sp for such cases so that ShrinkWrapping doesn't move prologue setup below them. Fixes MultiSource/Benchmarks/MallocBench/cfrac for '--target=mips-img-linux-gnu -mcpu=mips32r6 -mfpxx -mnan=2008' and '--target=mips-img-linux-gnu -mcpu=mips32r6 -mfp64 -mnan=2008 -mno-odd-spreg'. Differential Revision: https://reviews.llvm.org/D50986 llvm-svn: 340927	2018-08-29 14:07:14 +00:00
Sanjay Patel	7a05641fa8	[InstCombine] remove unnecessary shuffle undef folding Add a test for constant folding to show that (shuffle undef, undef, mask) should already be handled via instsimplify. llvm-svn: 340926	2018-08-29 13:24:34 +00:00
Alexandros Lamprineas	f6db5bcd38	Revert r340922 "[GVNHoist] Re-enable GVNHoist by default" Another sanitizer buildbot failed this time at bootstrap when compiling SemaTemplateInstantiate.cpp with this assertion: `dominates(MD, U) && "Memory Def does not dominate it's uses"'. http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux/builds/15047 llvm-svn: 340925	2018-08-29 13:00:55 +00:00
Hans Wennborg	2c390c54f6	Revert r340921 "[NFC] Unify guards detection" This broke the build, see e.g. http://lab.llvm.org:8011/builders/clang-cmake-armv8-lnt/builds/4626/ http://lab.llvm.org:8011/builders/clang-ppc64be-linux-lnt/builds/18647/ http://lab.llvm.org:8011/builders/clang-cmake-x86_64-avx2-linux/builds/5856/ http://lab.llvm.org:8011/builders/lld-x86_64-freebsd/builds/22800/ > We have multiple places in code where we try to identify whether or not > some instruction is a guard. This patch factors out this logic into a separate > utility function which works uniformly in all places. > > Differential Revision: https://reviews.llvm.org/D51152 > Reviewed By: fedor.sergeev llvm-svn: 340923	2018-08-29 12:21:32 +00:00
Alexandros Lamprineas	c03b9b8854	[GVNHoist] Re-enable GVNHoist by default Rebase rL338240 since the excessive memory usage observed when using GVNHoist with UBSan has been fixed by rL340818. Differential Revision: https://reviews.llvm.org/D49858 llvm-svn: 340922	2018-08-29 11:58:34 +00:00
Max Kazantsev	1dafaa87d9	[NFC] Unify guards detection We have multiple places in code where we try to identify whether or not some instruction is a guard. This patch factors out this logic into a separate utility function which works uniformly in all places. Differential Revision: https://reviews.llvm.org/D51152 Reviewed By: fedor.sergeev llvm-svn: 340921	2018-08-29 11:37:34 +00:00
Aleksandar Beserminji	f8f00e5065	[mips] Add missing instructions Add pll.ps, plu.ps, cvt.s.pu, cvt.s.pl, cvt.ps instructions for FP64. Differential Revision: https://reviews.llvm.org/D50437 llvm-svn: 340920	2018-08-29 11:35:03 +00:00
Simon Pilgrim	b49d5f3b53	[DAGCombiner] Add X / X -> 1 & X % X -> 0 folds Adds more divrem folds to try and get in sync with InstructionSimplify Differential Revision: https://reviews.llvm.org/D50636 llvm-svn: 340919	2018-08-29 11:30:16 +00:00
Simon Pilgrim	09cc7af85a	[DAGCombiner] Add X / X -> 1 & X % X -> 0 folds (test tweaks) Adjust missed test to avoid the X / X -> 1 & X % X -> 0 folds while keeping their original purposes. Differential Revision: https://reviews.llvm.org/D50636 llvm-svn: 340917	2018-08-29 11:23:59 +00:00
Simon Pilgrim	6d71c4cfe3	[DAGCombiner] Add X / X -> 1 & X % X -> 0 folds (test tweaks) Adjust tests to avoid the X / X -> 1 & X % X -> 0 folds while keeping their original purposes. Differential Revision: https://reviews.llvm.org/D50636 llvm-svn: 340916	2018-08-29 11:18:14 +00:00
Max Kazantsev	8b4ffe66d6	[NFC] Factor out guard utility methods into a separate file This patch creates file GuardUtils which will contain logic for work with guards that can be shared across different passes. Differential Revision: https://reviews.llvm.org/D51151 Reviewed By: fedor.sergeev llvm-svn: 340914	2018-08-29 10:51:59 +00:00
Simon Pilgrim	6b9bf7ecbc	[X86][AVX] Prefer VPBLENDW+VPBLENDD to VPBLENDVB for v16i16 blend shuffles Noticed while looking at D49562 codegen - we can avoid a large constant mask load and a slow VPBLENDVB select op by using VPBLENDW+VPBLENDD instead. TODO: As discussed on the patch, we should investigate adding VPBLENDVB handling to target shuffle combining as well, that will allow us to extend this to VPBLENDW+VPBLENDW+VPBLENDD. Differential Revision: https://reviews.llvm.org/D50074 llvm-svn: 340913	2018-08-29 10:51:08 +00:00
Krasimir Georgiev	edc318166f	[MC] fix a clang-tidy warning, NFC Summary: Per clang-tidy: function 'llvm::MCStreamer::checkCVLocSection' has a definition with different parameter names .../llvm/lib/MC/MCStreamer.cpp:275:18: the definition seen here .../llvm/include/llvm/MC/MCStreamer.h:235:8: differing parameters are named here: ('FuncId'), in definition: ('FunctionId') Reviewers: bkramer Reviewed By: bkramer Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D51406 llvm-svn: 340912	2018-08-29 10:40:51 +00:00
Simon Pilgrim	39715e3a66	Remove debug code accidently committed in rL340837. NFCI. llvm-svn: 340908	2018-08-29 10:10:58 +00:00
George Rimar	9fbecc97ae	Revert r340904 "[llvm-mc] - Allow to set custom flags for debug sections." It broke PPC64 BB: http://lab.llvm.org:8011/builders/clang-ppc64be-linux/builds/23252 llvm-svn: 340906	2018-08-29 09:04:52 +00:00
Kirill Bobyrev	b3caadf5f4	[benchmark] NFC: Turn benchmark ON on all non-Windows buildbots The problems with benchmark build should be fixed now, but Windows buildbots still run into errors seemingly because of the bug in clang-cl. Because of that, benchmark shouldn't be built on Windows at this point. llvm-svn: 340905	2018-08-29 08:59:36 +00:00
George Rimar	999d1ce517	[llvm-mc] - Allow to set custom flags for debug sections. I am experimenting with a single split dwarf (.dwo sections in .o files). I want to make linker to ignore .dwo sections in .o, for that I am trying to add SHF_EXCLUDE flag ("E") for them in my asm sample. I found that currently, it is impossible to add any flag for debug sections using llvm-mc. That happens because we have a set of predefined unique sections created early with default flags: https://github.com/llvm-mirror/llvm/blob/master/lib/MC/MCObjectFileInfo.cpp#L391 This patch allows a user to add any flags he wants. I had to edit TargetLoweringObjectFileImpl.cpp to set MetaData type for debug sections. Their kind was Data by default (so they were allocatable) and so after changes introduced by this patch the SHF_ALLOC flag was applied for them, what does not make sense for debug sections. One of OrcJITTests tests failed because of that. Differential revision: https://reviews.llvm.org/D51361 llvm-svn: 340904	2018-08-29 08:42:02 +00:00
Nicolai Haehnle	283b995097	AMDGPU: Fix getInstSizeInBytes Summary: Add some optional code to validate getInstSizeInBytes for emitted instructions. This flushed out some issues which are fixed by this patch: - Streamline getInstSizeInBytes - Properly define the VI readlane/writelane instruction as VOP3 - Fix the inline constant determination. Specifically, this change fixes an issue where a 32-bit value of 0xffffffff was recorded as unsigned. This is equal to -1 when restricting to a 32-bit comparison, and an inline constant can be used. Reviewers: arsenm, rampitec Subscribers: kzhuravl, wdng, yaxunl, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D50629 Change-Id: Id87c3b7975839da0de8156a124b0ce98c5fb47f2 llvm-svn: 340903	2018-08-29 07:46:09 +00:00
Hans Wennborg	e0f3e9283f	LoopSink: Don't sink into blocks without an insertion point (PR38462) In the PR, LoopSink was trying to sink into a catchswitch block, which doesn't have a valid insertion point. Differential Revision: https://reviews.llvm.org/D51307 llvm-svn: 340900	2018-08-29 06:55:27 +00:00
Craig Topper	01235a4033	[SelectionDAG] Remove masked_gather/scatter from TargetSelectionDAG.td. These aren't used in tree and the number of operands in the type profile is wrong. X86 uses its own ISD opcode and type profile after op legalization. llvm-svn: 340899	2018-08-29 04:45:33 +00:00
Craig Topper	6b03f267b0	[SelectionDAG] Add some comments to ISDOpcodes.h about the operands of MLOAD, MSTORE, MGATHER, MSCATTER. NFC llvm-svn: 340898	2018-08-29 04:45:32 +00:00
Zachary Turner	b2fef1a0b0	Add support for various C++14 demanglings. Mostly this includes <auto> and <decltype-auto> return values. Additionally, this fixes a fairly obscure back-referencing bug that was encountered in one of the C++14 tests, which is that if you have something like Foo<&bar, &bar> then the `bar` forms a backreference. llvm-svn: 340896	2018-08-29 04:12:44 +00:00

... 3 4 5 6 7 ...

168937 Commits