llvm-project

Commit Graph

Author	SHA1	Message	Date
Eric Christopher	04c7db31e8	Temporarily Revert "[MC] Defer asm errors to post-statement failure" as it's causing errors on the sanitizer bots. This reverts commit r281249. llvm-svn: 281280	2016-09-13 00:19:29 +00:00
Adam Nemet	fd84f48f9f	[OptDiag] Add getHotness accessor llvm-svn: 281274	2016-09-12 23:46:34 +00:00
Philip Reames	9db7948e90	[LVI] Complete the abstract of the cache layer [NFCI] Convert the previous introduced is-a relationship between the LVICache and LVIImple clases into a has-a relationship and hide all the implementation details of the cache from the lazy query layer. The only slightly concerning change here is removing the addition of a queried block into the SeenBlock set in LVIImpl::getBlockValue. As far as I can tell, this was effectively dead code. I think it used to be the case that getCachedValueInfo wasn't const and might end up inserting elements in the cache during lookup. That's no longer true and hasn't been for a while. I did fixup the const usage to make that more obvious. llvm-svn: 281272	2016-09-12 22:38:44 +00:00
Sanjay Patel	ff00fae8e6	add more tests for PR30273 llvm-svn: 281270	2016-09-12 22:28:29 +00:00
Lang Hames	a506d50744	[ORC] Clang-format RPCSerialization.h. llvm-svn: 281269	2016-09-12 22:05:14 +00:00
Lang Hames	c83e39b9e2	[ORC] Add some more documentation to RPCSerialization.h. llvm-svn: 281268	2016-09-12 22:05:12 +00:00
Philip Reames	b627aec407	[LVI] Sink a couple more cache manipulation routines into the cache itself [NFCI] The only interesting bit here is the refactor of the handle callback and even that's pretty straight-forward. llvm-svn: 281267	2016-09-12 22:03:36 +00:00
Philip Reames	92e5e1b92d	[LVI] Abstract out the actual cache logic [NFCI] Seperate the caching logic from the implementation of the lazy analysis. For the moment, the lazy analysis impl has a is-a relationship with the cache; this will change to a has-a relationship shortly. This was done as two steps merely to keep the changes simple and the diff understandable. llvm-svn: 281266	2016-09-12 21:46:58 +00:00
Nico Weber	7c31d0ebc0	Revert r281215, it caused PR30358. llvm-svn: 281263	2016-09-12 21:40:50 +00:00
Nico Weber	8ad01c303e	attempt to unbreak build after r281254 llvm-svn: 281262	2016-09-12 21:15:44 +00:00
Lang Hames	2f98f31c77	[ORC] Add missing <thread> header to RPCSerialization.h. llvm-svn: 281257	2016-09-12 20:45:01 +00:00
Lang Hames	27ce3c1b5e	[ORC] Replace the serialize/deserialize function pair with a SerializationTraits class. SerializationTraits provides serialize and deserialize methods corresponding to the earlier functions, but also provides a name for the type. In future, this name will be used to render function signatures as strings, which will in turn be used to negotiate and verify API support between RPC clients and servers. llvm-svn: 281254	2016-09-12 20:34:41 +00:00
Dehao Chen	c32d71253c	Fix the bug introduced in r281252. llvm-svn: 281253	2016-09-12 20:29:54 +00:00
Dehao Chen	9bbb941acf	Lower consecutive select instructions correctly. Summary: If consecutive select instructions are lowered separately in CGP, it will introduce redundant condition check and branches that cannot be removed by later optimization phases. This patch lowers all consecutive select instructions at the same to to avoid inefficent code as demonstrated in https://llvm.org/bugs/show_bug.cgi?id=29095 Reviewers: davidxl Subscribers: vsk, llvm-commits Differential Revision: https://reviews.llvm.org/D24147 llvm-svn: 281252	2016-09-12 20:23:28 +00:00
Nirav Dave	c0c0f7a196	[MC] Defer asm errors to post-statement failure Allow errors to be deferred and emitted as part of clean up to simplify and shorten Assembly parser code. This will allow error messages to be emitted in helper functions and be modified by the caller which has better context. As part of this many minor cleanups to the Parser: * Unify parser cleanup on error * Add Workaround for incorrect return values in ParseDirective instances * Tighten checks on error-signifying return values for parser functions and fix in-tree TargetParsers to be more consistent with the changes. * Fix AArch64 test cases checking for spurious error messages that are now fixed. These changes should be backwards compatible with current Target Parsers so long as the error status are correctly returned in appropriate functions. Reviewers: rnk, majnemer Subscribers: aemerson, jyknight, llvm-commits Differential Revision: https://reviews.llvm.org/D24047 llvm-svn: 281249	2016-09-12 20:03:02 +00:00
Sanjay Patel	dea26950a0	[InstCombine] add test for PR30327 llvm-svn: 281248	2016-09-12 19:50:08 +00:00
Sanjay Patel	2eea9a1d58	[InstCombine] regenerate checks llvm-svn: 281247	2016-09-12 19:29:26 +00:00
Elena Demikhovsky	57c602aad3	AVX-512: Added a test for -O0 mode. NFC. llvm-svn: 281246	2016-09-12 19:03:21 +00:00
Elena Demikhovsky	5730bf0429	AVX-512: Simplified masked_gather_scatter test. NFC. llvm-svn: 281244	2016-09-12 18:50:47 +00:00
Hemant Kulkarni	3a705b570b	Fix test failure in r281232 llvm-svn: 281240	2016-09-12 17:40:10 +00:00
Lang Hames	8d4be3aacf	[MCJIT] Fix some inconsistent handling of name mangling inside MCJIT. This patch moves symbol mangling from findSymbol to getSymbolAddress. The findSymbol, findExistingSymbol and findModuleForSymbol methods now always take a mangled name, allowing the 'demangle-and-retry' cruft to be removed from findSymbol. See http://llvm.org/PR28699 for details. Patch by James Holderness. Thanks very much James! llvm-svn: 281238	2016-09-12 17:19:24 +00:00
Hemant Kulkarni	5f4ca2f371	llvm-size: Add --totals option Differential Revision: https://reviews.llvm.org/D24308 llvm-svn: 281233	2016-09-12 17:08:28 +00:00
Hemant Kulkarni	aecf9d0c86	llvm-objdump: Add --start-address and --stop-address options Differential Revision: https://reviews.llvm.org/D24160 llvm-svn: 281232	2016-09-12 17:08:22 +00:00
Sanjay Patel	f5887f1fbd	[InstCombine] use m_APInt to allow icmp X, C folds for splat constant vectors isSignBitCheck could be changed to take a pointer param to avoid the 'UnusedBit' ugliness. llvm-svn: 281231	2016-09-12 16:25:41 +00:00
Nicolai Haehnle	e58e0e3fe3	AMDGPU: Do not clobber SCC in SIWholeQuadMode Reviewers: arsenm, tstellarAMD, mareko Subscribers: arsenm, llvm-commits, kzhuravl Differential Revision: http://reviews.llvm.org/D22198 llvm-svn: 281230	2016-09-12 16:25:20 +00:00
Ahmed Bougacha	925961b20c	[GlobalISel] Fix mismatched "<..)" in intrinsic MO printing. NFC. llvm-svn: 281229	2016-09-12 16:21:49 +00:00
James Molloy	3d06ff22b7	Revert "[ARM] Promote small global constants to constant pools" This reverts commit r281213. It made a bot go bang: http://lab.llvm.org:8011/builders/clang-cmake-armv7-a15-full/builds/14625 llvm-svn: 281228	2016-09-12 16:18:23 +00:00
Ahmed Bougacha	b678219aa6	[BranchFolding] Unique added live-ins after hoisting code. We're not supposed to have duplicate live-ins. llvm-svn: 281224	2016-09-12 16:05:31 +00:00
Ahmed Bougacha	45bfa8772f	[X86] Copy imp-uses when folding tailcall into conditional branch. r280832 added 32-bit support for emitting conditional tail-calls, but dropped imp-used parameter registers. This went unnoticed until r281113, which added 64-bit support, as this is only exposed with parameter passing via registers. Don't drop the imp-used parameters. llvm-svn: 281223	2016-09-12 16:05:27 +00:00
David Majnemer	c83044d9bb	[FunctionAttrs] Don't try to infer returned if it is already on an argument Trying to infer the 'returned' attribute if an argument is already 'returned' can lead to verification failure: inference might determine that a different argument is passed through which would result in two different arguments marked as 'returned'. This fixes PR30350. llvm-svn: 281221	2016-09-12 16:04:59 +00:00
Sanjay Patel	0531f0a5bb	fix formatting; NFC llvm-svn: 281220	2016-09-12 15:52:28 +00:00
Sanjay Patel	db400baa80	[InstCombine] add tests to show missing vector folds llvm-svn: 281219	2016-09-12 15:51:42 +00:00
Igor Breger	a3e36da6f2	add select i1 test, reproduser pr30249. llvm-svn: 281218	2016-09-12 15:27:02 +00:00
Sanjay Patel	3151dec7f1	[InstCombine] add helper function for foldICmpUsingKnownBits; NFCI llvm-svn: 281217	2016-09-12 15:24:31 +00:00
Sam Kolton	fb0d9d9c13	[AMDGPU] Assembler: Move disabled SDWA and DPP instruction into Disable asm variant Summary: This removes disabled instructions from match tables so we will not match them at all. Reviewers: tstellarAMD, vpykhtin, artem.tamazov Subscribers: wdng, nhaehnle, arsenm Differential Revision: https://reviews.llvm.org/D24452 llvm-svn: 281216	2016-09-12 14:42:43 +00:00
James Molloy	1e1b56bd48	[Thumb] Teach ISel how to lower compares of AND bitmasks efficiently For the common pattern (CMPZ (AND x, #bitmask), #0), we can do some more efficient instruction selection if the bitmask is one consecutive sequence of set bits (32 - clz(bm) - ctz(bm) == popcount(bm)). 1) If the bitmask touches the LSB, then we can remove all the upper bits and set the flags by doing one LSLS. 2) If the bitmask touches the MSB, then we can remove all the lower bits and set the flags with one LSRS. 3) If the bitmask has popcount == 1 (only one set bit), we can shift that bit into the sign bit with one LSLS and change the condition query from NE/EQ to MI/PL (we could also implement this by shifting into the carry bit and branching on BCC/BCS). 4) Otherwise, we can emit a sequence of LSLS+LSRS to remove the upper and lower zero bits of the mask. 1-3 require only one 16-bit instruction and can elide the CMP. 4 requires two 16-bit instructions but can elide the CMP and doesn't require materializing a complex immediate, so is also a win. llvm-svn: 281215	2016-09-12 14:30:48 +00:00
Sanjay Patel	5352331716	fix formatting/typos; NFC llvm-svn: 281214	2016-09-12 14:25:46 +00:00
James Molloy	8f82d45ff4	[ARM] Promote small global constants to constant pools If a constant is unamed_addr and is only used within one function, we can save on the code size and runtime cost of an indirection by changing the global's storage to inside the constant pool. For example, instead of: ldr r0, .CPI0 bl printf bx lr .CPI0: &format_string format_string: .asciz "hello, world!\n" We can emit: adr r0, .CPI0 bl printf bx lr .CPI0: .asciz "hello, world!\n" This can cause significant code size savings when many small strings are used in one function (4 bytes per string). llvm-svn: 281213	2016-09-12 13:42:16 +00:00
Chad Rosier	a4c424654e	[LoopInterchange] Improve debug output. NFC. llvm-svn: 281212	2016-09-12 13:24:47 +00:00
Pablo Barrio	0bebc38abb	Fix the Thumb test for vfloat intrinsics Summary: This test was not testing the intrinsics. A function like this: define %v4f32 @test_v4f32.floor(%v4f32 %a){ ... %1 = call %v4f32 @llvm.floor.v4f32(%v4f32 %a) ... } is transformed into the following assembly: _test_v4f32.floor: @ @test_v4f32.floor ... bl _floorf ... In each function tested, there are two CHECK: one that checked for the label and another one for the intrinsic that should be used inside the function (in our case, "floor"). However, although the first CHECK was matching the label, the second was not matching the intrinsic, but the second "floor" in the same line as the label. This is fixed by making the first CHECK match the entire line. Reviewers: jmolloy, rengolin Subscribers: rengolin, llvm-commits Differential Revision: https://reviews.llvm.org/D24398 llvm-svn: 281211	2016-09-12 13:14:14 +00:00
Rafael Espindola	74941239d8	Define a dummy zlib::uncompress when zlib is not available. Should fix link errors in some bots when it is used. llvm-svn: 281208	2016-09-12 13:00:51 +00:00
Tim Northover	032548fc5e	GlobalISel: support translation of global addresses. llvm-svn: 281207	2016-09-12 12:10:41 +00:00
Tim Northover	a7653b3919	GlobalISel: translate GEP instructions. Unlike SDag, we use a separate G_GEP instruction (much simplified, only taking a single byte offset) to preserve the pointer type information through selection. llvm-svn: 281205	2016-09-12 11:20:22 +00:00
Tim Northover	d28d3cc079	GlobalISel: disambiguate types when printing MIR Some generic instructions have multiple types. While in theory these always be discovered by inspecting the single definition of each generic vreg, in practice those definitions won't always be local and traipsing through a big function to find them will not be fun. So this changes MIRPrinter to print out the type of uses as well as defs, if they're known to be different or not known to be the same. On the parsing side, we're a little more flexible: provided each register is given a type in at least one place it's mentioned (and all types are consistent) we accept the MIR. This doesn't introduce ambiguity but makes writing tests manually a bit less painful. llvm-svn: 281204	2016-09-12 11:20:10 +00:00
Eric Liu	c7e5a9ce17	Fix WebAssembly broken build related to interface change in r281172. Reviewers: bkramer Subscribers: jfb, llvm-commits, dschuff Differential Revision: https://reviews.llvm.org/D24449 llvm-svn: 281201	2016-09-12 09:35:59 +00:00
Elena Demikhovsky	de1b494555	AVX-512: Added a test case that should be optimized in the future. NFC. llvm-svn: 281196	2016-09-12 06:26:03 +00:00
NAKAMURA Takumi	cf6aaa9e1a	llvm/test/CodeGen/AMDGPU/infinite-loop-evergreen.ll REQUIRES +Asserts. This might not crash with -Asserts. I saw it caused infinite loop in the codegen. llvm-svn: 281190	2016-09-12 04:27:28 +00:00
Duncan P. N. Exon Smith	cd0fffb6e1	MC: Move MCSection::begin/end to header, NFC llvm-svn: 281188	2016-09-12 00:17:09 +00:00
Sanjay Patel	60312bc45f	[InstCombine] add helper function for folding {and,or,xor} (cast X), C ; NFCI llvm-svn: 281187	2016-09-12 00:16:23 +00:00
Sanjay Patel	f9ca770225	[InstCombine] regenerate checks llvm-svn: 281186	2016-09-12 00:12:56 +00:00
Sanjay Patel	a2aabfcc17	[InstCombine] regenerate checks llvm-svn: 281185	2016-09-12 00:08:33 +00:00
Duncan P. N. Exon Smith	b5da005335	ADT: Never allocate nodes in iplist<> and ilist<> Remove createNode() and any API that depending on it, and add HasCreateNode to the list of checks for HasObsoleteCustomizations. Now an ilist never allocates (this was already true for iplist). This factors out all the differences between iplist and ilist. I'll aim to rename both to "owning_ilist" eventually, to call out the interesting (not exactly intrusive) ownership semantics. In the meantime, I've left both names around to reduce code churn. One of the deleted APIs is the ilist copy constructor. I've lifted up and tested iplist::cloneFrom (ala simple_ilist::cloneFrom) as a replacement. Users of ilist<> and iplist<> that want the list to allocate nodes have a few options: - use std::list; - use AllocatorList or BumpPtrList (or build a similarly trivial list); - use cloneFrom (which is explicit at the call site); or - allocate at the call site. See r280573, r281177, r281181, and r281182 for examples of what to do if you're updating out-of-tree code. llvm-svn: 281184	2016-09-11 23:43:43 +00:00
Duncan P. N. Exon Smith	eaf2827695	ADT: Fix build after r281182 Fix this bot: http://lab.llvm.org:8011/builders/clang-x86_64-linux-abi-test/builds/20680 which complained that a typedef of 'iterator_adaptor_base' changed meaning in AllocatorList::IteratorImpl. Use 'base_type' there instead. llvm-svn: 281183	2016-09-11 22:55:46 +00:00
Duncan P. N. Exon Smith	23d8306d13	ADT: Add AllocatorList, and use it for yaml::Token - Add AllocatorList, a non-intrusive list that owns an LLVM-style allocator and provides a std::list-like interface (trivially built on top of simple_ilist), - add a typedef (and unit tests) for BumpPtrList, and - use BumpPtrList for the list of llvm::yaml::Token (i.e., TokenQueueT). TokenQueueT has no need for the complexity of an intrusive list. The only reason to inherit from ilist was to customize the allocator. TokenQueueT was the only example in-tree of using ilist<> in a truly non-intrusive way. Moreover, this removes the final use of the non-intrusive ilist_traits<>::createNode (after r280573, r281177, and r281181). I have a WIP patch that removes this customization point (and the API that relies on it) that I plan to commit soon. Note: AllocatorList owns the allocator, which limits the viable API (e.g., splicing must be on the same list). For now I've left out any problematic API. It wouldn't be hard to split AllocatorList into two layers: an Impl class that calls DerivedT::getAlloc (via CRTP), and derived classes that handle Allocator ownership/reference/etc semantics; and then implement splice with appropriate assertions; but TBH we should probably just customize the std::list allocators at that point. llvm-svn: 281182	2016-09-11 22:40:40 +00:00
Duncan P. N. Exon Smith	3013ea1b63	Analysis: Only allow the move-constructor for IVUsers Force IVUsers to be moved instead of copied, properly update Parent pointers in IVStrideUse when IVUsers is moved, and make sure we have move constructors available in iplist and ilist. I came across this in a WIP patch that deleted the copy constructors from ilist. I was surprised to find that IVUsersAnalysis couldn't be registered in the new pass manager. It's not clear to me whether IVUsers was getting moved only when empty, but if it was being moved when it was non-empty then this fixes a pointer invalidation bug and should give some sort of speedup. Note that the bugfix would be necessary even for a copy constructor. llvm-svn: 281181	2016-09-11 22:11:37 +00:00
Craig Topper	7600794dde	[TwoAddressInstruction] When commuting an instruction don't assume that the destination register is operand 0. Pass it from the caller. In practice it probably is 0 so this may not be a functional change. llvm-svn: 281180	2016-09-11 22:10:42 +00:00
Lang Hames	958b699883	[ORC] Update examples for header changes in r281171. llvm-svn: 281178	2016-09-11 21:34:13 +00:00
Duncan P. N. Exon Smith	8b4e4af5ed	ScalarOpts: Use std::list for Candidates, NFC There is nothing intrusive about the Candidate list; use std::list over llvm::ilist for simplicity. llvm-svn: 281177	2016-09-11 21:29:34 +00:00
Duncan P. N. Exon Smith	077f5b41e4	ScalarOpts: Sort includes, NFC llvm-svn: 281176	2016-09-11 21:04:36 +00:00
Duncan P. N. Exon Smith	0e63013f27	ADT: Remove ilist_iterator::reset(), NFC ilist_iterator::reset was unnecessary API, and wasn't any clearer (or safer) at the call site than constructing a temporary and assigning it to the iterator. llvm-svn: 281175	2016-09-11 20:47:27 +00:00
Lang Hames	3e718e0818	[ORC] Fix the RPC unit test for header changes in r281171. llvm-svn: 281173	2016-09-11 19:12:19 +00:00
Duncan P. N. Exon Smith	1872096f1e	CodeGen: Give MachineBasicBlock::reverse_iterator a handle to the current MI Now that MachineBasicBlock::reverse_instr_iterator knows when it's at the end (since r281168 and r281170), implement MachineBasicBlock::reverse_iterator directly on top of an ilist::reverse_iterator by adding an IsReverse template parameter to MachineInstrBundleIterator. This replaces another hard-to-reason-about use of std::reverse_iterator on list iterators, matching the changes for ilist::reverse_iterator from r280032 (see the "out of scope" section at the end of that commit message). MachineBasicBlock::reverse_iterator now has a handle to the current node and has obvious invalidation semantics. r280032 has a more detailed explanation of how list-style reverse iterators (invalidated when the pointed-at node is deleted) are different from vector-style reverse iterators like std::reverse_iterator (invalidated on every operation). A great motivating example is this commit's changes to lib/CodeGen/DeadMachineInstructionElim.cpp. Note: If your out-of-tree backend deletes instructions while iterating on a MachineBasicBlock::reverse_iterator or converts between MachineBasicBlock::iterator and MachineBasicBlock::reverse_iterator, you'll need to update your code in similar ways to r280032. The following table might help: [Old] ==> [New] delete &RI, RE = end() delete &RI++ RI->erase(), RE = end() RI++->erase() reverse_iterator(I) std::prev(I).getReverse() reverse_iterator(I) ++I.getReverse() --reverse_iterator(I) I.getReverse() reverse_iterator(std::next(I)) I.getReverse() RI.base() std::prev(RI).getReverse() RI.base() ++RI.getReverse() --RI.base() RI.getReverse() std::next(RI).base() RI.getReverse() (For more details, have a look at r280032.) llvm-svn: 281172	2016-09-11 18:51:28 +00:00
Lang Hames	e98bc7af8b	[ORC] Rename RPCChannel to RPCByteChannel. NFC. llvm-svn: 281171	2016-09-11 18:41:05 +00:00
Duncan P. N. Exon Smith	3b22b18154	CodeGen: Assert that bundle iterators are valid Add an assertion to the MachineInstrBundleIterator from instr_iterator that the underlying iterator is valid. This is possible know that we can check ilist_node::isSentinel (since r281168), and is consistent with the constructors from MachineInstr* and MachineInstr&. Avoiding the new assertion in operator== and operator!= requires four (!!!!) new overloads each. (As an aside, I'm strongly in favour of: - making the conversion from instr_iterator explicit; - making the conversion from pointer explicit; - making the conversion from reference explicit; and - removing all the extra overloads of operator== and operator!= except const_instr_iterator. I'm not signing up for that at this point, but being clear about when something is an MachineInstr-iterator (possibly instr_end()) vs MachineInstr-bundle-iterator (possibly end()) vs MachineInstr* (possibly nullptr) vs MachineInstr& (known valid) would surely make code cleaner... and it would remove a ton of boilerplate from MachineInstrBundleIterator operators.) llvm-svn: 281170	2016-09-11 17:12:28 +00:00
Duncan P. N. Exon Smith	f887596ef8	Fix the modules build after r281167 Add an #include for <type_traits> to llvm/ADT/ilist_node_options.h to make it standalone. llvm-svn: 281169	2016-09-11 17:00:12 +00:00
Duncan P. N. Exon Smith	cc9edace0c	CodeGen: Turn on sentinel tracking for MachineInstr iterators This is a prep commit before fixing MachineBasicBlock::reverse_iterator invalidation semantics, ala r281167 for ilist::reverse_iterator. This changes MachineBasicBlock::Instructions to track which node is the sentinel regardless of LLVM_ENABLE_ABI_BREAKING_CHECKS. There's almost no functionality change (aside from ABI). However, in the rare configuration: #if !defined(NDEBUG) && !defined(LLVM_ENABLE_ABI_BREAKING_CHECKS) the isKnownSentinel() assertions in ilist_iterator<>::operator* suddenly have teeth for MachineInstr. If these assertions start firing for your out-of-tree backend, have a look at the suggestions in the commit message for r279314, and at some of the commits leading up to it that avoid dereferencing the end() iterator. llvm-svn: 281168	2016-09-11 16:38:18 +00:00
Duncan P. N. Exon Smith	085bbf1e2f	ADT: Add sentinel tracking and custom tags to ilists This adds two declarative configuration options for intrusive lists (available for simple_ilist, iplist, and ilist). Both of these options affect ilist_node interoperability and need to be passed both to the node and the list. Instead of adding a new traits class, they're specified as optional template parameters (in any order). The two options: 1. Pass ilist_sentinel_tracking<true> or ilist_sentinel_tracking<false> to control whether there's a bit on ilist_node "prev" pointer indicating whether it's the sentinel. The default behaviour is to use a bit if and only if LLVM_ENABLE_ABI_BREAKING_CHECKS. 2. Pass ilist_tag<TagA> and ilist_tag<TagB> to allow insertion of a single node into two different lists (simultaneously). I have an immediate use-case for (1) ilist_sentinel_tracking: fixing the validation semantics of MachineBasicBlock::reverse_iterator to match ilist::reverse_iterator (ala r280032: see the comments at the end of the commit message there). I'm adding (2) ilist_tag in the same commit to validate that the options framework supports expansion. Justin Bogner mentioned this might enable a possible cleanup in SelectionDAG, but I'll leave this to others to explore. In the meantime, the unit tests and the comments for simple_ilist and ilist_node have usage examples. Note that there's a layer of indirection to support optional, out-of-order, template paramaters. Internal classes are templated on an instantiation of the non-variadic ilist_detail::node_options. User-facing classes use ilist_detail::compute_node_options to compute the correct instantiation of ilist_detail::node_options. The comments for ilist_detail::is_valid_option describe how to add new options (e.g., ilist_packed_int<int NumBits>). llvm-svn: 281167	2016-09-11 16:20:53 +00:00
Igor Breger	e73ef85c6f	[AVX512] Fix pattern for vgetmantsd and all other instructions that use same class. Fix memory operand size, remove unnecessary pattern. Differential Revision: http://reviews.llvm.org/D24443 llvm-svn: 281164	2016-09-11 12:38:46 +00:00
James Molloy	a33571ccb4	Fixup failing debuginfo test for change in SimplifyCFG. This reverts this test back to its original pre-r280364 behaviour as we don't sink allocas any more. llvm-svn: 281163	2016-09-11 09:13:32 +00:00
James Molloy	104370ab37	[SimplifyCFG] Be even more conservative in SinkThenElseCodeToEnd This should actually fix PR30244. This cranks up the workaround for PR30188 so that we never sink loads or stores of allocas. The idea is that these should be removed by SROA/Mem2Reg, and any movement of them may well confuse SROA or just cause unwanted code churn. It's not ideal that the midend should be crippled like this, but that unwanted churn can really cause significant regressions in important workloads (tsan). llvm-svn: 281162	2016-09-11 09:00:03 +00:00
James Molloy	3e1ce05752	[AArch64] Fixup test after r281160 How I missed this locally is beyond me. I suspect llc didn't recompile. This is just changing the CHECK line back to what it was before r280364. llvm-svn: 281161	2016-09-11 08:24:04 +00:00
James Molloy	18d96e8fa5	[SimplifyCFG] Harden up the profitability heuristic for block splitting during sinking Exposed by PR30244, we will split a block currently if we think we can sink at least one instruction. However this isn't right - the reason we split predecessors is so that we can sink instructions that otherwise couldn't be sunk because it isn't safe to do so - stores, for example. So, change the heuristic to only split if it thinks it can sink at least one non-speculatable instruction. Should fix PR30244. llvm-svn: 281160	2016-09-11 08:07:30 +00:00
Craig Topper	1f81deee1f	[CodeGen] Make the TwoAddressInstructionPass check if the instruction is commutable before calling findCommutedOpIndices for every operand. Also make sure the operand is a register before each call to save some work on commutable instructions that might have an operand. llvm-svn: 281158	2016-09-11 06:00:15 +00:00
Craig Topper	3639cda748	[AVX-512] Add test cases to demonstrate opportunities for commuting vpternlog. Commuting will be added in a future commit. llvm-svn: 281157	2016-09-11 05:33:43 +00:00
Craig Topper	fb4564cf21	[AVX-512] Add VPTERNLOG to load folding tables. llvm-svn: 281156	2016-09-11 05:33:40 +00:00
Craig Topper	2c86705755	[X86] Side effecting asm in AVX512 integer stack folding test should return 2 x i64 not 8 x i64. llvm-svn: 281155	2016-09-11 05:33:38 +00:00
Craig Topper	69be1bd352	[X86] Make a helper method into a static function local to the cpp file. llvm-svn: 281154	2016-09-11 05:33:35 +00:00
Justin Lebar	11a3204355	Add handling of !invariant.load to PropagateMetadata. Summary: This will let e.g. the load/store vectorizer propagate this metadata appropriately. Reviewers: arsenm Subscribers: tra, jholewinski, hfinkel, mzolotukhin Differential Revision: https://reviews.llvm.org/D23479 llvm-svn: 281153	2016-09-11 01:39:08 +00:00
Justin Lebar	6d6b11a4a6	[NVPTX] Use ldg for explicitly invariant loads. Summary: With this change (plus some changes to prevent !invariant from being clobbered within llvm), clang will be able to model the __ldg CUDA builtin as an invariant load, rather than as a target-specific llvm intrinsic. This will let the optimizer play with these loads -- specifically, we should be able to vectorize them in the load-store vectorizer. Reviewers: tra Subscribers: jholewinski, hfinkel, llvm-commits, chandlerc Differential Revision: https://reviews.llvm.org/D23477 llvm-svn: 281152	2016-09-11 01:39:04 +00:00
Justin Lebar	adbf09e8cf	[CodeGen] Split out the notions of MI invariance and MI dereferenceability. Summary: An IR load can be invariant, dereferenceable, neither, or both. But currently, MI's notion of invariance is IR-invariant && IR-dereferenceable. This patch splits up the notions of invariance and dereferenceability at the MI level. It's NFC, so adds some probably-unnecessary "is-dereferenceable" checks, which we can remove later if desired. Reviewers: chandlerc, tstellarAMD Subscribers: jholewinski, arsenm, nemanjai, llvm-commits Differential Revision: https://reviews.llvm.org/D23371 llvm-svn: 281151	2016-09-11 01:38:58 +00:00
Sjoerd Meijer	e45b9a8c99	MCInstrDesc: Flags (uint64_t) was checked using (1 << MCID::XYZ), but this should have been (1ULL << MCID::XYZ). Currently this works because enum Flag has 31 values, but extending it will result in a compile warnings/errors. This was part of the accepted patch in https://reviews.llvm.org/D23601, but it was suggested to apply this first as a separate patch. llvm-svn: 281149	2016-09-10 20:54:19 +00:00
Arnold Schwaighofer	6c57f4f56d	It should also be legal to pass a swifterror parameter to a call as a swifterror argument. rdar://28233388 llvm-svn: 281147	2016-09-10 19:42:53 +00:00
Vedant Kumar	b1c174aa1c	[llvm-cov] Move the 'jump to first unexecuted line' link Having it in the same row as the source name is jarring. Move it next to the "Source" column label. llvm-svn: 281146	2016-09-10 19:37:26 +00:00
Vedant Kumar	7b9e9bb491	[llvm-cov] Minor visual tweaks for html reports - Change the location of the 'Region Coverage' column. - Use less css and text for some labels. llvm-svn: 281145	2016-09-10 19:37:20 +00:00
Arnold Schwaighofer	5d335559b9	InstCombine: Don't combine loads/stores from swifterror to a new type This generates invalid IR: the only users of swifterror can be call arguments, loads, and stores. rdar://28242257 llvm-svn: 281144	2016-09-10 18:14:57 +00:00
Arnold Schwaighofer	ad83002243	Add an isSwiftError predicate to Value llvm-svn: 281143	2016-09-10 18:14:54 +00:00
Duncan P. N. Exon Smith	34c4d2abfd	ADT: Move ilist_node_access to ilist_detail::NodeAccess... ... and make a few ilist-internal API changes, in preparation for changing how ilist_node is templated. The only effect for ilist users should be changing the friend target from llvm::ilist_node_access to llvm::ilist_detail::NodeAccess (which is only necessary when they inherit privately from ilist_node). - Split out SpecificNodeAccess, which has overloads of getNodePtr and getValuePtr that are untemplated. - Use more typedefs to prevent more changes later. - Force inheritance to use NodeAccess (to emphasize that ilist users* shouldn't be doing this). There should be no functionality change here. llvm-svn: 281142	2016-09-10 16:55:06 +00:00
Duncan P. N. Exon Smith	41aceac37f	ADT: Use typedefs for ilist_base and ilist_node_base, NFC This is a prep commit to minimize changes in a follow-up that is adding a template parameter to ilist_node_base and ilist_base. llvm-svn: 281141	2016-09-10 16:28:52 +00:00
Sanjay Patel	0a3d72bb93	[InstCombine] clean up foldICmpBinOpEqualityWithConstant / foldICmpIntrinsicWithConstant ; NFC 1. Rename variables to be consistent with related/preceding code (may want to reorganize). 2. Fix comments/formatting. llvm-svn: 281140	2016-09-10 15:33:39 +00:00
Sanjay Patel	f58f68c891	[InstCombine] rename and reorganize some icmp folding functions; NFC Everything under foldICmpInstWithConstant() should now be working for splat vectors via m_APInt matchers. Ie, I've removed all of the FIXMEs that I added while cleaning that section up. Note that not all of the associated FIXMEs in the regression tests are gone though, because some of the tests require earlier folds that are still scalar-only. llvm-svn: 281139	2016-09-10 15:03:44 +00:00
Arnold Schwaighofer	112ff66505	We also need to pass swifterror in R12 under swiftcc not only under ccc rdar://28190687 llvm-svn: 281138	2016-09-10 14:16:55 +00:00
Valery Pykhtin	b66e5eb612	[AMDGPU] Refactor MUBUF/MTBUF instructions Differential revision: https://reviews.llvm.org/D24295 llvm-svn: 281137	2016-09-10 13:09:16 +00:00
Teresa Johnson	027d7a0994	[gold/LTO] Add test case for r281134 Add test case that was supposed to go in with r281134. llvm-svn: 281135	2016-09-10 06:18:17 +00:00
Teresa Johnson	03c3511211	[LTO] Handle null GV in Symbol object Similar to other Symbol methods, have Symbol::getComdat handle a null GV gracefully. Fixes PR30326. llvm-svn: 281134	2016-09-10 06:15:26 +00:00
Heejin Ahn	99bd16b34b	[WebAssembly] Fix typos in comments llvm-svn: 281131	2016-09-10 02:33:47 +00:00
Kostya Serebryany	8c537c556a	[libFuzzer] print a failed-merge warning only in the merge mode llvm-svn: 281130	2016-09-10 02:17:22 +00:00
Matt Arsenault	3354f42ae7	AMDGPU: Implement is{LoadFrom\|StoreTo}FrameIndex llvm-svn: 281128	2016-09-10 01:20:33 +00:00
Matt Arsenault	7348a7eadd	AMDGPU: Fix scheduling info for spill pseudos These defaulted to Write32Bit. I don't think this actually matters since these don't exist during scheduling. llvm-svn: 281127	2016-09-10 01:20:28 +00:00
Vitaly Buka	3ac3aa50f6	[asan] Add flag to allow lifetime analysis of problematic allocas Summary: Could be useful for comparison when we suspect that alloca was skipped because of this. Reviewers: eugenis Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D24437 llvm-svn: 281126	2016-09-10 01:06:11 +00:00
Justin Lebar	d98cf00c95	[CodeGen] Rename MachineInstr::isInvariantLoad to isDereferenceableInvariantLoad. NFC Summary: I want to separate out the notions of invariance and dereferenceability at the MI level, so that they correspond to the equivalent concepts at the IR level. (Currently an MI load is MI-invariant iff it's IR-invariant and IR-dereferenceable.) First step is renaming this function. Reviewers: chandlerc Subscribers: MatzeB, jfb, llvm-commits Differential Revision: https://reviews.llvm.org/D23370 llvm-svn: 281125	2016-09-10 01:03:20 +00:00
Kostya Serebryany	4529960a3b	[libFuzzer] don't print help for internal flags llvm-svn: 281124	2016-09-10 00:35:30 +00:00
Kostya Serebryany	b991cc1f0e	[libFuzzer] print a visible message if merge fails due to a crash llvm-svn: 281122	2016-09-10 00:15:41 +00:00
Vedant Kumar	99b555709e	Remove dead code in the SelectionDAG headers (NFC) I tested this with "ninja check-llvm-codegen" on a Release build with all architectures enabled, and again with a Debug build on x86. Found with llvm-cov. Differential Revision: https://reviews.llvm.org/D24433 llvm-svn: 281120	2016-09-10 00:05:19 +00:00
Matt Arsenault	124384f08d	AMDGPU: Fix immediate folding logic when shrinking instructions If the literal is being folded into src0, it doesn't matter if it's an SGPR because it's being replaced with the literal. Also fixes initially selecting 32-bit versions of some instructions which also confused commuting. llvm-svn: 281117	2016-09-09 23:32:53 +00:00
Arnold Schwaighofer	c9277f40fd	Inliner: Don't mark swifterror allocas with lifetime markers This would create a bitcast use which fails the verifier: swifterror values may only be used by loads, stores, and as function arguments. rdar://28233244 llvm-svn: 281114	2016-09-09 22:40:27 +00:00
Hans Wennborg	6ecf619be9	X86: Fold tail calls into conditional branches also for 64-bit (PR26302) This extends the optimization in r280832 to also work for 64-bit. The only quirk is that we can't do this for 64-bit Windows (yet). Differential Revision: https://reviews.llvm.org/D24423 llvm-svn: 281113	2016-09-09 22:37:27 +00:00
Matt Arsenault	0efdd06b22	AMDGPU: Run LoadStoreVectorizer pass by default llvm-svn: 281112	2016-09-09 22:29:28 +00:00
Kostya Serebryany	1837152a34	[libFuzzer] use sizeof() in tests instead of 4 and 8 llvm-svn: 281111	2016-09-09 22:21:16 +00:00
Matt Arsenault	950a82047b	LSV: Fix incorrectly increasing alignment If the unaligned access has a dynamic offset, it may be odd which would make the adjusted alignment incorrect to use. llvm-svn: 281110	2016-09-09 22:20:14 +00:00
Davide Italiano	b4d0521c92	[gold] Test that we handle invalid directory correctly. I had this test sitting around for a while but always forgot to commit. Rafael reviewed it a while ago. Differential Revision: https://reviews.llvm.org/D19207 llvm-svn: 281109	2016-09-09 22:14:57 +00:00
Sanjay Patel	58109abe91	[InstCombine] use m_APInt to allow icmp ult X, C folds for splat constant vectors llvm-svn: 281107	2016-09-09 21:59:37 +00:00
Kostya Serebryany	4b17a331ae	[libFuzzer] one more puzzle for value profile llvm-svn: 281106	2016-09-09 21:58:42 +00:00
Simon Pilgrim	a3d1e03cd7	[X86][XOP] Fix VPERMIL2PD mask creation on 32-bit targets Use getConstVector helper to correctly create v2i64/v4i64 constants on 32-bit targets llvm-svn: 281105	2016-09-09 21:47:21 +00:00
Krzysztof Parzyszek	73e0ad8220	[Hexagon] Fix disassembler crash after r279255 When p0 was added as an explicit operand to the duplex subinstructions, the disassembler was not updated to reflect this. llvm-svn: 281104	2016-09-09 21:45:00 +00:00
Michael Kuperstein	b1153848ff	[X86] Regenerate test. NFC. llvm-svn: 281099	2016-09-09 21:36:17 +00:00
Justin Lebar	b1ad4e2cf4	Enable in-tree builds of parallel-libs. Summary: parallel-libs needs its own changes to make this work; these are just the LLVM changes. Reviewers: jhen Subscribers: llvm-commits, beanz, jprice Differential Revision: https://reviews.llvm.org/D24402 llvm-svn: 281097	2016-09-09 21:34:12 +00:00
Arnold Schwaighofer	7d7b4b4014	Create phi nodes for swifterror values at the end of the phi instructions list ISel makes assumption about the order of phi nodes. rdar://28190150 llvm-svn: 281095	2016-09-09 21:18:47 +00:00
Justin Lebar	b5e884976b	[NVPTX] Implement llvm.fabs.f32, llvm.max.f32, etc. Summary: Previously these only worked via NVPTX-specific intrinsics. This change will allow us to convert these target-specific intrinsics into the general LLVM versions, allowing existing LLVM passes to reason about their behavior. It also gets us some minor codegen improvements as-is, from situations where we canonicalize code into one of these llvm intrinsics. Reviewers: majnemer Subscribers: llvm-commits, jholewinski, tra Differential Revision: https://reviews.llvm.org/D24300 llvm-svn: 281092	2016-09-09 21:07:26 +00:00
Saleem Abdulrasool	92e33a3ebc	ARM: move the builtins libcall CC setup Move the target specific setup into the target specific lowering setup. As pointed out by Anton, the initial change was moving this too high up the stack resulting in a violation of the layering (the target generic code path setup target specific bits). Sink this into the ARM specific setup. NFC. llvm-svn: 281088	2016-09-09 20:11:31 +00:00
Michael Gottesman	f27ef41434	[cmake] Fix a stale comment from an earlier version of r281085. NFC. llvm-svn: 281086	2016-09-09 19:48:22 +00:00
Michael Gottesman	31e9363a3e	[cmake] Export gtest/gtest_main and its dependencies via a special build tree only cmake exports file. Previously, gtest/gtest_main were not exported via cmake. The intention here was to ensure that users whom are linking against the LLVM install tree would not get the gtest/gtest_main targets. This prevents downstream projects that link against the LLVM build tree (i.e. Swift) from getting this dependency information in their cmake builds. Without such dependency information, linker issues can result on linux due to LLVMSupport being put before gtest on the linker command line. This commit preserves behavior that we want for the install tree, while adding support for the build tree by: 1. The special casing for gtest/gtest_main in the add_llvm_library code is removed in favor of a flag called "BUILDTREE_ONLY". If this is set, then the library is communicating that it is only meant to be exported into the build tree and is not meant to be installed or exported via the install tree. This part is just a tweak to remove the special case, the underlying code is the same. 2. The cmake code that exports cmake targets for the build tree has special code to import an additional targets file called LLVMBuildTreeOnlyExports.cmake. Additionally the extra targets are added to the LLVMConfig.cmake's LLVM_EXPORTED_TARGETS variable. In contrast, the "installation" cmake file uses the normal LLVM_EXPORTS_TARGETS as before and does not include the extra exports file. This is implemented by defining/undefining variables when performing a configure of the build/install tree LLVMConfig.cmake files. llvm-svn: 281085	2016-09-09 19:45:34 +00:00
Rafael Espindola	46cdcb03ed	Add a lower level zlib::uncompress. SmallVectors are convenient, but they don't cover every use case. In particular, they are fairly large (3 pointers + one element) and there is no way to take ownership of the buffer to put it somewhere else. This patch then adds a lower lever interface that works with any buffer. llvm-svn: 281082	2016-09-09 19:32:36 +00:00
Wei Ding	06f8d39424	AMDGPU : Fix mqsad_u32_u8 instruction incorrect data type. Differential Revision: http://reviews.llvm.org/D23700 llvm-svn: 281081	2016-09-09 19:31:51 +00:00
Tom Stellard	b2869eb6e9	AMDGPU/SI: Make sure llvm.amdgcn.implicitarg.ptr() is 8-byte aligned for HSA Reviewers: arsenm Subscribers: arsenm, wdng, nhaehnle, llvm-commits Differential Revision: https://reviews.llvm.org/D24405 llvm-svn: 281080	2016-09-09 19:28:00 +00:00
Zachary Turner	36efbfa6d8	[pdb] Print out some more info when dumping a raw stream. We have various command line options that print the type of a stream, the size of a stream, etc but nowhere that it can all be viewed together. Since a previous patch introduced the ability to dump the bytes of a stream, this seems like a good place to present a full view of the stream's properties including its size, what kind of data it represents, and the blocks it occupies. So I added the ability to print that information to the -stream-data command line option. llvm-svn: 281077	2016-09-09 19:00:49 +00:00
Vedant Kumar	c236e5a0f1	[llvm-cov] Remove some asserts in the html renderer (NFC) These asserts are making tests fragile. The renderer does not enter an invalid state when they fail, however, it may spit out a garbled coverage report because the source text no longer matches the provided coverage mapping. Another follow-up to r281072. llvm-svn: 281076	2016-09-09 18:44:40 +00:00
Dehao Chen	22ce5eb051	Do not widen load for different variable in GVN. Summary: Widening load in GVN is too early because it will block other optimizations like PRE, LICM. https://llvm.org/bugs/show_bug.cgi?id=29110 The SPECCPU2006 benchmark impact of this patch: Reference: o2_nopatch (1): o2_patched Benchmark Base:Reference (1) ------------------------------------------------------- spec/2006/fp/C++/444.namd 25.2 -0.08% spec/2006/fp/C++/447.dealII 45.92 +1.05% spec/2006/fp/C++/450.soplex 41.7 -0.26% spec/2006/fp/C++/453.povray 35.65 +1.68% spec/2006/fp/C/433.milc 23.79 +0.42% spec/2006/fp/C/470.lbm 41.88 -1.12% spec/2006/fp/C/482.sphinx3 47.94 +1.67% spec/2006/int/C++/471.omnetpp 22.46 -0.36% spec/2006/int/C++/473.astar 21.19 +0.24% spec/2006/int/C++/483.xalancbmk 36.09 -0.11% spec/2006/int/C/400.perlbench 33.28 +1.35% spec/2006/int/C/401.bzip2 22.76 -0.04% spec/2006/int/C/403.gcc 32.36 +0.12% spec/2006/int/C/429.mcf 41.04 -0.41% spec/2006/int/C/445.gobmk 26.94 +0.04% spec/2006/int/C/456.hmmer 24.5 -0.20% spec/2006/int/C/458.sjeng 28 -0.46% spec/2006/int/C/462.libquantum 55.25 +0.27% spec/2006/int/C/464.h264ref 45.87 +0.72% geometric mean +0.23% For most benchmarks, it's a wash, but we do see stable improvements on some benchmarks, e.g. 447,453,482,400. Reviewers: davidxl, hfinkel, dberlin, sanjoy, reames Subscribers: gberry, junbuml Differential Revision: https://reviews.llvm.org/D24096 llvm-svn: 281074	2016-09-09 18:42:35 +00:00
Rui Ueyama	a5edf655af	Fix another -Wunused-variable for non-assert build. llvm-svn: 281073	2016-09-09 18:37:08 +00:00
Vedant Kumar	db7f924c03	[llvm-cov] Try to fix the native_separators.c test some more It's still breaking this bot (though, it looks like it always had been): http://lab.llvm.org:8011/builders/clang-x86-windows-msvc2015 This time, add quotes around llvm-{cov,config} so that lit won't expand them. Thanks to Reid for suggesting the patch! llvm-svn: 281072	2016-09-09 18:34:43 +00:00
Zachary Turner	72c5b6451f	[pdb] Add command line options for dumping individual streams and blocks I ran into a situation where I wanted to print out the contents of page 6 of a PDB as a binary blob, and there was no straightforward way to do that. In addition to adding that, this patch also adds the ability to dump a stream by index as a binary blob, and it will stitch together all the blocks and dump the whole thing as one seemingly contiguous sequence of bytes. llvm-svn: 281070	2016-09-09 18:17:52 +00:00
Rui Ueyama	47320da1ee	Fix -Wunused-variable for non-assert build. llvm-svn: 281069	2016-09-09 18:07:33 +00:00
Zachary Turner	9ba31a5efe	[pdb] Pass CVRecord's through the visitor as non-const references. This simplifies a lot of code, and will actually be necessary for an upcoming patch to serialize TPI record hash values. The idea before was that visitors should be examining records, not modifying them. But this is no longer true with a visitor that constructs a CVRecord from Yaml. To handle this until now, we were doing some fixups on CVRecord objects at a higher level, but the code is really awkward, and it makes sense to just have the visitor write the bytes into the CVRecord. In doing so I uncovered a few bugs related to `Data` and `RawData` and fixed those. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D24362 llvm-svn: 281067	2016-09-09 18:03:39 +00:00
Kostya Serebryany	00ef27112e	[libFuzzer] one more puzzle, value_profile cracks it in a second llvm-svn: 281066	2016-09-09 18:00:04 +00:00
Zachary Turner	c6d54da891	[pdb] Write PDB TPI Stream from Yaml. This writes the full sequence of type records described in Yaml to the TPI stream of the PDB file. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D24316 llvm-svn: 281063	2016-09-09 17:46:17 +00:00
Vedant Kumar	d938dfb308	[llvm-cov] Handle native paths correctly in the text index Treat filenames the same way in the text index as we do in the html index. This is a follow-up to r281008 (an attempt to unbreak the native_separators.c test on Windows). Patch by Maggie Yi! llvm-svn: 281062	2016-09-09 17:37:11 +00:00
Aaron Ballman	6a6e459800	Move the ocaml_all target into the Misc folder for cleanliness; NFC. llvm-svn: 281061	2016-09-09 17:30:59 +00:00
Reid Kleckner	1076288e22	[codeview] Don't assert if the array element type is incomplete This can happen when the frontend knows the debug info will be emitted somewhere else. Usually this happens for dynamic classes with out of line constructors or key functions, but it can also happen when modules are enabled. llvm-svn: 281060	2016-09-09 17:29:36 +00:00
Vedant Kumar	e32490933e	[Bitcode] Add compatibility test for the 3.9 release Fork off compatibility.ll for the 3.9 release. The *.bc file in this commit was produced using a Release build of the release_39 branch. llvm-svn: 281059	2016-09-09 17:24:31 +00:00
Sanjay Patel	6da0fb8c74	[InstCombine] add tests to show pattern matching failures due to commutation I was looking to fix a bug in getComplexity(), and these cases showed up as obvious failures. I'm not sure how to find these in general though. llvm-svn: 281055	2016-09-09 16:35:20 +00:00
Justin Lebar	aa4adccc4d	Attempt to fix static_asserts in SelectionDAGNodes.h that MSVC chokes on. These were added in r281051, which, I am embarrassed to admit, has an incomplete commit message that I forgot to update before pushing. You can ignore element (2) in that list. llvm-svn: 281054	2016-09-09 16:29:13 +00:00
Justin Lebar	8b1460250e	[SelectionDAG] Fix two issues with SDNode::getRawSubclassData(). 1) On some platforms, sizeof(SDNodeBits) == 1, so we were only copying one byte out of the bitfield when we wanted to copy two, and we were leaving half of the return value of getRawSubclassData() undefined. 2) Something something bitfields, not sure exactly what the issue or fix is, yet. (TODO) Summary: Previously we were assuming that SDNodeBits covered all of SDNode's anonymous subclass data bitfield union. But that's not right; it might have size 1, in which it clearly doesn't. This patch adds a field that does cover the whole union and adds static_asserts to ensure it stays correct. Reviewers: ahatanak, chandlerc Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D24223 llvm-svn: 281051	2016-09-09 16:07:06 +00:00
Sam Kolton	1eeb11bfd4	AMDGPU] Assembler: better support for immediate literals in assembler. Summary: Prevously assembler parsed all literals as either 32-bit integers or 32-bit floating-point values. Because of this we couldn't support f64 literals. E.g. in instruction "v_fract_f64 v[0:1], 0.5", literal 0.5 was encoded as 32-bit literal 0x3f000000, which is incorrect and will be interpreted as 3.0517578125E-5 instead of 0.5. Correct encoding is inline constant 240 (optimal) or 32-bit literal 0x3FE00000 at least. With this change the way immediate literals are parsed is changed. All literals are always parsed as 64-bit values either integer or floating-point. Then we convert parsed literals to correct form based on information about type of operand parsed (was literal floating or binary) and type of expected instruction operands (is this f32/64 or b32/64 instruction). Here are rules how we convert literals: - We parsed fp literal: - Instruction expects 64-bit operand: - If parsed literal is inlinable (e.g. v_fract_f64_e32 v[0:1], 0.5) - then we do nothing this literal - Else if literal is not-inlinable but instruction requires to inline it (e.g. this is e64 encoding, v_fract_f64_e64 v[0:1], 1.5) - report error - Else literal is not-inlinable but we can encode it as additional 32-bit literal constant - If instruction expect fp operand type (f64) - Check if low 32 bits of literal are zeroes (e.g. v_fract_f64 v[0:1], 1.5) - If so then do nothing - Else (e.g. v_fract_f64 v[0:1], 3.1415) - report warning that low 32 bits will be set to zeroes and precision will be lost - set low 32 bits of literal to zeroes - Instruction expects integer operand type (e.g. s_mov_b64_e32 s[0:1], 1.5) - report error as it is unclear how to encode this literal - Instruction expects 32-bit operand: - Convert parsed 64 bit fp literal to 32 bit fp. Allow lose of precision but not overflow or underflow - Is this literal inlinable and are we required to inline literal (e.g. v_trunc_f32_e64 v0, 0.5) - do nothing - Else report error - Do nothing. We can encode any other 32-bit fp literal (e.g. v_trunc_f32 v0, 10000000.0) - Parsed binary literal: - Is this literal inlinable (e.g. v_trunc_f32_e32 v0, 35) - do nothing - Else, are we required to inline this literal (e.g. v_trunc_f32_e64 v0, 35) - report error - Else, literal is not-inlinable and we are not required to inline it - Are high 32 bit of literal zeroes or same as sign bit (32 bit) - do nothing (e.g. v_trunc_f32 v0, 0xdeadbeef) - Else - report error (e.g. v_trunc_f32 v0, 0x123456789abcdef0) For this change it is required that we know operand types of instruction (are they f32/64 or b32/64). I added several new register operands (they extend previous register operands) and set operand types to corresponding types: ''' enum OperandType { OPERAND_REG_IMM32_INT, OPERAND_REG_IMM32_FP, OPERAND_REG_INLINE_C_INT, OPERAND_REG_INLINE_C_FP, } ''' This is not working yet: - Several tests are failing - Problems with predicate methods for inline immediates - LLVM generated assembler parts try to select e64 encoding before e32. More changes are required for several AsmOperands. Reviewers: vpykhtin, tstellarAMD Subscribers: arsenm, kzhuravl, artem.tamazov Differential Revision: https://reviews.llvm.org/D22922 llvm-svn: 281050	2016-09-09 14:44:04 +00:00
Chris Dewhurst	c59f7c745b	[Sparc][LEON] Removed the parts of the errata fixes implemented using inline assembly as this is not the desired behaviour for end-users. Small change to a unit test to implement this without requiring the inline assembly. llvm-svn: 281047	2016-09-09 14:16:51 +00:00
James Molloy	57d9dfa9ac	[ARM] ADD with a negative offset can become SUB for free So model that directly in TTI::getIntImmCost(). llvm-svn: 281044	2016-09-09 13:35:36 +00:00
James Molloy	1454e90f86	[ARM] icmp %x, -C can be lowered to a simple ADDS or CMN Tell TargetTransformInfo about this so ConstantHoisting is informed. llvm-svn: 281043	2016-09-09 13:35:28 +00:00
Simon Pilgrim	153b408433	[SelectionDAG] Ensure DAG::getZeroExtendInReg is called with a scalar type Fixes issue with rL280927 identified by Mikael Holmén llvm-svn: 281042	2016-09-09 13:31:52 +00:00
James Molloy	4d86bed0bb	[Thumb] Select (CMPZ X, -C) -> (CMPZ (ADDS X, C), 0) The CMPZ #0 disappears during peepholing, leaving just a tADDi3, tADDi8 or t2ADDri. This avoids having to materialize the expensive negative constant in Thumb-1, and allows a shrinking from a 32-bit CMN to a 16-bit ADDS in Thumb-2. llvm-svn: 281040	2016-09-09 12:52:24 +00:00
Tim Northover	25d1286e5a	GlobalISel: remove G_TYPE and G_PHI These instructions were only necessary when type information was stored in the MachineInstr (because only generic MachineInstrs possessed a type). Now that it's in MachineRegisterInfo, COPY and PHI work fine. llvm-svn: 281037	2016-09-09 11:47:31 +00:00
Tim Northover	1f8b1db93e	GlobalISel: fix comments and add assertions for valid instructions. llvm-svn: 281036	2016-09-09 11:46:58 +00:00
Tim Northover	0f140c769a	GlobalISel: move type information to MachineRegisterInfo. We want each register to have a canonical type, which means the best place to store this is in MachineRegisterInfo rather than on every MachineInstr that happens to use or define that register. Most changes following from this are pretty simple (you need an MRI anyway if you're going to be doing any transformations, so just check the type there). But legalization doesn't really want to check redundant operands (when, for example, a G_ADD only ever has one type) so I've made use of MCInstrDesc's operand type field to encode these constraints and limit legalization's work. As an added bonus, more validation is possible, both in MachineVerifier and MachineIRBuilder (coming soon). llvm-svn: 281035	2016-09-09 11:46:34 +00:00
Simon Dardis	ba92b034bf	Revert "[mips] Fix c.<cc>.<fmt> instruction definition." This reverts commit r281022. Mips buildbot broke, due to unhandled register class FCC. llvm-svn: 281033	2016-09-09 11:06:01 +00:00
Sam Kolton	a2e5c88baf	[AMDGPU] Assembler: rename amd_kernel_code_t asm names according to spec Summary: Also removed duplicate code from AMDGPUTargetAsmStreamer. This change only change how amd_kernel_code_t is parsed and printed. No variable names are changed. Reviewers: vpykhtin, tstellarAMD Subscribers: arsenm, wdng, nhaehnle Differential Revision: https://reviews.llvm.org/D24296 llvm-svn: 281028	2016-09-09 10:08:02 +00:00
James Molloy	0f41227b21	[Thumb1] Teach optimizeCompareInstr about thumb1 compares This avoids us doing a completely unneeded "cmp r0, #0" after a flag-setting instruction if we only care about the Z or C flags. Add LSL/LSR to the whitelist while we're here and add testing. This code could really do with a spring clean. llvm-svn: 281027	2016-09-09 09:51:06 +00:00
Sam Kolton	d63d8a7c05	[AMDGPU] Assembler: match e32 VOP instructions before e64. Summary: Split assembler match table in 4 tables with assembler variants: Default - all instructions except VOP3, SDWA and DPP - VOP3 - SDWA - DPP First match Default table then VOP3, SDWA and DPP. Reviewers: tstellarAMD, artem.tamazov, vpykhtin Subscribers: arsenm, wdng, nhaehnle, AMDGPU Differential Revision: https://reviews.llvm.org/D24252 llvm-svn: 281023	2016-09-09 09:37:51 +00:00
Simon Dardis	8efa979029	[mips] Fix c.<cc>.<fmt> instruction definition. As part of this effort, remove MipsFCmp nodes and use tablegen patterns rather than custom lowering through C++. Unexpectedly, this improves codesize for microMIPS as previous floating point setcc expansions would materialize 0 and 1 into GPRs before using the relevant mov[tf].[sd] instruction. Now $zero is used directly. Reviewers: dsanders, vkalintiris, zoran.jovanovic Differential Review: https://reviews.llvm.org/D23118 llvm-svn: 281022	2016-09-09 09:22:52 +00:00
Chris Dewhurst	ddad6e028e	[Sparc][LEON] Unit test for CASA instruction supported by some LEON processors added. llvm-svn: 281021	2016-09-09 09:08:13 +00:00
Gor Nishanov	faf36c2e0b	[Coroutines] Part13: Handle single edge PHINodes across suspends Summary: If one of the uses of the value is a single edge PHINode, handle it. Original: %val = something <suspend> %p = PHINode [%val] After Spill + Part13: %val = something %slot = gep val.spill.slot store %val, %slot <suspend> %p = load %slot Plus tiny fixes/changes: * use correct index for coro.free in CoroCleanup * fixup id parameter in coro.free to allow authoring coroutine in plain C with __builtins Reviewers: majnemer Subscribers: mehdi_amini, llvm-commits Differential Revision: https://reviews.llvm.org/D24242 llvm-svn: 281020	2016-09-09 05:39:00 +00:00
Amaury Sechet	5f04d819a5	Rationalise the attribute getter/setter methods on Function and CallSite. Summary: While woring on mapping attributes in the C API, it clearly appeared that the recent changes in the API on the C++ side left Function and Call/Invoke with an attribute API that grew in an ad hoc manner. This makes it difficult to work with it, because one doesn't know which overloads exists and which do not. Make sure that getter/setter function exists for both enum and string version. Remove inconsistent getter/setter, unless they have many callsites. This should make it easier to work with attributes in the future. This doesn't change how attribute works. Reviewers: bkramer, whitequark, mehdi_amini, void Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D21514 llvm-svn: 281019	2016-09-09 04:50:38 +00:00
Kostya Serebryany	b76a2a5503	[libFuzzer] improve -print_pcs to not print new PCs coming from libFuzzer itself llvm-svn: 281016	2016-09-09 02:38:28 +00:00
Kostya Serebryany	8ea4f9873b	[libFuzzer] remove unneeded call llvm-svn: 281014	2016-09-09 01:57:38 +00:00
Craig Topper	149e6bdc16	[AVX-512] Add VPCMP instructions to the load folding tables and make them commutable. llvm-svn: 281013	2016-09-09 01:36:10 +00:00
Craig Topper	1a1ac11625	[AVX-512] Add more integer vector comparison tests with loads. Some of these show opportunities where we can commute to fold loads. Commutes will be added in a followup commit. llvm-svn: 281012	2016-09-09 01:36:04 +00:00
Vedant Kumar	a59334da6b	[llvm-cov] Emit a summary in the report directory's index llvm-cov writes out an index file in '-output-dir' mode, albeit not a very informative one. Try to fix that by using the CoverageReport API to include some basic summary information in the index file. llvm-svn: 281011	2016-09-09 01:32:55 +00:00
Vedant Kumar	aae0ba7060	[llvm-cov] Constify some methods (NFC) llvm-svn: 281010	2016-09-09 01:32:51 +00:00
Vedant Kumar	627887b650	[llvm-cov] Add an API to prepare file reports (NFC) It would be nice to prepare file reports (using the CoverageReport API) without actually rendering them to the console. I plan on using this to flesh out the 'index' files in the coverage views. llvm-svn: 281009	2016-09-09 01:32:49 +00:00
Vedant Kumar	935bf9a890	[llvm-cov] Speculate fix for a Windows-only test (NFC) This test should have broken after r280896. Fix up the test case speculatively, since I don't have a way to test it. I wonder why I didn't get any angry bot emails about this. Maybe none of the win32 bots test llvm-cov? That could explain it, since the test says it 'REQUIRES: system-windows', which is restricted to win32 hosts. Also: why is 'system-windows' not defined for non-win32 Windows bots? llvm-svn: 281008	2016-09-09 01:32:47 +00:00
Kostya Serebryany	5c04bd250e	[libFuzzer] remove use_traces=1 since use_value_profile seems to be strictly better llvm-svn: 281007	2016-09-09 01:17:03 +00:00
David Majnemer	2c3ea55498	[X86] Tighten up a comment which confused x64 ABI terminology. The x64 ABI has two major function types: - frame functions - leaf functions A frame function is one which requires a stack frame. A leaf function is one which does not. A frame function may or may not have a frame pointer. A leaf function does not require a stack frame and may never modify SP except via a return (RET, tail call via JMP). A frame function which has a frame pointer is permitted to use the LEA instruction in the epilogue, a frame function without which doesn't establish a frame pointer must use ADD to adjust the stack pointer epilogue. Fun fact: Leaf functions don't require a function table entry (associated PDATA/XDATA). llvm-svn: 281006	2016-09-09 01:07:01 +00:00
Michael Kuperstein	ceff022800	[X86] Add more baseline tests for "irregular" shuffles. NFC. This adds more tests for shuffles where the output width does not match the input width and/or the output is generated from more than two inputs. llvm-svn: 281005	2016-09-09 00:49:29 +00:00
Hans Wennborg	c39ef776fc	Win64: Don't use REX prefix for direct tail calls The REX prefix should be used on indirect jmps, but not direct ones. For direct jumps, the unwinder looks at the offset to determine if it's inside the current function. Differential Revision: https://reviews.llvm.org/D24359 llvm-svn: 281003	2016-09-08 23:35:10 +00:00
Mike Aizatsky	f205a274c4	.clang-tidy: parameters and members should be CamelCased. Summary: Not sure why is it missing. Differential Revision: https://reviews.llvm.org/D24364 llvm-svn: 281002	2016-09-08 23:29:37 +00:00
Saleem Abdulrasool	ec6a774ed2	llvm-objdump: add missing ) in help output, NFC Add a missing ')' in the help output. NFC. llvm-svn: 281000	2016-09-08 23:17:34 +00:00
Kostya Serebryany	dec394935f	[libFuzzer] add missing docs llvm-svn: 280996	2016-09-08 22:21:13 +00:00
Dehao Chen	87823f8e4d	Remove debug info when hoisting instruction from then/else branch. Summary: The hoisted instruction is executed speculatively. It could affect the debugging experience as user would see gdb go into code that may not be expected to execute. It will also affect sample profile accuracy by assigning incorrect frequency to source within then/else branch. Reviewers: davidxl, dblaikie, chandlerc, kcc, echristo Subscribers: mehdi_amini, probinson, eric_niebler, andreadb, llvm-commits Differential Revision: https://reviews.llvm.org/D24164 llvm-svn: 280995	2016-09-08 21:53:33 +00:00
Sanjay Patel	a4c6223319	[InstCombine] regenerate checks llvm-svn: 280993	2016-09-08 21:40:21 +00:00
Matthew Simpson	bfe5e1817b	[LV] Ensure proper handling of multi-use case when collecting uniforms The test case included in r280979 wasn't checking what it was supposed to be checking for the predicated store case. Fixing the test revealed that the multi-use case (when a pointer is used by both vectorized and scalarized memory accesses) wasn't being handled properly. We can't skip over non-consecutive-like pointers since they may have looked consecutive-like with a different memory access. llvm-svn: 280992	2016-09-08 21:38:26 +00:00
Sanjay Patel	ed9fda01a3	[InstCombine] regenerate checks llvm-svn: 280991	2016-09-08 21:32:21 +00:00
Krzysztof Parzyszek	a1218728d3	[RDF] Further improve handling of multiple phis reached from shadows llvm-svn: 280987	2016-09-08 20:48:42 +00:00
Vedant Kumar	0b33f2c003	[llvm-cov] Fix issues with segment highlighting in the html view The text and html coverage views take different approaches to emitting highlighted regions. That's because this problem is easier in the text view: there's no need to worry about escaping text or adding tooltip content to a highlighted snippet. Unfortunately, the html view didn't get region highlighting quite right. This patch fixes the situation, bringing parity between the two views. llvm-svn: 280981	2016-09-08 19:18:23 +00:00
Matthew Simpson	408a3abcfe	[LV] Don't mark pointers used by scalarized memory accesses uniform Previously, all consecutive pointers were marked uniform after vectorization. However, if a consecutive pointer is used by a memory access that is eventually scalarized, the pointer won't remain uniform after all. An example is predicated stores. Even though a predicated store may be consecutive, it will still be scalarized, making it's pointer operand non-uniform. This patch updates the logic in collectLoopUniforms to consider the cases where a memory access may be scalarized. If a memory access may be scalarized, its pointer operand is not marked uniform. The determination of whether a given memory instruction will be scalarized or not has been moved into a common function that is used by the vectorizer, cost model, and legality analysis. Differential Revision: https://reviews.llvm.org/D24271 llvm-svn: 280979	2016-09-08 19:11:07 +00:00
Zachary Turner	22b652dbf8	[pdb] Make YamlTypeDumperCallbacks reuse *this. Previously we were making new instances of YamlTypeDumperCallbacks in order to recurse down and serialize / deserialize nested records such as field lists. This meant you could not pass context from a higher operation to a lower operation because it would be using a new instance of the visitor callback delegate. YAMLIO library was updated to support context-sensitive mappings, so now we can reuse the same instance of the visitor callback delegate even for nested operations. llvm-svn: 280978	2016-09-08 18:36:55 +00:00
Zachary Turner	35377f88f5	[YAMLIO] Add the ability to map with context. mapping a yaml field to an object in code has always been a stateless operation. You could still pass state by using the `setContext` function of the YAMLIO object, but this represented global state for the entire yaml input. In order to have context-sensitive state, it is necessary to pass this state in at the granularity of an individual mapping. This patch adds support for this type of context-sensitive state. You simply pass an additional argument of type T to the `mapRequired` or `mapOptional` functions, and provided you have specialized a `MappingContextTraits<U, T>` class with the appropriate mapping function, you can pass this context into the mapping function. Reviewed By: chandlerc Differential Revision: https://reviews.llvm.org/D24162 llvm-svn: 280977	2016-09-08 18:22:44 +00:00
Matt Arsenault	d745c28945	AMDGPU: Sign extend constants when splitting them This will confuse later passes which try to look at the immediate value and don't truncate first. llvm-svn: 280974	2016-09-08 17:44:36 +00:00
Krzysztof Parzyszek	a696b1b641	[Hexagon] Expand sext- and zextloads of vector types, not just extloads Recent change exposed this issue, breaking the Hexagon buildbots. llvm-svn: 280973	2016-09-08 17:42:14 +00:00
Matt Arsenault	be90f70d3a	AMDGPU: Try to commute when selecting s_addk_i32/s_mulk_i32 llvm-svn: 280972	2016-09-08 17:35:41 +00:00
Eric Christopher	98ddbdb563	AArch64 .arch directive - Include default arch attributes with extensions. Fix the .arch asm parser to use the full set of features for the architecture and any extensions on the command line. Add and update testcases accordingly as well as add an extension that was used but not supported. llvm-svn: 280971	2016-09-08 17:27:03 +00:00
Matt Arsenault	bbb47da8a1	AMDGPU: Support commuting with immediate in src0 llvm-svn: 280970	2016-09-08 17:19:29 +00:00
Renato Golin	049f387112	Revert "[XRay] ARM 32-bit no-Thumb support in LLVM" And associated commits, as they broke the Thumb bots. This reverts commit r280935. This reverts commit r280891. This reverts commit r280888. llvm-svn: 280967	2016-09-08 17:10:39 +00:00
Balaram Makam	c6cebf727c	[LoopDataPrefetch] Use range based for loop; NFCI Switch to range based for loop. No functional change, but more readable code. llvm-svn: 280966	2016-09-08 17:08:20 +00:00
Sanjay Patel	1c608f4323	[InstCombine] return a vector-safe true/false constant I introduced this potential bug by missing this diff in: https://reviews.llvm.org/rL280873 ...however, I'm not sure how to reach this code path with a regression test. We may be able to remove this code and assume that the transform to a constant is always handled by InstSimplify? llvm-svn: 280964	2016-09-08 16:54:02 +00:00
Dehao Chen	ebb715b119	Add unittest for r280760 llvm-svn: 280963	2016-09-08 16:53:40 +00:00
Simon Pilgrim	073472a2bf	[InstCombine][X86] Regenerate masked memory op combine tests llvm-svn: 280960	2016-09-08 16:32:37 +00:00
Simon Pilgrim	cd7b2830b9	[InstCombine][X86] Regenerate vperm2f128/vperm2i128 combine tests llvm-svn: 280959	2016-09-08 16:30:46 +00:00
Simon Pilgrim	3b1ecbe66c	[InstCombine][X86] Regenerate insertps combine tests llvm-svn: 280957	2016-09-08 16:15:21 +00:00
Vedant Kumar	2c5c878312	[Coverage] Delete some dead code (NFC) llvm-svn: 280956	2016-09-08 16:14:14 +00:00
Sam Kolton	1b746d1b9d	[TableGen] AsmMatcher: Add AsmVariantName to Instruction class. Summary: This allows specifying instructions that are available only in specific assembler variant. If AsmVariantName is specified then instruction will be presented only in MatchTable for this variant. If not specified then assembler variants will be determined based on AsmString. Also this allows splitting assembler match tables in same way as it is done in dissasembler. Reviewers: ab, tstellarAMD, craig.topper, vpykhtin Subscribers: wdng Differential Revision: https://reviews.llvm.org/D24249 llvm-svn: 280952	2016-09-08 15:50:52 +00:00
Reid Kleckner	1361c0c6ef	Remove restriction that 'sret' must be on the first parameter On Windows, it is often applied to the second parameter, and the x86 backend is prepared to deal with sret appearing on any parameter. Other backends assume it only appears on parameter zero, but those are target-specific requirements, and not an IR-level rule. llvm-svn: 280951	2016-09-08 15:45:27 +00:00
Reid Kleckner	cb5c98b14a	Give an x86 assembler test a triple llvm-svn: 280950	2016-09-08 15:40:43 +00:00
Dehao Chen	db3810771e	revert r280427 Refactor replaceDominatedUsesWith to have a flag to control whether to replace uses in BB itself. Summary: This is in preparation for LoopSink pass which calls replaceDominatedUsesWith to update after sinking. llvm-svn: 280949	2016-09-08 15:25:12 +00:00
Renato Golin	d257373887	[ARM XRay] Try to fix Thumb-only failure I mised the check that it had to support ARM to work. This commit tries to fix that, to make sure we don't emit ARM code in Thumb-only mode. llvm-svn: 280935	2016-09-08 13:45:10 +00:00

... 2 3 4 5 6 ...

138095 Commits