llvm-project

Commit Graph

Author	SHA1	Message	Date
Hemant Kulkarni	5f4ca2f371	llvm-size: Add --totals option Differential Revision: https://reviews.llvm.org/D24308 llvm-svn: 281233	2016-09-12 17:08:28 +00:00
Hemant Kulkarni	aecf9d0c86	llvm-objdump: Add --start-address and --stop-address options Differential Revision: https://reviews.llvm.org/D24160 llvm-svn: 281232	2016-09-12 17:08:22 +00:00
Sanjay Patel	f5887f1fbd	[InstCombine] use m_APInt to allow icmp X, C folds for splat constant vectors isSignBitCheck could be changed to take a pointer param to avoid the 'UnusedBit' ugliness. llvm-svn: 281231	2016-09-12 16:25:41 +00:00
Nicolai Haehnle	e58e0e3fe3	AMDGPU: Do not clobber SCC in SIWholeQuadMode Reviewers: arsenm, tstellarAMD, mareko Subscribers: arsenm, llvm-commits, kzhuravl Differential Revision: http://reviews.llvm.org/D22198 llvm-svn: 281230	2016-09-12 16:25:20 +00:00
Ahmed Bougacha	925961b20c	[GlobalISel] Fix mismatched "<..)" in intrinsic MO printing. NFC. llvm-svn: 281229	2016-09-12 16:21:49 +00:00
James Molloy	3d06ff22b7	Revert "[ARM] Promote small global constants to constant pools" This reverts commit r281213. It made a bot go bang: http://lab.llvm.org:8011/builders/clang-cmake-armv7-a15-full/builds/14625 llvm-svn: 281228	2016-09-12 16:18:23 +00:00
Jonathan Roelofs	96cb94b2a9	Trivial documentation fix regarding Obj-C ARC objc_arc_weak_reference_unavailable Fixed incorrect docs that referred to: objc_arc_weak_unavailable when it should be: objc_arc_weak_reference_unavailable Patch by: Sean McBride! llvm-svn: 281227	2016-09-12 16:14:52 +00:00
Pavel Labath	72090c2162	Move StdStringExtractor to tools/debugserver The class is only used in the debugserver. The rest of lldb has the StringExtractor class. Xcode project will need to be updated after this. llvm-svn: 281226	2016-09-12 16:13:05 +00:00
Jason Henline	57ea481945	[SE] RegisteredHostMemory for async device copies Summary: Improve the error-prone interface that allows users to pass host pointers that haven't been registered to asynchronous copy methods. In CUDA, this is an extremely easy error to make, and instead of failing at runtime, it succeeds and gives the right answers by turning the async copy into a sync copy. So, you silently get a huge performance degradation if you misuse the old interface. This new interface should prevent that. Reviewers: jlebar Subscribers: jprice, beanz, parallel_libs-commits Differential Revision: https://reviews.llvm.org/D24353 llvm-svn: 281225	2016-09-12 16:09:41 +00:00
Ahmed Bougacha	b678219aa6	[BranchFolding] Unique added live-ins after hoisting code. We're not supposed to have duplicate live-ins. llvm-svn: 281224	2016-09-12 16:05:31 +00:00
Ahmed Bougacha	45bfa8772f	[X86] Copy imp-uses when folding tailcall into conditional branch. r280832 added 32-bit support for emitting conditional tail-calls, but dropped imp-used parameter registers. This went unnoticed until r281113, which added 64-bit support, as this is only exposed with parameter passing via registers. Don't drop the imp-used parameters. llvm-svn: 281223	2016-09-12 16:05:27 +00:00
Rafael Espindola	7bd37870bc	Simplify handling of /DISCARD/. NFC. llvm-svn: 281222	2016-09-12 16:05:16 +00:00
David Majnemer	c83044d9bb	[FunctionAttrs] Don't try to infer returned if it is already on an argument Trying to infer the 'returned' attribute if an argument is already 'returned' can lead to verification failure: inference might determine that a different argument is passed through which would result in two different arguments marked as 'returned'. This fixes PR30350. llvm-svn: 281221	2016-09-12 16:04:59 +00:00
Sanjay Patel	0531f0a5bb	fix formatting; NFC llvm-svn: 281220	2016-09-12 15:52:28 +00:00
Sanjay Patel	db400baa80	[InstCombine] add tests to show missing vector folds llvm-svn: 281219	2016-09-12 15:51:42 +00:00
Igor Breger	a3e36da6f2	add select i1 test, reproduser pr30249. llvm-svn: 281218	2016-09-12 15:27:02 +00:00
Sanjay Patel	3151dec7f1	[InstCombine] add helper function for foldICmpUsingKnownBits; NFCI llvm-svn: 281217	2016-09-12 15:24:31 +00:00
Sam Kolton	fb0d9d9c13	[AMDGPU] Assembler: Move disabled SDWA and DPP instruction into Disable asm variant Summary: This removes disabled instructions from match tables so we will not match them at all. Reviewers: tstellarAMD, vpykhtin, artem.tamazov Subscribers: wdng, nhaehnle, arsenm Differential Revision: https://reviews.llvm.org/D24452 llvm-svn: 281216	2016-09-12 14:42:43 +00:00
James Molloy	1e1b56bd48	[Thumb] Teach ISel how to lower compares of AND bitmasks efficiently For the common pattern (CMPZ (AND x, #bitmask), #0), we can do some more efficient instruction selection if the bitmask is one consecutive sequence of set bits (32 - clz(bm) - ctz(bm) == popcount(bm)). 1) If the bitmask touches the LSB, then we can remove all the upper bits and set the flags by doing one LSLS. 2) If the bitmask touches the MSB, then we can remove all the lower bits and set the flags with one LSRS. 3) If the bitmask has popcount == 1 (only one set bit), we can shift that bit into the sign bit with one LSLS and change the condition query from NE/EQ to MI/PL (we could also implement this by shifting into the carry bit and branching on BCC/BCS). 4) Otherwise, we can emit a sequence of LSLS+LSRS to remove the upper and lower zero bits of the mask. 1-3 require only one 16-bit instruction and can elide the CMP. 4 requires two 16-bit instructions but can elide the CMP and doesn't require materializing a complex immediate, so is also a win. llvm-svn: 281215	2016-09-12 14:30:48 +00:00
Sanjay Patel	5352331716	fix formatting/typos; NFC llvm-svn: 281214	2016-09-12 14:25:46 +00:00
James Molloy	8f82d45ff4	[ARM] Promote small global constants to constant pools If a constant is unamed_addr and is only used within one function, we can save on the code size and runtime cost of an indirection by changing the global's storage to inside the constant pool. For example, instead of: ldr r0, .CPI0 bl printf bx lr .CPI0: &format_string format_string: .asciz "hello, world!\n" We can emit: adr r0, .CPI0 bl printf bx lr .CPI0: .asciz "hello, world!\n" This can cause significant code size savings when many small strings are used in one function (4 bytes per string). llvm-svn: 281213	2016-09-12 13:42:16 +00:00
Chad Rosier	a4c424654e	[LoopInterchange] Improve debug output. NFC. llvm-svn: 281212	2016-09-12 13:24:47 +00:00
Pablo Barrio	0bebc38abb	Fix the Thumb test for vfloat intrinsics Summary: This test was not testing the intrinsics. A function like this: define %v4f32 @test_v4f32.floor(%v4f32 %a){ ... %1 = call %v4f32 @llvm.floor.v4f32(%v4f32 %a) ... } is transformed into the following assembly: _test_v4f32.floor: @ @test_v4f32.floor ... bl _floorf ... In each function tested, there are two CHECK: one that checked for the label and another one for the intrinsic that should be used inside the function (in our case, "floor"). However, although the first CHECK was matching the label, the second was not matching the intrinsic, but the second "floor" in the same line as the label. This is fixed by making the first CHECK match the entire line. Reviewers: jmolloy, rengolin Subscribers: rengolin, llvm-commits Differential Revision: https://reviews.llvm.org/D24398 llvm-svn: 281211	2016-09-12 13:14:14 +00:00
Rafael Espindola	c7e1e03498	Store an ArrayRef for Data in InputSectionData. llvm-svn: 281210	2016-09-12 13:13:53 +00:00
Rafael Espindola	54f1614ec1	Revert "Revert "Compact InputSectionData from 64 to 48 bytes. NFC."" This reverts commit r281096. The previous link errors should be fixed by r281208. llvm-svn: 281209	2016-09-12 13:06:10 +00:00
Rafael Espindola	74941239d8	Define a dummy zlib::uncompress when zlib is not available. Should fix link errors in some bots when it is used. llvm-svn: 281208	2016-09-12 13:00:51 +00:00
Tim Northover	032548fc5e	GlobalISel: support translation of global addresses. llvm-svn: 281207	2016-09-12 12:10:41 +00:00
Daniel Marjamaki	03ea468a1c	[clang-tidy] readability-misplaced-array-index: add new check that warns when array index is misplaced. Reviewers: alexfh Differential Revision: https://reviews.llvm.org/D21134 llvm-svn: 281206	2016-09-12 12:04:13 +00:00
Tim Northover	a7653b3919	GlobalISel: translate GEP instructions. Unlike SDag, we use a separate G_GEP instruction (much simplified, only taking a single byte offset) to preserve the pointer type information through selection. llvm-svn: 281205	2016-09-12 11:20:22 +00:00
Tim Northover	d28d3cc079	GlobalISel: disambiguate types when printing MIR Some generic instructions have multiple types. While in theory these always be discovered by inspecting the single definition of each generic vreg, in practice those definitions won't always be local and traipsing through a big function to find them will not be fun. So this changes MIRPrinter to print out the type of uses as well as defs, if they're known to be different or not known to be the same. On the parsing side, we're a little more flexible: provided each register is given a type in at least one place it's mentioned (and all types are consistent) we accept the MIR. This doesn't introduce ambiguity but makes writing tests manually a bit less painful. llvm-svn: 281204	2016-09-12 11:20:10 +00:00
Daniel Jasper	c6a123111a	clang-format: Make emacs integration work with narrowed buffers. Use (call-process region nil ...) instead of (point-min) so that the call works in narrowed buffers. Patch by Philipp Stephani, thank you! llvm-svn: 281203	2016-09-12 10:02:46 +00:00
Eugene Leviant	99da752980	[ELF/AArch64] Implement some UABS relocs Differential revision: https://reviews.llvm.org/D24403 llvm-svn: 281202	2016-09-12 10:02:41 +00:00
Eric Liu	c7e5a9ce17	Fix WebAssembly broken build related to interface change in r281172. Reviewers: bkramer Subscribers: jfb, llvm-commits, dschuff Differential Revision: https://reviews.llvm.org/D24449 llvm-svn: 281201	2016-09-12 09:35:59 +00:00
Martin Bohme	0eb4403f24	[CFG] Add iterator_ranges to CFG and CFGBlock. Summary: (Needed for D23353.) Reviewers: alexfh Subscribers: cfe-commits Differential Revision: https://reviews.llvm.org/D23842 llvm-svn: 281200	2016-09-12 08:28:21 +00:00
Ilia K	94df34f72d	Add MiSyntaxTestCase.test_lldbmi_output_grammar test (MI) Summary: This patch adds a new test and fixes extra new-line before exit Reviewers: abidh Subscribers: ki.stfu, dawn, lldb-commits, abidh Differential Revision: https://reviews.llvm.org/D9740 llvm-svn: 281199	2016-09-12 07:14:51 +00:00
Richard Smith	67462ffce9	Add virtual destructor (necessary due to the switch to shared_ptr). llvm-svn: 281198	2016-09-12 06:51:11 +00:00
Richard Smith	94a2fe5c8d	Attempt #3 to placate MSVC. llvm-svn: 281197	2016-09-12 06:38:31 +00:00
Elena Demikhovsky	de1b494555	AVX-512: Added a test case that should be optimized in the future. NFC. llvm-svn: 281196	2016-09-12 06:26:03 +00:00
Richard Smith	c14994f290	Attempt #2 to placate MSVC llvm-svn: 281195	2016-09-12 06:23:26 +00:00
Richard Smith	cd608d1a20	Attempt to placate MSVC. llvm-svn: 281194	2016-09-12 06:13:44 +00:00
Tobias Grosser	5857b701a3	GPGPU: Bail out gracefully in case of invalid IR Instead of aborting, we now bail out gracefully in case the kernel IR we generate is invalid. This can currently happen in case the SCoP stores pointer values, which we model as arrays, as data values into other arrays. In this case, the original pointer value is not available on the device and can consequently not be stored. As detecting this ahead of time is not so easy, we detect these situations after the invalid IR has been generated and bail out. llvm-svn: 281193	2016-09-12 06:06:31 +00:00
Richard Smith	b6a3b4ba61	Add a mode to clang-tblgen to generate reference documentation for warning and remark flags. For now I'm checking in a copy of the built documentation, but we can replace this with a placeholder (as we do for the attributes reference documentation) once we enable building this server-side. llvm-svn: 281192	2016-09-12 05:58:29 +00:00
Ilia K	4f730dc750	Fix about a dozen compile warnings Summary: It fixes the following compile warnings: 1. '0' flag ignored with precision and ‘%d’ gnu_printf format 2. enumeral and non-enumeral type in conditional expression 3. format ‘%d’ expects argument of type ‘int’, but argument 4 has type ... 4. enumeration value ‘...’ not handled in switch 5. cast from type ‘const uint64_t* {aka ...}’ to type ‘int64_t* {aka ...}’ casts away qualifiers 6. extra ‘;’ 7. comparison between signed and unsigned integer expressions 8. variable ‘register_operand’ set but not used 9. control reaches end of non-void function Reviewers: jingham, emaste, zturner, clayborg Subscribers: lldb-commits Differential Revision: https://reviews.llvm.org/D24331 llvm-svn: 281191	2016-09-12 05:25:33 +00:00
NAKAMURA Takumi	cf6aaa9e1a	llvm/test/CodeGen/AMDGPU/infinite-loop-evergreen.ll REQUIRES +Asserts. This might not crash with -Asserts. I saw it caused infinite loop in the codegen. llvm-svn: 281190	2016-09-12 04:27:28 +00:00
David Majnemer	cb60a4305b	[MS ABI] Add /include directives for dynamic TLS MSVC emits /include directives in the .drective section for the __dyn_tls_init function (decorated as ___dyn_tls_init@12 for 32-bit). This fixes PR30347. llvm-svn: 281189	2016-09-12 02:51:43 +00:00
Duncan P. N. Exon Smith	cd0fffb6e1	MC: Move MCSection::begin/end to header, NFC llvm-svn: 281188	2016-09-12 00:17:09 +00:00
Sanjay Patel	60312bc45f	[InstCombine] add helper function for folding {and,or,xor} (cast X), C ; NFCI llvm-svn: 281187	2016-09-12 00:16:23 +00:00
Sanjay Patel	f9ca770225	[InstCombine] regenerate checks llvm-svn: 281186	2016-09-12 00:12:56 +00:00
Sanjay Patel	a2aabfcc17	[InstCombine] regenerate checks llvm-svn: 281185	2016-09-12 00:08:33 +00:00
Duncan P. N. Exon Smith	b5da005335	ADT: Never allocate nodes in iplist<> and ilist<> Remove createNode() and any API that depending on it, and add HasCreateNode to the list of checks for HasObsoleteCustomizations. Now an ilist never allocates (this was already true for iplist). This factors out all the differences between iplist and ilist. I'll aim to rename both to "owning_ilist" eventually, to call out the interesting (not exactly intrusive) ownership semantics. In the meantime, I've left both names around to reduce code churn. One of the deleted APIs is the ilist copy constructor. I've lifted up and tested iplist::cloneFrom (ala simple_ilist::cloneFrom) as a replacement. Users of ilist<> and iplist<> that want the list to allocate nodes have a few options: - use std::list; - use AllocatorList or BumpPtrList (or build a similarly trivial list); - use cloneFrom (which is explicit at the call site); or - allocate at the call site. See r280573, r281177, r281181, and r281182 for examples of what to do if you're updating out-of-tree code. llvm-svn: 281184	2016-09-11 23:43:43 +00:00

... 2 3 4 5 6 ...

241955 Commits All Branches Search

241955 Commits

All Branches