llvm-project

Commit Graph

Author	SHA1	Message	Date
Mehdi Amini	06a47806ee	[ThinLTO] Add an option to llvm-lto to print some basic statistics for the index Differential Revision: https://reviews.llvm.org/D24290 llvm-svn: 281537	2016-09-14 21:04:59 +00:00
Matt Arsenault	1b9fc8ed65	Finish renaming remaining analyzeBranch functions llvm-svn: 281535	2016-09-14 20:43:16 +00:00
Sanjoy Das	23f06e53d8	[Stackmap] Added callsite counts to emitted function information. Summary: It was previously not possible for tools to use solely the stackmap information emitted to reconstruct the return addresses of callsites in the map, which is necessary to use the information to walk a stack. This patch adds per-function callsite counts when emitting the stackmap section in order to resolve the problem. Note that this slightly alters the stackmap format, so external tools parsing these maps will need to be updated. Problem Details: Records only store their offset from the beginning of the function they belong to. While these records and the functions are output in program order, it is not possible to determine where the end of one function's records are without the callsite count when processing the records to compute return addresses. Patch by Kavon Farvardin! Reviewers: atrick, ributzka, sanjoy Subscribers: nemanjai Differential Revision: https://reviews.llvm.org/D23487 llvm-svn: 281532	2016-09-14 20:22:03 +00:00
Sanjay Patel	3caf155bd3	[x86] regenerate checks llvm-svn: 281531	2016-09-14 20:21:28 +00:00
Sanjay Patel	c0899b961a	[x86] regenerate checks llvm-svn: 281529	2016-09-14 20:16:24 +00:00
Evgeniy Stepanov	e97d3b90b9	Revert "[ARM] Promote small global constants to constant pools" Breaks Android tests by introducing text relocations to ARM binaries. http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux/builds/25362/steps/run%20asan%20lit%20tests%20%5Barm%2Fbullhead-userdebug%2FMTC20F%5D/logs/stdio llvm-svn: 281526	2016-09-14 20:02:30 +00:00
Sanjay Patel	d56a27e242	[x86] regenerate checks llvm-svn: 281523	2016-09-14 19:42:03 +00:00
Davide Italiano	63e8f444c7	[lib/LTO] Fix a typo. NFC. llvm-svn: 281517	2016-09-14 18:48:43 +00:00
Matt Arsenault	f40b70fa75	Revert "AMDGPU: Use SOPK compare instructions" Accidentally committed llvm-svn: 281514	2016-09-14 18:04:42 +00:00
Matt Arsenault	f757c87959	AMDGPU: Use SOPK compare instructions llvm-svn: 281513	2016-09-14 18:03:53 +00:00
Adrian Prantl	a2ef047bd9	Verifier: Mark orphaned DICompileUnits as a debug info failure. This is a follow-up to r268778 that adds a couple of missing cases, most notably orphaned compile units. rdar://problem/28193346 llvm-svn: 281508	2016-09-14 17:30:37 +00:00
Matt Arsenault	e8e0f5cac6	Make analyzeBranch family of instruction names consistent analyzeBranch was renamed to use lowercase first, rename the related set to match. llvm-svn: 281506	2016-09-14 17:24:15 +00:00
Matt Arsenault	a2b036e88b	AArch64: Use TTI branch functions in branch relaxation The main change is to return the code size from InsertBranch/RemoveBranch. Patch mostly by Tim Northover llvm-svn: 281505	2016-09-14 17:23:48 +00:00
Sanjay Patel	c531c9ebf5	[x86] fix formatting; NFC llvm-svn: 281504	2016-09-14 17:23:18 +00:00
Etienne Bergeron	752f8839a4	[compiler-rt] Avoid instrumenting sanitizer functions Summary: Function __asan_default_options is called by __asan_init before the shadow memory got initialized. Instrumenting that function may lead to flaky execution. As the __asan_default_options is provided by users, we cannot expect them to add the appropriate function atttributes to avoid instrumentation. Reviewers: kcc, rnk Subscribers: dberris, chrisha, llvm-commits Differential Revision: https://reviews.llvm.org/D24566 llvm-svn: 281503	2016-09-14 17:18:37 +00:00
Simon Pilgrim	a369219ce6	[X86][SSE] Improve recognition of i64 sitofp conversions that can be performed as i32 (PR29078) Until AVX512DQ we only support i64/vXi64 sitofp conversion as scalars. This patch sees if the sign bit extends far enough that we can truncate to a i32 type and then perform sitofp without loss of precision. Differential Revision: https://reviews.llvm.org/D24345 llvm-svn: 281502	2016-09-14 17:15:26 +00:00
Chad Rosier	e6b3a63a3d	[LoopInterchange] Typo. NFC. llvm-svn: 281501	2016-09-14 17:12:30 +00:00
Chad Rosier	72431890b1	[LoopInterchange] Add CL option to override cost threshold. Mostly useful for getting consistent lit testing. llvm-svn: 281500	2016-09-14 17:07:13 +00:00
Simon Pilgrim	fbbb28ebb3	[X86][SSE] Don't use PSHUFD directly - lower with generic shuffle Remove the last user of the old getTargetShuffleNode helpers llvm-svn: 281499	2016-09-14 17:04:22 +00:00
Sanjay Patel	284582b6d4	getValueType().getScalarSizeInBits() -> getScalarValueSizeInBits(), round 2 ; NFCI llvm-svn: 281498	2016-09-14 16:54:10 +00:00
Chad Rosier	58ede270a7	[LoopInterchange] Cleanup debug whitespace. NFC. llvm-svn: 281497	2016-09-14 16:43:19 +00:00
Sanjay Patel	1ed771f5d7	getVectorElementType().getSizeInBits() -> getScalarSizeInBits() ; NFCI llvm-svn: 281495	2016-09-14 16:37:15 +00:00
Sanjay Patel	b1f0a0f4a8	getValueType().getSizeInBits() -> getValueSizeInBits() ; NFCI llvm-svn: 281493	2016-09-14 16:05:51 +00:00
Etienne Bergeron	9bd4281006	Fix typo in comment [NFC] llvm-svn: 281492	2016-09-14 15:59:32 +00:00
Matt Arsenault	2bc198a333	AMDGPU: Support folding FrameIndex operands This avoids test regressions in a future commit. llvm-svn: 281491	2016-09-14 15:51:33 +00:00
Sanjay Patel	5f6bb6cd24	getValueType().getScalarSizeInBits() -> getScalarValueSizeInBits() ; NFCI llvm-svn: 281490	2016-09-14 15:43:44 +00:00
Sanjay Patel	bd6fca1419	getScalarType().getSizeInBits() -> getScalarSizeInBits() ; NFCI llvm-svn: 281489	2016-09-14 15:21:00 +00:00
Matt Arsenault	fa5f767a38	AMDGPU: Improve splitting 64-bit bit ops by constants This addresses a TODO to handle operations besides and. This also starts eliminating no-op operations with a constant that can emerge later. llvm-svn: 281488	2016-09-14 15:19:03 +00:00
Matthew Simpson	b25e87fca5	[LV] Process pointer IVs with PHINodes in collectLoopUniforms This patch moves the processing of pointer induction variables in collectLoopUniforms from the consecutive pointer phase of the analysis to the phi node phase. Previously, if a pointer induction variable was used by both a scalarized non-memory instruction as well as a vectorized memory instruction, we would incorrectly identify the pointer as uniform. Pointer induction variables should be treated the same as other phi nodes. That is, they are uniform if all users of the induction variable and induction variable update are uniform. Differential Revision: https://reviews.llvm.org/D24511 llvm-svn: 281485	2016-09-14 14:47:40 +00:00
James Molloy	13065b00ba	[ARM] Promote small global constants to constant pools If a constant is unamed_addr and is only used within one function, we can save on the code size and runtime cost of an indirection by changing the global's storage to inside the constant pool. For example, instead of: ldr r0, .CPI0 bl printf bx lr .CPI0: &format_string format_string: .asciz "hello, world!\n" We can emit: adr r0, .CPI0 bl printf bx lr .CPI0: .asciz "hello, world!\n" This can cause significant code size savings when many small strings are used in one function (4 bytes per string). llvm-svn: 281484	2016-09-14 14:47:27 +00:00
Sjoerd Meijer	d3b2321f78	MCInstrDesc: this fixes an issue setting/getting member Flags, which is an uint64_t. However, getter function getFlags returned an unsigned, and in function hasProperty (1 << MCFlag) was used instead of (1ULL << MCFlag). llvm-svn: 281483	2016-09-14 14:32:17 +00:00
Simon Pilgrim	ec2d206669	[X86][SSE] Removed unused getTargetShuffleNode function llvm-svn: 281481	2016-09-14 14:30:00 +00:00
Simon Pilgrim	67fdd15cf9	[X86] Added i128 lshr+shl -> mask combine test llvm-svn: 281480	2016-09-14 14:29:16 +00:00
Nemanja Ivanovic	d5deb4896c	Fix code-gen crash on Power9 for insert_vector_elt with variable index (PR30189) This patch corresponds to review: https://reviews.llvm.org/D24021 In the initial implementation of this instruction, I forgot to account for variable indices. This patch fixes PR30189 and should probably be merged into 3.9.1 (I'll open a bug according to the new instructions). llvm-svn: 281479	2016-09-14 14:19:09 +00:00
Andrea Di Biagio	e8e1af3649	[InstCombine] Merged two test files and regenerated checks using update_test_checks.py. NFC. llvm-svn: 281478	2016-09-14 14:18:21 +00:00
Silviu Baranga	0a020f0fb0	[StackProtector] Use INITIALIZE_TM_PASS instead of INITIALIZE_PASS in order to make sure that its TargetMachine constructor is registered. This allows us to run the PEI machine pass with MIR input (see PR30324). llvm-svn: 281474	2016-09-14 14:09:43 +00:00
Nemanja Ivanovic	a103d104e1	Adding missing directive for Power9. There is currently no codegen for Power9 that depends on the directive so this is NFC for now but will be important in the future. This was missed in r268950 so I'm adding it now. llvm-svn: 281473	2016-09-14 14:09:39 +00:00
Simon Pilgrim	ba325e3a73	[X86][SSE] Don't blend vector shifts with MOVSS/MOVSD directly, lower from generic shuffle Shuffle lowering will correctly lower to MOVSS/MOVSD/PBLEND, improving commutation opportunities llvm-svn: 281471	2016-09-14 14:08:18 +00:00
Kuba Brecka	a1ea64a044	[asan] Enable -asan-use-private-alias on Darwin/Mach-O, add test for ODR false positive with LTO (llvm part) The '-asan-use-private-alias’ option (disabled by default) option is currently only enabled for Linux and ELF, but it also works on Darwin and Mach-O. This option also fixes a known problem with LTO on Darwin (https://github.com/google/sanitizers/issues/647). This patch enables the support for Darwin (but still keeps it off by default) and adds the LTO test case. Differential Revision: https://reviews.llvm.org/D24292 llvm-svn: 281470	2016-09-14 14:06:33 +00:00
James Molloy	9790d8f81d	Revert "[Thumb] Teach ISel how to lower compares of AND bitmasks efficiently" This reverts commit r281323. It caused chromium test failures and a selfhost failure. llvm-svn: 281451	2016-09-14 09:45:28 +00:00
Vassil Vassilev	2ec8b1506a	Missing includes. llvm-svn: 281450	2016-09-14 08:55:18 +00:00
Tim Northover	1c7825fd79	GlobalISel: mark pointer stores as legal on AArch64. llvm-svn: 281448	2016-09-14 08:28:54 +00:00
Sjoerd Meijer	724023a1ec	This reapplies r281304. The issue was that I had missed to copy the new isAdd field in the tablegen data structure. llvm-svn: 281447	2016-09-14 08:20:03 +00:00
Elena Demikhovsky	0569d9d588	AVX-512: Fixed a bug in kortest.z intrinsic Lowering was wrong - X86ISD::SETCC node should return i8 type. llvm-svn: 281446	2016-09-14 08:06:54 +00:00
Igor Breger	74813fc19c	[AVX512BW] Change truncStore action (v16i16->v16i18). It can be legal only with AVX512VL. Differential Revision: http://reviews.llvm.org/D24547 llvm-svn: 281445	2016-09-14 08:04:28 +00:00
Craig Topper	4e2d5a43cf	[X86] Remove the VCVTSI2SD32 with rounding intrinsic. It's not used by clang and not needed since 32-bit integer to double is always exact. llvm-svn: 281442	2016-09-14 06:27:46 +00:00
Wei Mi	24662395df	Create a getelementptr instead of sub expr for ValueOffsetPair if the value is a pointer. This patch is to fix PR30213. When expanding an expr based on ValueOffsetPair, if the value is of pointer type, we can only create a getelementptr instead of sub expr. Differential Revision: https://reviews.llvm.org/D24088 llvm-svn: 281439	2016-09-14 04:39:50 +00:00
Tobias Grosser	12cc2b80b6	Ensure Polly linking works without BUILD_SHARED_LIBS This change ensures all necessary symbols are resolved correctly. Before this change on some systems, the linker may have eliminated some symbols not directly used in bugpoint, but used in Polly. Suggested-by: Michael Kruse <lvm@meinersbur.de> llvm-svn: 281438	2016-09-14 03:09:48 +00:00
Peter Collingbourne	0758644461	gold: Simplify. Do not unnecessarily enumerate Obj's symbols. llvm-svn: 281437	2016-09-14 02:55:16 +00:00
Kostya Serebryany	a00b243c75	[libFuzzer] start using trace-pc-guard as an alternative source of coverage llvm-svn: 281435	2016-09-14 02:13:06 +00:00
Kostya Serebryany	da718e55cf	[sanitizer-coverage] add yet another flavour of coverage instrumentation: trace-pc-guard. The intent is to eventually replace all of {bool coverage, 8bit-counters, trace-pc} with just this one. LLVM part llvm-svn: 281431	2016-09-14 01:39:35 +00:00
Akira Hatanaka	6d5a29489a	Address Pete's review comment and define OrigArg on its own line. This is a follow-up to r281419. llvm-svn: 281421	2016-09-13 23:53:43 +00:00
Akira Hatanaka	dea090e6b2	[ObjCARC] Traverse chain downwards to replace uses of argument passed to ObjC library call with call return. ARC contraction tries to replace uses of an argument passed to an objective-c library call with the call return value. For example, in the following IR, it replaces uses of argument %9 and uses of the values discovered traversing the chain upwards (%7 and %8) with the call return %10, if they are dominated by the call to @objc_autoreleaseReturnValue. This transformation enables code-gen to tail-call the call to @objc_autoreleaseReturnValue, which is necessary to enable auto release return value optimization. %7 = tail call i8* @objc_loadWeakRetained(i8** %6) %8 = bitcast i8* %7 to %0* %9 = bitcast %0* %8 to i8* %10 = tail call i8* @objc_autoreleaseReturnValue(i8* %9) ret %0* %8 Since r276727, llvm started removing redundant bitcasts and as a result started feeding the following IR to ARC contraction: %7 = tail call i8* @objc_loadWeakRetained(i8** %6) %8 = bitcast i8* %7 to %0* %9 = tail call i8* @objc_autoreleaseReturnValue(i8* %7) ret %0* %8 ARC contraction no longer does the optimization described above since it only traverses the chain upwards and fails to recognize that the function return can be replaced by the call return. This commit changes ARC contraction to traverse the chain downwards too and replace uses of bitcasts with the call return. rdar://problem/28011339 Differential Revision: https://reviews.llvm.org/D24523 llvm-svn: 281419	2016-09-13 23:43:11 +00:00
Vedant Kumar	84a280ad6a	[llvm-cov] Just emit the version number in the index file Having the version information in every view is distracting, especially if there are several sub-views. llvm-svn: 281414	2016-09-13 23:00:13 +00:00
Ahmed Bougacha	7398b178f1	[AArch64] Simplify patchpoint/stackmap size test (r281301). NFC. llvm-svn: 281407	2016-09-13 22:16:40 +00:00
Pawel Bylica	c397f0b272	[CodeGen] Fix invalid shift in mul expansion Summary: When expanding mul in type legalization make sure the type for shift amount can actually fit the value. This fixes PR30354 https://llvm.org/bugs/show_bug.cgi?id=30354. Reviewers: hfinkel, majnemer, RKSimon Subscribers: RKSimon, llvm-commits Differential Revision: https://reviews.llvm.org/D24478 llvm-svn: 281403	2016-09-13 21:55:41 +00:00
Michael Kuperstein	59f8305305	[DAG] Allow build-to-shuffle combine to combine builds from two wide vectors. This allows us to, in some cases, create a vector_shuffle out of a build_vector, when the inputs to the build are extract_elements from two different vectors, at least one of which is wider than the output. (E.g. a <8 x i16> being constructed out of elements from a <16 x i16> and a <8 x i16>). Differential Revision: https://reviews.llvm.org/D24491 llvm-svn: 281402	2016-09-13 21:53:32 +00:00
Kevin Enderby	f76b56cb9c	Next set of additional error checks for invalid Mach-O files for bad load commands that use the Mach::dyld_info_command type for the load commands that are currently use in the MachOObjectFile constructor. This contains the missing checks for LC_DYLD_INFO and LC_DYLD_INFO_ONLY load commands and the fields for the Mach::dyld_info_command type. llvm-svn: 281400	2016-09-13 21:42:28 +00:00
Krzysztof Parzyszek	d19d0507c8	[Hexagon] Better handling of HVX vector lowering - Expand SELECT_CC and BR_CC for vector types. - Implement TLI::isShuffleMaskLegal. llvm-svn: 281397	2016-09-13 21:16:07 +00:00
Sanjay Patel	ab40f9d0ef	add tests for PR28672 I'm not sure if we actually want to transform all of these in InstCombine yet, so I'm not labeling these with FIXME. llvm-svn: 281386	2016-09-13 20:36:13 +00:00
Matt Arsenault	e2e6cfee61	Reapply "InstCombine: Reduce trunc (shl x, K) width." This reapplies r272987 with a fix for infinitely looping when the truncated value is another shift of a constant. llvm-svn: 281379	2016-09-13 19:43:57 +00:00
Matthias Braun	1af1414d4d	AArch64: Cleanup tailcall CC check, enable swiftcc. Cleanup/change the code that checks for possible tailcall conventions to look the same as the one in the X86 target. This makes the distinction between calling conventions that can guarnatee tailcalls and the ones that may tailcall more obvious. - Add Swift to the mayTailCall list - PreserveMost seemed to be incorrectly part of the guarnteed tail call list, move it to the mayTailCall list. llvm-svn: 281376	2016-09-13 19:27:38 +00:00
Matt Arsenault	a992f71bef	AMDGPU: Remove code I think is dead As far as I can tell, resolveFrameIndex is supposed to be called with a legal offset, so inserting an add shouldn't be necessary. llvm-svn: 281372	2016-09-13 19:15:25 +00:00
Mike Aizatsky	481ddc3793	.clang-tidy: correct style name is 'camelBack' not 'lowerCase'. Summary: clang-tidy doesn't like to complain. Differential Revision: https://reviews.llvm.org/D24413 llvm-svn: 281370	2016-09-13 19:04:26 +00:00
Matt Arsenault	25dba30017	AMDGPU: Support commuting a FrameIndex operand llvm-svn: 281369	2016-09-13 19:03:12 +00:00
Matthew Simpson	81335bec96	[LV] Clean up uniform induction variable analysis (NFC) llvm-svn: 281368	2016-09-13 19:01:45 +00:00
Davide Italiano	39ccd24126	[LTO] Don't pass SF_Undefined symbols to the IRmover. This should fix PR 30363. llvm-svn: 281366	2016-09-13 18:45:13 +00:00
Reid Kleckner	9bf66f70cb	Fix MSVC 2013 build by using our <thread> wrapper header llvm-svn: 281365	2016-09-13 18:40:04 +00:00
Simon Pilgrim	4a8eba3e96	[DAGCombiner] Use APInt directly in (shl (zext (srl x, C)), C) combine range test To avoid assertion, we must ensure that the inner shift constant is within range before calling ConstantSDNode::getZExtValue(). We already know that the outer shift constant is in range. Followup to D23007 llvm-svn: 281362	2016-09-13 18:33:29 +00:00
Nico Weber	e204c48d16	Revert r281336 (and r281337), it caused PR30372. llvm-svn: 281361	2016-09-13 18:17:00 +00:00
Douglas Katzman	8ea02f4e1c	[Myriad]: set LeonCASA processor feature llvm-svn: 281359	2016-09-13 17:51:41 +00:00
Simon Pilgrim	1f9ddf48a6	[X86][SSE] Added AVX512F and additional vector truncate test cases trunc16i16_16i8 is currently commented out due to PR25684 llvm-svn: 281356	2016-09-13 17:34:56 +00:00
Simon Pilgrim	bd28a85d14	[DAGCombiner] Use APInt directly in (shl (ext (shl x, c1)), c2) combine Fix failure to detect out of range shift constants leading to assert in ConstantSDNode::getZExtValue() Followup to D23007 llvm-svn: 281354	2016-09-13 17:15:28 +00:00
Matt Arsenault	30bccade0b	Fix misleading comment for getOrEnforceKnownAlignment It does not return 0 to indicate failure, and returns the known alignment. llvm-svn: 281350	2016-09-13 16:39:43 +00:00
Andrea Di Biagio	7277afeec1	[ConstantFold] Improve the bitcast folding logic for constant vectors. The constant folder didn't know how to always fold bitcasts of constant integer vectors. In particular, it was unable to handle the case where a constant vector had some undef elements, and the resulting (i.e. bitcasted) vector type had more elements than the original vector type. Example: %cast = bitcast <2 x i64><i64 undef, i64 2> to <4 x i32> On a little endian target, %cast could have been folded to: <4 x i32><i32 undef, i32 undef, i32 2, i32 0> This patch improves the folding logic by teaching how to correctly propagate undef elements in the folded vector. Differential Revision: https://reviews.llvm.org/D24301 llvm-svn: 281343	2016-09-13 14:50:47 +00:00
Simon Pilgrim	2e99e0ff63	[X86] Regenerated shift combine tests. Added x86_64 tests llvm-svn: 281341	2016-09-13 14:41:39 +00:00
Vassil Vassilev	42a8457bcb	[modules] Re-enable some previously excluded files. Our modules support seems to be able to handle them nowadays. Patch by Cristina Cristescu! llvm-svn: 281340	2016-09-13 14:41:35 +00:00
Krzysztof Parzyszek	b558ae2125	[Hexagon] Clear the flow queue after visiting a single instruction llvm-svn: 281339	2016-09-13 14:36:55 +00:00
Nirav Dave	fbd38cadf1	Apply Clang-format to MCAsmParser.cpp NFC. llvm-svn: 281337	2016-09-13 13:57:16 +00:00
Nirav Dave	9fa8af2180	Defer asm errors to post-statement failure Recommitting after fixing AsmParser Initialization. Allow errors to be deferred and emitted as part of clean up to simplify and shorten Assembly parser code. This will allow error messages to be emitted in helper functions and be modified by the caller which has better context. As part of this many minor cleanups to the Parser: * Unify parser cleanup on error * Add Workaround for incorrect return values in ParseDirective instances * Tighten checks on error-signifying return values for parser functions and fix in-tree TargetParsers to be more consistent with the changes. * Fix AArch64 test cases checking for spurious error messages that are now fixed. These changes should be backwards compatible with current Target Parsers so long as the error status are correctly returned in appropriate functions. Reviewers: rnk, majnemer Subscribers: aemerson, jyknight, llvm-commits Differential Revision: https://reviews.llvm.org/D24047 llvm-svn: 281336	2016-09-13 13:55:06 +00:00
Chad Rosier	7ea0d3947a	[LoopInterchange] Minor refactor. NFC. llvm-svn: 281334	2016-09-13 13:30:30 +00:00
Andrea Di Biagio	3647a96a44	[InstSimplify] Add tests to show missed bitcast folding opportunities. InstSimplify doesn't always know how to fold a bitcast of a constant vector. In particular, the logic in InstSimplify doesn't know how to handle the case where the constant vector in input contains some undef elements, and the number of elements is smaller than the number of elements of the bitcast vector type. llvm-svn: 281332	2016-09-13 13:17:42 +00:00
Chad Rosier	61683a22cb	Don't use else if after return. Tidy comments. NFC. llvm-svn: 281331	2016-09-13 13:08:53 +00:00
Chad Rosier	d18ea0654b	Typo. NFC. llvm-svn: 281330	2016-09-13 13:00:29 +00:00
Chad Rosier	09c1109b12	[LoopInterchange] Tidy up and remove unnecessary dyn_casts. NFC. llvm-svn: 281328	2016-09-13 12:56:04 +00:00
James Molloy	043d613791	Revert "[ARM] Promote small global constants to constant pools" This reverts commit r281314. Speculatively revert as it's possible this caused linker errors: http://lab.llvm.org:8011/builders/clang-native-arm-lnt/builds/19656 llvm-svn: 281327	2016-09-13 12:45:51 +00:00
Sam Parker	64781ed4bb	Remove InstCombine test file My previous commit should of removed a test file but I missed it. llvm-svn: 281326	2016-09-13 12:33:06 +00:00
Pablo Barrio	bb6984d401	[ARM] Add ".code 32" to functions in the ARM instruction set Before, only Thumb functions were marked as ".code 16". These ".code x" directives are effective until the next directive of its kind is encountered. Therefore, in code with interleaved ARM and Thumb functions, it was possible to declare a function as ARM and end up with a Thumb function after assembly. A test has been added. An existing test has also been fixed to take this change into account. Reviewers: aschwaighofer, t.p.northover, jmolloy, rengolin Subscribers: aemerson, rengolin, llvm-commits Differential Revision: https://reviews.llvm.org/D24337 llvm-svn: 281324	2016-09-13 12:18:15 +00:00
James Molloy	d246c598de	[Thumb] Teach ISel how to lower compares of AND bitmasks efficiently For the common pattern (CMPZ (AND x, #bitmask), #0), we can do some more efficient instruction selection if the bitmask is one consecutive sequence of set bits (32 - clz(bm) - ctz(bm) == popcount(bm)). 1) If the bitmask touches the LSB, then we can remove all the upper bits and set the flags by doing one LSLS. 2) If the bitmask touches the MSB, then we can remove all the lower bits and set the flags with one LSRS. 3) If the bitmask has popcount == 1 (only one set bit), we can shift that bit into the sign bit with one LSLS and change the condition query from NE/EQ to MI/PL (we could also implement this by shifting into the carry bit and branching on BCC/BCS). 4) Otherwise, we can emit a sequence of LSLS+LSRS to remove the upper and lower zero bits of the mask. 1-3 require only one 16-bit instruction and can elide the CMP. 4 requires two 16-bit instructions but can elide the CMP and doesn't require materializing a complex immediate, so is also a win. llvm-svn: 281323	2016-09-13 12:12:32 +00:00
Sam Parker	214f7bf5cc	Enable simplify libcalls for ARM PCS Teach SimplifyLibcalls that in can treat functions annotated with apcs, aapcs or aapcs_vfp like normal C functions if they only take and return integer or pointer values, and the target is not iOS. Differential Revision: https://reviews.llvm.org/D24453 llvm-svn: 281322	2016-09-13 12:10:14 +00:00
Ying Yi	544b1df64f	[llvm-cov] - Included footer "Generated by llvm-cov -- llvm version <version number>" in the coverage report. The llvm-cov version information will be useful to the user when comparing the code coverage across different versions of llvm-cov. This patch provides the llvm-cov version information in the generated coverage report. Differential Revision: https://reviews.llvm.org/D24457 llvm-svn: 281321	2016-09-13 11:28:31 +00:00
Peter Smith	85bbda191d	[ARM] Support ldr.w in pseudo instruction ldr rd,=immediate The changes made in r269352, r269353 and r269354 to support the transformation of the ldr rd,=immediate to mov introduced a regression from 3.8 (ldr.w rd, =immediate) not supported. This change puts support back in for ldr.w by means of a t2InstAlias for the .w form. The .w is ignored in ARM state and propagated to the ldr in Thumb2. llvm-svn: 281319	2016-09-13 11:15:51 +00:00
James Molloy	3e4bc66134	[ARM] Promote small global constants to constant pools If a constant is unamed_addr and is only used within one function, we can save on the code size and runtime cost of an indirection by changing the global's storage to inside the constant pool. For example, instead of: ldr r0, .CPI0 bl printf bx lr .CPI0: &format_string format_string: .asciz "hello, world!\n" We can emit: adr r0, .CPI0 bl printf bx lr .CPI0: .asciz "hello, world!\n" This can cause significant code size savings when many small strings are used in one function (4 bytes per string). llvm-svn: 281314	2016-09-13 10:28:11 +00:00
Eric Liu	882dc72b38	[WebAssembly] Trying to fix broken tests in CodeGen/WebAssembly caused by r281285. Reviewers: bkramer, ddcc, dschuff, sunfish Subscribers: jfb, llvm-commits, dschuff Differential Revision: https://reviews.llvm.org/D24497 llvm-svn: 281312	2016-09-13 10:05:44 +00:00
Ayman Musa	0c2da88f82	Remove MVT:i1 xor instruction before SELECT. (Performance improvement). Differential Revision: https://reviews.llvm.org/D23764 llvm-svn: 281308	2016-09-13 09:12:45 +00:00
Sjoerd Meijer	520a18df9c	Revert of r281304 as it is causing build bot failures in hexagon hwloop regression tests. These tests pass locally; will be investigating where these differences come from. llvm-svn: 281306	2016-09-13 08:51:59 +00:00
Sjoerd Meijer	05453991fe	This adds a new field isAdd to MCInstrDesc. The ARM and Hexagon instruction descriptions now tag add instructions, and the Hexagon backend is using this to identify loop induction statements. Patch by Sam Parker and Sjoerd Meijer. Differential Revision: https://reviews.llvm.org/D23601 llvm-svn: 281304	2016-09-13 08:08:06 +00:00
Elena Demikhovsky	b906df9fe5	AVX-512: Fix for PR28175 - Scalar code optimization. Optimized (truncate (assertzext x) to i1) and anyext i1 to i8/16/32. Optimization of this patterns is a one more step towards i1 optimization on AVX-512. Differential Revision: https://reviews.llvm.org/D24456 llvm-svn: 281302	2016-09-13 07:57:00 +00:00
Diana Picus	4b97288184	[AArch64] Support stackmap/patchpoint in getInstSizeInBytes We currently return 4 for stackmaps and patchpoints, which is very optimistic and can in rare cases cause the branch relaxation pass to fail to relax certain branches. This patch causes getInstSizeInBytes to return a pessimistic estimate of the size as the number of bytes requested in the stackmap/patchpoint. In the future, we could provide a more accurate estimate by sharing some of the logic in AArch64::LowerSTACKMAP/PATCHPOINT. Fixes part of https://llvm.org/bugs/show_bug.cgi?id=28750 Differential Revision: https://reviews.llvm.org/D24073 llvm-svn: 281301	2016-09-13 07:45:17 +00:00
Craig Topper	4619c9e6a8	[X86] Remove masked shufpd/shufps intrinsics and autoupgrade to native vector shuffles. They were removed from clang previously but accidentally left in the backend. llvm-svn: 281300	2016-09-13 07:40:53 +00:00
Craig Topper	3f029ae6e1	[X86] Remove some dead intrinsics. They aren't implemented and clang doesn't reference them. llvm-svn: 281299	2016-09-13 07:40:48 +00:00
Davide Italiano	11cfa45bec	[Docs] Fix a broken link in the Kaleidoscope tutorial. Patch by: Alfred Perlstein <alfred@FreeBSD.org> llvm-svn: 281297	2016-09-13 06:31:37 +00:00
Davide Italiano	d0f70eb557	[LTO] Only expose the dataLayout string instead of the whole module. Differential Revision: https://reviews.llvm.org/D24494 llvm-svn: 281296	2016-09-13 06:29:17 +00:00
Zachary Turner	d97d5a2cee	Revert "[Support][CommandLine] Add cl::getRegisteredSubcommands()" This reverts r281290, as it breaks unit tests. http://lab.llvm.org:8011/builders/clang-x86-windows-msvc2015/builds/303 llvm-svn: 281292	2016-09-13 04:11:57 +00:00
Dean Michael Berris	d9d290c0c6	[Support][CommandLine] Add cl::getRegisteredSubcommands() This should allow users of the library to get a range to iterate through all the subcommands that are registered to the global parser. This allows users to define subcommands in libraries that self-register to have dispatch done at a different stage (like main). It allows for writing code like the following: for (auto S : cl::getRegisteredSubcommands()) { if (S) { // Dispatch on S->getName(). } } This change also contains tests that show this usage pattern. Reviewers: zturner, dblaikie, echristo Subscribers: llvm-commits, mehdi_amini Differential Revision: https://reviews.llvm.org/D24489 llvm-svn: 281290	2016-09-13 02:35:00 +00:00
Davide Italiano	b69efb9e8d	[lib/LTO] Expose getModule() in lto::InputFile. lld will use this to get the datalayout string and emit a diagnostic if empty. llvm-svn: 281289	2016-09-13 02:22:02 +00:00
Peter Collingbourne	d4135bbc30	DebugInfo: New metadata representation for global variables. This patch reverses the edge from DIGlobalVariable to GlobalVariable. This will allow us to more easily preserve debug info metadata when manipulating global variables. Fixes PR30362. A program for upgrading test cases is attached to that bug. Differential Revision: http://reviews.llvm.org/D20147 llvm-svn: 281284	2016-09-13 01:12:59 +00:00
Michael Kuperstein	efc0667583	[DAG] Refactor BUILD_VECTOR combine to make it easier to extend. NFCI. This should make it easier to add cases that we currently don't cover, like supporting more kinds of type mismatches and more than 2 input vectors. llvm-svn: 281283	2016-09-13 00:57:43 +00:00
Hans Wennborg	8a42d4b9cc	X86: Conditional tail calls should not have isBarrier = 1 That confuses e.g. machine basic block placement, which then doesn't realize that control can fall through a block that ends with a conditional tail call. Instead, isBranch=1 should be set. Also, mark EFLAGS as used by these instructions. llvm-svn: 281281	2016-09-13 00:21:32 +00:00
Eric Christopher	04c7db31e8	Temporarily Revert "[MC] Defer asm errors to post-statement failure" as it's causing errors on the sanitizer bots. This reverts commit r281249. llvm-svn: 281280	2016-09-13 00:19:29 +00:00
Adam Nemet	fd84f48f9f	[OptDiag] Add getHotness accessor llvm-svn: 281274	2016-09-12 23:46:34 +00:00
Philip Reames	9db7948e90	[LVI] Complete the abstract of the cache layer [NFCI] Convert the previous introduced is-a relationship between the LVICache and LVIImple clases into a has-a relationship and hide all the implementation details of the cache from the lazy query layer. The only slightly concerning change here is removing the addition of a queried block into the SeenBlock set in LVIImpl::getBlockValue. As far as I can tell, this was effectively dead code. I think it used to be the case that getCachedValueInfo wasn't const and might end up inserting elements in the cache during lookup. That's no longer true and hasn't been for a while. I did fixup the const usage to make that more obvious. llvm-svn: 281272	2016-09-12 22:38:44 +00:00
Sanjay Patel	ff00fae8e6	add more tests for PR30273 llvm-svn: 281270	2016-09-12 22:28:29 +00:00
Lang Hames	a506d50744	[ORC] Clang-format RPCSerialization.h. llvm-svn: 281269	2016-09-12 22:05:14 +00:00
Lang Hames	c83e39b9e2	[ORC] Add some more documentation to RPCSerialization.h. llvm-svn: 281268	2016-09-12 22:05:12 +00:00
Philip Reames	b627aec407	[LVI] Sink a couple more cache manipulation routines into the cache itself [NFCI] The only interesting bit here is the refactor of the handle callback and even that's pretty straight-forward. llvm-svn: 281267	2016-09-12 22:03:36 +00:00
Philip Reames	92e5e1b92d	[LVI] Abstract out the actual cache logic [NFCI] Seperate the caching logic from the implementation of the lazy analysis. For the moment, the lazy analysis impl has a is-a relationship with the cache; this will change to a has-a relationship shortly. This was done as two steps merely to keep the changes simple and the diff understandable. llvm-svn: 281266	2016-09-12 21:46:58 +00:00
Nico Weber	7c31d0ebc0	Revert r281215, it caused PR30358. llvm-svn: 281263	2016-09-12 21:40:50 +00:00
Nico Weber	8ad01c303e	attempt to unbreak build after r281254 llvm-svn: 281262	2016-09-12 21:15:44 +00:00
Lang Hames	2f98f31c77	[ORC] Add missing <thread> header to RPCSerialization.h. llvm-svn: 281257	2016-09-12 20:45:01 +00:00
Lang Hames	27ce3c1b5e	[ORC] Replace the serialize/deserialize function pair with a SerializationTraits class. SerializationTraits provides serialize and deserialize methods corresponding to the earlier functions, but also provides a name for the type. In future, this name will be used to render function signatures as strings, which will in turn be used to negotiate and verify API support between RPC clients and servers. llvm-svn: 281254	2016-09-12 20:34:41 +00:00
Dehao Chen	c32d71253c	Fix the bug introduced in r281252. llvm-svn: 281253	2016-09-12 20:29:54 +00:00
Dehao Chen	9bbb941acf	Lower consecutive select instructions correctly. Summary: If consecutive select instructions are lowered separately in CGP, it will introduce redundant condition check and branches that cannot be removed by later optimization phases. This patch lowers all consecutive select instructions at the same to to avoid inefficent code as demonstrated in https://llvm.org/bugs/show_bug.cgi?id=29095 Reviewers: davidxl Subscribers: vsk, llvm-commits Differential Revision: https://reviews.llvm.org/D24147 llvm-svn: 281252	2016-09-12 20:23:28 +00:00
Nirav Dave	c0c0f7a196	[MC] Defer asm errors to post-statement failure Allow errors to be deferred and emitted as part of clean up to simplify and shorten Assembly parser code. This will allow error messages to be emitted in helper functions and be modified by the caller which has better context. As part of this many minor cleanups to the Parser: * Unify parser cleanup on error * Add Workaround for incorrect return values in ParseDirective instances * Tighten checks on error-signifying return values for parser functions and fix in-tree TargetParsers to be more consistent with the changes. * Fix AArch64 test cases checking for spurious error messages that are now fixed. These changes should be backwards compatible with current Target Parsers so long as the error status are correctly returned in appropriate functions. Reviewers: rnk, majnemer Subscribers: aemerson, jyknight, llvm-commits Differential Revision: https://reviews.llvm.org/D24047 llvm-svn: 281249	2016-09-12 20:03:02 +00:00
Sanjay Patel	dea26950a0	[InstCombine] add test for PR30327 llvm-svn: 281248	2016-09-12 19:50:08 +00:00
Sanjay Patel	2eea9a1d58	[InstCombine] regenerate checks llvm-svn: 281247	2016-09-12 19:29:26 +00:00
Elena Demikhovsky	57c602aad3	AVX-512: Added a test for -O0 mode. NFC. llvm-svn: 281246	2016-09-12 19:03:21 +00:00
Elena Demikhovsky	5730bf0429	AVX-512: Simplified masked_gather_scatter test. NFC. llvm-svn: 281244	2016-09-12 18:50:47 +00:00
Hemant Kulkarni	3a705b570b	Fix test failure in r281232 llvm-svn: 281240	2016-09-12 17:40:10 +00:00
Lang Hames	8d4be3aacf	[MCJIT] Fix some inconsistent handling of name mangling inside MCJIT. This patch moves symbol mangling from findSymbol to getSymbolAddress. The findSymbol, findExistingSymbol and findModuleForSymbol methods now always take a mangled name, allowing the 'demangle-and-retry' cruft to be removed from findSymbol. See http://llvm.org/PR28699 for details. Patch by James Holderness. Thanks very much James! llvm-svn: 281238	2016-09-12 17:19:24 +00:00
Hemant Kulkarni	5f4ca2f371	llvm-size: Add --totals option Differential Revision: https://reviews.llvm.org/D24308 llvm-svn: 281233	2016-09-12 17:08:28 +00:00
Hemant Kulkarni	aecf9d0c86	llvm-objdump: Add --start-address and --stop-address options Differential Revision: https://reviews.llvm.org/D24160 llvm-svn: 281232	2016-09-12 17:08:22 +00:00
Sanjay Patel	f5887f1fbd	[InstCombine] use m_APInt to allow icmp X, C folds for splat constant vectors isSignBitCheck could be changed to take a pointer param to avoid the 'UnusedBit' ugliness. llvm-svn: 281231	2016-09-12 16:25:41 +00:00
Nicolai Haehnle	e58e0e3fe3	AMDGPU: Do not clobber SCC in SIWholeQuadMode Reviewers: arsenm, tstellarAMD, mareko Subscribers: arsenm, llvm-commits, kzhuravl Differential Revision: http://reviews.llvm.org/D22198 llvm-svn: 281230	2016-09-12 16:25:20 +00:00
Ahmed Bougacha	925961b20c	[GlobalISel] Fix mismatched "<..)" in intrinsic MO printing. NFC. llvm-svn: 281229	2016-09-12 16:21:49 +00:00
James Molloy	3d06ff22b7	Revert "[ARM] Promote small global constants to constant pools" This reverts commit r281213. It made a bot go bang: http://lab.llvm.org:8011/builders/clang-cmake-armv7-a15-full/builds/14625 llvm-svn: 281228	2016-09-12 16:18:23 +00:00
Ahmed Bougacha	b678219aa6	[BranchFolding] Unique added live-ins after hoisting code. We're not supposed to have duplicate live-ins. llvm-svn: 281224	2016-09-12 16:05:31 +00:00
Ahmed Bougacha	45bfa8772f	[X86] Copy imp-uses when folding tailcall into conditional branch. r280832 added 32-bit support for emitting conditional tail-calls, but dropped imp-used parameter registers. This went unnoticed until r281113, which added 64-bit support, as this is only exposed with parameter passing via registers. Don't drop the imp-used parameters. llvm-svn: 281223	2016-09-12 16:05:27 +00:00
David Majnemer	c83044d9bb	[FunctionAttrs] Don't try to infer returned if it is already on an argument Trying to infer the 'returned' attribute if an argument is already 'returned' can lead to verification failure: inference might determine that a different argument is passed through which would result in two different arguments marked as 'returned'. This fixes PR30350. llvm-svn: 281221	2016-09-12 16:04:59 +00:00
Sanjay Patel	0531f0a5bb	fix formatting; NFC llvm-svn: 281220	2016-09-12 15:52:28 +00:00
Sanjay Patel	db400baa80	[InstCombine] add tests to show missing vector folds llvm-svn: 281219	2016-09-12 15:51:42 +00:00
Igor Breger	a3e36da6f2	add select i1 test, reproduser pr30249. llvm-svn: 281218	2016-09-12 15:27:02 +00:00
Sanjay Patel	3151dec7f1	[InstCombine] add helper function for foldICmpUsingKnownBits; NFCI llvm-svn: 281217	2016-09-12 15:24:31 +00:00
Sam Kolton	fb0d9d9c13	[AMDGPU] Assembler: Move disabled SDWA and DPP instruction into Disable asm variant Summary: This removes disabled instructions from match tables so we will not match them at all. Reviewers: tstellarAMD, vpykhtin, artem.tamazov Subscribers: wdng, nhaehnle, arsenm Differential Revision: https://reviews.llvm.org/D24452 llvm-svn: 281216	2016-09-12 14:42:43 +00:00
James Molloy	1e1b56bd48	[Thumb] Teach ISel how to lower compares of AND bitmasks efficiently For the common pattern (CMPZ (AND x, #bitmask), #0), we can do some more efficient instruction selection if the bitmask is one consecutive sequence of set bits (32 - clz(bm) - ctz(bm) == popcount(bm)). 1) If the bitmask touches the LSB, then we can remove all the upper bits and set the flags by doing one LSLS. 2) If the bitmask touches the MSB, then we can remove all the lower bits and set the flags with one LSRS. 3) If the bitmask has popcount == 1 (only one set bit), we can shift that bit into the sign bit with one LSLS and change the condition query from NE/EQ to MI/PL (we could also implement this by shifting into the carry bit and branching on BCC/BCS). 4) Otherwise, we can emit a sequence of LSLS+LSRS to remove the upper and lower zero bits of the mask. 1-3 require only one 16-bit instruction and can elide the CMP. 4 requires two 16-bit instructions but can elide the CMP and doesn't require materializing a complex immediate, so is also a win. llvm-svn: 281215	2016-09-12 14:30:48 +00:00
Sanjay Patel	5352331716	fix formatting/typos; NFC llvm-svn: 281214	2016-09-12 14:25:46 +00:00
James Molloy	8f82d45ff4	[ARM] Promote small global constants to constant pools If a constant is unamed_addr and is only used within one function, we can save on the code size and runtime cost of an indirection by changing the global's storage to inside the constant pool. For example, instead of: ldr r0, .CPI0 bl printf bx lr .CPI0: &format_string format_string: .asciz "hello, world!\n" We can emit: adr r0, .CPI0 bl printf bx lr .CPI0: .asciz "hello, world!\n" This can cause significant code size savings when many small strings are used in one function (4 bytes per string). llvm-svn: 281213	2016-09-12 13:42:16 +00:00
Chad Rosier	a4c424654e	[LoopInterchange] Improve debug output. NFC. llvm-svn: 281212	2016-09-12 13:24:47 +00:00
Pablo Barrio	0bebc38abb	Fix the Thumb test for vfloat intrinsics Summary: This test was not testing the intrinsics. A function like this: define %v4f32 @test_v4f32.floor(%v4f32 %a){ ... %1 = call %v4f32 @llvm.floor.v4f32(%v4f32 %a) ... } is transformed into the following assembly: _test_v4f32.floor: @ @test_v4f32.floor ... bl _floorf ... In each function tested, there are two CHECK: one that checked for the label and another one for the intrinsic that should be used inside the function (in our case, "floor"). However, although the first CHECK was matching the label, the second was not matching the intrinsic, but the second "floor" in the same line as the label. This is fixed by making the first CHECK match the entire line. Reviewers: jmolloy, rengolin Subscribers: rengolin, llvm-commits Differential Revision: https://reviews.llvm.org/D24398 llvm-svn: 281211	2016-09-12 13:14:14 +00:00
Rafael Espindola	74941239d8	Define a dummy zlib::uncompress when zlib is not available. Should fix link errors in some bots when it is used. llvm-svn: 281208	2016-09-12 13:00:51 +00:00

1 2 3 4 5 ...

138154 Commits