llvm-project

Commit Graph

Author	SHA1	Message	Date
Zoran Jovanovic	c74e3eb9a6	[mips][microMIPS] Implement JRADDIUSP instruction Differential Revision: http://reviews.llvm.org/D5046 llvm-svn: 217681	2014-09-12 14:29:54 +00:00
Bill Schmidt	b73b370809	Address comments on r217622 llvm-svn: 217680	2014-09-12 14:26:36 +00:00
Zoran Jovanovic	ed6dd6bd39	[mips][microMIPS] Implement BGEZALS and BLTZALS instructions Differential Revision: http://reviews.llvm.org/D5004 llvm-svn: 217678	2014-09-12 13:51:58 +00:00
Zoran Jovanovic	ac9ef12fc5	[mips][microMIPS] Implement JALS and JALRS instructions. Differential Revision: http://reviews.llvm.org/D5003 llvm-svn: 217676	2014-09-12 13:43:41 +00:00
Zoran Jovanovic	4e7ac4ad2a	[mips][microMIPS] Implement TLBP, TLBR, TLBWI and TLBWR instructions Differential Revision: http://reviews.llvm.org/D5211 llvm-svn: 217675	2014-09-12 13:33:33 +00:00
James Molloy	a9f47b6bae	[ARM] Teach the cost model that cross-class copies are costly. Cross-class copies being expensive is actually a trait of the microarchitecture, but as I haven't yet seen an example of a microarchitecture where they're cheap it seems best to just enable this by default, covering the non-mcpu build case. llvm-svn: 217674	2014-09-12 13:29:40 +00:00
Benjamin Kramer	6d527ef9d6	Legalizer: Use the scalar bit width when promoting bit counting instrs on vectors. e.g. when promoting ctlz from <2 x i32> to <2 x i64> we have to fixup the result by 32 bits, not 64. PR20917. llvm-svn: 217671	2014-09-12 12:50:27 +00:00
Patrik Hagglund	c287f4a358	Fix gcc -Wpedantic. llvm-svn: 217669	2014-09-12 12:32:08 +00:00
Evgeniy Stepanov	986460de07	Add CMake check for libatomic. llvm-svn: 217666	2014-09-12 11:08:59 +00:00
Benjamin Kramer	cfd8d90969	Fix an ODR violation consisting of two 'struct Query' in the global namespace. Put them in their own anonymous namespaces. Found by GCC's new -Wodr (PR20915). llvm-svn: 217662	2014-09-12 08:56:53 +00:00
Joerg Sonnenberger	313f5e2f29	Add Tom Stellard's role as 3.5 release manager. llvm-svn: 217659	2014-09-12 08:07:31 +00:00
Justin Bogner	15562d00c1	llvm-cov: Move FunctionCoverageMapping into CoverageMapping.h (NFC) llvm-svn: 217657	2014-09-12 06:52:44 +00:00
Craig Topper	fec61ef391	Remove a temporary variable and just construct a unique_ptr directly using make_unique. llvm-svn: 217655	2014-09-12 05:17:20 +00:00
David Majnemer	6ac8b9ba8b	Object: Small cleanup in COFFObjectFile::getSymbol NFC. llvm-svn: 217648	2014-09-11 23:42:10 +00:00
Justin Bogner	3d7260e7b2	Revert "llvm-cov: Remove an overly system specific test" This fixes a call to sys::fs::equivalent that should've been to CodeCoverageTool::equivalentFiles, which lets us restore the test of r217476 that was removed in r217478. This reverts r217478, but the test works this time. llvm-svn: 217646	2014-09-11 23:20:48 +00:00
Lang Hames	6f8525d8a6	[MCJIT] Improve the "stub not found" diagnostic in RuntimeDyldChecker. A "stub found found" diagnostic is emitted when RuntimeDyldChecker's stub lookup logic fails to find the requested stub. The obvious reason for the failure is that no such stub has been created, but it can also fail for internal symbols if the symbol offset is not computed correctly (E.g. due to a mangled relocation addend). This patch adds a comment about the latter case so that it's not overlooked. Inspired by confusion experienced during test case construction for r217635. llvm-svn: 217643	2014-09-11 23:09:22 +00:00
Rui Ueyama	7b6cff8f7b	[Support][Endian] Overload += and -= This patch is to overload operator+= and operator-= for {u}{little}{big}{16,32,64}_t. llvm-svn: 217637	2014-09-11 22:55:25 +00:00
Matt Arsenault	362f345bab	R600/SI: Fix off by 1 error in used register count The register numbers start at 0, so if only 1 register was used, this was reported as 0. llvm-svn: 217636	2014-09-11 22:51:37 +00:00
Lang Hames	691a21ce5a	[MCJIT] Make sure we test ARM BR24 relocations with both internal and external symbols. Previously we have only been testing these relocations with external symbols. <rdar://problem/18308413> llvm-svn: 217635	2014-09-11 22:43:36 +00:00
Rui Ueyama	5c69ff5cf5	Support: Use llvm::COFF::BigObjMagic Use llvm::COFF::BigObjMagic insetad of the string literal. Also checks the version number. llvm-svn: 217633	2014-09-11 22:34:32 +00:00
Rui Ueyama	062c406a85	Support: Delete {aligned_,}{u,}{little,big}8_t The byte has no endianness, so these types don't make sense. uint8_t should be used instead. llvm-svn: 217631	2014-09-11 21:46:33 +00:00
Juergen Ributzka	14ae60407d	[C API] Make the 'lower switch' pass available via the C API. llvm-svn: 217630	2014-09-11 21:32:32 +00:00
Quentin Colombet	b2c5c6dde3	[CodeGenPrepare] Teach the addressing mode matcher how to promote zext. I.e., teach it about 'sext (zext a to ty) to ty2' => zext a to ty2. llvm-svn: 217629	2014-09-11 21:22:14 +00:00
David Blaikie	6741bb09bb	Remove the unused string section symbol parameter from DwarfFile::emitStrings And since it /looked/ like the DwarfStrSectionSym was unused, I tried removing it - but then it turned out that DwarfStringPool was reconstructing the same label (and expecting it to have already been emitted) and uses that. So I kept it around, but wanted to pass it in to users - since it seemed a bit silly for DwarfStringPool to have it passed in and returned but itself have no use for it. The only two users don't handle strings in both .dwo and .o files so they only ever need the one symbol - no need to keep it (and have an unused symbol) in the DwarfStringPool used for fission/.dwo. Refactor a bunch of accelerator table usage to remove duplication so I didn't have to touch 4-5 callers. llvm-svn: 217628	2014-09-11 21:12:48 +00:00
Rui Ueyama	2acb05843b	Support: improve identify_magic to recognize COFF bigobj identify_magic recognized a COFF bigobj as an import library file. This patch fixes that. llvm-svn: 217627	2014-09-11 21:09:57 +00:00
Rafael Espindola	281f23adc1	Misc cleanups to the FileSytem api. The main difference is the removal of std::error_code exists(const Twine &path, bool &result); It was an horribly redundant interface since a file not existing is also a valid error_code. Now we have an access function that returns just an error_code. This is the only function that has to be implemented for Unix and Windows. The functions can_write, exists and can_execute an now just wrappers. One still has to be very careful using these function to avoid introducing race conditions (Time of check to time of use). llvm-svn: 217625	2014-09-11 20:30:02 +00:00
Bill Schmidt	3ae268076b	Add missing colon to RUN line... llvm-svn: 217623	2014-09-11 20:13:52 +00:00
Bill Schmidt	be95fd5357	[PATCH, PowerPC] Accept 'U' and 'X' constraints in inline asm Inline asm may specify 'U' and 'X' constraints to print a 'u' for an update-form memory reference, or an 'x' for an indexed-form memory reference. However, these are really only useful in GCC internal code generation. In inline asm the operand of the memory constraint is typically just a register containing the address, so 'U' and 'X' make no sense. This patch quietly accepts 'U' and 'X' in inline asm patterns, but otherwise does nothing. If we ever unexpectedly see a non-register, we'll assert and sort it out afterwards. I've added a new test for these constraints; the test case should be used for other asm-constraints changes down the road. llvm-svn: 217622	2014-09-11 20:10:03 +00:00
Lang Hames	6f1048f94e	[MCJIT] Add support for ARM HALF_DIFF relocations to MCJIT. Fixes <rdar://problem/18297804>. llvm-svn: 217620	2014-09-11 19:21:14 +00:00
Rafael Espindola	c159915790	Use simpler version of sys::fs::exists. NFC. llvm-svn: 217618	2014-09-11 19:11:02 +00:00
Rafael Espindola	3e5cc65efa	Use the simpler sys::fs:;exists. NFC. llvm-svn: 217617	2014-09-11 18:44:26 +00:00
Matt Arsenault	d40e1c3fbc	Add triple to test to fix bots llvm-svn: 217612	2014-09-11 17:50:20 +00:00
Brad Smith	2ce0d91bde	Provide an implementation of getNoopForMachoTarget for SPARC. llvm-svn: 217611	2014-09-11 17:40:51 +00:00
Matt Arsenault	8239eaab99	Add DAG combine for shl + add of constants. Do (shl (add x, c1), c2) -> (add (shl x, c2), c1 << c2) This is already done for multiplies, but since multiplies by powers of two are turned into shifts, we also need to handle it here. This might want checks for isLegalAddImmediate to avoid transforming an add of a legal immediate with one that isn't. llvm-svn: 217610	2014-09-11 17:34:19 +00:00
Lang Hames	4669cd08a7	[MCJIT] Take the relocation addend into account when applying ARM MachO VANILLA and BR24 relocations. <rdar://problem/18296496> llvm-svn: 217605	2014-09-11 17:27:01 +00:00
Ismail Pazarbasi	1379b96d4c	Added missing LLVM_NOEXCEPT to the definition of _obj2yaml_error_category::name LLVM_NOEXCEPT was added in r210591. llvm-svn: 217603	2014-09-11 17:19:54 +00:00
Adam Nemet	053c4e825c	[AVX512] Fix miscompile for unpack r189189 implemented AVX512 unpack by essentially performing a 256-bit unpack between the low and the high 256 bits of src1 into the low part of the destination and another unpack of the low and high 256 bits of src2 into the high part of the destination. I don't think that's how unpack works. AVX512 unpack simply has more 128-bit lanes but other than it works the same way as AVX. So in each 128-bit lane, we're always interleaving certain parts of both operands rather different parts of one of the operands. E.g. for this: __v16sf a = { 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15 }; __v16sf b = { 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31 }; __v16sf c = __builtin_shufflevector(a, b, 0, 8, 1, 9, 4, 12, 5, 13, 16, 24, 17, 25, 20, 28, 21, 29); we generated punpcklps (notice how the elements of a and b are not interleaved in the shuffle). In turn, c was set to this: 0 16 1 17 4 20 5 21 8 24 9 25 12 28 13 29 Obviously this should have just returned the mask vector of the shuffle vector. I mostly reverted this change and made sure the original AVX code worked for 512-bit vectors as well. Also updated the tests because they matched the logic from the code. llvm-svn: 217602	2014-09-11 16:51:10 +00:00
Sanjay Patel	1eb5047ddb	Add triple and remove hashes to account for buildbot differences in comment strings. llvm-svn: 217601	2014-09-11 16:08:44 +00:00
Benjamin Kramer	9e5b4a5827	Move constant-sized bitvector to the stack. llvm-svn: 217600	2014-09-11 15:58:39 +00:00
Sanjay Patel	7bd228a82e	Combine fmul vector FP constants when unsafe math is allowed. This is an extension of the change made with r215820: http://llvm.org/viewvc/llvm-project?view=revision&revision=215820 That patch allowed combining of splatted vector FP constants that are multiplied. This patch allows combining non-uniform vector FP constants too by relaxing the check on the type of vector. Also, canonicalize a vector fmul in the same way that we already do for scalars - if only one operand of the fmul is a constant, make it operand 1. Otherwise, we miss potential folds. This fold is also done by -instcombine, but it's possible that extra fmuls may have been generated during lowering. Differential Revision: http://reviews.llvm.org/D5254 llvm-svn: 217599	2014-09-11 15:45:27 +00:00
Sanjay Patel	4cb54e0a78	typo llvm-svn: 217597	2014-09-11 15:41:01 +00:00
Aaron Watry	1885e53a75	R600: Add cmpxchg instruction for evergreen Refactored the R600_LDS_1A2D class a bit to get it to actually work. It seemed to be previously unused and broken. We also have to disable the conversion to the noret variant for now in R600ISelLowering because the getLDSNoRetOp method only handles 1A1D LDS ops. Someone can feel free to modify the AMDGPU::getLDSNoRetOp method to work for more than 1A1D variants of LDS operations. It's being left as a future TODO for now. Signed-off-by: Aaron Watry <awatry at gmail.com> Reviewed-by: Matt Arsenault <matthew.arsenault@amd.com> llvm-svn: 217596	2014-09-11 15:02:54 +00:00
Aaron Watry	3ffc560094	R600: Test local atomics for evergreen Now that the operations are all implemented, we can test this sub-arch here. Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Matt Arsenault <matthew.arsenault@amd.com> llvm-svn: 217595	2014-09-11 15:02:52 +00:00
Aaron Watry	21591670c9	R600: Add LDS_WRXCHG[_RET] instructions for Evergreen. Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Matt Arsenault <matthew.arsenault@amd.com> llvm-svn: 217594	2014-09-11 15:02:49 +00:00
Aaron Watry	564a22e995	R600: Add LDS_MIN_[U]INT[_RET] instructions for Evergreen Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Matt Arsenault <matthew.arsenault@amd.com> llvm-svn: 217593	2014-09-11 15:02:47 +00:00
Aaron Watry	e51794f2fa	R600: Add LDS_XOR[_RET] instructions for Evergreen Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Matt Arsenault <matthew.arsenault@amd.com> llvm-svn: 217592	2014-09-11 15:02:46 +00:00
Aaron Watry	cffa0114c7	R600: Add LDS_OR[_RET] instructions for Evergreen Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Matt Arsenault <matthew.arsenault@amd.com> llvm-svn: 217591	2014-09-11 15:02:44 +00:00
Aaron Watry	a7f122da60	R600: Add LDS_AND[_RET] instructions for Evergreen Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Matt Arsenault <matthew.arsenault@amd.com> llvm-svn: 217590	2014-09-11 15:02:43 +00:00
Aaron Watry	62a0af4a0d	R600: Add LDS_MAX_[U]INT[_RET] instructions for Evergreen This was only present for SI before. Cayman may still be missing, but I am unable to test that currently. v2: Don't create atomicrmw max tests in separate file Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Matt Arsenault <matthew.arsenault@amd.com> CC: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 217589	2014-09-11 15:02:41 +00:00
Daniel Sanders	f605184180	[docs] Mention character array constants in docs/LangRef.rst Summary: They were used in the 'Module Structure' example but weren't otherwise documented. Credit to Reed Kotler for noticing. Reviewers: hans Reviewed By: hans Subscribers: hans, llvm-commits Differential Revision: http://reviews.llvm.org/D5191 llvm-svn: 217583	2014-09-11 12:02:59 +00:00
Tilmann Scheller	ee0e49398c	[ARM] Add Thumb-2 code size optimization regression test for LSR (register). llvm-svn: 217582	2014-09-11 10:45:50 +00:00
Tilmann Scheller	579379a6f4	[ARM] Add Thumb-2 code size optimization regression test for LSR (immediate). llvm-svn: 217581	2014-09-11 10:42:17 +00:00
Arnaud A. de Grandmaison	3690266739	[AArch64] Reenable the PBQP test now that the leak issue has been fixed. David Blaikie's commits r217563 & r217564, which added shared_ptr to the CostPool have fixed some memory leak issues exposed by the PBQP with coalescing constraints. The sanitizer bot was failing because of those leaks. Now that the leaks are gone, we can reenable the aarch64/pbqp test. llvm-svn: 217580	2014-09-11 10:39:52 +00:00
Tilmann Scheller	0c1249ac60	[ARM] Add Thumb-2 code size optimization regression test for LSL (register). llvm-svn: 217579	2014-09-11 10:33:39 +00:00
Tilmann Scheller	7430df486e	[ARM] Add Thumb2 code size optimization regression test for LSL (immediate). llvm-svn: 217576	2014-09-11 10:29:42 +00:00
Chandler Carruth	1ec3e4e4bd	[x86] Fixup r217565 which baked in an assumption about the function name that breaks on some platforms. This part of the test just doesn't matter... llvm-svn: 217575	2014-09-11 10:21:25 +00:00
Hal Finkel	f83e1f7f66	[AlignmentFromAssumptions] Don't crash just because the target is 32-bit We used to crash processing any relevant @llvm.assume on a 32-bit target (because we'd ask SE to subtract expressions of differing types). I've copied our 'simple.ll' test, but with the data layout from arm-linux-gnueabihf to get some meaningful test coverage here. llvm-svn: 217574	2014-09-11 08:40:17 +00:00
David Xu	f7aff68fe3	Build correct vector filled with undef nodes llvm-svn: 217570	2014-09-11 05:10:28 +00:00
Justin Bogner	8e5f548b81	utils: Teach lldbDataFormatters how to format ArrayRefs llvm-svn: 217567	2014-09-11 01:47:38 +00:00
Chandler Carruth	292303dd47	[x86] FileCheck-ize this test. llvm-svn: 217565	2014-09-11 00:13:35 +00:00
David Blaikie	792e8f3c02	Use CostPool::PoolRef typedef some more Cleanup to 217563 suggested by Lang Hames in post-commit review. llvm-svn: 217564	2014-09-11 00:08:54 +00:00
David Blaikie	ebd7f671df	shared_ptrify ownershp of PoolEntries in PBQP's CostPool Leveraging both intrusive shared_ptr-ing (std::enable_shared_from_this) and shared_ptr<T>-owning-U (to allow external users to hold std::shared_ptr<CostT> while keeping the underlying PoolEntry alive). The intrusiveness could be removed if we had a weak_set that implicitly removed items from the set when their underlying data went away. This /might/ fix an existing memory leak reported by LeakSanitizer in r217504. llvm-svn: 217563	2014-09-10 23:54:45 +00:00
Matt Arsenault	61a528adc7	R600/SI: Fix losing chain when fixing reg class of loads. The lost chain resulting in earlier side effecting nodes being deleted. llvm-svn: 217561	2014-09-10 23:26:19 +00:00
Matt Arsenault	2e9911205f	R600/SI: Report offset in correct units for st64 DS instructions Need to convert the 64 element offset into bytes, not just the element size like the normal case instructions. Noticed by inspection. This can't be hit now because st64 instructions aren't emitted during instruction selection, and the post-RA scheduler isn't enabled. llvm-svn: 217560	2014-09-10 23:26:16 +00:00
Peter Collingbourne	d0ec5ab948	Add LLVMgold target to test dependencies. llvm-svn: 217557	2014-09-10 22:20:49 +00:00
Matt Arsenault	16e313343d	R600: Custom lower frem llvm-svn: 217553	2014-09-10 21:44:27 +00:00
Rafael Espindola	c435adcde0	Add doInitialization/doFinalization to DataLayoutPass. With this a DataLayoutPass can be reused for multiple modules. Once we have doInitialization/doFinalization, it doesn't seem necessary to pass a Module to the constructor. Overall this change seems in line with the idea of making DataLayout a required part of Module. With it the only way of having a DataLayout used is to add it to the Module. llvm-svn: 217548	2014-09-10 21:27:43 +00:00
Hal Finkel	8123630a21	Enable use of __builtin_assume_aligned when self-hosting Clang/LLVM trunk now have support for __builtin_assume_aligned, turn this && into an \|\| so we can use it ourselves. llvm-svn: 217545	2014-09-10 21:06:11 +00:00
Hal Finkel	71b7084112	[AlignmentFromAssumptions] Don't divide by zero for unknown starting alignment The routine that determines an alignment given some SCEV returns zero if the answer is unknown. In a case where we could determine the increment of an AddRec but not the starting alignment, we would compute the integer modulus by zero (which is illegal and traps). Prevent this by returning early if either the start or increment alignment is unknown (zero). llvm-svn: 217544	2014-09-10 21:05:52 +00:00
Dan Liew	4773d0b4bf	[sphinx cleanup] Fix sphinx warning introduced by r217537 llvm-svn: 217541	2014-09-10 20:43:03 +00:00
Gerolf Hoflehner	7b0abb89c2	[AArch64] Revert r216141 for cyclone The increase of the interleave factor to 4 has side-effects like performance losses eg. due to reminder loops being executed more frequently and may increase code size. It requires more analysis and careful heuristic tuning. Expect double digit gains in small benchmarks like lowercase.c and losses in puzzle.c. llvm-svn: 217540	2014-09-10 20:31:57 +00:00
Gerolf Hoflehner	008e5cdcba	[PassManager] Adding Hidden attribute to EnableMLSM option llvm-svn: 217539	2014-09-10 20:24:03 +00:00
Gerolf Hoflehner	24815d9b8f	[MergedLoadStoreMotion] Move pass enabling option to PassManagerBuilder llvm-svn: 217538	2014-09-10 19:55:29 +00:00
Nico Weber	4b916b21b4	Fix docs reference to inexistent class. Patch sent via telegraph by TNorthover. Thanks! llvm-svn: 217537	2014-09-10 19:50:55 +00:00
Rafael Espindola	71143ed24b	Remember to eraseFromParent after replaceAllUsesWith. llvm-svn: 217536	2014-09-10 19:39:41 +00:00
Adrian Prantl	1383d6f808	Cleanup: Use the appropriate API for accessing the DIVariable of a DBG_VALUE intrinsic. llvm-svn: 217533	2014-09-10 18:52:29 +00:00
Arnaud A. de Grandmaison	d17f96c9ad	[AArch64] Temporarily desactivate the PBQP test, while I investigate some leaks in the allocator llvm-svn: 217531	2014-09-10 18:40:18 +00:00
Alexey Samsonov	17a9cff55c	Make CallingConv::ID an alias of "unsigned". Summary: Make CallingConv::ID a plain unsigned instead of enum with a fixed set of valus. LLVM IR allows arbitraty calling conventions (you are free to write cc12345), and loading them as enum is an undefined behavior. This was reported by UBSan. Test Plan: llvm regression test suite Reviewers: nicholas Reviewed By: nicholas Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5248 llvm-svn: 217529	2014-09-10 18:00:17 +00:00
Sanjay Patel	b653de1ada	Rename getMaximumUnrollFactor -> getMaxInterleaveFactor; also rename option names controlling this variable. "Unroll" is not the appropriate name for this variable. Clang already uses the term "interleave" in pragmas and metadata for this. Differential Revision: http://reviews.llvm.org/D5066 llvm-svn: 217528	2014-09-10 17:58:16 +00:00
Gerolf Hoflehner	e4f6684d1b	Removed misleading comment. llvm-svn: 217527	2014-09-10 17:54:50 +00:00
Gerolf Hoflehner	68570c63ca	Added missing blank llvm-svn: 217526	2014-09-10 17:52:27 +00:00
Hans Wennborg	0def0668e4	LangRef: @baz should be @bar in the COMDAT example llvm-svn: 217520	2014-09-10 17:05:08 +00:00
Arnaud A. de Grandmaison	0dbcfba659	[AArch64] Address Chad's post commit review comments for r217504 (PBQP experimental support) llvm-svn: 217518	2014-09-10 17:03:25 +00:00
Sanjay Patel	1893a25a5f	typo llvm-svn: 217516	2014-09-10 16:58:40 +00:00
Frederic Riss	a873414f87	Fix comments of createReplaceableForwardDecl() and createForwardDecl(). Noticed while trying to understand how the merge of forward decalred types and defintions work. Reviewers: echristo, dblaikie, aprantl Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5291 llvm-svn: 217514	2014-09-10 16:03:14 +00:00
Rafael Espindola	d8bd91ccfc	Replace a few virtual with override. llvm-svn: 217513	2014-09-10 15:50:08 +00:00
Arnaud A. de Grandmaison	cfb28f77a4	[AArch64] Pacify lld buildbot complaining about an unused static function in release build. llvm-svn: 217505	2014-09-10 14:24:02 +00:00
Arnaud A. de Grandmaison	c75dbbbdd6	[AArch64] Add experimental PBQP support This adds target specific support for using the PBQP register allocator on the AArch64, for the A57 cpu. By default, the PBQP allocator is not used, unless explicitely required on the command line with "-aarch64-pbqp". llvm-svn: 217504	2014-09-10 14:06:10 +00:00
Asiri Rathnayake	369c030633	[AArch 64] Use a constant pool load for weak symbol references when using static relocation model and small code model. Summary: currently we generate GOT based relocations for weak symbol references regardless of the underlying relocation model. This should be change so that in static relocation model we use a constant pool load instead. Patch from: Keith Walker Reviewers: Renato Golin, Tim Northover llvm-svn: 217503	2014-09-10 13:54:38 +00:00
Sid Manning	e7b92f0e81	Add missing HWEncoding to base register class. This change gives tblgen the information needed to fill in the HexagonRegEncodingTable. llvm-svn: 217500	2014-09-10 13:09:25 +00:00
David Majnemer	4015ea6b24	Attempt to pacify buildbots. llvm-svn: 217499	2014-09-10 13:01:03 +00:00
Tim Northover	ba1d704229	ARM: don't size-reduce STMs using the LR register. The only Thumb-1 multi-store capable of using LR is the PUSH instruction, which translates to STMDB, so we shouldn't convert STMIAs. Patch by Sergey Dmitrouk. llvm-svn: 217498	2014-09-10 12:53:28 +00:00
David Majnemer	44f51e5113	Object: Add support for bigobj This adds support for reading the "bigobj" variant of COFF produced by cl's /bigobj and mingw's -mbig-obj. The most significant difference that bigobj brings is more than 2**16 sections to COFF. bigobj brings a few interesting differences with it: - It doesn't have a Characteristics field in the file header. - It doesn't have a SizeOfOptionalHeader field in the file header (it's only used in executable files). - Auxiliary symbol records have the same width as a symbol table entry. Since symbol table entries are bigger, so are auxiliary symbol records. Write support will come soon. Differential Revision: http://reviews.llvm.org/D5259 llvm-svn: 217496	2014-09-10 12:51:52 +00:00
Daniel Sanders	24b6572645	[mips] Remove inverted predicates from MipsSubtarget that were only used by MipsCallingConv.td Summary: No functional change Reviewers: echristo, vmedic Reviewed By: echristo, vmedic Subscribers: echristo, llvm-commits Differential Revision: http://reviews.llvm.org/D5266 llvm-svn: 217494	2014-09-10 12:02:27 +00:00
Dan Liew	d4abb8d308	Don't attempt to run llvm-config in cmake/modules/Makefile when doing ``make clean`` because it won't be available. This is an attempt to unbreak buildbots broken by r217484. llvm-svn: 217490	2014-09-10 11:09:23 +00:00
Yaron Keren	b6bbf341cb	FunctionPassManager isn't used by the JIT anymore, it is used in clang EmitAssemblyHelper::CreatePasses() in BackendUtil.cpp. llvm-svn: 217488	2014-09-10 10:48:06 +00:00
Daniel Sanders	75ee6b4302	[mips] Return an ArrayRef from MipsCC::intArgRegs() and remove MipsCC::numIntArgRegs() Summary: No functional change. Reviewers: vmedic Reviewed By: vmedic Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5265 llvm-svn: 217485	2014-09-10 10:37:03 +00:00
Dan Liew	f5c90062ca	Attempt to fix PR20884 This fixes the generation of broken LLVMExports.cmake file by the Autoconf/Makefile build system when --enable-shared is passed to configure. When --enable_shared is passed the Makefile.rules does not set the LLVMConfigLibs variable which cmake/modules/Makefile previously relied on. Now it runs the llvm-config command itself to get the library names. This still isn't perfect because the generated LLVM targets refer to the static libraries and not the shared library but that is much larger problem to fix. llvm-svn: 217484	2014-09-10 10:18:59 +00:00
Stepan Dyatkovskiy	fe134cdfa7	MergeFunctions: FunctionPtr has been renamed to FunctionNode. It's supposed to store additional pass information for current function here. That was the reason for name change. llvm-svn: 217483	2014-09-10 10:08:25 +00:00
Yuri Gorshenin	3939dec1f7	[asan-assembly-instrumentation] Added CFI directives to the generated instrumentation code. Summary: [asan-assembly-instrumentation] Added CFI directives to the generated instrumentation code. Reviewers: eugenis Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5189 llvm-svn: 217482	2014-09-10 09:45:49 +00:00
Job Noorman	eb19aea4f9	Drop the W postfix on the 16-bit registers. This ensures the inline assembly register constraints are properly recognised in TargetLowering::getRegForInlineAsmConstraint. llvm-svn: 217479	2014-09-10 06:58:14 +00:00
Justin Bogner	32cc7abdb6	llvm-cov: Remove an overly system specific test It appears that the -filename-equivalence option for testing llvm-cov doesn't work correctly with -show-expansions. I'm reverting this test to get the bots green while I look into fixing that. This partially reverts r217476 llvm-svn: 217478	2014-09-10 06:35:38 +00:00
Kai Nacke	d287094566	[MIPS] Add aliases for sync instruction used by Octeon CPU This commit adds aliases for the sync instruction (synciobdma, syncs, syncw, syncws) which are used by the Octeon CPU. Reviewed by D. Sanders llvm-svn: 217477	2014-09-10 06:10:24 +00:00
Justin Bogner	3f81d4953a	llvm-cov: Fix a misuse of ArrayRef::slice I introduced in r217430 It appears this code was completely untested, so using ArrayRef wrong didn't break anything obvious. llvm-svn: 217476	2014-09-10 06:06:07 +00:00
Craig Topper	7ff1592960	Use cast to MVT instead of EVT on a couple calls to getSizeInBits. llvm-svn: 217473	2014-09-10 04:51:36 +00:00
Lang Hames	0ae7980c90	[MCJIT] Remove redundant architecture check from RuntimeDyldMachOI386. llvm-svn: 217470	2014-09-10 00:13:42 +00:00
David Blaikie	d0f103775a	Sink PrevCU updating into DwarfUnit::addRange to ensure consistency So that the two operations in DwarfDebug couldn't get separated (because I accidentally separated them in some work in progress), put them together. While we're here, move DwarfUnit::addRange to DwarfCompileUnit, since it's not relevant to type units. llvm-svn: 217468	2014-09-09 23:13:01 +00:00
David Blaikie	deb174fed5	Remove DwarfDebug::PrevSection, PrevCU is sufficient for handling address range holes. PrevSection/PrevCU are used to detect holes in the address range of a CU to ensure the DW_AT_ranges does not include those holes. When we see a function with no debug info, though it may be in the same range as the prior and subsequent functions, there should be a gap in the CU's ranges. By setting PrevCU to null in that case, the range would not be extended to cover the gap. llvm-svn: 217466	2014-09-09 22:56:36 +00:00
NAKAMURA Takumi	1ab0cf0e28	SampleProfile.cpp: Prune a stray \param added in r217437. [-Wdocumentation] llvm-svn: 217465	2014-09-09 22:44:30 +00:00
Rafael Espindola	890db27b67	Handle common linkage correctly in the gold plugin. This is the plugin version of pr20882. This handles the case of every common symbol being in the IR. We will need some support from gold to handle the case where some symbols are in ELF and some in the IR. llvm-svn: 217458	2014-09-09 20:08:22 +00:00
Sanjay Patel	1191adf4df	Add a scheduling model for AMD 16H Jaguar (btver2). This is a first pass at a scheduling model for Jaguar. It's structured largely on the existing SandyBridge and SLM sched models. Using this model, in addition to turning on the PostRA scheduler, results in some perf wins on internal and 3rd party benchmarks. There's not much difference in LLVM's test-suite benchmarking subset of tests. Differential Revision: http://reviews.llvm.org/D5229 llvm-svn: 217457	2014-09-09 20:07:07 +00:00
Rafael Espindola	fe3842cda7	Merge alignment of common GlobalValue. Fixes pr20882. llvm-svn: 217455	2014-09-09 17:48:18 +00:00
Ed Maste	ea0257a9d9	Use armv6k default for FreeBSD/ARM Patch by Andrew Turner. llvm-svn: 217454	2014-09-09 17:47:24 +00:00
Bjorn Steinbrink	3c33150801	Add a test for hoisting instructions with metadata out of then/else blocks Test for the bug fixed in r215723. llvm-svn: 217453	2014-09-09 17:10:21 +00:00
Rafael Espindola	0910605af6	When merging two common GlobalValues, keep the largest. llvm-svn: 217451	2014-09-09 15:59:12 +00:00
Rafael Espindola	14a41ce802	Make this input file pass the verifier. This was not noticed before because llvm-link only runs the verifier on the result and these globals were not present in the result. llvm-svn: 217450	2014-09-09 15:40:12 +00:00
Rafael Espindola	dbb0bd1a42	Move some logic to ModuleLinker::shouldLinkFromSource. NFC. llvm-svn: 217449	2014-09-09 15:21:00 +00:00
NAKAMURA Takumi	bb4fac9050	ScalarOpts/LLVMBuild.txt: Prune unused dependency to IPA. llvm-svn: 217448	2014-09-09 15:00:38 +00:00
NAKAMURA Takumi	37ffecf06b	ScalarOpts/LLVMBuild.txt: Reorder. llvm-svn: 217447	2014-09-09 15:00:26 +00:00
NAKAMURA Takumi	06eb60a365	LLVMProfileData: Update LLVMBuild.txt corresponding to r217437. llvm-svn: 217446	2014-09-09 15:00:13 +00:00
Rafael Espindola	c83c8d4e74	Fix a use of an undefined value (the linkage). llvm-svn: 217445	2014-09-09 14:52:27 +00:00
Rafael Espindola	7fc29546f9	Prefer common over weak linkage when linking. This matches the behavior of ELF linkers. llvm-svn: 217443	2014-09-09 14:27:09 +00:00
Rafael Espindola	83a7ddbc9a	Simplify ModuleLinker::getLinkageResult. NFC. llvm-svn: 217441	2014-09-09 14:07:40 +00:00
Toma Tabacu	2664779b27	[mips] Add assembler support for .set mips0 directive. Summary: This directive is used to reset the assembler options to their initial values. Assembly programmers use it in conjunction with the ".set mipsX" directives. This patch depends on the .set push/pop directive (http://reviews.llvm.org/D4821). Contains work done by Matheus Almeida. Reviewers: dsanders Reviewed By: dsanders Differential Revision: http://reviews.llvm.org/D4957 llvm-svn: 217438	2014-09-09 12:52:14 +00:00
Diego Novillo	de1ab26f52	Re-factor sample profile reader into lib/ProfileData. Summary: This patch moves the profile reading logic out of the Sample Profile transformation into a generic profile reader facility in lib/ProfileData. The intent is to use this new reader to implement a sample profile reader/writer that can be used to convert sample profiles from external sources into LLVM. This first patch introduces no functional changes. It moves the profile reading code from lib/Transforms/SampleProfile.cpp into lib/ProfileData/SampleProfReader.cpp. In subsequent patches I will: - Add a bitcode format for sample profiles to allow for more efficient encoding of the profile. - Add a writer for both text and bitcode format profiles. - Add a 'convert' command to llvm-profdata to be able to convert between the two (and serve as entry point for other sample profile formats). Reviewers: bogner, echristo Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5250 llvm-svn: 217437	2014-09-09 12:40:50 +00:00
Daniel Sanders	2b746bc4ae	[mips] Move MipsTargetLowering::MipsCC::regSize() to MipsSubtarget::getGPRSizeInBytes() Summary: The GPR size is more a property of the subtarget than that of the ABI so move this information to the MipsSubtarget. No functional change. Reviewers: vmedic Reviewed By: vmedic Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5009 llvm-svn: 217436	2014-09-09 12:11:16 +00:00
Pavel Chupin	e6617fc6d4	[x32] Emit callq for CALLpcrel32 Summary: In AT&T annotation for both x86_64 and x32 calls should be printed as callq in assembly. It's only a matter of correct mnemonic, object output is ok. Test Plan: trivial test added Reviewers: nadav, dschuff, craig.topper Subscribers: llvm-commits, zinovy.nis Differential Revision: http://reviews.llvm.org/D5213 llvm-svn: 217435	2014-09-09 11:54:12 +00:00
Daniel Sanders	4abcfe2cda	[mips] Don't cache IsO32 and IsFP64 in MipsTargetLowering::MipsCC Summary: Use a MipsSubtarget reference instead. No functional change. Reviewers: vmedic Reviewed By: vmedic Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5008 llvm-svn: 217434	2014-09-09 10:46:48 +00:00
Tim Northover	0b0add517b	llvm-objdump: don't crash when __compact_unwind has no relocs. llvm-svn: 217433	2014-09-09 10:45:06 +00:00
Toma Tabacu	9db22db963	[mips] Add assembler support for .set push/pop directive. Summary: These directives are used to save the current assembler options (in the case of ".set push") and restore the previously saved options (in the case of ".set pop"). Contains work done by Matheus Almeida. Reviewers: dsanders Reviewed By: dsanders Differential Revision: http://reviews.llvm.org/D4821 llvm-svn: 217432	2014-09-09 10:15:38 +00:00
Renato Golin	63e27980da	ARM: Negative offset support problem This patch is to permit a negative offset usage for a non frame access. Patch by Igor Oblakov. llvm-svn: 217431	2014-09-09 09:57:59 +00:00
Justin Bogner	37aeb2d7c2	llvm-cov: Use ArrayRef::slice (NFC) llvm-svn: 217430	2014-09-09 09:15:52 +00:00
Patrik Hagglund	57d315b7c1	[MachineSinking] Conservatively clear kill flags after coalescing. This solves the problem of having a kill flag inside a loop with a definition of the register prior to the loop: %vreg368<def> ... Inside loop: %vreg520<def> = COPY %vreg368 %vreg568<def,tied1> = add %vreg341<tied0>, %vreg520<kill> => was coalesced into => %vreg568<def,tied1> = add %vreg341<tied0>, %vreg368<kill> MachineVerifier then complained: * Bad machine code: Virtual register killed in block, but needed live out. * The kill flag for %vreg368 is incorrect, and is cleared by this patch. This is similar to the clearing done at the end of MachineSinking::SinkInstruction(). Patch provided by Jonas Paulsson. Reviewed by Quentin Colombet and Juergen Ributzka. llvm-svn: 217427	2014-09-09 07:47:00 +00:00
Justin Bogner	e53be0662a	llvm-cov: Combine two types that were nearly identical (NFC) llvm-cov had a SourceRange type that was nearly identical to a CountedRegion except that it shaved off a couple of fields. There aren't likely to be enough of these for the minor memory savings to be worth the extra complexity here. llvm-svn: 217417	2014-09-09 05:32:18 +00:00
Justin Bogner	8e8aa3ff90	llvm-cov: Rename MappingRegion to coverage::CountedRegion (NFC) This name was too similar to CoverageMappingRegion, and the type really belongs in the coverage library anyway. llvm-svn: 217416	2014-09-09 05:32:14 +00:00
Bob Wilson	b3482af341	Set trunc store action to Expand for all X86 targets. When compiling without SSE2, isTruncStoreLegal(F64, F32) would return Legal, whereas with SSE2 it would return Expand. And since the Target doesn't seem to actually handle a truncstore for double -> float, it would just output a store of a full double in the space for a float hence overwriting other bits on the stack. Patch by Luqman Aden! llvm-svn: 217410	2014-09-09 01:13:36 +00:00
Justin Bogner	3f18834b4e	llvm-cov: Try to appease MSVC after r217404 llvm-svn: 217406	2014-09-08 21:31:43 +00:00
Dan Liew	ffcfe7fc53	Fix type error in insertvalue example in LangRef. %agg1 is of type {i32, float} and thus cannot be used where a type {i32, {float}} is expected. llvm-svn: 217405	2014-09-08 21:19:46 +00:00
Justin Bogner	85b0a03a69	llvm-cov: Use ErrorOr rather than an error_code* (NFC) llvm-svn: 217404	2014-09-08 21:04:00 +00:00
Hans Wennborg	18f0a986c1	Fast-ISel: Remove dead code after falling back from selecting call instructions (PR20863) Previously, fast-isel would not clean up after failing to select a call instruction, because it would have called flushLocalValueMap() which moves the insertion point, making SavedInsertPt in selectInstruction() invalid. Fixing this by making SavedInsertPt a member variable, and having flushLocalValueMap() update it. This removes some redundant code at -O0, and more importantly fixes PR20863. Differential Revision: http://reviews.llvm.org/D5249 llvm-svn: 217401	2014-09-08 20:24:10 +00:00
Sanjay Patel	394c333e3e	Group unsafe fmul math folds together for easier reading. No functional change. llvm-svn: 217399	2014-09-08 20:16:42 +00:00
Justin Bogner	84ae925f7a	llvm-cov: Remove dead code FunctionCoverageMapping::PrettyName was from a version of the tool during review, and isn't actually used currently. llvm-svn: 217398	2014-09-08 19:51:21 +00:00
Hal Finkel	337fa6e645	Don't static_cast invalid pointers UBSan complained about using static_cast on the invalid (tombstone, etc.) pointers used by DenseMap. Use a reinterpret_cast instead. llvm-svn: 217397	2014-09-08 19:31:25 +00:00
Alexey Samsonov	af023adb86	Be more careful in parsing Module::ModFlagBehavior value to make sure we don't do invalid load of an enum. Share the conversion code between llvm::Module implementation and the verifier. This bug was reported by UBSan. llvm-svn: 217395	2014-09-08 19:16:28 +00:00
Sanjay Patel	f4b7a6b030	Fix the FIXME that was just added in r217390 - remove a bunch of redundant fold permutations. The testcases for these folds already exist in test/CodeGen/X86/fp-fast.ll. llvm-svn: 217393	2014-09-08 18:22:51 +00:00
Sanjay Patel	8170dea280	group unsafe math folds together for easier reading Also added a FIXME regarding redundant folds for non-canonicalized constants. llvm-svn: 217390	2014-09-08 17:32:19 +00:00
Chad Rosier	bdbca15ccd	[AArch64] Enabled AA support for Cortex-A57. llvm-svn: 217381	2014-09-08 15:34:16 +00:00
Matt Arsenault	69bfb90419	R600/SI: Fix assertion from copying a TargetGlobalAddress Assert in scheduler from an inserted copy_to_regclass from a constant. This only seems to break sometimes when a constant initializer address is forced into VGPRs in a non-entry block. No test since the only case I've managed to hit only happens with a future patch, and that case will also not be a problem once scalar instructions are used in non-entry blocks. llvm-svn: 217380	2014-09-08 15:07:33 +00:00
Matt Arsenault	7ac9c4a074	R600/SI: Replace LDS atomics with no return versions llvm-svn: 217379	2014-09-08 15:07:31 +00:00
Matt Arsenault	9903ccf7ee	R600/SI: Add InstrMapping for noret atomics. Only handles LDS atomics for now, and will be used to replace atomics with no uses with the no return versions. llvm-svn: 217378	2014-09-08 15:07:27 +00:00
Chad Rosier	3528c1e4c6	[AArch64] Improve AA to remove unneeded edges in the AA MI scheduling graph. Patch by Sanjin Sijaric <ssijaric@codeaurora.org>! Phabricator Review: http://reviews.llvm.org/D5103 llvm-svn: 217371	2014-09-08 14:43:48 +00:00
Chad Rosier	c9f947744d	[AArch64] Enabled AA support for Cortex-A53. Patch by Sanjin Sijaric <ssijaric@codeaurora.org>! Phabricator Review: http://reviews.llvm.org/D5103 llvm-svn: 217370	2014-09-08 14:31:49 +00:00
Alexander Kornienko	0ed151a1ca	Add .clang-tidy configuration file to provide LLVM-optimized defaults for clang-tidy. Reviewers: chandlerc, djasper Reviewed By: djasper Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5236 llvm-svn: 217365	2014-09-08 13:30:00 +00:00
Sid Manning	ac3e325d67	Spelling correction Another trivial spelling change. llvm-svn: 217364	2014-09-08 13:05:23 +00:00
Andrew Trick	8fc3c6c093	Add a comment to getNewAlignmentDiff. llvm-svn: 217350	2014-09-07 23:16:24 +00:00
Hal Finkel	cebf0cc210	Make use @llvm.assume for loop guards in ScalarEvolution This adds a basic (but important) use of @llvm.assume calls in ScalarEvolution. When SE is attempting to validate a condition guarding a loop (such as whether or not the loop count can be zero), this check should also include dominating assumptions. llvm-svn: 217348	2014-09-07 21:37:59 +00:00
Hal Finkel	93873cc10e	Check for all known bits on ret in InstCombine From a combination of @llvm.assume calls (and perhaps through other means, such as range metadata), it is possible that all bits of a return value might be known. Previously, InstCombine did not check for this (which is understandable given assumptions of constant propagation), but means that we'd miss simple cases where assumptions are involved. llvm-svn: 217346	2014-09-07 21:28:34 +00:00
Hal Finkel	7e1844940e	Make use of @llvm.assume from LazyValueInfo This change teaches LazyValueInfo to use the @llvm.assume intrinsic. Like with the known-bits change (r217342), this requires feeding a "context" instruction pointer through many functions. Aside from a little refactoring to reuse the logic that turns predicates into constant ranges in LVI, the only new code is that which can 'merge' the range from an assumption into that otherwise computed. There is also a small addition to JumpThreading so that it can have LVI use assumptions in the same block as the comparison feeding a conditional branch. With this patch, we can now simplify this as expected: int foo(int a) { __builtin_assume(a > 5); if (a > 3) { bar(); return 1; } return 0; } llvm-svn: 217345	2014-09-07 20:29:59 +00:00
Hal Finkel	d67e463901	Add an AlignmentFromAssumptions Pass This adds a ScalarEvolution-powered transformation that updates load, store and memory intrinsic pointer alignments based on invariant((a+q) & b == 0) expressions. Many of the simple cases we can get with ValueTracking, but we still need something like this for the more complicated cases (such as those with an offset) that require some algebra. Note that gcc's __builtin_assume_aligned's optional third argument provides exactly for this kind of 'misalignment' offset for which this kind of logic is necessary. The primary motivation is to fixup alignments for vector loads/stores after vectorization (and unrolling). This pass is added to the optimization pipeline just after the SLP vectorizer runs (which, admittedly, does not preserve SE, although I imagine it could). Regardless, I actually don't think that the preservation matters too much in this case: SE computes lazily, and this pass won't issue any SE queries unless there are any assume intrinsics, so there should be no real additional cost in the common case (SLP does preserve DT and LoopInfo). llvm-svn: 217344	2014-09-07 20:05:11 +00:00
Hal Finkel	15aeaaf24a	Add additional patterns for @llvm.assume in ValueTracking This builds on r217342, which added the infrastructure to compute known bits using assumptions (@llvm.assume calls). That original commit added only a few patterns (to catch common cases related to determining pointer alignment); this change adds several other patterns for simple cases. r217342 contained that, for assume(v & b = a), bits in the mask that are known to be one, we can propagate known bits from the a to v. It also had a known-bits transfer for assume(a = b). This patch adds: assume(~(v & b) = a) : For those bits in the mask that are known to be one, we can propagate inverted known bits from the a to v. assume(v \| b = a) : For those bits in b that are known to be zero, we can propagate known bits from the a to v. assume(~(v \| b) = a): For those bits in b that are known to be zero, we can propagate inverted known bits from the a to v. assume(v ^ b = a) : For those bits in b that are known to be zero, we can propagate known bits from the a to v. For those bits in b that are known to be one, we can propagate inverted known bits from the a to v. assume(~(v ^ b) = a) : For those bits in b that are known to be zero, we can propagate inverted known bits from the a to v. For those bits in b that are known to be one, we can propagate known bits from the a to v. assume(v << c = a) : For those bits in a that are known, we can propagate them to known bits in v shifted to the right by c. assume(~(v << c) = a) : For those bits in a that are known, we can propagate them inverted to known bits in v shifted to the right by c. assume(v >> c = a) : For those bits in a that are known, we can propagate them to known bits in v shifted to the right by c. assume(~(v >> c) = a) : For those bits in a that are known, we can propagate them inverted to known bits in v shifted to the right by c. assume(v >=_s c) where c is non-negative: The sign bit of v is zero assume(v >_s c) where c is at least -1: The sign bit of v is zero assume(v <=_s c) where c is negative: The sign bit of v is one assume(v <_s c) where c is non-positive: The sign bit of v is one assume(v <=_u c): Transfer the known high zero bits assume(v <_u c): Transfer the known high zero bits (if c is know to be a power of 2, transfer one more) A small addition to InstCombine was necessary for some of the test cases. The problem is that when InstCombine was simplifying and, or, etc. it would fail to check the 'do I know all of the bits' condition before checking less specific conditions and would not fully constant-fold the result. I'm not sure how to trigger this aside from using assumptions, so I've just included the change here. llvm-svn: 217343	2014-09-07 19:21:07 +00:00
Hal Finkel	60db05896a	Make use of @llvm.assume in ValueTracking (computeKnownBits, etc.) This change, which allows @llvm.assume to be used from within computeKnownBits (and other associated functions in ValueTracking), adds some (optional) parameters to computeKnownBits and friends. These functions now (optionally) take a "context" instruction pointer, an AssumptionTracker pointer, and also a DomTree pointer, and most of the changes are just to pass this new information when it is easily available from InstSimplify, InstCombine, etc. As explained below, the significant conceptual change is that known properties of a value might depend on the control-flow location of the use (because we care that the @llvm.assume dominates the use because assumptions have control-flow dependencies). This means that, when we ask if bits are known in a value, we might get different answers for different uses. The significant changes are all in ValueTracking. Two main changes: First, as with the rest of the code, new parameters need to be passed around. To make this easier, I grouped them into a structure, and I made internal static versions of the relevant functions that take this structure as a parameter. The new code does as you might expect, it looks for @llvm.assume calls that make use of the value we're trying to learn something about (often indirectly), attempts to pattern match that expression, and uses the result if successful. By making use of the AssumptionTracker, the process of finding @llvm.assume calls is not expensive. Part of the structure being passed around inside ValueTracking is a set of already-considered @llvm.assume calls. This is to prevent a query using, for example, the assume(a == b), to recurse on itself. The context and DT params are used to find applicable assumptions. An assumption needs to dominate the context instruction, or come after it deterministically. In this latter case we only handle the specific case where both the assumption and the context instruction are in the same block, and we need to exclude assumptions from being used to simplify their own ephemeral values (those which contribute only to the assumption) because otherwise the assumption would prove its feeding comparison trivial and would be removed. This commit adds the plumbing and the logic for a simple masked-bit propagation (just enough to write a regression test). Future commits add more patterns (and, correspondingly, more regression tests). llvm-svn: 217342	2014-09-07 18:57:58 +00:00
David Blaikie	c42f9ac01c	DebugInfo: Do not use DW_FORM_GNU_addr_index in skeleton CUs, GDB 7.8 errors on this. It's probably not a huge deal to not do this - if we could, maybe the address could be reused by a subprogram low_pc and avoid an extra relocation, but it's just one per CU at best. llvm-svn: 217338	2014-09-07 17:31:42 +00:00
Hal Finkel	57f03dda49	Add functions for finding ephemeral values This adds a set of utility functions for collecting 'ephemeral' values. These are LLVM IR values that are used only by @llvm.assume intrinsics (directly or indirectly), and thus will be removed prior to code generation, implying that they should be considered free for certain purposes (like inlining). The inliner's cost analysis, and a few other passes, have been updated to account for ephemeral values using the provided functionality. This functionality is important for the usability of @llvm.assume, because it limits the "non-local" side-effects of adding llvm.assume on inlining, loop unrolling, etc. (these are hints, and do not generate code, so they should not directly contribute to estimates of execution cost). llvm-svn: 217335	2014-09-07 13:49:57 +00:00
Hal Finkel	74c2f355d2	Add an Assumption-Tracking Pass This adds an immutable pass, AssumptionTracker, which keeps a cache of @llvm.assume call instructions within a module. It uses callback value handles to keep stale functions and intrinsics out of the map, and it relies on any code that creates new @llvm.assume calls to notify it of the new instructions. The benefit is that code needing to find @llvm.assume intrinsics can do so directly, without scanning the function, thus allowing the cost of @llvm.assume handling to be negligible when none are present. The current design is intended to be lightweight. We don't keep track of anything until we need a list of assumptions in some function. The first time this happens, we scan the function. After that, we add/remove @llvm.assume calls from the cache in response to registration calls and ValueHandle callbacks. There are no new direct test cases for this pass, but because it calls it validation function upon module finalization, we'll pick up detectable inconsistencies from the other tests that touch @llvm.assume calls. This pass will be used by follow-up commits that make use of @llvm.assume. llvm-svn: 217334	2014-09-07 12:44:26 +00:00
Chandler Carruth	0a8151e69a	[x86] Revert my over-eager commit in r217332. I hadn't actually run all the tests yet and these combines have somewhat surprisingly far reaching effects. llvm-svn: 217333	2014-09-07 12:37:11 +00:00
Chandler Carruth	8405e8fff9	[x86] Tweak the rules surrounding 0,0 and 1,1 v2f64 shuffles and add support for MOVDDUP which is really important for matrix multiply style operations that do lots of non-vector-aligned load and splats. The original motivation was to add support for MOVDDUP as the lack of it regresses matmul_f64_4x4 by 5% or so. However, all of the rules here were somewhat suspicious. First, we should always be using the floating point domain shuffles, regardless of how many copies we have to make as a movapd is crazy faster than the domain switching cost on some chips. (Mostly because movapd is crazy cheap.) Because SHUFPD can't do the copy-for-free trick of the PSHUF instructions, there is no need to avoid canonicalizing on UNPCK variants, so do that canonicalizing. This also ensures we have the chance to form MOVDDUP. =] Second, we assume SSE2 support when doing any vector lowering, and given that we should just use UNPCKLPD and UNPCKHPD as they can operate on registers or memory. If vectors get spilled or come from memory at all this is going to allow the load to be folded into the operation. If we want to optimize for encoding size (the only difference, and only a 2 byte difference) it should be done much later, likely after RA. llvm-svn: 217332	2014-09-07 12:02:14 +00:00
Hans Wennborg	e5a96a5c06	Try to unflake AllocatorTest.TestAlignmentPastSlab llvm-svn: 217331	2014-09-07 05:14:29 +00:00
Hans Wennborg	44e2746418	BumpPtrAllocator: do the size check without moving any pointers Instead of aligning and moving the CurPtr forward, and then comparing with End, simply calculate how much space is needed, and compare that to how much is available. Hopefully this avoids any doubts about comparing addresses possibly derived from past the end of the slab array, overflowing, etc. Also add a test where aligning CurPtr would move it past End. llvm-svn: 217330	2014-09-07 04:24:31 +00:00
Lang Hames	9a89105248	[MCJIT] Revert partial RuntimeDyldELF cleanup that was prematurely committed in r217328. llvm-svn: 217329	2014-09-07 04:13:13 +00:00
Lang Hames	ca279c229a	[MCJIT] Rewrite RuntimeDyldMachO and its derived classes to use the 'Offset' field of RelocationValueRef, rather than the 'Addend' field. This is consistent with RuntimeDyldELF's use of RelocationValueRef, and more consistent with the semantics of the data being stored (the offset from the start of a section or symbol). llvm-svn: 217328	2014-09-07 04:03:32 +00:00
Lang Hames	69abd72e49	[MCJIT] Fix a bug RuntimeDyldImpl's read/writeBytesUnaligned methods. The previous implementation was writing to the high-bytes of integers on BE targets (when run on LE hosts). http://llvm.org/PR20640 llvm-svn: 217325	2014-09-07 02:05:26 +00:00
Matt Arsenault	76803bd384	R600/SI: Fix register class for some 64-bit atomics llvm-svn: 217323	2014-09-07 00:46:20 +00:00
Matt Arsenault	7b46a59b5a	R600/SI: Relax a few tests to help enable scheduler llvm-svn: 217320	2014-09-06 20:44:41 +00:00
Matt Arsenault	a9fcf62a9c	R600/SI: Fix broken check lines. Fix missing check, and hardcoded register numbers. llvm-svn: 217318	2014-09-06 20:37:56 +00:00
Saleem Abdulrasool	fcefa21b08	MC: correct DWARF line info for PE/COFF DWARF address ranges contain a reference to the debug_info section. This offset is an absolute relocation except on non-PE/COFF targets where it is section relative. We would emit this incorrectly, and trying to map the debug info from the address would fail. llvm-svn: 217317	2014-09-06 19:57:48 +00:00
Chandler Carruth	373b2b1728	[x86] Fix a pretty horrible bug and inconsistency in the x86 asm parsing (and latent bug in the instruction definitions). This is effectively a revert of r136287 which tried to address a specific and narrow case of immediate operands failing to be accepted by x86 instructions with a pretty heavy hammer: it introduced a new kind of operand that behaved differently. All of that is removed with this commit, but the test cases are both preserved and enhanced. The core problem that r136287 and this commit are trying to handle is that gas accepts both of the following instructions: insertps $192, %xmm0, %xmm1 insertps $-64, %xmm0, %xmm1 These will encode to the same byte sequence, with the immediate occupying an 8-bit entry. The first form was fixed by r136287 but that broke the prior handling of the second form! =[ Ironically, we would still emit the second form in some cases and then be unable to re-assemble the output. The reason why the first instruction failed to be handled is because prior to r136287 the operands ere marked 'i32i8imm' which forces them to be sign-extenable. Clearly, that won't work for 192 in a single byte. However, making thim zero-extended or "unsigned" doesn't really address the core issue either because it breaks negative immediates. The correct fix is to make these operands 'i8imm' reflecting that they can be either signed or unsigned but must be 8-bit immediates. This patch backs out r136287 and then changes those places as well as some others to use 'i8imm' rather than one of the extended variants. Naturally, this broke something else. The custom DAG nodes had to be updated to have a much more accurate type constraint of an i8 node, and a bunch of Pat immediates needed to be specified as i8 values. The fallout didn't end there though. We also then ceased to be able to match the instruction-specific intrinsics to the instructions so modified. Digging, this is because they too used i32 rather than i8 in their signature. So I've also switched those intrinsics to i8 arguments in line with the instructions. In order to make the intrinsic adjustments of course, I also had to add auto upgrading for the intrinsics. I suspect that the intrinsic argument types may have led everything down this rabbit hole. Pretty happy with the result. llvm-svn: 217310	2014-09-06 10:00:01 +00:00
Nick Lewycky	095b92e51b	Check whether the iterator p == the end iterator before trying to dereference it. This is a speculative fix for a failure found on the valgrind buildbot triggered by a clang test. llvm-svn: 217295	2014-09-06 01:16:42 +00:00
Alexey Samsonov	ba1ecbc7d3	Fix right shift by 64 bits detected on CXX/lex/lex.literal/lex.ext/p4.cpp test case on UBSan bootstrap bot. This fixes the last failure of "check-clang" in UBSan bootstrap bot. llvm-svn: 217294	2014-09-06 00:41:19 +00:00
Sean Silva	5e44ffdb3e	[docs] Document what "NFC" means in a commit message. llvm-svn: 217292	2014-09-06 00:19:16 +00:00
Lang Hames	018452e6bc	[MCJIT] Fix an iterator invalidation bug in MCJIT::finalizeObject. The finalizeObject method calls generateCodeForModule on each of the currently 'added' objects, but generateCodeForModule moves objects out of the 'added' set as it's called. To avoid iterator invalidation issues, the added set is copied out before any calls to generateCodeForModule. This should fix http://llvm.org/PR20851 . llvm-svn: 217291	2014-09-05 23:38:35 +00:00
Chandler Carruth	21d27ee95b	[x86] Fix an embarressing bug in the INSERTPS formation code. The mask computation was totally wrong, but somehow it didn't really show up with llc. I've added an assert that triggers on multiple existing test cases and updated one of them to show the correct value. There appear to still be more bugs lurking around insertps's mask. =/ However, note that this only really impacts the new vector shuffle lowering. llvm-svn: 217289	2014-09-05 23:19:45 +00:00
Akira Hatanaka	489decec67	[inline asm] Add a check in InlineAsm::ConstraintInfo::Parse to make sure '{' follows '~' in a clobber constraint string. Previously llc would hit an llvm_unreachable when compiling an inline-asm instruction with malformed constraint string "~x{21}". This commit enables LLParser to catch the error earlier and print a more helpful diagnostic. rdar://problem/14206559 llvm-svn: 217288	2014-09-05 22:30:32 +00:00
Sanjay Patel	75cc90eddc	Allow vector fsub ops with constants to get the same optimizations as scalars. This problem is bigger than just fsub, but this is the minimum fix to solve fneg for PR20556 ( http://llvm.org/bugs/show_bug.cgi?id=20556 ), and we solve zero subtraction with the same change. llvm-svn: 217286	2014-09-05 22:26:22 +00:00
Rafael Espindola	d31dc048ca	Fix pr20078. When linking llvm.global_ctors with the optional third element we have to handle it specially and only copy the elements whose keys were also copied. llvm-svn: 217281	2014-09-05 21:27:52 +00:00
Bjorn Steinbrink	5a121b2ef5	Restore the ability to check if LLVMCreateObjectFile was successful Summary: Until r216870 LLVMCreateObjectFile returned nullptr in case of an error, so callers could check if the call was successful. Now, it always returns an OwningBinary wrapped as an LLVMObjectFileRef, so callers can't check if the call was successul. This results in a segfault running e.g. llvm-c-test --object-list-sections < /dev/null So the old behaviour should be restored. Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5143 llvm-svn: 217279	2014-09-05 21:22:09 +00:00
Sanjay Patel	f4b8debc1c	clean up; NFC llvm-svn: 217278	2014-09-05 20:55:46 +00:00
Alexey Samsonov	d3e121331b	[DWARF parser] Fix nasty memory corruption in .dwo files handling. Forge a test case where llvm-symbolizer has to use external .dwo file to produce the inlining information. llvm-svn: 217270	2014-09-05 19:29:45 +00:00
Rafael Espindola	262b90321a	The gold tests also require ppc to be compiled in. We could create a tools/gold/PowerPC and a tools/gold/X86, but it doesn't seem worth it. llvm-svn: 217267	2014-09-05 19:01:12 +00:00
Rafael Espindola	b582372e87	Revert "Disable the fix for pr20793 because of a gnu ld bug." This reverts commit r217211. Both the bfd ld and gold outputs were valid. They were using a Rela relocation, so the value present in the relocated location was not used, which caused me to misread the output. llvm-svn: 217264	2014-09-05 18:03:38 +00:00
Lang Hames	3e930a3e83	[MCJIT] Const-ify the symbol lookup operations on RuntimeDyld. llvm-svn: 217263	2014-09-05 18:00:16 +00:00
Adrian Prantl	e5e8ce64de	Set the parent pointer of cloned DBG_VALUE instructions correctly. Fixes PR20523. When spilling variables onto the stack, spillVirtReg() is setting the parent pointer of the cloned DBG_VALUE intrinsic for the stack location to the parent pointer of the original intrinsic. MachineInstr parent pointers should however always point to the parent basic block. MBB is shadowing the MBB member variable. The instruction still ends up being inserted into the right basic block, because it's inserted after MI which serves as the iterator. I failed at constructing a reliable testcase for this, see http://llvm.org/bugs/show_bug.cgi?id=20523 for a large testcases. llvm-svn: 217260	2014-09-05 17:10:10 +00:00
Toma Tabacu	901ba6ea2e	[mips] Change Feature-related types from unsigned to uint64_t in MipsAsmParser. No functional changes. Summary: Found a couple of cases where unsigned was still being used. These two should be the last ones in the (entire) Mips backend. Reviewers: dsanders Reviewed By: dsanders Differential Revision: http://reviews.llvm.org/D5028 llvm-svn: 217257	2014-09-05 16:32:09 +00:00
Matt Arsenault	8ae5961065	R600/SI: Use same complex patterns for DS atomics This fixes hitting the same negative base offset problem that was already fixed for regular loads and stores. llvm-svn: 217256	2014-09-05 16:24:58 +00:00
Daniel Sanders	1fcea42e67	[mips] Marked the Trap-on-Condition instructions as Mips II Patch by Vasileios Kalintiris. Reviewers: dsanders Reviewed By: dsanders Differential Revision: http://reviews.llvm.org/D5173 llvm-svn: 217255	2014-09-05 15:50:13 +00:00
Toma Tabacu	3c24b0483a	[mips] Rename data members and member functions in MipsAssemblerOptions. Summary: Use the naming convention from the LLVM Coding Standards. Reviewers: dsanders Reviewed By: dsanders Differential Revision: http://reviews.llvm.org/D4972 llvm-svn: 217254	2014-09-05 15:43:21 +00:00
Jan Vesely	d1d1334064	R600: Fix FROUND round halfway cases away from zero Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Tom Stellard <tom@stellard.net> llvm-svn: 217250	2014-09-05 14:26:54 +00:00
Tom Stellard	0c93c9ecee	R600/SI: Fix bug in SIInstrInfo::legalizeOpWithMove() We must constrain the destination register class of legalized operands to a VGPR class or else the illegal operand may be folded back into the instruction by the register coalescer. This fixes a bug in add.ll that will be uncovered by future commits. llvm-svn: 217249	2014-09-05 14:08:01 +00:00
Tom Stellard	80942a1b50	R600/SI: Use S_ADD_U32 and S_SUB_U32 for low half of 64-bit operations https://bugs.freedesktop.org/show_bug.cgi?id=83416 llvm-svn: 217248	2014-09-05 14:07:59 +00:00
Chandler Carruth	19cbf0e2c4	[x86] Factor out the zero vector insertion logic in the new vector shuffle lowering for integer vectors and share it from v4i32, v8i16, and v16i8 code paths. Ironically, the SSE2 v16i8 code for this is now better than the SSSE3! =] Will have to fix the SSSE3 code next to just using a single pshufb. llvm-svn: 217240	2014-09-05 10:36:31 +00:00
Tim Northover	c879d06a85	ARM: cover all sub-architecture enumerators to keep compiler happy. No change in behaviour (hopefully). llvm-svn: 217233	2014-09-05 07:56:46 +00:00

... 2 3 4 5 6 ...

107757 Commits