llvm-project

Commit Graph

Author	SHA1	Message	Date
Adam Nemet	053c4e825c	[AVX512] Fix miscompile for unpack r189189 implemented AVX512 unpack by essentially performing a 256-bit unpack between the low and the high 256 bits of src1 into the low part of the destination and another unpack of the low and high 256 bits of src2 into the high part of the destination. I don't think that's how unpack works. AVX512 unpack simply has more 128-bit lanes but other than it works the same way as AVX. So in each 128-bit lane, we're always interleaving certain parts of both operands rather different parts of one of the operands. E.g. for this: __v16sf a = { 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15 }; __v16sf b = { 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31 }; __v16sf c = __builtin_shufflevector(a, b, 0, 8, 1, 9, 4, 12, 5, 13, 16, 24, 17, 25, 20, 28, 21, 29); we generated punpcklps (notice how the elements of a and b are not interleaved in the shuffle). In turn, c was set to this: 0 16 1 17 4 20 5 21 8 24 9 25 12 28 13 29 Obviously this should have just returned the mask vector of the shuffle vector. I mostly reverted this change and made sure the original AVX code worked for 512-bit vectors as well. Also updated the tests because they matched the logic from the code. llvm-svn: 217602	2014-09-11 16:51:10 +00:00
Sanjay Patel	1eb5047ddb	Add triple and remove hashes to account for buildbot differences in comment strings. llvm-svn: 217601	2014-09-11 16:08:44 +00:00
Benjamin Kramer	9e5b4a5827	Move constant-sized bitvector to the stack. llvm-svn: 217600	2014-09-11 15:58:39 +00:00
Sanjay Patel	7bd228a82e	Combine fmul vector FP constants when unsafe math is allowed. This is an extension of the change made with r215820: http://llvm.org/viewvc/llvm-project?view=revision&revision=215820 That patch allowed combining of splatted vector FP constants that are multiplied. This patch allows combining non-uniform vector FP constants too by relaxing the check on the type of vector. Also, canonicalize a vector fmul in the same way that we already do for scalars - if only one operand of the fmul is a constant, make it operand 1. Otherwise, we miss potential folds. This fold is also done by -instcombine, but it's possible that extra fmuls may have been generated during lowering. Differential Revision: http://reviews.llvm.org/D5254 llvm-svn: 217599	2014-09-11 15:45:27 +00:00
Rafael Espindola	1ac0ec86b7	Merge GetAddrOfCXXConstructor and GetAddrOfCXXDonstructor. NFC. llvm-svn: 217598	2014-09-11 15:42:06 +00:00
Sanjay Patel	4cb54e0a78	typo llvm-svn: 217597	2014-09-11 15:41:01 +00:00
Aaron Watry	1885e53a75	R600: Add cmpxchg instruction for evergreen Refactored the R600_LDS_1A2D class a bit to get it to actually work. It seemed to be previously unused and broken. We also have to disable the conversion to the noret variant for now in R600ISelLowering because the getLDSNoRetOp method only handles 1A1D LDS ops. Someone can feel free to modify the AMDGPU::getLDSNoRetOp method to work for more than 1A1D variants of LDS operations. It's being left as a future TODO for now. Signed-off-by: Aaron Watry <awatry at gmail.com> Reviewed-by: Matt Arsenault <matthew.arsenault@amd.com> llvm-svn: 217596	2014-09-11 15:02:54 +00:00
Aaron Watry	3ffc560094	R600: Test local atomics for evergreen Now that the operations are all implemented, we can test this sub-arch here. Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Matt Arsenault <matthew.arsenault@amd.com> llvm-svn: 217595	2014-09-11 15:02:52 +00:00
Aaron Watry	21591670c9	R600: Add LDS_WRXCHG[_RET] instructions for Evergreen. Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Matt Arsenault <matthew.arsenault@amd.com> llvm-svn: 217594	2014-09-11 15:02:49 +00:00
Aaron Watry	564a22e995	R600: Add LDS_MIN_[U]INT[_RET] instructions for Evergreen Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Matt Arsenault <matthew.arsenault@amd.com> llvm-svn: 217593	2014-09-11 15:02:47 +00:00
Aaron Watry	e51794f2fa	R600: Add LDS_XOR[_RET] instructions for Evergreen Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Matt Arsenault <matthew.arsenault@amd.com> llvm-svn: 217592	2014-09-11 15:02:46 +00:00
Aaron Watry	cffa0114c7	R600: Add LDS_OR[_RET] instructions for Evergreen Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Matt Arsenault <matthew.arsenault@amd.com> llvm-svn: 217591	2014-09-11 15:02:44 +00:00
Aaron Watry	a7f122da60	R600: Add LDS_AND[_RET] instructions for Evergreen Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Matt Arsenault <matthew.arsenault@amd.com> llvm-svn: 217590	2014-09-11 15:02:43 +00:00
Aaron Watry	62a0af4a0d	R600: Add LDS_MAX_[U]INT[_RET] instructions for Evergreen This was only present for SI before. Cayman may still be missing, but I am unable to test that currently. v2: Don't create atomicrmw max tests in separate file Signed-off-by: Aaron Watry <awatry@gmail.com> Reviewed-by: Matt Arsenault <matthew.arsenault@amd.com> CC: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 217589	2014-09-11 15:02:41 +00:00
Roman Kashitsyn	650ecb53ca	Fix bug 20892 - clang-format does not handle C-style comments Summary: http://llvm.org/bugs/show_bug.cgi?id=20892 Add support of C-style formatting enabling/disabling directives. Now the following two styles are supported: // clang-format on /* clang-format on */ The flexibility in comments (support of extra spaces and/or slashes, etc.) is deliberately avoided to simplify search in large code bases. Reviewers: djasper Reviewed By: djasper Subscribers: cfe-commits, curdeius, klimek Differential Revision: http://reviews.llvm.org/D5309 llvm-svn: 217588	2014-09-11 14:47:20 +00:00
Tobias Grosser	ee46b0c8be	Remove executable bit on all header files Some header files had been marked executable by accident. llvm-svn: 217587	2014-09-11 14:33:36 +00:00
Benjamin Kramer	22c68ef845	Avoid some unnecessary SmallVector copies. No functionality change. llvm-svn: 217586	2014-09-11 14:13:49 +00:00
Renato Golin	128485ba47	ARM Unwind syntax This patch fixes the bad argument that GAS accepted but the IAS didn't, ie. {#0x20}, moving it to {0x20} which both accept. It also makes the ARMv7+ save/restore correct by using VFP instructions rather than old co-processor ones. Fixes PR20529. llvm-svn: 217585	2014-09-11 12:57:02 +00:00
Evgeniy Stepanov	e579c76bd5	[asan] Preserve existing LD_PRELOAD setting on Android. llvm-svn: 217584	2014-09-11 12:20:29 +00:00
Daniel Sanders	f605184180	[docs] Mention character array constants in docs/LangRef.rst Summary: They were used in the 'Module Structure' example but weren't otherwise documented. Credit to Reed Kotler for noticing. Reviewers: hans Reviewed By: hans Subscribers: hans, llvm-commits Differential Revision: http://reviews.llvm.org/D5191 llvm-svn: 217583	2014-09-11 12:02:59 +00:00
Tilmann Scheller	ee0e49398c	[ARM] Add Thumb-2 code size optimization regression test for LSR (register). llvm-svn: 217582	2014-09-11 10:45:50 +00:00
Tilmann Scheller	579379a6f4	[ARM] Add Thumb-2 code size optimization regression test for LSR (immediate). llvm-svn: 217581	2014-09-11 10:42:17 +00:00
Arnaud A. de Grandmaison	3690266739	[AArch64] Reenable the PBQP test now that the leak issue has been fixed. David Blaikie's commits r217563 & r217564, which added shared_ptr to the CostPool have fixed some memory leak issues exposed by the PBQP with coalescing constraints. The sanitizer bot was failing because of those leaks. Now that the leaks are gone, we can reenable the aarch64/pbqp test. llvm-svn: 217580	2014-09-11 10:39:52 +00:00
Tilmann Scheller	0c1249ac60	[ARM] Add Thumb-2 code size optimization regression test for LSL (register). llvm-svn: 217579	2014-09-11 10:33:39 +00:00
Tim Northover	1684a614b3	[mach-o]: support optional "0x" prefix for -image_base llvm-svn: 217578	2014-09-11 10:31:46 +00:00
Tim Northover	5d95bd7037	[mach-o]: tighten up diagnostics for -image_base option The provided base must also be a multiple of the system's page size, which is a reasonable enough demand. Also check the other diagnostics more thoroughly. llvm-svn: 217577	2014-09-11 10:31:42 +00:00
Tilmann Scheller	7430df486e	[ARM] Add Thumb2 code size optimization regression test for LSL (immediate). llvm-svn: 217576	2014-09-11 10:29:42 +00:00
Chandler Carruth	1ec3e4e4bd	[x86] Fixup r217565 which baked in an assumption about the function name that breaks on some platforms. This part of the test just doesn't matter... llvm-svn: 217575	2014-09-11 10:21:25 +00:00
Hal Finkel	f83e1f7f66	[AlignmentFromAssumptions] Don't crash just because the target is 32-bit We used to crash processing any relevant @llvm.assume on a 32-bit target (because we'd ask SE to subtract expressions of differing types). I've copied our 'simple.ll' test, but with the data layout from arm-linux-gnueabihf to get some meaningful test coverage here. llvm-svn: 217574	2014-09-11 08:40:17 +00:00
Alexander Musman	fdfa8557c0	NULL->nullptr llvm-svn: 217573	2014-09-11 08:10:57 +00:00
Tim Northover	7b33f21f3d	[mach-o]: Support deprecated -seg1addr alias for -image_base Because NO LINKER MAY CHANGE. EVER. Even if it's a complete rewrite from scratch. llvm-svn: 217572	2014-09-11 07:56:20 +00:00
David Xu	f7aff68fe3	Build correct vector filled with undef nodes llvm-svn: 217570	2014-09-11 05:10:28 +00:00
Justin Bogner	560cbf506b	Fix a couple of -Wsign-compare warnings introduced in r217556 llvm-svn: 217569	2014-09-11 03:37:42 +00:00
Rui Ueyama	a726ef12a4	Make getFlavor function. The dangling "else" at the end of #if looked a bit error-prone. Make it a separate function. No functionality change. llvm-svn: 217568	2014-09-11 03:13:20 +00:00
Justin Bogner	8e5f548b81	utils: Teach lldbDataFormatters how to format ArrayRefs llvm-svn: 217567	2014-09-11 01:47:38 +00:00
Nick Kledzik	50bda292c8	If lld is renamed (or symlinked) to "ld" automatically pick the right flavor. The existing system linkers on Darwin and Linux are called "ld". We'd like to eventually drop in lld as "ld" and have it just work. But lld is a universal linker that requires the first option to be -flavor to know which command line mode to emulate (gnu or darwin). This change tests if argv[0] is "ld" and if so, if the tool was built on MacOSX then assume the darwin flavor otherwise the gnu flavor. There are two test cases which copy lld to "ld" and then run it. One for darwin and one for linux. llvm-svn: 217566	2014-09-11 00:52:05 +00:00
Chandler Carruth	292303dd47	[x86] FileCheck-ize this test. llvm-svn: 217565	2014-09-11 00:13:35 +00:00
David Blaikie	792e8f3c02	Use CostPool::PoolRef typedef some more Cleanup to 217563 suggested by Lang Hames in post-commit review. llvm-svn: 217564	2014-09-11 00:08:54 +00:00
David Blaikie	ebd7f671df	shared_ptrify ownershp of PoolEntries in PBQP's CostPool Leveraging both intrusive shared_ptr-ing (std::enable_shared_from_this) and shared_ptr<T>-owning-U (to allow external users to hold std::shared_ptr<CostT> while keeping the underlying PoolEntry alive). The intrusiveness could be removed if we had a weak_set that implicitly removed items from the set when their underlying data went away. This /might/ fix an existing memory leak reported by LeakSanitizer in r217504. llvm-svn: 217563	2014-09-10 23:54:45 +00:00
Matt Arsenault	61a528adc7	R600/SI: Fix losing chain when fixing reg class of loads. The lost chain resulting in earlier side effecting nodes being deleted. llvm-svn: 217561	2014-09-10 23:26:19 +00:00
Matt Arsenault	2e9911205f	R600/SI: Report offset in correct units for st64 DS instructions Need to convert the 64 element offset into bytes, not just the element size like the normal case instructions. Noticed by inspection. This can't be hit now because st64 instructions aren't emitted during instruction selection, and the post-RA scheduler isn't enabled. llvm-svn: 217560	2014-09-10 23:26:16 +00:00
Alexey Samsonov	5c825967ea	[TSan] Use common flags in the same way as all the other sanitizers llvm-svn: 217559	2014-09-10 23:08:06 +00:00
Alexey Samsonov	611c906cb3	[Sanitizer] Get rid of Symbolizer::Get() and Symbolizer::GetOrNull(). We may as well just use Symbolizer::GetOrInit() in all the cases. Don't call Symbolizer::Get() early in tools initialization: these days it doesn't do any important setup work, and we may as well create the symbolizer the first time it's actually needed. llvm-svn: 217558	2014-09-10 22:45:09 +00:00
Peter Collingbourne	d0ec5ab948	Add LLVMgold target to test dependencies. llvm-svn: 217557	2014-09-10 22:20:49 +00:00
DeLesley Hutchins	4e38f100b5	Thread Safety Analysis: major update to thread safety TIL. Numerous changes, including: * Changed the way variables and instructions are handled in basic blocks to be more efficient. * Eliminated SExprRef. * Simplified futures. * Fixed documentation. * Compute dominator and post dominator trees. llvm-svn: 217556	2014-09-10 22:12:52 +00:00
Fariborz Jahanian	a00a6526dc	More test for "void *" argument as index of a dictionary literal. llvm-svn: 217555	2014-09-10 22:12:13 +00:00
Matt Arsenault	16e313343d	R600: Custom lower frem llvm-svn: 217553	2014-09-10 21:44:27 +00:00
Ben Langmuir	4a78c9eec4	Remove a couple of fixed paths that snuck into my test from 217550 I forgot to fix these again the second time I copy-and-pasted. llvm-svn: 217552	2014-09-10 21:41:43 +00:00
Jim Ingham	77fd738f58	Rework how resetting breakpoints in changed modules works. Try to match up old locations with new ones if possible. Next up some test cases... llvm-svn: 217551	2014-09-10 21:40:47 +00:00
Ben Langmuir	5418f40127	Avoid a couple of assertions when preprocessing with modules 1. We were hitting the NextIsPrevious assertion because we were trying to merge decl chains that were independent of each other because we had no Sema object to allow them to find existing decls. This is fixed by delaying loading the "preloaded" decls until Sema is available. 2. We were trying to get identifier info from an annotation token, which asserts. The fix is to special-case the module annotations in the preprocessed output printer. Fixed in a single commit because when you hit 1 you almost invariably hit 2 as well. llvm-svn: 217550	2014-09-10 21:29:41 +00:00

1 2 3 4 5 ...

182500 Commits All Branches Search

182500 Commits

All Branches