llvm-project

Commit Graph

Author	SHA1	Message	Date
Hal Finkel	b6d0d6b263	[PowerPC] Generate unaligned vector loads using intrinsics instead of regular loads Altivec vector loads on PowerPC have an interesting property: They always load from an aligned address (by rounding down the address actually provided if necessary). In order to generate an actual unaligned load, you can generate two load instructions, one with the original address, one offset by one vector length, and use a special permutation to extract the bytes desired. When this was originally implemented, I generated these two loads using regular ISD::LOAD nodes, now marked as aligned. Unfortunately, there is a problem with this: The alignment of a load does not contribute to its identity, and SDNodes are uniqued. So, imagine that we have some unaligned load, L1, that is not aligned. The routine will create two loads, L1(aligned) and (L1+16)(aligned). Further imagine that there had already existed a load (L1+16)(unaligned) with the same chain operand as the load L1. When (L1+16)(aligned) is created as part of the lowering of L1, this load is also the (L1+16)(unaligned) node, just now marked as aligned (because the new alignment overwrites the old). But the original users of (L1+16)(unaligned) now get the data intended for the permutation yielding the data for L1, and (L1+16)(unaligned) no longer exists to get its own permutation-based expansion. This was PR19991. A second potential problem has to do with the MMOs on these loads, which can be used by AA during instruction scheduling to break chain-based dependencies. If the new "aligned" loads get the MMO from the original unaligned load, this does not represent the fact that it will load data from below the original address. Normally, this would not matter, but this load might be combined with another load pair for a previous vector, and then the dependency on the otherwise- ignored lower bytes can matter. To fix both problems, instead of generating the necessary loads using regular ISD::LOAD instructions, ppc_altivec_lvx intrinsics are used instead. These are provided with MMOs with a conservative address range. Unfortunately, I no longer have a failing test case (since PR19991 was reported, other changes in CodeGen have forced this bug back into hiding it again). Nevertheless, this should fix the underlying problem. llvm-svn: 214481	2014-08-01 05:20:41 +00:00
Matthew Gardiner	f39ebbe613	Change the encoding of the Triple string exchanged across GDB-RSP and update documentation to suit, as suggested by Jason Molenda and discussed in: http://lists.cs.uiuc.edu/pipermail/lldb-commits/Week-of-Mon-20140721/011978.html Differential Revision: http://reviews.llvm.org/D4704 llvm-svn: 214480	2014-08-01 05:12:23 +00:00
Suyog Sarda	56c9a87035	This patch implements transform for pattern "(A & ~B) ^ (~A) -> ~(A & B)". Differential Revision: http://reviews.llvm.org/D4653 llvm-svn: 214479	2014-08-01 05:07:20 +00:00
Suyog Sarda	1c6c2f69f7	This patch implements transform for pattern "(A \| B) & ((~A) ^ B) -> (A & B)". Differential Revision: http://reviews.llvm.org/D4628 llvm-svn: 214478	2014-08-01 04:59:26 +00:00
Suyog Sarda	52324c82cc	This patch implements transform for pattern "( A & (~B)) \| (A ^ B) -> (A ^ B)" Differential Revision: http://reviews.llvm.org/D4652 llvm-svn: 214477	2014-08-01 04:50:31 +00:00
Suyog Sarda	16d646594e	This patch implements transform for pattern "(A & B) \| ((~A) ^ B) -> (~A ^ B)". Patch Credit to Ankit Jain ! Differential Revision: http://reviews.llvm.org/D4655 llvm-svn: 214476	2014-08-01 04:41:43 +00:00
Tom Stellard	aa9a1a813e	R600/SI: Fix build warning llvm-svn: 214475	2014-08-01 02:05:57 +00:00
Eric Fiselier	9cd8ed4e23	Update linux test results file llvm-svn: 214474	2014-08-01 01:59:09 +00:00
Richard Smith	46bb581a03	[modules] Remove IRGen special case for emitting implicit special members if they're somehow missing a body. Looks like this was left behind when the loop was generalized, and it's not been problematic before because without modules, a used, implicit special member function declaration must be a definition. This was resulting in us trying to emit a constructor declaration rather than a definition, and producing a constructor missing its member initializers. llvm-svn: 214473	2014-08-01 01:56:39 +00:00
Manman Ren	264da422b9	Add comments to debug info testing case. llvm-svn: 214472	2014-08-01 01:47:13 +00:00
Richard Trieu	428058fb9a	Remove this pointer that is converted to bool. In well-defined contexts, the this pointer is always non-null. If the this pointer is null, it is undefined and the compiler may optimize it away by assuming it is non-null. The null checks are pushed into the callers. llvm-svn: 214471	2014-08-01 01:42:01 +00:00
Juergen Ributzka	82ecc7ff2a	[FastISel][AArch64] Fix the immediate versions of the {s\|u}{add\|sub}.with.overflow intrinsics. ADDS and SUBS cannot encode negative immediates or immediates larger than 12bit. This fix checks if the immediate version can be used under this constraints and if we can convert ADDS to SUBS or vice versa to support negative immediates. Also update the test cases to test the immediate versions. llvm-svn: 214470	2014-08-01 01:25:55 +00:00
Hal Finkel	3604bf7fe7	[PowerPC] Recognize consecutive memory accesses from intrinsics When generating unaligned vector loads, we need to search for other loads or stores nearby offset by one vector width. If we find one, then we know that we can safely generate another aligned load at that address. Otherwise, we must generate the next load using an offset of the vector width minus one byte (so we don't read off the end of the allocation if the base unaligned address happened to be aligned at runtime). We had previously done this using only other vector loads and stores, but did not consider the PowerPC-specific vector load/store intrinsics. Now we'll also consider vector intrinsics. By itself, this change is a feature enhancement, but is a necessary step toward fixing the underlying problem behind PR19991. llvm-svn: 214469	2014-08-01 01:02:01 +00:00
Reid Kleckner	71ff3f223f	MS inline asm: Fix null SMLoc when 'ptr' is missing after dword & co This improves the diagnostics from the regular assembler, but more importantly it fixes an assertion when parsing inline assembly. Test landing in Clang. llvm-svn: 214468	2014-08-01 00:59:22 +00:00
Tom Stellard	b4a313a76f	R600/SI: Do abs/neg folding with ComplexPatterns Abs/neg folding has moved out of foldOperands and into the instruction selection phase using complex patterns. As a consequence of this change, we now prefer to select the 64-bit encoding for most instructions and the modifier operands have been dropped from integer VOP3 instructions. llvm-svn: 214467	2014-08-01 00:32:39 +00:00
Tom Stellard	6655dd699f	TableGen: Allow AddedComplexity values to be negative This is useful for cases when stand-alone patterns are preferred to the patterns included in the instruction definitions. Instead of requiring that stand-alone patterns set a larger AddedComplexity value, which can be confusing to new developers, the allows us to reduce the complexity of the included patterns to achieve the same result. There will be test cases for this added to the R600 backend in a future commit. llvm-svn: 214466	2014-08-01 00:32:36 +00:00
Tom Stellard	0e975cf6e5	R600/SI: Simplify and fix handling of VOP2 in SIInstrInfo::legalizeOperands We were incorrectly assuming that all VOP2 instructions can read SGPRs in Src0, but this is not true for instructions that read carry-in from VCC. The old logic has been replaced with new logic which checks the defined register classes of the VOP2 instruction to determine whether or not to legalize the operands. llvm-svn: 214465	2014-08-01 00:32:35 +00:00
Tom Stellard	6407e1e632	R600/SI: Fold immediates when shrinking instructions This will prevent us from using extra MOV instructions once we prefer selecting 64-bit instructions. llvm-svn: 214464	2014-08-01 00:32:33 +00:00
Tom Stellard	86d12ebdbd	R600/SI: Fix incorrect commute operation in shrink instructions pass We were commuting the instruction by still shrinking it using the original opcode. NOTE: This is a candidate for the 3.5 branch. llvm-svn: 214463	2014-08-01 00:32:28 +00:00
Hans Wennborg	05fb383d2b	clang-format vs plugin: claim support for VS 14 CTP too llvm-svn: 214461	2014-08-01 00:02:24 +00:00
Kevin Enderby	0d928a142b	Add support for the X86 secure guard extensions instructions in assembler (SGX). This allows assembling the two new instructions, encls and enclu for the SKX processor model. Note the diffs are a bigger than what might think, but to fit the new MRM_CF and MRM_D7 in things in the right places things had to be renumbered and shuffled down causing a bit more diffs. rdar://16228228 llvm-svn: 214460	2014-07-31 23:57:38 +00:00
Richard Smith	24d166ca77	Fix buildbot: work around missing GCC C++11 feature. llvm-svn: 214459	2014-07-31 23:52:38 +00:00
Richard Smith	6de7a24782	[modules] Maintain an AST invariant across module load/save: if any declaration of a function has a resolved exception specification, then all declarations of the function do. We should probably improve the AST representation to make this implicit (perhaps only store the exception specification on the canonical declaration), but this fixes things for now. The testcase for this (which used to assert) also exposes the actual bug I was trying to reduce here: we sometimes fail to emit the body of an imported special member function definition. Fix for that to follow. llvm-svn: 214458	2014-07-31 23:46:44 +00:00
Reid Kleckner	b7e2f6015a	X86 MC: Don't crash on empty memory operand parens Instead, create an absolute memory operand. Fixes PR20504. llvm-svn: 214457	2014-07-31 23:26:35 +00:00
Reid Kleckner	0c5da97dd0	X86 MC: Reject invalid segment registers before a memory operand colon Previously we would execute unreachable during object emission. llvm-svn: 214456	2014-07-31 23:03:22 +00:00
Louis Gerbarg	09b8cdee12	White space fix. llvm-svn: 214455	2014-07-31 22:57:46 +00:00
Eric Fiselier	993dfb1eef	Change lit.cfg to allow whitespace before comments llvm-svn: 214454	2014-07-31 22:56:52 +00:00
Rui Ueyama	140f6029ce	[PECOFF] Fix section header. The PE/COFF spec says that SizeOfRawData field in the section header must be a multiple of FileAlignment from the optional header. LLD emits 512 as FileAlignment, so it must have been a multiple of 512. LLD did not follow that. It emitted the actual section size without the last padding as the SizeOfRawData. Although it's not correct as per the spec, the Windows loader doesn't seem to actually bother to check that. Executables created by LLD worked fine. However, tools dealing with executalbe files may expect it to be the correct value, and one instance of it is mt.exe tool distributed as a part of Windows SDK. If CMake is invoked with "-E vs_link_exe" option, it silently run mt.exe to embed a resource file to the resulting file. And mt.exe sometimes breaks an input file if it's section header does not follow the standard. That caused a misterous error that CMake with Ninja occasionally produces a broken executable. This patch fixes the section header to make mt.exe and other tools happy. llvm-svn: 214453	2014-07-31 22:40:35 +00:00
Hal Finkel	9e5298549e	Make classof in MemSDNode consistent with MemIntrinsicSDNode If INTRINSIC_W_CHAIN and INTRINSIC_VOID are MemIntrinsicSDNodes, and a MemIntrinsicSDNode is a MemSDNode, then INTRINSIC_W_CHAIN and INTRINSIC_VOID must be MemSDNodes too. Noticed by inspection. llvm-svn: 214452	2014-07-31 22:31:33 +00:00
Jan Vesely	3047950964	R600: Modernize work item intrinsics test Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Matt Arsenault <Matthew.Arsenault@amd.com> llvm-svn: 214451	2014-07-31 22:11:03 +00:00
Richard Smith	8acb4280c5	Factor out exception specification information from FunctionProtoType::ExtProtoInfo. Most of the users of these fields don't care about the other ExtProtoInfo bits and just want to talk about the exception specification. llvm-svn: 214450	2014-07-31 21:57:55 +00:00
Louis Gerbarg	67474e3755	Make sure no loads resulting from load->switch DAGCombine are marked invariant Currently when DAGCombine converts loads feeding a switch into a switch of addresses feeding a load the new load inherits the isInvariant flag of the left side. This is incorrect since invariant loads can be reordered in cases where it is illegal to reoarder normal loads. This patch adds an isInvariant parameter to getExtLoad() and updates all call sites to pass in the data if they have it or false if they don't. It also changes the DAGCombine to use that data to make the right decision when creating the new load. llvm-svn: 214449	2014-07-31 21:45:05 +00:00
Johannes Doerfert	99f6630c82	[Refactor] Remove unecessary check and function + Perform the parallelism check on the innermost loop only once. + Inline the markOpenmpParallel function. + Rename all IslAstUserPayload * into Payload to make it consistent. llvm-svn: 214448	2014-07-31 21:34:32 +00:00
Johannes Doerfert	0eefb0258f	[Refactor] Use nicer print callback function in IslAst llvm-svn: 214447	2014-07-31 21:33:49 +00:00
Aaron Ballman	ef940aaf07	Loop hint pragmas sometimes do not contain an identifier option (such as #pragma unroll(4)). Check explicitly that the token we stored was an identifier. Amends r214432 llvm-svn: 214446	2014-07-31 21:24:32 +00:00
Tyler Nowicki	b5a65395cc	Improve the remark generated for -Rpass-missed. The current remark is ambiguous and makes it sounds like explicitly specifying vectorization will allow the loop to be vectorized. This is not the case. The improved remark directs the user to -Rpass-analysis=loop-vectorize to determine the cause of the pass-miss. Reviewed by Arnold Schwaighofer` llvm-svn: 214445	2014-07-31 21:22:22 +00:00
Eric Christopher	59265af9eb	Revert "Remove MCObjectDisassembler.cpp as it is untested and unused." as it is apparently used, but the build didn't return errors weirdly. This reverts commits 214437 and 214438. llvm-svn: 214444	2014-07-31 21:18:38 +00:00
Zachary Turner	7c1bc2b8ae	Make CMake choose the target architecture according to the build. Previously, CMake was invoking the test runner and not specifying what architecture to use when building test executables. The Makefiles for the test executables then had logic to choose x64 by default. This doesn't work on Windows because the test compiler would then try to link against the 64-bit MSVCRT and not find them since only the 32-bit MSVCRT was in the path. This patch addresses this by figuring out, at CMake time, whether or not you are building LLDB with a 64 or 32-bit toolchain. Then, it explicitly passes this value to the test runner, causing the test runner to build tests whose architecture matches that of LLDB itself. This can still be overridden by setting the CMake variable LLDB_TEST_EXECUTABLE_ARCH=(x64\|x86) llvm-svn: 214443	2014-07-31 21:07:41 +00:00
Dan Albert	ea32c105a6	Make Android's ctype_base::mask unsigned. Keeping the regex code sane is much easier if we match the other platforms and use an unsigned mask. llvm-svn: 214442	2014-07-31 21:04:08 +00:00
Zachary Turner	7d186c0152	Remove shell-globbing from all test makefiles. llvm-svn: 214441	2014-07-31 21:03:11 +00:00
Tyler Nowicki	9fe497fcac	Improve the remark generated when a variable that is used outside the loop is not a reduction or induction variable. Reviewed by Arnold Schwaighofer llvm-svn: 214440	2014-07-31 21:02:40 +00:00
Rafael Espindola	ceb23381ec	Replaces a few pointers with references in llvm-nm.cpp. This opens the way for a few std::uinque_ptr cleanups. llvm-svn: 214439	2014-07-31 21:00:10 +00:00
Aaron Ballman	3866a8f2ca	Fixing CMake problems with MCObjectDisassembler.cpp not existing. llvm-svn: 214438	2014-07-31 20:48:54 +00:00
Eric Christopher	90a06fa97a	Remove MCObjectDisassembler.cpp as it is untested and unused. llvm-svn: 214437	2014-07-31 20:44:46 +00:00
Aaron Ballman	ef7aef8fe5	Implemented a diagnostic to handle multiple, distinct ownership_return attributes on the same declaration. This removes a FIXME from the code. llvm-svn: 214436	2014-07-31 20:44:26 +00:00
Hans Wennborg	914efc7239	msbuild integration: remove duplicated lines and BOM from 2014 integration (PR20341) llvm-svn: 214435	2014-07-31 20:33:22 +00:00
Rafael Espindola	cf8dd265c5	DWOHolder takes ownership of the argument constructor, use std::unique_ptr. Thanks to David Blaikie for noticing it. llvm-svn: 214434	2014-07-31 20:26:42 +00:00
Rafael Espindola	a04bb5b1e1	Use a reference instead of a pointer. This makes using a std::unique_ptr in the caller more convenient. llvm-svn: 214433	2014-07-31 20:19:36 +00:00
Tyler Nowicki	0c9b34b3ec	Add a state variable to the loop hint attribute. This patch is necessary to support constant expressions which replaces the integer value in the loop hint attribute with an expression. The integer value was also storing the pragma’s state for options like vectorize(enable/disable) and the pragma unroll and nounroll directive. The state variable is introduced to hold the state of those options/pragmas. This moves the validation of the state (keywords) from SemaStmtAttr handler to the loop hint annotation token handler. Resubmit with changes to try to fix the build-bot issue. Reviewed by Aaron Ballman llvm-svn: 214432	2014-07-31 20:15:14 +00:00
Eric Fiselier	18fab4684d	Add documentation for lit's --show-unsupported flag llvm-svn: 214431	2014-07-31 20:11:13 +00:00

1 2 3 4 5 ...

179853 Commits All Branches Search

179853 Commits

All Branches