llvm-project

Commit Graph

Author	SHA1	Message	Date
Reid Kleckner	a73c7781bd	[Support] Beef up and expose the response file parsing in llvm::cl The plan is to use it for clang and lld. Major behavior changes: - We can now parse UTF-16 files that have a byte order mark. - PR16209: Don't drop backslashes on the floor if they don't escape anything. The actual parsing loop was based on code from Clang's driver.cpp, although it's been rewritten to track its state with control flow rather than state variables. Reviewers: hans Differential Revision: http://llvm-reviews.chandlerc.com/D1170 llvm-svn: 186587	2013-07-18 16:52:05 +00:00
Joey Gouly	e25a86b082	Change 'n' to 'N' to keep consistent with other instructions. llvm-svn: 186576	2013-07-18 12:00:25 +00:00
Joey Gouly	943dd59ed5	[ARMv8] Add NEON instructions VCVT{A, N, P, M}. llvm-svn: 186574	2013-07-18 11:53:22 +00:00
Richard Sandiford	5109321042	[SystemZ] Use RNSBG This should be the last of the R.SBG patches for now. llvm-svn: 186573	2013-07-18 10:40:35 +00:00
Joey Gouly	dce01792f2	Add Thumb tests for the ARMv8 FP instructions that I recently added. Also, fix the namespace for two instructions that I missed previously. llvm-svn: 186572	2013-07-18 10:20:25 +00:00
Richard Sandiford	297f7d2724	[SystemZ] Generalize RxSBG SRA case The original code only folded SRA into ROTATE ... SELECTED BITS if there was no outer shift. This patch splits out that check and generalises it slightly. The extra cases aren't really that interesting, but this is paving the way for RNSBG support. llvm-svn: 186571	2013-07-18 10:14:55 +00:00
Richard Sandiford	7878b852e6	[SystemZ] Use RXSBG Extend the previous R.SBG patches to handle XORs. llvm-svn: 186570	2013-07-18 10:06:15 +00:00
Richard Sandiford	5cbac96730	[SystemZ] Rename and formatting fixes In hindsight, using "RISBG" for something that can be any type of R.SBG instruction was a bit confusing, so this renames it to RxSBG. That might not be the best choice either, since there is an instruction called RXSBG, but hopefully the lower-case letter stands out enough. While there I fixed a couple of GNUisms that had crept in -- sorry about that! llvm-svn: 186569	2013-07-18 09:45:08 +00:00
Joey Gouly	923d593fb5	Remove the extra leading 0 from VMAXNMND. The N3VDIntnp pattern takes bits<5> and I gave it 6 bits. Thanks to Jiangning Liu for spotting it! llvm-svn: 186568	2013-07-18 09:34:35 +00:00
Vladimir Medic	3467b90786	This patch extends mips register parsing methods to allow indexed register parsing. The corresponding test cases are added to the patch. llvm-svn: 186567	2013-07-18 09:28:35 +00:00
Craig Topper	ad1fff9be7	Fix copy and paste bug from r186491 to make v2f64 use MOVAPD/MOVUPD as it should. llvm-svn: 186566	2013-07-18 07:16:44 +00:00
Chandler Carruth	f0546402af	Reapply r186316 with a fix for one bug where the code could walk off the end of a vector. This was found with ASan. I've had one other report of a crasher, but thus far been unable to reproduce the crash. It may well be fixed with this version, and if not I'd like to get more information from the build bots about what is happening. See r186316 for the full commit log for the new implementation of the SROA algorithm. llvm-svn: 186565	2013-07-18 07:15:00 +00:00
Nadav Rotem	7d7036b8c6	SLPVectorizer: Speedup isConsecutive (that checks if two addresses are consecutive in memory) by checking for additional patterns that don't need to go through SCEV. llvm-svn: 186563	2013-07-18 04:33:20 +00:00
Hal Finkel	1860763c76	PPC: Support dynamic allocas with large alignment Support for dynamic stack alignments in the PPC backend has been unfinished, in part because it depends on dynamic stack realignment (which I only just recently implemented fully). Now we can also support dynamic allocas with higher than the default target stack alignment (16 bytes). In order to round-up the requested size to the maximum requested alignment, we need an additional register to hold the rounded-up size. We're already using one scavenged register to hold the previous stack-pointer value (which needs to be stored with the signal-safe stdux update), and so when we have dynamic allocas and a large alignment, we allocate two emergency spill slots for the scavenger. llvm-svn: 186562	2013-07-18 04:28:21 +00:00
Rafael Espindola	213c4cb18c	Remove dead code. llvm-svn: 186561	2013-07-18 03:29:51 +00:00
Rafael Espindola	4d10587ff3	Convert two uses if fstat with sys::fs::status. llvm-svn: 186560	2013-07-18 03:04:20 +00:00
Nick Lewycky	0dcefdfcab	Give 'hasPath' a longer but clearer name 'isPotentiallyReachable'. Also expand the comment. No functionality change. This change broken out of http://llvm-reviews.chandlerc.com/D996 . llvm-svn: 186558	2013-07-18 02:34:51 +00:00
Hal Finkel	f05d6c7843	PPC: Add base-pointer support to builtin setjmp/longjmp First, this changes the base-pointer implementation to remove an unnecessary complication (and one that is incompatible with how builtin SjLj is implemented): instead of using r31 as the base pointer when it is not needed as a frame pointer, now the base pointer will always be r30 when needed. Second, we introduce another pseudo register, BP, which is used just like the FP pseudo register to refer to the base register before we know for certain what register it will be. Third, we now save BP into the jmp_buf, and restore r30 from that slot in longjmp. If the function that called setjmp did not use a base pointer, then r30 will be overwritten by the setjmp-calling-function's restore code. FP restoration (which is restored into r31) works the same way. llvm-svn: 186545	2013-07-17 23:50:51 +00:00
Eric Christopher	7ab2c3ecb2	Add comparison operators for DIDescriptors to fix c++98 fallout of operator bool change. Also convert a variable in DebugIR. llvm-svn: 186544	2013-07-17 23:25:22 +00:00
Nadav Rotem	43639e8492	Fix a comment. llvm-svn: 186541	2013-07-17 22:41:16 +00:00
Eli Friedman	d2eb07acae	Handle '.' correctly in hex float literal parsing. There were a couple of different loops that were not handling '.' correctly in APFloat::convertFromHexadecimalString; these mistakes could lead to assertion failures and incorrect rounding for overlong hex float literals. Fixes PR16643. llvm-svn: 186539	2013-07-17 22:17:29 +00:00
Stephen Lin	03f9fbbcd7	Restore r181216, which was partially reverted in r182499. llvm-svn: 186533	2013-07-17 20:06:03 +00:00
Rafael Espindola	331aebae92	Fix a funny typo. Thanks to Aaron Ballman for noticing. llvm-svn: 186532	2013-07-17 19:58:28 +00:00
Nadav Rotem	3072baeb9c	Add a micro optimization to catch cases where the PtrA equals PtrB. llvm-svn: 186531	2013-07-17 19:52:25 +00:00
Rafael Espindola	16431fe7a7	Add FILE_SHARE_WRITE to openFileForRead. This should fix the windows bots. It looks like the failing tests are of the form prog1 > file prog2 file and prog2 fails trying to read the file. The best fix would probably be to close stdout/stderr in prog1, but it was not the intention of 186511 to change this, so just restore the old behavior for now. llvm-svn: 186530	2013-07-17 19:44:07 +00:00
Aaron Ballman	fbb104513b	Silencing an MSVC warning about signed vs unsigned comparison mismatches. llvm-svn: 186529	2013-07-17 19:43:13 +00:00
Akira Hatanaka	365d16e345	[mips] Use "foreach" loop to make register definitions more concise. llvm-svn: 186528	2013-07-17 19:09:27 +00:00
Michael Gottesman	f87a6ae65f	Add -- C++ -- to InstrEmitter.h. llvm-svn: 186527	2013-07-17 18:53:29 +00:00
Vladimir Medic	74593e6577	This patch checks for valid mnemonics at the beginning of parseInstruction method, thus giving the user the right error message for non-existing instructions. llvm-svn: 186512	2013-07-17 15:00:42 +00:00
Rafael Espindola	a0d9b6b693	Split openFileForRead into Windows and Unix versions. This has some advantages: * Lets us use native, utf16 windows functions. * Easy to produce good errors on windows about trying to use a directory when we want a file. * Simplifies the unix version a bit. llvm-svn: 186511	2013-07-17 14:58:25 +00:00
Hal Finkel	ec7cd26968	Fix comparisons of alloca alignment in inliner merging Duncan pointed out a mistake in my fix in r186425 when only one of the allocas being compared had the target-default alignment. This is essentially his suggested solution. Thanks! llvm-svn: 186510	2013-07-17 14:32:41 +00:00
Vladimir Medic	29410f9c91	Implement eret and deret(return from exception) instructions for Mips. Test examples are given. llvm-svn: 186507	2013-07-17 14:05:19 +00:00
Joey Gouly	df68600f44	[ARMv8] Add support for the NEON instructions vmaxnm/vminnm. This adds a new class for non-predicable NEON instructions and a new DecoderNamespace for v8 NEON instructions. llvm-svn: 186504	2013-07-17 13:59:38 +00:00
Duncan Sands	e2cd13906e	Ensure sys::getProcessTriple always uses a normalized triple. Patch by Thomas B. Jablin, from PR16636. llvm-svn: 186501	2013-07-17 11:01:05 +00:00
Richard Osborne	9ff96e6f9b	[XCore] Ensure implicit operands aren't lost on the return instruction. Patch by Robert Lytton. llvm-svn: 186500	2013-07-17 10:58:37 +00:00
Craig Topper	55475d448b	Teach x86 fast-isel to use AVX opcodes for vector stores when AVX is enabled. llvm-svn: 186496	2013-07-17 06:58:23 +00:00
Craig Topper	4f55b0efd2	Make x86 fast-isel correctly choose between aligned and unaligned operations for vector stores. Fixes PR16640. llvm-svn: 186491	2013-07-17 05:57:45 +00:00
JF Bastien	cd4c64d234	Fix ARMFastISel::ARMEmitIntExt shift emission My patch 'r183551 - ARM FastISel integer sext/zext improvements' was incorrect when emitting ARM register-immediate ASR, LSL, LSR instructions: they are pseudo-instructions in ARMInstrInfo.td and I should have used MOVsi instead. This is not an issue when code is generated through a .s file, but is an issue when generated straight to a .o (-filetype=obj). llvm-svn: 186489	2013-07-17 05:46:46 +00:00
Hal Finkel	40f76d5830	PPC: Add CTR-register clobber to builtin setjmp Because the builtin longjmp implementation uses a CTR-based indirect jump, when the control flow arrives at the builtin setjmp call, the CTR register has necessarily been clobbered. Correspondingly, this adds CTR to the list of implicit definitions of the builtin setjmp pseudo instruction. We don't need to add CTR to the implicit definitions of builtin longjmp because, even though it does clobber the CTR register, the control flow cannot return to inside the loop unless there is also a builtin setjmp call. llvm-svn: 186488	2013-07-17 05:35:44 +00:00
Craig Topper	24048c9440	Mark a method 'const' and another 'static'. llvm-svn: 186485	2013-07-17 03:54:53 +00:00
Craig Topper	1c4d667ca5	Make a few more static string pointers constant. llvm-svn: 186484	2013-07-17 03:43:10 +00:00
Rafael Espindola	b6fea4c618	Don't fallback to copy + delete in rename. Rename's documentation says "Files are renamed as if by POSIX rename()". and it is used for atomically updating output files from a temporary. Having rename fallback to a non atomic copy has the potential to hide bugs, like using a temporary file in /tmp instead of a unique name next to the final destination. llvm-svn: 186483	2013-07-17 03:33:41 +00:00
Craig Topper	9fdc70e846	Make constant string pointer into an array to remove a pointer lookup for every access. llvm-svn: 186482	2013-07-17 03:11:32 +00:00
NAKAMURA Takumi	212c80ac5d	raw_ostream.cpp: Introduce <fcntl.h> to let O_BINARY provided. Or, llvm::outs() would be set to O_TEXT by default. llvm/test/Object/check_binary_output.ll is expected to pass on win32. llvm-svn: 186480	2013-07-17 02:21:10 +00:00
Nadav Rotem	2202317fce	SLPVectorizer: Accelerate the isConsecutive check by replacing the subtraction of the two values with a simple SCEV expression that adds the offset to one of the pointers that we compare. llvm-svn: 186479	2013-07-17 00:48:31 +00:00
Hal Finkel	a7c54e8cf4	PPC: Implement base pointer and stack realignment This builds on some frame-lowering code that has existed since 2005 (r24224) but was disabled in 2008 (r48188) because it needed base pointer support to function correctly. This implementation follows the strategy suggested by Dale Johannesen in r48188 where the following comment was added: This does not currently work, because the delta between old and new stack pointers is added to offsets that reference incoming parameters after the prolog is generated, and the code that does that doesn't handle a variable delta. You don't want to do that anyway; a better approach is to reserve another register that retains to the incoming stack pointer, and reference parameters relative to that. And now we do exactly that. If we don't need a frame pointer, then we use r31 as a base pointer. If we do need a frame pointer, then we use r30 as a base pointer. The base pointer retains the value of the stack pointer before it was decremented in the prologue. We then use the base pointer to resolve all negative frame indicies. The basic scheme follows that for base pointers in the X86 backend. We use a base pointer when we need to dynamically realign the incoming stack pointer. This currently applies only to static objects (dynamic allocas with large alignments, and base-pointer support in SjLj lowering will come in future commits). llvm-svn: 186478	2013-07-17 00:45:52 +00:00
Craig Topper	8fc4096fab	Move string pointer from being a static class member to just a static global in the one file its needed in. llvm-svn: 186476	2013-07-17 00:31:35 +00:00
Manman Ren	8bfde8917e	Add getModuleFlag(StringRef Key) to query a module flag given Key. No functionality change. llvm-svn: 186470	2013-07-16 23:21:16 +00:00
Nadav Rotem	d2e8c4cdea	flip the scev minus direction to simplify the code. llvm-svn: 186466	2013-07-16 22:57:06 +00:00
Nadav Rotem	8f924f3891	SLPVectorizer: Improve the compile time of isConsecutive by adding a simple constant-gep check before using SCEV. This check does not always work because not all of the GEPs use a constant offset, but it happens often enough to reduce the number of times we use SCEV. llvm-svn: 186465	2013-07-16 22:51:07 +00:00
Lang Hames	57a113eb0d	Related to r181161 - Indirect branches may not be the last branch in a basic block. Blocks that have an indirect branch terminator, even if it's not the last terminator, should still be treated as unanalyzable. <rdar://problem/14437274> Reducing a useful regression test case is proving difficult - I hope to have one soon. llvm-svn: 186461	2013-07-16 22:01:40 +00:00
Tilmann Scheller	305bb90442	ARM: Add support for the Thumb2 PLI alternate literal form. This adds an instruction alias to make the assembler recognize the alternate literal form: pli [PC, #+/-<imm>] See A8.8.129 in the ARM ARM (DDI 0406C.b). Fixes <rdar://problem/14403733>. llvm-svn: 186459	2013-07-16 21:52:34 +00:00
Rafael Espindola	6d35481c94	Add a wrapper for open. This centralizes the handling of O_BINARY and opens the way for hiding more differences (like how open behaves with directories). llvm-svn: 186447	2013-07-16 19:44:17 +00:00
Jakob Stoklund Olesen	efeb3a1969	Remove floats from live range splitting costs. These floats all represented block frequencies anyway, so just use the BlockFrequency class directly. Some floating point computations remain in tryLocalSplit(). They are estimating spill weights which are still floats. llvm-svn: 186435	2013-07-16 18:26:18 +00:00
Jakob Stoklund Olesen	c5454ff046	Reapply r185393. Original commit message: Remove floating point computations from SpillPlacement.cpp. Patch by Benjamin Kramer! Use the BlockFrequency class instead of floats in the Hopfield network computations. This rescales the node Bias field from a [-2;2] float range to two block frequencies BiasN and BiasP pulling in opposite directions. This construct has a more predictable behavior when block frequencies saturate. The per-node scaling factors are no longer necessary, assuming the block frequencies around a bundle are consistent. This patch can cause the register allocator to make different spilling decisions. The differences should be small. llvm-svn: 186434	2013-07-16 18:26:15 +00:00
Juergen Ributzka	3d527d80b8	[X86] Use min/max to optimze unsigend vector comparison on X86 Use PMIN/PMAX for UGE/ULE vector comparions to reduce the number of required instructions. This trick also works for UGT/ULT, but there is no advantage in doing so. It wouldn't reduce the number of instructions and it would actually reduce performance. Reviewer: Ben radar:5972691 llvm-svn: 186432	2013-07-16 18:20:45 +00:00
Peter Collingbourne	8b77f18da0	Make SpecialCaseList match full strings, as documented, using anchors. Differential Revision: http://llvm-reviews.chandlerc.com/D1149 llvm-svn: 186431	2013-07-16 17:56:07 +00:00
Juergen Ributzka	c16f86020c	Test commit to verify write access. llvm-svn: 186429	2013-07-16 17:44:23 +00:00
Reid Kleckner	7df03c2e30	[Support] Add a Unicode conversion wrapper from UTF16 to UTF8 This is to support parsing UTF16 response files in LLVM/lib/Option for lld and clang. Reviewers: hans Differential Revision: http://llvm-reviews.chandlerc.com/D1138 llvm-svn: 186426	2013-07-16 17:14:33 +00:00
Hal Finkel	9caa8f7ba7	When the inliner merges allocas, it must keep the larger alignment For safety, the inliner cannot decrease the allignment on an alloca when merging it with another. I've included two variants of the test case for this: one with DataLayout available, and one without. When DataLayout is not available, if only one of the allocas uses the default alignment (getAlignment() == 0), then they cannot be safely merged. llvm-svn: 186425	2013-07-16 17:10:55 +00:00
Nadav Rotem	26bf9a0c75	SLPVectorizer: Reduce the compile time of the consecutive store lookup. Process groups of stores in chunks of 16. llvm-svn: 186420	2013-07-16 15:25:17 +00:00
Rafael Espindola	e08b59f81d	Create files with mode 666. This matches the behavior of other unix tools. llvm-svn: 186414	2013-07-16 14:10:07 +00:00
Reid Kleckner	5f4535b974	[Support] Fix some warnings when self-hosting clang on Windows llvm-svn: 186413	2013-07-16 14:04:08 +00:00
Ulrich Weigand	1d4dbda5b9	[APFloat] PR16573: Avoid losing mantissa bits in ppc_fp128 to double truncation When truncating to a format with fewer mantissa bits, APFloat::convert will perform a right shift of the mantissa by the difference of the precision of the two formats. Usually, this will result in just the mantissa bits needed for the target format. One special situation is if the input number is denormal. In this case, the right shift may discard significant bits. This is usually not a problem, since truncating a denormal usually results in zero (underflow) after normalization anyway, since the result format's exponent range is usually smaller than the target format's. However, there is one case where the latter property does not hold: when truncating from ppc_fp128 to double. In particular, truncating a ppc_fp128 whose first double of the pair is denormal should result in just that first double, not zero. The current code however performs an excessive right shift, resulting in lost result bits. This is then caught in the APFloat::normalize call performed by APFloat::convert and causes an assertion failure. This patch checks for the scenario of truncating a denormal, and attempts to (possibly partially) replace the initial mantissa right shift by decrementing the exponent, if doing so will still result in a valid target format exponent. Index: test/CodeGen/PowerPC/pr16573.ll =================================================================== --- test/CodeGen/PowerPC/pr16573.ll (revision 0) +++ test/CodeGen/PowerPC/pr16573.ll (revision 0) @@ -0,0 +1,11 @@ +; RUN: llc < %s \| FileCheck %s + +target triple = "powerpc64-unknown-linux-gnu" + +define double @test() { + %1 = fptrunc ppc_fp128 0xM818F2887B9295809800000000032D000 to double + ret double %1 +} + +; CHECK: .quad -9111018957755033591 + Index: lib/Support/APFloat.cpp =================================================================== --- lib/Support/APFloat.cpp (revision 185817) +++ lib/Support/APFloat.cpp (working copy) @@ -1956,6 +1956,23 @@ X86SpecialNan = true; } + // If this is a truncation of a denormal number, and the target semantics + // has larger exponent range than the source semantics (this can happen + // when truncating from PowerPC double-double to double format), the + // right shift could lose result mantissa bits. Adjust exponent instead + // of performing excessive shift. + if (shift < 0 && isFiniteNonZero()) { + int exponentChange = significandMSB() + 1 - fromSemantics.precision; + if (exponent + exponentChange < toSemantics.minExponent) + exponentChange = toSemantics.minExponent - exponent; + if (exponentChange < shift) + exponentChange = shift; + if (exponentChange < 0) { + shift -= exponentChange; + exponent += exponentChange; + } + } + // If this is a truncation, perform the shift before we narrow the storage. if (shift < 0 && (isFiniteNonZero() \|\| category==fcNaN)) lostFraction = shiftRight(significandParts(), oldPartCount, -shift); llvm-svn: 186409	2013-07-16 13:03:25 +00:00
Richard Osborne	ab29d19536	[XCore] Fix printing of inline asm operands. Previously an asm operand with no operand modifier would give the error "invalid operand in inline asm". llvm-svn: 186407	2013-07-16 12:48:34 +00:00
Tim Northover	069f95f926	ARM: allow printing of ARM atomic DAG nodes. We'd forgotten to provide string representations for the special ARMISD atomic nodes; this adds them in. No effect on CodeGen, just makes the output of "-view-whatever-dags" slightly more readable. llvm-svn: 186406	2013-07-16 12:15:36 +00:00
Richard Sandiford	885140c951	[SystemZ] Use ROSBG and non-zero form of RISBG for OR nodes llvm-svn: 186405	2013-07-16 11:55:57 +00:00
Vladimir Medic	a73970b662	Fixing a buildbot failure:unused function. llvm-svn: 186403	2013-07-16 11:43:20 +00:00
Richard Sandiford	35bb463fb1	[SystemZ] Add MC support for R[NOX]SBG CodeGen support will come later. llvm-svn: 186401	2013-07-16 11:28:08 +00:00
Richard Sandiford	82ec87dbdb	[SystemZ] Use RISBG for (shift (and ...)) Another patch in the series to make more use of R.SBG. This one extends r186072 and r186073 to handle cases where the AND is inside the shift. llvm-svn: 186399	2013-07-16 11:02:24 +00:00
Vladimir Medic	64828a1f73	This patch represents Mips utilization of r186388 code that alows asm matcher to emit mnemonics contain '.' characters. This makes asm parser code simpler and more efficient. llvm-svn: 186397	2013-07-16 10:07:14 +00:00
NAKAMURA Takumi	37ce985739	PPCJITInfo.cpp: Tweak r186252 with s/__ppc/__powerpc/ to work on powerpc-linux Fedora 12. g++ (GCC) 4.4.4 20100630 (Red Hat 4.4.4-10) llvm-svn: 186396	2013-07-16 09:59:51 +00:00
Tim Northover	a7ecd241d2	ARM: implement ldrex, strex and clrex intrinsics Intrinsics already existed for the 64-bit variants, so these support operations of size at most 32-bits. llvm-svn: 186392	2013-07-16 09:46:55 +00:00
Renato Golin	8761069e22	ARM EABI divmod support This patch enables calls to __aeabi_idivmod when in EABI mode, by using the remainder value returned on registers (R1), enabled by the ARM triple "none-eabi". Note that Darwin and GNUEABI triples will continue lowering on GNU style, that is, using the stack for the remainder. Still need to add SREM/UREM support fix for 64-bit lowering. llvm-svn: 186390	2013-07-16 09:32:17 +00:00
Rafael Espindola	77021c9487	Add a version of sys::fs::status that uses fstat. llvm-svn: 186378	2013-07-16 03:20:13 +00:00
Rafael Espindola	9da91a0e03	Instead friending status, provide windows and posix constructors to file_status. This opens the way of having static helpers in the .inc files that can construct a file_status. llvm-svn: 186376	2013-07-16 02:55:33 +00:00
Craig Topper	d3a34f81f8	Add 'const' qualifiers to static const char* variables. llvm-svn: 186371	2013-07-16 01:17:10 +00:00
Manman Ren	b827123cf7	PEI: Support for non-zero SPAdj at beginning of a basic block. We can have a FrameSetup in one basic block and the matching FrameDestroy in a different basic block when we have struct byval. In that case, SPAdj is not zero at beginning of the basic block. Modify PEI to correctly set SPAdj at beginning of each basic block using DFS traversal. We used to assume SPAdj is 0 at beginning of each basic block. PEI had an assert SPAdjCount \|\| SPAdj == 0. If we have a Destroy <n> followed by a Setup <m>, PEI will assert failure. We can add an extra condition to make sure the pairs are matched: The pairs start with a FrameSetup. But since we are doing a much better job in the verifier, this patch removes the check in PEI. PR16393 llvm-svn: 186364	2013-07-15 23:47:29 +00:00
Nadav Rotem	1c1d6c1666	PR16628: Fix a bug in the code that merges compares. Compares return i1 but they compare different types. llvm-svn: 186359	2013-07-15 22:52:48 +00:00
Hal Finkel	a0014a5a26	PPC: Refactoring to support subtarget feature changing This change mirrors the changes that were made to the X86 and ARM targets to support subtarget feature changing. As indicated in r182899, the mechanism is still undergoing revision, and so as with the X86 and ARM targets, there is no test case yet (there is no effective functionality change). llvm-svn: 186357	2013-07-15 22:29:40 +00:00
Manman Ren	aa6875b1f9	Machine Verifier: verify FrameSetup and FrameDestroy 1> on every path through the CFG, a FrameSetup <n> is always followed by a FrameDestroy <n> and a FrameDestroy is always followed by a FrameSetup. 2> stack adjustments are identical on all CFG edges to a merge point. 3> frame is destroyed at end of a return block. PR16393 llvm-svn: 186350	2013-07-15 21:26:31 +00:00
Rafael Espindola	8ea26d6a80	Remove an extra is_directory call. I checked that opening a directory on windows does fail, so this saves a "stat". llvm-svn: 186345	2013-07-15 20:52:01 +00:00
Hal Finkel	8e8618ae5c	Fix register subclass handling in PPCInstrInfo::insertSelect PPCInstrInfo::insertSelect and PPCInstrInfo::canInsertSelect were computing the common subclass of the true and false inputs, and then selecting either the 32-bit or the 64-bit isel variant based on the result of calling PPC::GPRCRegClass.hasSubClassEq(RC) and PPC::G8RCRegClass.hasSubClassEq(RC) (where RC is the common subclass). Unfortunately, this is not quite right: if we have something like this: %vreg8<def> = SELECT_CC_I8 %vreg4<kill>, %vreg7<kill>, %vreg6<kill>, 76; G8RC_and_G8RC_NOX0:%vreg8 CRRC:%vreg4 G8RC_NOX0:%vreg7,%vreg6 then the common subclass of G8RC_and_G8RC_NOX0 and G8RC_NOX0 is G8RC_NOX0, and G8RC_NOX0 is not a subclass of G8RC (because it also contains the ZERO8 pseudo-register). As a result, we also need to check the common subclass against GPRC_NOR0 and G8RC_NOX0 explicitly. This had not been a problem for clients of insertSelect that called canInsertSelect first (because it had a compensating mistake), but insertSelect is also used by the PPC pseudo-instruction expander, and this error was causing a problem in that context. This problem was found by csmith. llvm-svn: 186343	2013-07-15 20:22:58 +00:00
Reid Kleckner	dae7b4e4d1	[mc-coff] Resolve aliases when emitting COFF relocations This is consistent with the ELF object writer. Add some COFF tests that relocate against an alias. Reviewers: espindola Differential Revision: http://llvm-reviews.chandlerc.com/D1079 llvm-svn: 186341	2013-07-15 19:41:21 +00:00
Tom Stellard	31209cc8eb	R600/SI: Add support for 64-bit loads https://bugs.freedesktop.org/show_bug.cgi?id=65873 llvm-svn: 186339	2013-07-15 19:00:09 +00:00
Hal Finkel	2f5e8e3d95	Remove invalid assert in DAGTypeLegalizer::RemapValue There is a comment at the top of DAGTypeLegalizer::PerformExpensiveChecks which, in part, says: // Note that these invariants may not hold momentarily when processing a node: // the node being processed may be put in a map before being marked Processed. Unfortunately, this assert would be valid only if the above-mentioned invariant held unconditionally. This was causing llc to assert when, in fact, everything was fine. Thanks to Richard Sandiford for investigating this issue! Fixes PR16562. llvm-svn: 186338	2013-07-15 18:57:05 +00:00
Stephen Lin	837bba1c51	Remove trailing whitespace llvm-svn: 186333	2013-07-15 17:55:02 +00:00
Chandler Carruth	e3899f2c2c	Revert r186316 while I track down an ASan failure and an assert from a bot. This reverts the commit which introduced a new implementation of the fancy SROA pass designed to reduce its overhead. I'll skip the huge commit log here, refer to r186316 if you're looking for how this all works and why it works that way. llvm-svn: 186332	2013-07-15 17:36:21 +00:00
Reid Kleckner	a75eba9c05	Revert "[Option] Store arg strings in a set backed by a BumpPtrAllocator" This broke clang's crash-report.c test, and I haven't been able to figure it out yet. This reverts commit r186319. llvm-svn: 186329	2013-07-15 16:40:52 +00:00
Job Noorman	a928e1d7a1	Test commit to see if write access works. llvm-svn: 186321	2013-07-15 14:25:26 +00:00
Reid Kleckner	cacb40c6c7	[Option] Store arg strings in a set backed by a BumpPtrAllocator No functionality change. This is preparing to move response file parsing into lib/Option so it can be shared between clang and lld. This change isn't just a micro-optimization. Clang's driver uses a std::set<std::string> to unique arguments while parsing response files, so this matches that. llvm-svn: 186319	2013-07-15 13:46:24 +00:00
Chandler Carruth	e74ff4c643	Reimplement SROA yet again. Same fundamental principle, but a totally different core implementation strategy. Previously, SROA would build a relatively elaborate partitioning of an alloca, associate uses with each partition, and then rewrite the uses of each partition in an attempt to break apart the alloca into chunks that could be promoted. This was very wasteful in terms of memory and compile time because regardless of how complex the alloca or how much we're able to do in breaking it up, all of the datastructure work to analyze the partitioning was done up front. The new implementation attempts to form partitions of the alloca lazily and on the fly, rewriting the uses that make up that partition as it goes. This has a few significant effects: 1) Much simpler data structures are used throughout. 2) No more double walk of the recursive use graph of the alloca, only walk it once. 3) No more complex algorithms for associating a particular use with a particular partition. 4) PHI and Select speculation is simplified and happens lazily. 5) More precise information is available about a specific use of the alloca, removing the need for some side datastructures. Ultimately, I think this is a much better implementation. It removes about 300 lines of code, but arguably removes more like 500 considering that some code grew in the process of being factored apart and cleaned up for this all to work. I've re-used as much of the old implementation as possible, which includes the lion's share of code in the form of the rewriting logic. The interesting new logic centers around how the uses of a partition are sorted, and split into actual partitions. Each instruction using a pointer derived from the alloca gets a 'Partition' entry. This name is totally wrong, but I'll do a rename in a follow-up commit as there is already enough churn here. The entry describes the offset range accessed and the nature of the access. Once we have all of these entries we sort them in a very specific way: increasing order of begin offset, followed by whether they are splittable uses (memcpy, etc), followed by the end offset or whatever. Sorting by splittability is important as it simplifies the collection of uses into a partition. Once we have these uses sorted, we walk from the beginning to the end building up a range of uses that form a partition of the alloca. Overlapping unsplittable uses are merged into a single partition while splittable uses are broken apart and carried from one partition to the next. A partition is also introduced to bridge splittable uses between the unsplittable regions when necessary. I've looked at the performance PRs fairly closely. PR15471 no longer will even load (the module is invalid). Not sure what is up there. PR15412 improves by between 5% and 10%, however it is nearly impossible to know what is holding it up as SROA (the entire pass) takes less time than reading the IR for that test case. The analysis takes the same time as running mem2reg on the final allocas. I suspect (without much evidence) that the new implementation will scale much better however, and it is just the small nature of the test cases that makes the changes small and noisy. Either way, it is still simpler and cleaner I think. llvm-svn: 186316	2013-07-15 10:30:19 +00:00
Alexey Samsonov	1a98450469	DebugInfo: Factor out parsing compile unit DIEs to a separate function. Improve code style and comments. No functionality change. llvm-svn: 186315	2013-07-15 08:43:35 +00:00
Craig Topper	06b3b6651e	Add 'const' qualifier to some arrays. llvm-svn: 186312	2013-07-15 08:02:13 +00:00
Craig Topper	e952ad0bc1	Make some arrays 'static const' llvm-svn: 186311	2013-07-15 07:22:00 +00:00
Craig Topper	f18edae094	Add include to hopefully fix windows build. llvm-svn: 186310	2013-07-15 07:15:05 +00:00
Craig Topper	de1f151115	Add const qualifier to some static arrays. llvm-svn: 186309	2013-07-15 07:02:45 +00:00
Craig Topper	202fbc2c9b	Add 'static' keyword to some const arrays for consistency. llvm-svn: 186308	2013-07-15 06:54:12 +00:00
Craig Topper	0afd0ab749	Make some arrays 'static const' llvm-svn: 186307	2013-07-15 06:39:13 +00:00
Craig Topper	26b45c27f1	Revert part of 186302 to fix buildbots. llvm-svn: 186303	2013-07-15 04:37:54 +00:00
Craig Topper	5871321e49	Use llvm::array_lengthof to replace sizeof(array)/sizeof(array[0]). llvm-svn: 186301	2013-07-15 04:27:47 +00:00
Eric Christopher	7980b957cc	Clarify comments. llvm-svn: 186297	2013-07-14 22:23:54 +00:00
Eric Christopher	8e46e7f04b	Add DW_AT_GNU_odr_signature to the set of dwarf attributes. llvm-svn: 186296	2013-07-14 22:02:31 +00:00
Eric Christopher	666dc635c7	Collapse temporary variable into call. llvm-svn: 186295	2013-07-14 21:46:51 +00:00
Anton Korobeynikov	5714237ca5	Use conventional syntax for branches. Patch by Job! llvm-svn: 186291	2013-07-14 18:19:44 +00:00
Anton Korobeynikov	fee796d734	Properly lower jump tables on MSP430. Patch by Job Noorman! llvm-svn: 186283	2013-07-14 15:11:00 +00:00
Nadav Rotem	d9f3f4548e	SLPVectorizer: change the order in which we search for vectorization candidates. Do stores first and PHIs second. llvm-svn: 186277	2013-07-14 06:15:46 +00:00
Tobias Grosser	84f34be98e	Fix build by replacing '>>' with '> >' llvm-svn: 186276	2013-07-14 06:12:01 +00:00
Craig Topper	b94011fd28	Use SmallVectorImpl& instead of SmallVector to avoid repeating small vector size. llvm-svn: 186274	2013-07-14 04:42:23 +00:00
Andrew Trick	aa8ceba833	Remove a bunch of old SCEVExpander FIXME's for preserving NoWrap. The great thing about the SCEVAddRec No-Wrap flag (unlike nsw/nuw) is that is can be preserved while normalizing (reassociating and factoring). The bad thing is that is can't be tranfered back to IR, which is one of the reasons I don't like the concept of SCEVExpander. Sorry, I can't think of a direct way to test this, which is why these were FIXMEs for so long. I just think it's a good time to finally clean it up. llvm-svn: 186273	2013-07-14 03:10:08 +00:00
Andrew Trick	8eaae28693	Teach indvars to generate nsw/nuw flags when widening an induction variable. Fixes PR16600. llvm-svn: 186272	2013-07-14 02:50:07 +00:00
Arnold Schwaighofer	a92eeebde8	LoopVectorizer: Disallow reductions whose header phi is used outside the loop If an outside loop user of the reduction value uses the header phi node we cannot just reduce the vectorized phi value in the vector code epilog because we would loose VF-1 reductions. lp: p = phi (0, lv) lv = lv + 1 ... brcond , lp, outside outside: usr = add 0, p (Say the loop iterates two times, the value of p coming out of the loop is one). We cannot just transform this to: vlp: p = phi (<0,0>, lv) lv = lv + <1,1> .. brcond , lp, outside outside: p_reduced = p[0] + [1]; usr = add 0, p_reduced (Because the original loop iterated two times the vectorized loop would iterate one time, but p_reduced ends up being zero instead of one). We would have to execute VF-1 iterations in the scalar remainder loop in such cases. For now, just disable vectorization. PR16522 llvm-svn: 186256	2013-07-13 19:09:29 +00:00
Joerg Sonnenberger	8e01ae895d	Reduce large list of macros to the primary platform macros. Distingiush between ELF (Linux, FreeBSD, NetBSD) and OSX as platform for the assembler dialect. llvm-svn: 186252	2013-07-13 17:59:55 +00:00
Craig Topper	e0b711864c	Pass SmallVector by const reference instead of by value. llvm-svn: 186243	2013-07-13 07:43:40 +00:00
Andrew Trick	0ae8c94f8f	LoopVectorize fix: LoopInfo must be valid when invoking utils like SCEVExpander. In general, one should always complete CFG modifications first, update CFG-based analyses, like Dominatores and LoopInfo, then generate instruction sequences. LoopVectorizer was creating a new loop, calling SCEVExpander to generate checks, then updating LoopInfo. I just changed the order. llvm-svn: 186241	2013-07-13 06:20:06 +00:00
Nick Lewycky	7459be6dc7	Add a microoptimization for urem. llvm-svn: 186235	2013-07-13 01:16:47 +00:00
Chandler Carruth	86e60a36b5	Revert commit r186217 -- this is breaking bots: http://lab.llvm.org:8013/builders/clang-x86_64-darwin11-nobootstrap-RAincremental/builds/4328 Original commit log: Use the function attributes to pass along the stack protector buffer size. llvm-svn: 186234	2013-07-13 01:00:17 +00:00
Nick Lewycky	35aeea993b	Fix logic error optimizing "icmp pred (urem X, Y), Y" where pred is signed. Fixes PR16605. llvm-svn: 186229	2013-07-12 23:42:57 +00:00
Akira Hatanaka	66bc419366	[mips] Implement MipsTargetMachine::getInstrItineraryData(). llvm-svn: 186227	2013-07-12 23:33:22 +00:00
JF Bastien	583db65031	Fix ARM paired GPR COPY lowering ARM paired GPR COPY was being lowered to two MOVr without CC. This patch puts the CC back. My test is a reduction of the case where I encountered the issue, 64-bit atomics use paired GPRs. The issue only occurs with selectionDAG, FastISel doesn't encounter it so I didn't bother calling it. llvm-svn: 186226	2013-07-12 23:33:03 +00:00
Joey Gouly	a3250f22c2	Fix a crash in EvaluateInDifferentElementOrder where it would generate an undef vector of the wrong type. LGTM'd by Nick Lewycky on IRC. llvm-svn: 186224	2013-07-12 23:08:06 +00:00
Akira Hatanaka	1baf2ea2d1	[mips] Add instruction itinerary classes for mult, seb and slt instructions. llvm-svn: 186222	2013-07-12 22:43:20 +00:00
Bill Wendling	4f73ff4711	Use the function attributes to pass along the stack protector buffer size. Now that we have robust function attributes, don't use a command line option to specify the stack protecto buffer size. llvm-svn: 186217	2013-07-12 22:25:20 +00:00
Andrew Trick	a1e4118a46	LFTR improvement to avoid truncation. This is a reimplemntation of the patch originally in r186107. llvm-svn: 186215	2013-07-12 22:08:48 +00:00
Andrew Trick	2b71848ffe	Cleanup LFTR logic. llvm-svn: 186214	2013-07-12 22:08:44 +00:00
Andrew Trick	466555e50d	Cleanup: rename a variable to make the logic easier to follow. llvm-svn: 186213	2013-07-12 22:08:41 +00:00
Eric Christopher	3931cf94ac	Remove extraneous braces. llvm-svn: 186212	2013-07-12 22:08:24 +00:00
Rafael Espindola	3e2b21cd4d	Change llvm-ar to use lib/Object. This fixes two bugs is lib/Object that the use in llvm-ar found: * In OS X created archives, the name can be padded with nulls. Strip them. * In the constructor, remember the first non special member and use that in begin_children. This makes sure we skip all special members, not just the first one. The change to llvm-ar itself consist of * Using lib/Object for reading archives instead of ArchiveReader.cpp. * Writing the modified archive directly, instead of creating an in memory representation. The old Archive library was way more general than what is needed, as can be seen by the diffstat of this patch. Having llvm-ar using lib/Object now opens the way for creating regular symbol tables for both native objects and bitcode files so that we can use those archives for LTO. llvm-svn: 186197	2013-07-12 20:21:39 +00:00
Benjamin Kramer	c22c790f89	R600: Remove unsafe type punning. No intended functionality change. llvm-svn: 186196	2013-07-12 20:18:05 +00:00
Arnold Schwaighofer	6042a261b8	X86 cost model: Add cost for vectorized gather/scather radar://14351991 llvm-svn: 186189	2013-07-12 19:16:07 +00:00
Arnold Schwaighofer	da2b311865	ARM cost model: Add cost for gather/scather Fixes a 35% degradation compared to unvectorized code in MiBench/automotive-susan and an equally serious regression on a private image processing benchmark. radar://14351991 llvm-svn: 186188	2013-07-12 19:16:04 +00:00
Arnold Schwaighofer	9da9a43af8	TargetTransformInfo: address calculation parameter for gather/scather Address calculation for gather/scather in vectorized code can incur a significant cost making vectorization unbeneficial. Add infrastructure to add cost. Tests and cost model for targets will be in follow-up commits. radar://14351991 llvm-svn: 186187	2013-07-12 19:16:02 +00:00
Tom Stellard	ccae60acc3	R600/SI: Add support for f64 kernel arguments Patch by: Niels Ole Salscheider Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 186182	2013-07-12 18:15:26 +00:00
Tom Stellard	4e1100ab75	R600/SI: Implement select and compares for SI Patch by: Niels Ole Salscheider Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 186181	2013-07-12 18:15:19 +00:00
Tom Stellard	8ed7b45da3	R600/SI: Add fsqrt pattern for SI Patch by: Niels Ole Salscheider Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 186180	2013-07-12 18:15:13 +00:00
Tom Stellard	2a6a610516	R600/SI: Add double precision fsub pattern for SI Patch by: Niels Ole Salscheider Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 186179	2013-07-12 18:15:08 +00:00
Tom Stellard	ab8a8c84d4	R600/SI: SI support for 64bit ConstantFP Patch by: Niels Ole Salscheider Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 186178	2013-07-12 18:15:02 +00:00
Tom Stellard	7512c0803c	R600/SI: Add initial double precision support for SI Patch by: Niels Ole Salscheider Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 186177	2013-07-12 18:14:56 +00:00
Benjamin Kramer	068a2253e9	X86: Shrink certain forms of movsx. In particular: movsbw %al, %ax --> cbtw movswl %ax, %eax --> cwtl movslq %eax, %rax --> cltq According to Intel's manual those have the same performance characteristics but come with a smaller encoding. llvm-svn: 186174	2013-07-12 18:06:44 +00:00
Stephen Lin	fda967fdea	X86: fold SSE2/AVX2 logical shift by immediate amount into zero vector when possible Patch by Andrea Di Biagio llvm-svn: 186165	2013-07-12 15:31:36 +00:00
Rafael Espindola	f0c617264a	Don't reject an empty archive. llvm-svn: 186159	2013-07-12 13:32:28 +00:00
Chandler Carruth	cf3715cadd	Revert "indvars: Improve LFTR by eliminating truncation when comparing against a constant." This reverts commit r186107. It didn't handle wrapping arithmetic in the loop correctly and thus caused the following C program to count from 0 to UINT64_MAX instead of from 0 to 255 as intended: #include <stdio.h> int main() { unsigned char first = 0, last = 255; do { printf("%d\n", first); } while (first++ != last); } Full test case and instructions to reproduce with just the -indvars pass sent to the original review thread rather than to r186107's commit. llvm-svn: 186152	2013-07-12 11:18:55 +00:00
Vladimir Medic	bcf1ca08e0	Add support for Mips break and syscall insructions. The corresponding test cases are added. llvm-svn: 186151	2013-07-12 09:25:35 +00:00
Richard Sandiford	6d4bd28322	[SystemZ] Optimize sign-extends of vector setccs Normal (sext (setcc ...)) sequences are optimised into (select_cc ..., -1, 0) by DAGCombiner::visitSIGN_EXTEND. However, this is deliberately not done for vectors, and after vector type legalization we have (sext_inreg (setcc ...)) instead. I wondered about trying to extend DAGCombiner to handle this case too, but it seemed to be a loss on some other targets I tried, even those for which SETCC isn't "legal" and SELECT_CC is. llvm-svn: 186149	2013-07-12 09:17:10 +00:00
Richard Sandiford	b820405b59	[SystemZ] Fix parsing of inline asm registers GPR and FPR constraints like "{r2}" and "{f2}" weren't handled correctly because the name-to-regno mapping depends on the value type and (because of that) the internal names in RegStrings are not the same as the AsmName. CC constraints like "{cc}" didn't work either because there was no associated register class. llvm-svn: 186148	2013-07-12 09:08:12 +00:00
Richard Sandiford	3f0edc2903	[SystemZ] Improve spilling of LGDR and LDGR If the source of these instructions is spilled we should load the destination. If the destination is spilled we should store the source. llvm-svn: 186147	2013-07-12 08:37:17 +00:00
Shuxin Yang	23773b34c6	Stylistic change. Thank Nick for figuring out these problems. llvm-svn: 186146	2013-07-12 07:25:38 +00:00
Nadav Rotem	89c41bf06a	SLPVectorizer: Sink and enable CSE for ExtractElements. llvm-svn: 186145	2013-07-12 06:09:24 +00:00
Charles Davis	e8f297ca94	Target/X86: Add explicit Win64 and System V/x86-64 calling conventions. Summary: This patch adds explicit calling convention types for the Win64 and System V/x86-64 ABIs. This allows code to override the default, and use the Win64 convention on a target that wants to use SysV (and vice-versa). This is needed to implement the `ms_abi` and `sysv_abi` GNU attributes. Reviewers: CC: llvm-svn: 186144	2013-07-12 06:02:35 +00:00
NAKAMURA Takumi	40bd28a7df	Windows/TimeValue.inc: Mute prefixed '0' on %d to emulate %e. It fixes compatibility in llvm/test/Object/archive-toc.test. llvm-svn: 186142	2013-07-12 02:13:03 +00:00
Manman Ren	30d6865a23	PEI: refactor replaceFrameIndices(MF) to call replaceFrameIndices(BB). replaceFrameIndices(MF) will iterate over the BBs and call replaceFrameIndices(BB). No functionality change. llvm-svn: 186141	2013-07-12 00:37:01 +00:00
Nadav Rotem	fa3c2db211	SLPVectorize: Replace the code that checks for vectorization candidates in successor blocks with code that scans PHINodes. Before we could vectorize PHINodes scanning successors was a good way of finding candidates. Now we can vectorize the phinodes which is simpler. llvm-svn: 186139	2013-07-12 00:04:18 +00:00
Nadav Rotem	db06b139fd	Remove an argument that we dont use anymore. llvm-svn: 186116	2013-07-11 20:56:13 +00:00
Hal Finkel	4715081787	PPC: Add some missing V_SET0 patterns We had patterns to match v4i32 immAllZerosV -> V_SET0, but not patterns for v8i16 (which occurs in the test case) or v16i8. The same was true for V_SETALLONES (so I added the associated patterns for those as well). Another bug found by llvm-stress. llvm-svn: 186108	2013-07-11 17:43:32 +00:00
Andrew Trick	3095993d6f	indvars: Improve LFTR by eliminating truncation when comparing against a constant. Patch by Michele Scandale! Adds a special handling of the case where, during the loop exit condition rewriting, the exit value is a constant of bitwidth lower than the type of the induction variable: instead of introducing a trunc operation in order to match correctly the operand types, it allows to convert the constant value to an equivalent constant, depending on the initial value of the induction variable and the trip count, in order have an equivalent comparison between the induction variable and the new constant. llvm-svn: 186107	2013-07-11 17:08:59 +00:00
Hal Finkel	ff3ea8060c	PPCDAGToDAGISel::isRunOfOnes should return false on zero This fixes a bug (found by csmith) at -O0 where we attempt to create a RLWIMI with an out-of-range operand. Most uses of the isRunOfOnes function are guarded by a condition that the value is not zero. This was not true in two places, and in both places a zero input would result in an out-of-rage MB value (= 32). To fix this, isRunOfOnes returns false on a zero input (and I've remove one now-redundant guard). llvm-svn: 186101	2013-07-11 16:31:51 +00:00
Craig Topper	2cd5ff8003	Use SmallVectorImpl& instead of SmallVector to avoid repeating small vector size. llvm-svn: 186098	2013-07-11 16:22:38 +00:00
Rafael Espindola	1761824d72	Add back code for supporting old mingw versions. Should bring the bots back. llvm-svn: 186096	2013-07-11 16:11:21 +00:00
Benjamin Kramer	fc3ea6f4bc	Don't use a potentially expensive shift if all we want is one set bit. No functionality change. llvm-svn: 186095	2013-07-11 16:05:50 +00:00
Rafael Espindola	bce399216c	Looks like some versions of mingw don't have errno_t. Use int. llvm-svn: 186092	2013-07-11 15:47:04 +00:00
Benjamin Kramer	cce97be70b	Use move semantics if possible to construct ConstantRanges. Arithmetic on ConstantRanges creates a lot of large temporary APInts that benefit from move semantics. llvm-svn: 186091	2013-07-11 15:37:27 +00:00
Rafael Espindola	b1c1c5f377	Fix a FIXME about the format and add a test. While at it, use strftime on Unix too and use the thread safe versions of localtime. llvm-svn: 186090	2013-07-11 15:35:23 +00:00
Arnold Schwaighofer	e97c71b8fd	LoopVectorize: Vectorize all accesses in address space zero with unit stride We can vectorize them because in the case where we wrap in the address space the unvectorized code would have had to access a pointer value of zero which is undefined behavior in address space zero according to the LLVM IR semantics. (Thank you Duncan, for pointing this out to me). Fixes PR16592. llvm-svn: 186088	2013-07-11 15:21:55 +00:00
Benjamin Kramer	741146b825	Reduce the number of indirections in the attributes implementation. - Coallocate entires for AttributeSetImpls and Nodes after the class itself. - Remove mutable iterators from immutable classes. - Remove unused context field from AttributeImpl. - Derive Enum/Align/String attribute implementations from AttributeImpl instead of having a whole new inheritance tree for them. - Derive AlignAttributeImpl from EnumAttributeImpl. llvm-svn: 186075	2013-07-11 12:13:16 +00:00
Richard Sandiford	ea9b6aa20b	[SystemZ] Use zeroing form of RISBG for shift-and-AND sequences Extend r186072 to handle shifts and ANDs. llvm-svn: 186073	2013-07-11 09:10:09 +00:00
Richard Sandiford	84f54a3bc9	[SystemZ] Use zeroing form of RISBG for some AND sequences RISBG can handle some ANDs for which no AND IMMEDIATE exists. It also acts as a three-operand AND for some cases where an AND IMMEDIATE could be used instead. It might be worth adding a pass to replace RISBG with AND IMMEDIATE in cases where the register operands end up being the same and where AND IMMEDIATE is smaller. llvm-svn: 186072	2013-07-11 08:59:12 +00:00
Richard Sandiford	67ddcd6dd0	[SystemZ] Allow 8-bit operands to RISBG RISBG has three 8-bit operands (I3, I4 and I5). I'd originally restricted all three to 6 bits, since that's the only range we intended to use at the time. However, the top bit of I4 acts as a "zero" flag for RISBG, while the top bit of I3 acts as a "test" flag for RNSBG & co. This patch therefore allows them to have the full 8-bit range. I've left the fifth operand as a 6-bit value for now since the upper 2 bits have no defined meaning. llvm-svn: 186070	2013-07-11 08:37:13 +00:00
Duncan Sands	e773c08021	TryToSimplifyUncondBranchFromEmptyBlock was checking that any common predecessors of the two blocks it is attempting to merge supply the same incoming values to any phi in the successor block. This change allows merging in the case where there is one or more incoming values that are undef. The undef values are rewritten to match the non-undef value that flows from the other edge. Patch by Mark Lacey. llvm-svn: 186069	2013-07-11 08:28:20 +00:00
Hal Finkel	6161b9405f	Initialize AsmPrinter::MF in the constructor MF is normally initialized in AsmPrinter::SetupMachineFunction, but if the file contains only globals (no functions), then we need this to be initialized because, when encountering an error, lowerConstant() references it. This should fix the non-deterministic failures of test/CodeGen/X86/nonconst-static-iv.ll, etc. llvm-svn: 186068	2013-07-11 06:41:14 +00:00
Hal Finkel	743b194084	RegScavenger should not exclude undef uses When computing currently-live registers, the register scavenger excludes undef uses. As a result, undef uses are ignored when computing the restore points of registers spilled into the emergency slots. While the register scavenger normally excludes from consideration, when scavenging, registers used by the current instruction, we need to not exclude undef uses. Otherwise, we might end up requiring more emergency spill slots than we have (in the case where the undef use is the currently-spilled register). Another bug found by llvm-stress. llvm-svn: 186067	2013-07-11 05:55:57 +00:00
Craig Topper	37039640e3	Fix indentation. No functional change. llvm-svn: 186065	2013-07-11 05:39:44 +00:00
Nadav Rotem	08efb262a9	Fix a warning. llvm-svn: 186064	2013-07-11 05:39:02 +00:00
Nadav Rotem	b8dd66f655	SLPVectorizer: refactor the code that places extracts. Place the code that decides where to put extracts in the build-tree phase. This allows us to take the cost of the extracts into account. llvm-svn: 186058	2013-07-11 04:54:05 +00:00
Michael Gottesman	b40db26eae	Teach TailRecursionElimination to handle certain cases of nocapture escaping allocas. Without the changes introduced into this patch, if TRE saw any allocas at all, TRE would not perform TRE or mark callsites with the tail marker. Because TRE runs after mem2reg, this inadequacy is not a death sentence. But given a callsite A without escaping alloca argument, A may not be able to have the tail marker placed on it due to a separate callsite B having a write-back parameter passed in via an argument with the nocapture attribute. Assume that B is the only other callsite besides A and B only has nocapture escaping alloca arguments (NOTE B may have other arguments that are not passed allocas). In this case not marking A with the tail marker is unnecessarily conservative since: 1. By assumption A has no escaping alloca arguments itself so it can not access the caller's stack via its arguments. 2. Since all of B's escaping alloca arguments are passed as parameters with the nocapture attribute, we know that B does not stash said escaping allocas in a manner that outlives B itself and thus could be accessed indirectly by A. With the changes introduced by this patch: 1. If we see any escaping allocas passed as a capturing argument, we do nothing and bail early. 2. If we do not see any escaping allocas passed as captured arguments but we do see escaping allocas passed as nocapture arguments: i. We do not perform TRE to avoid PR962 since the code generator produces significantly worse code for the dynamic allocas that would be created by the TRE algorithm. ii. If we do not return twice, mark call sites without escaping allocas with the tail marker. NOTE This excludes functions with escaping nocapture allocas. 3. If we do not see any escaping allocas at all (whether captured or not): i. If we do not have usage of setjmp, mark all callsites with the tail marker. ii. If there are no dynamic/variable sized allocas in the function, attempt to perform TRE on all callsites in the function. Based off of a patch by Nick Lewycky. rdar://14324281. llvm-svn: 186057	2013-07-11 04:40:01 +00:00
Hal Finkel	b31366da82	Don't assert if we can't constant fold extract/insertvalue A non-constant-foldable static initializer expression containing insertvalue or extractvalue had been causing an assert: Constants.cpp:1971: Assertion `FC && "ExtractValue constant expr couldn't be folded!"' failed. Now we report a more-sensible "Unsupported expression in static initializer" error instead. Fixes PR15417. llvm-svn: 186044	2013-07-10 22:51:01 +00:00
Rafael Espindola	555099207b	Find the symbol table on archives created on OS X. llvm-svn: 186041	2013-07-10 22:07:59 +00:00
Tim Northover	a630fb0b67	Put ELF COMDAT relocations into the relevant COMDAT group. Patch from Игорь Пашев (I do hope we support utf-8 commit messages; I also hope he'll forgive me for transliterating it as Igor Pashev in case things go horribly wrong). llvm-svn: 186034	2013-07-10 20:58:17 +00:00
Stephen Lin	10947502e5	Remove trailing whitespac llvm-svn: 186032	2013-07-10 20:47:39 +00:00
Rafael Espindola	fbcafc0793	Don't crash in 'llvm -s' when an archive has no symtab. llvm-svn: 186029	2013-07-10 20:14:22 +00:00
Michael Gottesman	6eb95dc2f7	[objc-arc] Changed 'mode: c++' => 'C++' at Nick Lewycky's suggestion. Also removed unnecessary mode: c++ lines from .cpp files. llvm-svn: 186026	2013-07-10 18:49:00 +00:00
Rafael Espindola	fc3876118d	MemoryBuffer::getFile handles zero sized files, no need to duplicate the test. llvm-svn: 186018	2013-07-10 17:30:39 +00:00
Aaron Ballman	f04bbd8b7f	Replacing an empty switch with its moral equivalent. No functional changes intended. llvm-svn: 186017	2013-07-10 17:19:22 +00:00
Rafael Espindola	4d08d8bada	Use status to implement file_size. The status function is already using a syscall that returns the file size. Remember it and implement file_size as a simple wrapper. No functionally change, but clients that already use status now can avoid calling file_size. llvm-svn: 186016	2013-07-10 17:16:40 +00:00
Adrian Prantl	d3f6fe51ab	Use the appropriate unsigned int type for the offset. llvm-svn: 186015	2013-07-10 16:56:52 +00:00
Adrian Prantl	c31ec1c948	Safeguard DBG_VALUE handling. Unbreaks the ASAN buildbot. llvm-svn: 186014	2013-07-10 16:56:47 +00:00
Craig Topper	9ae4707868	Simplify code. llvm-svn: 186013	2013-07-10 16:38:35 +00:00
Michel Danzer	49812b5bbd	R600/SI: Initial local memory support Enough for the radeonsi driver to use it for calculating derivatives. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 186012	2013-07-10 16:37:07 +00:00
Michel Danzer	1f87df365f	R600/SI: Add pattern for the AMDGPU.barrier.local intrinsic lit test coverage to follow in the next commit. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 186011	2013-07-10 16:36:57 +00:00
Michel Danzer	8d69617b27	R600/SI: Add intrinsic for retrieving the current thread ID Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 186010	2013-07-10 16:36:52 +00:00
Michel Danzer	1c45430e76	R600/SI: Initial support for LDS/GDS instructions Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 186009	2013-07-10 16:36:43 +00:00
Michel Danzer	83f87c4c2e	R600/SI: Add intrinsics for texture sampling with user derivatives Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 186008	2013-07-10 16:36:36 +00:00
Hal Finkel	7ab3db52d3	PPC: Add a better comment about the i64 FI fixup In discussing this change with Bill Schmidt, it was decided that the original comment about negative FIs was incorrect. We'll still exclude them for now, but now with a more-accurate explanation. llvm-svn: 186005	2013-07-10 15:29:01 +00:00
Vladimir Medic	524ad0e46e	Reverting commit r185999 due to buildboot failure. llvm-svn: 186000	2013-07-10 12:26:26 +00:00
Vladimir Medic	e84de1e101	Add support for Mips break and syscall insructions. The corresponding test cases are added. llvm-svn: 185999	2013-07-10 10:18:10 +00:00
Stephen Lin	2a6447320e	Fix typo llvm-svn: 185995	2013-07-10 01:57:39 +00:00
Stephen Lin	dd502028ec	Explicitly define ARMISelLowering::isFMAFasterThanFMulAndFAdd. No functionality change. Currently ARM is the only backend that supports FMA instructions (for at least some subtargets) but does not implement this virtual, so FMAs are never generated except from explicit fma intrinsic calls. Apparently this is due to the fact that it supports both fused (one rounding step) and unfused (two rounding step) multiply + add instructions. This patch clarifies that this the case without changing behavior by implementing the virtual function to simply return false, as the default TargetLoweringBase version does. It is possible that some cpus perform the fused version faster than the unfused version and vice-versa, so the function implementation should be revisited if hard data is found. llvm-svn: 185994	2013-07-10 01:54:24 +00:00
Adrian Prantl	a1ffd1a450	Un-break the buildbot by tweaking the indirection flag. Pulled in a testcase from the debuginfo-test suite. llvm-svn: 185993	2013-07-10 01:53:37 +00:00
Adrian Prantl	facc9f4e3e	Document a known limitation of the status quo. llvm-svn: 185992	2013-07-10 01:53:30 +00:00
Eric Christopher	93ebdd727f	Fix comment. llvm-svn: 185984	2013-07-09 23:48:45 +00:00
Jim Grosbach	ebcad2e063	ARM: Fix incorrect pack pattern for thumb2 Propagate the fix from r185712 to Thumb2 codegen as well. Original commit message applies here as well: A "pkhtb x, x, y asr #num" uses the lower 16 bits of "y asr #num" and packs them in the bottom half of "x". An arithmetic and logic shift are only equivalent in this context if the shift amount is 16. We would be shifting in ones into the bottom 16bits instead of zeros if "y" is negative. rdar://14338767 llvm-svn: 185982	2013-07-09 22:59:22 +00:00
Peter Collingbourne	49062a97cf	Implement categories for special case lists. A special case list can now specify categories for specific globals, which can be used to instruct an instrumentation pass to treat certain functions or global variables in a specific way, such as by omitting certain aspects of instrumentation while keeping others, or informing the instrumentation pass that a specific uninstrumentable function has certain semantics, thus allowing the pass to instrument callers according to those semantics. For example, AddressSanitizer now uses the "init" category instead of global-init prefixes for globals whose initializers should not be instrumented, but which in all other respects should be instrumented. The motivating use case is DataFlowSanitizer, which will have a number of different categories for uninstrumentable functions, such as "functional" which specifies that a function has pure functional semantics, or "discard" which indicates that a function's return value should not be labelled. Differential Revision: http://llvm-reviews.chandlerc.com/D1092 llvm-svn: 185978	2013-07-09 22:03:17 +00:00
Peter Collingbourne	2eb048d230	Introduce a SpecialCaseList ctor which takes a MemoryBuffer to make it more unit testable, and fix memory leak in the other ctor. Differential Revision: http://llvm-reviews.chandlerc.com/D1090 llvm-svn: 185976	2013-07-09 22:03:09 +00:00
Peter Collingbourne	015370e23a	Rename BlackList class to SpecialCaseList and move it to Transforms/Utils. Differential Revision: http://llvm-reviews.chandlerc.com/D1089 llvm-svn: 185975	2013-07-09 22:02:49 +00:00
David Majnemer	a80fed7e58	InstSimplify: X >> X -> 0 llvm-svn: 185973	2013-07-09 22:01:22 +00:00
Adrian Prantl	19942885ba	Typo. llvm-svn: 185971	2013-07-09 21:44:06 +00:00
Nadav Rotem	d7b574e5b3	Fix PR16571, which is a bug in the code that checks that all of the types in the bundle are uniform. llvm-svn: 185970	2013-07-09 21:38:08 +00:00
Adrian Prantl	418d1d1ea9	Reapply an improved version of r180816/180817. Change the informal convention of DBG_VALUE machine instructions so that we can express a register-indirect address with an offset of 0. The old convention was that a DBG_VALUE is a register-indirect value if the offset (operand 1) is nonzero. The new convention is that a DBG_VALUE is register-indirect if the first operand is a register and the second operand is an immediate. For plain register values the combination reg, reg is used. MachineInstrBuilder::BuildMI knows how to build the new DBG_VALUES. rdar://problem/13658587 llvm-svn: 185966	2013-07-09 20:28:37 +00:00
Hal Finkel	e4dd5c29f0	WidenVecRes_BUILD_VECTOR must use the first operand's type Because integer BUILD_VECTOR operands may have a larger type than the result's vector element type, and all operands must have the same type, when widening a BUILD_VECTOR node by adding UNDEFs, we cannot use the vector element type, but rather must use the type of the existing operands. Another bug found by llvm-stress. llvm-svn: 185960	2013-07-09 18:55:10 +00:00
Bill Schmidt	4122169308	[PowerPC] Better fix for PR16556. A more complete example of the bug in PR16556 was recently provided, showing that the previous fix was not sufficient. The previous fix is reverted herein. The real problem is that ReplaceNodeResults() uses LowerFP_TO_INT as custom lowering for FP_TO_SINT during type legalization, without checking whether the input type is handled by that routine. LowerFP_TO_INT requires the input to be f32 or f64, so we fail when the input is ppcf128. I'm leaving the test case from the initial fix (r185821) in place, and adding the new test as another crash-only check. llvm-svn: 185959	2013-07-09 18:50:20 +00:00
Stephen Lin	73de7bf5de	AArch64/PowerPC/SystemZ/X86: This patch fixes the interface, usage, and all in-tree implementations of TargetLoweringBase::isFMAFasterThanMulAndAdd in order to resolve the following issues with fmuladd (i.e. optional FMA) intrinsics: 1. On X86(-64) targets, ISD::FMA nodes are formed when lowering fmuladd intrinsics even if the subtarget does not support FMA instructions, leading to laughably bad code generation in some situations. 2. On AArch64 targets, ISD::FMA nodes are formed for operations on fp128, resulting in a call to a software fp128 FMA implementation. 3. On PowerPC targets, FMAs are not generated from fmuladd intrinsics on types like v2f32, v8f32, v4f64, etc., even though they promote, split, scalarize, etc. to types that support hardware FMAs. The function has also been slightly renamed for consistency and to force a merge/build conflict for any out-of-tree target implementing it. To resolve, see comments and fixed in-tree examples. llvm-svn: 185956	2013-07-09 18:16:56 +00:00
Hal Finkel	ff666bd962	Don't crash in SE dealing with ashr x, -1 ScalarEvolution::getSignedRange uses ComputeNumSignBits from ValueTracking on ashr instructions. ComputeNumSignBits can return zero, but this case was not handled correctly by the code in getSignedRange which was calling: APInt::getSignedMinValue(BitWidth).ashr(NS - 1) with NS = 0, resulting in an assertion failure in APInt::ashr. Now, we just return the conservative result (as with NS == 1). Another bug found by llvm-stress. llvm-svn: 185955	2013-07-09 18:16:16 +00:00
David Majnemer	a92b3c914e	ValueTracking: Fix bugs in isKnownToBeAPowerOfTwo (add nsw x, (and x, y)) isn't a power of two if x is zero, it's zero (add nsw x, (xor x, y)) isn't a power of two if y has bits set that aren't set in x llvm-svn: 185954	2013-07-09 18:11:10 +00:00
Nadav Rotem	861bef7dd0	Set the default insert point to the first instruction, and not to end() llvm-svn: 185953	2013-07-09 17:55:36 +00:00
Hal Finkel	6c29bd9088	DAGCombine tryFoldToZero cannot create illegal types after type legalization When folding sub x, x (and other similar constructs), where x is a vector, the result is a vector of zeros. After type legalization, make sure that the input zero elements have a legal type. This type may be larger than the result's vector element type. This was another bug found by llvm-stress. llvm-svn: 185949	2013-07-09 17:02:45 +00:00
Ulrich Weigand	52cf8e4488	[PowerPC] Revert r185476 and fix up TLS variant kinds In the commit message to r185476 I wrote: >The PowerPC-specific modifiers VK_PPC_TLSGD and VK_PPC_TLSLD >correspond exactly to the generic modifiers VK_TLSGD and VK_TLSLD. >This causes some confusion with the asm parser, since VK_PPC_TLSGD >is output as @tlsgd, which is then read back in as VK_TLSGD. > >To avoid this confusion, this patch removes the PowerPC-specific >modifiers and uses the generic modifiers throughout. (The only >drawback is that the generic modifiers are printed in upper case >while the usual convention on PowerPC is to use lower-case modifiers. >But this is just a cosmetic issue.) This was unfortunately incorrect, there is is fact another, serious drawback to using the default VK_TLSLD/VK_TLSGD variant kinds: using these causes ELFObjectWriter::RelocNeedsGOT to return true, which in turn causes the ELFObjectWriter to emit an undefined reference to _GLOBAL_OFFSET_TABLE_. This is a problem on powerpc64, because it uses the TOC instead of the GOT, and the linker does not provide _GLOBAL_OFFSET_TABLE_, so the symbol remains undefined. This means shared libraries using TLS built with the integrated assembler are currently broken. While the whole RelocNeedsGOT / _GLOBAL_OFFSET_TABLE_ situation probably ought to be properly fixed at some point, for now I'm simply reverting the r185476 commit. Now this in turn exposes the breakage of handling @tlsgd/@tlsld in the asm parser that this check-in was originally intended to fix. To avoid this regression, I'm also adding a different fix for this problem: while common code now parses @tlsgd as VK_TLSGD, a special hack in the asm parser translates this code to the platform-specific VK_PPC_TLSGD that the back-end now expects. While this is not really pretty, it's self-contained and shouldn't hurt anything else for now. One the underlying problem is fixed, this hack can be reverted again. llvm-svn: 185945	2013-07-09 16:41:09 +00:00
Vincent Lejeune	ce499744b3	R600: Do not predicated basic block with multiple alu clause Test is not included as it is several 1000 lines long. To test this functionnality, a test case must generate at least 2 ALU clauses, where an ALU clause is ~110 instructions long. NOTE: This is a candidate for the stable branch. llvm-svn: 185943	2013-07-09 15:03:33 +00:00
Vincent Lejeune	b8aac8d720	R600: Fix a rare bug where swizzle optimization returns wrong values llvm-svn: 185942	2013-07-09 15:03:25 +00:00
Vincent Lejeune	a4d8d2ef2b	R600: Fix wrong export reswizzling llvm-svn: 185941	2013-07-09 15:03:19 +00:00
Vincent Lejeune	b55940cc7d	R600: Use DAG lowering pass to handle fcos/fsin NOTE: This is a candidate for the stable branch. llvm-svn: 185940	2013-07-09 15:03:11 +00:00
Vincent Lejeune	f10d1cd2a3	R600: Print Export Swizzle llvm-svn: 185939	2013-07-09 15:03:03 +00:00
Rafael Espindola	8115e1da91	Add missing getters. They will be used in llvm-ar. llvm-svn: 185937	2013-07-09 12:49:24 +00:00
Rafael Espindola	8e9385ec63	Archive members cannot be larger than 4GB. Return a uint32_t. llvm-svn: 185936	2013-07-09 12:45:11 +00:00
Rafael Espindola	97ee9de652	Add getHeader helper and move ToHeader to the cpp file. llvm-svn: 185933	2013-07-09 12:22:05 +00:00
Joey Gouly	0f12aa2b0f	Add MC assembly/disassembly support for VRINT{A, N, P, M} to V8FP. llvm-svn: 185929	2013-07-09 11:26:18 +00:00
Joey Gouly	3b693c42b5	Add MC assembly/disassembly support for VRINT{Z, X, R} to V8FP. llvm-svn: 185926	2013-07-09 11:03:21 +00:00
Ulrich Weigand	55daa77901	[PowerPC] Support ".machine any" The PowerPC assembler is supposed to provide a directive .machine that allows switching the supported CPU instruction set on the fly. Since we do not yet check CPU feature sets at all and always accept any available instruction, this is not really useful at this point. However, it makes sense to accept (and ignore) ".machine any" to avoid spuriously rejecting existing assembler files that use this. llvm-svn: 185924	2013-07-09 10:00:34 +00:00
Alexander Potapenko	8d2d79d05f	Revert r185872 - "Stop emitting weak symbols into the "coal" sections" This patch broke `make check-asan` on Mac, causing ld warnings like the following one: ld: warning: direct access in __GLOBAL__I_a to global weak symbol ___asan_mapping_scale means the weak symbol cannot be overridden at runtime. This was likely caused by different translation units being compiled with different visibility settings. The resulting test binaries crashed with incorrect ASan warnings. llvm-svn: 185923	2013-07-09 10:00:16 +00:00
Joey Gouly	2d0175e8fb	Add MC assembly/disassembly support for VCVT{A, N, P, M} to V8FP. llvm-svn: 185922	2013-07-09 09:59:04 +00:00
Richard Sandiford	9784649157	[SystemZ] Use MVC for simple load/store pairs Look for patterns of the form (store (load ...), ...) in which the two locations are known not to partially overlap. (Identical locations are OK.) These sequences are better implemented by MVC unless either the load or the store could use RELATIVE LONG instructions. The testcase showed that we weren't using LHRL and LGHRL for extload16, only sextloadi16. The patch fixes that too. llvm-svn: 185919	2013-07-09 09:46:39 +00:00
Richard Sandiford	47660c148c	[SystemZ] Use "STC;MVC" for memset Use "STC;MVC" for memsets that are too big for two STCs or MV...Is yet small enough for a single MVC. As with memcpy, I'm leaving longer cases till later. The number of tests might seem excessive, but f33 & f34 from memset-04.ll failed the first cut because I'd not added the "?:" on the calculation of Size1. llvm-svn: 185918	2013-07-09 09:32:42 +00:00
David Majnemer	eeed73b981	InstCombine: Fix typo in comment for visitICmpInstWithInstAndIntCst llvm-svn: 185916	2013-07-09 09:24:35 +00:00
David Majnemer	72d76275ac	InstCombine: variations on 0xffffffff - x >= 4 The following transforms are valid if -C is a power of 2: (icmp ugt (xor X, C), ~C) -> (icmp ult X, C) (icmp ult (xor X, C), -C) -> (icmp uge X, C) These are nice, they get rid of the xor. llvm-svn: 185915	2013-07-09 09:20:58 +00:00
David Majnemer	414d4e58aa	InstCombine: X & -C != -C -> X <= u ~C Tests were added in r185910 somehow. llvm-svn: 185912	2013-07-09 08:09:32 +00:00
Ulrich Weigand	78a5a116a0	[PowerPC] Support .llong and fix .word This adds support for the .llong PowerPC-specifc assembler directive. In doing so, I notices that .word is currently incorrect: it is supposed to define a 2-byte data element, not a 4-byte one. llvm-svn: 185911	2013-07-09 07:59:25 +00:00
David Majnemer	bafa537eb7	Commit r185909 was a misapplied patch, fix it llvm-svn: 185910	2013-07-09 07:58:32 +00:00
David Majnemer	f2a9a513c7	InstCombine: add more transforms C1-X <u C2 -> (X\|(C2-1)) == C1 C1-X >u C2 -> (X\|C2) == C1 X-C1 <u C2 -> (X & -C2) == C1 X-C1 >u C2 -> (X & ~C2) == C1 llvm-svn: 185909	2013-07-09 07:50:59 +00:00
Hal Finkel	dbbf09b28e	PPC: Allocate RS spill slot for unaligned i64 load/store This fixes another bug found by llvm-stress! If we happen to be doing an i64 load or store into a stack slot that has less than a 4-byte alignment, then the frame-index elimination may need to use an indexed load or store instruction (because the offset may not be a multiple of 4, a requirement of the STD/LD instructions). The extra register needed to hold the offset comes from the register scavenger, and it is possible that the scavenger will need to use an emergency spill slot. As a result, we need to make sure that a spill slot is allocated when doing an i64 load/store into a less-than-4-byte-aligned stack slot. Because test cases for things like this tend to be fairly fragile, I've concatenated a few small bugpoint-reduced test cases together to form the regression test. llvm-svn: 185907	2013-07-09 06:34:51 +00:00
Rafael Espindola	0f3de64ddf	Compute the size of an archive member in the constructor. It is always computed the same way (by parsing the header). Doing it in the constructor simplifies the callers a bit. llvm-svn: 185905	2013-07-09 05:26:25 +00:00
Rafael Espindola	747bc07bc3	Move some code out of line. No functionality change. llvm-svn: 185901	2013-07-09 03:39:35 +00:00
Jim Grosbach	340b6da4f2	X86: Add comment. llvm-svn: 185900	2013-07-09 02:07:28 +00:00
Jim Grosbach	c35388f103	X86 fast-isel: Avoid explicit AH subreg reference for [SU]Rem. Explicit references to %AH for an i8 remainder instruction can lead to references to %AH in a REX prefixed instruction, which causes things to blow up. Do the same thing in FastISel as we do for DAG isel and instead shift %AX right by 8 bits and then extract the 8-bit subreg from that result. rdar://14203849 http://llvm.org/bugs/show_bug.cgi?id=16105 llvm-svn: 185899	2013-07-09 02:07:25 +00:00
Sean Silva	2f672d610e	Make BinaryRef output correctly in case of empty data. Previously, it would simply output nothing, but it should output an empty string `""`. llvm-svn: 185894	2013-07-09 00:54:46 +00:00
Stephen Lin	8e8424eb17	Style fixes: remove unnecessary braces for one-statement if blocks, no else after return, etc. No funcionality change. llvm-svn: 185893	2013-07-09 00:44:49 +00:00
Eric Christopher	215a77585d	Revert "DebugInfo: remove unused helper function getDICompositeType." This reverts commit r185876 as the functions appear to still be used by dragonegg. llvm-svn: 185890	2013-07-09 00:16:56 +00:00
Eli Bendersky	07b0e451ca	Fix comment llvm-svn: 185888	2013-07-08 23:57:07 +00:00
Nadav Rotem	c9c57518ab	This patch changes the saved IRBuilder insert point from BasicBlock::iterator to AssertingVH. Commit 185883 fixes a bug in the IRBuilder that should fix the ASan bot. AssertingVH can help in exposing some RAUW problems. Thanks Ben and Alexey! llvm-svn: 185886	2013-07-08 23:31:13 +00:00
Michael Gottesman	c1b648f6c0	[objc-arc] Fix assertion in EraseInstruction so that noop on null calls when passed null do not trigger the assert. The specific case of interest is when objc_retainBlock is passed null. llvm-svn: 185885	2013-07-08 23:30:23 +00:00
Manman Ren	8bad86c81b	DebugInfo: remove unused helper function getDICompositeType. llvm-svn: 185876	2013-07-08 21:55:46 +00:00
Bill Wendling	0176708e85	Stop emitting weak symbols into the "coal" sections. The Mach-O linker has been able to support the weak-def bit on any symbol for quite a while now. The compiler however continued to place these symbols into a "coal" section, which required the linker to map them back to the base section name. Replace the sections like this: __TEXT/__textcoal_nt instead use __TEXT/__text __TEXT/__const_coal instead use __TEXT/__const __DATA/__datacoal_nt instead use __DATA/__data <rdar://problem/14265330> llvm-svn: 185872	2013-07-08 21:34:52 +00:00
Eric Christopher	aba20dd603	Update comment to avoid mentioning DbgValues which is an instance variable later in the class. llvm-svn: 185866	2013-07-08 21:16:18 +00:00
Manman Ren	9c5e998043	Revert r185852. llvm-svn: 185861	2013-07-08 20:27:34 +00:00
Matt Arsenault	fe56cc67c5	Find xdot or xdot.py. Ubuntu installs this as xdot, so finding xdot.py would fail. llvm-svn: 185860	2013-07-08 20:24:54 +00:00
Ulrich Weigand	266db7fe04	[PowerPC] Always use "assembler dialect" 1 A setting in MCAsmInfo defines the "assembler dialect" to use. This is used by common code to choose between alternatives in a multi-alternative GNU inline asm statement like the following: __asm__ ("{sfe\|subfe} %0,%1,%2" : "=r" (out) : "r" (in1), "r" (in2)); The meaning of these dialects is platform specific, and GCC defines those for PowerPC to use dialect 0 for old-style (POWER) mnemonics and 1 for new-style (PowerPC) mnemonics, like in the example above. To be compatible with inline asm used with GCC, LLVM ought to do the same. Specifically, this means we should always use assembler dialect 1 since old-style mnemonics really aren't supported on any current platform. However, the current LLVM back-end uses: AssemblerDialect = 1; // New-Style mnemonics. in PPCMCAsmInfoDarwin, and AssemblerDialect = 0; // Old-Style mnemonics. in PPCLinuxMCAsmInfo. The Linux setting really isn't correct, we should be using new-style mnemonics everywhere. This is changed by this commit. Unfortunately, the setting of this variable is overloaded in the back-end to decide whether or not we are on a Darwin target. This is done in PPCInstPrinter (the "SyntaxVariant" is initialized from the MCAsmInfo AssemblerDialect setting), and also in PPCMCExpr. Setting AssemblerDialect to 1 for both Darwin and Linux no longer allows us to make this distinction. Instead, this patch uses the MCSubtargetInfo passed to createPPCMCInstPrinter to distinguish Darwin targets, and ignores the SyntaxVariant parameter. As to PPCMCExpr, this patch adds an explicit isDarwin argument that needs to be passed in by the caller when creating a target MCExpr. (To do so this patch implicitly also reverts commit 184441.) llvm-svn: 185858	2013-07-08 20:20:51 +00:00
Hal Finkel	21ada79757	PPC: Mark vector CC action for SETO and SETONE as Expand Another bug found by llvm-stress! This fixes hitting llvm_unreachable("Invalid integer vector compare condition"); at the end of getVCmpInst in PPCISelDAGToDAG. llvm-svn: 185855	2013-07-08 20:00:03 +00:00
Joey Gouly	392cdad2b1	Add a comment to this change, requested by Eric Christopher. llvm-svn: 185853	2013-07-08 19:52:51 +00:00
Manman Ren	c6fe5bc77c	StringRef: add DenseMapInfo for StringRef. Remove the implementation in include/llvm/Support/YAMLTraits.h. Added a DenseMap type DITypeHashMap in DebugInfo.h: DenseMap<std::pair<StringRef, unsigned>, MDNode*> llvm-svn: 185852	2013-07-08 19:17:48 +00:00
Manman Ren	7504ed4255	Debug Info: clean up usage of Verify. No functionality change. It should suffice to check the type of a debug info metadata, instead of calling Verify. llvm-svn: 185847	2013-07-08 18:33:29 +00:00
Jim Grosbach	24e102a947	ARM: Improve codegen for generic vselect. Fall back to by-element insert rather than building it up on the stack. rdar://14351991 llvm-svn: 185846	2013-07-08 18:18:52 +00:00
David Blaikie	ce1960f936	DebugInfo: Correct comment & re-format a nearby loop llvm-svn: 185844	2013-07-08 17:51:28 +00:00
Shuxin Yang	efc4c01ed3	Fix a SCEV update problem. The symptom is seg-fault, and the root cause is that a SCEV contains a SCEVUnknown which has null-pointer to a llvm::Value. This is how the problem take place: =================================== 1). In the pristine input IR, there are two relevant instrutions Op1 and Op2, Op1's corresponding SCEV (denoted as SCEV(op1)) is a SCEVUnknown, and SCEV(Op2) contains SCEV(Op1). None of these instructions are dead. Op1 : V1 = ... ... Op2 : V2 = ... // directly or indirectly (data-flow) depends on Op1 2) Optimizer (LSR in my case) generates an instruction holding the equivalent value of Op1, making Op1 dead. Op1': V1' = ... Op1: V1 = ... ; now dead) Op2 : V2 = ... //Now deps on Op1', but the SCEV(Op2) still contains SCEV(Op1) 3) Op1 is deleted, and call-back function is called to reset SCEV(Op1) to indicate it is invalid. However, SCEV(Op2) is not invalidated as well. 4) Following pass get the cached, invalid SCEV(Op2), and try to manipulate it, and cause segfault. The fix: ======== It seems there is no clean yet inexpensive fix. I write to dev-list soliciting good solution, unforunately no ack. So, I decide to fix this problem in a brute-force way: When ScalarEvolution::getSCEV is called, check if the cached SCEV contains a invalid SCEVUnknow, if yes, remove the cached SCEV, and re-evaluate the SCEV from scratch. I compile buch of big .c and .cpp, fortunately, I don't see any increase in compile time. Misc: ===== The reduced test-case has 2357 lines of code+other-stuff, too big to commit. rdar://14283433 llvm-svn: 185843	2013-07-08 17:33:13 +00:00
David Blaikie	ac569a656f	DebugInfo: Simplify Address Pool index handling. Since the pool indexes are necessarily sequential and contiguous, just insert things in the right place rather than having to sort the sequence after the fact. No functionality change. llvm-svn: 185842	2013-07-08 17:33:10 +00:00
Hal Finkel	e39302258e	PPC: Mark vector FREM as Expand by default Another bug found by llvm-stress! This fixes crashing with: LLVM ERROR: Cannot select: v4f32 = frem ... llvm-svn: 185840	2013-07-08 17:30:25 +00:00
Rafael Espindola	a8a9f1baf0	We now always create files with the correct permissions. Simplify the interface. llvm-svn: 185834	2013-07-08 16:42:01 +00:00
Rafael Espindola	9a7801566f	Create files with the correct permission instead of changing it afterwards. Not intended functionality change. llvm-svn: 185830	2013-07-08 15:22:09 +00:00
Ulrich Weigand	e840ee2ca2	[PowerPC] Support time base instructions This adds support for the old-style time base instructions; while new programs are supposed to use mfspr, the mftb instructions are still supported and in use by existing assembler files. llvm-svn: 185829	2013-07-08 15:20:38 +00:00
Ulrich Weigand	c0944b50fe	[PowerPC] Support basic compare mnemonics This adds support for the basic mnemoics (with the L operand) for the fixed-point compare instructions. These are defined as aliases for the already existing CMPW/CMPD patterns, depending on the value of L. This requires use of InstAlias patterns with immediate literal operands. To make this work, we need two further changes: - define a RegisterPrefix, because otherwise literals 0 and 1 would be parsed as literal register names - provide a PPCAsmParser::validateTargetOperandClass routine to recognize immediate literals (like ARM does) llvm-svn: 185826	2013-07-08 14:49:37 +00:00
Hal Finkel	12493bb7d5	Improve the comment from r185794 (re: PromoteIntRes_BUILD_VECTOR) In response to Duncan's review, I believe that the original comment was not as clear as it could be. Hopefully, this is better. llvm-svn: 185824	2013-07-08 14:40:04 +00:00
Bill Schmidt	2db29ef467	[PowerPC] Fix PR16556 (handle undef ppcf128 in LowerFP_TO_INT). PPCTargetLowering::LowerFP_TO_INT() expects its source operand to be either an f32 or f64, but this is not checked. A long double (ppcf128) operand will normally be custom-lowered to a conversion to f64 in this context. However, this isn't the case for an UNDEF node. This patch recognizes a ppcf128 as a legal source operand for FP_TO_INT only if it's an undef, in which case it creates an undef of the target type. At some point we might want to do a wholesale custom lowering of ISD::UNDEF when the type is ppcf128, but it's not really clear that's a great idea, and probably more work than it's worth for a situation that only arises in the case of a programming error. At this point I think simple is best. The test case comes from PR16556, and is a crash-test only. llvm-svn: 185821	2013-07-08 14:22:45 +00:00
David Majnemer	fa90a0b325	InstCombine: Fold X-C1 <u 2 -> (X & -2) == C1 Back in r179493 we determined that two transforms collided with each other. The fix back then was to reorder the transforms so that the preferred transform would give it a try and then we would try the secondary transform. However, it was noted that the best approach would canonicalize one transform into the other, removing the collision and allowing us to optimize IR given to us in that form. llvm-svn: 185808	2013-07-08 11:53:08 +00:00
Nico Rieck	51969be724	Reuse %rax after calling __chkstk on win64 Reapply this as I reverted the wrong commit. llvm-svn: 185807	2013-07-08 11:20:11 +00:00
Nico Rieck	4801303ce1	Revert "Proper va_arg/va_copy lowering on win64" This reverts commit 2b52880592a525cfe04d8f9008a35da8c2ea94c3. Needs review. llvm-svn: 185806	2013-07-08 11:19:44 +00:00
Richard Sandiford	d6c78e8f9f	[SystemZ] Remove unwanted part from last commit I was originally going to use MVC for memmove too, but that's less of a clear win. Remove some accidental left-overs in the previous commit. llvm-svn: 185804	2013-07-08 09:55:36 +00:00
Richard Sandiford	d131ff8cf8	[SystemZ] Use MVC for memcpy Use MVC for memcpy in cases where a single MVC is enough. Using MVC is a win for longer copies too, but I'll leave that for later. llvm-svn: 185802	2013-07-08 09:35:23 +00:00
Hal Finkel	8cb9a0e1d3	Fix PromoteIntRes_BUILD_VECTOR crash with i1 vectors This fixes a bug (found by llvm-stress) in DAGTypeLegalizer::PromoteIntRes_BUILD_VECTOR where it assumed that the result type would always be larger than the original operands. This is not always true, however, with boolean vectors. For example, promoting a node of type v8i1 (where the operands will be of type i32, the type to which i1 is promoted) will yield a node with a result vector element type of i16 (and operands of type i32). As a result, we cannot blindly assume that we can ANY_EXTEND the operands to the result type. llvm-svn: 185794	2013-07-08 06:16:58 +00:00
Kai Nacke	c5cca5ab42	Revert: Fix wrong code offset for unwind code SET_FPREG. llvm-svn: 185793	2013-07-08 04:48:34 +00:00
Kai Nacke	939ecd7ea0	Revert: Generate IMAGE_REL_AMD64_ADDR32NB relocations for SEH data structures. llvm-svn: 185791	2013-07-08 04:46:55 +00:00
Kai Nacke	07bad44e9b	Revert: Fix alignment of unwind data. llvm-svn: 185790	2013-07-08 04:45:05 +00:00
Kai Nacke	42097301f6	Revert: Emit personality function and Dwarf EH data for Win64 SEH. llvm-svn: 185788	2013-07-08 04:43:23 +00:00
Hal Finkel	ec474f28e3	Add the nearbyint -> FNEARBYINT mapping to BasicTargetTransformInfo This fixes an oversight that Intrinsic::nearbyint was not being mapped to ISD::FNEARBYINT (thus fixing the over-optimistic cost we were assigning to nearbyint calls for some targets). llvm-svn: 185783	2013-07-08 03:24:07 +00:00
Nico Rieck	43b51056d6	Revert "Reuse %rax after calling __chkstk on win64" This reverts commit 01f8d579f7672872324208ac5bc4ac311e81b22e. llvm-svn: 185781	2013-07-08 01:30:57 +00:00
Stephen Lin	cfe7f352c7	Remove trailing whitespace from SelectionDAG/*.cpp llvm-svn: 185780	2013-07-08 00:37:03 +00:00
Nico Rieck	7adf6111a8	Reuse %rax after calling __chkstk on win64 llvm-svn: 185778	2013-07-07 16:48:39 +00:00
Nadav Rotem	2ee35771a8	Clear the builder insert point between tree-vectorization phases. llvm-svn: 185777	2013-07-07 14:57:18 +00:00
Nick Lewycky	c0514629c9	Eliminate trivial redundant loads across nocapture+readonly calls to uncaptured pointer arguments. llvm-svn: 185776	2013-07-07 10:15:16 +00:00
Nadav Rotem	2041b742d4	SLPVectorizer: Implement DCE as part of vectorization. This is a complete re-write if the bottom-up vectorization class. Before this commit we scanned the instruction tree 3 times. First in search of merge points for the trees. Second, for estimating the cost. And finally for vectorization. There was a lot of code duplication and adding the DCE exposed bugs. The new design is simpler and DCE was a part of the design. In this implementation we build the tree once. After that we estimate the cost by scanning the different entries in the constructed tree (in any order). The vectorization phase also works on the built tree. llvm-svn: 185774	2013-07-07 06:57:07 +00:00
Michael Gottesman	618df456e2	[objc-arc] Remove the alias analysis part of r185764. Upon further reflection, the alias analysis part of r185764 is not a safe change. llvm-svn: 185770	2013-07-07 04:18:03 +00:00
Michael Gottesman	a72630d453	[objc-arc] Teach the ARC optimizer that objc_sync_enter/objc_sync_exit do not modify the ref count of an objc object and additionally are inert for modref purposes. llvm-svn: 185769	2013-07-07 01:52:55 +00:00
Stephen Lin	6d715e8699	SelectionDAGBuilder: style fixes (add space between end parentheses and open brace) llvm-svn: 185768	2013-07-06 21:44:25 +00:00
Joey Gouly	2efaa733a2	Add MC support for the v8fp instructions: vmaxnm and vminnm. llvm-svn: 185767	2013-07-06 20:50:18 +00:00
Michael Gottesman	e557da26db	[objc-arc] When we initialize ARCRuntimeEntryPoints, make sure we reset all references to entrypoint declarations as well. llvm-svn: 185764	2013-07-06 18:43:05 +00:00
Nico Rieck	99ef2890c0	Proper va_arg/va_copy lowering on win64 llvm-svn: 185763	2013-07-06 18:08:19 +00:00
Kai Nacke	c947ad2a2d	Emit personality function and Dwarf EH data for Win64 SEH. Obviously the personality function should be emitted as language handler instead of the hard coded _GCC_specific_handler. The language specific data must be placed after the unwind information therefore it must not be emitted into a separate section. Reviewed by Charles Davis and Nico Rieck. llvm-svn: 185761	2013-07-06 17:17:31 +00:00
Kai Nacke	4417cccba3	Fix alignment of unwind data. For alignment purposes, the instruction array will always have an even number of entries, with the final entry potentially unused (in which case the array will be one longer than indicated by the count of unwind codes field). Reviewed by Charles Davis and Nico Rieck. llvm-svn: 185760	2013-07-06 17:16:50 +00:00
Kai Nacke	2a933a6549	Generate IMAGE_REL_AMD64_ADDR32NB relocations for SEH data structures. The Win64 EH data structures must be of type IMAGE_REL_AMD64_ADDR32NB instead of IMAGE_REL_AMD64_ADDR32. This is easiely achieved by adding the VK_COFF_IMGREL32 modifier to the symbol reference. Change also references to start and end of the SEH range of a function as offsets to start of the function. Reviewed by Charles Davis and Nico Rieck. llvm-svn: 185759	2013-07-06 17:16:12 +00:00
Kai Nacke	66bfdb8354	Fix wrong code offset for unwind code SET_FPREG. The code offset for unwind code SET_FPREG is wrong because it is set to constant 0. The fix is to do the same as for the other unwind codes: emit a label and later the absolute difference between the label and the begin of the prologue. Also enables the failing test case MC/COFF/seh.s Reviewed by Charles Davis and Nico Rieck. llvm-svn: 185758	2013-07-06 17:15:36 +00:00
Benjamin Kramer	3d90a8f4f9	Reassociate: Remove unnecessary default operator=. llvm-svn: 185757	2013-07-06 15:10:13 +00:00
Benjamin Kramer	c7332b2796	DAGCombiner: Don't drop extension behavior when shrinking a load when unsafe. ReduceLoadWidth unconditionally drops extensions from loads. Limit it to the case when all of the bits the extension would otherwise produce are dropped by the shrink. It would be possible to shrink the load in more cases by merging the extensions, but this isn't trivial and a very rare case. I left a TODO for that case. Fixes PR16551. llvm-svn: 185755	2013-07-06 14:05:09 +00:00
Tim Northover	dab4db5372	Stop putting operations after a tail call. This prevents the emission of DAG-generated vreg definitions after a tail call be dropping them entirely (on the grounds that nothing could use them anyway, and they interfere with O0 CodeGen). llvm-svn: 185754	2013-07-06 12:58:45 +00:00
Nico Rieck	a37acf702d	MC: Implement COFF .linkonce directive llvm-svn: 185753	2013-07-06 12:13:10 +00:00
David Majnemer	c13678a24f	isKnownToBeAPowerOfTwo: Fix a typo in a comment llvm-svn: 185748	2013-07-06 02:24:59 +00:00

... 4 5 6 7 8 ...

62944 Commits