llvm-project

Commit Graph

Author	SHA1	Message	Date
Craig Topper	24048c9440	Mark a method 'const' and another 'static'. llvm-svn: 186485	2013-07-17 03:54:53 +00:00
Craig Topper	1c4d667ca5	Make a few more static string pointers constant. llvm-svn: 186484	2013-07-17 03:43:10 +00:00
Rafael Espindola	b6fea4c618	Don't fallback to copy + delete in rename. Rename's documentation says "Files are renamed as if by POSIX rename()". and it is used for atomically updating output files from a temporary. Having rename fallback to a non atomic copy has the potential to hide bugs, like using a temporary file in /tmp instead of a unique name next to the final destination. llvm-svn: 186483	2013-07-17 03:33:41 +00:00
Craig Topper	9fdc70e846	Make constant string pointer into an array to remove a pointer lookup for every access. llvm-svn: 186482	2013-07-17 03:11:32 +00:00
NAKAMURA Takumi	212c80ac5d	raw_ostream.cpp: Introduce <fcntl.h> to let O_BINARY provided. Or, llvm::outs() would be set to O_TEXT by default. llvm/test/Object/check_binary_output.ll is expected to pass on win32. llvm-svn: 186480	2013-07-17 02:21:10 +00:00
Nadav Rotem	2202317fce	SLPVectorizer: Accelerate the isConsecutive check by replacing the subtraction of the two values with a simple SCEV expression that adds the offset to one of the pointers that we compare. llvm-svn: 186479	2013-07-17 00:48:31 +00:00
Hal Finkel	a7c54e8cf4	PPC: Implement base pointer and stack realignment This builds on some frame-lowering code that has existed since 2005 (r24224) but was disabled in 2008 (r48188) because it needed base pointer support to function correctly. This implementation follows the strategy suggested by Dale Johannesen in r48188 where the following comment was added: This does not currently work, because the delta between old and new stack pointers is added to offsets that reference incoming parameters after the prolog is generated, and the code that does that doesn't handle a variable delta. You don't want to do that anyway; a better approach is to reserve another register that retains to the incoming stack pointer, and reference parameters relative to that. And now we do exactly that. If we don't need a frame pointer, then we use r31 as a base pointer. If we do need a frame pointer, then we use r30 as a base pointer. The base pointer retains the value of the stack pointer before it was decremented in the prologue. We then use the base pointer to resolve all negative frame indicies. The basic scheme follows that for base pointers in the X86 backend. We use a base pointer when we need to dynamically realign the incoming stack pointer. This currently applies only to static objects (dynamic allocas with large alignments, and base-pointer support in SjLj lowering will come in future commits). llvm-svn: 186478	2013-07-17 00:45:52 +00:00
NAKAMURA Takumi	4caf019a1a	llvm/test/CodeGen/X86/vec_setcc.ll: Add explicit -mtriple=x86_64-unknown-unknown to satisfy win32-targeted configuration. llvm-svn: 186477	2013-07-17 00:42:37 +00:00
Craig Topper	8fc4096fab	Move string pointer from being a static class member to just a static global in the one file its needed in. llvm-svn: 186476	2013-07-17 00:31:35 +00:00
Manman Ren	8bfde8917e	Add getModuleFlag(StringRef Key) to query a module flag given Key. No functionality change. llvm-svn: 186470	2013-07-16 23:21:16 +00:00
NAKAMURA Takumi	86bae2c54d	llvm/test/Object/ar-create.test: Relax a CHECK line to satisfy localized message catalogue. For example, 'No such file or directory' cannot be seen on Japanese version of msvcrt. llvm-svn: 186469	2013-07-16 23:17:22 +00:00
NAKAMURA Takumi	c398315043	llvm/test/Object/check_binary_output.ll: Mark it as XFAIL on Windows. Investigating. llvm-svn: 186468	2013-07-16 23:16:57 +00:00
Nadav Rotem	d2e8c4cdea	flip the scev minus direction to simplify the code. llvm-svn: 186466	2013-07-16 22:57:06 +00:00
Nadav Rotem	8f924f3891	SLPVectorizer: Improve the compile time of isConsecutive by adding a simple constant-gep check before using SCEV. This check does not always work because not all of the GEPs use a constant offset, but it happens often enough to reduce the number of times we use SCEV. llvm-svn: 186465	2013-07-16 22:51:07 +00:00
Lang Hames	57a113eb0d	Related to r181161 - Indirect branches may not be the last branch in a basic block. Blocks that have an indirect branch terminator, even if it's not the last terminator, should still be treated as unanalyzable. <rdar://problem/14437274> Reducing a useful regression test case is proving difficult - I hope to have one soon. llvm-svn: 186461	2013-07-16 22:01:40 +00:00
Tilmann Scheller	305bb90442	ARM: Add support for the Thumb2 PLI alternate literal form. This adds an instruction alias to make the assembler recognize the alternate literal form: pli [PC, #+/-<imm>] See A8.8.129 in the ARM ARM (DDI 0406C.b). Fixes <rdar://problem/14403733>. llvm-svn: 186459	2013-07-16 21:52:34 +00:00
Rafael Espindola	f9a2619930	Update the examples for an API change. llvm-svn: 186453	2013-07-16 20:22:35 +00:00
Rafael Espindola	6d35481c94	Add a wrapper for open. This centralizes the handling of O_BINARY and opens the way for hiding more differences (like how open behaves with directories). llvm-svn: 186447	2013-07-16 19:44:17 +00:00
Benjamin Kramer	d9508b704d	Finally, force the target for this test. Should unbreak non-x86 buildbots. llvm-svn: 186445	2013-07-16 19:22:07 +00:00
Rafael Espindola	b4f7831320	XFAIL this test on mingw. llvm-svn: 186444	2013-07-16 19:20:29 +00:00
Benjamin Kramer	0edeabfe43	Label names also differ between platforms. Use a relaxed regex. llvm-svn: 186442	2013-07-16 18:54:21 +00:00
Benjamin Kramer	cadc611e93	Fix test not to fail when the target doesn't use leading underscores on symbols. llvm-svn: 186439	2013-07-16 18:42:01 +00:00
Manman Ren	18ba5b2e0f	Cleanup testing case by using a shorter name for types. llvm-svn: 186436	2013-07-16 18:26:48 +00:00
Jakob Stoklund Olesen	efeb3a1969	Remove floats from live range splitting costs. These floats all represented block frequencies anyway, so just use the BlockFrequency class directly. Some floating point computations remain in tryLocalSplit(). They are estimating spill weights which are still floats. llvm-svn: 186435	2013-07-16 18:26:18 +00:00
Jakob Stoklund Olesen	c5454ff046	Reapply r185393. Original commit message: Remove floating point computations from SpillPlacement.cpp. Patch by Benjamin Kramer! Use the BlockFrequency class instead of floats in the Hopfield network computations. This rescales the node Bias field from a [-2;2] float range to two block frequencies BiasN and BiasP pulling in opposite directions. This construct has a more predictable behavior when block frequencies saturate. The per-node scaling factors are no longer necessary, assuming the block frequencies around a bundle are consistent. This patch can cause the register allocator to make different spilling decisions. The differences should be small. llvm-svn: 186434	2013-07-16 18:26:15 +00:00
Juergen Ributzka	3d527d80b8	[X86] Use min/max to optimze unsigend vector comparison on X86 Use PMIN/PMAX for UGE/ULE vector comparions to reduce the number of required instructions. This trick also works for UGT/ULT, but there is no advantage in doing so. It wouldn't reduce the number of instructions and it would actually reduce performance. Reviewer: Ben radar:5972691 llvm-svn: 186432	2013-07-16 18:20:45 +00:00
Peter Collingbourne	8b77f18da0	Make SpecialCaseList match full strings, as documented, using anchors. Differential Revision: http://llvm-reviews.chandlerc.com/D1149 llvm-svn: 186431	2013-07-16 17:56:07 +00:00
Juergen Ributzka	c16f86020c	Test commit to verify write access. llvm-svn: 186429	2013-07-16 17:44:23 +00:00
Reid Kleckner	7df03c2e30	[Support] Add a Unicode conversion wrapper from UTF16 to UTF8 This is to support parsing UTF16 response files in LLVM/lib/Option for lld and clang. Reviewers: hans Differential Revision: http://llvm-reviews.chandlerc.com/D1138 llvm-svn: 186426	2013-07-16 17:14:33 +00:00
Hal Finkel	9caa8f7ba7	When the inliner merges allocas, it must keep the larger alignment For safety, the inliner cannot decrease the allignment on an alloca when merging it with another. I've included two variants of the test case for this: one with DataLayout available, and one without. When DataLayout is not available, if only one of the allocas uses the default alignment (getAlignment() == 0), then they cannot be safely merged. llvm-svn: 186425	2013-07-16 17:10:55 +00:00
Rafael Espindola	7303af37b7	On error, close the temporary file descriptor. With this change llvm-ar can remove the temporary file on windows too. llvm-svn: 186423	2013-07-16 16:00:32 +00:00
Nadav Rotem	26bf9a0c75	SLPVectorizer: Reduce the compile time of the consecutive store lookup. Process groups of stores in chunks of 16. llvm-svn: 186420	2013-07-16 15:25:17 +00:00
Rafael Espindola	e08b59f81d	Create files with mode 666. This matches the behavior of other unix tools. llvm-svn: 186414	2013-07-16 14:10:07 +00:00
Reid Kleckner	5f4535b974	[Support] Fix some warnings when self-hosting clang on Windows llvm-svn: 186413	2013-07-16 14:04:08 +00:00
Ulrich Weigand	1d4dbda5b9	[APFloat] PR16573: Avoid losing mantissa bits in ppc_fp128 to double truncation When truncating to a format with fewer mantissa bits, APFloat::convert will perform a right shift of the mantissa by the difference of the precision of the two formats. Usually, this will result in just the mantissa bits needed for the target format. One special situation is if the input number is denormal. In this case, the right shift may discard significant bits. This is usually not a problem, since truncating a denormal usually results in zero (underflow) after normalization anyway, since the result format's exponent range is usually smaller than the target format's. However, there is one case where the latter property does not hold: when truncating from ppc_fp128 to double. In particular, truncating a ppc_fp128 whose first double of the pair is denormal should result in just that first double, not zero. The current code however performs an excessive right shift, resulting in lost result bits. This is then caught in the APFloat::normalize call performed by APFloat::convert and causes an assertion failure. This patch checks for the scenario of truncating a denormal, and attempts to (possibly partially) replace the initial mantissa right shift by decrementing the exponent, if doing so will still result in a valid target format exponent. Index: test/CodeGen/PowerPC/pr16573.ll =================================================================== --- test/CodeGen/PowerPC/pr16573.ll (revision 0) +++ test/CodeGen/PowerPC/pr16573.ll (revision 0) @@ -0,0 +1,11 @@ +; RUN: llc < %s \| FileCheck %s + +target triple = "powerpc64-unknown-linux-gnu" + +define double @test() { + %1 = fptrunc ppc_fp128 0xM818F2887B9295809800000000032D000 to double + ret double %1 +} + +; CHECK: .quad -9111018957755033591 + Index: lib/Support/APFloat.cpp =================================================================== --- lib/Support/APFloat.cpp (revision 185817) +++ lib/Support/APFloat.cpp (working copy) @@ -1956,6 +1956,23 @@ X86SpecialNan = true; } + // If this is a truncation of a denormal number, and the target semantics + // has larger exponent range than the source semantics (this can happen + // when truncating from PowerPC double-double to double format), the + // right shift could lose result mantissa bits. Adjust exponent instead + // of performing excessive shift. + if (shift < 0 && isFiniteNonZero()) { + int exponentChange = significandMSB() + 1 - fromSemantics.precision; + if (exponent + exponentChange < toSemantics.minExponent) + exponentChange = toSemantics.minExponent - exponent; + if (exponentChange < shift) + exponentChange = shift; + if (exponentChange < 0) { + shift -= exponentChange; + exponent += exponentChange; + } + } + // If this is a truncation, perform the shift before we narrow the storage. if (shift < 0 && (isFiniteNonZero() \|\| category==fcNaN)) lostFraction = shiftRight(significandParts(), oldPartCount, -shift); llvm-svn: 186409	2013-07-16 13:03:25 +00:00
Richard Osborne	ab29d19536	[XCore] Fix printing of inline asm operands. Previously an asm operand with no operand modifier would give the error "invalid operand in inline asm". llvm-svn: 186407	2013-07-16 12:48:34 +00:00
Tim Northover	069f95f926	ARM: allow printing of ARM atomic DAG nodes. We'd forgotten to provide string representations for the special ARMISD atomic nodes; this adds them in. No effect on CodeGen, just makes the output of "-view-whatever-dags" slightly more readable. llvm-svn: 186406	2013-07-16 12:15:36 +00:00
Richard Sandiford	885140c951	[SystemZ] Use ROSBG and non-zero form of RISBG for OR nodes llvm-svn: 186405	2013-07-16 11:55:57 +00:00
Vladimir Medic	a73970b662	Fixing a buildbot failure:unused function. llvm-svn: 186403	2013-07-16 11:43:20 +00:00
Richard Sandiford	35bb463fb1	[SystemZ] Add MC support for R[NOX]SBG CodeGen support will come later. llvm-svn: 186401	2013-07-16 11:28:08 +00:00
Richard Sandiford	82ec87dbdb	[SystemZ] Use RISBG for (shift (and ...)) Another patch in the series to make more use of R.SBG. This one extends r186072 and r186073 to handle cases where the AND is inside the shift. llvm-svn: 186399	2013-07-16 11:02:24 +00:00
Vladimir Medic	64828a1f73	This patch represents Mips utilization of r186388 code that alows asm matcher to emit mnemonics contain '.' characters. This makes asm parser code simpler and more efficient. llvm-svn: 186397	2013-07-16 10:07:14 +00:00
NAKAMURA Takumi	37ce985739	PPCJITInfo.cpp: Tweak r186252 with s/__ppc/__powerpc/ to work on powerpc-linux Fedora 12. g++ (GCC) 4.4.4 20100630 (Red Hat 4.4.4-10) llvm-svn: 186396	2013-07-16 09:59:51 +00:00
Tim Northover	a7ecd241d2	ARM: implement ldrex, strex and clrex intrinsics Intrinsics already existed for the 64-bit variants, so these support operations of size at most 32-bits. llvm-svn: 186392	2013-07-16 09:46:55 +00:00
Renato Golin	8761069e22	ARM EABI divmod support This patch enables calls to __aeabi_idivmod when in EABI mode, by using the remainder value returned on registers (R1), enabled by the ARM triple "none-eabi". Note that Darwin and GNUEABI triples will continue lowering on GNU style, that is, using the stack for the remainder. Still need to add SREM/UREM support fix for 64-bit lowering. llvm-svn: 186390	2013-07-16 09:32:17 +00:00
Vladimir Medic	75429adb4d	This patch allows targets to define weather the instruction mnemonics in asm matcher tables will contain '.' character. llvm-svn: 186388	2013-07-16 09:22:38 +00:00
NAKAMURA Takumi	07bc8e9b8e	llvm/test/Object/directory.ll: Mark it as XFAIL:cygwin. Directories can be opened on cygwin. llvm-svn: 186387	2013-07-16 09:06:47 +00:00
Rafael Espindola	eed7690155	Use open+fstat instead of stat+open. llvm-svn: 186381	2013-07-16 03:34:31 +00:00
Rafael Espindola	8c1ee47fb0	Remember that we have a null terminated string. This is a micro optimization. Instead of going char->StringRef->Twine->char, go char->Twine->char and avoid having to copy the filename on the stack. llvm-svn: 186380	2013-07-16 03:30:10 +00:00
Rui Ueyama	2c633e40ee	[Object/COFF] Add import_directory_table_entry. Summary: Add import_directory_table_entry to use for .idata section. Reviewers: Bigcheese CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D1059 llvm-svn: 186379	2013-07-16 03:23:55 +00:00
Rafael Espindola	77021c9487	Add a version of sys::fs::status that uses fstat. llvm-svn: 186378	2013-07-16 03:20:13 +00:00
Rui Ueyama	77b4c81761	COFF: Add constants for optional data directory. llvm-svn: 186377	2013-07-16 03:11:55 +00:00
Rafael Espindola	9da91a0e03	Instead friending status, provide windows and posix constructors to file_status. This opens the way of having static helpers in the .inc files that can construct a file_status. llvm-svn: 186376	2013-07-16 02:55:33 +00:00
NAKAMURA Takumi	be2978b2e4	unittests/Support: Add TimeValue.Win32FILETIME, corresponding to r186374. llvm-svn: 186375	2013-07-16 02:44:23 +00:00
NAKAMURA Takumi	59f3ff8c41	Fix TimeValue::toWin32Time() to be symmetric to fromWin32Time() and compatible to Win32's FILETIME. llvm-ar is the only user of toWin32Time() (via setLastModificationAndAccessTime), and r186298 can be reverted. It had been buggy since the initial commit. FIXME: Could we rename {from\|to}Win32Time as {from\|to}Win32FILETIME in TimeValue? llvm-svn: 186374	2013-07-16 02:43:51 +00:00
NAKAMURA Takumi	ad187dd20e	Rename Support.TimeValue to TimeValue.time_t in unittests/Support. llvm-svn: 186372	2013-07-16 02:03:32 +00:00
Craig Topper	d3a34f81f8	Add 'const' qualifiers to static const char* variables. llvm-svn: 186371	2013-07-16 01:17:10 +00:00
Rafael Espindola	f85d3ab7f8	Add mingw32 to the XFAIL. I forgot about it when adding win32. llvm-svn: 186365	2013-07-15 23:51:47 +00:00
Manman Ren	b827123cf7	PEI: Support for non-zero SPAdj at beginning of a basic block. We can have a FrameSetup in one basic block and the matching FrameDestroy in a different basic block when we have struct byval. In that case, SPAdj is not zero at beginning of the basic block. Modify PEI to correctly set SPAdj at beginning of each basic block using DFS traversal. We used to assume SPAdj is 0 at beginning of each basic block. PEI had an assert SPAdjCount \|\| SPAdj == 0. If we have a Destroy <n> followed by a Setup <m>, PEI will assert failure. We can add an extra condition to make sure the pairs are matched: The pairs start with a FrameSetup. But since we are doing a much better job in the verifier, this patch removes the check in PEI. PR16393 llvm-svn: 186364	2013-07-15 23:47:29 +00:00
Nadav Rotem	1c1d6c1666	PR16628: Fix a bug in the code that merges compares. Compares return i1 but they compare different types. llvm-svn: 186359	2013-07-15 22:52:48 +00:00
Hal Finkel	a0014a5a26	PPC: Refactoring to support subtarget feature changing This change mirrors the changes that were made to the X86 and ARM targets to support subtarget feature changing. As indicated in r182899, the mechanism is still undergoing revision, and so as with the X86 and ARM targets, there is no test case yet (there is no effective functionality change). llvm-svn: 186357	2013-07-15 22:29:40 +00:00
David Blaikie	02559ebbd3	Further simplify test case from r186119/r186035. llvm-svn: 186356	2013-07-15 22:28:45 +00:00
Rafael Espindola	54b71fdee2	XFAIL on windows too and document the XFAILs. llvm-svn: 186354	2013-07-15 22:16:53 +00:00
Manman Ren	aa6875b1f9	Machine Verifier: verify FrameSetup and FrameDestroy 1> on every path through the CFG, a FrameSetup <n> is always followed by a FrameDestroy <n> and a FrameDestroy is always followed by a FrameSetup. 2> stack adjustments are identical on all CFG edges to a merge point. 3> frame is destroyed at end of a return block. PR16393 llvm-svn: 186350	2013-07-15 21:26:31 +00:00
Rafael Espindola	8ea26d6a80	Remove an extra is_directory call. I checked that opening a directory on windows does fail, so this saves a "stat". llvm-svn: 186345	2013-07-15 20:52:01 +00:00
Hal Finkel	8e8618ae5c	Fix register subclass handling in PPCInstrInfo::insertSelect PPCInstrInfo::insertSelect and PPCInstrInfo::canInsertSelect were computing the common subclass of the true and false inputs, and then selecting either the 32-bit or the 64-bit isel variant based on the result of calling PPC::GPRCRegClass.hasSubClassEq(RC) and PPC::G8RCRegClass.hasSubClassEq(RC) (where RC is the common subclass). Unfortunately, this is not quite right: if we have something like this: %vreg8<def> = SELECT_CC_I8 %vreg4<kill>, %vreg7<kill>, %vreg6<kill>, 76; G8RC_and_G8RC_NOX0:%vreg8 CRRC:%vreg4 G8RC_NOX0:%vreg7,%vreg6 then the common subclass of G8RC_and_G8RC_NOX0 and G8RC_NOX0 is G8RC_NOX0, and G8RC_NOX0 is not a subclass of G8RC (because it also contains the ZERO8 pseudo-register). As a result, we also need to check the common subclass against GPRC_NOR0 and G8RC_NOX0 explicitly. This had not been a problem for clients of insertSelect that called canInsertSelect first (because it had a compensating mistake), but insertSelect is also used by the PPC pseudo-instruction expander, and this error was causing a problem in that context. This problem was found by csmith. llvm-svn: 186343	2013-07-15 20:22:58 +00:00
Reid Kleckner	dae7b4e4d1	[mc-coff] Resolve aliases when emitting COFF relocations This is consistent with the ELF object writer. Add some COFF tests that relocate against an alias. Reviewers: espindola Differential Revision: http://llvm-reviews.chandlerc.com/D1079 llvm-svn: 186341	2013-07-15 19:41:21 +00:00
Tom Stellard	31209cc8eb	R600/SI: Add support for 64-bit loads https://bugs.freedesktop.org/show_bug.cgi?id=65873 llvm-svn: 186339	2013-07-15 19:00:09 +00:00
Hal Finkel	2f5e8e3d95	Remove invalid assert in DAGTypeLegalizer::RemapValue There is a comment at the top of DAGTypeLegalizer::PerformExpensiveChecks which, in part, says: // Note that these invariants may not hold momentarily when processing a node: // the node being processed may be put in a map before being marked Processed. Unfortunately, this assert would be valid only if the above-mentioned invariant held unconditionally. This was causing llc to assert when, in fact, everything was fine. Thanks to Richard Sandiford for investigating this issue! Fixes PR16562. llvm-svn: 186338	2013-07-15 18:57:05 +00:00
Stephen Lin	837bba1c51	Remove trailing whitespace llvm-svn: 186333	2013-07-15 17:55:02 +00:00
Chandler Carruth	e3899f2c2c	Revert r186316 while I track down an ASan failure and an assert from a bot. This reverts the commit which introduced a new implementation of the fancy SROA pass designed to reduce its overhead. I'll skip the huge commit log here, refer to r186316 if you're looking for how this all works and why it works that way. llvm-svn: 186332	2013-07-15 17:36:21 +00:00
Aaron Ballman	e59e358823	Teaching llvm-tblgen to not emit a switch statement when there are no case statements. llvm-svn: 186330	2013-07-15 16:53:32 +00:00
Reid Kleckner	a75eba9c05	Revert "[Option] Store arg strings in a set backed by a BumpPtrAllocator" This broke clang's crash-report.c test, and I haven't been able to figure it out yet. This reverts commit r186319. llvm-svn: 186329	2013-07-15 16:40:52 +00:00
Job Noorman	a928e1d7a1	Test commit to see if write access works. llvm-svn: 186321	2013-07-15 14:25:26 +00:00
Reid Kleckner	cacb40c6c7	[Option] Store arg strings in a set backed by a BumpPtrAllocator No functionality change. This is preparing to move response file parsing into lib/Option so it can be shared between clang and lld. This change isn't just a micro-optimization. Clang's driver uses a std::set<std::string> to unique arguments while parsing response files, so this matches that. llvm-svn: 186319	2013-07-15 13:46:24 +00:00
Rafael Espindola	69d2271871	XFAIL this on freebsd to bring the bot back. Joerg Sonnenberger tells me one can open a directory in freebsd. I will try to centralize our calls to open so that we can handle O_BINARY in one place, and will then handle this there too. llvm-svn: 186317	2013-07-15 12:18:30 +00:00
Chandler Carruth	e74ff4c643	Reimplement SROA yet again. Same fundamental principle, but a totally different core implementation strategy. Previously, SROA would build a relatively elaborate partitioning of an alloca, associate uses with each partition, and then rewrite the uses of each partition in an attempt to break apart the alloca into chunks that could be promoted. This was very wasteful in terms of memory and compile time because regardless of how complex the alloca or how much we're able to do in breaking it up, all of the datastructure work to analyze the partitioning was done up front. The new implementation attempts to form partitions of the alloca lazily and on the fly, rewriting the uses that make up that partition as it goes. This has a few significant effects: 1) Much simpler data structures are used throughout. 2) No more double walk of the recursive use graph of the alloca, only walk it once. 3) No more complex algorithms for associating a particular use with a particular partition. 4) PHI and Select speculation is simplified and happens lazily. 5) More precise information is available about a specific use of the alloca, removing the need for some side datastructures. Ultimately, I think this is a much better implementation. It removes about 300 lines of code, but arguably removes more like 500 considering that some code grew in the process of being factored apart and cleaned up for this all to work. I've re-used as much of the old implementation as possible, which includes the lion's share of code in the form of the rewriting logic. The interesting new logic centers around how the uses of a partition are sorted, and split into actual partitions. Each instruction using a pointer derived from the alloca gets a 'Partition' entry. This name is totally wrong, but I'll do a rename in a follow-up commit as there is already enough churn here. The entry describes the offset range accessed and the nature of the access. Once we have all of these entries we sort them in a very specific way: increasing order of begin offset, followed by whether they are splittable uses (memcpy, etc), followed by the end offset or whatever. Sorting by splittability is important as it simplifies the collection of uses into a partition. Once we have these uses sorted, we walk from the beginning to the end building up a range of uses that form a partition of the alloca. Overlapping unsplittable uses are merged into a single partition while splittable uses are broken apart and carried from one partition to the next. A partition is also introduced to bridge splittable uses between the unsplittable regions when necessary. I've looked at the performance PRs fairly closely. PR15471 no longer will even load (the module is invalid). Not sure what is up there. PR15412 improves by between 5% and 10%, however it is nearly impossible to know what is holding it up as SROA (the entire pass) takes less time than reading the IR for that test case. The analysis takes the same time as running mem2reg on the final allocas. I suspect (without much evidence) that the new implementation will scale much better however, and it is just the small nature of the test cases that makes the changes small and noisy. Either way, it is still simpler and cleaner I think. llvm-svn: 186316	2013-07-15 10:30:19 +00:00
Alexey Samsonov	1a98450469	DebugInfo: Factor out parsing compile unit DIEs to a separate function. Improve code style and comments. No functionality change. llvm-svn: 186315	2013-07-15 08:43:35 +00:00
Craig Topper	06b3b6651e	Add 'const' qualifier to some arrays. llvm-svn: 186312	2013-07-15 08:02:13 +00:00
Craig Topper	e952ad0bc1	Make some arrays 'static const' llvm-svn: 186311	2013-07-15 07:22:00 +00:00
Craig Topper	f18edae094	Add include to hopefully fix windows build. llvm-svn: 186310	2013-07-15 07:15:05 +00:00
Craig Topper	de1f151115	Add const qualifier to some static arrays. llvm-svn: 186309	2013-07-15 07:02:45 +00:00
Craig Topper	202fbc2c9b	Add 'static' keyword to some const arrays for consistency. llvm-svn: 186308	2013-07-15 06:54:12 +00:00
Craig Topper	0afd0ab749	Make some arrays 'static const' llvm-svn: 186307	2013-07-15 06:39:13 +00:00
Craig Topper	26b45c27f1	Revert part of 186302 to fix buildbots. llvm-svn: 186303	2013-07-15 04:37:54 +00:00
Craig Topper	5871321e49	Use llvm::array_lengthof to replace sizeof(array)/sizeof(array[0]). llvm-svn: 186301	2013-07-15 04:27:47 +00:00
NAKAMURA Takumi	7a12961bba	Mark llvm/test/Object/extract.ll as XFAIL:mingw32, for now. FIXME: Investigate Win32's TimeValue stuff! llvm-svn: 186298	2013-07-15 03:04:13 +00:00
Eric Christopher	7980b957cc	Clarify comments. llvm-svn: 186297	2013-07-14 22:23:54 +00:00
Eric Christopher	8e46e7f04b	Add DW_AT_GNU_odr_signature to the set of dwarf attributes. llvm-svn: 186296	2013-07-14 22:02:31 +00:00
Eric Christopher	666dc635c7	Collapse temporary variable into call. llvm-svn: 186295	2013-07-14 21:46:51 +00:00
Anton Korobeynikov	5714237ca5	Use conventional syntax for branches. Patch by Job! llvm-svn: 186291	2013-07-14 18:19:44 +00:00
Stephen Lin	a6e877fab4	Correct inaccurate statement in FileCheck docs. llvm-svn: 186290	2013-07-14 18:12:25 +00:00
Anton Korobeynikov	fee796d734	Properly lower jump tables on MSP430. Patch by Job Noorman! llvm-svn: 186283	2013-07-14 15:11:00 +00:00
Chandler Carruth	ba310c4bde	The archive update test has a subtle race condition in it: if the test is executed within the same second as the inputs for the test are checked out from the source tree, it will fail to update due to being below the resolution of the 'mtime' test used. Now, this may seem improbably to you... ok, maybe really improbable, but consider a system which does distributed execution of tests by shipping their inputs to another machine and runs them. That might cause the mtime to be quite recent during the test run. ;] Instead, create two files directly in the test (allowing all platforms to see the problem) and add either a use of the 'touch' command that forces one mtime to some time quite a bit in the past, or it sleeps for just over a second to be outside of the precision window. llvm-svn: 186282	2013-07-14 10:46:51 +00:00
Stephen Lin	d24ab20e9b	Mass update to CodeGen tests to use CHECK-LABEL for labels corresponding to function definitions for more informative error messages. No functionality change and all updated tests passed locally. This update was done with the following bash script: find test/CodeGen -name ".ll" \| \ while read NAME; do echo "$NAME" if ! grep -q "^; RUN: llc.debug" $NAME; then TEMP=`mktemp -t temp` cp $NAME $TEMP sed -n "s/^define [^@]@$[A-Za-z0-9_]$(.$/\1/p" < $NAME \| \ while read FUNC; do sed -i '' "s/;$.$$[A-Za-z0-9_-]$:$ $$FUNC: \$/;\1\2-LABEL:\3$FUNC:/g" $TEMP done sed -i '' "s/;$.$-LABEL-LABEL:/;\1-LABEL:/" $TEMP sed -i '' "s/;$.$-NEXT-LABEL:/;\1-NEXT:/" $TEMP sed -i '' "s/;$.$-NOT-LABEL:/;\1-NOT:/" $TEMP sed -i '' "s/;$.*$-DAG-LABEL:/;\1-DAG:/" $TEMP mv $TEMP $NAME fi done llvm-svn: 186280	2013-07-14 06:24:09 +00:00
Nadav Rotem	d9f3f4548e	SLPVectorizer: change the order in which we search for vectorization candidates. Do stores first and PHIs second. llvm-svn: 186277	2013-07-14 06:15:46 +00:00
Tobias Grosser	84f34be98e	Fix build by replacing '>>' with '> >' llvm-svn: 186276	2013-07-14 06:12:01 +00:00
Craig Topper	b94011fd28	Use SmallVectorImpl& instead of SmallVector to avoid repeating small vector size. llvm-svn: 186274	2013-07-14 04:42:23 +00:00
Andrew Trick	aa8ceba833	Remove a bunch of old SCEVExpander FIXME's for preserving NoWrap. The great thing about the SCEVAddRec No-Wrap flag (unlike nsw/nuw) is that is can be preserved while normalizing (reassociating and factoring). The bad thing is that is can't be tranfered back to IR, which is one of the reasons I don't like the concept of SCEVExpander. Sorry, I can't think of a direct way to test this, which is why these were FIXMEs for so long. I just think it's a good time to finally clean it up. llvm-svn: 186273	2013-07-14 03:10:08 +00:00
Andrew Trick	8eaae28693	Teach indvars to generate nsw/nuw flags when widening an induction variable. Fixes PR16600. llvm-svn: 186272	2013-07-14 02:50:07 +00:00
Stephen Lin	c89a0ececb	Fixup to r186268 and r186269: don't append -LABEL to CHECK-NOT. No functionality change. llvm-svn: 186271	2013-07-14 02:10:57 +00:00
Stephen Lin	a76289aa1b	Catch more CHECK that can be converted to CHECK-LABEL in Transforms for easier debugging. No functionality change. This conversion was done with the following bash script: find test/Transforms -name ".ll" \| \ while read NAME; do echo "$NAME" if ! grep -q "^; RUN: llc" $NAME; then TEMP=`mktemp -t temp` cp $NAME $TEMP sed -n "s/^define [^@]@$[A-Za-z0-9_]$(.$/\1/p" < $NAME \| \ while read FUNC; do sed -i '' "s/;$.$$[A-Za-z0-9_]$:$ $define$[^@]$@$FUNC$[( ]*$\$/;\1\2-LABEL:\3define\4@$FUNC(/g" $TEMP done mv $TEMP $NAME fi done llvm-svn: 186269	2013-07-14 01:50:49 +00:00
Stephen Lin	c1c7a1309c	Update Transforms tests to use CHECK-LABEL for easier debugging. No functionality change. This update was done with the following bash script: find test/Transforms -name ".ll" \| \ while read NAME; do echo "$NAME" if ! grep -q "^; RUN: llc" $NAME; then TEMP=`mktemp -t temp` cp $NAME $TEMP sed -n "s/^define [^@]@$[A-Za-z0-9_]$(.$/\1/p" < $NAME \| \ while read FUNC; do sed -i '' "s/;$.$$[A-Za-z0-9_]$:$ $@$FUNC$[( ]$\$/;\1\2-LABEL:\3@$FUNC(/g" $TEMP done mv $TEMP $NAME fi done llvm-svn: 186268	2013-07-14 01:42:54 +00:00
Stephen Lin	2e105ff8b7	Modify two Transforms tests to explicitly check for full function names in some cases, rather than just a common prefix. No functionality change. (This is to avoid confusing a scripted mass update of these tests to use CHECK-LABEL) llvm-svn: 186267	2013-07-14 01:38:19 +00:00
Stephen Lin	552c915e84	Convert Windows to Unix line endings, no functionality change. llvm-svn: 186264	2013-07-13 22:08:55 +00:00
Stephen Lin	6dd347b39f	Add newlines at end of test files, no functionality change llvm-svn: 186263	2013-07-13 22:00:58 +00:00
Stephen Lin	f799e3f944	Convert CodeGen//.ll tests to use the new CHECK-LABEL for easier debugging. No functionality change and all tests pass after conversion. This was done with the following sed invocation to catch label lines demarking function boundaries: sed -i '' "s/^;$ $$[A-Z0-9_]$:$ $test$[A-Za-z0-9_-]$:$ $$/;\1\2-LABEL:\3test\4:\5/g" test/CodeGen//*.ll which was written conservatively to avoid false positives rather than false negatives. I scanned through all the changes and everything looks correct. llvm-svn: 186258	2013-07-13 20:38:47 +00:00
Arnold Schwaighofer	a92eeebde8	LoopVectorizer: Disallow reductions whose header phi is used outside the loop If an outside loop user of the reduction value uses the header phi node we cannot just reduce the vectorized phi value in the vector code epilog because we would loose VF-1 reductions. lp: p = phi (0, lv) lv = lv + 1 ... brcond , lp, outside outside: usr = add 0, p (Say the loop iterates two times, the value of p coming out of the loop is one). We cannot just transform this to: vlp: p = phi (<0,0>, lv) lv = lv + <1,1> .. brcond , lp, outside outside: p_reduced = p[0] + [1]; usr = add 0, p_reduced (Because the original loop iterated two times the vectorized loop would iterate one time, but p_reduced ends up being zero instead of one). We would have to execute VF-1 iterations in the scalar remainder loop in such cases. For now, just disable vectorization. PR16522 llvm-svn: 186256	2013-07-13 19:09:29 +00:00
Joerg Sonnenberger	8e01ae895d	Reduce large list of macros to the primary platform macros. Distingiush between ELF (Linux, FreeBSD, NetBSD) and OSX as platform for the assembler dialect. llvm-svn: 186252	2013-07-13 17:59:55 +00:00
Benjamin Kramer	e7d26f9b49	Convert a couple of grep tests to FileCheck. llvm-svn: 186250	2013-07-13 17:30:25 +00:00
Benjamin Kramer	c74fcc9972	Only verify the length in archive test, we can't make assumptions on the spacing. And .* did just match about anything anyways. llvm-svn: 186246	2013-07-13 15:21:39 +00:00
Rafael Espindola	1a08ba0eb6	Attempt at fixing a mingw bot. It is failing with YAMLTest.cpp:38: instantiated from here YAMLTraits.h:226: error: 'llvm::yaml::MappingTraits<<unnamed>::BinaryHolder>::mapping' is not a valid template argument for type 'void (*)(llvm::yaml::IO&, <unnamed>::BinaryHolder&)' because function 'static void llvm::yaml::MappingTraits<<unnamed>::BinaryHolder>::mapping(llvm::yaml::IO&, <unnamed>::BinaryHolder&)' has not external linkage llvm-svn: 186245	2013-07-13 12:36:30 +00:00
Craig Topper	3964367ceb	Remove unneeded forward declarations. llvm-svn: 186244	2013-07-13 08:28:45 +00:00
Craig Topper	e0b711864c	Pass SmallVector by const reference instead of by value. llvm-svn: 186243	2013-07-13 07:43:40 +00:00
Andrew Trick	960dee381d	Make the new vectorizer test immune to TTI llvm-svn: 186242	2013-07-13 06:40:33 +00:00
Andrew Trick	0ae8c94f8f	LoopVectorize fix: LoopInfo must be valid when invoking utils like SCEVExpander. In general, one should always complete CFG modifications first, update CFG-based analyses, like Dominatores and LoopInfo, then generate instruction sequences. LoopVectorizer was creating a new loop, calling SCEVExpander to generate checks, then updating LoopInfo. I just changed the order. llvm-svn: 186241	2013-07-13 06:20:06 +00:00
Rafael Espindola	07025fe5e9	Try to open the file before use data from stat. Looks like on mingw we get bogus last modification times on directories. Should fix the mingw bots. llvm-svn: 186240	2013-07-13 05:07:22 +00:00
Rafael Espindola	a19899ac42	Remove unused file. Thanks to Sean Silva for noticing it. llvm-svn: 186239	2013-07-13 04:24:33 +00:00
Rafael Espindola	0aac01b2f6	Add r186216 back, but make the test tolerant of different uids and gids. original message: Fix a off by one error about which members need to use the string table. llvm-svn: 186238	2013-07-13 04:14:13 +00:00
Nick Lewycky	7459be6dc7	Add a microoptimization for urem. llvm-svn: 186235	2013-07-13 01:16:47 +00:00
Chandler Carruth	86e60a36b5	Revert commit r186217 -- this is breaking bots: http://lab.llvm.org:8013/builders/clang-x86_64-darwin11-nobootstrap-RAincremental/builds/4328 Original commit log: Use the function attributes to pass along the stack protector buffer size. llvm-svn: 186234	2013-07-13 01:00:17 +00:00
Chandler Carruth	fa74085f60	Revert commit r186216 -- it's breaking bots: http://lab.llvm.org:8011/builders/clang-x86_64-debian-fast/builds/6897/steps/check-all/logs/LLVM%3A%3Aarchive-format.test Original commit log: Fix a off by one error about which members need to use the string table. llvm-svn: 186232	2013-07-13 00:42:56 +00:00
Akira Hatanaka	f2826aacf9	[mips] Remove trailing whitespace. llvm-svn: 186230	2013-07-12 23:47:38 +00:00
Nick Lewycky	35aeea993b	Fix logic error optimizing "icmp pred (urem X, Y), Y" where pred is signed. Fixes PR16605. llvm-svn: 186229	2013-07-12 23:42:57 +00:00
Akira Hatanaka	66bc419366	[mips] Implement MipsTargetMachine::getInstrItineraryData(). llvm-svn: 186227	2013-07-12 23:33:22 +00:00
JF Bastien	583db65031	Fix ARM paired GPR COPY lowering ARM paired GPR COPY was being lowered to two MOVr without CC. This patch puts the CC back. My test is a reduction of the case where I encountered the issue, 64-bit atomics use paired GPRs. The issue only occurs with selectionDAG, FastISel doesn't encounter it so I didn't bother calling it. llvm-svn: 186226	2013-07-12 23:33:03 +00:00
Michael Gottesman	44ccf3ebd2	Fixed 80+ violation and added C++ to header. llvm-svn: 186225	2013-07-12 23:09:43 +00:00
Joey Gouly	a3250f22c2	Fix a crash in EvaluateInDifferentElementOrder where it would generate an undef vector of the wrong type. LGTM'd by Nick Lewycky on IRC. llvm-svn: 186224	2013-07-12 23:08:06 +00:00
Akira Hatanaka	1baf2ea2d1	[mips] Add instruction itinerary classes for mult, seb and slt instructions. llvm-svn: 186222	2013-07-12 22:43:20 +00:00
Bill Wendling	4f73ff4711	Use the function attributes to pass along the stack protector buffer size. Now that we have robust function attributes, don't use a command line option to specify the stack protecto buffer size. llvm-svn: 186217	2013-07-12 22:25:20 +00:00
Rafael Espindola	bc63134afd	Fix a off by one error about which members need to use the string table. llvm-svn: 186216	2013-07-12 22:22:34 +00:00
Andrew Trick	a1e4118a46	LFTR improvement to avoid truncation. This is a reimplemntation of the patch originally in r186107. llvm-svn: 186215	2013-07-12 22:08:48 +00:00
Andrew Trick	2b71848ffe	Cleanup LFTR logic. llvm-svn: 186214	2013-07-12 22:08:44 +00:00
Andrew Trick	466555e50d	Cleanup: rename a variable to make the logic easier to follow. llvm-svn: 186213	2013-07-12 22:08:41 +00:00
Eric Christopher	3931cf94ac	Remove extraneous braces. llvm-svn: 186212	2013-07-12 22:08:24 +00:00
Benjamin Kramer	e07b7a9f02	R600: Reapply testcase from r186178, the big endian issue should be fixed by r186196. llvm-svn: 186209	2013-07-12 21:54:43 +00:00
Rafael Espindola	04397c1e9c	Change archive-update.test to create a new file on the fly. llvm-svn: 186206	2013-07-12 21:17:17 +00:00
Rafael Espindola	554eaad04e	Rename directory to avoid problems on windows. llvm-svn: 186202	2013-07-12 20:53:23 +00:00
Rafael Espindola	c61cab8b02	fix autoconf build llvm-svn: 186200	2013-07-12 20:45:01 +00:00
Rafael Espindola	023e65611d	Fix the build with c++03. llvm-svn: 186198	2013-07-12 20:28:02 +00:00
Rafael Espindola	3e2b21cd4d	Change llvm-ar to use lib/Object. This fixes two bugs is lib/Object that the use in llvm-ar found: * In OS X created archives, the name can be padded with nulls. Strip them. * In the constructor, remember the first non special member and use that in begin_children. This makes sure we skip all special members, not just the first one. The change to llvm-ar itself consist of * Using lib/Object for reading archives instead of ArchiveReader.cpp. * Writing the modified archive directly, instead of creating an in memory representation. The old Archive library was way more general than what is needed, as can be seen by the diffstat of this patch. Having llvm-ar using lib/Object now opens the way for creating regular symbol tables for both native objects and bitcode files so that we can use those archives for LTO. llvm-svn: 186197	2013-07-12 20:21:39 +00:00
Benjamin Kramer	c22c790f89	R600: Remove unsafe type punning. No intended functionality change. llvm-svn: 186196	2013-07-12 20:18:05 +00:00
Rafael Espindola	9cef215724	Add a test for llvm-ar's u option. llvm-svn: 186192	2013-07-12 19:34:24 +00:00
Tom Stellard	6547fbee03	R600: Remove the fpconst64.ll test which was failing on non-x86 buildbots I'm guessing the failure had something to do with the double precision floating point constant used in the test. llvm-svn: 186191	2013-07-12 19:29:54 +00:00
Arnold Schwaighofer	6042a261b8	X86 cost model: Add cost for vectorized gather/scather radar://14351991 llvm-svn: 186189	2013-07-12 19:16:07 +00:00
Arnold Schwaighofer	da2b311865	ARM cost model: Add cost for gather/scather Fixes a 35% degradation compared to unvectorized code in MiBench/automotive-susan and an equally serious regression on a private image processing benchmark. radar://14351991 llvm-svn: 186188	2013-07-12 19:16:04 +00:00
Arnold Schwaighofer	9da9a43af8	TargetTransformInfo: address calculation parameter for gather/scather Address calculation for gather/scather in vectorized code can incur a significant cost making vectorization unbeneficial. Add infrastructure to add cost. Tests and cost model for targets will be in follow-up commits. radar://14351991 llvm-svn: 186187	2013-07-12 19:16:02 +00:00
Rafael Espindola	c5acd1ca84	Relax the test a bit more to handle different UIDs and GIDs. llvm-svn: 186186	2013-07-12 19:13:16 +00:00
Rafael Espindola	5f1ef3c8aa	Relax test a bit to handle umask differences. llvm-svn: 186184	2013-07-12 18:54:28 +00:00
Rafael Espindola	3e7249fb15	Add a test for the 'o' option in llvm-ar. llvm-svn: 186183	2013-07-12 18:51:25 +00:00
Tom Stellard	ccae60acc3	R600/SI: Add support for f64 kernel arguments Patch by: Niels Ole Salscheider Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 186182	2013-07-12 18:15:26 +00:00
Tom Stellard	4e1100ab75	R600/SI: Implement select and compares for SI Patch by: Niels Ole Salscheider Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 186181	2013-07-12 18:15:19 +00:00
Tom Stellard	8ed7b45da3	R600/SI: Add fsqrt pattern for SI Patch by: Niels Ole Salscheider Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 186180	2013-07-12 18:15:13 +00:00
Tom Stellard	2a6a610516	R600/SI: Add double precision fsub pattern for SI Patch by: Niels Ole Salscheider Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 186179	2013-07-12 18:15:08 +00:00
Tom Stellard	ab8a8c84d4	R600/SI: SI support for 64bit ConstantFP Patch by: Niels Ole Salscheider Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 186178	2013-07-12 18:15:02 +00:00
Tom Stellard	7512c0803c	R600/SI: Add initial double precision support for SI Patch by: Niels Ole Salscheider Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 186177	2013-07-12 18:14:56 +00:00
Tom Stellard	2d17f67651	R600: Add ISA documents to the CompilerWriterInfo page llvm-svn: 186176	2013-07-12 18:14:40 +00:00
Michael Gottesman	fd7f897426	Fixed comment in header of Block Frequency Impl and added text for C++ mode. This is a generic block implementation that works on more than machine blocks. The C++ mode addition is a bonus due to the extra space provided. llvm-svn: 186175	2013-07-12 18:11:14 +00:00
Benjamin Kramer	068a2253e9	X86: Shrink certain forms of movsx. In particular: movsbw %al, %ax --> cbtw movswl %ax, %eax --> cwtl movslq %eax, %rax --> cltq According to Intel's manual those have the same performance characteristics but come with a smaller encoding. llvm-svn: 186174	2013-07-12 18:06:44 +00:00
Rafael Espindola	0557153138	Add static. llvm-svn: 186170	2013-07-12 16:29:27 +00:00
Stephen Lin	fda967fdea	X86: fold SSE2/AVX2 logical shift by immediate amount into zero vector when possible Patch by Andrea Di Biagio llvm-svn: 186165	2013-07-12 15:31:36 +00:00
Stephen Lin	764d8d3d6f	Start using CHECK-LABEL in some tests. llvm-svn: 186163	2013-07-12 14:54:12 +00:00
Stephen Lin	f8bd2e5b86	Add new directive called CHECK-LABEL to FileCheck. CHECK-LABEL is meant to be used in place on CHECK on lines containing identifiers or other unique labels (they need not actually be labels in the source or output language, though.) This is used to break up the input stream into separate blocks delineated by CHECK-LABEL lines, each of which is checked independently. This greatly improves the accuracy of errors and fix-it hints in many cases, and allows for FileCheck to recover from errors in one block by continuing to subsequent blocks. Some tests will be converted to use this new directive in forthcoming patches. llvm-svn: 186162	2013-07-12 14:51:05 +00:00
Rafael Espindola	f0c617264a	Don't reject an empty archive. llvm-svn: 186159	2013-07-12 13:32:28 +00:00
Benjamin Kramer	5dc99f1f91	Mark MDNode::getOperand as readonly. We can't inline it but we can still CSE calls to it. llvm-svn: 186156	2013-07-12 12:05:13 +00:00
Chandler Carruth	cf3715cadd	Revert "indvars: Improve LFTR by eliminating truncation when comparing against a constant." This reverts commit r186107. It didn't handle wrapping arithmetic in the loop correctly and thus caused the following C program to count from 0 to UINT64_MAX instead of from 0 to 255 as intended: #include <stdio.h> int main() { unsigned char first = 0, last = 255; do { printf("%d\n", first); } while (first++ != last); } Full test case and instructions to reproduce with just the -indvars pass sent to the original review thread rather than to r186107's commit. llvm-svn: 186152	2013-07-12 11:18:55 +00:00
Vladimir Medic	bcf1ca08e0	Add support for Mips break and syscall insructions. The corresponding test cases are added. llvm-svn: 186151	2013-07-12 09:25:35 +00:00
Richard Sandiford	17276d3567	[SystemZ] Add test missing from r186148 Sigh, twice in two days sorry. One day I'll remember... llvm-svn: 186150	2013-07-12 09:20:14 +00:00
Richard Sandiford	6d4bd28322	[SystemZ] Optimize sign-extends of vector setccs Normal (sext (setcc ...)) sequences are optimised into (select_cc ..., -1, 0) by DAGCombiner::visitSIGN_EXTEND. However, this is deliberately not done for vectors, and after vector type legalization we have (sext_inreg (setcc ...)) instead. I wondered about trying to extend DAGCombiner to handle this case too, but it seemed to be a loss on some other targets I tried, even those for which SETCC isn't "legal" and SELECT_CC is. llvm-svn: 186149	2013-07-12 09:17:10 +00:00
Richard Sandiford	b820405b59	[SystemZ] Fix parsing of inline asm registers GPR and FPR constraints like "{r2}" and "{f2}" weren't handled correctly because the name-to-regno mapping depends on the value type and (because of that) the internal names in RegStrings are not the same as the AsmName. CC constraints like "{cc}" didn't work either because there was no associated register class. llvm-svn: 186148	2013-07-12 09:08:12 +00:00
Richard Sandiford	3f0edc2903	[SystemZ] Improve spilling of LGDR and LDGR If the source of these instructions is spilled we should load the destination. If the destination is spilled we should store the source. llvm-svn: 186147	2013-07-12 08:37:17 +00:00
Shuxin Yang	23773b34c6	Stylistic change. Thank Nick for figuring out these problems. llvm-svn: 186146	2013-07-12 07:25:38 +00:00
Nadav Rotem	89c41bf06a	SLPVectorizer: Sink and enable CSE for ExtractElements. llvm-svn: 186145	2013-07-12 06:09:24 +00:00
Charles Davis	e8f297ca94	Target/X86: Add explicit Win64 and System V/x86-64 calling conventions. Summary: This patch adds explicit calling convention types for the Win64 and System V/x86-64 ABIs. This allows code to override the default, and use the Win64 convention on a target that wants to use SysV (and vice-versa). This is needed to implement the `ms_abi` and `sysv_abi` GNU attributes. Reviewers: CC: llvm-svn: 186144	2013-07-12 06:02:35 +00:00
NAKAMURA Takumi	aaaec3db98	llvm/test/Object/archive-toc.test: Use env(1) to satisfy win32 hosts. llvm-svn: 186143	2013-07-12 02:34:45 +00:00
NAKAMURA Takumi	40bd28a7df	Windows/TimeValue.inc: Mute prefixed '0' on %d to emulate %e. It fixes compatibility in llvm/test/Object/archive-toc.test. llvm-svn: 186142	2013-07-12 02:13:03 +00:00
Manman Ren	30d6865a23	PEI: refactor replaceFrameIndices(MF) to call replaceFrameIndices(BB). replaceFrameIndices(MF) will iterate over the BBs and call replaceFrameIndices(BB). No functionality change. llvm-svn: 186141	2013-07-12 00:37:01 +00:00
Nadav Rotem	fa3c2db211	SLPVectorize: Replace the code that checks for vectorization candidates in successor blocks with code that scans PHINodes. Before we could vectorize PHINodes scanning successors was a good way of finding candidates. Now we can vectorize the phinodes which is simpler. llvm-svn: 186139	2013-07-12 00:04:18 +00:00
David Dean	f3ed656189	Add the ability to use guarded malloc when running llvm lit tests. llvm-svn: 186134	2013-07-11 23:36:57 +00:00
Benjamin Kramer	64caeb7cd4	llvm-ar: Clean up memory management with OwningPtr. llvm-svn: 186131	2013-07-11 23:15:05 +00:00
Benjamin Kramer	1c7fcccb49	Sync SmallBitVector with BitVector. Add unit tests for the missing methods. llvm-svn: 186123	2013-07-11 21:59:16 +00:00
Michael Gottesman	b1c7b58424	Fixed up comments in TargetLowering.h to conform to the LLVM Style Guide. llvm-svn: 186121	2013-07-11 21:38:33 +00:00
Adrian Prantl	29b3fdc8c2	In response to dblaikie's comment on r186035, replacing the (reduced LLVM IR) + (full source in comment) with the (full LLVM IR) + (reduced src in comment) llvm-svn: 186119	2013-07-11 21:16:14 +00:00
Rafael Espindola	dee53e76f6	Add tests for the before and after modifiers. llvm-svn: 186118	2013-07-11 21:11:55 +00:00
Nadav Rotem	db06b139fd	Remove an argument that we dont use anymore. llvm-svn: 186116	2013-07-11 20:56:13 +00:00
Rafael Espindola	ed0f6468b8	Use %llu to print a 64 bit number. Should fix the ARM bots. llvm-svn: 186113	2013-07-11 20:01:30 +00:00
Rafael Espindola	621ca94358	Add a test for llvm-ar's m operation. llvm-svn: 186110	2013-07-11 19:09:04 +00:00
Hal Finkel	4715081787	PPC: Add some missing V_SET0 patterns We had patterns to match v4i32 immAllZerosV -> V_SET0, but not patterns for v8i16 (which occurs in the test case) or v16i8. The same was true for V_SETALLONES (so I added the associated patterns for those as well). Another bug found by llvm-stress. llvm-svn: 186108	2013-07-11 17:43:32 +00:00
Andrew Trick	3095993d6f	indvars: Improve LFTR by eliminating truncation when comparing against a constant. Patch by Michele Scandale! Adds a special handling of the case where, during the loop exit condition rewriting, the exit value is a constant of bitwidth lower than the type of the induction variable: instead of introducing a trunc operation in order to match correctly the operand types, it allows to convert the constant value to an equivalent constant, depending on the initial value of the induction variable and the trip count, in order have an equivalent comparison between the induction variable and the new constant. llvm-svn: 186107	2013-07-11 17:08:59 +00:00
Hal Finkel	ff3ea8060c	PPCDAGToDAGISel::isRunOfOnes should return false on zero This fixes a bug (found by csmith) at -O0 where we attempt to create a RLWIMI with an out-of-range operand. Most uses of the isRunOfOnes function are guarded by a condition that the value is not zero. This was not true in two places, and in both places a zero input would result in an out-of-rage MB value (= 32). To fix this, isRunOfOnes returns false on a zero input (and I've remove one now-redundant guard). llvm-svn: 186101	2013-07-11 16:31:51 +00:00
Craig Topper	2cd5ff8003	Use SmallVectorImpl& instead of SmallVector to avoid repeating small vector size. llvm-svn: 186098	2013-07-11 16:22:38 +00:00
Rafael Espindola	1761824d72	Add back code for supporting old mingw versions. Should bring the bots back. llvm-svn: 186096	2013-07-11 16:11:21 +00:00
Benjamin Kramer	fc3ea6f4bc	Don't use a potentially expensive shift if all we want is one set bit. No functionality change. llvm-svn: 186095	2013-07-11 16:05:50 +00:00
Rafael Espindola	ce2c84e670	InsertBefore is the same as AddBefore. Delete it. llvm-svn: 186094	2013-07-11 15:54:53 +00:00
Rafael Espindola	bce399216c	Looks like some versions of mingw don't have errno_t. Use int. llvm-svn: 186092	2013-07-11 15:47:04 +00:00
Benjamin Kramer	cce97be70b	Use move semantics if possible to construct ConstantRanges. Arithmetic on ConstantRanges creates a lot of large temporary APInts that benefit from move semantics. llvm-svn: 186091	2013-07-11 15:37:27 +00:00
Rafael Espindola	b1c1c5f377	Fix a FIXME about the format and add a test. While at it, use strftime on Unix too and use the thread safe versions of localtime. llvm-svn: 186090	2013-07-11 15:35:23 +00:00
Arnold Schwaighofer	e97c71b8fd	LoopVectorize: Vectorize all accesses in address space zero with unit stride We can vectorize them because in the case where we wrap in the address space the unvectorized code would have had to access a pointer value of zero which is undefined behavior in address space zero according to the LLVM IR semantics. (Thank you Duncan, for pointing this out to me). Fixes PR16592. llvm-svn: 186088	2013-07-11 15:21:55 +00:00
Rafael Espindola	a7f6913c08	Merge these tests. llvm-svn: 186084	2013-07-11 13:44:10 +00:00
Rafael Espindola	70a765dc47	Use a more unique name to avoid conflicting with directory.ll tests when running in parallel. llvm-svn: 186083	2013-07-11 13:31:38 +00:00
Rafael Espindola	0ec47c801d	Add a test for llvm-ar's 'd' operation. llvm-svn: 186082	2013-07-11 13:24:27 +00:00
Rafael Espindola	54dbca5eeb	Add tests for the 'x' operation. llvm-svn: 186081	2013-07-11 13:13:09 +00:00
Rafael Espindola	b1d026890a	Remove the 'N' modifier from llvm-ar. * It is not present on OS X. * It is untested. * It is not needed for using ar in a build system. llvm-svn: 186080	2013-07-11 13:03:27 +00:00
Rafael Espindola	e132160e7e	Delete dead code. llvm-svn: 186079	2013-07-11 12:54:11 +00:00
Rafael Espindola	4d7b3be00b	Remove support for truncating names in archives. * All systems we support have some form of long name support. * The options has different names and semantics in different implementations ('f' on gnu, 'T' on OS X), which makes it unlikely it is normally used on build systems. * It was completely untested. llvm-svn: 186078	2013-07-11 12:38:02 +00:00
Rafael Espindola	69278e9ac8	Sync llvm-ar's help string with the options it supports. llvm-svn: 186076	2013-07-11 12:28:36 +00:00
Benjamin Kramer	741146b825	Reduce the number of indirections in the attributes implementation. - Coallocate entires for AttributeSetImpls and Nodes after the class itself. - Remove mutable iterators from immutable classes. - Remove unused context field from AttributeImpl. - Derive Enum/Align/String attribute implementations from AttributeImpl instead of having a whole new inheritance tree for them. - Derive AlignAttributeImpl from EnumAttributeImpl. llvm-svn: 186075	2013-07-11 12:13:16 +00:00
Richard Sandiford	4209e7f6c6	[SystemZ] Add testcase missing from r186073 llvm-svn: 186074	2013-07-11 09:10:38 +00:00
Richard Sandiford	ea9b6aa20b	[SystemZ] Use zeroing form of RISBG for shift-and-AND sequences Extend r186072 to handle shifts and ANDs. llvm-svn: 186073	2013-07-11 09:10:09 +00:00
Richard Sandiford	84f54a3bc9	[SystemZ] Use zeroing form of RISBG for some AND sequences RISBG can handle some ANDs for which no AND IMMEDIATE exists. It also acts as a three-operand AND for some cases where an AND IMMEDIATE could be used instead. It might be worth adding a pass to replace RISBG with AND IMMEDIATE in cases where the register operands end up being the same and where AND IMMEDIATE is smaller. llvm-svn: 186072	2013-07-11 08:59:12 +00:00
Richard Sandiford	67ddcd6dd0	[SystemZ] Allow 8-bit operands to RISBG RISBG has three 8-bit operands (I3, I4 and I5). I'd originally restricted all three to 6 bits, since that's the only range we intended to use at the time. However, the top bit of I4 acts as a "zero" flag for RISBG, while the top bit of I3 acts as a "test" flag for RNSBG & co. This patch therefore allows them to have the full 8-bit range. I've left the fifth operand as a 6-bit value for now since the upper 2 bits have no defined meaning. llvm-svn: 186070	2013-07-11 08:37:13 +00:00
Duncan Sands	e773c08021	TryToSimplifyUncondBranchFromEmptyBlock was checking that any common predecessors of the two blocks it is attempting to merge supply the same incoming values to any phi in the successor block. This change allows merging in the case where there is one or more incoming values that are undef. The undef values are rewritten to match the non-undef value that flows from the other edge. Patch by Mark Lacey. llvm-svn: 186069	2013-07-11 08:28:20 +00:00
Hal Finkel	6161b9405f	Initialize AsmPrinter::MF in the constructor MF is normally initialized in AsmPrinter::SetupMachineFunction, but if the file contains only globals (no functions), then we need this to be initialized because, when encountering an error, lowerConstant() references it. This should fix the non-deterministic failures of test/CodeGen/X86/nonconst-static-iv.ll, etc. llvm-svn: 186068	2013-07-11 06:41:14 +00:00
Hal Finkel	743b194084	RegScavenger should not exclude undef uses When computing currently-live registers, the register scavenger excludes undef uses. As a result, undef uses are ignored when computing the restore points of registers spilled into the emergency slots. While the register scavenger normally excludes from consideration, when scavenging, registers used by the current instruction, we need to not exclude undef uses. Otherwise, we might end up requiring more emergency spill slots than we have (in the case where the undef use is the currently-spilled register). Another bug found by llvm-stress. llvm-svn: 186067	2013-07-11 05:55:57 +00:00
Craig Topper	37039640e3	Fix indentation. No functional change. llvm-svn: 186065	2013-07-11 05:39:44 +00:00
Nadav Rotem	08efb262a9	Fix a warning. llvm-svn: 186064	2013-07-11 05:39:02 +00:00
Nadav Rotem	108ef760ff	Consolidate more lit tests. llvm-svn: 186063	2013-07-11 05:15:11 +00:00
Nadav Rotem	e0a49499fe	Consolidate some of the lit tests. llvm-svn: 186062	2013-07-11 05:11:33 +00:00
Nadav Rotem	c6b5e2499e	Consolidate some of the lit tests. llvm-svn: 186060	2013-07-11 05:01:50 +00:00
Nadav Rotem	b8dd66f655	SLPVectorizer: refactor the code that places extracts. Place the code that decides where to put extracts in the build-tree phase. This allows us to take the cost of the extracts into account. llvm-svn: 186058	2013-07-11 04:54:05 +00:00
Michael Gottesman	b40db26eae	Teach TailRecursionElimination to handle certain cases of nocapture escaping allocas. Without the changes introduced into this patch, if TRE saw any allocas at all, TRE would not perform TRE or mark callsites with the tail marker. Because TRE runs after mem2reg, this inadequacy is not a death sentence. But given a callsite A without escaping alloca argument, A may not be able to have the tail marker placed on it due to a separate callsite B having a write-back parameter passed in via an argument with the nocapture attribute. Assume that B is the only other callsite besides A and B only has nocapture escaping alloca arguments (NOTE B may have other arguments that are not passed allocas). In this case not marking A with the tail marker is unnecessarily conservative since: 1. By assumption A has no escaping alloca arguments itself so it can not access the caller's stack via its arguments. 2. Since all of B's escaping alloca arguments are passed as parameters with the nocapture attribute, we know that B does not stash said escaping allocas in a manner that outlives B itself and thus could be accessed indirectly by A. With the changes introduced by this patch: 1. If we see any escaping allocas passed as a capturing argument, we do nothing and bail early. 2. If we do not see any escaping allocas passed as captured arguments but we do see escaping allocas passed as nocapture arguments: i. We do not perform TRE to avoid PR962 since the code generator produces significantly worse code for the dynamic allocas that would be created by the TRE algorithm. ii. If we do not return twice, mark call sites without escaping allocas with the tail marker. NOTE This excludes functions with escaping nocapture allocas. 3. If we do not see any escaping allocas at all (whether captured or not): i. If we do not have usage of setjmp, mark all callsites with the tail marker. ii. If there are no dynamic/variable sized allocas in the function, attempt to perform TRE on all callsites in the function. Based off of a patch by Nick Lewycky. rdar://14324281. llvm-svn: 186057	2013-07-11 04:40:01 +00:00
Hal Finkel	94383e542b	Move r186044 tests into CodeGen/X86 I had thought that these tests could be target-neutral, but in practice this is not the case (on some targets, like Hexagon and Darwin), they trigger an assert (a different assert than the one that r186044 fixes). llvm-svn: 186051	2013-07-11 01:55:55 +00:00
Hal Finkel	a2aeb8e8e1	Set REQUIRES shell on the test cases for r186044 Trying to fix the i686-mingw32 build. llvm-svn: 186046	2013-07-10 23:25:03 +00:00
Hal Finkel	31ffcec999	XFAIL the test cases for r186044 on Hexagon For some reason, the Hexagon backend does not reject these invalid static initializer expressions, but instead crashes in AsmPrinter::EmitGlobalConstant. llvm-svn: 186045	2013-07-10 23:11:14 +00:00
Hal Finkel	b31366da82	Don't assert if we can't constant fold extract/insertvalue A non-constant-foldable static initializer expression containing insertvalue or extractvalue had been causing an assert: Constants.cpp:1971: Assertion `FC && "ExtractValue constant expr couldn't be folded!"' failed. Now we report a more-sensible "Unsupported expression in static initializer" error instead. Fixes PR15417. llvm-svn: 186044	2013-07-10 22:51:01 +00:00
Rafael Espindola	555aa899c6	Remove this test for now. It is not reliable to depend on the output of llvm_unreachable. The original change will have proper tests when llvm-ar moves to lib/Object (soon). llvm-svn: 186043	2013-07-10 22:15:29 +00:00
Hans Wennborg	0c14cf9a7c	CommandLine.rst: remove tiny bit of bad mark-up llvm-svn: 186042	2013-07-10 22:09:22 +00:00
Rafael Espindola	555099207b	Find the symbol table on archives created on OS X. llvm-svn: 186041	2013-07-10 22:07:59 +00:00
Rafael Espindola	3b5475c0f2	Move tests from test/Archive to test/Object. There is no lib/Archive anymore and some archive tests were in test/Archive and others in test/Object. Since archive is just one of the formats supported by lib/Object, test/Object is probably the best location. llvm-svn: 186038	2013-07-10 21:47:16 +00:00
Adrian Prantl	ef99752e69	Add a comment. llvm-svn: 186035	2013-07-10 21:08:02 +00:00
Tim Northover	a630fb0b67	Put ELF COMDAT relocations into the relevant COMDAT group. Patch from Игорь Пашев (I do hope we support utf-8 commit messages; I also hope he'll forgive me for transliterating it as Igor Pashev in case things go horribly wrong). llvm-svn: 186034	2013-07-10 20:58:17 +00:00
Stephen Lin	10947502e5	Remove trailing whitespac llvm-svn: 186032	2013-07-10 20:47:39 +00:00
Adrian Prantl	5a4c862a90	Add a testcase for r186014. llvm-svn: 186031	2013-07-10 20:43:29 +00:00
Rafael Espindola	fbcafc0793	Don't crash in 'llvm -s' when an archive has no symtab. llvm-svn: 186029	2013-07-10 20:14:22 +00:00
Reid Kleckner	755d324cd2	Fix %t typo in Ocaml bindings test. llvm-svn: 186027	2013-07-10 18:55:06 +00:00
Michael Gottesman	6eb95dc2f7	[objc-arc] Changed 'mode: c++' => 'C++' at Nick Lewycky's suggestion. Also removed unnecessary mode: c++ lines from .cpp files. llvm-svn: 186026	2013-07-10 18:49:00 +00:00
Michael Gottesman	e95df9fdb1	Changed "mode: c++" => "C++" at the suggestion of Nick Lewycky. llvm-svn: 186025	2013-07-10 18:40:49 +00:00
Benjamin Kramer	b34939aaf5	Update doxygen comment to match renamed parameters. Found by -Wdocumentation. llvm-svn: 186021	2013-07-10 18:01:16 +00:00
Rafael Espindola	fc3876118d	MemoryBuffer::getFile handles zero sized files, no need to duplicate the test. llvm-svn: 186018	2013-07-10 17:30:39 +00:00
Aaron Ballman	f04bbd8b7f	Replacing an empty switch with its moral equivalent. No functional changes intended. llvm-svn: 186017	2013-07-10 17:19:22 +00:00
Rafael Espindola	4d08d8bada	Use status to implement file_size. The status function is already using a syscall that returns the file size. Remember it and implement file_size as a simple wrapper. No functionally change, but clients that already use status now can avoid calling file_size. llvm-svn: 186016	2013-07-10 17:16:40 +00:00
Adrian Prantl	d3f6fe51ab	Use the appropriate unsigned int type for the offset. llvm-svn: 186015	2013-07-10 16:56:52 +00:00
Adrian Prantl	c31ec1c948	Safeguard DBG_VALUE handling. Unbreaks the ASAN buildbot. llvm-svn: 186014	2013-07-10 16:56:47 +00:00
Craig Topper	9ae4707868	Simplify code. llvm-svn: 186013	2013-07-10 16:38:35 +00:00
Michel Danzer	49812b5bbd	R600/SI: Initial local memory support Enough for the radeonsi driver to use it for calculating derivatives. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 186012	2013-07-10 16:37:07 +00:00
Michel Danzer	1f87df365f	R600/SI: Add pattern for the AMDGPU.barrier.local intrinsic lit test coverage to follow in the next commit. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 186011	2013-07-10 16:36:57 +00:00
Michel Danzer	8d69617b27	R600/SI: Add intrinsic for retrieving the current thread ID Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 186010	2013-07-10 16:36:52 +00:00
Michel Danzer	1c45430e76	R600/SI: Initial support for LDS/GDS instructions Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 186009	2013-07-10 16:36:43 +00:00
Michel Danzer	83f87c4c2e	R600/SI: Add intrinsics for texture sampling with user derivatives Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 186008	2013-07-10 16:36:36 +00:00
Argyrios Kyrtzidis	aafb84be9e	Remove llvm/ADT/NullablePtr.h, there are no uses of it in-tree. llvm-svn: 186006	2013-07-10 15:33:20 +00:00

... 3 4 5 6 7 ...

94120 Commits