llvm-project

Commit Graph

Author	SHA1	Message	Date
Nadav Rotem	cf0dcdc71c	When we vectorize across multiple basic blocks we may vectorize PHINodes that create a cycle. We already break the cycle on phi-nodes, but arithmetic operations are still uplicated. This patch adds code that checks if the operation that we are vectorizing was vectorized during the visit of the operands and uses this value if it can. llvm-svn: 186883	2013-07-22 22:18:07 +00:00
Jakub Staszak	cb132face0	OldPtr is llvm::Instruction. Remove unneeded cast<>. llvm-svn: 186880	2013-07-22 22:10:43 +00:00
Richard Trieu	46978d41e6	Silence gcc warning. llvm-svn: 186879	2013-07-22 21:29:28 +00:00
Kevin Enderby	285da02094	Fix the move to/from accumulator register instructions that use a full 64-bit absolute address encoded in the instruction. rdar://8612627 and rdar://14299221 llvm-svn: 186878	2013-07-22 21:25:31 +00:00
Jakub Staszak	6b36db08f3	Change tabs to spaces. llvm-svn: 186877	2013-07-22 21:11:30 +00:00
Michael Gottesman	c0659fad7f	[stackprotector] Changed isNoopBitcast/sameNoopInput to take TargetLoweringBase instead of TargetLowering. Both functions only use functionality from TargetLoweringBase. rdar://13935163 llvm-svn: 186874	2013-07-22 21:05:47 +00:00
Craig Topper	998bcf9534	Recommit r186813: More Intel syntax alias fixes. With the addition of suppressing some of the aliases from being emitted by the asm printer. llvm-svn: 186869	2013-07-22 20:46:37 +00:00
Michael Gottesman	a6188f9fcd	[stackprotector] Refactored ssp prologue creation code into its own helper function. No functionality change. rdar://13935163 llvm-svn: 186868	2013-07-22 20:44:11 +00:00
Manman Ren	14dd2a656e	Debug Info Finder: add processScope to actually handle the Scope. Instead of just adding the scope to the list, we actually handle the scope. llvm-svn: 186867	2013-07-22 20:28:53 +00:00
Bill Wendling	c02a0aabb5	Recommit r186217 with testcase fix: Use the function attributes to pass along the stack protector buffer size. Now that we have robust function attributes, don't use a command line option to specify the stack protecto buffer size. llvm-svn: 186863	2013-07-22 20:15:21 +00:00
Akira Hatanaka	4d2ea3c696	[mips] Fix MipsAsmParser::parseCCRRegs. Enable parsing all 32 floating point control registers $0-31 and stop trying to parse floating point condition code register $fcc0. Also, return ParseFail if the operand being parsed is not in the expected format. llvm-svn: 186861	2013-07-22 19:30:38 +00:00
Matt Arsenault	fb18323885	Fix spelling and grammar llvm-svn: 186858	2013-07-22 18:59:58 +00:00
Akira Hatanaka	44ff81d4e3	[mips] Use ADDu instead of OR to copy general purpose registers. Also, delete the InstAlias pattern which maps "move" to OR to resolve ambiguity in MatchTable. llvm-svn: 186855	2013-07-22 18:52:22 +00:00
Eric Christopher	19d153261f	Formatting. llvm-svn: 186851	2013-07-22 18:26:15 +00:00
Nadav Rotem	8c45d4b27f	Fix an obvious typo in the loop vectorizer where the cost model uses the wrong variable. The variable BlockCost is ignored. We don't have tests for the effect of if-conversion loops because it requires a big test (that includes if-converted loops) and it is difficult to find and balance a loop to do the right thing. llvm-svn: 186845	2013-07-22 17:10:48 +00:00
Justin Holewinski	0b76dd85ea	[NVPTX] Remove unused prototypes llvm-svn: 186844	2013-07-22 17:04:40 +00:00
Hans Wennborg	31d6fd84e6	Option parsing: allow aliases in groups Option aliases in option groups were previously disallowed by an assert. As far as I can tell, there was no technical reason for this, and I would like to be able to put cl.exe compatible options in their own group for Clang, so let's change the assert. llvm-svn: 186838	2013-07-22 16:18:13 +00:00
Mihai Popa	8a9da5b00c	This adds range checking for "ldr Rn, [pc, #imm]" Thumb instructions. With this patch: 1. ldr.n is recognized as mnemonic for the short encoding 2. ldr.w is recognized as menmonic for the long encoding 3. ldr will map to either short or long encodings depending on the size of the offset llvm-svn: 186831	2013-07-22 15:49:36 +00:00
Justin Holewinski	cd069e6dec	[NVPTX] Use approximate FP ops when unsafe-fp-math is used, and append .ftz to instructions if the nvptx-f32ftz attribute is set to "true" llvm-svn: 186820	2013-07-22 12:18:04 +00:00
Tim Northover	1a9dafcd6f	Revert "More Intel syntax alias fixes." This reverts commit r186813, which broke the bots. llvm-svn: 186818	2013-07-22 11:02:32 +00:00
Craig Topper	b095c09330	Fix typo. Change %cl to CL in Intel pattern. llvm-svn: 186815	2013-07-22 10:07:26 +00:00
Craig Topper	61da939a17	More Intel syntax alias fixes. llvm-svn: 186814	2013-07-22 09:58:07 +00:00
Craig Topper	1b9e4e7e9d	More Intel syntax alias fixes. llvm-svn: 186813	2013-07-22 09:42:31 +00:00
Craig Topper	03db790dc6	Change %xmm0 to XMM0 in Intel side of asm strings for PBLENDVB. llvm-svn: 186812	2013-07-22 09:22:49 +00:00
Craig Topper	8f9402a989	Add Intel variants to aliases for some FP instructions. llvm-svn: 186811	2013-07-22 09:18:43 +00:00
Tim Northover	eb5e4d532c	ARM: remove now unneeded custom Asm converters After Ulrich's r180677 (thanks!) TableGen is intelligent enough to handle tied constraints involving complex operands properly, so virtually all of the ARM custom converters are now unnecessary. llvm-svn: 186810	2013-07-22 09:06:12 +00:00
Craig Topper	d0ed3a417e	Reverse operands for Intel syntax form of 'bt' alias. llvm-svn: 186809	2013-07-22 07:47:51 +00:00
Nadav Rotem	d7ff88a8d9	Delete unused helper functions. llvm-svn: 186808	2013-07-22 05:19:22 +00:00
Michael Gottesman	da6365f4ed	Added missing - in the header of PrologEpilogInserter.h so that editors properly realize it is a c++ header and not a c header. llvm-svn: 186801	2013-07-22 00:52:55 +00:00
Richard Smith	70523c7926	Treat nothrow forms of ::operator delete and ::operator delete[] as deallocation functions. llvm-svn: 186798	2013-07-21 23:11:42 +00:00
Benjamin Kramer	2fdb758ca8	mem2reg: Minor STL usage cleanup. No functionality change. llvm-svn: 186790	2013-07-21 11:03:40 +00:00
Chandler Carruth	7aa9ebb546	Make the mem2reg interface use an ArrayRef as it keeps a copy of these to iterate over. llvm-svn: 186788	2013-07-21 08:37:58 +00:00
Craig Topper	8956fe0dbc	Mark that the _ftol2 function used by windows on x86 to handle fptoui modifies ECX. llvm-svn: 186787	2013-07-21 07:28:13 +00:00
Nadav Rotem	f6bb6a464c	Revert a part of r186420. Don't forbid multiple store chains that merge. llvm-svn: 186786	2013-07-21 06:12:57 +00:00
Chandler Carruth	b1ca98c4d0	Hoist the rest of the logic for promoting single-store allocas into the helper function. This leaves both trivial cases handled entirely in helper functions and merely manages the list of allocas to process in the run method. The next step will be to handle all of the trivial promotion work prior to even creating the core class and the subsequent simplifications that enables. llvm-svn: 186784	2013-07-21 01:52:33 +00:00
Chandler Carruth	f9e7e1dd87	Hoist the rest of the logic for fully promoting allocas with all uses in a single block into the helper routine. This takes advantage of the fact that we can directly replace uses prior to any store with undef to simplify matters and unconditionally promote allocas only used within one block. I've removed the special handling for the case of no stores existing. This has no semantic effect but might slow things down. I'll fix that in a later patch when I refactor this entire thing to be easier to manage the different cases. llvm-svn: 186783	2013-07-21 01:44:07 +00:00
Chandler Carruth	e99f931516	Remove a method made dead by the prior refactoring. llvm-svn: 186782	2013-07-21 00:01:34 +00:00
Chandler Carruth	420fafef93	Hoist the two trivial promotion routines out of the big class that handles the general cases. The hope is to refactor this so that we don't end up building the entire class for the trivial cases. I also want to lift a lot of the early pre-processing in the initial segment of run() into a separate routine, and really none of it needs to happen inside the primary promotion class. These routines in particular used none of the actual state in the promotion class, so they don't really make sense as members. llvm-svn: 186781	2013-07-20 23:59:51 +00:00
Chandler Carruth	48e11fd76d	Hoist the AllocaInfo struct to the top of the file. This struct is nicely independent of everything else, and we already needed a foward declaration here. It's simpler to just define it immediately. llvm-svn: 186780	2013-07-20 23:39:26 +00:00
Chandler Carruth	4711793e8a	Sink a typedef and comparator down to the function that actually uses them. llvm-svn: 186779	2013-07-20 23:36:19 +00:00
Rafael Espindola	c2bb73fc8d	Don't crash when llvm.compiler.used becomes empty. GlobalOpt simplifies llvm.compiler.used by removing any members that are also in the more strict llvm.used. Handle the special case where llvm.compiler.used becomes empty. llvm-svn: 186778	2013-07-20 23:33:15 +00:00
Chandler Carruth	f3878f46ce	Don't allocate the DIBuilder on the heap and remove all the complexity that ensued from that. llvm-svn: 186777	2013-07-20 23:33:06 +00:00
Chandler Carruth	e62f211b77	Rename constructor parameters to follow the common member-shadowing pattern and conform to the naming conventions. llvm-svn: 186776	2013-07-20 23:23:47 +00:00
Chandler Carruth	b3e8e6f10b	Reformat the implementation of mem2reg with clang-format so that my subsequent changes don't introduce inconsistencies. llvm-svn: 186775	2013-07-20 23:20:08 +00:00
Andrew Trick	b5f3c44c3a	Comment: try to clarify loop iteration order. llvm-svn: 186774	2013-07-20 23:10:31 +00:00
Chandler Carruth	985eb0b550	Remove a DenseMapInfo specialization for std::pair -- we have one of those baked into DenseMap now. llvm-svn: 186773	2013-07-20 23:09:05 +00:00
Chandler Carruth	019516109d	Update mem2reg's comments to conform to the new doxygen standards. No functionality changed. llvm-svn: 186772	2013-07-20 22:20:05 +00:00
Matt Arsenault	828b565c9c	Disallow global aliases to bitcast between address spaces llvm-svn: 186767	2013-07-20 17:46:05 +00:00
Matt Arsenault	c4c9226046	Remove trailing whitespace, fix file path in comment llvm-svn: 186766	2013-07-20 17:46:00 +00:00
Benjamin Kramer	08e5070bf5	SROA: Microoptimization: Remove dead entries first, then sort. While there replace an explicit struct with std::mem_fun. llvm-svn: 186761	2013-07-20 08:38:34 +00:00
Stephen Lin	a9b57f6bea	InstCombine: call FoldOpIntoSelect for all floating binops, not just fmul llvm-svn: 186759	2013-07-20 07:13:13 +00:00
Matt Arsenault	727aa349ad	Have InlineCost check constant fcmps llvm-svn: 186758	2013-07-20 04:09:00 +00:00
Manman Ren	19b4986b80	Debug Info Verifier: simplify DIxxx::Verify Simplify DIxxx:Verify to not call Verify on an operand. Instead, we use DebugInfoFinder to list all MDNodes that should be a DIScope and all MDNodes that should be a DIType and we will call Verify on those lists. llvm-svn: 186737	2013-07-20 00:38:46 +00:00
Matt Arsenault	fe8ff5ccda	Fix size_t -> uint warnings with MSVC 64-bit build llvm-svn: 186736	2013-07-20 00:20:10 +00:00
Lang Hames	24864fe150	Refactor AnalyzeBranch on ARM. The previous version did not always analyze indirect branches correctly. Under some circumstances, this led to the deletion of basic blocks that were the destination of indirect branches. In that case it left indirect branches to nowhere in the code. This patch replaces, and is more general than either of the previous fixes for indirect-branch-analysis issues, r181161 and r186461. For other branches (not indirect) this refactor should have almost identical behavior to the previous version. There are some corner cases where this refactor is able to analyze blocks that the previous version could not (e.g. this necessitated the update to thumb2-ifcvt2.ll). <rdar://problem/14464830> llvm-svn: 186735	2013-07-19 23:52:47 +00:00
Rui Ueyama	ed64342b67	Retry submitting r186623: COFFDumper: Dump data directory entries. The original change was rolled back in r186627 because of test failures on the big endian machine. I believe I fixed the issue so re-submitting. llvm-svn: 186734	2013-07-19 23:23:29 +00:00
Nadav Rotem	e210839f5b	fix an 80-col line. llvm-svn: 186733	2013-07-19 23:14:01 +00:00
Nadav Rotem	c069c25518	Use LLVMs ADTs that improve the compile time of this pass. llvm-svn: 186732	2013-07-19 23:12:19 +00:00
Nadav Rotem	5c9a193a65	SLPVectorizer: Improve the compile time of isConsecutive by reordering the conditions that check GEPs and eliminate two of the calls to accumulateConstantOffset. llvm-svn: 186731	2013-07-19 23:11:15 +00:00
Vincent Lejeune	8b8a7b5514	R600: Don't emit empty then clause and use alu_pop_after llvm-svn: 186725	2013-07-19 21:45:15 +00:00
Vincent Lejeune	960a622ca6	R600: Simplify AMDILCFGStructurize by removing templates and assuming single exit llvm-svn: 186724	2013-07-19 21:45:06 +00:00
Vincent Lejeune	a8c38fedd6	R600: Replace legacy debug code in AMDILCFGStructurizer.cpp llvm-svn: 186723	2013-07-19 21:44:56 +00:00
Rafael Espindola	9aadcc4c0e	s/compiler_used/compiler.used/. We were incorrectly using compiler_used instead of compiler.used. Unfortunately the passes using the broken name had tests also using the broken name. llvm-svn: 186705	2013-07-19 18:44:51 +00:00
Reid Kleckner	eadb765f42	[Option] Add inclusion and exclusion flags to option parsing Summary: This allows the clang driver to put MSVC compatible options in the same enumerator space as its normal options but exclude them from normal option parsing. Also changes the standard ParseArgs() method to consider unknown arguments with a leading slash as being inputs rather than flags. High level discussion for clang-cl is here: http://lists.cs.uiuc.edu/pipermail/cfe-dev/2013-June/030404.html CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D1049 llvm-svn: 186703	2013-07-19 18:04:57 +00:00
Joey Gouly	f520d5e291	Add a line that got missed off somehow. Sorry about that! llvm-svn: 186692	2013-07-19 16:45:16 +00:00
Richard Sandiford	fac8b10a84	[SystemZ] Add ALRK, AGLRK, SLRK and SGLRK Follows the same lines as r186686, but much more limited, since we only use ADD LOGICAL for multi-i64 additions. llvm-svn: 186689	2013-07-19 16:37:00 +00:00
Joey Gouly	e860255850	[ARMv8] Implement the NEON instructions VRINT{N, X, A, Z, M, P}. llvm-svn: 186688	2013-07-19 16:34:16 +00:00
Richard Sandiford	7d6a453623	[SystemZ] Add AHIK and AGHIK I did these as a separate patch because it uses a slightly different form of RIE layout. llvm-svn: 186687	2013-07-19 16:32:12 +00:00
Richard Sandiford	c575df6dcc	[SystemZ] Add ARK, AGRK, SRK and SGRK The testsuite changes follow the same lines as for r186683. llvm-svn: 186686	2013-07-19 16:26:39 +00:00
Richard Sandiford	c57e586792	[SystemZ] Add NGRK, OGRK and XGRK Like r186683, but for 64 bits. llvm-svn: 186685	2013-07-19 16:24:22 +00:00
Serge Pavlov	4c2a09d6c4	Initialize TempFileHandle. llvm-svn: 186684	2013-07-19 16:23:54 +00:00
Richard Sandiford	0175b4a353	[SystemZ] Add NRK, ORK and XRK The atomic tests assume the two-operand forms, so I've restricted them to z10. Running and-01.ll, or-01.ll and xor-01.ll for z196 as well as z10 shows why using convertToThreeAddress() is better than exposing the three-operand forms first and then converting back to two operands where possible (which is what I'd originally tried). Using the three-operand form first stops us from taking advantage of NG, OG and XG for spills. llvm-svn: 186683	2013-07-19 16:21:55 +00:00
Tilmann Scheller	34869503cb	ARM: Add instruction aliases for the Thumb2 PLD/PLDW (literal) alternate form. See A8.8.127 in ARM DDI 0406C.b. Related to <rdar://problem/14403733>. llvm-svn: 186682	2013-07-19 16:18:56 +00:00
Richard Sandiford	ff6c5a5609	[SystemZ] Use SLLK, SRLK and SRAK for codegen This patch uses the instructions added in r186680 for codegen. llvm-svn: 186681	2013-07-19 16:12:08 +00:00
Richard Sandiford	27d1cfe3d4	[SystemZ] Start adding z196 and zEC12 support This first step just adds definitions for SLLK, SRLK and SRAK. The next patch will actually make use of them during codegen. insn-bad.s tests that some form of error is reported when using these instructions on z10. More work is needed to get the "instruction requires: distinct-ops" that we'd ideally like, so I've stubbed that part out for now. I'll come back and make it mandatory once the necessary changes are in. llvm-svn: 186680	2013-07-19 16:09:03 +00:00
Rafael Espindola	67080cec25	Split openFileForWrite into windows and unix versions. It is similar to 186511, but for creating files for writing. llvm-svn: 186679	2013-07-19 15:02:03 +00:00
Chandler Carruth	6c321c131b	Cleanup the stats counters for the new implementation. These actually count the right things and have the right names. llvm-svn: 186667	2013-07-19 10:57:36 +00:00
Chandler Carruth	1ed848d55c	Fix another assert failure very similar to PR16651's test case. This test case came from Benjamin and found the parallel bug in the vector promotion code. llvm-svn: 186666	2013-07-19 10:57:32 +00:00
Chandler Carruth	9f21fe1d65	Try to move to a more reasonable set of naming conventions given the new implementation of the SROA algorithm. We were using the term 'partition' in many places that no longer ever represented an actual partition, but rather just an arbitrary slice of an alloca. No functionality change intended here. Mostly just renaming of types, functions, variables, and rewording of comments. Several comments were rewritten to make a lot more sense in the new structure of things. The stats are still weird and not reflective of how this really works. I'll fix those up in a separate patch as it is a touch more semantic of a change... llvm-svn: 186659	2013-07-19 09:13:58 +00:00
Alexey Samsonov	64c391dbe4	Fix uninitialized memory read found by MemorySanitizer: always set output parameter of ConvergingScheduler::SchedBoundary::getOtherResourceCount llvm-svn: 186658	2013-07-19 08:55:18 +00:00
Chandler Carruth	90a735d606	A long overdue cleanup in SROA to use 'DL' instead of 'TD' for the DataLayout variables. llvm-svn: 186656	2013-07-19 07:21:28 +00:00
Chandler Carruth	5955c9e4da	Fix PR16651, an assert introduced in my recent re-work of the innards of SROA. The crux of the issue is that now we track uses of a partition of the alloca in two places: the iterators over the partitioning uses and the previously collected split uses vector. We weren't accounting for the fact that the split uses might invalidate integer widening in ways other than due to their width (in this case due to being volatile). Further reduced testcase added to the tests. llvm-svn: 186655	2013-07-19 07:12:23 +00:00
Akira Hatanaka	5bcb2407a5	[mips] Delete MFC1_FT_CCR, MTC1_FT_CCR and MOVCCRToCCR. No functionality change. llvm-svn: 186642	2013-07-19 01:19:52 +00:00
Eric Christopher	03b3e1118f	Remove DIBuilder cache of variable TheCU and change the few uses that wanted it. Also change the interface for createCompileUnit to compensate. Fix comments that refer to TheCU as well. llvm-svn: 186637	2013-07-19 00:51:47 +00:00
Manman Ren	74c61b9c80	Debug Info: enable verifying by default and disable testing cases that fail. 1> Use DebugInfoFinder to find debug info MDNodes. 2> Add disable-debug-info-verifier to disable verifying debug info. 3> Disable verifying for testing cases that fail (will update the testing cases later on). 4> MDNodes generated by clang can have empty filename for TAG_inheritance and TAG_friend, so DIType::Verify is modified accordingly. Note that DebugInfoFinder does not list all debug info MDNode. For example, clang can generate: metadata !{i32 786468}, which will fail to verify. This MDNode is used by debug info but not included in DebugInfoFinder. This MDNode is generated as a temporary node in DIBuilder::createFunction Value *TElts[] = { GetTagConstant(VMContext, DW_TAG_base_type) }; MDNode::getTemporary(VMContext, TElts) llvm-svn: 186634	2013-07-19 00:31:03 +00:00
Andrew Trick	b13ef17a14	MI Sched: Update the way resources are tracked so the current heuristics make more sense. llvm-svn: 186632	2013-07-19 00:20:07 +00:00
Rui Ueyama	f388243037	Revert "COFFDumper: Dump data directory entries." Because it broke s390x and ppc64-linux buildbots. This reverts commit r186623. llvm-svn: 186627	2013-07-18 23:15:50 +00:00
Rui Ueyama	a20b9f52d4	COFFDumper: Dump data directory entries. Summary: Dump optional data directory entries in the PE/COFF header, so that we can test the output of LLD linker. This patch updates the test binary file, but the source of the binary is the same. I just re-linked the file. I don't know how the previous file was linked, but the previous file did not have any data directory entries for some reason. Reviewers: rafael CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D1148 llvm-svn: 186623	2013-07-18 22:44:20 +00:00
Nick Lewycky	03f3d34ffb	Clean up some of this code a tiny bit, no functionality change. llvm-svn: 186622	2013-07-18 22:32:32 +00:00
Tilmann Scheller	c8a06ff600	ARM: Make sure the instruction alias for PLI uses the right subtarget features. PLI requires both the Thumb2 and the ARMv7 feature. Related to <rdar://problem/14403733>. llvm-svn: 186620	2013-07-18 22:19:59 +00:00
Tom Stellard	8374720aad	R600/SI: Fix crash with VSELECT https://bugs.freedesktop.org/show_bug.cgi?id=66175 llvm-svn: 186616	2013-07-18 21:43:53 +00:00
Tom Stellard	adf732cfbc	R600/SI: Add support for v2f32 loads llvm-svn: 186615	2013-07-18 21:43:48 +00:00
Tom Stellard	ed2f6149f3	R600/SI: Add support for v2f32 stores llvm-svn: 186614	2013-07-18 21:43:42 +00:00
Tom Stellard	67ae4762ef	R600: Expand VSELECT for all types llvm-svn: 186613	2013-07-18 21:43:35 +00:00
Eric Christopher	a4b6cf14f6	Revert "Remove DIBuilder cache of variable TheCU and change the few" This reverts commit r186599 as I didn't want to commit this yet. llvm-svn: 186601	2013-07-18 19:13:06 +00:00
Eric Christopher	d0b2150f01	Remove DIBuilder cache of variable TheCU and change the few uses that wanted it. Also change the interface for createCompileUnit to compensate. Fix comments that refer to TheCU as well. llvm-svn: 186599	2013-07-18 19:11:29 +00:00
Rafael Espindola	81177c5dac	Small improvement to the use of GetFileType: * assert that the return value is one of the documented values on msdn. * on FILE_TYPE_UNKNOWN, check GetLastError. Unfortunately I can't think of a way to get a FILE_TYPE_UNKNOWN on a test. llvm-svn: 186595	2013-07-18 18:42:52 +00:00
Nadav Rotem	bb3398f000	Handle constants without going through SCEV. llvm-svn: 186593	2013-07-18 18:34:21 +00:00
Nadav Rotem	de2815a5f7	SLPVectorizer: Speedup isConsecutive by manually checking GEPs with multiple indices. This brings the compile time of the SLP-Vectorizer to about 2.5% of OPT for my testcase. llvm-svn: 186592	2013-07-18 18:20:45 +00:00
NAKAMURA Takumi	8b01da4bd8	Windows/Path.inc: Introduce file_type::character_file and file_type::fifo_file in sys::fs::getStatus(HANDLE). It fixes llvm/test/Other/close-stderr.ll on msys. FIXME: Provide unittests. llvm-svn: 186588	2013-07-18 17:00:54 +00:00
Reid Kleckner	a73c7781bd	[Support] Beef up and expose the response file parsing in llvm::cl The plan is to use it for clang and lld. Major behavior changes: - We can now parse UTF-16 files that have a byte order mark. - PR16209: Don't drop backslashes on the floor if they don't escape anything. The actual parsing loop was based on code from Clang's driver.cpp, although it's been rewritten to track its state with control flow rather than state variables. Reviewers: hans Differential Revision: http://llvm-reviews.chandlerc.com/D1170 llvm-svn: 186587	2013-07-18 16:52:05 +00:00
Joey Gouly	e25a86b082	Change 'n' to 'N' to keep consistent with other instructions. llvm-svn: 186576	2013-07-18 12:00:25 +00:00
Joey Gouly	943dd59ed5	[ARMv8] Add NEON instructions VCVT{A, N, P, M}. llvm-svn: 186574	2013-07-18 11:53:22 +00:00
Richard Sandiford	5109321042	[SystemZ] Use RNSBG This should be the last of the R.SBG patches for now. llvm-svn: 186573	2013-07-18 10:40:35 +00:00
Joey Gouly	dce01792f2	Add Thumb tests for the ARMv8 FP instructions that I recently added. Also, fix the namespace for two instructions that I missed previously. llvm-svn: 186572	2013-07-18 10:20:25 +00:00
Richard Sandiford	297f7d2724	[SystemZ] Generalize RxSBG SRA case The original code only folded SRA into ROTATE ... SELECTED BITS if there was no outer shift. This patch splits out that check and generalises it slightly. The extra cases aren't really that interesting, but this is paving the way for RNSBG support. llvm-svn: 186571	2013-07-18 10:14:55 +00:00
Richard Sandiford	7878b852e6	[SystemZ] Use RXSBG Extend the previous R.SBG patches to handle XORs. llvm-svn: 186570	2013-07-18 10:06:15 +00:00
Richard Sandiford	5cbac96730	[SystemZ] Rename and formatting fixes In hindsight, using "RISBG" for something that can be any type of R.SBG instruction was a bit confusing, so this renames it to RxSBG. That might not be the best choice either, since there is an instruction called RXSBG, but hopefully the lower-case letter stands out enough. While there I fixed a couple of GNUisms that had crept in -- sorry about that! llvm-svn: 186569	2013-07-18 09:45:08 +00:00
Joey Gouly	923d593fb5	Remove the extra leading 0 from VMAXNMND. The N3VDIntnp pattern takes bits<5> and I gave it 6 bits. Thanks to Jiangning Liu for spotting it! llvm-svn: 186568	2013-07-18 09:34:35 +00:00
Vladimir Medic	3467b90786	This patch extends mips register parsing methods to allow indexed register parsing. The corresponding test cases are added to the patch. llvm-svn: 186567	2013-07-18 09:28:35 +00:00
Craig Topper	ad1fff9be7	Fix copy and paste bug from r186491 to make v2f64 use MOVAPD/MOVUPD as it should. llvm-svn: 186566	2013-07-18 07:16:44 +00:00
Chandler Carruth	f0546402af	Reapply r186316 with a fix for one bug where the code could walk off the end of a vector. This was found with ASan. I've had one other report of a crasher, but thus far been unable to reproduce the crash. It may well be fixed with this version, and if not I'd like to get more information from the build bots about what is happening. See r186316 for the full commit log for the new implementation of the SROA algorithm. llvm-svn: 186565	2013-07-18 07:15:00 +00:00
Nadav Rotem	7d7036b8c6	SLPVectorizer: Speedup isConsecutive (that checks if two addresses are consecutive in memory) by checking for additional patterns that don't need to go through SCEV. llvm-svn: 186563	2013-07-18 04:33:20 +00:00
Hal Finkel	1860763c76	PPC: Support dynamic allocas with large alignment Support for dynamic stack alignments in the PPC backend has been unfinished, in part because it depends on dynamic stack realignment (which I only just recently implemented fully). Now we can also support dynamic allocas with higher than the default target stack alignment (16 bytes). In order to round-up the requested size to the maximum requested alignment, we need an additional register to hold the rounded-up size. We're already using one scavenged register to hold the previous stack-pointer value (which needs to be stored with the signal-safe stdux update), and so when we have dynamic allocas and a large alignment, we allocate two emergency spill slots for the scavenger. llvm-svn: 186562	2013-07-18 04:28:21 +00:00
Rafael Espindola	213c4cb18c	Remove dead code. llvm-svn: 186561	2013-07-18 03:29:51 +00:00
Rafael Espindola	4d10587ff3	Convert two uses if fstat with sys::fs::status. llvm-svn: 186560	2013-07-18 03:04:20 +00:00
Nick Lewycky	0dcefdfcab	Give 'hasPath' a longer but clearer name 'isPotentiallyReachable'. Also expand the comment. No functionality change. This change broken out of http://llvm-reviews.chandlerc.com/D996 . llvm-svn: 186558	2013-07-18 02:34:51 +00:00
Hal Finkel	f05d6c7843	PPC: Add base-pointer support to builtin setjmp/longjmp First, this changes the base-pointer implementation to remove an unnecessary complication (and one that is incompatible with how builtin SjLj is implemented): instead of using r31 as the base pointer when it is not needed as a frame pointer, now the base pointer will always be r30 when needed. Second, we introduce another pseudo register, BP, which is used just like the FP pseudo register to refer to the base register before we know for certain what register it will be. Third, we now save BP into the jmp_buf, and restore r30 from that slot in longjmp. If the function that called setjmp did not use a base pointer, then r30 will be overwritten by the setjmp-calling-function's restore code. FP restoration (which is restored into r31) works the same way. llvm-svn: 186545	2013-07-17 23:50:51 +00:00
Eric Christopher	7ab2c3ecb2	Add comparison operators for DIDescriptors to fix c++98 fallout of operator bool change. Also convert a variable in DebugIR. llvm-svn: 186544	2013-07-17 23:25:22 +00:00
Nadav Rotem	43639e8492	Fix a comment. llvm-svn: 186541	2013-07-17 22:41:16 +00:00
Eli Friedman	d2eb07acae	Handle '.' correctly in hex float literal parsing. There were a couple of different loops that were not handling '.' correctly in APFloat::convertFromHexadecimalString; these mistakes could lead to assertion failures and incorrect rounding for overlong hex float literals. Fixes PR16643. llvm-svn: 186539	2013-07-17 22:17:29 +00:00
Stephen Lin	03f9fbbcd7	Restore r181216, which was partially reverted in r182499. llvm-svn: 186533	2013-07-17 20:06:03 +00:00
Rafael Espindola	331aebae92	Fix a funny typo. Thanks to Aaron Ballman for noticing. llvm-svn: 186532	2013-07-17 19:58:28 +00:00
Nadav Rotem	3072baeb9c	Add a micro optimization to catch cases where the PtrA equals PtrB. llvm-svn: 186531	2013-07-17 19:52:25 +00:00
Rafael Espindola	16431fe7a7	Add FILE_SHARE_WRITE to openFileForRead. This should fix the windows bots. It looks like the failing tests are of the form prog1 > file prog2 file and prog2 fails trying to read the file. The best fix would probably be to close stdout/stderr in prog1, but it was not the intention of 186511 to change this, so just restore the old behavior for now. llvm-svn: 186530	2013-07-17 19:44:07 +00:00
Aaron Ballman	fbb104513b	Silencing an MSVC warning about signed vs unsigned comparison mismatches. llvm-svn: 186529	2013-07-17 19:43:13 +00:00
Akira Hatanaka	365d16e345	[mips] Use "foreach" loop to make register definitions more concise. llvm-svn: 186528	2013-07-17 19:09:27 +00:00
Michael Gottesman	f87a6ae65f	Add -- C++ -- to InstrEmitter.h. llvm-svn: 186527	2013-07-17 18:53:29 +00:00
Vladimir Medic	74593e6577	This patch checks for valid mnemonics at the beginning of parseInstruction method, thus giving the user the right error message for non-existing instructions. llvm-svn: 186512	2013-07-17 15:00:42 +00:00
Rafael Espindola	a0d9b6b693	Split openFileForRead into Windows and Unix versions. This has some advantages: * Lets us use native, utf16 windows functions. * Easy to produce good errors on windows about trying to use a directory when we want a file. * Simplifies the unix version a bit. llvm-svn: 186511	2013-07-17 14:58:25 +00:00
Hal Finkel	ec7cd26968	Fix comparisons of alloca alignment in inliner merging Duncan pointed out a mistake in my fix in r186425 when only one of the allocas being compared had the target-default alignment. This is essentially his suggested solution. Thanks! llvm-svn: 186510	2013-07-17 14:32:41 +00:00
Vladimir Medic	29410f9c91	Implement eret and deret(return from exception) instructions for Mips. Test examples are given. llvm-svn: 186507	2013-07-17 14:05:19 +00:00
Joey Gouly	df68600f44	[ARMv8] Add support for the NEON instructions vmaxnm/vminnm. This adds a new class for non-predicable NEON instructions and a new DecoderNamespace for v8 NEON instructions. llvm-svn: 186504	2013-07-17 13:59:38 +00:00
Duncan Sands	e2cd13906e	Ensure sys::getProcessTriple always uses a normalized triple. Patch by Thomas B. Jablin, from PR16636. llvm-svn: 186501	2013-07-17 11:01:05 +00:00
Richard Osborne	9ff96e6f9b	[XCore] Ensure implicit operands aren't lost on the return instruction. Patch by Robert Lytton. llvm-svn: 186500	2013-07-17 10:58:37 +00:00
Craig Topper	55475d448b	Teach x86 fast-isel to use AVX opcodes for vector stores when AVX is enabled. llvm-svn: 186496	2013-07-17 06:58:23 +00:00
Craig Topper	4f55b0efd2	Make x86 fast-isel correctly choose between aligned and unaligned operations for vector stores. Fixes PR16640. llvm-svn: 186491	2013-07-17 05:57:45 +00:00
JF Bastien	cd4c64d234	Fix ARMFastISel::ARMEmitIntExt shift emission My patch 'r183551 - ARM FastISel integer sext/zext improvements' was incorrect when emitting ARM register-immediate ASR, LSL, LSR instructions: they are pseudo-instructions in ARMInstrInfo.td and I should have used MOVsi instead. This is not an issue when code is generated through a .s file, but is an issue when generated straight to a .o (-filetype=obj). llvm-svn: 186489	2013-07-17 05:46:46 +00:00
Hal Finkel	40f76d5830	PPC: Add CTR-register clobber to builtin setjmp Because the builtin longjmp implementation uses a CTR-based indirect jump, when the control flow arrives at the builtin setjmp call, the CTR register has necessarily been clobbered. Correspondingly, this adds CTR to the list of implicit definitions of the builtin setjmp pseudo instruction. We don't need to add CTR to the implicit definitions of builtin longjmp because, even though it does clobber the CTR register, the control flow cannot return to inside the loop unless there is also a builtin setjmp call. llvm-svn: 186488	2013-07-17 05:35:44 +00:00
Craig Topper	24048c9440	Mark a method 'const' and another 'static'. llvm-svn: 186485	2013-07-17 03:54:53 +00:00
Craig Topper	1c4d667ca5	Make a few more static string pointers constant. llvm-svn: 186484	2013-07-17 03:43:10 +00:00
Rafael Espindola	b6fea4c618	Don't fallback to copy + delete in rename. Rename's documentation says "Files are renamed as if by POSIX rename()". and it is used for atomically updating output files from a temporary. Having rename fallback to a non atomic copy has the potential to hide bugs, like using a temporary file in /tmp instead of a unique name next to the final destination. llvm-svn: 186483	2013-07-17 03:33:41 +00:00
Craig Topper	9fdc70e846	Make constant string pointer into an array to remove a pointer lookup for every access. llvm-svn: 186482	2013-07-17 03:11:32 +00:00
NAKAMURA Takumi	212c80ac5d	raw_ostream.cpp: Introduce <fcntl.h> to let O_BINARY provided. Or, llvm::outs() would be set to O_TEXT by default. llvm/test/Object/check_binary_output.ll is expected to pass on win32. llvm-svn: 186480	2013-07-17 02:21:10 +00:00
Nadav Rotem	2202317fce	SLPVectorizer: Accelerate the isConsecutive check by replacing the subtraction of the two values with a simple SCEV expression that adds the offset to one of the pointers that we compare. llvm-svn: 186479	2013-07-17 00:48:31 +00:00
Hal Finkel	a7c54e8cf4	PPC: Implement base pointer and stack realignment This builds on some frame-lowering code that has existed since 2005 (r24224) but was disabled in 2008 (r48188) because it needed base pointer support to function correctly. This implementation follows the strategy suggested by Dale Johannesen in r48188 where the following comment was added: This does not currently work, because the delta between old and new stack pointers is added to offsets that reference incoming parameters after the prolog is generated, and the code that does that doesn't handle a variable delta. You don't want to do that anyway; a better approach is to reserve another register that retains to the incoming stack pointer, and reference parameters relative to that. And now we do exactly that. If we don't need a frame pointer, then we use r31 as a base pointer. If we do need a frame pointer, then we use r30 as a base pointer. The base pointer retains the value of the stack pointer before it was decremented in the prologue. We then use the base pointer to resolve all negative frame indicies. The basic scheme follows that for base pointers in the X86 backend. We use a base pointer when we need to dynamically realign the incoming stack pointer. This currently applies only to static objects (dynamic allocas with large alignments, and base-pointer support in SjLj lowering will come in future commits). llvm-svn: 186478	2013-07-17 00:45:52 +00:00
Craig Topper	8fc4096fab	Move string pointer from being a static class member to just a static global in the one file its needed in. llvm-svn: 186476	2013-07-17 00:31:35 +00:00
Manman Ren	8bfde8917e	Add getModuleFlag(StringRef Key) to query a module flag given Key. No functionality change. llvm-svn: 186470	2013-07-16 23:21:16 +00:00
Nadav Rotem	d2e8c4cdea	flip the scev minus direction to simplify the code. llvm-svn: 186466	2013-07-16 22:57:06 +00:00
Nadav Rotem	8f924f3891	SLPVectorizer: Improve the compile time of isConsecutive by adding a simple constant-gep check before using SCEV. This check does not always work because not all of the GEPs use a constant offset, but it happens often enough to reduce the number of times we use SCEV. llvm-svn: 186465	2013-07-16 22:51:07 +00:00
Lang Hames	57a113eb0d	Related to r181161 - Indirect branches may not be the last branch in a basic block. Blocks that have an indirect branch terminator, even if it's not the last terminator, should still be treated as unanalyzable. <rdar://problem/14437274> Reducing a useful regression test case is proving difficult - I hope to have one soon. llvm-svn: 186461	2013-07-16 22:01:40 +00:00
Tilmann Scheller	305bb90442	ARM: Add support for the Thumb2 PLI alternate literal form. This adds an instruction alias to make the assembler recognize the alternate literal form: pli [PC, #+/-<imm>] See A8.8.129 in the ARM ARM (DDI 0406C.b). Fixes <rdar://problem/14403733>. llvm-svn: 186459	2013-07-16 21:52:34 +00:00
Rafael Espindola	6d35481c94	Add a wrapper for open. This centralizes the handling of O_BINARY and opens the way for hiding more differences (like how open behaves with directories). llvm-svn: 186447	2013-07-16 19:44:17 +00:00
Jakob Stoklund Olesen	efeb3a1969	Remove floats from live range splitting costs. These floats all represented block frequencies anyway, so just use the BlockFrequency class directly. Some floating point computations remain in tryLocalSplit(). They are estimating spill weights which are still floats. llvm-svn: 186435	2013-07-16 18:26:18 +00:00
Jakob Stoklund Olesen	c5454ff046	Reapply r185393. Original commit message: Remove floating point computations from SpillPlacement.cpp. Patch by Benjamin Kramer! Use the BlockFrequency class instead of floats in the Hopfield network computations. This rescales the node Bias field from a [-2;2] float range to two block frequencies BiasN and BiasP pulling in opposite directions. This construct has a more predictable behavior when block frequencies saturate. The per-node scaling factors are no longer necessary, assuming the block frequencies around a bundle are consistent. This patch can cause the register allocator to make different spilling decisions. The differences should be small. llvm-svn: 186434	2013-07-16 18:26:15 +00:00
Juergen Ributzka	3d527d80b8	[X86] Use min/max to optimze unsigend vector comparison on X86 Use PMIN/PMAX for UGE/ULE vector comparions to reduce the number of required instructions. This trick also works for UGT/ULT, but there is no advantage in doing so. It wouldn't reduce the number of instructions and it would actually reduce performance. Reviewer: Ben radar:5972691 llvm-svn: 186432	2013-07-16 18:20:45 +00:00
Peter Collingbourne	8b77f18da0	Make SpecialCaseList match full strings, as documented, using anchors. Differential Revision: http://llvm-reviews.chandlerc.com/D1149 llvm-svn: 186431	2013-07-16 17:56:07 +00:00
Juergen Ributzka	c16f86020c	Test commit to verify write access. llvm-svn: 186429	2013-07-16 17:44:23 +00:00
Reid Kleckner	7df03c2e30	[Support] Add a Unicode conversion wrapper from UTF16 to UTF8 This is to support parsing UTF16 response files in LLVM/lib/Option for lld and clang. Reviewers: hans Differential Revision: http://llvm-reviews.chandlerc.com/D1138 llvm-svn: 186426	2013-07-16 17:14:33 +00:00
Hal Finkel	9caa8f7ba7	When the inliner merges allocas, it must keep the larger alignment For safety, the inliner cannot decrease the allignment on an alloca when merging it with another. I've included two variants of the test case for this: one with DataLayout available, and one without. When DataLayout is not available, if only one of the allocas uses the default alignment (getAlignment() == 0), then they cannot be safely merged. llvm-svn: 186425	2013-07-16 17:10:55 +00:00
Nadav Rotem	26bf9a0c75	SLPVectorizer: Reduce the compile time of the consecutive store lookup. Process groups of stores in chunks of 16. llvm-svn: 186420	2013-07-16 15:25:17 +00:00
Rafael Espindola	e08b59f81d	Create files with mode 666. This matches the behavior of other unix tools. llvm-svn: 186414	2013-07-16 14:10:07 +00:00
Reid Kleckner	5f4535b974	[Support] Fix some warnings when self-hosting clang on Windows llvm-svn: 186413	2013-07-16 14:04:08 +00:00
Ulrich Weigand	1d4dbda5b9	[APFloat] PR16573: Avoid losing mantissa bits in ppc_fp128 to double truncation When truncating to a format with fewer mantissa bits, APFloat::convert will perform a right shift of the mantissa by the difference of the precision of the two formats. Usually, this will result in just the mantissa bits needed for the target format. One special situation is if the input number is denormal. In this case, the right shift may discard significant bits. This is usually not a problem, since truncating a denormal usually results in zero (underflow) after normalization anyway, since the result format's exponent range is usually smaller than the target format's. However, there is one case where the latter property does not hold: when truncating from ppc_fp128 to double. In particular, truncating a ppc_fp128 whose first double of the pair is denormal should result in just that first double, not zero. The current code however performs an excessive right shift, resulting in lost result bits. This is then caught in the APFloat::normalize call performed by APFloat::convert and causes an assertion failure. This patch checks for the scenario of truncating a denormal, and attempts to (possibly partially) replace the initial mantissa right shift by decrementing the exponent, if doing so will still result in a valid target format exponent. Index: test/CodeGen/PowerPC/pr16573.ll =================================================================== --- test/CodeGen/PowerPC/pr16573.ll (revision 0) +++ test/CodeGen/PowerPC/pr16573.ll (revision 0) @@ -0,0 +1,11 @@ +; RUN: llc < %s \| FileCheck %s + +target triple = "powerpc64-unknown-linux-gnu" + +define double @test() { + %1 = fptrunc ppc_fp128 0xM818F2887B9295809800000000032D000 to double + ret double %1 +} + +; CHECK: .quad -9111018957755033591 + Index: lib/Support/APFloat.cpp =================================================================== --- lib/Support/APFloat.cpp (revision 185817) +++ lib/Support/APFloat.cpp (working copy) @@ -1956,6 +1956,23 @@ X86SpecialNan = true; } + // If this is a truncation of a denormal number, and the target semantics + // has larger exponent range than the source semantics (this can happen + // when truncating from PowerPC double-double to double format), the + // right shift could lose result mantissa bits. Adjust exponent instead + // of performing excessive shift. + if (shift < 0 && isFiniteNonZero()) { + int exponentChange = significandMSB() + 1 - fromSemantics.precision; + if (exponent + exponentChange < toSemantics.minExponent) + exponentChange = toSemantics.minExponent - exponent; + if (exponentChange < shift) + exponentChange = shift; + if (exponentChange < 0) { + shift -= exponentChange; + exponent += exponentChange; + } + } + // If this is a truncation, perform the shift before we narrow the storage. if (shift < 0 && (isFiniteNonZero() \|\| category==fcNaN)) lostFraction = shiftRight(significandParts(), oldPartCount, -shift); llvm-svn: 186409	2013-07-16 13:03:25 +00:00
Richard Osborne	ab29d19536	[XCore] Fix printing of inline asm operands. Previously an asm operand with no operand modifier would give the error "invalid operand in inline asm". llvm-svn: 186407	2013-07-16 12:48:34 +00:00
Tim Northover	069f95f926	ARM: allow printing of ARM atomic DAG nodes. We'd forgotten to provide string representations for the special ARMISD atomic nodes; this adds them in. No effect on CodeGen, just makes the output of "-view-whatever-dags" slightly more readable. llvm-svn: 186406	2013-07-16 12:15:36 +00:00
Richard Sandiford	885140c951	[SystemZ] Use ROSBG and non-zero form of RISBG for OR nodes llvm-svn: 186405	2013-07-16 11:55:57 +00:00
Vladimir Medic	a73970b662	Fixing a buildbot failure:unused function. llvm-svn: 186403	2013-07-16 11:43:20 +00:00
Richard Sandiford	35bb463fb1	[SystemZ] Add MC support for R[NOX]SBG CodeGen support will come later. llvm-svn: 186401	2013-07-16 11:28:08 +00:00
Richard Sandiford	82ec87dbdb	[SystemZ] Use RISBG for (shift (and ...)) Another patch in the series to make more use of R.SBG. This one extends r186072 and r186073 to handle cases where the AND is inside the shift. llvm-svn: 186399	2013-07-16 11:02:24 +00:00
Vladimir Medic	64828a1f73	This patch represents Mips utilization of r186388 code that alows asm matcher to emit mnemonics contain '.' characters. This makes asm parser code simpler and more efficient. llvm-svn: 186397	2013-07-16 10:07:14 +00:00
NAKAMURA Takumi	37ce985739	PPCJITInfo.cpp: Tweak r186252 with s/__ppc/__powerpc/ to work on powerpc-linux Fedora 12. g++ (GCC) 4.4.4 20100630 (Red Hat 4.4.4-10) llvm-svn: 186396	2013-07-16 09:59:51 +00:00
Tim Northover	a7ecd241d2	ARM: implement ldrex, strex and clrex intrinsics Intrinsics already existed for the 64-bit variants, so these support operations of size at most 32-bits. llvm-svn: 186392	2013-07-16 09:46:55 +00:00
Renato Golin	8761069e22	ARM EABI divmod support This patch enables calls to __aeabi_idivmod when in EABI mode, by using the remainder value returned on registers (R1), enabled by the ARM triple "none-eabi". Note that Darwin and GNUEABI triples will continue lowering on GNU style, that is, using the stack for the remainder. Still need to add SREM/UREM support fix for 64-bit lowering. llvm-svn: 186390	2013-07-16 09:32:17 +00:00
Rafael Espindola	77021c9487	Add a version of sys::fs::status that uses fstat. llvm-svn: 186378	2013-07-16 03:20:13 +00:00
Rafael Espindola	9da91a0e03	Instead friending status, provide windows and posix constructors to file_status. This opens the way of having static helpers in the .inc files that can construct a file_status. llvm-svn: 186376	2013-07-16 02:55:33 +00:00
Craig Topper	d3a34f81f8	Add 'const' qualifiers to static const char* variables. llvm-svn: 186371	2013-07-16 01:17:10 +00:00
Manman Ren	b827123cf7	PEI: Support for non-zero SPAdj at beginning of a basic block. We can have a FrameSetup in one basic block and the matching FrameDestroy in a different basic block when we have struct byval. In that case, SPAdj is not zero at beginning of the basic block. Modify PEI to correctly set SPAdj at beginning of each basic block using DFS traversal. We used to assume SPAdj is 0 at beginning of each basic block. PEI had an assert SPAdjCount \|\| SPAdj == 0. If we have a Destroy <n> followed by a Setup <m>, PEI will assert failure. We can add an extra condition to make sure the pairs are matched: The pairs start with a FrameSetup. But since we are doing a much better job in the verifier, this patch removes the check in PEI. PR16393 llvm-svn: 186364	2013-07-15 23:47:29 +00:00
Nadav Rotem	1c1d6c1666	PR16628: Fix a bug in the code that merges compares. Compares return i1 but they compare different types. llvm-svn: 186359	2013-07-15 22:52:48 +00:00
Hal Finkel	a0014a5a26	PPC: Refactoring to support subtarget feature changing This change mirrors the changes that were made to the X86 and ARM targets to support subtarget feature changing. As indicated in r182899, the mechanism is still undergoing revision, and so as with the X86 and ARM targets, there is no test case yet (there is no effective functionality change). llvm-svn: 186357	2013-07-15 22:29:40 +00:00
Manman Ren	aa6875b1f9	Machine Verifier: verify FrameSetup and FrameDestroy 1> on every path through the CFG, a FrameSetup <n> is always followed by a FrameDestroy <n> and a FrameDestroy is always followed by a FrameSetup. 2> stack adjustments are identical on all CFG edges to a merge point. 3> frame is destroyed at end of a return block. PR16393 llvm-svn: 186350	2013-07-15 21:26:31 +00:00
Rafael Espindola	8ea26d6a80	Remove an extra is_directory call. I checked that opening a directory on windows does fail, so this saves a "stat". llvm-svn: 186345	2013-07-15 20:52:01 +00:00
Hal Finkel	8e8618ae5c	Fix register subclass handling in PPCInstrInfo::insertSelect PPCInstrInfo::insertSelect and PPCInstrInfo::canInsertSelect were computing the common subclass of the true and false inputs, and then selecting either the 32-bit or the 64-bit isel variant based on the result of calling PPC::GPRCRegClass.hasSubClassEq(RC) and PPC::G8RCRegClass.hasSubClassEq(RC) (where RC is the common subclass). Unfortunately, this is not quite right: if we have something like this: %vreg8<def> = SELECT_CC_I8 %vreg4<kill>, %vreg7<kill>, %vreg6<kill>, 76; G8RC_and_G8RC_NOX0:%vreg8 CRRC:%vreg4 G8RC_NOX0:%vreg7,%vreg6 then the common subclass of G8RC_and_G8RC_NOX0 and G8RC_NOX0 is G8RC_NOX0, and G8RC_NOX0 is not a subclass of G8RC (because it also contains the ZERO8 pseudo-register). As a result, we also need to check the common subclass against GPRC_NOR0 and G8RC_NOX0 explicitly. This had not been a problem for clients of insertSelect that called canInsertSelect first (because it had a compensating mistake), but insertSelect is also used by the PPC pseudo-instruction expander, and this error was causing a problem in that context. This problem was found by csmith. llvm-svn: 186343	2013-07-15 20:22:58 +00:00
Reid Kleckner	dae7b4e4d1	[mc-coff] Resolve aliases when emitting COFF relocations This is consistent with the ELF object writer. Add some COFF tests that relocate against an alias. Reviewers: espindola Differential Revision: http://llvm-reviews.chandlerc.com/D1079 llvm-svn: 186341	2013-07-15 19:41:21 +00:00
Tom Stellard	31209cc8eb	R600/SI: Add support for 64-bit loads https://bugs.freedesktop.org/show_bug.cgi?id=65873 llvm-svn: 186339	2013-07-15 19:00:09 +00:00
Hal Finkel	2f5e8e3d95	Remove invalid assert in DAGTypeLegalizer::RemapValue There is a comment at the top of DAGTypeLegalizer::PerformExpensiveChecks which, in part, says: // Note that these invariants may not hold momentarily when processing a node: // the node being processed may be put in a map before being marked Processed. Unfortunately, this assert would be valid only if the above-mentioned invariant held unconditionally. This was causing llc to assert when, in fact, everything was fine. Thanks to Richard Sandiford for investigating this issue! Fixes PR16562. llvm-svn: 186338	2013-07-15 18:57:05 +00:00
Stephen Lin	837bba1c51	Remove trailing whitespace llvm-svn: 186333	2013-07-15 17:55:02 +00:00
Chandler Carruth	e3899f2c2c	Revert r186316 while I track down an ASan failure and an assert from a bot. This reverts the commit which introduced a new implementation of the fancy SROA pass designed to reduce its overhead. I'll skip the huge commit log here, refer to r186316 if you're looking for how this all works and why it works that way. llvm-svn: 186332	2013-07-15 17:36:21 +00:00
Reid Kleckner	a75eba9c05	Revert "[Option] Store arg strings in a set backed by a BumpPtrAllocator" This broke clang's crash-report.c test, and I haven't been able to figure it out yet. This reverts commit r186319. llvm-svn: 186329	2013-07-15 16:40:52 +00:00
Job Noorman	a928e1d7a1	Test commit to see if write access works. llvm-svn: 186321	2013-07-15 14:25:26 +00:00
Reid Kleckner	cacb40c6c7	[Option] Store arg strings in a set backed by a BumpPtrAllocator No functionality change. This is preparing to move response file parsing into lib/Option so it can be shared between clang and lld. This change isn't just a micro-optimization. Clang's driver uses a std::set<std::string> to unique arguments while parsing response files, so this matches that. llvm-svn: 186319	2013-07-15 13:46:24 +00:00
Chandler Carruth	e74ff4c643	Reimplement SROA yet again. Same fundamental principle, but a totally different core implementation strategy. Previously, SROA would build a relatively elaborate partitioning of an alloca, associate uses with each partition, and then rewrite the uses of each partition in an attempt to break apart the alloca into chunks that could be promoted. This was very wasteful in terms of memory and compile time because regardless of how complex the alloca or how much we're able to do in breaking it up, all of the datastructure work to analyze the partitioning was done up front. The new implementation attempts to form partitions of the alloca lazily and on the fly, rewriting the uses that make up that partition as it goes. This has a few significant effects: 1) Much simpler data structures are used throughout. 2) No more double walk of the recursive use graph of the alloca, only walk it once. 3) No more complex algorithms for associating a particular use with a particular partition. 4) PHI and Select speculation is simplified and happens lazily. 5) More precise information is available about a specific use of the alloca, removing the need for some side datastructures. Ultimately, I think this is a much better implementation. It removes about 300 lines of code, but arguably removes more like 500 considering that some code grew in the process of being factored apart and cleaned up for this all to work. I've re-used as much of the old implementation as possible, which includes the lion's share of code in the form of the rewriting logic. The interesting new logic centers around how the uses of a partition are sorted, and split into actual partitions. Each instruction using a pointer derived from the alloca gets a 'Partition' entry. This name is totally wrong, but I'll do a rename in a follow-up commit as there is already enough churn here. The entry describes the offset range accessed and the nature of the access. Once we have all of these entries we sort them in a very specific way: increasing order of begin offset, followed by whether they are splittable uses (memcpy, etc), followed by the end offset or whatever. Sorting by splittability is important as it simplifies the collection of uses into a partition. Once we have these uses sorted, we walk from the beginning to the end building up a range of uses that form a partition of the alloca. Overlapping unsplittable uses are merged into a single partition while splittable uses are broken apart and carried from one partition to the next. A partition is also introduced to bridge splittable uses between the unsplittable regions when necessary. I've looked at the performance PRs fairly closely. PR15471 no longer will even load (the module is invalid). Not sure what is up there. PR15412 improves by between 5% and 10%, however it is nearly impossible to know what is holding it up as SROA (the entire pass) takes less time than reading the IR for that test case. The analysis takes the same time as running mem2reg on the final allocas. I suspect (without much evidence) that the new implementation will scale much better however, and it is just the small nature of the test cases that makes the changes small and noisy. Either way, it is still simpler and cleaner I think. llvm-svn: 186316	2013-07-15 10:30:19 +00:00
Alexey Samsonov	1a98450469	DebugInfo: Factor out parsing compile unit DIEs to a separate function. Improve code style and comments. No functionality change. llvm-svn: 186315	2013-07-15 08:43:35 +00:00
Craig Topper	06b3b6651e	Add 'const' qualifier to some arrays. llvm-svn: 186312	2013-07-15 08:02:13 +00:00
Craig Topper	e952ad0bc1	Make some arrays 'static const' llvm-svn: 186311	2013-07-15 07:22:00 +00:00
Craig Topper	f18edae094	Add include to hopefully fix windows build. llvm-svn: 186310	2013-07-15 07:15:05 +00:00
Craig Topper	de1f151115	Add const qualifier to some static arrays. llvm-svn: 186309	2013-07-15 07:02:45 +00:00
Craig Topper	202fbc2c9b	Add 'static' keyword to some const arrays for consistency. llvm-svn: 186308	2013-07-15 06:54:12 +00:00
Craig Topper	0afd0ab749	Make some arrays 'static const' llvm-svn: 186307	2013-07-15 06:39:13 +00:00
Craig Topper	26b45c27f1	Revert part of 186302 to fix buildbots. llvm-svn: 186303	2013-07-15 04:37:54 +00:00

... 2 3 4 5 6 ...

62944 Commits