llvm-project

Commit Graph

Author	SHA1	Message	Date
Chandler Carruth	a7205b6154	[LCG] Special case the removal of self edges. These don't impact the SCC graph in any way because we don't track edges in the SCC graph, just nodes. This also lets us add a nice assert about the invariant that we're working on at least a certain number of nodes within the SCC. llvm-svn: 207305	2014-04-26 03:36:37 +00:00
Juergen Ributzka	a6bda8bae2	[DAG] During DAG legalization keep opaque constants even after expanding. The included test case would return the incorrect results, because the expansion of an shift with a constant shift amount of 0 would generate undefined behavior. This is because ExpandShiftByConstant assumes that all shifts by constants with a value of 0 have already been optimized away. This doesn't happen for opaque constants and usually this isn't a problem, because opaque constants won't take this code path - they are not supposed to. In the case that the opaque constant has to be expanded by the legalizer, the legalizer would drop the opaque flag. In this case we hit the limitations of ExpandShiftByConstant and create incorrect code. This commit fixes the legalizer by not dropping the opaque flag when expanding opaque constants and adding an assertion to ExpandShiftByConstant to catch this not supported case in the future. This fixes <rdar://problem/16718472> llvm-svn: 207304	2014-04-26 02:58:04 +00:00
Gerolf Hoflehner	c46e9b0423	Revert commit r207302 since build failures have been reported. llvm-svn: 207303	2014-04-26 02:03:17 +00:00
Gerolf Hoflehner	34210108b3	RecursivelyDeleteTriviallyDeadInstructions() could remove more than 1 instruction. The caller need to be aware of this and adjust instruction iterators accordingly. rdar://16679376 llvm-svn: 207302	2014-04-26 01:19:16 +00:00
Quentin Colombet	ea18933d97	[X86] Implement TargetLowering::getScalingFactorCost hook. Scaling factors are not free on X86 because every "complex" addressing mode breaks the related instruction into 2 allocations instead of 1. <rdar://problem/16730541> llvm-svn: 207301	2014-04-26 01:11:26 +00:00
Chandler Carruth	8f92d6db22	[LCG] Refactor the duplicated code I added in my last commit here into a helper function. Also factor the other two places where we did the same thing into the helper function. =] Much cleaner this way. NFC. llvm-svn: 207300	2014-04-26 01:03:46 +00:00
Andrea Di Biagio	8cc9059ce8	[InstCombine][X86] Teach how to fold calls to SSE2/AVX2 packed logical shift right intrinsics. A packed logical shift right with a shift count bigger than or equal to the element size always produces a zero vector. In all other cases, it can be safely replaced by a 'lshr' instruction. llvm-svn: 207299	2014-04-26 01:03:22 +00:00
Richard Smith	8d039e4420	Add missing include guards and missing #include, found by modules build. llvm-svn: 207298	2014-04-26 00:53:26 +00:00
Filipe Cabecinhas	d71f110fe9	Appease the almighty buildbots. llvm-svn: 207295	2014-04-26 00:02:37 +00:00
Filipe Cabecinhas	363b570d2a	Optimization for certain shufflevector by using insertps. Summary: If we're doing a v4f32/v4i32 shuffle on x86 with SSE4.1, we can lower certain shufflevectors to an insertps instruction: When most of the shufflevector result's elements come from one vector (and keep their index), and one element comes from another vector or a memory operand. Added tests for insertps optimizations on shufflevector. Added support and tests for v4i32 vector optimization. Reviewers: nadav Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D3475 llvm-svn: 207291	2014-04-25 23:51:17 +00:00
Duncan P. N. Exon Smith	42292ceaa9	Revert "blockfreq: Approximate irreducible control flow" This reverts commit r207286. It causes an ICE on the cmake-llvm-x86_64-linux buildbot [1]: llvm/lib/Analysis/BlockFrequencyInfo.cpp: In lambda function: llvm/lib/Analysis/BlockFrequencyInfo.cpp:182:1: internal compiler error: in get_expr_operands, at tree-ssa-operands.c:1035 [1]: http://bb.pgr.jp/builders/cmake-llvm-x86_64-linux/builds/12093/steps/build_llvm/logs/stdio llvm-svn: 207287	2014-04-25 23:16:58 +00:00
Duncan P. N. Exon Smith	384d0e8ad4	blockfreq: Approximate irreducible control flow Previously, irreducible backedges were ignored. With this commit, irreducible SCCs are discovered on the fly, and modelled as loops with multiple headers. This approximation specifies the headers of irreducible sub-SCCs as its entry blocks and all nodes that are targets of a backedge within it (excluding backedges within true sub-loops). Block frequency calculations act as if we insert a new block that intercepts all the edges to the headers. All backedges and entries to the irreducible SCC point to this imaginary block. This imaginary block has an edge (with even probability) to each header block. The result is now reasonable enough that I've added a number of testcases for irreducible control flow. I've outlined in `BlockFrequencyInfoImpl.h` ways to improve the approximation. <rdar://problem/14292693> llvm-svn: 207286	2014-04-25 23:08:57 +00:00
Adrian Prantl	232897feaa	Unbreak the gdb buildbot by not lowering dbg.declare intrinsics for arrays. llvm-svn: 207284	2014-04-25 23:00:25 +00:00
Eric Christopher	ece0e90e33	Make sure that rangelists are also relative to the compile unit low_pc similar to location lists. Fixes PR19563 llvm-svn: 207283	2014-04-25 22:23:54 +00:00
Matt Arsenault	de1c3410c3	R600: Fix function name printing in LowerCall v2: Check both ExternalSymbol and GlobalAddress Patch by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 207282	2014-04-25 22:22:01 +00:00
David Blaikie	772ab8ae5a	DwarfAccelTable: Store the string symbol in the accelerator table to avoid duplicate lookup. This also avoids the need for subtly side-effecting calls to manifest strings in the string table at the point where items are added to the accelerator tables. llvm-svn: 207281	2014-04-25 22:21:35 +00:00
Tom Roeder	fd1bc602b3	Add an -mattr option to the gold plugin to support subtarget features in LTO This adds support for an -mattr option to the gold plugin and to llvm-lto. This allows the caller to specify details of the subtarget architecture, like +aes, or +ssse3 on x86. Note that this requires a change to the include/llvm-c/lto.h interface: it adds a function lto_codegen_set_attr and it increments the version of the interface. llvm-svn: 207279	2014-04-25 21:46:51 +00:00
Alexey Samsonov	b54d0f4020	Fix missing include llvm-svn: 207278	2014-04-25 21:42:35 +00:00
David Blaikie	daefdbf3ad	Encapsulate the DWARF string pool in a separate type. Pulls out some more code from some of the rather monolithic DWARF classes. Unlike the address table, the string table won't move up into DwarfDebug - each DWARF file has its own string table (but there can be only one address table). llvm-svn: 207277	2014-04-25 21:34:35 +00:00
Alexey Samsonov	001ecd9aa9	[DWARF parser] Cleanup code in DWARFDebugAranges. No functionality change. llvm-svn: 207276	2014-04-25 21:30:03 +00:00
Alexey Samsonov	4316df5921	[DWARF parser] Cleanup code in DWARFDebugAbbrev. No functionality change. llvm-svn: 207274	2014-04-25 21:10:56 +00:00
Adam Nemet	03d91c51e4	[LoopStrengthReduce] Don't trim formula that uses a subset of required registers Consider this use from the new testcase: LSR Use: Kind=ICmpZero, Offsets={0}, widest fixup type: i32 reg({1000,+,-1}<nw><%for.body>) -3003 + reg({3,+,3}<nw><%for.body>) -1001 + reg({1,+,1}<nuw><nsw><%for.body>) -1000 + reg({0,+,1}<nw><%for.body>) -3000 + reg({0,+,3}<nuw><%for.body>) reg({-1000,+,1}<nw><%for.body>) reg({-3000,+,3}<nsw><%for.body>) This is the last use we consider for a solution in SolveRecurse, so CurRegs is a large set. (CurRegs is the set of registers that are needed by the previously visited uses in the in-progress solution.) ReqRegs is { {3,+,3}<nw><%for.body>, {1,+,1}<nuw><nsw><%for.body> } This is the intersection of the regs used by any of the formulas for the current use and CurRegs. Now, the code requires a formula to contain all these regs (the comment is simply wrong), otherwise the formula is immediately disqualified. Obviously, no formula for this use contains two regs so they will all get disqualified. The fix modifies the check to allow the formula in this case. The idea is that neither of these formulae is introducing any new registers which is the point of this early pruning as far as I understand. In terms of set arithmetic, we now allow formulas whose used regs are a subset of the required regs not just the other way around. There are few more loops in the test-suite that are now successfully LSRed. I have benchmarked those and found very minimal change. Fixes <rdar://problem/13965777> llvm-svn: 207271	2014-04-25 21:02:21 +00:00
Duncan P. N. Exon Smith	9f35117956	SCC: Use the reference typedef Actually use the `reference` typedef, and remove the private redefinition of `pointer` since it has no users. Using `reference` exposes a problem with r207257, which specified the wrong `value_type` to `iterator_facade_base` (fixed that too). llvm-svn: 207270	2014-04-25 20:52:08 +00:00
Adrian Prantl	32da88923a	This reapplies r207235 with an additional bugfixes caught by the msan buildbot - do not insert debug intrinsics before phi nodes. Debug info for optimized code: Support variables that are on the stack and described by DBG_VALUEs during their lifetime. Previously, when a variable was at a FrameIndex for any part of its lifetime, this would shadow all other DBG_VALUEs and only a single fbreg location would be emitted, which in fact is only valid for a small range and not the entire lexical scope of the variable. The included dbg-value-const-byref testcase demonstrates this. This patch fixes this by Local - emitting dbg.value intrinsics for allocas that are passed by reference - dropping all dbg.declares (they are now fully lowered to dbg.values) SelectionDAG - renamed constructors for SDDbgValue for better readability. - fix UserValue::match() to handle indirect values correctly - not inserting an MMI table entries for dbg.values that describe allocas. - lowering dbg.values that describe allocas into indirect DBG_VALUEs. CodeGenPrepare - leaving dbg.values for an alloca were they are (see comment) Other - regenerated/updated instcombine.ll testcase and included source rdar://problem/16679879 http://reviews.llvm.org/D3374 llvm-svn: 207269	2014-04-25 20:49:25 +00:00
David Blaikie	0651d7650a	MCAssembler: Simplify implementation of const variants of getSymbolData by calling one implementation from the other. Code review feedback by Rafael Espindola on r207124. llvm-svn: 207266	2014-04-25 20:19:11 +00:00
David Blaikie	37436ed485	BugPoint: Fix some memory leaks. Patch by Kostya Serebryany. unique_ptr would be nice, but it's a bit too much work for an area I'm not familiar with, nor invested in, unfortunately. llvm-svn: 207265	2014-04-25 20:15:16 +00:00
David Blaikie	0eb13ce85a	DwarfUnit: Remove unused function llvm-svn: 207264	2014-04-25 20:02:24 +00:00
David Blaikie	914046e1e7	DIE: Pass ownership of children via std::unique_ptr rather than raw pointer. This should reduce the chance of memory leaks like those fixed in r207240. There's still some unclear ownership of DIEs happening in DwarfDebug. Pushing unique_ptr and references through more APIs should help expose the cases where ownership is a bit fuzzy. llvm-svn: 207263	2014-04-25 20:00:34 +00:00
David Blaikie	8dbcc3fe32	DIEEntry: Refer to the specified DIE via reference rather than pointer. Makes some more cases (the unit tests, specifically), lexically compatible with a change to unique_ptr. llvm-svn: 207261	2014-04-25 19:33:43 +00:00
David Blaikie	b0b3fcf6d3	DwarfUnit: return by reference from createAndAddDIE Since this doesn't return ownership (the DIE has been added to the specified parent already) nor return null, just return by reference. llvm-svn: 207259	2014-04-25 18:52:29 +00:00
Duncan P. N. Exon Smith	da5eaeda01	blockfreq: Further shift logic to LoopData Move a lot of the loop-related logic that was sprinkled around the code into `LoopData`. <rdar://problem/14292693> llvm-svn: 207258	2014-04-25 18:47:04 +00:00
Duncan P. N. Exon Smith	eb6a582d13	SCC: Provide operator->() through iterator_facade_base Use the fancy new `iterator_facade_base` to add `scc_iterator::operator->()`. Remove other definitions where `iterator_facade_base` does the right thing. <rdar://problem/14292693> llvm-svn: 207257	2014-04-25 18:43:41 +00:00
Reed Kotler	5c7f91e42f	enable fast isel tablegen files for Mips Reviewers: dsanders Reviewed By: dsanders Differential Revision: http://reviews.llvm.org/D3498 llvm-svn: 207256	2014-04-25 18:36:38 +00:00
David Blaikie	adcde36ceb	Return DIE by reference instead of pointer from DwarfUnit::getUnitDie llvm-svn: 207255	2014-04-25 18:35:57 +00:00
Duncan P. N. Exon Smith	ef86928927	SCC: Remove non-const operator*() <rdar://problem/14292693> llvm-svn: 207254	2014-04-25 18:26:45 +00:00
David Blaikie	65a7466675	DwarfUnit: Suddently, DIE references, everywhere. This'll make changing to unique_ptr ownership of DIEs easier since the usages will now have '*' on them making them textually compatible between unique_ptr and raw pointer. llvm-svn: 207253	2014-04-25 18:26:14 +00:00
Duncan P. N. Exon Smith	d2b2facb07	SCC: Change clients to use const, NFC It's fishy to be changing the `std::vector<>` owned by the iterator, and no one actual does it, so I'm going to remove the ability in a subsequent commit. First, update the users. <rdar://problem/14292693> llvm-svn: 207252	2014-04-25 18:24:50 +00:00
Duncan P. N. Exon Smith	f4e1d6fd06	SCC: Doxygen-ize comments, NFC <rdar://problem/14292693> llvm-svn: 207251	2014-04-25 18:18:46 +00:00
Adrian Prantl	d2d9b76e48	Revert "This reapplies r207130 with an additional testcase+and a missing check for" This reverts commit 207235 to investigate msan buildbot breakage. llvm-svn: 207250	2014-04-25 18:18:09 +00:00
Duncan P. N. Exon Smith	a16a629ef6	SCC: Un-inline long functions These are long functions that really shouldn't be inlined. Otherwise, no functionality change. <rdar://problem/14292693> llvm-svn: 207249	2014-04-25 18:15:50 +00:00
Duncan P. N. Exon Smith	5547afed78	SCC: Remove redundant inline keywords, NFC Functions declared in line in a class are inlined by default. There's no reason for the `inline` keyword. <rdar://problem/14292693> llvm-svn: 207248	2014-04-25 18:10:23 +00:00
Reed Kotler	c041669927	Make sure that DSUB does not duplicate the pattern of DSUBU Test Plan: Run test suite to make sure there is no regression. https://dmz-portal.mips.com/bb/builders/LLVM%20with%2064bit%20and%20delay%20slot%20optimizer%20and%20direct%20object%20emitter/builds/626 Reviewers: dsanders Reviewed By: dsanders Differential Revision: http://reviews.llvm.org/D3497 llvm-svn: 207247	2014-04-25 18:05:00 +00:00
Saleem Abdulrasool	99f0d458c3	ARM: remove @llvm.arm.sevl This intrinsic is no longer needed with the new @llvm.arm.hint(i32) intrinsic which provides a generic, extensible manner for adding hint instructions. This functionality can now be represented as @llvm.arm.hint(i32 5). llvm-svn: 207246	2014-04-25 17:51:25 +00:00
Manman Ren	3c44067a30	[inline cold threshold] Command line argument for inline threshold will override the default cold threshold. When we use command line argument to set the inline threshold, the default cold threshold will not be used. This is in line with how we use OptSizeThreshold. When we want a higher threshold for all functions, we do not have to set both inline threshold and cold threshold. llvm-svn: 207245	2014-04-25 17:34:55 +00:00
David Blaikie	e071fc8082	Refactor some common logic in DwarfUnit::constructVariableDIE and pass non-null DIE by reference to DbgVariable::setDIE llvm-svn: 207244	2014-04-25 17:32:19 +00:00
Saleem Abdulrasool	7e7c2f9ca6	ARM: provide a new generic hint intrinsic Introduce the llvm.arm.hint(i32) intrinsic that can be used to inject hints into the instruction stream. This is particularly useful for generating IR from a compiler where the user may inject an intrinsic (e.g. __yield). These are then pattern substituted into the correct instruction which already existed. llvm-svn: 207242	2014-04-25 17:24:24 +00:00
David Blaikie	de519a2d82	PR19554: Fix some memory leaks in DIEHashTest.cpp llvm-svn: 207240	2014-04-25 17:07:55 +00:00
Adrian Prantl	0840a22452	Reapply r207135 without modifications. Debug info: Let dbg.values inserted by LowerDbgDeclare inherit the location of the dbg.value. This gets rid of tons of redundant variable DIEs in subscopes. rdar://problem/14874886, rdar://problem/16679936 llvm-svn: 207236	2014-04-25 17:01:04 +00:00
Adrian Prantl	f5834a4b49	This reapplies r207130 with an additional testcase+and a missing check for AllocaInst that was missing in one location. Debug info for optimized code: Support variables that are on the stack and described by DBG_VALUEs during their lifetime. Previously, when a variable was at a FrameIndex for any part of its lifetime, this would shadow all other DBG_VALUEs and only a single fbreg location would be emitted, which in fact is only valid for a small range and not the entire lexical scope of the variable. The included dbg-value-const-byref testcase demonstrates this. This patch fixes this by Local - emitting dbg.value intrinsics for allocas that are passed by reference - dropping all dbg.declares (they are now fully lowered to dbg.values) SelectionDAG - renamed constructors for SDDbgValue for better readability. - fix UserValue::match() to handle indirect values correctly - not inserting an MMI table entries for dbg.values that describe allocas. - lowering dbg.values that describe allocas into indirect DBG_VALUEs. CodeGenPrepare - leaving dbg.values for an alloca were they are (see comment) Other - regenerated/updated instcombine.ll testcase and included source rdar://problem/16679879 http://reviews.llvm.org/D3374 llvm-svn: 207235	2014-04-25 17:01:00 +00:00
Tilmann Scheller	2c65bbddd8	[ARM64] When compiling for ELF in PIC mode, local symbols shouldn't go through the GOT There's no need for local symbols to go through the GOT, in fact it seems GNU ld is not even emitting GOT entries for local symbols and will error out when trying to resolve a GOT relocation for a local symbol. This bug triggers when bootstrapping clang on AArch64 Linux with -fPIC and the ARM64 backend. The AArch64 backend is not affected. With this commit it's now possible to bootstrap clang on AArch64 Linux with the ARM64 backend (-fPIC, -O3). llvm-svn: 207226	2014-04-25 13:43:18 +00:00
Jiangning Liu	533b560bc6	[ARM64] Handle fp128 for parameter passing on stack llvm-svn: 207222	2014-04-25 12:07:03 +00:00
Tim Northover	eb7354fd3b	ARM64: fix assertion in ISelDAGToDAG Also an unused variable, so double bonus! This should deal with PR19548. llvm-svn: 207221	2014-04-25 10:48:47 +00:00
Bradley Smith	672df15122	[ARM64] Print preferred aliases for SFBM/UBFM in InstPrinter llvm-svn: 207219	2014-04-25 10:25:29 +00:00
Chandler Carruth	9ba7762d7f	[LCG] During the incremental update of an SCC, switch to using the SCCMap to test for nodes that have been re-added to the root SCC rather than a set vector. We already have done the SCCMap lookup, we juts need to test it in two different ways. In turn, do most of the processing of these nodes as they go into the root SCC rather than lazily. This simplifies the final loop to just stitch the root SCC into its children's parent sets. No functionlatiy changed. However, this makes a few things painfully obvious, which was my intent. =] There is tons of repeated code introduced here and elsewhere. I'm splitting the refactoring of that code into helpers from this change so its clear that this is the change which switches the datastructures used around, and the other is a pure factoring & deduplication of code change. llvm-svn: 207217	2014-04-25 09:52:44 +00:00
Kevin Qin	022d395c9c	[ARM64] Add RUN lines for "–target arm64 –mattr=-fp-armv8" on AArch64 no-fp test. This patch is a supplement of implementing predicate of FP, enabling aarch64 backend no-fp tests on arm64 target for verification. During this, one bug is exposed and fixed by this patch. llvm-svn: 207215	2014-04-25 09:44:20 +00:00
Kevin Qin	0e7b07704e	[ARM64] Support crc predicate on ARM64. According to the specification, CRC is an optional extension of the architecture. llvm-svn: 207214	2014-04-25 09:25:42 +00:00
Chandler Carruth	2e6ef0e80f	[LCG] During the incremental re-build of an SCC after removing an edge, remove the nodes in the SCC from the SCC map entirely prior to the DFS walk. This allows the SCC map to represent both the state of not-yet-re-added-to-an-SCC and added-back-to-this-SCC independently. The first is being missing from the SCC map, the second is mapping back to 'this'. In a subsequent commit, I'm going to use this property to simplify the new node list for this SCC. In theory, I think this also makes the contract for orphaning a node from the graph slightly less confusing. Now it is also orphaned from the SCC graph. Still, this isn't quite right either, and so I'm not adding test cases here. I'll add test cases for the behavior of orphaning nodes when the code actually supports it. The change here is mostly incidental, my goal is simplifying the algorithm. llvm-svn: 207213	2014-04-25 09:08:10 +00:00
Chandler Carruth	770060ddfa	[LCG] Rather than doing a linear time SmallSetVector removal of each child from the worklist, wait until we actually need to pop another element off of the worklist and skip over any that were already visited by the DFS. This also enables swapping the nodes of the SCC into the worklist. No functionality changed. llvm-svn: 207212	2014-04-25 09:08:05 +00:00
Chandler Carruth	6b88e3a545	[LCG] Remove a completely unnecessary loop. It wasn't even doing any thing, just mucking up the code. I feel bad that I even wrote this loop. Very sorry. The diff is huge because of the indent change, but I promise all this is doing is realizing that the outer two loops were actually the exact same loops, and we didn't need two of them. llvm-svn: 207202	2014-04-25 06:45:06 +00:00
Chandler Carruth	774c9320c0	[LCG] Now that the loop structure of the core SCC finding routine is factored into a more reasonable form, replace the tail call with a simple outer-loop continuation. It's sad that C++ makes this so awkward to write, but it seems more direct and clear than the tail call at this point. llvm-svn: 207201	2014-04-25 06:38:58 +00:00
Saleem Abdulrasool	d4cae62fda	X86: convert object streamer selection to a switch Change the object streamer selection to a switch from a series of if conditions. Rather than defaulting to ELF, require that an ELF format is requested. The Windows/!ELF is maintained as MachO would have been selected first and will still provide a MachO format. Add an assertion that if COFF is requested that the target platform is Windows as only WinCOFF object emission is currently supported. llvm-svn: 207200	2014-04-25 06:29:36 +00:00
Anders Waldenborg	f3a1acfbf7	[python] Fix getting section contents. The returnvalue was handled as c_char_p which ment that ctypes handled it as a NUL-terminated string making it cut the contents at first NUL (or even worse - overrunning the buffer if it doesn't contain a NUL). Differential Revision: http://reviews.llvm.org/D3474 llvm-svn: 207199	2014-04-25 06:25:15 +00:00
David Blaikie	69d0cf06bc	Add missing cpp file header Code review feedback from Paul Robinson on r207022 llvm-svn: 207198	2014-04-25 06:22:32 +00:00
Craig Topper	062a2baef0	[C++] Use 'nullptr'. Target edition. llvm-svn: 207197	2014-04-25 05:30:21 +00:00
Craig Topper	f40110f4d8	[C++] Use 'nullptr'. Transforms edition. llvm-svn: 207196	2014-04-25 05:29:35 +00:00
Duncan P. N. Exon Smith	cb7d29d30c	blockfreq: Only one mass distribution per node Remove the concepts of "forward" and "general" mass distributions, which was wrong. The split might have made sense in an early version of the algorithm, but it's definitely wrong now. <rdar://problem/14292693> llvm-svn: 207195	2014-04-25 04:38:43 +00:00
Duncan P. N. Exon Smith	ebf7626988	blockfreq: Document assertion <rdar://problem/14292693> llvm-svn: 207194	2014-04-25 04:38:40 +00:00
Duncan P. N. Exon Smith	84408d1fda	blockfreq: Use better branch weights in multiexit test The branch weights were even before. Make them different. <rdar://problem/14292693> llvm-svn: 207193	2014-04-25 04:38:37 +00:00
Duncan P. N. Exon Smith	58c8948a0c	blockfreq: Clean up irreducible testcases Strip irreducible testcases to pure control flow. The function calls made the branch weights more believable but cluttered it up a lot. There isn't going to be any constant analysis here, so just use dumb branch logic to clarify the important parts. <rdar://problem/14292693> llvm-svn: 207192	2014-04-25 04:38:35 +00:00
Duncan P. N. Exon Smith	3f086789ff	blockfreq: Document high-level functions <rdar://problem/14292693> llvm-svn: 207191	2014-04-25 04:38:32 +00:00
Duncan P. N. Exon Smith	71f07451b6	blockfreq: Remove dead code <rdar://problem/14292693> llvm-svn: 207190	2014-04-25 04:38:30 +00:00
Duncan P. N. Exon Smith	5291d2a561	blockfreq: Scale LoopData::Scale on the way down Rather than scaling loop headers and then scaling all the loop members by the header frequency, scale `LoopData::Scale` itself, and scale the loop members by it. It's much more obvious what's going on this way, and doesn't cost any extra multiplies. <rdar://problem/14292693> llvm-svn: 207189	2014-04-25 04:38:27 +00:00
Duncan P. N. Exon Smith	0633f0ec29	blockfreq: unwrapLoopPackage() => unwrapLoop() <rdar://problem/14292693> llvm-svn: 207188	2014-04-25 04:38:25 +00:00
Duncan P. N. Exon Smith	da0b21cf96	blockfreq: Pass the Loop directly into unwrapLoopPackage() <rdar://problem/14292693> llvm-svn: 207187	2014-04-25 04:38:23 +00:00
Duncan P. N. Exon Smith	575bd8c81b	blockfreq: Unwrap from Loops When unwrapping loops, just visit the loops rather than all nodes. <rdar://problem/14292693> llvm-svn: 207186	2014-04-25 04:38:20 +00:00
Duncan P. N. Exon Smith	46d9a56ce6	blockfreq: Separate unwrapLoops() from finalizeMetrics() <rdar://problem/14292693> llvm-svn: 207185	2014-04-25 04:38:17 +00:00
Duncan P. N. Exon Smith	50a1bb85b8	blockfreq: LoopData::MemberList => NodeList <rdar://problem/14292693> llvm-svn: 207184	2014-04-25 04:38:15 +00:00
Duncan P. N. Exon Smith	c9b7cfea2f	blockfreq: Expose getPackagedNode() Make `getPackagedNode()` a member function of `BlockFrequencyInfoImplBase` so that it's available for templated code. <rdar://problem/14292693> llvm-svn: 207183	2014-04-25 04:38:12 +00:00
Duncan P. N. Exon Smith	1cab8a0708	blockfreq: Store the header with the members <rdar://problem/14292693> llvm-svn: 207182	2014-04-25 04:38:09 +00:00
Duncan P. N. Exon Smith	39cc64827e	blockfreq: Encapsulate LoopData::Header <rdar://problem/14292693> llvm-svn: 207181	2014-04-25 04:38:06 +00:00
Duncan P. N. Exon Smith	4bbaff75e0	blockfreq: Embed Loop hierarchy in LoopData Continue refactoring to make `LoopData` first-class. Here I'm making the `LoopData` hierarchy explicit, instead of bouncing back and forth with `WorkingData`. This simplifies the logic and better matches the `LoopInfo` design. (Eventually, `LoopInfo` should be restructured so that it supports this pass, and `LoopData` can be removed.) <rdar://problem/14292693> llvm-svn: 207180	2014-04-25 04:38:03 +00:00
Duncan P. N. Exon Smith	d132040ed6	blockfreq: Use LoopData directly Instead of passing around loop headers, pass around `LoopData` directly. <rdar://problem/14292693> llvm-svn: 207179	2014-04-25 04:38:01 +00:00
Duncan P. N. Exon Smith	e005c7c496	blockfreq: Stop using range-based for to traverse Loops A follow-up commit will need the actual iterators. <rdar://problem/14292693> llvm-svn: 207178	2014-04-25 04:37:58 +00:00
Duncan P. N. Exon Smith	fc7dc93031	blockfreq: Use a std::list for Loops As pointed out by David Blaikie in code review, a `std::list<T>` is simpler than a `std::vector<std::unique_ptr<T>>`. Another option is a `std::deque<T>` (which allocates in chunks), but I'd like to leave open the option of inserting in the middle of the sequence for handling irreducible control flow on the fly. <rdar://problem/14292693> llvm-svn: 207177	2014-04-25 04:30:06 +00:00
Craig Topper	e6cb63e471	[C++] Use 'nullptr'. Tools edition. llvm-svn: 207176	2014-04-25 04:24:47 +00:00
Karthik Bhat	6a48f7d66e	Allow vectorization of bit intrinsics in BB Vectorizer. This patch adds support for vectorization of bit intrinsics such as bswap,ctpop,ctlz,cttz. llvm-svn: 207174	2014-04-25 03:33:48 +00:00
Justin Bogner	b59d7c73b0	ProfileData: Treat missing function counts as malformed llvm-svn: 207172	2014-04-25 02:45:33 +00:00
Reid Kleckner	65fc0e2c00	Change llvm-config --ldflags to report ${CMAKE_CXX_LINK_FLAGS} Should fix PR19526. When Oscar added this code in the intial CMake build system port, he had a TODO saying that ${CMAKE_SHARED_LINKER_FLAGS} was probably wrong. I agree. I'm using ${CMAKE_CXX_LINK_FLAGS} to point LLVM at my custom installation of gcc 4.recent, so that seems more correct. With this change, I can build creduce against an installed clang, and it picks up the write flags from --ldflags. llvm-svn: 207171	2014-04-25 01:44:20 +00:00
David Blaikie	39fa6a285c	Fix quadratic performance during debug compression due to sections x symbols iteration. When fixing the symbols in each compressed section we were iterating over all symbols for each compressed section. In extreme cases this could snowball severely (5min uncompressed -> 35min compressed) due to iterating over all symbols for each compressed section (large numbers of compressed sections can be generated by DWARF type units). To address this, build a map of the symbols in each section ahead of time, and access that map if a section is being compressed. This brings compile time for the aforementioned example down to ~6 minutes. llvm-svn: 207167	2014-04-25 00:48:01 +00:00
Adrian Prantl	6e5de2ea06	Revert "This reapplies r207130 with an additional testcase+and a missing check for" Typo in testcase. llvm-svn: 207166	2014-04-25 00:42:50 +00:00
Adrian Prantl	3512190ab3	This reapplies r207130 with an additional testcase+and a missing check for AllocaInst that was missing in one location. Debug info for optimized code: Support variables that are on the stack and described by DBG_VALUEs during their lifetime. Previously, when a variable was at a FrameIndex for any part of its lifetime, this would shadow all other DBG_VALUEs and only a single fbreg location would be emitted, which in fact is only valid for a small range and not the entire lexical scope of the variable. The included dbg-value-const-byref testcase demonstrates this. This patch fixes this by Local - emitting dbg.value intrinsics for allocas that are passed by reference - dropping all dbg.declares (they are now fully lowered to dbg.values) SelectionDAG - renamed constructors for SDDbgValue for better readability. - fix UserValue::match() to handle indirect values correctly - not inserting an MMI table entries for dbg.values that describe allocas. - lowering dbg.values that describe allocas into indirect DBG_VALUEs. CodeGenPrepare - leaving dbg.values for an alloca were they are (see comment) Other - regenerated/updated instcombine.ll testcase and included source rdar://problem/16679879 http://reviews.llvm.org/D3374 llvm-svn: 207165	2014-04-25 00:38:40 +00:00
Adrian Prantl	ff4282a204	Revert "Debug info for optimized code: Support variables that are on the stack and" This reverts commit 207130 for buildbot breakage. llvm-svn: 207162	2014-04-25 00:04:49 +00:00
Adrian Prantl	5ad11841f7	Revert "Debug info: Let dbg.values inserted by LowerDbgDeclare inherit the location" This reverts commit 207130 for buildbot breakage. llvm-svn: 207159	2014-04-24 23:53:29 +00:00
Richard Smith	ab1cb0990d	Add missing include, found by modules build. llvm-svn: 207158	2014-04-24 23:29:25 +00:00
Richard Smith	80429c42ab	Function defined in a header should be inline. Found by modules build. llvm-svn: 207157	2014-04-24 23:14:32 +00:00
Alexey Samsonov	19f76f25e9	[DWARF parser] Make a few methods non-public llvm-svn: 207156	2014-04-24 23:08:56 +00:00
Alexey Samsonov	7682f81266	[DWARF parser] DWARFUnit ctor doesn't need both parsed and raw .debug_abbrev section. Remove the former. llvm-svn: 207153	2014-04-24 22:51:03 +00:00
Alexey Samsonov	9a5c95ad3a	[DWARF parser] Simplify and re-format a method llvm-svn: 207151	2014-04-24 22:41:09 +00:00
Chandler Carruth	91dcf0f977	[LCG] Switch a weird do/while loop that actually couldn't fail its condition into an obviously infinite loop with an assert about the degenerate condition. No functionality changed. llvm-svn: 207147	2014-04-24 21:19:30 +00:00
Chandler Carruth	d5835ee368	[ADT] Generalize pointee_iterator to smart pointers by using decltype. Based on review feedback from Dave on the original patch. llvm-svn: 207146	2014-04-24 21:10:35 +00:00
Benjamin Kramer	76f753e9a9	X86: Don't transform shifts into ands when the sign bit is tested. Should unbreak MultiSource/Benchmarks/mediabench/g721/g721encode/encode. llvm-svn: 207145	2014-04-24 20:51:37 +00:00
Reid Kleckner	3981faecbd	Remove dead inline function that doesn't compile MSVC doesn't diagnose this, interestingly. llvm-svn: 207144	2014-04-24 20:19:22 +00:00
Reid Kleckner	5772b77789	Add 'musttail' marker to call instructions This is similar to the 'tail' marker, except that it guarantees that tail call optimization will occur. It also comes with convervative IR verification rules that ensure that tail call optimization is possible. Reviewers: nicholas Differential Revision: http://llvm-reviews.chandlerc.com/D3240 llvm-svn: 207143	2014-04-24 20:14:34 +00:00
Reid Kleckner	0fbb1e91e5	Fix rdtsc.ll test to match r8 on win64 llvm-svn: 207142	2014-04-24 20:14:08 +00:00
Richard Smith	a4b7cfd64f	Remove C++11ism (specializing a template in a surrounding namespace) to appease the buildbots. llvm-svn: 207136	2014-04-24 18:49:15 +00:00
Adrian Prantl	f4a701092e	Debug info: Let dbg.values inserted by LowerDbgDeclare inherit the location of the dbg.value. This gets rid of tons of redundant variable DIEs in subscopes. rdar://problem/14874886, rdar://problem/16679936 llvm-svn: 207135	2014-04-24 18:44:15 +00:00
Richard Smith	0d9ec713e7	[modules] "Specialize" a function by actually specializing a function template rather than by adding an overload and hoping that it's declared before the code that calls it. (In a modules build, it isn't.) llvm-svn: 207133	2014-04-24 18:27:29 +00:00
Adrian Prantl	f4223918de	Debug info for optimized code: Support variables that are on the stack and described by DBG_VALUEs during their lifetime. Previously, when a variable was at a FrameIndex for any part of its lifetime, this would shadow all other DBG_VALUEs and only a single fbreg location would be emitted, which in fact is only valid for a small range and not the entire lexical scope of the variable. The included dbg-value-const-byref testcase demonstrates this. This patch fixes this by Local - emitting dbg.value intrinsics for allocas that are passed by reference - dropping all dbg.declares (they are now fully lowered to dbg.values) SelectionDAG - renamed constructors for SDDbgValue for better readability. - fix UserValue::match() to handle indirect values correctly - not inserting an MMI table entries for dbg.values that describe allocas. - lowering dbg.values that describe allocas into indirect DBG_VALUEs. CodeGenPrepare - leaving dbg.values for an alloca were they are (see comment) Other - regenerated/updated instcombine-intrinsics testcase and included source rdar://problem/16679879 http://reviews.llvm.org/D3374 llvm-svn: 207130	2014-04-24 17:41:45 +00:00
Andrea Di Biagio	d1ab866868	[X86] Add support for Read Time Stamp Counter x86 builtin intrinsics. This patch: - Adds two new X86 builtin intrinsics ('int_x86_rdtsc' and 'int_x86_rdtscp') as GCCBuiltin intrinsics; - Teaches the backend how to lower the two new builtins; - Introduces a common function to lower READCYCLECOUNTER dag nodes and the two new rdtsc/rdtscp intrinsics; - Improves (and extends) the existing x86 test 'rdtsc.ll'; now test 'rdtsc.ll' correctly verifies that both READCYCLECOUNTER and the two new intrinsics work fine for both 64bit and 32bit Subtargets. llvm-svn: 207127	2014-04-24 17:18:27 +00:00
Matt Arsenault	1018c897f6	R600/SI: Use address space in allowsUnalignedMemoryAccesses llvm-svn: 207126	2014-04-24 17:08:26 +00:00
David Blaikie	908f4d4bf5	Spread some const around for non-mutating uses of MCSymbolData. I discovered this const-hole while attempting to coalesnce the Symbol and SymbolMap data structures. There's some pending issues with that, but I figured this change was easy to flush early. llvm-svn: 207124	2014-04-24 16:59:40 +00:00
Matheus Almeida	583a13cf36	[mips] Remove non-ascii character. llvm-svn: 207123	2014-04-24 16:31:10 +00:00
Tim Northover	11a9b1b45a	AArch64/ARM64: add ARM64 runs to more MC tests. llvm-svn: 207120	2014-04-24 15:04:26 +00:00
Tim Northover	d3c3f9f3ca	AArch64/ARM64: run AArch64 NEON MC tests through ARM64 too. This skips a couple of compare ones due to the different syntaxt for floating-point 0.0. AArch64 does it more canonically, and we'll need to fiddle ARM64 to make it work. llvm-svn: 207119	2014-04-24 15:04:20 +00:00
David Blaikie	293a2a3ada	Fix memory leak of MCSymbolData in MCAsmStreamer. Leak identified by LSan and reported by Kostya Serebryany. Let's get a bit experimental here... in theory our minimum compiler versions support unordered_map. llvm-svn: 207118	2014-04-24 14:33:36 +00:00
Tim Northover	6331d4b975	AArch64: print NEON lists with a space. This matches ARM64 behaviour, which I think is clearer. It also puts all the churn from that difference into one easily ignored commit. llvm-svn: 207116	2014-04-24 14:06:20 +00:00
Evgeniy Stepanov	f4a36999ad	[asan] Use MCInstrInfo in inline asm instrumentation. Patch by Yuri Gorshenin. llvm-svn: 207115	2014-04-24 13:29:34 +00:00
Tim Northover	11b935f282	AArch64/ARM64: enable remaining MC elf tests. llvm-svn: 207112	2014-04-24 12:56:41 +00:00
Tim Northover	d702d6ac6f	AArch64/ARM64: allow negative addends, at least on ELF. llvm-svn: 207111	2014-04-24 12:56:38 +00:00
Tim Northover	624928134f	ARM64: support relocated "TBZ/TBNZ" instructions. llvm-svn: 207110	2014-04-24 12:56:34 +00:00
Tim Northover	0815a43e7c	AArch64/ARM64: support relocated ADR instruction llvm-svn: 207109	2014-04-24 12:56:30 +00:00
Tim Northover	597ccb200c	AArch64/ARM64: add support for :abs_gN_s: MOVZ modifiers We only need assembly support, so it's fairly easy. llvm-svn: 207108	2014-04-24 12:56:27 +00:00
Tim Northover	49153037d4	ARM64: shut up warning about variable only used in assert. llvm-svn: 207106	2014-04-24 12:22:12 +00:00
Tim Northover	79ec019261	AArch64/ARM64: disentangle the "B.CC" and "LDR lit" operands These can have different relocations in ELF. In particular both: b.eq global ldr x0, global are valid, giving different relocations. The only possible way to distinguish them is via a different fixup, so the operands had to be separated throughout the backend. llvm-svn: 207105	2014-04-24 12:12:10 +00:00
Tim Northover	cf16ec238e	AArch64/ARM64: enable some MC tests on ARM64 This will also (as with CodeGen) disable testing when the ARM64 backend is not present. llvm-svn: 207104	2014-04-24 12:12:01 +00:00
Tim Northover	9b594d1163	AArch64/ARM64: port bitfield test to ARM64. llvm-svn: 207103	2014-04-24 12:11:56 +00:00
Tim Northover	eb6611e727	AArch64/ARM64: implement BFI optimisation ARM64 was not producing pure BFI instructions for bitfield insertion operations, unlike AArch64. The approach had to be a little different (in ISelDAGToDAG rather than ISelLowering), and the outcomes aren't identical but hopefully this gives it similar power. This should address PR19424. llvm-svn: 207102	2014-04-24 12:11:53 +00:00
Tim Northover	1cb984fbcf	AArch64/ARM64: port more tests llvm-svn: 207101	2014-04-24 12:11:46 +00:00
Chandler Carruth	24553934f8	[LCG] Incorporate the core trick of improvements on the naive Tarjan's algorithm here: http://dl.acm.org/citation.cfm?id=177301. The idea of isolating the roots has even more relevance when using the stack not just to implement the DFS but also to implement the recursive step. Because we use it for the recursive step, to isolate the roots we need to maintain two stacks: one for our recursive DFS walk, and another of the nodes that have been walked. The nice thing is that the latter will be half the size. It also fixes a complete hack where we scanned backwards over the stack to find the next potential-root to continue processing. Now that is always the top of the DFS stack. While this is a really nice improvement already (IMO) it further opens the door for two important simplifications: 1) De-duplicating some of the code across the two different walks. I've actually made the duplication a bit worse in some senses with this patch because the two are starting to converge. 2) Dramatically simplifying the loop structures of both walks. I wanted to do those separately as they'll be essentially just CFG restructuring. This patch on the other hand actually uses different datastructures to implement the algorithm itself. llvm-svn: 207098	2014-04-24 11:05:20 +00:00
Chandler Carruth	09751bf173	[LCG] Rotate logic applied to the top of the DFSStack to instead be applied prior to pushing a node onto the DFSStack. This is the first step toward avoiding the stack entirely for leaf nodes. It also simplifies things a bit and I think is pointing the way toward factoring some more of the shared logic out of the two implementations. It is also making it more obvious how to restructure the loops themselves to be a bit easier to read (although no different in terms of functionality). llvm-svn: 207095	2014-04-24 09:59:59 +00:00
Chandler Carruth	ead50d39bc	[LCG] Re-order expectations to provide more useful output when debugging an issue. This way you see that the number of nodes was wrong before a crash due to accessing too many nodes. llvm-svn: 207094	2014-04-24 09:59:56 +00:00
Evgeniy Stepanov	b6c47a5bd2	[asan] Fix instrumentation of x86 intel syntax inline assembly. Patch by Yuri Gorshenin. llvm-svn: 207092	2014-04-24 09:56:15 +00:00
Chandler Carruth	493e0a6ad0	[LCG] Switch the parent SCC tracking from a SmallSetVector to a SmallPtrSet. Currently, there is no need for stable iteration in this dimension, and I now thing there won't need to be going forward. If this is ever re-introduced in any form, it needs to not be a SetVector based solution because removal cannot be linear. There will be many SCCs with large numbers of parents. When encountering these, the incremental SCC update for intra-SCC edge removal was quadratic due to linear removal (kind of). I'm really hoping we can avoid having an ordering property here at all though... llvm-svn: 207091	2014-04-24 09:22:31 +00:00
Chandler Carruth	d52f8e0e4d	[LCG] We don't actually need a set in each SCC to track the nodes. We can use the node -> SCC mapping in the top-level graph to test this on the rare occasions we need it. llvm-svn: 207090	2014-04-24 08:55:36 +00:00
Zinovy Nis	27c486ffe1	[CLNUP] Test commit. Remove newline. llvm-svn: 207089	2014-04-24 08:42:58 +00:00
Benjamin Kramer	f4575db2fd	X86: Emit test instead of constant shift + compare if the shift result is unused. This allows us to compile return (mask & 0x8 ? a : b); into testb $8, %dil cmovnel %edx, %esi instead of andl $8, %edi shrl $3, %edi cmovnel %edx, %esi which we formed previously because dag combiner canonicalizes setcc of and into shift. llvm-svn: 207088	2014-04-24 08:15:31 +00:00
Chandler Carruth	944b9acddd	[LCG] Switch the SCC's parent iterators to be value iterators rather than pointer iterators. llvm-svn: 207086	2014-04-24 07:48:18 +00:00
Karthik Bhat	81e6bf0a41	Allow vectorization of few missed llvm intrinsic calls in BBVectorizor by handling them in isVectorizableIntrinsic function. llvm-svn: 207085	2014-04-24 07:29:55 +00:00
Chandler Carruth	3478d4b164	[ADT] Attempt to appease another MSVC oddity by moving the injected class name usage into a context we can put typename on it. llvm-svn: 207084	2014-04-24 06:59:50 +00:00
Craig Topper	353eda484c	[C++] Use 'nullptr'. llvm-svn: 207083	2014-04-24 06:44:33 +00:00
Chandler Carruth	150a5f1dd3	[ADT] Try to appease MSVC by sinking the enable_if from a default template argument to a default argument to the constructor. llvm-svn: 207082	2014-04-24 06:16:12 +00:00
Stepan Dyatkovskiy	00dcc0f53c	Fix for PR18921, "vmov" part. Added support for bytes replication feature, so it could be GAS compatible. E.g. instructions below: "vmov.i32 d0, 0xffffffff" "vmvn.i32 d0, 0xabababab" "vmov.i32 d0, 0xabababab" "vmov.i16 d0, 0xabab" are incorrect, but we could deal with such cases. For first one we should emit: "vmov.i8 d0, 0xff" For second one ("vmvn"): "vmov.i8 d0, 0x54" For last two instructions it should emit: "vmov.i8 d0, 0xab" P.S.: In ARMAsmParser.cpp I have also fixed few nearby style issues in old code. Just for keeping method bodies in harmony with themselves. llvm-svn: 207080	2014-04-24 06:03:01 +00:00
Chandler Carruth	a3211b5dca	Use the shiny new iterator adaptor tool to implement the value_op_iterator. llvm-svn: 207078	2014-04-24 05:33:53 +00:00
Chandler Carruth	2803df5ae6	[ADT] Factor out the facade aspect of the iterator_adaptor_base into its own CRTP base class for more general purpose use. Add some clarifying comments for the exact way in which the adaptor uses it. Hopefully this will help us write increasingly full featured iterators. This is becoming important as they start to be used heavily inside of ranges. llvm-svn: 207072	2014-04-24 04:07:06 +00:00
Chandler Carruth	9a6be8b3b1	[ADT] Add a generic iterator utility for adapting iterators much like Boost's iterator_adaptor, and a specific adaptor which iterates over pointees when wrapped around an iterator over pointers. This is the result of a long discussion on IRC with Duncan Smith, Dave Blaikie, Richard Smith, and myself. Essentially, I could use some subset of the iterator facade facilities often used from Boost, and everyone seemed interested in having the functionality in a reasonably generic form. I've tried to strike a balance between the pragmatism and the established Boost design. The primary differences are: 1) Delegating to the standard iterator interface names rather than special names that then make up a second iterator-like API. 2) Using the name 'pointee_iterator' which seems more clear than 'indirect_iterator'. The whole business of calling the '*p' operation 'pointer indirection' in the standard is ... quite confusing. And 'dereference' is no better of a term for moving from a pointer to a reference. Hoping Duncan, and others continue to provide comments on this until we've got a nice, minimal abstraction. llvm-svn: 207069	2014-04-24 03:31:23 +00:00
David Blaikie	31f2900ae6	Remove unused parameter llvm-svn: 207061	2014-04-24 01:25:10 +00:00
David Blaikie	18d337508c	Remove the intermediate AccelTypes maps in DWARF units. llvm-svn: 207060	2014-04-24 01:23:49 +00:00
David Blaikie	ecf0415245	Remove the intermediate AccelNamespace maps in DWARF units. llvm-svn: 207059	2014-04-24 01:02:42 +00:00
Michael J. Spencer	dee4b2c379	[InstCombine][x86] Constant fold psll intrinsics. This excludes avx512 as I don't have hardware to verify. It excludes _dq variants because they are represented in the IR as <{2,4} x i64> when it's actually a byte shift of the entire i{128,265}. This also excludes _dq_bs as they aren't at all supported by the backend. There are also no corresponding instructions in the ISA. I have no idea why they exist... llvm-svn: 207058	2014-04-24 00:58:18 +00:00
David Blaikie	0ee82b95cb	Remove the intermediate AccelObjC maps in DWARF units llvm-svn: 207057	2014-04-24 00:53:32 +00:00
Filipe Cabecinhas	1a80595a2b	Optimize some special cases for SSE4a insertqi Summary: Since the upper 64 bits of the destination register are undefined when performing this operation, we can substitute it and let the optimizer figure out that only a copy is needed. Also added range merging, if an instruction copies a range that can be merged with a previous copied range. Added test cases for both optimizations. Reviewers: grosbach, nadav CC: llvm-commits Differential Revision: http://reviews.llvm.org/D3357 llvm-svn: 207055	2014-04-24 00:38:14 +00:00
Matt Arsenault	60728177fb	Handle addrspacecast when looking at memcpys from globals llvm-svn: 207054	2014-04-24 00:01:09 +00:00
Chandler Carruth	6a4fee87bc	[LCG] Normalize the post-order SCC iterator to just iterate over the SCC values rather than having pointers in weird places. llvm-svn: 207053	2014-04-23 23:51:07 +00:00
Chandler Carruth	a800e28818	[LCG] Remove two unused typedefs from the iterators. llvm-svn: 207052	2014-04-23 23:51:02 +00:00
David Blaikie	27931a41e4	And actually use the DwarfDebug::AccelNames to emit the names. Fix for r207049 which would've emitted no accelerated names at all... llvm-svn: 207051	2014-04-23 23:46:25 +00:00
David Blaikie	f2505d6995	More formatting... llvm-svn: 207050	2014-04-23 23:38:39 +00:00
David Blaikie	2406a0627c	Remove intermediate accelerator table for names. (similar changes coming for the other accelerator tables) llvm-svn: 207049	2014-04-23 23:37:35 +00:00
Chandler Carruth	bd5d3082c4	[LCG] Switch the primary node iterator to be a much more normal C++ iterator, returning a Node by reference on dereference. llvm-svn: 207048	2014-04-23 23:34:48 +00:00
Chandler Carruth	2a898e0df6	[LCG] Make the insertion and query paths into the LCG which cannot fail return references to better model this property. No functionality changed. llvm-svn: 207047	2014-04-23 23:20:36 +00:00
Chandler Carruth	a10e240377	[LCG] Switch the SCC lookup to be in terms of call graph nodes rather than functions. So far, this access pattern is much more common. It seems likely that any user of this interface is going to have nodes at the point that they are querying the SCCs. No functionality changed. llvm-svn: 207045	2014-04-23 23:12:06 +00:00
David Blaikie	2c0f4ef241	DwarfAccelTable: Remove trivial dtor and simplify construction with an array. llvm-svn: 207044	2014-04-23 23:03:45 +00:00
Jordan Rose	001080b375	Use std::less instead of < in array_pod_sort's default comparator. This makes array_pod_sort portably safe to use with pointers. llvm-svn: 207043	2014-04-23 22:44:11 +00:00
Chandler Carruth	b4a04da0b9	[LCG] Switch the primary SCC building code to use the negative low-link values rather than an expensive dense map query to test whether children have already been popped into an SCC. This matches the incremental SCC building code. I've also included the assert that I put there but updated both of their text. No functionality changed here. I still don't have any great ideas for sharing the code between the two implementations, but I may try a brute-force approach to factoring it at some point. llvm-svn: 207042	2014-04-23 22:28:13 +00:00
Saleem Abdulrasool	b6d051c4f0	MC: disable test on thumbv7-windows This is dependent on changes that are not fully ready to be merged yet (WoA object file emission). The test can be re-enabled for that target later. llvm-svn: 207038	2014-04-23 21:55:18 +00:00
Justin Bogner	c67f0250ef	llvm-cov: Add support for gcov's --long-file-names option GCOV provides an option to prepend output file names with the source file name, to disambiguate between covered data that's included from multiple sources. Add a flag to llvm-cov that does the same. llvm-svn: 207035	2014-04-23 21:44:55 +00:00
Justin Bogner	bac905c684	llvm-cov: Allow short options to be grouped llvm-svn: 207034	2014-04-23 21:44:48 +00:00
Saleem Abdulrasool	9e6a524551	MC: move test from Generic to COFF This is a COFF specific test, move it to COFF to fix the Hexagon buildbots. llvm-svn: 207030	2014-04-23 21:41:07 +00:00
Saleem Abdulrasool	33ebff07a9	MC: move ARM64 test from AArch64 directory The test was changed from aarch64 to arm64 but not moved. The test would fail if the backend was not built. llvm-svn: 207029	2014-04-23 21:29:40 +00:00
Saleem Abdulrasool	11049a0fef	MC: honour IMAGE_SCN_CNT_INITIALIZED_DATA Emit the flag to indicate to the assembler that a section contains data if there is pre-populated data present. llvm-svn: 207028	2014-04-23 21:29:34 +00:00
David Blaikie	d75fb28ae7	Move the AddressPool from DwarfFile to DwarfDebug. There's only ever one address pool, not one per DWARF output file, so let's just have one. (similar refactoring of the string pool to come soon) llvm-svn: 207026	2014-04-23 21:20:10 +00:00
David Blaikie	8fb87eee17	clang-format for my previous commit (I keep forgetting... ) llvm-svn: 207025	2014-04-23 21:20:07 +00:00
Matt Arsenault	4dbd4891c7	Use pointer size function where only a pointer is expected llvm-svn: 207023	2014-04-23 21:10:15 +00:00
David Blaikie	e226b08ee9	Separate out the DWARF address pool into its own type/files. llvm-svn: 207022	2014-04-23 21:04:59 +00:00
Matt Arsenault	be55888849	Remove more default address space argument usage. These places are inconsequential in practice. llvm-svn: 207021	2014-04-23 20:58:57 +00:00
Quentin Colombet	ef86b4067c	[ARM64] Fix the information we give to the peephole optimizer for comparison. ANDS does not use the same encoding scheme as other xxxS instructions (e.g., ADDS). Take that into account to avoid wrong peephole optimization. <rdar://problem/16693089> llvm-svn: 207020	2014-04-23 20:43:38 +00:00
Matt Arsenault	fcd7401bbf	Don't use default address space arguments in GlobalOpt llvm-svn: 207019	2014-04-23 20:36:10 +00:00
Anders Waldenborg	614dda1ef3	[python] Fix python bindings tests Broke after the changes related to the LLVMGetSymbolFileOffset removal in r206750 llvm-svn: 207018	2014-04-23 20:32:03 +00:00
Matt Arsenault	4c6ab696e2	R600: Add a test that used to be broken that I forgot to add llvm-svn: 207017	2014-04-23 19:45:05 +00:00
David Blaikie	05e736fb8a	clang-format r207010 llvm-svn: 207016	2014-04-23 19:44:08 +00:00
Matt Arsenault	fed895c9c6	Convert test to FileCheck llvm-svn: 207015	2014-04-23 19:32:37 +00:00
Quentin Colombet	04f7b74c39	[X86] Fix missing/wrong scheduling model found by code inspection. llvm-svn: 207014	2014-04-23 19:30:26 +00:00
Anders Waldenborg	91527efdec	llvm-build: Get rid of 'import *' This allows pyflakes catching more errors in the script. Differential Revision: http://reviews.llvm.org/D3334 llvm-svn: 207012	2014-04-23 19:17:42 +00:00
David Blaikie	85f80d7122	Split out DwarfFile from DwarfDebug into its own .h/.cpp files. Some of these types (DwarfDebug in particular) are quite large to begin with (and I keep forgetting whether DwarfFile is in DwarfDebug or DwarfUnit... ) so having a few smaller files seems like goodness. llvm-svn: 207010	2014-04-23 18:54:00 +00:00
Justin Bogner	fa5b013d48	ProfileData: Avoid unnecessary copies of CounterData We're currently copying CounterData from InstrProfWriter into the OnDiskHashTable, even though we don't need to, and then carelessly leaking those copies. A const pointer is much better here. llvm-svn: 207009	2014-04-23 18:50:16 +00:00
Simon Atanasyan	eb08c4f038	[yaml2obj][ELF] Remove unnecessary space between namespace name and colons. llvm-svn: 207003	2014-04-23 17:30:29 +00:00
Alexander Potapenko	a51e483846	[ASan] Move the shadow range on 32-bit iOS (and iOS Simulator) to 0x40000000-0x60000000 to avoid address space clash with system libraries. The solution has been proposed by tahabekireren@gmail.com in https://code.google.com/p/address-sanitizer/issues/detail?id=210 This is also known to fix some Chromium iOS tests. llvm-svn: 207002	2014-04-23 17:14:45 +00:00
Matt Arsenault	6b4bed4b83	Remove dead code in instcombine. Don't replace shifts greater than the type with the maximum shift. This isn't hit anywhere in the tests, and somewhere else is replacing these with undef. llvm-svn: 207000	2014-04-23 16:48:40 +00:00
NAKAMURA Takumi	d5696915d4	X86AsmParser.cpp: Fix memory leak at replacing movsd to movsl. llvm-svn: 206991	2014-04-23 14:51:35 +00:00
NAKAMURA Takumi	c2c6649d61	cl::ParseCommandLineOptions(): Use StringRef to receive sys::path::filename() instead of std::string. llvm-svn: 206990	2014-04-23 14:51:23 +00:00
NAKAMURA Takumi	fe16a620a7	Mark llvm/test/BugPoint/compile-custom.ll as XFAIL:vg_leak. llvm-svn: 206989	2014-04-23 14:51:12 +00:00
Rafael Espindola	6a4a0799a5	Centralize handling of ELF_Other_ThumbFunc. No functionality change. llvm-svn: 206988	2014-04-23 14:42:32 +00:00
Evgeniy Stepanov	119cb2eed5	Fix handling of missing DataLayout in sanitizers. Pass::doInitialization is supposed to return False when it did not change the program, not when a fatal error occurs. llvm-svn: 206975	2014-04-23 12:51:32 +00:00
Rafael Espindola	6992778176	Remove AssemblyAnnotationWriter from NamedMDNode::print. No functionality change, this parameter was always set to nullptr. Patch by Robert Matusewicz! llvm-svn: 206972	2014-04-23 12:23:05 +00:00
Evgeniy Stepanov	0a951b775e	Create MCTargetOptions. For now it contains a single flag, SanitizeAddress, which enables AddressSanitizer instrumentation of inline assembly. Patch by Yuri Gorshenin. llvm-svn: 206971	2014-04-23 11:16:03 +00:00
Simon Atanasyan	62fce0a975	[yaml2obj][ELF] Add a virtual destructor to the ELFYAML::Section class to prevent memory leaks. llvm-svn: 206969	2014-04-23 11:10:55 +00:00
Chandler Carruth	9302fbf0ae	[LCG] Add the first round of mutation support to the lazy call graph. This implements the core functionality necessary to remove an edge from the call graph and correctly update both the basic graph and the SCC structure. As part of that it has to run a tiny (in number of nodes) Tarjan-style DFS walk of an SCC being mutated to compute newly formed SCCs, etc. This is very rough and a WIP. I have a bunch of FIXMEs for code cleanup that will reduce the boilerplate in this change substantially. I also have a bunch of simplifications to various parts of both algorithms that I want to make, but first I'd like to have a more holistic picture. Ideally, I'd also like more testing. I'll probably add quite a few more unit tests as I go here to cover the various different aspects and corner cases of removing edges from the graph. Still, this is, so far, successfully updating the SCC graph in-place without disrupting the identity established for the existing SCCs even when we do challenging things like delete the critical edge that made an SCC cycle at all and have to reform things as a tree of smaller SCCs. Getting this to work is really critical for the new pass manager as it is going to associate significant state with the SCC instance and needs it to be stable. That is also the motivation behind the return of the newly formed SCCs. Eventually, I'll wire this all the way up to the public API so that the pass manager can use it to correctly re-enqueue newly formed SCCs into a fresh postorder traversal. llvm-svn: 206968	2014-04-23 11:03:03 +00:00
James Molloy	029de8b769	[ARM64] Fix formatting. llvm-svn: 206967	2014-04-23 10:50:32 +00:00
Chandler Carruth	cace6623c4	[LCG] Implement Tarjan's algorithm correctly this time. We have to walk up the stack finishing the exploration of each entries children before we're finished in addition to accounting for their low-links. Added a unittest that really hammers home the need for this with interlocking cycles that would each appear distinct otherwise and crash or compute the wrong result. As part of this, nuke a stale fixme and bring the rest of the implementation still more closely in line with the original algorithm. llvm-svn: 206966	2014-04-23 10:31:17 +00:00
James Molloy	650cb57067	[ARM64] Add a big endian version of the ARM64 target machine, and update all users. This completes the porting of r202024 (cpirker "Add AArch64 big endian Target (aarch64_be)") to ARM64. llvm-svn: 206965	2014-04-23 10:26:40 +00:00
Alexey Volkov	9511327db8	Fixing typos in commit r206957 Differential Revision: http://reviews.llvm.org/D3451 llvm-svn: 206960	2014-04-23 10:20:31 +00:00
Chandler Carruth	d27fc468a7	[LCG] Add some accessor methods to the SCC to allow iterating over the parents of an SCC, and add a lookup method for finding the SCC for a given function. These aren't used yet, but will be used shortly in some unit tests I'm adding and are really part of the broader intended interface for the analysis. llvm-svn: 206959	2014-04-23 09:57:18 +00:00
Alexey Volkov	0e55a99c0f	[X86] Silvermont new scheduler model This model is not final and work is still in progress. However there are substantial improvements on integer tests mainly because of better RAL with new scheduler. Differential Revision: http://reviews.llvm.org/D3451 llvm-svn: 206957	2014-04-23 08:57:09 +00:00
Alexander Musman	f0785f4db4	[LV] Statistics numbers for LoopVectorize introduced: a number of analyzed loops & a number of vectorized loops. Use -stats to see how many loops were analyzed for possible vectorization and how many of them were actually vectorized. Patch by Zinovy Nis Differential Revision: http://reviews.llvm.org/D3438 llvm-svn: 206956	2014-04-23 08:40:37 +00:00
Chandler Carruth	c7bad9a5a0	[LCG] Add a unittest for the LazyCallGraph. I had a weak moment and resisted this for too long. Just with the basic testing here I was able to exercise the analysis in more detail and sift out both type signature bugs in the API and a bug in the DFS numbering. All of these are fixed here as well. The unittests will be much more important for the mutation support where it is necessary to craft minimal mutations and then inspect the state of the graph. There is just no way to do that with a standard FileCheck test. However, unittesting these kinds of analyses is really quite easy, especially as they're designed with the new pass manager where there is essentially no infrastructure required to rig up the core logic and exercise it at an API level. As a minor aside about the DFS numbering bug, the DFS numbering used in LCG is a bit unusual. Rather than numbering from 0, we number from 1, and use 0 as the sentinel "unvisited" state. Other implementations often use '-1' for this, but I find it easier to deal with 0 and it shouldn't make any real difference provided someone doesn't write silly bugs like forgetting to actually initialize the DFS numbering. Oops. ;] llvm-svn: 206954	2014-04-23 08:08:49 +00:00
Elena Demikhovsky	8ac0bf96f0	X86Disassembler - fixed a bug in immediate print llvm-svn: 206953	2014-04-23 07:21:04 +00:00
Stepan Dyatkovskiy	afc364bd51	Integrated assbemler, macros: added 'vararg' argument qualifier support. Note, currently we have no 'vararg' support for darwin macros. llvm-svn: 206951	2014-04-23 06:56:28 +00:00
Kevin Qin	a4ee178762	[ARM64] Enable feature predicates for NEON / FP / CRYPTO. AArch64 has feature predicates for NEON, FP and CRYPTO instructions. This allows the compiler to generate code without using FP, NEON or CRYPTO instructions. llvm-svn: 206949	2014-04-23 06:22:48 +00:00
Chandler Carruth	3f9869a8e2	[LCG] Hoist the logic for forming a new SCC from the top of the DFSStack into a helper function. I plan to re-use it for doing incremental DFS-based updates to the SCCs when we mutate the call graph. llvm-svn: 206948	2014-04-23 06:09:03 +00:00
Chandler Carruth	0b623baeb3	[LCG] Switch the Callee sets to be DenseMaps pointing to the index into the Callee list. This is going to be quite important to prevent removal from going quadratic. No functionality changed at this point, this is one of the refactoring patches I've broken out of my initial work toward mutation updates of the call graph. llvm-svn: 206938	2014-04-23 04:00:17 +00:00
Reid Kleckner	feb1148ed6	Fix test/CodeGen/arm.ll The 'CHECK: add' line was occasionally matching against the filename, breaking the subsequent CHECK-NOT. Also use CHECK-LABEL. llvm-svn: 206936	2014-04-23 01:09:29 +00:00
David Blaikie	637cac42ed	Requisite reformatting for previous commit. llvm-svn: 206927	2014-04-22 23:09:36 +00:00
David Blaikie	f9b6a558c8	Push memory ownership of DwarfUnits into clients of DwarfFile. This prompted me to push references through most of DwarfDebug. Sorry for the churn. Honestly it's a bit silly that we're passing around units all over the place like that anyway and I think it's mostly due to the DIE attribute adding utility functions being utilities in DwarfUnit. I should have another go at moving them out of DwarfUnit... llvm-svn: 206925	2014-04-22 22:39:41 +00:00
Sean Silva	3feb690f76	[docs] Add a note to docs/README.txt Added note to docs/README.txt on how to check the reachibility of external links in the documentation. Patch by Dan Liew! llvm-svn: 206924	2014-04-22 21:47:53 +00:00
Kevin Enderby	7ee97cebfc	Change the prototype for MCContext::FatalError() so it can be called from places like MCCodeEmitter() in the MC backend when the MCContext is const. I was going to use this in my change for r206669 but Jim convinced me to use an assert there. But this still is a good tweak. llvm-svn: 206923	2014-04-22 21:42:18 +00:00
David Blaikie	c33b3cdb0c	Use std::unique_ptr to handle ownership of DwarfUnits in DwarfFile. So Chandler - how about those range algorithms? (would really love a dereferencing range adapter for this sort of stuff) llvm-svn: 206921	2014-04-22 21:27:37 +00:00
Rui Ueyama	71a26346d3	Whitespace llvm-svn: 206919	2014-04-22 19:52:05 +00:00
Rui Ueyama	17a9a84f5c	No need to check condition after grow() r206916 was not logically the same as the previous code because the goto statements did not create loop. This should be the same as the previous code. llvm-svn: 206918	2014-04-22 19:47:26 +00:00
Rafael Espindola	3e993d0f42	Follow aliases when determining if a symbol is thumb. This fixes pr19484. llvm-svn: 206917	2014-04-22 19:11:07 +00:00
Rui Ueyama	70bcf4222e	Replace loops using goto with plain while loops Goto statements jumping into previous inner blocks are pretty confusing to read even though in this case they are valid. No reason to not use while loops there. llvm-svn: 206916	2014-04-22 19:07:14 +00:00
Juergen Ributzka	575bcb770a	[Constant Hoisting] Materialize the constant before the cloned cast instruction. In the case where the constant comes from a cloned cast instruction, the materialization code has to go before the cloned cast instruction. This commit fixes the method that finds the materialization insertion point by making it aware of this case. This fixes <rdar://problem/15532441> llvm-svn: 206913	2014-04-22 18:06:58 +00:00
Juergen Ributzka	a1444b39fb	[Constant Hoisting] Print the instructions in the correct order for debugging. No functional change. llvm-svn: 206912	2014-04-22 18:06:51 +00:00
Rafael Espindola	89992b0d6b	Fix DataLayout::operator==(). Patch by Maks Naumov! llvm-svn: 206911	2014-04-22 17:47:03 +00:00
Kevin Enderby	96918bc406	Fix the assembler to print a better relocatable expression error diagnostic that includes location information. Currently if one has this assembly: .quad (0x1234 + (4 * SOME_VALUE)) where SOME_VALUE is undefined ones gets the less than useful error message with no location information: % clang -c x.s clang -cc1as: fatal error: error in backend: expected relocatable expression With this fix one now gets a more useful error message with location information: % clang -c x.s x.s:5:8: error: expected relocatable expression .quad (0x1234 + (4 * SOME_VALUE)) ^ To do this I plumbed the SMLoc through the MCObjectStreamer EmitValue() and EmitValueImpl() interfaces so it could be used when creating the MCFixup. rdar://12391022 llvm-svn: 206906	2014-04-22 17:27:29 +00:00
David Blaikie	5f1a001071	Simplify address pool index assignment. llvm-svn: 206905	2014-04-22 17:21:40 +00:00
Matt Arsenault	16353871c3	R600: Emit error instead of unreachable on function call llvm-svn: 206904	2014-04-22 16:42:00 +00:00
Tom Stellard	8d6d449756	R600/SI: Reorganize SIInstructions.td llvm-svn: 206902	2014-04-22 16:33:57 +00:00
Elena Demikhovsky	acc5c9e83e	AVX-512: store and truncstore for i1 values llvm-svn: 206897	2014-04-22 14:13:10 +00:00
NAKAMURA Takumi	7a25ca63ba	Remove DOS CRLF. llvm-svn: 206894	2014-04-22 13:35:50 +00:00
Tim Northover	52d3283026	AArch64/ARM64: more testing from AArch64 to ARM64 llvm-svn: 206889	2014-04-22 12:45:47 +00:00
Tim Northover	a962398a3f	AArch64/ARM64: make use of ANDS and BICS instructions for comparisons. llvm-svn: 206888	2014-04-22 12:45:42 +00:00
Tim Northover	31ebef86b8	AArch64/ARM64: add extra testing from AArch64 to ARM64 llvm-svn: 206887	2014-04-22 12:45:32 +00:00
Lang Hames	64f6ebb8a9	[X86] Require HasBMI2 for the new BZHI tablegen patterns. Evidently tablegen doesn't infer this from the HasBMI2 predicate on the BZHI instructions. This should fix the recent bot failures. llvm-svn: 206885	2014-04-22 12:04:53 +00:00
Robert Khasanov	189e7fdcfb	[AVX512] Implemented integer conversions up/down with masking. Added encoding tests. llvm-svn: 206884	2014-04-22 11:36:19 +00:00
Kostya Serebryany	c9a2c17ad3	[asan] Support outline instrumentation for wide types and delete dead code, patch by Yuri Gribov llvm-svn: 206883	2014-04-22 11:19:45 +00:00
Lang Hames	70fa72d340	[X86] Remove Tablegen def of X86bzhi SDNode: It's not needed as of r206879. llvm-svn: 206880	2014-04-22 10:50:46 +00:00
Lang Hames	3067ab2344	[X86] Use tablegen instead of DAG combines to match BZHI instructions, as suggested by Ben Kramer in review of r206738. Thanks again Ben! llvm-svn: 206879	2014-04-22 10:41:56 +00:00
Matheus Almeida	2852af8a00	[mips] Clang-format MipsAsmParser. No functional changes. llvm-svn: 206878	2014-04-22 10:15:54 +00:00
Tim Northover	2b73e74238	AArch64/ARM64: enable various AArch64 tests on ARM64. llvm-svn: 206877	2014-04-22 10:10:26 +00:00
Tim Northover	00b4ee848f	AArch64/ARM64: add patterns for scalar_to_vector/extract pairs llvm-svn: 206876	2014-04-22 10:10:18 +00:00
Tim Northover	e74fb0d7b9	AArch64/ARM64: mark fmul intrinsic as commutative. This gives DAG patterns matching indexed patterns where either side is an indexed vector. llvm-svn: 206875	2014-04-22 10:10:14 +00:00
Tim Northover	978d25f391	ARM: disable emission of __XYZvfp in soft-float environment. The point of these calls is to allow Thumb-1 code to make use of the VFP unit to perform its operations. This is not desirable with -msoft-float, since most of the reasons you'd want that apply equally to the runtime library. rdar://problem/13766161 llvm-svn: 206874	2014-04-22 10:10:09 +00:00
Hao Liu	c636d15284	Fix an infinite loop bug in DAG Combine about keeping transfering between ANY_EXTEND and SIGN_EXTEND. llvm-svn: 206873	2014-04-22 09:57:06 +00:00
Lang Hames	f6f42cac3f	[X86] Don't use BZHI for short masks (>=32 bits). Thanks to Ben Kramer for the review. llvm-svn: 206869	2014-04-22 07:40:34 +00:00
David Blaikie	afd2c6be0e	Revert "Use value semantics to manage DbgVariables rather than dynamic allocation/pointers." This reverts commit r206780. This commit was regressing gdb.opt/inline-locals.exp in the GDB 7.5 test suite. Reverting until I can fix the issue. llvm-svn: 206867	2014-04-22 05:41:06 +00:00
David Blaikie	2f2021ad31	Use unique_ptr to manage ParsedBinariesAndObjects in LLVMSymbolizer llvm-svn: 206866	2014-04-22 05:26:14 +00:00
Matt Arsenault	a3c8cde77b	R600: Change how vector truncating stores are packed. Don't introduce new operations on an illegal sub 32-bit type. Do the operations on a 32-bit value, and then use a truncating store. llvm-svn: 206864	2014-04-22 04:11:14 +00:00
Matt Arsenault	5dbd5db518	R600: Make sign_extend_inreg legal. Don't know why I didn't just do this in the first place. llvm-svn: 206862	2014-04-22 03:49:30 +00:00
Jiangning Liu	87486e0bac	[AArch64] Enable global merge pass. llvm-svn: 206861	2014-04-22 03:33:26 +00:00
Duncan P. N. Exon Smith	b3380ea60a	blockfreq: Skip irreducible backedges inside functions The branch that skips irreducible backedges was only active when propagating mass at the top-level. In particular, when propagating mass through a loop recognized by `LoopInfo` with irreducible control flow inside, irreducible backedges would not be skipped. Not sure where that idea came from, but the result was that mass was lost until after loop exit. Added a testcase that covers this case. llvm-svn: 206860	2014-04-22 03:31:53 +00:00
Duncan P. N. Exon Smith	d1aec79d7a	blockfreq: Rename PackagedLoops => Loops llvm-svn: 206859	2014-04-22 03:31:50 +00:00

... 3 4 5 6 7 ...

102954 Commits