llvm-project

Commit Graph

Author	SHA1	Message	Date
Craig Topper	64941d9786	Convert SelectionDAG::getMergeValues to use ArrayRef. llvm-svn: 207374	2014-04-27 19:20:57 +00:00
Craig Topper	2d7d6052c6	Const-correct SelectionDAG::getAtomic. llvm-svn: 207373	2014-04-27 19:20:47 +00:00
Adrian Prantl	42a0d8c6ef	Clarify the doxygen comment for AsmPrinter::EmitDwarfRegOpPiece and add default arguments to the function. No functional change. llvm-svn: 207372	2014-04-27 18:50:45 +00:00
Adrian Prantl	d34db65c84	Debug info: Refactor EmitDwarfRegOpPiece to be a member function of AsmPrinter. No functional change. http://reviews.llvm.org/D3373 rdar://problem/15928306 llvm-svn: 207369	2014-04-27 18:25:45 +00:00
Rafael Espindola	aa0242723e	Make getOrCreateSymbolData non virtual. llvm-svn: 207367	2014-04-27 17:23:37 +00:00
Saleem Abdulrasool	a8b1f7204b	MC: create X86WinCOFFStreamer for target specific behaviour This introduces a target specific streamer, X86WinCOFFStreamer, which handles the target specific behaviour (e.g. WinEH). This is mostly to ensure that differences between ARM and X86 remain disjoint and do not accidentally cross boundaries. This is the final staging change for enabling object emission for Windows on ARM. llvm-svn: 207344	2014-04-27 03:48:12 +00:00
Saleem Abdulrasool	cf1a29ffee	MC: rename WinCOFFStreamer and move declaration out-of-line This is in preparation for promoting WinCOFFStreamer to a base class which will be shared by the X86 and ARM specific target COFF streamers. Also add a new getOrCreateSymbolData interface (like MCELFStreamer) for the ARM COFF Streamer. This makes the COFFStreamer more similar to the ELFStreamer. llvm-svn: 207343	2014-04-27 03:48:05 +00:00
Chandler Carruth	aa839b22c9	[LCG] Re-organize the methods for mutating a call graph to make their API requirements much more obvious. The key here is that there are two totally different use cases for mutating the graph. Prior to doing any SCC formation, it is very easy to mutate the graph. There may be users that want to do small tweaks here, and then use the already-built graph for their SCC-based operations. This method remains on the graph itself and is documented carefully as being cheap but unavailable once SCCs are formed. Once SCCs are formed, and there is some in-flight DFS building them, we have to be much more careful in how we mutate the graph. These mutation operations are sunk onto the SCCs themselves, which both simplifies things (the code was already there!) and helps make it obvious that these interfaces are only applicable within that context. The other primary constraint is that the edge being mutated is actually related to the SCC on which we call the method. This helps make it obvious that you cannot arbitrarily mutate some other SCC. I've tried to write much more complete documentation for the interesting mutation API -- intra-SCC edge removal. Currently one aspect of this documentation is a lie (the result list of SCCs) but we also don't even have tests for that API. =[ I'm going to add tests and fix it to match the documentation next. llvm-svn: 207339	2014-04-27 01:59:50 +00:00
Chandler Carruth	1129e9cec1	[LCG] Add some pedantry to the use of ptrdiff_t to appease build bots. llvm-svn: 207337	2014-04-26 22:59:28 +00:00
Chandler Carruth	27a5c6713b	[LCG] Eliminate more boiler plate by using the iterator facade base class. llvm-svn: 207336	2014-04-26 22:51:31 +00:00
Chandler Carruth	68ba2085d7	[LCG] Switch the node iterator to use the new fancy adaptor base. This is much cleaner, makes the iterator a full random access iterator, etc. llvm-svn: 207335	2014-04-26 22:43:56 +00:00
Benjamin Kramer	ccf45ebc24	Mark the growing path in SmallVector::push_back as cold. It's vital for performance that the cold path of push_back isn't inlined. llvm-svn: 207331	2014-04-26 20:10:49 +00:00
Craig Topper	206fcd450a	Convert getMemIntrinsicNode to take ArrayRef of SDValue instead of pointer and size. llvm-svn: 207329	2014-04-26 19:29:41 +00:00
Craig Topper	48d114bed1	Convert SelectionDAG::getNode methods to use ArrayRef<SDValue>. llvm-svn: 207327	2014-04-26 18:35:24 +00:00
Craig Topper	963c5d5ef8	Remove an unused version of getMemIntrinsicNode and getNode. Additionally, these were calling makeVTList with the pointers passed in which would were unlikely to belong to SelectionDAG and likely would have just been stack pointers. llvm-svn: 207326	2014-04-26 18:35:13 +00:00
Benjamin Kramer	4dae598bc8	DAGCombiner: Turn divs of vector splats into vectorized multiplications. Otherwise the legalizer would just scalarize everything. Support for mulhi in the targets isn't that great yet so on most targets we get exactly the same scalarized output. Add a test for x86 vector udiv. I had to disable the mulhi nodes on ARM because there aren't any patterns for it. As far as I know ARM has instructions for getting the high part of a multiply so this should be fixed. llvm-svn: 207315	2014-04-26 12:06:28 +00:00
Michael Zolotukhin	1a97a7bcbf	Revert r206749 till a final decision about the intrinsics is made. llvm-svn: 207313	2014-04-26 09:56:41 +00:00
Chandler Carruth	90821c2a93	[LCG] Rather than removing nodes from the SCC entry set when we process them, just skip over any DFS-numbered nodes when finding the next root of a DFS. This allows the entry set to just be a vector as we populate it from a uniqued source. It also removes the possibility for a linear scan of the entry set to actually do the removal which can make things go quadratic if we get unlucky. llvm-svn: 207312	2014-04-26 09:45:55 +00:00
Chandler Carruth	aca48d0443	[LCG] Hoist the main DFS loop out of the edge removal function. This makes working through the worklist much cleaner, and makes it possible to avoid the 'bool-to-continue-the-outer-loop' hack. Not a huge difference, but I think this is approaching as polished as I can make it. llvm-svn: 207310	2014-04-26 09:06:53 +00:00
Chandler Carruth	680af7a78c	[LCG] In the incremental SCC re-formation, lift the node currently being processed in the DFS out of the stack completely. Keep it exclusively in a variable. Re-shuffle some code structure to make this easier. This can have a very dramatic effect in some cases because call graphs tend to look like a high fan-out spanning tree. As a consequence, there are a large number of leaf nodes in the graph, and this technique causes leaf nodes to never even go into the stack. While this only reduces the max depth by 1, it may cause the total number of round trips through the stack to drop by a lot. Now, most of this isn't really relevant for the incremental version. =] But I wanted to prototype it first here as this variant is in ways more complex. As long as I can get the code factored well here, I'll next make the primary walk look the same. There are several refactorings this exposes I think. llvm-svn: 207306	2014-04-26 03:36:42 +00:00
Chandler Carruth	8f92d6db22	[LCG] Refactor the duplicated code I added in my last commit here into a helper function. Also factor the other two places where we did the same thing into the helper function. =] Much cleaner this way. NFC. llvm-svn: 207300	2014-04-26 01:03:46 +00:00
Duncan P. N. Exon Smith	42292ceaa9	Revert "blockfreq: Approximate irreducible control flow" This reverts commit r207286. It causes an ICE on the cmake-llvm-x86_64-linux buildbot [1]: llvm/lib/Analysis/BlockFrequencyInfo.cpp: In lambda function: llvm/lib/Analysis/BlockFrequencyInfo.cpp:182:1: internal compiler error: in get_expr_operands, at tree-ssa-operands.c:1035 [1]: http://bb.pgr.jp/builders/cmake-llvm-x86_64-linux/builds/12093/steps/build_llvm/logs/stdio llvm-svn: 207287	2014-04-25 23:16:58 +00:00
Duncan P. N. Exon Smith	384d0e8ad4	blockfreq: Approximate irreducible control flow Previously, irreducible backedges were ignored. With this commit, irreducible SCCs are discovered on the fly, and modelled as loops with multiple headers. This approximation specifies the headers of irreducible sub-SCCs as its entry blocks and all nodes that are targets of a backedge within it (excluding backedges within true sub-loops). Block frequency calculations act as if we insert a new block that intercepts all the edges to the headers. All backedges and entries to the irreducible SCC point to this imaginary block. This imaginary block has an edge (with even probability) to each header block. The result is now reasonable enough that I've added a number of testcases for irreducible control flow. I've outlined in `BlockFrequencyInfoImpl.h` ways to improve the approximation. <rdar://problem/14292693> llvm-svn: 207286	2014-04-25 23:08:57 +00:00
Tom Roeder	fd1bc602b3	Add an -mattr option to the gold plugin to support subtarget features in LTO This adds support for an -mattr option to the gold plugin and to llvm-lto. This allows the caller to specify details of the subtarget architecture, like +aes, or +ssse3 on x86. Note that this requires a change to the include/llvm-c/lto.h interface: it adds a function lto_codegen_set_attr and it increments the version of the interface. llvm-svn: 207279	2014-04-25 21:46:51 +00:00
Duncan P. N. Exon Smith	9f35117956	SCC: Use the reference typedef Actually use the `reference` typedef, and remove the private redefinition of `pointer` since it has no users. Using `reference` exposes a problem with r207257, which specified the wrong `value_type` to `iterator_facade_base` (fixed that too). llvm-svn: 207270	2014-04-25 20:52:08 +00:00
Adrian Prantl	32da88923a	This reapplies r207235 with an additional bugfixes caught by the msan buildbot - do not insert debug intrinsics before phi nodes. Debug info for optimized code: Support variables that are on the stack and described by DBG_VALUEs during their lifetime. Previously, when a variable was at a FrameIndex for any part of its lifetime, this would shadow all other DBG_VALUEs and only a single fbreg location would be emitted, which in fact is only valid for a small range and not the entire lexical scope of the variable. The included dbg-value-const-byref testcase demonstrates this. This patch fixes this by Local - emitting dbg.value intrinsics for allocas that are passed by reference - dropping all dbg.declares (they are now fully lowered to dbg.values) SelectionDAG - renamed constructors for SDDbgValue for better readability. - fix UserValue::match() to handle indirect values correctly - not inserting an MMI table entries for dbg.values that describe allocas. - lowering dbg.values that describe allocas into indirect DBG_VALUEs. CodeGenPrepare - leaving dbg.values for an alloca were they are (see comment) Other - regenerated/updated instcombine.ll testcase and included source rdar://problem/16679879 http://reviews.llvm.org/D3374 llvm-svn: 207269	2014-04-25 20:49:25 +00:00
David Blaikie	0651d7650a	MCAssembler: Simplify implementation of const variants of getSymbolData by calling one implementation from the other. Code review feedback by Rafael Espindola on r207124. llvm-svn: 207266	2014-04-25 20:19:11 +00:00
Duncan P. N. Exon Smith	da5eaeda01	blockfreq: Further shift logic to LoopData Move a lot of the loop-related logic that was sprinkled around the code into `LoopData`. <rdar://problem/14292693> llvm-svn: 207258	2014-04-25 18:47:04 +00:00
Duncan P. N. Exon Smith	eb6a582d13	SCC: Provide operator->() through iterator_facade_base Use the fancy new `iterator_facade_base` to add `scc_iterator::operator->()`. Remove other definitions where `iterator_facade_base` does the right thing. <rdar://problem/14292693> llvm-svn: 207257	2014-04-25 18:43:41 +00:00
Duncan P. N. Exon Smith	ef86928927	SCC: Remove non-const operator*() <rdar://problem/14292693> llvm-svn: 207254	2014-04-25 18:26:45 +00:00
Duncan P. N. Exon Smith	f4e1d6fd06	SCC: Doxygen-ize comments, NFC <rdar://problem/14292693> llvm-svn: 207251	2014-04-25 18:18:46 +00:00
Adrian Prantl	d2d9b76e48	Revert "This reapplies r207130 with an additional testcase+and a missing check for" This reverts commit 207235 to investigate msan buildbot breakage. llvm-svn: 207250	2014-04-25 18:18:09 +00:00
Duncan P. N. Exon Smith	a16a629ef6	SCC: Un-inline long functions These are long functions that really shouldn't be inlined. Otherwise, no functionality change. <rdar://problem/14292693> llvm-svn: 207249	2014-04-25 18:15:50 +00:00
Duncan P. N. Exon Smith	5547afed78	SCC: Remove redundant inline keywords, NFC Functions declared in line in a class are inlined by default. There's no reason for the `inline` keyword. <rdar://problem/14292693> llvm-svn: 207248	2014-04-25 18:10:23 +00:00
Saleem Abdulrasool	99f0d458c3	ARM: remove @llvm.arm.sevl This intrinsic is no longer needed with the new @llvm.arm.hint(i32) intrinsic which provides a generic, extensible manner for adding hint instructions. This functionality can now be represented as @llvm.arm.hint(i32 5). llvm-svn: 207246	2014-04-25 17:51:25 +00:00
Saleem Abdulrasool	7e7c2f9ca6	ARM: provide a new generic hint intrinsic Introduce the llvm.arm.hint(i32) intrinsic that can be used to inject hints into the instruction stream. This is particularly useful for generating IR from a compiler where the user may inject an intrinsic (e.g. __yield). These are then pattern substituted into the correct instruction which already existed. llvm-svn: 207242	2014-04-25 17:24:24 +00:00
Adrian Prantl	f5834a4b49	This reapplies r207130 with an additional testcase+and a missing check for AllocaInst that was missing in one location. Debug info for optimized code: Support variables that are on the stack and described by DBG_VALUEs during their lifetime. Previously, when a variable was at a FrameIndex for any part of its lifetime, this would shadow all other DBG_VALUEs and only a single fbreg location would be emitted, which in fact is only valid for a small range and not the entire lexical scope of the variable. The included dbg-value-const-byref testcase demonstrates this. This patch fixes this by Local - emitting dbg.value intrinsics for allocas that are passed by reference - dropping all dbg.declares (they are now fully lowered to dbg.values) SelectionDAG - renamed constructors for SDDbgValue for better readability. - fix UserValue::match() to handle indirect values correctly - not inserting an MMI table entries for dbg.values that describe allocas. - lowering dbg.values that describe allocas into indirect DBG_VALUEs. CodeGenPrepare - leaving dbg.values for an alloca were they are (see comment) Other - regenerated/updated instcombine.ll testcase and included source rdar://problem/16679879 http://reviews.llvm.org/D3374 llvm-svn: 207235	2014-04-25 17:01:00 +00:00
Craig Topper	f40110f4d8	[C++] Use 'nullptr'. Transforms edition. llvm-svn: 207196	2014-04-25 05:29:35 +00:00
Duncan P. N. Exon Smith	cb7d29d30c	blockfreq: Only one mass distribution per node Remove the concepts of "forward" and "general" mass distributions, which was wrong. The split might have made sense in an early version of the algorithm, but it's definitely wrong now. <rdar://problem/14292693> llvm-svn: 207195	2014-04-25 04:38:43 +00:00
Duncan P. N. Exon Smith	3f086789ff	blockfreq: Document high-level functions <rdar://problem/14292693> llvm-svn: 207191	2014-04-25 04:38:32 +00:00
Duncan P. N. Exon Smith	71f07451b6	blockfreq: Remove dead code <rdar://problem/14292693> llvm-svn: 207190	2014-04-25 04:38:30 +00:00
Duncan P. N. Exon Smith	46d9a56ce6	blockfreq: Separate unwrapLoops() from finalizeMetrics() <rdar://problem/14292693> llvm-svn: 207185	2014-04-25 04:38:17 +00:00
Duncan P. N. Exon Smith	50a1bb85b8	blockfreq: LoopData::MemberList => NodeList <rdar://problem/14292693> llvm-svn: 207184	2014-04-25 04:38:15 +00:00
Duncan P. N. Exon Smith	c9b7cfea2f	blockfreq: Expose getPackagedNode() Make `getPackagedNode()` a member function of `BlockFrequencyInfoImplBase` so that it's available for templated code. <rdar://problem/14292693> llvm-svn: 207183	2014-04-25 04:38:12 +00:00
Duncan P. N. Exon Smith	1cab8a0708	blockfreq: Store the header with the members <rdar://problem/14292693> llvm-svn: 207182	2014-04-25 04:38:09 +00:00
Duncan P. N. Exon Smith	39cc64827e	blockfreq: Encapsulate LoopData::Header <rdar://problem/14292693> llvm-svn: 207181	2014-04-25 04:38:06 +00:00
Duncan P. N. Exon Smith	4bbaff75e0	blockfreq: Embed Loop hierarchy in LoopData Continue refactoring to make `LoopData` first-class. Here I'm making the `LoopData` hierarchy explicit, instead of bouncing back and forth with `WorkingData`. This simplifies the logic and better matches the `LoopInfo` design. (Eventually, `LoopInfo` should be restructured so that it supports this pass, and `LoopData` can be removed.) <rdar://problem/14292693> llvm-svn: 207180	2014-04-25 04:38:03 +00:00
Duncan P. N. Exon Smith	d132040ed6	blockfreq: Use LoopData directly Instead of passing around loop headers, pass around `LoopData` directly. <rdar://problem/14292693> llvm-svn: 207179	2014-04-25 04:38:01 +00:00
Duncan P. N. Exon Smith	e005c7c496	blockfreq: Stop using range-based for to traverse Loops A follow-up commit will need the actual iterators. <rdar://problem/14292693> llvm-svn: 207178	2014-04-25 04:37:58 +00:00
Duncan P. N. Exon Smith	fc7dc93031	blockfreq: Use a std::list for Loops As pointed out by David Blaikie in code review, a `std::list<T>` is simpler than a `std::vector<std::unique_ptr<T>>`. Another option is a `std::deque<T>` (which allocates in chunks), but I'd like to leave open the option of inserting in the middle of the sequence for handling irreducible control flow on the fly. <rdar://problem/14292693> llvm-svn: 207177	2014-04-25 04:30:06 +00:00
Karthik Bhat	6a48f7d66e	Allow vectorization of bit intrinsics in BB Vectorizer. This patch adds support for vectorization of bit intrinsics such as bswap,ctpop,ctlz,cttz. llvm-svn: 207174	2014-04-25 03:33:48 +00:00
Adrian Prantl	6e5de2ea06	Revert "This reapplies r207130 with an additional testcase+and a missing check for" Typo in testcase. llvm-svn: 207166	2014-04-25 00:42:50 +00:00
Adrian Prantl	3512190ab3	This reapplies r207130 with an additional testcase+and a missing check for AllocaInst that was missing in one location. Debug info for optimized code: Support variables that are on the stack and described by DBG_VALUEs during their lifetime. Previously, when a variable was at a FrameIndex for any part of its lifetime, this would shadow all other DBG_VALUEs and only a single fbreg location would be emitted, which in fact is only valid for a small range and not the entire lexical scope of the variable. The included dbg-value-const-byref testcase demonstrates this. This patch fixes this by Local - emitting dbg.value intrinsics for allocas that are passed by reference - dropping all dbg.declares (they are now fully lowered to dbg.values) SelectionDAG - renamed constructors for SDDbgValue for better readability. - fix UserValue::match() to handle indirect values correctly - not inserting an MMI table entries for dbg.values that describe allocas. - lowering dbg.values that describe allocas into indirect DBG_VALUEs. CodeGenPrepare - leaving dbg.values for an alloca were they are (see comment) Other - regenerated/updated instcombine.ll testcase and included source rdar://problem/16679879 http://reviews.llvm.org/D3374 llvm-svn: 207165	2014-04-25 00:38:40 +00:00
Adrian Prantl	ff4282a204	Revert "Debug info for optimized code: Support variables that are on the stack and" This reverts commit 207130 for buildbot breakage. llvm-svn: 207162	2014-04-25 00:04:49 +00:00
Richard Smith	ab1cb0990d	Add missing include, found by modules build. llvm-svn: 207158	2014-04-24 23:29:25 +00:00
Richard Smith	80429c42ab	Function defined in a header should be inline. Found by modules build. llvm-svn: 207157	2014-04-24 23:14:32 +00:00
Chandler Carruth	d5835ee368	[ADT] Generalize pointee_iterator to smart pointers by using decltype. Based on review feedback from Dave on the original patch. llvm-svn: 207146	2014-04-24 21:10:35 +00:00
Reid Kleckner	3981faecbd	Remove dead inline function that doesn't compile MSVC doesn't diagnose this, interestingly. llvm-svn: 207144	2014-04-24 20:19:22 +00:00
Reid Kleckner	5772b77789	Add 'musttail' marker to call instructions This is similar to the 'tail' marker, except that it guarantees that tail call optimization will occur. It also comes with convervative IR verification rules that ensure that tail call optimization is possible. Reviewers: nicholas Differential Revision: http://llvm-reviews.chandlerc.com/D3240 llvm-svn: 207143	2014-04-24 20:14:34 +00:00
Richard Smith	0d9ec713e7	[modules] "Specialize" a function by actually specializing a function template rather than by adding an overload and hoping that it's declared before the code that calls it. (In a modules build, it isn't.) llvm-svn: 207133	2014-04-24 18:27:29 +00:00
Adrian Prantl	f4223918de	Debug info for optimized code: Support variables that are on the stack and described by DBG_VALUEs during their lifetime. Previously, when a variable was at a FrameIndex for any part of its lifetime, this would shadow all other DBG_VALUEs and only a single fbreg location would be emitted, which in fact is only valid for a small range and not the entire lexical scope of the variable. The included dbg-value-const-byref testcase demonstrates this. This patch fixes this by Local - emitting dbg.value intrinsics for allocas that are passed by reference - dropping all dbg.declares (they are now fully lowered to dbg.values) SelectionDAG - renamed constructors for SDDbgValue for better readability. - fix UserValue::match() to handle indirect values correctly - not inserting an MMI table entries for dbg.values that describe allocas. - lowering dbg.values that describe allocas into indirect DBG_VALUEs. CodeGenPrepare - leaving dbg.values for an alloca were they are (see comment) Other - regenerated/updated instcombine-intrinsics testcase and included source rdar://problem/16679879 http://reviews.llvm.org/D3374 llvm-svn: 207130	2014-04-24 17:41:45 +00:00
Andrea Di Biagio	d1ab866868	[X86] Add support for Read Time Stamp Counter x86 builtin intrinsics. This patch: - Adds two new X86 builtin intrinsics ('int_x86_rdtsc' and 'int_x86_rdtscp') as GCCBuiltin intrinsics; - Teaches the backend how to lower the two new builtins; - Introduces a common function to lower READCYCLECOUNTER dag nodes and the two new rdtsc/rdtscp intrinsics; - Improves (and extends) the existing x86 test 'rdtsc.ll'; now test 'rdtsc.ll' correctly verifies that both READCYCLECOUNTER and the two new intrinsics work fine for both 64bit and 32bit Subtargets. llvm-svn: 207127	2014-04-24 17:18:27 +00:00
David Blaikie	908f4d4bf5	Spread some const around for non-mutating uses of MCSymbolData. I discovered this const-hole while attempting to coalesnce the Symbol and SymbolMap data structures. There's some pending issues with that, but I figured this change was easy to flush early. llvm-svn: 207124	2014-04-24 16:59:40 +00:00
Chandler Carruth	24553934f8	[LCG] Incorporate the core trick of improvements on the naive Tarjan's algorithm here: http://dl.acm.org/citation.cfm?id=177301. The idea of isolating the roots has even more relevance when using the stack not just to implement the DFS but also to implement the recursive step. Because we use it for the recursive step, to isolate the roots we need to maintain two stacks: one for our recursive DFS walk, and another of the nodes that have been walked. The nice thing is that the latter will be half the size. It also fixes a complete hack where we scanned backwards over the stack to find the next potential-root to continue processing. Now that is always the top of the DFS stack. While this is a really nice improvement already (IMO) it further opens the door for two important simplifications: 1) De-duplicating some of the code across the two different walks. I've actually made the duplication a bit worse in some senses with this patch because the two are starting to converge. 2) Dramatically simplifying the loop structures of both walks. I wanted to do those separately as they'll be essentially just CFG restructuring. This patch on the other hand actually uses different datastructures to implement the algorithm itself. llvm-svn: 207098	2014-04-24 11:05:20 +00:00
Chandler Carruth	493e0a6ad0	[LCG] Switch the parent SCC tracking from a SmallSetVector to a SmallPtrSet. Currently, there is no need for stable iteration in this dimension, and I now thing there won't need to be going forward. If this is ever re-introduced in any form, it needs to not be a SetVector based solution because removal cannot be linear. There will be many SCCs with large numbers of parents. When encountering these, the incremental SCC update for intra-SCC edge removal was quadratic due to linear removal (kind of). I'm really hoping we can avoid having an ordering property here at all though... llvm-svn: 207091	2014-04-24 09:22:31 +00:00
Chandler Carruth	d52f8e0e4d	[LCG] We don't actually need a set in each SCC to track the nodes. We can use the node -> SCC mapping in the top-level graph to test this on the rare occasions we need it. llvm-svn: 207090	2014-04-24 08:55:36 +00:00
Chandler Carruth	944b9acddd	[LCG] Switch the SCC's parent iterators to be value iterators rather than pointer iterators. llvm-svn: 207086	2014-04-24 07:48:18 +00:00
Chandler Carruth	3478d4b164	[ADT] Attempt to appease another MSVC oddity by moving the injected class name usage into a context we can put typename on it. llvm-svn: 207084	2014-04-24 06:59:50 +00:00
Craig Topper	353eda484c	[C++] Use 'nullptr'. llvm-svn: 207083	2014-04-24 06:44:33 +00:00
Chandler Carruth	150a5f1dd3	[ADT] Try to appease MSVC by sinking the enable_if from a default template argument to a default argument to the constructor. llvm-svn: 207082	2014-04-24 06:16:12 +00:00
Chandler Carruth	a3211b5dca	Use the shiny new iterator adaptor tool to implement the value_op_iterator. llvm-svn: 207078	2014-04-24 05:33:53 +00:00
Chandler Carruth	2803df5ae6	[ADT] Factor out the facade aspect of the iterator_adaptor_base into its own CRTP base class for more general purpose use. Add some clarifying comments for the exact way in which the adaptor uses it. Hopefully this will help us write increasingly full featured iterators. This is becoming important as they start to be used heavily inside of ranges. llvm-svn: 207072	2014-04-24 04:07:06 +00:00
Chandler Carruth	9a6be8b3b1	[ADT] Add a generic iterator utility for adapting iterators much like Boost's iterator_adaptor, and a specific adaptor which iterates over pointees when wrapped around an iterator over pointers. This is the result of a long discussion on IRC with Duncan Smith, Dave Blaikie, Richard Smith, and myself. Essentially, I could use some subset of the iterator facade facilities often used from Boost, and everyone seemed interested in having the functionality in a reasonably generic form. I've tried to strike a balance between the pragmatism and the established Boost design. The primary differences are: 1) Delegating to the standard iterator interface names rather than special names that then make up a second iterator-like API. 2) Using the name 'pointee_iterator' which seems more clear than 'indirect_iterator'. The whole business of calling the '*p' operation 'pointer indirection' in the standard is ... quite confusing. And 'dereference' is no better of a term for moving from a pointer to a reference. Hoping Duncan, and others continue to provide comments on this until we've got a nice, minimal abstraction. llvm-svn: 207069	2014-04-24 03:31:23 +00:00
Chandler Carruth	6a4fee87bc	[LCG] Normalize the post-order SCC iterator to just iterate over the SCC values rather than having pointers in weird places. llvm-svn: 207053	2014-04-23 23:51:07 +00:00
Chandler Carruth	a800e28818	[LCG] Remove two unused typedefs from the iterators. llvm-svn: 207052	2014-04-23 23:51:02 +00:00
Chandler Carruth	bd5d3082c4	[LCG] Switch the primary node iterator to be a much more normal C++ iterator, returning a Node by reference on dereference. llvm-svn: 207048	2014-04-23 23:34:48 +00:00
Chandler Carruth	2a898e0df6	[LCG] Make the insertion and query paths into the LCG which cannot fail return references to better model this property. No functionality changed. llvm-svn: 207047	2014-04-23 23:20:36 +00:00
Chandler Carruth	a10e240377	[LCG] Switch the SCC lookup to be in terms of call graph nodes rather than functions. So far, this access pattern is much more common. It seems likely that any user of this interface is going to have nodes at the point that they are querying the SCCs. No functionality changed. llvm-svn: 207045	2014-04-23 23:12:06 +00:00
Jordan Rose	001080b375	Use std::less instead of < in array_pod_sort's default comparator. This makes array_pod_sort portably safe to use with pointers. llvm-svn: 207043	2014-04-23 22:44:11 +00:00
Justin Bogner	c67f0250ef	llvm-cov: Add support for gcov's --long-file-names option GCOV provides an option to prepend output file names with the source file name, to disambiguate between covered data that's included from multiple sources. Add a flag to llvm-cov that does the same. llvm-svn: 207035	2014-04-23 21:44:55 +00:00
Rafael Espindola	6992778176	Remove AssemblyAnnotationWriter from NamedMDNode::print. No functionality change, this parameter was always set to nullptr. Patch by Robert Matusewicz! llvm-svn: 206972	2014-04-23 12:23:05 +00:00
Evgeniy Stepanov	0a951b775e	Create MCTargetOptions. For now it contains a single flag, SanitizeAddress, which enables AddressSanitizer instrumentation of inline assembly. Patch by Yuri Gorshenin. llvm-svn: 206971	2014-04-23 11:16:03 +00:00
Simon Atanasyan	62fce0a975	[yaml2obj][ELF] Add a virtual destructor to the ELFYAML::Section class to prevent memory leaks. llvm-svn: 206969	2014-04-23 11:10:55 +00:00
Chandler Carruth	9302fbf0ae	[LCG] Add the first round of mutation support to the lazy call graph. This implements the core functionality necessary to remove an edge from the call graph and correctly update both the basic graph and the SCC structure. As part of that it has to run a tiny (in number of nodes) Tarjan-style DFS walk of an SCC being mutated to compute newly formed SCCs, etc. This is very rough and a WIP. I have a bunch of FIXMEs for code cleanup that will reduce the boilerplate in this change substantially. I also have a bunch of simplifications to various parts of both algorithms that I want to make, but first I'd like to have a more holistic picture. Ideally, I'd also like more testing. I'll probably add quite a few more unit tests as I go here to cover the various different aspects and corner cases of removing edges from the graph. Still, this is, so far, successfully updating the SCC graph in-place without disrupting the identity established for the existing SCCs even when we do challenging things like delete the critical edge that made an SCC cycle at all and have to reform things as a tree of smaller SCCs. Getting this to work is really critical for the new pass manager as it is going to associate significant state with the SCC instance and needs it to be stable. That is also the motivation behind the return of the newly formed SCCs. Eventually, I'll wire this all the way up to the public API so that the pass manager can use it to correctly re-enqueue newly formed SCCs into a fresh postorder traversal. llvm-svn: 206968	2014-04-23 11:03:03 +00:00
Chandler Carruth	cace6623c4	[LCG] Implement Tarjan's algorithm correctly this time. We have to walk up the stack finishing the exploration of each entries children before we're finished in addition to accounting for their low-links. Added a unittest that really hammers home the need for this with interlocking cycles that would each appear distinct otherwise and crash or compute the wrong result. As part of this, nuke a stale fixme and bring the rest of the implementation still more closely in line with the original algorithm. llvm-svn: 206966	2014-04-23 10:31:17 +00:00
Chandler Carruth	d27fc468a7	[LCG] Add some accessor methods to the SCC to allow iterating over the parents of an SCC, and add a lookup method for finding the SCC for a given function. These aren't used yet, but will be used shortly in some unit tests I'm adding and are really part of the broader intended interface for the analysis. llvm-svn: 206959	2014-04-23 09:57:18 +00:00
Chandler Carruth	c7bad9a5a0	[LCG] Add a unittest for the LazyCallGraph. I had a weak moment and resisted this for too long. Just with the basic testing here I was able to exercise the analysis in more detail and sift out both type signature bugs in the API and a bug in the DFS numbering. All of these are fixed here as well. The unittests will be much more important for the mutation support where it is necessary to craft minimal mutations and then inspect the state of the graph. There is just no way to do that with a standard FileCheck test. However, unittesting these kinds of analyses is really quite easy, especially as they're designed with the new pass manager where there is essentially no infrastructure required to rig up the core logic and exercise it at an API level. As a minor aside about the DFS numbering bug, the DFS numbering used in LCG is a bit unusual. Rather than numbering from 0, we number from 1, and use 0 as the sentinel "unvisited" state. Other implementations often use '-1' for this, but I find it easier to deal with 0 and it shouldn't make any real difference provided someone doesn't write silly bugs like forgetting to actually initialize the DFS numbering. Oops. ;] llvm-svn: 206954	2014-04-23 08:08:49 +00:00
Chandler Carruth	3f9869a8e2	[LCG] Hoist the logic for forming a new SCC from the top of the DFSStack into a helper function. I plan to re-use it for doing incremental DFS-based updates to the SCCs when we mutate the call graph. llvm-svn: 206948	2014-04-23 06:09:03 +00:00
Chandler Carruth	0b623baeb3	[LCG] Switch the Callee sets to be DenseMaps pointing to the index into the Callee list. This is going to be quite important to prevent removal from going quadratic. No functionality changed at this point, this is one of the refactoring patches I've broken out of my initial work toward mutation updates of the call graph. llvm-svn: 206938	2014-04-23 04:00:17 +00:00
Kevin Enderby	7ee97cebfc	Change the prototype for MCContext::FatalError() so it can be called from places like MCCodeEmitter() in the MC backend when the MCContext is const. I was going to use this in my change for r206669 but Jim convinced me to use an assert there. But this still is a good tweak. llvm-svn: 206923	2014-04-22 21:42:18 +00:00
Rui Ueyama	71a26346d3	Whitespace llvm-svn: 206919	2014-04-22 19:52:05 +00:00
Rui Ueyama	17a9a84f5c	No need to check condition after grow() r206916 was not logically the same as the previous code because the goto statements did not create loop. This should be the same as the previous code. llvm-svn: 206918	2014-04-22 19:47:26 +00:00
Rui Ueyama	70bcf4222e	Replace loops using goto with plain while loops Goto statements jumping into previous inner blocks are pretty confusing to read even though in this case they are valid. No reason to not use while loops there. llvm-svn: 206916	2014-04-22 19:07:14 +00:00
Kevin Enderby	96918bc406	Fix the assembler to print a better relocatable expression error diagnostic that includes location information. Currently if one has this assembly: .quad (0x1234 + (4 * SOME_VALUE)) where SOME_VALUE is undefined ones gets the less than useful error message with no location information: % clang -c x.s clang -cc1as: fatal error: error in backend: expected relocatable expression With this fix one now gets a more useful error message with location information: % clang -c x.s x.s:5:8: error: expected relocatable expression .quad (0x1234 + (4 * SOME_VALUE)) ^ To do this I plumbed the SMLoc through the MCObjectStreamer EmitValue() and EmitValueImpl() interfaces so it could be used when creating the MCFixup. rdar://12391022 llvm-svn: 206906	2014-04-22 17:27:29 +00:00
Tim Northover	e74fb0d7b9	AArch64/ARM64: mark fmul intrinsic as commutative. This gives DAG patterns matching indexed patterns where either side is an indexed vector. llvm-svn: 206875	2014-04-22 10:10:14 +00:00
Duncan P. N. Exon Smith	d1aec79d7a	blockfreq: Rename PackagedLoops => Loops llvm-svn: 206859	2014-04-22 03:31:50 +00:00
Duncan P. N. Exon Smith	2984a64bae	blockfreq: Use a pointer for ContainingLoop too llvm-svn: 206858	2014-04-22 03:31:44 +00:00
Duncan P. N. Exon Smith	e1423639bb	blockfreq: Use pointers to loops instead of an index Store pointers directly to loops inside the nodes. This could have been done without changing the type stored in `std::vector<>`. However, rather than computing the number of loops before constructing them (which `LoopInfo` doesn't provide directly), I've switched to a `vector<unique_ptr<LoopData>>`. This adds some heap overhead, but the number of loops is typically small. llvm-svn: 206857	2014-04-22 03:31:37 +00:00
Duncan P. N. Exon Smith	cc88ebfa5f	blockfreq: Rename PackagedLoopData => LoopData No functionality change. llvm-svn: 206855	2014-04-22 03:31:31 +00:00
Duncan P. N. Exon Smith	f2eb5bc3ff	blockfreq: Move PackagedLoopData above WorkingData llvm-svn: 206854	2014-04-22 03:31:25 +00:00
Duncan P. N. Exon Smith	84749e52a3	blockfreq: Remove "dead" comment llvm-svn: 206853	2014-04-22 03:31:23 +00:00
Chandler Carruth	1b9dde087e	[Modules] Remove potential ODR violations by sinking the DEBUG_TYPE define below all header includes in the lib/CodeGen/... tree. While the current modules implementation doesn't check for this kind of ODR violation yet, it is likely to grow support for it in the future. It also removes one layer of macro pollution across all the included headers. Other sub-trees will follow. llvm-svn: 206837	2014-04-22 02:02:50 +00:00
Rui Ueyama	97d484342c	Fix wrong iterator type ELFEntityIterator does not implement RandomAccessIterator. It does not even implement BidirectionalIterator. This patch fixes LLD build issue when compiled with MSVC2013 with debug: MSVC's find_if checks if the start iterator is before the end iterator in the sense of operator< if it declares implementing RandomAccessIterator. If a class does not have operator<, it fails to compile. llvm-svn: 206825	2014-04-21 23:00:42 +00:00
Chandler Carruth	e96dd8975f	[Modules] Make Support/Debug.h modular. This requires it to not change behavior based on other files defining DEBUG_TYPE, which means it cannot define DEBUG_TYPE at all. This is actually better IMO as it forces folks to define relevant DEBUG_TYPEs for their files. However, it requires all files that currently use DEBUG(...) to define a DEBUG_TYPE if they don't already. I've updated all such files in LLVM and will do the same for other upstream projects. This still leaves one important change in how LLVM uses the DEBUG_TYPE macro going forward: we need to only define the macro after header files have been #include-ed. Previously, this wasn't possible because Debug.h required the macro to be pre-defined. This commit removes that. By defining DEBUG_TYPE after the includes two things are fixed: - Header files that need to provide a DEBUG_TYPE for some inline code can do so by defining the macro before their inline code and undef-ing it afterward so the macro does not escape. - We no longer have rampant ODR violations due to including headers with different DEBUG_TYPE definitions. This may be mostly an academic violation today, but with modules these types of violations are easy to check for and potentially very relevant. Where necessary to suppor headers with DEBUG_TYPE, I have moved the definitions below the includes in this commit. I plan to move the rest of the DEBUG_TYPE macros in LLVM in subsequent commits; this one is big enough. The comments in Debug.h, which were hilariously out of date already, have been updated to reflect the recommended practice going forward. llvm-svn: 206822	2014-04-21 22:55:11 +00:00
David Blaikie	09757491d6	Use unique_ptr to manage ownership of GCOVFunctions, Blocks, and Edges. llvm-svn: 206796	2014-04-21 21:40:16 +00:00
David Blaikie	422b93dcf1	Use unique_ptr to manage objects owned by the ScheduleDAGMI. llvm-svn: 206784	2014-04-21 20:32:32 +00:00
Yi Jiang	d069f6393a	ARM64: Combine shifts and uses from different basic block to bit-extract instruction llvm-svn: 206774	2014-04-21 19:34:27 +00:00
Duncan P. N. Exon Smith	254689fcf9	blockfreq: Some cleanup of UnsignedFloat Change `PositiveFloat` to `UnsignedFloat`, and fix some of the comments to indicate that it's disappearing eventually. llvm-svn: 206771	2014-04-21 18:31:58 +00:00
Jim Grosbach	81ab4cc97a	Tidy up. Remove extraneous typedef. llvm-svn: 206768	2014-04-21 18:10:29 +00:00
Jim Grosbach	c5c881ee82	Object: iterator_range accessors for ObjectImage symbols and sections. llvm-svn: 206767	2014-04-21 18:10:26 +00:00
Duncan P. N. Exon Smith	10be9a8868	Reapply "blockfreq: Rewrite BlockFrequencyInfoImpl" This reverts commit r206707, reapplying r206704. The preceding commit to CalcSpillWeights should have sorted out the failing buildbots. <rdar://problem/14292693> llvm-svn: 206766	2014-04-21 17:57:07 +00:00
Rafael Espindola	6956b1a517	Convert getFileOffset to getOffset and move it to its only user. We normally don't drop functions from the C API's, but in this case I think we can: * The old implementation of getFileOffset was fairly broken * The introduction of LLVMGetSymbolFileOffset was itself a C api breaking change as it removed LLVMGetSymbolOffset. * It is an incredibly specialized use case. The only reason MCJIT needs it is because of its odd position of being a dynamic linker of .o files. llvm-svn: 206750	2014-04-21 13:45:32 +00:00
Michael Zolotukhin	f2ba994bf6	Reapply r206732. This time without optimization of branches. llvm-svn: 206749	2014-04-21 12:01:33 +00:00
Chandler Carruth	572e3407c3	[PM] Add a new-PM-style CGSCC pass manager using the newly added LazyCallGraph analysis framework. Wire it up all the way through the opt driver and add some very basic testing that we can build pass pipelines including these components. Still a lot more to do in terms of testing that all of this works, but the basic pieces are here. There is a lot of boiler plate here. It's something I'm going to actively look at reducing, but I don't have any immediate ideas that don't end up making the code terribly complex in order to fold away the boilerplate. Until I figure out something to minimize the boilerplate, almost all of this is based on the code for the existing pass managers, copied and heavily adjusted to suit the needs of the CGSCC pass management layer. The actual CG management still has a bunch of FIXMEs in it. Notably, we don't do any updating of the CG as it is potentially invalidated. I wanted to get this in place to motivate the new analysis, and add update APIs to the analysis and the pass management layers in concert to make sure that the right APIs are present. llvm-svn: 206745	2014-04-21 11:12:00 +00:00
Benjamin Kramer	d2da720ead	[C++11] Replace OwningPtr with std::unique_ptr in places where it doesn't break the API. No functionality change. llvm-svn: 206740	2014-04-21 09:34:48 +00:00
Chandler Carruth	a2533a7bef	Revert r206732 which is causing llc to crash on most of the build bots. Original commit message: Implement builtins for safe division: safe.sdiv.iN, safe.udiv.iN, safe.srem.iN, safe.urem.iN (iN = i8, i61, i32, or i64). llvm-svn: 206735	2014-04-21 07:11:15 +00:00
Michael Zolotukhin	137a84616c	Implement builtins for safe division: safe.sdiv.iN, safe.udiv.iN, safe.srem.iN, safe.urem.iN (iN = i8, i16, i32, or i64). llvm-svn: 206732	2014-04-21 05:33:09 +00:00
David Blaikie	e9907ba16e	Protect the ArgList dtor It could even be made non-virtual if it weren't for bad compiler warnings. This demonstrates that ArgList objects aren't destroyed polymorphically and possibly that they aren't even used polymorphically. If that's the case, it might be possible to refactor the two ArgList types more separately and simplify the Arg ownership model. continues experimenting llvm-svn: 206727	2014-04-20 23:59:00 +00:00
Richard Smith	c5d5340eeb	Add missing #include found by modules build. llvm-svn: 206726	2014-04-20 23:39:19 +00:00
David Blaikie	f6e403f3c8	Remove comment that hasn't been true for 5 years llvm-svn: 206725	2014-04-20 22:40:43 +00:00
David Blaikie	f70b21a4b8	Use unique_ptr to handle ownership of synthesized args in DerivedArgList This might be able to be simplified further by using Arg as a value type in a linked list (to maintain pointer validity), but here's something simple to start with. llvm-svn: 206724	2014-04-20 22:37:46 +00:00
Simon Atanasyan	f54f8ff094	[Mips] Add more special values for the st_other field in the symbol table entry for MIPS. llvm-svn: 206716	2014-04-20 21:05:36 +00:00
Duncan P. N. Exon Smith	e63327e967	Revert "blockfreq: Rewrite BlockFrequencyInfoImpl" This reverts commit r206704, as expected. llvm-svn: 206707	2014-04-19 22:46:00 +00:00
Duncan P. N. Exon Smith	875ddfac75	Reapply "blockfreq: Rewrite BlockFrequencyInfoImpl" This reverts commit r206677, reapplying my BlockFrequencyInfo rewrite. I've done a careful audit, added some asserts, and fixed a couple of bugs (unfortunately, they were in unlikely code paths). There's a small chance that this will appease the failing bots [1][2]. (If so, great!) If not, I have a follow-up commit ready that will temporarily add -debug-only=block-freq to the two failing tests, allowing me to compare the code path between what the failing bots and what my machines (and the rest of the bots) are doing. Once I've triggered those builds, I'll revert both commits so the bots go green again. [1]: http://bb.pgr.jp/builders/ninja-x64-msvc-RA-centos6/builds/1816 [2]: http://llvm-amd64.freebsd.your.org/b/builders/clang-i386-freebsd/builds/18445 <rdar://problem/14292693> llvm-svn: 206704	2014-04-19 22:34:26 +00:00
Yaron Keren	d7ba46b287	Patch by Vadim Chugunov Win64 stack unwinder gets confused when execution flow "falls through" after a call to 'noreturn' function. This fixes the "missing epilogue" problem by emitting a trap instruction for IR 'unreachable' on x86_x64-pc-windows. A secondary use for it would be for anyone wanting to make double-sure that 'noreturn' functions, indeed, do not return. llvm-svn: 206684	2014-04-19 13:47:43 +00:00
Duncan P. N. Exon Smith	76b813619a	Revert "blockfreq: Rewrite BlockFrequencyInfoImpl" (#2 ) This reverts commit r206666, as planned. Still stumped on why the bots are failing. Sanitizer bots haven't turned anything up. If anyone can help me debug either of the failures (referenced in r206666) I'll owe them a beer. (In the meantime, I'll be auditing my patch for undefined behaviour.) llvm-svn: 206677	2014-04-19 00:42:46 +00:00
Justin Bogner	e808171628	OnDiskHashTable: Audit types and use offset_type consistently llvm-svn: 206675	2014-04-19 00:33:15 +00:00
Justin Bogner	4435e4157a	ProfileData: Avoid UB when reading llvm-svn: 206674	2014-04-19 00:33:12 +00:00
Justin Bogner	4bc13f6b47	OnDiskHashTable: Fix a think-o with offset_type llvm-svn: 206672	2014-04-18 23:50:07 +00:00
Duncan P. N. Exon Smith	b3caf3646f	Reapply "blockfreq: Rewrite BlockFrequencyInfoImpl" (#2 ) This reverts commit r206628, reapplying r206622 (and r206626). Two tests are failing only on buildbots [1][2]: i.e., I can't reproduce on Darwin, and Chandler can't reproduce on Linux. Asan and valgrind don't tell us anything, but we're hoping the msan bot will catch it. So, I'm applying this again to get more feedback from the bots. I'll leave it in long enough to trigger builds in at least the sanitizer buildbots (it was failing for reasons unrelated to my commit last time it was in), and hopefully a few others.... and then I expect to revert a third time. [1]: http://bb.pgr.jp/builders/ninja-x64-msvc-RA-centos6/builds/1816 [2]: http://llvm-amd64.freebsd.your.org/b/builders/clang-i386-freebsd/builds/18445 llvm-svn: 206666	2014-04-18 22:30:03 +00:00
Justin Bogner	b5d368e838	ProfileData: Don't forward declare ComputeHash and make it static inline llvm-svn: 206663	2014-04-18 22:00:22 +00:00
Justin Bogner	b7aa26303b	ProfileData: Add support for the indexed instrprof format This adds support for an indexed instrumentation based profiling format, which is just a small header and an on disk hash table. This format will be used by clang's -fprofile-instr-use= for PGO. llvm-svn: 206656	2014-04-18 21:48:40 +00:00
Alexey Samsonov	d010999abe	[DWARF parser] Turn DILineInfo into a struct. Immutable DILineInfo doesn't bring any benefits and complicates code. Also, use std::string instead of SmallString<16> for file and function names - their length can vary significantly. No functionality change. llvm-svn: 206654	2014-04-18 21:36:39 +00:00
Justin Bogner	12d6c3b4d7	OnDiskHashTable: Expect the Info type to declare the offset type This changes the on-disk hash to get the type to use for offsets from the Info type, so that clients can be more flexible with the size of table they support. llvm-svn: 206643	2014-04-18 20:39:46 +00:00
Justin Bogner	8b56488749	OnDiskHashTable: Expect the Info type to declare the hash size This changes the on-disk hash to get the size of a hash value from the Info type, so that clients can be more flexible with the types of hash they use. llvm-svn: 206642	2014-04-18 20:39:43 +00:00
Benjamin Kramer	147644d400	Remove a couple of redundant copies of SmallVector::operator==. No functionality change. llvm-svn: 206635	2014-04-18 19:48:03 +00:00
David Blaikie	583a31c976	Add range access to MCAssembler's symbol collection. llvm-svn: 206631	2014-04-18 18:24:25 +00:00
Reid Kleckner	d861811e46	Update comment in LLVMBitCodes.h to reflect the actual bitcode record llvm-svn: 206630	2014-04-18 18:19:18 +00:00
Matt Arsenault	3add036dc7	Fix uint -> size_t conversion warning. This warning is disabled for the LLVM build, but external users of the header can still run into this. Patch by Ke Bai llvm-svn: 206629	2014-04-18 18:08:31 +00:00
Duncan P. N. Exon Smith	0842ff36a6	Revert "blockfreq: Rewrite BlockFrequencyInfoImpl" (#2 ) This reverts commit r206622 and the MSVC fixup in r206626. Apparently the remotely failing tests are still failing, despite my attempt to fix the nondeterminism in r206621. llvm-svn: 206628	2014-04-18 17:56:08 +00:00
Duncan P. N. Exon Smith	f8361d127a	Reapply "blockfreq: Rewrite BlockFrequencyInfoImpl" This reverts commit r206556, effectively reapplying commit r206548 and its fixups in r206549 and r206550. In an intervening commit I've added target triples to the tests that were failing remotely [1] (but passing locally). I'm hoping the mystery is solved? I'll revert this again if the tests are still failing remotely. [1]: http://bb.pgr.jp/builders/ninja-x64-msvc-RA-centos6/builds/1816 llvm-svn: 206622	2014-04-18 17:22:25 +00:00
Benjamin Kramer	889873d890	LineIterator: Add DataTypes.h for int64_t on MSVC. llvm-svn: 206617	2014-04-18 16:57:01 +00:00
Benjamin Kramer	753a7ab8c5	Add some missing includes for various standard library implementations. llvm-svn: 206616	2014-04-18 16:46:29 +00:00
Benjamin Kramer	e677b8dab1	Make the copy member of StringRef/ArrayRef generic wrt allocators. Doesn't make sense to restrict this to BumpPtrAllocator. While there replace an explicit loop with std::equal. Some standard libraries know how to compile this down to a ::memcmp call if possible. llvm-svn: 206615	2014-04-18 16:36:15 +00:00
Benjamin Kramer	29af3ed457	Allocator: Remove ReferenceAdder hack. This was a workaround for compilers that had issues with reference collapsing. llvm-svn: 206612	2014-04-18 14:54:51 +00:00
Chandler Carruth	d8d865e266	[LCG] Remove all of the complexity stemming from supporting copying. Reality is that we're never going to copy one of these. Supporting this was becoming a nightmare because nothing even causes it to compile most of the time. Lots of subtle errors built up that wouldn't have been caught by any "normal" testing. Also, make the move assignment actually work rather than the bogus swap implementation that would just infloop if used. As part of that, factor out the graph pointer updates into a helper to share between move construction and move assignment. llvm-svn: 206583	2014-04-18 11:02:33 +00:00
Chandler Carruth	54125a2ba8	[Allocator] Fix an obvious think-o with the move assignment implementation of the SpecificBumpPtrAllocator -- we have to actually move the subobject. =] Noticed when using this code more directly. llvm-svn: 206582	2014-04-18 11:02:29 +00:00
Chandler Carruth	18eadd9260	[LCG] Add support for building persistent and connected SCCs to the LazyCallGraph. This is the start of the whole point of this different abstraction, but it is just the initial bits. Here is a run-down of what's going on here. I'm planning to incorporate some (or all) of this into comments going forward, hopefully with better editing and wording. =] The crux of the problem with the traditional way of building SCCs is that they are ephemeral. The new pass manager however really needs the ability to associate analysis passes and results of analysis passes with SCCs in order to expose these analysis passes to the SCC passes. Making this work is kind-of the whole point of the new pass manager. =] So, when we're building SCCs for the call graph, we actually want to build persistent nodes that stick around and can be reasoned about later. We'd also like the ability to walk the SCC graph in more complex ways than just the traditional postorder traversal of the current CGSCC walk. That means that in addition to being persistent, the SCCs need to be connected into a useful graph structure. However, we still want the SCCs to be formed lazily where possible. These constraints are quite hard to satisfy with the SCC iterator. Also, using that would bypass our ability to actually add data to the nodes of the call graph to facilite implementing the Tarjan walk. So I've re-implemented things in a more direct and embedded way. This immediately makes it easy to get the persistence and connectivity correct, and it also allows leveraging the existing nodes to simplify the algorithm. I've worked somewhat to make this implementation more closely follow the traditional paper's nomenclature and strategy, although it is still a bit obtuse because it isn't recursive, using an explicit stack and a tail call instead, and it is interruptable, resuming each time we need another SCC. The other tricky bit here, and what actually took almost all the time and trials and errors I spent building this, is exactly what graph structure to build for the SCCs. The naive thing to build is the call graph in its newly acyclic form. I wrote about 4 versions of this which did precisely this. Inevitably, when I experimented with them across various use cases, they became incredibly awkward. It was all implementable, but it felt like a complete wrong fit. Square peg, round hole. There were two overriding aspects that pushed me in a different direction: 1) We want to discover the SCC graph in a postorder fashion. That means the root node will be the last node we find. Using the call-SCC DAG as the graph structure of the SCCs results in an orphaned graph until we discover a root. 2) We will eventually want to walk the SCC graph in parallel, exploring distinct sub-graphs independently, and synchronizing at merge points. This again is not helped by the call-SCC DAG structure. The structure which, quite surprisingly, ended up being completely natural to use is the inverse of the call-SCC DAG. We add the leaf SCCs to the graph as "roots", and have edges to the caller SCCs. Once I switched to building this structure, everything just fell into place elegantly. Aside from general cleanups (there are FIXMEs and too few comments overall) that are still needed, the other missing piece of this is support for iterating across levels of the SCC graph. These will become useful for implementing #2, but they aren't an immediate priority. Once SCCs are in good shape, I'll be working on adding mutation support for incremental updates and adding the pass manager that this analysis enables. llvm-svn: 206581	2014-04-18 10:50:32 +00:00
Lang Hames	bc876017c2	[ExecutionEngine] Allow JIT clients to enable/disable module verification. Previously module verification was always enabled, with no way to turn it off. As of this commit, module verification is on by default in Debug builds, and off by default in release builds. The default behaviour can be overridden by calling setVerifyModules(bool) on the JIT instance (this works for both the old JIT, and MCJIT). <rdar://problem/16150008> llvm-svn: 206561	2014-04-18 06:48:23 +00:00
Duncan P. N. Exon Smith	e576167df8	Revert "blockfreq: Rewrite BlockFrequencyInfoImpl" This reverts commits r206548, r206549 and r206549. There are some unit tests failing that aren't failing locally [1], so reverting until I have time to investigate. [1]: http://bb.pgr.jp/builders/ninja-x64-msvc-RA-centos6/builds/1816 llvm-svn: 206556	2014-04-18 02:17:43 +00:00
Justin Bogner	e877eda50d	OnDiskHashTable: Provide iterator_range for keys and data llvm-svn: 206555	2014-04-18 02:10:26 +00:00
Duncan P. N. Exon Smith	12e68e1733	blockfreq: Rewrite BlockFrequencyInfoImpl Rewrite the shared implementation of BlockFrequencyInfo and MachineBlockFrequencyInfo entirely. The old implementation had a fundamental flaw: precision losses from nested loops (or very wide branches) compounded past loop exits (and convergence points). The @nested_loops testcase at the end of test/Analysis/BlockFrequencyAnalysis/basic.ll is motivating. This function has three nested loops, with branch weights in the loop headers of 1:4000 (exit:continue). The old analysis gives non-sensical results: Printing analysis 'Block Frequency Analysis' for function 'nested_loops': ---- Block Freqs ---- entry = 1.0 for.cond1.preheader = 1.00103 for.cond4.preheader = 5.5222 for.body6 = 18095.19995 for.inc8 = 4.52264 for.inc11 = 0.00109 for.end13 = 0.0 The new analysis gives correct results: Printing analysis 'Block Frequency Analysis' for function 'nested_loops': block-frequency-info: nested_loops - entry: float = 1.0, int = 8 - for.cond1.preheader: float = 4001.0, int = 32007 - for.cond4.preheader: float = 16008001.0, int = 128064007 - for.body6: float = 64048012001.0, int = 512384096007 - for.inc8: float = 16008001.0, int = 128064007 - for.inc11: float = 4001.0, int = 32007 - for.end13: float = 1.0, int = 8 Most importantly, the frequency leaving each loop matches the frequency entering it. The new algorithm leverages BlockMass and PositiveFloat to maintain precision, separates "probability mass distribution" from "loop scaling", and uses dithering to eliminate probability mass loss. I have unit tests for these types out of tree, but it was decided in the review to make the classes private to BlockFrequencyInfoImpl, and try to shrink them (or remove them entirely) in follow-up commits. The new algorithm should generally have a complexity advantage over the old. The previous algorithm was quadratic in the worst case. The new algorithm is still worst-case quadratic in the presence of irreducible control flow, but it's linear without it. The key difference between the old algorithm and the new is that control flow within a loop is evaluated separately from control flow outside, limiting propagation of precision problems and allowing loop scale to be calculated independently of mass distribution. Loops are visited bottom-up, their loop scales are calculated, and they are replaced by pseudo-nodes. Mass is then distributed through the function, which is now a DAG. Finally, loops are revisited top-down to multiply through the loop scales and the masses distributed to pseudo nodes. There are some remaining flaws. - Irreducible control flow isn't modelled correctly. LoopInfo and MachineLoopInfo ignore irreducible edges, so this algorithm will fail to scale accordingly. There's a note in the class documentation about how to get closer. See also the comments in test/Analysis/BlockFrequencyInfo/irreducible.ll. - Loop scale is limited to 4096 per loop (2^12) to avoid exhausting the 64-bit integer precision used downstream. - The "bias" calculation proposed on llvmdev is not incorporated here. This will be added in a follow-up commit, once comments from this review have been handled. llvm-svn: 206548	2014-04-18 01:57:45 +00:00
Duncan P. N. Exon Smith	49f3ec80c2	PMBuilder: Expose an option to disable tail calls Adds API to allow frontends to disable tail calls in PassManagerBuilder. <rdar://problem/16050591> llvm-svn: 206542	2014-04-18 01:05:15 +00:00
Diego Novillo	0915c047c2	Fix bug 19437 - Only add discriminators for DWARF 4 and above. Summary: This prevents the discriminator generation pass from triggering if the DWARF version being used in the module is prior to 4. Reviewers: echristo, dblaikie CC: llvm-commits Differential Revision: http://reviews.llvm.org/D3413 llvm-svn: 206507	2014-04-17 22:33:50 +00:00
Tim Northover	0129f298c4	ARM64: add acquire/release versions of the existing atomic intrinsics. These will be needed to support IR-level lowering of atomic operations. llvm-svn: 206489	2014-04-17 20:00:24 +00:00
Tim Northover	037f26f212	Atomics: promote ARM's IR-based atomics pass to CodeGen. Still only 32-bit ARM using it at this stage, but the promotion allows direct testing via opt and is a reasonably self-contained patch on the way to switching ARM64. At this point, other targets should be able to make use of it without too much difficulty if they want. (See ARM64 commit coming soon for an example). llvm-svn: 206485	2014-04-17 18:22:47 +00:00
NAKAMURA Takumi	cd1fc4bc1b	Inliner::OptimizationRemark: Fix crash in clang/test/Frontend/optimization-remark.c on some hosts, including --vg. DebugLoc in Callsite would not live after Inliner. It should be copied before Inliner. llvm-svn: 206459	2014-04-17 12:22:14 +00:00
Chandler Carruth	7e107dabd6	[LCG] Remove a dead declaration. This stopped being used when I switched to a more normal move operation on the graph itself. The definition already got removed, but I missed the declaration. llvm-svn: 206455	2014-04-17 09:41:54 +00:00
Chandler Carruth	b5f938dc00	[LCG] Move the call graph node class into the graph class's definition. This will become necessary to build up the SCC iterators and SCC definitions. Moving it now so that subsequent diffs are incremental. llvm-svn: 206454	2014-04-17 09:40:13 +00:00
Chandler Carruth	4c1b05f822	Make the User::value_op_iterator a random access iterator. I had written this code ages ago and lost track of it. Seems worth doing though -- this thing can get called from places that would benefit from knowing that std::distance is O(1). Also add a very fledgeling unittest for Users and make sure various aspects of this seem to work reasonably. llvm-svn: 206453	2014-04-17 09:07:50 +00:00
Chandler Carruth	b60cb315bc	[LCG] Just move the allocator (now that we can) when moving a call graph. This simplifies the custom move constructor operation to one of walking the graph and updating the 'up' pointers to point to the new location of the graph. Switch the nodes from a reference to a pointer for the 'up' edge to facilitate this. llvm-svn: 206450	2014-04-17 07:25:59 +00:00
Chandler Carruth	81f497d176	[LCG] Remove the Module reference member which we weren't using for anything and doesn't make sense if assigning. llvm-svn: 206449	2014-04-17 07:22:19 +00:00
Chandler Carruth	75f2ca4787	[Allocator] Make SpecificBumpPtrAllocator also movable and move assignable. llvm-svn: 206448	2014-04-17 07:08:56 +00:00
Justin Bogner	033135c5eb	Support: Move OnDiskHashTable from clang to llvm This introduces clang's Basic/OnDiskHashTable.h into llvm as Support/OnDiskHashTable.h. I've taken the opportunity to add doxygen comments and run the file through clang-format, but other than the namespace changing from clang:: to llvm:: the API is identical. llvm-svn: 206438	2014-04-17 02:16:53 +00:00
Jim Grosbach	6623e7f94a	[c++11] Tidy up AsmPrinter.cpp. Range'ify loops and tidy up some by-reference handling. No functional change. llvm-svn: 206422	2014-04-16 22:38:02 +00:00
Jim Grosbach	4800dff007	iterator_range for machine block terminators. llvm-svn: 206421	2014-04-16 22:37:58 +00:00
Tom Stellard	1580dc78ae	Added new functionality to LLVM C API to use DiagnosticInfo to handle errors Patch by: Darren Powell llvm-svn: 206407	2014-04-16 17:45:04 +00:00
Matheus Almeida	0051f2dc78	[mips] Add initial support for NaN2008 in the back-end. This is so that EF_MIPS_NAN2008 is set if we are using IEEE 754-2008 NaN encoding (-mnan=2008). This patch also adds support for parsing '.nan legacy' and '.nan 2008' assembly directives. The handling of these directives should match GAS' behaviour i.e., the last directive in use sets the ELF header bit (EF_MIPS_NAN2008). Differential Revision: http://reviews.llvm.org/D3346 llvm-svn: 206396	2014-04-16 15:48:55 +00:00
Chandler Carruth	eacd996daf	[LCG] Stop playing fast and loose with reference members and assignment. It doesn't work. I'm still cleaning up all the places where I blindly followed this pattern. There are more to come in this code too. As a benefit, this lets the default copy and move operations Just Work. llvm-svn: 206375	2014-04-16 11:14:28 +00:00
Chandler Carruth	0e31ed9058	[Allocator] Make BumpPtrAllocator movable and move assignable. llvm-svn: 206372	2014-04-16 10:48:27 +00:00
Chandler Carruth	448ce011ab	[Allocator] Nuke to useless functions. The implicit ones are sufficient here (obviously). llvm-svn: 206369	2014-04-16 09:21:29 +00:00
Craig Topper	abb4ac7f87	Convert SelectionDAG::getVTList to use ArrayRef llvm-svn: 206357	2014-04-16 06:10:51 +00:00
Chandler Carruth	a073f253f3	[Allocator] Fold the two templated overloads into a single one with a default argument. The allocator interface we're modeling doesn't distinguish between array and non-array allocation. llvm-svn: 206327	2014-04-15 21:51:14 +00:00
Chandler Carruth	9c2a3958f0	[Allocator] Remove a really problematic overload. This is very confusing because there is another (size_t, size_t) overload of Allocator, and the only distinguishing factor is that one is a tempalte and the other isn't. There was only one usage of this and that one was easily converted to carry the alignment constraint in the type itself. llvm-svn: 206325	2014-04-15 21:36:02 +00:00
David Blaikie	ec649acb82	Use unique_ptr to manage ownership of child Regions within llvm::Region llvm-svn: 206310	2014-04-15 18:32:43 +00:00
Duncan P. N. Exon Smith	6ef5f284d6	verify-di: Implement DebugInfoVerifier Implement DebugInfoVerifier, which steals verification relying on DebugInfoFinder from Verifier. - Adds LegacyDebugInfoVerifierPassPass, a ModulePass which wraps DebugInfoVerifier. Uses -verify-di command-line flag. - Change verifyModule() to invoke DebugInfoVerifier as well as Verifier. - Add a call to createDebugInfoVerifierPass() wherever there was a call to createVerifierPass(). This implementation as a module pass should sidestep efficiency issues, allowing us to turn debug info verification back on. <rdar://problem/15500563> llvm-svn: 206300	2014-04-15 16:27:38 +00:00
Tim Northover	2f553f326a	FastISel: constrain the RegClass of operands when emitting instructions. ARM64 suffered multiple -verify-machineinstr failures (principally over the xsp/xzr issue) because FastISel was completely ignoring which subset of the general-purpose registers each instruction required. More fixes are coming in ARM64 specific FastISel, but this should cover the generic problems. llvm-svn: 206283	2014-04-15 13:59:49 +00:00
Chandler Carruth	785a9228b6	[Allocator] Finally, finish nuking the redundant code that led me here by removing the MallocSlabAllocator entirely and just using MallocAllocator directly. This makes all off these allocators expose and utilize the same core interface. The only ugly part of this is that it exposes the fact that the JIT allocator has no real handling of alignment, any more than the malloc allocator does. =/ It would be nice to fix both of these to support alignments, and then to leverage that in the BumpPtrAllocator to do less over allocation in order to manually align pointers. But, that's another patch for another day. This patch has no functional impact, it just removes the somewhat meaningless wrapper around MallocAllocator. llvm-svn: 206267	2014-04-15 09:44:09 +00:00
Chandler Carruth	bf4e0f86c9	[Allocator] Pass the size to the deallocation function. This, on some allocation libraries, may allow more efficient allocation and deallocation. It at least makes the interface implementable by the JIT memory manager. However, this highlights problematic overloading between the void* and the T* deallocation functions. I'm looking into a better way to do this, but as it happens, it comes up rarely in the codebase. llvm-svn: 206265	2014-04-15 08:59:52 +00:00
Chandler Carruth	553283e57d	[Allocator] Fix r206256 which got the enabling case backwards on these overloads. This doesn't matter that much yet, but it will in a subsequent patch. I had tested the original pattern, but not my attempt to pacify MSVC. This at least appears to work. Still fixing the rest of the fallout in the final patch that uses these overloads, but it will follow shortly. llvm-svn: 206259	2014-04-15 08:14:48 +00:00
Nick Lewycky	8589cc7e86	Fix broken build of llvm using clang. llvm-svn: 206257	2014-04-15 08:10:46 +00:00
Chandler Carruth	4eeaafcdef	[Allocator] MSVC apparantly has broken SFINAE context handling of 'sizeof(T)' for T == void and produces a hard error. I cannot fathom why this is OK. Oh well. switch to an explicit test for being the (potentially qualified) void type, which is the only specific case I was worried about. Hopefully this survives the libstdc++ build bots which have limited type traits implementations... llvm-svn: 206256	2014-04-15 08:02:29 +00:00
Nick Lewycky	aad475b324	Break PseudoSourceValue out of the Value hierarchy. It is now the root of its own tree containing FixedStackPseudoSourceValue (which you can use isa/dyn_cast on) and MipsCallEntry (which you can't). Anything that needs to use either a PseudoSourceValue* and Value* is strongly encouraged to use a MachinePointerInfo instead. llvm-svn: 206255	2014-04-15 07:22:52 +00:00
Craig Topper	2406477179	[C++11] More 'nullptr' conversion. In some cases just using a boolean check instead of comparing to nullptr. llvm-svn: 206254	2014-04-15 07:20:03 +00:00
Nick Lewycky	3cdb5cd00a	Add a DenseMapInfo specialization for PointerUnion. In tree user to land shortly. llvm-svn: 206253	2014-04-15 07:08:40 +00:00
Craig Topper	2617dccea2	[C++11] More 'nullptr' conversion. In some cases just using a boolean check instead of comparing to nullptr. llvm-svn: 206252	2014-04-15 06:32:26 +00:00
Chandler Carruth	d82d4cc356	[Allocator] Constrain the Deallocate templated overloads to only apply to types which we can compute the size of. The comparison with zero isn't actually interesting here, it's mostly about putting sizeof into a sfinae context. This is particular important for Deallocate as otherwise the void* overload can quickly become ambiguous. llvm-svn: 206251	2014-04-15 06:29:04 +00:00
David Blaikie	dc72f9774d	Use unique_ptr to manage ownership of GCFunctionInfos in GCStrategy llvm-svn: 206249	2014-04-15 06:07:26 +00:00
David Blaikie	ec528ee93f	Use unique_ptr for the result of Registry entries. llvm-svn: 206248	2014-04-15 05:53:26 +00:00
David Blaikie	88368bae4c	Use unique_ptr to manage ownership of GCStrategy objects in GCMetadata llvm-svn: 206246	2014-04-15 05:34:49 +00:00
David Blaikie	bb97e1b52e	Use unique_ptr to own MCFunctions within MCModule. MCModule's ctor had to be moved out of line so the definition of MCFunction was available. (ctor requires the dtor of members (in case the ctor throws) which required access to the dtor of MCFunction) llvm-svn: 206244	2014-04-15 05:15:19 +00:00
Craig Topper	9f008867c0	[C++11] More 'nullptr' conversion. In some cases just using a boolean check instead of comparing to nullptr. llvm-svn: 206243	2014-04-15 04:59:12 +00:00
David Blaikie	4a7a050910	Use std::unique_ptr to manage MCBasicBlocks in MCFunction. llvm-svn: 206242	2014-04-15 04:56:29 +00:00
Lang Hames	a1bc0f5662	[MC] Require an MCContext when constructing an MCDisassembler. This patch re-introduces the MCContext member that was removed from MCDisassembler in r206063, and requires that an MCContext be passed in at MCDisassembler construction time. (Previously the MCContext member had been initialized in an ad-hoc fashion after construction). The MCCContext member can be used by MCDisassembler sub-classes to construct constant or target-specific MCExprs. This patch updates disassemblers for in-tree targets, and provides the MCRegisterInfo instance that some disassemblers were using through the MCContext (previously those backends were constructing their own MCRegisterInfo instances). llvm-svn: 206241	2014-04-15 04:40:56 +00:00
Jim Grosbach	3ace407630	Add iterator_range for MachineInstr defs. llvm-svn: 206238	2014-04-15 02:14:06 +00:00
Chandler Carruth	ddfadb4654	[Allocator] Add Deallocate support to the AllocatorBase CRTP class, along with templated overloads much like we have for Allocate. These will facilitate switching the Deallocate interface of all the Allocator classes to accept the size by pre-filling it from the type size where we can do so. I plan to convert several uses to the template variants in subsequent patches prior to adding the Size parameter. No functionality changed, WIP. llvm-svn: 206230	2014-04-15 00:47:47 +00:00
Chandler Carruth	ce020670a4	[Allocator] Hack around the fact that GCC can't compile the static_assert added in r206225. I'm looking into a proper fix, but wanted the bots back. llvm-svn: 206226	2014-04-15 00:22:53 +00:00
Chandler Carruth	761af74802	[Allocator] Factor the Allocate template overloads into a base class rather than defining them (differently!) in both allocators. This also serves as a basis for documenting and even enforcing some of the LLVM-style "allocator" concept methods which must exist with various signatures. I plan on extending and changing the signatures of these to further simplify our allocator model in subsequent commits, so I wanted to factor things as best as I could first. Notably, I'm working to add the 'Size' to the deallocation method of all allocators. This has several implications not the least of which are faster deallocation times on certain allocation libraries (tcmalloc). It also will allow the JIT allocator to fully model the existing allocation interfaces and allow sanitizer poisoning of deallocated regions. The list of advantages goes on. =] But by factoring things first I'll be able to make this easier by first introducing template helpers for the deallocation path. llvm-svn: 206225	2014-04-15 00:19:41 +00:00
Chandler Carruth	3a8c087cb9	[cleanup] Run clang-format over most of YAMLParser.h to fix a bunch of small formatting inconsistencies with the rest of LLVM and even this file. I looked at all the changes and they seemed like just better formatting. llvm-svn: 206209	2014-04-14 21:12:15 +00:00
James Molloy	09a53b960b	[ARM64] Add big endian target arm64_be. llvm-svn: 206197	2014-04-14 17:37:53 +00:00

... 2 3 4 5 6 ...

20544 Commits