llvm-project

Commit Graph

Author	SHA1	Message	Date
Craig Topper	963c5d5ef8	Remove an unused version of getMemIntrinsicNode and getNode. Additionally, these were calling makeVTList with the pointers passed in which would were unlikely to belong to SelectionDAG and likely would have just been stack pointers. llvm-svn: 207326	2014-04-26 18:35:13 +00:00
Argyrios Kyrtzidis	ea75aad379	[Sema] Adjust Sema::getCurBlock()/getCurLambda() to take into account that we may have switch CurContext due to class template instantiation. Fixes crash of the included test case. rdar://16527205 llvm-svn: 207325	2014-04-26 18:29:13 +00:00
David Blaikie	9c34526c17	Include C++ source for debug info test case committed in r207323 llvm-svn: 207324	2014-04-26 18:25:07 +00:00
David Blaikie	e12b49a6e8	DWARF Type Units: Avoid emitting type units under fission if the type requires an address. Since there's no way to ensure the type unit in the .dwo and the type unit skeleton in the .o are correlated, this cannot work. This implementation is a bit inefficient for a few reasons, called out in comments. llvm-svn: 207323	2014-04-26 17:27:38 +00:00
Benjamin Kramer	c2ad8f3ef1	Print X86ISD::PMULDQ nodes properly in debug output. llvm-svn: 207322	2014-04-26 16:26:41 +00:00
David Blaikie	f3de2ab46c	DwarfDebug: Minor refactoring around type unit construction Sinking addition of the declaration attribute down to where the signature is added. So that if the signature is not added neither is the declaration attribute (this will come in handy when aborting type unit construction to instead emit the type into the CU directly in some cases) Pull out type unit identifier hashing just to simplify the function a little, it'll be getting longer. llvm-svn: 207321	2014-04-26 16:26:41 +00:00
Benjamin Kramer	7c3722724b	X86TTI: i16/i32 vector div with a constant (splat) divisor are reasonably cheap now. Turn vectorization back on. llvm-svn: 207320	2014-04-26 14:53:05 +00:00
Alp Toker	87d3975369	libclang: remove 'CXDiagnostic_Remark' The change was landed without review or test cases. It trivially broke almost any stable application checking for Severity >= CXDiagnostic_Error or indeed any other kind of severity comparison upon encountering a 'remark'. Mapped to CXDiagnostic_Warning until a workable solution is proposed to the list that preserves API stability. (It's also not clear why the rest of r202475 wasn't simply implemented as a modifier to the existing 'warning' level.) llvm-svn: 207319	2014-04-26 14:43:53 +00:00
Benjamin Kramer	6d2dff61f9	X86: Lower SMUL_LOHI of v4i32 to pmuldq when SSE4.1 is available. llvm-svn: 207318	2014-04-26 14:12:19 +00:00
Benjamin Kramer	c9827ab103	X86: Add patterns for MULHU/MULHS of v8i16 and v16i16. This gets us pretty code for divs of i16 vectors. Turn the existing intrinsics into the corresponding nodes. llvm-svn: 207317	2014-04-26 13:01:03 +00:00
Benjamin Kramer	ad0168702a	Rip out X86-specific vector SDIV lowering, make the corresponding DAGCombiner transform work on vectors. llvm-svn: 207316	2014-04-26 13:00:53 +00:00
Benjamin Kramer	4dae598bc8	DAGCombiner: Turn divs of vector splats into vectorized multiplications. Otherwise the legalizer would just scalarize everything. Support for mulhi in the targets isn't that great yet so on most targets we get exactly the same scalarized output. Add a test for x86 vector udiv. I had to disable the mulhi nodes on ARM because there aren't any patterns for it. As far as I know ARM has instructions for getting the high part of a multiply so this should be fixed. llvm-svn: 207315	2014-04-26 12:06:28 +00:00
Benjamin Kramer	29139d5cb5	X86: Custom lower v4i32 UMUL_LOHI into 2 pmuludqs. Test will follow soon. llvm-svn: 207314	2014-04-26 12:06:11 +00:00
Michael Zolotukhin	1a97a7bcbf	Revert r206749 till a final decision about the intrinsics is made. llvm-svn: 207313	2014-04-26 09:56:41 +00:00
Chandler Carruth	90821c2a93	[LCG] Rather than removing nodes from the SCC entry set when we process them, just skip over any DFS-numbered nodes when finding the next root of a DFS. This allows the entry set to just be a vector as we populate it from a uniqued source. It also removes the possibility for a linear scan of the entry set to actually do the removal which can make things go quadratic if we get unlucky. llvm-svn: 207312	2014-04-26 09:45:55 +00:00
Chandler Carruth	5e2d70b9a3	[LCG] Rotate the full SCC finding algorithm to avoid round-trips through the DFS stack for leaves in the call graph. As mentioned in my previous commit, this is particularly interesting for graphs which have high fan out but low connectivity resulting in many leaves. For such graphs, this can remove a large % of the DFS stack traffic even though it doesn't make the stack much smaller. It's a bit easier to formulate this for the full algorithm because that one stops completely for each SCC. For example, I was able to directly eliminate the "Recurse" boolean used to continue an outer loop from the inner loop. llvm-svn: 207311	2014-04-26 09:28:00 +00:00
Chandler Carruth	aca48d0443	[LCG] Hoist the main DFS loop out of the edge removal function. This makes working through the worklist much cleaner, and makes it possible to avoid the 'bool-to-continue-the-outer-loop' hack. Not a huge difference, but I think this is approaching as polished as I can make it. llvm-svn: 207310	2014-04-26 09:06:53 +00:00
Gerolf Hoflehner	af7a87d2e3	RecursivelyDeleteTriviallyDeadInstructions() could remove more than 1 instruction. The caller need to be aware of this and adjust instruction iterators accordingly. rdar://16679376 Repaired r207302. llvm-svn: 207309	2014-04-26 05:58:11 +00:00
Gerolf Hoflehner	1da7cbd584	Restore CloneFunction.cpp which got accidently overwritten by previous backout of r207303 llvm-svn: 207308	2014-04-26 05:43:41 +00:00
Marshall Clow	85d3e7a729	Fix bug #18350 . Add tests for tuples of all the smart pointers (except auto_ptr) llvm-svn: 207307	2014-04-26 05:19:48 +00:00
Chandler Carruth	680af7a78c	[LCG] In the incremental SCC re-formation, lift the node currently being processed in the DFS out of the stack completely. Keep it exclusively in a variable. Re-shuffle some code structure to make this easier. This can have a very dramatic effect in some cases because call graphs tend to look like a high fan-out spanning tree. As a consequence, there are a large number of leaf nodes in the graph, and this technique causes leaf nodes to never even go into the stack. While this only reduces the max depth by 1, it may cause the total number of round trips through the stack to drop by a lot. Now, most of this isn't really relevant for the incremental version. =] But I wanted to prototype it first here as this variant is in ways more complex. As long as I can get the code factored well here, I'll next make the primary walk look the same. There are several refactorings this exposes I think. llvm-svn: 207306	2014-04-26 03:36:42 +00:00
Chandler Carruth	a7205b6154	[LCG] Special case the removal of self edges. These don't impact the SCC graph in any way because we don't track edges in the SCC graph, just nodes. This also lets us add a nice assert about the invariant that we're working on at least a certain number of nodes within the SCC. llvm-svn: 207305	2014-04-26 03:36:37 +00:00
Juergen Ributzka	a6bda8bae2	[DAG] During DAG legalization keep opaque constants even after expanding. The included test case would return the incorrect results, because the expansion of an shift with a constant shift amount of 0 would generate undefined behavior. This is because ExpandShiftByConstant assumes that all shifts by constants with a value of 0 have already been optimized away. This doesn't happen for opaque constants and usually this isn't a problem, because opaque constants won't take this code path - they are not supposed to. In the case that the opaque constant has to be expanded by the legalizer, the legalizer would drop the opaque flag. In this case we hit the limitations of ExpandShiftByConstant and create incorrect code. This commit fixes the legalizer by not dropping the opaque flag when expanding opaque constants and adding an assertion to ExpandShiftByConstant to catch this not supported case in the future. This fixes <rdar://problem/16718472> llvm-svn: 207304	2014-04-26 02:58:04 +00:00
Gerolf Hoflehner	c46e9b0423	Revert commit r207302 since build failures have been reported. llvm-svn: 207303	2014-04-26 02:03:17 +00:00
Gerolf Hoflehner	34210108b3	RecursivelyDeleteTriviallyDeadInstructions() could remove more than 1 instruction. The caller need to be aware of this and adjust instruction iterators accordingly. rdar://16679376 llvm-svn: 207302	2014-04-26 01:19:16 +00:00
Quentin Colombet	ea18933d97	[X86] Implement TargetLowering::getScalingFactorCost hook. Scaling factors are not free on X86 because every "complex" addressing mode breaks the related instruction into 2 allocations instead of 1. <rdar://problem/16730541> llvm-svn: 207301	2014-04-26 01:11:26 +00:00
Chandler Carruth	8f92d6db22	[LCG] Refactor the duplicated code I added in my last commit here into a helper function. Also factor the other two places where we did the same thing into the helper function. =] Much cleaner this way. NFC. llvm-svn: 207300	2014-04-26 01:03:46 +00:00
Andrea Di Biagio	8cc9059ce8	[InstCombine][X86] Teach how to fold calls to SSE2/AVX2 packed logical shift right intrinsics. A packed logical shift right with a shift count bigger than or equal to the element size always produces a zero vector. In all other cases, it can be safely replaced by a 'lshr' instruction. llvm-svn: 207299	2014-04-26 01:03:22 +00:00
Richard Smith	8d039e4420	Add missing include guards and missing #include, found by modules build. llvm-svn: 207298	2014-04-26 00:53:26 +00:00
Rui Ueyama	f33946d51d	[PECOFF] Allow multiple directives in one module-definition file. I'm a bit surprised that I have not implemented this yet. This is definitely needed to handle real-world module definition files. This patch contains a unit test for r207294. llvm-svn: 207297	2014-04-26 00:25:02 +00:00
Nick Lewycky	0c2986f78e	Add mangling for attribute enable_if. The demangling patch for libcxxabi is still in review. llvm-svn: 207296	2014-04-26 00:14:00 +00:00
Filipe Cabecinhas	d71f110fe9	Appease the almighty buildbots. llvm-svn: 207295	2014-04-26 00:02:37 +00:00
Rui Ueyama	637300ea4e	[PECOFF] Fix off-by-one error in .def file parser. I'm fixing another bug in the parser, and I wanted to submit this fix as a separate change as it's logically independent from the other. I'll add a test for this shortly. llvm-svn: 207294	2014-04-25 23:59:27 +00:00
Greg Clayton	28432c24e9	Since one or more Editline instances of the same kind (lldb commands, expressions, etc) can exist at once, they should all shared a ref counted history object. Now they do. llvm-svn: 207293	2014-04-25 23:55:26 +00:00
Greg Clayton	ed6499fe64	Free the strong reference to a lldb::SBDebugger that the script interpreter was holding onto in the "lldb.debugger" global variable. llvm-svn: 207292	2014-04-25 23:55:12 +00:00
Filipe Cabecinhas	363b570d2a	Optimization for certain shufflevector by using insertps. Summary: If we're doing a v4f32/v4i32 shuffle on x86 with SSE4.1, we can lower certain shufflevectors to an insertps instruction: When most of the shufflevector result's elements come from one vector (and keep their index), and one element comes from another vector or a memory operand. Added tests for insertps optimizations on shufflevector. Added support and tests for v4i32 vector optimization. Reviewers: nadav Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D3475 llvm-svn: 207291	2014-04-25 23:51:17 +00:00
Duncan P. N. Exon Smith	42292ceaa9	Revert "blockfreq: Approximate irreducible control flow" This reverts commit r207286. It causes an ICE on the cmake-llvm-x86_64-linux buildbot [1]: llvm/lib/Analysis/BlockFrequencyInfo.cpp: In lambda function: llvm/lib/Analysis/BlockFrequencyInfo.cpp:182:1: internal compiler error: in get_expr_operands, at tree-ssa-operands.c:1035 [1]: http://bb.pgr.jp/builders/cmake-llvm-x86_64-linux/builds/12093/steps/build_llvm/logs/stdio llvm-svn: 207287	2014-04-25 23:16:58 +00:00
Duncan P. N. Exon Smith	384d0e8ad4	blockfreq: Approximate irreducible control flow Previously, irreducible backedges were ignored. With this commit, irreducible SCCs are discovered on the fly, and modelled as loops with multiple headers. This approximation specifies the headers of irreducible sub-SCCs as its entry blocks and all nodes that are targets of a backedge within it (excluding backedges within true sub-loops). Block frequency calculations act as if we insert a new block that intercepts all the edges to the headers. All backedges and entries to the irreducible SCC point to this imaginary block. This imaginary block has an edge (with even probability) to each header block. The result is now reasonable enough that I've added a number of testcases for irreducible control flow. I've outlined in `BlockFrequencyInfoImpl.h` ways to improve the approximation. <rdar://problem/14292693> llvm-svn: 207286	2014-04-25 23:08:57 +00:00
Todd Fiala	260323dba6	Added TestLldbGdbServer test for A start exe packet. Fixed up bug in XFAIL tests where I appended an array when I intended to merge an array. llvm-svn: 207285	2014-04-25 23:08:24 +00:00
Adrian Prantl	232897feaa	Unbreak the gdb buildbot by not lowering dbg.declare intrinsics for arrays. llvm-svn: 207284	2014-04-25 23:00:25 +00:00
Eric Christopher	ece0e90e33	Make sure that rangelists are also relative to the compile unit low_pc similar to location lists. Fixes PR19563 llvm-svn: 207283	2014-04-25 22:23:54 +00:00
Matt Arsenault	de1c3410c3	R600: Fix function name printing in LowerCall v2: Check both ExternalSymbol and GlobalAddress Patch by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 207282	2014-04-25 22:22:01 +00:00
David Blaikie	772ab8ae5a	DwarfAccelTable: Store the string symbol in the accelerator table to avoid duplicate lookup. This also avoids the need for subtly side-effecting calls to manifest strings in the string table at the point where items are added to the accelerator tables. llvm-svn: 207281	2014-04-25 22:21:35 +00:00
Warren Hunt	f0ffdb2e60	Fixed Assert In CGRecordLowering Prior to this patch, CGRecordLower assumed that virtual bases could not be placed before the nvsize of an object. This isn't true in Itanium mode, virtual bases are placed at dsize rather than vnsize and in the case of zero sized non-virtual bases nvsize can be larger than dsize. This patch fixes CGRecordLowering to avoid an assert and to clip bitfields properly in this case. A test case is included. llvm-svn: 207280	2014-04-25 21:56:30 +00:00
Tom Roeder	fd1bc602b3	Add an -mattr option to the gold plugin to support subtarget features in LTO This adds support for an -mattr option to the gold plugin and to llvm-lto. This allows the caller to specify details of the subtarget architecture, like +aes, or +ssse3 on x86. Note that this requires a change to the include/llvm-c/lto.h interface: it adds a function lto_codegen_set_attr and it increments the version of the interface. llvm-svn: 207279	2014-04-25 21:46:51 +00:00
Alexey Samsonov	b54d0f4020	Fix missing include llvm-svn: 207278	2014-04-25 21:42:35 +00:00
David Blaikie	daefdbf3ad	Encapsulate the DWARF string pool in a separate type. Pulls out some more code from some of the rather monolithic DWARF classes. Unlike the address table, the string table won't move up into DwarfDebug - each DWARF file has its own string table (but there can be only one address table). llvm-svn: 207277	2014-04-25 21:34:35 +00:00
Alexey Samsonov	001ecd9aa9	[DWARF parser] Cleanup code in DWARFDebugAranges. No functionality change. llvm-svn: 207276	2014-04-25 21:30:03 +00:00
Saleem Abdulrasool	b9f07e3dbc	CodeGen: add __yield intrinsic for ARM The __yield intrinsic generates a hint instruction to indicate that the thread is not performing any useful operations at the moment. This is for compatibility with MSVC, although, the intrinsic is also part of the ACLE, and is enabled globally as a result. llvm-svn: 207275	2014-04-25 21:13:29 +00:00
Alexey Samsonov	4316df5921	[DWARF parser] Cleanup code in DWARFDebugAbbrev. No functionality change. llvm-svn: 207274	2014-04-25 21:10:56 +00:00

1 2 3 4 5 ...

172965 Commits All Branches Search

172965 Commits

All Branches