llvm-project

Commit Graph

Author	SHA1	Message	Date
Sebastian Pop	47fe7de1b5	move findArrayDimensions to ScalarEvolution we do not use the information from SCEVAddRecExpr to compute the shape of the array, so a better place for this function is in ScalarEvolution. llvm-svn: 208456	2014-05-09 22:45:07 +00:00
Reid Kleckner	7941856445	Allow sret on the second parameter as well as the first MSVC always places the implicit sret parameter after the implicit this parameter of instance methods. We used to handle this for x86_thiscallcc by allocating the sret parameter on the stack and leaving the this pointer in ecx, but that doesn't handle alternative calling conventions like cdecl, stdcall, fastcall, or the win64 convention. Instead, change the verifier to allow sret on the second parameter. This also requires changing the Mips and X86 backends to return the argument with the sret parameter, instead of assuming that the sret parameter comes first. The Sparc backend also returns sret parameters in a register, but I wasn't able to update it to handle secondary sret parameters. It currently calls report_fatal_error if you feed it an sret in the second parameter. Reviewers: rafael.espindola, majnemer Differential Revision: http://reviews.llvm.org/D3617 llvm-svn: 208453	2014-05-09 22:32:13 +00:00
Rafael Espindola	b2ea33975f	Run clang-format in small sections of code to make a patch easier to read. llvm-svn: 208419	2014-05-09 15:49:02 +00:00
Oliver Stannard	c24f2171ca	ARM: HFAs must be passed in consecutive registers When using the ARM AAPCS, HFAs (Homogeneous Floating-point Aggregates) must be passed in a block of consecutive floating-point registers, or on the stack. This means that unused floating-point registers cannot be back-filled with part of an HFA, however this can currently happen. This patch, along with the corresponding clang patch (http://reviews.llvm.org/D3083) prevents this. llvm-svn: 208413	2014-05-09 14:01:47 +00:00
Rafael Espindola	9ad68a908f	Remove trailing white space. llvm-svn: 208411	2014-05-09 13:54:40 +00:00
Rafael Espindola	3d7c778d6d	Don't indent inside a namespace. Don't duplicate a function name in comment. llvm-svn: 208389	2014-05-09 02:56:16 +00:00
Nick Lewycky	ad1b3d1de5	printCustom is only used in PseudoSourceValue, remove it from Value. llvm-svn: 208383	2014-05-09 00:49:03 +00:00
Rafael Espindola	529c8462dd	Add missing linkage predicates. llvm-svn: 208379	2014-05-09 00:36:18 +00:00
David Blaikie	2f143e0c30	Reapply r207876 (Try simplifying LexicalScopes ownership again) including a workaround for an MSVC2012 bug regarding forward_as_tuple (r207876 was reverted in r208131 after seeing some consistent buildbot failure for MSVC 2012. The original commits were in r207724-r207726) Takumi was nice enough to dig into this and locate this Microsoft Connect issue: http://connect.microsoft.com/VisualStudio/feedback/details/814899/forward-as-tuple-debug-implementation-error describing a bug in MSVC2012's forward_as_tuple implementation. Since the parameters in this instance are trivial/small, pass them by value (using make_tuple) instead of perfectly-forwarded tuple of rvalue references (involving the broken forward_as_tuple). Hopefully this will satisfy MSVC2012. llvm-svn: 208364	2014-05-08 22:24:51 +00:00
David Blaikie	e08c540e68	Missed formatting llvm-svn: 208362	2014-05-08 21:53:33 +00:00
David Blaikie	8ae8fd08ff	StringMap: Move assignment and move construction. llvm-svn: 208361	2014-05-08 21:52:29 +00:00
David Blaikie	70a14fc4d6	StringMap: Replace faux-copyability with faux-movability, which is sufficient. This behavior was added to support StringMaps of StringMaps, default + move construction are sufficient for this. Real move construction support coming soon (& probably copy construction too). llvm-svn: 208360	2014-05-08 21:52:26 +00:00
David Blaikie	9cb331f9fb	StringMap support for move-only values. llvm-svn: 208359	2014-05-08 21:52:23 +00:00
Ed Maste	6b008bf205	Add isOSFreeBSD triple test For http://reviews.llvm.org/D3448 llvm-svn: 208309	2014-05-08 13:00:15 +00:00
Hal Finkel	6532c20faa	Move late partial-unrolling thresholds into the processor definitions The old method used by X86TTI to determine partial-unrolling thresholds was messy (because it worked by testing target features), and also would not correctly identify the target CPU if certain target features were disabled. After some discussions on IRC with Chandler et al., it was decided that the processor scheduling models were the right containers for this information (because it is often tied to special uop dispatch-buffer sizes). This does represent a small functionality change: - For generic x86-64 (which uses the SB model and, thus, will get some unrolling). - For AMD cores (because they still currently use the SB scheduling model) - For Haswell (based on benchmarking by Louis Gerbarg, it was decided to bump the default threshold to 50; we're working on a test case for this). Otherwise, nothing has changed for any other targets. The logic, however, has been moved into BasicTTI, so other targets may now also opt-in to this functionality simply by setting LoopMicroOpBufferSize in their processor model definitions. llvm-svn: 208289	2014-05-08 09:14:44 +00:00
Richard Smith	789d3007fb	[modules] Add missing #include. llvm-svn: 208276	2014-05-08 02:34:32 +00:00
Duncan P. N. Exon Smith	e60adfdbd0	GlobalValue: Assert symbols with local linkage have default visibility The change to ExtractGV.cpp has no functionality change except to avoid the asserts. Existing testcases already cover this, so I didn't add a new one. llvm-svn: 208264	2014-05-07 23:00:22 +00:00
Justin Bogner	c9124a54c3	llvm-cov: Fix some funny indentation (NFC) Noticed by Duncan Exon Smith. Thanks! llvm-svn: 208253	2014-05-07 21:50:43 +00:00
Nico Weber	bc8a35f093	Let OnDiskHashTable call the destructor of its Items. OnDiskHashTable::insert() calls the Item constructor via placement new, but nothing called the destructor. This matters in cases when the Info template parameter has key_type or data_type typedefs that have a destructor, for example like IdentifierIndexWriterTrait in clang's GlobalModuleIndex.cpp. This fixes a 5-year old bug that's been around since the OnDiskHashTable code was added in r64192. Bug found by LSan! llvm-svn: 208243	2014-05-07 19:55:38 +00:00
Matt Arsenault	5f2fd4b22a	Fix using wrong result type for setcc. When reducing the bitwidth of a comparison against a constant, the original setcc's result type was used, which was incorrect. No test since I don't think any other in tree targets change the bitwidth of the setcc type depending on the bitwidth of the compared type. llvm-svn: 208236	2014-05-07 18:26:58 +00:00
Sebastian Pop	448712b1a6	split delinearization pass in 3 steps To compute the dimensions of the array in a unique way, we split the delinearization analysis in three steps: - find parametric terms in all memory access functions - compute the array dimensions from the set of terms - compute the delinearized access functions for each dimension The first step is executed on all the memory access functions such that we gather all the patterns in which an array is accessed. The second step reduces all this information in a unique description of the sizes of the array. The third step is delinearizing each memory access function following the common description of the shape of the array computed in step 2. This rewrite of the delinearization pass also solves a problem we had with the previous implementation: because the previous algorithm was by induction on the structure of the SCEV, it would not correctly recognize the shape of the array when the memory access was not following the nesting of the loops: for example, see polly/test/ScopInfo/multidim_only_ivs_3d_reverse.ll ; void foo(long n, long m, long o, double A[n][m][o]) { ; ; for (long i = 0; i < n; i++) ; for (long j = 0; j < m; j++) ; for (long k = 0; k < o; k++) ; A[i][k][j] = 1.0; Starting with this patch we no longer delinearize access functions that do not contain parameters, for example in test/Analysis/DependenceAnalysis/GCD.ll ;; for (long int i = 0; i < 100; i++) ;; for (long int j = 0; j < 100; j++) { ;; A[2i - 4j] = i; ;; B++ = A[6i + 8*j]; these accesses will not be delinearized as the upper bound of the loops are constants, and their access functions do not contain SCEVUnknown parameters. llvm-svn: 208232	2014-05-07 18:01:20 +00:00
Rafael Espindola	764ac3677d	Style update: don't duplicate the function name. llvm-svn: 208227	2014-05-07 17:04:45 +00:00
Rafael Espindola	031c890221	Style update: don't duplicate the function name. llvm-svn: 208224	2014-05-07 16:43:23 +00:00
Rafael Espindola	566fcfe69b	Remove the UseCFI option from createAsmStreamer. We were already always passing true, this just removes the option. llvm-svn: 208205	2014-05-07 13:00:43 +00:00
Ed Maste	fd122267c4	DebugInfo: Use enum instead of unsigned This makes debuging DebugInfo generation with LLDB a little more pleasant. Differential Revision: http://reviews.llvm.org/D3626 llvm-svn: 208202	2014-05-07 12:49:08 +00:00
Daniel Sanders	314e80e5f8	[tablegen] Add !listconcat operator with the similar semantics as !strconcat Summary: It concatenates two or more lists. In addition to the !strconcat semantics the lists must have the same element type. My overall aim is to make it easy to append to Instruction.Predicates rather than override it. This can be done by concatenating lists passed as arguments, or by concatenating lists passed in additional fields. Reviewers: dsanders Reviewed By: dsanders Subscribers: hfinkel, llvm-commits Differential Revision: http://reviews.llvm.org/D3506 llvm-svn: 208183	2014-05-07 10:13:19 +00:00
Zinovy Nis	da925c0d7c	[BUG][REFACTOR] 1) Fix for printing debug locations for absolute paths. 2) Location printing is moved into public method DebugLoc::print() to avoid re-inventing the wheel. Differential Revision: http://reviews.llvm.org/D3513 llvm-svn: 208177	2014-05-07 09:51:22 +00:00
Tobias Grosser	924221cb37	[C++11] Add NArySCEV->Operands iterator range llvm-svn: 208158	2014-05-07 06:07:47 +00:00
Justin Bogner	cf27e1b996	llvm-cov: Handle missing source files as GCOV does If the source files referenced by a gcno file are missing, gcov outputs a coverage file where every line is simply /EOF/. This also occurs for lines in the coverage that are past the end of a file that is found. This change mimics gcov. llvm-svn: 208149	2014-05-07 02:11:23 +00:00
Justin Bogner	1a18d7caa3	llvm-cov: Implement --no-output In gcov, there's a -n/--no-output option, which disables the writing of any .gcov files, so that it emits only the summary info on stdout. This implements the same behaviour in llvm-cov. llvm-svn: 208148	2014-05-07 02:11:18 +00:00
Rafael Espindola	8d8f100c57	Special case aliases in GlobalValue::getSection. This is similar to the getAlignment patch, but is done just for completeness. It looks like we never call getSection on an alias. All the tests still pass if the if is replaced with an assert. llvm-svn: 208139	2014-05-06 22:44:30 +00:00
David Blaikie	9dabbf6228	Revert "Try simplifying LexicalScopes ownership again." Speculatively reverting due to a suspicious failure on a Windows buildbot. This reverts commit 10c37a012ea11596d44cd9059fe09c959caf30c8. llvm-svn: 208131	2014-05-06 21:07:17 +00:00
Eric Christopher	dc5072d60e	ArrayRef-ize the Feature and Processor tables for SubtargetFeatures. This removes arguments passed everywhere and allows the use of standard iteration over lists. Should be no functional change. llvm-svn: 208127	2014-05-06 20:23:04 +00:00
Renato Golin	c7aea40ec6	Implememting named register intrinsics This patch implements the infrastructure to use named register constructs in programs that need access to specific registers (bare metal, kernels, etc). So far, only the stack pointer is supported as a technology preview, but as it is, the intrinsic can already support all non-allocatable registers from any architecture. llvm-svn: 208104	2014-05-06 16:51:25 +00:00
Rafael Espindola	52dc5d828f	Special case aliases in GlobalValue::getAlignment. An alias has the address of what it points to, so it also has the same alignment. This allows a few optimizations to see past aliases for free. llvm-svn: 208103	2014-05-06 16:48:58 +00:00
Rafael Espindola	8fbbfbbec3	Be more strict about not allowing setSection on aliases. llvm-svn: 208095	2014-05-06 14:59:14 +00:00
Owen Anderson	4cf4e664c2	Fix some obvious Doxygen comment bugs. llvm-svn: 208059	2014-05-06 05:05:59 +00:00
David Blaikie	945cdd07d3	Update comment from a recent commit. llvm-svn: 208057	2014-05-06 03:53:10 +00:00
David Blaikie	d3f094a33b	PR19598: Provide the ability to RAUW a declaration with itself, creating a non-temporary copy and using that to RAUW. Also, provide the ability to create temporary and non-temporary declarations, as not all declarations may be replaced by definitions later on. This provides the necessary infrastructure for Clang to fix PR19598, leaking temporary MDNodes in Clang's debug info generation. llvm-svn: 208054	2014-05-06 03:41:57 +00:00
Eric Christopher	7eba3f90ae	Revert "Walk back commits for unused function parameters - they're still being" this reapplies 208012 and 208002. llvm-svn: 208037	2014-05-06 02:37:26 +00:00
Duncan P. N. Exon Smith	87c40fdfdb	blockfreq: Move include to .cpp llvm-svn: 208035	2014-05-06 01:57:42 +00:00
Richard Smith	c167d656e7	Re-commit r208025, reverted in r208030, with a fix for a conformance issue which GCC detects and Clang does not! llvm-svn: 208033	2014-05-06 01:44:26 +00:00
Richard Smith	09bf116939	Revert r208025, which made buildbots unhappy for unknown reasons. llvm-svn: 208030	2014-05-06 01:26:00 +00:00
Argyrios Kyrtzidis	8c1eafc9b0	[Support/MemoryBuffer] Rename IsVolatile -> IsVolatileSize and add a comment about the use case for the new parameter. llvm-svn: 208026	2014-05-06 01:03:52 +00:00
Richard Smith	6cf1d744d8	Add llvm::function_ref (and a couple of uses of it), representing a type-erased reference to a callable object. llvm-svn: 208025	2014-05-06 01:01:29 +00:00
Nick Lewycky	5ef6bc8815	Improve 'tail' call marking in TRE. A bootstrap of clang goes from 375k calls marked tail in the IR to 470k, however this improvement does not carry into an improvement of the call/jmp ratio on x86. The most common pattern is a tail call + br to a block with nothing but a 'ret'. The number of tail call to loop conversions remains the same (1618 by my count). The new algorithm does a local scan over the use-def chains to identify local "alloca-derived" values, as well as points where the alloca could escape. Then, a visit over the CFG marks blocks as being before or after the allocas have escaped, and annotates the calls accordingly. llvm-svn: 208017	2014-05-05 23:59:03 +00:00
Eric Christopher	4b33ec96d3	Walk back commits for unused function parameters - they're still being used via dragonegg for now. llvm-svn: 208016	2014-05-05 23:26:59 +00:00
Argyrios Kyrtzidis	20a92ae3d2	[Support/MemoryBuffer] Introduce a boolean parameter (false by default) 'IsVolatile' for the open file functions. This provides a hint that the file may be changing often so mmap is avoided. llvm-svn: 208007	2014-05-05 21:55:51 +00:00
Eric Christopher	6beaa8adb8	Remove unused argument from AddFeature. llvm-svn: 208002	2014-05-05 21:40:44 +00:00
Eric Christopher	aa1641e564	Fix typo (also tab character). llvm-svn: 208001	2014-05-05 21:40:41 +00:00
Rafael Espindola	595f54205c	Remove the -disable-cfi option. This also add a release note about it. If this stays I will cleanup MC next week. llvm-svn: 207977	2014-05-05 17:33:26 +00:00
Simon Atanasyan	d2a822d3ca	Add range access to ELFFile's sections collection. llvm-svn: 207952	2014-05-05 06:48:34 +00:00
Chandler Carruth	312dddfb81	[LCG] Add the last (and most complex) of the edge insertion mutation operations on the call graph. This one forms a cycle, and while not as complex as removing an internal edge from an SCC, it involves a reasonable amount of work to find all of the nodes newly connected in a cycle. Also somewhat alarming is the worst case complexity here: it might have to walk roughly the entire SCC inverse DAG to insert a single edge. This is carefully documented in the API (I hope). llvm-svn: 207935	2014-05-04 09:38:32 +00:00
David Majnemer	cf63a79818	IR: Cleanup AttributeSet::get for AttrBuilder We don't modify the AttrBuilder in AttributeSet::get, make the reference argument const. llvm-svn: 207924	2014-05-03 23:00:35 +00:00
Rafael Espindola	3d082fa507	Fix pr19645. The fix itself is fairly simple: move getAccessVariant to MCValue so that we replace the old weak expression evaluation with the far more general EvaluateAsRelocatable. This then requires that EvaluateAsRelocatable stop when it finds a non trivial reference kind. And that in turn requires the ELF writer to look harder for weak references. Last but not least, this found a case where we were being bug by bug compatible with gas and accepting an invalid input. I reported pr19647 to track it. llvm-svn: 207920	2014-05-03 19:57:04 +00:00
Rafael Espindola	80df4bb10f	Rename member variable to try to fix the bots. llvm-svn: 207915	2014-05-03 15:28:13 +00:00
Rafael Espindola	83ceb8edfb	Move LTOModule and LTOCodeGenerator to the llvm namespace. llvm-svn: 207911	2014-05-03 14:59:52 +00:00
Rafael Espindola	9d4f24a34b	Style fix: don't duplicate the method names. llvm-svn: 207910	2014-05-03 14:46:47 +00:00
Rafael Espindola	b62e6b4535	Style update: don't duplicate comments, they were getting out of sync. llvm-svn: 207909	2014-05-03 14:34:48 +00:00
Karthik Bhat	ddd0cb5ecf	Vectorize intrinsic math function calls in SLPVectorizer. This patch adds support to recognize and vectorize intrinsic math functions in SLPVectorizer. Review: http://reviews.llvm.org/D3560 and http://reviews.llvm.org/D3559 llvm-svn: 207901	2014-05-03 09:59:54 +00:00
David Blaikie	658a20b04d	Try simplifying LexicalScopes ownership again. Committed initially in r207724-r207726 and reverted due to compiler-rt crashes in r207732. Instead, fix this harder with unordered_map and store the LexicalScopes by value in the map. This did necessitate moving the definition of LexicalScope above the definition of LexicalScopes. Let's see how the buildbots/compilers tolerate unordered_map::emplace + std::piecewise_construct + std::forward_as_tuple... llvm-svn: 207876	2014-05-02 22:21:05 +00:00
Rafael Espindola	7cdc8a1f30	Remove dead declaration. llvm-svn: 207857	2014-05-02 18:37:07 +00:00
Nico Weber	4b2acde21a	Teach GlobalDCE how to remove empty global_ctor entries. This moves most of GlobalOpt's constructor optimization code out of GlobalOpt into Transforms/Utils/CDtorUtils.{h,cpp}. The public interface is a single function OptimizeGlobalCtorsList() that takes a predicate returning which constructors to remove. GlobalOpt calls this with a function that statically evaluates all constructors, just like it did before. This part of the change is behavior-preserving. Also add a call to this from GlobalDCE with a filter that removes global constructors that contain a "ret" instruction and nothing else – this fixes PR19590. llvm-svn: 207856	2014-05-02 18:35:25 +00:00
Juergen Ributzka	37fc0a8ae8	[Stackmaps] Pacify windows buildbot. llvm-svn: 207807	2014-05-01 22:39:26 +00:00
Juergen Ributzka	673a762b80	[Stackmaps] Add command line option to specify the stackmap version. llvm-svn: 207805	2014-05-01 22:21:30 +00:00
Juergen Ributzka	6340195abd	[Stackmaps] Refactor serialization code. No functional change intended. llvm-svn: 207804	2014-05-01 22:21:27 +00:00
Juergen Ributzka	f01e809383	[Stackmaps] Replace the custom ConstantPool class with a MapVector. llvm-svn: 207803	2014-05-01 22:21:24 +00:00
Eli Bendersky	a108a65df2	Add an optimization that does CSE in a group of similar GEPs. This optimization merges the common part of a group of GEPs, so we can compute each pointer address by adding a simple offset to the common part. The optimization is currently only enabled for the NVPTX backend, where it has a large payoff on some benchmarks. Review: http://reviews.llvm.org/D3462 Patch by Jingyue Wu. llvm-svn: 207783	2014-05-01 18:38:36 +00:00
Rafael Espindola	2aeac7a321	Move getBaseSymbol somewhere the COFF writer can use. I will use it there in a second. llvm-svn: 207761	2014-05-01 13:24:25 +00:00
Chandler Carruth	7cc4ed8202	[LCG] Add the other simple edge insertion API to the call graph. This just connects an SCC to one of its descendants directly. Not much of an impact. The last one is the hard one -- connecting an SCC to one of its ancestors, and thereby forming a cycle such that we have to merge all the SCCs participating in the cycle. llvm-svn: 207751	2014-05-01 12:18:20 +00:00
Chandler Carruth	4b096741b4	[LCG] Add some basic methods for querying the parent/child relationships of SCCs in the SCC DAG. Exercise them in the big graph test case. These will be especially useful for establishing invariants in insertion logic. llvm-svn: 207749	2014-05-01 12:12:42 +00:00
Chandler Carruth	2629ef6e41	[LCG] Fix a bad bug in the new fancy iterator scheme I added to support removal. We can't just blindly increment (or decrement) the adapted iterator when the value is null because doing so can walk past the end (or beginning) and keep inspecting the value. The fix I've implemented is to restrict this further to a forward iterator and add an end iterator to the members (replacing a member that had become dead when I switched to the adaptor base!) and using that to stop the iteration. I'm not entirely pleased with this solution. I feel like forward iteration is too restrictive. I wasn't even happy about bidirectional iteration. It also makes the iterator objects larger and the iteration loops more complex. However, I also don't really like the other alternative that seems obvious: a sentinel node. I'm still hoping to come up with a more elegant solution here, but this at least fixes the MSan and Valgrind errors on this code. llvm-svn: 207743	2014-05-01 10:41:51 +00:00
Oliver Stannard	7eacbd5a71	Record the DWARF version in MCContext Record the DWARF version in MCContext, and use it when emitting the dwarf version into the debug info. llvm-svn: 207739	2014-05-01 08:46:02 +00:00
Richard Smith	d730500706	Speculatively roll back r207724-r207726, which are code cleanup changes and appear to be breaking a bootstrapped build of compiler-rt. llvm-svn: 207732	2014-05-01 00:46:58 +00:00
David Blaikie	6b71cc7bac	LexicalScopes: Use unique_ptr to manage ownership of abstract LexicalScopes. llvm-svn: 207726	2014-04-30 23:46:27 +00:00
David Blaikie	b36914421b	LexicalScopes: use unique_ptr to own LexicalScope objects. Ownership of abstract scopes coming soon. llvm-svn: 207724	2014-04-30 23:40:59 +00:00
Rafael Espindola	fee224f942	Provide a version of getSymbolOffset that returns false on error. This simplifies ELFObjectWriter::SymbolValue a bit more. This new version will also be used in the COFF writer to fix pr19147. llvm-svn: 207711	2014-04-30 21:51:13 +00:00
Jay Foad	f517c0f21b	Remove unused field hash_state::seed. llvm-svn: 207703	2014-04-30 21:12:17 +00:00
Weiming Zhao	7f6daf1799	[ARM64] Prevent bit extraction to be adjusted by following shift For pattern like ((x >> C1) & Mask) << C2, DAG combiner may convert it into (x >> (C1-C2)) & (Mask << C2), which makes pattern matching of ubfx more difficult. For example: Given %shr = lshr i64 %x, 4 %and = and i64 %shr, 15 %arrayidx = getelementptr inbounds [8 x [64 x i64]]* @arr, i64 0, %i64 2, i64 %and %0 = load i64* %arrayidx With current shift folding, it takes 3 instrs to compute base address: lsr x8, x0, #1 and x8, x8, #0x78 add x8, x9, x8 If using ubfx, it only needs 2 instrs: ubfx x8, x0, #4, #4 add x8, x9, x8, lsl #3 This fixes bug 19589 llvm-svn: 207702	2014-04-30 21:07:24 +00:00
Hans Wennborg	83e6e1e926	ELFObjectWriter: deduplicate suffices in strtab We already do this for shstrtab, so might as well do it for strtab. This extracts the string table building code into a separate class. The idea is to use it for other object formats too. I mostly wanted to do this for the general principle, but it does save a little bit on object file size. I tried this on a clang bootstrap and saved 0.54% on the sum of object file sizes (1.14 MB out of 212 MB for a release build). Differential Revision: http://reviews.llvm.org/D3533 llvm-svn: 207670	2014-04-30 16:25:02 +00:00
Douglas Gregor	8451cdff2f	Fix a use of uninitialized memory in SmallVector's move-assignment operator. When we were moving from a larger vector to a smaller one but didn't need to re-allocate, we would move-assign over uninitialized memory in the target, then move-construct that same data again. llvm-svn: 207663	2014-04-30 15:49:06 +00:00
Matheus Almeida	c0284d118f	[mips] Emit all three relocation operations for each relocation entry on Mips64 big-endian systems. Summary: The N64 ABI allows up to three operations to be specified per relocation record independently of the endianness. Reviewers: dsanders Reviewed By: dsanders Differential Revision: http://reviews.llvm.org/D3529 llvm-svn: 207636	2014-04-30 11:21:10 +00:00
Chandler Carruth	5217c94522	[LCG] Add the really, really boring edge insertion case: adding an edge entirely within an existing SCC. Shockingly, making the connected component more connected is ... a total snooze fest. =] Anyways, its wired up, and I even added a test case to make sure it pretty much sorta works. =D llvm-svn: 207631	2014-04-30 10:48:36 +00:00
NAKAMURA Takumi	d112b82066	raw_ostream::operator<<(StringRef): Avoid potential overflow in pointer arithmetic. (OutBufCur + Size) might overflow if Size were large. For example on i686-linux, OutBufCur: 0xFFFDF27D OutBufEnd: 0xFFFDF370 Size: 0x0002BF20 (180,000) It caused flaky error in MC/COFF/section-name-encoding.s. llvm-svn: 207621	2014-04-30 09:33:50 +00:00
Chandler Carruth	c5026b670e	[LCG] Actually test the basic edge removal bits (IE, the non-SCC bits), and discover that it's totally broken. Yay tests. Boo bug. Fix the basic edge removal so that it works by nulling out the removed edges rather than actually removing them. This leaves the indices valid in the map from callee to index, and preserves some of the locality for iterating over edges. The iterator is made bidirectional to reflect that it now has to skip over null entries, and the skipping logic is layered onto it. As future work, I would like to track essentially the "load factor" of the edge list, and when it falls below a threshold do a compaction. An alternative I considered (and continue to consider) is storing the callees in a doubly linked list where each element of the list is in a set (which is essentially the classical linked-hash-table datastructure). The problem with that approach is that either you need to heap allocate the linked list nodes and use pointers to them, or use a bucket hash table (with even more linked list pointer overhead!), etc. It's pretty easy to get 5x overhead for values that are just pointers. So far, I think punching holes in the vector, and periodic compaction is likely to be much more efficient overall in the space/time tradeoff. llvm-svn: 207619	2014-04-30 07:45:27 +00:00
Chandler Carruth	8b9663e8cc	[ADT] Provide some helpful static_asserts for using operations of the wrong iterator category. These aren't comprehensive, but they have caught the common cases for me and produce much nicer errors. llvm-svn: 207601	2014-04-30 00:49:32 +00:00
Benjamin Kramer	d59664f4f7	raw_ostream: Forward declare OpenFlags and include FileSystem.h only where necessary. llvm-svn: 207593	2014-04-29 23:26:49 +00:00
David Blaikie	35907d8e23	Fix MSVC build broken by r207580 Seems MSVC wants to be able to codegen inline-definitions of virtual functions even in TUs that don't define the key function - and it's well within its rights to do so. llvm-svn: 207581	2014-04-29 22:04:55 +00:00
David Blaikie	7a1e775a7e	PR19553: Memory leak in RuntimeDyldELF::createObjectImageFromFile This starts in MCJIT::getSymbolAddress where the unique_ptr<object::Binary> is release()d and (after a cast) passed to a single caller, MCJIT::addObjectFile. addObjectFile calls RuntimeDyld::loadObject. RuntimeDld::loadObject calls RuntimeDyldELF::createObjectFromFile And the pointer is never owned at this point. I say this point, because the alternative codepath, RuntimeDyldMachO::createObjectFile certainly does take ownership, so this seemed like a good hint that this was a/the right place to take ownership. llvm-svn: 207580	2014-04-29 21:52:46 +00:00
Andrea Di Biagio	a12dae37d5	[Windows] Fix assertion failure when passing 'nul' in input to clang. Before this patch, if 'nul' was passed in input to clang, function getStatus() (in Path.inc) always returned an instance of file_status with field 'nFileSizeHigh' and 'nFileSizeLow' left uninitialized. This was causing the triggering of an assertion failure in MemoryBuffer.cpp due to an invalid FileSize for device 'nul'. This patch fixes the assertion failure modifying the constructors of class file_status (in llvm/Support/FileSystem.h) so that every field of the class gets initialized to zero by default. A clang test will be submitted on a separate patch. llvm-svn: 207575	2014-04-29 20:17:28 +00:00
Duncan P. N. Exon Smith	bdc1e2abdb	BranchProb: Simplify printing code llvm-svn: 207559	2014-04-29 17:07:42 +00:00
Duncan P. N. Exon Smith	134b2af618	Support: Remove out-of-date comments The code is now shared... no need for a note. llvm-svn: 207555	2014-04-29 16:47:39 +00:00
Duncan P. N. Exon Smith	547183bf87	blockfreq: Defer to BranchProbability::scale() (again) Change `BlockFrequency` to defer to `BranchProbability::scale()` and `BranchProbability::scaleByInverse()`. This removes `BlockFrequency::scale()` from its API (and drops the ability to see the remainder), but the only user was the unit tests. If some code in the future needs an API that exposes the remainder, we can add something to `BranchProbability`, but I find that unlikely. llvm-svn: 207550	2014-04-29 16:31:29 +00:00
Duncan P. N. Exon Smith	d22bea7dad	blockfreq: Defer to BranchProbability::scale() `BlockMass` can now defer to `BranchProbability::scale()`. llvm-svn: 207547	2014-04-29 16:20:05 +00:00
Duncan P. N. Exon Smith	4ac56cf249	blockfreq: Remove BlockMassBlockMass Since `BlockMass` is an implementation detail and there are no current users of this, delete `BlockMass::operator=(BlockMass)`. I might need this when I try to strip out `UnsignedFloat`, but I can pull it back in at that point. llvm-svn: 207546	2014-04-29 16:20:01 +00:00
Duncan P. N. Exon Smith	415e7656f6	Support: Add BranchProbability::scale() and ::scaleByInverse() Add API to `BranchProbability` for scaling big integers. Next job is to rip the logic out of `BlockMass` and `BlockFrequency`. llvm-svn: 207544	2014-04-29 16:15:35 +00:00
Duncan P. N. Exon Smith	7fcce45847	Support: Simplify BranchProbability operators llvm-svn: 207541	2014-04-29 16:12:16 +00:00
Diego Novillo	34fc8a7c4c	Add optimization remarks to the loop unroller and vectorizer. Summary: This calls emitOptimizationRemark from the loop unroller and vectorizer at the point where they make a positive transformation. For the vectorizer, it reports vectorization and interleave factors. For the loop unroller, it reports all the different supported types of unrolling. Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D3456 llvm-svn: 207528	2014-04-29 14:27:31 +00:00
Yaron Keren	aa0e88acbf	Updated the link to the correct URL. llvm-svn: 207523	2014-04-29 13:21:05 +00:00
Rafael Espindola	b60c829a2a	Centralize the handling of the thumb bit. This patch centralizes the handling of the thumb bit around MCStreamer::isThumbFunc and makes isThumbFunc handle aliases. This fixes a corner case, but the main advantage is having just one way to check if a MCSymbol is thumb or not. This should still be refactored to be ARM only, but at least now it is just one predicate that has to be refactored instead of 3 (isThumbFunc, ELF_Other_ThumbFunc, and SF_ThumbFunc). llvm-svn: 207522	2014-04-29 12:46:50 +00:00
Elena Demikhovsky	299cf511c4	AVX-512: optimized a shuffle pattern to VINSERTI64x4. Added intrinsics for VPERMT2PS/PD/D/Q instructions. llvm-svn: 207513	2014-04-29 09:09:15 +00:00
Chandler Carruth	3ab40727a7	[ADT] Make the iterator adaptor utility a touch more general by requiring full control over the various parameters to the std::iterator concept / trait thing. This is a precursor for adjusting these things to where you can write a bidirectional iterator wrapping a random access iterator with custom increment and decrement logic. llvm-svn: 207487	2014-04-29 01:57:35 +00:00
Chandler Carruth	d24465f443	[ADT] Teach PointerUnion to support assignment directly from nullptr to clear it out. llvm-svn: 207471	2014-04-29 00:14:27 +00:00
Rafael Espindola	bc91d7e25a	Add an option for evaluating past symbols. When evaluating an assembly expression for a relocation, we want to stop at MCSymbols that are in the symbol table, even if they are variables. This is needed since the semantics may require that the relocation use them. That is not the case when computing the value of a symbol in the symbol table. There are no relocations in this case and we have to keep going until we hit a section or find out that the expression doesn't have an assembly time value. llvm-svn: 207445	2014-04-28 20:53:11 +00:00
Duncan P. N. Exon Smith	a375e711f6	blockfreq: Remove extra typename from r207438 llvm-svn: 207439	2014-04-28 20:08:23 +00:00
Duncan P. N. Exon Smith	c5a3139ebd	Reapply "blockfreq: Approximate irreducible control flow" This reverts commit r207287, reapplying r207286. I'm hoping that declaring an explicit struct and instantiating `addBlockEdges()` directly works around the GCC crash from r207286. This is a lot more boilerplate, though. llvm-svn: 207438	2014-04-28 20:02:29 +00:00
Juergen Ributzka	4989255432	[PM] Add pass run listeners to the pass manager. This commit provides the necessary C/C++ APIs and infastructure to enable fine- grain progress report and safe suspension points after each pass in the pass manager. Clients can provide a callback function to the pass manager to call after each pass. This can be used in a variety of ways (progress report, dumping of IR between passes, safe suspension of threads, etc). The run listener list is maintained in the LLVMContext, which allows a multi- threaded client to be only informed for it's own thread. This of course assumes that the client created a LLVMContext for each thread. This fixes <rdar://problem/16728690> llvm-svn: 207430	2014-04-28 18:19:25 +00:00
Joerg Sonnenberger	4482dcd072	Fix comment llvm-svn: 207429	2014-04-28 18:11:51 +00:00
Chandler Carruth	c00a7ff4b7	[LCG] Add the most basic of edge insertion to the lazy call graph. This just handles the pre-DFS case. Also add some test cases for this case to make sure it works. llvm-svn: 207411	2014-04-28 11:10:23 +00:00
Chandler Carruth	1fcee98ddc	Fix very poor compile-time in PR19499 due to excessive tree walks in domtree. When finding a nearest common dominator, if neither A dominates B nor B dominates A, we immediately resorted to a tree walk. The tree walk here is particularly expensive because we have to build a (potentially very large) set for one side's dominators and compare it with the other side's. If at any point we have DFS info, we don't need to do any of this. We can just walk up one side's immediate dominators and return the first one which dominates the other side. Because of the DFS info, the dominates queries are trivially constant time. This reduces the optimizers time in the test case on PR19499 by 70%. It now optimizes in about 30 seconds for me. And there is still more to be done for this case. llvm-svn: 207406	2014-04-28 09:34:03 +00:00
Craig Topper	8c0b4d0791	Convert more SelectionDAG functions to use ArrayRef. llvm-svn: 207397	2014-04-28 05:57:50 +00:00
Craig Topper	e73658ddbb	[C++] Use 'nullptr'. llvm-svn: 207394	2014-04-28 04:05:08 +00:00
NAKAMURA Takumi	4495f83826	CodeGen/AsmPrinter.h: Fix \param in r207369. [-Wdocumentation] llvm-svn: 207384	2014-04-27 23:57:57 +00:00
Craig Topper	633d99b62d	Convert AddNodeIDNode and SelectionDAG::getNodeIfExiists to use ArrayRef<SDValue> llvm-svn: 207383	2014-04-27 23:22:43 +00:00
Rafael Espindola	466d66358d	Add emitThumbSet to the arm target streamer. This fixes the asm printer implementation and lets the parser be unaware of what .thumb_set is. llvm-svn: 207381	2014-04-27 20:23:58 +00:00
Craig Topper	2893b2e1da	Fix an assert I accidentally broke to hopefully fix the build bots. llvm-svn: 207380	2014-04-27 19:40:43 +00:00
Craig Topper	b2ba83cd30	Convert SelectionDAGISel::MorphNode to use ArrayRef. llvm-svn: 207379	2014-04-27 19:21:20 +00:00
Craig Topper	131de82adb	Convert SelectionDAG::MorphNodeTo to use ArrayRef. llvm-svn: 207378	2014-04-27 19:21:16 +00:00
Craig Topper	481fb2879f	Convert SelectionDAG::SelectNodeTo to use ArrayRef. llvm-svn: 207377	2014-04-27 19:21:11 +00:00
Craig Topper	dd5e16dd34	Convert one last signature of getNode to take an ArrayRef of SDUse. llvm-svn: 207376	2014-04-27 19:21:06 +00:00
Craig Topper	bb5330725e	Convert SDNode constructor to use ArrayRef. llvm-svn: 207375	2014-04-27 19:21:02 +00:00
Craig Topper	64941d9786	Convert SelectionDAG::getMergeValues to use ArrayRef. llvm-svn: 207374	2014-04-27 19:20:57 +00:00
Craig Topper	2d7d6052c6	Const-correct SelectionDAG::getAtomic. llvm-svn: 207373	2014-04-27 19:20:47 +00:00
Adrian Prantl	42a0d8c6ef	Clarify the doxygen comment for AsmPrinter::EmitDwarfRegOpPiece and add default arguments to the function. No functional change. llvm-svn: 207372	2014-04-27 18:50:45 +00:00
Adrian Prantl	d34db65c84	Debug info: Refactor EmitDwarfRegOpPiece to be a member function of AsmPrinter. No functional change. http://reviews.llvm.org/D3373 rdar://problem/15928306 llvm-svn: 207369	2014-04-27 18:25:45 +00:00
Rafael Espindola	aa0242723e	Make getOrCreateSymbolData non virtual. llvm-svn: 207367	2014-04-27 17:23:37 +00:00
Saleem Abdulrasool	a8b1f7204b	MC: create X86WinCOFFStreamer for target specific behaviour This introduces a target specific streamer, X86WinCOFFStreamer, which handles the target specific behaviour (e.g. WinEH). This is mostly to ensure that differences between ARM and X86 remain disjoint and do not accidentally cross boundaries. This is the final staging change for enabling object emission for Windows on ARM. llvm-svn: 207344	2014-04-27 03:48:12 +00:00
Saleem Abdulrasool	cf1a29ffee	MC: rename WinCOFFStreamer and move declaration out-of-line This is in preparation for promoting WinCOFFStreamer to a base class which will be shared by the X86 and ARM specific target COFF streamers. Also add a new getOrCreateSymbolData interface (like MCELFStreamer) for the ARM COFF Streamer. This makes the COFFStreamer more similar to the ELFStreamer. llvm-svn: 207343	2014-04-27 03:48:05 +00:00
Chandler Carruth	aa839b22c9	[LCG] Re-organize the methods for mutating a call graph to make their API requirements much more obvious. The key here is that there are two totally different use cases for mutating the graph. Prior to doing any SCC formation, it is very easy to mutate the graph. There may be users that want to do small tweaks here, and then use the already-built graph for their SCC-based operations. This method remains on the graph itself and is documented carefully as being cheap but unavailable once SCCs are formed. Once SCCs are formed, and there is some in-flight DFS building them, we have to be much more careful in how we mutate the graph. These mutation operations are sunk onto the SCCs themselves, which both simplifies things (the code was already there!) and helps make it obvious that these interfaces are only applicable within that context. The other primary constraint is that the edge being mutated is actually related to the SCC on which we call the method. This helps make it obvious that you cannot arbitrarily mutate some other SCC. I've tried to write much more complete documentation for the interesting mutation API -- intra-SCC edge removal. Currently one aspect of this documentation is a lie (the result list of SCCs) but we also don't even have tests for that API. =[ I'm going to add tests and fix it to match the documentation next. llvm-svn: 207339	2014-04-27 01:59:50 +00:00
Chandler Carruth	1129e9cec1	[LCG] Add some pedantry to the use of ptrdiff_t to appease build bots. llvm-svn: 207337	2014-04-26 22:59:28 +00:00
Chandler Carruth	27a5c6713b	[LCG] Eliminate more boiler plate by using the iterator facade base class. llvm-svn: 207336	2014-04-26 22:51:31 +00:00
Chandler Carruth	68ba2085d7	[LCG] Switch the node iterator to use the new fancy adaptor base. This is much cleaner, makes the iterator a full random access iterator, etc. llvm-svn: 207335	2014-04-26 22:43:56 +00:00
Benjamin Kramer	ccf45ebc24	Mark the growing path in SmallVector::push_back as cold. It's vital for performance that the cold path of push_back isn't inlined. llvm-svn: 207331	2014-04-26 20:10:49 +00:00
Craig Topper	206fcd450a	Convert getMemIntrinsicNode to take ArrayRef of SDValue instead of pointer and size. llvm-svn: 207329	2014-04-26 19:29:41 +00:00
Craig Topper	48d114bed1	Convert SelectionDAG::getNode methods to use ArrayRef<SDValue>. llvm-svn: 207327	2014-04-26 18:35:24 +00:00
Craig Topper	963c5d5ef8	Remove an unused version of getMemIntrinsicNode and getNode. Additionally, these were calling makeVTList with the pointers passed in which would were unlikely to belong to SelectionDAG and likely would have just been stack pointers. llvm-svn: 207326	2014-04-26 18:35:13 +00:00
Benjamin Kramer	4dae598bc8	DAGCombiner: Turn divs of vector splats into vectorized multiplications. Otherwise the legalizer would just scalarize everything. Support for mulhi in the targets isn't that great yet so on most targets we get exactly the same scalarized output. Add a test for x86 vector udiv. I had to disable the mulhi nodes on ARM because there aren't any patterns for it. As far as I know ARM has instructions for getting the high part of a multiply so this should be fixed. llvm-svn: 207315	2014-04-26 12:06:28 +00:00
Michael Zolotukhin	1a97a7bcbf	Revert r206749 till a final decision about the intrinsics is made. llvm-svn: 207313	2014-04-26 09:56:41 +00:00
Chandler Carruth	90821c2a93	[LCG] Rather than removing nodes from the SCC entry set when we process them, just skip over any DFS-numbered nodes when finding the next root of a DFS. This allows the entry set to just be a vector as we populate it from a uniqued source. It also removes the possibility for a linear scan of the entry set to actually do the removal which can make things go quadratic if we get unlucky. llvm-svn: 207312	2014-04-26 09:45:55 +00:00
Chandler Carruth	aca48d0443	[LCG] Hoist the main DFS loop out of the edge removal function. This makes working through the worklist much cleaner, and makes it possible to avoid the 'bool-to-continue-the-outer-loop' hack. Not a huge difference, but I think this is approaching as polished as I can make it. llvm-svn: 207310	2014-04-26 09:06:53 +00:00
Chandler Carruth	680af7a78c	[LCG] In the incremental SCC re-formation, lift the node currently being processed in the DFS out of the stack completely. Keep it exclusively in a variable. Re-shuffle some code structure to make this easier. This can have a very dramatic effect in some cases because call graphs tend to look like a high fan-out spanning tree. As a consequence, there are a large number of leaf nodes in the graph, and this technique causes leaf nodes to never even go into the stack. While this only reduces the max depth by 1, it may cause the total number of round trips through the stack to drop by a lot. Now, most of this isn't really relevant for the incremental version. =] But I wanted to prototype it first here as this variant is in ways more complex. As long as I can get the code factored well here, I'll next make the primary walk look the same. There are several refactorings this exposes I think. llvm-svn: 207306	2014-04-26 03:36:42 +00:00
Chandler Carruth	8f92d6db22	[LCG] Refactor the duplicated code I added in my last commit here into a helper function. Also factor the other two places where we did the same thing into the helper function. =] Much cleaner this way. NFC. llvm-svn: 207300	2014-04-26 01:03:46 +00:00
Duncan P. N. Exon Smith	42292ceaa9	Revert "blockfreq: Approximate irreducible control flow" This reverts commit r207286. It causes an ICE on the cmake-llvm-x86_64-linux buildbot [1]: llvm/lib/Analysis/BlockFrequencyInfo.cpp: In lambda function: llvm/lib/Analysis/BlockFrequencyInfo.cpp:182:1: internal compiler error: in get_expr_operands, at tree-ssa-operands.c:1035 [1]: http://bb.pgr.jp/builders/cmake-llvm-x86_64-linux/builds/12093/steps/build_llvm/logs/stdio llvm-svn: 207287	2014-04-25 23:16:58 +00:00
Duncan P. N. Exon Smith	384d0e8ad4	blockfreq: Approximate irreducible control flow Previously, irreducible backedges were ignored. With this commit, irreducible SCCs are discovered on the fly, and modelled as loops with multiple headers. This approximation specifies the headers of irreducible sub-SCCs as its entry blocks and all nodes that are targets of a backedge within it (excluding backedges within true sub-loops). Block frequency calculations act as if we insert a new block that intercepts all the edges to the headers. All backedges and entries to the irreducible SCC point to this imaginary block. This imaginary block has an edge (with even probability) to each header block. The result is now reasonable enough that I've added a number of testcases for irreducible control flow. I've outlined in `BlockFrequencyInfoImpl.h` ways to improve the approximation. <rdar://problem/14292693> llvm-svn: 207286	2014-04-25 23:08:57 +00:00
Tom Roeder	fd1bc602b3	Add an -mattr option to the gold plugin to support subtarget features in LTO This adds support for an -mattr option to the gold plugin and to llvm-lto. This allows the caller to specify details of the subtarget architecture, like +aes, or +ssse3 on x86. Note that this requires a change to the include/llvm-c/lto.h interface: it adds a function lto_codegen_set_attr and it increments the version of the interface. llvm-svn: 207279	2014-04-25 21:46:51 +00:00
Duncan P. N. Exon Smith	9f35117956	SCC: Use the reference typedef Actually use the `reference` typedef, and remove the private redefinition of `pointer` since it has no users. Using `reference` exposes a problem with r207257, which specified the wrong `value_type` to `iterator_facade_base` (fixed that too). llvm-svn: 207270	2014-04-25 20:52:08 +00:00
Adrian Prantl	32da88923a	This reapplies r207235 with an additional bugfixes caught by the msan buildbot - do not insert debug intrinsics before phi nodes. Debug info for optimized code: Support variables that are on the stack and described by DBG_VALUEs during their lifetime. Previously, when a variable was at a FrameIndex for any part of its lifetime, this would shadow all other DBG_VALUEs and only a single fbreg location would be emitted, which in fact is only valid for a small range and not the entire lexical scope of the variable. The included dbg-value-const-byref testcase demonstrates this. This patch fixes this by Local - emitting dbg.value intrinsics for allocas that are passed by reference - dropping all dbg.declares (they are now fully lowered to dbg.values) SelectionDAG - renamed constructors for SDDbgValue for better readability. - fix UserValue::match() to handle indirect values correctly - not inserting an MMI table entries for dbg.values that describe allocas. - lowering dbg.values that describe allocas into indirect DBG_VALUEs. CodeGenPrepare - leaving dbg.values for an alloca were they are (see comment) Other - regenerated/updated instcombine.ll testcase and included source rdar://problem/16679879 http://reviews.llvm.org/D3374 llvm-svn: 207269	2014-04-25 20:49:25 +00:00
David Blaikie	0651d7650a	MCAssembler: Simplify implementation of const variants of getSymbolData by calling one implementation from the other. Code review feedback by Rafael Espindola on r207124. llvm-svn: 207266	2014-04-25 20:19:11 +00:00
Duncan P. N. Exon Smith	da5eaeda01	blockfreq: Further shift logic to LoopData Move a lot of the loop-related logic that was sprinkled around the code into `LoopData`. <rdar://problem/14292693> llvm-svn: 207258	2014-04-25 18:47:04 +00:00
Duncan P. N. Exon Smith	eb6a582d13	SCC: Provide operator->() through iterator_facade_base Use the fancy new `iterator_facade_base` to add `scc_iterator::operator->()`. Remove other definitions where `iterator_facade_base` does the right thing. <rdar://problem/14292693> llvm-svn: 207257	2014-04-25 18:43:41 +00:00
Duncan P. N. Exon Smith	ef86928927	SCC: Remove non-const operator*() <rdar://problem/14292693> llvm-svn: 207254	2014-04-25 18:26:45 +00:00
Duncan P. N. Exon Smith	f4e1d6fd06	SCC: Doxygen-ize comments, NFC <rdar://problem/14292693> llvm-svn: 207251	2014-04-25 18:18:46 +00:00
Adrian Prantl	d2d9b76e48	Revert "This reapplies r207130 with an additional testcase+and a missing check for" This reverts commit 207235 to investigate msan buildbot breakage. llvm-svn: 207250	2014-04-25 18:18:09 +00:00
Duncan P. N. Exon Smith	a16a629ef6	SCC: Un-inline long functions These are long functions that really shouldn't be inlined. Otherwise, no functionality change. <rdar://problem/14292693> llvm-svn: 207249	2014-04-25 18:15:50 +00:00
Duncan P. N. Exon Smith	5547afed78	SCC: Remove redundant inline keywords, NFC Functions declared in line in a class are inlined by default. There's no reason for the `inline` keyword. <rdar://problem/14292693> llvm-svn: 207248	2014-04-25 18:10:23 +00:00
Saleem Abdulrasool	99f0d458c3	ARM: remove @llvm.arm.sevl This intrinsic is no longer needed with the new @llvm.arm.hint(i32) intrinsic which provides a generic, extensible manner for adding hint instructions. This functionality can now be represented as @llvm.arm.hint(i32 5). llvm-svn: 207246	2014-04-25 17:51:25 +00:00
Saleem Abdulrasool	7e7c2f9ca6	ARM: provide a new generic hint intrinsic Introduce the llvm.arm.hint(i32) intrinsic that can be used to inject hints into the instruction stream. This is particularly useful for generating IR from a compiler where the user may inject an intrinsic (e.g. __yield). These are then pattern substituted into the correct instruction which already existed. llvm-svn: 207242	2014-04-25 17:24:24 +00:00
Adrian Prantl	f5834a4b49	This reapplies r207130 with an additional testcase+and a missing check for AllocaInst that was missing in one location. Debug info for optimized code: Support variables that are on the stack and described by DBG_VALUEs during their lifetime. Previously, when a variable was at a FrameIndex for any part of its lifetime, this would shadow all other DBG_VALUEs and only a single fbreg location would be emitted, which in fact is only valid for a small range and not the entire lexical scope of the variable. The included dbg-value-const-byref testcase demonstrates this. This patch fixes this by Local - emitting dbg.value intrinsics for allocas that are passed by reference - dropping all dbg.declares (they are now fully lowered to dbg.values) SelectionDAG - renamed constructors for SDDbgValue for better readability. - fix UserValue::match() to handle indirect values correctly - not inserting an MMI table entries for dbg.values that describe allocas. - lowering dbg.values that describe allocas into indirect DBG_VALUEs. CodeGenPrepare - leaving dbg.values for an alloca were they are (see comment) Other - regenerated/updated instcombine.ll testcase and included source rdar://problem/16679879 http://reviews.llvm.org/D3374 llvm-svn: 207235	2014-04-25 17:01:00 +00:00
Craig Topper	f40110f4d8	[C++] Use 'nullptr'. Transforms edition. llvm-svn: 207196	2014-04-25 05:29:35 +00:00
Duncan P. N. Exon Smith	cb7d29d30c	blockfreq: Only one mass distribution per node Remove the concepts of "forward" and "general" mass distributions, which was wrong. The split might have made sense in an early version of the algorithm, but it's definitely wrong now. <rdar://problem/14292693> llvm-svn: 207195	2014-04-25 04:38:43 +00:00
Duncan P. N. Exon Smith	3f086789ff	blockfreq: Document high-level functions <rdar://problem/14292693> llvm-svn: 207191	2014-04-25 04:38:32 +00:00
Duncan P. N. Exon Smith	71f07451b6	blockfreq: Remove dead code <rdar://problem/14292693> llvm-svn: 207190	2014-04-25 04:38:30 +00:00
Duncan P. N. Exon Smith	46d9a56ce6	blockfreq: Separate unwrapLoops() from finalizeMetrics() <rdar://problem/14292693> llvm-svn: 207185	2014-04-25 04:38:17 +00:00
Duncan P. N. Exon Smith	50a1bb85b8	blockfreq: LoopData::MemberList => NodeList <rdar://problem/14292693> llvm-svn: 207184	2014-04-25 04:38:15 +00:00
Duncan P. N. Exon Smith	c9b7cfea2f	blockfreq: Expose getPackagedNode() Make `getPackagedNode()` a member function of `BlockFrequencyInfoImplBase` so that it's available for templated code. <rdar://problem/14292693> llvm-svn: 207183	2014-04-25 04:38:12 +00:00
Duncan P. N. Exon Smith	1cab8a0708	blockfreq: Store the header with the members <rdar://problem/14292693> llvm-svn: 207182	2014-04-25 04:38:09 +00:00
Duncan P. N. Exon Smith	39cc64827e	blockfreq: Encapsulate LoopData::Header <rdar://problem/14292693> llvm-svn: 207181	2014-04-25 04:38:06 +00:00
Duncan P. N. Exon Smith	4bbaff75e0	blockfreq: Embed Loop hierarchy in LoopData Continue refactoring to make `LoopData` first-class. Here I'm making the `LoopData` hierarchy explicit, instead of bouncing back and forth with `WorkingData`. This simplifies the logic and better matches the `LoopInfo` design. (Eventually, `LoopInfo` should be restructured so that it supports this pass, and `LoopData` can be removed.) <rdar://problem/14292693> llvm-svn: 207180	2014-04-25 04:38:03 +00:00
Duncan P. N. Exon Smith	d132040ed6	blockfreq: Use LoopData directly Instead of passing around loop headers, pass around `LoopData` directly. <rdar://problem/14292693> llvm-svn: 207179	2014-04-25 04:38:01 +00:00
Duncan P. N. Exon Smith	e005c7c496	blockfreq: Stop using range-based for to traverse Loops A follow-up commit will need the actual iterators. <rdar://problem/14292693> llvm-svn: 207178	2014-04-25 04:37:58 +00:00
Duncan P. N. Exon Smith	fc7dc93031	blockfreq: Use a std::list for Loops As pointed out by David Blaikie in code review, a `std::list<T>` is simpler than a `std::vector<std::unique_ptr<T>>`. Another option is a `std::deque<T>` (which allocates in chunks), but I'd like to leave open the option of inserting in the middle of the sequence for handling irreducible control flow on the fly. <rdar://problem/14292693> llvm-svn: 207177	2014-04-25 04:30:06 +00:00
Karthik Bhat	6a48f7d66e	Allow vectorization of bit intrinsics in BB Vectorizer. This patch adds support for vectorization of bit intrinsics such as bswap,ctpop,ctlz,cttz. llvm-svn: 207174	2014-04-25 03:33:48 +00:00
Adrian Prantl	6e5de2ea06	Revert "This reapplies r207130 with an additional testcase+and a missing check for" Typo in testcase. llvm-svn: 207166	2014-04-25 00:42:50 +00:00
Adrian Prantl	3512190ab3	This reapplies r207130 with an additional testcase+and a missing check for AllocaInst that was missing in one location. Debug info for optimized code: Support variables that are on the stack and described by DBG_VALUEs during their lifetime. Previously, when a variable was at a FrameIndex for any part of its lifetime, this would shadow all other DBG_VALUEs and only a single fbreg location would be emitted, which in fact is only valid for a small range and not the entire lexical scope of the variable. The included dbg-value-const-byref testcase demonstrates this. This patch fixes this by Local - emitting dbg.value intrinsics for allocas that are passed by reference - dropping all dbg.declares (they are now fully lowered to dbg.values) SelectionDAG - renamed constructors for SDDbgValue for better readability. - fix UserValue::match() to handle indirect values correctly - not inserting an MMI table entries for dbg.values that describe allocas. - lowering dbg.values that describe allocas into indirect DBG_VALUEs. CodeGenPrepare - leaving dbg.values for an alloca were they are (see comment) Other - regenerated/updated instcombine.ll testcase and included source rdar://problem/16679879 http://reviews.llvm.org/D3374 llvm-svn: 207165	2014-04-25 00:38:40 +00:00
Adrian Prantl	ff4282a204	Revert "Debug info for optimized code: Support variables that are on the stack and" This reverts commit 207130 for buildbot breakage. llvm-svn: 207162	2014-04-25 00:04:49 +00:00
Richard Smith	ab1cb0990d	Add missing include, found by modules build. llvm-svn: 207158	2014-04-24 23:29:25 +00:00
Richard Smith	80429c42ab	Function defined in a header should be inline. Found by modules build. llvm-svn: 207157	2014-04-24 23:14:32 +00:00
Chandler Carruth	d5835ee368	[ADT] Generalize pointee_iterator to smart pointers by using decltype. Based on review feedback from Dave on the original patch. llvm-svn: 207146	2014-04-24 21:10:35 +00:00
Reid Kleckner	3981faecbd	Remove dead inline function that doesn't compile MSVC doesn't diagnose this, interestingly. llvm-svn: 207144	2014-04-24 20:19:22 +00:00
Reid Kleckner	5772b77789	Add 'musttail' marker to call instructions This is similar to the 'tail' marker, except that it guarantees that tail call optimization will occur. It also comes with convervative IR verification rules that ensure that tail call optimization is possible. Reviewers: nicholas Differential Revision: http://llvm-reviews.chandlerc.com/D3240 llvm-svn: 207143	2014-04-24 20:14:34 +00:00
Richard Smith	0d9ec713e7	[modules] "Specialize" a function by actually specializing a function template rather than by adding an overload and hoping that it's declared before the code that calls it. (In a modules build, it isn't.) llvm-svn: 207133	2014-04-24 18:27:29 +00:00
Adrian Prantl	f4223918de	Debug info for optimized code: Support variables that are on the stack and described by DBG_VALUEs during their lifetime. Previously, when a variable was at a FrameIndex for any part of its lifetime, this would shadow all other DBG_VALUEs and only a single fbreg location would be emitted, which in fact is only valid for a small range and not the entire lexical scope of the variable. The included dbg-value-const-byref testcase demonstrates this. This patch fixes this by Local - emitting dbg.value intrinsics for allocas that are passed by reference - dropping all dbg.declares (they are now fully lowered to dbg.values) SelectionDAG - renamed constructors for SDDbgValue for better readability. - fix UserValue::match() to handle indirect values correctly - not inserting an MMI table entries for dbg.values that describe allocas. - lowering dbg.values that describe allocas into indirect DBG_VALUEs. CodeGenPrepare - leaving dbg.values for an alloca were they are (see comment) Other - regenerated/updated instcombine-intrinsics testcase and included source rdar://problem/16679879 http://reviews.llvm.org/D3374 llvm-svn: 207130	2014-04-24 17:41:45 +00:00
Andrea Di Biagio	d1ab866868	[X86] Add support for Read Time Stamp Counter x86 builtin intrinsics. This patch: - Adds two new X86 builtin intrinsics ('int_x86_rdtsc' and 'int_x86_rdtscp') as GCCBuiltin intrinsics; - Teaches the backend how to lower the two new builtins; - Introduces a common function to lower READCYCLECOUNTER dag nodes and the two new rdtsc/rdtscp intrinsics; - Improves (and extends) the existing x86 test 'rdtsc.ll'; now test 'rdtsc.ll' correctly verifies that both READCYCLECOUNTER and the two new intrinsics work fine for both 64bit and 32bit Subtargets. llvm-svn: 207127	2014-04-24 17:18:27 +00:00
David Blaikie	908f4d4bf5	Spread some const around for non-mutating uses of MCSymbolData. I discovered this const-hole while attempting to coalesnce the Symbol and SymbolMap data structures. There's some pending issues with that, but I figured this change was easy to flush early. llvm-svn: 207124	2014-04-24 16:59:40 +00:00
Chandler Carruth	24553934f8	[LCG] Incorporate the core trick of improvements on the naive Tarjan's algorithm here: http://dl.acm.org/citation.cfm?id=177301. The idea of isolating the roots has even more relevance when using the stack not just to implement the DFS but also to implement the recursive step. Because we use it for the recursive step, to isolate the roots we need to maintain two stacks: one for our recursive DFS walk, and another of the nodes that have been walked. The nice thing is that the latter will be half the size. It also fixes a complete hack where we scanned backwards over the stack to find the next potential-root to continue processing. Now that is always the top of the DFS stack. While this is a really nice improvement already (IMO) it further opens the door for two important simplifications: 1) De-duplicating some of the code across the two different walks. I've actually made the duplication a bit worse in some senses with this patch because the two are starting to converge. 2) Dramatically simplifying the loop structures of both walks. I wanted to do those separately as they'll be essentially just CFG restructuring. This patch on the other hand actually uses different datastructures to implement the algorithm itself. llvm-svn: 207098	2014-04-24 11:05:20 +00:00
Chandler Carruth	493e0a6ad0	[LCG] Switch the parent SCC tracking from a SmallSetVector to a SmallPtrSet. Currently, there is no need for stable iteration in this dimension, and I now thing there won't need to be going forward. If this is ever re-introduced in any form, it needs to not be a SetVector based solution because removal cannot be linear. There will be many SCCs with large numbers of parents. When encountering these, the incremental SCC update for intra-SCC edge removal was quadratic due to linear removal (kind of). I'm really hoping we can avoid having an ordering property here at all though... llvm-svn: 207091	2014-04-24 09:22:31 +00:00
Chandler Carruth	d52f8e0e4d	[LCG] We don't actually need a set in each SCC to track the nodes. We can use the node -> SCC mapping in the top-level graph to test this on the rare occasions we need it. llvm-svn: 207090	2014-04-24 08:55:36 +00:00
Chandler Carruth	944b9acddd	[LCG] Switch the SCC's parent iterators to be value iterators rather than pointer iterators. llvm-svn: 207086	2014-04-24 07:48:18 +00:00
Chandler Carruth	3478d4b164	[ADT] Attempt to appease another MSVC oddity by moving the injected class name usage into a context we can put typename on it. llvm-svn: 207084	2014-04-24 06:59:50 +00:00
Craig Topper	353eda484c	[C++] Use 'nullptr'. llvm-svn: 207083	2014-04-24 06:44:33 +00:00
Chandler Carruth	150a5f1dd3	[ADT] Try to appease MSVC by sinking the enable_if from a default template argument to a default argument to the constructor. llvm-svn: 207082	2014-04-24 06:16:12 +00:00
Chandler Carruth	a3211b5dca	Use the shiny new iterator adaptor tool to implement the value_op_iterator. llvm-svn: 207078	2014-04-24 05:33:53 +00:00
Chandler Carruth	2803df5ae6	[ADT] Factor out the facade aspect of the iterator_adaptor_base into its own CRTP base class for more general purpose use. Add some clarifying comments for the exact way in which the adaptor uses it. Hopefully this will help us write increasingly full featured iterators. This is becoming important as they start to be used heavily inside of ranges. llvm-svn: 207072	2014-04-24 04:07:06 +00:00
Chandler Carruth	9a6be8b3b1	[ADT] Add a generic iterator utility for adapting iterators much like Boost's iterator_adaptor, and a specific adaptor which iterates over pointees when wrapped around an iterator over pointers. This is the result of a long discussion on IRC with Duncan Smith, Dave Blaikie, Richard Smith, and myself. Essentially, I could use some subset of the iterator facade facilities often used from Boost, and everyone seemed interested in having the functionality in a reasonably generic form. I've tried to strike a balance between the pragmatism and the established Boost design. The primary differences are: 1) Delegating to the standard iterator interface names rather than special names that then make up a second iterator-like API. 2) Using the name 'pointee_iterator' which seems more clear than 'indirect_iterator'. The whole business of calling the '*p' operation 'pointer indirection' in the standard is ... quite confusing. And 'dereference' is no better of a term for moving from a pointer to a reference. Hoping Duncan, and others continue to provide comments on this until we've got a nice, minimal abstraction. llvm-svn: 207069	2014-04-24 03:31:23 +00:00
Chandler Carruth	6a4fee87bc	[LCG] Normalize the post-order SCC iterator to just iterate over the SCC values rather than having pointers in weird places. llvm-svn: 207053	2014-04-23 23:51:07 +00:00
Chandler Carruth	a800e28818	[LCG] Remove two unused typedefs from the iterators. llvm-svn: 207052	2014-04-23 23:51:02 +00:00
Chandler Carruth	bd5d3082c4	[LCG] Switch the primary node iterator to be a much more normal C++ iterator, returning a Node by reference on dereference. llvm-svn: 207048	2014-04-23 23:34:48 +00:00
Chandler Carruth	2a898e0df6	[LCG] Make the insertion and query paths into the LCG which cannot fail return references to better model this property. No functionality changed. llvm-svn: 207047	2014-04-23 23:20:36 +00:00
Chandler Carruth	a10e240377	[LCG] Switch the SCC lookup to be in terms of call graph nodes rather than functions. So far, this access pattern is much more common. It seems likely that any user of this interface is going to have nodes at the point that they are querying the SCCs. No functionality changed. llvm-svn: 207045	2014-04-23 23:12:06 +00:00
Jordan Rose	001080b375	Use std::less instead of < in array_pod_sort's default comparator. This makes array_pod_sort portably safe to use with pointers. llvm-svn: 207043	2014-04-23 22:44:11 +00:00
Justin Bogner	c67f0250ef	llvm-cov: Add support for gcov's --long-file-names option GCOV provides an option to prepend output file names with the source file name, to disambiguate between covered data that's included from multiple sources. Add a flag to llvm-cov that does the same. llvm-svn: 207035	2014-04-23 21:44:55 +00:00
Rafael Espindola	6992778176	Remove AssemblyAnnotationWriter from NamedMDNode::print. No functionality change, this parameter was always set to nullptr. Patch by Robert Matusewicz! llvm-svn: 206972	2014-04-23 12:23:05 +00:00
Evgeniy Stepanov	0a951b775e	Create MCTargetOptions. For now it contains a single flag, SanitizeAddress, which enables AddressSanitizer instrumentation of inline assembly. Patch by Yuri Gorshenin. llvm-svn: 206971	2014-04-23 11:16:03 +00:00
Simon Atanasyan	62fce0a975	[yaml2obj][ELF] Add a virtual destructor to the ELFYAML::Section class to prevent memory leaks. llvm-svn: 206969	2014-04-23 11:10:55 +00:00
Chandler Carruth	9302fbf0ae	[LCG] Add the first round of mutation support to the lazy call graph. This implements the core functionality necessary to remove an edge from the call graph and correctly update both the basic graph and the SCC structure. As part of that it has to run a tiny (in number of nodes) Tarjan-style DFS walk of an SCC being mutated to compute newly formed SCCs, etc. This is very rough and a WIP. I have a bunch of FIXMEs for code cleanup that will reduce the boilerplate in this change substantially. I also have a bunch of simplifications to various parts of both algorithms that I want to make, but first I'd like to have a more holistic picture. Ideally, I'd also like more testing. I'll probably add quite a few more unit tests as I go here to cover the various different aspects and corner cases of removing edges from the graph. Still, this is, so far, successfully updating the SCC graph in-place without disrupting the identity established for the existing SCCs even when we do challenging things like delete the critical edge that made an SCC cycle at all and have to reform things as a tree of smaller SCCs. Getting this to work is really critical for the new pass manager as it is going to associate significant state with the SCC instance and needs it to be stable. That is also the motivation behind the return of the newly formed SCCs. Eventually, I'll wire this all the way up to the public API so that the pass manager can use it to correctly re-enqueue newly formed SCCs into a fresh postorder traversal. llvm-svn: 206968	2014-04-23 11:03:03 +00:00
Chandler Carruth	cace6623c4	[LCG] Implement Tarjan's algorithm correctly this time. We have to walk up the stack finishing the exploration of each entries children before we're finished in addition to accounting for their low-links. Added a unittest that really hammers home the need for this with interlocking cycles that would each appear distinct otherwise and crash or compute the wrong result. As part of this, nuke a stale fixme and bring the rest of the implementation still more closely in line with the original algorithm. llvm-svn: 206966	2014-04-23 10:31:17 +00:00
Chandler Carruth	d27fc468a7	[LCG] Add some accessor methods to the SCC to allow iterating over the parents of an SCC, and add a lookup method for finding the SCC for a given function. These aren't used yet, but will be used shortly in some unit tests I'm adding and are really part of the broader intended interface for the analysis. llvm-svn: 206959	2014-04-23 09:57:18 +00:00
Chandler Carruth	c7bad9a5a0	[LCG] Add a unittest for the LazyCallGraph. I had a weak moment and resisted this for too long. Just with the basic testing here I was able to exercise the analysis in more detail and sift out both type signature bugs in the API and a bug in the DFS numbering. All of these are fixed here as well. The unittests will be much more important for the mutation support where it is necessary to craft minimal mutations and then inspect the state of the graph. There is just no way to do that with a standard FileCheck test. However, unittesting these kinds of analyses is really quite easy, especially as they're designed with the new pass manager where there is essentially no infrastructure required to rig up the core logic and exercise it at an API level. As a minor aside about the DFS numbering bug, the DFS numbering used in LCG is a bit unusual. Rather than numbering from 0, we number from 1, and use 0 as the sentinel "unvisited" state. Other implementations often use '-1' for this, but I find it easier to deal with 0 and it shouldn't make any real difference provided someone doesn't write silly bugs like forgetting to actually initialize the DFS numbering. Oops. ;] llvm-svn: 206954	2014-04-23 08:08:49 +00:00
Chandler Carruth	3f9869a8e2	[LCG] Hoist the logic for forming a new SCC from the top of the DFSStack into a helper function. I plan to re-use it for doing incremental DFS-based updates to the SCCs when we mutate the call graph. llvm-svn: 206948	2014-04-23 06:09:03 +00:00
Chandler Carruth	0b623baeb3	[LCG] Switch the Callee sets to be DenseMaps pointing to the index into the Callee list. This is going to be quite important to prevent removal from going quadratic. No functionality changed at this point, this is one of the refactoring patches I've broken out of my initial work toward mutation updates of the call graph. llvm-svn: 206938	2014-04-23 04:00:17 +00:00
Kevin Enderby	7ee97cebfc	Change the prototype for MCContext::FatalError() so it can be called from places like MCCodeEmitter() in the MC backend when the MCContext is const. I was going to use this in my change for r206669 but Jim convinced me to use an assert there. But this still is a good tweak. llvm-svn: 206923	2014-04-22 21:42:18 +00:00
Rui Ueyama	71a26346d3	Whitespace llvm-svn: 206919	2014-04-22 19:52:05 +00:00
Rui Ueyama	17a9a84f5c	No need to check condition after grow() r206916 was not logically the same as the previous code because the goto statements did not create loop. This should be the same as the previous code. llvm-svn: 206918	2014-04-22 19:47:26 +00:00
Rui Ueyama	70bcf4222e	Replace loops using goto with plain while loops Goto statements jumping into previous inner blocks are pretty confusing to read even though in this case they are valid. No reason to not use while loops there. llvm-svn: 206916	2014-04-22 19:07:14 +00:00
Kevin Enderby	96918bc406	Fix the assembler to print a better relocatable expression error diagnostic that includes location information. Currently if one has this assembly: .quad (0x1234 + (4 * SOME_VALUE)) where SOME_VALUE is undefined ones gets the less than useful error message with no location information: % clang -c x.s clang -cc1as: fatal error: error in backend: expected relocatable expression With this fix one now gets a more useful error message with location information: % clang -c x.s x.s:5:8: error: expected relocatable expression .quad (0x1234 + (4 * SOME_VALUE)) ^ To do this I plumbed the SMLoc through the MCObjectStreamer EmitValue() and EmitValueImpl() interfaces so it could be used when creating the MCFixup. rdar://12391022 llvm-svn: 206906	2014-04-22 17:27:29 +00:00
Tim Northover	e74fb0d7b9	AArch64/ARM64: mark fmul intrinsic as commutative. This gives DAG patterns matching indexed patterns where either side is an indexed vector. llvm-svn: 206875	2014-04-22 10:10:14 +00:00
Duncan P. N. Exon Smith	d1aec79d7a	blockfreq: Rename PackagedLoops => Loops llvm-svn: 206859	2014-04-22 03:31:50 +00:00
Duncan P. N. Exon Smith	2984a64bae	blockfreq: Use a pointer for ContainingLoop too llvm-svn: 206858	2014-04-22 03:31:44 +00:00
Duncan P. N. Exon Smith	e1423639bb	blockfreq: Use pointers to loops instead of an index Store pointers directly to loops inside the nodes. This could have been done without changing the type stored in `std::vector<>`. However, rather than computing the number of loops before constructing them (which `LoopInfo` doesn't provide directly), I've switched to a `vector<unique_ptr<LoopData>>`. This adds some heap overhead, but the number of loops is typically small. llvm-svn: 206857	2014-04-22 03:31:37 +00:00
Duncan P. N. Exon Smith	cc88ebfa5f	blockfreq: Rename PackagedLoopData => LoopData No functionality change. llvm-svn: 206855	2014-04-22 03:31:31 +00:00
Duncan P. N. Exon Smith	f2eb5bc3ff	blockfreq: Move PackagedLoopData above WorkingData llvm-svn: 206854	2014-04-22 03:31:25 +00:00
Duncan P. N. Exon Smith	84749e52a3	blockfreq: Remove "dead" comment llvm-svn: 206853	2014-04-22 03:31:23 +00:00
Chandler Carruth	1b9dde087e	[Modules] Remove potential ODR violations by sinking the DEBUG_TYPE define below all header includes in the lib/CodeGen/... tree. While the current modules implementation doesn't check for this kind of ODR violation yet, it is likely to grow support for it in the future. It also removes one layer of macro pollution across all the included headers. Other sub-trees will follow. llvm-svn: 206837	2014-04-22 02:02:50 +00:00
Rui Ueyama	97d484342c	Fix wrong iterator type ELFEntityIterator does not implement RandomAccessIterator. It does not even implement BidirectionalIterator. This patch fixes LLD build issue when compiled with MSVC2013 with debug: MSVC's find_if checks if the start iterator is before the end iterator in the sense of operator< if it declares implementing RandomAccessIterator. If a class does not have operator<, it fails to compile. llvm-svn: 206825	2014-04-21 23:00:42 +00:00
Chandler Carruth	e96dd8975f	[Modules] Make Support/Debug.h modular. This requires it to not change behavior based on other files defining DEBUG_TYPE, which means it cannot define DEBUG_TYPE at all. This is actually better IMO as it forces folks to define relevant DEBUG_TYPEs for their files. However, it requires all files that currently use DEBUG(...) to define a DEBUG_TYPE if they don't already. I've updated all such files in LLVM and will do the same for other upstream projects. This still leaves one important change in how LLVM uses the DEBUG_TYPE macro going forward: we need to only define the macro after header files have been #include-ed. Previously, this wasn't possible because Debug.h required the macro to be pre-defined. This commit removes that. By defining DEBUG_TYPE after the includes two things are fixed: - Header files that need to provide a DEBUG_TYPE for some inline code can do so by defining the macro before their inline code and undef-ing it afterward so the macro does not escape. - We no longer have rampant ODR violations due to including headers with different DEBUG_TYPE definitions. This may be mostly an academic violation today, but with modules these types of violations are easy to check for and potentially very relevant. Where necessary to suppor headers with DEBUG_TYPE, I have moved the definitions below the includes in this commit. I plan to move the rest of the DEBUG_TYPE macros in LLVM in subsequent commits; this one is big enough. The comments in Debug.h, which were hilariously out of date already, have been updated to reflect the recommended practice going forward. llvm-svn: 206822	2014-04-21 22:55:11 +00:00
David Blaikie	09757491d6	Use unique_ptr to manage ownership of GCOVFunctions, Blocks, and Edges. llvm-svn: 206796	2014-04-21 21:40:16 +00:00
David Blaikie	422b93dcf1	Use unique_ptr to manage objects owned by the ScheduleDAGMI. llvm-svn: 206784	2014-04-21 20:32:32 +00:00
Yi Jiang	d069f6393a	ARM64: Combine shifts and uses from different basic block to bit-extract instruction llvm-svn: 206774	2014-04-21 19:34:27 +00:00
Duncan P. N. Exon Smith	254689fcf9	blockfreq: Some cleanup of UnsignedFloat Change `PositiveFloat` to `UnsignedFloat`, and fix some of the comments to indicate that it's disappearing eventually. llvm-svn: 206771	2014-04-21 18:31:58 +00:00
Jim Grosbach	81ab4cc97a	Tidy up. Remove extraneous typedef. llvm-svn: 206768	2014-04-21 18:10:29 +00:00
Jim Grosbach	c5c881ee82	Object: iterator_range accessors for ObjectImage symbols and sections. llvm-svn: 206767	2014-04-21 18:10:26 +00:00
Duncan P. N. Exon Smith	10be9a8868	Reapply "blockfreq: Rewrite BlockFrequencyInfoImpl" This reverts commit r206707, reapplying r206704. The preceding commit to CalcSpillWeights should have sorted out the failing buildbots. <rdar://problem/14292693> llvm-svn: 206766	2014-04-21 17:57:07 +00:00
Rafael Espindola	6956b1a517	Convert getFileOffset to getOffset and move it to its only user. We normally don't drop functions from the C API's, but in this case I think we can: * The old implementation of getFileOffset was fairly broken * The introduction of LLVMGetSymbolFileOffset was itself a C api breaking change as it removed LLVMGetSymbolOffset. * It is an incredibly specialized use case. The only reason MCJIT needs it is because of its odd position of being a dynamic linker of .o files. llvm-svn: 206750	2014-04-21 13:45:32 +00:00
Michael Zolotukhin	f2ba994bf6	Reapply r206732. This time without optimization of branches. llvm-svn: 206749	2014-04-21 12:01:33 +00:00
Chandler Carruth	572e3407c3	[PM] Add a new-PM-style CGSCC pass manager using the newly added LazyCallGraph analysis framework. Wire it up all the way through the opt driver and add some very basic testing that we can build pass pipelines including these components. Still a lot more to do in terms of testing that all of this works, but the basic pieces are here. There is a lot of boiler plate here. It's something I'm going to actively look at reducing, but I don't have any immediate ideas that don't end up making the code terribly complex in order to fold away the boilerplate. Until I figure out something to minimize the boilerplate, almost all of this is based on the code for the existing pass managers, copied and heavily adjusted to suit the needs of the CGSCC pass management layer. The actual CG management still has a bunch of FIXMEs in it. Notably, we don't do any updating of the CG as it is potentially invalidated. I wanted to get this in place to motivate the new analysis, and add update APIs to the analysis and the pass management layers in concert to make sure that the right APIs are present. llvm-svn: 206745	2014-04-21 11:12:00 +00:00
Benjamin Kramer	d2da720ead	[C++11] Replace OwningPtr with std::unique_ptr in places where it doesn't break the API. No functionality change. llvm-svn: 206740	2014-04-21 09:34:48 +00:00
Chandler Carruth	a2533a7bef	Revert r206732 which is causing llc to crash on most of the build bots. Original commit message: Implement builtins for safe division: safe.sdiv.iN, safe.udiv.iN, safe.srem.iN, safe.urem.iN (iN = i8, i61, i32, or i64). llvm-svn: 206735	2014-04-21 07:11:15 +00:00
Michael Zolotukhin	137a84616c	Implement builtins for safe division: safe.sdiv.iN, safe.udiv.iN, safe.srem.iN, safe.urem.iN (iN = i8, i16, i32, or i64). llvm-svn: 206732	2014-04-21 05:33:09 +00:00
David Blaikie	e9907ba16e	Protect the ArgList dtor It could even be made non-virtual if it weren't for bad compiler warnings. This demonstrates that ArgList objects aren't destroyed polymorphically and possibly that they aren't even used polymorphically. If that's the case, it might be possible to refactor the two ArgList types more separately and simplify the Arg ownership model. continues experimenting llvm-svn: 206727	2014-04-20 23:59:00 +00:00
Richard Smith	c5d5340eeb	Add missing #include found by modules build. llvm-svn: 206726	2014-04-20 23:39:19 +00:00
David Blaikie	f6e403f3c8	Remove comment that hasn't been true for 5 years llvm-svn: 206725	2014-04-20 22:40:43 +00:00
David Blaikie	f70b21a4b8	Use unique_ptr to handle ownership of synthesized args in DerivedArgList This might be able to be simplified further by using Arg as a value type in a linked list (to maintain pointer validity), but here's something simple to start with. llvm-svn: 206724	2014-04-20 22:37:46 +00:00
Simon Atanasyan	f54f8ff094	[Mips] Add more special values for the st_other field in the symbol table entry for MIPS. llvm-svn: 206716	2014-04-20 21:05:36 +00:00
Duncan P. N. Exon Smith	e63327e967	Revert "blockfreq: Rewrite BlockFrequencyInfoImpl" This reverts commit r206704, as expected. llvm-svn: 206707	2014-04-19 22:46:00 +00:00
Duncan P. N. Exon Smith	875ddfac75	Reapply "blockfreq: Rewrite BlockFrequencyInfoImpl" This reverts commit r206677, reapplying my BlockFrequencyInfo rewrite. I've done a careful audit, added some asserts, and fixed a couple of bugs (unfortunately, they were in unlikely code paths). There's a small chance that this will appease the failing bots [1][2]. (If so, great!) If not, I have a follow-up commit ready that will temporarily add -debug-only=block-freq to the two failing tests, allowing me to compare the code path between what the failing bots and what my machines (and the rest of the bots) are doing. Once I've triggered those builds, I'll revert both commits so the bots go green again. [1]: http://bb.pgr.jp/builders/ninja-x64-msvc-RA-centos6/builds/1816 [2]: http://llvm-amd64.freebsd.your.org/b/builders/clang-i386-freebsd/builds/18445 <rdar://problem/14292693> llvm-svn: 206704	2014-04-19 22:34:26 +00:00
Yaron Keren	d7ba46b287	Patch by Vadim Chugunov Win64 stack unwinder gets confused when execution flow "falls through" after a call to 'noreturn' function. This fixes the "missing epilogue" problem by emitting a trap instruction for IR 'unreachable' on x86_x64-pc-windows. A secondary use for it would be for anyone wanting to make double-sure that 'noreturn' functions, indeed, do not return. llvm-svn: 206684	2014-04-19 13:47:43 +00:00
Duncan P. N. Exon Smith	76b813619a	Revert "blockfreq: Rewrite BlockFrequencyInfoImpl" (#2 ) This reverts commit r206666, as planned. Still stumped on why the bots are failing. Sanitizer bots haven't turned anything up. If anyone can help me debug either of the failures (referenced in r206666) I'll owe them a beer. (In the meantime, I'll be auditing my patch for undefined behaviour.) llvm-svn: 206677	2014-04-19 00:42:46 +00:00
Justin Bogner	e808171628	OnDiskHashTable: Audit types and use offset_type consistently llvm-svn: 206675	2014-04-19 00:33:15 +00:00
Justin Bogner	4435e4157a	ProfileData: Avoid UB when reading llvm-svn: 206674	2014-04-19 00:33:12 +00:00
Justin Bogner	4bc13f6b47	OnDiskHashTable: Fix a think-o with offset_type llvm-svn: 206672	2014-04-18 23:50:07 +00:00

... 3 4 5 6 7 ...

20715 Commits