llvm-project

Commit Graph

Author	SHA1	Message	Date
Eli Bendersky	9966b26dac	Fix PR19270 - type mismatch caused by invalid optimization. Patch by Jingyue Wu. llvm-svn: 205547	2014-04-03 17:51:58 +00:00
Juergen Ributzka	c81000b8e9	Revert "[Constant Hoisting] Lazily compute the idom and cache the result." This code is no longer usefull, because we only compute and use the IDom once. There is no benefit in caching it anymore. llvm-svn: 205498	2014-04-03 01:38:47 +00:00
Duncan P. N. Exon Smith	4680f40d28	Revert "Reapply "LTO: add API to set strategy for -internalize"" This reverts commit r199244. Conflicts: include/llvm-c/lto.h include/llvm/LTO/LTOCodeGenerator.h lib/LTO/LTOCodeGenerator.cpp llvm-svn: 205471	2014-04-02 22:05:57 +00:00
Tim Northover	670df3d937	SLPVectorizer: compare entire intrinsic for SLP compatibility. Some Intrinsics are overloaded to the extent that return type equality (all that's been checked up to now) does not guarantee that the arguments are the same. In these cases SLP vectorizer should not recurse into the operands, which can be achieved by comparing them as "Function *" rather than simply the ID. llvm-svn: 205424	2014-04-02 14:39:02 +00:00
Hal Finkel	b0ebdc0f43	[LoopVectorizer] Count dependencies of consecutive pointers as uniforms For the purpose of calculating the cost of the loop at various vectorization factors, we need to count dependencies of consecutive pointers as uniforms (which means that the VF = 1 cost is used for all overall VF values). For example, the TSVC benchmark function s173 has: ... %3 = add nsw i64 %indvars.iv, 16000 %arrayidx8 = getelementptr inbounds %struct.GlobalData* @global_data, i64 0, i32 0, i64 %3 ... and we must realize that the add will be a scalar in order to correctly deduce it to be profitable to vectorize this on PowerPC with VSX enabled. In fact, all dependencies of a consecutive pointer must be a scalar (uniform), and so we simply need to add all consecutive pointers to the worklist that currently detects collects uniforms. Fixes PR19296. llvm-svn: 205387	2014-04-02 02:34:49 +00:00
Hal Finkel	6386cb8d4d	Add some additional fields to TTI::UnrollingPreferences In preparation for an upcoming commit implementing unrolling preferences for x86, this adds additional fields to the UnrollingPreferences structure: - PartialThreshold and PartialOptSizeThreshold - Like Threshold and OptSizeThreshold, but used when not fully unrolling. These are necessary because we need different thresholds for full unrolling from those used when partially unrolling (the full unrolling thresholds are generally going to be larger). - MaxCount - A cap on the unrolling factor when partially unrolling. This can be used by a target to prevent the unrolled loop from exceeding some resource limit independent of the loop size (such as number of branches). There should be no functionality change for any in-tree targets. llvm-svn: 205347	2014-04-01 18:50:30 +00:00
Hal Finkel	86b3064f2b	Move partial/runtime unrolling late in the pipeline The generic (concatenation) loop unroller is currently placed early in the standard optimization pipeline. This is a good place to perform full unrolling, but not the right place to perform partial/runtime unrolling. However, most targets don't enable partial/runtime unrolling, so this never mattered. However, even some x86 cores benefit from partial/runtime unrolling of very small loops, and follow-up commits will enable this. First, we need to move partial/runtime unrolling late in the optimization pipeline (importantly, this is after SLP and loop vectorization, as vectorization can drastically change the size of a loop), while keeping the full unrolling where it is now. This change does just that. llvm-svn: 205264	2014-03-31 23:23:51 +00:00
Arnold Schwaighofer	15262e6703	Revert "SLPVectorizer: Ignore users that are insertelements we can reschedule them" This reverts commit r205018. Conflicts: lib/Transforms/Vectorize/SLPVectorizer.cpp test/Transforms/SLPVectorizer/X86/insert-element-build-vector.ll This is breaking libclc build. llvm-svn: 205260	2014-03-31 23:05:56 +00:00
Rafael Espindola	5e66a7e699	Add a missing break. Patch by Tobias Güntner. I tried to write a test, but the only difference is the Changed value that gets returned. It can be tested with "opt -debug-pass=Executions -functionattrs, but that doesn't seem worth it. llvm-svn: 205121	2014-03-30 03:26:17 +00:00
Tim Northover	00ed9964c6	ARM64: initial backend import This adds a second implementation of the AArch64 architecture to LLVM, accessible in parallel via the "arm64" triple. The plan over the coming weeks & months is to merge the two into a single backend, during which time thorough code review should naturally occur. Everything will be easier with the target in-tree though, hence this commit. llvm-svn: 205090	2014-03-29 10:18:08 +00:00
Arnold Schwaighofer	c9d58e8d32	SLPVectorizer: Take credit for free extractelement instructions Extract element instructions that will be removed when vectorzing lower the cost. Patch by Arch D. Robison! llvm-svn: 205020	2014-03-28 17:21:32 +00:00
Arnold Schwaighofer	b0d3bcdd32	SLPVectorizer: Fix typos Patch by Arch D. Robison! llvm-svn: 205019	2014-03-28 17:21:27 +00:00
Arnold Schwaighofer	b190cb30c3	SLPVectorizer: Ignore users that are insertelements we can reschedule them Patch by Arch D. Robison! llvm-svn: 205018	2014-03-28 17:21:22 +00:00
Erik Verbruggen	5e1bac3a38	Revert "InstCombine: merge constants in both operands of icmp." This reverts commit r204912, and follow-up commit r204948. This introduced a performance regression, and the fix is not completely clear yet. llvm-svn: 205010	2014-03-28 14:50:57 +00:00
Erik Verbruggen	2074ebd8af	Revert "GVN: merge overflow intrinsics with non-overflow instructions." This reverts commit r203553, and follow-up commits r203558 and r203574. I will follow this up on the mailinglist to do it in a way that won't cause subtle PRE bugs. llvm-svn: 205009	2014-03-28 14:42:34 +00:00
Adrian Prantl	79c8e8f046	C++11: convert verbose loops to range-based loops. llvm-svn: 204981	2014-03-27 23:30:04 +00:00
Reid Kleckner	3bdf9bc48b	InstCombine: Don't combine constants on unsigned icmps Fixes a miscompile introduced in r204912. It would miscompile code like (unsigned)(a + -49) <= 5U. The transform would turn this into (unsigned)a < 55U, which would return true for values in [0, 49], when it should not. llvm-svn: 204948	2014-03-27 17:49:27 +00:00
Rafael Espindola	24a669d225	Prevent alias from pointing to weak aliases. This adds back r204781. Original message: Aliases are just another name for a position in a file. As such, the regular symbol resolutions are not applied. For example, given define void @my_func() { ret void } @my_alias = alias weak void ()* @my_func @my_alias2 = alias void ()* @my_alias We produce without this patch: .weak my_alias my_alias = my_func .globl my_alias2 my_alias2 = my_alias That is, in the resulting ELF file my_alias, my_func and my_alias are just 3 names pointing to offset 0 of .text. That is not the semantics of IR linking. For example, linking in a @my_alias = alias void ()* @other_func would require the strong my_alias to override the weak one and my_alias2 would end up pointing to other_func. There is no way to represent that with aliases being just another name, so the best solution seems to be to just disallow it, converting a miscompile into an error. llvm-svn: 204934	2014-03-27 15:26:56 +00:00
Erik Verbruggen	59a1219846	InstCombine: merge constants in both operands of icmp. Transform: icmp X+Cst2, Cst into: icmp X, Cst-Cst2 when Cst-Cst2 does not overflow, and the add has nsw. llvm-svn: 204912	2014-03-27 11:16:05 +00:00
Nick Lewycky	77d5fb40c8	Treat lifetime.start'd memory like we treat freshly alloca'd memory. Patch by Björn Steinbrink! llvm-svn: 204876	2014-03-26 23:45:15 +00:00
Reid Kleckner	23798a9731	CloneFunction: Clone all attributes, including the CC Summary: Tested with a unit test because we don't appear to have any transforms that use this other than ASan, I think. Fixes PR17935. Reviewers: nicholas CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D3194 llvm-svn: 204866	2014-03-26 22:26:35 +00:00
Rafael Espindola	65481d7b97	Revert "Prevent alias from pointing to weak aliases." This reverts commit r204781. I will follow up to with msan folks to see what is what they were trying to do with aliases to weak aliases. llvm-svn: 204784	2014-03-26 06:14:40 +00:00
Rafael Espindola	3b712a84a9	Prevent alias from pointing to weak aliases. Aliases are just another name for a position in a file. As such, the regular symbol resolutions are not applied. For example, given define void @my_func() { ret void } @my_alias = alias weak void ()* @my_func @my_alias2 = alias void ()* @my_alias We produce without this patch: .weak my_alias my_alias = my_func .globl my_alias2 my_alias2 = my_alias That is, in the resulting ELF file my_alias, my_func and my_alias are just 3 names pointing to offset 0 of .text. That is not the semantics of IR linking. For example, linking in a @my_alias = alias void ()* @other_func would require the strong my_alias to override the weak one and my_alias2 would end up pointing to other_func. There is no way to represent that with aliases being just another name, so the best solution seems to be to just disallow it, converting a miscompile into an error. llvm-svn: 204781	2014-03-26 04:48:47 +00:00
Juergen Ributzka	7be410f5d5	[Constant Hoisting] Make the constant candidate map local to the collectConstantCandidates method. llvm-svn: 204758	2014-03-25 21:21:10 +00:00
Richard Osborne	0af4aa9a19	[InstCombine] Don't fold bitcast into store if it would need addrspacecast Summary: Previously the code didn't check if the before and after types for the store were pointers to different address spaces. This resulted in instcombine using a bitcast to convert between pointers to different address spaces, causing an assertion due to the invalid cast. It is not be appropriate to use addrspacecast this case because it is not guaranteed to be a no-op cast. Instead bail out and do not do the transformation. CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D3117 llvm-svn: 204733	2014-03-25 17:21:41 +00:00
Richard Osborne	9805ec457d	Reuse earlier variables to make it clear the types involved in the cast. No functionality change. llvm-svn: 204732	2014-03-25 17:21:35 +00:00
Evgeniy Stepanov	fc742acc8c	[msan] More precise instrumentation of select IR. Some bits of select result may be initialized even if select condition is not. https://code.google.com/p/memory-sanitizer/issues/detail?id=50 llvm-svn: 204716	2014-03-25 13:08:34 +00:00
Andrew Trick	c8ac7ea261	SLP vectorizer: Don't hoist vector extracts of phis. Extracts coming from phis were being hoisted, while all others were sunk to their uses. This was inconsistent and didn't seem to serve a purpose. Changing all extracts to be sunk to uses is a prerequisite for adding block frequency to the SLP vectorizer's cost model. I benchmarked the change in isolation (without block frequency). I only saw noise on x86 and some potentially significant improvements on ARM. No major regressions is good enough for me. llvm-svn: 204699	2014-03-25 02:18:47 +00:00
Nuno Lopes	31617266ea	remove a bunch of unused private methods found with a smarter version of -Wunused-member-function that I'm playwing with. Appologies in advance if I removed someone's WIP code. include/llvm/CodeGen/MachineSSAUpdater.h \| 1 include/llvm/IR/DebugInfo.h \| 3 lib/CodeGen/MachineSSAUpdater.cpp \| 10 -- lib/CodeGen/PostRASchedulerList.cpp \| 1 lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp \| 10 -- lib/IR/DebugInfo.cpp \| 12 -- lib/MC/MCAsmStreamer.cpp \| 2 lib/Support/YAMLParser.cpp \| 39 --------- lib/TableGen/TGParser.cpp \| 16 --- lib/TableGen/TGParser.h \| 1 lib/Target/AArch64/AArch64TargetTransformInfo.cpp \| 9 -- lib/Target/ARM/ARMCodeEmitter.cpp \| 12 -- lib/Target/ARM/ARMFastISel.cpp \| 84 -------------------- lib/Target/Mips/MipsCodeEmitter.cpp \| 11 -- lib/Target/Mips/MipsConstantIslandPass.cpp \| 12 -- lib/Target/NVPTX/NVPTXISelDAGToDAG.cpp \| 21 ----- lib/Target/NVPTX/NVPTXISelDAGToDAG.h \| 2 lib/Target/PowerPC/PPCFastISel.cpp \| 1 lib/Transforms/Instrumentation/AddressSanitizer.cpp \| 2 lib/Transforms/Instrumentation/BoundsChecking.cpp \| 2 lib/Transforms/Instrumentation/MemorySanitizer.cpp \| 1 lib/Transforms/Scalar/LoopIdiomRecognize.cpp \| 8 - lib/Transforms/Scalar/SCCP.cpp \| 1 utils/TableGen/CodeEmitterGen.cpp \| 2 24 files changed, 2 insertions(+), 261 deletions(-) llvm-svn: 204560	2014-03-23 17:09:26 +00:00
Lang Hames	459b5dc39e	Revert r204076 for now - it caused significant regressions in a number of benchmarks. <rdar://problem/16368461> llvm-svn: 204558	2014-03-23 04:22:31 +00:00
Juergen Ributzka	e474752f4c	[Constant Hoisting] Erase dead cast instructions. The cleanup code that removes dead cast instructions only removed them from the basic block, but didn't delete them. This fix erases them now too. llvm-svn: 204538	2014-03-22 01:49:30 +00:00
Juergen Ributzka	e802d507b0	[Constant Hoisting] Fix multiple entries for the same basic block in PHI nodes. A PHI node usually has only one value/basic block pair per incoming basic block. In the case of a switch statement it is possible that a following PHI node may have more than one such pair per incoming basic block. E.g.: %0 = phi i64 [ 123456, %case2 ], [ 654321, %Entry ], [ 654321, %Entry ] This is valid and the verfier doesn't complain, because both values are the same. Constant hoisting materializes the constant for each operand separately and the value is still the same, but the variable names have changed. As a result the verfier can't recognize anymore that they are the same value and complains. This fix adds special update code for PHI node in constant hoisting to prevent this corner case. This fixes <rdar://problem/16394449> llvm-svn: 204537	2014-03-22 01:49:27 +00:00
Arnaud A. de Grandmaison	c97727a492	Remove some dead assignements found by scan-build llvm-svn: 204526	2014-03-21 21:54:46 +00:00
Tom Stellard	edfd81d965	Sink: Don't sink static allocas from the entry block CodeGen treats allocas outside the entry block as dynamically sized stack objects. llvm-svn: 204473	2014-03-21 15:51:51 +00:00
Juergen Ributzka	f0dff49ad0	[Constant Hoisting] Make the constant materialization cost operand dependent Extend the target hook to take also the operand index into account when calculating the cost of the constant materialization. Related to <rdar://problem/16381500> llvm-svn: 204435	2014-03-21 06:04:45 +00:00
Juergen Ributzka	500abd48d1	[Constant Hoisting] Lazily compute the idom and cache the result. Related to <rdar://problem/16381500> llvm-svn: 204434	2014-03-21 06:04:39 +00:00
Juergen Ributzka	5429c06b90	[Constant Hoisting] Change the algorithm to only track constants for instructions. Originally the algorithm would search for expensive constants and track their users, which could be instructions and constant expressions. This change only tracks the constants for instructions, but constant expressions are indirectly covered too. If an operand is an constant expression, then we look through the expression to find anny expensive constants. The algorithm keep now track of the instruction and the operand index where the constant is used. This allows more precise hoisting of constant materialization code for PHI instructions, because we only hoist to the basic block of the incoming operand. Before we had to find the idom of all PHI operands and hoist the materialization code there. This also makes updating of instructions easier. Before we had to keep track of the original constant, find it in the instructions, and then replace it. Now we can just simply update the operand. Related to <rdar://problem/16381500> llvm-svn: 204433	2014-03-21 06:04:36 +00:00
Juergen Ributzka	b8489b3167	[Constant Hoisting] Fix capitalization of function names. llvm-svn: 204432	2014-03-21 06:04:33 +00:00
Juergen Ributzka	a29a5b8475	[Constant Hoisting] Replace the MapVector with a separate Map and Vector to keep track of constant candidates. This simplifies working with the constant candidates and removes the tight coupling between the map and the vector. Related to <rdar://problem/16381500> llvm-svn: 204431	2014-03-21 06:04:30 +00:00
Juergen Ributzka	46357931ab	Revert "[Constant Hoisting] Extend coverage of the constant hoisting pass." I will break this up into smaller pieces for review and recommit. llvm-svn: 204393	2014-03-20 20:17:13 +00:00
Juergen Ributzka	6dab520c70	[Constant Hoisting] Extend coverage of the constant hoisting pass. This commit extends the coverage of the constant hoisting pass, adds additonal debug output and updates the function names according to the style guide. Related to <rdar://problem/16381500> llvm-svn: 204389	2014-03-20 19:55:52 +00:00
Mark Seaborn	b6118c5b17	Remove LowerInvoke's obsolete "-enable-correct-eh-support" option This option caused LowerInvoke to generate code using SJLJ-based exception handling, but there is no code left that interprets the jmp_buf stack that the resulting code maintained (llvm.sjljeh.jblist). This option has been obsolete for a while, and replaced by SjLjEHPrepare. This leaves the default behaviour of LowerInvoke, which is to convert invokes to calls. Differential Revision: http://llvm-reviews.chandlerc.com/D3136 llvm-svn: 204388	2014-03-20 19:54:47 +00:00
Alexander Potapenko	04969e8b31	[ASan] Do not instrument globals from the llvm.metadata section. Fixes https://code.google.com/p/address-sanitizer/issues/detail?id=279. llvm-svn: 204331	2014-03-20 10:48:34 +00:00
Evgeniy Stepanov	2275a01a44	Set debug info for instructions inserted in SplitBlockAndInsertIfThen. llvm-svn: 204230	2014-03-19 12:56:38 +00:00
Duncan P. N. Exon Smith	cb1c81afa0	Fix use_iterator crash in ObjCArc from r203364 The use_iterator redesign in r203364 introduced an increment past the end of a range in -objc-arc-contract. Added an explicit check for the end of the range. <rdar://problem/16333235> llvm-svn: 204195	2014-03-18 22:32:43 +00:00
Chandler Carruth	4c5001cc9c	[LV] While I'm here, use range based for loops which are so much cleaner for this kind of walk. llvm-svn: 204188	2014-03-18 22:00:32 +00:00
Chandler Carruth	ae324439d0	[LV] The actual change I intended to commit in r204148. Sorry for the noise. Original commit log: Replace some dead code with an assert. When I first ported this pass from a loop pass to a function pass I did so in the naive, recursive way. It doesn't actually work, we need a worklist instead. When I switched to the worklist I didn't delete the naive recursion. That recursion was also buggy because it was dead and never really exercised. llvm-svn: 204187	2014-03-18 21:58:38 +00:00
Chandler Carruth	f73079ca89	[LV] Replace some dead code with an assert. When I first ported this pass from a loop pass to a function pass I did so in the naive, recursive way. It doesn't actually work, we need a worklist instead. When I switched to the worklist I didn't delete the naive recursion. That recursion was also buggy because it was dead and never really exercised. llvm-svn: 204184	2014-03-18 21:51:46 +00:00
Evgeniy Stepanov	302964ee92	[msan] Origin tracking with history. LLVM part of MSan implementation of advanced origin tracking, when we record not only creation point, but all locations where an uninitialized value was stored to memory, too. llvm-svn: 204151	2014-03-18 13:30:56 +00:00
Diego Novillo	213bb00245	Tolerate unmangled names in sample profiles. Summary: The compiler does not always generate linkage names. If a function has been inlined and its body elided, its linkage name may not be generated. When the binary executes, the profiler will use its unmangled name when attributing samples. This results in unmangled names in the input profile. We are currently failing hard when this happens. However, in this case all that happens is that we fail to attribute samples to the inlined function. While this means fewer optimization opportunities, it should not cause a compilation failure. This patch accepts all valid function names, regardless of whether they were mangled or not. Reviewers: chandlerc CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D3087 llvm-svn: 204142	2014-03-18 12:03:12 +00:00
Evgeniy Stepanov	2aac07349d	[msan] Kill -msan-store-clean-origin flag. Not only is it slower than the alternative, but also subtly broken. This commit does not change the default behavior. llvm-svn: 204131	2014-03-18 09:47:06 +00:00
Alon Mishne	ad312155a6	[C++11] Change DebugInfoFinder to use range-based loops Also changes the iterators to return actual DI type over MDNode. llvm-svn: 204130	2014-03-18 09:41:07 +00:00
Evgeniy Stepanov	dba42d6652	[msan] Remove unused code. llvm-svn: 204125	2014-03-18 08:29:42 +00:00
Dan Gohman	172c5d3451	Use range metadata instead of introducing selects. When GlobalOpt has determined that a GlobalVariable only ever has two values, it would convert the GlobalVariable to a boolean, and introduce SelectInsts at every load, to choose between the two possible values. These SelectInsts introduce overhead and other unpleasantness. This patch makes GlobalOpt just add range metadata to loads from such GlobalVariables instead. This enables the same main optimization (as seen in test/Transforms/GlobalOpt/integer-bool.ll), without introducing selects. The main downside is that it doesn't get the memory savings of shrinking such GlobalVariables, but this is expected to be negligible. llvm-svn: 204076	2014-03-17 19:57:04 +00:00
Eli Bendersky	576ef3c667	Consistent use of the noduplicate attribute. The "noduplicate" attribute of call instructions is sometimes queried directly and sometimes through the cannotDuplicate() predicate. This patch streamlines all queries to use the cannotDuplicate() predicate. It also adds this predicate to InvokeInst, to mirror what CallInst has. llvm-svn: 204049	2014-03-17 16:19:07 +00:00
David Blaikie	6107968f55	Remove named Twine. While technically correct, we generally disallow any instance of named Twines due to their subtlety. llvm-svn: 204016	2014-03-16 01:36:18 +00:00
Arnaud A. de Grandmaison	75c9e6dedf	Remove some dead assignements found by scan-build llvm-svn: 204013	2014-03-15 22:13:15 +00:00
Benjamin Kramer	62fb0cfb97	LSR: Compress a pair (and get rid of the DenseMapInfo for it). Also convert a horrible hash function to use our hashing infrastructure. No functionality change. llvm-svn: 204008	2014-03-15 17:17:48 +00:00
NAKAMURA Takumi	6cf7e5d341	SampleProfile.cpp: Fix take #2 . The issue was abuse of StringRef here. llvm-svn: 203996	2014-03-15 01:56:17 +00:00
NAKAMURA Takumi	bfb172897e	SampleProfile.cpp: Quick fix to r203976 about abuse of Twine. The life of Twine was too short. FIXME: DiagnosticInfoSampleProfile should not hold Twine&. llvm-svn: 203990	2014-03-15 00:10:12 +00:00
Diego Novillo	709590838f	Re-format SampleProfile.cpp with clang-format. No functional changes. llvm-svn: 203977	2014-03-14 22:07:18 +00:00
Diego Novillo	a32aa3251c	Use DiagnosticInfo facility. Summary: The sample profiler pass emits several error messages. Instead of just aborting the compiler with report_fatal_error, we can emit better messages using DiagnosticInfo. This adds a new sub-class of DiagnosticInfo to handle the sample profiler. Reviewers: chandlerc, qcolombet CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D3086 llvm-svn: 203976	2014-03-14 21:58:59 +00:00
Alexander Potapenko	b76ea32834	[ASan] Fix https://code.google.com/p/address-sanitizer/issues/detail?id=274 by ignoring globals from __TEXT,__cstring,cstring_literals during instrumenation. Add a regression test. llvm-svn: 203916	2014-03-14 10:41:49 +00:00
Stepan Dyatkovskiy	a53cf970a1	MergeFunctions, cmpType: fixed variable names from XXTy1 and XXTy2 to XXTyL and XXTyR. llvm-svn: 203907	2014-03-14 08:48:52 +00:00
Stepan Dyatkovskiy	90c4436962	MergeFunctions, cmpType: Fixed comments wrapping. llvm-svn: 203905	2014-03-14 08:17:19 +00:00
Owen Anderson	9b8f9c3d95	Fix a bug in InstCombine where we would incorrectly attempt to construct a bitcast between pointers of two different address spaces if they happened to have the same pointer size. llvm-svn: 203862	2014-03-13 22:51:43 +00:00
Evgeniy Stepanov	7ab838eb56	[msan] Fix handling of byval arguments in VarArg calls. llvm-svn: 203794	2014-03-13 13:17:11 +00:00
Stepan Dyatkovskiy	d8eb0bcb5b	First patch of patch series that improves MergeFunctions performance time from O(NN) to O(Nlog(N)). The idea is to introduce total ordering among functions set. That allows to build binary tree and perform function look-up procedure in O(log(N)) time. This patch description: Introduced total ordering among Type instances. Actually it is improvement for existing isEquivalentType. 0. Coerce pointer of 0 address space to integer. 1. If left and right types are equal (the same Type* value), return 0 (means equal). 2. If types are of different kind (different type IDs). Return result of type IDs comparison, treating them as numbers. 3. If types are vectors or integers, return result of its pointers comparison (casted to numbers). 4. Check whether type ID belongs to the next group: * Void * Float * Double * X86_FP80 * FP128 * PPC_FP128 * Label * Metadata If so, return 0. 5. If left and right are pointers, return result of address space comparison (numbers comparison). 6. If types are complex. Then both LEFT and RIGHT will be expanded and their element types will be checked with the same way. If we get Res != 0 on some stage, return it. Otherwise return 0. 7. For all other cases put llvm_unreachable. llvm-svn: 203788	2014-03-13 11:54:50 +00:00
Mark Seaborn	07e7486128	Fix typo in comment: "inwoke" -> "invoke" llvm-svn: 203739	2014-03-13 00:04:17 +00:00
Raul E. Silvera	62f0236d36	Resubmit "[SLPV] Recognize vectorizable intrinsics during SLP vectorization ..." This reverts commit 86cb795388643710dab34941ddcb5a9470ac39d8. The problems previously found have been resolved through other CLs. llvm-svn: 203707	2014-03-12 20:21:50 +00:00
Hans Wennborg	b73c0b041d	Allow switch-to-lookup table for tables with holes by adding bitmask check This allows us to generate table lookups for code such as: unsigned test(unsigned x) { switch (x) { case 100: return 0; case 101: return 1; case 103: return 2; case 105: return 3; case 107: return 4; case 109: return 5; case 110: return 6; default: return f(x); } } Since cases 102, 104, etc. are not constants, the lookup table has holes in those positions. We therefore guard the table lookup with a bitmask check. Patch by Jasper Neumann! llvm-svn: 203694	2014-03-12 18:35:40 +00:00
Evan Cheng	ad6efbfa0f	Revert r203488 and r203520. llvm-svn: 203687	2014-03-12 18:09:37 +00:00
Eli Bendersky	95b540f221	Revive SizeOptLevel-explaining comments that were dropped in r203669 llvm-svn: 203675	2014-03-12 16:44:17 +00:00
Eli Bendersky	49f6565267	Move duplicated code into a helper function (exposed through overload). There's a bit of duplicated "magic" code in opt.cpp and Clang's CodeGen that computes the inliner threshold from opt level and size opt level. This patch moves the code to a function that lives alongside the inliner itself, providing a convenient overload to the inliner creation. A separate patch can be committed to Clang to use this once it's committed to LLVM. Standalone tools that use the inlining pass can also avoid duplicating this code and fearing it will go out of sync. Note: this patch also restructures the conditinal logic of the computation to be cleaner. llvm-svn: 203669	2014-03-12 16:12:36 +00:00
Alon Mishne	07d949f39a	Cloning a function now also clones its debug metadata if 'ModuleLevelChanges' is true. llvm-svn: 203662	2014-03-12 14:42:51 +00:00
Erik Verbruggen	3f5dcc97e0	Fix crash in PRE. After r203553 overflow intrinsics and their non-intrinsic (normal) instruction get hashed to the same value. This patch prevents PRE from moving an instruction into a predecessor block, and trying to add a phi node that gets two different types (the intrinsic result and the non-intrinsic result), resulting in a failing assert. llvm-svn: 203574	2014-03-11 15:07:32 +00:00
Tim Northover	e94a518a22	IR: add a second ordering operand to cmpxhg for failure The syntax for "cmpxchg" should now look something like: cmpxchg i32* %addr, i32 42, i32 3 acquire monotonic where the second ordering argument gives the required semantics in the case that no exchange takes place. It should be no stronger than the first ordering constraint and cannot be either "release" or "acq_rel" (since no store will have taken place). rdar://problem/15996804 llvm-svn: 203559	2014-03-11 10:48:52 +00:00
Erik Verbruggen	aab3cfe023	GVN: fix hashing of extractvalue. My last commit did not add the indexes to the hashed value for extractvalue. Adding that back in. llvm-svn: 203558	2014-03-11 10:21:30 +00:00
Erik Verbruggen	e2d437148a	GVN: merge overflow intrinsics with non-overflow instructions. When an overflow intrinsic is followed by a non-overflow instruction, replace the latter with an extract. For example: %sadd = tail call { i32, i1 } @llvm.sadd.with.overflow.i32(i32 %a, i32 %b) %sadd3 = add i32 %a, %b Here the add statement will be replaced by an extract. When an overflow intrinsic follows a non-overflow instruction, a clone of the intrinsic is inserted before the normal instruction, which makes it the same as the previous case. Subsequent runs of GVN can then clean up the duplicate instructions and insert the extract. This fixes PR8817. llvm-svn: 203553	2014-03-11 09:36:48 +00:00
Duncan P. N. Exon Smith	cec1c2486a	Cleanup whitespace llvm-svn: 203529	2014-03-11 02:44:45 +00:00
Evan Cheng	bf371db951	Follow up to r203488. Code clean up to eliminate a lot of copy+paste. llvm-svn: 203520	2014-03-11 00:24:20 +00:00
Diego Novillo	92aa8c220a	Use discriminator information in sample profiles. Summary: When the sample profiles include discriminator information, use the discriminator values to distinguish instruction weights in different basic blocks. This modifies the BodySamples mapping to map <line, discriminator> pairs to weights. Instructions on the same line but different blocks, will use different discriminator values. This, in turn, means that the blocks may have different weights. Other changes in this patch: - Add tests for positive values of line offset, discriminator and samples. - Change data types from uint32_t to unsigned and int and do additional validation. Reviewers: chandlerc CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D2857 llvm-svn: 203508	2014-03-10 22:41:28 +00:00
Benjamin Kramer	3ef5e46b6d	MemCpyOpt: When merging memsets also merge the trivial case of two memsets with the same destination. The testcase is from PR19092, but I think the bug described there is actually a clang issue. llvm-svn: 203489	2014-03-10 21:05:13 +00:00
Evan Cheng	0e8f4612a9	For functions with ARM target specific calling convention, when simplify-libcall optimize a call to a llvm intrinsic to something that invovles a call to a C library call, make sure it sets the right calling convention on the call. e.g. extern double pow(double, double); double t(double x) { return pow(10, x); } Compiles to something like this for AAPCS-VFP: define arm_aapcs_vfpcc double @t(double %x) #0 { entry: %0 = call double @llvm.pow.f64(double 1.000000e+01, double %x) ret double %0 } declare double @llvm.pow.f64(double, double) #1 Simplify libcall (part of instcombine) will turn the above into: define arm_aapcs_vfpcc double @t(double %x) #0 { entry: %__exp10 = call double @__exp10(double %x) #1 ret double %__exp10 } declare double @__exp10(double) The pre-instcombine code works because calls to LLVM builtins are special. Instruction selection will chose the right calling convention for the call. However, the code after instcombine is wrong. The call to __exp10 will use the C calling convention. I can think of 3 options to fix this. 1. Make "C" calling convention just work since the target should know what CC is being used. This doesn't work because each function can use different CC with the "pcs" attribute. 2. Have Clang add the right CC keyword on the calls to LLVM builtin. This will work but it doesn't match the LLVM IR specification which states these are "Standard C Library Intrinsics". 3. Fix simplify libcall so the resulting calls to the C routines will have the proper CC keyword. e.g. %__exp10 = call arm_aapcs_vfpcc double @__exp10(double %x) #1 This works and is the solution I implemented here. Both solutions #2 and #3 would work. After carefully considering the pros and cons, I decided to implement #3 for the following reasons. 1. It doesn't change the "spec" of the intrinsics. 2. It's a self-contained fix. There are a couple of potential downsides. 1. There could be other places in the optimizer that is broken in the same way that's not addressed by this. 2. There could be other calling conventions that need to be propagated by simplify-libcall that's not handled. But for now, this is the fix that I'm most comfortable with. llvm-svn: 203488	2014-03-10 20:49:45 +00:00
Benjamin Kramer	79da941f3a	SimplifyCFG: Simplify the weight scaling algorithm. No change in functionality. llvm-svn: 203413	2014-03-09 14:42:55 +00:00
Ahmed Charles	5d461ede5b	Fix build break. llvm-svn: 203366	2014-03-09 03:50:36 +00:00
Chandler Carruth	cdf4788401	[C++11] Add range based accessors for the Use-Def chain of a Value. This requires a number of steps. 1) Move value_use_iterator into the Value class as an implementation detail 2) Change it to actually be a Use iterator rather than a User iterator. 3) Add an adaptor which is a User iterator that always looks through the Use to the User. 4) Wrap these in Value::use_iterator and Value::user_iterator typedefs. 5) Add the range adaptors as Value::uses() and Value::users(). 6) Update all of the callers to correctly distinguish between whether they wanted a use_iterator (and to explicitly dig out the User when needed), or a user_iterator which makes the Use itself totally opaque. Because #6 requires churning essentially everything that walked the Use-Def chains, I went ahead and added all of the range adaptors and switched them to range-based loops where appropriate. Also because the renaming requires at least churning every line of code, it didn't make any sense to split these up into multiple commits -- all of which would touch all of the same lies of code. The result is still not quite optimal. The Value::use_iterator is a nice regular iterator, but Value::user_iterator is an iterator over Users rather than over the User objects themselves. As a consequence, it fits a bit awkwardly into the range-based world and it has the weird extra-dereferencing 'operator->' that so many of our iterators have. I think this could be fixed by providing something which transforms a range of T&s into a range of Ts, but that can be separated into another patch, and it isn't yet 100% clear whether this is the right move. However, this change gets us most of the benefit and cleans up a substantial amount of code around Use and User. =] llvm-svn: 203364	2014-03-09 03:16:01 +00:00
Benjamin Kramer	adf1ea8227	[C++11] Revert uses of lambdas with array_pod_sort. Looks like GCC implements the lambda->function pointer conversion differently. llvm-svn: 203294	2014-03-07 21:52:38 +00:00
Benjamin Kramer	b0f74b24fa	[C++11] Convert sort predicates into lambdas. No functionality change. llvm-svn: 203288	2014-03-07 21:35:39 +00:00
Tim Northover	fad2761ca0	InstCombine: form shuffles from wider range of insert/extractelements Sequences of insertelement/extractelements are sometimes used to build vectorsr; this code tries to put them back together into shuffles, but could only produce a completely uniform shuffle types (<N x T> from two <N x T> sources). This should allow shuffles with different numbers of elements on the input and output sides as well. llvm-svn: 203229	2014-03-07 10:24:44 +00:00
Ahmed Charles	56440fd820	Replace OwningPtr<T> with std::unique_ptr<T>. This compiles with no changes to clang/lld/lldb with MSVC and includes overloads to various functions which are used by those projects and llvm which have OwningPtr's as parameters. This should allow out of tree projects some time to move. There are also no changes to libs/Target, which should help out of tree targets have time to move, if necessary. llvm-svn: 203083	2014-03-06 05:51:42 +00:00
Chandler Carruth	7da14f1ab9	[Layering] Move InstVisitor.h into the IR library as it is pretty obviously coupled to the IR. llvm-svn: 203064	2014-03-06 03:23:41 +00:00
Chandler Carruth	9a4c9e597b	[Layering] Move DebugInfo.h into the IR library where its implementation already lives. llvm-svn: 203046	2014-03-06 00:46:21 +00:00
Chandler Carruth	12664a0b17	[Layering] Move DIBuilder.h into the IR library where its implementation already lives. llvm-svn: 203038	2014-03-06 00:22:06 +00:00
Arnold Schwaighofer	ab12363c02	LoopVectorizer: Preserve fast-math flags Fixes PR19045. llvm-svn: 203008	2014-03-05 21:10:47 +00:00
Chandler Carruth	9205140772	[Layering] Move DebugLoc.h into the IR library. The implementation already lived there and it is where it belongs -- this is the in-memory debug location representation. This is just cleanup -- Modules can actually cope with this, but that doesn't make it right. After chatting with folks that have out-of-tree stuff, going ahead and moving the rest of the headers seems preferable. llvm-svn: 202960	2014-03-05 10:30:38 +00:00
Chandler Carruth	64e9aa5c93	[C++11] Make this interface accept const Use pointers and use override to ensure we don't mess up any of the overrides. Necessary for cleaning up the Value use iterators and enabling range-based traversing of use lists. llvm-svn: 202958	2014-03-05 10:21:48 +00:00
Ahmed Charles	96c9d95f51	[C++11] Replace OwningPtr::take() with OwningPtr::release(). llvm-svn: 202957	2014-03-05 10:19:29 +00:00
Craig Topper	3e4c697ca1	[C++11] Add 'override' keyword to virtual methods that override their base class. llvm-svn: 202953	2014-03-05 09:10:37 +00:00
Chandler Carruth	8cd041ef19	[Modules] Move the ConstantRange class into the IR library. This is a bit surprising, as the class is almost entirely abstracted away from any particular IR, however it encodes the comparsion predicates which mutate ranges as ICmp predicate codes. This is reasonable as they're used for both instructions and constants. Thus, it belongs in the IR library with instructions and constants. llvm-svn: 202838	2014-03-04 12:24:34 +00:00

1 2 3 4 5 ...

11351 Commits