llvm-project

Commit Graph

Author	SHA1	Message	Date
Chandler Carruth	d177f86124	[SROA] Don't preserve the IR names in release builds. This is espcially important because the new SROA pass goes to great lengths to provide helpful names for debugging, and as a consequence they can become very slow to render. Good for between 5% and 15% of the SROA runtime on some slow test cases such as the one in PR15412. llvm-svn: 177495	2013-03-20 07:30:36 +00:00
Chandler Carruth	0941b66283	Move the endif to the correct line so we don't have warnings about unused statistics variables. llvm-svn: 177494	2013-03-20 06:47:00 +00:00
Chandler Carruth	5f5b616344	Introduce some new statistics to help track the exact behavior of the new SROA pass. llvm-svn: 177493	2013-03-20 06:30:46 +00:00
Quentin Colombet	2393cb92b8	Update global merge pass according to Duncan's advices: - Remove useless includes - Change misleading comments - Move code into doFinalization llvm-svn: 177445	2013-03-19 21:46:49 +00:00
Bill Wendling	04d57c7b2c	Register the GCOV writeout functions so that they're emitted serially. We don't want to write out >1000 files at the same time. That could make things prohibitively expensive. Instead, register the "writeout" function so that it's emitted serially. <rdar://problem/12439551> llvm-svn: 177437	2013-03-19 21:03:22 +00:00
Arnaud A. de Grandmaison	87c473f0d1	IndVarSimplify: do not recompute an IV value outside of the loop if : - it is trivially known to be used inside the loop in a way that can not be optimized away - there is no use outside of the loop which can take advantage of the computation hoisting llvm-svn: 177432	2013-03-19 20:00:22 +00:00
Andrew Trick	f3a2544dba	Revert "Cleanup some SCEV logic a bit." This reverts commit 82cd8f7382322bee7a71cdc31f7a923c44d37d32. Just add a comment instead! llvm-svn: 177377	2013-03-19 05:10:27 +00:00
Andrew Trick	de78866594	Cleanup some SCEV logic a bit. Make the code more obvious to scan-build and humans. llvm-svn: 177375	2013-03-19 04:14:59 +00:00
Andrew Trick	a1c01ba8c7	Tighten up an internal LSR API that should check for NULL. No test case, but should fix a scan_build warning. llvm-svn: 177374	2013-03-19 04:14:57 +00:00
Nick Lewycky	d67186337a	Emit the linkage name instead of the function name, when available. This means that we'll prefer to emit the mangled C++ name (pending a clang change). llvm-svn: 177371	2013-03-19 01:37:55 +00:00
Jakub Staszak	bc421efddf	Make method private. Keep coding standard. llvm-svn: 177348	2013-03-18 23:31:30 +00:00
Bill Wendling	c3cab816bb	Register the flush function for each compile unit. For each compile unit, we want to register a function that will flush that compile unit. Otherwise, __gcov_flush() would only flush the counters within the current compile unit, and not any outside of it. PR15191 & <rdar://problem/13167507> llvm-svn: 177340	2013-03-18 23:04:39 +00:00
Quentin Colombet	8fc340976d	Extend global merge pass to optionally consider global constant variables. Also add some checks to not merge globals used within landing pad instructions or marked as "used". llvm-svn: 177331	2013-03-18 22:30:07 +00:00
Kostya Serebryany	10cc12f2b7	[asan] when creating string constants, set unnamed_attr and align 1 so that equal strings are merged by the linker. Observed up to 1% binary size reduction. Thanks to Anton Korobeynikov for the suggestion llvm-svn: 177264	2013-03-18 09:38:39 +00:00
Chandler Carruth	f74654d274	Mark internal classes as POD-like to get better behavior out of SmallVector and DenseMap. This speeds up SROA by 25% on PR15412. llvm-svn: 177259	2013-03-18 08:36:46 +00:00
Kostya Serebryany	bd016bb614	[asan] while generating the description of a global variable, emit the module name in a separate field, thus not duplicating this information if every description. This decreases the binary size (observed up to 3%). https://code.google.com/p/address-sanitizer/issues/detail?id=168 . This changes the asan API version. llvm-part llvm-svn: 177254	2013-03-18 08:05:29 +00:00
Kostya Serebryany	6b5b58deeb	[asan] don't instrument functions with available_externally linkage. This saves a bit of compile time and reduces the number of redundant global strings generated by asan (https://code.google.com/p/address-sanitizer/issues/detail?id=167 ) llvm-svn: 177250	2013-03-18 07:33:49 +00:00
Arnold Schwaighofer	c63cf3a0ae	LoopVectorize: Invert case when we use a vector cmp value to query select cost We generate a select with a vectorized condition argument when the condition is NOT loop invariant. Not the other way around. llvm-svn: 177098	2013-03-14 18:54:36 +00:00
Shuxin Yang	2eca602f8b	Perform factorization as a last resort of unsafe fadd/fsub simplification. Rules include: 1)1 xy +/- xz => x*(y +/- z) (the order of operands dosen't matter) 2) y/x +/- z/x => (y +/- z)/x The transformation is disabled if the new add/sub expr "y +/- z" is a denormal/naz/inifinity. rdar://12911472 llvm-svn: 177088	2013-03-14 18:08:26 +00:00
Alexey Samsonov	819eddc3ce	[ASan] emit instrumentation for initialization order checking by default llvm-svn: 177063	2013-03-14 12:38:58 +00:00
Chandler Carruth	a1c54bbe34	PR14972: SROA vs. GVN exposed a really bad bug in SROA. The fundamental problem is that SROA didn't allow for overly wide loads where the bits past the end of the alloca were masked away and the load was sufficiently aligned to ensure there is no risk of page fault, or other trapping behavior. With such widened loads, SROA would delete the load entirely rather than clamping it to the size of the alloca in order to allow mem2reg to fire. This was exposed by a test case that neatly arranged for GVN to run first, widening certain loads, followed by an inline step, and then SROA which miscompiles the code. However, I see no reason why this hasn't been plaguing us in other contexts. It seems deeply broken. Diagnosing all of the above took all of 10 minutes of debugging. The really annoying aspect is that fixing this completely breaks the pass. ;] There was an implicit reliance on the fact that no loads or stores extended past the alloca once we decided to rewrite them in the final stage of SROA. This was used to encode information about whether the loads and stores had been split across multiple partitions of the original alloca. That required threading explicit tracking of whether a use of a partition is split across multiple partitions. Once that was done, another problem arose: we allowed splitting of integer loads and stores iff they were loads and stores to the entire alloca. This is a really arbitrary limitation, and splitting at least some integer loads and stores is crucial to maximize promotion opportunities. My first attempt was to start removing the restriction entirely, but currently that does Very Bad Things by causing many common alloca patterns to be fully decomposed into i8 operations and lots of or-ing together to produce larger integers on demand. The code bloat is terrifying. That is still the right end-goal, but substantial work must be done to either merge partitions or ensure that small i8 values are eagerly merged in some other pass. Sadly, figuring all this out took essentially all the time and effort here. So the end result is that we allow splitting only when the load or store at least covers the alloca. That ensures widened loads and stores don't hurt SROA, and that we don't rampantly decompose operations more than we have previously. All of this was already fairly well tested, and so I've just updated the tests to cover the wide load behavior. I can add a test that crafts the pass ordering magic which caused the original PR, but that seems really brittle and to provide little benefit. The fundamental problem is that widened loads should Just Work. llvm-svn: 177055	2013-03-14 11:32:24 +00:00
Nick Lewycky	307a1d03b5	Remove accidentally committed debug line. llvm-svn: 177005	2013-03-14 05:19:12 +00:00
Nick Lewycky	fdfed3e9c9	Refactor GCOV's six constructor arguments into a struct with a getter that constructs default arguments. It can now take default arguments from cl::opt'ions. Add a new -default-gcov-version=... option, and actually test it! Sink the reverse-order of the version into GCOVProfiling, hiding it from our users. llvm-svn: 177002	2013-03-14 05:13:26 +00:00
Nick Lewycky	ad145509eb	No functionality change. Rename emitGCNO() to the more sensible emitProfileNotes(), similar to emitProfileArcs(). Also update its comment. Also add a comment on Version[4] (there will be another comment in clang later), and compress lines that exceeded 80 columns. llvm-svn: 176994	2013-03-13 22:55:42 +00:00
Arnaud A. de Grandmaison	7153305b92	Fix a performance regression when combining to smaller types in icmp (shl %v, C1), C2 : Only combine when the shl is only used by the icmp llvm-svn: 176950	2013-03-13 14:40:37 +00:00
Dan Gohman	00253592c7	Change the order of the operands in patchAndReplaceAllUsesWith so that they're more consistent with Value::replaceAllUsesWith. llvm-svn: 176872	2013-03-12 16:22:56 +00:00
Meador Inge	20255ef24d	LibCallSimplifier: optimize speed for short-lived instances Nadav reported a performance regression due to the work I did to merge the library call simplifier into instcombine [1]. The issue is that a new LibCallSimplifier object is being created whenever InstCombiner::runOnFunction is called. Every time a LibCallSimplifier object is used to optimize a call it creates a hash table to map from a function name to an object that optimizes functions of that name. For short-lived LibCallSimplifier instances this is quite inefficient. Especially for cases where no calls are actually simplified. This patch fixes the issue by dropping the hash table and implementing an explicit lookup function to correlate the function name to the object that optimizes functions of that name. This avoids the cost of always building and destroying the hash table in cases where the LibCallSimplifier object is short-lived and avoids the cost of building the table when no simplifications are actually preformed. On a benchmark containing 100,000 calls where none of them are simplified I noticed a 30% speedup. On a benchmark containing 100,000 calls where all of them are simplified I noticed an 8% speedup. [1] http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20130304/167639.html llvm-svn: 176840	2013-03-12 00:08:29 +00:00
Bill Wendling	9534d8885f	Don't remove a landing pad if the invoke requires a table entry. An invoke may require a table entry. For instance, when the function it calls is expected to throw. <rdar://problem/13360379> llvm-svn: 176827	2013-03-11 20:53:00 +00:00
Nick Lewycky	5f50854186	Use LLVMBool instead of 'bool' in the C API. Based on a patch by Peter Zotov! llvm-svn: 176793	2013-03-10 21:58:22 +00:00
Hal Finkel	f610be9f36	BBVectorize: Fixup debugging statements After the recent data-structure improvements, a couple of debugging statements were broken (printing pointer values). llvm-svn: 176791	2013-03-10 20:57:42 +00:00
Benjamin Kramer	6eda79f69a	Remove a source of nondeterminism from the LoopVectorizer. This made us emit runtime checks in a random order. Hopefully bootstrap miscompares will go away now. llvm-svn: 176775	2013-03-09 19:22:40 +00:00
Arnold Schwaighofer	8b3dc09400	LoopVectorizer: Ignore all dbg intrinisic Ignore all DbgIntriniscInfo instructions instead of just DbgValueInst. llvm-svn: 176769	2013-03-09 16:27:27 +00:00
Arnold Schwaighofer	4090b61ac3	LoopVectorizer: Ignore dbg.value instructions We want vectorization to happen at -g. Ignore calls to the dbg.value intrinsic and don't transfer them to the vectorized code. radar://13378964 llvm-svn: 176768	2013-03-09 15:56:34 +00:00
Jakub Staszak	2ef36b633b	Simplify code. No functionality change. llvm-svn: 176765	2013-03-09 11:18:59 +00:00
Nick Lewycky	291df6ec42	Use the correct index variable. This is the meat of what was supposed to be in r176751. Also, learn a lesson about applying patches by hand/eyeball. llvm-svn: 176764	2013-03-09 10:13:26 +00:00
Nick Lewycky	03aed11cdb	Fix bug introduced in r176616 when making function identifier numbers stable. Count the subprograms, not the compile units. llvm-svn: 176751	2013-03-09 02:06:37 +00:00
Nick Lewycky	88f1d0d64e	Don't emit the extra checksum into the .gcda file if the user hasn't asked for it. Fortunately, versions of gcov that predate the extra checksum also ignore any extra data, so this isn't a problem. There will be a matching commit in compiler-rt. llvm-svn: 176745	2013-03-09 01:33:06 +00:00
Benjamin Kramer	37c2d65c5a	Insert the reduction start value into the first bypass block to preserve domination. Fixes PR15344. llvm-svn: 176701	2013-03-08 16:58:37 +00:00
Jakub Staszak	fd56611b49	Keep coding stanard. llvm-svn: 176661	2013-03-07 22:20:06 +00:00
Jakub Staszak	db4579d796	Don't create IRBuilder if we can return from the method earlier. llvm-svn: 176660	2013-03-07 22:10:33 +00:00
Pekka Jaaskelainen	093cf41e86	Fixed a crash when cloning a function into a function with different size argument list and without attributes in the arguments. llvm-svn: 176632	2013-03-07 16:46:43 +00:00
Nick Lewycky	492afe8127	Switch from a version 4.2/4.4 switch to a four-byte version string to be put into the actual gcov file. Instead of using the bottom 4 bytes as the function identifier, use a counter. This makes the identifier numbers stable across multiple runs. llvm-svn: 176616	2013-03-07 08:28:49 +00:00
Andrew Trick	a0a5ca06b9	SimplifyCFG fix for volatile load/store. Fixes rdar:13349374. Volatile loads and stores need to be preserved even if the language standard says they are undefined. "volatile" in this context means "get out of the way compiler, let my platform handle it". Additionally, this is the only way I know of with llvm to write to the first page (when hardware allows) without dropping to assembly. llvm-svn: 176599	2013-03-07 01:03:35 +00:00
Andrew Trick	fcb37243f9	Generalize my previous fix for -print-options. Always print options that differ from their implicit default. At least for simple option types. llvm-svn: 176572	2013-03-06 19:04:56 +00:00
Andrew Trick	946c2b32e6	Give -loop-vectorize an explicit default. This way, clang -mllvm -print-options shows that the driver is overriding it. llvm-svn: 176569	2013-03-06 18:22:22 +00:00
Jim Grosbach	95d2eb95c3	InstCombine: Don't shrink allocas when combining with a bitcast. When considering folding a bitcast of an alloca into the alloca itself, make sure we don't shrink the amount of memory being allocated, or things rapidly go sideways. rdar://13324424 llvm-svn: 176547	2013-03-06 05:44:53 +00:00
Lang Hames	30be8a30cc	Check isDiscardableIfUnused, rather than hasLocalLinkage, when bumping GlobalValue linkage up to ExternalLinkage in the ExtractGV pass. This prevents linkonce and linkonce_odr symbols from being DCE'd. llvm-svn: 176459	2013-03-04 22:40:44 +00:00
Preston Gurd	485296d1e8	Bypass Slow Divides * Only apply divide bypass optimization when not optimizing for size. * Fixed bug caused by constant for 0 value of type Int32, used dividend type to generate the constant instead. * For atom x86-64 apply the divide bypass to use 16-bit divides instead of 64-bit divides when operand values are small enough. * Added lit tests for 64-bit divide bypass. Patch by Tyler Nowicki! llvm-svn: 176442	2013-03-04 18:13:57 +00:00
Nadav Rotem	739e37a0d2	PR14448 - prevent the loop vectorizer from vectorizing the same loop twice. The LoopVectorizer often runs multiple times on the same function due to inlining. When this happens the loop vectorizer often vectorizes the same loops multiple times, increasing code size and adding unneeded branches. With this patch, the vectorizer during vectorization puts metadata on scalar loops and marks them as 'already vectorized' so that it knows to ignore them when it sees them a second time. PR14448. llvm-svn: 176399	2013-03-02 01:33:49 +00:00
Peter Collingbourne	1b97a9c82a	Modify {Call,Invoke}Inst::addAttribute to take an AttrKind. llvm-svn: 176397	2013-03-02 01:20:18 +00:00
Benjamin Kramer	12f98fae98	LoopVectorize: Don't hang forever if a PHI only has skipped PHI uses. Fixes PR15384. llvm-svn: 176366	2013-03-01 19:07:31 +00:00
Quentin Colombet	e684a6d4aa	Fix a bug in instcombine for fmul in fast math mode. The instcombine recognized pattern looks like: a = b * c d = a +/- Cst or a = b * c d = Cst +/- a When creating the new operands for fadd or fsub instruction following the related fmul, the first operand was created with the second original operand (M0 was created with C1) and the second with the first (M1 with Opnd0). The fix consists in creating the new operands with the appropriate original operand, i.e., M0 with Opnd0 and M1 with C1. llvm-svn: 176300	2013-02-28 21:12:40 +00:00
Evgeniy Stepanov	00062b4498	[msan] Implement sanitize_memory attribute. Shadow checks are disabled and memory loads always produce fully initialized values in functions that don't have a sanitize_memory attribute. Value and argument shadow is propagated as usual. This change also updates blacklist behaviour to match the above. llvm-svn: 176247	2013-02-28 11:25:14 +00:00
Evgeniy Stepanov	4c9300e630	Remove unused leftover declarations. llvm-svn: 176240	2013-02-28 08:42:11 +00:00
Benjamin Kramer	dc145816fd	LoopVectorize: Vectorize math builtin calls. This properly asks TargetLibraryInfo if a call is available and if it is, it can be translated into the corresponding LLVM builtin. We don't vectorize sqrt() yet because I'm not sure about the semantics for negative numbers. The other intrinsic should be exact equivalents to the libm functions. Differential Revision: http://llvm-reviews.chandlerc.com/D465 llvm-svn: 176188	2013-02-27 15:24:19 +00:00
Nick Lewycky	6fd43e4071	In GCC 4.7, function names are now forbidden from .gcda files. Support this by passing a null pointer to the function name in to GCDAProfiling, and add another switch onto GCOVProfiling. llvm-svn: 176173	2013-02-27 06:22:56 +00:00
Nick Lewycky	625f395663	Doh, fix behaviour change introduced in r176168 which is tested in clang, not llvm. llvm-svn: 176172	2013-02-27 06:21:30 +00:00
Nadav Rotem	464e807d41	For each function that we optimize we initialize a new list of lib functions. For each function name we malloc memory. This patch changes the Libcall map to use BumpPtrAllocator. Now we malloc only once. This speeds up instcombine by a few % on a large c++ program. llvm-svn: 176170	2013-02-27 05:53:43 +00:00
Nick Lewycky	8e94d80aab	IRBuilder has grown all sorts of useful utility functions. Make use of them to clean up this code a tiny bit. No functionality change. llvm-svn: 176168	2013-02-27 05:46:30 +00:00
Pedro Artigas	e40467b589	Enhance integer division emulation support to handle types smaller than 32 bits, enhancement done the trivial way; by extending inputs and truncating outputs which is addequate for targets with little or no support for integer arithmetic on integer types less than 32 bits. llvm-svn: 176139	2013-02-26 23:33:20 +00:00
Kostya Serebryany	cf880b9443	Unify clang/llvm attributes for asan/tsan/msan (LLVM part) These are two related changes (one in llvm, one in clang). LLVM: - rename address_safety => sanitize_address (the enum value is the same, so we preserve binary compatibility with old bitcode) - rename thread_safety => sanitize_thread - rename no_uninitialized_checks -> sanitize_memory CLANG: - add __attribute__((no_sanitize_address)) as a synonym for __attribute__((no_address_safety_analysis)) - add __attribute__((no_sanitize_thread)) - add __attribute__((no_sanitize_memory)) for S in address thread memory If -fsanitize=S is present and __attribute__((no_sanitize_S)) is not set llvm attribute sanitize_S llvm-svn: 176075	2013-02-26 06:58:09 +00:00
Benjamin Kramer	ee40b9a2d4	CVP: If we have a PHI with an incoming select, try to skip the select. This is a common pattern with dyn_cast and similar constructs, when the PHI no longer depends on the select it can often be turned into a simpler construct or even get hoisted out of the loop. PR15340. llvm-svn: 175995	2013-02-24 15:34:43 +00:00
Michael Gottesman	f4b7761ed7	Fixed a careless mistake. rdar://13273675. llvm-svn: 175939	2013-02-23 00:31:32 +00:00
Bill Wendling	09bd1f71ee	Implement the NoBuiltin attribute. The 'nobuiltin' attribute is applied to call sites to indicate that LLVM should not treat the callee function as a built-in function. I.e., it shouldn't try to replace that function with different code. llvm-svn: 175835	2013-02-22 00:12:35 +00:00
Renato Golin	cf928cb53f	Allow GlobalValues to vectorize with AliasAnalysis Storing the load/store instructions with the values and inspect them using Alias Analysis to make sure they don't alias, since the GEP pointer operand doesn't take the offset into account. Trying hard to not add any extra cost to loads and stores that don't overlap on global values, AA is only calculated if all of the previous attempts failed. Using biggest vector register size as the stride for the vectorization access, as we're being conservative and the cost model (which calculates the real vectorization factor) is only run after the legalization phase. We might re-think this relationship in the future, but for now, I'd rather be safe than sorry. llvm-svn: 175818	2013-02-21 22:39:03 +00:00
Chad Rosier	9b7f9c3e9e	Remove dead code and whitespace. llvm-svn: 175804	2013-02-21 21:40:51 +00:00
Chad Rosier	4d87d45a05	Update a comment that looks to have been accidentally deleted many moons ago. llvm-svn: 175658	2013-02-20 20:15:55 +00:00
Kostya Serebryany	699ac28aa5	[asan] instrument invoke insns with noreturn attribute (as well as call insns) llvm-svn: 175617	2013-02-20 12:35:15 +00:00
Jakub Staszak	ae2fd9c97d	Remove unused variable. llvm-svn: 175568	2013-02-19 22:17:58 +00:00
Jakub Staszak	3c6583a1b1	Minor cleanups. No functionality change. llvm-svn: 175567	2013-02-19 22:14:45 +00:00
Jakub Staszak	90fbe91c58	Remove unneeded #includes. llvm-svn: 175565	2013-02-19 22:06:38 +00:00
Jakub Staszak	086f6cde5d	Fix typos. llvm-svn: 175562	2013-02-19 22:02:21 +00:00
Kostya Serebryany	3ece9beaf1	[asan] instrument memory accesses with unusual sizes This patch makes asan instrument memory accesses with unusual sizes (e.g. 5 bytes or 10 bytes), e.g. long double or packed structures. Instrumentation is done with two 1-byte checks (first and last bytes) and if the error is found __asan_report_load_n(addr, real_size) or __asan_report_store_n(addr, real_size) is called. Also, call these two new functions in memset/memcpy instrumentation. asan-rt part will follow. llvm-svn: 175507	2013-02-19 11:29:21 +00:00
Bill Wendling	c98e4fef1a	Temporarily revert r175470 for more review. llvm-svn: 175476	2013-02-19 00:52:45 +00:00
Bill Wendling	66651e4c2f	Check to see if the 'no-builtin' attribute is set before simplifying a library call. llvm-svn: 175470	2013-02-18 23:17:16 +00:00
Kostya Serebryany	7ca384bc1a	[asan] revert r175266 as it breaks code with packed structures. supporting long double will require a more general solution llvm-svn: 175442	2013-02-18 13:47:02 +00:00
Hal Finkel	76e65e4542	BBVectorize: Fix an invalid reference bug This fixes PR15289. This bug was introduced (recently) in r175215; collecting all std::vector references for candidate pairs to delete at once is invalid because subsequent lookups in the owning DenseMap could invalidate the references. bugpoint was able to reduce a useful test case. Unfortunately, because whether or not this asserts depends on memory layout, this test case will sometimes appear to produce valid output. Nevertheless, running under valgrind will reveal the error. llvm-svn: 175397	2013-02-17 15:59:26 +00:00
Bill Wendling	23242098e7	The transform is: (or (bool?A:B),(bool?C:D)) --> (bool?(or A,C):(or B,D)) By the time the OR is visited, both the SELECTs have been visited and not optimized and the OR itself hasn't been transformed so we do this transform in the hopes that the new ORs will be optimized. The transform is explicitly disabled for vector-selects until "codegen matures to handle them better". Patch by Muhammad Tauqir! llvm-svn: 175380	2013-02-16 23:41:36 +00:00
Jakub Staszak	11bd83551c	Reduce indents in LSRInstance::NarrowSearchSpaceByCollapsingUnrolledCode method. No functionality change. llvm-svn: 175364	2013-02-16 16:08:15 +00:00
Hal Finkel	89909397a1	BBVectorize: Call a DAG and DAG instead of a tree Several functions and variable names used the term 'tree' to refer to what is actually a DAG. Correcting this mistake will, hopefully, prevent confusion in the future. No functionality change intended. llvm-svn: 175278	2013-02-15 17:20:54 +00:00
Arnaud A. de Grandmaison	1fd843eee7	Fix refactoring mistake in "Teach InstCombine to work with smaller legal types..." llvm-svn: 175273	2013-02-15 15:18:17 +00:00
Arnaud A. de Grandmaison	61c167c62b	Teach InstCombine to work with smaller legal types in icmp (shl %v, C1), C2 It enables to work with a smaller constant, which is target friendly for those which can compare to immediates. It also avoids inserting a shift in favor of a trunc, which can be free on some targets. This used to work until LLVM-3.1, but regressed with the 3.2 release. llvm-svn: 175270	2013-02-15 14:35:47 +00:00
Kostya Serebryany	a968568165	[asan] support long double on 64-bit. See https://code.google.com/p/address-sanitizer/issues/detail?id=151 llvm-svn: 175266	2013-02-15 12:46:06 +00:00
Benjamin Kramer	6ecb1e78a9	Make helpers static. Add missing include so LLVMInitializeObjCARCOpts gets C linkage. llvm-svn: 175264	2013-02-15 12:30:38 +00:00
Hal Finkel	283f4f0e66	BBVectorize: Cap the number of candidate pairs in each instruction group For some basic blocks, it is possible to generate many candidate pairs for relatively few pairable instructions. When many (tens of thousands) of these pairs are generated for a single instruction group, the time taken to generate and rank the different vectorization plans can become quite large. As a result, we now cap the number of candidate pairs within each instruction group. This is done by closing out the group once the threshold is reached (set now at 3000 pairs). Although this will limit the overall compile-time impact, this may not be the best way to achieve this result. It might be better, for example, to prune excessive candidate pairs after the fact the prevent the generation of short, but highly-connected groups. We can experiment with this in the future. This change reduces the overall compile-time slowdown of the csa.ll test case in PR15222 to ~5x. If 5x is still considered too large, a lower limit can be used as the default. This represents a functionality change, but only for very large inputs (thus, there is no regression test). llvm-svn: 175251	2013-02-15 04:28:42 +00:00
Hal Finkel	e7a1ef422b	BBVectorize: Remove the remaining instances of std::multimap All instances of std::multimap have now been replaced by DenseMap<K, std::vector<V> >, and this yields a speedup of 5% on the csa.ll test case from PR15222. No functionality change intended. llvm-svn: 175216	2013-02-14 22:38:04 +00:00
Hal Finkel	c3a4425c34	BBVectorize: Don't store candidate pairs in a std::multimap This is another commit on the road to removing std::multimap from BBVectorize. This gives an ~1% speedup on the csa.ll test case in PR15222. No functionality change intended. llvm-svn: 175215	2013-02-14 22:37:09 +00:00
Bill Wendling	7297b864a4	Retain the name of the new internal global that's been shrunk. It's possible (e.g. after an LTO build) that an internal global may be used for debugging purposes. If that's the case appending a '.b' to it makes it hard to find that variable. Steal the name from the old GV before deleting it so that they can find that variable again. llvm-svn: 175104	2013-02-13 23:00:51 +00:00
Benjamin Kramer	0aa2ad6104	LoopVectorize: Simplify code for clarity. No functionality change. llvm-svn: 175076	2013-02-13 21:12:29 +00:00
Pekka Jaaskelainen	0d23725a8d	Metadata for annotating loops as parallel. The first consumer for this metadata is the loop vectorizer. See the documentation update for more info. llvm-svn: 175060	2013-02-13 18:08:57 +00:00
Kostya Serebryany	caf11af9d3	[asan] fix confusing indentation llvm-svn: 175033	2013-02-13 05:14:12 +00:00
Arnaud A. de Grandmaison	2e4df4f7c2	Fix comment visitSExt is an adapted copy of the related visitZExt method, so adapt the comment accordingly. llvm-svn: 175019	2013-02-13 00:19:19 +00:00
Michael Gottesman	27029f4642	Changed isStoredObjCPointer => IsStoredObjCPointer. No functionality change. llvm-svn: 175017	2013-02-12 23:35:08 +00:00
Dan Gohman	a6307574d6	Actually delete this code, since it's really not clear what it's trying to do. llvm-svn: 175014	2013-02-12 22:26:41 +00:00
Dan Gohman	f377160d2f	Record PRE predecessors with a SmallVector instead of a DenseMap, and avoid a second pred_iterator traversal. llvm-svn: 175001	2013-02-12 19:49:10 +00:00
Dan Gohman	2001cd8f9e	When disabling PRE for a value is directly redundant with itself (through a loop), don't continue to iterate through the reamining predecessors. llvm-svn: 174994	2013-02-12 19:05:10 +00:00
Dan Gohman	fd41de0b10	Check that pointers are removed from maps before calling delete on the pointers, for tidiness' sake. llvm-svn: 174988	2013-02-12 18:44:43 +00:00
Dan Gohman	f60667020a	Minor code simplification. llvm-svn: 174985	2013-02-12 18:38:36 +00:00
Alexander Potapenko	259e8127ad	[ASan] Do not use kDefaultShort64bitShadowOffset on Mac, where the binaries may get mapped at 0x100000000+ and thus may interleave with the shadow. llvm-svn: 174964	2013-02-12 12:41:12 +00:00
Kostya Serebryany	be73337ad2	[asan] change the default mapping offset on x86_64 to 0x7fff8000. This gives roughly 5% speedup. Since this is an ABI change, bump the asan ABI version by renaming __asan_init to __asan_init_v1. llvm part, compiler-rt part will follow llvm-svn: 174957	2013-02-12 11:11:02 +00:00
Hal Finkel	6ae564b4a0	BBVectorize: Don't over-search when building the dependency map When building the pairable-instruction dependency map, don't search past the last pairable instruction. For large blocks that have been divided into multiple instruction groups, searching past the last instruction in each group is very wasteful. This gives a 32% speedup on the csa.ll test case from PR15222 (when using 50 instructions in each group). No functionality change intended. llvm-svn: 174915	2013-02-11 23:02:17 +00:00
Hal Finkel	39a95032d2	BBVectorize: Omit unnecessary entries in PairableInstUsers This map is queried only for instructions in pairs of pairable instructions; so make sure that only pairs of pairable instructions are added to the map. This gives a 3.5% speedup on the csa.ll test case from PR15222. No functionality change intended. llvm-svn: 174914	2013-02-11 23:02:09 +00:00
Michael Ilseman	74a6da963b	Optimization: bitcast (<1 x ...> insertelement ..., X, ...) to ... ==> bitcast X to ... llvm-svn: 174905	2013-02-11 21:41:44 +00:00
Hal Finkel	0b8ae895b4	BBVectorize: Eliminate one more restricted linear search This eliminates one more linear search over a range of std::multimap entries. This gives a 22% speedup on the csa.ll test case from PR15222. No functionality change intended. llvm-svn: 174893	2013-02-11 17:19:34 +00:00
Kostya Serebryany	c5f44bc62d	[asan] added a flag -mllvm asan-short-64bit-mapping-offset=1 (0 by default) This flag makes asan use a small (<2G) offset for 64-bit asan shadow mapping. On x86_64 this saves us a register, thus achieving ~2/3 of the zero-base-offset's benefits in both performance and code size. Thanks Jakub Jelinek for the idea. llvm-svn: 174886	2013-02-11 14:36:01 +00:00
Hal Finkel	cb268f7995	BBVectorize: Remove the linear searches from pair connection searching This removes the last of the linear searches over ranges of std::multimap iterators, giving a 7% speedup on the doduc.bc input from PR15222. No functionality change intended. llvm-svn: 174859	2013-02-11 05:29:51 +00:00
Hal Finkel	fee38f9754	BBVectorize: Avoid linear searches within the load-move set This is another cleanup aimed at eliminating linear searches in ranges of std::multimap. No functionality change intended. llvm-svn: 174858	2013-02-11 05:29:49 +00:00
Hal Finkel	dd4bc66593	BBVectorize: isa/cast cleanup in getInstructionTypes Profiling suggests that getInstructionTypes is performance-sensitive, this cleans up some double-casting in that function in favor of using dyn_cast. No functionality change intended. llvm-svn: 174857	2013-02-11 05:29:48 +00:00
Hal Finkel	c1cc166948	BBVectorize: Make the bookkeeping to support full cycle checking less expensive By itself, this does not have much of an effect, but only because in the default configuration the full cycle checks are used only for small problem sizes. This is part of a general cleanup of uses of iteration over std::multimap ranges only for the purpose of checking membership. No functionality change intended. llvm-svn: 174856	2013-02-11 05:29:41 +00:00
Andrew Trick	bc7059032b	LSR IVChain improvement. Handle chains in which the same offset is used for both loads and stores to the same array. Fixes rdar://11410078. llvm-svn: 174789	2013-02-09 01:11:01 +00:00
Jakub Staszak	f23980aba5	Remove #includes from the commonly used LoopInfo.h. llvm-svn: 174786	2013-02-09 01:04:28 +00:00
Bob Wilson	bfb44ef9cb	Revert "Add LLVMContext::emitWarning methods and use them. <rdar://problem/12867368>" This reverts r171041. This was a nice idea that didn't work out well. Clang warnings need to be associated with warning groups so that they can be selectively disabled, promoted to errors, etc. This simplistic patch didn't allow for that. Enhancing it to provide some way for the backend to specify a front-end warning type seems like overkill for the few uses of this, at least for now. llvm-svn: 174748	2013-02-08 21:48:29 +00:00
Hal Finkel	dd2721842d	BBVectorize: Use TTI->getAddressComputationCost This is a follow-up to the cost-model change in r174713 which splits the cost of a memory operation between the address computation and the actual memory access. In r174713, this cost is always added to the memory operation cost, and so BBVectorize will do the same. Currently, this new cost function is used only by ARM, and I don't have any ARM test cases for BBVectorize. Assistance in generating some good ARM test cases for BBVectorize would be greatly appreciated! llvm-svn: 174743	2013-02-08 21:13:39 +00:00
Chad Rosier	22d275f7b8	[SimplifyLibCalls] Library call simplification doen't work if the call site isn't using the default calling convention. However, if the transformation is from a call to inline IR, then the calling convention doesn't matter. rdar://13157990 llvm-svn: 174724	2013-02-08 18:00:14 +00:00
Jakob Stoklund Olesen	479e5a9313	Typos. llvm-svn: 174723	2013-02-08 17:43:32 +00:00
Arnold Schwaighofer	594fa2dc2b	ARM cost model: Address computation in vector mem ops not free Adds a function to target transform info to query for the cost of address computation. The cost model analysis pass now also queries this interface. The code in LoopVectorize adds the cost of address computation as part of the memory instruction cost calculation. Only there, we know whether the instruction will be scalarized or not. Increase the penality for inserting in to D registers on swift. This becomes necessary because we now always assume that address computation has a cost and three is a closer value to the architecture. radar://13097204 llvm-svn: 174713	2013-02-08 14:50:48 +00:00
Michael Kuperstein	f63b77be7f	Test Commit llvm-svn: 174709	2013-02-08 12:58:29 +00:00
Andrew Trick	1bd53c3675	Revert "Have InstCombine call SipmlifyCall when handling calls. Test case included." This reverts commit 3854a5d90fee52af1065edbed34521fff6cdc18d. This causes a clang unit test to hang: vtable-available-externally.cpp. llvm-svn: 174692	2013-02-08 01:55:39 +00:00
Michael Ilseman	6092dc5455	Have InstCombine call SipmlifyCall when handling calls. Test case included. llvm-svn: 174675	2013-02-07 23:01:35 +00:00
Nadav Rotem	a9100f3609	fix 80-col violation and fix the docs. llvm-svn: 174671	2013-02-07 22:34:07 +00:00
Arnold Schwaighofer	3476fc8c82	Loop Vectorizer: Refactor Memory Cost Computation We don't want too many classes in a pass and the classes obscure the details. I was going a little overboard with object modeling here. Replace classes by generic code that handles both loads and stores. No functionality change intended. llvm-svn: 174646	2013-02-07 19:05:21 +00:00
Michael Gottesman	697d8b9a26	Moved some comments due to the recent refactoring of ObjCARC. 1. Moved a comment from ObjCARCOpts.cpp -> ObjCARCContract.cpp. 2. Removed a comment from ObjCARCOpts.cpp that was already moved to ObjCARCAliasAnalysis.h/.cpp. llvm-svn: 174581	2013-02-07 04:12:57 +00:00
Michael Ilseman	1dd6f2a5ba	Preserve fast-math flags after reassociation and commutation. Update test cases llvm-svn: 174571	2013-02-07 01:40:15 +00:00
Benjamin Kramer	944e0abf04	InstCombine: Fix and simplify the inttoptr side too. llvm-svn: 174438	2013-02-05 20:22:40 +00:00
Michael Gottesman	415ddd7e13	Removed explicit inline as per the LLVM style guide. llvm-svn: 174432	2013-02-05 19:32:18 +00:00
Benjamin Kramer	e477875873	InstCombine: Harden code to work with vectors of pointers and simplify it a bit. Found by running instcombine on a fabricated test case for the constant folder. llvm-svn: 174430	2013-02-05 19:21:56 +00:00
Arnold Schwaighofer	3be40b56c5	Loop Vectorizer: Refactor code to compute vectorized memory instruction cost Introduce a helper class that computes the cost of memory access instructions. No functionality change intended. llvm-svn: 174422	2013-02-05 18:46:41 +00:00
Chad Rosier	92a54f6d4c	[SjLj Prepare] When demoting an invoke instructions to the stack, if the normal edge is critical, then split it so we can insert the store. rdar://13126179 llvm-svn: 174418	2013-02-05 18:23:10 +00:00
Arnold Schwaighofer	22174f5d5a	Loop Vectorizer: Handle pointer stores/loads in getWidestType() In the loop vectorizer cost model, we used to ignore stores/loads of a pointer type when computing the widest type within a loop. This meant that if we had only stores/loads of pointers in a loop we would return a widest type of 8bits (instead of 32 or 64 bit) and therefore a vector factor that was too big. Now, if we see a consecutive store/load of pointers we use the size of a pointer (from data layout). This problem occured in SingleSource/Benchmarks/Shootout-C++/hash.cpp (reduced test case is the first test in vector_ptr_load_store.ll). radar://13139343 llvm-svn: 174377	2013-02-05 15:08:02 +00:00
Nick Lewycky	535d97cc86	Revert accidental commit (ran svn commit from wrong directory). llvm-svn: 174241	2013-02-02 00:25:26 +00:00
Nick Lewycky	a8c77e4266	This patch makes "&Cls::purevfn" not an odr use. This isn't what the standard says, but that's a defect (to be filed). "Cls::purevfn()" is still an odr use. Also fixes a bug in the previous patch that caused us to not mark the function referenced just because we didn't want to mark it odr used. llvm-svn: 174240	2013-02-02 00:22:37 +00:00
Preston Gurd	25c3b6acc0	This patch aims to improve compile time performance by increasing the SCEV vector size in LoopStrengthReduce. It is observed that the BaseRegs vector size is 4 in most cases, and elements are frequently copied when it is initialized as SmallVector<const SCEV *, 2> BaseRegs. Our benchmark results show that the compilation time performance improved by ~0.5%. Patch by Wan Xiaofei. llvm-svn: 174219	2013-02-01 20:41:27 +00:00
Nadav Rotem	4349f6963e	Revert r174152. The shift amount may overflow and in that case this transformation is illegal. llvm-svn: 174156	2013-02-01 07:59:33 +00:00
Nadav Rotem	1d584029ae	Optimize shift lefts of a constant by a value plus constant into a single shift. llvm-svn: 174152	2013-02-01 06:45:40 +00:00
Manman Ren	aec2ce7db4	Linker: correctly link in dbg.declare This is a re-worked version of r174048. Given source IR: call void @llvm.dbg.declare(metadata !{i32* %argc.addr}, metadata !14), !dbg !15 we used to generate call void @llvm.dbg.declare(metadata !27, metadata !28), !dbg !29 !27 = metadata !{null} With this patch, we will correctly generate call void @llvm.dbg.declare(metadata !{i32* %argc.addr}, metadata !27), !dbg !28 Looking up %argc.addr in ValueMap will return null, since %argc.addr is already correctly set up, we can use identity mapping. rdar://problem/13089880 llvm-svn: 174093	2013-01-31 21:19:18 +00:00
Alexey Samsonov	5234a8ed9f	Revert r173946. This breaks compilation of googletest with Clang llvm-svn: 174048	2013-01-31 08:02:11 +00:00
Dan Gohman	20a2ae9df5	Change GetPointerBaseWithConstantOffset's DataLayout argument from a reference to a pointer, so that it can handle the case where DataLayout is not available and behave conservatively. llvm-svn: 174024	2013-01-31 02:00:45 +00:00
Bill Wendling	785afdf3a4	Remove addRetAttributes and addFnAttributes, which aren't useful abstractions. llvm-svn: 173992	2013-01-30 23:40:31 +00:00
Bill Wendling	d219675c2a	Convert typeIncompatible to return an AttributeSet. There are still places which treat the Attribute object as a collection of attributes. I'm systematically removing them. llvm-svn: 173990	2013-01-30 23:07:40 +00:00
Manman Ren	81dcc62805	Linker: correctly link in dbg.declare Given source IR: call void @llvm.dbg.declare(metadata !{i32* %argc.addr}, metadata !14), !dbg !15 we used to generate call void @llvm.dbg.declare(metadata !27, metadata !28), !dbg !29 !27 = metadata !{null} With this patch, we will correctly generate call void @llvm.dbg.declare(metadata !{i32* %argc.addr}, metadata !27), !dbg !28 Looking up %argc.addr in ValueMap will return null, since %argc.addr is already correctly set up, we can use identity mapping. llvm-svn: 173946	2013-01-30 17:42:15 +00:00
Nadav Rotem	513bd8a73c	InstCombine: canonicalize sext-and --> select sext-not-and --> select. Patch by Muhammad Tauqir Ahmad. llvm-svn: 173901	2013-01-30 06:35:22 +00:00
Michael Gottesman	e52dec1695	Made certain small functions in PtrState inlined. llvm-svn: 173842	2013-01-29 22:29:59 +00:00
Pekka Jaaskelainen	f50ab84bb1	LoopVectorize: convert TinyTripCountVectorThreshold constant to a command line switch. llvm-svn: 173837	2013-01-29 21:42:08 +00:00
Michael Gottesman	9bdab2bf6b	Removed trailing comma in last element of enum declaration. llvm-svn: 173836	2013-01-29 21:41:44 +00:00
Michael Gottesman	386241ce5b	Moved S_Stop back to its previous position in the sequence order. llvm-svn: 173834	2013-01-29 21:39:02 +00:00
Michael Gottesman	23cda0cd39	Fixed a few debug messages and some 80+ violations. llvm-svn: 173832	2013-01-29 21:07:53 +00:00
Michael Gottesman	53fd20bdbd	Added some periods to some comments and added an overload for operator<< for type Sequence so I can print out Sequences in debug statements. llvm-svn: 173831	2013-01-29 21:07:51 +00:00
Michael Gottesman	774d2c014e	Changed DoesObjCBlockEscape => DoesRetainableObjPtrEscape so I can use it to perform escape analysis of other retainable object pointers in other locations. llvm-svn: 173829	2013-01-29 21:00:52 +00:00
Edwin Vane	82f80d4967	Fixing warnings revealed by gcc release build Fixed set-but-not-used warnings. Reviewer: gribozavr llvm-svn: 173810	2013-01-29 17:42:24 +00:00
Benjamin Kramer	cf406756ce	LoopVectorize: Clean up ValueMap a bit and avoid double lookups. No intended functionality change. llvm-svn: 173809	2013-01-29 17:31:33 +00:00
Timur Iskhodzhanov	5d7ff00456	Hopefully fix the Windows build failure introduced in r173769 llvm-svn: 173781	2013-01-29 09:09:27 +00:00
Michael Gottesman	1e29ca1501	Fixed 2 more header comments... llvm-svn: 173774	2013-01-29 05:07:18 +00:00
Michael Gottesman	5a8f9e7c54	Fixed header comment. llvm-svn: 173773	2013-01-29 05:05:17 +00:00
Michael Gottesman	23a1ee5f5b	Fixed some whitespace/80+ violations. Also added a space after a namespace declaration. llvm-svn: 173772	2013-01-29 04:58:30 +00:00
Michael Gottesman	7bf48af498	Added missing dashes from header declaration comment. llvm-svn: 173770	2013-01-29 04:53:55 +00:00
Michael Gottesman	13a5f1a8b7	Juggled Debug.h from ObjCARC.h to only the including cpp files that actually have DEBUG statements. Also changed raw_ostream in said header to be a forward declaration (removing an include). llvm-svn: 173769	2013-01-29 04:51:59 +00:00
Michael Gottesman	278266faa8	Sorted includes using utils/sort_includes. llvm-svn: 173767	2013-01-29 04:20:52 +00:00
Michael Gottesman	f823dd2ef7	Added two missing headers from ObjCARCAliasAnalysis.h. This was missed since whenever I was including ObjCARCAliasAnalysis.h, I was including ObjCARC.h before it which included these includes (resulting in no compilation breakage). llvm-svn: 173764	2013-01-29 04:09:24 +00:00
Michael Gottesman	7f387ae6e3	Removed InstCombine/Targets as library dependencies for libObjCARCOpts since they are unnecessary. llvm-svn: 173763	2013-01-29 04:05:17 +00:00
Michael Gottesman	778138e960	Extracted ObjCARCContract from ObjCARCOpts into its own file. This also required adding 2x headers Dependency Analysis.h/Provenance Analysis.h and a .cpp file DependencyAnalysis.cpp to unentangle the dependencies inbetween ObjCARCContract and ObjCARCOpts. llvm-svn: 173760	2013-01-29 03:03:03 +00:00
Michael Gottesman	50a622f120	Removed some cruft from ObjCARCAliasAnalysis.cpp. llvm-svn: 173759	2013-01-29 03:02:59 +00:00
Hal Finkel	bf4db4fe11	Unroll again after running BBVectorize Because BBVectorize may significantly shorten a loop body, unroll again after vectorization. This is especially important when using runtime or partial unrolling. llvm-svn: 173730	2013-01-29 00:22:49 +00:00
Renato Golin	1258519674	Vectorization Factor clarification llvm-svn: 173691	2013-01-28 16:02:45 +00:00
Evgeniy Stepanov	6f85ef300d	[msan] Mostly disable msan-handle-icmp-exact. It is way too slow. Change the default option value to 0. Always do exact shadow propagation for unsigned ICmp with constants, it is cheap (under 1% cpu time) and required for correctness. llvm-svn: 173682	2013-01-28 11:42:28 +00:00
Evgeniy Stepanov	52c7b1b98f	Revert r173678. Broken tests. llvm-svn: 173679	2013-01-28 09:18:40 +00:00
Evgeniy Stepanov	5ec2ff57e9	[msan] Make msan-handle-icmp-exact=0 by default. 50% slowdown on one of the specs. llvm-svn: 173678	2013-01-28 09:15:15 +00:00
Michael Gottesman	5ed40afe17	Created ObjCARCUtil.cpp for functions which in my humble opinion are too large to static inline and place in a header file such as ObjCARC.h. llvm-svn: 173666	2013-01-28 06:39:31 +00:00
Michael Gottesman	9bfcf28d88	Cleaned up includes in various ObjCARC files and removed some whitespace violations. llvm-svn: 173663	2013-01-28 05:51:58 +00:00
Michael Gottesman	294e7daaac	Refactor ObjCARCAliasAnalysis into its own file. llvm-svn: 173662	2013-01-28 05:51:54 +00:00
Michael Gottesman	fa0939f790	Refactored out pass ObjCARCAPElim from ObjCARCOpts.cpp => ObjCARCAPElim.cpp. llvm-svn: 173654	2013-01-28 04:12:07 +00:00
Michael Gottesman	283e079fa6	Fixed case insensitive issue. llvm-svn: 173653	2013-01-28 03:35:20 +00:00
Michael Gottesman	0d90b12acc	Removed extraneous doxygen end module statement. llvm-svn: 173652	2013-01-28 03:30:34 +00:00
Michael Gottesman	08904e3ba4	Extracted pass ObjCARCExpand from ObjCARC.cpp => ObjCARCExpand.cpp. I also added the local header ObjCARC.h for common functions used by the various passes. llvm-svn: 173651	2013-01-28 03:28:38 +00:00
Michael Gottesman	79d8d81226	Extracted ObjCARC.cpp into its own library libLLVMObjCARCOpts in preparation for refactoring the ARC Optimizer. llvm-svn: 173647	2013-01-28 01:35:51 +00:00
Hal Finkel	293a41d14f	BBVectorize: Better use of TTI->getShuffleCost When flipping the pair of subvectors that form a vector, if the vector length is 2, we can use the SK_Reverse shuffle kind to get more-accurate cost information. Also we can use the SK_ExtractSubvector shuffle kind to get accurate subvector extraction costs. The current cost model implementations don't yet seem complex enough for this to make a difference (thus, there are no test cases with this commit), but it should help in future. Depending on how the various targets optimize and combine shuffles in practice, we might be able to get more-accurate costs by combining the costs of multiple shuffle kinds. For example, the cost of flipping the subvector pairs could be modeled as two extractions and two subvector insertions. These changes, however, should probably be motivated by specific test cases. llvm-svn: 173621	2013-01-27 20:07:01 +00:00
Chandler Carruth	329b590e6e	Re-revert r173342, without losing the compile time improvements, flat out bug fixes, or functionality preserving refactorings. llvm-svn: 173610	2013-01-27 06:42:03 +00:00
Michael Gottesman	5300cdd8f2	Renamed function IsPotentialUse to IsPotentialRetainableObjPtr. This name change does the following: 1. Causes the function name to use proper ARC terminology. 2. Makes it clear what the function truly does. llvm-svn: 173609	2013-01-27 06:19:48 +00:00
Bill Wendling	3575c8c6d6	Use the AttributeSet instead of AttributeWithIndex. In the future, AttributeWithIndex won't be used anymore. Besides, it exposes the internals of the AttributeSet to outside users, which isn't goodness. llvm-svn: 173602	2013-01-27 02:08:22 +00:00
Bill Wendling	37a52df920	Use the AttributeSet instead of AttributeWithIndex. In the future, AttributeWithIndex won't be used anymore. Besides, it exposes the internals of the AttributeSet to outside users, which isn't goodness. llvm-svn: 173601	2013-01-27 01:57:28 +00:00
Bill Wendling	6eaab61bb5	Use the AttributeSet instead of AttributeWithIndex. In the future, AttributeWithIndex won't be used anymore. Besides, it exposes the internals of the AttributeSet to outside users, which isn't goodness. llvm-svn: 173600	2013-01-27 01:44:34 +00:00
Hal Finkel	2d443e94b4	BBVectorize: Add a additional comment about the cost computation llvm-svn: 173580	2013-01-26 16:49:04 +00:00
Hal Finkel	351a75b6d7	BBVectorize: Fix anomalous capital letter in comment llvm-svn: 173579	2013-01-26 16:49:03 +00:00
Bill Wendling	201d7b2545	Convert BuildLibCalls.cpp to using the AttributeSet methods instead of AttributeWithIndex. llvm-svn: 173536	2013-01-26 00:03:11 +00:00
Bill Wendling	57625a4966	Remove some introspection functions. The 'getSlot' function and its ilk allow introspection into the AttributeSet class. However, that class should be opaque. Allow access through accessor methods instead. llvm-svn: 173522	2013-01-25 23:09:36 +00:00
Nadav Rotem	69a040d3eb	LoopVectorize: Refactor the code that vectorizes loads/stores to remove duplication. llvm-svn: 173500	2013-01-25 21:47:42 +00:00
Bill Wendling	8649283e75	Use the new 'getSlotIndex' method to retrieve the attribute's slot index. llvm-svn: 173499	2013-01-25 21:46:52 +00:00
Benjamin Kramer	21e8da5990	LoopVectorize: Simplify code. No functionality change. llvm-svn: 173475	2013-01-25 19:43:15 +00:00
Pedro Artigas	b95c98faa2	added ability to dynamically change the ExportList of an already created InternalizePass (useful for pass reuse) llvm-svn: 173474	2013-01-25 19:41:03 +00:00
Nadav Rotem	8e9ca2f8cb	LoopVectorizer: Refactor more code to use the IRBuilder. llvm-svn: 173471	2013-01-25 19:26:23 +00:00
Nadav Rotem	c8adf3ff6e	Refactor some code to use the IRBuilder. llvm-svn: 173467	2013-01-25 18:34:09 +00:00
Evgeniy Stepanov	2cb0fa10c2	[msan] A comment on ICmp handling logic. llvm-svn: 173453	2013-01-25 15:35:29 +00:00
Evgeniy Stepanov	fac8403249	[msan] Implement exact shadow propagation for relational ICmp. Only for integers, pointers, and vectors of those. No floats. Instrumentation seems very heavy, and may need to be replaced with some approximation in the future. llvm-svn: 173452	2013-01-25 15:31:10 +00:00
Chandler Carruth	ceff222dea	Switch this code away from Value::isUsedInBasicBlock. That code either loops over instructions in the basic block or the use-def list of the value, neither of which are really efficient when repeatedly querying about values in the same basic block. What's more, we already know that the CondBB is small, and so we can do a much more efficient test by counting the uses in CondBB, and seeing if those account for all of the uses. Finally, we shouldn't blanket fail on any such instruction, instead we should conservatively assume that those instructions are part of the cost. Note that this actually fixes a bug in the pass because isUsedInBasicBlock has a really terrible bug in it. I'll fix that in my next commit, but the fix for it would make this code suddenly take the compile time hit I thought it already was taking, so I wanted to go ahead and migrate this code to a faster & better pattern. The bug in isUsedInBasicBlock was also causing other tests to test the wrong thing entirely: for example we weren't actually disabling speculation for floating point operations as intended (and tested), but the test passed because we failed to speculate them due to the isUsedInBasicBlock failure. llvm-svn: 173417	2013-01-25 05:40:09 +00:00
Michael Gottesman	12780c2d97	Added comment to ObjCARC elaborating what is meant by the term 'Provenance' in 'Provenance Analysis'. llvm-svn: 173374	2013-01-24 21:35:00 +00:00
Benjamin Kramer	1c4e323fdd	Reapply chandlerc's r173342 now that the miscompile it was triggering is fixed. Original commit message: Plug TTI into the speculation logic, giving it a real cost interface that can be specialized by targets. The goal here is not to be more aggressive, but to just be more accurate with very obvious cases. There are instructions which are known to be truly free and which were not being modeled as such in this code -- see the regression test which is distilled from an inner loop of zlib. Everywhere the TTI cost model is insufficiently conservative I've added explicit checks with FIXME comments to go add proper modelling of these cost factors. If this causes regressions, the likely solution is to make TTI even more conservative in its cost estimates, but test cases will help here. llvm-svn: 173357	2013-01-24 16:44:25 +00:00
Chandler Carruth	321c6a7c50	Revert r173342 temporarily. It appears to cause a very late miscompile of stage2 in a bootstrap. Still investigating.... llvm-svn: 173343	2013-01-24 13:24:24 +00:00
Chandler Carruth	5f4519309f	Plug TTI into the speculation logic, giving it a real cost interface that can be specialized by targets. The goal here is not to be more aggressive, but to just be more accurate with very obvious cases. There are instructions which are known to be truly free and which were not being modeled as such in this code -- see the regression test which is distilled from an inner loop of zlib. Everywhere the TTI cost model is insufficiently conservative I've added explicit checks with FIXME comments to go add proper modelling of these cost factors. If this causes regressions, the likely solution is to make TTI even more conservative in its cost estimates, but test cases will help here. llvm-svn: 173342	2013-01-24 12:39:29 +00:00
Chandler Carruth	01bffaad03	Address a large chunk of this FIXME by accumulating the cost for unfolded constant expressions rather than checking each one independently. llvm-svn: 173341	2013-01-24 12:05:17 +00:00
Chandler Carruth	8a21005cca	Switch the constant expression speculation cost evaluation away from a cost fuction that seems both a bit ad-hoc and also poorly suited to evaluating constant expressions. Notably, it is missing any support for trivial expressions such as 'inttoptr'. I could fix this routine, but it isn't clear to me all of the constraints its other users are operating under. The core protection that seems relevant here is avoiding the formation of a select instruction wich a further chain of select operations in a constant expression operand. Just explicitly encode that constraint. Also, update the comments and organization here to make it clear where this needs to go -- this should be driven off of real cost measurements which take into account the number of constants expressions and the depth of the constant expression tree. llvm-svn: 173340	2013-01-24 11:53:01 +00:00
Chandler Carruth	7481ca8ff5	Rephrase the speculating scan of the conditional BB to be phrased in terms of cost rather than hoisting a single instruction. This does not change the cost model! We still set the cost threshold at 1 here, it's just that we track it by accumulating cost rather than by storing an instruction. The primary advantage is that we no longer leave no-op intrinsics in the basic block. For example, this will now move both debug info intrinsics and a single instruction, instead of only moving the instruction and leaving a basic block with nothing bug debug info intrinsics in it, and those intrinsics now no longer ordered correctly with the hoisted value. Instead, we now splice the entire conditional basic block's instruction sequence. This also places the code for checking the safety of hoisting next to the code computing the cost. Currently, the only observable side-effect of this change is that debug info intrinsics are no longer abandoned. I'm not sure how to craft a test case for this, and my real goal was the refactoring, but I'll talk to Dave or Eric about how to add a test case for this. llvm-svn: 173339	2013-01-24 11:52:58 +00:00

... 2 3 4 5 6 ...

10238 Commits