llvm-project

Commit Graph

Author	SHA1	Message	Date
Bill Wendling	32854e2727	Add a flag to the struct type finder to collect only those types which have names. This saves collecting types we normally don't care about. llvm-svn: 155300	2012-04-21 23:59:16 +00:00
Bill Wendling	1bf41faca2	Revert r155241, which is causing some breakage. llvm-svn: 155253	2012-04-20 23:11:38 +00:00
Bill Wendling	c5fae47a63	If we discover all of the named structs in a module, then don't bother to process any more Values. llvm-svn: 155241	2012-04-20 21:56:24 +00:00
Craig Topper	d3c9e404ba	Remove AVX vpermil intrinsics. I removed their uses from clang headers and builtins a while back. llvm-svn: 154985	2012-04-18 05:24:00 +00:00
Eric Christopher	7df0240e52	Typo. llvm-svn: 154879	2012-04-16 23:54:31 +00:00
Duncan Sands	9af6298293	Remove support for the special 'fast' value for fpmath accuracy for the moment. llvm-svn: 154850	2012-04-16 19:39:33 +00:00
Duncan Sands	05f4df8d72	Make it possible to indicate relaxed floating point requirements at the IR level through the use of 'fpmath' metadata. Currently this only provides a 'fpaccuracy' value, which may be a number in ULPs or the keyword 'fast', however the intent is that this will be extended with additional information about NaN's, infinities etc later. No optimizations have been hooked up to this so far. llvm-svn: 154822	2012-04-16 16:28:59 +00:00
Duncan Sands	34bd91a49f	Rename "fpaccuracy" metadata to the more generic "fpmath". That's because I'm thinking of generalizing it to be able to specify other freedoms beyond accuracy (such as that NaN's don't have to be respected). I'd like the 3.1 release (the first one with this metadata) to have the more generic name already rather than having to auto-upgrade it in 3.2. llvm-svn: 154744	2012-04-14 12:36:06 +00:00
Dan Gohman	4f8ced58a7	Def here is an Instruction, so !isa<Instruction>(Def) is always false, as Eli noticed. llvm-svn: 154641	2012-04-13 00:50:57 +00:00
Dan Gohman	73273275a4	Add forms of dominates and isReachableFromEntry that accept a Use directly instead of a user Instruction. This allows them to test whether a def dominates a particular operand if the user instruction is a PHI. llvm-svn: 154631	2012-04-12 23:31:46 +00:00
Benjamin Kramer	2335a5cb85	Cache the hash value of the operands in the MDNode. FoldingSet is implemented as a chained hash table. When there is a hash collision during insertion, which is common as we fill the table until a load factor of 2.0 is hit, we walk the chained elements, comparing every operand with the new element's operands. This can be very expensive if the MDNode has many operands. We sacrifice a word of space in MDNode to cache the full hash value, reducing compares on collision to a minimum. MDNode grows from 28 to 32 bytes + operands on x86. On x86_64 the new bits fit nicely into existing padding, not growing the struct at all. The actual speedup depends a lot on the test case and is typically between 1% and 2% for C++ code with clang -c -O0 -g. llvm-svn: 154497	2012-04-11 14:06:54 +00:00
Benjamin Kramer	7a426b5f2e	Compute hashes directly with hash_combine instead of taking a detour through FoldingSetNodeID. llvm-svn: 154495	2012-04-11 14:06:39 +00:00
Bill Wendling	c4c568b2d9	The MDString class stored a StringRef to the string which was already in a StringMap. This was redundant and unnecessarily bloated the MDString class. Because the MDString class is a "Value" and will never have a "name", and because the Name field in the Value class is a pointer to a StringMap entry, we repurpose the Name field for an MDString. It stores the StringMap entry in the Name field, and uses the normal methods to get the string (name) back. PR12474 llvm-svn: 154429	2012-04-10 20:12:16 +00:00
Duncan Sands	af06b26c8e	Express the number of ULPs in fpaccuracy metadata as a real rather than a rational number, eg as 2.5 rather than 5, 2. OK'd by Peter Collingbourne. llvm-svn: 154387	2012-04-10 08:22:43 +00:00
Bill Wendling	5c0068f807	Remove the 'Parent' pointer from the MDNodeOperand class. An MDNode has a list of MDNodeOperands allocated directly after it as part of its allocation. Therefore, the Parent of the MDNodeOperands can be found by walking back through the operands to the beginning of that list. Mark the first operand's value pointer as being the 'first' operand so that we know where the beginning of said list is. This saves a lot of space during LTO with -O0 -g flags. llvm-svn: 154280	2012-04-08 10:20:49 +00:00
Bill Wendling	9b2503a006	Allow subclasses of the ValueHandleBase to store information as part of the value pointer by making the value pointer into a pointer-int pair with 2 bits available for flags. llvm-svn: 154279	2012-04-08 10:16:43 +00:00
Bill Wendling	e2cf674310	The speedup doesn't appear to have been from this, but was an anomaly of my testing machine. llvm-svn: 153951	2012-04-03 11:19:21 +00:00
Bill Wendling	dd91e73409	Reserve space for the eventual filling of the vector. This gives a small speedup. llvm-svn: 153949	2012-04-03 10:50:09 +00:00
Duncan Sands	26a80f3ddb	I noticed in passing that the Metadata getIfExists method was creating a new node and returning it if one didn't exist. llvm-svn: 153798	2012-03-31 08:20:11 +00:00
Rafael Espindola	a53c46aaa3	Handle unreachable code in the dominates functions. This changes users when needed for correctness, but still doesn't clean up code that now unnecessary checks for reachability. llvm-svn: 153755	2012-03-30 16:46:21 +00:00
Douglas Gregor	c0f6380464	Add missing include of <new> llvm-svn: 153436	2012-03-26 14:04:17 +00:00
Rafael Espindola	8e5b40eb08	Remove always true variable. llvm-svn: 153392	2012-03-24 20:02:25 +00:00
Rafael Espindola	ef9f5504ea	First part of PR12251. Add documentation and verifier support for the range metadata. llvm-svn: 153359	2012-03-24 00:14:51 +00:00
Eric Christopher	bdb64495c4	Fix up cmake build. llvm-svn: 153306	2012-03-23 03:55:14 +00:00
Eric Christopher	3c0d51661f	Take out the debug info probe stuff. It's making some changes to the PassManager annoying and should be reimplemented as a decorator on top of existing passes (as should the timing data). llvm-svn: 153305	2012-03-23 03:54:05 +00:00
Chris Lattner	2cc6f9dd90	add load/store volatility control to the C API, patch by Yiannis Tsiouris! llvm-svn: 153238	2012-03-22 03:54:15 +00:00
Chandler Carruth	4d1d34fbfc	Extend the inline cost calculation to account for bonuses due to correlated pairs of pointer arguments at the callsite. This is designed to recognize the common C++ idiom of begin/end pointer pairs when the end pointer is a constant offset from the begin pointer. With the C-based idiom of a pointer and size, the inline cost saw the constant size calculation, and this provides the same level of information for begin/end pairs. In order to propagate this information we have to search for candidate operations on a pair of pointer function arguments (or derived from them) which would be simplified if the pointers had a known constant offset. Then the callsite analysis looks for such pointer pairs in the argument list, and applies the appropriate bonus. This helps LLVM detect that half of bounds-checked STL algorithms (such as hash_combine_range, and some hybrid sort implementations) disappear when inlined with a constant size input. However, it's not a complete fix due the inaccuracy of our cost metric for constants in general. I'm looking into that next. Benchmarks showed no significant code size change, and very minor performance changes. However, specific code such as hashing is showing significantly cleaner inlining decisions. llvm-svn: 152752	2012-03-14 23:19:53 +00:00
Stepan Dyatkovskiy	97b02fc1b3	llvm::SwitchInst Renamed methods caseBegin, caseEnd and caseDefault with case_begin, case_end, and case_default. Added some notes relative to case iterators. llvm-svn: 152532	2012-03-11 06:09:17 +00:00
Chandler Carruth	97f6f03c42	Refactor some methods to look through bitcasts and GEPs on pointers into a common collection of methods on Value, and share their implementation. We had two variations in two different places already, and I need the third variation for inline cost estimation. Reviewed by Duncan Sands on IRC, but further comments here welcome. llvm-svn: 152490	2012-03-10 08:39:09 +00:00
Stepan Dyatkovskiy	5b648afb4d	Taken into account Duncan's comments for r149481 dated by 2nd Feb 2012: http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20120130/136146.html Implemented CaseIterator and it solves almost all described issues: we don't need to mix operand/case/successor indexing anymore. Base iterator class is implemented as a template since it may be initialized either from "const SwitchInst" or from "SwitchInst". ConstCaseIt is just a read-only iterator. CaseIt is read-write iterator; it allows to change case successor and case value. Usage of iterator allows totally remove resolveXXXX methods. All indexing convertions done automatically inside the iterator's getters. Main way of iterator usage looks like this: SwitchInst SI = ... // intialize it somehow for (SwitchInst::CaseIt i = SI->caseBegin(), e = SI->caseEnd(); i != e; ++i) { BasicBlock BB = i.getCaseSuccessor(); ConstantInt *V = i.getCaseValue(); // Do something. } If you want to convert case number to TerminatorInst successor index, just use getSuccessorIndex iterator's method. If you want initialize iterator from TerminatorInst successor index, use CaseIt::fromSuccessorIndex(...) method. There are also related changes in llvm-clients: klee and clang. llvm-svn: 152297	2012-03-08 07:06:20 +00:00
Chandler Carruth	d4ba3eb480	Switch this code to use hash_combine_range rather than incremental calls to hash_combine. One of the interfaces could already do this, and the other can just use a small buffer. This is a much more efficient way to use the hash_combine interface, although I don't have any particular benchmark where this code was hot, so I can't measure much of an impact. It at least doesn't slow anything down. llvm-svn: 152200	2012-03-07 03:22:32 +00:00
Chandler Carruth	cee7a12d40	Cache the sized-ness of struct types, once we reach the steady state of "is sized". This prevents every query to isSized() from recursing over every sub-type of a struct type. This could get very slow for extremely deep nesting of structs, as in 177.mesa. This change is a 45% speedup for 'opt -O2' of 177.mesa.linked.bc, and likely a significant speedup for other cases as well. It even impacts -O0 cases because so many part of the code try to check whether a type is sized. Thanks for the review from Nick Lewycky and Benjamin Kramer on IRC. llvm-svn: 152197	2012-03-07 02:33:09 +00:00
Jay Foad	cc5fd3e25d	Change ConstantAggrUniqueMap to use Chandler's new hashing implementation. Patch by Meador Inge llvm-svn: 152116	2012-03-06 10:43:52 +00:00
Chandler Carruth	71bd7d1e54	Replace the hashing functions on APInt and APFloat with overloads of the new hash_value infrastructure, and replace their implementations using hash_combine. This removes a complete copy of Jenkin's lookup3 hash function (which is both significantly slower and lower quality than the one implemented in hash_combine) along with a somewhat scary xor-only hash function. Now that APInt and APFloat can be passed directly to hash_combine, simplify the rest of the LLVMContextImpl hashing to use the new infrastructure. llvm-svn: 152004	2012-03-04 12:02:57 +00:00
Chandler Carruth	1d03a3b6b1	Rewrite LLVM's generalized support library for hashing to follow the API of the proposed standard hashing interfaces (N3333), and to use a modified and tuned version of the CityHash algorithm. Some of the highlights of this change: -- Significantly higher quality hashing algorithm with very well distributed results, and extremely few collisions. Should be close to a checksum for up to 64-bit keys. Very little clustering or clumping of hash codes, to better distribute load on probed hash tables. -- Built-in support for reserved values. -- Simplified API that composes cleanly with other C++ idioms and APIs. -- Better scaling performance as keys grow. This is the fastest algorithm I've found and measured for moderately sized keys (such as show up in some of the uniquing and folding use cases) -- Support for enabling per-execution seeds to prevent table ordering or other artifacts of hashing algorithms to impact the output of LLVM. The seeding would make each run different and highlight these problems during bootstrap. This implementation was tested extensively using the SMHasher test suite, and pased with flying colors, doing better than the original CityHash algorithm even. I've included a unittest, although it is somewhat minimal at the moment. I've also added (or refactored into the proper location) type traits necessary to implement this, and converted users of GeneralHash over. My only immediate concerns with this implementation is the performance of hashing small keys. I've already started working to improve this, and will continue to do so. Currently, the only algorithms faster produce lower quality results, but it is likely there is a better compromise than the current one. Many thanks to Jeffrey Yasskin who did most of the work on the N3333 paper, pair-programmed some of this code, and reviewed much of it. Many thanks also go to Geoff Pike Pike and Jyrki Alakuijala, the original authors of CityHash on which this is heavily based, and Austin Appleby who created MurmurHash and the SMHasher test suite. Also thanks to Nadav, Tobias, Howard, Jay, Nick, Ahmed, and Duncan for all of the review comments! If there are further comments or concerns, please let me know and I'll jump on 'em. llvm-svn: 151822	2012-03-01 18:55:25 +00:00
Benjamin Kramer	acd78d5092	Emit the "is an intrinsic overloaded" table as a bitfield. llvm-svn: 151792	2012-03-01 02:16:57 +00:00
Rafael Espindola	654320a0bb	Use the DT dominates function in the verifier. llvm-svn: 151470	2012-02-26 02:23:37 +00:00
Rafael Espindola	94df267db3	Change the implementation of dominates(inst, inst) to one based on what the verifier does. This correctly handles invoke. Thanks to Duncan, Andrew and Chris for the comments. Thanks to Joerg for the early testing. llvm-svn: 151469	2012-02-26 02:19:19 +00:00
Rafael Espindola	bfa7579801	Don't call dominates on unreachable instructions. llvm-svn: 151468	2012-02-26 02:14:25 +00:00
Nick Lewycky	d489edff7f	Remove spurious emacs mode marker. llvm-svn: 151440	2012-02-25 07:20:06 +00:00
Jay Foad	529776c786	Reinstate r151049 now that GeneralHash is fixed. llvm-svn: 151248	2012-02-23 09:17:40 +00:00
Chad Rosier	5dfe6dab25	Remove extra semi-colons. llvm-svn: 151169	2012-02-22 17:25:00 +00:00
Jay Foad	af3cf11fec	Revert r151049 cos it broke the buildbots. llvm-svn: 151052	2012-02-21 11:44:46 +00:00
Jay Foad	3f99d381b4	PR1210: make uniquing of struct and function types more efficient by using a DenseMap and Talin's new GeneralHash, avoiding the need for a temporary std::vector on every lookup. Patch by Meador Inge! llvm-svn: 151049	2012-02-21 09:25:52 +00:00
Ahmed Charles	636a3d618c	Remove dead code. Improve llvm_unreachable text. Simplify some control flow. llvm-svn: 150918	2012-02-19 11:37:01 +00:00
Rafael Espindola	082d482981	White space fixes. llvm-svn: 150886	2012-02-18 19:46:02 +00:00
Bill Wendling	89cc16637d	s/ModAttrBehavior/ModFlagBehavior/g to be consistent with how module flags are named elsewhere. llvm-svn: 150679	2012-02-16 10:28:10 +00:00
NAKAMURA Takumi	35d19c0f1c	VMCore/AsmWriter.cpp: Tweak to check #INF and #NAN earlier. With MSVCRT, prior checker missed emission of #INF and #NAN. FIXME: Checking should be simpler. llvm-svn: 150667	2012-02-16 08:12:24 +00:00
NAKAMURA Takumi	aec412337c	VMCore/AsmWriter.cpp: Use APFloat instead of atof(3). atof(3) might behave differently among platforms. llvm-svn: 150661	2012-02-16 04:19:15 +00:00
Bill Wendling	399eae61e9	Use the enum instead of 'unsigned'. llvm-svn: 150632	2012-02-15 23:27:50 +00:00

1 2 3 4 5 ...

4159 Commits