llvm-project

Commit Graph

Author	SHA1	Message	Date
Zachary Turner	8bd42a1a98	[Support] Add StringRef::getAsDouble. Differential Revision: https://reviews.llvm.org/D29918 llvm-svn: 295089	2017-02-14 19:06:37 +00:00
Chandler Carruth	ecbe61966f	Tweak the core loop in StringRef::find to avoid calling memcmp on every iteration. Instead, load the byte at the needle length, compare it directly, and save it to use in the lookup table of lengths we can skip forward. I also added an annotation to expect that the comparison fails so that the loop gets laid out contiguously without the call to memcpy (and the substantial register shuffling that the ABI requires of that call). Finally, because this behaves especially badly with a needle length of one (by calling memcmp with a zero length) special case that to directly call memchr, which is what we should have been doing anyways. This was motivated by the fact that there are a large number of test cases in 'check-llvm' where FileCheck's performance is dominated by calls to StringRef::find (in a release, no-asserts build). I'm working on patches to generally improve matters there, but this alone was worth a 12.5% improvement in one test case where FileCheck spent 92% of its time in this routine. I experimented a bunch with different minor variations on this theme, for example setting the pointer at the last byte and indexing backwards for the call to memcmp. That didn't improve anything on this version and seemed more complex. I also tried other things to make the loop flow more nicely and none worked. =/ It is a bit unfortunate, the generated code here remains pretty gross, but I don't see any obvious ways to improve it. At this point, most of my ideas would be really elaborate: 1) While the remainder of the string is long enough, we could load a 16-byte or 32-byte vector at the address of the last byte and use palignr to rotate that and check the first 15- or 31-bytes at the front of the next segment, essentially pre-loading the first several bytes of the next iteration so we could quickly detect a mismatch in those bytes without an additional memory access. Down side would be the code complexity, having a fallback loop, and likely misaligned vector load. Plus it would make the common case of the last byte not matching somewhat slower (need some extraction from a vector). 2) While we have space, we could do an aligned load of a 16- or 32-byte vector that contains the end byte, and use any peceding bytes to have a more precise "no" test, and any subsequent bytes could be saved for the next iteration. This remove any unaligned load penalty, but still requires us to pay the overhead of vector extraction for the cases where we didn't need to do anything other than load and compare the last byte. 3) Try to walk from the last byte in a way that is more friendly to cache and/or memory pre-fetcher considering we have to poke the last byte anyways. No idea if any of these are really worth pursuing though. They all seem somewhat unlikely to yield big wins in practice and to be a lot of work and complexity. So I settled here, which at least seems like a strict improvement over the previous version. llvm-svn: 289373	2016-12-11 07:46:21 +00:00
Zachary Turner	17412b03b2	[Support] Add StringRef::find_lower and contains_lower. Differential Revision: https://reviews.llvm.org/D25299 llvm-svn: 286724	2016-11-12 17:17:12 +00:00
Zachary Turner	d5d57635ba	Speculative fix for build failures due to consumeInteger. A recent patch added support for consumeInteger() and made getAsInteger delegate to this function. A few buildbots are failing as a result with an assertion failure. On a hunch, I tested what happens if I call getAsInteger() on an empty string, and sure enough it crashes the same way that the buildbots are crashing. I confirmed that getAsInteger() on an empty string did not crash before my patch, so I suspect this to be the cause. I also added a unit test for the empty string. llvm-svn: 282170	2016-09-22 15:55:05 +00:00
Zachary Turner	65fd2fc7b4	[Support] Add StringRef::consumeInteger. StringRef::getInteger() exists and treats the entire string as an integer of the specified radix, failing if any invalid characters are encountered or the number overflows. Sometimes you might have something like "123456foo" and you want to get the number 123456 and leave the string "foo" remaining. This is similar to what would be possible by using the standard runtime library functions strtoul et al and specifying an end pointer. This patch adds consumeInteger(), which does exactly that. It consumes as much as possible until an invalid character is found, and modifies the StringRef in place so that upon return only the portion of the StringRef after the number remains. Differential Revision: https://reviews.llvm.org/D24778 llvm-svn: 282164	2016-09-22 15:05:19 +00:00
Colin LeMahieu	0143146514	[MCParser] Accept uppercase radix variants 0X and 0B Differential Revision: http://reviews.llvm.org/D14781 llvm-svn: 263802	2016-03-18 18:22:07 +00:00
Chandler Carruth	233edd20a7	[ADT] Rewrite the StringRef::find implementation to be simpler, clearer, and tremendously less reliant on the optimizer to fix things. The code is always necessarily looking for the entire length of the string when doing the equality tests in this find implementation, but it previously was needlessly re-checking the size each time among other annoyances. By writing this so simply an ddirectly in terms of memcmp, it also is about 8x faster in a debug build, which in turn makes FileCheck about 2x faster in 'ninja check-llvm'. This saves about 8% of the time for FileCheck-heavy parts of the test suite like the x86 backend tests. llvm-svn: 247269	2015-09-10 11:17:49 +00:00
Chandler Carruth	4425c91dea	[ADT] Fix a confusing interface spec and some annoying peculiarities with the StringRef::split method when used with a MaxSplit argument other than '-1' (which nobody really does today, but which should actually work). The spec claimed both to split up to MaxSplit times, but also to append <= MaxSplit strings to the vector. One of these doesn't make sense. Given the name "MaxSplit", let's go with it being a max over how many splits occur, which means the max on how many strings get appended is MaxSplit+1. I'm not actually sure the implementation correctly provided this logic either, as it used a really opaque loop structure. The implementation was also playing weird games with nullptr in the data field to try to rely on a totally opaque hidden property of the split method that returns a pair. Nasty IMO. Replace all of this with what is (IMO) simpler code that doesn't use the pair returning split method, and instead just finds each separator and appends directly. I think this is a lot easier to read, and it most definitely matches the spec. Added some tests that exercise the corner cases around StringRef() and StringRef("") that all now pass. I'll start using this in code in the next commit. llvm-svn: 247249	2015-09-10 07:51:37 +00:00
Chandler Carruth	477121721b	[ADT] Add a single-character version of the small vector split routine on StringRef. Finding and splitting on a single character is substantially faster than doing it on even a single character StringRef -- we immediately get to a very tuned memchr call this way. Even nicer, we get to this even in a debug build, shaving 18% off the runtime of TripleTest.Normalization, helping PR23676 some more. llvm-svn: 247244	2015-09-10 06:07:03 +00:00
Craig Topper	e1d1294853	Simplify creation of a bunch of ArrayRefs by using None, makeArrayRef or just letting them be implicitly created. llvm-svn: 216525	2014-08-27 05:25:25 +00:00
Craig Topper	3ced27c835	Remove custom implementations of max/min in StringRef that was originally added to work an old gcc bug. I believe its been fixed by now. llvm-svn: 216156	2014-08-21 04:31:10 +00:00
Craig Topper	c10719f55d	[C++11] Make use of 'nullptr' in the Support library. llvm-svn: 205697	2014-04-07 04:17:22 +00:00
Ahmed Charles	56440fd820	Replace OwningPtr<T> with std::unique_ptr<T>. This compiles with no changes to clang/lld/lldb with MSVC and includes overloads to various functions which are used by those projects and llvm which have OwningPtr's as parameters. This should allow out of tree projects some time to move. There are also no changes to libs/Target, which should help out of tree targets have time to move, if necessary. llvm-svn: 203083	2014-03-06 05:51:42 +00:00
Rui Ueyama	00e24e48b6	Add {start,end}with_lower methods to StringRef. startswith_lower is ocassionally useful and I think worth adding. endwith_lower is added for completeness. Differential Revision: http://llvm-reviews.chandlerc.com/D2041 llvm-svn: 193706	2013-10-30 18:32:26 +00:00
Dmitri Gribenko	292c9200fc	Added const qualifier to StringRef::edit_distance member function Patch by Ismail Pazarbasi. llvm-svn: 189162	2013-08-24 01:50:41 +00:00
Manman Ren	9c5e998043	Revert r185852. llvm-svn: 185861	2013-07-08 20:27:34 +00:00
Manman Ren	c6fe5bc77c	StringRef: add DenseMapInfo for StringRef. Remove the implementation in include/llvm/Support/YAMLTraits.h. Added a DenseMap type DITypeHashMap in DebugInfo.h: DenseMap<std::pair<StringRef, unsigned>, MDNode*> llvm-svn: 185852	2013-07-08 19:17:48 +00:00
Chandler Carruth	ed0881b2a6	Use the new script to sort the includes of every file under lib. Sooooo many of these had incorrect or strange main module includes. I have manually inspected all of these, and fixed the main module include to be the nearest plausible thing I could find. If you own or care about any of these source files, I encourage you to take some time and check that these edits were sensible. I can't have broken anything (I strictly added headers, and reordered them, never removed), but they may not be the headers you'd really like to identify as containing the API being implemented. Many forward declarations and missing includes were added to a header files to allow them to parse cleanly when included first. The main module rule does in fact have its merits. =] llvm-svn: 169131	2012-12-03 16:50:05 +00:00
Nick Kledzik	35c79da3f8	Improve overflow detection in StringRef::getAsUnsignedInteger(). llvm-svn: 165038	2012-10-02 20:01:48 +00:00
Michael J. Spencer	93303819ac	[Support/StringRef] Add find_last_not_of and {r,l,}trim. llvm-svn: 156652	2012-05-11 22:08:50 +00:00
Chris Lattner	5e14666149	Don't die with an assertion if the Result bitwidth is already correct. This fixes an assert reading "1239123123123123" when the result is already 64-bit. llvm-svn: 155329	2012-04-23 00:27:54 +00:00
Chris Lattner	0a1bafed7b	No need for "else if" after a return. Autosense "0o123" as octal in StringRef::getAsInteger llvm-svn: 155298	2012-04-21 22:03:05 +00:00
Michael J. Spencer	cfa95f66a1	Make StringRef::getAsInteger work with all integer types. Before this change it would fail with {,u}int64_t on x86-64 Linux. This also removes code duplication. llvm-svn: 152517	2012-03-10 23:02:54 +00:00
Chandler Carruth	ca99ad3f0d	Add generic support for hashing StringRef objects using the new hashing library. llvm-svn: 152003	2012-03-04 10:55:27 +00:00
Duncan Sands	69d7a91334	Workaround a miscompilation by gcc-4.3 that showed up as a failure of the StringRef.Split2 unittest on 32 bit machines. llvm-svn: 151358	2012-02-24 09:01:34 +00:00
Duncan Sands	8570b29dfe	Move the implementation of StringRef::split out of StringExtras.cpp and into StringRef.cpp, which is where the other StringRef stuff is. llvm-svn: 151054	2012-02-21 12:00:25 +00:00
Kaelyn Uhrain	7a9ccf4c09	Add function for computing the edit distance of two arrays. Accomplished by moving the body of StringRef::edit_distance into a separate function that accepts two ArrayRefs, and making StringRef::edit_distance a wrapper around the new function. llvm-svn: 150621	2012-02-15 22:13:07 +00:00
Benjamin Kramer	e3b94d1b55	Fix a typo. llvm-svn: 143890	2011-11-06 20:36:50 +00:00
Daniel Dunbar	3fa528d67c	ADT/StringRef: Add ::lower() and ::upper() methods. llvm-svn: 143880	2011-11-06 18:04:43 +00:00
Benjamin Kramer	e664de33b1	Fix handling of the From parameter in StringRef::find. Enable bounds checking to catch this kind of bug earlier. llvm-svn: 142247	2011-10-17 20:49:40 +00:00
Benjamin Kramer	4d681d7dc4	Add a bad char heuristic to StringRef::find. Based on Horspool's simplified version of Boyer-Moore. We use a constant-sized table of uint8_ts to keep cache thrashing low, needles bigger than 255 bytes are uncommon anyways. The worst case is still O(n*m) but we do a lot better on the average case now. llvm-svn: 142061	2011-10-15 10:08:31 +00:00
Jakob Stoklund Olesen	c874e2d8fb	Fix a bug in compare_numeric(). Thanks to Alexandru Dura and Jonas Paulsson for finding it. llvm-svn: 140859	2011-09-30 17:03:55 +00:00
Lenny Maiorani	367342e209	Remove bounded StringRef::compare() since nothing but Clang SA was using it and it is just as easy to use StringRef::substr() preceding StringRef::compare() to achieve the same thing. llvm-svn: 130430	2011-04-28 20:20:12 +00:00
Lenny Maiorani	fad9d95722	Implements StringRef::compare with bounds. It is behaves similarly to strncmp(). Unit tests also included. llvm-svn: 129582	2011-04-15 17:56:50 +00:00
Chris Lattner	0ab5e2cded	Fix a ton of comment typos found by codespell. Patch by Luis Felipe Strano Moraes! llvm-svn: 129558	2011-04-15 05:18:47 +00:00
Jay Foad	583abbc4df	PR5207: Change APInt methods trunc(), sext(), zext(), sextOrTrunc() and zextOrTrunc(), and APSInt methods extend(), extOrTrunc() and new method trunc(), to be const and to return a new value instead of modifying the object in place. llvm-svn: 121120	2010-12-07 08:25:19 +00:00
Michael J. Spencer	e1d3603dc6	Support/ADT/StringRef: Add find_last_of. llvm-svn: 120495	2010-11-30 23:27:35 +00:00
Michael J. Spencer	f13f442b1a	Fix Whitespace. llvm-svn: 120166	2010-11-26 04:16:08 +00:00
Ted Kremenek	3e100cf582	Fix memory leak in StringRef::edit_distance(). 'Allocated' could be leaked on an early return. llvm-svn: 118370	2010-11-07 06:09:02 +00:00
Douglas Gregor	21afc3b012	Extend StringRef's edit-distance algorithm to permit an upper bound on the allowed edit distance llvm-svn: 116867	2010-10-19 22:13:48 +00:00
Benjamin Kramer	9bf0380a54	StringRef::compare_numeric also differed from StringRef::compare for characters > 127. llvm-svn: 112189	2010-08-26 15:25:35 +00:00
Benjamin Kramer	b04d4af057	Do unsigned char comparisons in StringRef::compare_lower to be more consistent with compare in corner cases. llvm-svn: 112185	2010-08-26 14:21:08 +00:00
Benjamin Kramer	08fd2cf26a	Avoid O(n*m) complexity in StringRef::find_first(_not)_of(StringRef). - Cache used characters in a bitset to reduce memory overhead to just 32 bytes. - On my core2 this code is faster except when the checked string was very short (smaller than the list of delimiters). llvm-svn: 111817	2010-08-23 18:16:08 +00:00
Jakob Stoklund Olesen	d1d7ed63ff	Add StringRef::compare_numeric and use it to sort TableGen register records. This means that our Registers are now ordered R7, R8, R9, R10, R12, ... Not R1, R10, R11, R12, R2, R3, ... llvm-svn: 104745	2010-05-26 21:47:28 +00:00
John McCall	512b650210	Add an override to StringRef::getAsInteger which parses into an APInt. It gets its own implementation totally divorced from the (presumably performance-sensitive) routines which parse into a uint64_t. Add APInt::operator\|=(uint64_t), which is situationally much better than using a full APInt. llvm-svn: 97381	2010-02-28 09:55:58 +00:00
Douglas Gregor	47ed966813	More trivial optimizations to a function well outside the critical path llvm-svn: 92896	2010-01-07 02:24:06 +00:00
Douglas Gregor	09470e6a4e	Switch StringRef::edit_distance over to using raw pointers, since both std::vector and llvm::SmallVector have annoying performance tradeoffs. No, I don't expect this to matter, and now it won't. llvm-svn: 92884	2010-01-07 00:51:54 +00:00
Douglas Gregor	5639af4eac	Document the edit-distance algorithm used in StringRef, switch it over to SmallVector, and add a unit test. llvm-svn: 92340	2009-12-31 04:24:34 +00:00
Douglas Gregor	165882c240	Implement edit distance for StringRef llvm-svn: 92309	2009-12-30 17:23:44 +00:00
Daniel Dunbar	956c1581fa	Use StringRef::min instead of std::min. llvm-svn: 89372	2009-11-19 18:53:18 +00:00

1 2

59 Commits