Seth Cantrell
4902e033b5
isLegalUTF8() was giving the wrong answer
...
invalid but not caught by isLegalUTF8(): 0xED 0x75 0x84
llvm-svn: 167063
2012-10-30 23:50:26 +00:00
Richard Smith
639b8d05dd
When a bad UTF-8 encoding or bogus escape sequence is encountered in a
...
string literal, produce a diagnostic pointing at the erroneous character
range, not at the start of the literal.
llvm-svn: 163459
2012-09-08 07:16:20 +00:00
Enrico Granata
4e0b1eda9a
Enabling the UTF conversion routines that were previously conditionally ifdef'ed out. This change is required to support enhancements in the LLDB data formatters
...
llvm-svn: 163331
2012-09-06 17:03:42 +00:00
Dmitri Gribenko
ce675c52ba
Add a function to convert a single Unicode code point to a UTF8 sequence.
...
llvm-svn: 160890
2012-07-27 20:36:22 +00:00
Richard Smith
812924502b
When checking the encoding of an 8-bit string literal, don't just check the
...
first codepoint! Also, don't reject empty raw string literals for spurious
"encoding" issues. Also, don't rely on undefined behavior in ConvertUTF.c.
llvm-svn: 152344
2012-03-08 21:59:28 +00:00
Eli Friedman
2ae0e1e279
Move ConvertUTF8toUTF32 out of #if 0, in preparation for a patch which needs it.
...
llvm-svn: 143415
2011-11-01 02:10:54 +00:00
Mike Stump
11289f4280
Remove tabs, and whitespace cleanups.
...
llvm-svn: 81346
2009-09-09 15:08:12 +00:00
Mike Stump
d73e4419f5
Fixup whitespacing.
...
llvm-svn: 69055
2009-04-14 18:24:37 +00:00
Steve Naroff
29cae66bde
Add ConvertUTF module from http://www.unicode.org/Public/PROGRAMS/CVTUTF .
...
#ifdef'd out the 5 conversion routines that we don't currently need.
Still need a bit more work in GetAddrOfConstantCFString(). Added a FIXME to indicate this.
Expect to remove the FIXME today...
llvm-svn: 68208
2009-04-01 15:50:34 +00:00