llvm-project

Commit Graph

Author	SHA1	Message	Date
Tobias Grosser	fb4842ff95	Add the runtime library for GPGPU code generation. Contributed by: Yabin Hu <yabin.hwu@gmail.com> llvm-svn: 158304	2012-06-11 09:25:01 +00:00
Bill Wendling	4b79647a6e	Re-enable the CMN instruction. We turned off the CMN instruction because it had semantics which we weren't getting correct. If we are comparing with an immediate, then it's okay to use the CMN instruction. <rdar://problem/7569620> llvm-svn: 158302	2012-06-11 08:07:26 +00:00
Benjamin Kramer	2150145ae4	InstCombine: factor code better. No functionality change. llvm-svn: 158301	2012-06-11 08:01:25 +00:00
Craig Topper	ce8dbaadb6	Add XOP shift and compare intrinsics. llvm-svn: 158300	2012-06-11 07:01:43 +00:00
James Dennett	1355bd17c6	Documentation cleanup, fixing Doxygen markup. Mostly this avoids common terms such as "protocol" and "expression" being implicitly turned into links to mistakenly-generated Doxygen pages: - Escaping @ symbols when Doxygen would otherwise incorrectly interpret them; - Escaping # symbols when they're not intended as explicit Doxygen link requests, such as when discussing preprocessor directives; - In one odd case, unescaping @ in @__experimental_modules_import, because Doxygen wrote '\@' to the output in that case, causing the example in the description of ImportDecl to be wrong; and - Fixing a typo: @breif -> @brief. llvm-svn: 158299	2012-06-11 06:19:40 +00:00
Argyrios Kyrtzidis	ff32674d3a	Const'ify CompilerInvocation::toArgs(). llvm-svn: 158298	2012-06-11 03:34:26 +00:00
Benjamin Kramer	8b8a76974f	InstCombine: Turn (zext A) == (B & (1<<X)-1) into A == (trunc B), narrowing the compare. This saves a cast, and zext is more expensive on platforms with subreg support than trunc is. This occurs in the BSD implementation of memchr(3), see PR12750. On the synthetic benchmark from that bug stupid_memchr and bsd_memchr have the same performance now when not inlining either function. stupid_memchr: 323.0us bsd_memchr: 321.0us memchr: 479.0us where memchr is the llvm-gcc compiled bsd_memchr from osx lion's libc. When inlining is enabled bsd_memchr still regresses down to llvm-gcc memchr time, I haven't fully understood the issue yet, something is grossly mangling the loop after inlining. llvm-svn: 158297	2012-06-10 20:35:00 +00:00
Hal Finkel	4e9f1a859f	Enable ILP scheduling for all nodes by default on PPC. Over the entire test-suite, this has an insignificantly negative average performance impact, but reduces some of the worst slowdowns from the anti-dep. change (r158294). Largest speedups: SingleSource/Benchmarks/Stanford/Quicksort - 28% SingleSource/Benchmarks/Stanford/Towers - 24% SingleSource/Benchmarks/Shootout-C++/matrix - 23% MultiSource/Benchmarks/SciMark2-C/scimark2 - 19% MultiSource/Benchmarks/MiBench/automotive-bitcount/automotive-bitcount - 15% (matrix and automotive-bitcount were both in the top-5 slowdown list from the anti-dep. change) Largest slowdowns: MultiSource/Benchmarks/McCat/03-testtrie/testtrie - 28% MultiSource/Benchmarks/mediabench/gsm/toast/toast - 26% MultiSource/Benchmarks/MiBench/automotive-susan/automotive-susan - 21% SingleSource/Benchmarks/CoyoteBench/lpbench - 20% MultiSource/Applications/d/make_dparser - 16% llvm-svn: 158296	2012-06-10 19:32:29 +00:00
Nadav Rotem	17ee58a792	Add AutoUpgrade support for the SSE4 ptest intrinsics. Patch by Michael Kuperstein. llvm-svn: 158295	2012-06-10 18:42:51 +00:00
Hal Finkel	a8100281ae	Use critical anti-dep. breaking on all PPC targets, but also add other register classes. Using 'all' instead of 'critical' would be better because it would make it easier to satisfy the bundling constraints, but, as noted in the FIXME, that is currently not possible with the crs. This yields an average 1% speedup over the entire test suite (on Power 7). Largest speedups: SingleSource/Benchmarks/Shootout-C++/moments - 40% MultiSource/Benchmarks/McCat/03-testtrie/testtrie - 28% SingleSource/Benchmarks/BenchmarkGame/nsieve-bits - 26% SingleSource/Benchmarks/McGill/misr - 23% MultiSource/Applications/JM/ldecod/ldecod - 22% Largest slowdowns: SingleSource/Benchmarks/Shootout-C++/matrix - -29% SingleSource/Benchmarks/Shootout-C++/ary3 - -22% MultiSource/Benchmarks/BitBench/uuencode/uuencode - -18% SingleSource/Benchmarks/Shootout-C++/ary - -17% MultiSource/Benchmarks/MiBench/automotive-bitcount/automotive-bitcount - -15% llvm-svn: 158294	2012-06-10 11:15:36 +00:00
Richard Smith	521ecc1f97	PR12964: __int128 and unsigned __int128 are promoted integral types, be sure to consider them when enumerating builtin operator candidates. llvm-svn: 158293	2012-06-10 08:00:26 +00:00
Craig Topper	a3c5fbf54b	Add XOP vprot* instruction intrinsics llvm-svn: 158292	2012-06-10 07:47:32 +00:00
Craig Topper	7afe343be5	Add intrinsics for immediate form of XOP vprot instructions. Use i128mem instead of f128mem for integer XOP instructions. llvm-svn: 158291	2012-06-10 07:31:56 +00:00
Richard Smith	4086a13df8	Remove CXXRecordDecl flags which are unused after r158289. We need an efficient mechanism to determine whether a defaulted default constructor is constexpr, in order to determine whether a class is a literal type, so keep the incrementally-built form on CXXRecordDecl. Remove the on-demand computation of same, so that we only have one method for determining whether a default constructor is constexpr. This doesn't affect correctness, since default constructor lookup is much simpler than selecting a constructor for copying or moving. We don't need a corresponding mechanism for defaulted copy or move constructors, since they can't affect whether a type is a literal type. Conversely, checking whether such functions are constexpr can require non-trivial effort, so we defer such checks until the copy or move constructor is required. Thus we now only compute whether a copy or move constructor is constexpr on demand, and only compute whether a default constructor is constexpr in advance. This is unfortunate, but seems like the best solution. llvm-svn: 158290	2012-06-10 07:07:24 +00:00
Richard Smith	b58000950e	Fix PR13052 properly, by performing special member lookup to determine whether an explicitly-defaulted default constructor would be constexpr. This is necessary in weird (but well-formed) cases where a class has more than one copy or move constructor. Cleanup of now-unused parts of CXXRecordDecl to follow. llvm-svn: 158289	2012-06-10 05:43:50 +00:00
Richard Smith	2b013185f8	PR13064: Store whether an in-class initializer uses direct or copy initialization, and use that information to produce the right kind of initialization during template instantiation. llvm-svn: 158288	2012-06-10 03:12:00 +00:00
Craig Topper	02b3d81a97	More XOP intrinsics llvm-svn: 158287	2012-06-10 02:46:15 +00:00
Craig Topper	33b6d5e20b	Begin adding XOP intrinsics llvm-svn: 158286	2012-06-10 00:39:38 +00:00
James Dennett	f3d90890ec	Fix the top-of-file comment in Attr.h to say that it's about attributes, not expressions. llvm-svn: 158285	2012-06-09 23:00:13 +00:00
Craig Topper	f561a9562d	Add XOP feature flag. llvm-svn: 158284	2012-06-09 22:24:14 +00:00
Hal Finkel	2edfbddcf0	Improve ext/trunc patterns on PPC64. The PPC64 backend had patterns for i32 <-> i64 extensions and truncations that would leave self-moves in the final assembly. Replacing those patterns with ones based on the SUBREG builtins yields better-looking code. Thanks to Jakob and Owen for their suggestions in this matter. llvm-svn: 158283	2012-06-09 22:10:19 +00:00
Ted Kremenek	2d7fc64274	Add CSS style for FAQ questions, and restate FAQ questions as actual questions. llvm-svn: 158282	2012-06-09 20:10:45 +00:00
Ted Kremenek	41bf2cc177	Wordsmith a bit, and turn </h1> into </h4> when appropriate. llvm-svn: 158281	2012-06-09 20:10:42 +00:00
Howard Hinnant	c418654096	Revert pair constructors back to using is_convertible instead of is_constructible. This should pull things into alignment with the final draft. Fixes http://llvm.org/bugs/show_bug.cgi?id=13063#add_comment . llvm-svn: 158280	2012-06-09 20:01:23 +00:00
Craig Topper	a54893c662	Use XOP vpcom intrinsics in patterns instead of a target specific SDNode type. Remove the custom lowering code that selected the SDNode type. llvm-svn: 158279	2012-06-09 17:02:24 +00:00
Craig Topper	3352ba55b9	Replace XOP vpcom intrinsics with fewer intrinsics that take the immediate as an argument. llvm-svn: 158278	2012-06-09 16:46:13 +00:00
Gregory Szorc	531880af77	[clang.py] Implement Cursor.is_static_method llvm-svn: 158277	2012-06-09 16:21:34 +00:00
Benjamin Kramer	df97aa1628	Hashing: Remove outdated comment. Support for reserved hash values was removed in r151865. llvm-svn: 158276	2012-06-09 15:33:28 +00:00
Benjamin Kramer	8ea1fe028c	Reorder ParsedTemplateArg's members to reduce padding on x86_64. llvm-svn: 158275	2012-06-09 15:15:27 +00:00
Aaron Ballman	36a978cca2	Disabling a spurious deprecation warning about using PathV1 from within the PathV1 implementation file. llvm-svn: 158274	2012-06-09 13:59:29 +00:00
Aaron Ballman	503bbff367	Fixing a typo in the comments. llvm-svn: 158273	2012-06-09 13:46:36 +00:00
Benjamin Kramer	7132638995	Replace a char counting helper function with std::count. No functionality change. llvm-svn: 158272	2012-06-09 13:18:14 +00:00
Benjamin Kramer	a74cf08e68	Replace constant-sized BitVector with std::bitset. llvm-svn: 158270	2012-06-09 12:56:12 +00:00
Benjamin Kramer	0748008df5	Allocate the contents of DwarfDebug's StringMaps in a single big BumpPtrAllocator. llvm-svn: 158265	2012-06-09 10:34:15 +00:00
Duncan Sands	556eab8878	Silence a gcc-4.6 warning: GCC fails to understand that secondReg and cmpOp2 are correlated, and thinks that cmpOp2 may be used uninitialized. llvm-svn: 158263	2012-06-09 10:04:03 +00:00
Alexey Samsonov	bb62f555f3	[Sanitizer] Use __libc_malloc/__libc_free instead of malloc/free inside internal allocator on Linux (important for TSan) llvm-svn: 158261	2012-06-09 09:21:44 +00:00
Alexey Samsonov	f516d39c1f	[TSan] use efficient real_memcpy inside runtime llvm-svn: 158260	2012-06-09 09:14:24 +00:00
Hal Finkel	eb50c2d4a4	Enable tail merging on PPC. Tail merging had been disabled on PPC because it would disturb bundling decisions made during pre-RA scheduling on the 970 cores. Now, however, all bundling decisions are made during post-RA scheduling, and tail merging is generally beneficial (the average test-suite speedup is insignificantly positive). Largest test-suite speedups: MultiSource/Benchmarks/mediabench/gsm/toast/toast - 30% MultiSource/Benchmarks/BitBench/uuencode/uuencode - 23% SingleSource/Benchmarks/Shootout-C++/ary - 21% SingleSource/Benchmarks/Stanford/Queens - 17% Largest slowdowns: MultiSource/Benchmarks/MiBench/security-sha/security-sha - 24% MultiSource/Benchmarks/McCat/03-testtrie/testtrie - 22% MultiSource/Applications/JM/ldecod/ldecod - 14% MultiSource/Benchmarks/mediabench/g721/g721encode/encode - 9% This is improved by using full (instead of just critical) anti-dependency breaking, but doing so still causes miscompiles and so cannot yet be enabled by default. llvm-svn: 158259	2012-06-09 03:14:50 +00:00
Argyrios Kyrtzidis	091b87bb9d	[libclang] When doing the cursor visitation make sure declarations in the same line do not override getting a cursor for the previous declaration. e.g: int x, y; @synthesize prop1, prop2; pointing at 'x'/'prop1' would give 'y'/'prop2' because their source ranges overlap. rdar://11361113 llvm-svn: 158258	2012-06-09 03:03:02 +00:00
Andrew Trick	fc8ce08be3	Register pressure: added getPressureAfterInstr. llvm-svn: 158256	2012-06-09 02:16:58 +00:00
Jakob Stoklund Olesen	c26fbbfba5	Sketch a LiveRegMatrix analysis pass. The LiveRegMatrix represents the live range of assigned virtual registers in a Live interval union per register unit. This is not fundamentally different from the interference tracking in RegAllocBase that both RABasic and RAGreedy use. The important differences are: - LiveRegMatrix tracks interference per register unit instead of per physical register. This makes interference checks cheaper and assignments slightly more expensive. For example, the ARM D7 reigster has 24 aliases, so we would check 24 physregs before assigning to one. With unit-based interference, we check 2 units before assigning to 2 units. - LiveRegMatrix caches regmask interference checks. That is currently duplicated functionality in RABasic and RAGreedy. - LiveRegMatrix is a pass which makes it possible to insert target-dependent passes between register allocation and rewriting. Such passes could tweak the register assignments with interference checking support from LiveRegMatrix. Eventually, RABasic and RAGreedy will be switched to LiveRegMatrix. llvm-svn: 158255	2012-06-09 02:13:10 +00:00
Anna Zaks	ab24b83b01	[analyzer] FAQ page fixup llvm-svn: 158254	2012-06-09 01:05:01 +00:00
Anna Zaks	40d64fcd55	[analyzer] Add FAQ and How to Deal with Common False Positives page Still not linked in; comments and additions are very welcome. llvm-svn: 158253	2012-06-09 01:04:54 +00:00
Jack Carter	2db37e8226	Test commit llvm-svn: 158250	2012-06-09 00:27:55 +00:00
Jakob Stoklund Olesen	be336295cd	Also compute MBB live-in lists in the new rewriter pass. This deduplicates some code from the optimizing register allocators, and it means that it is now possible to change the register allocators' solutions simply by editing the VirtRegMap between the register allocator pass and the rewriter. llvm-svn: 158249	2012-06-09 00:14:47 +00:00
Dmitri Gribenko	dbeafa773a	Convert comments to proper Doxygen comments. llvm-svn: 158248	2012-06-09 00:01:45 +00:00
Andrew Trick	ce679ad89d	Removing strange "using" declarations form TargetInstrInfo. I can't imagine why these were added. Trial and error. llvm-svn: 158247	2012-06-08 23:56:26 +00:00
Jason Molenda	b68c2d96ce	Incorporate suggestions from Jim, add "USING LLDB" and "FILES" sections. llvm-svn: 158246	2012-06-08 23:49:32 +00:00
Michael J. Spencer	d1e09a4282	[C++11 Compat] Fix breaking change in C++11 pair copyctor. While this code is valid C++98, it is not valid C++11. The problem can be reduced to: class MDNode; class DIType { operator MDNode() const {return 0;} }; class WeakVH { WeakVH(MDNode) {} }; int main() { DIType di; std::pair<void, WeakVH> p(std::make_pair((void)0, di))); } This was not detected by any of the bots we have because they either compile C++98 with libstdc++ (which allows it), or C++11 with libc++ (which incorrectly allows it). I ran into the problem when compiling with VS 2012 RC. Thanks to Richard for explaining the issue. llvm-svn: 158245	2012-06-08 23:47:12 +00:00
Jakob Stoklund Olesen	1224312f5b	Reintroduce VirtRegRewriter. OK, not really. We don't want to reintroduce the old rewriter hacks. This patch extracts virtual register rewriting as a separate pass that runs after the register allocator. This is possible now that CodeGen/Passes.cpp can configure the full optimizing register allocator pipeline. The rewriter pass uses register assignments in VirtRegMap to rewrite virtual registers to physical registers, and it inserts kill flags based on live intervals. These finalization steps are the same for the optimizing register allocators: RABasic, RAGreedy, and PBQP. llvm-svn: 158244	2012-06-08 23:44:45 +00:00

1 2 3 4 5 ...

128701 Commits All Branches Search

128701 Commits

All Branches