llvm-project

Commit Graph

Author	SHA1	Message	Date
Puyan Lotfi	5eb1004889	The following patch' purpose is to reduce compile time for compilation of small programs on targets with large register files. The root of the compile time overhead was in the use of llvm::SmallVector to hold PhysRegEntries, which resulted in slow-down from calling llvm::SmallVector::assign(N, 0). In contrast std::vector uses the faster __platform_bzero to zero out primitive buffers when assign is called, while SmallVector uses an iterator. The fix for this was simply to replace the SmallVector with a dynamically allocated buffer and to initialize or reinitialize the buffer based on the total registers that the target architecture requires. The changes support cases where a pass manager may be reused for different targets, and note that the PhysRegEntries is allocated using calloc mainly for good for, and also to quite tools like Valgrind (see comments for more info on this). There is an rdar to track the fact that SmallVector doesn't have platform specific speedup optimizations inside of it for things like this, and I'll create a bugzilla entry at some point soon as well. TL;DR: This fix replaces the expensive llvm::SmallVector<unsigned char>::assign(N, 0) with a call to calloc for N bytes which is much faster because SmallVector's assign uses iterators. llvm-svn: 200917	2014-02-06 09:23:24 +00:00
Dmitry Vyukov	9ba840865f	tsan: simplify Go build script we don't use assembly files llvm-svn: 200916	2014-02-06 09:23:12 +00:00
Dmitry Vyukov	447bb46e03	tsan: remove unused functions llvm-svn: 200915	2014-02-06 09:22:50 +00:00
Dmitry Vyukov	a5d1fcfde1	tsan: improve error message for Go llvm-svn: 200914	2014-02-06 09:22:29 +00:00
Puyan Lotfi	12ae04bd17	This small change reduces compile time for small programs on targets that have large register files. The omission of Queries.clear() is perfectly safe because LiveIntervalUnion::Query doesn't contain any data that needs freeing and because LiveRegMatrix::runOnFunction happens to reset the OwningArrayPtr holding Queries every time it is run, so there's no need to zero out the queries either. Not having to do this for very large numbers of physregs is a noticeable constant cost reduction in compilation of small programs. llvm-svn: 200913	2014-02-06 08:42:01 +00:00
Simon Atanasyan	0743a72caa	Accept and handle absolute symbols with empty name. llvm-svn: 200911	2014-02-06 07:35:16 +00:00
Kostya Serebryany	1f5d17c57d	[asan] fix testing on Mac llvm-svn: 200910	2014-02-06 07:19:52 +00:00
NAKAMURA Takumi	0c81c716eb	check-clang: Introduce the feature "utf8-capable-terminal". clang/test/FixIt/fixit-unicode-with-utf8-output.c has begun complained since LLVM r200885. Although it is changes for StringRef, it brought LLVM_ON_WIN32 to Support/Locale.cpp. Before r200885, LLVM_ON_WIN32 was undefined in Locale.cpp! FIXME: We should consider i18n on win32. llvm-svn: 200909	2014-02-06 07:15:59 +00:00
Kostya Serebryany	1ee681305f	[asan] introduce two functions that will allow implementations of C++ garbage colection to work with asan's fake stack llvm-svn: 200908	2014-02-06 06:56:22 +00:00
Nick Lewycky	993849490e	A memcpy out of an fresh alloca is a no-op, delete it. Patch by Patrick Walton! llvm-svn: 200907	2014-02-06 06:29:19 +00:00
Craig Topper	f1aab4502e	Delete all of the CodeGenInstructions from CodeGenTarget destructor. llvm-svn: 200906	2014-02-06 06:27:59 +00:00
Chandler Carruth	d1ba2efb8f	[PM] Fix horrible typos that somehow didn't cause a failure in a C++11 build but spectacularly changed behavior of the C++98 build. =] This shows my one problem with not having unittests -- basic API expectations aren't well exercised by the integration tests because they happen to not come up, even though they might later. I'll probably add a basic unittest to complement the integration testing later, but I wanted to revive the bots. llvm-svn: 200905	2014-02-06 05:17:02 +00:00
Marshall Clow	d230a3d1f6	Fix PR17221 - can't catch virtual base classes when throwing derived NULL pointers. Specifically, libc++abi would crash when you tried it. llvm-svn: 200904	2014-02-06 04:47:02 +00:00
Chandler Carruth	bf71a34eb9	[PM] Add a new "lazy" call graph analysis pass for the new pass manager. The primary motivation for this pass is to separate the call graph analysis used by the new pass manager's CGSCC pass management from the existing call graph analysis pass. That analysis pass is (somewhat unfortunately) over-constrained by the existing CallGraphSCCPassManager requirements. Those requirements make it really hard to cleanly layer the needed functionality for the new pass manager on top of the existing analysis. However, there are also a bunch of things that the pass manager would specifically benefit from doing differently from the existing call graph analysis, and this new implementation tries to address several of them: - Be lazy about scanning function definitions. The existing pass eagerly scans the entire module to build the initial graph. This new pass is significantly more lazy, and I plan to push this even further to maximize locality during CGSCC walks. - Don't use a single synthetic node to partition functions with an indirect call from functions whose address is taken. This node creates a huge choke-point which would preclude good parallelization across the fanout of the SCC graph when we got to the point of looking at such changes to LLVM. - Use a memory dense and lightweight representation of the call graph rather than value handles and tracking call instructions. This will require explicit update calls instead of some updates working transparently, but should end up being significantly more efficient. The explicit update calls ended up being needed in many cases for the existing call graph so we don't really lose anything. - Doesn't explicitly model SCCs and thus doesn't provide an "identity" for an SCC which is stable across updates. This is essential for the new pass manager to work correctly. - Only form the graph necessary for traversing all of the functions in an SCC friendly order. This is a much simpler graph structure and should be more memory dense. It does limit the ways in which it is appropriate to use this analysis. I wish I had a better name than "call graph". I've commented extensively this aspect. This is still very much a WIP, in fact it is really just the initial bits. But it is about the fourth version of the initial bits that I've implemented with each of the others running into really frustrating problms. This looks like it will actually work and I'd like to split the actual complexity across commits for the sake of my reviewers. =] The rest of the implementation along with lots of wiring will follow somewhat more rapidly now that there is a good path forward. Naturally, this doesn't impact any of the existing optimizer. This code is specific to the new pass manager. A bunch of thanks are deserved for the various folks that have helped with the design of this, especially Nick Lewycky who actually sat with me to go through the fundamentals of the final version here. llvm-svn: 200903	2014-02-06 04:37:03 +00:00
Chandler Carruth	e309d3768c	[PM] Back out one hunk of the patch in r200901 that was supposed to go in my next patch. Sorry for the breakage. llvm-svn: 200902	2014-02-06 04:32:33 +00:00
Chandler Carruth	c68d08241b	[PM] Wire up the analysis managers in the opt driver. This isn't really necessary until we add analyses to the driver, but I have such an analysis ready and wanted to split this out. This is actually exercised by the existing tests of the new pass manager as the analysis managers are cross-checked and validated by the function and module managers. llvm-svn: 200901	2014-02-06 04:25:13 +00:00
Juergen Ributzka	fa0eba6c8b	[DAG] Don't pull the binary operation though the shift if the operands have opaque constants. During DAGCombine visitShiftByConstant assumes that certain binary operations with only constant operands can always be folded successfully. This is no longer true when the constant is opaque. This commit fixes visitShiftByConstant by not performing the optimization for opaque constants. Otherwise we would end up in an infinite DAGCombine loop. llvm-svn: 200900	2014-02-06 04:09:06 +00:00
Serge Pavlov	774c6d03b2	Allow transformation of VariableArray to ConstantArray. In the following code: struct A { static const int sz; }; template<class T> void f() { T arr[A::sz]; } the array 'arr' is represented as a variable size array in the template. If 'A::sz' gets value below in the translation unit, the array in instantiation can turn into constant size array. This change fixes PR18633. Differential Revision: http://llvm-reviews.chandlerc.com/D2688 llvm-svn: 200899	2014-02-06 03:49:11 +00:00
Manman Ren	d461244972	Set default of inlinecold-threshold to 225. 225 is the default value of inline-threshold. This change will make sure we have the same inlining behavior as prior to r200886. As Chandler points out, even though we don't have code in our testing suite that uses cold attribute, there are larger applications that do use cold attribute. r200886 + this commit intend to keep the same behavior as prior to r200886. We can later on tune the inlinecold-threshold. The main purpose of r200886 is to help performance of instrumentation based PGO before we actually hook up inliner with analysis passes such as BPI and BFI. For instrumentation based PGO, we try to increase inlining of hot functions and reduce inlining of cold functions by setting inlinecold-threshold. Another option suggested by Chandler is to use a boolean flag that controls if we should use OptSizeThreshold for cold functions. The default value of the boolean flag should not change the current behavior. But it gives us less freedom in controlling inlining of cold functions. llvm-svn: 200898	2014-02-06 01:59:22 +00:00
Richard Smith	18819307d3	DR101, PR12770: If a function is declared in the same context as a using-declaration, and they declare the same function (either because the using-declaration is in the same namespace as the declaration it imports, or because they're both extern "C"), they do not conflict. llvm-svn: 200897	2014-02-06 01:31:33 +00:00
Kevin Enderby	d6b107136a	Update the X86 assembler for .intel_syntax to accept the << and >> bitwise operators. rdar://15975725 llvm-svn: 200896	2014-02-06 01:21:15 +00:00
Rafael Espindola	6a383f9a54	don't set HasReliableSymbolDifference for ELF. It is only used in MachObjectWriter.cpp. Another leftover from early days of ELF in MC. llvm-svn: 200895	2014-02-06 01:06:31 +00:00
Rafael Espindola	12f04984f8	doesSectionRequireSymbols is meaningless on ELF, remove. This is a nop. doesSectionRequireSymbols is only used from isSymbolLinkerVisible. isSymbolLinkerVisible only use from ELF was in if (!Asm.isSymbolLinkerVisible(Symbol) && !Symbol.isUndefined()) return false; if (Symbol.isTemporary()) return false; If the symbol is a temporary this code returns false and it is irrelevant if we take the first if or not. If the symbol is not a temporary, Asm.isSymbolLinkerVisible returns true without ever calling doesSectionRequireSymbols. This was an horrible leftover from when support for ELF was first added. llvm-svn: 200894	2014-02-06 00:54:53 +00:00
Manman Ren	9724752f4b	Simplify code by combining ifs. llvm-svn: 200893	2014-02-06 00:08:15 +00:00
Paul Robinson	af4e64d095	Disable most IR-level transform passes on functions marked 'optnone'. Ideally only those transform passes that run at -O0 remain enabled, in reality we get as close as we reasonably can. Passes are responsible for disabling themselves, it's not the job of the pass manager to do it for them. llvm-svn: 200892	2014-02-06 00:07:05 +00:00
Manman Ren	f9e58778bc	Fix Werror introduced at r200874. llvm-svn: 200891	2014-02-06 00:03:20 +00:00
Rafael Espindola	4998280fdf	Just returning false is the default. llvm-svn: 200890	2014-02-06 00:03:15 +00:00
Nick Lewycky	1f529663bb	Fix -Wunused-variable 'FD' by using it instead of ND when they're equal but FD has a more precise type. llvm-svn: 200889	2014-02-05 23:53:29 +00:00
Matt Arsenault	1b55dd9a81	Pass address space to allowsUnalignedMemoryAccesses llvm-svn: 200888	2014-02-05 23:16:05 +00:00
Matt Arsenault	25793a3f22	Add address space argument to allowsUnalignedMemoryAccess. On R600, some address spaces have more strict alignment requirements than others. llvm-svn: 200887	2014-02-05 23:15:53 +00:00
Manman Ren	e8781b1a36	Inliner uses a smaller inline threshold for callees with cold attribute. Added command line option inlinecold-threshold to set threshold for inlining functions with cold attribute. Listen to the cold attribute when it would decrease the inline threshold. llvm-svn: 200886	2014-02-05 22:53:44 +00:00
Nick Kledzik	4d6d981297	Fix layering StringRef copy using BumpPtrAllocator. Now to copy a string into a BumpPtrAllocator and get a StringRef to the copy: StringRef myCopy = myStr.copy(myAllocator); llvm-svn: 200885	2014-02-05 22:22:56 +00:00
Ben Langmuir	2cb4a78f93	Add a CC1 option -verify-pch This option will: - load the given pch file - verify it is not out of date by stat'ing dependencies, and - return 0 on success and non-zero on error llvm-svn: 200884	2014-02-05 22:21:15 +00:00
Quentin Colombet	87769713cf	[RegAlloc] Add a last chance recoloring mechanism when everything else failed to find a register. The idea is to choose a color for the variable that cannot be allocated and recolor its interferences around. Unlike the current register allocation scheme, it is allowed to change the color of an already assigned (but maybe not splittable or spillable) live interval while propagating this change to its neighbors. In other word, there are two things that may help finding an available color: - Already assigned variables (RS_Done) can be recolored to different color. - The recoloring allows to catch solutions that needs to touch more that just the neighbors of the current allocated variable. E.g., vA can use {R1, R2 } vB can use { R2, R3} vC can use {R1 } Where vA, vB, and vC cannot be split anymore (they are reloads for instance) and they all interfere. vA is assigned R1 vB is assigned R2 vC tries to evict vA but vA is already done. => Regular register allocation heuristic fails. Last chance recoloring kicks in: vC does as if vA was evicted => vC uses R1. vC is marked as fixed. vA needs to find a color. None are available. vA cannot evict vC: vC is a fixed virtual register now. vA does as if vB was evicted => vA uses R2. vB needs to find a color. R3 is available. Recoloring => vC = R1, vA = R2, vB = R3. <rdar://problem/15947839> llvm-svn: 200883	2014-02-05 22:13:59 +00:00
Greg Clayton	8ee673141b	Don't print out "script" results twice. We now properly detect when a result object has an immediate output stream and don't echo the results a second time. <rdar://problem/15954906> llvm-svn: 200882	2014-02-05 21:46:20 +00:00
Chandler Carruth	eedf9fca28	[PM] Don't require analysis results to be const in the new pass manager. I think this was just over-eagerness on my part. The analysis results need to often be non-const because they need to (in some cases at least) be updated by the transformation pass in order to remain correct. It also makes lazy analyses (a common case) needlessly annoying to write in order to make their entire state mutable. llvm-svn: 200881	2014-02-05 21:41:42 +00:00
Manman Ren	215893317b	Try to fix ppc bot failure. llvm-svn: 200880	2014-02-05 21:40:10 +00:00
Enrico Granata	9b55aa4e8f	An example summary provider for PyObject and the LLDB wrapper PythonObject hierarchy - this would have probably helped track down those refcount bugs.. llvm-svn: 200879	2014-02-05 21:38:50 +00:00
Jim Ingham	f0c63b97d6	Fix the --source-quietly option to the driver so that it actually works. Clean up the help output a bit. llvm-svn: 200878	2014-02-05 21:35:09 +00:00
Benjamin Kramer	c24767b4ad	Clean up some particularly ugly casting. No functionality change. llvm-svn: 200877	2014-02-05 21:29:05 +00:00
Alexander Kornienko	4fa81df455	Changed OptionCategory variables to be static. llvm-svn: 200876	2014-02-05 21:28:03 +00:00
Greg Clayton	e4e462c42c	Fixed output to display correctly for "command source" by fixing the correct flags being set. Also emit the "Executing commands" message so it properly only comes out when desired and so it comes out in the right place. <rdar://problem/15992208> llvm-svn: 200875	2014-02-05 21:03:22 +00:00
Manman Ren	67a28136ad	PGO: instrumentation based profiling sets function attributes. We collect a maximal function count among all functions in the pgo data file. For functions that are hot, we set its InlineHint attribute. For functions that are cold, we set its Cold attribute. We currently treat functions with >= 30% of the maximal function count as hot and functions with <= 1% of the maximal function count are treated as cold. These two numbers are from preliminary tuning on SPEC. This commit should not affect non-PGO builds and should boost performance on instrumentation based PGO. llvm-svn: 200874	2014-02-05 20:40:15 +00:00
Sergey Matveev	efefe5e225	[sanitizer] Fix build. llvm-svn: 200873	2014-02-05 20:04:12 +00:00
Sergey Matveev	c5c84a1d86	[sanitizer] Implement ioctl decoding. When an unknown ioctl is encountered, try to guess the parameter size from the request id. llvm-svn: 200872	2014-02-05 19:35:24 +00:00
Ed Maste	f697a1ef94	Enable lldb-gdbserver for FreeBSD in the (g)make build llvm-svn: 200871	2014-02-05 19:28:47 +00:00
Ed Maste	fb29fa3e35	Enable lldb-gdbserver on Linux as well in the cmake build llvm-svn: 200870	2014-02-05 19:03:18 +00:00
Reid Kleckner	09b47d166b	MS ABI: Fix mangling of static methods and function references Function references always use $1? like function pointers and never $E? like var decl references. Static methods are mangled like function pointers. llvm-svn: 200869	2014-02-05 18:59:38 +00:00
Kaelyn Uhrain	21a6617c34	Don't consider records with a NULL identifier as a name for typo correction. Because in C++, "anonymous" doesn't mean "nameless" for records. In other words, RecordDecl::isAnonymousStructOrUnion only returns true if the record lacks a name and is not used as the type in an object's declaration. llvm-svn: 200868	2014-02-05 18:57:51 +00:00
Ed Maste	ba7cc706d9	Remove leftover debug printf llvm-svn: 200866	2014-02-05 18:49:10 +00:00

1 2 3 4 5 ...

166850 Commits All Branches Search

166850 Commits

All Branches