llvm-project

Commit Graph

Author	SHA1	Message	Date
Dmitri Gribenko	801e76900d	GettingStarted: improve formatting and document that configure checks for 'clang' to use it as the compiler. llvm-svn: 171630	2013-01-05 18:10:06 +00:00
Michael Gottesman	def07bba3e	Added debug message to ObjCARC when we transform objc_retainAutorelasedReturnValue => objc_retain since the operand to said function is not a return value. llvm-svn: 171629	2013-01-05 17:55:42 +00:00
Michael Gottesman	5c32ce9d3e	Added debug message for ObjCARC when we zap an objc_autoreleaseReturnValue/objc_retainAutoreleasedValue pair. llvm-svn: 171628	2013-01-05 17:55:35 +00:00
Chris Lattner	473988cf54	switch from pointer equality comparison to MDNode::getMostGenericTBAA when merging two TBAA tags, pointed out by Nuno. llvm-svn: 171627	2013-01-05 16:44:07 +00:00
Chandler Carruth	42e9611f15	Funnel the actual TargetTransformInfo pass from the SelectionDAGISel pass into the SelectionDAG itself rather than snooping on the implementation of that pass as exposed by the TargetMachine. This removes the last direct client of the ScalarTargetTransformInfo class outside of the TTI pass implementation. llvm-svn: 171625	2013-01-05 12:32:17 +00:00
Benjamin Kramer	d60321ba19	Attribute: Make hashes match when looking up AttributeImpls. This isn't optimal either but fixes a massive compile time regression from the attribute uniquing work. llvm-svn: 171624	2013-01-05 12:08:00 +00:00
Chandler Carruth	441c2ac98a	Fix another place where we build the TTI pass to the new interface. Sorry for the noise here, 'make check' doesn't build this code. =/ llvm-svn: 171623	2013-01-05 11:54:35 +00:00
Chandler Carruth	539edf4ee0	Convert the TargetTransformInfo from an immutable pass with dynamic interfaces which could be extracted from it, and must be provided on construction, to a chained analysis group. The end goal here is that TTI works much like AA -- there is a baseline "no-op" and target independent pass which is in the group, and each target can expose a target-specific pass in the group. These passes will naturally chain allowing each target-specific pass to delegate to the generic pass as needed. In particular, this will allow a much simpler interface for passes that would like to use TTI -- they can have a hard dependency on TTI and it will just be satisfied by the stub implementation when that is all that is available. This patch is a WIP however. In particular, the "stub" pass is actually the one and only pass, and everything there is implemented by delegating to the target-provided interfaces. As a consequence the tools still have to explicitly construct the pass. Switching targets to provide custom passes and sinking the stub behavior into the NoTTI pass is the next step. llvm-svn: 171621	2013-01-05 11:43:11 +00:00
Chandler Carruth	21b3c586ab	Switch the loop vectorizer from VTTI to just use TTI directly. llvm-svn: 171620	2013-01-05 10:16:02 +00:00
Chandler Carruth	cf569a8cdf	Switch the cost model analysis over to just the TTI interface. llvm-svn: 171619	2013-01-05 10:09:33 +00:00
Chandler Carruth	7c4f91dea5	Switch the BB vectorizer from the VTTI interface to the simple TTI interface. llvm-svn: 171618	2013-01-05 10:05:28 +00:00
Chandler Carruth	6db43e6ca3	Switch SimplifyCFG over to the TargetTransformInfo interface rather than the ScalarTargetTransformInfo interface. llvm-svn: 171617	2013-01-05 10:05:26 +00:00
Chandler Carruth	6fe147fb3a	Switch LoopIdiomRecognize to directly use the TargetTransformInfo interface rather than the ScalarTargetTransformInterface. llvm-svn: 171616	2013-01-05 10:00:09 +00:00
Chandler Carruth	8f37342b38	Replicate the APIs of ScalarTargetTransformInfo and VectorTargetTransformInfo into the TargetTransformInfo pass, implementing them be delegating back out to the two subobjects. This is the first step to folding the interfaces together and making TargetTransformInfo a normal analysis pass (specifically an analysis group which targets can provide target-specific analysis pass implementations of). No callers are migrated here, this just stubs out the interface. Next step will be to migrate all the callers to directly operate on TTI instead of STTI or VTTI respectively. That will allow replacing the machinery for delivering TTI without changing every caller at once. WIP, I promise all the duplicated interfaces will be removed in the end, this just decouples the steps of the process. llvm-svn: 171615	2013-01-05 09:56:20 +00:00
Chandler Carruth	5394e11a55	Switch the empty and tombstone key enumerators to not have explicit values -- that's not required to fix the bug that was cropping up, and the values selected made the enumeration's underlying type signed and introduced some warnings. This fixes the -Werror build. The underlying issue here was that the DenseMapInfo was casting values completely outside the range of the underlying storage of the enumeration to the enumeration's type. GCC went and "optimized" that into infloops and other misbehavior. By providing designated special values for these keys in the dense map, we ensure they are indeed representable and that they won't be used for anything else. It might be better to reuse None for the empty key and have the tombstone share the value of the sentinel enumerator, but honestly having 2 extra enumerators seemed not to matter and this seems a bit simpler. I'll let Bill shuffle this around (or ask me to shuffle it around) if he prefers it to look a different way. I also made the switch a bit more clear (and produce a better assert) that the enumerators are never going to show up and are errors if they do. llvm-svn: 171614	2013-01-05 08:47:26 +00:00
Chandler Carruth	e0900ec533	While the struct being defined in the AddressingMode.h header was unused, there were transitive includes needed. llvm-svn: 171613	2013-01-05 08:19:20 +00:00
Chandler Carruth	a56c3200aa	Remove unnecessary include. llvm-svn: 171612	2013-01-05 08:12:59 +00:00
NAKAMURA Takumi	c91006f741	IR/Attributes: Provide EmptyKey and TombstoneKey in part of enum, as workaround for gcc-4.4 take #2 . I will investigate, later, what was wrong. I am too tired for now. llvm-svn: 171611	2013-01-05 07:55:47 +00:00
David Blaikie	800a916f99	Emit DW_TAG_formal_parameter for unnamed parameters. This change essentially reverts r87069 which came without a test case. It causes no regressions in the GDB 7.5 test suite & fixes 25 xfails (commit to the test suite to follow). If anyone can present a test case that demonstrates why this check is necessary I'd be happy to account for it in one way or another. llvm-svn: 171609	2013-01-05 07:43:02 +00:00
Craig Topper	92a70b1e65	Recommit r171461 which was incorrectly reverted. Mark DIV/IDIV instructions hasSideEffects=1 because they can trap when dividing by 0. This is needed to keep early if conversion from moving them across basic blocks. llvm-svn: 171608	2013-01-05 07:39:25 +00:00
Nadav Rotem	478b6a47ec	Revert revision 171524. Original message: URL: http://llvm.org/viewvc/llvm-project?rev=171524&view=rev Log: The current Intel Atom microarchitecture has a feature whereby when a function returns early then it is slightly faster to execute a sequence of NOP instructions to wait until the return address is ready, as opposed to simply stalling on the ret instruction until the return address is ready. When compiling for X86 Atom only, this patch will run a pass, called "X86PadShortFunction" which will add NOP instructions where less than four cycles elapse between function entry and return. It includes tests. Patch by Andy Zhang. llvm-svn: 171603	2013-01-05 05:42:48 +00:00
NAKAMURA Takumi	6f3ef2d70e	Whitespace. llvm-svn: 171601	2013-01-05 05:16:53 +00:00
NAKAMURA Takumi	3b4d2c99ad	DenseMap: Appease -fstrict-aliasing on g++-4.4. With DenseMapInfo<Enum>, it is miscompiled on g++-4.4. static inline Enum getEmptyKey() { return Enum(<arbitrary int/unsigned value>); } isEauql(getEmptyKey(), ...) The compiler mis-assumes the return value is not aliased to Enum. llvm-svn: 171600	2013-01-05 05:14:23 +00:00
Jakob Stoklund Olesen	dc5285f102	Don't call destructors on MachineInstr and MachineOperand. The series of patches leading up to this one makes llc -O0 run 8% faster. When deallocating a MachineFunction, there is no need to visit all MachineInstr and MachineOperand objects to deallocate them. All their memory come from a BumpPtrAllocator that is about to be purged, and they have empty destructors anyway. This only applies when deallocating the MachineFunction. DeleteMachineInstr() should still be used to recycle MI memory during the codegen passes. Remove the LeakDetector support for MachineInstr. I've never seen it used before, and now it definitely doesn't work. With this patch, leaked MachineInstrs would be much less of a problem since all of their memory will be reclaimed by ~MachineFunction(). llvm-svn: 171599	2013-01-05 05:05:51 +00:00
Jakob Stoklund Olesen	1bfeecb491	Use ArrayRecycler for MachineInstr operand lists. Instead of an std::vector<MachineOperand>, use MachineOperand arrays from an ArrayRecycler living in MachineFunction. This has several advantages: - MachineInstr now has a trivial destructor, making it possible to delete them in batches when destroying MachineFunction. This will be enabled in a later patch. - Bypassing malloc() and free() can be faster, depending on the system library. - MachineInstr objects and their operands are allocated from the same BumpPtrAllocator, so they will usually be next to each other in memory, providing better locality of reference. - Reduce MachineInstr footprint. A std::vector is 24 bytes, the new operand array representation only uses 8+4+1 bytes in MachineInstr. - Better control over operand array reallocations. In the old representation, the use-def chains would be reordered whenever a std::vector reached its capacity. The new implementation never changes the use-def chain order. Note that some decisions in the code generator depend on the use-def chain orders, so this patch may cause different assembly to be produced in a few cases. llvm-svn: 171598	2013-01-05 05:00:09 +00:00
Jakob Stoklund Olesen	fe445cd646	Add MachineRegisterInfo::moveOperands(). This function works like memmove() for MachineOperands, except it also updates any use-def chains containing the moved operands. The use-def chains are updated without affecting the order of operands in the list. That isn't possible when using the removeRegOperandFromUseList() and addRegOperandToUseList() functions. Callers to follow soon. llvm-svn: 171597	2013-01-05 04:38:12 +00:00
Chandler Carruth	4a7c311008	Refactor the ScalarTargetTransformInfo API for querying about the legality of an address mode to not use a struct of four values and instead to accept them as parameters. I'd love to have named parameters here as most callers only care about one or two of these, but the defaults aren't terribly scary to write out. That said, there is no real impact of this as the passes aren't yet using STTI for this and are still relying upon TargetLowering. llvm-svn: 171595	2013-01-05 03:36:17 +00:00
Chandler Carruth	c892591596	Sink the AddressingModeMatcher helper class into an anonymous namespace next to its only user. This helper relies on TargetLowering information that shouldn't be generally used throughout the Transfoms library, and so it made little sense as a generic utility. This also consolidates the file where we need to remove the remaining uses of TargetLowering in favor of the IR-layer abstract interface in TargetTransformInfo. llvm-svn: 171590	2013-01-05 02:09:22 +00:00
Chandler Carruth	0dba59ae37	Rename the unittest from ArrayRecylerTest.cpp to ArrayRecyclerTest.cpp. Fixes the CMake build. It took me cutting and pasting this before I managed to see the missing character. =] llvm-svn: 171589	2013-01-05 02:08:43 +00:00
Akira Hatanaka	d35a263076	[mips] Fix data layout string. Add 64 to the list of native integer widths and add stack alignment information. llvm-svn: 171587	2013-01-05 02:00:56 +00:00
Bill Wendling	960f52a132	Add a method to create an AttributeSet from an AttrBuilder. The Attribute class is eventually going to represent one attribute. So we need this class to create the set of attributes. Add some iterator methods to the builder to access its internal bits in a nice way. llvm-svn: 171586	2013-01-05 01:36:54 +00:00
Nadav Rotem	f19d515316	Fix a typo. Remove the duplicated test. llvm-svn: 171584	2013-01-05 01:17:46 +00:00
Nadav Rotem	e9f5bfd5e9	iLoopVectorize: Non commutative operators can be used as reduction variables as long as the reduction chain is used in the LHS. PR14803. llvm-svn: 171583	2013-01-05 01:15:47 +00:00
Nadav Rotem	6d9dafe3ff	Force a fixed unroll count on the target independent tests. This should fix clang-native-arm-cortex-a9. Thanks Renato. llvm-svn: 171582	2013-01-05 00:58:48 +00:00
Jakob Stoklund Olesen	17a7d22d89	Add an ArrayRecycler class. This is similar to the existing Recycler allocator, but instead of recycling individual objects from a BumpPtrAllocator, arrays of different sizes can be allocated. llvm-svn: 171581	2013-01-05 00:57:11 +00:00
Chandler Carruth	b5429f43b8	Eric thought that Darwin was right to use -1 consistently rather than leaving this undefined, and despite the sentence in the standard that seems to require it, I'll cede the point and assume its a bug in the wording. Other parts of POSIX regularly allow for things to be -1 instead of undefined, this should too. Makes things more consistent too. This should have to real impact for folks though. llvm-svn: 171574	2013-01-05 00:42:50 +00:00
Chandler Carruth	4a77863bce	Fix a stray 'dnl' that my editor line-wrapped into this comment. Thanks to filcab on IRC for spotting. llvm-svn: 171573	2013-01-05 00:34:40 +00:00
Eric Christopher	770c550990	Make this an integer so we have enumeral types in the conditional expression. llvm-svn: 171571	2013-01-05 00:32:04 +00:00
Chandler Carruth	d121a7b0c0	Finally, fix the autoconf setup to allow for a missing clock_gettime; the source code should now be set up to handle this. llvm-svn: 171570	2013-01-05 00:29:06 +00:00
Chandler Carruth	e46cf6c509	Provide a default constructor for TimeValue. This was used, but only in if-ed out code paths and on Windows. Hopefully restores the Windows build. Thanks to Reid Kleckner for helping triage this. llvm-svn: 171568	2013-01-05 00:23:09 +00:00
Alex Rosenberg	0d6ecec69d	Fix warnings from llvm-gcc as seen on darwin10 (10.6). llvm-svn: 171567	2013-01-05 00:21:12 +00:00
Chandler Carruth	2aaec89fd0	Try to suppress the use of clock_gettime on Darwin which apparantly defines _POSIX_CPUTIME but doesn't support the clock_* functions. I don't test the value of _POSIX_CPUTIME because the spec merely says that if it is defined, the CPU-specific timers are available, whereas it says that _POSIX_TIMERS must be defined and defined to a value greater than zero. However, this may not work, as the POSIX spec clearly states: "If the symbolic constant _POSIX_CPUTIME is defined, then the symbolic constant _POSIX_TIMERS shall also be defined by the implementation to have the value 200112L." If this doesn't work, I'll add more hacks for Darwin. llvm-svn: 171565	2013-01-05 00:11:21 +00:00
Chandler Carruth	b79a7aa541	Fix an obvious typo spotted by Reid Kleckner, and breaking windows builds. llvm-svn: 171559	2013-01-04 23:46:04 +00:00
Bill Wendling	cd330348f5	Get rid of the 'Bits' mask in the attribute builder. The bit mask thing will be a thing of the past. It's not extensible enough. Get rid of its use here. Opt instead for using a vector to hold the attributes. Note: Some of this code will become obsolete once the rewrite is further along. llvm-svn: 171553	2013-01-04 23:27:34 +00:00
Chandler Carruth	ef7f968e09	Add time getters to the process interface for requesting the elapsed wall time, user time, and system time since a process started. For walltime, we currently use TimeValue's interface and a global initializer to compute a close approximation of total process runtime. For user time, this adds support for an somewhat more precise timing mechanism -- clock_gettime with the CLOCK_PROCESS_CPUTIME_ID clock selected. For system time, we have to do a full getrusage call to extract the system time from the OS. This is expensive but unavoidable. In passing, clean up the implementation of the old APIs and fix some latent bugs in the Windows code. This might have manifested on Windows ARM systems or other systems with strange 64-bit integer behavior. The old API for this both user time and system time simultaneously from a single getrusage call. While this results in fewer system calls, it also results in a lower precision user time and if only user time is desired, it introduces a higher overhead. It may be worthwhile to switch some of the pass timers to not track system time and directly track user and wall time. The old API also tracked walltime in a confusing way -- it just set it to the current walltime rather than providing any measure of wall time since the process started the way buth user and system time are tracked. The new API is more consistent here. The plan is to eventually implement these methods for a child process by using the wait3(2) system call to populate an rusage struct representing the whole subprocess execution. That way, after waiting on a child process its stats will become accurate and cheap to query. llvm-svn: 171551	2013-01-04 23:19:55 +00:00
Andrew Trick	18021a45aa	tabs-to-spaces llvm-svn: 171550	2013-01-04 23:11:35 +00:00
Jakub Staszak	43fafaf496	Move 'break' to the right place to prevent fallthru. There is no test-case because conditions in the next case prevented from doing anything nasty. llvm-svn: 171549	2013-01-04 23:01:26 +00:00
Jakob Stoklund Olesen	83d5d19aea	Special case Recycler::clear(BumpPtrAllocator). A BumpPtrAllocator has an empty Deallocate() method, but Recycler::clear() would still call it for every single object ever allocated, bringing all those objects into cache. As a bonus, iplist::remove() will also write to the Prev/Next pointers on all the objects, so all those cache lines have to be written back to RAM before the pages are given back to the OS. Stop wasting time and memory bandwith by using the new clearAndLeakUnsafely() function to jettison all the recycled objects. llvm-svn: 171541	2013-01-04 22:35:45 +00:00
Jakob Stoklund Olesen	4ccabc1da9	Add an iplist::clearAndLeakNodesUnsafely() function. The iplist::clear() function can be quite expensive because it traverses the entire list, calling deleteNode() and removeNodeFromList() on each element. If node destruction and deallocation can be handled some other way, clearAndLeakNodesUnsafely() can be used to jettison all nodes without bringing them into cache. The function name is meant to be ominous. llvm-svn: 171540	2013-01-04 22:35:42 +00:00
Jakob Stoklund Olesen	7f92b7ad0a	Move an assertion so it doesn't dereference end(). The R600 target has test cases that exercises this code. llvm-svn: 171538	2013-01-04 22:17:31 +00:00

1 2 3 4 5 ...

88046 Commits