llvm-project

Commit Graph

Author	SHA1	Message	Date
Sean Silva	93be7d59d2	Simplify TableGen type-compatibility checks. Patch by Elior Malul! llvm-svn: 171684	2013-01-07 02:30:19 +00:00
Chandler Carruth	664e354de7	Switch TargetTransformInfo from an immutable analysis pass that requires a TargetMachine to construct (and thus isn't always available), to an analysis group that supports layered implementations much like AliasAnalysis does. This is a pretty massive change, with a few parts that I was unable to easily separate (sorry), so I'll walk through it. The first step of this conversion was to make TargetTransformInfo an analysis group, and to sink the nonce implementations in ScalarTargetTransformInfo and VectorTargetTranformInfo into a NoTargetTransformInfo pass. This allows other passes to add a hard requirement on TTI, and assume they will always get at least on implementation. The TargetTransformInfo analysis group leverages the delegation chaining trick that AliasAnalysis uses, where the base class for the analysis group delegates to the previous analysis pass, allowing all but tho NoFoo analysis passes to only implement the parts of the interfaces they support. It also introduces a new trick where each pass in the group retains a pointer to the top-most pass that has been initialized. This allows passes to implement one API in terms of another API and benefit when some other pass above them in the stack has more precise results for the second API. The second step of this conversion is to create a pass that implements the TargetTransformInfo analysis using the target-independent abstractions in the code generator. This replaces the ScalarTargetTransformImpl and VectorTargetTransformImpl classes in lib/Target with a single pass in lib/CodeGen called BasicTargetTransformInfo. This class actually provides most of the TTI functionality, basing it upon the TargetLowering abstraction and other information in the target independent code generator. The third step of the conversion adds support to all TargetMachines to register custom analysis passes. This allows building those passes with access to TargetLowering or other target-specific classes, and it also allows each target to customize the set of analysis passes desired in the pass manager. The baseline LLVMTargetMachine implements this interface to add the BasicTTI pass to the pass manager, and all of the tools that want to support target-aware TTI passes call this routine on whatever target machine they end up with to add the appropriate passes. The fourth step of the conversion created target-specific TTI analysis passes for the X86 and ARM backends. These passes contain the custom logic that was previously in their extensions of the ScalarTargetTransformInfo and VectorTargetTransformInfo interfaces. I separated them into their own file, as now all of the interface bits are private and they just expose a function to create the pass itself. Then I extended these target machines to set up a custom set of analysis passes, first adding BasicTTI as a fallback, and then adding their customized TTI implementations. The fourth step required logic that was shared between the target independent layer and the specific targets to move to a different interface, as they no longer derive from each other. As a consequence, a helper functions were added to TargetLowering representing the common logic needed both in the target implementation and the codegen implementation of the TTI pass. While technically this is the only change that could have been committed separately, it would have been a nightmare to extract. The final step of the conversion was just to delete all the old boilerplate. This got rid of the ScalarTargetTransformInfo and VectorTargetTransformInfo classes, all of the support in all of the targets for producing instances of them, and all of the support in the tools for manually constructing a pass based around them. Now that TTI is a relatively normal analysis group, two things become straightforward. First, we can sink it into lib/Analysis which is a more natural layer for it to live. Second, clients of this interface can depend on it always being available which will simplify their code and behavior. These (and other) simplifications will follow in subsequent commits, this one is clearly big enough. Finally, I'm very aware that much of the comments and documentation needs to be updated. As soon as I had this working, and plausibly well commented, I wanted to get it committed and in front of the build bots. I'll be doing a few passes over documentation later if it sticks. Commits to update DragonEgg and Clang will be made presently. llvm-svn: 171681	2013-01-07 01:37:14 +00:00
Michael J. Spencer	c445408710	[Object][ELF] Fix incorrect size of members for the 64 version of Elf_Phdr_Impl. llvm-svn: 171650	2013-01-06 03:57:11 +00:00
Michael J. Spencer	9ed47c76af	[Object][ELF] Add program header iterator. llvm-svn: 171648	2013-01-06 03:56:27 +00:00
Michael J. Spencer	960b5d2b75	[Object][ELF] Refactor ELFRelocationIterator into ELFEntityIterator. No functionality change. llvm-svn: 171647	2013-01-06 03:56:14 +00:00
Chandler Carruth	42e9611f15	Funnel the actual TargetTransformInfo pass from the SelectionDAGISel pass into the SelectionDAG itself rather than snooping on the implementation of that pass as exposed by the TargetMachine. This removes the last direct client of the ScalarTargetTransformInfo class outside of the TTI pass implementation. llvm-svn: 171625	2013-01-05 12:32:17 +00:00
Chandler Carruth	539edf4ee0	Convert the TargetTransformInfo from an immutable pass with dynamic interfaces which could be extracted from it, and must be provided on construction, to a chained analysis group. The end goal here is that TTI works much like AA -- there is a baseline "no-op" and target independent pass which is in the group, and each target can expose a target-specific pass in the group. These passes will naturally chain allowing each target-specific pass to delegate to the generic pass as needed. In particular, this will allow a much simpler interface for passes that would like to use TTI -- they can have a hard dependency on TTI and it will just be satisfied by the stub implementation when that is all that is available. This patch is a WIP however. In particular, the "stub" pass is actually the one and only pass, and everything there is implemented by delegating to the target-provided interfaces. As a consequence the tools still have to explicitly construct the pass. Switching targets to provide custom passes and sinking the stub behavior into the NoTTI pass is the next step. llvm-svn: 171621	2013-01-05 11:43:11 +00:00
Chandler Carruth	8f37342b38	Replicate the APIs of ScalarTargetTransformInfo and VectorTargetTransformInfo into the TargetTransformInfo pass, implementing them be delegating back out to the two subobjects. This is the first step to folding the interfaces together and making TargetTransformInfo a normal analysis pass (specifically an analysis group which targets can provide target-specific analysis pass implementations of). No callers are migrated here, this just stubs out the interface. Next step will be to migrate all the callers to directly operate on TTI instead of STTI or VTTI respectively. That will allow replacing the machinery for delivering TTI without changing every caller at once. WIP, I promise all the duplicated interfaces will be removed in the end, this just decouples the steps of the process. llvm-svn: 171615	2013-01-05 09:56:20 +00:00
Chandler Carruth	5394e11a55	Switch the empty and tombstone key enumerators to not have explicit values -- that's not required to fix the bug that was cropping up, and the values selected made the enumeration's underlying type signed and introduced some warnings. This fixes the -Werror build. The underlying issue here was that the DenseMapInfo was casting values completely outside the range of the underlying storage of the enumeration to the enumeration's type. GCC went and "optimized" that into infloops and other misbehavior. By providing designated special values for these keys in the dense map, we ensure they are indeed representable and that they won't be used for anything else. It might be better to reuse None for the empty key and have the tombstone share the value of the sentinel enumerator, but honestly having 2 extra enumerators seemed not to matter and this seems a bit simpler. I'll let Bill shuffle this around (or ask me to shuffle it around) if he prefers it to look a different way. I also made the switch a bit more clear (and produce a better assert) that the enumerators are never going to show up and are errors if they do. llvm-svn: 171614	2013-01-05 08:47:26 +00:00
Chandler Carruth	e0900ec533	While the struct being defined in the AddressingMode.h header was unused, there were transitive includes needed. llvm-svn: 171613	2013-01-05 08:19:20 +00:00
Chandler Carruth	a56c3200aa	Remove unnecessary include. llvm-svn: 171612	2013-01-05 08:12:59 +00:00
NAKAMURA Takumi	c91006f741	IR/Attributes: Provide EmptyKey and TombstoneKey in part of enum, as workaround for gcc-4.4 take #2 . I will investigate, later, what was wrong. I am too tired for now. llvm-svn: 171611	2013-01-05 07:55:47 +00:00
NAKAMURA Takumi	6f3ef2d70e	Whitespace. llvm-svn: 171601	2013-01-05 05:16:53 +00:00
NAKAMURA Takumi	3b4d2c99ad	DenseMap: Appease -fstrict-aliasing on g++-4.4. With DenseMapInfo<Enum>, it is miscompiled on g++-4.4. static inline Enum getEmptyKey() { return Enum(<arbitrary int/unsigned value>); } isEauql(getEmptyKey(), ...) The compiler mis-assumes the return value is not aliased to Enum. llvm-svn: 171600	2013-01-05 05:14:23 +00:00
Jakob Stoklund Olesen	dc5285f102	Don't call destructors on MachineInstr and MachineOperand. The series of patches leading up to this one makes llc -O0 run 8% faster. When deallocating a MachineFunction, there is no need to visit all MachineInstr and MachineOperand objects to deallocate them. All their memory come from a BumpPtrAllocator that is about to be purged, and they have empty destructors anyway. This only applies when deallocating the MachineFunction. DeleteMachineInstr() should still be used to recycle MI memory during the codegen passes. Remove the LeakDetector support for MachineInstr. I've never seen it used before, and now it definitely doesn't work. With this patch, leaked MachineInstrs would be much less of a problem since all of their memory will be reclaimed by ~MachineFunction(). llvm-svn: 171599	2013-01-05 05:05:51 +00:00
Jakob Stoklund Olesen	1bfeecb491	Use ArrayRecycler for MachineInstr operand lists. Instead of an std::vector<MachineOperand>, use MachineOperand arrays from an ArrayRecycler living in MachineFunction. This has several advantages: - MachineInstr now has a trivial destructor, making it possible to delete them in batches when destroying MachineFunction. This will be enabled in a later patch. - Bypassing malloc() and free() can be faster, depending on the system library. - MachineInstr objects and their operands are allocated from the same BumpPtrAllocator, so they will usually be next to each other in memory, providing better locality of reference. - Reduce MachineInstr footprint. A std::vector is 24 bytes, the new operand array representation only uses 8+4+1 bytes in MachineInstr. - Better control over operand array reallocations. In the old representation, the use-def chains would be reordered whenever a std::vector reached its capacity. The new implementation never changes the use-def chain order. Note that some decisions in the code generator depend on the use-def chain orders, so this patch may cause different assembly to be produced in a few cases. llvm-svn: 171598	2013-01-05 05:00:09 +00:00
Jakob Stoklund Olesen	fe445cd646	Add MachineRegisterInfo::moveOperands(). This function works like memmove() for MachineOperands, except it also updates any use-def chains containing the moved operands. The use-def chains are updated without affecting the order of operands in the list. That isn't possible when using the removeRegOperandFromUseList() and addRegOperandToUseList() functions. Callers to follow soon. llvm-svn: 171597	2013-01-05 04:38:12 +00:00
Chandler Carruth	4a7c311008	Refactor the ScalarTargetTransformInfo API for querying about the legality of an address mode to not use a struct of four values and instead to accept them as parameters. I'd love to have named parameters here as most callers only care about one or two of these, but the defaults aren't terribly scary to write out. That said, there is no real impact of this as the passes aren't yet using STTI for this and are still relying upon TargetLowering. llvm-svn: 171595	2013-01-05 03:36:17 +00:00
Chandler Carruth	c892591596	Sink the AddressingModeMatcher helper class into an anonymous namespace next to its only user. This helper relies on TargetLowering information that shouldn't be generally used throughout the Transfoms library, and so it made little sense as a generic utility. This also consolidates the file where we need to remove the remaining uses of TargetLowering in favor of the IR-layer abstract interface in TargetTransformInfo. llvm-svn: 171590	2013-01-05 02:09:22 +00:00
Bill Wendling	960f52a132	Add a method to create an AttributeSet from an AttrBuilder. The Attribute class is eventually going to represent one attribute. So we need this class to create the set of attributes. Add some iterator methods to the builder to access its internal bits in a nice way. llvm-svn: 171586	2013-01-05 01:36:54 +00:00
Jakob Stoklund Olesen	17a7d22d89	Add an ArrayRecycler class. This is similar to the existing Recycler allocator, but instead of recycling individual objects from a BumpPtrAllocator, arrays of different sizes can be allocated. llvm-svn: 171581	2013-01-05 00:57:11 +00:00
Eric Christopher	770c550990	Make this an integer so we have enumeral types in the conditional expression. llvm-svn: 171571	2013-01-05 00:32:04 +00:00
Chandler Carruth	e46cf6c509	Provide a default constructor for TimeValue. This was used, but only in if-ed out code paths and on Windows. Hopefully restores the Windows build. Thanks to Reid Kleckner for helping triage this. llvm-svn: 171568	2013-01-05 00:23:09 +00:00
Alex Rosenberg	0d6ecec69d	Fix warnings from llvm-gcc as seen on darwin10 (10.6). llvm-svn: 171567	2013-01-05 00:21:12 +00:00
Bill Wendling	cd330348f5	Get rid of the 'Bits' mask in the attribute builder. The bit mask thing will be a thing of the past. It's not extensible enough. Get rid of its use here. Opt instead for using a vector to hold the attributes. Note: Some of this code will become obsolete once the rewrite is further along. llvm-svn: 171553	2013-01-04 23:27:34 +00:00
Chandler Carruth	ef7f968e09	Add time getters to the process interface for requesting the elapsed wall time, user time, and system time since a process started. For walltime, we currently use TimeValue's interface and a global initializer to compute a close approximation of total process runtime. For user time, this adds support for an somewhat more precise timing mechanism -- clock_gettime with the CLOCK_PROCESS_CPUTIME_ID clock selected. For system time, we have to do a full getrusage call to extract the system time from the OS. This is expensive but unavoidable. In passing, clean up the implementation of the old APIs and fix some latent bugs in the Windows code. This might have manifested on Windows ARM systems or other systems with strange 64-bit integer behavior. The old API for this both user time and system time simultaneously from a single getrusage call. While this results in fewer system calls, it also results in a lower precision user time and if only user time is desired, it introduces a higher overhead. It may be worthwhile to switch some of the pass timers to not track system time and directly track user and wall time. The old API also tracked walltime in a confusing way -- it just set it to the current walltime rather than providing any measure of wall time since the process started the way buth user and system time are tracked. The new API is more consistent here. The plan is to eventually implement these methods for a child process by using the wait3(2) system call to populate an rusage struct representing the whole subprocess execution. That way, after waiting on a child process its stats will become accurate and cheap to query. llvm-svn: 171551	2013-01-04 23:19:55 +00:00
Jakob Stoklund Olesen	83d5d19aea	Special case Recycler::clear(BumpPtrAllocator). A BumpPtrAllocator has an empty Deallocate() method, but Recycler::clear() would still call it for every single object ever allocated, bringing all those objects into cache. As a bonus, iplist::remove() will also write to the Prev/Next pointers on all the objects, so all those cache lines have to be written back to RAM before the pages are given back to the OS. Stop wasting time and memory bandwith by using the new clearAndLeakUnsafely() function to jettison all the recycled objects. llvm-svn: 171541	2013-01-04 22:35:45 +00:00
Jakob Stoklund Olesen	4ccabc1da9	Add an iplist::clearAndLeakNodesUnsafely() function. The iplist::clear() function can be quite expensive because it traverses the entire list, calling deleteNode() and removeNodeFromList() on each element. If node destruction and deallocation can be handled some other way, clearAndLeakNodesUnsafely() can be used to jettison all nodes without bringing them into cache. The function name is meant to be ominous. llvm-svn: 171540	2013-01-04 22:35:42 +00:00
Bill Wendling	9ac69f9d37	General cleanups. * Remove dead methods. * Use the 'operator==' method instead of 'contains', which isn't needed. * Fix some comments. No functionality change. llvm-svn: 171523	2013-01-04 20:54:35 +00:00
Michael J. Spencer	bae14cef80	[Object][ELF] Add a maximum alignment. This is used by createELFObjectFile to create a properly aligned reader. llvm-svn: 171520	2013-01-04 20:36:28 +00:00
Nick Kledzik	11964f2a8f	Fix how YAML I/O detects flow sequences. Update test case to verify flow sequence is written as a flow sequence. llvm-svn: 171514	2013-01-04 19:32:00 +00:00
Adhemerval Zanella	9b0b781395	PowerPC: Fix eh_frame relocation for PIC This patch fixes the PPC eh_frame definitions for the personality and frame unwinding for PIC objects. It makes PIC build correctly creates relative relocations in the '.rela.eh_frame' segments and thus avoiding a text relocation that generates a DT_TEXTREL segments in link phase. llvm-svn: 171506	2013-01-04 19:08:13 +00:00
Eric Christopher	c0fa867c7b	Add section information for the DWARF5 split debug proposal string offset section. llvm-svn: 171474	2013-01-04 17:59:22 +00:00
Eric Christopher	aa082063cb	Make comment a bit more clear. llvm-svn: 171473	2013-01-04 17:59:17 +00:00
Nadav Rotem	e1d5c4b8b9	LoopVectorizer: 1. Add code to estimate register pressure. 2. Add code to select the unroll factor based on register pressure. 3. Add bits to TargetTransformInfo to provide the number of registers. llvm-svn: 171469	2013-01-04 17:48:25 +00:00
Hal Finkel	c8c12bc0ff	Better comment on VTTI::getShuffleCost llvm-svn: 171459	2013-01-03 05:02:41 +00:00
NAKAMURA Takumi	600361b3c1	Compiler.h: Leave LLVM_BUILTIN_UNREACHABLE undefined if it is unavailable in host compiler. Users of LLVM_BUILTIN_UNREACHABLE should be responsible in the case when LLVM_BUILTIN_UNREACHABLE is undefined. Actually, (0, (p)) in LLVM_ASSUME_ALIGNED(p, a) caused thousands of warnings on g++-4.4. It was a motivation in this commit. llvm-svn: 171455	2013-01-03 03:30:22 +00:00
Hal Finkel	95de3f3018	Add a subtype parameter to VTTI::getShuffleCost In order to cost subvector insertion and extraction, we need to know the type of the subvector being extracted. No functionality change. llvm-svn: 171453	2013-01-03 02:34:09 +00:00
Bill Wendling	04949fa998	Try again to revert the bad patch. The tree was reverted for some unknown reason before the last time. --- Reverse-merging r171442 into '.': U include/llvm/IR/Attributes.h U lib/IR/Attributes.cpp U lib/IR/AttributeImpl.h llvm-svn: 171448	2013-01-03 01:54:39 +00:00
Hal Finkel	b168708ad8	Add a default Index for VTTI::getShuffleCost When Kind == (Broadcast or Reverse) then Index is not used; make it an optional parameter. No functionality change. llvm-svn: 171447	2013-01-03 01:50:51 +00:00
Bill Wendling	40785ae18f	Revert patch. Something snuck in there that shouldn't be. --- Reverse-merging r171441 into '.': U include/llvm/IR/Attributes.h U lib/IR/Attributes.cpp llvm-svn: 171444	2013-01-03 01:46:27 +00:00
Bill Wendling	af9a90cc00	Remove the 'contains' methods in favor of the 'operator==' method. The 'operator==' method is a bit clearer and much less verbose for somethings that should have only one value. Remove from the AttrBuilder for consistency. llvm-svn: 171442	2013-01-03 01:43:05 +00:00
NAKAMURA Takumi	cedab7ecf3	Revert r171427, "An intermediate step in the Attributes rewrite." llvm-svn: 171441	2013-01-03 01:42:06 +00:00
Bill Wendling	91055cfbaf	An intermediate step in the Attributes rewrite. Modify the AttrBuilder class to store the attributes as a set instead of as a bit mask. The Attribute class will represent only one attribute instead of a collection of attributes. This is the wave of the future! llvm-svn: 171427	2013-01-02 23:45:09 +00:00
Argyrios Kyrtzidis	1b52f1e364	Use a bool instead of a bitfield in llvm/ADT/Optional. Fixes Valgrind failures and removes bitwise operations that don't provide any benefit. Valgrind failures reported by NAKAMURA Takumi. llvm-svn: 171413	2013-01-02 21:19:08 +00:00
Michael J. Spencer	a4295f2812	Restrict __builtin_assume_aligned to gcc 4.7+ llvm-svn: 171408	2013-01-02 20:23:49 +00:00
Michael J. Spencer	30589ce60e	[Support][Endian] Add support for specifying the alignment and native unaligned types. * Add support for specifying the alignment to use. * Add the concept of native endianness. Used for unaligned native types. The native alignment and read/write simplification is based on a patch by Richard Smith. llvm-svn: 171406	2013-01-02 20:14:11 +00:00
Argyrios Kyrtzidis	0b46809d1e	Don't #include stuff outside the include guards. This defeats the include-guard optimization when parsing. llvm-svn: 171405	2013-01-02 19:42:53 +00:00
Shuxin Yang	98c844fd89	- Add comment to two functions which might be considered as dead code. - Fix a typo llvm-svn: 171399	2013-01-02 18:26:31 +00:00
Chandler Carruth	db25c6cf8e	Actually update the CMake and Makefile builds correctly, and update the code that includes Intrinsics.gen directly. This never showed up in my testing because the old Intrinsics.gen was still kicking around in the make build system and was correct there. =[ Thankfully, some of the bots to clean rebuilds and that caught this. llvm-svn: 171373	2013-01-02 12:09:16 +00:00
Chandler Carruth	9fb823bbd4	Move all of the header files which are involved in modelling the LLVM IR into their new header subdirectory: include/llvm/IR. This matches the directory structure of lib, and begins to correct a long standing point of file layout clutter in LLVM. There are still more header files to move here, but I wanted to handle them in separate commits to make tracking what files make sense at each layer easier. The only really questionable files here are the target intrinsic tablegen files. But that's a battle I'd rather not fight today. I've updated both CMake and Makefile build systems (I think, and my tests think, but I may have missed something). I've also re-sorted the includes throughout the project. I'll be committing updates to Clang, DragonEgg, and Polly momentarily. llvm-svn: 171366	2013-01-02 11:36:10 +00:00
Chandler Carruth	be81023d74	Resort the #include lines in include/... and lib/... with the utils/sort_includes.py script. Most of these are updating the new R600 target and fixing up a few regressions that have creeped in since the last time I sorted the includes. llvm-svn: 171362	2013-01-02 10:22:59 +00:00
Benjamin Kramer	614b5e85b9	Add IRBuilder::CreateVectorSplat and use it to simplify code. llvm-svn: 171349	2013-01-01 19:55:16 +00:00
Chandler Carruth	2da2cab63a	Make it explicit that the only entry points to the Program object are through the static helper functions. This is already true throughout the codebase. Slowly, I'm going to re-implement these static helpers in terms of a new process based interface which can expose more information, and remove the program object entirely. llvm-svn: 171335	2012-12-31 23:44:49 +00:00
Chandler Carruth	76fbeef95a	Remove an unused method on Program. I'm simplifying this interface as much as I can before merging it with the new process interface. llvm-svn: 171334	2012-12-31 23:44:47 +00:00
Chandler Carruth	db8842f9f3	Remove an unused method on the Program class. llvm-svn: 171332	2012-12-31 23:38:28 +00:00
Chandler Carruth	acd64becb1	Go ahead and get rid of the old page size interface and convert all the users over to the new one. No sense maintaining this "compatibility" layer it seems. llvm-svn: 171331	2012-12-31 23:31:56 +00:00
Chandler Carruth	15dcad9e36	Flesh out a page size accessor in the new API. Implement the old API in terms of the new one. This simplifies the implementation on Windows which can now re-use the self_process's once initialization. llvm-svn: 171330	2012-12-31 23:23:35 +00:00
Chandler Carruth	3fa2010c7d	Remove the declspecs from small alignments that we can force with a union. These don't actually work for by-value function arguments, and MSVC warns if they exist even while (we hope) it aligns the argument correctly due to the other union member. This means MSVC will miss out on optimizations based on the alignment of the buffer, but really, there aren't that many for x86 and MSVC is likely not doing a great job of optimizing LLVM and Clang anyways. llvm-svn: 171328	2012-12-31 22:18:01 +00:00
Chandler Carruth	b12634bf80	Remove an unused function in the old Process interface. llvm-svn: 171327	2012-12-31 22:17:59 +00:00
Nuno Lopes	d896a400f1	recommit r171298 (add support for PHI nodes to ObjectSizeOffsetVisitor). Hopefully with bugs corrected now. llvm-svn: 171325	2012-12-31 20:45:10 +00:00
Michael J. Spencer	da3e31a419	[AlignOf] Add AlignedCharArray and refactor AlignedCharArrayUnion. This adds AlignedCharArray<Alignment, Size>. A templated struct that contains a member named buffer of type char[Size] that is aligned to Alignment. llvm-svn: 171319	2012-12-31 19:54:45 +00:00
Rafael Espindola	c8288c103d	Fix bits check in ELFObjectFile::isSectionZeroInit(). Fixes PR14723. Patch by Sami Liedes! llvm-svn: 171309	2012-12-31 18:20:51 +00:00
Nuno Lopes	e9d6dbf7a2	add support for GlobalAlias to ObjectSizeOffsetVisitor llvm-svn: 171303	2012-12-31 16:23:48 +00:00
Bill Wendling	f50ea7109d	Remove dead method. llvm-svn: 171295	2012-12-31 11:52:55 +00:00
Bill Wendling	e10f76c640	Add some comparison operators to compare the Attribute object with the AttrKind value. llvm-svn: 171294	2012-12-31 11:51:54 +00:00
Chandler Carruth	97683aa2fa	Begin sketching out the process interface. The coding style used here is not LLVM's style because this is modeled after a Boost interface and thus done in the style of a candidate C++ standard library interface. I'll probably end up proposing it as a standard C++ library if it proves to be reasonably portable and useful. This is just the most basic parts of the interface -- getting the process ID out of it. However, it helps sketch out some of the boiler plate such as the base class, derived class, shared code, and static factory function. It also introduces a unittest so that I can incrementally ensure this stuff works. However, I've not even compiled this code for Windows yet. I'll try to fix any Windows fallout from the bots, and if I can't fix it I'll revert and get someone on Windows to help out. There isn't a lot more that is mandatory, so soon I'll switch to just stubbing out the Windows side and get Michael Spencer to help with implementation as he can test it directly. llvm-svn: 171289	2012-12-31 11:17:50 +00:00
Chandler Carruth	c457907708	Start sketching out a roadmap for better subprocess management in the LLVM libraries. Also, clean up the doxygen and formatting of the existing interfaces. With this change I'm calling the existing interface "legacy" because I'd like to replace it with something much better. My end goal is to expose a common set of interfaces for inspecting various properties of a process, and implementations to expose those both for the current process and for child processes. This will also expose more rich interfaces for spawning and controling a subprocess, notably to use system calls like wait3 and wait4 where available and gather detailed resource usage stats about the subprocess. My plan (discussed with Michael Spencer on IRC) is to base this loosely around the proposed Boost.Process interface, but to implement a relatively small subset of that functionality based around the needs of LLVM, Clang, the Clang driver, etc. llvm-svn: 171285	2012-12-31 09:29:16 +00:00
Bill Wendling	6e95ae803a	Remove the getAttributesAtIndex and getNumAttrs methods in favor of using the getAttrSomewhere predicate. This prevents the uses of 'Attribute' as a collection of attributes. llvm-svn: 171271	2012-12-31 00:49:59 +00:00
Bill Wendling	749a43d874	Use the predicate methods off of AttributeSet instead of Attribute. llvm-svn: 171257	2012-12-30 13:50:49 +00:00
Bill Wendling	74dba875e2	Remove the Function::getRetAttributes method in favor of using the AttributeSet accessor method. llvm-svn: 171256	2012-12-30 13:01:51 +00:00
Bill Wendling	94dcaf8e2b	Remove Function::getParamAttributes and use the AttributeSet accessor methods instead. llvm-svn: 171255	2012-12-30 12:45:13 +00:00
Bill Wendling	698e84fc4f	Remove the Function::getFnAttributes method in favor of using the AttributeSet directly. This is in preparation for removing the use of the 'Attribute' class as a collection of attributes. That will shift to the AttributeSet class instead. llvm-svn: 171253	2012-12-30 10:32:01 +00:00
Bill Wendling	6190254e0f	s/hasAttribute/contains/g to be more consistent with other method names. llvm-svn: 171252	2012-12-30 09:17:46 +00:00
Bill Wendling	3e4c4c9607	s/Raw/getBitMask/g to be more in line with current naming conventions. This method won't be sticking around. llvm-svn: 171244	2012-12-30 01:05:42 +00:00
Chandler Carruth	f6182155f6	Teach instsimplify to use the constant folder where appropriate for constant folding calls. Add the initial tests for this which show that now instsimplify can simplify blindingly obvious code patterns expressed with both intrinsics and library calls. llvm-svn: 171194	2012-12-28 14:23:29 +00:00
Chandler Carruth	9dc3558920	Add entry points to instsimplify for simplifying calls. The entry points are nice and decomposed so that we can simplify synthesized calls as easily as actually call instructions. The internal utility still has the same behavior, it just now operates on a more generic interface so that I can extend the set of call simplifications that instsimplify knows about. llvm-svn: 171189	2012-12-28 11:30:55 +00:00
Alexey Samsonov	3efc87e92d	Add proper support for -fsanitize-blacklist= flag for TSan and MSan. LLVM part. llvm-svn: 171183	2012-12-28 09:30:44 +00:00
Chandler Carruth	3edd52c1d0	Add support to BasicBlocks for iterating backwards over the instructions. This just exposes the already present reverse iterators of the instruction ilist. llvm-svn: 171159	2012-12-27 12:00:56 +00:00
Chandler Carruth	a3c0d67d5b	Provide a common half-open interval map info implementation, and just re-use that for SlotIndexes. This way other users who want half-open semantics can share the implementation. llvm-svn: 171158	2012-12-27 11:29:17 +00:00
Nadav Rotem	9aa00f0363	DAGCombinerInformation: add a getter that exposes the dagcombine level. llvm-svn: 171152	2012-12-27 08:44:35 +00:00
Alexey Samsonov	75ceb5b56b	Fix new[]/delete mismatch in FullDependence spotted by AddressSanitizer llvm-svn: 171150	2012-12-27 08:40:37 +00:00
Nadav Rotem	b1dd52450e	Refactor DAGCombinerInfo. Change the different booleans that indicate if we are before or after different runs of DAGCo, with the CombineLevel enum. Also, added a new API for checking if we are running before or after the LegalizeVectorOps phase. llvm-svn: 171142	2012-12-27 06:47:41 +00:00
Nadav Rotem	b3f6751df5	whitespace llvm-svn: 171129	2012-12-27 02:04:12 +00:00
Eric Christopher	5a6acfa4c8	Right now all of the relocations are 32-bit dwarf, and the relocation information doesn't return an addend for Rel relocations. Go ahead and use this information to fix relocation handling inside dwarfdump for 32-bit ELF REL. llvm-svn: 171126	2012-12-27 01:07:07 +00:00
Nadav Rotem	0e1d662d56	white space llvm-svn: 171090	2012-12-26 04:58:12 +00:00
Hal Finkel	2ebe6d08cd	Loosen scheduling restrictions on the PPC dcbt intrinsic As with the prefetch intrinsic to which it maps, simply have dcbt marked as reading from and writing to its arguments instead of having unmodeled side effects. While this might cause unwanted code motion (because aliasing checks don't really capture cache-line sharing), it is more important that prefetches in unrolled loops don't block the scheduler from rearranging the unrolled loop body. llvm-svn: 171073	2012-12-25 18:51:18 +00:00
Bob Wilson	fe73ac34c5	Rename LLVMContext diagnostic handler types and functions. These are now generally used for all diagnostics from the backend, not just for inline assembly, so this drops the "InlineAsm" from the names. No functional change. (I've left aliases for the old names but only for long enough to let me switch over clang to use the new ones.) llvm-svn: 171047	2012-12-25 00:07:12 +00:00
Bob Wilson	4ed23578da	Add LLVMContext::emitWarning methods and use them. <rdar://problem/12867368> When the backend is used from clang, it should produce proper diagnostics instead of just printing messages to errs(). Other clients may also want to register their own error handlers with the LLVMContext, and the same handler should work for warnings in the same way as the existing emitError methods. llvm-svn: 171041	2012-12-24 18:15:21 +00:00
Nadav Rotem	3ee6b10dd4	CostModel: We have API for checking the costs of known shuffles. This patch adds support for the insert-subvector and extract-subvector kinds. llvm-svn: 171027	2012-12-24 10:04:03 +00:00
Elena Demikhovsky	517afbff01	Added 6 more value types: v32i1, v64i1, v32i16, v32i8, v64i8, v8f64 llvm-svn: 171026	2012-12-24 10:03:57 +00:00
Nadav Rotem	7e1599e100	Change the codegen Cost Model API for shuffeles. This patch removes the API for broadcast and adds a more general API that accepts an enum of known shuffles. llvm-svn: 171022	2012-12-24 08:57:47 +00:00
NAKAMURA Takumi	fec2ea1b3d	llvm/MC/MCMachObjectWriter.h: ComputeSymbolTable(): Prune one description in the comment. [-Wdocumentation] /// \param StringIndexMap [out] - Map from symbol names to offsets in the string table. llvm-svn: 171010	2012-12-24 01:24:04 +00:00
Nadav Rotem	cf9999d9d5	CostModel: Change the default target-independent implementation for finding the cost of arithmetic functions. We now assume that the cost of arithmetic operations that are marked as Legal or Promote is low, but ops that are marked as custom are higher. llvm-svn: 171002	2012-12-23 17:31:23 +00:00
Nadav Rotem	2cade68025	Loop Vectorizer: Update the cost model of scatter/gather operations and make them more expensive. llvm-svn: 170995	2012-12-23 07:23:55 +00:00
Craig Topper	fc5ee3516c	Add a comma to fix the build. llvm-svn: 170982	2012-12-22 08:22:01 +00:00
Craig Topper	c9dcbe6987	Use a negative value to represent INVALID_SIMPLE_VALUE_TYPE instead of 256. Its much cheaper for the isSimple() checks to look for values less than 0 rather than a value greater than 255. This shaves ~8k off the size of the llc binary on x86-64. llvm-svn: 170981	2012-12-22 08:16:17 +00:00
Craig Topper	8289b327ac	Add vAny and Metadata to the switch in getSizeInBits for consistency since every other enum was listed. llvm-svn: 170977	2012-12-22 03:08:37 +00:00
Bill Wendling	c79e42c5ce	Change 'AttrVal' to 'AttrKind' to better reflect that it's a kind of attribute instead of the value of the attribute. llvm-svn: 170972	2012-12-22 00:37:52 +00:00
Richard Smith	2450f1c2c5	Fix some undefined behavior when parsing YAML input: don't try to compare an uninitialized value against a default value. Found by -fsanitize=enum. llvm-svn: 170970	2012-12-22 00:31:54 +00:00
Jakob Stoklund Olesen	0edb164723	Add a missing assertion, the null register has no register units. llvm-svn: 170916	2012-12-21 18:38:09 +00:00
Evgeniy Stepanov	4fbc0d08bf	[msan] Remove unreachable blocks before instrumenting a function. llvm-svn: 170883	2012-12-21 11:18:49 +00:00
Rafael Espindola	a9f810b6b5	Add a function to get the segment name of a section. On MachO, sections also have segment names. When a tool looking at a .o file prints a segment name, this is what they mean. In reality, a .o has only one anonymous, segment. This patch adds a MachO only function to fetch that segment name. I named it getSectionFinalSegmentName since the main use for the name seems to be inform the linker with segment this section should go to. The patch also changes MachOObjectFile::getSectionName to return just the section name instead of computing SegmentName,SectionName. The main difference from the previous patch is that it doesn't use InMemoryStruct. It is extremely dangerous: if the endians match it returns a pointer to the file buffer, if not, it returns a pointer to an internal buffer that is overwritten in the next API call. We should change all of this code to use support::detail::packed_endian_specific_integral like ELF, but since these functions only handle strings, they work with big and little endian machines as is. I have tested this by installing ubuntu 12.10 ppc on qemu, that is why it took so long :-) llvm-svn: 170838	2012-12-21 03:47:03 +00:00
Evan Cheng	59421aee3d	Add targets to skip running the GC passes. llvm-svn: 170836	2012-12-21 02:57:04 +00:00
Jakob Stoklund Olesen	2455b58551	Require the two-argument MI::addOperand(MF, MO) for dangling instructions. Instructions that are inserted in a basic block can still be decorated with addOperand(MO). Make the two-argument addOperand() function contain the actual implementation. This function will now always have a valid MF reference that it can use for memory allocation. llvm-svn: 170798	2012-12-20 22:54:05 +00:00
Jakob Stoklund Olesen	33f5d1492d	Add an MF argument to MI::copyImplicitOps(). This function is often used to decorate dangling instructions, so a context reference is required to allocate memory for the operands. Also add a corresponding MachineInstrBuilder method. llvm-svn: 170797	2012-12-20 22:54:02 +00:00
Jakob Stoklund Olesen	ac4210eacb	Use two-arg addOperand(MF, MO) internally in MachineInstr when possible. llvm-svn: 170796	2012-12-20 22:53:58 +00:00
Bill Wendling	66e978f904	Some random comment, naming, and format changes. Rename the AttributeImpl* from Attrs to pImpl to be consistent with other code. Add comments where none were before. Or doxygen-ify other comments. llvm-svn: 170767	2012-12-20 21:28:43 +00:00
Jakob Stoklund Olesen	00b28ecfae	Remove two dead functions. llvm-svn: 170766	2012-12-20 21:12:42 +00:00
Bob Wilson	7bba4f8957	Revert "Adding support for llvm.arm.neon.vaddl[su].* and" This reverts r170694. The operations can be represented in IR without adding any new intrinsics. llvm-svn: 170765	2012-12-20 21:09:38 +00:00
Eli Bendersky	f483ff9204	Aligned bundling support. Following the discussion here: http://lists.cs.uiuc.edu/pipermail/llvmdev/2012-December/056754.html The proposal and implementation are fully documented here: https://sites.google.com/a/chromium.org/dev/nativeclient/pnacl/aligned-bundling-support-in-llvm Tests will follow shortly. llvm-svn: 170718	2012-12-20 19:05:53 +00:00
Jim Grosbach	759292c93f	Fix inadvertant delete of 'has'. llvm-svn: 170713	2012-12-20 18:09:48 +00:00
Jakob Stoklund Olesen	b109a7b430	Use MachineInstrBuilder in InstrEmitter. This is supposed to be a mechanical change with no functional effects. InstrEmitter can generate all types of MachineOperands which revealed that MachineInstrBuilder was missing a few methods, added by this patch. Besides providing a context pointer to MI::addOperand(), MachineInstrBuilder seems like a better fit for this code. llvm-svn: 170712	2012-12-20 18:08:09 +00:00
James Molloy	4f6fb953a7	Add a new attribute, 'noduplicate'. If a function contains a noduplicate call, the call cannot be duplicated - Jump threading, loop unrolling, loop unswitching, and loop rotation are inhibited if they would duplicate the call. Similarly inlining of the function is inhibited, if that would duplicate the call (in particular inlining is still allowed when there is only one callsite and the function has internal linkage). llvm-svn: 170704	2012-12-20 16:04:27 +00:00
Roman Divacky	ff95a1dc12	Remove MCTargetAsmLexer and its derived classes now that edis, its only user, is gone. llvm-svn: 170699	2012-12-20 14:43:30 +00:00
Renato Golin	6b2ea4a48f	Adding support for llvm.arm.neon.vaddl[su].* and llvm.arm.neon.vsub[su].* intrinsics. Patch by Pete Couperus <pjcoup@gmail.com> llvm-svn: 170694	2012-12-20 13:52:11 +00:00
Richard Smith	e7701ebfec	Don't use -1 as a value of an unsigned 7-bit enumeration; that has undefined behavior and violates the !range constraints we put on loads of this enum. Found by clang -fsanitize=enum. llvm-svn: 170653	2012-12-20 04:02:58 +00:00
Richard Smith	3287fac591	Don't leave IsUnsigned uninitialized in a default-constructed APSInt. Copying such a structure has undefined behavior. Caught by -fsanitize=bool. llvm-svn: 170652	2012-12-20 03:59:24 +00:00
Bill Wendling	4607f4bdad	s/AttributesImpl/AttributeImpl/g This is going to apply to Attribute, not Attributes. llvm-svn: 170631	2012-12-20 01:36:59 +00:00
Jim Grosbach	f9c2e5e450	Clean up some DOxygen comments. llvm-svn: 170629	2012-12-20 01:14:48 +00:00
Jim Grosbach	23f1f957d5	Clean up some DOxygen comments. llvm-svn: 170628	2012-12-20 01:14:45 +00:00
Jim Grosbach	6df94846ec	MC: Add MCInstrDesc::mayAffectControlFlow() method. MC disassembler clients (LLDB) are interested in querying if an instruction may affect control flow other than by virtue of being an explicit branch instruction. For example, instructions which write directly to the PC on some architectures. llvm-svn: 170610	2012-12-19 23:38:53 +00:00
Jim Grosbach	01ab714758	Add isSubRegisterEq() and isSuperRegisterEq(). isSub and isSuper return false if RegA == RegB. Add variants which also include the identity function. llvm-svn: 170609	2012-12-19 23:38:49 +00:00
Jim Grosbach	74c6944a31	Move isSubRegister() and isSuperRegister to MCRegisterInfo. These were defined on TargetRegisterInfo, but they don't use any information that's not available in MCRegisterInfo, so sink them down to be available at the MC layer. llvm-svn: 170608	2012-12-19 23:38:46 +00:00
Jim Grosbach	98e0b8e273	Fix doc comment. '///' not '//'. llvm-svn: 170607	2012-12-19 23:38:44 +00:00
Michael Ilseman	b99f80dea7	Refactor isIntrinsic() to be quicker, and change classof() (and thus, isa<IntrinsicInst>()) to use it. This decreases the number of occurrences of the slow-path string matching performed by getIntrinsicID(). llvm-svn: 170602	2012-12-19 23:17:20 +00:00
Bill Wendling	6848e38daf	s/AttributeListImpl/AttributeSetImpl/g to match the namechange of AttributeList. llvm-svn: 170600	2012-12-19 22:42:22 +00:00
Jakob Stoklund Olesen	8fb0c99a12	Always use addOperand(MF, MO) from MachineInstrBuilder. The single-argument MachineInstr::addOperand(MO) will be removed soon. llvm-svn: 170599	2012-12-19 22:35:46 +00:00
Jakob Stoklund Olesen	b159b5ff0d	Remove the explicit MachineInstrBuilder(MI) constructor. Use the version that also takes an MF reference instead. It would technically be possible to extract an MF reference from the MI as MI->getParent()->getParent(), but that would not work for MIs that are not inserted into any basic block. Given the reasonably small number of places this constructor was used at all, I preferred the compile time check to a run time assertion. llvm-svn: 170588	2012-12-19 21:31:56 +00:00
Benjamin Kramer	870f4fe261	Remove edis remnant. llvm-svn: 170580	2012-12-19 20:11:17 +00:00
Roman Divacky	e3d323052f	Remove edis - the enhanced disassembler. Fixes PR14654. llvm-svn: 170578	2012-12-19 19:55:47 +00:00
Jakob Stoklund Olesen	35641e41eb	Add an MF argument to MachineInstr::addOperand(). Just like for addMemOperand(), the function pointer provides a context for allocating memory. This will make it possible to use a better memory allocation strategy for the MI operand list, which is currently a slow std::vector. Most calls to addOperand() come from MachineInstrBuilder, so give that class an MF reference as well. Code using BuildMI() won't need changing at all since the MF reference is already required to allocate a MachineInstr. Future patches will fix code that calls MI::addOperand(Op) directly, as well as code that uses the now deprecated MachineInstrBuilder(MI) constructor. llvm-svn: 170574	2012-12-19 19:19:01 +00:00
Chad Rosier	5f69df3f03	Remove superfluous brief command from getAsString. llvm-svn: 170569	2012-12-19 18:06:44 +00:00
Patrik Hagglund	f9934613e8	Change AsmOperandInfo::ConstraintVT to MVT, instead of EVT. Accordingly, add MVT::getVT. llvm-svn: 170550	2012-12-19 15:19:11 +00:00
Rafael Espindola	0f00de40dd	Revert 170545 while I debug the ppc failures. llvm-svn: 170547	2012-12-19 14:48:05 +00:00
Benjamin Kramer	ae0bb61053	Make TargetLowering::getTypeConversion more resilient against odd illegal MVTs. - An MVT can become an EVT when being split (e.g. v2i8 -> v1i8, the latter doesn't exist) - Return the scalar value when an MVT is scalarized (v1i64 -> i64) Fixes PR14639ff. llvm-svn: 170546	2012-12-19 14:34:28 +00:00
Rafael Espindola	aa7b27801c	Add r170095 back. I cannot reproduce it the failures locally, so I will keep an eye at the ppc bots. This patch does add the change to the "Disassembly of section" message, but that is not what was failing on the bots. Original message: Add a funciton to get the segment name of a section. On MachO, sections also have segment names. When a tool looking at a .o file prints a segment name, this is what they mean. In reality, a .o has only one anonymous, segment. This patch adds a MachO only function to fetch that segment name. I named it getSectionFinalSegmentName since the main use for the name seems to be infor the linker with segment this section should go to. The patch also changes MachOObjectFile::getSectionName to return just the section name instead of computing SegmentName,SectionName. llvm-svn: 170545	2012-12-19 14:15:04 +00:00
Evgeniy Stepanov	abeae5c7d5	[msan] Add track-origins argument to the pass constructor. llvm-svn: 170544	2012-12-19 13:55:51 +00:00
Patrik Hagglund	e09cac9a67	Change TargetLowering::getTypeForExtArgOrReturn to take and return MVTs, instead of EVTs. llvm-svn: 170537	2012-12-19 12:02:25 +00:00
Patrik Hagglund	3f1905199b	Change a parameter of TargetLowering::getVectorTypeBreakdown to MVT, from EVT. llvm-svn: 170536	2012-12-19 11:53:21 +00:00
Patrik Hagglund	bad545ccba	Change TargetLowering::RegisterTypeForVT to contain MVTs, instead of EVTs. llvm-svn: 170535	2012-12-19 11:48:16 +00:00
Patrik Hagglund	93060569ba	Change TargetLowering::TransformToType to contain MVTs, instead of EVTs. llvm-svn: 170534	2012-12-19 11:42:00 +00:00
Patrik Hagglund	2fc3c59a45	Change TargetLowering::getRepRegClassCostFor, getIndexedLoadAction, getIndexedStoreAction, and addRegisterClass to take and MVT, instead of EVT. llvm-svn: 170533	2012-12-19 11:37:12 +00:00
Patrik Hagglund	f9eb168ef4	Change TargetLowering::findRepresentativeClass to take an MVT, instead of EVT. llvm-svn: 170532	2012-12-19 11:30:36 +00:00
Patrik Hagglund	fd41b5b969	Change TargetLowering::getTypeToPromoteTo to take and return MVTs, instead of EVTs. llvm-svn: 170529	2012-12-19 11:21:04 +00:00
Benjamin Kramer	44ba3753ad	MapVector: Add lookup(). llvm-svn: 170527	2012-12-19 11:08:33 +00:00
Patrik Hagglund	ffd057a3e1	Change TargetLowering::isCondCodeLegal to take an MVT, instead of EVT. llvm-svn: 170524	2012-12-19 10:19:55 +00:00
Patrik Hagglund	deee9003ed	Change TargetLowering::getCondCodeAction to take an MVT, instead of EVT. llvm-svn: 170522	2012-12-19 10:09:26 +00:00
Bill Wendling	a87cdc27d9	Inline hasFunctionOnlyAttrs into its only use. llvm-svn: 170518	2012-12-19 09:15:11 +00:00
Bill Wendling	e9506a211f	Inline the only use of the hasParameterOnlyAttrs method. llvm-svn: 170517	2012-12-19 09:04:58 +00:00
Bill Wendling	d97b75d816	Inline the 'hasIncompatibleWithVarArgsAttrs' method into its only uses. And some minor comment reformatting. llvm-svn: 170516	2012-12-19 08:57:40 +00:00
Patrik Hagglund	d7cdcf8cb5	Change TargetLowering::getTruncStoreAction to take MVTs, instead of EVTs. llvm-svn: 170510	2012-12-19 08:28:51 +00:00
Bill Wendling	3d7b0b8ac7	Rename the 'Attributes' class to 'Attribute'. It's going to represent a single attribute in the future. llvm-svn: 170502	2012-12-19 07:18:57 +00:00
Kevin Enderby	85cf531593	Add to the disassembler C API an option to print the disassembled instructions in the assembly code variant if one exists. The intended use for this is so tools like lldb and darwin's otool(1) can be switched to print Intel-flavored disassembly. I discussed extensively this API with Jim Grosbach and we feel while it may not be fully general, in reality there is only one syntax for each assembly with the exception of X86 which has exactly two for historical reasons. rdar://10989182 llvm-svn: 170477	2012-12-18 23:47:28 +00:00
Jakob Stoklund Olesen	7aafc4bcdd	Remove MachineInstr::setIsInsideBundle(). The bundle flags are now maintained by the slightly higher-level functions bundleWithPred() / bundleWithSucc() which enforce consistent bundle flags between neighboring instructions. See also MIBundleBuilder for an even higher-level approach to building bundles. llvm-svn: 170475	2012-12-18 23:40:14 +00:00
Jakob Stoklund Olesen	d742533dbc	Use bidirectional bundle flags to simplify important functions. The bundle_iterator::operator++ function now doesn't need to dig out the basic block and check against end(). It can use the isBundledWithSucc() flag to find the last bundled instruction safely. Similarly, MachineInstr::isBundled() no longer needs to look at iterators etc. It only has to look at flags. llvm-svn: 170473	2012-12-18 23:21:49 +00:00
Jakob Stoklund Olesen	a33f504b3e	Don't allow the automatically updated MI flags to be set directly. The bundle-related MI flags need to be kept in sync with the neighboring instructions. Don't allow the bulk flag-setting setFlags() function to change them. Also don't copy MI flags when cloning an instruction. The clone's bundle flags will be set when it is explicitly inserted into a bundle. llvm-svn: 170459	2012-12-18 21:36:05 +00:00
Jakob Stoklund Olesen	78eaf05fa7	Tighten up the splice() API for bundled instructions. Remove the instr_iterator versions of the splice() functions. It doesn't seem useful to be able to splice sequences of instructions that don't consist of full bundles. The normal splice functions that take MBB::iterator arguments are not changed, and they can move whole bundles around without any problems. llvm-svn: 170456	2012-12-18 20:59:41 +00:00
Jakob Stoklund Olesen	b8d29bf2e4	Add an assertion for a likely ilist::splice() contract violation. The single-element ilist::splice() function supports a noop move: List.splice(I, List, I); The corresponding std::list function doesn't allow that, so add a unit test to document that behavior. This also means that List.splice(I, List, F); is somewhat surprisingly not equivalent to List.splice(I, List, F, next(F)); This patch adds an assertion to catch the illegal case I == F above. Alternatively, we could make I == F a legal noop, but that would make ilist differ even more from std::list. llvm-svn: 170443	2012-12-18 19:28:37 +00:00
Jakob Stoklund Olesen	422e07b091	Tighten the insert() API for bundled instructions. The normal insert() function takes an MBB::iterator position, and inserts a stand-alone MachineInstr as before. The insert() function that takes an MBB::instr_iterator position can insert instructions inside a bundle, and will now update the bundle flags correctly when that happens. When the insert position is between two bundles, it is unclear whether the instruction should be appended to the previous bundle, prepended to the next bundle, or stand on its own. The MBB::insert() function doesn't bundle the instruction in that case, use the MIBundleBuilder class for that. llvm-svn: 170437	2012-12-18 17:54:53 +00:00
Eli Bendersky	fede6b1d62	Cleanup comment and formatting llvm-svn: 170398	2012-12-18 00:53:36 +00:00
Eric Christopher	906da23229	Add support for passing -main-file-name all the way through to the assembler. Part of PR14624 llvm-svn: 170390	2012-12-18 00:31:01 +00:00
Jakob Stoklund Olesen	ccfb5fb472	Tighten up the erase/remove API for bundled instructions. Most code is oblivious to bundles and uses the MBB::iterator which only visits whole bundles. MBB::erase() operates on whole bundles at a time as before. MBB::remove() now refuses to remove bundled instructions. It is not safe to remove all instructions in a bundle without deleting them since there is no way of returning pointers to all the removed instructions. MBB::remove_instr() and MBB::erase_instr() will now update bundle flags correctly, lifting individual instructions out of bundles while leaving the remaining bundle intact. The MachineInstr convenience functions are updated so eraseFromParent() erases a whole bundle as before eraseFromBundle() erases a single instruction, leaving the rest of its bundle. removeFromParent() refuses to operate on bundled instructions, and removeFromBundle() lifts a single instruction out of its bundle. These functions will no longer accidentally split or coalesce bundles - bundle flags are updated to preserve the existing bundling, and explicit bundleWith* / unbundleFrom* functions should be used to change the instruction bundling. This API update is still a work in progress. I am going to update APIs first so they maintain bundle flags automatically when possible. Then I'll add stricter verification of the bundle flags. llvm-svn: 170384	2012-12-17 23:55:38 +00:00
Chandler Carruth	10700aad85	Prepare LLVM to fix PR14625, exposing a hook in MCContext to manage the compilation directory. This defaults to the current working directory, just as it always has, but now an assembler can choose to override it with a custom directory. I've taught llvm-mc about this option and added a test case. llvm-svn: 170371	2012-12-17 21:32:42 +00:00
Michael Ilseman	5feb4e17d0	Remove trailing whitespace llvm-svn: 170368	2012-12-17 20:40:14 +00:00
Nick Kledzik	95850c24a4	Use different trait techniques to be compatible with g++ llvm-svn: 170355	2012-12-17 19:02:05 +00:00
Tim Northover	5edabc131a	Teach MachO which sections contain code llvm-svn: 170349	2012-12-17 17:59:32 +00:00
Duncan Sands	7cb52522fe	Fix typo that results in new landing pads not getting a name, fixing PR14617. Patch by Chris Toshok. llvm-svn: 170318	2012-12-17 12:02:36 +00:00
Duncan Sands	66c2cd3d88	Fix comment typo. llvm-svn: 170317	2012-12-17 11:43:15 +00:00
Reed Kotler	aee4d5d194	This patch is needed to make c++ exceptions work for mips16. Mips16 is really a processor decoding mode (ala thumb 1) and in the same program, mips16 and mips32 functions can exist and can call each other. If a jal type instruction encounters an address with the lower bit set, then the processor switches to mips16 mode (if it is not already in it). If the lower bit is not set, then it switches to mips32 mode. The linker knows which functions are mips16 and which are mips32. When relocation is performed on code labels, this lower order bit is set if the code label is a mips16 code label. In general this works just fine, however when creating exception handling tables and dwarf, there are cases where you don't want this lower order bit added in. This has been traditionally distinguished in gas assembly source by using a different syntax for the label. lab1: ; this will cause the lower order bit to be added lab2=. ; this will not cause the lower order bit to be added In some cases, it does not matter because in dwarf and debug tables the difference of two labels is used and in that case the lower order bits subtract each other out. To fix this, I have added to mcstreamer the notion of a debuglabel. The default is for label and debug label to be the same. So calling EmitLabel and EmitDebugLabel produce the same result. For various reasons, there is only one set of labels that needs to be modified for the mips exceptions to work. These are the "$eh_func_beginXXX" labels. Mips overrides the debug label suffix from ":" to "=." . This initial patch fixes exceptions. More changes most likely will be needed to DwarfCFException to make all of this work for actual debugging. These changes will be to emit debug labels in some places where a simple label is emitted now. Some historical discussion on this from gcc can be found at: http://gcc.gnu.org/ml/gcc-patches/2008-08/msg00623.html http://gcc.gnu.org/ml/gcc-patches/2008-11/msg01273.html llvm-svn: 170279	2012-12-16 04:00:45 +00:00
Pedro Artigas	b95c53e216	Add more reset methods to make all objects that the backend may use for outputting code have a reset, some are not used but were declared for completeness llvm-svn: 170227	2012-12-14 18:52:11 +00:00
NAKAMURA Takumi	92eb254c3c	[CMake] Move libxml2 stuff from clang to llvm/cmake. llvm-svn: 170225	2012-12-14 18:30:20 +00:00
Bill Schmidt	9f0b4ec0f5	This patch improves the 64-bit PowerPC InitialExec TLS support by providing for a wider range of GOT entries that can hold thread-relative offsets. This matches the behavior of GCC, which was not documented in the PPC64 TLS ABI. The ABI will be updated with the new code sequence. Former sequence: ld 9,x@got@tprel(2) add 9,9,x@tls New sequence: addis 9,2,x@got@tprel@ha ld 9,x@got@tprel@l(9) add 9,9,x@tls Note that a linker optimization exists to transform the new sequence into the shorter sequence when appropriate, by replacing the addis with a nop and modifying the base register and relocation type of the ld. llvm-svn: 170209	2012-12-14 17:02:38 +00:00
Patrik Hagglund	55d6f47a37	Change TargetLowering::getLoadExtAction to take an MVT, instead of EVT. llvm-svn: 170183	2012-12-14 09:05:13 +00:00
Chris Lattner	1117cb2f16	fix comment. llvm-svn: 170155	2012-12-13 22:34:43 +00:00
Patrik Hagglund	13abe5ec3c	Change TargetLowering::setTypeAction to take an MVT, instead fo EVT. llvm-svn: 170148	2012-12-13 20:42:43 +00:00
Patrik Hagglund	05394352c0	Change TargetLowering::getRepRegClassFor to take an MVT, instead of EVT. Accordingly, change RegDefIter to contain MVTs instead of EVTs. llvm-svn: 170140	2012-12-13 18:45:35 +00:00
NAKAMURA Takumi	887c8586e2	JITEventListener.h: Use llvm-config.h instead of config.h. llvm-svn: 170129	2012-12-13 15:03:38 +00:00
Eric Christopher	c859c2912f	Revert "Add a funciton to get the segment name of a section." This reverts commit r170095 since it appears to be breaking the bots. llvm-svn: 170105	2012-12-13 06:36:18 +00:00
Patrik Hagglund	5e6c361bc0	Change TargetLowering::getRegClassFor to take an MVT, instead of EVT. Accordingly, add helper funtions getSimpleValueType (in parallel to getValueType) in SDValue, SDNode, and TargetLowering. This is the first, in a series of patches. This is the second attempt. In the first attempt (r169837), a few getSimpleVT() were hoisted too far, detected by bootstrap failures. llvm-svn: 170104	2012-12-13 06:34:11 +00:00
Rafael Espindola	bc8016d062	Add a funciton to get the segment name of a section. On MachO, sections also have segment names. When a tool looking at a .o file prints a segment name, this is what they mean. In reality, a .o has only one, anonymous, segment. This patch adds a MachO only function to fetch that segment name. I named it getSectionFinalSegmentName since the main use for the name seems to be informing the linker with segment this section should go to. The patch also changes MachOObjectFile::getSectionName to return just the section name instead of computing SegmentName,SectionName. llvm-svn: 170095	2012-12-13 04:07:18 +00:00
Rafael Espindola	319f74cd11	Rename isPowerOfTwo to isKnownToBeAPowerOfTwo. In a previous thread it was pointed out that isPowerOfTwo is not a very precise name since it can return false for powers of two if it is unable to show that they are powers of two. llvm-svn: 170093	2012-12-13 03:37:24 +00:00
Michael Ilseman	536cc32ba0	Pattern matching code for intrinsics. Provides m_Argument that allows matching against a CallSite's specified argument. Provides m_Intrinsic pattern that can be templatized over the intrinsic id and bind/match arguments similarly to other pattern matchers. Implementations provided for 0 to 4 arguments, though it's very simple to extend for more. Also provides example template specialization for bswap (m_BSwap) and example of code cleanup for its use. llvm-svn: 170091	2012-12-13 03:13:36 +00:00
Eric Christopher	cf6d6317cc	Remove extraneous debugging code. llvm-svn: 170090	2012-12-13 03:07:28 +00:00
Eric Christopher	80882db88f	Add a way of printing out an arbitrary label name for a section given the section. llvm-svn: 170087	2012-12-13 03:00:35 +00:00
Michael Ilseman	eae9752381	m_CombineOr and m_CombineAnd pattern combinators llvm-svn: 170086	2012-12-13 02:55:53 +00:00
Jakob Stoklund Olesen	a9f236cc88	Express prepend and append in terms of a more generic insert(). Also add an MIBundleBuilder constructor that takes an existing bundle. Together these functions make it possible to add instructions to existing bundles. llvm-svn: 170063	2012-12-13 00:59:36 +00:00
Pedro Artigas	7212ee4534	Make the MCStreamer have a reset method and call that after finalization of the asm printer, also changed MCContext to a single reset only method for simplicity as requested on the list llvm-svn: 170041	2012-12-12 22:59:46 +00:00
Benjamin Kramer	36b0f12474	YAMLIO: Remove all of the template instantiation hacks, I don't see why they're necessary and it breaks linking of the unit tests. Also comes with a clang-format run on the cpp file, it had major style violations. llvm-svn: 170036	2012-12-12 22:40:02 +00:00
Nick Kledzik	323bcb9eb9	AlignedCharArrayUnion is erroring with non-clang compilers llvm-svn: 170031	2012-12-12 22:03:57 +00:00
Nick Kledzik	f60a9279ea	Initial implementation of a utility for converting native data structures to and from YAML using traits. The first client will be the test suite of lld. The documentation will show up at: http://llvm.org/docs/YamlIO.html llvm-svn: 170019	2012-12-12 20:46:15 +00:00
Eli Bendersky	e11ab3aafe	Make naming consistent, add comments and sanity asserts llvm-svn: 170007	2012-12-12 19:54:05 +00:00
Nadav Rotem	d0bb22bba3	LoopVectorizer: Use the "optsize" attribute to decide if we are allowed to increase the function size. llvm-svn: 170004	2012-12-12 19:29:45 +00:00
Bill Schmidt	24b8dd6eb7	This patch implements local-dynamic TLS model support for the 64-bit PowerPC target. This is the last of the four models, so we now have full TLS support. This is mostly a straightforward extension of the general dynamic model. I had to use an additional Chain operand to tie ADDIS_DTPREL_HA to the register copy following ADDI_TLSLD_L; otherwise everything above the ADDIS_DTPREL_HA appeared dead and was removed. As before, there are new test cases to test the assembly generation, and the relocations output during integrated assembly. The expected code gen sequence can be read in test/CodeGen/PowerPC/tls-ld.ll. There are a couple of things I think can be done more efficiently in the overall TLS code, so there will likely be a clean-up patch forthcoming; but for now I want to be sure the functionality is in place. Bill llvm-svn: 170003	2012-12-12 19:29:35 +00:00
Rafael Espindola	e40238069e	The TargetData is not used for the isPowerOfTwo determination. It has never been used in the first place. It simply was passed to the function and to the recursive invocations. Simply drop the parameter and update the callers for the new signature. Patch by Saleem Abdulrasool! llvm-svn: 169988	2012-12-12 16:52:40 +00:00
Alexey Samsonov	3d43b63a6e	Improve debug info generated with enabled AddressSanitizer. When ASan replaces <alloca instruction> with <offset into a common large alloca>, it should also patch llvm.dbg.declare calls and replace debug info descriptors to mark that we've replaced alloca with a value that stores an address of the user variable, not the user variable itself. See PR11818 for more context. llvm-svn: 169984	2012-12-12 14:31:53 +00:00
Logan Chien	4dd14fb5eb	Add ARM NONE and PREL31 relocation types. Add R_ARM_NONE and R_ARM_PREL31 relocation types to MCExpr. Both of them will be used while generating .ARM.extab and .ARM.exidx sections. llvm-svn: 169965	2012-12-12 07:14:46 +00:00
Evan Cheng	962711ee71	Sorry about the churn. One more change to getOptimalMemOpType() hook. Did I mention the inline memcpy / memset expansion code is a mess? This patch split the ZeroOrLdSrc argument into two: IsMemset and ZeroMemset. The first indicates whether it is expanding a memset or a memcpy / memmove. The later is whether the memset is a memset of zero. It's totally possible (likely even) that targets may want to do different things for memcpy and memset of zero. llvm-svn: 169959	2012-12-12 02:34:41 +00:00
Evan Cheng	c3d1aca657	- Rename isLegalMemOpType to isSafeMemOpType. "Legal" is a very overloade term. Also added more comments to explain why it is generally ok to return true. - Rename getOptimalMemOpType argument IsZeroVal to ZeroOrLdSrc. It's meant to be true for loaded source (memcpy) or zero constants (memset). The poor name choice is probably some kind of legacy issue. llvm-svn: 169954	2012-12-12 01:32:07 +00:00
Nadav Rotem	aeb17df802	LoopVectorizer: When -Os is used, vectorize only loops that dont require a tail loop. There is no testcase because I dont know of a way to initialize the loop vectorizer pass without adding an additional hidden flag. llvm-svn: 169950	2012-12-12 01:11:46 +00:00
Evan Cheng	04e5518783	Avoid using lossy load / stores for memcpy / memset expansion. e.g. f64 load / store on non-SSE2 x86 targets. llvm-svn: 169944	2012-12-12 00:42:09 +00:00
Michael Ilseman	bb6f691b01	Added a slew of SimplifyInstruction floating-point optimizations, many of which take advantage of fast-math flags. Test cases included. fsub X, +0 ==> X fsub X, -0 ==> X, when we know X is not -0 fsub +/-0.0, (fsub -0.0, X) ==> X fsub nsz +/-0.0, (fsub +/-0.0, X) ==> X fsub nnan ninf X, X ==> 0.0 fadd nsz X, 0 ==> X fadd [nnan ninf] X, (fsub [nnan ninf] 0, X) ==> 0 where nnan and ninf have to occur at least once somewhere in this expression fmul X, 1.0 ==> X llvm-svn: 169940	2012-12-12 00:27:46 +00:00
Michael Ilseman	5cd69b4ce3	Pattern matchers for floating point values m_ConstantFP - match and bind a float constant m_SpecificConstantFP - match a specific floating point value or vector of floats of that value m_FPOne - match a floating point 1.0 or vector of 1.0s m_NegZero - match -0.0 m_AnyZero - match 0 or -0.0 llvm-svn: 169939	2012-12-12 00:23:43 +00:00
Michael Ilseman	d9d61793e6	Remove FIXMEs surrounding Constant[Data]Vectors, instead llvm-svn: 169938	2012-12-12 00:21:43 +00:00
Evan Cheng	eb54240dc2	Replace TargetLowering::isIntImmLegal() with ScalarTargetTransformInfo::getIntImmCost() instead. "Legal" is a poorly defined term for something like integer immediate materialization. It is always possible to materialize an integer immediate. Whether to use it for memcpy expansion is more a "cost" conceern. llvm-svn: 169929	2012-12-11 23:26:14 +00:00
Tom Stellard	75aadc2813	Add R600 backend A new backend supporting AMD GPUs: Radeon HD2XXX - HD7XXX llvm-svn: 169915	2012-12-11 21:25:42 +00:00
Bill Schmidt	c56f1d34bc	This patch implements the general dynamic TLS model for 64-bit PowerPC. Given a thread-local symbol x with global-dynamic access, the generated code to obtain x's address is: Instruction Relocation Symbol addis ra,r2,x@got@tlsgd@ha R_PPC64_GOT_TLSGD16_HA x addi r3,ra,x@got@tlsgd@l R_PPC64_GOT_TLSGD16_L x bl __tls_get_addr(x@tlsgd) R_PPC64_TLSGD x R_PPC64_REL24 __tls_get_addr nop <use address in r3> The implementation borrows from the medium code model work for introducing special forms of ADDIS and ADDI into the DAG representation. This is made slightly more complicated by having to introduce a call to the external function __tls_get_addr. Using the full call machinery is overkill and, more importantly, makes it difficult to add a special relocation. So I've introduced another opcode GET_TLS_ADDR to represent the function call, and surrounded it with register copies to set up the parameter and return value. Most of the code is pretty straightforward. I ran into one peculiarity when I introduced a new PPC opcode BL8_NOP_ELF_TLSGD, which is just like BL8_NOP_ELF except that it takes another parameter to represent the symbol ("x" above) that requires a relocation on the call. Something in the TblGen machinery causes BL8_NOP_ELF and BL8_NOP_ELF_TLSGD to be treated identically during the emit phase, so this second operand was never visited to generate relocations. This is the reason for the slightly messy workaround in PPCMCCodeEmitter.cpp:getDirectBrEncoding(). Two new tests are included to demonstrate correct external assembly and correct generation of relocations using the integrated assembler. Comments welcome! Thanks, Bill llvm-svn: 169910	2012-12-11 20:30:11 +00:00
Rafael Espindola	a92da5b34f	Use an ArrayRef instead of a std::vector&. llvm-svn: 169881	2012-12-11 16:36:02 +00:00
Patrik Hagglund	e98b7a0389	Revert EVT->MVT changes, r169836-169851, due to buildbot failures. llvm-svn: 169854	2012-12-11 11:14:33 +00:00
Chandler Carruth	7ec41c7827	Holding my nose and moving the accumulation routine to GEPOperator instead of the instruction. I've left a forwarding wrapper for the instruction so users with the instruction don't need to create a GEPOperator themselves. This lets us remove the copy of this code in instsimplify. I've looked at most of the other copies of similar code, and this is the only one I've found that is actually exactly the same. The one in InlineCost is very close, but it requires re-mapping non-constant indices through the cost analysis value simplification map. I could add direct support for this to the generic routine, but it seems overly specific. llvm-svn: 169853	2012-12-11 11:05:15 +00:00
Chandler Carruth	1e14053d84	Hoist the GEP constant address offset computation to a common home on the GEP instruction class. This is part of the continued refactoring and cleaning of the infrastructure used by SROA. This particular operation is also done in a few other places which I'll try to refactor to share this implementation. llvm-svn: 169852	2012-12-11 10:29:10 +00:00
Patrik Hagglund	ad432a8e70	Change TargetLowering::getTypeForExtArgOrReturn to take and return MVTs, instead of EVTs. Accordingly, add bitsLT (and similar) to MVT. llvm-svn: 169850	2012-12-11 10:20:51 +00:00
Patrik Hagglund	d34337495e	Change a parameter of TargetLowering::getVectorTypeBreakdown to MVT, from EVT. llvm-svn: 169849	2012-12-11 10:16:19 +00:00
Patrik Hagglund	03e9628cfa	Change TargetLowering::RegisterTypeForVT to contain MVTs, instead of EVTs. llvm-svn: 169848	2012-12-11 10:09:23 +00:00
Patrik Hagglund	c50489e203	Change TargetLowering::TransformToType to contain MVTs, instead of EVTs. llvm-svn: 169847	2012-12-11 10:05:04 +00:00
Patrik Hagglund	7d0ba05894	Change TargetLowering::getRepRegClassCostFor, getIndexedLoadAction, getIndexedStoreAction, and addRegisterClass to take an MVT, instead of EVT. llvm-svn: 169846	2012-12-11 10:00:35 +00:00
Patrik Hagglund	8d2e7cf561	Change TargetLowering::findRepresentativeClass to take an MVT, instead of EVT. llvm-svn: 169845	2012-12-11 09:57:18 +00:00
Patrik Hagglund	ffb60f7c08	Change TargetLowering::getTypeToPromoteTo to take and return MVTs, instead of EVTs. llvm-svn: 169844	2012-12-11 09:54:23 +00:00
Patrik Hagglund	a970281106	Change TargetLowering::isCondCodeLegal to take an MVT, instead of EVT. llvm-svn: 169843	2012-12-11 09:51:27 +00:00
Patrik Hagglund	e3bec6365a	Change TargetLowering::getCondCodeAction to take an MVT, instead of EVT. llvm-svn: 169842	2012-12-11 09:48:14 +00:00
Patrik Hagglund	7ffcd226dd	Change TargetLowering::getTruncStoreAction to take MVTs, instead of EVTs. llvm-svn: 169841	2012-12-11 09:42:24 +00:00
Patrik Hagglund	cbc9d4d0f9	Change TargetLowering::getLoadExtAction to take an MVT, instead of EVT. llvm-svn: 169840	2012-12-11 09:39:09 +00:00
Patrik Hagglund	40e1afe970	Change TargetLowering::setTypeAction to take an MVT, instead fo EVT. llvm-svn: 169839	2012-12-11 09:32:56 +00:00
Patrik Hagglund	57b1694df1	Change TargetLowering::getRepRegClassFor to take an MVT, instead of EVT. Accordingly, change RegDefIter to contain MVTs instead of EVTs. llvm-svn: 169838	2012-12-11 09:31:43 +00:00
Patrik Hagglund	3708e548f8	Change TargetLowering::getRegClassFor to take an MVT, instead of EVT. Accordingly, add helper funtions getSimpleValueType (in parallel to getValueType) in SDValue, SDNode, and TargetLowering. This is the first, in a series of patches. llvm-svn: 169837	2012-12-11 09:10:33 +00:00
NAKAMURA Takumi	15c8df025d	llvm/Target/TargetMachine.h: Remove two dependent headers. -#include "llvm/Target/TargetTransformImpl.h" -#include "llvm/TargetTransformInfo.h" llvm-svn: 169818	2012-12-11 05:53:43 +00:00
Chad Rosier	df42cf39ab	Fall back to the selection dag isel to select tail calls. This shouldn't affect codegen for -O0 compiles as tail call markers are not emitted in unoptimized compiles. Testing with the external/internal nightly test suite reveals no change in compile time performance. Testing with -O1, -O2 and -O3 with fast-isel enabled did not cause any compile-time or execution-time failures. All tests were performed on my x86 machine. I'll monitor our arm testers to ensure no regressions occur there. In an upcoming clang patch I will be marking the objc_autoreleaseReturnValue and objc_retainAutoreleaseReturnValue as tail calls unconditionally. While it's theoretically true that this is just an optimization, it's an optimization that we very much want to happen even at -O0, or else ARC applications become substantially harder to debug. Part of rdar://12553082 llvm-svn: 169796	2012-12-11 00:18:02 +00:00
Evan Cheng	79e2ca90bc	Some enhancements for memcpy / memset inline expansion. 1. Teach it to use overlapping unaligned load / store to copy / set the trailing bytes. e.g. On 86, use two pairs of movups / movaps for 17 - 31 byte copies. 2. Use f64 for memcpy / memset on targets where i64 is not legal but f64 is. e.g. x86 and ARM. 3. When memcpy from a constant string, do not replace the load with a constant if it's not possible to materialize an integer immediate with a single instruction (required a new target hook: TLI.isIntImmLegal()). 4. Use unaligned load / stores more aggressively if target hooks indicates they are "fast". 5. Update ARM target hooks to use unaligned load / stores. e.g. vld1.8 / vst1.8. Also increase the threshold to something reasonable (8 for memset, 4 pairs for memcpy). This significantly improves Dhrystone, up to 50% on ARM iOS devices. rdar://12760078 llvm-svn: 169791	2012-12-10 23:21:26 +00:00
Lang Hames	517fc8b264	Defer call to InitSections until after MCContext has been initialized. If InitSections is called before the MCContext is initialized it could cause duplicate temporary symbols to be emitted later (after context initialization resets the temporary label counter). llvm-svn: 169785	2012-12-10 22:49:11 +00:00
Eric Christopher	200dd760fa	Fix a coding style nit. llvm-svn: 169776	2012-12-10 22:00:20 +00:00
Bill Wendling	4a8fc8f271	Revert r169656. The linker will call `lto_codegen_add_must_preserve_symbol' on all globals that should be kept around. The linker will pretend that a dylib is being created. <rdar://problem/12528059> llvm-svn: 169770	2012-12-10 21:33:45 +00:00
Eli Bendersky	4c7296fd1a	Cleanup formatting, comments and naming. llvm-svn: 169762	2012-12-10 20:13:43 +00:00
Bill Wendling	74f334e476	Don't use a red zone for code coverage if the user specified `-mno-red-zone'. The `-mno-red-zone' flag wasn't being propagated to the functions that code coverage generates. This allowed some of them to use the red zone when that wasn't allowed. <rdar://problem/12843084> llvm-svn: 169754	2012-12-10 19:46:49 +00:00
Sean Silva	aab278fbba	Fix funky copy-pasted grammatical error. PR14343 llvm-svn: 169742	2012-12-10 18:37:26 +00:00
Chandler Carruth	e41e7b7901	Add a new visitor for walking the uses of a pointer value. This visitor provides infrastructure for recursively traversing the use-graph of a pointer-producing instruction like an alloca or a malloc. It maintains a worklist of uses to visit, so it can handle very deep recursions. It automatically looks through instructions which simply translate one pointer to another (bitcasts and GEPs). It tracks the offset relative to the original pointer as long as that offset remains constant and exposes it during the visit as an APInt offset. Finally, it performs conservative escape analysis. However, currently it has some limitations that should be addressed going forward: 1) It doesn't handle vectors of pointers. 2) It doesn't provide a cheaper visitor when the constant offset tracking isn't needed. 3) It doesn't support non-instruction pointer values. The current functionality is exactly what is required to implement the SROA pointer-use visitors in terms of this one, rather than in terms of their own ad-hoc base visitor, which was always very poorly specified. SROA has been converted to use this, and the code there deleted which this utility now provides. Technically speaking, using this new visitor allows SROA to handle a few more cases than it previously did. It is now more aggressive in ignoring chains of instructions which look like they would defeat SROA, but in fact do not because they never result in a read or write of memory. While this is "neat", it shouldn't be interesting for real programs as any such chains should have been removed by others passes long before we get to SROA. As a consequence, I've not added any tests for these features -- it shouldn't be part of SROA's contract to perform such heroics. The goal is to extend the functionality of this visitor going forward, and re-use it from passes like ASan that can benefit from doing a detailed walk of the uses of a pointer. Thanks to Ben Kramer for the code review rounds and lots of help reviewing and debugging this patch. llvm-svn: 169728	2012-12-10 08:28:39 +00:00
Michael Ilseman	65f1435a6f	Reorganize FastMathFlags to be a wrapper around unsigned, and streamline some interfaces. llvm-svn: 169712	2012-12-09 21:12:04 +00:00
Paul Redmond	2adb13c100	LoopVectorize: support vectorizing intrinsic calls - added function to VectorTargetTransformInfo to query cost of intrinsics - vectorize trivially vectorizable intrinsic calls such as sin, cos, log, etc. Reviewed by: Nadav llvm-svn: 169711	2012-12-09 20:42:17 +00:00
Michael Ilseman	6d2ffa1858	Have the bitcode reader/writer just use FPMathOperator's fast math enum directly llvm-svn: 169710	2012-12-09 20:23:16 +00:00
Shuxin Yang	95de7c37e2	- Re-enable population count loop idiom recognization - fix a bug which cause sigfault. - add two testing cases which was causing crash llvm-svn: 169687	2012-12-09 03:12:46 +00:00
Chandler Carruth	91e47532fe	Revert the patches adding a popcount loop idiom recognition pass. There are still bugs in this pass, as well as other issues that are being worked on, but the bugs are crashers that occur pretty easily in the wild. Test cases have been sent to the original commit's review thread. This reverts the commits: r169671: Fix a logic error. r169604: Move the popcnt tests to an X86 subdirectory. r168931: Initial commit adding the pass. llvm-svn: 169683	2012-12-08 22:18:29 +00:00
Logan Chien	6ebca4be33	Fix Windows build breakage. Windows does not have <stdint.h>, should include "llvm/Support/DataTypes.h" instead. llvm-svn: 169672	2012-12-08 05:19:49 +00:00
Bill Wendling	65a6ee11dd	Add the `lto_codegen_set_export_dynamic' function. This function sets the `_exportDynamic' ivar. When that's set, we export all symbols (e.g. we don't run the internalize pass). This is equivalent to the `--export-dynamic' linker flag in GNU land: --export-dynamic When creating a dynamically linked executable, add all symbols to the dynamic symbol table. The dynamic symbol table is the set of symbols which are visible from dynamic objects at run time. If you do not use this option, the dynamic symbol table will normally contain only those symbols which are referenced by some dynamic object mentioned in the link. If you use dlopen to load a dynamic object which needs to refer back to the symbols defined by the program, rather than some other dynamic object, then you will probably need to use this option when linking the program itself. The Darwin linker will support this via the `-export_dynamic' flag. We should modify clang to support this via the `-rdynamic' flag. llvm-svn: 169656	2012-12-08 00:18:16 +00:00
Jim Grosbach	0ca9d5b7a5	Add C API for specifying CPU to the disassembler. It was a nasty oversight that we didn't include this when we added this API in the first place. Blech. rdar://12839439 llvm-svn: 169653	2012-12-07 23:53:27 +00:00
Bill Wendling	e94d843e43	s/AttrListPtr/AttributeSet/g to better label what this class is going to be in the near future. llvm-svn: 169651	2012-12-07 23:16:57 +00:00
Eli Bendersky	84b2a79570	Make the contents of encoded sections SmallVector<char, N> instead of SmallString. This makes it possible to use the length-erased SmallVectorImpl in the interface without imposing buffer size. Thus, the size of MCInstFragment is back down since a preallocated 8-byte contents buffer is enough. It would be generally a good idea to rid all the fragments of SmallString as contents, because a vector just makes more sense. llvm-svn: 169644	2012-12-07 22:06:56 +00:00
Michael Ilseman	e76c1e5aec	Remove trailing whitespace llvm-svn: 169637	2012-12-07 21:41:53 +00:00
Ted Kremenek	8047dea32d	Mark ImmutableMap::remove/add() const. llvm-svn: 169629	2012-12-07 19:44:12 +00:00
Eli Bendersky	a31a894eed	Refactor MCInstFragment and MCDataFragment to adhere to a common interface, which removes code duplication and prepares the ground for future additions. Full discussion: http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20121203/158233.html llvm-svn: 169626	2012-12-07 19:13:57 +00:00
Eli Bendersky	ea2824dc88	Lift EmitAssignment into MCObjectStreamer which gets rid of at least three duplicate implementations in format-specific streamers. llvm-svn: 169613	2012-12-07 17:42:41 +00:00
Tim Northover	5cc3dc86bb	Added Mapping Symbols for ARM ELF Before this patch, when you objdump an LLVM-compiled file, objdump tried to decode data-in-code sections as if they were code. This patch adds the missing Mapping Symbols, as defined by "ELF for the ARM Architecture" (ARM IHI 0044D). Patch based on work by Greg Fitzgerald. llvm-svn: 169609	2012-12-07 16:50:23 +00:00
Logan Chien	59ff070376	Split MCELFStreamer into a header file. llvm-svn: 169603	2012-12-07 15:50:40 +00:00
Jakob Stoklund Olesen	8001d2ef11	Add an MIBundleBuilder class. Like the MachineInstrBuilder, this class makes it easier to build bundles of MachineInstrs. llvm-svn: 169584	2012-12-07 04:23:35 +00:00
Jakob Stoklund Olesen	fead62d4f4	Add higher-level API for dealing with bundled MachineInstrs. This is still a work in progress. The purpose is to make bundling and unbundling operations explicit, and to catch errors where bundles are broken or created inadvertently. The old IsInsideBundle flag is replaced by two MI flags: BundledPred which has the same meaning as IsInsideBundle, and BundledSucc which is set on instructions that are bundled with a successor. Having two flags provdes redundancy to detect when a bundle is inadvertently torn by a splice() or insert(), and it makes it possible to write bundle iterators that don't need to peek at adjacent instructions. The new flags can't be manipulated directly (once setIsInsideBundle is gone). Instead there are MI functions to make and break bundle bonds. The setIsInsideBundle function will be removed in a future commit. It should be replaced by bundleWithPred(). llvm-svn: 169583	2012-12-07 04:23:29 +00:00
Ted Kremenek	e483d87bd9	Add manualRetain() and manualRelease() to ImmutableMapRef, and add a new constructor. llvm-svn: 169572	2012-12-07 02:03:00 +00:00
Eli Bendersky	0159cab70c	Add convenience accessor to Triple for OS == NaCl llvm-svn: 169565	2012-12-07 00:01:53 +00:00
Pedro Artigas	e84b13f039	fixed valgrind issues of prior commit, this change applies r169456 changes back to the tree with fixes. on darwin no valgrind issues exist in the tests that used to fail. original change description: change MCContext to work on the doInitialization/doFinalization model reviewed by Evan Cheng <evan.cheng@apple.com> llvm-svn: 169553	2012-12-06 22:12:44 +00:00
Ted Kremenek	625add4c46	Revert "Allow modifying an ImmutableMap without canonicalizing it immediately." Jordan and I discussed this, and we don't want this in the API. llvm-svn: 169541	2012-12-06 19:41:30 +00:00
Evan Cheng	9ec512d768	Replace r169459 with something safer. Rather than having computeMaskedBits to understand target implementation of any_extend / extload, just generate zero_extend in place of any_extend for liveouts when the target knows the zero_extend will be implicit (e.g. ARM ldrb / ldrh) or folded (e.g. x86 movz). rdar://12771555 llvm-svn: 169536	2012-12-06 19:13:27 +00:00
Jordan Rose	9163978b2c	Allow modifying an ImmutableMap without canonicalizing it immediately. This is an alternative to the ImmutableMapRef interface where a factory should still be canonicalizing by default, but in certain cases an improvement can be made by delaying the canonicalization. llvm-svn: 169532	2012-12-06 19:01:24 +00:00
Bill Wendling	28fe9e7a36	Handle non-default array bounds. Some languages, e.g. Ada and Pascal, allow you to specify that the array bounds are different from the default (1 in these cases). If we have a lower bound that's non-default, then we emit the lower bound. We also calculate the correct upper bound in those cases. llvm-svn: 169484	2012-12-06 07:38:10 +00:00
NAKAMURA Takumi	d985d76040	Revert r169456, "change MCContext to work on the doInitialization/doFinalization model" It broke many builders. llvm-svn: 169462	2012-12-06 02:00:13 +00:00
Evan Cheng	5213139f48	Let targets provide hooks that compute known zero and ones for any_extend and extload's. If they are implemented as zero-extend, or implicitly zero-extend, then this can enable more demanded bits optimizations. e.g. define void @foo(i16* %ptr, i32 %a) nounwind { entry: %tmp1 = icmp ult i32 %a, 100 br i1 %tmp1, label %bb1, label %bb2 bb1: %tmp2 = load i16* %ptr, align 2 br label %bb2 bb2: %tmp3 = phi i16 [ 0, %entry ], [ %tmp2, %bb1 ] %cmp = icmp ult i16 %tmp3, 24 br i1 %cmp, label %bb3, label %exit bb3: call void @bar() nounwind br label %exit exit: ret void } This compiles to the followings before: push {lr} mov r2, #0 cmp r1, #99 bhi LBB0_2 @ BB#1: @ %bb1 ldrh r2, [r0] LBB0_2: @ %bb2 uxth r0, r2 cmp r0, #23 bhi LBB0_4 @ BB#3: @ %bb3 bl _bar LBB0_4: @ %exit pop {lr} bx lr The uxth is not needed since ldrh implicitly zero-extend the high bits. With this change it's eliminated. rdar://12771555 llvm-svn: 169459	2012-12-06 01:28:01 +00:00
Pedro Artigas	bf7d3bab26	change MCContext to work on the doInitialization/doFinalization model reviewed by Evan Cheng <evan.cheng@apple.com> llvm-svn: 169456	2012-12-06 00:50:55 +00:00
Andrew Trick	d3226eee03	RegPressureTracker::dump(): Remove unnecessary argument. llvm-svn: 169443	2012-12-05 23:05:22 +00:00
Eli Bendersky	02631c4e31	Change std::vector to SmallVector<4> and remove some unused methods. This is more consistent with other vectors in this code. In addition, I ran some tests compiling a large program and >96% of fragments have 4 or less fixups, so SmallVector<4> is a good optimization. llvm-svn: 169433	2012-12-05 22:11:02 +00:00
Andrew Trick	7bbcad7bcd	RegisterPressureTracker: unify virtual registers and physical regunits. Now that live register units are tracked individually, the code can be simplified. llvm-svn: 169426	2012-12-05 21:37:47 +00:00
Andrew Trick	7f7cee39ab	RegisterPresssureTracker: Track live physical register by unit. This is much simpler to reason about, more efficient, and fixes some corner cases involving implicit super-register defs. Fixed rdar://12797931. llvm-svn: 169425	2012-12-05 21:37:42 +00:00
Eli Bendersky	a5b779a87c	Remove unused methods llvm-svn: 169419	2012-12-05 20:56:39 +00:00
Michael J. Spencer	fa8e8c5193	Updates to Win64EH.h structures. Change member types of RuntimeFunction and UnwindInfo from uint64_t to uint32_t: These members represent addresses. According to MSDN, they are image relative, that is, they are 32-bit offsets from the starting address of the image that contains the function table entry. See MSDN for more information: RUNTIME_FUNCTION: http://msdn.microsoft.com/en-us/library/ft9x1kdx.aspx UNWIND_INFO: http://msdn.microsoft.com/en-us/library/ddssxxy8.aspx Make Win64.h platform-neutral: The standard types unit8_t, uint16_t and uint32_t are replaced with their counterparts from Endian.h. Accessor functions are introduced to replace bit fields. Patch by João Matos and Kai Nacke. llvm-svn: 169414	2012-12-05 20:12:13 +00:00
Eli Bendersky	80119eae27	Remove the non-const getInst accessor. It wasn't being used, and isn't very good for enacpsulation anyway. llvm-svn: 169407	2012-12-05 19:31:33 +00:00
Andrew Trick	7dba3952fd	Remove two dead functions resulting from a bad rebase. llvm-svn: 169401	2012-12-05 18:52:15 +00:00
Benjamin Kramer	507aca835e	Try to unbreak the build on hosts that don't transitively pull in a definition for int64_t. Also use the portable (ugly) format string macros, for MSVC compatibility. llvm-svn: 169396	2012-12-05 18:31:11 +00:00
Jakob Stoklund Olesen	a97cec790f	Remove unused MachineInstr constructors. A MachineInstr can only ever be constructed by CreateMachineInstr() and CloneMachineInstr(), and those factories don't use the removed constructors. llvm-svn: 169395	2012-12-05 18:27:39 +00:00
Kevin Enderby	168ffb36a5	Added a option to the disassembler to print immediates as hex. This is for the lldb team so most of but not all of the values are to be printed as hex with this option. Some small values like the scale in an X86 address were requested to printed in decimal without the leading 0x. There may be some tweaks need to places that may still be in decimal that they want in hex. Specially for arm. I made my best guess. Any tweaks from here should be simple. I also did the best I know now with help from the C++ gurus creating the cleanest formatImm() utility function and containing the changes. But if someone has a better idea to make something cleaner I'm all ears and game for changing the implementation. rdar://8109283 llvm-svn: 169393	2012-12-05 18:13:19 +00:00
Pedro Artigas	41b98843e8	- Added calls to doInitialization/doFinalization to immutable passes - fixed ordering of calls to doFinalization to be the reverse of the pass run order due to potential dependencies - fixed machine module info to operate in the doInitialization/doFinalization model, also fixes some FIXMEs reviewed by Evan Cheng <evan.cheng@apple.com> llvm-svn: 169391	2012-12-05 17:12:22 +00:00
Andrew Trick	510e606e19	RegisterPressure API. Add support for physical register units. At build-time register pressure was always computed in terms of register units. But the compile-time API was expressed in terms of register classes because it was intended for virtual registers (and physical register units weren't yet used anywhere in codegen). Now that the codegen uses physreg units consistently, prepare for tracking register pressure also in terms of live units, not live registers. llvm-svn: 169360	2012-12-05 06:47:12 +00:00
Andrew Trick	d52ab339cb	Added RegisterPressureTracker::dump() for debugging. llvm-svn: 169359	2012-12-05 06:47:08 +00:00
Andrew Trick	73b8a8a5e4	Comment formatting. llvm-svn: 169358	2012-12-05 06:47:06 +00:00
Michael J. Spencer	41ee041d4f	Copy clang/Driver/<Option parsing stuff> to llvm. llvm-svn: 169344	2012-12-05 00:29:32 +00:00
Bill Wendling	d7767125d5	Use the 'count' attribute to calculate the upper bound of an array. The count attribute is more accurate with regards to the size of an array. It also obviates the upper bound attribute in the subrange. We can also better handle an unbound array by setting the count to -1 instead of the lower bound to 1 and upper bound to 0. llvm-svn: 169312	2012-12-04 21:34:03 +00:00
Eli Bendersky	57e09e87a4	Remove a URL from code llvm-svn: 169293	2012-12-04 19:08:43 +00:00
Eli Bendersky	abe546368b	Make NaCl naming consistent. The triple OSType is called NaCl and is represented textually as NativeClient. Also added a link to the native client project for readers unfamiliar with it. A Clang patch will follow shortly. llvm-svn: 169291	2012-12-04 18:37:26 +00:00
Duncan Sands	ab6db722ce	Fix comment typo. llvm-svn: 169282	2012-12-04 16:36:05 +00:00
Bill Schmidt	ca4a0c9dbd	This patch introduces initial-exec model support for thread-local storage on 64-bit PowerPC ELF. The patch includes code to handle external assembly and MC output with the integrated assembler. It intentionally does not support the "old" JIT. For the initial-exec TLS model, the ABI requires the following to calculate the address of external thread-local variable x: Code sequence Relocation Symbol ld 9,x@got@tprel(2) R_PPC64_GOT_TPREL16_DS x add 9,9,x@tls R_PPC64_TLS x The register 9 is arbitrary here. The linker will replace x@got@tprel with the offset relative to the thread pointer to the generated GOT entry for symbol x. It will replace x@tls with the thread-pointer register (13). The two test cases verify correct assembly output and relocation output as just described. PowerPC-specific selection node variants are added for the two instructions above: LD_GOT_TPREL and ADD_TLS. These are inserted when an initial-exec global variable is encountered by PPCTargetLowering::LowerGlobalTLSAddress(), and later lowered to machine instructions LDgotTPREL and ADD8TLS. LDgotTPREL is a pseudo that uses the same LDrs support added for medium code model's LDtocL, with a different relocation type. The rest of the processing is straightforward. llvm-svn: 169281	2012-12-04 16:18:08 +00:00
Bill Wendling	bfc0e5725f	Add a 'count' field to the DWARF subrange. The count field is necessary because there isn't a difference between the 'lo' and 'hi' attributes for a one-element array and a zero-element array. When the count is '0', we know that this is a zero-element array. When it's >=1, then it's a normal constant sized array. When it's -1, then the array is unbounded. llvm-svn: 169218	2012-12-04 06:20:49 +00:00
Bill Wendling	36ed3024ed	Add a 'getCount' method to get the number of elements in the subrange. llvm-svn: 169215	2012-12-04 06:12:44 +00:00
Matt Beaumont-Gay	abfc446063	Add 'using' declarations to suppress -Woverloaded-virtual warnings. llvm-svn: 169214	2012-12-04 05:41:27 +00:00
Manman Ren	f563941adc	Stack Alignment: when creating stack objects in MachineFrameInfo, make sure the alignment is clamped to TargetFrameLowering.getStackAlignment if the target does not support stack realignment or the option "realign-stack" is off. This will cause miscompile if the address is treated as aligned and add is replaced with or in DAGCombine. Added a bool StackRealignable to TargetFrameLowering to check whether stack realignment is implemented for the target. Also added a bool RealignOption to MachineFrameInfo to check whether the option "realign-stack" is on. rdar://12713765 llvm-svn: 169197	2012-12-04 00:52:33 +00:00
Jakob Stoklund Olesen	a32d85b39d	Remove the old TRI::ResolveRegAllocHint() and getRawAllocationOrder() hooks. These functions have been replaced by TRI::getRegAllocationHints() which provides the same capabilities. llvm-svn: 169192	2012-12-04 00:46:13 +00:00
Jakob Stoklund Olesen	084665fa6d	Remove VirtRegMap::getRegAllocPref(). Now that there can be multiple hint registers from targets, it doesn't make sense to have a function that returns 'the' preferred register. llvm-svn: 169190	2012-12-04 00:35:59 +00:00
Jakob Stoklund Olesen	1dd82dd3fc	Use MRI::getSimpleHint() instead of getRegAllocPref() in remaining cases. Targets can provide multiple hints now, so getRegAllocPref() doesn't make sense any longer because it only returns one preferred register. Replace it with getSimpleHint() in the remaining heuristics. This function only llvm-svn: 169188	2012-12-04 00:30:22 +00:00
Manman Ren	26c73f93e0	Stack Alignment: move functions from header file MachineFrameInfo.h. No functional change for this commit. The follow-up patch will add more stuff to these functions. rdar://12713765 llvm-svn: 169186	2012-12-04 00:26:44 +00:00
Jakob Stoklund Olesen	74052b041b	Add VirtRegMap::hasKnownPreference(). Virtual registers with a known preferred register are prioritized by RAGreedy. This function makes the condition explicit without depending on getRegAllocPref(). llvm-svn: 169179	2012-12-03 23:23:50 +00:00
Nadav Rotem	d479a57f68	minor renaming, documentation and cleanups. llvm-svn: 169175	2012-12-03 22:57:09 +00:00
Michael J. Spencer	7fe24f5744	[Support] Make FileOutputBuffer work on Windows. llvm-svn: 169167	2012-12-03 22:09:52 +00:00
Pedro Artigas	e4348b0412	moves doInitialization and doFinalization to the Pass class and removes some unreachable code in MachineModuleInfo reviewed by Evan Cheng <evan.cheng@apple.com> llvm-svn: 169164	2012-12-03 21:56:57 +00:00
Argyrios Kyrtzidis	8c114534ff	Add a getMemorySize() function for DenseSet. llvm-svn: 169163	2012-12-03 21:46:21 +00:00
Jakob Stoklund Olesen	499cac486a	Add a new hook for providing register allocator hints more flexibly. The TargetRegisterInfo::getRegAllocationHints() function is going to replace the existing mechanisms for providing target-dependent hints to the register allocator: ResolveRegAllocHint() and getRawAllocationOrder(). The new hook is more flexible because it allows the target to provide multiple preferred candidate registers for each virtual register, and it is easier to use because targets are not required to return a reference to a constant array like getRawAllocationOrder(). An optional VirtRegMap argument can be used to provide target-dependent hints that depend on the provisional assignments of other virtual registers. llvm-svn: 169154	2012-12-03 21:17:00 +00:00
Argyrios Kyrtzidis	479d37ed63	Eliminate redundant bitwise operations when using a llvm/ADT/PointerUnion. For comparison, with this code sample: PointerUnion<int , char > Data; PointerUnion<int , char > foo1() { Data = new int; return new int; } PointerUnion<int , char > foo2() { Data = new char; return new char; } Before this patch we would get: define i64 @_Z4foo1v() uwtable ssp { %1 = tail call noalias i8* @_Znwm(i64 4) %2 = ptrtoint i8* %1 to i64 %3 = load i64* getelementptr inbounds (%"class.llvm::PointerUnion"* @Data, i64 0, i32 0, i32 0), align 8 %4 = and i64 %3, 1 %.masked.i = and i64 %2, -3 %5 = or i64 %4, %.masked.i store i64 %5, i64* getelementptr inbounds (%"class.llvm::PointerUnion"* @Data, i64 0, i32 0, i32 0), align 8 %6 = tail call noalias i8* @_Znwm(i64 4) %7 = ptrtoint i8* %6 to i64 %8 = and i64 %7, -3 ret i64 %8 } define i64 @_Z4foo2v() uwtable ssp { %1 = tail call noalias i8* @_Znwm(i64 1) %2 = ptrtoint i8* %1 to i64 %3 = load i64* getelementptr inbounds (%"class.llvm::PointerUnion"* @Data, i64 0, i32 0, i32 0), align 8 %4 = and i64 %3, 1 %5 = or i64 %2, %4 %6 = or i64 %5, 2 store i64 %6, i64* getelementptr inbounds (%"class.llvm::PointerUnion"* @Data, i64 0, i32 0, i32 0), align 8 %7 = tail call noalias i8* @_Znwm(i64 1) %8 = ptrtoint i8* %7 to i64 %9 = or i64 %8, 2 ret i64 %9 } After the patch: define i64 @_Z4foo1v() uwtable ssp { %1 = tail call noalias i8* @_Znwm(i64 4) %2 = ptrtoint i8* %1 to i64 store i64 %2, i64* getelementptr inbounds (%"class.llvm::PointerUnion"* @Data, i64 0, i32 0, i32 0), align 8 %3 = tail call noalias i8* @_Znwm(i64 4) %4 = ptrtoint i8* %3 to i64 ret i64 %4 } declare noalias i8* @_Znwm(i64) define i64 @_Z4foo2v() uwtable ssp { %1 = tail call noalias i8* @_Znwm(i64 1) %2 = ptrtoint i8* %1 to i64 %3 = or i64 %2, 2 store i64 %3, i64* getelementptr inbounds (%"class.llvm::PointerUnion"* @Data, i64 0, i32 0, i32 0), align 8 %4 = tail call noalias i8* @_Znwm(i64 1) %5 = ptrtoint i8* %4 to i64 %6 = or i64 %5, 2 ret i64 %6 } llvm-svn: 169147	2012-12-03 19:59:23 +00:00
Bill Wendling	7b246c3872	Add 'getInt64Field()' method to get the signed integer instead of unsigned. llvm-svn: 169145	2012-12-03 19:44:25 +00:00
Alexey Samsonov	ef51c3ff81	ASan: add blacklist file to ASan pass options. Clang patch for this will follow. llvm-svn: 169143	2012-12-03 19:09:26 +00:00
Chandler Carruth	a79a28b7a8	Sort the #include lines for the include/... tree with the script. AKA: Recompile ALL the source code! This one went much better. No manual edits here. I spot-checked for silliness and grep-checked for really broken edits and everything seemed good. It all still compiles. Yell if you see something that looks goofy. llvm-svn: 169133	2012-12-03 17:02:12 +00:00
Chandler Carruth	ed0881b2a6	Use the new script to sort the includes of every file under lib. Sooooo many of these had incorrect or strange main module includes. I have manually inspected all of these, and fixed the main module include to be the nearest plausible thing I could find. If you own or care about any of these source files, I encourage you to take some time and check that these edits were sensible. I can't have broken anything (I strictly added headers, and reordered them, never removed), but they may not be the headers you'd really like to identify as containing the API being implemented. Many forward declarations and missing includes were added to a header files to allow them to parse cleanly when included first. The main module rule does in fact have its merits. =] llvm-svn: 169131	2012-12-03 16:50:05 +00:00
James Molloy	e901b5fda2	Remove bugzilla link. llvm-svn: 169091	2012-12-01 14:44:23 +00:00
Andrew Trick	b767d1eba8	misched: Fix RegisterPressureTracker handling of DebugVals. Assertion failed: (TopRPTracker.getPos() == RegionBegin && "bad initial Top tracker"). rdar://12790302. llvm-svn: 169072	2012-12-01 01:22:49 +00:00
Bill Wendling	c786b31233	Replace r168930 with a more reasonable patch. The original patch removed a bunch of code that the SjLjEHPrepare pass placed into the entry block if all of the landing pads were removed during the CodeGenPrepare class. The more natural way of doing things is to run the CGP before we run the SjLjEHPrepare pass. Make it so! llvm-svn: 169044	2012-11-30 22:08:55 +00:00
Chandler Carruth	f12e3a67db	Switch LLVM_USE_RVALUE_REFERENCES to LLVM_HAS_RVALUE_REFERENCES. Rationale: 1) This was the name in the comment block. ;] 2) It matches Clang's __has_feature naming convention. 3) It matches other compiler-feature-test conventions. Sorry for the noise. =] I've also switch the comment block to use a \brief tag and not duplicate the name. llvm-svn: 168996	2012-11-30 11:45:22 +00:00
Chandler Carruth	9c7462a8b8	Separate out the tests for whether the compiler suports R-value references from whether it supports an R-value reference *this. No version of GCC today supports the latter, which breaks GCC C++11 compiles of LLVM and Clang now. Also add doxygen comments clarifying what's going on here, and update the usage in Optional. I'll update the usages in Clang next. llvm-svn: 168993	2012-11-30 11:04:18 +00:00
Patrik Hagglund	086ee1ee50	More strict error checking in parseSpecifier + simplified code. For example, don't allow empty strings to be passed to getInt. Move asserts inside parseSpecifier. (One day we may want to pass parse error messages to the user - from LLParser - instead of using asserts, but keep the code simple until then. There have been an attempt to do this. See r142288, which got reverted, and r142605.) llvm-svn: 168991	2012-11-30 10:06:59 +00:00
Eric Christopher	3c23009117	Add the rest of the experimental fission sections to MC. llvm-svn: 168986	2012-11-30 06:47:06 +00:00
Chandler Carruth	dbd6958183	Move the InstVisitor utility into VMCore where it belongs. It heavily depends on the IR infrastructure, there is no sense in it being off in Support land. This is in preparation to start working to expand InstVisitor into more special-purpose visitors that are still generic and can be re-used across different passes. The expansion will go into the Analylis tree though as nothing in VMCore needs it. llvm-svn: 168972	2012-11-30 03:08:41 +00:00
Jordan Rose	142e56d157	Add a new C++11 compatibility macro, LLVM_LVALUE_FUNCTION. This expands to '&', and is intended to be used when an /optional/ rvalue override is available. Before: void foo() const { ... } After: void foo() const LLVM_LVALUE_FUNCTION { ... } void foo() && { ... } This is used to allow moving the contents of an Optional. llvm-svn: 168963	2012-11-30 00:38:53 +00:00
Dan Gohman	913c96da43	Update comment for malloc being a library call now, rather than an instruction. llvm-svn: 168946	2012-11-29 21:58:47 +00:00
Michael Ilseman	05d3bf77a1	copyFastMathFlags utility and test case llvm-svn: 168943	2012-11-29 21:25:12 +00:00
Chad Rosier	0987dd1a48	Whitespace. llvm-svn: 168937	2012-11-29 20:58:08 +00:00
Chad Rosier	c19b0695ba	Fix 80-column violations. llvm-svn: 168936	2012-11-29 20:56:58 +00:00
Shuxin Yang	abcc370423	rdar://12100355 (part 1) This revision attempts to recognize following population-count pattern: while(a) { c++; ... ; a &= a - 1; ... }, where <c> and <a>could be used multiple times in the loop body. TODO: On X8664 and ARM, __buildin_ctpop() are not expanded to a efficent instruction sequence, which need to be improved in the following commits. Reviewed by Nadav, really appreciate! llvm-svn: 168931	2012-11-29 19:38:54 +00:00
Jim Grosbach	aae0a4bd87	Fix a memory leak in MachOObjectFile. MachOObjectFile owns a MachOObj, but never frees it. Both MachOObjectFile and MachOObj want to own the MemoryBuffer, though, so we have to be careful and give them each one of their own. Thanks to Greg Clayton, Eric Christopher and Michael Spencer for helping figure out what's going wrong here. rdar://12561773 llvm-svn: 168923	2012-11-29 19:14:11 +00:00
Alexey Samsonov	df6245233c	Add options to AddressSanitizer passes to make them configurable by frontend. llvm-svn: 168910	2012-11-29 18:14:24 +00:00
Pedro Artigas	d6b092bbd5	One more step towards making doInitialization and doFinalization useful for start up and clean up module passes, now that ASAN and TSAN are fixed the tests pass llvm-svn: 168905	2012-11-29 17:47:05 +00:00
Justin Holewinski	bc45119b44	Allow targets to prefer TypeSplitVector over TypePromoteInteger when computing the legalization method for vectors For some targets, it is desirable to prefer scalarizing <N x i1> instead of promoting to a larger legal type, such as <N x i32>. llvm-svn: 168882	2012-11-29 14:26:24 +00:00
Evgeniy Stepanov	d4bd7b73e3	Initial commit of MemorySanitizer. Compiler pass only. llvm-svn: 168866	2012-11-29 09:57:20 +00:00
Jakob Stoklund Olesen	bdb55e0c59	Use MCPhysReg for RegisterClassInfo allocation orders. This saves a bit of memory. llvm-svn: 168852	2012-11-29 03:34:17 +00:00
Jakob Stoklund Olesen	7afe1663e9	Add an MCPhysReg typedef to replace naked uint16_t. Use this type for arrays of physical registers. llvm-svn: 168850	2012-11-29 02:39:28 +00:00
Shuxin Yang	01ab5d718b	Instruction::isAssociative() returns true for fmul/fadd if they are tagged "unsafe" mode. Approved by: Eli and Michael. llvm-svn: 168848	2012-11-29 01:47:31 +00:00
Michael Ilseman	be6871db58	Fast-math: Extend IRBuilder to have settable FastMathFlags to create instructions with Also extended IRBuilder's documentation to mention the convenience state for DefaultFPMathTag and FastMathFlags that can be set. llvm-svn: 168812	2012-11-28 21:16:19 +00:00
Michael Ilseman	8e5db2d07a	Fast-math comments and convenience method llvm-svn: 168811	2012-11-28 21:11:25 +00:00
Jakob Stoklund Olesen	26c9d70d28	Make the LiveRegMatrix analysis available to targets. No functional change, just moved header files. Targets can inject custom passes between register allocation and rewriting. This makes it possible to tweak the register allocation before rewriting, using the full global interference checking available from LiveRegMatrix. llvm-svn: 168806	2012-11-28 19:13:06 +00:00
Eli Bendersky	10f22d7054	Add backreference matching capabilities to Support/Regex, with appropriate unit tests. This change in itself is not expected to affect any functionality at this point, but it will serve as a stepping stone to improve FileCheck's variable matching capabilities. Luckily, our regex implementation already supports backreferences, although a bit of hacking is required to enable it. It supports both Basic Regular Expressions (BREs) and Extended Regular Expressions (EREs), without supporting backrefs for EREs, following POSIX strictly in this respect. And EREs is what we actually use (rightly). This is contrary to many implementations (including the default on Linux) of POSIX regexes, that do allow backrefs in EREs. Adding backref support to our EREs is a very simple change in the regcomp parsing code. I fail to think of significant cases where it would clash with existing things, and can bring more versatility to the regexes we write. There's always the danger of a backref in a specially crafted regex causing exponential matching times, but since we mainly use them for testing purposes I don't think it's a big problem. [it can also be placed behind a flag specific to FileCheck, if needed]. For more details, see: * http://lists.cs.uiuc.edu/pipermail/llvmdev/2012-November/055840.html * http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20121126/156878.html llvm-svn: 168802	2012-11-28 19:00:02 +00:00
Kostya Serebryany	dfe9e7933e	[asan] Split AddressSanitizer into two passes (FunctionPass, ModulePass), LLVM part. This requires a clang part which will follow. llvm-svn: 168781	2012-11-28 10:31:36 +00:00
Bill Wendling	706d3d66e9	Add back support for reading and parsing 'deplibs'. This is for backwards compatibility for pre-3.x bc files. The code reads the code, but does nothing with it. llvm-svn: 168779	2012-11-28 08:41:48 +00:00
Andrew Trick	48d392e81e	misched: Analysis that partitions the DAG into subtrees. This is a simple, cheap infrastructure for analyzing the shape of a DAG. It recognizes uniform DAGs that take the shape of bottom-up subtrees, such as the included matrix multiplication example. This is useful for heuristics that balance register pressure with ILP. Two canonical expressions of the heuristic are implemented in scheduling modes: -misched-ilpmin and -misched-ilpmax. llvm-svn: 168773	2012-11-28 05:13:28 +00:00
Andrew Trick	cd1c2f9fb1	misched: rename ScheduleDAGILP to ScheduleDFS to prepare for other heuristics. llvm-svn: 168772	2012-11-28 05:13:24 +00:00
Eric Christopher	c3b434b76c	Add brief support for the fission .debug_info.dwo section for ELF output. llvm-svn: 168764	2012-11-28 02:49:38 +00:00
Eric Christopher	038e22ca3f	Rearrange ordering of sections. llvm-svn: 168762	2012-11-28 02:49:32 +00:00
Eric Christopher	4d33f67217	Move and comment accessor routines. llvm-svn: 168761	2012-11-28 02:49:28 +00:00
Jakob Stoklund Olesen	9de596e650	Remove all references to TargetInstrInfoImpl. This class has been merged into its super-class TargetInstrInfo. llvm-svn: 168760	2012-11-28 02:35:17 +00:00
Jakob Stoklund Olesen	c351aed4b1	Move the guts of TargetInstrInfoImpl into the TargetInstrInfo class. The *Impl class no longer serves a purpose now that the super-class implementation is in CodeGen. llvm-svn: 168759	2012-11-28 02:35:13 +00:00
Andrew Kaylor	41fadd56b8	Fix comment formatting in RuntimeDyld.h llvm-svn: 168739	2012-11-27 22:53:57 +00:00
Andrew Kaylor	ab5ba51a6e	Moving SectionMemoryManager into RuntimeDyld and adding unit tests for it. The SectionMemoryManager now supports (and requires) applying section-specific page permissions. Clients using this memory manager must call either MCJIT::finalizeObject() or SectionMemoryManager::applyPermissions() before executing JITed code. See r168718 for changes from the previous implementation. llvm-svn: 168721	2012-11-27 19:42:02 +00:00
Pedro Artigas	0ee1b50949	Test commit only modifying comments llvm-svn: 168709	2012-11-27 17:39:20 +00:00
Bill Schmidt	34627e3434	This patch implements medium code model support for 64-bit PowerPC. The default for 64-bit PowerPC is small code model, in which TOC entries must be addressable using a 16-bit offset from the TOC pointer. Additionally, only TOC entries are addressed via the TOC pointer. With medium code model, TOC entries and data sections can all be addressed via the TOC pointer using a 32-bit offset. Cooperation with the linker allows 16-bit offsets to be used when these are sufficient, reducing the number of extra instructions that need to be executed. Medium code model also does not generate explicit TOC entries in ".section toc" for variables that are wholly internal to the compilation unit. Consider a load of an external 4-byte integer. With small code model, the compiler generates: ld 3, .LC1@toc(2) lwz 4, 0(3) .section .toc,"aw",@progbits .LC1: .tc ei[TC],ei With medium model, it instead generates: addis 3, 2, .LC1@toc@ha ld 3, .LC1@toc@l(3) lwz 4, 0(3) .section .toc,"aw",@progbits .LC1: .tc ei[TC],ei Here .LC1@toc@ha is a relocation requesting the upper 16 bits of the 32-bit offset of ei's TOC entry from the TOC base pointer. Similarly, .LC1@toc@l is a relocation requesting the lower 16 bits. Note that if the linker determines that ei's TOC entry is within a 16-bit offset of the TOC base pointer, it will replace the "addis" with a "nop", and replace the "ld" with the identical "ld" instruction from the small code model example. Consider next a load of a function-scope static integer. For small code model, the compiler generates: ld 3, .LC1@toc(2) lwz 4, 0(3) .section .toc,"aw",@progbits .LC1: .tc test_fn_static.si[TC],test_fn_static.si .type test_fn_static.si,@object .local test_fn_static.si .comm test_fn_static.si,4,4 For medium code model, the compiler generates: addis 3, 2, test_fn_static.si@toc@ha addi 3, 3, test_fn_static.si@toc@l lwz 4, 0(3) .type test_fn_static.si,@object .local test_fn_static.si .comm test_fn_static.si,4,4 Again, the linker may replace the "addis" with a "nop", calculating only a 16-bit offset when this is sufficient. Note that it would be more efficient for the compiler to generate: addis 3, 2, test_fn_static.si@toc@ha lwz 4, test_fn_static.si@toc@l(3) The current patch does not perform this optimization yet. This will be addressed as a peephole optimization in a later patch. For the moment, the default code model for 64-bit PowerPC will remain the small code model. We plan to eventually change the default to medium code model, which matches current upstream GCC behavior. Note that the different code models are ABI-compatible, so code compiled with different models will be linked and execute correctly. I've tested the regression suite and the application/benchmark test suite in two ways: Once with the patch as submitted here, and once with additional logic to force medium code model as the default. The tests all compile cleanly, with one exception. The mandel-2 application test fails due to an unrelated ABI compatibility with passing complex numbers. It just so happens that small code model was incredibly lucky, in that temporary values in floating-point registers held the expected values needed by the external library routine that was called incorrectly. My current thought is to correct the ABI problems with _Complex before making medium code model the default, to avoid introducing this "regression." Here are a few comments on how the patch works, since the selection code can be difficult to follow: The existing logic for small code model defines three pseudo-instructions: LDtoc for most uses, LDtocJTI for jump table addresses, and LDtocCPT for constant pool addresses. These are expanded by SelectCodeCommon(). The pseudo-instruction approach doesn't work for medium code model, because we need to generate two instructions when we match the same pattern. Instead, new logic in PPCDAGToDAGISel::Select() intercepts the TOC_ENTRY node for medium code model, and generates an ADDIStocHA followed by either a LDtocL or an ADDItocL. These new node types correspond naturally to the sequences described above. The addis/ld sequence is generated for the following cases: * Jump table addresses * Function addresses * External global variables * Tentative definitions of global variables (common linkage) The addis/addi sequence is generated for the following cases: * Constant pool entries * File-scope static global variables * Function-scope static variables Expanding to the two-instruction sequences at select time exposes the instructions to subsequent optimization, particularly scheduling. The rest of the processing occurs at assembly time, in PPCAsmPrinter::EmitInstruction. Each of the instructions is converted to a "real" PowerPC instruction. When a TOC entry needs to be created, this is done here in the same manner as for the existing LDtoc, LDtocJTI, and LDtocCPT pseudo-instructions (I factored out a new routine to handle this). I had originally thought that if a TOC entry was needed for LDtocL or ADDItocL, it would already have been generated for the previous ADDIStocHA. However, at higher optimization levels, the ADDIStocHA may appear in a different block, which may be assembled textually following the block containing the LDtocL or ADDItocL. So it is necessary to include the possibility of creating a new TOC entry for those two instructions. Note that for LDtocL, we generate a new form of LD called LDrs. This allows specifying the @toc@l relocation for the offset field of the LD instruction (i.e., the offset is replaced by a SymbolLo relocation). When the peephole optimization described above is added, we will need to do similar things for all immediate-form load and store operations. The seven "mcm-n.ll" test cases are kept separate because otherwise the intermingling of various TOC entries and so forth makes the tests fragile and hard to understand. The above assumes use of an external assembler. For use of the integrated assembler, new relocations are added and used by PPCELFObjectWriter. Testing is done with "mcm-obj.ll", which tests for proper generation of the various relocations for the same sequences tested with the external assembler. llvm-svn: 168708	2012-11-27 17:35:46 +00:00
Bill Wendling	ee5984df39	Remove the dependent libraries feature. The dependent libraries feature was never used and has bit-rotted. Remove it. llvm-svn: 168694	2012-11-27 09:55:56 +00:00
Craig Topper	b9773650ec	Make PrintReg constructor explicit to prevent weird implicit conversions from accidentally being triggered. llvm-svn: 168686	2012-11-27 08:14:24 +00:00
NAKAMURA Takumi	2e4a30709d	llvm/CodeGen: Remove empty files in r168659. llvm-svn: 168663	2012-11-27 01:21:50 +00:00
Jakub Staszak	0820b2a360	Remove unused MachineLoopRanges analysis. llvm-svn: 168659	2012-11-27 01:14:34 +00:00
Owen Anderson	1db12f5135	Revert r168635 "Step towards implementation of pass manager with doInitialization and doFinalization per module detangled from runOn?? calls, still has temporary code not to break ASAN to be removed when that pass conforms to the proposed model". It appears to have broken at least one buildbot. llvm-svn: 168654	2012-11-27 00:53:24 +00:00
Michael Ilseman	be9137a5c5	Fast-math optimization: fold multiply by zero Added in first optimization using fast-math flags to serve as an example for following optimizations. SimplifyInstruction will now try to optimize an fmul observing its FastMathFlags to see if it can fold multiply by zero when 'nnan' and 'nsz' flags are set. llvm-svn: 168648	2012-11-27 00:46:26 +00:00
Michael Ilseman	9978d7e9e5	Fast-math flags for the bitcode Added in bitcode enum for the serializing of fast-math flags. Added in the reading/writing of fast-math flags from the OptimizationFlags record for BinaryOps. llvm-svn: 168646	2012-11-27 00:43:38 +00:00
Michael Ilseman	149209ec4c	Fast-math interfaces for Instructions Add in getter/setter methods for Instructions, allowing them to be the interface to FPMathOperator similarly to now NUS/NSW is handled. llvm-svn: 168642	2012-11-27 00:41:22 +00:00

... 5 6 7 8 9 ...

17478 Commits