llvm-project

Commit Graph

Author	SHA1	Message	Date
Chandler Carruth	5394e11a55	Switch the empty and tombstone key enumerators to not have explicit values -- that's not required to fix the bug that was cropping up, and the values selected made the enumeration's underlying type signed and introduced some warnings. This fixes the -Werror build. The underlying issue here was that the DenseMapInfo was casting values completely outside the range of the underlying storage of the enumeration to the enumeration's type. GCC went and "optimized" that into infloops and other misbehavior. By providing designated special values for these keys in the dense map, we ensure they are indeed representable and that they won't be used for anything else. It might be better to reuse None for the empty key and have the tombstone share the value of the sentinel enumerator, but honestly having 2 extra enumerators seemed not to matter and this seems a bit simpler. I'll let Bill shuffle this around (or ask me to shuffle it around) if he prefers it to look a different way. I also made the switch a bit more clear (and produce a better assert) that the enumerators are never going to show up and are errors if they do. llvm-svn: 171614	2013-01-05 08:47:26 +00:00
NAKAMURA Takumi	c91006f741	IR/Attributes: Provide EmptyKey and TombstoneKey in part of enum, as workaround for gcc-4.4 take #2 . I will investigate, later, what was wrong. I am too tired for now. llvm-svn: 171611	2013-01-05 07:55:47 +00:00
David Blaikie	800a916f99	Emit DW_TAG_formal_parameter for unnamed parameters. This change essentially reverts r87069 which came without a test case. It causes no regressions in the GDB 7.5 test suite & fixes 25 xfails (commit to the test suite to follow). If anyone can present a test case that demonstrates why this check is necessary I'd be happy to account for it in one way or another. llvm-svn: 171609	2013-01-05 07:43:02 +00:00
Craig Topper	92a70b1e65	Recommit r171461 which was incorrectly reverted. Mark DIV/IDIV instructions hasSideEffects=1 because they can trap when dividing by 0. This is needed to keep early if conversion from moving them across basic blocks. llvm-svn: 171608	2013-01-05 07:39:25 +00:00
Nadav Rotem	478b6a47ec	Revert revision 171524. Original message: URL: http://llvm.org/viewvc/llvm-project?rev=171524&view=rev Log: The current Intel Atom microarchitecture has a feature whereby when a function returns early then it is slightly faster to execute a sequence of NOP instructions to wait until the return address is ready, as opposed to simply stalling on the ret instruction until the return address is ready. When compiling for X86 Atom only, this patch will run a pass, called "X86PadShortFunction" which will add NOP instructions where less than four cycles elapse between function entry and return. It includes tests. Patch by Andy Zhang. llvm-svn: 171603	2013-01-05 05:42:48 +00:00
Jakob Stoklund Olesen	dc5285f102	Don't call destructors on MachineInstr and MachineOperand. The series of patches leading up to this one makes llc -O0 run 8% faster. When deallocating a MachineFunction, there is no need to visit all MachineInstr and MachineOperand objects to deallocate them. All their memory come from a BumpPtrAllocator that is about to be purged, and they have empty destructors anyway. This only applies when deallocating the MachineFunction. DeleteMachineInstr() should still be used to recycle MI memory during the codegen passes. Remove the LeakDetector support for MachineInstr. I've never seen it used before, and now it definitely doesn't work. With this patch, leaked MachineInstrs would be much less of a problem since all of their memory will be reclaimed by ~MachineFunction(). llvm-svn: 171599	2013-01-05 05:05:51 +00:00
Jakob Stoklund Olesen	1bfeecb491	Use ArrayRecycler for MachineInstr operand lists. Instead of an std::vector<MachineOperand>, use MachineOperand arrays from an ArrayRecycler living in MachineFunction. This has several advantages: - MachineInstr now has a trivial destructor, making it possible to delete them in batches when destroying MachineFunction. This will be enabled in a later patch. - Bypassing malloc() and free() can be faster, depending on the system library. - MachineInstr objects and their operands are allocated from the same BumpPtrAllocator, so they will usually be next to each other in memory, providing better locality of reference. - Reduce MachineInstr footprint. A std::vector is 24 bytes, the new operand array representation only uses 8+4+1 bytes in MachineInstr. - Better control over operand array reallocations. In the old representation, the use-def chains would be reordered whenever a std::vector reached its capacity. The new implementation never changes the use-def chain order. Note that some decisions in the code generator depend on the use-def chain orders, so this patch may cause different assembly to be produced in a few cases. llvm-svn: 171598	2013-01-05 05:00:09 +00:00
Jakob Stoklund Olesen	fe445cd646	Add MachineRegisterInfo::moveOperands(). This function works like memmove() for MachineOperands, except it also updates any use-def chains containing the moved operands. The use-def chains are updated without affecting the order of operands in the list. That isn't possible when using the removeRegOperandFromUseList() and addRegOperandToUseList() functions. Callers to follow soon. llvm-svn: 171597	2013-01-05 04:38:12 +00:00
Chandler Carruth	4a7c311008	Refactor the ScalarTargetTransformInfo API for querying about the legality of an address mode to not use a struct of four values and instead to accept them as parameters. I'd love to have named parameters here as most callers only care about one or two of these, but the defaults aren't terribly scary to write out. That said, there is no real impact of this as the passes aren't yet using STTI for this and are still relying upon TargetLowering. llvm-svn: 171595	2013-01-05 03:36:17 +00:00
Chandler Carruth	c892591596	Sink the AddressingModeMatcher helper class into an anonymous namespace next to its only user. This helper relies on TargetLowering information that shouldn't be generally used throughout the Transfoms library, and so it made little sense as a generic utility. This also consolidates the file where we need to remove the remaining uses of TargetLowering in favor of the IR-layer abstract interface in TargetTransformInfo. llvm-svn: 171590	2013-01-05 02:09:22 +00:00
Akira Hatanaka	d35a263076	[mips] Fix data layout string. Add 64 to the list of native integer widths and add stack alignment information. llvm-svn: 171587	2013-01-05 02:00:56 +00:00
Bill Wendling	960f52a132	Add a method to create an AttributeSet from an AttrBuilder. The Attribute class is eventually going to represent one attribute. So we need this class to create the set of attributes. Add some iterator methods to the builder to access its internal bits in a nice way. llvm-svn: 171586	2013-01-05 01:36:54 +00:00
Nadav Rotem	e9f5bfd5e9	iLoopVectorize: Non commutative operators can be used as reduction variables as long as the reduction chain is used in the LHS. PR14803. llvm-svn: 171583	2013-01-05 01:15:47 +00:00
Chandler Carruth	b5429f43b8	Eric thought that Darwin was right to use -1 consistently rather than leaving this undefined, and despite the sentence in the standard that seems to require it, I'll cede the point and assume its a bug in the wording. Other parts of POSIX regularly allow for things to be -1 instead of undefined, this should too. Makes things more consistent too. This should have to real impact for folks though. llvm-svn: 171574	2013-01-05 00:42:50 +00:00
Chandler Carruth	2aaec89fd0	Try to suppress the use of clock_gettime on Darwin which apparantly defines _POSIX_CPUTIME but doesn't support the clock_* functions. I don't test the value of _POSIX_CPUTIME because the spec merely says that if it is defined, the CPU-specific timers are available, whereas it says that _POSIX_TIMERS must be defined and defined to a value greater than zero. However, this may not work, as the POSIX spec clearly states: "If the symbolic constant _POSIX_CPUTIME is defined, then the symbolic constant _POSIX_TIMERS shall also be defined by the implementation to have the value 200112L." If this doesn't work, I'll add more hacks for Darwin. llvm-svn: 171565	2013-01-05 00:11:21 +00:00
Chandler Carruth	b79a7aa541	Fix an obvious typo spotted by Reid Kleckner, and breaking windows builds. llvm-svn: 171559	2013-01-04 23:46:04 +00:00
Bill Wendling	cd330348f5	Get rid of the 'Bits' mask in the attribute builder. The bit mask thing will be a thing of the past. It's not extensible enough. Get rid of its use here. Opt instead for using a vector to hold the attributes. Note: Some of this code will become obsolete once the rewrite is further along. llvm-svn: 171553	2013-01-04 23:27:34 +00:00
Chandler Carruth	ef7f968e09	Add time getters to the process interface for requesting the elapsed wall time, user time, and system time since a process started. For walltime, we currently use TimeValue's interface and a global initializer to compute a close approximation of total process runtime. For user time, this adds support for an somewhat more precise timing mechanism -- clock_gettime with the CLOCK_PROCESS_CPUTIME_ID clock selected. For system time, we have to do a full getrusage call to extract the system time from the OS. This is expensive but unavoidable. In passing, clean up the implementation of the old APIs and fix some latent bugs in the Windows code. This might have manifested on Windows ARM systems or other systems with strange 64-bit integer behavior. The old API for this both user time and system time simultaneously from a single getrusage call. While this results in fewer system calls, it also results in a lower precision user time and if only user time is desired, it introduces a higher overhead. It may be worthwhile to switch some of the pass timers to not track system time and directly track user and wall time. The old API also tracked walltime in a confusing way -- it just set it to the current walltime rather than providing any measure of wall time since the process started the way buth user and system time are tracked. The new API is more consistent here. The plan is to eventually implement these methods for a child process by using the wait3(2) system call to populate an rusage struct representing the whole subprocess execution. That way, after waiting on a child process its stats will become accurate and cheap to query. llvm-svn: 171551	2013-01-04 23:19:55 +00:00
Jakub Staszak	43fafaf496	Move 'break' to the right place to prevent fallthru. There is no test-case because conditions in the next case prevented from doing anything nasty. llvm-svn: 171549	2013-01-04 23:01:26 +00:00
Jakob Stoklund Olesen	7f92b7ad0a	Move an assertion so it doesn't dereference end(). The R600 target has test cases that exercises this code. llvm-svn: 171538	2013-01-04 22:17:31 +00:00
Paul Redmond	874f01e956	Do not vectorize loops with subtraction reductions Since subtraction does not commute the loop vectorizer incorrectly vectorizes reductions such as x = A[i] - x. Disabling for now. llvm-svn: 171537	2013-01-04 22:10:16 +00:00
Eric Christopher	cad9b53c02	Add a name for the anonymous type we're creating for subrange types and a FIXME for what we should be doing. Should solve the immediacy of PR12069 where our debug info is crashing another tool. llvm-svn: 171536	2013-01-04 21:51:53 +00:00
Michael Gottesman	1e00ac6256	Added DEBUG message to ObjCARC when we optimize objc_retain => objc_retainAutorelasedReturnValue. llvm-svn: 171535	2013-01-04 21:30:38 +00:00
Michael Gottesman	9f848aeddd	Fixed up some DEBUG messages where I was putting in the text of a message the method where it was being called when I should have just prefixed the actual message with Pass::Method. Additionally I fixed some whitespace issues. llvm-svn: 171534	2013-01-04 21:29:57 +00:00
Nadav Rotem	93bd30be9b	Fix a warning llvm-svn: 171525	2013-01-04 21:08:44 +00:00
Preston Gurd	e36b685a94	The current Intel Atom microarchitecture has a feature whereby when a function returns early then it is slightly faster to execute a sequence of NOP instructions to wait until the return address is ready, as opposed to simply stalling on the ret instruction until the return address is ready. When compiling for X86 Atom only, this patch will run a pass, called "X86PadShortFunction" which will add NOP instructions where less than four cycles elapse between function entry and return. It includes tests. Patch by Andy Zhang. llvm-svn: 171524	2013-01-04 20:54:54 +00:00
Bill Wendling	9ac69f9d37	General cleanups. * Remove dead methods. * Use the 'operator==' method instead of 'contains', which isn't needed. * Fix some comments. No functionality change. llvm-svn: 171523	2013-01-04 20:54:35 +00:00
Michael J. Spencer	bae14cef80	[Object][ELF] Add a maximum alignment. This is used by createELFObjectFile to create a properly aligned reader. llvm-svn: 171520	2013-01-04 20:36:28 +00:00
Akira Hatanaka	b13b33359b	[mips] MipsTargetLowering::getSetCCResultType should return a vector type if vectors are being compared. llvm-svn: 171517	2013-01-04 20:06:01 +00:00
Akira Hatanaka	e067e5a13f	[mips] 80 columns. llvm-svn: 171515	2013-01-04 19:38:05 +00:00
Nick Kledzik	11964f2a8f	Fix how YAML I/O detects flow sequences. Update test case to verify flow sequence is written as a flow sequence. llvm-svn: 171514	2013-01-04 19:32:00 +00:00
Akira Hatanaka	f412e7501a	[mips] Reorder template parameters. Remove class shift_rotate_imm32 and shift_rotate_imm64. llvm-svn: 171513	2013-01-04 19:25:46 +00:00
Manman Ren	fe5a61edbe	Memory Dependence Analysis: fix a miscompile that uses DT to approxmiate the reachablity. We conservatively approximate the reachability analysis by saying it is not reachable if there is a single path starting from "From" and the path does not reach "To". rdar://12801584 llvm-svn: 171512	2013-01-04 19:19:47 +00:00
Akira Hatanaka	a7a9fa1c16	[mips] Refactor conditional move instructions. llvm-svn: 171511	2013-01-04 19:16:38 +00:00
Akira Hatanaka	e36e2f6876	[mips] Refactor instructions which move data from or to coprocessors. llvm-svn: 171510	2013-01-04 19:13:49 +00:00
Eli Bendersky	46468d2fda	Remove unused #include llvm-svn: 171507	2013-01-04 19:08:43 +00:00
Adhemerval Zanella	9b0b781395	PowerPC: Fix eh_frame relocation for PIC This patch fixes the PPC eh_frame definitions for the personality and frame unwinding for PIC objects. It makes PIC build correctly creates relative relocations in the '.rela.eh_frame' segments and thus avoiding a text relocation that generates a DT_TEXTREL segments in link phase. llvm-svn: 171506	2013-01-04 19:08:13 +00:00
Nadav Rotem	e6bb35435d	Change the default number of registers to prevent unrolling on targets that dont have this hook. llvm-svn: 171489	2013-01-04 18:40:39 +00:00
Eric Christopher	7d1d713f59	Add a space to the end of the line so we don't get "itsbounds" in output. llvm-svn: 171487	2013-01-04 18:30:36 +00:00
Pedro Artigas	3383225167	small fixes to enable the reuse of the pass manager across multiple modules llvm-svn: 171475	2013-01-04 18:04:42 +00:00
Eric Christopher	c0fa867c7b	Add section information for the DWARF5 split debug proposal string offset section. llvm-svn: 171474	2013-01-04 17:59:22 +00:00
Nadav Rotem	be6570d429	Move the loop vectorizer from O2 to O3. It looks like the increase in code size actually hurts the performance on many programs. llvm-svn: 171471	2013-01-04 17:57:44 +00:00
Nadav Rotem	e1d5c4b8b9	LoopVectorizer: 1. Add code to estimate register pressure. 2. Add code to select the unroll factor based on register pressure. 3. Add bits to TargetTransformInfo to provide the number of registers. llvm-svn: 171469	2013-01-04 17:48:25 +00:00
Nadav Rotem	c616a5408a	Revert revision: 171467. This transformation is incorrect and makes some tests fail. Original message: Simplified TRUNCATE operation that comes after SETCC. It is possible since SETCC result is 0 or -1. Added a test. llvm-svn: 171468	2013-01-04 17:35:21 +00:00
Elena Demikhovsky	5f2f06d2d9	Simplified TRUNCATE operation that comes after SETCC. It is possible since SETCC result is 0 or -1. Added a test. llvm-svn: 171467	2013-01-03 08:48:33 +00:00
Michael Gottesman	820aac1c78	Revert "Mark DIV/IDIV instructions hasSideEffects=1 because they can trap when dividing by 0. This is needed to keep early if conversion from moving them across basic blocks." This reverts commit r171461 since it breaks the following tests: Clang :: Analysis/outofbound-notwork.c Clang :: Analysis/string-fail.c Clang :: CXX/basic/basic.lookup/basic.lookup.qual/p6-0x.cpp Clang :: CXX/basic/basic.lookup/basic.lookup.unqual/p15.cpp Clang :: CXX/dcl.dcl/dcl.spec/dcl.fct.spec/p4.cpp Clang :: CXX/dcl.dcl/dcl.spec/dcl.stc/p10.cpp Clang :: CXX/temp/temp.param/p14.cpp Clang :: CXX/temp/temp.res/temp.dep.res/temp.point/p1.cpp Clang :: CodeGen/2009-02-13-zerosize-union-field-ppc.c Clang :: CodeGen/blocks-2.c Clang :: CodeGen/libcalls-d.c Clang :: CodeGen/libcalls-ld.c Clang :: CodeGenCXX/conversion-function.cpp Clang :: CodeGenCXX/debug-info-limit-type.cpp Clang :: CodeGenCXX/inheriting-constructor.cpp Clang :: FixIt/fixit-errors.c Clang :: FixIt/fixit-pmem.cpp Clang :: Modules/namespaces.cpp Clang :: PCH/changed-files.c Clang :: PCH/pr4489.c Clang :: PCH/source-manager-stack.c Clang :: Parser/cxx-ambig-decl-expr-xfail.cpp Clang :: SemaCXX/switch-implicit-fallthrough-cxx98.cpp Clang :: SemaTemplate/instantiate-function-1.mm llvm-svn: 171466	2013-01-03 08:18:30 +00:00
Michael Gottesman	50ae5b28e9	Changed two debug statements that state that a queue had finished being processed when said queue was really a list to state a list had finished being processed. llvm-svn: 171465	2013-01-03 08:09:27 +00:00
Michael Gottesman	ef682c5430	Added DEBUG message for ObjCARC when we zap a push/pop pair in ObjCARCAPElim::OptimizeBB. llvm-svn: 171464	2013-01-03 08:09:17 +00:00
Michael Gottesman	416dc00cad	Added DEBUG message to ObjCARC when we transform objc_initWeak(p, null) => *p = null. llvm-svn: 171463	2013-01-03 07:32:53 +00:00
Michael Gottesman	00d1f966b4	Added DEBUG message for ObjCARC when an inline asm marker is inserted for architectures where this is required to perform a retainAutoreleasedReturnValue optimization. llvm-svn: 171462	2013-01-03 07:32:41 +00:00
Craig Topper	7c27cc9fd0	Mark DIV/IDIV instructions hasSideEffects=1 because they can trap when dividing by 0. This is needed to keep early if conversion from moving them across basic blocks. llvm-svn: 171461	2013-01-03 06:40:20 +00:00
Hal Finkel	95de3f3018	Add a subtype parameter to VTTI::getShuffleCost In order to cost subvector insertion and extraction, we need to know the type of the subvector being extracted. No functionality change. llvm-svn: 171453	2013-01-03 02:34:09 +00:00
Bill Wendling	c0df112941	Revert everything to r171366 to try to fix the build. llvm-svn: 171450	2013-01-03 02:01:50 +00:00
Bill Wendling	04949fa998	Try again to revert the bad patch. The tree was reverted for some unknown reason before the last time. --- Reverse-merging r171442 into '.': U include/llvm/IR/Attributes.h U lib/IR/Attributes.cpp U lib/IR/AttributeImpl.h llvm-svn: 171448	2013-01-03 01:54:39 +00:00
Bill Wendling	40785ae18f	Revert patch. Something snuck in there that shouldn't be. --- Reverse-merging r171441 into '.': U include/llvm/IR/Attributes.h U lib/IR/Attributes.cpp llvm-svn: 171444	2013-01-03 01:46:27 +00:00
Bill Wendling	af9a90cc00	Remove the 'contains' methods in favor of the 'operator==' method. The 'operator==' method is a bit clearer and much less verbose for somethings that should have only one value. Remove from the AttrBuilder for consistency. llvm-svn: 171442	2013-01-03 01:43:05 +00:00
NAKAMURA Takumi	cedab7ecf3	Revert r171427, "An intermediate step in the Attributes rewrite." llvm-svn: 171441	2013-01-03 01:42:06 +00:00
Michael J. Spencer	088925ea96	[MC][COFF] Switch the COFF streamer over to using the MCObjectStreamer version of EmitInstruction. llvm-svn: 171437	2013-01-03 01:09:22 +00:00
Nadav Rotem	72f984b596	LoopVectorizer: Add support for loop-unrolling during vectorization for increasing the ILP. At the moment this feature is disabled by default and this commit should not cause any functional changes. llvm-svn: 171436	2013-01-03 00:52:27 +00:00
Jakob Stoklund Olesen	725d57682b	Fix PR14732 by handling all kinds of IMPLICIT_DEF live ranges. Most IMPLICIT_DEF instructions are removed by the ProcessImplicitDefs pass, and a few are reinserted by PHIElimination when a PHI argument is <undef>. RegisterCoalescer was assuming that all IMPLICIT_DEF live ranges look like those created by PHIElimination, and that their live range never leaves the basic block. The PR14732 test case does tricks with PHI nodes that causes a longer IMPLICIT_DEF live range to appear. This happens very rarely, but RegisterCoalescer should be able to handle it. llvm-svn: 171435	2013-01-03 00:47:51 +00:00
Bill Wendling	0793f4501c	Make the type signature more strict. llvm-svn: 171434	2013-01-03 00:46:43 +00:00
Nadav Rotem	4897392360	Avoid vectorization when the function has the "noimplicitflot" attribute. llvm-svn: 171429	2013-01-02 23:54:43 +00:00
Eric Christopher	da4b2195fc	Extend the dumping infrastructure to deal with additional sections for debug info. These are some of the dwo sections from the DWARF5 split debug info proposal. Update the fission-cu.ll testcase to show what we should be able to dump more of now. Work in progress: Ultimately the relocations will be gone for the dwo section and the strings will be a different form (as well as the rest of the sections will be included). llvm-svn: 171428	2013-01-02 23:52:13 +00:00
Bill Wendling	91055cfbaf	An intermediate step in the Attributes rewrite. Modify the AttrBuilder class to store the attributes as a set instead of as a bit mask. The Attribute class will represent only one attribute instead of a collection of attributes. This is the wave of the future! llvm-svn: 171427	2013-01-02 23:45:09 +00:00
Tom Stellard	567f886eb0	DAGCombiner: Avoid generating illegal vector INT_TO_FP nodes DAGCombiner::reduceBuildVecConvertToConvertBuildVec() was making two mistakes: 1. It was checking the legality of scalar INT_TO_FP nodes and then generating vector nodes. 2. It was passing the result value type to TargetLoweringInfo::getOperationAction() when it should have been passing the value type of the first operand. llvm-svn: 171420	2013-01-02 22:13:01 +00:00
Kevin Enderby	726e0ea6eb	Adds missing aliases for fcom and fcomp instructions without arguments. Patch by Michael M Kuperstein! llvm-svn: 171414	2013-01-02 21:20:15 +00:00
Shuxin Yang	98c844fd89	- Add comment to two functions which might be considered as dead code. - Fix a typo llvm-svn: 171399	2013-01-02 18:26:31 +00:00
Nadav Rotem	761937a757	AVX: Fix a bug in WidenMaskArithmetic. llvm-svn: 171398	2013-01-02 17:41:03 +00:00
Chandler Carruth	db25c6cf8e	Actually update the CMake and Makefile builds correctly, and update the code that includes Intrinsics.gen directly. This never showed up in my testing because the old Intrinsics.gen was still kicking around in the make build system and was correct there. =[ Thankfully, some of the bots to clean rebuilds and that caught this. llvm-svn: 171373	2013-01-02 12:09:16 +00:00
Chandler Carruth	9fb823bbd4	Move all of the header files which are involved in modelling the LLVM IR into their new header subdirectory: include/llvm/IR. This matches the directory structure of lib, and begins to correct a long standing point of file layout clutter in LLVM. There are still more header files to move here, but I wanted to handle them in separate commits to make tracking what files make sense at each layer easier. The only really questionable files here are the target intrinsic tablegen files. But that's a battle I'd rather not fight today. I've updated both CMake and Makefile build systems (I think, and my tests think, but I may have missed something). I've also re-sorted the includes throughout the project. I'll be committing updates to Clang, DragonEgg, and Polly momentarily. llvm-svn: 171366	2013-01-02 11:36:10 +00:00
Chandler Carruth	be81023d74	Resort the #include lines in include/... and lib/... with the utils/sort_includes.py script. Most of these are updating the new R600 target and fixing up a few regressions that have creeped in since the last time I sorted the includes. llvm-svn: 171362	2013-01-02 10:22:59 +00:00
Chandler Carruth	ef860a2488	Rename VMCore directory to IR. Aside from moving the actual files, this patch only updates the build system and the source file comments under lib/... that are relevant. I'll be updating other docs and other files in smaller subsequnet commits. While I've tried to test this, but it is entirely possible that there will still be some build system fallout. Also, note that I've not changed the library name itself: libLLVMCore.a is still the library name. I'd be interested in others' opinions about whether we should rename this as well (I think we should, just not sure what it might break) llvm-svn: 171359	2013-01-02 09:10:48 +00:00
Craig Topper	9791afb182	Merge SSE and AVX instruction definitions for scalar forms of SQRT, RSQRT, and RCP. llvm-svn: 171356	2013-01-02 08:00:39 +00:00
Craig Topper	4bc5c4e152	Merge SSE and AVX instruction definitions for PSHUFD/PSHUFHW/PSHUFLW. llvm-svn: 171355	2013-01-02 07:27:49 +00:00
Rafael Espindola	db1a84c84a	Revert 171351. It broke MC/X86/x86-32-avx.s. llvm-svn: 171352	2013-01-02 01:35:11 +00:00
Craig Topper	86d0cdb82f	Merge SSE and AVX instruction definitions for scalar forms of SQRT, RSQRT, and RCP. llvm-svn: 171351	2013-01-01 20:53:20 +00:00
Benjamin Kramer	614b5e85b9	Add IRBuilder::CreateVectorSplat and use it to simplify code. llvm-svn: 171349	2013-01-01 19:55:16 +00:00
Benjamin Kramer	c003a4521b	SROA: Clean up unused assignment warnings from clang's analyzer. No functionality change. llvm-svn: 171348	2013-01-01 16:13:35 +00:00
Michael Gottesman	c8a11df33b	Added DEBUG message when ObjCARC replaces a call which returns its argument verbatim with its argument to temporarily undo an optimization. Specifically these calls return their argument verbatim, as a low-level optimization. However, this makes high-level optimizations harder. We undo any uses of this optimization that the front-end emitted. We redo them later in the contract pass. llvm-svn: 171346	2013-01-01 16:05:54 +00:00
Michael Gottesman	3f146e204e	Added DEBUG messages to the top of several processing loops in ObjCARC.cpp that emit what instructions are being visited. This is a part of a larger effort of adding DEBUG messages to the ARC Optimizer Backend. llvm-svn: 171345	2013-01-01 16:05:48 +00:00
Craig Topper	12ed9cd6ae	Remove unused argument from a multiclass. llvm-svn: 171340	2013-01-01 03:42:44 +00:00
Craig Topper	2edafc059d	Merge intrinsic instruction definitions for SSE and AVX versions of RCPPS and RSQRTPS. llvm-svn: 171339	2013-01-01 03:30:21 +00:00
Craig Topper	d04dbec6c9	Remove 2 unused multiclasses. llvm-svn: 171338	2013-01-01 02:02:45 +00:00
Craig Topper	7cc4f322cf	Merge AVX/SSE instruction definitions for SQRTPS/PD, RSQRTPS, RCPPS. No funcitonal change intended. llvm-svn: 171337	2013-01-01 00:11:07 +00:00
Craig Topper	c2521cd309	Use packed instead of scalar itineraries for SSE1/2 SQRTPS/PD, RCPPS, and RSQRTPS. VEX-encoded forms already use packed. llvm-svn: 171336	2012-12-31 23:49:05 +00:00
Chandler Carruth	76fbeef95a	Remove an unused method on Program. I'm simplifying this interface as much as I can before merging it with the new process interface. llvm-svn: 171334	2012-12-31 23:44:47 +00:00
Chandler Carruth	db8842f9f3	Remove an unused method on the Program class. llvm-svn: 171332	2012-12-31 23:38:28 +00:00
Chandler Carruth	acd64becb1	Go ahead and get rid of the old page size interface and convert all the users over to the new one. No sense maintaining this "compatibility" layer it seems. llvm-svn: 171331	2012-12-31 23:31:56 +00:00
Chandler Carruth	15dcad9e36	Flesh out a page size accessor in the new API. Implement the old API in terms of the new one. This simplifies the implementation on Windows which can now re-use the self_process's once initialization. llvm-svn: 171330	2012-12-31 23:23:35 +00:00
Chandler Carruth	b12634bf80	Remove an unused function in the old Process interface. llvm-svn: 171327	2012-12-31 22:17:59 +00:00
Nuno Lopes	368c4d0e1b	reimplement GetPointerBaseWithConstantOffset(). The new code is an improved copy of the code I deleted from Analysis/Loads.cpp. One less compute-constant-gep-offset implementation. yay :) llvm-svn: 171326	2012-12-31 20:48:35 +00:00
Nuno Lopes	d896a400f1	recommit r171298 (add support for PHI nodes to ObjectSizeOffsetVisitor). Hopefully with bugs corrected now. llvm-svn: 171325	2012-12-31 20:45:10 +00:00
Benjamin Kramer	af463573cb	Revert "add support for PHI nodes to ObjectSizeOffsetVisitor" This reverts r171298. Breaks clang selfhost. llvm-svn: 171318	2012-12-31 19:51:10 +00:00
Jakub Staszak	c48bbe7170	Add extra CHECK to make sure that 'or' instruction was replaced. Also add an assert to avoid confusion in the code where is known that C1 <= C2. llvm-svn: 171310	2012-12-31 18:26:42 +00:00
Nuno Lopes	4b47f82ac2	revert r171306, since we cannot compare APInts with different bitwidths llvm-svn: 171308	2012-12-31 18:01:36 +00:00
Nuno Lopes	69dcc7deec	use ValueTracking's GetPointerBaseWithConstantOffset() function instead of a local implementation llvm-svn: 171307	2012-12-31 17:42:11 +00:00
Nuno Lopes	556b7de2c0	minor code simplification llvm-svn: 171306	2012-12-31 17:25:24 +00:00
Nuno Lopes	e9d6dbf7a2	add support for GlobalAlias to ObjectSizeOffsetVisitor llvm-svn: 171303	2012-12-31 16:23:48 +00:00
Nuno Lopes	7ab7c02d23	add support for PHI nodes to ObjectSizeOffsetVisitor llvm-svn: 171298	2012-12-31 13:52:36 +00:00
Bill Wendling	e10f76c640	Add some comparison operators to compare the Attribute object with the AttrKind value. llvm-svn: 171294	2012-12-31 11:51:54 +00:00
Chandler Carruth	5473dfb099	Switch this code to a more idiomatic double using namespace directive. Fix a truly odd namespace qualifier that was flat out wrong in the process. The fully qualified namespace would have been llvm::sys::TimeValue, llvm::TimeValue makes no sense. llvm-svn: 171292	2012-12-31 11:45:20 +00:00
Chandler Carruth	304de3c424	Delete a cut/paste-o from r171290. Very sorry about the noise. llvm-svn: 171291	2012-12-31 11:40:04 +00:00
Chandler Carruth	5412246d8b	Suppress a MSVC warning complaining about the code working as intended. llvm-svn: 171290	2012-12-31 11:39:02 +00:00
Chandler Carruth	97683aa2fa	Begin sketching out the process interface. The coding style used here is not LLVM's style because this is modeled after a Boost interface and thus done in the style of a candidate C++ standard library interface. I'll probably end up proposing it as a standard C++ library if it proves to be reasonably portable and useful. This is just the most basic parts of the interface -- getting the process ID out of it. However, it helps sketch out some of the boiler plate such as the base class, derived class, shared code, and static factory function. It also introduces a unittest so that I can incrementally ensure this stuff works. However, I've not even compiled this code for Windows yet. I'll try to fix any Windows fallout from the bots, and if I can't fix it I'll revert and get someone on Windows to help out. There isn't a lot more that is mandatory, so soon I'll switch to just stubbing out the Windows side and get Michael Spencer to help with implementation as he can test it directly. llvm-svn: 171289	2012-12-31 11:17:50 +00:00
Chris Lattner	f5cca68c2c	Fix LICM's memory promotion optimization to preserve TBAA tags when promoting a store in a loop. This was noticed when working on PR14753, but isn't directly related. llvm-svn: 171281	2012-12-31 08:37:17 +00:00
Chris Lattner	eeefe1bc07	teach instcombine to preserve TBAA tag when merging two stores, part of PR14753 llvm-svn: 171279	2012-12-31 08:10:58 +00:00
Jakub Staszak	f584977df2	Grammo. llvm-svn: 171272	2012-12-31 01:40:44 +00:00
Bill Wendling	6e95ae803a	Remove the getAttributesAtIndex and getNumAttrs methods in favor of using the getAttrSomewhere predicate. This prevents the uses of 'Attribute' as a collection of attributes. llvm-svn: 171271	2012-12-31 00:49:59 +00:00
Jakub Staszak	ea2b9b9d67	Transform (A == C1 \|\| A == C2) into (A & ~(C1 ^ C2)) == C1 if C1 and C2 differ only with one bit. Fixes PR14708. llvm-svn: 171270	2012-12-31 00:34:55 +00:00
Hal Finkel	6dbdd4307b	Support ppcf128 in SelectionDAG::getConstantFP Fixes pr14751. Patch by Kai; Thanks! llvm-svn: 171261	2012-12-30 19:03:32 +00:00
Nuno Lopes	b6ad98224a	convert a bunch of callers from DataLayout::getIndexedOffset() to GEP::accumulateConstantOffset(). The later API is nicer than the former, and is correct regarding wrap-around offsets (if anyone cares). There are a few more places left with duplicated code, which I'll remove soon. llvm-svn: 171259	2012-12-30 16:25:48 +00:00
Bill Wendling	749a43d874	Use the predicate methods off of AttributeSet instead of Attribute. llvm-svn: 171257	2012-12-30 13:50:49 +00:00
Bill Wendling	74dba875e2	Remove the Function::getRetAttributes method in favor of using the AttributeSet accessor method. llvm-svn: 171256	2012-12-30 13:01:51 +00:00
Bill Wendling	94dcaf8e2b	Remove Function::getParamAttributes and use the AttributeSet accessor methods instead. llvm-svn: 171255	2012-12-30 12:45:13 +00:00
Bill Wendling	698e84fc4f	Remove the Function::getFnAttributes method in favor of using the AttributeSet directly. This is in preparation for removing the use of the 'Attribute' class as a collection of attributes. That will shift to the AttributeSet class instead. llvm-svn: 171253	2012-12-30 10:32:01 +00:00
Bill Wendling	6190254e0f	s/hasAttribute/contains/g to be more consistent with other method names. llvm-svn: 171252	2012-12-30 09:17:46 +00:00
Nadav Rotem	0b37f14371	LoopVectorizer: Fix a bug in the code that updates the loop exiting block. LCSSA PHIs may have undef values. The vectorizer updates values that are used by outside users such as PHIs. The bug happened because undefs are not loop values. This patch handles these PHIs. PR14725 llvm-svn: 171251	2012-12-30 07:47:00 +00:00
Bill Wendling	3a554ac065	Add a few more c'tors: * One that accepts a single Attribute::AttrKind. * One that accepts an Attribute::AttrKind plus a list of values. This is for attributes defined like this: #1 = attributes { align = 4 } * One that accepts a string, for target-specific attributes like this: #2 = attributes { "cpu=cortex-a8" } llvm-svn: 171249	2012-12-30 02:22:16 +00:00
Bill Wendling	b1d12619c9	Add a few (as yet unused) query methods to determine if the attribute that's stored here is of a certain kind. This is in preparation for when an Attribute object represents a single attribute, instead of a bitmask of attributes. llvm-svn: 171247	2012-12-30 01:38:39 +00:00
Bill Wendling	5e8ff877f4	Uniquify the AttributeImpl based on the Constant pointer, since those are already uniquified. Note: This will be expanded in the future to add more than just one pointer value. llvm-svn: 171245	2012-12-30 01:23:08 +00:00
Bill Wendling	3e4c4c9607	s/Raw/getBitMask/g to be more in line with current naming conventions. This method won't be sticking around. llvm-svn: 171244	2012-12-30 01:05:42 +00:00
Craig Topper	fe82eb6bcd	Remove intrinsic specific instructions for (V)SQRTPS/PD. Instead lower to target-independent ISD nodes and use the existing patterns for those. llvm-svn: 171237	2012-12-29 18:18:20 +00:00
Craig Topper	f4a9c6e21b	Merge similar functionality using a nested switch. llvm-svn: 171229	2012-12-29 17:19:06 +00:00
Craig Topper	6b27251a76	Remove intrinsic specific instructions for SSE/SSE2/AVX floating point max/min instructions. Lower them to target specific nodes and use those patterns instead. This also allows them to be commuted if UnsafeFPMath is enabled. llvm-svn: 171227	2012-12-29 16:44:25 +00:00
Jakub Staszak	215f94143c	Simplify code, no functionality change. llvm-svn: 171226	2012-12-29 15:57:26 +00:00
Jakub Staszak	afe8109fce	Delete executive bit on ./lib/Target/Hexagon/HexagonAsmPrinter.h. llvm-svn: 171225	2012-12-29 15:23:06 +00:00
Bill Wendling	0cd0f7f832	Use a 'Constant' object instead of a bit field to store the attribute data. llvm-svn: 171221	2012-12-29 12:29:38 +00:00
Bill Wendling	4fdde84613	Use the accessor method instead of the raw ivar to get the bits. llvm-svn: 171220	2012-12-29 12:10:46 +00:00
Chandler Carruth	405d681340	Nuke some dead code that snuck in some how. I thought I had already deleted this, but apparantly not. Charmingly, Clang didn't warn on it but GCC did. llvm-svn: 171197	2012-12-28 14:50:51 +00:00
Chandler Carruth	86ed53089f	Fix a stunning oversight in the inline cost analysis. It was never propagating one of the values it simplified to a constant across a myriad of instructions. Notably, ptrtoint instructions when we had a constant pointer (say, 0) didn't propagate that, blocking a massive number of down-stream optimizations. This was uncovered when investigating why we fail to inline and delete the boilerplate in: void f() { std::vector<int> v; v.push_back(1); } It turns out most of the efforts I've made thus far to improve the analysis weren't making it far purely because of this. After this is fixed, the store-to-load forwarding patch enables LLVM to optimize the above to an empty function. We still can't nuke a second push_back, but for different reasons. There is a very real chance this will cause somewhat noticable changes in inlining behavior, so please let me know if you see regressions (or improvements!) because of this patch. llvm-svn: 171196	2012-12-28 14:43:42 +00:00
Chandler Carruth	753e21d057	Teach the inline cost analysis about calls that can be simplified and how to propagate constants through insert and extract value instructions. With the recent improvements to instsimplify, this allows inline cost analysis to constant fold through intrinsic functions, including notably the with.overflow intrinsic math routines which often show up inside of STL abstractions. This is yet another piece in the puzzle of breaking down the code for: void f() { std::vector<int> v; v.push_back(1); } But it still isn't enough. There are a pile of bugs in inline cost still blocking this. llvm-svn: 171195	2012-12-28 14:23:32 +00:00
Chandler Carruth	f6182155f6	Teach instsimplify to use the constant folder where appropriate for constant folding calls. Add the initial tests for this which show that now instsimplify can simplify blindingly obvious code patterns expressed with both intrinsics and library calls. llvm-svn: 171194	2012-12-28 14:23:29 +00:00
Chandler Carruth	9dc3558920	Add entry points to instsimplify for simplifying calls. The entry points are nice and decomposed so that we can simplify synthesized calls as easily as actually call instructions. The internal utility still has the same behavior, it just now operates on a more generic interface so that I can extend the set of call simplifications that instsimplify knows about. llvm-svn: 171189	2012-12-28 11:30:55 +00:00
Alexey Samsonov	3efc87e92d	Add proper support for -fsanitize-blacklist= flag for TSan and MSan. LLVM part. llvm-svn: 171183	2012-12-28 09:30:44 +00:00
Nadav Rotem	9785f519b4	CostModel: initial checkin for code that estimates the cost of special shuffles. llvm-svn: 171180	2012-12-28 08:19:03 +00:00
Nadav Rotem	c982a2dc25	wrap 80-col lines. llvm-svn: 171179	2012-12-28 07:28:43 +00:00
Nadav Rotem	3da9ac72fa	AVX: Move the ZEXT/ANYEXT DAGCo optimizations to the lowering of these optimizations. The old test cases still cover all of these lowering/optimizations. The single change that we have is that now anyext does not need to zero a register, because it does not use the exact code path as the zero_extend. llvm-svn: 171178	2012-12-28 05:45:24 +00:00
Nadav Rotem	68441914a5	Reverse the 'if' condition and reduce the indentation. llvm-svn: 171172	2012-12-27 23:08:05 +00:00
Craig Topper	ab2e6842cc	Merge basic_sse12_fp_binop_p_int and basic_sse12_fp_binop_p_y_int multiclasses. llvm-svn: 171171	2012-12-27 22:53:47 +00:00
Nadav Rotem	3b34190100	AVX/AVX2: Move the SEXT lowering code from a target specific DAGco to a lowering function. llvm-svn: 171170	2012-12-27 22:47:16 +00:00
Craig Topper	e2eec3c52b	Merge basic_sse12_fp_binop_p and basic_sse12_fp_binop_p_y multiclasses. llvm-svn: 171166	2012-12-27 18:51:50 +00:00
Chandler Carruth	e40e60eed5	Make this parameter be named consistently with most other getAnalysisUsage implementations. llvm-svn: 171157	2012-12-27 11:17:15 +00:00
Alexey Samsonov	29dd7f2090	[ASan] Fix lifetime intrinsics handling. Now for each intrinsic we check if it describes one of 'interesting' allocas. Assume that allocas can go through casts and phi-nodes before apperaring as llvm.lifetime arguments llvm-svn: 171153	2012-12-27 08:50:58 +00:00
Nadav Rotem	2a054b4475	On AVX/AVX2 the type v8i1 is legalized to v8i16, which is an XMM sized register. In most cases we actually compare or select YMM-sized registers and mixing the two types creates horrible code. This commit optimizes some of the transition sequences. PR14657. llvm-svn: 171148	2012-12-27 08:15:45 +00:00
Nadav Rotem	8e5d80eba3	AVX/AVX2: Move the code that lowers vector-trunc from a DAGCo-hook to custom lowering hook. The vector truncs were scalarized during LegalizeVectorOps, later vectorized again by some DAGCombine optimization and finally, lowered by a dagcombing optimization. Now, they are properly lowered during LegalizeVectorOps. No new testcase because the original testcases still work. llvm-svn: 171146	2012-12-27 07:45:10 +00:00
Craig Topper	757f3fc394	Add hasSideEffects=0 to some forms of ROUND, RCP, and RSQRT. llvm-svn: 171143	2012-12-27 07:16:08 +00:00
Nadav Rotem	b1dd52450e	Refactor DAGCombinerInfo. Change the different booleans that indicate if we are before or after different runs of DAGCo, with the CombineLevel enum. Also, added a new API for checking if we are running before or after the LegalizeVectorOps phase. llvm-svn: 171142	2012-12-27 06:47:41 +00:00
Craig Topper	09ce4b9efe	Move single letter 'P' prefix out of multiclass now that tablegen allows defm to start with #NAME. This makes instruction names more searchable again. llvm-svn: 171141	2012-12-27 06:34:54 +00:00
Craig Topper	8f0b73942e	Update tablegen parser to allow defm names to start with #NAME. llvm-svn: 171140	2012-12-27 06:32:52 +00:00
Craig Topper	396cb795bc	Add hasSideEffects=0 to some shift and rotate instructions. None of which are currently used by code generation. llvm-svn: 171137	2012-12-27 03:35:44 +00:00
Craig Topper	c7910828e4	Mark the divide instructions as hasSideEffects=0. llvm-svn: 171136	2012-12-27 03:01:18 +00:00
Eric Christopher	3bf29fda91	For the dwarf5 split debug info code split out the string section per compile unit/skeleton compile unit. Update tests accordingly. llvm-svn: 171133	2012-12-27 02:14:01 +00:00
Craig Topper	5b807aaa38	Add hasSideEffects=0 to CMP*rr_REV. llvm-svn: 171130	2012-12-27 02:08:46 +00:00
Craig Topper	89e8607755	Add mayLoad, mayStore, and hasSideEffects tags to BT/BTS/BTR/BTC instructions. Shouldn't change any functionality since they don't have patterns to select them. llvm-svn: 171128	2012-12-27 02:01:33 +00:00
Eric Christopher	5a6acfa4c8	Right now all of the relocations are 32-bit dwarf, and the relocation information doesn't return an addend for Rel relocations. Go ahead and use this information to fix relocation handling inside dwarfdump for 32-bit ELF REL. llvm-svn: 171126	2012-12-27 01:07:07 +00:00
Nadav Rotem	5350cd314b	If all of the write objects are identified then we can vectorize the loop even if the read objects are unidentified. PR14719. llvm-svn: 171124	2012-12-26 23:30:53 +00:00
Craig Topper	c557343956	Fix operands and encoding form for ARPL instruction. Register form had and reversed. Memory form writes memory, but was marked as MRMSrcMem. llvm-svn: 171123	2012-12-26 23:27:57 +00:00
Craig Topper	d47a70de9f	Add hasSideEffects=0 to some atomic instructions. llvm-svn: 171122	2012-12-26 23:08:12 +00:00
Craig Topper	af2372087b	Mark the AL/AX/EAX forms of the basic arithmetic operations has never having side effects. llvm-svn: 171121	2012-12-26 22:19:23 +00:00
Nick Lewycky	fca2acb618	80 columns. No functionality change. llvm-svn: 171120	2012-12-26 22:00:49 +00:00
Nick Lewycky	90053a1214	Remove mid-optimizer warning. This situation should be handled differently, such as by a compiler warning, a check in clang -fsanitizer=undefined, being optimized to unreachable, or a combination of the above. PR14722. llvm-svn: 171119	2012-12-26 22:00:35 +00:00
Craig Topper	1b8c0750ee	Mark all the _REV instructions as not having side effects. They aren't really emitted by the backend, but it reduces the number of instructions in the output files with unmodelled side effects to make auditing easier. llvm-svn: 171118	2012-12-26 21:30:22 +00:00
Craig Topper	18f2675e9b	Remove a special conditional setting of neverHasSideEffects if the instruction didn't have a pattern. This was leftover from when tablegen used to complain if things were already inferred from patterns. llvm-svn: 171117	2012-12-26 21:04:30 +00:00
Nadav Rotem	3f7c4f36ba	LoopVectorizer: Optimize the vectorization of consecutive memory access when the iteration step is -1 llvm-svn: 171114	2012-12-26 19:08:17 +00:00
Evgeniy Stepanov	5eb5bf8b46	[msan] Raise alignment of origin stores/loads when possible. Origin alignment is as high as the alignment of the corresponding application location, but never less than 4. llvm-svn: 171110	2012-12-26 11:55:09 +00:00
Evgeniy Stepanov	d8be0c510c	[msan] Expand the file comment with track-origins info. llvm-svn: 171109	2012-12-26 10:59:00 +00:00
Craig Topper	24f316e4db	Merge still more SSE/AVX instruction definitions. llvm-svn: 171103	2012-12-26 07:54:43 +00:00
Craig Topper	af629e2700	Merge more SSE/AVX instruction definitions. llvm-svn: 171102	2012-12-26 07:20:35 +00:00
Craig Topper	65fe30450d	Fix 80 column violation. llvm-svn: 171097	2012-12-26 06:15:53 +00:00
Craig Topper	f4d0fe8fcd	Fix class name in comment. llvm-svn: 171096	2012-12-26 06:15:09 +00:00
Craig Topper	59747c4dbd	Merge SSE/AVX PCMPEQ/PCMPGT instruction definitions. llvm-svn: 171095	2012-12-26 06:14:15 +00:00
Craig Topper	8a48677586	Remove 'v' from mnemonic to fix asm matching failures. llvm-svn: 171093	2012-12-26 06:02:15 +00:00
Craig Topper	b4ef0fa3a1	Use an additional multiclass to merge the 128/256-bit SSE/AVX instruction definitions for a bunch of SSE2 integer arithmetic instructions. llvm-svn: 171092	2012-12-26 05:49:15 +00:00
Nadav Rotem	5267bb71b8	Reformat the docs. llvm-svn: 171091	2012-12-26 04:59:20 +00:00
Craig Topper	a2594dd5f0	Use an additional multiclass to merge the 128/256-bit SSE/AVX instruction definitions for PAND/POR/PXOR/PANDN llvm-svn: 171087	2012-12-26 04:36:03 +00:00
Craig Topper	97730a0d6a	Merge an AVX/SSE 256-bit and 128-bit multiclass. llvm-svn: 171086	2012-12-26 03:56:47 +00:00
Craig Topper	8b59746390	Mark VANDNPD/VANDNPDS as not commutable. llvm-svn: 171085	2012-12-26 03:48:10 +00:00
Craig Topper	81d1e596bb	Remove alignment from a bunch more VEX encoded operations in the folding tables. llvm-svn: 171082	2012-12-26 02:44:47 +00:00
Craig Topper	b2922164f0	Remove alignment from folding table for VMOVUPD as an unaligned instruction it shouldn't require alignment... llvm-svn: 171081	2012-12-26 02:14:19 +00:00
Craig Topper	d09a9af9b6	Remove alignment requirements from (V)EXTRACTPS. This instruction does 32-bit stores which aren't required to be aligned on SSE or AVX. llvm-svn: 171080	2012-12-26 01:47:12 +00:00
Hal Finkel	30e95a8ebb	BBVectorize: Use VTTI to compute costs for intrinsics vectorization For the time being this includes only some dummy test cases. Once the generic implementation of the intrinsics cost function does something other than assuming scalarization in all cases, or some target specializes the interface, some real test cases can be added. Also, for consistency, I changed the type of IID from unsigned to Intrinsic::ID in a few other places. llvm-svn: 171079	2012-12-26 01:36:57 +00:00
Craig Topper	caef1c5d86	Remove alignment requirement from VCVTSS2SD in folding tables. Reverting r171049. This instruction doesn't require alignment. llvm-svn: 171078	2012-12-26 00:35:47 +00:00
Hal Finkel	b44f890133	LoopVectorize: Enable vectorization of the fmuladd intrinsic llvm-svn: 171076	2012-12-25 23:21:29 +00:00
Hal Finkel	2a456112ec	BBVectorize: Enable vectorization of the fmuladd intrinsic llvm-svn: 171075	2012-12-25 22:36:08 +00:00
Hal Finkel	1b5ff08d43	Expand PPC64 atomic load and store Use of store or load with the atomic specifier on 64-bit types would cause instruction-selection failures. As with the 32-bit case, these can use the default expansion in terms of cmp-and-swap. llvm-svn: 171072	2012-12-25 17:22:53 +00:00
Evgeniy Stepanov	f19c086d1e	[msan] Fix handling of vectors of pointers. VectorType::getInteger() can not be used with them, because pointer size depends on the target. llvm-svn: 171070	2012-12-25 16:04:38 +00:00
Evgeniy Stepanov	ec8371283b	[msan] Fix handling of select with vector condition. llvm-svn: 171069	2012-12-25 14:56:21 +00:00
Benjamin Kramer	81b5a8fd2e	X86: Shave off one shuffle from the pcmpeqq sequence for SSE2 by making use of and commutativity. llvm-svn: 171064	2012-12-25 13:09:08 +00:00
Benjamin Kramer	df4af41b9b	X86: Custom lower <2 x i64> eq and ne when SSE41 is not available. pcmpeqd, pshufd, pshufd, pand is cheaper than unpack + cmpq, sbbq, cmpq, sbbq + pack. Small speedup on loop-vectorized viterbi (-march=core2). llvm-svn: 171063	2012-12-25 12:54:19 +00:00
Alexey Samsonov	788381b8ac	ASan: initialize callbacks from ASan module pass in a separate function for consistency llvm-svn: 171061	2012-12-25 12:28:20 +00:00
Alexey Samsonov	1e3f7ba8f7	ASan: move stack poisoning logic into FunctionStackPoisoner struct llvm-svn: 171060	2012-12-25 12:04:36 +00:00
Nick Lewycky	d192517cf3	Fix whitespace. No functionality change. llvm-svn: 171051	2012-12-25 06:13:25 +00:00
Nadav Rotem	00410ae625	VCVTSS2SD requires a strict alignment. Thanks Elena. llvm-svn: 171049	2012-12-25 03:29:18 +00:00
Bob Wilson	fe73ac34c5	Rename LLVMContext diagnostic handler types and functions. These are now generally used for all diagnostics from the backend, not just for inline assembly, so this drops the "InlineAsm" from the names. No functional change. (I've left aliases for the old names but only for long enough to let me switch over clang to use the new ones.) llvm-svn: 171047	2012-12-25 00:07:12 +00:00
Nick Lewycky	521e0d59f3	Quiet gcc's -Wparenthesis warning. No functionality change. llvm-svn: 171044	2012-12-24 19:58:45 +00:00
Benjamin Kramer	9d46110ff1	Use a std::string rather than a dynamically allocated char* buffer. This affords us to use std::string's allocation routines and use the destructor for the memory management. Switching to that also means that we can use operator==(const std::string&, const char *) to perform the string comparison rather than resorting to libc functionality (i.e. strcmp). Patch by Saleem Abdulrasool! Differential Revision: http://llvm-reviews.chandlerc.com/D230 llvm-svn: 171042	2012-12-24 19:23:30 +00:00
Bob Wilson	4ed23578da	Add LLVMContext::emitWarning methods and use them. <rdar://problem/12867368> When the backend is used from clang, it should produce proper diagnostics instead of just printing messages to errs(). Other clients may also want to register their own error handlers with the LLVMContext, and the same handler should work for warnings in the same way as the existing emitError methods. llvm-svn: 171041	2012-12-24 18:15:21 +00:00
Nadav Rotem	3ee6b10dd4	CostModel: We have API for checking the costs of known shuffles. This patch adds support for the insert-subvector and extract-subvector kinds. llvm-svn: 171027	2012-12-24 10:04:03 +00:00
Elena Demikhovsky	517afbff01	Added 6 more value types: v32i1, v64i1, v32i16, v32i8, v64i8, v8f64 llvm-svn: 171026	2012-12-24 10:03:57 +00:00
Elena Demikhovsky	2fdeb6da8d	Removed "static" from "__jit_debug_descriptor" because "static" adds C++ mangling prefix to this symbol. llvm-svn: 171025	2012-12-24 09:42:27 +00:00
Nadav Rotem	dc0ad92b64	Some x86 instructions can load/store one of the operands to memory. On SSE, this memory needs to be aligned. When these instructions are encoded in VEX (on AVX) there is no such requirement. This changes the folding tables and removes the alignment restrictions from VEX-encoded instructions. llvm-svn: 171024	2012-12-24 09:40:33 +00:00
Nadav Rotem	5f7c12cfbd	LoopVectorizer: When checking for vectorizable types, also check the StoreInst operands. PR14705. llvm-svn: 171023	2012-12-24 09:14:18 +00:00
Nadav Rotem	7e1599e100	Change the codegen Cost Model API for shuffeles. This patch removes the API for broadcast and adds a more general API that accepts an enum of known shuffles. llvm-svn: 171022	2012-12-24 08:57:47 +00:00
Alexey Samsonov	098842b401	Fix typo in comments llvm-svn: 171021	2012-12-24 08:52:53 +00:00
Nadav Rotem	99868e4f9d	Update the docs of the cost model. llvm-svn: 171016	2012-12-24 05:51:12 +00:00
Nadav Rotem	bd5d1d832a	LoopVectorizer: Fix an endless loop in the code that looks for reductions. The bug was in the code that detects PHIs in if-then-else block sequence. PR14701. llvm-svn: 171008	2012-12-24 01:22:06 +00:00
Nadav Rotem	cf9999d9d5	CostModel: Change the default target-independent implementation for finding the cost of arithmetic functions. We now assume that the cost of arithmetic operations that are marked as Legal or Promote is low, but ops that are marked as custom are higher. llvm-svn: 171002	2012-12-23 17:31:23 +00:00
Benjamin Kramer	28691400dd	LoopVectorize: Fix accidentaly inverted condition. llvm-svn: 171001	2012-12-23 13:21:41 +00:00
Benjamin Kramer	855ba03408	LoopVectorize: For scalars and void types there is no need to compute vector insert/extract costs. Fixes an assert during the build of oggenc in the test suite. llvm-svn: 171000	2012-12-23 13:19:18 +00:00
Nadav Rotem	b15c69a725	whitespace llvm-svn: 170997	2012-12-23 07:33:44 +00:00
Nadav Rotem	1bef5a0509	Rename a function. llvm-svn: 170996	2012-12-23 07:30:09 +00:00
Nadav Rotem	2cade68025	Loop Vectorizer: Update the cost model of scatter/gather operations and make them more expensive. llvm-svn: 170995	2012-12-23 07:23:55 +00:00
Craig Topper	1bef2c859f	Remove trailing whitespace. llvm-svn: 170991	2012-12-22 19:15:35 +00:00
Craig Topper	4c94775198	Remove trailing whitespace llvm-svn: 170990	2012-12-22 18:09:02 +00:00
Jakob Stoklund Olesen	7bca670a8b	Remove a special case that doesn't seem necessary any longer. Back when this exception was added, it was skipping a lot more code, but now it just looks like a premature optimization. llvm-svn: 170989	2012-12-22 17:33:22 +00:00
Jakob Stoklund Olesen	b089483993	Use getNumOperands() instead of Operands.size(). The representation of the Operands array is going to change soon so it can be allocated from a BumpPtrAllocator. llvm-svn: 170988	2012-12-22 17:13:06 +00:00
Benjamin Kramer	76268ac682	X86: Turn mul of <4 x i32> into pmuludq when no SSE4.1 is available. pmuludq is slow, but it turns out that all the unpacking and packing of the scalarized mul is even slower. 10% speedup on loop-vectorized paq8p. llvm-svn: 170985	2012-12-22 16:07:56 +00:00
Benjamin Kramer	b2f0a2bd4b	X86: Emit vector sext as shuffle + sra if vpmovsx is not available. Also loosen the SSSE3 dependency a bit, expanded pshufb + psra is still better than scalarized loads. Fixes PR14590. llvm-svn: 170984	2012-12-22 11:34:28 +00:00
Bill Wendling	c79e42c5ce	Change 'AttrVal' to 'AttrKind' to better reflect that it's a kind of attribute instead of the value of the attribute. llvm-svn: 170972	2012-12-22 00:37:52 +00:00
Richard Smith	045e4f1365	Don't call back() on an empty SmallVector. Found by -fsanitize=enum! llvm-svn: 170968	2012-12-22 00:15:13 +00:00
Nadav Rotem	d5aae980cb	In some cases, due to scheduling constraints we copy the EFLAGS. The only way to read the eflags is using push and pop. If we don't adjust the stack then we run over the first frame index. This is not something that we want to do, so we have to make sure that our machine function does not copy the flags. If it does then we have to emit the prolog that adjusts the stack. rdar://12896831 llvm-svn: 170961	2012-12-21 23:48:49 +00:00
Akira Hatanaka	6ac2fc4976	[mips] Refactor subword-swap, EXT/INS, load-effective-address and read-hardware instructions. llvm-svn: 170956	2012-12-21 23:21:32 +00:00
Akira Hatanaka	beea8a34c3	[mips] Refactor SYNC and multiply/divide instructions. llvm-svn: 170955	2012-12-21 23:17:36 +00:00
Akira Hatanaka	31ddec5887	[mips] Refactor BAL instructions. llvm-svn: 170954	2012-12-21 23:15:59 +00:00
Akira Hatanaka	d6b694f036	[mips] Fix encoding of BAL instruction. Also, fix assembler test case which was not catching the error. llvm-svn: 170953	2012-12-21 23:13:59 +00:00
Akira Hatanaka	a158042a56	[mips] Refactor jump, jump register, jump-and-link and nop instructions. llvm-svn: 170952	2012-12-21 23:03:50 +00:00
Akira Hatanaka	e1826d7464	[mips] Refactor load/store left/right and load-link and store-conditional instructions. llvm-svn: 170950	2012-12-21 23:01:24 +00:00
Akira Hatanaka	d9bf8424e5	[mips] Refactor load/store instructions. llvm-svn: 170948	2012-12-21 22:58:55 +00:00
Akira Hatanaka	b59b047fbe	[mips] Remove unnecessary isPseudo parameter. llvm-svn: 170947	2012-12-21 22:57:26 +00:00
Akira Hatanaka	e738efc95b	[mips] Refactor LUI instruction. llvm-svn: 170944	2012-12-21 22:46:07 +00:00
Akira Hatanaka	895e1cb2aa	[mips] Refactor count leading zero or one instructions. llvm-svn: 170942	2012-12-21 22:43:58 +00:00
Akira Hatanaka	4f4c4aa05e	[mips] Refactor sign-extension-in-register instructions. llvm-svn: 170940	2012-12-21 22:41:52 +00:00
Akira Hatanaka	b14c6e4e5f	[mips] Refactor instructions which copy from and to HI/LO registers. llvm-svn: 170939	2012-12-21 22:39:17 +00:00
Akira Hatanaka	9e89195dce	[mips] Refactor logical NOR instructions. llvm-svn: 170937	2012-12-21 22:35:47 +00:00
Akira Hatanaka	ac10697207	[mips] Move instruction definitions in MipsInstrInfo.td. llvm-svn: 170936	2012-12-21 22:33:43 +00:00
Tom Stellard	09ef8425e9	R600: Coding style - remove empty spaces from the beginning of functions No functionality change. llvm-svn: 170923	2012-12-21 20:12:02 +00:00
Tom Stellard	41398026e7	R600: Fix MAX_UINT definition Patch by: Vadim Girlin Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 170922	2012-12-21 20:12:01 +00:00
Tom Stellard	4fa7ac29f1	R600: Add SHADOWCUBE to TEX_SHADOW pattern Patch by: Vadim Girlin Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 170921	2012-12-21 20:11:59 +00:00
Benjamin Kramer	5521b94b07	Cleanup compiler warnings on discarding type qualifiers in casts. Switch to C++ style casts. Patch by Saleem Abdulrasool! Differential Revision: http://llvm-reviews.chandlerc.com/D204 llvm-svn: 170917	2012-12-21 19:09:53 +00:00
Benjamin Kramer	82d1c371e2	X86: Match pmin/pmax as a target specific dag combine. This occurs during vectorization. Part of PR14667. llvm-svn: 170908	2012-12-21 17:46:58 +00:00
Roman Divacky	a229186a82	Remove duplicate includes. llvm-svn: 170902	2012-12-21 17:06:44 +00:00
Tom Stellard	a8b0351720	R600: Expand vec4 INT <-> FP conversions llvm-svn: 170901	2012-12-21 16:33:24 +00:00
Benjamin Kramer	4669d18893	X86: Match the SSE/AVX min/max vector ops using a custom node instead of intrinsics This is very mechanical, no functionality change. Preparation for PR14667. llvm-svn: 170898	2012-12-21 14:04:55 +00:00
Evgeniy Stepanov	4fbc0d08bf	[msan] Remove unreachable blocks before instrumenting a function. llvm-svn: 170883	2012-12-21 11:18:49 +00:00
Nadav Rotem	eacbb731d3	Add a missing "virtual" keyword. llvm-svn: 170842	2012-12-21 05:02:12 +00:00
Nadav Rotem	3b850b70b3	Enable if-conversion. llvm-svn: 170841	2012-12-21 04:47:54 +00:00
Quentin Colombet	b1b66e7a25	Add ARM cortex-r5 subtarget. llvm-svn: 170840	2012-12-21 04:35:05 +00:00
Rafael Espindola	73bf9fa7ba	Don't skip __DWARF, Now that we don't merge section and segment names, we don't need to skip the segment name to get to the section name. llvm-svn: 170839	2012-12-21 04:08:03 +00:00
Rafael Espindola	a9f810b6b5	Add a function to get the segment name of a section. On MachO, sections also have segment names. When a tool looking at a .o file prints a segment name, this is what they mean. In reality, a .o has only one anonymous, segment. This patch adds a MachO only function to fetch that segment name. I named it getSectionFinalSegmentName since the main use for the name seems to be inform the linker with segment this section should go to. The patch also changes MachOObjectFile::getSectionName to return just the section name instead of computing SegmentName,SectionName. The main difference from the previous patch is that it doesn't use InMemoryStruct. It is extremely dangerous: if the endians match it returns a pointer to the file buffer, if not, it returns a pointer to an internal buffer that is overwritten in the next API call. We should change all of this code to use support::detail::packed_endian_specific_integral like ELF, but since these functions only handle strings, they work with big and little endian machines as is. I have tested this by installing ubuntu 12.10 ppc on qemu, that is why it took so long :-) llvm-svn: 170838	2012-12-21 03:47:03 +00:00
Evan Cheng	59421aee3d	Add targets to skip running the GC passes. llvm-svn: 170836	2012-12-21 02:57:04 +00:00
Evan Cheng	99cafb1db2	Every pass deserves a name, even codegenprep. llvm-svn: 170831	2012-12-21 01:48:14 +00:00
Nadav Rotem	6d4fdd6d2c	Improve the X86 cost model for loads and stores. llvm-svn: 170830	2012-12-21 01:33:59 +00:00
Nadav Rotem	a4b53f20a3	BB-Vectorizer: Check the cost of the store pointer type and not the return type, which is void. A number of test cases fail after adding the assertion in TTImpl. llvm-svn: 170828	2012-12-21 01:24:36 +00:00
Reed Kotler	9bff1ead0e	Call llvm_unreachable instead of assert. llvm-svn: 170822	2012-12-21 00:44:59 +00:00
Nadav Rotem	e7785686a5	Fix a bug in the code that checks if we can vectorize loops while using dynamic memory bound checks. Before the fix we were able to vectorize this loop from the Livermore Loops benchmark: for ( k=1 ; k<n ; k++ ) x[k] = x[k-1] + y[k]; llvm-svn: 170811	2012-12-21 00:07:35 +00:00
Jakob Stoklund Olesen	2455b58551	Require the two-argument MI::addOperand(MF, MO) for dangling instructions. Instructions that are inserted in a basic block can still be decorated with addOperand(MO). Make the two-argument addOperand() function contain the actual implementation. This function will now always have a valid MF reference that it can use for memory allocation. llvm-svn: 170798	2012-12-20 22:54:05 +00:00
Jakob Stoklund Olesen	33f5d1492d	Add an MF argument to MI::copyImplicitOps(). This function is often used to decorate dangling instructions, so a context reference is required to allocate memory for the operands. Also add a corresponding MachineInstrBuilder method. llvm-svn: 170797	2012-12-20 22:54:02 +00:00
Jakob Stoklund Olesen	ac4210eacb	Use two-arg addOperand(MF, MO) internally in MachineInstr when possible. llvm-svn: 170796	2012-12-20 22:53:58 +00:00
Jakob Stoklund Olesen	2ea203694d	MachineInstrBuilderize ARM. llvm-svn: 170795	2012-12-20 22:53:55 +00:00
Jakob Stoklund Olesen	4255c96aed	MachineInstrBuilderize NVPTX. llvm-svn: 170794	2012-12-20 22:53:53 +00:00
Eli Bendersky	75a7a338fc	Fix an unitialized member variable that may have caused sporadic failures for code that wasn't even in bundling mode. llvm-svn: 170793	2012-12-20 22:51:52 +00:00
Eric Christopher	48fef599a4	Whitespace and 80-column cleanup. llvm-svn: 170771	2012-12-20 21:58:40 +00:00
Eric Christopher	e698f53740	Start splitting out the debug string section handling by moving it into the DwarfUnits class. llvm-svn: 170770	2012-12-20 21:58:36 +00:00
Bill Wendling	66e978f904	Some random comment, naming, and format changes. Rename the AttributeImpl* from Attrs to pImpl to be consistent with other code. Add comments where none were before. Or doxygen-ify other comments. llvm-svn: 170767	2012-12-20 21:28:43 +00:00
Jakob Stoklund Olesen	00b28ecfae	Remove two dead functions. llvm-svn: 170766	2012-12-20 21:12:42 +00:00
Bob Wilson	7bba4f8957	Revert "Adding support for llvm.arm.neon.vaddl[su].* and" This reverts r170694. The operations can be represented in IR without adding any new intrinsics. llvm-svn: 170765	2012-12-20 21:09:38 +00:00
Nadav Rotem	2ababf68d7	LoopVectorize: Fix a bug in the scalarization of instructions. Before if-conversion we could check if a value is loop invariant if it was declared inside the basic block. Now that loops have multiple blocks this check is incorrect. This fixes External/SPEC/CINT95/099_go/099_go llvm-svn: 170756	2012-12-20 20:24:40 +00:00
Evan Cheng	ddc0cb6dc5	On some ARM cpus, flags setting movs with shifter operand, i.e. lsl, lsr, asr, are more expensive than the non-flag setting variant. Teach thumb2 size reduction pass to avoid generating them unless we are optimizing for size. rdar://12892707 llvm-svn: 170728	2012-12-20 19:59:30 +00:00
Eli Bendersky	f483ff9204	Aligned bundling support. Following the discussion here: http://lists.cs.uiuc.edu/pipermail/llvmdev/2012-December/056754.html The proposal and implementation are fully documented here: https://sites.google.com/a/chromium.org/dev/nativeclient/pnacl/aligned-bundling-support-in-llvm Tests will follow shortly. llvm-svn: 170718	2012-12-20 19:05:53 +00:00
Jakob Stoklund Olesen	2705333253	Use MachineInstrBuilder for PHI nodes in SelectionDAGISel. llvm-svn: 170716	2012-12-20 18:46:29 +00:00
Jakob Stoklund Olesen	b109a7b430	Use MachineInstrBuilder in InstrEmitter. This is supposed to be a mechanical change with no functional effects. InstrEmitter can generate all types of MachineOperands which revealed that MachineInstrBuilder was missing a few methods, added by this patch. Besides providing a context pointer to MI::addOperand(), MachineInstrBuilder seems like a better fit for this code. llvm-svn: 170712	2012-12-20 18:08:09 +00:00
Jakob Stoklund Olesen	f623e9870d	Use MachineInstrBuilder in a few CodeGen passes. This automatically passes a context pointer to MI->addOperand(). llvm-svn: 170711	2012-12-20 18:08:06 +00:00
Nadav Rotem	8b20c0a814	Loop Vectorizer: turn-off if-conversion. llvm-svn: 170708	2012-12-20 17:42:53 +00:00
James Molloy	4f6fb953a7	Add a new attribute, 'noduplicate'. If a function contains a noduplicate call, the call cannot be duplicated - Jump threading, loop unrolling, loop unswitching, and loop rotation are inhibited if they would duplicate the call. Similarly inlining of the function is inhibited, if that would duplicate the call (in particular inlining is still allowed when there is only one callsite and the function has internal linkage). llvm-svn: 170704	2012-12-20 16:04:27 +00:00
Roman Divacky	ff95a1dc12	Remove MCTargetAsmLexer and its derived classes now that edis, its only user, is gone. llvm-svn: 170699	2012-12-20 14:43:30 +00:00
Renato Golin	6b2ea4a48f	Adding support for llvm.arm.neon.vaddl[su].* and llvm.arm.neon.vsub[su].* intrinsics. Patch by Pete Couperus <pjcoup@gmail.com> llvm-svn: 170694	2012-12-20 13:52:11 +00:00
Craig Topper	ae48cb2e5a	Formatting fixes. Remove some unnecessary 'else' after 'return'. No functional change. llvm-svn: 170676	2012-12-20 07:15:54 +00:00
Craig Topper	9d4171afed	Removing trailing whitespace llvm-svn: 170675	2012-12-20 07:09:41 +00:00
Reed Kotler	d11acc7dc0	Implement cfi_def_cfa_offset. "Make check" test case for this comming in the next few days but it's already tested a lot from test-suite and works fine. This patch completes almost 100% pass of test-suite for mips 16. llvm-svn: 170674	2012-12-20 06:59:37 +00:00
Reed Kotler	8965d24a2a	There is one more patch to finish large frames. Make sure we assert on code that has large frames which will not yet compile correctly. llvm-svn: 170673	2012-12-20 06:57:00 +00:00
Jyotsna Verma	56605448f2	Add constant extender support to GP-relative load/store instructions. llvm-svn: 170672	2012-12-20 06:52:46 +00:00
Jyotsna Verma	bf75aaf53e	Add TSFlags to ALU32 type instructions for constant-extender/Relationship maps. llvm-svn: 170671	2012-12-20 06:45:39 +00:00
Reed Kotler	7bff8f1d7a	set register class properly for mips16 here llvm-svn: 170669	2012-12-20 06:06:35 +00:00
Rafael Espindola	fb8ac2df09	Undefine PPC harder. This was causing a build failure while trying to build on ppc ubuntu 12.10 with cmake. llvm-svn: 170668	2012-12-20 05:13:09 +00:00
Reed Kotler	92fc33bc97	This assert is overly restrictive and does not work for mips16. llvm-svn: 170667	2012-12-20 05:09:15 +00:00
Reed Kotler	fd633229f7	Turn on register scavenger for Mips 16 We use an unused Mips 32 register for the emergency slot instead of using the stack. llvm-svn: 170665	2012-12-20 04:44:58 +00:00
Akira Hatanaka	e7f1acc7c0	[mips] Refactor SLT (set on less than) instructions. Separate encoding information from the rest. llvm-svn: 170664	2012-12-20 04:27:52 +00:00
Akira Hatanaka	bbd197e9c4	[mips] Refactor unconditional branch instruction. Separate encoding information from the rest. llvm-svn: 170663	2012-12-20 04:22:39 +00:00
Akira Hatanaka	b1527b7505	[mips] Remove asm string parameter from pseudo instructions. Add InstrItinClass parameter. llvm-svn: 170661	2012-12-20 04:20:09 +00:00
Akira Hatanaka	14f9ce0f83	[mips] Delete definition of CPRESTORE instruction. llvm-svn: 170660	2012-12-20 04:15:30 +00:00
Akira Hatanaka	c0ea0bb99b	[mips] Refactor conditional branch instructions with one register operand. Separate encoding information from the rest. llvm-svn: 170659	2012-12-20 04:13:23 +00:00
Richard Smith	4a8e454ab2	Don't use isa<CallInst>(this) in the constructor for CallInst's base class. This has undefined behavior, because the classof implementation attempts to access parts of the not-yet-constructed derived class. Found by clang -fsanitize=vptr. llvm-svn: 170658	2012-12-20 04:11:02 +00:00
Akira Hatanaka	f71ffd29d9	[mips] Refactor conditional branch instructions with two register operands. Separate encoding information from the rest. llvm-svn: 170657	2012-12-20 04:10:13 +00:00
Reed Kotler	d019dbf75e	fix most of remaining issues with large frames. these patches are tested a lot by test-suite but make check tests are forthcoming once the next few patches that complete this are committed. with the next few patches the pass rate for mips16 is near 100% llvm-svn: 170656	2012-12-20 04:07:42 +00:00
Akira Hatanaka	f423672117	[mips] Use "or $r0, $r1, $zero" instead of "addu $r0, $zero, $r1" to copy physical register $r1 to $r0. GNU disassembler recognizes an "or" instruction as a "move", and this change makes the disassembled code easier to read. Original patch by Reed Kotler. llvm-svn: 170655	2012-12-20 04:06:06 +00:00
Richard Smith	15b1e3727b	Fix use-before-construction of X86TargetLowering. llvm-svn: 170654	2012-12-20 04:04:17 +00:00
Richard Smith	e7701ebfec	Don't use -1 as a value of an unsigned 7-bit enumeration; that has undefined behavior and violates the !range constraints we put on loads of this enum. Found by clang -fsanitize=enum. llvm-svn: 170653	2012-12-20 04:02:58 +00:00
Akira Hatanaka	7d75f9e3d3	[mips] Change the order of template parameters. Move the default parameters to the end. llvm-svn: 170651	2012-12-20 03:52:08 +00:00
Akira Hatanaka	244f9e874c	[mips] Refactor shift instructions with register operands. Separate encoding information from the rest. llvm-svn: 170650	2012-12-20 03:48:24 +00:00
Akira Hatanaka	7f96ad325f	[mips] Refactor shift immediate instructions. Separate encoding information from the rest. llvm-svn: 170649	2012-12-20 03:44:41 +00:00
Akira Hatanaka	ab1b715bf2	[mips] Refactor arithmetic and logic instructions with immediate operands. Separate encoding information from the rest. llvm-svn: 170648	2012-12-20 03:40:03 +00:00
Akira Hatanaka	1b37c4af01	[mips] Refactor arithmetic and logic instructions. Separate encoding information from the rest. llvm-svn: 170647	2012-12-20 03:34:05 +00:00
Akira Hatanaka	73495897b1	[mips] Delete ArithOverflowR and ArithOverflow and use ArithLogicR and ArithLogicI as the instruction base classes. llvm-svn: 170642	2012-12-20 03:00:16 +00:00
Nadav Rotem	7bdc45b570	Loop Vectorizer: Enable if-conversion. llvm-svn: 170632	2012-12-20 02:00:02 +00:00
Bill Wendling	4607f4bdad	s/AttributesImpl/AttributeImpl/g This is going to apply to Attribute, not Attributes. llvm-svn: 170631	2012-12-20 01:36:59 +00:00
Bob Wilson	3365b80290	Do not introduce vector operations in functions marked with noimplicitfloat. <rdar://problem/12879313> llvm-svn: 170630	2012-12-20 01:36:20 +00:00
Nadav Rotem	28408a20c9	whitespace llvm-svn: 170626	2012-12-20 00:49:56 +00:00
NAKAMURA Takumi	2a0b40f584	Target/R600: Update MIB according to r170588. llvm-svn: 170620	2012-12-20 00:22:11 +00:00
Bill Wendling	6ad6c3b1c2	Add a context so that once we uniquify strings we can access them easily. llvm-svn: 170615	2012-12-19 23:55:43 +00:00
Jim Grosbach	6df94846ec	MC: Add MCInstrDesc::mayAffectControlFlow() method. MC disassembler clients (LLDB) are interested in querying if an instruction may affect control flow other than by virtue of being an explicit branch instruction. For example, instructions which write directly to the PC on some architectures. llvm-svn: 170610	2012-12-19 23:38:53 +00:00
Michael Ilseman	b99f80dea7	Refactor isIntrinsic() to be quicker, and change classof() (and thus, isa<IntrinsicInst>()) to use it. This decreases the number of occurrences of the slow-path string matching performed by getIntrinsicID(). llvm-svn: 170602	2012-12-19 23:17:20 +00:00
Bill Wendling	6848e38daf	s/AttributeListImpl/AttributeSetImpl/g to match the namechange of AttributeList. llvm-svn: 170600	2012-12-19 22:42:22 +00:00
Dmitri Gribenko	349d1a35ff	Add a missing 'else'. Found by grep '} if' No testcase because it is apparently not so trivial to construct. llvm-svn: 170595	2012-12-19 22:13:01 +00:00
Tom Stellard	1c315d5411	R600: Remove unecessary VREG alignment. Unlike SGPRs VGPRs doesn't need to be aligned. Patch by: Christian König Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Tested-by: Michel Dänzer <michel.daenzer@amd.com> Signed-off-by: Christian König <deathsimple@vodafone.de> llvm-svn: 170593	2012-12-19 22:10:34 +00:00
Tom Stellard	e7b907d85c	R600: control flow optimization Branch if we have enough instructions so that it makes sense. Also remove branches if they don't make sense. Patch by: Christian König Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Tested-by: Michel Dänzer <michel.daenzer@amd.com> Signed-off-by: Christian König <deathsimple@vodafone.de> llvm-svn: 170592	2012-12-19 22:10:33 +00:00
Tom Stellard	f8794354b2	R600: New control flow for SI v2 This patch replaces the control flow handling with a new pass which structurize the graph before transforming it to machine instruction. This has a couple of different advantages and currently fixes 20 piglit tests without a single regression. It is now a general purpose transformation that could be not only be used for SI/R6xx, but also for other hardware implementations that use a form of structurized control flow. v2: further cleanup, fixes and documentation Patch by: Christian König Signed-off-by: Christian König <deathsimple@vodafone.de> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> Tested-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 170591	2012-12-19 22:10:31 +00:00
Eric Christopher	3c5a1914b6	Split out abbreviations for the skeleton info from the rest of the abbreviations. Part of implementing split dwarf. llvm-svn: 170589	2012-12-19 22:02:53 +00:00
Jakob Stoklund Olesen	b159b5ff0d	Remove the explicit MachineInstrBuilder(MI) constructor. Use the version that also takes an MF reference instead. It would technically be possible to extract an MF reference from the MI as MI->getParent()->getParent(), but that would not work for MIs that are not inserted into any basic block. Given the reasonably small number of places this constructor was used at all, I preferred the compile time check to a run time assertion. llvm-svn: 170588	2012-12-19 21:31:56 +00:00
Nadav Rotem	11350aafb4	Fix a bug that was found by building clang with -fsanitize. I introduced it in r166785. PR14291. If TD is unavailable use getScalarSizeInBits, but don't optimize pointers or vectors of pointers. llvm-svn: 170586	2012-12-19 20:47:04 +00:00
Evan Cheng	eae6d2ccea	LLVM sdisel normalize bit extraction of the form: ((x & 0xff00) >> 8) << 2 to (x >> 6) & 0x3fc This is general goodness since it folds a left shift into the mask. However, the trailing zeros in the mask prevents the ARM backend from using the bit extraction instructions. And worse since the mask materialization may require an addition instruction. This comes up fairly frequently when the result of the bit twiddling is used as memory address. e.g. = ptr[(x & 0xFF0000) >> 16] We want to generate: ubfx r3, r1, #16, #8 ldr.w r3, [r0, r3, lsl #2] vs. mov.w r9, #1020 and.w r2, r9, r1, lsr #14 ldr r2, [r0, r2] Add a late ARM specific isel optimization to ARMDAGToDAGISel::PreprocessISelDAG(). It folds the left shift to the 'base + offset' address computation; change the mask to one which doesn't have trailing zeros and enable the use of ubfx. Note the optimization has to be done late since it's target specific and we don't want to change the DAG normalization. It's also fairly restrictive as shifter operands are not always free. It's only done for lsh 1 / 2. It's known to be free on some cpus and they are most common for address computation. This is a slight win for blowfish, rijndael, etc. rdar://12870177 llvm-svn: 170581	2012-12-19 20:16:09 +00:00
Roman Divacky	e3d323052f	Remove edis - the enhanced disassembler. Fixes PR14654. llvm-svn: 170578	2012-12-19 19:55:47 +00:00
Paul Redmond	5917f4c715	Transform (x&C)>V into (x&C)!=0 where possible When the least bit of C is greater than V, (x&C) must be greater than V if it is not zero, so the comparison can be simplified. Although this was suggested in Target/X86/README.txt, it benefits any architecture with a directly testable form of AND. Patch by Kevin Schoedel llvm-svn: 170576	2012-12-19 19:47:13 +00:00
Benjamin Kramer	c5071466d4	PowerPC: Expand VSELECT nodes. There's probably a better expansion for those nodes than the default for altivec, but this is better than crashing. VSELECTs occur in loop vectorizer output. llvm-svn: 170551	2012-12-19 15:49:14 +00:00
Patrik Hagglund	f9934613e8	Change AsmOperandInfo::ConstraintVT to MVT, instead of EVT. Accordingly, add MVT::getVT. llvm-svn: 170550	2012-12-19 15:19:11 +00:00
Rafael Espindola	0f00de40dd	Revert 170545 while I debug the ppc failures. llvm-svn: 170547	2012-12-19 14:48:05 +00:00
Rafael Espindola	aa7b27801c	Add r170095 back. I cannot reproduce it the failures locally, so I will keep an eye at the ppc bots. This patch does add the change to the "Disassembly of section" message, but that is not what was failing on the bots. Original message: Add a funciton to get the segment name of a section. On MachO, sections also have segment names. When a tool looking at a .o file prints a segment name, this is what they mean. In reality, a .o has only one anonymous, segment. This patch adds a MachO only function to fetch that segment name. I named it getSectionFinalSegmentName since the main use for the name seems to be infor the linker with segment this section should go to. The patch also changes MachOObjectFile::getSectionName to return just the section name instead of computing SegmentName,SectionName. llvm-svn: 170545	2012-12-19 14:15:04 +00:00
Evgeniy Stepanov	abeae5c7d5	[msan] Add track-origins argument to the pass constructor. llvm-svn: 170544	2012-12-19 13:55:51 +00:00
Patrik Hagglund	00e7a11904	Split the usage of 'EVT PartVT' into 'MVT PartVT' and 'EVT PartEVT'. llvm-svn: 170540	2012-12-19 12:33:30 +00:00
Patrik Hagglund	4e0f828686	Change RegVT in BitTestBlock and RegsForValue, to contain MVTs, instead of EVTs. llvm-svn: 170538	2012-12-19 12:23:01 +00:00
Patrik Hagglund	e09cac9a67	Change TargetLowering::getTypeForExtArgOrReturn to take and return MVTs, instead of EVTs. llvm-svn: 170537	2012-12-19 12:02:25 +00:00
Patrik Hagglund	3f1905199b	Change a parameter of TargetLowering::getVectorTypeBreakdown to MVT, from EVT. llvm-svn: 170536	2012-12-19 11:53:21 +00:00
Patrik Hagglund	bad545ccba	Change TargetLowering::RegisterTypeForVT to contain MVTs, instead of EVTs. llvm-svn: 170535	2012-12-19 11:48:16 +00:00
Patrik Hagglund	93060569ba	Change TargetLowering::TransformToType to contain MVTs, instead of EVTs. llvm-svn: 170534	2012-12-19 11:42:00 +00:00
Patrik Hagglund	f9eb168ef4	Change TargetLowering::findRepresentativeClass to take an MVT, instead of EVT. llvm-svn: 170532	2012-12-19 11:30:36 +00:00
Evgeniy Stepanov	d7571cd4bc	[msan] Heuristically instrument unknown intrinsics. This changes adds shadow and origin propagation for unknown intrinsics by examining the arguments and ModRef behaviour. For now, only 3 classes of intrinsics are handled: - those that look like simple SIMD store - those that look like simple SIMD load - those that don't have memory effects and look like arithmetic/logic/whatever operation on simple types. llvm-svn: 170530	2012-12-19 11:22:04 +00:00
Patrik Hagglund	fd41b5b969	Change TargetLowering::getTypeToPromoteTo to take and return MVTs, instead of EVTs. llvm-svn: 170529	2012-12-19 11:21:04 +00:00
Benjamin Kramer	e300004bd5	LoopVectorize: Make iteration over induction variables not depend on pointer values. MapVector is a bit heavyweight, but I don't see a simpler way. Also the InductionList is unlikely to be large. This should help 3-stage selfhost compares (PR14647). llvm-svn: 170528	2012-12-19 11:09:15 +00:00
Patrik Hagglund	ffd057a3e1	Change TargetLowering::isCondCodeLegal to take an MVT, instead of EVT. llvm-svn: 170524	2012-12-19 10:19:55 +00:00
NAKAMURA Takumi	89209462fe	X86ISelLowering.cpp: Fix warnings. [-Wlogical-op-parentheses] llvm-svn: 170523	2012-12-19 10:12:48 +00:00
Patrik Hagglund	deee9003ed	Change TargetLowering::getCondCodeAction to take an MVT, instead of EVT. llvm-svn: 170522	2012-12-19 10:09:26 +00:00
Bill Wendling	a87cdc27d9	Inline hasFunctionOnlyAttrs into its only use. llvm-svn: 170518	2012-12-19 09:15:11 +00:00
Bill Wendling	e9506a211f	Inline the only use of the hasParameterOnlyAttrs method. llvm-svn: 170517	2012-12-19 09:04:58 +00:00
Bill Wendling	d97b75d816	Inline the 'hasIncompatibleWithVarArgsAttrs' method into its only uses. And some minor comment reformatting. llvm-svn: 170516	2012-12-19 08:57:40 +00:00
Patrik Hagglund	d7cdcf8cb5	Change TargetLowering::getTruncStoreAction to take MVTs, instead of EVTs. llvm-svn: 170510	2012-12-19 08:28:51 +00:00
Elena Demikhovsky	14a4af0e66	Optimized load + SIGN_EXTEND patterns in the X86 backend. llvm-svn: 170506	2012-12-19 07:50:20 +00:00
Nadav Rotem	33360d8ae9	After reducing the size of an operation in the DAG we zero-extend the reduced bitwidth op back to the original size. If we reduce ANDs then this can cause an endless loop. This patch changes the ZEXT to ANY_EXTEND if the demanded bits are equal or smaller than the size of the reduced operation. llvm-svn: 170505	2012-12-19 07:39:08 +00:00
Bill Wendling	3d7b0b8ac7	Rename the 'Attributes' class to 'Attribute'. It's going to represent a single attribute in the future. llvm-svn: 170502	2012-12-19 07:18:57 +00:00
Craig Topper	3f194c8f4f	Remove more of 'else's after 'returns'. No functional change. llvm-svn: 170497	2012-12-19 06:43:58 +00:00
Craig Topper	5dd8291cbe	Remove a bunch of 'else's after 'returns' llvm-svn: 170496	2012-12-19 06:39:17 +00:00
Craig Topper	63f5921776	Teach SimplifySetCC that comparing AssertZext i1 against a constant 1 can be rewritten as a compare against a constant 0 with the opposite condition. llvm-svn: 170495	2012-12-19 06:12:28 +00:00

... 5 6 7 8 9 ...

58713 Commits