llvm-project

Commit Graph

Author	SHA1	Message	Date
Jakub Staszak	27da123d66	Adjust file to the coding standard. llvm-svn: 187808	2013-08-06 17:03:42 +00:00
Hal Finkel	11b9e452f6	Add PPC64 mulli pattern The PPC backend had been missing a pattern to generate mulli for 64-bit multiples. We had been generating it only for 32-bit multiplies. Unfortunately, generating li + mulld unnecessarily increases register pressure. llvm-svn: 187807	2013-08-06 17:03:03 +00:00
Jakub Staszak	340c780dd6	Remove extraneous semicolon. llvm-svn: 187806	2013-08-06 16:40:40 +00:00
Mihai Popa	c34bf73ebb	This corrects creation of operands for t2PLDW. It also removes the definition of t2PLDWpci, as pldw does not have a literal variant (i.e. pc relative version) llvm-svn: 187804	2013-08-06 16:07:46 +00:00
Mihai Popa	8f49a45c68	Support APSR_nzcv as operand for Thumb2 mrc. Deprecate pre-UAL syntax (pc instead of apsr_nzcv) llvm-svn: 187803	2013-08-06 15:52:36 +00:00
Justin Holewinski	debe686f05	[NVPTX] Add missing patterns for i1 [s,u]int_to_fp llvm-svn: 187800	2013-08-06 14:13:34 +00:00
Justin Holewinski	871ec93909	[NVPTX] Fix bug in stack code generation causes by MC conversion We do use a very small set of physical registers, so account for them in the virtual register encoding between MachineInstr and MC llvm-svn: 187799	2013-08-06 14:13:31 +00:00
Justin Holewinski	a2a63d28df	[NVPTX] Start conversion to MC infrastructure This change converts the NVPTX target to use the MC infrastructure instead of directly emitting MachineInstr instances. This brings the target more up-to-date with LLVM TOT, and should fix PR15175 and PR15958 (libNVPTXInstPrinter is empty) as a side-effect. llvm-svn: 187798	2013-08-06 14:13:27 +00:00
Tim Northover	cc2e903bda	ARM: implement allowTruncateForTailCall Now that it's in place, it seems silly not to let ARM make use of the extra tail call opportunities. llvm-svn: 187795	2013-08-06 13:58:03 +00:00
Rafael Espindola	b2be0b41af	Add a release not about llvm-ar. Thanks to Bill Wendling for the reminder. llvm-svn: 187794	2013-08-06 13:16:28 +00:00
Tim Northover	35989340a4	Remove oddly named libraries with "make uninstall-local" Patch by Edward-san. llvm-svn: 187793	2013-08-06 12:50:45 +00:00
Alexey Samsonov	3211e61b6d	Store compile unit corresponding to each chain of inlined debug info entries. No functionality change. llvm-svn: 187792	2013-08-06 10:49:15 +00:00
Elena Demikhovsky	63bd63e4a3	LLVM Interpreter: fixed bug 16694 fix for: Bug 16694 - ExecutionEngine/test-interp-vec-loadstore.ll failing on powerpc-darwin8 (http://llvm.org/bugs/show_bug.cgi?id=16694) The ExecutionEngine/test-interp-vec-loadstore.ll test has been failing on powerpc-darwin8 (on other platforms it passed) the reason of fail was wrong output by printf. this output is checked by FileCheck, but on little-endian powerpc the output numeric data were printed inside out and FileCheck reported fail. the printfs have been replaced by checking data inside test and numeric output has been replaced by the text output like : "int test passed, float test passed". The text output is checked by FileCheck. the dependency on data layout has been removed. done by Yuri Veselov (Intel) llvm-svn: 187791	2013-08-06 10:40:45 +00:00
Alexey Samsonov	c2e008734b	Add LLVM-style RTTI to DIContext/DWARFContext classes llvm-svn: 187790	2013-08-06 10:32:39 +00:00
Tim Northover	a4415854db	Refactor isInTailCallPosition handling This change came about primarily because of two issues in the existing code. Niether of: define i64 @test1(i64 %val) { %in = trunc i64 %val to i32 tail call i32 @ret32(i32 returned %in) ret i64 %val } define i64 @test2(i64 %val) { tail call i32 @ret32(i32 returned undef) ret i32 42 } should be tail calls, and the function sameNoopInput is responsible. The main problem is that it is completely symmetric in the "tail call" and "ret" value, but in reality different things are allowed on each side. For these cases: 1. Any truncation should lead to a larger value being generated by "tail call" than needed by "ret". 2. Undef should only be allowed as a source for ret, not as a result of the call. Along the way I noticed that a mismatch between what this function treats as a valid truncation and what the backends see can lead to invalid calls as well (see x86-32 test case). This patch refactors the code so that instead of being based primarily on values which it recurses into when necessary, it starts by inspecting the type and considers each fundamental slot that the backend will see in turn. For example, given a pathological function that returned {{}, {{}, i32, {}}, i32} we would consider each "real" i32 in turn, and ask if it passes through unchanged. This is much closer to what the backend sees as a result of ComputeValueVTs. Aside from the bug fixes, this eliminates the recursion that's going on and, I believe, makes the bulk of the code significantly easier to understand. The trade-off is the nasty iterators needed to find the real types inside a returned value. llvm-svn: 187787	2013-08-06 09:12:35 +00:00
Serge Pavlov	71044cbe16	Unbreak Debug build on Windows llvm-svn: 187786	2013-08-06 08:44:18 +00:00
Craig Topper	cf969eadaf	Simplify vector lane handling math a bit. No functional change intended. llvm-svn: 187783	2013-08-06 07:23:12 +00:00
Craig Topper	7418ff460c	Simplify math a little bit. llvm-svn: 187781	2013-08-06 06:54:25 +00:00
NAKAMURA Takumi	aaf66c7357	Target//CMakeLists.txt: Add the dependency to CommonTableGen explicitly for each corresponding CodeGen. Without explicit dependencies, both per-file action and in-CommonTableGen action could run in parallel. It races to emit .inc files simultaneously. llvm-svn: 187780	2013-08-06 06:38:37 +00:00
Craig Topper	9bc00b65b6	Replace EVT with MVT in isHorizontalBinOp as it is only called with legal types. llvm-svn: 187779	2013-08-06 06:05:05 +00:00
NAKAMURA Takumi	e359e85649	AsmPrinter/CMakeLists.txt: Add explicit dependency to intrinsics_gen here. llvm-svn: 187778	2013-08-06 05:56:39 +00:00
NAKAMURA Takumi	752f1ec651	Option/CMakeLists.txt: Don't use target_link_libraries. LLVMBuild knows dependencies. llvm-svn: 187777	2013-08-06 05:56:32 +00:00
Craig Topper	5ba12d75df	Put an llvm_unreachable at the end of getSplatIndex as its loop should never find all undef elements. llvm-svn: 187775	2013-08-06 05:41:22 +00:00
Craig Topper	770547db15	Check against >= 0 instead of != -1 in getSplatIndex because it generally compiles to better code and is equivalent for shuffle indices. llvm-svn: 187774	2013-08-06 05:07:37 +00:00
Craig Topper	7d60e7ca7f	Remove trailing whitespace and fix an 80-column violation. No functional change. llvm-svn: 187773	2013-08-06 05:01:21 +00:00
Craig Topper	47d7c5c8fe	Simplify code slightly. No functional change. llvm-svn: 187771	2013-08-06 04:12:40 +00:00
Tom Stellard	aa664d9b92	Factor FlattenCFG out from SimplifyCFG Patch by: Mei Ye llvm-svn: 187764	2013-08-06 02:43:45 +00:00
Eric Christopher	f7d848d0b9	Allow 4 as a valid debug info version. llvm-svn: 187763	2013-08-06 01:38:27 +00:00
Shuxin Yang	6f7213cb93	Add some comment to LTOCodeGenerator class llvm-svn: 187761	2013-08-06 00:45:32 +00:00
Matt Arsenault	ff7dc7248e	Fix missing -- C++ --s llvm-svn: 187758	2013-08-06 00:16:21 +00:00
Bill Wendling	dc17270968	FileCheckize some of the testcases. llvm-svn: 187756	2013-08-05 23:43:18 +00:00
Bill Wendling	7c8a4a4346	Fix grammar. llvm-svn: 187755	2013-08-05 23:29:16 +00:00
Tom Stellard	eef2ad92c7	R600/SI: Add missing test for r187749 llvm-svn: 187754	2013-08-05 22:45:56 +00:00
Eric Christopher	0062f2edc0	Recommit previous cleanup with a fix for c++98 ambiguity. llvm-svn: 187752	2013-08-05 22:32:28 +00:00
Tom Stellard	28d06de6f6	R600: Implement TargetLowering::getVectorIdxTy() We use MVT::i32 for the vector index type, because we use 32-bit operations to caculate offsets when dynamically indexing vectors. llvm-svn: 187749	2013-08-05 22:22:07 +00:00
Tom Stellard	d42c594960	TargetLowering: Add getVectorIdxTy() function v2 This virtual function can be implemented by targets to specify the type to use for the index operand of INSERT_VECTOR_ELT, EXTRACT_VECTOR_ELT, INSERT_SUBVECTOR, EXTRACT_SUBVECTOR. The default implementation returns the result from TargetLowering::getPointerTy() The previous code was using TargetLowering::getPointerTy() for vector indices, because this is guaranteed to be legal on all targets. However, using TargetLowering::getPointerTy() can be a problem for targets with pointer sizes that differ across address spaces. On such targets, when vectors need to be loaded or stored to an address space other than the default 'zero' address space (which is the address space assumed by TargetLowering::getPointerTy()), having an index that is a different size than the pointer can lead to inefficient pointer calculations, (e.g. 64-bit adds for a 32-bit address space). There is no intended functionality change with this patch. llvm-svn: 187748	2013-08-05 22:22:01 +00:00
Eric Christopher	432c99af0b	Revert "Use existing builtin hashing functions to make this routine more" This reverts commit r187745. llvm-svn: 187747	2013-08-05 22:07:30 +00:00
Eric Christopher	d728355a1c	Use existing builtin hashing functions to make this routine more simple. llvm-svn: 187745	2013-08-05 22:00:50 +00:00
Eric Christopher	0369ad7053	Change parent hashing algorithm to be non-recursive and elaborate greatly on many comments in the code. llvm-svn: 187742	2013-08-05 21:40:57 +00:00
Michael Gottesman	6964f33fc9	[bugpoint] Allow the user to specify the path to opt on the commandline. llvm-svn: 187739	2013-08-05 21:07:07 +00:00
Peter Collingbourne	bace606657	Introduce an optimisation for special case lists with large numbers of literal entries. Our internal regex implementation does not cope with large numbers of anchors very efficiently. Given a ~3600-entry special case list, regex compilation can take on the order of seconds. This patch solves the problem for the special case of patterns matching literal global names (i.e. patterns with no regex metacharacters). Rather than forming regexes from literal global name patterns, add them to a StringSet which is checked before matching against the regex. This reduces regex compilation time by an order of roughly thousands when reading the aforementioned special case list, according to a completely unscientific study. No test cases. I figure that any new tests for this code should check that regex metacharacters are properly recognised. However, I could not find any documentation which documents the fact that the syntax of global names in special case lists is based on regexes. The extent to which regex syntax is supported in special case lists should probably be decided on/documented before writing tests. Differential Revision: http://llvm-reviews.chandlerc.com/D1150 llvm-svn: 187732	2013-08-05 17:48:04 +00:00
Peter Collingbourne	fe8cd75971	Introduce Regex::isLiteralERE function. This will be used to implement an optimisation for literal entries in special case lists. Differential Revision: http://llvm-reviews.chandlerc.com/D1278 llvm-svn: 187731	2013-08-05 17:47:59 +00:00
Aaron Ballman	5b4634576e	Silencing an MSVC11 type conversion warning. llvm-svn: 187727	2013-08-05 13:47:03 +00:00
Alexey Samsonov	f52b717db3	80-cols llvm-svn: 187725	2013-08-05 13:19:49 +00:00
Elena Demikhovsky	62d19c8bdf	LLVM Interpreter: This patch implements vector support for cast operations (zext, sext, uitofp, sitofp, trunc, fpext, fptosi, fptrunc, bitcast) and shift operations (shl, ashr, lshr) for integer and floating point data types. Added tests. Done by Yuri Veselov (mailto:Yuri.Veselov@intel.com). llvm-svn: 187724	2013-08-05 12:17:06 +00:00
Richard Sandiford	c212125d27	[SystemZ] Use BRCT and BRCTG to eliminate add-&-compare sequences This patch just uses a peephole test for "add; compare; branch" sequences within a single block. The IR optimizers already convert loops to decrement-and-branch-on-nonzero form in some cases, so even this simplistic test triggers many times during a clang bootstrap and projects/test-suite run. It looks like there are still cases where we need to more strongly prefer branches on nonzero though. E.g. I saw a case where a loop that started out with a check for 0 ended up with a check for -1. I'll try to look at that sometime. I ended up adding the Reference class because MachineInstr::readsRegister() doesn't check for subregisters (by design, as far as I could tell). llvm-svn: 187723	2013-08-05 11:23:46 +00:00
Benjamin Kramer	483b9fbddb	Don't leak passes if added outside of the area determined by Started/Stopped flags. llvm-svn: 187722	2013-08-05 11:11:11 +00:00
Richard Sandiford	9795d8e611	[SystemZ] Add definitions for BRCT and BRCTG llvm-svn: 187721	2013-08-05 11:07:38 +00:00
Richard Sandiford	b49a3ab262	[SystemZ] Use LOAD AND TEST to eliminate comparisons against zero llvm-svn: 187720	2013-08-05 11:03:20 +00:00
Richard Sandiford	c62c64a038	[SystemZ] Add LOAD AND TEST instructions Just the definitions and MC support. The next patch uses them for codegen. llvm-svn: 187719	2013-08-05 11:00:53 +00:00
Richard Sandiford	bdbb8af7e6	[SystemZ] Split out comparison elimination into a separate pass Perhaps predictably, doing comparison elimination on the fly during SystemZLongBranch turned out to be a bad idea. The next patches make use of LOAD AND TEST and BRANCH ON COUNT, both of which require changes to earlier instructions. No functionality change intended. llvm-svn: 187718	2013-08-05 10:58:53 +00:00
Elena Demikhovsky	40864b690b	AVX-512 set: added mask operations, lowering BUILD_VECTOR for i1 vector types. Added intrinsics and tests. llvm-svn: 187717	2013-08-05 08:52:21 +00:00
Nadav Rotem	eae928acd2	Update the release notes about the status of the vectorizers. llvm-svn: 187714	2013-08-05 04:31:05 +00:00
Nadav Rotem	2da8b3e99e	Update the docs. llvm-svn: 187713	2013-08-05 04:27:34 +00:00
Reed Kotler	9c285b300d	Add the saving of S2. This is needed for some of the floating point helper functions. This can be optimized out later when the remaining parts of the helper function work is moved into the Mips16HardFloat pass. For now it forces us to use the 32 bit save/restore instructions instead of the 16 bit ones. llvm-svn: 187712	2013-08-04 23:56:53 +00:00
Bob Wilson	b9549baf7f	Remove "lto_on_osx" xfails, now that -rdynamic works on Darwin. Note that this will require a recent version of the linker for Darwin builds with LTO to pass these tests. llvm-svn: 187711	2013-08-04 23:55:24 +00:00
Bob Wilson	9fcf545575	Build with the $RDYNAMIC flag on Darwin as well as other platforms. Part of <rdar://problem/14620988> llvm-svn: 187710	2013-08-04 22:06:11 +00:00
Benjamin Kramer	5bc180c14f	X86: Turn fp selects into mask operations. double test(double a, double b, double c, double d) { return a<b ? c : d; } before: _test: ucomisd %xmm0, %xmm1 ja LBB0_2 movaps %xmm3, %xmm2 LBB0_2: movaps %xmm2, %xmm0 after: _test: cmpltsd %xmm1, %xmm0 andpd %xmm0, %xmm2 andnpd %xmm3, %xmm0 orpd %xmm2, %xmm0 Small speedup on Benchmarks/SmallPT llvm-svn: 187706	2013-08-04 12:05:16 +00:00
Elena Demikhovsky	cd46691728	AVX-512 set: added VEXTRACTPS instruction llvm-svn: 187705	2013-08-04 10:46:07 +00:00
Tim Northover	adb550068a	X86: specify CPU on new test to fix atom buildbot Apparently Atoms use lea for stack adjustment, which we weren't looking for. llvm-svn: 187704	2013-08-04 10:00:45 +00:00
Tim Northover	ecc018c7b7	X86: correct tail return address calculation Due to the weird and wondeful usual arithmetic conversions, some calculations involving negative values were getting performed in uint32_t and then promoted to int64_t, which is really not a good idea. Patch by Katsuhiro Ueno. llvm-svn: 187703	2013-08-04 09:35:57 +00:00
Benjamin Kramer	1df3a1f678	AsmParser: Store MacroLikeBodies on the side so they don't get leaked. llvm-svn: 187702	2013-08-04 09:06:29 +00:00
Reed Kotler	30cedf65ef	Clean up code for Mips16 large frame handling. llvm-svn: 187701	2013-08-04 01:13:25 +00:00
Benjamin Kramer	72d45cc846	PPCAsmParser: Stop leaking names. Store them in a place that gets cleaned up properly. llvm-svn: 187700	2013-08-03 22:43:29 +00:00
Benjamin Kramer	5d62ad2aff	Unbreak llvm-rtdyld build. llvm-svn: 187699	2013-08-03 22:18:45 +00:00
Benjamin Kramer	097e09abba	MachObjectFile: Don't leak on error. llvm-svn: 187698	2013-08-03 22:16:37 +00:00
Benjamin Kramer	9ce7708abb	llvm-rtdyld: Don't leak memory managers. Dyld never outlives MemMgr, just put both on the stack. llvm-svn: 187697	2013-08-03 22:16:31 +00:00
Benjamin Kramer	23632bd466	ARMAsmParser: Plug a leak. Using an object to do the cleanup may look like overkill, but it's safer and nicer than putting deletes everywhere. llvm-svn: 187696	2013-08-03 22:16:24 +00:00
Benjamin Kramer	dcfd5b525a	Stop leaking register infos in the disassemblers. llvm-svn: 187695	2013-08-03 22:16:16 +00:00
Hal Finkel	b176acb6b7	Fix PPC64 64-bit GPR inline asm constraint matching Internally, the PowerPC backend names the 32-bit GPRs R[0-9]+, and names the 64-bit parent GPRs X[0-9]+. When matching inline assembly constraints with explicit register names, on PPC64 when an i64 MVT has been requested, we need to follow gcc's convention of using r[0-9]+ to refer to the 64-bit (parent) registers. At some point, we'll probably want to arrange things so that the generic code in TargetLowering uses the AsmName fields declared in *RegisterInfo.td in order to match these inline asm register constraints. If we do that, this change can be reverted. llvm-svn: 187693	2013-08-03 12:25:10 +00:00
Matt Arsenault	2f9cce2cd6	Minor address space code simplification. Remove assertion that the verifier should catch. llvm-svn: 187692	2013-08-03 01:03:12 +00:00
Bob Wilson	8d7e6906d1	Regenerate with changes for -rdynamic. llvm-svn: 187687	2013-08-02 22:51:11 +00:00
Bob Wilson	8658b9147d	Link with -rdynamic instead of -Wl,-export-dynamic. Recent versions of the OS X linker support this but follow the existing OS X linker convention of using an underscore in the option name, i.e., -export_dynamic. Rather than changing our configure scripts to check for that alternate spelling, it is simpler to just use the compiler's -rdynamic option and let it deal with translating that to the appropriate linker option. One potential disadvantage of this approach is that the compiler will typically ignore -rdynamic on platforms where it is not supported, so the HAVE_LINK_EXPORT_DYNAMIC in config.h will not necessarily show whether that option has any effect or not. I don't see any in-tree uses of that macro, so I'm assuming it is OK. llvm-svn: 187686	2013-08-02 22:51:06 +00:00
Peter Collingbourne	abca2ecaab	Add a AttributeSetImpl::dump function. This is for the benefit of those of us with inferior debuggers which do not permit member function calls on value types. llvm-svn: 187685	2013-08-02 22:34:30 +00:00
Peter Collingbourne	bd6c7459bb	Make one of the AttributeSet ctors maintain the invariant that the attribute list is ordered by index. Differential Revision: http://llvm-reviews.chandlerc.com/D1265 llvm-svn: 187682	2013-08-02 22:29:40 +00:00
Hans Wennborg	b8f3420d1e	Option parsing: recognize the special -- token Everything that comes after -- should be treated as a filename. This enables passing in filenames that would otherwise be conflated with command-line options. This is especially important for clang-cl which supports options starting with /, which are easily conflatable with Unix-style path names. Differential Revision: http://llvm-reviews.chandlerc.com/D1274 llvm-svn: 187675	2013-08-02 21:20:27 +00:00
Hal Finkel	e9efbf140b	Fix invalid function pointers in bugpoint ExtractLoops The ExtractLoops function tries to reduce the failing test case by extracting one or more loops from the misoptimized piece of the program. In doing this, ExtractLoops must keep the MiscompiledFunctions vector up-to-date by ensuring that the pointers refer to functions in the current failing program. Unfortunately, this is not trivial because: - ExtractLoops is iterative, and there are several early exits (and the MiscompiledFunctions vector must be consistent with the current program at every non-fatal exit point). - Several of the utility functions used by ExtractLoops (such as TestOptimizer, some of which are called through the TestFn callback parameter, and Linker::LinkModules) delete their inputs upon success. This change adds several updates of the MiscompiledFunctions vector at different points. The first is after the initial call to TestMergedProgram which checks that the loop-extracted program still works. The second is after the call to TestFn (TestOptimizer, for example). This function will delete its inputs (which is why the existing ExtractLoops logic cloned the inputs first). llvm-svn: 187674	2013-08-02 21:13:42 +00:00
Joey Gouly	fcf6778172	Add a missing 'return' statement. llvm-svn: 187671	2013-08-02 20:50:01 +00:00
Akira Hatanaka	7be35cb1bf	[mips] Expand vector truncating stores and extending loads. llvm-svn: 187667	2013-08-02 19:23:33 +00:00
Joey Gouly	5d0564d2e6	[ARMv8] Add an assembler warning for the deprecated 'setend' instruction. llvm-svn: 187666	2013-08-02 19:18:12 +00:00
Nadav Rotem	5defea90e6	SLPVectorizer: Fix PR16777. PHInodes may use multiple extracted values that come from different blocks. Thanks Alexey Samsonov. llvm-svn: 187663	2013-08-02 18:40:24 +00:00
Matt Arsenault	0e5df35556	Teach EmitGEPOffset about address spaces llvm-svn: 187662	2013-08-02 18:33:34 +00:00
Renato Golin	0178a25fc5	Fixes ARM LNT bot from SLP change in O3 This patch fixes the multiple breakages on ARM test-suite after the SLP vectorizer was introduced by default on O3. The problem was an illegal vector type on ARMTTI::getCmpSelInstrCost() <3 x i1> which is not simple. The guard protects this code from breaking (cause of the problems) but doesn't fix the issue that is generating the odd vector in the first place, which also needs to be investigated. llvm-svn: 187658	2013-08-02 17:10:04 +00:00
Carlo Kok	4382da983a	Bugfix for making the DWARF debug strings and labels to code emitted as secrel32 instead of long opcodes (only for coff). This makes them debuggable with GDB (with fix for 64bits msvc) llvm-svn: 187656	2013-08-02 16:14:15 +00:00
Tim Northover	cf708c3284	Fix handling of CHECK-DAG combined with CHECK-NOT Patch by Daniel Sanders. llvm-svn: 187651	2013-08-02 11:32:50 +00:00
Duncan Sands	3194fca45f	Pacify GCC, which worries about falling off the end of the switch. llvm-svn: 187649	2013-08-02 09:37:20 +00:00
Alexey Samsonov	9096968de5	Fix dereferencing end iterator in SimplifyCFG. Patch by Ye Mei. llvm-svn: 187646	2013-08-02 08:06:43 +00:00
NAKAMURA Takumi	6fda3b4b86	Revert r187597, "Bugfix for making the DWARF debug strings and labels to code emitted as secrel32 instead of long opcodes (only for coff). This makes them debuggable with GDB." It broke x86_64-win32 builder in llvm/test/DebugInfo. llvm-svn: 187642	2013-08-02 03:46:05 +00:00
Eric Christopher	3b5ea5178d	Use @rpath for libraries rather than @executable_path on OSX. Patch by Benjamin Scarlet! llvm-svn: 187641	2013-08-02 01:51:52 +00:00
Eric Christopher	cdc78961d3	Temporarily revert "Debug Info Finder\|Verifier: handle DbgLoc attached to instructions." in an attempt to bring back some bots. This reverts commit r187609. llvm-svn: 187638	2013-08-02 00:49:44 +00:00
Matt Arsenault	1c349ef7e8	Teach InstructionSimplify about pointer address spaces llvm-svn: 187635	2013-08-02 00:10:44 +00:00
Akira Hatanaka	21f334372e	[mips] Make load/store accumulator pseudo instructions codeGenOnly. Also, remove lines that are setting DecoderNamespace for pseudo atomic instructions. No intended functionality change. llvm-svn: 187632	2013-08-01 23:14:16 +00:00
Matt Arsenault	87dc60761f	Teach getOrEnforceKnownAlignment about address spaces llvm-svn: 187629	2013-08-01 22:42:18 +00:00
Nadav Rotem	e4e6e9ed47	Move the optlevel check to the frontend. llvm-svn: 187628	2013-08-01 22:41:58 +00:00
Carlo Kok	3e8c33cff5	fix for LLVM debug info on llvm-mips-linux where the label name uses % instead of L as a prefix. llvm-svn: 187623	2013-08-01 22:15:34 +00:00
Bill Wendling	a5c536e1ee	Use function attributes to indicate that we don't want to realign the stack. Function attributes are the future! So just query whether we want to realign the stack directly from the function instead of through a random target options structure. llvm-svn: 187618	2013-08-01 21:42:05 +00:00
Rafael Espindola	4d305dca52	Expose that the unique file ID has a device and a file component. The use of sd_dev and st_ino has reached libclang, so expose the two components in UniqueID so that we can use it in clang. llvm-svn: 187616	2013-08-01 21:36:02 +00:00
Daniel Malea	a3d4245a72	Fixed the Intel-syntax X86 disassembler to respect the (existing) option for hexadecimal immediates, to match AT&T syntax. This also brings a new option for C-vs-MASM-style hex. Patch by Richard Mitton Reviewed: http://llvm-reviews.chandlerc.com/D1243 llvm-svn: 187614	2013-08-01 21:18:16 +00:00
Reed Kotler	83f879ddb2	Fix some issues with Mips16 floating when certain intrinsics are present. This is actually an LLVM bug in the way it generates signatures for these when soft float is enabled. For example, floor ends up having the signature of int64(int64). The signature part is not the same as where the actual parameter types are recorded, and those ARE of course int64(int64) when soft float is enabled. (Yes, Mips16 hard float uses soft float but with different runtime rounes but then has to interoperate with Mips32 using normal floating point). This logic will eventually be moved to the Mips16HardFloat pass so it's not worth sorting out these issues in LLVM since nobody but Mips16 cares about these signatures, as far as I know, and even I won't eventually either. llvm-svn: 187613	2013-08-01 21:17:53 +00:00
Carlo Kok	aad6a6a3e0	ARM/Hexagon testcases can't compile x86 only testcase. Reverting change to testcase & fixing check for all. llvm-svn: 187610	2013-08-01 20:53:57 +00:00
Manman Ren	4c065e779c	Debug Info Finder\|Verifier: handle DbgLoc attached to instructions. Also remove checking of llvm.dbg.sp since it is not used in generating dwarf. Current state of Finder: DebugInfoFinder tries to list all debug info MDNodes used in a module. To list debug info MDNodes used by an instruction, DebugInfoFinder provides processDeclare, processValue and processLocation to handle DbgDeclareInst, DbgValueInst and DbgLoc attached to instructions. processModule will go through all DICompileUnits in llvm.dbg.cu and list debug info MDNodes used by the CUs. TODO: 1> Finder has a list of CUs, SPs, Types, Scopes and global variables. We need to add a list of variables that are used by DbgDeclareInst and DbgValueInst. 2> MDString fields should be null or isa<MDString> and MDNode fields should be null or isa<MDNode>. We currently use empty string or int 0 to represent null. 3> Go though Verify functions and make sure that they check field types. 4> Clean up existing testing cases to remove llvm.dbg.sp and make sure each testing case has a llvm.dbg.cu. llvm-svn: 187609	2013-08-01 20:52:39 +00:00
David Blaikie	a1ae0e6ecb	DebugInfo: Emit definitions for types with no members. The absence of members was a poor/incorrect proxy for "is definition". llvm-svn: 187607	2013-08-01 20:30:22 +00:00
Carlo Kok	d0b09c42a3	change the inlinefnlocalvar testcase so it uses a triple that's not coff (doesn't seem to matter for the testcase itself, what it tests isn't triple specific), as coff has a slightly different way of emitting what it checks for. llvm-svn: 187604	2013-08-01 20:17:37 +00:00
Bob Wilson	dd52d58680	Temporarily xfail a test that breaks on OS X when building with LTO. This is another case where internalize hides a symbol that is needed by a loadable module. I am currently investigating a proper fix but this patch will get our buildbot to pass in the meantime. <rdar://problem/14578094> llvm-svn: 187601	2013-08-01 19:29:26 +00:00
Sean Silva	d544a9dc87	Update incorrect file headers. One of these was spotted in review by Rafael. llvm-svn: 187598	2013-08-01 18:42:28 +00:00
Carlo Kok	afcc62024e	Bugfix for making the DWARF debug strings and labels to code emitted as secrel32 instead of long opcodes (only for coff). This makes them debuggable with GDB. fixes Bug 16249 - LLVM generates broken debug info on Windows llvm-svn: 187597	2013-08-01 18:38:14 +00:00
Nadav Rotem	9153b3871d	Only enable SLP-vectorization on O3 builds. llvm-svn: 187595	2013-08-01 18:28:15 +00:00
Simon Atanasyan	cc53ec4c84	Pass -G argument to cmake with the same generator's name as used for the initial cmake invocation. Patch reviewed by Reid Kleckner. llvm-svn: 187591	2013-08-01 18:04:07 +00:00
Robert Lytton	4075315ad0	remove executable permission from IntrinsicsXCore.td llvm-svn: 187584	2013-08-01 17:17:59 +00:00
Tom Stellard	0344cdfe39	R600: Add 64-bit float load/store support * Added R600_Reg64 class * Added T#Index#.XY registers definition * Added v2i32 register reads from parameter and global space * Added f32 and i32 elements extraction from v2f32 and v2i32 * Added v2i32 -> v2f32 conversions Tom Stellard: - Mark vec2 operations as expand. The addition of a vec2 register class made them all legal. Patch by: Dmitry Cherkassov Signed-off-by: Dmitry Cherkassov <dcherkassov@gmail.com> llvm-svn: 187582	2013-08-01 15:23:42 +00:00
Tom Stellard	53698938a4	R600: Use 64-bit alignment for 64-bit kernel arguments llvm-svn: 187581	2013-08-01 15:23:31 +00:00
Tom Stellard	98f675a994	R600/SI: Custom lower i64 ZERO_EXTEND llvm-svn: 187580	2013-08-01 15:23:26 +00:00
Elena Demikhovsky	b1266b5447	EVEX and compressed displacement encoding for AVX512 llvm-svn: 187576	2013-08-01 13:34:06 +00:00
Richard Sandiford	fd7f4ae6d4	[SystemZ] Reuse CC results for integer comparisons with zero This also fixes a bug in the predication of LR to LOCR: I'd forgotten that with these in-place instruction builds, the implicit operands need to be added manually. I think this was latent until now, but is tested by int-cmp-45.c. It also adds a CC valid mask to STOC, again tested by int-cmp-45.c. llvm-svn: 187573	2013-08-01 10:39:40 +00:00
Richard Sandiford	a075708abe	[SystemZ] Prefer comparisons with zero Convert >= 1 to > 0, etc. Using comparison with zero isn't a win on its own, but it exposes more opportunities for CC reuse (the next patch). llvm-svn: 187571	2013-08-01 10:29:45 +00:00
Vladimir Medic	deaa618cdc	Add tests for Mips DSP instructions. llvm-svn: 187570	2013-08-01 09:35:25 +00:00
Vladimir Medic	d3dade29f5	Moving definition of MnemonicContainsDot field from class Instruction to class AsmParser as suggested. llvm-svn: 187569	2013-08-01 09:25:27 +00:00
Tim Northover	40e9efd725	AArch64: add initial NEON support Patch by Ana Pazos. - Completed implementation of instruction formats: AdvSIMD three same AdvSIMD modified immediate AdvSIMD scalar pairwise - Completed implementation of instruction classes (some of the instructions in these classes belong to yet unfinished instruction formats): Vector Arithmetic Vector Immediate Vector Pairwise Arithmetic - Initial implementation of instruction formats: AdvSIMD scalar two-reg misc AdvSIMD scalar three same - Intial implementation of instruction class: Scalar Arithmetic - Initial clang changes to support arm v8 intrinsics. Note: no clang changes for scalar intrinsics function name mangling yet. - Comprehensive test cases for added instructions To verify auto codegen, encoding, decoding, diagnosis, intrinsics. llvm-svn: 187567	2013-08-01 09:20:35 +00:00
Robert Lytton	ba05bfb4f6	XCore target: add GCCBuiltin to four intrinsics The following are made available by clang in the XCore ABI __builtin_bitrev __builtin_getid __builtin_getps __builtin_setps llvm-svn: 187566	2013-08-01 08:41:32 +00:00
Robert Lytton	4be00f8ad1	XCore target: Fix Vararg handling llvm-svn: 187565	2013-08-01 08:29:44 +00:00
Robert Lytton	4e60a3f4e3	XCore target: Add byval handling llvm-svn: 187563	2013-08-01 08:18:55 +00:00
Robert Lytton	b4787a159d	Xcore target Fix emitArrayBound() calling OutStreamer.Emit*() multiple times when trying to print a single line llvm-svn: 187562	2013-08-01 07:52:05 +00:00
Reed Kotler	302ae6b002	Fix some misc. issues with Mips16 fp stubs. 1) They should never be inlined. 2) A naming inconsistency with gcc mips16 3) Stubs should not have the global attribute llvm-svn: 187555	2013-08-01 02:26:31 +00:00
Eric Christopher	a8d49794e4	Formatting. llvm-svn: 187554	2013-08-01 01:38:16 +00:00
Reed Kotler	fd132b99bb	Add an omitted IsCall=1. llvm-svn: 187553	2013-08-01 00:59:06 +00:00
Hans Wennborg	8669b97490	Option parsing: remove non-SUPPORT_ALIASARGS fall-back The clients of this code have been updated to all support AliasArgs. This depends on Clang r187538 and lld r187541. llvm-svn: 187546	2013-07-31 23:28:51 +00:00
Hans Wennborg	5fdcf86861	Option parsing: add support for alias arguments. This makes option aliases more powerful by enabling them to pass along arguments to the option they're aliasing. For example, if we have a joined option "-foo=", we can now specify a flag option "-bar" to be an alias of that, with the argument "baz". This is especially useful for the cl.exe compatible clang driver, where many options are aliases. For example, this patch enables us to alias "/Ox" to "-O3" (-O is a joined option), and "/WX" to "-Werror" (again, -W is a joined option). Differential Revision: http://llvm-reviews.chandlerc.com/D1245 llvm-svn: 187537	2013-07-31 22:44:41 +00:00
Nadav Rotem	25f15358d2	80-col llvm-svn: 187535	2013-07-31 22:17:45 +00:00
Andrew Trick	753663ccce	comment typo. llvm-svn: 187531	2013-07-31 21:05:54 +00:00
Kevin Enderby	78f9572f39	Added the B9.3.19 SUBS PC, LR, #imm (Thumb2) system instruction. While the .td entry is nice and all, it takes a pretty gross hack in ARMAsmParser::ParseInstruction() because of handling of other "subs" instructions to get it to match. Ran it by Jim Grosbach and he said it was about what he expected to make this work given the existing code. rdar://14214063 llvm-svn: 187530	2013-07-31 21:05:30 +00:00
Tom Stellard	ca69a53bae	Revert "R600: Non vector only instruction can be scheduled on trans unit" This reverts commit 98ce62780ea7185ba710868bf83c8077e8d7f6d6. llvm-svn: 187526	2013-07-31 20:43:27 +00:00
Tom Stellard	0ebf29d41f	Revert "TableGen: Enumerate Schedule Model too." This reverts commit 2ca1e4a39c7e0d7a00e66ff5437c6d7ace2404a0. llvm-svn: 187525	2013-07-31 20:43:08 +00:00
Tom Stellard	4dd41845ec	Revert "R600: Use SchedModel enum for is{Trans,Vector}Only functions" This reverts commit 3f1de26cb5cc0543a6a1d71259a7a39d97139051. llvm-svn: 187524	2013-07-31 20:43:03 +00:00
Vincent Lejeune	220db748b0	R600: Do not mergevector after a vector reg is used If we merge vector when a vector is used, it will generate an artificial antidependency that can prevent 2 tex/vtx instructions to use the same clause and thus generate extra clauses that reduce performance. There is no test case as such situation is really hard to predict. llvm-svn: 187516	2013-07-31 19:32:12 +00:00
Vincent Lejeune	bb3f931123	R600: Avoid more than 4 literals in the same instruction group at scheduling llvm-svn: 187515	2013-07-31 19:32:07 +00:00
Vincent Lejeune	df18804e26	R600: Non vector only instruction can be scheduled on trans unit llvm-svn: 187514	2013-07-31 19:31:56 +00:00
Vincent Lejeune	21de8baa15	R600: Don't mix LDS and non-LDS instructions in the same group There are a lot of restrictions on instruction groups that contain LDS instructions, so for now we will be conservative and not packetize anything else with them. llvm-svn: 187513	2013-07-31 19:31:41 +00:00
Vincent Lejeune	79afe17e99	R600: Use SchedModel enum for is{Trans,Vector}Only functions llvm-svn: 187512	2013-07-31 19:31:35 +00:00
Vincent Lejeune	22e6ddd475	TableGen: Enumerate Schedule Model too. llvm-svn: 187511	2013-07-31 19:31:20 +00:00
Vincent Lejeune	0c5ed2b437	R600: Remove predicated_break inst We were using two instructions for similar purpose : break and predicated break. Only predicated_break was emitted and it was lowered at R600ControlFlowFinalizer to JUMP;CF_BREAK;POP. This commit simplify the situation by making AMDILCFGStructurizer emit IF_PREDICATE;BREAK;ENDIF; instead of predicated_break (which is now removed). There is no functionality change. llvm-svn: 187510	2013-07-31 19:31:14 +00:00
Matt Arsenault	24b49c411c	Reject bitcasts between address spaces with different sizes llvm-svn: 187506	2013-07-31 17:49:08 +00:00
Richard Sandiford	791bea4182	[SystemZ] Implement isLegalAddressingMode() The loop optimizers were assuming that scales > 1 were OK. I think this is actually a bug in TargetLoweringBase::isLegalAddressingMode(), since it seems to be trying to reject anything that isn't r+i or r+r, but it has no default case for scales other than 0, 1 or 2. Implementing the hook for z means that z can no longer test any change there though. llvm-svn: 187497	2013-07-31 12:58:26 +00:00
Richard Sandiford	ee8343822e	[SystemZ] Be more careful about inverting CC masks (conditional loads) Extend r187495 to conditional loads. I split this out because the easiest way seemed to be to force a particular operand order in SystemZISelDAGToDAG.cpp. llvm-svn: 187496	2013-07-31 12:38:08 +00:00
Richard Sandiford	3d768e334b	[SystemZ] Be more careful about inverting CC masks System z branches have a mask to select which of the 4 CC values should cause the branch to be taken. We can invert a branch by inverting the mask. However, not all instructions can produce all 4 CC values, so inverting the branch like this can lead to some oddities. For example, integer comparisons only produce a CC of 0 (equal), 1 (less) or 2 (greater). If an integer EQ is reversed to NE before instruction selection, the branch will test for 1 or 2. If instead the branch is reversed after instruction selection (by inverting the mask), it will test for 1, 2 or 3. Both are correct, but the second isn't really canonical. This patch therefore keeps track of which CC values are possible and uses this when inverting a mask. Although this is mostly cosmestic, it fixes undefined behavior for the CIJNLH in branch-08.ll. Another fix would have been to mask out bit 0 when generating the fused compare and branch, but the point of this patch is that we shouldn't need to do that in the first place. The patch also makes it easier to reuse CC results from other instructions. llvm-svn: 187495	2013-07-31 12:30:20 +00:00
Richard Sandiford	8a757bba10	[SystemZ] Move compare-and-branch generation even later r187116 moved compare-and-branch generation from the instruction-selection pass to the peephole optimizer (via optimizeCompare). It turns out that even this is a bit too early. Fused compare-and-branch instructions don't interact well with predication, where a CC result is needed. They also make it harder to reuse the CC side-effects of earlier instructions (not yet implemented, but the subject of a later patch). Another problem was that the AnalyzeBranch family of routines weren't handling compares and branches, so we weren't able to reverse the fused form in cases where we would reverse a separate branch. This could have been fixed by extending AnalyzeBranch, but given the other problems, I've instead moved the fusing to the long-branch pass, which is also responsible for the opposite transformation: splitting out-of-range compares and branches into separate compares and long branches. I've added a test for the AnalyzeBranch problem. A test for the predication problem is included in the next patch, which fixes a bug in the choice of CC mask. llvm-svn: 187494	2013-07-31 12:11:07 +00:00
Elena Demikhovsky	b0a75431ad	Fixed assertion in Extract128BitVector() llvm-svn: 187493	2013-07-31 12:03:08 +00:00
Richard Sandiford	6a06ba36ba	[SystemZ] Postpone NI->RISBG conversion to convertToThreeAddress() r186399 aggressively used the RISBG instruction for immediate ANDs, both because it can handle some values that AND IMMEDIATE can't, and because it allows the destination register to be different from the source. I realized later while implementing the distinct-ops support that it would be better to leave the choice up to convertToThreeAddress() instead. The AND IMMEDIATE form is shorter and is less likely to be cracked. This is a problem for 32-bit ANDs because we assume that all 32-bit operations will leave the high word untouched, whereas RISBG used in this way will either clear the high word or copy it from the source register. The patch uses the z196 instruction RISBLG for this instead. This means that z10 will be restricted to NILL, NILH and NILF for 32-bit ANDs, but I think that should be OK for now. Although we're using z10 as the base architecture, the optimization work is going to be focused more on z196 and zEC12. llvm-svn: 187492	2013-07-31 11:36:35 +00:00
Elena Demikhovsky	67b05fc0b3	Added INSERT and EXTRACT intructions from AVX-512 ISA. All insertf/extractf functions replaced with insert/extract since we have insertf and inserti forms. Added lowering for INSERT_VECTOR_ELT / EXTRACT_VECTOR_ELT for 512-bit vectors. Added lowering for EXTRACT/INSERT subvector for 512-bit vectors. Added a test. llvm-svn: 187491	2013-07-31 11:35:14 +00:00
Richard Sandiford	6cf80b3ec0	[SystemZ] Add RISBLG and RISBHG instruction definitions The next patch will make use of RISBLG for codegen. llvm-svn: 187490	2013-07-31 11:17:35 +00:00
Richard Trieu	8dc432314e	Add parentheses to silence gcc warning. llvm-svn: 187482	2013-07-31 04:07:28 +00:00
Andrew Trick	9447cce0ed	Fix register pressure tables on ARM. The heuristic that merges register pressure sets was bogus for ARM's S/D regs. llvm-svn: 187479	2013-07-31 03:24:31 +00:00
Andrew Trick	301dd8d795	Add tracing to the tblgen register pressure table generator. llvm-svn: 187478	2013-07-31 03:24:28 +00:00
Craig Topper	62cb2bc837	Increment arg_count inside the loop in printInline. Patch by Joe Matarazzo. llvm-svn: 187477	2013-07-31 03:22:07 +00:00
Craig Topper	efd67d4612	Changed register names (and pointer keywords) to be lower case when using Intel X86 assembler syntax. Patch by Richard Mitton. llvm-svn: 187476	2013-07-31 02:47:52 +00:00
Andrew Trick	c3bc8b8de6	Fix a severe compile time problem when forming large SCEV expressions. This fix is very lightweight. The same fix already existed for AddRec but was missing for NAry expressions. This is obviously an improvement and I'm unsure how to test compile time problems. Patch by Xiaoyi Guo! llvm-svn: 187475	2013-07-31 02:43:40 +00:00
Craig Topper	75a5ba7ed0	Remove trailing whitespace and some tab characters. llvm-svn: 187472	2013-07-31 02:00:15 +00:00
Craig Topper	6e8cd80def	Fixed incorrect disassembly for MOV16o16a when using Intel syntax. Patch by Richard Mitton. llvm-svn: 187471	2013-07-31 01:50:26 +00:00
Eric Christopher	e6656ac870	Fix crashing on invalid inline asm with matching constraints. For a testcase like the following: typedef unsigned long uint64_t; typedef struct { uint64_t lo; uint64_t hi; } blob128_t; void add_128_to_128(const blob128_t in, blob128_t res) { asm ("PAND %1, %0" : "+Q"(res) : "Q"(in)); } where we'll fail to allocate the register for the output constraint, our matching input constraint will not find a register to match, and could try to search past the end of the current operands array. On the idea that we'd like to attempt to keep compilation going to find more errors in the module, change the error cases when we're visiting inline asm IR to return immediately and avoid trying to create a node in the DAG. This leaves us with only a single error message per inline asm instruction, but allows us to safely keep going in the general case. llvm-svn: 187470	2013-07-31 01:26:24 +00:00
Akira Hatanaka	d6445686a9	[mips] Rename instruction DANDi to ANDi64. No functionality change. llvm-svn: 187469	2013-07-31 00:57:41 +00:00
Akira Hatanaka	f8fff213d5	[mips] Define instruction itineraries IIArith and IILogic. No functionality change. llvm-svn: 187468	2013-07-31 00:55:34 +00:00
Matt Arsenault	065ced9bed	Fix ptr vector inconsistency in CreatePointerCast One form would accept a vector of pointers, and the other did not. Make both accept vectors of pointers, and add an assertion for the number of elements. llvm-svn: 187464	2013-07-31 00:17:33 +00:00
Rafael Espindola	107b74c6c3	Fix windows' implementation of status when a file doesn't exist. The unix one was returning no_such_file_or_directory, but the windows one was return success. Update the one one caller that was depending on the old behavior. llvm-svn: 187463	2013-07-31 00:10:25 +00:00
Owen Anderson	c7be519dc0	Preserve fast-math flags when folding (fsub x, (fneg y)) to (fadd x, y). llvm-svn: 187462	2013-07-30 23:53:17 +00:00
Eric Christopher	029af15086	Reflow this to be easier to read. llvm-svn: 187459	2013-07-30 22:50:44 +00:00
Eric Christopher	ca6384aeeb	Make these just inline, not static inline. llvm-svn: 187457	2013-07-30 22:35:06 +00:00
Eric Christopher	83afc1e43b	Make sure that -gsplit-dwarf isn't passed to the linker. llvm-svn: 187456	2013-07-30 22:34:30 +00:00
Matt Arsenault	130e0ef6f4	Respect address space sizes in isEliminableCastPair. This avoids constant folding bitcast/ptrtoint/inttoptr combinations that have illegal bitcasts between differently sized address spaces. llvm-svn: 187455	2013-07-30 22:27:10 +00:00
Matt Arsenault	b4019ae13c	Revert "Remove isCastable since nothing uses it now" Apparently dragonegg uses it. llvm-svn: 187454	2013-07-30 22:02:14 +00:00
Eric Christopher	3987806087	Add capability for building with -gsplit-dwarf to the cmake build. In limited testing this seems to work. Caveat emptor. llvm-svn: 187452	2013-07-30 21:44:10 +00:00
Matt Arsenault	f63dfbb198	Remove isCastable since nothing uses it now llvm-svn: 187448	2013-07-30 21:11:17 +00:00
David Majnemer	b7d5409ad2	isKnownToBeAPowerOfTwo: Strengthen isKnownToBeAPowerOfTwo's analysis on add instructions Call into ComputeMaskedBits to figure out which bits are set on both add operands and determine if the value is a power-of-two-or-zero or not. llvm-svn: 187445	2013-07-30 21:01:36 +00:00
Matt Arsenault	cacbb2377a	Change behavior of calling bitcasted alias functions. It will now only convert the arguments / return value and call the underlying function if the types are able to be bitcasted. This avoids using fp<->int conversions that would occur before. llvm-svn: 187444	2013-07-30 20:45:05 +00:00
Akira Hatanaka	8f69d7f0c0	[mips] Delete instruction format for "bal". llvm-svn: 187443	2013-07-30 20:42:19 +00:00
Andrew Trick	3f423dec77	This test may have been sensitive to the ARM ABI... llvm-svn: 187442	2013-07-30 20:34:59 +00:00
Rafael Espindola	a5932afef0	Implement getUniqueID for directories on windows. llvm-svn: 187441	2013-07-30 20:25:53 +00:00
Akira Hatanaka	5973e8371a	[mips] Define "bal" as a pseudo instruction. Also, fix bug in the InstAlias that turns "bal" into "bgezal". llvm-svn: 187440	2013-07-30 20:24:24 +00:00
Rafael Espindola	62b418e2de	Remove dead code. llvm-svn: 187439	2013-07-30 20:02:18 +00:00
Andrew Trick	c7934b3e37	Down-scale slot index distance to save bits. llvm-svn: 187438	2013-07-30 19:59:19 +00:00
Andrew Trick	9b866051e5	whitespace llvm-svn: 187437	2013-07-30 19:59:15 +00:00
Andrew Trick	9c17eab761	MI Sched: Track live-thru registers. When registers must be live throughout the scheduling region, increase the limit for the register class. Once we exceed the original limit, they will be spilled, and there's no point further reducing pressure. This isn't a perfect heuristics but avoids a situation where the scheduler could become trapped by trying to achieve the impossible. llvm-svn: 187436	2013-07-30 19:59:12 +00:00
Andrew Trick	d9761776bc	MI Sched fix: assert "Disconnected LRG within the scheduling region." llvm-svn: 187435	2013-07-30 19:59:08 +00:00
Venkatraman Govindaraju	fee76fac2f	[Sparc] Rewrite MBB's live-in registers for leaf functions. Also, add register i7 as a live-in if current function's return address is taken. This revision fixes PR16269. llvm-svn: 187433	2013-07-30 19:53:10 +00:00
Rui Ueyama	a2222b573b	Implement TokenizeWindowsCommandLine. This is a follow up patch for r187390 to implement the parser for the Windows-style command line. This should follow the rule as described at http://msdn.microsoft.com/en-us/library/windows/desktop/17w5ykft(v=vs.85).aspx Differential Revision: http://llvm-reviews.chandlerc.com/D1235 llvm-svn: 187430	2013-07-30 19:03:20 +00:00
Daniel Malea	788d126ca1	Fix parameter ordering bug in createDebugIRPass() - Thanks to Ilia Filippov for pointing out the inconsistency! llvm-svn: 187424	2013-07-30 16:16:11 +00:00
Tom Stellard	aa313d0a74	R600/SI: Expand vector fp <-> int conversions llvm-svn: 187421	2013-07-30 14:31:03 +00:00
Vladimir Medic	643b398786	This patch implements parsing of mips FCC register operands. The example instructions have been added to test files. llvm-svn: 187410	2013-07-30 10:12:14 +00:00
Bill Wendling	c02da467f4	Fix underscore to be the proper length. llvm-svn: 187406	2013-07-30 08:26:24 +00:00
Saleem Abdulrasool	0c2ee5a2cb	[ARM] check bitwidth in PerformORCombine When simplifying a (or (and B A) (and C ~A)) to a (VBSL A B C) ensure that the bitwidth of the second operands to both ands match before comparing the negation of the values. Split the check of the value of the second operands to the ands. Move the cast and variable declaration slightly higher to make it slightly easier to follow. Bug-Id: 16700 Signed-off-by: Saleem Abdulrasool <compnerd@compnerd.org> llvm-svn: 187404	2013-07-30 04:43:08 +00:00
Rafael Espindola	d20830dfe5	Remove more dead documentation. llvm-svn: 187403	2013-07-30 04:06:06 +00:00
Venkatraman Govindaraju	fdcc498a25	[Sparc] Use call's debugloc for the unimp instruction. llvm-svn: 187402	2013-07-30 02:26:29 +00:00
Bill Schmidt	0cf702fa61	[PowerPC] Skeletal FastISel support for 64-bit PowerPC ELF. This is the first of many upcoming patches for PowerPC fast instruction selection support. This patch implements the minimum necessary for a functional (but extremely limited) FastISel pass. It allows the table-generated portions of the selector to be created and used, but in most cases selection will fall back to the DAG selector. None of the block terminator instructions are implemented yet, and most interesting instructions require some special handling. Therefore there aren't any new test cases with this patch. There will be quite a few tests coming with future patches. This patch adds the make/CMake support for the new code (including tablegen -gen-fast-isel) and creates the FastISel object for PPC64 ELF only. It instantiates the necessary virtual functions (TargetSelectInstruction, TargetMaterializeConstant, TargetMaterializeAlloca, tryToFoldLoadIntoMI, and FastLowerArguments), but of these, only TargetMaterializeConstant contains any useful implementation. This is present since the table-generated code requires the ability to materialize integer constants for some instructions. This patch has been tested by building and running the projects/test-suite code with -O0. All tests passed with the exception of a couple of long-running tests that time out using -O0 code generation. llvm-svn: 187399	2013-07-30 00:50:39 +00:00
Quentin Colombet	e2e0548d77	[R600] Replicate old DAGCombiner behavior in target specific DAG combine. build_vector is lowered to REG_SEQUENCE, which is something the register allocator does a good job at optimizing. llvm-svn: 187397	2013-07-30 00:27:16 +00:00
Quentin Colombet	6bf4baa408	[DAGCombiner] insert_vector_elt: Avoid building a vector twice. This patch prevents the following combine when the input vector is used more than once. insert_vector_elt (build_vector elt0, ..., eltN), NewEltIdx, idx => build_vector elt0, ..., NewEltIdx, ..., eltN The reasons are: - Building a vector may be expensive, so try to reuse the existing part of a vector instead of creating a new one (think big vectors). - elt0 to eltN now have two users instead of one. This may prevent some other optimizations. llvm-svn: 187396	2013-07-30 00:24:09 +00:00
Eric Christopher	4ed04e2ee3	Move file to X86 and add a triple to fix darwin bots for now. The problem is due to the section name being explicitly mentioned in the IR and differing between the two platforms. llvm-svn: 187394	2013-07-30 00:20:06 +00:00
Eric Christopher	e414ece79a	Fix a truly egregious thinko in anonymous namespace check, update testcase to make sure we generate debug info for walrus by adding a non-trivial constructor and verify that we don't emit an ODR signature for the type. llvm-svn: 187393	2013-07-29 23:53:08 +00:00
Eric Christopher	d853ea3142	Make sure we don't emit an ODR hash for types with no name and make sure the comments for each testcase are a bit easier to distinguish. llvm-svn: 187392	2013-07-29 23:53:05 +00:00
Eric Christopher	32d1531a33	Clarify comments for types contained in anonymous namespaces and odr hashes. llvm-svn: 187391	2013-07-29 23:53:01 +00:00
Eric Christopher	f8542ec305	Elaborate a bit on the type unit and ODR conditional code. llvm-svn: 187385	2013-07-29 22:24:32 +00:00
Rafael Espindola	d123099abc	Make file_status::getUniqueID const. llvm-svn: 187383	2013-07-29 21:55:38 +00:00
Rafael Espindola	cb87e35ed6	Delete documentation for deleted options. llvm-svn: 187380	2013-07-29 21:35:48 +00:00
Rafael Espindola	7f822a9306	Include st_dev to make the result of getUniqueID actually unique. This will let us use getUniqueID instead of st_dev directly on clang. llvm-svn: 187378	2013-07-29 21:26:49 +00:00
Manman Ren	620e978f69	Debug Info: enable verifier for testing cases. llvm-svn: 187375	2013-07-29 20:18:19 +00:00
Akira Hatanaka	52dd808bc3	[mips] Add comment and simplify function. llvm-svn: 187371	2013-07-29 19:08:34 +00:00
Nadav Rotem	16e9dd4dd2	Add the C source code to the test to make it easier to update when debug info changes. Thanks Eric. llvm-svn: 187368	2013-07-29 18:47:36 +00:00
Nadav Rotem	d9c74cc6d3	SLPVectorier: update the debug location for the new instructions. llvm-svn: 187363	2013-07-29 18:18:46 +00:00
Manman Ren	e9a52e18da	Debug Info: update testing cases to pass verifier. llvm-svn: 187362	2013-07-29 18:12:58 +00:00
Nico Rieck	7fdaee8f15	Use proper section suffix for COFF weak symbols 32-bit symbols have "_" as global prefix, but when forming the name of COMDAT sections this prefix is ignored. The current behavior assumes that this prefix is always present which is not the case for 64-bit and names are truncated. llvm-svn: 187356	2013-07-29 13:58:39 +00:00
Nico Rieck	06d17c80cc	Proper va_arg/va_copy lowering on win64 Win64 uses CharPtrBuiltinVaList instead of X86_64ABIBuiltinVaList like other 64-bit targets. llvm-svn: 187355	2013-07-29 13:07:06 +00:00
Aaron Ballman	f5aacd34e4	Re-application of 187310. Re-enabling warning C4275 for MSVC 11 and up, but not MSVC 10 since it is still required there. llvm-svn: 187354	2013-07-29 13:02:08 +00:00
Rafael Espindola	b6b5f52ee8	Add support for the 's' operation to llvm-ar. If no other operation is specified, 's' becomes an operation instead of an modifier. The s operation just creates a symbol table. It is the same as running ranlib. We assume the archive was created by a sane ar (like llvm-ar or gnu ar) and if the symbol table is present, then it is current. We use that to optimize the most common case: a broken build system that thinks it has to run ranlib. llvm-svn: 187353	2013-07-29 12:40:31 +00:00
Nico Rieck	2c9c89b21d	MC: Support larger COFF string tables Single-slash encoded entries do not require a terminating null. This bumps the maximum table size from ~1MB to ~9.5MB. llvm-svn: 187352	2013-07-29 12:30:12 +00:00
NAKAMURA Takumi	1373b743bb	ExceptionDemo.cpp: Tweak a @param. [-Wdocumentation] llvm-svn: 187351	2013-07-29 11:03:50 +00:00
Benjamin Kramer	fb34989a82	Some Intel Penryn CPUs come with SSE4 disabled. Detect them as core 2. PR16721. llvm-svn: 187350	2013-07-29 11:02:08 +00:00
Silviu Baranga	91ddaa1b48	Allow generation of vmla.f32 instructions when targeting Cortex-A15. The patch also adds the VFP4 feature to Cortex-A15 and fixes the DontUseFusedMAC predicate so that we can still generate vmla.f32 instructions on non-darwin targets with VFP4. llvm-svn: 187349	2013-07-29 09:25:50 +00:00
Robert Lytton	862b04516f	test commit llvm-svn: 187348	2013-07-29 09:23:13 +00:00
Chandler Carruth	cd7c8cdfa1	Teach the AllocaPromoter which is wrapped around the SSAUpdater infrastructure to do promotion without a domtree the same smarts about looking through GEPs, bitcasts, etc., that I just taught mem2reg about. This way, if SROA chooses to promote an alloca which still has some noisy instructions this code can cope with them. I've not used as principled of an approach here for two reasons: 1) This code doesn't really need it as we were already set up to zip through the instructions used by the alloca. 2) I view the code here as more of a hack, and hopefully a temporary one. The SSAUpdater path in SROA is a real sore point for me. It doesn't make a lot of architectural sense for many reasons: - We're likely to end up needing the domtree anyways in a subsequent pass, so why not compute it earlier and use it. - In the future we'll likely end up needing the domtree for parts of the inliner itself. - If we need to we could teach the inliner to preserve the domtree. Part of the re-work of the pass manager will allow this to be very powerful even in large SCCs with many functions. - Ultimately, computing a domtree has gotten significantly faster since the original SSAUpdater-using code went into ScalarRepl. We no longer use domfrontiers, and much of domtree is lazily done based on queries rather than eagerly. - At this point keeping the SSAUpdater-based promotion saves a total of 0.7% on a build of the 'opt' tool for me. That's not a lot of performance given the complexity! So I'm leaving this a bit ugly in the hope that eventually we just remove all of this nonsense. I can't even readily test this because this code isn't reachable except through SROA. When I re-instate the patch that fast-tracks allocas already suitable for promotion, I'll add a testcase there that failed before this change. Before that, SROA will fix any test case I give it. llvm-svn: 187347	2013-07-29 09:06:53 +00:00
Nadav Rotem	750e42cba3	Don't vectorize when the attribute NoImplicitFloat is used. llvm-svn: 187340	2013-07-29 05:13:00 +00:00
Rafael Espindola	caa776be91	Fix -Wdocumentation warnings. llvm-svn: 187336	2013-07-28 23:43:28 +00:00
Chandler Carruth	6b55dbea86	Update comments for SSAUpdater to use the modern doxygen comment standards for LLVM. Remove duplicated comments on the interface from the implementation file (implementation comments are left there of course). Also clean up, re-word, and fix a few typos and errors in the commenst spotted along the way. This is in preparation for changes to these files and to keep the uninteresting tidying in a separate commit. llvm-svn: 187335	2013-07-28 22:00:33 +00:00
Craig Topper	9469e906a5	Remove use of sprintf added to X86 disassembler tablegen code. Send message with instruction name to errs() instead and use a generic message for the llvm_unreachable. Consistent with other places in this file. llvm-svn: 187333	2013-07-28 21:28:02 +00:00
Aaron Ballman	d1594bdaa9	Partial revert of 187310; it seems MSVC 10 still spits out this warning, but MSVC 11 does not. llvm-svn: 187331	2013-07-28 18:04:26 +00:00
Chandler Carruth	d31370e060	Temporarily revert r187323 until I update SSAUpdater to match mem2reg. I forgot that we had two totally independent things here. :: sigh :: llvm-svn: 187327	2013-07-28 09:05:49 +00:00
Elena Demikhovsky	baf51e3e61	fixed compilation issue llvm-svn: 187325	2013-07-28 08:45:12 +00:00
Elena Demikhovsky	003e7d73b9	Added encoding prefixes for KNL instructions (EVEX). Added 512-bit operands printing. Added instruction formats for KNL instructions. llvm-svn: 187324	2013-07-28 08:28:38 +00:00
Chandler Carruth	9d96100ff0	Now that mem2reg understands how to cope with a slightly wider set of uses of an alloca, we can pre-compute promotability while analyzing an alloca for splitting in SROA. That lets us short-circuit the common case of a bunch of trivially promotable allocas. This cuts 20% to 30% off the run time of SROA for typical frontend-generated IR sequneces I'm seeing. It gets the new SROA to within 20% of ScalarRepl for such code. My current benchmark for these numbers is PR15412, but it fits the general pattern of IR emitted by Clang so it should be widely applicable. llvm-svn: 187323	2013-07-28 08:27:12 +00:00
Chandler Carruth	d5b806a27f	Thread DataLayout through the callers and into mem2reg. This will be useful in a subsequent patch, but causes an unfortunate amount of noise, so I pulled it out into a separate patch. llvm-svn: 187322	2013-07-28 06:43:11 +00:00
Bill Schmidt	40f78a2a86	[PowerPC] Add comment explaining preprocessor directive. llvm-svn: 187320	2013-07-28 03:23:32 +00:00
Bill Schmidt	20573225ed	Revert 187318 llvm-svn: 187319	2013-07-28 02:13:24 +00:00
Bill Schmidt	f5b32e3935	[PowerPC] Remove unnecessary preprocessor checking. The tests !defined(__ppc__) && !defined(__powerpc__) are not needed or helpful when verifying that code is being compiled for a 64-bit target. The simpler test provided by this revision is sufficient to tell if the target is 64-bit. llvm-svn: 187318	2013-07-28 02:08:13 +00:00
Nadav Rotem	3e50c68956	Update the comment llvm-svn: 187316	2013-07-27 23:28:47 +00:00
Michael Gottesman	b0e688e87c	[APFloat] Make all arithmetic operations with NaN produce positive NaNs. IEEE-754R 1.4 Exclusions states that IEEE-754R does not specify the interpretation of the sign of NaNs. In order to remove an irrelevant variable that most floating point implementations do not use, standardize add, sub, mul, div, mod so that operating anything with NaN always yields a positive NaN. In a later commit I am going to update the APIs for creating NaNs so that one can not even create a negative NaN. llvm-svn: 187314	2013-07-27 21:49:25 +00:00
Michael Gottesman	30a90eb1a5	[APFloat] Move setting fcNormal in zeroSignificand() to calling code. Zeroing the significand of a floating point number does not necessarily cause a floating point number to become finite non zero. For instance, if one has a NaN, zeroing the significand will cause it to become +/- infinity. llvm-svn: 187313	2013-07-27 21:49:21 +00:00
Michael Gottesman	aae69c0a1d	[APFloat] Removed nextafter from missing operations since it is implemented in APFloat::next. llvm-svn: 187312	2013-07-27 21:49:19 +00:00
Aaron Ballman	437c9f92bc	Re-enabling some more MSVC warnings; all of these compile cleanly with no further changes required. llvm-svn: 187310	2013-07-27 20:20:28 +00:00
Matt Arsenault	517cf483c0	Minor code simplification suggested by Duncan llvm-svn: 187309	2013-07-27 19:22:28 +00:00
Benjamin Kramer	409afcf174	DwarfDebug: MD5 is always little endian, bswap on big endian platforms. This makes LLVM emit the same signature regardless of host and target endianess. llvm-svn: 187304	2013-07-27 14:14:43 +00:00
Chandler Carruth	26ad41ed6e	Create a constant pool symbol for the GOT in the ARMCGBR the same way we do in the SDag when lowering references to the GOT: use ARMConstantPoolSymbol rather than creating a dummy global variable. The computation of the alignment still feels weird (it uses IR types and datalayout) but it preserves the exact previous behavior. This change fixes the memory leak of the global variable detected on the valgrind leak checking bot. Thanks to Benjamin Kramer for pointing me at ARMConstantPoolSymbol to handle this use case. llvm-svn: 187303	2013-07-27 11:58:26 +00:00
Chandler Carruth	1c82d3310e	Fix yet another memory leak found by the vg-leak bot. Folks (including me) should start watching this bot more as its catching lots of bugs. The fix here is to not construct the global if we aren't going to need it. That's cheaper anyways, and globals have highly predictable types in practice. I've added an assert to catch skew between our manual testing of the type and the actual type just for paranoia's sake. Note that this pattern is actually fine in most globals because when you build a global with a module it automatically is moved to be owned by that module. But here, we're in isel and don't really want to do that. The solution of not creating a global is simpler anyways. llvm-svn: 187302	2013-07-27 11:23:08 +00:00
Chandler Carruth	2a1c0d2c03	Fix a memory leak in the debug emission by simply not allocating memory. There doesn't appear to be any reason to put this variable on the heap. I'm suspicious of the LexicalScope above that we stuff in a map and then delete afterward, but I'm just trying to get the valgrind bot clean. llvm-svn: 187301	2013-07-27 11:09:58 +00:00
Chandler Carruth	c18e39ca83	Fix a memory leak in the hexagon scheduler. We call initialize here more than once, and the second time through we leaked memory. Found thanks to the vg-leak bot, but I can't locally reproduce it with valgrind. The debugger confirms that it is in fact leaking here. This whole code is totally gross. Why is initialize being called on each runOnFunction??? Why aren't these OwningPtr<>s, and why aren't their lifetimes better defined? Anyways, this is just a surgical change to help out the leak checking bots. llvm-svn: 187299	2013-07-27 10:48:45 +00:00
Chandler Carruth	8e3c4dc50e	Don't use all the #ifdefs to hide the stats counters and instead rely on their being optimized out in debug mode. Realistically, this just isn't going to be the slow part anyways. This also fixes unused variable warnings that are breaking LLD build bots. =/ I didn't see these at first, and kept losing track of the fact that they were broken. llvm-svn: 187297	2013-07-27 10:17:49 +00:00
Chandler Carruth	e8f5812a30	Merge the removal of dead instructions and lifetime markers with the analysis of the alloca. We don't need to visit all the users twice for this. We build up a kill list during the analysis and then just process it afterward. This recovers the tiny bit of performance lost by moving to the visitor based analysis system as it removes one entire use-list walk from mem2reg. In some cases, this is now faster than mem2reg was previously. llvm-svn: 187296	2013-07-27 09:43:30 +00:00
Aaron Ballman	6436940e38	Re-enabling a warning in MSVC mode now that r187292 fixed the only instance of the warning. llvm-svn: 187293	2013-07-27 03:35:44 +00:00
Tom Stellard	9ba44da833	SimplifyCFG: Add missing tests from r187278 llvm-svn: 187291	2013-07-27 02:54:44 +00:00
Nick Lewycky	e51b4bcd66	Update this CMakeLists.txt for r187283 too. llvm-svn: 187286	2013-07-27 01:26:30 +00:00
Manman Ren	921382ed78	Debug Info Verifier: verify SPs in llvm.dbg.sp. Also always add DIType, DISubprogram and DIGlobalVariable to the list in DebugInfoFinder without checking them, so we can verify them later on. llvm-svn: 187285	2013-07-27 01:26:08 +00:00
Nick Lewycky	cd1e8930ae	Also update CMakeLists.txt for r187283. llvm-svn: 187284	2013-07-27 01:25:51 +00:00
Nick Lewycky	0b68245ec8	Reimplement isPotentiallyReachable to make nocapture deduction much stronger. Adds unit tests for it too. Split BasicBlockUtils into an analysis-half and a transforms-half, and put the analysis bits into a new Analysis/CFG.{h,cpp}. Promote isPotentiallyReachable into llvm::isPotentiallyReachable and move it into Analysis/CFG. llvm-svn: 187283	2013-07-27 01:24:00 +00:00
Aaron Ballman	568cb27833	Re-enabling some more MSVC warnings; all of these compile cleanly with no further changes required. llvm-svn: 187279	2013-07-27 00:13:11 +00:00
Tom Stellard	8b1e021e85	SimplifyCFG: Use parallel-and and parallel-or mode to consolidate branch conditions Merge consecutive if-regions if they contain identical statements. Both transformations reduce number of branches. The transformation is guarded by a target-hook, and is currently enabled only for +R600, but the correctness has been tested on X86 target using a variety of CPU benchmarks. Patch by: Mei Ye llvm-svn: 187278	2013-07-27 00:01:07 +00:00

... 3 4 5 6 7 ...

94725 Commits