llvm-project

Commit Graph

Author	SHA1	Message	Date
Chad Rosier	676c093758	Add missing functions. llvm-svn: 145608	2011-12-01 18:26:19 +00:00
Benjamin Kramer	3ced545ccf	Autodetect bulldozers. llvm-svn: 145607	2011-12-01 18:24:17 +00:00
Chad Rosier	10fe1fe39e	Add a few more functions to TargetLibraryInfo. More of rdar://10500969. llvm-svn: 145596	2011-12-01 17:54:37 +00:00
Chris Lattner	f433c08659	fix broken tag llvm-svn: 145590	2011-12-01 17:25:28 +00:00
Duncan Sands	b8e6cee9ca	Revert commit 145449 (ddunbar) since it is breaking the dragonegg buildbots. Original commit message: llvm-config: Replace with C++ version (was llvm-config-2). - Reapply of r144300, with lots of fixes/migration easement in between. llvm-svn: 145582	2011-12-01 10:50:19 +00:00
Eric Christopher	9da7f305a4	For 64-bit the rest of the general regs are ok for the q constraint. Make sure we can emit both the high and low versions of those registers. Fixes rdar://10392864 llvm-svn: 145579	2011-12-01 08:12:41 +00:00
David Blaikie	3a15e14520	Add some missing anchors. llvm-svn: 145578	2011-12-01 08:00:17 +00:00
Eli Friedman	d61887dd0a	Pass AVX vectors which are arguments to varargs functions on the stack. <rdar://problem/10463281>. llvm-svn: 145573	2011-12-01 04:49:21 +00:00
Pete Cooper	3b7f35bf08	Removed use of grep from test and moved it to be with other icmp tests llvm-svn: 145570	2011-12-01 04:35:26 +00:00
Pete Cooper	bc5c524b71	Added instcombine pattern to spot comparing -val or val against 0. (val != 0) == (-val != 0) so "abs(val) != 0" becomes "val != 0" Fixes <rdar://problem/10482509> llvm-svn: 145563	2011-12-01 03:58:40 +00:00
Chad Rosier	c24b86ffbe	Propagate TargetLibraryInfo throughout ConstantFolding.cpp and InstructionSimplify.cpp. Other fixups as needed. Part of rdar://10500969 llvm-svn: 145559	2011-12-01 03:08:23 +00:00
Nick Lewycky	e659b8459e	Make use of "getScalarType()". No functionality change. llvm-svn: 145556	2011-12-01 02:39:36 +00:00
Eli Friedman	c1870b2633	Small fix for assembler generation on Darwin PPC64. Patch by Michael Kostylev. PR11437. llvm-svn: 145553	2011-12-01 01:43:47 +00:00
Jim Grosbach	8eeb353071	Replace an assert() with an actual diagnostic. llvm-svn: 145535	2011-11-30 23:16:25 +00:00
Kostya Serebryany	dc436f95d2	make asan work at -O0, llvm part. Patch by glider@google.com llvm-svn: 145530	2011-11-30 22:19:26 +00:00
Jan Sjödin	9430e284a9	Support for encoding all FMA4 instructions and tablegen patterns for all remaining FMA4 instructions and intrinsics with tests. llvm-svn: 145525	2011-11-30 22:09:42 +00:00
Eli Friedman	6cff9df298	Make GlobalMerge honor the preferred alignment on globals without an explicitly specified alignment. <rdar://problem/10497732>. llvm-svn: 145523	2011-11-30 21:54:15 +00:00
Bill Wendling	ad8b58b2ac	More cleanups. No content change. llvm-svn: 145522	2011-11-30 21:52:43 +00:00
Bill Wendling	b6c2220600	Minor cleanup. No content change. llvm-svn: 145521	2011-11-30 21:43:43 +00:00
Bob Wilson	a0c69014f8	Remove the install-clang-c makefile target. When I did this before it broke a buildbot that was testing that target, but we've removed that buildbot now. llvm-svn: 145519	2011-11-30 21:06:12 +00:00
Bill Wendling	3eb84cc75b	Remove an XXX which hasn't been fixed yet. It's too late now. llvm-svn: 145518	2011-11-30 20:53:52 +00:00
Matt Beaumont-Gay	23c30b90e3	Remove unused variable llvm-svn: 145517	2011-11-30 19:53:11 +00:00
Jim Grosbach	7d8517b1d4	Add some tests for all-lanes VLD1 parsing. llvm-svn: 145512	2011-11-30 19:37:38 +00:00
Jim Grosbach	a68c9a847e	ARM parsing for VLD1 all lanes, with writeback. llvm-svn: 145510	2011-11-30 19:35:44 +00:00
Chad Rosier	738da252ab	Add a few functions to TargetLibraryInfo. llvm-svn: 145508	2011-11-30 19:19:00 +00:00
Jim Grosbach	3ecf976ca9	ARM parsing for VLD1 two register all lanes, no writeback. llvm-svn: 145504	2011-11-30 18:21:25 +00:00
Nadav Rotem	0a1801015c	Add test arch to make it pass on non x86 targets llvm-svn: 145498	2011-11-30 17:34:28 +00:00
Benjamin Kramer	5feb3dab79	X86: Turns out bulldozer also supports sse42 and lzcnt. While at it remove the barcelona/instanbul/shanghai subtargets, they're unsupported by GCC and look pretty broken. llvm-svn: 145494	2011-11-30 15:48:16 +00:00
Benjamin Kramer	981f32327d	X86: Add subtargets for AMD's bulldozer. llvm-svn: 145493	2011-11-30 15:27:46 +00:00
Nadav Rotem	66427bcce9	Add a tripple to the test llvm-svn: 145489	2011-11-30 11:20:56 +00:00
Nadav Rotem	96923cc2bb	X86: PerformOrCombine introduced a vselect node with a wrong order of operands. This bug was introduced when a dedicated blend sdnode was replaced with the vselect node (in 139479). llvm-svn: 145488	2011-11-30 10:13:37 +00:00
Craig Topper	c4977ba413	Add instruction selection support for AVX2 horizontal add/sub instructions. llvm-svn: 145487	2011-11-30 09:10:50 +00:00
Duncan Sands	e9adf5c860	Mention that -O4 does result in more optimization when used with -fplugin-arg-dragonegg-enable-gcc-optzns, though it usually isn't a win. llvm-svn: 145486	2011-11-30 08:46:05 +00:00
Craig Topper	0a672eaf9e	Merge VPERM2F128/VPERM2I128 ISD node types. llvm-svn: 145485	2011-11-30 07:47:51 +00:00
Andrew Trick	613c67e475	Better test case found in duplicate PR10570. llvm-svn: 145484	2011-11-30 06:26:42 +00:00
Craig Topper	bafd224c8b	Merge decoding of VPERMILPD and VPERMILPS shuffle masks. Merge X86ISD node type for VPERMILPD/PS. Add instruction selection support for VINSERTI128/VEXTRACTI128. llvm-svn: 145483	2011-11-30 06:25:25 +00:00
Andrew Trick	ceafa2c746	LSR: handle the expansion of phi operands that use postinc forms of the IV. Fixes PR11431: SCEVExpander::expandAddRecExprLiterally(const llvm::SCEVAddRecExpr*): Assertion `(!isa<Instruction>(Result) \|\| SE.DT->dominates(cast<Instruction>(Result), Builder.GetInsertPoint())) && "postinc expansion does not dominate use"' failed. llvm-svn: 145482	2011-11-30 06:07:54 +00:00
Chad Rosier	385d9f6c24	Whitespace. llvm-svn: 145470	2011-11-30 01:59:59 +00:00
Chad Rosier	abba0947db	Alphabetize TargetLibraryInfo enum and fix doxygen comments. No functional change intended. llvm-svn: 145468	2011-11-30 01:51:49 +00:00
Jim Grosbach	f09b1c46cf	llvm_unreachable() is not for user diagnostics.... llvm-svn: 145465	2011-11-30 01:15:55 +00:00
Jim Grosbach	cd6f5e757c	ARM parsing aliases for VLD1 single register all lanes. llvm-svn: 145464	2011-11-30 01:09:44 +00:00
Chad Rosier	82e1bd8e94	Add support for sqrt, sqrtl, and sqrtf in TargetLibraryInfo. Disable (fptrunc (sqrt (fpext x))) -> (sqrtf x) transformation if -fno-builtin is specified. rdar://10466410 llvm-svn: 145460	2011-11-29 23:57:10 +00:00
Jim Grosbach	182b6a077e	Tidy up a bit. llvm-svn: 145458	2011-11-29 23:51:09 +00:00
Jim Grosbach	ae672f8118	Add comment. llvm-svn: 145456	2011-11-29 23:33:40 +00:00
Jim Grosbach	e1154eef0b	ARM parsing aliases for data-size suffices on VST1. llvm-svn: 145454	2011-11-29 23:21:31 +00:00
Jakob Stoklund Olesen	f50d2eafdb	FileCheckize. llvm-svn: 145452	2011-11-29 23:09:16 +00:00
Akira Hatanaka	dc25f9f38a	Change names for MIPS "generic" processors defined in Mips.td to match what GNU tools use. Patch by Simon Atanasyan. "mips32r1" => "mips32" "4ke" => mips32r2" "mips64r1" => "mips64" llvm-svn: 145451	2011-11-29 23:08:41 +00:00
Jim Grosbach	5ee209ce3a	ARM assembly parsing and encoding for four-register VST1. llvm-svn: 145450	2011-11-29 22:58:48 +00:00
Daniel Dunbar	8d5cc33ad8	llvm-config: Replace with C++ version (was llvm-config-2). - Reapply of r144300, with lots of fixes/migration easement in between. llvm-svn: 145449	2011-11-29 22:56:31 +00:00
Evan Cheng	648e48d02e	Add another missing pattern. llvm-gcc likes f64 but clang likes i64 so it was generating poor code for some SSE builtins. llvm-svn: 145448	2011-11-29 22:48:34 +00:00
Jim Grosbach	2a9c43649a	Enable some VST1 tests and add a few more. llvm-svn: 145443	2011-11-29 22:40:32 +00:00
Jim Grosbach	98d032fd67	ARM assembly parsing and encoding for three-register VST1. llvm-svn: 145442	2011-11-29 22:38:04 +00:00
Jakob Stoklund Olesen	bde32d36bb	Make X86::FsFLD0SS / FsFLD0SD real pseudo-instructions. Like V_SET0, these instructions are expanded by ExpandPostRA to xorps / vxorps so they can participate in execution domain swizzling. This also makes the AVX variants redundant. llvm-svn: 145440	2011-11-29 22:27:25 +00:00
Stepan Dyatkovskiy	31798ef3c0	Potential bug in RewriteLoopBodyWithConditionConstant: use iterator should not be changed inside the uses enumeration loop. llvm-svn: 145432	2011-11-29 20:34:39 +00:00
Eric Christopher	b2d03a6d00	Update the docs for some of the test-suite configure changes and be more clear about what to do and how to do it. llvm-svn: 145426	2011-11-29 19:40:56 +00:00
Chad Rosier	46addb9e07	If fast-isel fails, remove dead instructions generated during the failed attempt. llvm-svn: 145425	2011-11-29 19:40:47 +00:00
Rafael Espindola	b61cc44265	grammar. llvm-svn: 145423	2011-11-29 19:38:09 +00:00
Andrew Trick	312b97c267	comment. llvm-svn: 145422	2011-11-29 19:33:49 +00:00
Daniel Dunbar	539d0a8a09	build/CMake: Finish removal of add_llvm_library_dependencies. llvm-svn: 145420	2011-11-29 19:25:30 +00:00
Benjamin Kramer	098c1d9680	Add a link to Bill's blog post. llvm-svn: 145419	2011-11-29 19:24:11 +00:00
Rafael Espindola	866a22aba7	Release notes for segmented stacks. Patch by Sanjoy Das. llvm-svn: 145416	2011-11-29 19:08:23 +00:00
Duncan Sands	ca6f8ddbf8	Fix a theoretical problem (not seen in the wild): if different instances of a weak variable are compiled by different compilers, such as GCC and LLVM, while LLVM may increase the alignment to the preferred alignment there is no reason to think that GCC will use anything more than the ABI alignment. Since it is the GCC version that might end up in the final program (as the linkage is weak), it is wrong to increase the alignment of loads from the global up to the preferred alignment as the alignment might only be the ABI alignment. Increasing alignment up to the ABI alignment might be OK, but I'm not totally convinced that it is. It seems better to just leave the alignment of weak globals alone. llvm-svn: 145413	2011-11-29 18:26:38 +00:00
Michael J. Spencer	de3a2118db	MC/X86/COFF: Allow quotes in names when targeting MS/Windows, as MC is the only assembler we support. This splits MS/Windows and GNU/Windows ASM infos into two seperate classes. While there is currently only one difference, full MS C++ ABI support will require many more. llvm-svn: 145409	2011-11-29 18:00:06 +00:00
Danil Malyshev	cbe72fc959	Fixed ObjectFile functions: - getSymbolOffset() renamed as getSymbolFileOffset() - getSymbolFileOffset(), getSymbolAddress(), getRelocationAddress() returns same result for ELFObjectFile, MachOObjectFile and COFFObjectFile. - added getRelocationOffset() - fixed MachOObjectFile::getSymbolSize() - fixed MachOObjectFile::getSymbolSection() - fixed MachOObjectFile::getSymbolOffset() for symbols without section data. llvm-svn: 145408	2011-11-29 17:40:10 +00:00
Elena Demikhovsky	7a81dea516	Fixed vsqrt.ss intrinsic usage - order of input operands was wrong. Added a test. Thanks Bruno for reviewing the patch. llvm-svn: 145403	2011-11-29 15:00:45 +00:00
Craig Topper	1d63ae3731	Fix shuffle decoding for memory forms for (V)SHUFPS/D. llvm-svn: 145392	2011-11-29 07:58:09 +00:00
Craig Topper	c16db840be	Fix issues in shuffle decoding around VPERM* instructions. Fix shuffle decoding for VSHUFPS/D for 256-bit types. Add pattern matching for memory forms of VPERMILPS/VPERMILPD. llvm-svn: 145390	2011-11-29 07:49:05 +00:00
NAKAMURA Takumi	64404a3b2c	[Win32] Catch exceptions (eg. segfault) on waiting for invoked clang from the driver. clang/lib/Driver/Driver.cpp: Don't pass through negative exit status, or parent would be confused. llvm::sys::Program::Wait(): Suppose 0x8000XXXX and 0xC000XXXX as abnormal exit code and pass it as negative value. Win32 Exception Handler: Exit with ExceptionCode on an unhandle exception. llvm-svn: 145389	2011-11-29 07:47:04 +00:00
NAKAMURA Takumi	0e5bae7191	lit/TestRunner.py: Try to catch ERROR_FILE_NOT_FOUND, too. Thanks to Francois, to let me know. llvm-svn: 145381	2011-11-29 06:40:50 +00:00
Bob Wilson	b103fbf005	Install llvmCore to /usr/local. <rdar://problem/10390708> llvm-svn: 145378	2011-11-29 06:11:56 +00:00
Craig Topper	12b72def4e	Fix VINSERTF128/VEXTRACTF128 to be marked as FP instructions. Allow execution dependency fix pass to convert them to their integer equivalents when AVX2 is enabled. llvm-svn: 145376	2011-11-29 05:37:58 +00:00
Craig Topper	897a7d4b9c	Correctly mark VPERM2F128 as being an FP instruction and add execution domain fixing support to convert it to VPERM2I128 for AVX2. llvm-svn: 145370	2011-11-29 03:57:34 +00:00
Bill Wendling	11b9894234	MachO doesn't support the protected visibility. Don't default to 'global' here. <rdar://problem/10396775> llvm-svn: 145368	2011-11-29 02:39:58 +00:00
Andrew Trick	d25089f8e0	SCEV fix. In general, Add/Mul expressions should not inherit NSW/NUW. This reverts r139450, fixes r139453, and adds much needed comments and a unit test. llvm-svn: 145367	2011-11-29 02:16:38 +00:00
Andrew Trick	d912a5b2e3	Make SCEV print <nsw><nuw> for Add/MulExpr. llvm-svn: 145364	2011-11-29 02:06:35 +00:00
Andrew Trick	5ec136c57e	Filecheckize. llvm-svn: 145363	2011-11-29 02:05:23 +00:00
Peter Collingbourne	7e09afb833	Remove content that has been moved to Clang release notes. llvm-svn: 145362	2011-11-29 02:04:48 +00:00
Peter Collingbourne	7d9c13d81f	Fix grammar. llvm-svn: 145361	2011-11-29 02:04:44 +00:00
Bill Wendling	e4cc332729	On MachO, the pointer to the personality function should always be in the non_lazy_symbol_pointers section (__IMPORT,__pointers). Ignore the 'hidden' part since that will place it in the wrong section. <rdar://problem/10443720> llvm-svn: 145356	2011-11-29 01:43:20 +00:00
Daniel Dunbar	faaa76d1b7	build/cmake: Switch to using llvm-build computed dependencies. - I verified locally that the current dependency lists are identical. - This makes add_llvm_library_dependencies() a no-op. I'll remove it once this change passes the bots. llvm-svn: 145355	2011-11-29 01:31:52 +00:00
Eli Friedman	7534b46884	Zap some completely ridiculous code. There's probably a miscompile here, but I don't really want to try to write a testcase involving an invoke returning a pointer to a varargs function... llvm-svn: 145347	2011-11-29 01:18:23 +00:00
Jim Grosbach	ae9132207f	Better fix for ARM MOVT relocation encoding of thumb bit. Replaces r145318 with a more targetted fix for the relocation handling. llvm-svn: 145346	2011-11-29 01:15:25 +00:00
Andrew Trick	e756031a62	Reenable this IndVars unit test. SCEV can't optimize undef in all cases, which is a separate issue from this test case. llvm-svn: 145343	2011-11-29 00:52:04 +00:00
Daniel Dunbar	fe2d028ab1	build: Update CMakeLists.txt. llvm-svn: 145341	2011-11-29 00:33:14 +00:00
Chandler Carruth	60062ed5dc	Add a link from the LLVM release notes to the Clang release notes. I suspect we could profitably remove/move some of the bullet points under Clang here to the Clang notes in order to keep things clean on both sides. Unless I hear objections I'll start doing that once folks have read over the Clang notes a bit. llvm-svn: 145340	2011-11-29 00:32:43 +00:00
Daniel Dunbar	b074d102a3	edis: Sink EDMain.cpp into lib/MC/MCDisassembler. - This fixes some layering violations and matches how we handle the llvm-c lib, for example. llvm-svn: 145338	2011-11-29 00:25:57 +00:00
Daniel Dunbar	d690111dd4	edis: Don't do the target initialization in EDGetDisassembler, this is contrary to the way we currently expect target selection to work -- clients are supposed to have control over what targets are available. llvm-svn: 145331	2011-11-29 00:06:58 +00:00
Daniel Dunbar	69987abde6	llvm-c: Add a few missing InitializeAll* functions. llvm-svn: 145330	2011-11-29 00:06:55 +00:00
Daniel Dunbar	2266cfcc77	build/Make: edis isn't built as a shared library anymore, remove related cruft from the Makefile. llvm-svn: 145329	2011-11-29 00:06:53 +00:00
Daniel Dunbar	4128db91c2	llvmbuild/CMake: Update CMake output fragment to include explicit library dependency information. llvm-svn: 145328	2011-11-29 00:06:50 +00:00
Rafael Espindola	8fa4bd048b	Expand the part about CFI a bit. llvm-svn: 145324	2011-11-28 23:55:49 +00:00
Devang Patel	8e5bfd8349	Add documentation for llvm-cov. llvm-svn: 145319	2011-11-28 23:39:25 +00:00
Jim Grosbach	30168fbde5	Thumb2 only force the fixup thumb bit for data relocations. rdar://10493453 llvm-svn: 145318	2011-11-28 23:39:00 +00:00
Eli Friedman	b3f9b0676a	Add a missing safety check to ProcessUGT_ADDCST_ADD. Fixes PR11438. llvm-svn: 145316	2011-11-28 23:32:19 +00:00
Jim Grosbach	faa8efb482	Remove obsolete FIXME. llvm-svn: 145313	2011-11-28 23:23:58 +00:00
Eli Friedman	e7ab1a2f0f	Make SelectionDAG::InferPtrAlignment use llvm::ComputeMaskedBits instead of duplicating the logic for globals. Make llvm::ComputeMaskedBits handle GlobalVariables slightly more aggressively, to match what InferPtrAlignment knew how to do. llvm-svn: 145304	2011-11-28 22:48:22 +00:00
Evan Cheng	4a5b2040e2	Revert r145273 and fix in SelectionDAG::InferPtrAlignment() instead. Conservatively returns zero when the GV does not specify an alignment nor is it initialized. Previously it returns ABI alignment for type of the GV. However, if the type is a "packed" type, then the under-specified alignments is attached to the load / store instructions. In that case, the alignment of the type cannot be trusted. rdar://10464621 llvm-svn: 145300	2011-11-28 22:37:34 +00:00
Daniel Dunbar	9339d4556e	Fix some possible gcc-4.2 may be used uninitialized warnings. llvm-svn: 145292	2011-11-28 22:19:32 +00:00
Nick Lewycky	76c6299f88	Don't define these unless we plan to use them. llvm-svn: 145289	2011-11-28 22:14:02 +00:00
Joe Abbey	3483e11496	Merging two bullet points into one llvm-svn: 145287	2011-11-28 22:07:12 +00:00
Evan Cheng	a4b6404cf0	DAG combine should not increase alignment of loads / stores with alignment less than ABI alignment. These are loads / stores from / to "packed" data structures. Their alignments are intentionally under-specified. rdar://10301431 llvm-svn: 145273	2011-11-28 20:42:56 +00:00
Evan Cheng	aa93ceb164	Add missing avx pattern. llvm-svn: 145272	2011-11-28 20:27:23 +00:00
Peter Collingbourne	952adc32d9	Add OpenCL blurb to release notes. llvm-svn: 145270	2011-11-28 20:04:12 +00:00
Chad Rosier	61e8d1026f	80-column. llvm-svn: 145267	2011-11-28 19:59:09 +00:00
Bill Wendling	5ebc95ff4c	Remove dead llvm.eh.sjlj.dispatchsetup intrinsic. llvm-svn: 145263	2011-11-28 19:23:13 +00:00
Andrew Trick	a8bdb7cbf1	Remove the temporary flag -disable-unroll-scev and dead code. SCEV should now be used for trip count analysis, not LoopInfo. llvm-svn: 145262	2011-11-28 19:22:09 +00:00
Eli Friedman	31f0116173	Add back a line I deleted by accident in r145141. Fixes uninitialized variable warnings and runtime failures. llvm-svn: 145256	2011-11-28 18:50:37 +00:00
Michael J. Spencer	f7a4ed7d7f	Add object file related release notes. llvm-svn: 145254	2011-11-28 18:20:09 +00:00
Jakob Stoklund Olesen	979dad7bd2	Explain what ExeDepsFix does. llvm-svn: 145253	2011-11-28 18:03:11 +00:00
Rafael Espindola	c87cebf2bf	Fix spelling/grammar errors found by Duncan. llvm-svn: 145250	2011-11-28 17:06:58 +00:00
Benjamin Kramer	4c16431a67	Handle more cases in APInt::getLowBitsSet's fast path. llvm-svn: 145249	2011-11-28 16:56:38 +00:00
Bill Wendling	957cc212bb	Support a 'final' release candidate tag. llvm-svn: 145243	2011-11-28 11:45:10 +00:00
Duncan Sands	12330650f8	Silence wrong warnings from GCC about variables possibly being used uninitialized: GCC doesn't understand that the variables are only used if !UseImm, in which case they have been initialized. llvm-svn: 145239	2011-11-28 10:31:27 +00:00
Craig Topper	818a983e93	Add X86 instruction selection for VPERM2I128 when AVX2 is enabled. Merge VPERMILPS/VPERMILPD detection since they are pretty similar. llvm-svn: 145238	2011-11-28 10:14:51 +00:00
Bob Wilson	3f35470fc7	Add an optional separate install prefix for internal components. rdar://10217046 Some files installed by clang are not relevant for general users and we'd like to be able to install them to a different location. This adds a new --with-internal-prefix configure option and a corresponding PROJ_internal_prefix makefile variable, which defaults to the standard prefix. A tool makefile can specify that it should be installed to this internal prefix by defining INTERNAL_TOOL. llvm-svn: 145234	2011-11-28 07:59:52 +00:00
NAKAMURA Takumi	8284ec46b6	test/lit.cfg: Enable the feature 'asserts' to check output of llc -version. llc knows whether he is compiled with -DNDEBUG. \| Optimized build with assertions. llvm-svn: 145230	2011-11-28 05:09:15 +00:00
NAKAMURA Takumi	a0d652e71b	lit/TestRunner.py: Use RemoveForce(). llvm-svn: 145223	2011-11-28 01:55:08 +00:00
NAKAMURA Takumi	57fc5adca0	lit/TestRunner.py: [Win32] Introduce WinWaitReleased(f), to wait for file handles to be released by children. When wait() has finished, opened handles (especially writing stdout to file) might not be released immediately. To wait for released, poll to attempt renaming. llvm-svn: 145222	2011-11-28 01:55:01 +00:00
Jakob Stoklund Olesen	6d110aa84d	Add a blurb about the new ExecutionDepsFix pass. llvm-svn: 145220	2011-11-28 01:46:19 +00:00
Craig Topper	b0456936da	Make isCommutedVSHUFP more like the way isCommutedSHUFP is handled. llvm-svn: 145218	2011-11-28 01:14:24 +00:00
NAKAMURA Takumi	4ad52a54b9	configure, config.h.in: Regenerate. config.h.cmake: Synchronize to config.h.in. llvm-svn: 145217	2011-11-28 01:07:19 +00:00
Dylan Noblesmith	3e79ef1d45	use llvm-config.h in public header The config.h file's macros collide with other projects that include LLVM and shouldn't get exported. llvm-svn: 145215	2011-11-28 00:49:01 +00:00
Dylan Noblesmith	efddf20126	rename ENABLE_THREADS to LLVM_ENABLE_THREADS Now that it needs to be exported in a public header (Valgrind.h) it should be prefixed to avoid collision with other projects. Add it to llvm-config.h as well. This'll require regenerating the configure script after this commit, but I don't have the required autoconf version. llvm-svn: 145214	2011-11-28 00:48:58 +00:00
Dylan Noblesmith	daef41b1d1	update description of LLVM_DEFAULT_TARGET_TRIPLE It was out of sync with the description in configure.ac/config.h.in. Also re-alphabetize it from its position when it was LLVM_HOST_TRIPLE. llvm-svn: 145213	2011-11-28 00:48:53 +00:00
Nick Lewycky	6404d97a99	Place the "cfg checksum" around a test. This was recently added in April 2011 to gcc, though I thought it was older (my gcc 4.4 has it as a local patch. Whoops!) This fixes PR10589. Also add some debugging statements. Remove GcnoFiles, the mapping from CompilationUnit to raw_ostream. Now that we start by iterating over each CU and descending into them, there's no need to maintain a mapping. llvm-svn: 145208	2011-11-27 23:22:20 +00:00
Chris Lattner	ef714c0b05	dwarf parsing stuff. llvm-svn: 145207	2011-11-27 22:39:23 +00:00
Chris Lattner	b035c31215	first pass of writing complete! llvm-svn: 145206	2011-11-27 22:36:22 +00:00
Chris Lattner	7b32d97e02	arm and carve out a place ot mention segmented stacks. llvm-svn: 145204	2011-11-27 22:12:32 +00:00
Rafael Espindola	799ca897e7	Add a description of the status of segmented stacks. llvm-svn: 145201	2011-11-27 22:05:46 +00:00
Chris Lattner	7257f76728	optimize, mc, x86 llvm-svn: 145200	2011-11-27 22:03:34 +00:00
Craig Topper	79ee88a511	Merge detecting and handling for VSHUFPSY and VSHUFPDY since a lot of the code was similar for both. llvm-svn: 145199	2011-11-27 21:41:12 +00:00
Chris Lattner	644976405f	some writing. llvm-svn: 145198	2011-11-27 21:30:28 +00:00
Chris Lattner	9661de7d30	fix some out-of-date attribution. llvm-svn: 145197	2011-11-27 21:02:12 +00:00
Chris Lattner	6442197819	distribute various bullets to different sections. llvm-svn: 145196	2011-11-27 20:51:47 +00:00
Chandler Carruth	4f56720754	Prevent rotating the blocks of a loop (and thus getting a backedge to be fallthrough) in cases where we might fail to rotate an exit to an outer loop onto the end of the loop chain. Having some rotation, but not performing this rotation, is the primary fix of thep performance regression with -enable-block-placement for Olden/em3d (a whopping 30% regression). Still working on reducing the test case that actually exercises this and the new rotation strategy out of this code, but I want to check if this regresses other test cases first as that may indicate it isn't the correct fix. llvm-svn: 145195	2011-11-27 20:18:00 +00:00
Chris Lattner	080dd7ce30	rewrite the known problems section. Including a short list of individual bugs per target isn't particularly useful. Link to the target features matrix. llvm-svn: 145193	2011-11-27 19:38:20 +00:00
Chris Lattner	4857190a50	move the detailed information about the EH rewrite to a comment, Bill is blog'izing it. llvm-svn: 145192	2011-11-27 19:26:30 +00:00
Chris Lattner	e9a31c40b6	tweak subprojects' section llvm-svn: 145191	2011-11-27 18:53:41 +00:00
Chris Lattner	25a7790603	some random notes. llvm-svn: 145190	2011-11-27 18:47:37 +00:00
Chris Lattner	251d827d2c	remove a test that is using old-style llvm.dbg intrinsics, apparently only fails on ppc and arm hosts. llvm-svn: 145188	2011-11-27 18:13:47 +00:00
Chandler Carruth	03adbd46ca	Take two on rotating the block ordering of loops. My previous attempt was centered around the premise of laying out a loop in a chain, and then rotating that chain. This is good for preserving contiguous layout, but bad for actually making sane rotations. In order to keep it safe, I had to essentially make it impossible to rotate deeply nested loops. The information needed to correctly reason about a deeply nested loop is actually available -- before we layout the loop. We know the inner loops are already fused into chains, etc. We lose information the moment we actually lay out the loop. The solution was the other alternative for this algorithm I discussed with Benjamin and some others: rather than rotating the loop after-the-fact, try to pick a profitable starting block for the loop's layout, and then use our existing layout logic. I was worried about the complexity of this "pick" step, but it turns out such complexity is needed to handle all the important cases I keep teasing out of benchmarks. This is, I'm afraid, a bit of a work-in-progress. It is still misbehaving on some likely important cases I'm investigating in Olden. It also isn't really tested. I'm going to try to craft some interesting nested-loop test cases, but it's likely to be extremely time consuming and I don't want to go there until I'm sure I'm testing the correct behavior. Sadly I can't come up with a way of getting simple, fine grained test cases for this logic. We need complex loop structures to even trigger much of it. llvm-svn: 145183	2011-11-27 13:34:33 +00:00
Chandler Carruth	37ab257b88	Revert r145180 as it is causing test failures on all the bots. Original commit message: Fixed ObjectFile functions: - getSymbolOffset() renamed as getSymbolFileOffset() - getSymbolFileOffset(), getSymbolAddress(), getRelocationAddress() returns same result for ELFObjectFile, MachOObjectFile and COFFObjectFile. - added getRelocationOffset() - fixed MachOObjectFile::getSymbolSize() - fixed MachOObjectFile::getSymbolSection() - fixed MachOObjectFile::getSymbolOffset() for symbols without section data. llvm-svn: 145182	2011-11-27 10:37:47 +00:00
Chandler Carruth	9e46684154	Fix an impressive type-o / spell-o Duncan noticed. llvm-svn: 145181	2011-11-27 10:32:16 +00:00
Danil Malyshev	2631f93f7d	Fixed ObjectFile functions: - getSymbolOffset() renamed as getSymbolFileOffset() - getSymbolFileOffset(), getSymbolAddress(), getRelocationAddress() returns same result for ELFObjectFile, MachOObjectFile and COFFObjectFile. - added getRelocationOffset() - fixed MachOObjectFile::getSymbolSize() - fixed MachOObjectFile::getSymbolSection() - fixed MachOObjectFile::getSymbolOffset() for symbols without section data. llvm-svn: 145180	2011-11-27 10:12:52 +00:00
Chandler Carruth	a054580993	Rework a bit of the implementation of loop block rotation to not rely so heavily on AnalyzeBranch. That routine doesn't behave as we want given that rotation occurs mid-way through re-ordering the function. Instead merely check that there are not unanalyzable branching constructs present, and then reason about the CFG via successor lists. This actually simplifies my mental model for all of this as well. The concrete result is that we now will rotate more loop chains. I've added a test case from Olden highlighting the effect. There is still a bit more to do here though in order to regain all of the performance in Olden. llvm-svn: 145179	2011-11-27 09:22:53 +00:00
Chris Lattner	0bcbde46e2	Eli managed to kill off llvm.membarrier in llvm 3.0 also, this means that mainline needs no autoupgrade logic for intrinsics yet, woohoo! llvm-svn: 145178	2011-11-27 08:42:07 +00:00
Chris Lattner	3dcdc29d11	add some final random notes, I've completed my pass over all the commits. I'll work on turning this into something intelligible tomorrow. llvm-svn: 145177	2011-11-27 08:32:32 +00:00
Chris Lattner	410f3d7f5d	The llvm.atomic intrinsics were removed in LLVM 3.0 (in r141333), remove the autoupgrade logic for 2.9 and before. llvm-svn: 145176	2011-11-27 08:18:55 +00:00
Chris Lattner	ee471c484a	remove autoupgrade support for old forms of llvm.prefetch and the old trampoline forms. Both of these were correct in LLVM 3.0, and we don't need to support LLVM 2.9 and earlier in mainline. llvm-svn: 145174	2011-11-27 07:42:04 +00:00
Chris Lattner	d5bb9e6c4c	add some notes. llvm-svn: 145173	2011-11-27 07:37:53 +00:00
Chris Lattner	bc639298e5	remove asmparsing and documentation support for "volatile load", which was only produced by LLVM 2.9 and earlier. LLVM 3.0 and later prefers "load volatile". llvm-svn: 145172	2011-11-27 06:56:53 +00:00
Chris Lattner	6a144a2227	Upgrade syntax of tests using volatile instructions to use 'load volatile' instead of 'volatile load', which is archaic. llvm-svn: 145171	2011-11-27 06:54:59 +00:00
Chris Lattner	ebed15e973	some notes. llvm-svn: 145170	2011-11-27 06:24:49 +00:00
Chris Lattner	90ef78c07f	remove autoupgrade support for really old-style debug info intrinsics. I think this is the last of autoupgrade that can be removed in 3.1. Can the atomic upgrade stuff also go? llvm-svn: 145169	2011-11-27 06:18:33 +00:00
Chris Lattner	6aa6c0c3b7	remove some old autoupgrade logic llvm-svn: 145167	2011-11-27 06:10:54 +00:00
Chris Lattner	db89153969	remove autoupgrade support for LLVM 2.9 exception stuff. Mainline supports LLVM 3.0 and later. llvm-svn: 145165	2011-11-27 05:56:16 +00:00
Chris Lattner	1c9e5678b8	remove support for reading llvm 2.9 .bc files. LLVM 3.1 is only compatible back to 3.0 llvm-svn: 145164	2011-11-27 05:48:27 +00:00
Chris Lattner	74a3e00ebf	add some notes llvm-svn: 145163	2011-11-27 05:47:57 +00:00
Wesley Peck	97b3da5433	Add several new instructions supported by the latest MicroBlaze. These instructions are not generated by the backend yet, this will come in a later commit. llvm-svn: 145161	2011-11-27 05:16:58 +00:00
Bob Wilson	8e6d9da04c	Partially revert r145157 to quiet an unhappy buildbot. Removing that buildbot would be a better solution, but this is at least a temporary workaround. llvm-svn: 145160	2011-11-27 01:48:54 +00:00
Wesley Peck	d2e2e1782f	Optimize comparison against 0 in conditional instructions. Fix a couple of 80-column violations. llvm-svn: 145159	2011-11-27 01:36:20 +00:00
Chandler Carruth	9ffb97e631	Introduce a loop block rotation optimization to the new block placement pass. This is designed to achieve one of the important optimizations that the old code placement pass did, but more simply. This is a somewhat rough and very conservative version of the transform. We could get a lot fancier here if there are profitable cases to do so. In particular, this only looks for a single pattern, it insists that the loop backedge being rotated away is the last backedge in the chain, and it doesn't provide any means of doing better in-loop placement due to the rotation. However, it appears that it will handle the important loops I am finding in the LLVM test suite. llvm-svn: 145158	2011-11-27 00:38:03 +00:00
Bob Wilson	4eefd2d52f	Merge the install-clang-c target into install-clang. <rdar://problem/10217046> llvm-svn: 145157	2011-11-27 00:26:22 +00:00
Benjamin Kramer	7ba71be392	Move code into anonymous namespaces. llvm-svn: 145154	2011-11-26 23:01:57 +00:00
Craig Topper	51280d565b	Merge 128-bit and 256-bit X86ISD node types for VPERMILPS and VPERMILPD. Simplify some shuffle lowering code since V1 can never be UNDEF due to canonalizing that occurs when shuffle nodes are created. llvm-svn: 145153	2011-11-26 22:55:48 +00:00
Wesley Peck	69d5040485	Rename a couple of options and fix some simple typos. llvm-svn: 145152	2011-11-26 21:50:38 +00:00
Craig Topper	7704bd7ac3	Collapse X86ISD node types for PUNPCKH, PUNPCKL, UNPCKLP, and UNPCKHP to not be type specific. Now we just have integer high and low and floating point high and low. Pattern matching will choose the correct instruction based on the vector type. llvm-svn: 145148	2011-11-26 20:47:44 +00:00
Benjamin Kramer	8c8486dbb2	Move the branch probability blurb into the optimizer section. Add a minimal bullet for AVX. llvm-svn: 145145	2011-11-26 11:14:54 +00:00
David Chisnall	07618783f3	Added Objective-C and libc++ details to the 3.0 release notes. llvm-svn: 145144	2011-11-26 10:56:17 +00:00
Chandler Carruth	f156f0cf57	FileCheck-ize this test and make it more precise. This is in preparation for adding other tests. llvm-svn: 145143	2011-11-26 08:24:25 +00:00
Eli Friedman	a84ad7d0d0	Fix APFloat::convert so that it handles narrowing conversions correctly; it was returning incorrect values in rare cases, and incorrectly marking exact conversions as inexact in some more common cases. Fixes PR11406, and a missed optimization in test/CodeGen/X86/fp-stack-O0.ll. llvm-svn: 145141	2011-11-26 03:38:02 +00:00
Benjamin Kramer	a02af616b1	shpelling llvm-svn: 145138	2011-11-25 21:26:00 +00:00
Benjamin Kramer	889b243fd6	Remove ZooLib from the projects list. I don't see how the project is using LLVM and we really can't list every user of the clang analyzer. Sorry. llvm-svn: 145137	2011-11-25 21:03:06 +00:00
Chris Lattner	c3e4fdcc10	add a user llvm-svn: 145136	2011-11-25 20:36:17 +00:00
Chris Lattner	614d0391e9	add some notes llvm-svn: 145135	2011-11-25 20:33:27 +00:00
Chris Lattner	e5b37be30a	add faust llvm-svn: 145134	2011-11-25 20:28:16 +00:00
Bruno Cardoso Lopes	0f9a1f5e6c	This patch contains support for encoding FMA4 instructions and tablegen patterns for scalar FMA4 operations and intrinsic. Also add tests for vfmaddsd. Patch by Jan Sjodin llvm-svn: 145133	2011-11-25 19:33:42 +00:00
NAKAMURA Takumi	989eaf6e3f	ARMLoadStoreOptimizer.cpp: Fix MSVC(Debug) build. llvm-svn: 145129	2011-11-25 09:19:57 +00:00
Craig Topper	d65a444478	Remove 256-bit specific node types for UNPCKHPS/D and instead use the 128-bit versions and let the operand type disinquish. Also fix the load form of the v8i32 patterns for these to realize that the load would be promoted to v4i64. llvm-svn: 145126	2011-11-24 22:57:10 +00:00
Craig Topper	d26466748b	Remove AVX2 specific X86ISD node types for PUNPCKH/L and instead just reuse the 128-bit versions and let the vector type distinguish. llvm-svn: 145125	2011-11-24 22:20:08 +00:00
Benjamin Kramer	8a2d143672	Devirtualize Pass::getPassID, overriding it isn't useful and it gets called a lot. While at it pull the trivial ctor in line. llvm-svn: 145124	2011-11-24 21:14:11 +00:00
Benjamin Kramer	6709e05012	Make ConstantRange::truncate a bit more efficient. llvm-svn: 145122	2011-11-24 17:24:33 +00:00
Benjamin Kramer	651db37352	X86: alias cqo to cqto. llvm-svn: 145121	2011-11-24 12:02:46 +00:00
Chandler Carruth	7adee1a01a	Fix a silly use-after-free issue. A much earlier version of this code need lots of fanciness around retaining a reference to a Chain's slot in the BlockToChain map, but that's all gone now. We can just go directly to allocating the new chain (which will update the mapping for us) and using it. Somewhat gross mechanically generated test case replicates the issue Duncan spotted when actually testing this out. llvm-svn: 145120	2011-11-24 11:23:15 +00:00
Chandler Carruth	d394bafd2d	When adding blocks to the list of those which no longer have any CFG conflicts, we should only be adding the first block of the chain to the list, lest we try to merge into the middle of that chain. Most of the places we were doing this we already happened to be looking at the first block, but there is no reason to assume that, and in some cases it was clearly wrong. I've added a couple of tests here. One already worked, but I like having an explicit test for it. The other is reduced from a test case Duncan reduced for me and used to crash. Now it is handled correctly. llvm-svn: 145119	2011-11-24 08:46:04 +00:00
Jim Grosbach	651e2ee792	Add a few notes for ARM and a blurb about the MCJIT. llvm-svn: 145118	2011-11-24 00:49:21 +00:00
Akira Hatanaka	049e9e4d22	This patch makes the following changes necessary for MIPS' direct code emission. - lower unaligned loads/stores. - encode the size operand of instructions INS and EXT. - emit relocation information needed for JAL (jump-and-link). llvm-svn: 145113	2011-11-23 22:19:28 +00:00
Akira Hatanaka	f5ddf13f79	This patch addresses gp relative fixups/relocations for jump tables. llvm-svn: 145112	2011-11-23 22:18:04 +00:00
Richard Smith	4f9a8081c3	Correctly byte-swap APInts with bit-widths greater than 64. llvm-svn: 145111	2011-11-23 21:33:37 +00:00
Benjamin Kramer	6e013bf96c	Validate the return type when checking if a function is malloc. Fixes PR11426. Not sure if a test case with a "wrong" malloc would be useful. llvm-svn: 145106	2011-11-23 17:58:47 +00:00
Duncan Sands	81a2af12d6	Fix a crash in which a multiplication was being reported as being both negative and positive: positive, because it could be directly computed to be positive; negative, because the nsw flags means it is either negative or undefined (the multiplication always overflowed). llvm-svn: 145104	2011-11-23 16:26:47 +00:00
Benjamin Kramer	ebcb451874	X86: Use btq for bit tests if the immediate can't be encoded in 32 bits. Before: movabsq $4294967296, %rax ## encoding: [0x48,0xb8,0x00,0x00,0x00,0x00,0x01,0x00,0x00,0x00] testq %rax, %rdi ## encoding: [0x48,0x85,0xf8] jne LBB0_2 ## encoding: [0x75,A] After: btq $32, %rdi ## encoding: [0x48,0x0f,0xba,0xe7,0x20] jb LBB0_2 ## encoding: [0x72,A] btq is usually slower than testq because it doesn't fuse with the jump, but here we're better off saving one register and a giant movabsq. llvm-svn: 145103	2011-11-23 13:54:17 +00:00
NAKAMURA Takumi	0b3e996485	test/CodeGen/X86/block-placement.ll: Add explicit -mtriple=i686-linux. X86 Win32 CodeGen does not support EH yet. llvm-svn: 145101	2011-11-23 12:18:22 +00:00
Chandler Carruth	99fe42fbd9	Relax an invariant that block placement was trying to assert a bit further. This invariant just wasn't going to work in the face of unanalyzable branches; we need to be resillient to the phenomenon of chains poking into a loop and poking out of a loop. In fact, we already were, we just needed to not assert on it. This was found during a bootstrap with block placement turned on. llvm-svn: 145100	2011-11-23 10:35:36 +00:00
Elena Demikhovsky	779ba6d7b7	I added several lines in X86 code generator that allow to choose VSHUFPS/VSHUFPD instructions while lowering VECTOR_SHUFFLE node. I check a commuted VSHUFP mask. The patch was reviewed by Bruno. llvm-svn: 145099	2011-11-23 10:23:16 +00:00
Chandler Carruth	8c68f1f3c8	Handle the case of a no-return invoke correctly. It actually still has successors, they just are all landing pad successors. We handle this the same way as no successors. Comments attached for the next person to wade through here and another lovely test case courtesy of Benjamin Kramer's bugpoint reduction. llvm-svn: 145098	2011-11-23 08:23:54 +00:00
Bob Wilson	ebb44646c4	Enable stack protectors for all arrays, not just char arrays. rdar://5875909 Patch by Bill Wendling. llvm-svn: 145097	2011-11-23 07:13:56 +00:00
Jakob Stoklund Olesen	02845410f9	Fix PR11422. This was a bug in keeping track of the available domains when merging domain values. The wrong domain mask caused ExecutionDepsFix to try to move VANDPSYrr to the integer domain which is only available in AVX2. Also add an assertion to catch future attempts at emitting AVX2 instructions. llvm-svn: 145096	2011-11-23 04:03:08 +00:00
Rafael Espindola	5d03d46127	Point to libLTO with -L/PATH/ -lLTO so that it is found in the install directory. Patch by Markus Trippelsdorf. llvm-svn: 145095	2011-11-23 03:07:25 +00:00
Chandler Carruth	4a87aa0c31	Fix a crash in block placement due to an inner loop that happened to be reversed in the function's original ordering, and we happened to encounter it while handling an outer unnatural CFG structure. Thanks to the test case reduced from GCC's source by Benjamin Kramer. This may also fix a crasher in gzip that Duncan reduced for me, but I haven't yet gotten to testing that one. llvm-svn: 145094	2011-11-23 03:03:21 +00:00
Kostya Serebryany	8b5c7a56a3	[asan] do not instrument threadlocal globals, this is buggy llvm-svn: 145092	2011-11-23 02:10:54 +00:00
Anshuman Dasgupta	bcf6a37a58	Undo test commit llvm-svn: 145079	2011-11-22 20:05:48 +00:00
Anshuman Dasgupta	9ff0894703	Test commit llvm-svn: 145078	2011-11-22 20:03:30 +00:00
Hal Finkel	6f0ae783fe	add basic PPC register-pressure feedback; adjust the vaarg test to match the new register-allocation pattern llvm-svn: 145065	2011-11-22 16:21:04 +00:00
Craig Topper	83c4592619	More fixes to the X86InstComments for shuffle instructions. In particular add AVX flavors of many instructions and fix the destination operand for some of the existing AVX entries. llvm-svn: 145063	2011-11-22 14:27:57 +00:00
Chandler Carruth	ee54feb6f6	Fix a devilish miscompile exposed by block placement. The updateTerminator code didn't correctly handle EH terminators in one very specific case. AnalyzeBranch would find no terminator instruction, and so the fallback in updateTerminator is to assume fallthrough. This is correct, but the destination of the fallthrough was assumed to be the first successor. This is almost always true, but in certain cases the loop transformations will cause the landing pad to be the first successor! Instead of this brittle logic, actually look through the successors for a non-landing-pad accessor, and to assert if more than one is found. This will hopefully fix some (if not all) of the self host miscompiles with block placement. Thanks to Benjamin Kramer for reporting, Nick Lewycky for an initial stab at a reduction, and Duncan for endless advice on EH (which I know nothing about) as well as reviewing the actual fix. llvm-svn: 145062	2011-11-22 13:13:16 +00:00
Benjamin Kramer	e1effb0da2	Add configure checking for pread(2) and use it to save a syscall when reading files. llvm-svn: 145061	2011-11-22 12:31:53 +00:00
Chandler Carruth	e2530dc889	Fix an obvious omission in the SelectionDAGBuilder where we were dropping weights on the floor for invokes. This was impeding my writing further test cases for invoke when interacting with probabilities and block placement. No test case as there doesn't appear to be a way to test this stuff. =/ Suggestions for a test case of course welcome. I hope to be able to add test cases that indirectly cover this eventually by adding probabilities to the exceptional edge and reordering blocks as a result. llvm-svn: 145060	2011-11-22 11:37:46 +00:00
Benjamin Kramer	f22623b78b	Turn error recovery into an assert. This was put in because in a certain version of DragonFlyBSD stat(2) lied about the size of some files. This was fixed a long time ago so we can remove the workaround. llvm-svn: 145059	2011-11-22 11:37:11 +00:00
Rafael Espindola	c55e1af137	Add triple to the test. llvm-svn: 145057	2011-11-22 06:36:25 +00:00
Rafael Espindola	2021f38281	If a register is both an early clobber and part of a tied use, handle the use before the clobber so that we copy the value if needed. Fixes pr11415. llvm-svn: 145056	2011-11-22 06:27:18 +00:00
Craig Topper	ccb7097509	Fix shuffle decoding logic to handle UNPCKLPS/UNPCKLPD on 256-bit vectors correctly. Add support for decoding UNPCKHPS/UNPCKHPD for AVX 128-bit and 256-bit forms. llvm-svn: 145055	2011-11-22 01:57:35 +00:00
Craig Topper	f563977795	Add methods for querying minimum SSE version along with AVX. Simplifies all the places that had to check a version of SSE and AVX. llvm-svn: 145053	2011-11-22 00:44:41 +00:00
Sebastian Pop	74e1bc7933	fix typo in comment llvm-svn: 145048	2011-11-21 20:46:55 +00:00
Nick Lewycky	063ae5897c	Fix crasher in GVN due to my recent capture tracking changes. llvm-svn: 145047	2011-11-21 19:42:56 +00:00
Nick Lewycky	aa2a00db35	Add virtual destructor. Whoops! llvm-svn: 145044	2011-11-21 18:32:21 +00:00
Craig Topper	6270d072c5	Lowering for v32i8 to VPUNPCKLBW/VPUNPCKHBW when AVX2 is enabled. llvm-svn: 145028	2011-11-21 08:26:50 +00:00
Craig Topper	d12d6f4b1c	Test case for r145026 llvm-svn: 145027	2011-11-21 06:58:09 +00:00
Craig Topper	669199ca94	Add support for lowering 256-bit shuffles to VPUNPCKL/H for i16, i32, i64 if AVX2 is enabled. llvm-svn: 145026	2011-11-21 06:57:39 +00:00
Joe Abbey	96e89f6412	Fixing a comment llvm-svn: 145025	2011-11-21 04:42:21 +00:00
Craig Topper	a065238c6e	Make LowerSIGN_EXTEND_INREG split 256-bit vectors when AVX1 is enabled and use AVX2 shifts when AVX2 is enabled. llvm-svn: 145022	2011-11-21 01:12:36 +00:00
Nick Lewycky	6ae03c3378	Less template, more virtual! Refactoring suggested by Chris in code review. llvm-svn: 145014	2011-11-20 19:37:06 +00:00
Nick Lewycky	612d70b19d	Refactor code to use new attribute getters on CallSite for NoCapture and ByVal. Suggested in code review by Eli. That code in InstCombine looks kinda suspicious. llvm-svn: 145013	2011-11-20 19:09:04 +00:00
NAKAMURA Takumi	76dfa03874	test/CodeGen/X86/block-placement.ll: Relax expressions for Win32. llvm-svn: 145011	2011-11-20 12:49:45 +00:00
Chandler Carruth	18dfac385b	The logic for breaking the CFG in the presence of hot successors didn't properly account for the global probability of the edge being taken. This manifested as a very large number of unconditional branches to blocks being merged against the CFG even though they weren't particularly hot within the CFG. The fix is to check whether the edge being merged is both locally hot relative to other successors for the source block, and globally hot compared to other (unmerged) predecessors of the destination block. This introduces a new crasher on GCC single-source, but it's currently behind a flag, and Ben has offered to work on the reduction. =] llvm-svn: 145010	2011-11-20 11:22:06 +00:00
Chandler Carruth	bcb5f39526	Make an obviously const interface actually be marked as const. llvm-svn: 145009	2011-11-20 11:22:03 +00:00
Benjamin Kramer	650c09aa4d	XFAIL this test until I figure out what indvars is doing here (or find someone who does) llvm-svn: 145008	2011-11-20 11:10:03 +00:00
Benjamin Kramer	b5ba2eef2d	SCEV: Actually set overflow flags on add expressions. setFlags doesn't modify its arguments. llvm-svn: 145007	2011-11-20 10:24:36 +00:00
Chandler Carruth	20df3953d3	Add some comments to the latest test case I added here to document what is actually being tested. Also add some FileCheck goodness to much more carefully ensure that the result is the desired result. Before this test would only have failed through an assert failure if the underlying fix were reverted. Also, add some weight metadata and a comment explaining exactly what is going on to a trick section of the test case. Originally, we were getting very unlucky and trying to form a block chain that isn't actually profitable. I'm working on a fix to avoid forming these unprofitable chains, and that would also have masked any failure from this test case. The easy solution is to add some metadata that makes it really profitable to form the bad chain here. llvm-svn: 145006	2011-11-20 09:30:40 +00:00
Craig Topper	e79761df73	Add code for lowering v32i8 shifts by a splat to AVX2 immediate shift instructions. Remove 256-bit splat handling from LowerShift as it was already handled by PerformShiftCombine. llvm-svn: 145005	2011-11-20 00:12:05 +00:00
Craig Topper	a3a6583694	Use 256-bit vcmpeqd for creating an all ones vector when AVX2 is enabled. llvm-svn: 145004	2011-11-19 22:34:59 +00:00
Craig Topper	bac86038ac	Remove some of the special classes that worked around an old tablegen limitation of not being able to remove redundant bitconverts from patterns. llvm-svn: 145003	2011-11-19 21:01:54 +00:00
Craig Topper	3af6ae089f	Custom lower AVX2 variable shift intrinsics to shl/srl/sra nodes and remove the intrinsic patterns. llvm-svn: 144999	2011-11-19 17:46:46 +00:00
Chandler Carruth	f3dc9eff16	Move the handling of unanalyzable branches out of the loop-driven chain formation phase and into the initial walk of the basic blocks. We essentially pre-merge all blocks where unanalyzable fallthrough exists, as we won't be able to update the terminators effectively after any reorderings. This is quite a bit more principled as there may be CFGs where the second half of the unanalyzable pair has some analyzable predecessor that gets placed first. Then it may get placed next, implicitly breaking the unanalyzable branch even though we never even looked at the part that isn't analyzable. I've included a test case that triggers this (thanks Benjamin yet again!), and I'm hoping to synthesize some more general ones as I dig into related issues. Also, to make this new scheme work we have to be able to handle branches into the middle of a chain, so add this check. We always fallback on the incoming ordering. Finally, this starts to really underscore a known limitation of the current implementation -- we don't consider broken predecessors when merging successors. This can caused major missed opportunities, and is something I'm planning on looking at next (modulo more bug reports). llvm-svn: 144994	2011-11-19 10:26:02 +00:00
Craig Topper	6d77f4ae14	Test cases for SSSE3/AVX integer horizontal add/sub. llvm-svn: 144990	2011-11-19 09:03:33 +00:00
Craig Topper	f984efbfce	Synthesize SSSE3/AVX 128-bit horizontal integer add/sub instructions from add/sub of appropriate shuffle vectors. llvm-svn: 144989	2011-11-19 09:02:40 +00:00
Craig Topper	81390be00f	Collapse X86 PSIGNB/PSIGNW/PSIGND node types. llvm-svn: 144988	2011-11-19 07:33:10 +00:00
Craig Topper	de6b73bb4d	Extend VPBLENDVB and VPSIGN lowering to work for AVX2. llvm-svn: 144987	2011-11-19 07:07:26 +00:00
Craig Topper	75ffc5fbb5	Remove some unnecessary filtering checks from X86 disassembler table build. llvm-svn: 144986	2011-11-19 05:48:20 +00:00
Craig Topper	66e2b5a61e	Remove unused parameters from the AVX maskmov classes. llvm-svn: 144985	2011-11-19 04:49:22 +00:00
Andrew Trick	6b4d578f54	Fix a corner case in updating LoopInfo after fully unrolling an outer loop. The loop tree's inclusive block lists are painful and expensive to update. (I have no idea why they're inclusive). The design was supposed to handle this case but the implementation missed it and my unit tests weren't thorough enough. Fixes PR11335: loop unroll update. llvm-svn: 144970	2011-11-18 03:42:41 +00:00
Nadav Rotem	1ec141d0f9	Add AVX2 vpbroadcast support llvm-svn: 144967	2011-11-18 02:49:55 +00:00
Kostya Serebryany	1cdc6e9567	[asan] workaround for reg alloc bug 11395: don't instrument functions with large chunks of inline assembler llvm-svn: 144962	2011-11-18 01:41:06 +00:00
Chad Rosier	ee93ff736a	Guard call to getRegForValue with isTypeLegal check to avoid unnecessary work/dead code. llvm-svn: 144959	2011-11-18 01:17:34 +00:00
Devang Patel	107e8ec30d	DISubrange supports unsigned lower/upper array bounds, so let's not fake it in the end while emitting DWARF. If a FE needs to encode signed lower/upper array bounds then we need to extend DISubrange or ad DISignedSubrange. llvm-svn: 144937	2011-11-17 23:43:15 +00:00
Kostya Serebryany	a6edf4c21f	quick fix: remove GlobalVariable::GlobalVariable mistakenly commited at r144933. For some reason this compiles on linux llvm-svn: 144936	2011-11-17 23:37:53 +00:00
Andrew Trick	949045864d	Fix an overly general check in SimplifyIndvar to handle useless phi cycles. The right way to check for a binary operation is cast<BinaryOperator>. The original check: cast<Instruction> && numOperands() == 2 would match phi "instructions", leading to an infinite loop in extreme corner case: a useless phi with operands [self, constant] that prior optimization passes failed to remove, being used in the loop by another useless phi, in turn being used by an lshr or udiv. Fixes PR11350: runaway iteration assertion. llvm-svn: 144935	2011-11-17 23:36:35 +00:00
Kostya Serebryany	65e2211b95	fall back to explicit list of allowed linkages when instrumenting globals in asan; add a test check that asan does not touch linkonce_odr llvm-svn: 144933	2011-11-17 23:14:59 +00:00
Ted Kremenek	b42cfa0015	Fix bug in RefCountedBase/RefCountedBaseVPTR where the reference count was accidentally copied as part of the copy constructor. This could result in objects getting leaked because there reference count was too high. llvm-svn: 144931	2011-11-17 23:02:14 +00:00
Chad Rosier	0eff3e5c21	Add TODO comment. llvm-svn: 144920	2011-11-17 21:46:13 +00:00

... 3 4 5 6 7 ...

78486 Commits