llvm-project

Commit Graph

Author	SHA1	Message	Date
Jim Grosbach	dc1e36e9f5	Tidy up. Trailing whitespace. llvm-svn: 156602	2012-05-11 01:41:30 +00:00
Eli Friedman	e0a64d83fc	Fix a minor logic mistake transforming compares in instcombine. PR12514. llvm-svn: 156600	2012-05-11 01:32:59 +00:00
Manman Ren	dc8ad0058f	ARM: peephole optimization to remove cmp instruction This patch will optimize the following cases: sub r1, r3 \| sub r1, imm cmp r3, r1 or cmp r1, r3 \| cmp r1, imm bge L1 TO subs r1, r3 bge L1 or ble L1 If the branch instruction can use flag from "sub", then we can replace "sub" with "subs" and eliminate the "cmp" instruction. rdar: 10734411 llvm-svn: 156599	2012-05-11 01:30:47 +00:00
Dan Gohman	dfab443ae8	Define a new intrinsic, @llvm.debugger. It will be similar to __builtin_trap(), but it generates int3 on x86 instead of ud2. llvm-svn: 156593	2012-05-11 00:19:32 +00:00
Eric Christopher	b6148ed72c	Allow unique_file to take a mode for file permissions, but default to user only read/write. Part of rdar://11325849 llvm-svn: 156591	2012-05-11 00:07:44 +00:00
Chad Rosier	8244b1dc7e	Fix intendation. llvm-svn: 156589	2012-05-10 23:38:07 +00:00
Nuno Lopes	f573030391	objectsize: add support for GEPs with non-constant indexes add an additional parameter to InstCombiner::EmitGEPOffset() to force it to not emit operations with NUW flag llvm-svn: 156585	2012-05-10 23:17:35 +00:00
Preston Gurd	4fe10a5d9a	Added X86 Atom latencies for instructions in X86InstrInfo.td. llvm-svn: 156579	2012-05-10 21:58:35 +00:00
Eric Christopher	ed51b9ec0b	Add support for the 'X' inline asm operand modifier. Patch by Jack Carter. llvm-svn: 156577	2012-05-10 21:48:22 +00:00
Andrew Trick	c5d7008f27	misched: Print machineinstrs with -debug-only=misched llvm-svn: 156576	2012-05-10 21:06:21 +00:00
Andrew Trick	419eae2db7	misched: tracing register pressure heuristics. llvm-svn: 156575	2012-05-10 21:06:19 +00:00
Andrew Trick	7ee9de51f2	misched: Add register pressure backoff to ConvergingScheduler. Prioritize the instruction that comes closest to keeping pressure under the target's limit. Then prioritize instructions that avoid increasing the max pressure in the scheduled region. The max pressure heuristic is a tad aggressive. Later I'll fix it to consider the unscheduled pressure as well. WIP: This is mostly functional but untested and not likely to do much good yet. llvm-svn: 156574	2012-05-10 21:06:16 +00:00
Andrew Trick	795c1120a6	misched: Release only unscheduled nodes into ReadyQ. llvm-svn: 156573	2012-05-10 21:06:14 +00:00
Andrew Trick	95dafd8b31	misched: Added ReadyQ container wrapper for Top and Bottom Queues. llvm-svn: 156572	2012-05-10 21:06:12 +00:00
Andrew Trick	4add42f439	misched: Introducing Top and Bottom register pressure trackers during scheduling. llvm-svn: 156571	2012-05-10 21:06:10 +00:00
Sirish Pande	fc8118bf41	Hexagon V5 Support - V5 td file. llvm-svn: 156569	2012-05-10 20:24:28 +00:00
Sirish Pande	69295b8963	Hexagon V5 FP Support. llvm-svn: 156568	2012-05-10 20:20:25 +00:00
Andrew Trick	75812f815c	RegPressure: API for speculatively checking instruction pressure. Added getMaxExcessUpward/DownwardPressure. They somewhat abuse the tracker by speculatively handling an instruction out of order. But it is convenient for now. In the future, we will cache each instruction's pressure contribution to make this efficient. llvm-svn: 156561	2012-05-10 19:11:52 +00:00
Andrew Trick	1df762abf4	RegPressure: fix array index iteration style. llvm-svn: 156560	2012-05-10 19:11:49 +00:00
Dan Gohman	ed7c24e2d9	Teach DeadStoreElimination to eliminate exit-block stores with phi addresses. llvm-svn: 156558	2012-05-10 18:57:38 +00:00
Manman Ren	b555b382bd	Revert: 156550 "ARM: peephole optimization to remove cmp instruction" This commit broke an external linux bot and gave a compile-time warning. llvm-svn: 156556	2012-05-10 18:49:43 +00:00
Dan Gohman	0291246ce7	Rewrite ScalarEvolution::hasOperand to use an explicit worklist instead of recursion, to avoid excessive stack usage on deep expressions. llvm-svn: 156554	2012-05-10 17:21:30 +00:00
Nuno Lopes	300d629924	teach DSE and isInstructionTriviallyDead() about calloc llvm-svn: 156553	2012-05-10 17:14:00 +00:00
Manman Ren	c860887b2d	ARM: peephole optimization to remove cmp instruction This patch will optimize the following cases: sub r1, r3 \| sub r1, imm cmp r3, r1 or cmp r1, r3 \| cmp r1, imm bge L1 TO subs r1, r3 bge L1 or ble L1 If the branch instruction can use flag from "sub", then we can replace "sub" with "subs" and eliminate the "cmp" instruction. rdar: 10734411 llvm-svn: 156550	2012-05-10 16:48:21 +00:00
Joel Jones	3d90a9ae65	Fix a problem with incomplete equality testing of PHINodes in Instruction::IsIdenticalToWhenDefined. This manifested itself when inlining two calls to the same function. The inlined function had a switch statement that returned one of a set of global variables. Without this modification, the two phi instructions that chose values from the branches of the switch instruction inlined from the callee were considered equivalent and jump-threading replaced a load for the first switch value with a phi selecting from the second switch, thereby producing incorrect code. This patch has been tested with "make check-all", "lnt runteste nt", and llvm self-hosted, and on the original program that had this problem, wireshark. <rdar://problem/11025519> llvm-svn: 156548	2012-05-10 15:59:41 +00:00
Nadav Rotem	1a65397017	Fix merge-typo and cleanup llvm-svn: 156541	2012-05-10 12:50:02 +00:00
Nadav Rotem	15946e50c1	AVX2: Add an additional broadcast idiom. llvm-svn: 156540	2012-05-10 12:39:13 +00:00
Nadav Rotem	b86a3fb8d0	Generate AVX/AVX2 shuffles even when there is a memory op somewhere else in the program. Starting r155461 we are able to select patterns for vbroadcast even when the load op is used by other users. Fix PR11900. llvm-svn: 156539	2012-05-10 12:22:05 +00:00
Jim Grosbach	1ace8b6a76	ExecutionEngine: Check for NULL ErrorStr before using it. Patch by Yury Mikhaylov <yury.mikhaylov@gmail.com>. llvm-svn: 156523	2012-05-10 00:31:50 +00:00
Dan Gohman	f8b19d09ba	Fix the objc_storeStrong recognizer to stop before walking off the end of a basic block if there's no store. llvm-svn: 156520	2012-05-09 23:08:33 +00:00
Nuno Lopes	7100f463b0	objectsize: refactor code a bit to enable future changes to support run-time information add support to compute allocation sizes at run-time if penalty > 1 (e.g., malloc(x), calloc(x, y), and VLAs) llvm-svn: 156515	2012-05-09 21:30:57 +00:00
Roman Divacky	e07cc042f6	Mark .opd @progbits, thus avoiding a warning from asm. llvm-svn: 156494	2012-05-09 18:24:23 +00:00
Chad Rosier	9d7b1cee39	Set the default iOS version to 3.0. llvm-svn: 156492	2012-05-09 18:23:00 +00:00
Bob Wilson	8d4e2fab63	Use the cpuid 64 bit flag to pick the default CPU name for an unknown model. For the Family 6 switch in sys::getHostCPUName, an unrecognized model was reported as "i686". That's a really bad default since it means that new CPUs will be treated as if they can only use 32-bit code. This just looks at the cpuid extended feature flag for 64 bit support, and if that is set, it uses a default x86-64 cpu. Similar logic is already used for the Family 15 code. <rdar://problem/11314502> llvm-svn: 156486	2012-05-09 17:47:03 +00:00
Chad Rosier	2778cbc880	Don't return true on a function with a void return type. llvm-svn: 156484	2012-05-09 17:38:47 +00:00
Chad Rosier	d84eaac673	Add Triple::getiOSVersion. This new function provides a way to get the iOS version number from ios triples. Part of rdar://11409204 llvm-svn: 156483	2012-05-09 17:23:48 +00:00
Hans Wennborg	b7ef2fe8ae	Introduce llvm-c function LLVMPrintModuleToFile. This lets you save the textual representation of the LLVM IR to a file. Before this patch it could only be printed to STDERR from llvm-c. Patch by Carlo Kok! llvm-svn: 156479	2012-05-09 16:54:17 +00:00
Nuno Lopes	01547b3ad2	change the objectsize intrinsic signature: add a 3rd parameter to denote the maximum runtime performance penalty that the user is willing to accept. This commit only adds the parameter. Code taking advantage of it will follow. llvm-svn: 156473	2012-05-09 15:52:43 +00:00
Bill Wendling	a3aeb980d2	Supply a C interface to the "LinkModules" method. Patch by Andrew Wilkins! llvm-svn: 156469	2012-05-09 08:55:40 +00:00
Craig Topper	28540adfcf	Remove unused variable to get rid of warning. llvm-svn: 156466	2012-05-09 07:08:58 +00:00
Akira Hatanaka	ca41d13bbd	Add another peephole pattern for conditional moves. llvm-svn: 156460	2012-05-09 02:29:29 +00:00
Jakob Stoklund Olesen	7e21d617ef	Use ptr_rc_tailcall instead of GR32_TC. The getPointerRegClass() hook will return GR32_TC, or whatever is appropriate for the current function. Patch by Yiannis Tsiouris! llvm-svn: 156459	2012-05-09 01:50:09 +00:00
Akira Hatanaka	05b9dad1e6	Make register FP allocatable if the compiled function does not have dynamic allocas. llvm-svn: 156458	2012-05-09 01:38:13 +00:00
Akira Hatanaka	0a8ab718cb	Expand 64-bit shifts if target ABI is O32. llvm-svn: 156457	2012-05-09 00:55:21 +00:00
Richard Trieu	edf46e6b6e	Remove unused variable to silence compiler warning. llvm-svn: 156456	2012-05-09 00:30:21 +00:00
Dan Gohman	41375a3545	Miscellaneous accumulated cleanups. llvm-svn: 156445	2012-05-08 23:39:44 +00:00
Kevin Enderby	fe3d005ca5	Fix it so llvm-objdump -arch does accept x86 and x86-64 as valid arch names. PR12731. Patch by Meador Inge! llvm-svn: 156444	2012-05-08 23:38:45 +00:00
Dan Gohman	61708d37d6	Fix objc_storeStrong pattern matching to catch a potential use of the old value after the store but before it is released. This fixes rdar:/11116986. llvm-svn: 156442	2012-05-08 23:34:08 +00:00
Jakob Stoklund Olesen	10191fd44f	Use a shared function for a common operation. llvm-svn: 156441	2012-05-08 23:27:30 +00:00
Eric Christopher	8d2a77de63	Fix thinko in conditional. Part of rdar://11352000 and should bring the buildbots back. llvm-svn: 156421	2012-05-08 21:24:39 +00:00
Jim Grosbach	92f6adc8be	DAGCombiner should not change the type of an extract_vector index. When a combine twiddles an extract_vector, care should be take to preserve the type of the index operand. No luck extracting a reasonable testcase, unfortunately. rdar://11391009 llvm-svn: 156419	2012-05-08 20:56:07 +00:00
Eric Christopher	d666bb0dd8	Remove excess semi-colons to quiet warnings. llvm-svn: 156416	2012-05-08 20:45:04 +00:00
Daniel Dunbar	5f1c956eb0	[Support] Fix sys::GetRandomNumber() to always use a high quality seed. llvm-svn: 156414	2012-05-08 20:38:00 +00:00
Sirish Pande	1c9f7dbc10	Update load/store instruction patterns in Hexagon V4. llvm-svn: 156411	2012-05-08 19:50:20 +00:00
Akira Hatanaka	fd82286e62	Formatting fixes. Patch by Jack Carter. llvm-svn: 156409	2012-05-08 19:14:42 +00:00
Akira Hatanaka	c515bfb9e7	Define mips16 instruction formats. Patch by Reed Kotler. llvm-svn: 156408	2012-05-08 19:08:58 +00:00
Eric Christopher	4d25052a9a	Handle OpDeref in case it comes in as a register operand. Part of rdar://11352000 llvm-svn: 156405	2012-05-08 18:56:00 +00:00
Nuno Lopes	24ac479a7d	remove autoupgrade code for old function attributes format. I still left another fixme regarding alignment, because I'm unsure how to remove that code without breaking things llvm-svn: 156387	2012-05-08 17:07:35 +00:00
Nuno Lopes	f7596c91af	remove TYPE_CODE_FUNCTION_OLD type code. it is no longer in use and it was marked for removal in 3.0 llvm-svn: 156383	2012-05-08 16:16:20 +00:00
Jakob Stoklund Olesen	276ae14023	s/CSR_Ghc/CSR_NoRegs/ Share the CalleeSavedRegs defs between all calling conventions having no callee-saved registers. Patch by Yiannis Tsiouris! llvm-svn: 156382	2012-05-08 15:07:29 +00:00
NAKAMURA Takumi	3b7f995b75	Windows/PathV2.inc: Retry rename() for (maximum) 2 seconds. Files might be opend by system scanners (eg. file indexer, virus scanner, &c). llvm-svn: 156380	2012-05-08 14:31:46 +00:00
Duncan Sands	3bbb1d50df	Calling ReassociateExpression recursively is extremely dangerous since it will replace the operands of expressions with only one use with undef and generate a new expression for the original without using RAUW to update the original. Thus any copies of the original expression held in a vector may end up referring to some bogus value - and using a ValueHandle won't help since there is no RAUW. There is already a mechanism for getting the effect of recursion non-recursively: adding the value to be recursed on to RedoInsts. But it wasn't being used systematically. Have various places where recursion had snuck in at some point use the RedoInsts mechanism instead. Fixes PR12169. llvm-svn: 156379	2012-05-08 12:16:05 +00:00
Stepan Dyatkovskiy	5eafce5c88	Rejected r156374: Ordinary PR1255 patch. Due to clang-x86_64-debian-fnt buildbot failure. llvm-svn: 156377	2012-05-08 08:33:21 +00:00
Craig Topper	7daf897678	Remove 256-bit AVX non-temporal store intrinsics. Similar was previously done for 128-bit. llvm-svn: 156375	2012-05-08 06:58:15 +00:00
Stepan Dyatkovskiy	b6a4640163	Ordinary patch for PR1255. Added new case-ranges orientated methods for adding/removing cases in SwitchInst. After this patch cases will internally representated as ConstantArray-s instead of ConstantInt, externally cases wrapped within the ConstantRangesSet object. Old methods of SwitchInst are also works well, but marked as deprecated. So on this stage we have no side effects except that I added support for case ranges in BitcodeReader/Writer, of course test for Bitcode is also added. Old "switch" format is also supported. llvm-svn: 156374	2012-05-08 06:36:08 +00:00
Andrew Trick	d29cd732d4	Allow NULL LoopPassManager argument in UnrollLoop. PR12734. llvm-svn: 156358	2012-05-08 02:52:09 +00:00
Jakob Stoklund Olesen	952b4c11fe	Extract methods for joining physregs. No functional change. llvm-svn: 156345	2012-05-08 00:08:35 +00:00
Jakob Stoklund Olesen	9e8ae6c37f	Naming convention and whitespace. No functional change. llvm-svn: 156342	2012-05-07 23:46:16 +00:00
Jakob Stoklund Olesen	98595b5a61	Coalesce subreg-subreg copies. At least some of them: %vreg1:sub_16bit = COPY %vreg2:sub_16bit; GR64:%vreg1, GR32: %vreg2 Previously, we couldn't figure out that the above copy could be eliminated by coalescing %vreg2 with %vreg1:sub_32bit. The new getCommonSuperRegClass() hook makes it possible. This is not very useful yet since the unmodified part of the destination register usually interferes with the source register. The coalescer needs to understand sub-register interference checking first. llvm-svn: 156334	2012-05-07 22:57:55 +00:00
Jakob Stoklund Olesen	3c52f0281f	Add an MF argument to TRI::getPointerRegClass() and TII::getRegClass(). The getPointerRegClass() hook can return register classes that depend on the calling convention of the current function (ptr_rc_tailcall). So far, we have been able to infer the calling convention from the subtarget alone, but as we add support for multiple calling conventions per target, that no longer works. Patch by Yiannis Tsiouris! llvm-svn: 156328	2012-05-07 22:10:26 +00:00
Jakob Stoklund Olesen	c4b3a7a1d7	Fix bug in TRI::getCommonSuperRegClass(). Test cases for this code are coming. It is not used for anything yet. llvm-svn: 156327	2012-05-07 21:59:31 +00:00
Owen Anderson	ab63d84252	Teach DAG combine to fold x-x to 0.0 when unsafe FP math is enabled. llvm-svn: 156324	2012-05-07 20:51:25 +00:00
Owen Anderson	f4f80e1f39	Teach reassociate to commute FMul's and FAdd's in order to canonicalize the order of their operands across instructions. This allows for greater CSE opportunities. llvm-svn: 156323	2012-05-07 20:47:23 +00:00
Preston Gurd	e65f4e66ac	Make IntelJITEvents and OProfileJIT as optional libraries and add optional library support to the llvm-build tool: - Add new command line parameter to llvm-build: “--enable-optional-libraries” - Add handing of new llvm-build library type “OptionalLibrary” - Update Cmake and automake build systems to pass correct flags to llvm-build based on configuration Patch by Dan Malea! llvm-svn: 156319	2012-05-07 19:38:40 +00:00
Jakob Stoklund Olesen	65a6dafc8d	Add TRI::getCommonSuperRegClass(). This function is a generalization of getMatchingSuperRegClass() to the symmetric case where both sides are using a sub-register index. It will find a super-register class and sub-register indexes that make this diagram commute: PreA SuperRC ----------> RCA \| \| \| \| PreB \| \| SubA \| \| \| \| V V RCB ----------> SubRC SubB This can be used to coalesce copies like: %vreg1:sub16 = COPY %vreg2:sub16; GR64:%vreg1, GR32: %vreg2 llvm-svn: 156317	2012-05-07 19:14:58 +00:00
Chad Rosier	d8287fec17	Fix a regression from r147481. This combine should only happen if there is a single use. rdar://11360370 llvm-svn: 156316	2012-05-07 18:47:44 +00:00
Matt Beaumont-Gay	a1b3b007f3	Don't assume size_t is unsigned long long. Fixes a -Woverflow warning from gcc when building for 32-bit platforms. llvm-svn: 156313	2012-05-07 18:12:42 +00:00
Manman Ren	ef4e0479ec	X86: optimization for -(x != 0) This patch will optimize -(x != 0) on X86 FROM cmpl $0x01,%edi sbbl %eax,%eax notl %eax TO negl %edi sbbl %eax %eax In order to generate negl, I added patterns in Target/X86/X86InstrCompiler.td: def : Pat<(X86sub_flag 0, GR32:$src), (NEG32r GR32:$src)>; rdar: 10961709 llvm-svn: 156312	2012-05-07 18:06:23 +00:00
Eric Christopher	0d8c15d20f	Add support for the 'x' constraint. Patch by Jack Carter. llvm-svn: 156295	2012-05-07 06:25:19 +00:00
Eric Christopher	9c492e6ebf	Add support for the 'l' constraint. Patch by Jack Carter. llvm-svn: 156294	2012-05-07 06:25:15 +00:00
Eric Christopher	e3c494de82	Add support for the 'c' constraint. Patch by Jack Carter. llvm-svn: 156293	2012-05-07 06:25:10 +00:00
Eric Christopher	c18ae4a3b1	Add support for the 'P' constraint. Patch by Jack Carter. llvm-svn: 156292	2012-05-07 06:25:02 +00:00
Craig Topper	dbb98b4917	Fix some issues in the f16c instructions. llvm-svn: 156287	2012-05-07 06:00:15 +00:00
Eric Christopher	470578a91b	Add support for the 'O' constraint. Patch by Jack Carter. llvm-svn: 156285	2012-05-07 05:46:48 +00:00
Eric Christopher	e07aa430b8	Add support for the 'N' inline asm constraint. Patch by Jack Carter. llvm-svn: 156284	2012-05-07 05:46:43 +00:00
Eric Christopher	1109b3406d	Add support for the 'L' inline asm constraint. Patch by Jack Carter. llvm-svn: 156283	2012-05-07 05:46:37 +00:00
Eric Christopher	3ff88a05b7	Add support for the inline asm constraint 'K'. llvm-svn: 156282	2012-05-07 05:46:29 +00:00
Craig Topper	d4e1894ec1	Add SSE4A MOVNTSS/MOVNTSD instructions. llvm-svn: 156281	2012-05-07 05:36:19 +00:00
Eric Christopher	7201e1b4b9	Support the 'J' constraint. Patch by Jack Carter. llvm-svn: 156280	2012-05-07 03:13:42 +00:00
Eric Christopher	1d6c89eea1	Add support for the 'I' inline asm constraint. Also add tests from the previous 2 patches. Patch by Jack Carter. llvm-svn: 156279	2012-05-07 03:13:32 +00:00
Eric Christopher	58daf04681	Allow 64 bit integer values in gpu registers if arch and abi are 64 bit. Patch by Jack Carter. llvm-svn: 156278	2012-05-07 03:13:22 +00:00
Eric Christopher	cfcd77b0bc	When using inline asm constraints representing non-floating point general registers allow 8 and 16-bit elements. Patch by Jack Carter. llvm-svn: 156277	2012-05-07 03:13:16 +00:00
Craig Topper	00a1e6d48b	Use MVT instead of EVT as the argument to all the shuffle decode functions. Simplify some of the decode functions. llvm-svn: 156268	2012-05-06 19:46:21 +00:00
Craig Topper	804be3b546	Add VPERMQ/VPERMPD to the list of target specific shuffles that can be looked through for DAG combine purposes. llvm-svn: 156266	2012-05-06 18:54:26 +00:00
Craig Topper	54bdb350e2	Add shuffle decode support for VPERMQ/VPERMPD. llvm-svn: 156265	2012-05-06 18:44:02 +00:00
Chris Lattner	854f366a1f	make SourceMgr tolerate empty SMLoc()'s better. llvm-svn: 156260	2012-05-06 16:20:49 +00:00
Benjamin Kramer	3d38c17b59	Switch the select to branch transformation on by default. The primitive conservative heuristic seems to give a slight overall improvement while not regressing stuff. Make it available to wider testing. If you notice any speed regressions (or significant code size regressions) let me know! llvm-svn: 156258	2012-05-06 14:25:16 +00:00
Jakub Staszak	cfc46f82ff	Remove trailing spaces. llvm-svn: 156257	2012-05-06 13:52:31 +00:00
NAKAMURA Takumi	7bec74112d	Unix/Process.inc: Give more useful random seed to srand. Workaround for PR12743. llvm-svn: 156252	2012-05-06 08:24:24 +00:00
NAKAMURA Takumi	54acb28882	Support/Process: Move llvm::sys::Process::GetRandomNumber() from Process.cpp to Unix/Process.inc. FIXME: GetRandomNumber() is not implemented in Win32. llvm-svn: 156251	2012-05-06 08:24:18 +00:00
Chris Lattner	9322ba824c	reapply my patch, with a fix for an off-by-one error. Turned out to be a lot of work for a drive-by fix :) llvm-svn: 156246	2012-05-05 22:17:32 +00:00
Chris Lattner	64f65d33df	revert my patches, which are causing problems. llvm-svn: 156245	2012-05-05 22:11:04 +00:00
Chris Lattner	cd60bc491e	refactor some code to expose column numbers more and make diagnostic printing slightly more efficient. llvm-svn: 156243	2012-05-05 21:39:51 +00:00
Jim Grosbach	7ce129268e	Nuke a few dead remnants of the CBE. llvm-svn: 156241	2012-05-05 17:45:12 +00:00
Daniel Dunbar	d5f82d92f3	[Support] Add missing include. llvm-svn: 156240	2012-05-05 16:49:11 +00:00
Daniel Dunbar	58ed0c6c09	[Support] Fix up comments. llvm-svn: 156239	2012-05-05 16:39:22 +00:00
Daniel Dunbar	3f0fa19bc4	[Support] Rewrite sys::fs::unique_file to not be stupid with /dev/urandom. - Just use sys::Process::GetRandomNumber instead of having two poor implementations. - This is ~70 times (!) faster on my OS X machine. llvm-svn: 156238	2012-05-05 16:36:24 +00:00
Daniel Dunbar	b57ddd4e29	[Support] Add sys::Process::GetRandomNumber(). - Primitive API, but we rarely have need for random numbers. llvm-svn: 156237	2012-05-05 16:36:20 +00:00
Benjamin Kramer	047d7ca0b1	CodeGenPrepare: Add a transform to turn selects into branches in some cases. This came up when a change in block placement formed a cmov and slowed down a hot loop by 50%: ucomisd (%rdi), %xmm0 cmovbel %edx, %esi cmov is a really bad choice in this context because it doesn't get branch prediction. If we emit it as a branch, an out-of-order CPU can do a better job (if the branch is predicted right) and avoid waiting for the slow load+compare instruction to finish. Of course it won't help if the branch is unpredictable, but those are really rare in practice. This patch uses a dumb conservative heuristic, it turns all cmovs that have one use and a direct memory operand into branches. cmovs usually save some code size, so we disable the transform in -Os mode. In-Order architectures are unlikely to benefit as well, those are included in the "predictableSelectIsExpensive" flag. It would be better to reuse branch probability info here, but BPI doesn't support select instructions currently. It would make sense to use the same heuristics as the if-converter pass, which does the opposite direction of this transform. Test suite shows a small improvement here and there on corei7-level machines, but the actual results depend a lot on the used microarchitecture. The transformation is currently disabled by default and available by passing the -enable-cgp-select2branch flag to the code generator. Thanks to Chandler for the initial test case to him and Evan Cheng for providing me with comments and test-suite numbers that were more stable than mine :) llvm-svn: 156234	2012-05-05 12:49:22 +00:00
Benjamin Kramer	e31f31e5c0	Add a new target hook "predictableSelectIsExpensive". This will be used to determine whether it's profitable to turn a select into a branch when the branch is likely to be predicted. Currently enabled for everything but Atom on X86 and Cortex-A9 devices on ARM. I'm not entirely happy with the name of this flag, suggestions welcome ;) llvm-svn: 156233	2012-05-05 12:49:14 +00:00
Benjamin Kramer	a25a61b9e8	NVPTX: Initialize the UseF32FTZ flag. llvm-svn: 156232	2012-05-05 11:22:02 +00:00
Stepan Dyatkovskiy	cb2a1a34e2	Small fix in InstCombineCasts.cpp. Restored "alloca + bitcast" reducing for case when alloca's size is calculated within the "add/sub/... nsw". Also added fix to 2011-06-13-nsw-alloca.ll test. llvm-svn: 156231	2012-05-05 07:09:40 +00:00
Eric Christopher	de9e92ed9b	Typo. llvm-svn: 156226	2012-05-05 01:16:06 +00:00
Jakob Stoklund Olesen	e326ed33a8	Make sure findRepresentativeClass picks the widest super-register. We want the representative register class to contain the largest super-registers available. This makes the function less sensitive to the register class numbering. llvm-svn: 156220	2012-05-04 22:53:28 +00:00
Jakob Stoklund Olesen	e89496fe63	Remove extra comma in debug output. llvm-svn: 156219	2012-05-04 22:53:26 +00:00
David Blaikie	891d0a3d20	Fix warnings in release build. This fixes a couple of Clang warnings in release builds of LLVM: * Missing return in ISelLowering * Unused variable in NVPTXutil.cpp llvm-svn: 156216	2012-05-04 22:34:16 +00:00
Kevin Enderby	cabbae653e	Tweak to the fix in r156212, as with the change in removing the shift the SignExtend32<22>(Val<<1) also needs to change to SignExtend32<21>(Val) . llvm-svn: 156213	2012-05-04 22:09:52 +00:00
Kevin Enderby	8ce1ada1be	Fix a bug in the ARM disassembler for wide branch conditional instructions where the symbolic operand's displacement was incorrectly shifted left by 1. rdar://11387046 llvm-svn: 156212	2012-05-04 22:02:27 +00:00
Chandler Carruth	cd3464ee22	Fix a Clang warning in the new NVPTX backend: In file included from ../lib/Target/NVPTX/VectorElementize.cpp:53: ../lib/Target/NVPTX/NVPTX.h:44:3: warning: default label in switch which covers all enumeration values [-Wcovered-switch-default] default: assert(0 && "Unknown condition code"); ^ 1 warning generated. The prevailing pattern in LLVM is to not use a default label, and instead to use llvm_unreachable to denote that the switch in fact covers all return paths from the function. llvm-svn: 156209	2012-05-04 21:35:49 +00:00
Chandler Carruth	6781821c01	Teach the code extractor how to extract a sequence of blocks from RegionInfo's RegionNode. This mirrors the logic for automating the extraction from a Loop. llvm-svn: 156208	2012-05-04 21:33:30 +00:00
Chandler Carruth	8880325a92	Rename the Region::block_iterator to Region::block_node_iterator, and add a new Region::block_iterator which actually iterates over the basic blocks of the region. The old iterator, now call 'block_node_iterator' iterates over RegionNodes which contain a single basic block. This works well with the GraphTraits-based iterator design, however most users actually want an iterator over the BasicBlocks inside these RegionNodes. Now the 'block_iterator' is a wrapper which exposes exactly this interface. Internally it uses the block_node_iterator to walk all nodes which are single basic blocks, but transparently unwraps the basic block to make user code simpler. While this patch is a bit of a wash, most of the updates are to internal users, not external users of the RegionInfo. I have an accompanying patch to Polly that is a strict simplification of every user of this interface, and I'm working on a pass that also wants the same simplified interface. This patch alone should have no functional impact. llvm-svn: 156202	2012-05-04 20:55:23 +00:00
Justin Holewinski	ae556d3ef7	This patch adds a new NVPTX back-end to LLVM which supports code generation for NVIDIA PTX 3.0. This back-end will (eventually) replace the current PTX back-end, while maintaining compatibility with it. The new target machines are: nvptx (old ptx32) => 32-bit PTX nvptx64 (old ptx64) => 64-bit PTX The sources are based on the internal NVIDIA NVPTX back-end, and contain more functionality than the current PTX back-end currently provides. NV_CONTRIB llvm-svn: 156196	2012-05-04 20:18:50 +00:00
Sebastian Pop	2420e8b7d5	Added missing CMN case in Thumb2SizeReduction pass so that LLVM emits 16-bits encoding of CMN instructions. llvm-svn: 156195	2012-05-04 19:53:56 +00:00
Preston Gurd	d6c440cd4c	Adds Intel Atom scheduling latencies to X86InstrSystem.td. llvm-svn: 156194	2012-05-04 19:26:37 +00:00
Matt Beaumont-Gay	e82ab6baa7	Pacify GCC's -Wreturn-type llvm-svn: 156189	2012-05-04 18:34:27 +00:00
Chandler Carruth	14316fcf7d	Factor the computation of input and output sets into a public interface of the CodeExtractor utility. This allows speculatively computing input and output sets to measure the likely size impact of the code extraction. These sets cannot be reused sadly -- we mutate the function prior to forming the final sets used by the actual extraction. The interface has been revamped slightly to make it easier to use correctly by making the interface const and sinking the computation of the number of exit blocks into the full extraction function and away from the rest of this logic which just computed two output parameters. llvm-svn: 156168	2012-05-04 11:20:27 +00:00
Chandler Carruth	44e13911bc	Rather than trying to gracefully handle input sequences with repeated blocks, assert that this doesn't happen. We don't want to bother trying to support this call pattern as it isn't necessary. llvm-svn: 156167	2012-05-04 11:17:06 +00:00
Chandler Carruth	0a570552d1	Fix a goof with my previous commit by completely returning when we detect an in-eligible block rather than just breaking out of the loop. llvm-svn: 156166	2012-05-04 11:14:19 +00:00
Chandler Carruth	2f5d0191f7	Hoist a safety assert from the extraction method into the construction of the extractor itself. llvm-svn: 156164	2012-05-04 10:26:45 +00:00
Chandler Carruth	0fde00150d	Move the CodeExtractor utility to a dedicated header file / source file, and expose it as a utility class rather than as free function wrappers. The simple free-function interface works well for the bugpoint-specific pass's uses of code extraction, but in an upcoming patch for more advanced code extraction, they simply don't expose a rich enough interface. I need to expose various stages of the process of doing the code extraction and query information to decide whether or not to actually complete the extraction or give up. Rather than build up a new predicate model and pass that into these functions, just take the class that was actually implementing the functions and lift it up into a proper interface that can be used to perform code extraction. The interface is cleaned up and re-documented to work better in a header. It also is now setup to accept the blocks to be extracted in the constructor rather than in a method. In passing this essentially reverts my previous commit here exposing a block-level query for eligibility of extraction. That is no longer necessary with the more rich interface as clients can query the extraction object for eligibility directly. This will reduce the number of walks of the input basic block sequence by quite a bit which is useful if this enters the normal optimization pipeline. llvm-svn: 156163	2012-05-04 10:18:49 +00:00
Hans Wennborg	aea412008e	Make ARM and Mips use TargetMachine::getTLSModel() This moves the logic for selecting a TLS model to a single place, instead of the previous three (ARM, Mips, and X86 which already uses this function). llvm-svn: 156162	2012-05-04 09:40:39 +00:00
Craig Topper	bdd2e34b1f	Fix some loops to match coding standards. No functional change intended. llvm-svn: 156159	2012-05-04 06:39:13 +00:00
Craig Topper	d4d3237bb8	Fix up some spacing. No functional change. llvm-svn: 156158	2012-05-04 06:18:33 +00:00
Craig Topper	e2ae413746	Simplify broadcast lowering code. No functional change intended. llvm-svn: 156157	2012-05-04 05:49:51 +00:00
Craig Topper	42f2182366	Allow v16i16 and v32i8 shuffles to be rewritten as narrower shuffles. llvm-svn: 156156	2012-05-04 04:44:49 +00:00
Bill Wendling	fa0ebcd1b0	Add 'landingpad' instructions to the list of instructions to ignore. Also combine the code in the 'assert' statement. llvm-svn: 156155	2012-05-04 04:22:32 +00:00
Craig Topper	59063c0a3d	Simplify shuffle narrowing code a bit. No functional change intended. llvm-svn: 156154	2012-05-04 04:08:44 +00:00
Jakob Stoklund Olesen	796e5272ab	Remove the SubRegClasses field from RegisterClass descriptions. This information in now computed by TableGen. llvm-svn: 156152	2012-05-04 03:30:34 +00:00
Jakob Stoklund Olesen	75fbe90839	Use SuperRegClassIterator for findRepresentativeClass(). The masks returned by SuperRegClassIterator are computed automatically by TableGen. This is better than depending on the manually specified SuperRegClasses. llvm-svn: 156147	2012-05-04 02:19:22 +00:00
Jakob Stoklund Olesen	34a8f13e5f	Initialize SparcInstrInfo before SparcTargetLowering. The TargetLowering construction needs to use a valid TargetRegisterInfo instance. llvm-svn: 156146	2012-05-04 02:16:39 +00:00
Jakob Stoklund Olesen	57c7050675	Add a SuperRegClassIterator class. This iterator class provides a more abstract interface to the (Idx, Mask) lists of super-registers for a register class. The layout of the tables shouldn't be exposed to clients. llvm-svn: 156144	2012-05-04 01:48:29 +00:00
Chandler Carruth	da7513a834	A pile of long over-due refactorings here. There are some very, very minor behavior changes with this, but nothing I have seen evidence of in the wild or expect to be meaningful. The real goal is unifying our logic and simplifying the interfaces. A summary of the changes follows: - Make 'callIsSmall' actually accept a callsite so it can handle intrinsics, and simplify callers appropriately. - Nuke a completely bogus declaration of 'callIsSmall' that was still lurking in InlineCost.h... No idea how this got missed. - Teach the 'isInstructionFree' about the various more intelligent 'free' heuristics that got added to the inline cost analysis during review and testing. This mostly surrounds int->ptr and ptr->int casts. - Switch most of the interesting parts of the inline cost analysis that were essentially computing 'is this instruction free?' to use the code metrics routine instead. This way we won't keep duplicating logic. All of this is motivated by the desire to allow other passes to compute a roughly equivalent 'cost' metric for a particular basic block as the inline cost analysis. Sadly, re-using the same analysis for both is really messy because only the actual inline cost analysis is ever going to go to the contortions required for simplification, SROA analysis, etc. llvm-svn: 156140	2012-05-04 00:58:03 +00:00
Jakob Stoklund Olesen	2f460ae3b4	Use a shared implementation of getMatchingSuperRegClass(). TargetRegisterClass now gives access to the necessary tables. llvm-svn: 156122	2012-05-03 22:49:04 +00:00
Kevin Enderby	914223010c	Fix issues with the ARM bl and blx thumb instructions and the J1 and J2 bits for the assembler and disassembler. Which were not being set/read correctly for offsets greater than 22 bits in some cases. Changes to lib/Target/ARM/ARMAsmBackend.cpp from Gideon Myles! llvm-svn: 156118	2012-05-03 22:41:56 +00:00
Chandler Carruth	a46e62424b	Factor the logic for testing whether a basic block is viable for code extraction into a public interface. Also clean it up and apply it more consistently such that we check for landing pads anywhere in the extracted code, not just in single-block extraction. This will be used to guide decisions in passes that are planning to eventually perform a round of code extraction. llvm-svn: 156114	2012-05-03 22:26:53 +00:00
Nuno Lopes	d4cf35d775	remove calls to calloc if the allocated memory is not used (it was already being done for malloc) fix a few typos found by Chad in my previous commit llvm-svn: 156110	2012-05-03 22:08:19 +00:00
Sirish Pande	f8e5e3c072	Support for target dependent Hexagon VLIW packetizer. This patch creates and optimizes packets as per Hexagon ISA rules. llvm-svn: 156109	2012-05-03 21:52:53 +00:00
Nuno Lopes	d2b71e7fa9	add support for calloc to objectsize lowering llvm-svn: 156102	2012-05-03 21:19:58 +00:00
Silviu Baranga	9560af848c	Fixed disassembler for vstm/vldm ARM VFP instructions. llvm-svn: 156077	2012-05-03 16:38:40 +00:00
Sirish Pande	c92c31674e	Extensions of Hexagon V4 instructions. This adds new instructions for Hexagon V4 architecture. llvm-svn: 156071	2012-05-03 16:18:50 +00:00
Nuno Lopes	22f6f3b055	replace 'break's with 'return 0' in visitCallInst code for objectsize, since there is no need to fallback to visitCallSite. This gives a 0.9% in a test case llvm-svn: 156069	2012-05-03 16:06:07 +00:00
Craig Topper	242183834a	Use 'unsigned' instead of 'int' in a few places dealing with counts of vector elements. llvm-svn: 156060	2012-05-03 07:26:59 +00:00
Craig Topper	315a5cc789	Fix 256-bit vpshuflw and vpshufhw immediate encoding to handle undefs in the lower half correctly. Missed in r155982. llvm-svn: 156059	2012-05-03 07:12:59 +00:00
Evan Cheng	b64e7b778b	Fix two-address pass's aggressive instruction commuting heuristics. It's meant to catch cases like: %reg1024<def> = MOV r1 %reg1025<def> = MOV r0 %reg1026<def> = ADD %reg1024, %reg1025 r0 = MOV %reg1026 By commuting ADD, it let coalescer eliminate all of the copies. However, there was a bug in the heuristics where it ended up commuting the ADD in: %reg1024<def> = MOV r0 %reg1025<def> = MOV 0 %reg1026<def> = ADD %reg1024, %reg1025 r0 = MOV %reg1026 That did no benefit but rather ensure the last MOV would not be coalesced. rdar://11355268 llvm-svn: 156048	2012-05-03 01:45:13 +00:00
Andrew Trick	32aea358e1	Added TargetRegisterInfo::getAllocatableClass. The ensures that virtual registers always belong to an allocatable class. If your target attempts to create a vreg for an operand that has no allocatable register subclass, you will crash quickly. This ensures that targets define register classes as intended. llvm-svn: 156046	2012-05-03 01:14:37 +00:00
Bill Wendling	c94d86c4ad	Whitespace cleanup. llvm-svn: 156034	2012-05-02 23:43:23 +00:00
Owen Anderson	41b0665b5b	Teach DAGCombine the same multiply-by-1.0 folding trick when doing FMAs, just like it now knows for FMULs. llvm-svn: 156029	2012-05-02 22:17:40 +00:00
Preston Gurd	926afd7401	For Intel Atom, use ILP scheduling always, instead of ILP for 64 bit and Hybrid for 32 bit, since benchmarks show ILP scheduling is better most of the time. llvm-svn: 156028	2012-05-02 22:02:02 +00:00
Preston Gurd	c0b976c42a	Change the Intel Atom detection code to recognize Lincroft and Medfield. llvm-svn: 156025	2012-05-02 21:38:46 +00:00
Owen Anderson	b5f167c660	Teach DAG combine that multiplication by 1.0 can always be constant folded. llvm-svn: 156023	2012-05-02 21:32:35 +00:00
Jim Grosbach	28b0b7279e	ARM: Add missing two-operand VBIC aliases. llvm-svn: 156019	2012-05-02 21:11:56 +00:00
Douglas Gregor	12c1cd33f4	Move llvm-tblgen's StringMatcher into the TableGen library so it can be used by clang-tblgen. llvm-svn: 156000	2012-05-02 17:32:48 +00:00
Preston Gurd	fa3f6cb830	This patch continues the work of adding instruction latencies for X86 Atom, by providing the latencies for the instructions in X86InstrFPStack.td. llvm-svn: 155996	2012-05-02 16:03:35 +00:00
Manman Ren	f02efc8731	Revert r155853 The commit is intended to fix rdar://10961709. But it is the root cause of PR12720. Revert it for now. llvm-svn: 155992	2012-05-02 15:24:32 +00:00
Kostya Serebryany	ae7188d9b9	[tsan] typo and style (thanks to Nick Lewycky) llvm-svn: 155986	2012-05-02 13:12:19 +00:00
Bill Wendling	274ba89d77	The value held in the vector may be RAUW'ed by some of the canonicalization methods. Use a weak value handle to keep up with this. PR12245 llvm-svn: 155984	2012-05-02 09:59:45 +00:00
Richard Barton	0fc56890ba	Disallow YIELD and other allocated nop hints in pre-ARMv6 architectures. llvm-svn: 155983	2012-05-02 09:43:18 +00:00
Craig Topper	c73bc39c22	Add support for selecting AVX2 vpshuflw and vpshufhw. Add decoding support for AsmPrinter. llvm-svn: 155982	2012-05-02 08:03:44 +00:00
Eli Friedman	4a80e94b86	Fix the implementation of MachOObjectFile::isSectionZeroInit so it follows the MachO spec. llvm-svn: 155976	2012-05-02 02:31:28 +00:00
Jim Grosbach	edcb868fe3	Tidy up. Naming conventions. llvm-svn: 155960	2012-05-01 23:21:41 +00:00
Jakub Staszak	6126401c83	Remove unneeded break. llvm-svn: 155959	2012-05-01 23:08:16 +00:00
Jakub Staszak	cd2353402d	Use dyn_cast instead of checking opcode and cast. llvm-svn: 155957	2012-05-01 23:06:00 +00:00
Jakub Staszak	339380286b	Remove trailing spaces. llvm-svn: 155956	2012-05-01 23:04:38 +00:00
Bill Wendling	b6b50c6638	Strip the pointer casts off of allocas so that the selection DAG can find them. PR10799 llvm-svn: 155954	2012-05-01 22:50:45 +00:00
Sirish Pande	94212168fc	Target independent Hexagon Packetizer fix. llvm-svn: 155947	2012-05-01 21:28:30 +00:00
Jim Grosbach	1d20efb837	ARM: Add a few missing add->sub aliases w/ 'w' suffix. Aliases for adding a negative immediate when using an explicit 'w' suffix. E.g., adds.w r2, #-16 adds.w r2, r2, #-16 addw r2, #-16 addw r2, #-16 addw r2, r2, #-16 rdar://11330769 llvm-svn: 155946	2012-05-01 21:17:34 +00:00
Jim Grosbach	70bed4faaf	ARM: allow vanilla expressions for movw/movt. Expressions for movw/movt don't always have an :upper16: or :lower16: on them and that's ok. When they don't, it's just a plain [0-65536] immediate result, effectively the same as a :lower16: variant kind. rdar://10550147 llvm-svn: 155941	2012-05-01 20:43:21 +00:00
Preston Gurd	5ae5278ca1	This patch marks the X86 floating point stack registers ST0-ST7 as reserved in order to avoid assertion failures in the register scavenger. The assertion failures were “Bad machine code: Using an undefined physical register” and “Bad machine code: MBB exits via unconditional fall-through but its successor differs from its CFG successor!”. llvm-svn: 155930	2012-05-01 19:50:22 +00:00
Jim Grosbach	758e0cc94a	MC: Unknown assembler directives are now hard errors. Previously, an unsupported/unknown assembler directive issued a warning. That's generally unsafe, and inconsistent with the behaviour of pretty much every system assembler. Now that the MC assemblers are mature enough to be the default on multiple targets, it's reasonable to issue errors for these. For target or platform directives that need to stay warnings, we should add explicit handlers for them in, e.g., ELFAsmParser.cpp, DarwinAsmParser.cpp, et. al., and issue the warning there. rdar://9246275 llvm-svn: 155926	2012-05-01 18:38:27 +00:00
Jim Grosbach	a0c53f147a	MC: Remove errant EatToEndOfStatement() in asm parser. The caller is already responsible for eating any additional input on the line. Putting an additional EatToEndOfStatement() in ParseStatement() causes an entire extra statement to be consumed when treating warnings as errors. For example, test/MC/macros.s will assert() because the .endmacro directive is missed as a result. rdar://11355843 llvm-svn: 155925	2012-05-01 18:38:24 +00:00
Manman Ren	425a55c1ce	X86: optimization for max-like struct This patch will optimize the following cases on X86 (a > b) ? (a-b) : 0 (a >= b) ? (a-b) : 0 (b < a) ? (a-b) : 0 (b <= a) ? (a-b) : 0 FROM movl %edi, %ecx subl %esi, %ecx cmpl %edi, %esi movl $0, %eax cmovll %ecx, %eax TO xorl %eax, %eax subl %esi, %edi cmovll %eax, %edi movl %edi, %eax rdar: 10734411 llvm-svn: 155919	2012-05-01 17:16:15 +00:00
Alexey Samsonov	c4b3ad8195	X86: Use StackRegister instead of FrameRegister in getFrameIndexReference (to generate debug info for local variables) if stack needs realignment llvm-svn: 155917	2012-05-01 15:16:06 +00:00
Benjamin Kramer	cb3e98cf44	Move MipsDisassembler classes into an anonymous namespace. llvm-svn: 155915	2012-05-01 14:34:24 +00:00
Benjamin Kramer	512c1dce8f	Value-initialize global to avoid global construction. llvm-svn: 155909	2012-05-01 10:48:02 +00:00
Eli Bendersky	667b879e73	RuntimeDyld cleanup: - Improved parameter names for clarity - Added comments - emitCommonSymbols should return void because its return value is not being used anywhere - Attempt to reduce the usage of the RelocationValueRef type. Restricts it for a single goal and may serve as a step for eventual removal. llvm-svn: 155908	2012-05-01 10:41:12 +00:00
Benjamin Kramer	84b857e4e6	YAMLParser: get rid of global ctors & dtors. llvm-svn: 155907	2012-05-01 10:19:59 +00:00
Bill Wendling	b12f16e75f	Change the PassManager from a reference to a pointer. The TargetPassManager's default constructor wants to initialize the PassManager to 'null'. But it's illegal to bind a null reference to a null l-value. Make the ivar a pointer instead. PR12468 llvm-svn: 155902	2012-05-01 08:27:43 +00:00
Craig Topper	05eb6e096a	Allow BMI, AES, F16C, POPCNT, FMA3, and CLMUL to be detected on AMD processors. llvm-svn: 155899	2012-05-01 07:10:32 +00:00
Eli Bendersky	fc079081b7	RuntimeDyld code cleanup: - There's no point having a different type for the local and global symbol tables. - Renamed SymbolTable to GlobalSymbolTable to clarify the intention - Improved const correctness where relevant llvm-svn: 155898	2012-05-01 06:58:59 +00:00
Craig Topper	bae0e9ea1d	Make XOP and FMA4 require SSE4A to match GCC behavior. Use this to simplify Bulldozer feature list. llvm-svn: 155897	2012-05-01 06:54:48 +00:00
Craig Topper	d32ebcc36b	Attempt to handle MRMInitReg in emitVEXOpcodePrefix. Hopefully fixes PR12711. llvm-svn: 155896	2012-05-01 06:34:01 +00:00
Craig Topper	43518cc55f	Make XOP imply AVX as its needed to legalize the registers types. llvm-svn: 155891	2012-05-01 05:41:41 +00:00
Craig Topper	c0cef32b83	Remove HasSSE2 from AES and CLMUL predicates. It's now implied by the HasAES and HasCLMUL predicates. llvm-svn: 155890	2012-05-01 05:35:02 +00:00
Craig Topper	29dd148a71	Make CLMUL and AES imply SSE2 since its needed to legalize the type. llvm-svn: 155888	2012-05-01 05:28:32 +00:00
Craig Topper	0eacda5f69	Enable AVX and FMA4 for AMD Bulldozer processors. llvm-svn: 155885	2012-05-01 05:18:13 +00:00
Nick Lewycky	78ee67e814	An instruction in a loop is not guaranteed to be executed just because the loop has no exit blocks. Fixes PR12706! llvm-svn: 155884	2012-05-01 04:03:01 +00:00
Lang Hames	3a90fabd85	Add support for llvm.arm.neon.vmull* intrinsics to InstCombine. Fixes <rdar://problem/11291436>. This is a second attempt at a fix for this, the first was r155468. Thanks to Chandler, Bob and others for the feedback that helped me improve this. llvm-svn: 155866	2012-05-01 00:20:38 +00:00
Jakub Staszak	cec09b2594	Add some constantness. No functionality change. llvm-svn: 155859	2012-04-30 23:41:30 +00:00
Manman Ren	4f4d5c8fc8	X86: optimization for -(x != 0) This patch will optimize -(x != 0) on X86 FROM cmpl $0x01,%edi sbbl %eax,%eax notl %eax TO negl %edi sbbl %eax %eax llvm-svn: 155853	2012-04-30 22:51:25 +00:00
Jim Grosbach	e78031a9f3	ARM: Diagnostics for out of range fixups. Replace some assert() calls w/ actual diagnostics. In a perfect world, there'd be range checks on these values long before things ever reached this code. For now, though, issuing a better-late-than-never diagnostic is still a big improvement over assert(). rdar://11347287 llvm-svn: 155851	2012-04-30 22:30:43 +00:00

... 2 3 4 5 6 ...

54483 Commits