llvm-project

Commit Graph

Author	SHA1	Message	Date
Akira Hatanaka	bff84e1914	Add support for local dynamic TLS model in LowerGlobalTLSAddress. Direct object emission is not supported yet, but a patch that adds the support should follow soon. llvm-svn: 146572	2011-12-14 18:26:41 +00:00
Jim Grosbach	a342667fd0	ARM/Thumb2 'cmp rn, #imm' alias to cmn. When 'cmp rn #imm' doesn't match due to the immediate not being representable, but 'cmn rn, #-imm' does match, use the latter in place of the former, as it's equivalent. rdar://10552389 llvm-svn: 146567	2011-12-14 17:30:24 +00:00
Jim Grosbach	ab5830e51b	ARM assembler support for the target-specific .req directive. rdar://10549683 llvm-svn: 146543	2011-12-14 02:16:11 +00:00
Evan Cheng	7fae11b231	- Add MachineInstrBundle.h and MachineInstrBundle.cpp. This includes a function to finalize MI bundles (i.e. add BUNDLE instruction and computing register def and use lists of the BUNDLE instruction) and a pass to unpack bundles. - Teach more of MachineBasic and MachineInstr methods to be bundle aware. - Switch Thumb2 IT block to MI bundles and delete the hazard recognizer hack to prevent IT blocks from being broken apart. llvm-svn: 146542	2011-12-14 02:11:42 +00:00
Chad Rosier	4020ae75ea	Add newline at EOF. llvm-svn: 146538	2011-12-14 01:34:39 +00:00
Jim Grosbach	485e5622f4	Thumb2 assembler aliases for "mov(shifted register)" rdar://10549767 llvm-svn: 146520	2011-12-13 22:45:11 +00:00
Jim Grosbach	18bf363078	ARM LDM/STM system instruction variants. rdar://10550269 llvm-svn: 146519	2011-12-13 21:48:29 +00:00
Jim Grosbach	dce106940e	Test for 146516 llvm-svn: 146517	2011-12-13 21:06:59 +00:00
Jim Grosbach	1f1a3598c2	ARM thumb2 parsing of "rsb rd, rn, #0". rdar://10549741 llvm-svn: 146515	2011-12-13 20:50:38 +00:00
Jim Grosbach	4b0844e191	ARM NEON two-operand aliases for VQDMULH. llvm-svn: 146514	2011-12-13 20:40:37 +00:00
Jim Grosbach	561e4e18cf	ARM pre-UAL NEG mnemonic for convenience when porting old code. llvm-svn: 146511	2011-12-13 20:23:22 +00:00
Chad Rosier	563de603f7	[fast-isel] Unaligned loads of floats are not supported. Therefore, convert to a regular load and then move the result from a GPR to a FPR. llvm-svn: 146502	2011-12-13 19:22:14 +00:00
Akira Hatanaka	7200123fa3	Add test/MC/Mips/dg.exp. llvm-svn: 146472	2011-12-13 04:12:49 +00:00
Akira Hatanaka	341850fdc6	Move direct object emitter test to directory test/MC/Mips. Rename it to elf-relsym.ll. llvm-svn: 146470	2011-12-13 03:50:34 +00:00
Akira Hatanaka	e41963ce47	Relocation against a symbol, instead of against section. We had some extreme test cases where there were a lot of relocations applied relative to a large rodata section. Gas would create a symbol for each of these whereas we would be relative to the beginning of the rodata section. This change mimics what gas does. Patch by Jack Carter. llvm-svn: 146468	2011-12-13 02:27:40 +00:00
Nick Lewycky	86ffb03c79	Don't rely on a particular version string for llvm. llvm-svn: 146456	2011-12-13 00:34:14 +00:00
Tony Linthicum	525ca5fc69	Temporarily disable Hexagon tests. They are failing on OS X llvm-svn: 146455	2011-12-13 00:33:45 +00:00
Akira Hatanaka	9e5908ae3a	Test case for r146432 by Jack Carter. llvm-svn: 146433	2011-12-12 22:41:39 +00:00
Bob Wilson	fadc2c83e5	Implement 'e' and 'f' modifiers for Neon inline asm. <rdar://problem/10551006> These modifiers simply select either the low or high D subregister of a Neon Q register. I've also removed the unimplemented 'p' modifier, which turns out to be a bit different than the comment here suggests and as far as I can tell was only intended for internal use in Apple's version of gcc. llvm-svn: 146417	2011-12-12 21:45:15 +00:00
Tony Linthicum	1213a7a57f	Hexagon backend support llvm-svn: 146412	2011-12-12 21:14:40 +00:00
Joerg Sonnenberger	45c4164166	Only replace fwrite with fputc, if the return value is unused. llvm-svn: 146411	2011-12-12 20:18:31 +00:00
Jan Sjödin	7c0face455	XOP instructions and encoding tests. llvm-svn: 146407	2011-12-12 19:37:49 +00:00
Roman Divacky	735cb8bcdc	Add support for gnu_indirect_function. llvm-svn: 146377	2011-12-12 17:34:04 +00:00
Chandler Carruth	6b0e34c445	Manually upgrade the test suite to specify the flag to cttz and ctlz. I followed three heuristics for deciding whether to set 'true' or 'false': - Everything target independent got 'true' as that is the expected common output of the GCC builtins. - If the target arch only has one way of implementing this operation, set the flag in the way that exercises the most of codegen. For most architectures this is also the likely path from a GCC builtin, with 'true' being set. It will (eventually) require lowering away that difference, and then lowering to the architecture's operation. - Otherwise, set the flag differently dependending on which target operation should be tested. Let me know if anyone has any issue with this pattern or would like specific tests of another form. This should allow the x86 codegen to just iteratively improve as I teach the backend how to differentiate between the two forms, and everything else should remain exactly the same. llvm-svn: 146370	2011-12-12 11:59:10 +00:00
Chandler Carruth	f13db84794	Add an explicit test of the auto-upgrade functionality for the new intrinsic syntax. Now that this is explicitly covered, I plan to upgrade the existing test suite to use an explicit immediate. Note that I plan to specify 'true' in most places rather than the auto-upgraded value as that is the far more common value to end up here as that is the value coming from GCC's builtins. The only place I'm likely to put a 'false' in is when testing x86 which actually has different instructions for the two variants. llvm-svn: 146369	2011-12-12 11:23:11 +00:00
Chandler Carruth	026cc37e48	Teach the verifier to reject all non-constant arguments to the second argument of the cttz and ctlz intrinsics. llvm-svn: 146360	2011-12-12 04:36:02 +00:00
Stepan Dyatkovskiy	4683740967	Fixed bug 9905: Failure in code selection for llvm intrinsics sqrt/exp (fix for FSQRT, FSIN, FCOS, FPOWI, FPOW, FLOG, FLOG2, FLOG10, FEXP, FEXP2). Third attempt: simplified checks in test for armv7-apple-darwin11. llvm-svn: 146341	2011-12-11 14:35:48 +00:00
Chandler Carruth	1d76d4196a	Don't assume things about the exact details of the LLVM version number, such as what VCS information is attached. llvm-svn: 146333	2011-12-10 21:40:31 +00:00
Chad Rosier	1c468af854	Revert associate SelectInsertValue test as well. llvm-svn: 146332	2011-12-10 21:34:28 +00:00
Chad Rosier	6641294e3b	Revert r146322 to appease buildbots. Original commit message: Fixed bug 9905: Failure in code selection for llvm intrinsics sqrt/exp (fix for FSQRT, FSIN, FCOS, FPOWI, FPOW, FLOG, FLOG2, FLOG10, FEXP, FEXP2). Second attempt. llvm-svn: 146328	2011-12-10 19:55:03 +00:00
Stepan Dyatkovskiy	df0b779e9f	Fixed bug 9905: Failure in code selection for llvm intrinsics sqrt/exp (fix for FSQRT, FSIN, FCOS, FPOWI, FPOW, FLOG, FLOG2, FLOG10, FEXP, FEXP2). Second attempt. llvm-svn: 146322	2011-12-10 08:42:24 +00:00
Hal Finkel	67a7f18faf	Make CR spill and restore use a reserved register. These operations cannot use the register scavenger because the scavenger can only scavenge one register and frame-index elimination may have already grabbed it. llvm-svn: 146318	2011-12-10 04:50:53 +00:00
Rafael Espindola	c7f355b8e1	Handle expressions of the form _GLOBAL_OFFSET_TABLE_-symbol the same way gas does. The _GLOBAL_OFFSET_TABLE_ is still magical in that we get a R_386_GOTPC, but it doesn't change the immediate in the same way as when the expression has no right hand side symbol. llvm-svn: 146311	2011-12-10 02:28:43 +00:00
Eli Friedman	4e36a934dc	Splats can contain undef's; make sure to handle them correctly. PR11526. llvm-svn: 146299	2011-12-09 23:54:42 +00:00
Jim Grosbach	6192b6570d	ARM assembly aliases for BIC<-->AND (immediate). When the immediate operand of an AND or BIC instruction isn't representable in the immediate field of the instruction, but the bitwise negation of the immediate is, assemble the instruction as the inverse operation instead with the inverted immediate as the operand. rdar://10550057 llvm-svn: 146283	2011-12-09 22:02:17 +00:00
Evan Cheng	1d54d2210a	Update test to something more sensible. llvm-svn: 146282	2011-12-09 21:54:10 +00:00
Jim Grosbach	d146a02c79	ARM assembly parsing and encoding for VLD2 with writeback. Refactor the instructions into fixed writeback and register-stride writeback variants to simplify the offset operand (no more optional register operand using reg0). This is a simpler representation and allows the assembly parser to more easily handle these instructions. Add tests for the instruction variants now supported. llvm-svn: 146278	2011-12-09 21:28:25 +00:00
Chad Rosier	dd998ff4df	[fast-isel] Add support for selecting insertvalue. rdar://10530851 llvm-svn: 146276	2011-12-09 20:09:54 +00:00
Rafael Espindola	7e0a793183	Handle reloc_signed_4byte in here. Not doing so was a regression from my previous commit. It is strange that we see it in 32 bits. We already have a fixme about it. llvm-svn: 146273	2011-12-09 19:57:29 +00:00
Kevin Enderby	e7739d484f	The second part of support for generating dwarf for assembly source files. This generates the dwarf Compile Unit DIE and a dwarf subprogram DIE for each non-temporary label. The next part will be to get the clang driver to enable this when assembling a .s file. rdar://9275556 llvm-svn: 146262	2011-12-09 18:09:40 +00:00
Benjamin Kramer	16bbfbec66	X86: Add patterns for the various rounding ops for SSE4.1 and AVX. llvm-svn: 146257	2011-12-09 15:44:03 +00:00
Andrew Trick	d04d152998	Add -unroll-runtime for unrolling loops with run-time trip counts. Patch by Brendon Cahoon! This extends the existing LoopUnroll and LoopUnrollPass. Brendon measured no regressions in the llvm test suite with -unroll-runtime enabled. This implementation works by using the existing loop unrolling code to unroll the loop by a power-of-two (default 8). It generates an if-then-else sequence of code prior to the loop to execute the extra iterations before entering the unrolled loop. llvm-svn: 146245	2011-12-09 06:19:40 +00:00
Evan Cheng	5895fa79d6	Forgot setting -march. llvm-svn: 146244	2011-12-09 06:15:00 +00:00
Rafael Espindola	0a7f336475	Handle the case of the magical _GLOBAL_OFFSET_TABLE_ showing up in a symbol difference. This matches gas behavior and fixes PR11513. We still don't handle _GLOBAL_OFFSET_TABLE_ in data sections. llvm-svn: 146238	2011-12-09 03:03:58 +00:00
Akira Hatanaka	8e16aac534	jalr should use t9 ($25) for indirect calls regardless of the relocation model specified. llvm-svn: 146229	2011-12-09 01:45:12 +00:00
Eli Friedman	053a724483	Fix a couple of logic bugs in TargetLowering::SimplifyDemandedBits. PR11514. llvm-svn: 146219	2011-12-09 01:16:26 +00:00
Nick Lewycky	fe970725cc	Fix infinite loop in DSE when deleting a free in a reachable loop that's also trivially infinite. llvm-svn: 146197	2011-12-08 22:36:35 +00:00
Evan Cheng	b96bca81e7	Add 256-bit variant vmovss and vmovsd patterns. rdar://10538417 llvm-svn: 146196	2011-12-08 22:30:45 +00:00
Jim Grosbach	db731be7b8	ARM 64-bit VEXT assembly uses a .64 suffix, not .32, amazingly enough. llvm-svn: 146194	2011-12-08 22:19:04 +00:00
Jim Grosbach	ba7d6ed05d	ARM VSHR implied destination operand form aliases. llvm-svn: 146192	2011-12-08 22:06:06 +00:00
Evan Cheng	2a217be25f	Add various missing AVX patterns which was causing crashes. Sadly, the generated code looks pretty bad compared to SSE. rdar://10538793 llvm-svn: 146191	2011-12-08 22:05:28 +00:00
Jim Grosbach	3a97d946d2	Tidy up a bit. llvm-svn: 146190	2011-12-08 22:04:40 +00:00
Jim Grosbach	ab9c8bb45b	ARM VSUB implied destination operand form aliases. llvm-svn: 146182	2011-12-08 20:56:26 +00:00
Jim Grosbach	27a33edfa0	Tidy up a bit. llvm-svn: 146181	2011-12-08 20:53:19 +00:00
Jim Grosbach	66c9ad7642	ARM VQADD implied destination operand form aliases. llvm-svn: 146179	2011-12-08 20:49:43 +00:00
Jim Grosbach	e9ee1092e1	ARM a few more VMUL implied destination operand form aliases. llvm-svn: 146177	2011-12-08 20:42:35 +00:00
Owen Anderson	0b9b9da6c8	Teach SelectionDAG to match more calls to libm functions onto existing SDNodes. Mark these nodes as illegal by default, unless the target declares otherwise. llvm-svn: 146171	2011-12-08 19:32:14 +00:00
Evan Cheng	3294538546	Add test for r146163. llvm-svn: 146167	2011-12-08 19:21:39 +00:00
Daniel Dunbar	c09e4593b2	Revert r146143, "Fix bug 9905: Failure in code selection for llvm intrinsics sqrt/exp (fix for FSQRT, FSIN, FCOS, FPOWI, FPOW, FLOG, FLOG2, FLOG10, FEXP, FEXP2).", it is failing tests. llvm-svn: 146157	2011-12-08 17:32:18 +00:00
NAKAMURA Takumi	0faa233439	test/CodeGen/X86/vec_compare-2.ll: Add explicit -mtriple=i686-linux. llvm-svn: 146152	2011-12-08 15:24:09 +00:00
Nadav Rotem	26edb291ac	Fix a bug in the integer-promotion of bitcast operations on vector types. We must not issue a bitcast operation for integer-promotion of vector types, because the location of the values in the vector may be different. llvm-svn: 146150	2011-12-08 13:10:01 +00:00
Stepan Dyatkovskiy	a4bcf27dae	Fix bug 9905: Failure in code selection for llvm intrinsics sqrt/exp (fix for FSQRT, FSIN, FCOS, FPOWI, FPOW, FLOG, FLOG2, FLOG10, FEXP, FEXP2). llvm-svn: 146143	2011-12-08 07:55:03 +00:00
Jim Grosbach	00326406d4	ARM NEON two-operand aliases for VSHL(immediate). llvm-svn: 146125	2011-12-08 01:30:04 +00:00
Jim Grosbach	f10a635eb4	ARM NEON two-operand aliases for VSHL(register). llvm-svn: 146123	2011-12-08 01:12:35 +00:00
Jim Grosbach	6600f520b0	ARM optional destination operand variants for VEXT instructions. llvm-svn: 146114	2011-12-08 00:43:47 +00:00
Jim Grosbach	5ff64c7141	Tidy up. llvm-svn: 146113	2011-12-08 00:41:54 +00:00
Jim Grosbach	3050625a50	ARM assembler aliases for "add Rd, #-imm" to "sub Rd, #imm". llvm-svn: 146111	2011-12-08 00:31:07 +00:00
Jim Grosbach	3b559ff3c5	ARM assembly, allow 'asl' as a synonym for 'lsl' in shifted-register operands. For 'gas' compatibility. llvm-svn: 146106	2011-12-07 23:40:58 +00:00
Akira Hatanaka	ae378af667	32 to 64-bit zext pattern. llvm-svn: 146096	2011-12-07 23:14:41 +00:00
Jim Grosbach	90d961250b	ARM two-operand aliases for VAND/VEOR/VORR instructions. llvm-svn: 146095	2011-12-07 23:08:12 +00:00
Jim Grosbach	3744a7febb	ARM two-operand aliases for VADDW instructions. llvm-svn: 146093	2011-12-07 23:01:10 +00:00
Jim Grosbach	552691556c	ARM two-operand aliases for VADD instructions. llvm-svn: 146091	2011-12-07 22:52:54 +00:00
Akira Hatanaka	b2e05cb6b1	64-bit WrapperPICPat patterns. llvm-svn: 146086	2011-12-07 22:11:43 +00:00
Akira Hatanaka	c5b5a8d8b1	Modify LowerFCOPYSIGN to handle Mips64. llvm-svn: 146080	2011-12-07 21:48:50 +00:00
Akira Hatanaka	4a04a56a36	Fix 64-bit immediate patterns. llvm-svn: 146059	2011-12-07 20:10:24 +00:00
Jim Grosbach	d6ae4ba002	Darwin assembler improved relocs when w/o subsections_via_symbols. When the file isn't being built with subsections-via-symbols, symbol differences involving non-local symbols can be resolved more aggressively. Needed for gas compatibility. llvm-svn: 146054	2011-12-07 19:46:59 +00:00
Jim Grosbach	18b0e5dca0	Thumb2 alias for long-form pop and friends. rdar://10542474 llvm-svn: 146046	2011-12-07 18:32:28 +00:00
Jim Grosbach	7f882399b8	ARM support the .arm and .thumb directives for assembly mode switching. llvm-svn: 146042	2011-12-07 18:04:19 +00:00
Jim Grosbach	721042fa3a	ARM NEON VCLT(register) is a pseudo aliasing VCGT(register). llvm-svn: 146039	2011-12-07 17:51:15 +00:00
Jim Grosbach	a4337ced68	Tidy up. Move MachO tests to MachO directory. llvm-svn: 146038	2011-12-07 17:50:28 +00:00
Eli Friedman	ed8b3e38ec	Support vector bitcasts in the AsmPrinter. PR11495. llvm-svn: 146001	2011-12-07 00:50:54 +00:00
Eli Friedman	0e58cba286	Fix an optimization involving EXTRACT_SUBVECTOR in DAGCombine so it behaves correctly. PR11494. llvm-svn: 145996	2011-12-07 00:11:56 +00:00
Hal Finkel	0fc34bc2d3	delaying restore-cr changed assigned registers in some tests llvm-svn: 145963	2011-12-06 20:55:46 +00:00
Hal Finkel	0702bc1b28	add a test case that uses RESTORE_CR llvm-svn: 145962	2011-12-06 20:55:41 +00:00
Justin Holewinski	04424665c3	PTX: Continue to fix up the register mess. llvm-svn: 145947	2011-12-06 17:39:48 +00:00
Craig Topper	6572e0f203	Fix a bunch of SSE/AVX patterns to use v2i64/v4i64 loads since all other integer vector loads are promoted to those. llvm-svn: 145927	2011-12-06 09:04:59 +00:00
NAKAMURA Takumi	51416d5f00	test/MC: Introduce MC/MachO/ARM, and relocate relax-thumb2-branches.s into it. FIXME: Restore more other arch-dependent MachO tests. (eg. r126401 and r133856) llvm-svn: 145925	2011-12-06 06:48:26 +00:00
Jim Grosbach	e303e24d77	ARM mode 'mul' operand ordering tweak. Same as r145922, just for ARM mode. llvm-svn: 145923	2011-12-06 05:28:00 +00:00
Jim Grosbach	5f143be8c5	Thumb2: MUL two-operand form encoding operand order fix. Fix the alias to encode 'mul r5, r6' as if it were 'mul r5, r6, r5' so we match gas. rdar://10532439 llvm-svn: 145922	2011-12-06 05:03:45 +00:00
Craig Topper	bf41eb3a98	Merge isSHUFPMask and isCommutedSHUFPMask into single function that can do both. Do the same for the 256-bit version. Use loops to reduce size of isVSHUFPYMask. Fix test cases that were incorrectly passing due to isCommutedSHUFPMask not checking for the vector being 128-bit. This caused some 256-bit shuffles to be incorrectly commuted. llvm-svn: 145921	2011-12-06 04:59:07 +00:00
Jim Grosbach	175c7d0da5	Thumb2 encoding choice correction for PLD. Using encoding T1 for offset of #0 and encoding T2 for #-0. rdar://10532413 llvm-svn: 145919	2011-12-06 04:49:29 +00:00
NAKAMURA Takumi	5bdc0fbabd	test/MC: Move relax-thumb2-branches.s from MC/MachO/ to MC/ARM. MC/MachO assumes x86. llvm-svn: 145916	2011-12-06 03:56:05 +00:00
Andrew Trick	5df9096584	LSR: prune undesirable formulae early. It's always good to prune early, but formulae that are unsatisfactory in their own right need to be removed before running any other pruning heuristics. We easily avoid generating such formulae, but we need them as an intermediate basis for forming other good formulae. llvm-svn: 145906	2011-12-06 03:13:31 +00:00
Chad Rosier	c77830d21e	[arm-fast-isel] Doublewords only require word-alignment. rdar://10528060 llvm-svn: 145891	2011-12-06 01:44:17 +00:00
Jakob Stoklund Olesen	2e05db2fa0	Align ARM constant pool islands via their basic block. Previously, all ARM::CONSTPOOL_ENTRY instructions had a hardwired alignment of 4 bytes emitted by ARMAsmPrinter. Now the same alignment is set on the basic block. This is in preparation of supporting ARM constant pool islands with different alignments. llvm-svn: 145890	2011-12-06 01:43:02 +00:00
Jim Grosbach	9105085b4a	Fix ARM handling of tBcc branch relaxation. rdar://10069056 llvm-svn: 145885	2011-12-06 01:08:19 +00:00
Chad Rosier	8abf65a130	Probably not a good idea to convert a single vector load into a memcpy. We don't do this now, but add a test case to prevent this from happening in the future. Additional test for rdar://9892684 llvm-svn: 145879	2011-12-06 00:19:08 +00:00
Chad Rosier	19446a07a7	Make the MemCpyOptimizer a bit more aggressive. I can't think of a scenerio where this would be bad as the backend shouldn't have a problem inlining small memcpys. rdar://10510150 llvm-svn: 145865	2011-12-05 22:37:00 +00:00
Jim Grosbach	b8c719ccc6	Tweak ADDrr fix. Bad check for explicit .w llvm-svn: 145863	2011-12-05 22:27:04 +00:00
Jim Grosbach	8b5e92577b	Update tests for r145860. Add a few new ones. llvm-svn: 145861	2011-12-05 22:21:28 +00:00
Akira Hatanaka	20cee2eba1	Add definitions of 64-bit extract and insert instrucions and make PerformANDCombine and PerformOrCombine aware of them. Test cases are included too. llvm-svn: 145853	2011-12-05 21:26:34 +00:00
Jim Grosbach	ec9ba98299	Thumb2 prefer encoding T3 to T4 for ADD/SUB immediate instructions. rdar://10529348 llvm-svn: 145851	2011-12-05 21:06:26 +00:00
Akira Hatanaka	34e3df76f9	Have LowerJumpTable support Mips64. Modify 2010-07-20-Switch.ll to test N64 and O32 with relocation-model=pic too. llvm-svn: 145850	2011-12-05 21:03:03 +00:00
Jim Grosbach	fdf9e1587a	ARM assembly parsing for the rest of the VMUL data type aliases. Finish up rdar://10522016. llvm-svn: 145846	2011-12-05 20:29:59 +00:00
Hal Finkel	97a6028b3a	Add test case - this input used to crash because of duplicate generation of SPILL_CRs llvm-svn: 145820	2011-12-05 17:55:22 +00:00
Hal Finkel	8f6834dfa5	enable PPC register scavenging by default (update tests and remove some FIXMEs) llvm-svn: 145819	2011-12-05 17:55:17 +00:00
Hal Finkel	e18c72689c	remove wasted space for extra bit copies of CR2 subregs llvm-svn: 145817	2011-12-05 17:55:06 +00:00
NAKAMURA Takumi	e6efe405de	test/CodeGen/X86/pointer-vector.ll: Add explicit -mtriple=i686-linux. llvm-svn: 145805	2011-12-05 07:54:57 +00:00
Nadav Rotem	3924cb0267	Add support for vectors of pointers. llvm-svn: 145801	2011-12-05 06:29:09 +00:00
Anton Korobeynikov	965e0c6de2	Emit the ctors in the proper order on ARM/EABI. Maybe some targets should use this as well. Patch by Evgeniy Stepanov! llvm-svn: 145781	2011-12-03 23:49:37 +00:00
Venkatraman Govindaraju	6dae604f50	Sparc CodeGen: Fix AnalyzeBranch for PR 10282. Removing addSuccessor() since AnalyzeBranch doesn't change the successor, just the order. llvm-svn: 145779	2011-12-03 21:24:48 +00:00
Sanjoy Das	006e43bcc0	Check for stack space more intelligently. libgcc sets the stack limit field in TCB to 256 bytes above the actual allocated stack limit. This means if the function's stack frame needs less than 256 bytes, we can just compare the stack pointer with the stack limit. This should result in lesser calls to __morestack. llvm-svn: 145766	2011-12-03 09:32:07 +00:00
Sanjoy Das	165ca1d4ba	Fix a bug in the x86-32 code generated for segmented stacks. Currently LLVM pads the call to __morestack with a add and sub of 8 bytes to esp. This isn't correct since __morestack expects the call to be followed directly by a ret. This commit also adjusts the relevant test-case. llvm-svn: 145765	2011-12-03 09:21:07 +00:00
Chad Rosier	ec3b77e00d	[arm-fast-isel] Unaligned stores of floats require special care. rdar://10510150 llvm-svn: 145742	2011-12-03 02:21:57 +00:00
Pete Cooper	e03fe83d98	Fixed deadstoreelimination bug where negative indices were incorrectly causing the optimisation to occur Turns out long long + unsigned long long is unsigned. Doh! Fixes http://llvm.org/bugs/show_bug.cgi?id=11455 llvm-svn: 145731	2011-12-03 00:04:30 +00:00
Chad Rosier	0155a63513	Add support for constant folding the pow intrinsic. rdar://10514247 llvm-svn: 145730	2011-12-03 00:00:03 +00:00
Akira Hatanaka	430f917fbe	Test cases for 64-bit multiplication and division. llvm-svn: 145717	2011-12-02 22:31:36 +00:00
Akira Hatanaka	bbc5555bee	Fix test cases to use FileCheck. llvm-svn: 145716	2011-12-02 22:28:09 +00:00
Jim Grosbach	7276397f41	ARM tests for VLD1 single lane w/ writeback. llvm-svn: 145713	2011-12-02 22:03:52 +00:00
Chad Rosier	9fd0e55e91	[arm-fast-isel] After promoting a function parameter be sure to update the argument value type. Otherwise, the sign/zero-extend has no effect on arguments passed via the stack (i.e., undefined high-order bits). rdar://10515467 llvm-svn: 145701	2011-12-02 20:25:18 +00:00
Hal Finkel	d87f7af1f3	specify cpu for test to fix failure on some darwin systems with a g4+ cpu llvm-svn: 145699	2011-12-02 19:38:17 +00:00
Jim Grosbach	e7dcbc8691	Clean up aliases for ARM VLD1 single-lane assembly parsing a bit. Add the 16-bit lane variants while I'm at it. llvm-svn: 145693	2011-12-02 18:52:30 +00:00
Craig Topper	abeb79eee3	Add instruction selection support for horizontal add/sub of 256-bit floating point vectors. Also add the test case for 256-bit integer vectors. llvm-svn: 145680	2011-12-02 07:16:01 +00:00
Hal Finkel	9286705955	adjust the instruction ordering in some PPC tests: changes due to postRA haz. rec. llvm-svn: 145678	2011-12-02 04:58:12 +00:00
Chad Rosier	3367123b12	Prevent library calls from being folded if -fno-builtin has been specified. rdar://10500969 llvm-svn: 145639	2011-12-01 22:14:50 +00:00
Pete Cooper	fdddc27143	Improved fix for abs(val) != 0 to check other similar case. Also fixed style issues and confusing comment llvm-svn: 145618	2011-12-01 19:13:26 +00:00
Eric Christopher	9da7f305a4	For 64-bit the rest of the general regs are ok for the q constraint. Make sure we can emit both the high and low versions of those registers. Fixes rdar://10392864 llvm-svn: 145579	2011-12-01 08:12:41 +00:00
Eli Friedman	d61887dd0a	Pass AVX vectors which are arguments to varargs functions on the stack. <rdar://problem/10463281>. llvm-svn: 145573	2011-12-01 04:49:21 +00:00
Pete Cooper	3b7f35bf08	Removed use of grep from test and moved it to be with other icmp tests llvm-svn: 145570	2011-12-01 04:35:26 +00:00
Pete Cooper	bc5c524b71	Added instcombine pattern to spot comparing -val or val against 0. (val != 0) == (-val != 0) so "abs(val) != 0" becomes "val != 0" Fixes <rdar://problem/10482509> llvm-svn: 145563	2011-12-01 03:58:40 +00:00
Jan Sjödin	9430e284a9	Support for encoding all FMA4 instructions and tablegen patterns for all remaining FMA4 instructions and intrinsics with tests. llvm-svn: 145525	2011-11-30 22:09:42 +00:00
Eli Friedman	6cff9df298	Make GlobalMerge honor the preferred alignment on globals without an explicitly specified alignment. <rdar://problem/10497732>. llvm-svn: 145523	2011-11-30 21:54:15 +00:00
Jim Grosbach	7d8517b1d4	Add some tests for all-lanes VLD1 parsing. llvm-svn: 145512	2011-11-30 19:37:38 +00:00
Nadav Rotem	0a1801015c	Add test arch to make it pass on non x86 targets llvm-svn: 145498	2011-11-30 17:34:28 +00:00
Nadav Rotem	66427bcce9	Add a tripple to the test llvm-svn: 145489	2011-11-30 11:20:56 +00:00
Nadav Rotem	96923cc2bb	X86: PerformOrCombine introduced a vselect node with a wrong order of operands. This bug was introduced when a dedicated blend sdnode was replaced with the vselect node (in 139479). llvm-svn: 145488	2011-11-30 10:13:37 +00:00
Andrew Trick	613c67e475	Better test case found in duplicate PR10570. llvm-svn: 145484	2011-11-30 06:26:42 +00:00
Andrew Trick	ceafa2c746	LSR: handle the expansion of phi operands that use postinc forms of the IV. Fixes PR11431: SCEVExpander::expandAddRecExprLiterally(const llvm::SCEVAddRecExpr*): Assertion `(!isa<Instruction>(Result) \|\| SE.DT->dominates(cast<Instruction>(Result), Builder.GetInsertPoint())) && "postinc expansion does not dominate use"' failed. llvm-svn: 145482	2011-11-30 06:07:54 +00:00
Chad Rosier	82e1bd8e94	Add support for sqrt, sqrtl, and sqrtf in TargetLibraryInfo. Disable (fptrunc (sqrt (fpext x))) -> (sqrtf x) transformation if -fno-builtin is specified. rdar://10466410 llvm-svn: 145460	2011-11-29 23:57:10 +00:00
Jakob Stoklund Olesen	f50d2eafdb	FileCheckize. llvm-svn: 145452	2011-11-29 23:09:16 +00:00
Akira Hatanaka	dc25f9f38a	Change names for MIPS "generic" processors defined in Mips.td to match what GNU tools use. Patch by Simon Atanasyan. "mips32r1" => "mips32" "4ke" => mips32r2" "mips64r1" => "mips64" llvm-svn: 145451	2011-11-29 23:08:41 +00:00
Jim Grosbach	5ee209ce3a	ARM assembly parsing and encoding for four-register VST1. llvm-svn: 145450	2011-11-29 22:58:48 +00:00
Evan Cheng	648e48d02e	Add another missing pattern. llvm-gcc likes f64 but clang likes i64 so it was generating poor code for some SSE builtins. llvm-svn: 145448	2011-11-29 22:48:34 +00:00
Jim Grosbach	2a9c43649a	Enable some VST1 tests and add a few more. llvm-svn: 145443	2011-11-29 22:40:32 +00:00
Jakob Stoklund Olesen	bde32d36bb	Make X86::FsFLD0SS / FsFLD0SD real pseudo-instructions. Like V_SET0, these instructions are expanded by ExpandPostRA to xorps / vxorps so they can participate in execution domain swizzling. This also makes the AVX variants redundant. llvm-svn: 145440	2011-11-29 22:27:25 +00:00
Chad Rosier	46addb9e07	If fast-isel fails, remove dead instructions generated during the failed attempt. llvm-svn: 145425	2011-11-29 19:40:47 +00:00
Duncan Sands	ca6f8ddbf8	Fix a theoretical problem (not seen in the wild): if different instances of a weak variable are compiled by different compilers, such as GCC and LLVM, while LLVM may increase the alignment to the preferred alignment there is no reason to think that GCC will use anything more than the ABI alignment. Since it is the GCC version that might end up in the final program (as the linkage is weak), it is wrong to increase the alignment of loads from the global up to the preferred alignment as the alignment might only be the ABI alignment. Increasing alignment up to the ABI alignment might be OK, but I'm not totally convinced that it is. It seems better to just leave the alignment of weak globals alone. llvm-svn: 145413	2011-11-29 18:26:38 +00:00
Michael J. Spencer	de3a2118db	MC/X86/COFF: Allow quotes in names when targeting MS/Windows, as MC is the only assembler we support. This splits MS/Windows and GNU/Windows ASM infos into two seperate classes. While there is currently only one difference, full MS C++ ABI support will require many more. llvm-svn: 145409	2011-11-29 18:00:06 +00:00
Danil Malyshev	cbe72fc959	Fixed ObjectFile functions: - getSymbolOffset() renamed as getSymbolFileOffset() - getSymbolFileOffset(), getSymbolAddress(), getRelocationAddress() returns same result for ELFObjectFile, MachOObjectFile and COFFObjectFile. - added getRelocationOffset() - fixed MachOObjectFile::getSymbolSize() - fixed MachOObjectFile::getSymbolSection() - fixed MachOObjectFile::getSymbolOffset() for symbols without section data. llvm-svn: 145408	2011-11-29 17:40:10 +00:00
Elena Demikhovsky	7a81dea516	Fixed vsqrt.ss intrinsic usage - order of input operands was wrong. Added a test. Thanks Bruno for reviewing the patch. llvm-svn: 145403	2011-11-29 15:00:45 +00:00
Craig Topper	1d63ae3731	Fix shuffle decoding for memory forms for (V)SHUFPS/D. llvm-svn: 145392	2011-11-29 07:58:09 +00:00
Craig Topper	c16db840be	Fix issues in shuffle decoding around VPERM* instructions. Fix shuffle decoding for VSHUFPS/D for 256-bit types. Add pattern matching for memory forms of VPERMILPS/VPERMILPD. llvm-svn: 145390	2011-11-29 07:49:05 +00:00
Craig Topper	12b72def4e	Fix VINSERTF128/VEXTRACTF128 to be marked as FP instructions. Allow execution dependency fix pass to convert them to their integer equivalents when AVX2 is enabled. llvm-svn: 145376	2011-11-29 05:37:58 +00:00
Craig Topper	897a7d4b9c	Correctly mark VPERM2F128 as being an FP instruction and add execution domain fixing support to convert it to VPERM2I128 for AVX2. llvm-svn: 145370	2011-11-29 03:57:34 +00:00
Andrew Trick	d25089f8e0	SCEV fix. In general, Add/Mul expressions should not inherit NSW/NUW. This reverts r139450, fixes r139453, and adds much needed comments and a unit test. llvm-svn: 145367	2011-11-29 02:16:38 +00:00
Andrew Trick	5ec136c57e	Filecheckize. llvm-svn: 145363	2011-11-29 02:05:23 +00:00
Andrew Trick	e756031a62	Reenable this IndVars unit test. SCEV can't optimize undef in all cases, which is a separate issue from this test case. llvm-svn: 145343	2011-11-29 00:52:04 +00:00
Eli Friedman	b3f9b0676a	Add a missing safety check to ProcessUGT_ADDCST_ADD. Fixes PR11438. llvm-svn: 145316	2011-11-28 23:32:19 +00:00
Eli Friedman	e7ab1a2f0f	Make SelectionDAG::InferPtrAlignment use llvm::ComputeMaskedBits instead of duplicating the logic for globals. Make llvm::ComputeMaskedBits handle GlobalVariables slightly more aggressively, to match what InferPtrAlignment knew how to do. llvm-svn: 145304	2011-11-28 22:48:22 +00:00
Evan Cheng	4a5b2040e2	Revert r145273 and fix in SelectionDAG::InferPtrAlignment() instead. Conservatively returns zero when the GV does not specify an alignment nor is it initialized. Previously it returns ABI alignment for type of the GV. However, if the type is a "packed" type, then the under-specified alignments is attached to the load / store instructions. In that case, the alignment of the type cannot be trusted. rdar://10464621 llvm-svn: 145300	2011-11-28 22:37:34 +00:00
Evan Cheng	a4b6404cf0	DAG combine should not increase alignment of loads / stores with alignment less than ABI alignment. These are loads / stores from / to "packed" data structures. Their alignments are intentionally under-specified. rdar://10301431 llvm-svn: 145273	2011-11-28 20:42:56 +00:00
Craig Topper	818a983e93	Add X86 instruction selection for VPERM2I128 when AVX2 is enabled. Merge VPERMILPS/VPERMILPD detection since they are pretty similar. llvm-svn: 145238	2011-11-28 10:14:51 +00:00
NAKAMURA Takumi	8284ec46b6	test/lit.cfg: Enable the feature 'asserts' to check output of llc -version. llc knows whether he is compiled with -DNDEBUG. \| Optimized build with assertions. llvm-svn: 145230	2011-11-28 05:09:15 +00:00
Chris Lattner	251d827d2c	remove a test that is using old-style llvm.dbg intrinsics, apparently only fails on ppc and arm hosts. llvm-svn: 145188	2011-11-27 18:13:47 +00:00
Chandler Carruth	03adbd46ca	Take two on rotating the block ordering of loops. My previous attempt was centered around the premise of laying out a loop in a chain, and then rotating that chain. This is good for preserving contiguous layout, but bad for actually making sane rotations. In order to keep it safe, I had to essentially make it impossible to rotate deeply nested loops. The information needed to correctly reason about a deeply nested loop is actually available -- before we layout the loop. We know the inner loops are already fused into chains, etc. We lose information the moment we actually lay out the loop. The solution was the other alternative for this algorithm I discussed with Benjamin and some others: rather than rotating the loop after-the-fact, try to pick a profitable starting block for the loop's layout, and then use our existing layout logic. I was worried about the complexity of this "pick" step, but it turns out such complexity is needed to handle all the important cases I keep teasing out of benchmarks. This is, I'm afraid, a bit of a work-in-progress. It is still misbehaving on some likely important cases I'm investigating in Olden. It also isn't really tested. I'm going to try to craft some interesting nested-loop test cases, but it's likely to be extremely time consuming and I don't want to go there until I'm sure I'm testing the correct behavior. Sadly I can't come up with a way of getting simple, fine grained test cases for this logic. We need complex loop structures to even trigger much of it. llvm-svn: 145183	2011-11-27 13:34:33 +00:00
Chandler Carruth	a054580993	Rework a bit of the implementation of loop block rotation to not rely so heavily on AnalyzeBranch. That routine doesn't behave as we want given that rotation occurs mid-way through re-ordering the function. Instead merely check that there are not unanalyzable branching constructs present, and then reason about the CFG via successor lists. This actually simplifies my mental model for all of this as well. The concrete result is that we now will rotate more loop chains. I've added a test case from Olden highlighting the effect. There is still a bit more to do here though in order to regain all of the performance in Olden. llvm-svn: 145179	2011-11-27 09:22:53 +00:00
Chris Lattner	ee471c484a	remove autoupgrade support for old forms of llvm.prefetch and the old trampoline forms. Both of these were correct in LLVM 3.0, and we don't need to support LLVM 2.9 and earlier in mainline. llvm-svn: 145174	2011-11-27 07:42:04 +00:00
Chris Lattner	6a144a2227	Upgrade syntax of tests using volatile instructions to use 'load volatile' instead of 'volatile load', which is archaic. llvm-svn: 145171	2011-11-27 06:54:59 +00:00
Chris Lattner	90ef78c07f	remove autoupgrade support for really old-style debug info intrinsics. I think this is the last of autoupgrade that can be removed in 3.1. Can the atomic upgrade stuff also go? llvm-svn: 145169	2011-11-27 06:18:33 +00:00
Chris Lattner	6aa6c0c3b7	remove some old autoupgrade logic llvm-svn: 145167	2011-11-27 06:10:54 +00:00
Chris Lattner	1c9e5678b8	remove support for reading llvm 2.9 .bc files. LLVM 3.1 is only compatible back to 3.0 llvm-svn: 145164	2011-11-27 05:48:27 +00:00
Wesley Peck	97b3da5433	Add several new instructions supported by the latest MicroBlaze. These instructions are not generated by the backend yet, this will come in a later commit. llvm-svn: 145161	2011-11-27 05:16:58 +00:00
Chandler Carruth	9ffb97e631	Introduce a loop block rotation optimization to the new block placement pass. This is designed to achieve one of the important optimizations that the old code placement pass did, but more simply. This is a somewhat rough and very conservative version of the transform. We could get a lot fancier here if there are profitable cases to do so. In particular, this only looks for a single pattern, it insists that the loop backedge being rotated away is the last backedge in the chain, and it doesn't provide any means of doing better in-loop placement due to the rotation. However, it appears that it will handle the important loops I am finding in the LLVM test suite. llvm-svn: 145158	2011-11-27 00:38:03 +00:00
Chandler Carruth	f156f0cf57	FileCheck-ize this test and make it more precise. This is in preparation for adding other tests. llvm-svn: 145143	2011-11-26 08:24:25 +00:00
Eli Friedman	a84ad7d0d0	Fix APFloat::convert so that it handles narrowing conversions correctly; it was returning incorrect values in rare cases, and incorrectly marking exact conversions as inexact in some more common cases. Fixes PR11406, and a missed optimization in test/CodeGen/X86/fp-stack-O0.ll. llvm-svn: 145141	2011-11-26 03:38:02 +00:00
Bruno Cardoso Lopes	0f9a1f5e6c	This patch contains support for encoding FMA4 instructions and tablegen patterns for scalar FMA4 operations and intrinsic. Also add tests for vfmaddsd. Patch by Jan Sjodin llvm-svn: 145133	2011-11-25 19:33:42 +00:00
Craig Topper	d65a444478	Remove 256-bit specific node types for UNPCKHPS/D and instead use the 128-bit versions and let the operand type disinquish. Also fix the load form of the v8i32 patterns for these to realize that the load would be promoted to v4i64. llvm-svn: 145126	2011-11-24 22:57:10 +00:00
Benjamin Kramer	651db37352	X86: alias cqo to cqto. llvm-svn: 145121	2011-11-24 12:02:46 +00:00
Chandler Carruth	7adee1a01a	Fix a silly use-after-free issue. A much earlier version of this code need lots of fanciness around retaining a reference to a Chain's slot in the BlockToChain map, but that's all gone now. We can just go directly to allocating the new chain (which will update the mapping for us) and using it. Somewhat gross mechanically generated test case replicates the issue Duncan spotted when actually testing this out. llvm-svn: 145120	2011-11-24 11:23:15 +00:00
Chandler Carruth	d394bafd2d	When adding blocks to the list of those which no longer have any CFG conflicts, we should only be adding the first block of the chain to the list, lest we try to merge into the middle of that chain. Most of the places we were doing this we already happened to be looking at the first block, but there is no reason to assume that, and in some cases it was clearly wrong. I've added a couple of tests here. One already worked, but I like having an explicit test for it. The other is reduced from a test case Duncan reduced for me and used to crash. Now it is handled correctly. llvm-svn: 145119	2011-11-24 08:46:04 +00:00
Richard Smith	4f9a8081c3	Correctly byte-swap APInts with bit-widths greater than 64. llvm-svn: 145111	2011-11-23 21:33:37 +00:00
Duncan Sands	81a2af12d6	Fix a crash in which a multiplication was being reported as being both negative and positive: positive, because it could be directly computed to be positive; negative, because the nsw flags means it is either negative or undefined (the multiplication always overflowed). llvm-svn: 145104	2011-11-23 16:26:47 +00:00
Benjamin Kramer	ebcb451874	X86: Use btq for bit tests if the immediate can't be encoded in 32 bits. Before: movabsq $4294967296, %rax ## encoding: [0x48,0xb8,0x00,0x00,0x00,0x00,0x01,0x00,0x00,0x00] testq %rax, %rdi ## encoding: [0x48,0x85,0xf8] jne LBB0_2 ## encoding: [0x75,A] After: btq $32, %rdi ## encoding: [0x48,0x0f,0xba,0xe7,0x20] jb LBB0_2 ## encoding: [0x72,A] btq is usually slower than testq because it doesn't fuse with the jump, but here we're better off saving one register and a giant movabsq. llvm-svn: 145103	2011-11-23 13:54:17 +00:00
NAKAMURA Takumi	0b3e996485	test/CodeGen/X86/block-placement.ll: Add explicit -mtriple=i686-linux. X86 Win32 CodeGen does not support EH yet. llvm-svn: 145101	2011-11-23 12:18:22 +00:00
Chandler Carruth	99fe42fbd9	Relax an invariant that block placement was trying to assert a bit further. This invariant just wasn't going to work in the face of unanalyzable branches; we need to be resillient to the phenomenon of chains poking into a loop and poking out of a loop. In fact, we already were, we just needed to not assert on it. This was found during a bootstrap with block placement turned on. llvm-svn: 145100	2011-11-23 10:35:36 +00:00
Elena Demikhovsky	779ba6d7b7	I added several lines in X86 code generator that allow to choose VSHUFPS/VSHUFPD instructions while lowering VECTOR_SHUFFLE node. I check a commuted VSHUFP mask. The patch was reviewed by Bruno. llvm-svn: 145099	2011-11-23 10:23:16 +00:00
Chandler Carruth	8c68f1f3c8	Handle the case of a no-return invoke correctly. It actually still has successors, they just are all landing pad successors. We handle this the same way as no successors. Comments attached for the next person to wade through here and another lovely test case courtesy of Benjamin Kramer's bugpoint reduction. llvm-svn: 145098	2011-11-23 08:23:54 +00:00
Bob Wilson	ebb44646c4	Enable stack protectors for all arrays, not just char arrays. rdar://5875909 Patch by Bill Wendling. llvm-svn: 145097	2011-11-23 07:13:56 +00:00
Jakob Stoklund Olesen	02845410f9	Fix PR11422. This was a bug in keeping track of the available domains when merging domain values. The wrong domain mask caused ExecutionDepsFix to try to move VANDPSYrr to the integer domain which is only available in AVX2. Also add an assertion to catch future attempts at emitting AVX2 instructions. llvm-svn: 145096	2011-11-23 04:03:08 +00:00
Chandler Carruth	4a87aa0c31	Fix a crash in block placement due to an inner loop that happened to be reversed in the function's original ordering, and we happened to encounter it while handling an outer unnatural CFG structure. Thanks to the test case reduced from GCC's source by Benjamin Kramer. This may also fix a crasher in gzip that Duncan reduced for me, but I haven't yet gotten to testing that one. llvm-svn: 145094	2011-11-23 03:03:21 +00:00
Kostya Serebryany	8b5c7a56a3	[asan] do not instrument threadlocal globals, this is buggy llvm-svn: 145092	2011-11-23 02:10:54 +00:00
Hal Finkel	6f0ae783fe	add basic PPC register-pressure feedback; adjust the vaarg test to match the new register-allocation pattern llvm-svn: 145065	2011-11-22 16:21:04 +00:00
Chandler Carruth	ee54feb6f6	Fix a devilish miscompile exposed by block placement. The updateTerminator code didn't correctly handle EH terminators in one very specific case. AnalyzeBranch would find no terminator instruction, and so the fallback in updateTerminator is to assume fallthrough. This is correct, but the destination of the fallthrough was assumed to be the first successor. This is almost always true, but in certain cases the loop transformations will cause the landing pad to be the first successor! Instead of this brittle logic, actually look through the successors for a non-landing-pad accessor, and to assert if more than one is found. This will hopefully fix some (if not all) of the self host miscompiles with block placement. Thanks to Benjamin Kramer for reporting, Nick Lewycky for an initial stab at a reduction, and Duncan for endless advice on EH (which I know nothing about) as well as reviewing the actual fix. llvm-svn: 145062	2011-11-22 13:13:16 +00:00
Rafael Espindola	c55e1af137	Add triple to the test. llvm-svn: 145057	2011-11-22 06:36:25 +00:00
Rafael Espindola	2021f38281	If a register is both an early clobber and part of a tied use, handle the use before the clobber so that we copy the value if needed. Fixes pr11415. llvm-svn: 145056	2011-11-22 06:27:18 +00:00
Nick Lewycky	063ae5897c	Fix crasher in GVN due to my recent capture tracking changes. llvm-svn: 145047	2011-11-21 19:42:56 +00:00
Craig Topper	6270d072c5	Lowering for v32i8 to VPUNPCKLBW/VPUNPCKHBW when AVX2 is enabled. llvm-svn: 145028	2011-11-21 08:26:50 +00:00
Craig Topper	d12d6f4b1c	Test case for r145026 llvm-svn: 145027	2011-11-21 06:58:09 +00:00
Craig Topper	a065238c6e	Make LowerSIGN_EXTEND_INREG split 256-bit vectors when AVX1 is enabled and use AVX2 shifts when AVX2 is enabled. llvm-svn: 145022	2011-11-21 01:12:36 +00:00
NAKAMURA Takumi	76dfa03874	test/CodeGen/X86/block-placement.ll: Relax expressions for Win32. llvm-svn: 145011	2011-11-20 12:49:45 +00:00

... 2 3 4 5 6 ...

15370 Commits