llvm-project

Commit Graph

Author	SHA1	Message	Date
Evan Cheng	093e124256	Fix a coaelescer bug. If a copy val# is extended to eliminate a non-trivially coalesced copy, and the copy kills its source register. Trim the source register's live range to the last use if possible. This fixes up kill marker to make the scavenger happy. llvm-svn: 77967	2009-08-03 08:41:59 +00:00
Anton Korobeynikov	71386e08fe	Unbreak Win64 CC. Step one: honour register save area, fix some alignment and provide a different set of call-clobberred registers. llvm-svn: 77962	2009-08-03 08:12:53 +00:00
Rafael Espindola	70e9816624	Use movd instead of movq llvm-svn: 77956	2009-08-03 05:21:05 +00:00
Daniel Dunbar	0f16ea5c30	Pass target triple string in to TargetMachine constructor. This is not just a matter of passing in the target triple from the module; currently backends are making decisions based on the build and host architecture. The goal is to migrate to making these decisions based off of the triple (in conjunction with the feature string). Thus most clients pass in the target triple, or the host triple if that is empty. This has one important change in the way behavior of the JIT and llc. For the JIT, it was previously selecting the Target based on the host (naturally), but it was setting the target machine features based on the triple from the module. Now it is setting the target machine features based on the triple of the host. For LLC, -march was previously only used to select the target, the target machine features were initialized from the module's triple (which may have been empty). Now the target triple is taken from the module, or the host's triple is used if that is empty. Then the triple is adjusted to match -march. The take away is that -march for llc is now used in conjunction with the host triple to initialize the subtarget. If users want more deterministic behavior from llc, they should use -mtriple, or set the triple in the input module. llvm-svn: 77946	2009-08-03 04:03:51 +00:00
Rafael Espindola	18ba271a79	Use movq to move 64 bits in and out of mmx registers. Fixes PR4669 llvm-svn: 77940	2009-08-03 02:45:34 +00:00
Evan Cheng	8b9deebba3	Use the i12 variant of load / store opcodes if offset is zero. Now we pass all of multisource as well. llvm-svn: 77939	2009-08-03 02:38:06 +00:00
Richard Osborne	bbb772ace9	Add extra SEXT pattern. llvm-svn: 77920	2009-08-02 22:45:24 +00:00
Jakob Stoklund Olesen	7dc3b72685	Remove unneeded intrinsics from Blackfin backend. __builtin_bfin_ones does the same as ctpop, so it can be implemented in the front-end. __builtin_bfin_loadbytes loads from an unaligned pointer with the disalignexcpt instruction. It does the same as loading from a pointer with the low bits masked. It is better if the front-end creates a masked load. We can always instruction select the masked to disalignexcpt+load. We keep csync/ssync/idle. These intrinsics represent instructions that need workarounds for some silicon revisions. We may even want to convert inline assembler to intrinsics to enable the workarounds. llvm-svn: 77917	2009-08-02 21:49:05 +00:00
Jakob Stoklund Olesen	185eb035e9	Fix issue in regscavenger when scavenging a callee-saved register that has not been spilled. llvm-svn: 77912	2009-08-02 20:29:41 +00:00
Jakob Stoklund Olesen	c59cd9bcd0	Never add a kill flag to a constrained physical register in a two-addr instruction. llvm-svn: 77906	2009-08-02 19:13:03 +00:00
Jakob Stoklund Olesen	5d52bfbbc9	Scavenger asserts. Allow imp-def and imp-use of anything in the scavenger asserts, just like the machine code verifier. Allow redefinition of a sub-register of a live register. llvm-svn: 77904	2009-08-02 18:28:41 +00:00
Jakob Stoklund Olesen	2a21149b20	Add some basic blackfin intrinsics. llvm-svn: 77903	2009-08-02 18:28:11 +00:00
Jakob Stoklund Olesen	b052972a58	Inline assembly support for Blackfin. We use the same constraints as GCC, including those that are slightly insane for inline assembler. llvm-svn: 77899	2009-08-02 17:39:17 +00:00
Jakob Stoklund Olesen	552d8d6618	Analog Devices Blackfin back-end. Generate code for the Blackfin family of DSPs from Analog Devices: http://www.analog.com/en/embedded-processing-dsp/blackfin/processors/index.html We aim to be compatible with the exsisting GNU toolchain found at: http://blackfin.uclinux.org/gf/project/toolchain The back-end is experimental. llvm-svn: 77897	2009-08-02 17:32:10 +00:00
Evan Cheng	8e3889f12e	Test both darwin and linux. llvm-svn: 77852	2009-08-02 02:54:34 +00:00
Chris Lattner	c4d6f83f20	switch to filecheck format llvm-svn: 77841	2009-08-02 00:32:26 +00:00
Chris Lattner	b4b1012d29	fix a problem Eli noticed where we would compile the attached ptrtoint to: .quad X even on a 32-bit system, where X is not 64-bits. There isn't much that we can do here, so we just print: .quad ((X) & 4294967295) instead. llvm-svn: 77818	2009-08-01 22:25:12 +00:00
Dan Gohman	9023fd2b2a	Add nounwind to this test. llvm-svn: 77792	2009-08-01 19:11:04 +00:00
Eli Friedman	f165160724	Hack to make this test work on platforms which aren't Macs. Fixing this myself because I'm getting tired of seeing the red buildbots, which have been red since 5:30PM PDT last night. Proposed supplement to developer policy: committers should make sure to be around to watch for buildbot failures after committing. llvm-svn: 77785	2009-08-01 16:37:18 +00:00
Evan Cheng	e64f48ba8b	Workaround a couple of Darwin assembler bugs. llvm-svn: 77781	2009-08-01 06:13:52 +00:00
Evan Cheng	e6e8289d72	Split t2MOVCCs since some assemblers do not recognize mov shifted register alias with predicate. llvm-svn: 77764	2009-08-01 01:43:45 +00:00
Evan Cheng	6ab54fdb0a	Fix Thumb2 function call isel. Thumb1 and Thumb2 should share the same instructions for calls since BL and BLX are always 32-bit long and BX is always 16-bit long. Also, we should be using BLX to call external function stubs. llvm-svn: 77756	2009-08-01 00:16:10 +00:00
David Greene	81bcae5fda	Simplify operand padding by keying off tabs in the asm stream. If padding is disabled, tabs get replaced by spaces except in the case of the first operand, where the tab is output to line up the operands after the mnemonics. Add some better comments and eliminate redundant code. Fix some testcases to not assume tabs. llvm-svn: 77740	2009-07-31 21:57:10 +00:00
Chris Lattner	4d2c0f9008	switch off of 'Section' onto MCSection. We're not properly using MCSection subclasses yet, but this is a step in the right direction. llvm-svn: 77708	2009-07-31 18:48:30 +00:00
Evan Cheng	be8422e8e0	Until we have a "ALIGN" pseudo instruction, have asm printer emitted a .align to ensure the instruction that follows a TBB (when the number of table entries is odd) is 2-byte aligned. Patch by Sandeep Patel. llvm-svn: 77705	2009-07-31 18:35:56 +00:00
Chris Lattner	fc0264a38e	fix PR4650: we only track sizes for certain objects, so only put something into the mergable section if it is one of our special cases. This could obviously be improved, but this is the minimal fix and restores us to the previous behavior. llvm-svn: 77679	2009-07-31 16:17:13 +00:00
Evan Cheng	5811ab5cf3	When fp is not eliminated, instructions with T2_i12 modes will be changed to T2_i8 ones. Take that into consideration when determining stack size limit for reserving register scavenging slot. llvm-svn: 77642	2009-07-30 23:29:25 +00:00
David Goodwin	0bfc8312c2	Darwin assembler now recognizes "orn", so remove workaround. llvm-svn: 77627	2009-07-30 21:51:41 +00:00
David Goodwin	ce774e2383	Darwin assembler now supports "rrx", so remove workaround. llvm-svn: 77625	2009-07-30 21:38:40 +00:00
David Goodwin	79c079b478	Cleanup and include code selection for some frame index cases. llvm-svn: 77622	2009-07-30 18:56:48 +00:00
Evan Cheng	e62288fdd4	Optimize some common usage patterns of atomic built-ins __sync_add_and_fetch() and __sync_sub_and_fetch. When the return value is not used (i.e. only care about the value in the memory), x86 does not have to use add to implement these. Instead, it can use add, sub, inc, dec instructions with the "lock" prefix. This is currently implemented using a bit of instruction selection trick. The issue is the target independent pattern produces one output and a chain and we want to map it into one that just output a chain. The current trick is to select it into a merge_values with the first definition being an implicit_def. The proper solution is to add new ISD opcodes for the no-output variant. DAG combiner can then transform the node before it gets to target node selection. Problem #2 is we are adding a whole bunch of x86 atomic instructions when in fact these instructions are identical to the non-lock versions. We need a way to add target specific information to target nodes and have this information carried over to machine instructions. Asm printer (or JIT) can use this information to add the "lock" prefix. llvm-svn: 77582	2009-07-30 08:33:02 +00:00
Dan Gohman	49a6f16b7c	Add a new register class to describe operands that can't be SP, due to x86 encoding restrictions. This is currently off by default because it may cause code quality regressions. This is for PR4572. llvm-svn: 77565	2009-07-30 01:56:29 +00:00
Evan Cheng	e3493a91cc	tbb / tbh instructions only branch forward, not backwards. llvm-svn: 77522	2009-07-29 23:20:20 +00:00
Evan Cheng	1f58eed638	Add VFP3 D registers to the DPR register class. llvm-svn: 77521	2009-07-29 23:03:41 +00:00
Bob Wilson	cf19885a32	Change Neon VLDn intrinsics to return multiple values instead of really wide vectors. Likewise, change VSTn intrinsics to take separate arguments for each vector in a multi-vector struct. Adjust tests accordingly. llvm-svn: 77468	2009-07-29 16:39:22 +00:00
Chris Lattner	c5397abb52	fix PR4584 with a trivial patch now that the pieces are in place. llvm-svn: 77434	2009-07-29 05:20:33 +00:00
Evan Cheng	c6d70ae063	Optimize Thumb2 jumptable to use tbb / tbh when all the offsets fit in byte / halfword. llvm-svn: 77422	2009-07-29 02:18:14 +00:00
Eric Christopher	dce1e4949e	Add a couple more tests for the ptest intrinsics to make sure we're grabbing them all correctly. llvm-svn: 77413	2009-07-29 00:51:15 +00:00
Eric Christopher	f7802a33ce	Add support for gcc __builtin_ia32_ptest{z,c,nzc} intrinsics. Lower to ptest instruction plus setcc. Revamp ptest instruction. Add test. llvm-svn: 77407	2009-07-29 00:28:05 +00:00
Evan Cheng	c8bed03349	In thumb2 mode, add pc is unpredictable. Use add + mov pc instead (that is until more optimization goes in). llvm-svn: 77364	2009-07-28 20:53:24 +00:00
David Goodwin	68bb69d6e3	Remove support for ORN to workaround <rdar://problem/7096522>. llvm-svn: 77363	2009-07-28 20:51:25 +00:00
David Goodwin	865c6298d7	Add workaround for <rdar://problem/7098328>. llvm-svn: 77340	2009-07-28 18:15:38 +00:00
Chris Lattner	ebbbf451c9	fix testcase for previous patch. llvm-svn: 77338	2009-07-28 18:04:18 +00:00
Chris Lattner	513a36b63d	Fix PR4639, a ELF-TLS regression from some of my refactoring. llvm-svn: 77336	2009-07-28 17:57:51 +00:00
David Goodwin	e82862e24e	Add Thumb-2 patterns for ARMsrl_flag and ARMsra_flag. llvm-svn: 77329	2009-07-28 17:06:49 +00:00
Evan Cheng	12da273f90	tADDrSPI doesn't have a predicate operand, but tADDhirr and tADDi3 have. llvm-svn: 77305	2009-07-28 07:38:35 +00:00
Evan Cheng	780748d565	- More refactoring. This gets rid of all of the getOpcode calls. - This change also makes it possible to switch between ARM / Thumb on a per-function basis. - Fixed thumb2 routine which expand reg + arbitrary immediate. It was using using ARM so_imm logic. - Use movw and movt to do reg + imm when profitable. - Other code clean ups and minor optimizations. llvm-svn: 77300	2009-07-28 05:48:47 +00:00
David Goodwin	57b51d9f82	ORN does not require (and can not have) the ".w" suffix. "Orthogonality" is a dirty word at ARM. llvm-svn: 77275	2009-07-27 23:34:12 +00:00
David Goodwin	782f242fd7	Add ".w" suffix for wide thumb-2 instructions. llvm-svn: 77199	2009-07-27 16:31:55 +00:00
Sanjiv Gupta	a77a182b04	Test case to check that separate section is created for a global variable specified with section attribute. llvm-svn: 77195	2009-07-27 16:20:41 +00:00
Chris Lattner	57af4ece60	update testcase. llvm-svn: 77192	2009-07-27 15:52:58 +00:00
Chris Lattner	8e58bc9ed4	put normal data into .data instead of .data.rel on elf systems. llvm-svn: 77116	2009-07-26 03:06:11 +00:00
Chris Lattner	397792d981	finish simplifying DarwinTargetAsmInfo::SelectSectionForGlobal for now. Make the section switching directives more consistent by not including \n and including \t for them all. llvm-svn: 77107	2009-07-26 01:24:18 +00:00
Chris Lattner	5b42b45fb9	simplify DarwinTargetAsmInfo::SelectSectionForGlobal a bit and make it more aggressive, we now put: const int G2 __attribute__((weak)) = 42; into the text (readonly) segment like gcc, previously we put it into the data (readwrite) segment. llvm-svn: 77104	2009-07-26 00:51:36 +00:00
Bob Wilson	8a37bbebfd	Add support for ARM Neon VREV instructions. Patch by Anton Korzh, with some modifications from me. llvm-svn: 77101	2009-07-26 00:39:34 +00:00
Chris Lattner	2de9510572	add the most expedient hack to fix PR4619, along with a testcase. Thanks to Rafael for the great example. llvm-svn: 77083	2009-07-25 17:57:37 +00:00
Evan Cheng	3b5791f982	I've lost my mind. PR4572 has not been fixed. llvm-svn: 77031	2009-07-25 01:11:46 +00:00
Evan Cheng	f3a1fce8ae	Change Thumb2 jumptable codegen to one that uses two level jumps: Before: adr r12, #LJTI3_0_0 ldr pc, [r12, +r0, lsl #2] LJTI3_0_0: .long LBB3_24 .long LBB3_30 .long LBB3_31 .long LBB3_32 After: adr r12, #LJTI3_0_0 add pc, r12, +r0, lsl #2 LJTI3_0_0: b.w LBB3_24 b.w LBB3_30 b.w LBB3_31 b.w LBB3_32 This has several advantages. 1. This will make it easier to optimize this to a TBB / TBH instruction + (smaller) table. 2. This eliminate the need for ugly asm printer hack to force the address into thumb addresses (bit 0 is one). 3. Same codegen for pic and non-pic. 4. This eliminate the need to align the table so constantpool island pass won't have to over-estimate the size. Based on my calculation, the later is probably slightly faster as well since ldr pc with shifter address is very slow. That is, it should be a win as long as the HW implementation can do a reasonable job of branch predict the second branch. llvm-svn: 77024	2009-07-25 00:33:29 +00:00
Evan Cheng	8c8e88bd39	Remove a duplicated test. llvm-svn: 77020	2009-07-25 00:24:40 +00:00
Evan Cheng	01740ab57b	Forgot this test earlier. llvm-svn: 77007	2009-07-24 22:42:45 +00:00
Evan Cheng	aee0e1f48c	Fix these tests. llvm-svn: 77006	2009-07-24 22:42:22 +00:00
Eric Christopher	fae639c9ad	Move insertps tests to sse41 combo test file, convert to filecheck format and add an extract/insert test. llvm-svn: 76994	2009-07-24 19:24:26 +00:00
Evan Cheng	3990850a7d	Convert a test to FileCheck. llvm-svn: 76954	2009-07-24 06:01:46 +00:00
Chris Lattner	26aff56462	Remove SectionKind::Small*. This was only used on mips, and is apparently a sad mistake that is regretted. :) llvm-svn: 76935	2009-07-24 03:11:51 +00:00
Richard Osborne	fc39e417a8	Add tests for handling of globals and tls on the XCore. These currently fail but pass when run against r76652. llvm-svn: 76923	2009-07-24 00:38:20 +00:00
Dan Gohman	17151155ed	Remove the IA-64 backend. llvm-svn: 76920	2009-07-24 00:30:09 +00:00
Evan Cheng	dc99f07113	Thumb2 does not allow the use of "pc" register as part of the load / store address. llvm-svn: 76909	2009-07-23 23:09:51 +00:00
Evan Cheng	d2919a1773	Fix up ARM constant island pass for Thumb2. Also fixed up code to fully use the SoImm field for ADR on ARM mode. llvm-svn: 76890	2009-07-23 18:27:47 +00:00
Chris Lattner	dc13b7c637	merge one more sse41 test into sse41.ll llvm-svn: 76853	2009-07-23 04:49:39 +00:00
Chris Lattner	70d5783535	merge another sse41 test into sse41.ll llvm-svn: 76852	2009-07-23 04:43:48 +00:00
Chris Lattner	08fc6e6e40	merge sse41-pmovx.ll into sse41.ll llvm-svn: 76850	2009-07-23 04:39:09 +00:00
Chris Lattner	b9cdd3153c	change a test to run in filecheck style. Rename it to be a general dumping ground of various SSE4.1 tests, since filecheck can reasonably handle them all in one file. Generalize it to check x86-64 stuff as well since it has a different ABI (a convenient way to test both the reg and mem forms of these instructions). llvm-svn: 76848	2009-07-23 04:33:02 +00:00
Eric Christopher	b1b77ca862	Support insertps via the intrinsic and add a couple of simple testcases to make sure it's being generated. llvm-svn: 76843	2009-07-23 02:22:41 +00:00
Eric Christopher	327cb795a1	Add test for pinsrd and pinsrb instructions. llvm-svn: 76840	2009-07-23 01:58:04 +00:00
Dan Gohman	b215100c7c	Revert r75663 (and r76805), as it is causing regressions on powerpc. llvm-svn: 76823	2009-07-23 00:09:46 +00:00
Dan Gohman	824ab40381	x86 isel tweak: use lea (%reg,%reg) instead of lea (,%reg,2). llvm-svn: 76817	2009-07-22 23:26:55 +00:00
Dan Gohman	cdbef5f2c0	Add -march=ppc32 lines so that this test doesn't ever default to ppc64. llvm-svn: 76805	2009-07-22 22:08:31 +00:00
Evan Cheng	e270d4a4dd	Use getTargetConstant instead of getConstant since it's meant as an constant operand. llvm-svn: 76803	2009-07-22 22:03:29 +00:00
Dan Gohman	c510293251	Make the grep line in this test more specific, to avoid unintended matches. llvm-svn: 76802	2009-07-22 22:02:42 +00:00
Evan Cheng	d2d52d1906	Ignore undef uses. llvm-svn: 76799	2009-07-22 21:51:42 +00:00
Duncan Sands	0cf7f5d6d2	Revert commit 76707, it was breaking the llvm-gcc build on linux platforms. The binutils assembler does not recognize the "s" flag, see for example http://sourceware.org/binutils/docs/as/Section.html llvm-svn: 76733	2009-07-22 10:35:05 +00:00
Chris Lattner	8ebaec6b27	set the ELF "small" flag on objects that end up in .rodata.cst4 consistently, updating a mips testcase to expect it. llvm-svn: 76707	2009-07-22 00:41:56 +00:00
Evan Cheng	332a6590ae	Remove a big test case. llvm-svn: 76669	2009-07-21 22:52:04 +00:00
Evan Cheng	38e88cb53f	Do not select tSXTB / tSXTH in thumb2 mode. llvm-svn: 76600	2009-07-21 18:15:26 +00:00
Chris Lattner	8e55200089	convert this test to filecheck format, which is faster and avoids false matches of "st" -> "stdin" llvm-svn: 76591	2009-07-21 17:36:24 +00:00
Chris Lattner	b61f9c8c8d	add a testcase for the pic16 section handling stuff. llvm-svn: 76579	2009-07-21 16:48:20 +00:00
Evan Cheng	07a6ac6b29	Another rewriter bug exposed by recent coalescer changes. ReuseInfo::GetRegForReload() should make sure the "switched" register is in the desired register class. I'm surprised this hasn't caused more failures in the past. llvm-svn: 76558	2009-07-21 09:15:00 +00:00
Chris Lattner	83423aa276	remove a very large testcase for now. llvm-svn: 76537	2009-07-21 06:28:36 +00:00
Evan Cheng	a7bb55ebb6	Fix a dagga combiner bug: avoid creating illegal constant. Is this really a winning transformation? fold (shl (srl x, c1), c2) -> (shl (and x, (shl -1, c1)), (sub c2, c1)) or (srl (and x, (shl -1, c1)), (sub c1, c2)) llvm-svn: 76535	2009-07-21 05:40:15 +00:00
Evan Cheng	0d8b0cf3b8	Fix ARM isle code that optimize multiply by constants which are power-of-2 +/- 1. llvm-svn: 76520	2009-07-21 00:31:12 +00:00
Evan Cheng	9a47392f2e	Cross RC coalescing is now on by default. llvm-svn: 76519	2009-07-21 00:22:59 +00:00
David Greene	40c68ad3bb	Re-apply 75490, 75806 and 76177 with fixes and tests. Efficiency comes next. llvm-svn: 76486	2009-07-20 22:02:59 +00:00
Evan Cheng	a2b8c3f98f	Forgot this test earlier. llvm-svn: 76485	2009-07-20 21:46:42 +00:00
Evan Cheng	57106d6dc0	Use TII->findCommutedOpIndices to find the commute operands (rather than guessing). llvm-svn: 76472	2009-07-20 21:16:08 +00:00
Evan Cheng	027d9f93ea	Fix some sub-reg coalescing bugs where the coalescer wasn't updating the resulting interval's register class. llvm-svn: 76458	2009-07-20 19:47:55 +00:00
Dan Gohman	33a3fd0b9c	Revert the addition of hasNoPointerOverflow to GEPOperator. Getelementptrs that are defined to wrap are virtually useless to optimization, and getelementptrs that are undefined on any kind of overflow are too restrictive -- it's difficult to ensure that all intermediate addresses are within bounds. I'm going to take a different approach. Remove a few optimizations that depended on this flag. llvm-svn: 76437	2009-07-20 17:43:30 +00:00
Chris Lattner	58f9bb2ccd	implement a new magic global "llvm.compiler.used" which is like llvm.used, but doesn't cause ".no_dead_strip" to be emitted on darwin. llvm-svn: 76399	2009-07-20 06:14:25 +00:00
Evan Cheng	4e4eb0b00c	Restore AsmWriterEmitter.cpp back to 74742. The recent changes broke Thumb. llvm-svn: 76398	2009-07-20 06:10:07 +00:00
Jakob Stoklund Olesen	aba695c7d0	Fix http://llvm.org/bugs/show_bug.cgi?id=4583 Inline asm instructions may have additional <imp-def,kill> register operands. These operands are not marked with a flag like the normal asm operands, so we must not assert that there is a flag. llvm-svn: 76373	2009-07-19 19:09:59 +00:00
Evan Cheng	090db9b7a9	Catch more coalescing opportunities. llvm-svn: 76282	2009-07-18 04:52:23 +00:00
Evan Cheng	e20cbf3068	Enable cross register class coalescing. llvm-svn: 76281	2009-07-18 02:10:10 +00:00
Evan Cheng	a776067d3f	Fix pr4552. Stack slot coloring with register must take care not to generate illegal ams. llvm-svn: 76258	2009-07-17 22:42:51 +00:00
Evan Cheng	18fe458103	Fix x86 inline ams 'q' constraint support. In 32-bit mode, it's just like 'Q', i.e. EAX, EDX, ECX, EBX. In 64-bit mode, it just means all the i64r registers. Yeah, that makes sense. llvm-svn: 76248	2009-07-17 22:13:25 +00:00
Chris Lattner	52d436e98b	rename test. llvm-svn: 76197	2009-07-17 18:05:55 +00:00
Eli Friedman	97f3f965eb	Make promotion in operation legalization for SETCC work correctly. llvm-svn: 76153	2009-07-17 05:16:04 +00:00
Anton Korobeynikov	c5df7e2dc1	Emit cross regclass register moves for thumb2. Minor code duplication cleanup. llvm-svn: 76124	2009-07-16 23:26:06 +00:00
Dale Johannesen	c4148c4ec7	Assume an inline asm might be a call, so we get stack alignment right when it is. This is not ideal but conservatively correct. Adjust a test to compensate for changed stack offset value. gcc.apple/asm-block-57.c llvm-svn: 76120	2009-07-16 22:34:45 +00:00
Jakob Stoklund Olesen	070fab8a1f	Teach MachineInstr::isRegTiedToDefOperand() to correctly parse inline asm operands. The inline asm operands must be parsed from the first flag, you cannot assume that an immediate operand preceeding a register use operand is the flag. PowerPC "m" operands are represented as (flag, imm, reg) triples. isRegTiedToDefOperand() would incorrectly interpret the imm as the flag. llvm-svn: 76101	2009-07-16 20:58:34 +00:00
Evan Cheng	357645efad	Changed my mind. We now allow remat of instructions whose defs have subreg indices. llvm-svn: 76100	2009-07-16 20:15:00 +00:00
Evan Cheng	fdd0eb4011	With recent MC changes, RIP base register is explicitly modeled. Make sure we add it when x86 V_SET0 / V_SETALLONES (by transforming it into a constpool load) into the use instruction. llvm-svn: 76094	2009-07-16 18:44:05 +00:00
Anton Korobeynikov	77a50bd3a8	Make xfail proper llvm-svn: 76065	2009-07-16 14:53:47 +00:00
Anton Korobeynikov	73fcd3d962	Temporary disable 16 bit bswap llvm-svn: 76063	2009-07-16 14:35:57 +00:00
Anton Korobeynikov	902facfe96	Add bswap patterns llvm-svn: 76061	2009-07-16 14:34:52 +00:00
Anton Korobeynikov	3ae30e08ef	Fix logic inversion for RI-mode address selection llvm-svn: 76052	2009-07-16 14:31:14 +00:00
Anton Korobeynikov	6c2c47ecb2	Unbreak the test llvm-svn: 76051	2009-07-16 14:30:49 +00:00
Anton Korobeynikov	4121039bef	Expand 32-bit bitconverts via memory llvm-svn: 76050	2009-07-16 14:30:29 +00:00
Anton Korobeynikov	bc2ead6ea3	Fix incomin arg stack frame offset in case we need to generate stack frame llvm-svn: 76049	2009-07-16 14:29:57 +00:00
Anton Korobeynikov	bd41c83ab0	Revert the commit, it just hides the real bug llvm-svn: 76045	2009-07-16 14:28:26 +00:00
Anton Korobeynikov	2acdac0f8e	Lower anyext to zext, 32-bit stuff does not have any implicit zero-extension side effects llvm-svn: 76035	2009-07-16 14:24:41 +00:00
Anton Korobeynikov	b25949b0f5	Provide consistent subreg idx scheme. This (hopefully) fixes remaining divide problems llvm-svn: 76011	2009-07-16 14:18:17 +00:00
Anton Korobeynikov	091872cb37	Implement 'large' PIC model llvm-svn: 76006	2009-07-16 14:16:05 +00:00
Anton Korobeynikov	569a94c4d0	Implement shifts properly (hopefilly - finally!) llvm-svn: 76005	2009-07-16 14:15:24 +00:00
Anton Korobeynikov	fe8df8ff61	Properly handle divides. As a bonus - implement memory versions of them. llvm-svn: 76003	2009-07-16 14:14:33 +00:00
Anton Korobeynikov	34ad780d0d	32 bit shifts have only 12 bit displacements llvm-svn: 76000	2009-07-16 14:13:24 +00:00
Anton Korobeynikov	1eb6262b4b	Consolidate reg-imm / reg-reg-imm address mode selection logic in one place. llvm-svn: 75990	2009-07-16 14:10:17 +00:00
Anton Korobeynikov	62f8515b1c	Add support for 12 bit displacements llvm-svn: 75988	2009-07-16 14:09:35 +00:00
Anton Korobeynikov	43d33bd6d2	Emit proper lowering of load from arg stack slot llvm-svn: 75986	2009-07-16 14:08:42 +00:00
Anton Korobeynikov	a8197bb651	Implement dynamic allocas llvm-svn: 75985	2009-07-16 14:08:15 +00:00
Anton Korobeynikov	7193e2670e	Add jump tables llvm-svn: 75984	2009-07-16 14:07:50 +00:00
Anton Korobeynikov	2ff298fad0	Add rotates llvm-svn: 75981	2009-07-16 14:06:49 +00:00
Anton Korobeynikov	9362d9aa76	Add patterns for integer negate llvm-svn: 75980	2009-07-16 14:06:27 +00:00
Anton Korobeynikov	f07c7941f0	Provide proper patterns for and with imm instructions. Tune the tests accordingly. llvm-svn: 75979	2009-07-16 14:06:00 +00:00
Anton Korobeynikov	59049d9176	Add 32 bit and reg-imm and disable invalid patterns for now llvm-svn: 75978	2009-07-16 14:05:32 +00:00
Anton Korobeynikov	2d218394c6	Add z9 and z10 target processors. Mark z10-only instructions as such. llvm-svn: 75977	2009-07-16 14:05:00 +00:00
Anton Korobeynikov	d568f6dce2	Proper lower 'small' results llvm-svn: 75962	2009-07-16 13:58:24 +00:00
Anton Korobeynikov	f1bf3176c6	Completel forgot about unconditional branches llvm-svn: 75961	2009-07-16 13:57:52 +00:00
Anton Korobeynikov	15d6e8785b	Lower addresses of globals llvm-svn: 75960	2009-07-16 13:57:27 +00:00
Anton Korobeynikov	a442cdfb04	Test (incomplete) for easy muls llvm-svn: 75959	2009-07-16 13:57:03 +00:00
Anton Korobeynikov	f0d7d6ce65	Provide "wide" muls and divs/rems llvm-svn: 75958	2009-07-16 13:56:42 +00:00
Anton Korobeynikov	b04a4fa5c1	Tests for cmp / br_cc / select_cc llvm-svn: 75949	2009-07-16 13:53:15 +00:00
Anton Korobeynikov	8695a30066	Emit callee-saved regs spills / restores llvm-svn: 75943	2009-07-16 13:51:12 +00:00
Anton Korobeynikov	d694b9ff8b	Some preliminary call lowering llvm-svn: 75941	2009-07-16 13:50:21 +00:00
Anton Korobeynikov	018599fc0b	Prologue / epilogue emission llvm-svn: 75940	2009-07-16 13:49:49 +00:00
Anton Korobeynikov	09890bd434	Add simple frame index elimination llvm-svn: 75939	2009-07-16 13:49:25 +00:00
Anton Korobeynikov	5dc5629100	Provide proper test :) llvm-svn: 75938	2009-07-16 13:48:59 +00:00
Anton Korobeynikov	405833dfb6	Add address computation stuff llvm-svn: 75935	2009-07-16 13:47:59 +00:00
Anton Korobeynikov	df99232d27	Add mem-imm stores llvm-svn: 75933	2009-07-16 13:47:14 +00:00
Anton Korobeynikov	44f8bbfb3f	Add stores and truncstores llvm-svn: 75931	2009-07-16 13:45:00 +00:00
Anton Korobeynikov	11b91b4e2e	Add patterns for various extloads llvm-svn: 75930	2009-07-16 13:44:30 +00:00
Anton Korobeynikov	04be818918	Add shifts and reg-imm address matching llvm-svn: 75927	2009-07-16 13:43:18 +00:00
Anton Korobeynikov	cf7ea6a94f	Add bunch of 32-bit patterns... Uffff :) llvm-svn: 75926	2009-07-16 13:42:31 +00:00
Anton Korobeynikov	ebe2de0e14	Add bunch of reg-imm movs llvm-svn: 75921	2009-07-16 13:34:50 +00:00
Anton Korobeynikov	28234bcde2	Provide masked reg-imm 'or' and 'and' llvm-svn: 75919	2009-07-16 13:33:57 +00:00
Anton Korobeynikov	1c4c7823ae	Fix test running lines llvm-svn: 75918	2009-07-16 13:33:21 +00:00
Anton Korobeynikov	0d76b17a78	Add reg-reg and pattern llvm-svn: 75917	2009-07-16 13:32:49 +00:00
Anton Korobeynikov	f9fe4036f2	Add sub reg-reg pattern llvm-svn: 75916	2009-07-16 13:32:16 +00:00
Anton Korobeynikov	a083d7af53	Add xor reg-reg pattern llvm-svn: 75915	2009-07-16 13:31:28 +00:00
Anton Korobeynikov	65096d6a60	Add or reg-reg pattern. llvm-svn: 75914	2009-07-16 13:30:53 +00:00
Anton Korobeynikov	18172d786f	Add add reg-reg and reg-imm patterns llvm-svn: 75913	2009-07-16 13:30:15 +00:00
Anton Korobeynikov	09082fa01a	Add simple reg-reg and reg-imm moves llvm-svn: 75912	2009-07-16 13:29:38 +00:00
Anton Korobeynikov	cf4ba97dba	Minimal lowering for formal_arguments / ret llvm-svn: 75911	2009-07-16 13:28:59 +00:00
Anton Korobeynikov	a3ceeaeda5	Add testsuite dir for systemz stuff llvm-svn: 75910	2009-07-16 13:28:22 +00:00
Richard Osborne	0cceec520c	Combine an unaligned store of unaligned load into a memmove. llvm-svn: 75908	2009-07-16 12:50:48 +00:00
Richard Osborne	bfdc557c8a	Expand unaligned 32 bit loads from an address which is a constant offset from a 32 bit aligned base as follows: ldw low, base[offset >> 2] ldw high, base[(offset >> 2) + 1] shr low_shifted, low, (offset & 0x3) * 8 shl high_shifted, high, 32 - (offset & 0x3) * 8 or result, low_shifted, high_shifted Expand 32 bit loads / stores with 16 bit alignment into two 16 bit loads / stores. llvm-svn: 75902	2009-07-16 10:42:35 +00:00
Richard Osborne	25b33cb035	Custom lower unaligned 32 bit stores and loads into libcalls. This is a big code size win since before they were expanding to upto 16 instructions. llvm-svn: 75901	2009-07-16 10:21:18 +00:00
Evan Cheng	84517443ca	Let callers decide the sub-register index on the def operand of rematerialized instructions. Avoid remat'ing instructions whose def have sub-register indices for now. It's just really really hard to get all the cases right. llvm-svn: 75900	2009-07-16 09:20:10 +00:00
Evan Cheng	43229fb489	ShortenDeadCopySrcLiveRange needs to be more conservative in multi-kill situations. llvm-svn: 75838	2009-07-15 21:39:50 +00:00
Richard Osborne	a8edd048c2	Fix pattern for LD16S_3r, add basic tests to check load / store instructions are being properly selected. llvm-svn: 75797	2009-07-15 17:06:59 +00:00
Richard Osborne	57489b0658	Fix XCoreTargetLowering::isLegalAddressingMode to handle non simple VTs. llvm-svn: 75788	2009-07-15 15:46:56 +00:00
Chris Lattner	55452c2bea	fix an arm codegen bug (the same as PR4482 on ppc) where available_externally symbols were not getting stubs. While I'm at it, add a big testcase for stub generation to make sure I don't break anything. llvm-svn: 75737	2009-07-15 04:12:33 +00:00
Chris Lattner	7d1f9542c2	get the PPC stub temporary label from the mangler instead of using horrible string hacking. This gives us a different label, but it's just an assembler temporary, so the name doesn't matter. llvm-svn: 75733	2009-07-15 02:56:53 +00:00
Chris Lattner	dab248ac95	convert this to filecheck style and make it a test of darwin/PPC's extremely elaborate pic/nopic stubs. llvm-svn: 75726	2009-07-15 01:43:31 +00:00
Chris Lattner	815337abd6	simplify this test to test the esentials. llvm-svn: 75725	2009-07-15 01:32:33 +00:00
Chris Lattner	d7fec20cba	convert to filecheck style, simplify RUN line, and add comment. llvm-svn: 75667	2009-07-14 19:49:11 +00:00
Chris Lattner	109866bf21	convert this test to filecheck style llvm-svn: 75663	2009-07-14 18:57:40 +00:00
Chris Lattner	8c9a96b966	Reapply my previous asmprinter changes now with more testing and two additional bug fixes: 1. The bug that everyone hit was a problem in the asmprinter where it would remove $stub but keep the L prefix on a name when emitting the indirect symbol. This is easy to fix by keeping the name of the stub and the name of the symbol in a StringMap instead of just keeping a StringSet and trying to reconstruct it late. 2. There was a problem printing the personality function. The current logic to print out the personality function from the DWARF information is a bit of a cesspool right now that duplicates a bunch of other logic in the asm printer. The short version of it is that it depends on emitting both the L and _ prefix for symbols (at least on darwin) and until I can untangle it, it is best to switch the mangler back to emitting both prefixes. llvm-svn: 75646	2009-07-14 18:17:16 +00:00
Daniel Dunbar	966932ccb7	Revert r75610 (and r75620, which was blocking the revert), in the hopes of unbreaking llvm-gcc (on Darwin). --- Reverse-merging r75620 into '.': U include/llvm/Support/Mangler.h --- Reverse-merging r75610 into '.': U test/CodeGen/X86/loop-hoist.ll G include/llvm/Support/Mangler.h U lib/Target/X86/AsmPrinter/X86ATTAsmPrinter.cpp U lib/VMCore/Mangler.cpp llvm-svn: 75636	2009-07-14 15:57:55 +00:00
Chris Lattner	774f2a2d51	Change the X86 asmprinter to use the mangler to apply suffixes like "$non_lazy_ptr" to symbols instead of doing it with "printSuffixedName". This gets us to the point where there is a real separation between computing a symbol name and printing it, something I need for MC printer stuff. This patch also fixes a corner case bug where unnamed private globals wouldn't get the private label prefix. Next up, rename all uses of getValueName -> getMangledName for better greppability, and then tackle the ppc/arm backends to eliminate "printSuffixedName". llvm-svn: 75610	2009-07-14 06:04:35 +00:00
Chris Lattner	f34815b32f	Change the internal interface to makeNameProper to take a bool that indicates whether the label is private or not, instead of taking prefix stuff. One effect of this is that symbols will be generated with just the private prefix, instead of both the private prefix and the user-label-prefix, but this doesn't matter as long as it is consistent. For example we'll now get "Lfoo" instead of "L_foo". These are just assembler temporary labels anyway, so they never even make it into the .o file. llvm-svn: 75607	2009-07-14 04:50:12 +00:00
David Goodwin	72b80ac9b1	Fix detection of valid BFC immediates. llvm-svn: 75576	2009-07-14 00:57:56 +00:00
Bill Wendling	e604b776a7	Check for the correct unnamed name. llvm-svn: 75573	2009-07-14 00:53:58 +00:00
Dan Gohman	dbaddda21f	Check in a reduced version of this testcase. llvm-svn: 75544	2009-07-13 23:04:44 +00:00
Chris Lattner	ec8efcb44e	Two changes: 1) unique globals with the existing "Count" local in Mangler, not with atomic nonsense. Using atomics will give us nondeterminstic output from the compiler when using multiple threads, which is bad. 2) Do not mangle an unknown global name with a type suffix. We don't need this anymore now that llvm ir doesn't have type planes. llvm-svn: 75541	2009-07-13 22:48:46 +00:00
Dan Gohman	054d2a7837	Add testcases for PR4538, PR4537, and PR4534. llvm-svn: 75533	2009-07-13 22:30:31 +00:00
Chris Lattner	92ce8381f5	remove tests for removed intrinsics. llvm-svn: 75433	2009-07-12 21:30:06 +00:00
Chris Lattner	f39f55d46c	add nounwind llvm-svn: 75407	2009-07-12 00:46:16 +00:00
Nick Lewycky	d57fb023e0	Darwin prepends an _ to internal globals, Linux doesn't. llvm-svn: 75405	2009-07-11 23:48:59 +00:00
Chris Lattner	38df005e12	fix x86-64 static codegen to materialize the address of a global with movl instead of lea. It is better for code size (and presumably efficiency) to use: movl $foo, %eax rather than: leal foo, eax Both give a nice zero extending "move immediate" instruction, the former is just smaller. Note that global addresses should be handled different by the x86 backend, but I chose to follow the style already in place and add more fixme's. llvm-svn: 75403	2009-07-11 23:17:29 +00:00
Chris Lattner	056dfc6f90	this test was incorrect for x86-64 static. It passed on darwin, because darwin doesn't have static x86-64 mode. llvm-svn: 75392	2009-07-11 22:30:05 +00:00
Chris Lattner	e91900097e	Fix PR4533, which is about buggy codegen in x86-64 -static mode. Basically, using: lea symbol(%rip), %rax is not valid in -static mode, because the current RIP may not be within 32-bits of "symbol" when an app is built partially pic and partially static. The fix for this is to compile it to: lea symbol, %rax It would be better to codegen this as: movq $symbol, %rax but that will come next. The hard part of fixing this bug was fixing abi-isel, which was actively testing for the wrong behavior. Also, the RUN lines are completely impossible to understand what they are testing. To help with this, convert the -static x86-64 codegen tests to use filecheck. This is much more stable and makes it more clear what the codegen is expected to be. llvm-svn: 75382	2009-07-11 20:29:19 +00:00
Chris Lattner	20adc670b2	We get the P modifier wrong in a lot of cases, just add some more rigorous testing. In addition to fixing this, I still need to do some more testing on darwin. llvm-svn: 75362	2009-07-11 08:30:22 +00:00
Evan Cheng	017288a4fc	Don't put IT instruction before conditional branches. llvm-svn: 75361	2009-07-11 07:26:20 +00:00
Evan Cheng	0794c6a083	Smarter isel of ldrsb / ldrsh. Only make use of these when [r,r] address is feasible. llvm-svn: 75360	2009-07-11 07:08:13 +00:00
Evan Cheng	cd4cdd1157	Major changes to Thumb (not Thumb2). Many 16-bit instructions either modifies CPSR when they are outside the IT blocks, or they can predicated when in Thumb2. Move the implicit def of CPSR to an optional def which defaults CPSR. This allows the 's' bit to be toggled dynamically. A side-effect of this change is asm printer is now using unified assembly. There are some minor clean ups and fixes as well. llvm-svn: 75359	2009-07-11 06:43:01 +00:00
Chris Lattner	e3c4765bac	convert test to use FileCheck, which is much more precise and faster than the previous RUN lines. Hopefully this will be an inspiration for future tests :) llvm-svn: 75261	2009-07-10 18:34:47 +00:00
Evan Cheng	0f9cce7951	Add a thumb2 pass to insert IT blocks. llvm-svn: 75218	2009-07-10 01:54:42 +00:00
Evan Cheng	223ac25930	Remove a bogus assertion. llvm-svn: 75206	2009-07-10 00:23:48 +00:00
Bob Wilson	9ce44e2521	Handle 'a' modifier on inline assembly operands. This is part of the fix for pr4521. llvm-svn: 75201	2009-07-09 23:54:51 +00:00
Eli Friedman	2b77eef160	Make EXTRACT_VECTOR_ELT a bit more flexible in terms of the returned value. Adjust other code to deal with that correctly. Make DAGTypeLegalizer::PromoteIntRes_EXTRACT_VECTOR_ELT take advantage of this new flexibility to simplify the code and make it deal with unusual vectors (like <4 x i1>) correctly. Fixes PR3037. llvm-svn: 75176	2009-07-09 22:01:03 +00:00
Evan Cheng	7452c968e4	Targets sometimes assign fixed stack object to spill certain callee-saved registers based on dynamic conditions. For example, X86 EBP/RBP, when used as frame register has to be spilled in the first fixed object. It should inform PEI this so it doesn't get allocated another stack object. Also, it should not be spilled as other callee-saved registers but rather its spilling and restoring are being handled by emitPrologue and emitEpilogue. Avoid spilling it twice. llvm-svn: 75116	2009-07-09 06:53:48 +00:00

... 2 3 4 5 6 ...

2235 Commits