llvm-project

Commit Graph

Author	SHA1	Message	Date
Bob Wilson	a3f1901531	Use a target-specific VMOVIMM DAG node instead of BUILD_VECTOR to represent NEON VMOV-immediate instructions. This simplifies some things. llvm-svn: 108275	2010-07-13 21:16:48 +00:00
Evan Cheng	0cc4ad983d	Extend the r107852 optimization which turns some fp compare to code sequence using only i32 operations. It now optimize some f64 compares when fp compare is exceptionally slow (e.g. cortex-a8). It also catches comparison against 0.0. llvm-svn: 108258	2010-07-13 19:27:42 +00:00
Evan Cheng	25f9364cbd	Optimize some vfp comparisons to integer ones. This patch implements the simplest case when the following conditions are met: 1. The arguments are f32. 2. The arguments are loads and they have no uses other than the comparison. 3. The comparison code is EQ or NE. e.g. vldr.32 s0, [r1] vldr.32 s1, [r0] vcmpe.f32 s1, s0 vmrs apsr_nzcv, fpscr beq LBB0_2 => ldr r1, [r1] ldr r0, [r0] cmp r0, r1 beq LBB0_2 More complicated cases will be implemented in subsequent patches. llvm-svn: 107852	2010-07-08 02:08:50 +00:00
Dan Gohman	fe7532a308	Split the SDValue out of OutputArg so that SelectionDAG-independent code can do calling-convention queries. This obviates OutputArgReg. llvm-svn: 107786	2010-07-07 15:54:55 +00:00
Dale Johannesen	ce97d55ad9	The hasMemory argument is irrelevant to how the argument for an "i" constraint should get lowered; PR 6309. While this argument was passed around a lot, this is the only place it was used, so it goes away from a lot of other places. llvm-svn: 106893	2010-06-25 21:55:36 +00:00
Bob Wilson	f3f7a770b7	Add basic support for NEON modified immediates besides VMOV. llvm-svn: 106030	2010-06-15 19:05:35 +00:00
Bob Wilson	5b2b504038	Rename functions referring to VMOV immediates to refer to NEON "modified immediate" operands. These functions have so far only been used for VMOV but they also apply to other NEON instructions with modified immediate operands. No functional changes. llvm-svn: 105969	2010-06-14 22:19:57 +00:00
Bob Wilson	d8a9a04739	For NEON vectors with 32- or 64-bit elements, select BUILD_VECTORs and VECTOR_SHUFFLEs to REG_SEQUENCE instructions. The standard ISD::BUILD_VECTOR node corresponds closely to REG_SEQUENCE but I couldn't use it here because its operands do not get legalized. That is pretty awful, but I guess it makes sense for other targets. Instead, I have added an ARM-specific version of BUILD_VECTOR that will have its operands properly legalized. This fixes the rest of Radar 7872877. llvm-svn: 105439	2010-06-04 00:04:02 +00:00
Dale Johannesen	d679ff7330	Early implementation of tail call for ARM. A temporary flag -arm-tail-calls defaults to off, so there is no functional change by default. Intrepid users may try this; simple cases work but there are bugs. llvm-svn: 105413	2010-06-03 21:09:53 +00:00
Jim Grosbach	84511e1526	Clean up 80 column violations. No functional change. llvm-svn: 105350	2010-06-02 21:53:11 +00:00
Jim Grosbach	c9f532dddc	back out 104862/104869. Can reuse stacksave after all. Very cool. llvm-svn: 104897	2010-05-27 23:11:57 +00:00
Jim Grosbach	5cde219fb1	add ISD::STACKADDR to get the current stack pointer. Will be used by sjlj EH to update the jmpbuf in the presence of VLAs. llvm-svn: 104862	2010-05-27 18:23:48 +00:00
Jim Grosbach	c98892fdaa	Adjust eh.sjlj.setjmp to properly have a chain and to have an opcode entry in ISD::. No functional change. llvm-svn: 104734	2010-05-26 20:22:18 +00:00
Evan Cheng	168ced94d8	Implement @llvm.returnaddress. rdar://8015977. llvm-svn: 104421	2010-05-22 01:47:14 +00:00
Jim Grosbach	bd9485db63	Implement eh.sjlj.longjmp for ARM. Clean up the intrinsic a bit. Followups: docs patch for the builtin and eh.sjlj.setjmp cleanup to match longjmp. llvm-svn: 104419	2010-05-22 01:06:18 +00:00
Evan Cheng	4401f8873c	Allow targets more controls on what nodes are scheduled by reg pressure, what for latency in hybrid mode. llvm-svn: 104293	2010-05-20 23:26:43 +00:00
Evan Cheng	4cad68eb34	Allow TargetLowering::getRegClassFor() to be called on illegal types. Also allow target to override it in order to map register classes to illegal but synthesizable types. e.g. v4i64, v8i64 for ARM / NEON. llvm-svn: 103854	2010-05-15 02:18:07 +00:00
Dan Gohman	bb919dfb6b	Implement a bunch more TargetSelectionDAGInfo infrastructure. Move EmitTargetCodeForMemcpy, EmitTargetCodeForMemset, and EmitTargetCodeForMemmove out of TargetLowering and into SelectionDAGInfo to exercise this. llvm-svn: 103481	2010-05-11 17:31:57 +00:00
Dan Gohman	4df9d9ce11	Remove the TargetLowering::getSubtarget() virtual function, which was unused. TargetMachine::getSubtarget() is used instead. llvm-svn: 103474	2010-05-11 16:21:03 +00:00
Dan Gohman	25c1653700	Get rid of the EdgeMapping map. Instead, just check for BasicBlock changes before doing phi lowering for switches. llvm-svn: 102809	2010-05-01 00:01:06 +00:00
Dan Gohman	21cea8ac2e	Use const qualifiers with TargetLowering. This eliminates several const_casts, and it reinforces the design of the Target classes being immutable. SelectionDAGISel::IsLegalToFold is now a static member function, because PIC16 uses it in an unconventional way. There is more room for API cleanup here. And PIC16's AsmPrinter no longer uses TargetLowering. llvm-svn: 101635	2010-04-17 15:26:15 +00:00
Dan Gohman	31ae586c74	Move per-function state out of TargetLowering subclasses and into MachineFunctionInfo subclasses. llvm-svn: 101634	2010-04-17 14:41:14 +00:00
Mon P Wang	c576ee9040	Reapply address space patch after fixing an issue in MemCopyOptimizer. Added support for address spaces and added a isVolatile field to memcpy, memmove, and memset, e.g., llvm.memcpy.i32(i8, i8, i32, i32) -> llvm.memcpy.p0i8.p0i8.i32(i8, i8, i32, i32, i1) llvm-svn: 100304	2010-04-04 03:10:48 +00:00
Mon P Wang	999c1b927b	Revert r100191 since it breaks objc in clang llvm-svn: 100199	2010-04-02 18:43:02 +00:00
Mon P Wang	a972ab8564	Reapply address space patch after fixing an issue in MemCopyOptimizer. Added support for address spaces and added a isVolatile field to memcpy, memmove, and memset, e.g., llvm.memcpy.i32(i8, i8, i32, i32) -> llvm.memcpy.p0i8.p0i8.i32(i8, i8, i32, i32, i1) llvm-svn: 100191	2010-04-02 18:04:15 +00:00
Bob Wilson	6f7fd28824	Revert Mon Ping's change 99928, since it broke all the llvm-gcc buildbots. llvm-svn: 99948	2010-03-30 22:27:04 +00:00
Mon P Wang	7460571381	Added support for address spaces and added a isVolatile field to memcpy, memmove, and memset, e.g., llvm.memcpy.i32(i8, i8, i32, i32) -> llvm.memcpy.p0i8.p0i8.i32(i8, i8, i32, i32, i1) A update of langref will occur in a subsequent checkin. llvm-svn: 99928	2010-03-30 20:55:56 +00:00
Bob Wilson	e4191e719b	Revert this change, since it was causing ARM performance regressions. --- Reverse-merging r98889 into '.': U lib/Target/ARM/ARMInstrNEON.td U lib/Target/ARM/ARMISelLowering.h U lib/Target/ARM/ARMInstrInfo.td U lib/Target/ARM/ARMInstrVFP.td U lib/Target/ARM/ARMISelLowering.cpp U lib/Target/ARM/ARMInstrFormats.td llvm-svn: 99010	2010-03-19 22:51:32 +00:00
Anton Korobeynikov	f11aa9e7b4	Get rid of target-specific fp <-> int nodes when still I'm here. llvm-svn: 98889	2010-03-18 22:35:45 +00:00
Anton Korobeynikov	64578d5599	Get rid of target-specific nodes for fp16 <-> fp32 conversion. llvm-svn: 98888	2010-03-18 22:35:37 +00:00
Anton Korobeynikov	d7fece38fc	Add codegen support for FP16 on ARM llvm-svn: 98502	2010-03-14 18:42:31 +00:00
Bob Wilson	c6c13a3515	Use NEON vmin/vmax instructions for floating-point selects. Radar 7461718. llvm-svn: 96572	2010-02-18 06:05:53 +00:00
Jim Grosbach	a570d05228	tighten up eh.setjmp sequence a bit. llvm-svn: 95603	2010-02-08 23:22:00 +00:00
Evan Cheng	6f36a083ef	Revert 95130. llvm-svn: 95160	2010-02-02 23:55:14 +00:00
Evan Cheng	c1b0116ff1	Pass callsite return type to TargetLowering::LowerCall and use that to check sibcall eligibility. llvm-svn: 95130	2010-02-02 21:29:10 +00:00
Evan Cheng	67a69dd2ed	Eliminate target hook IsEligibleForTailCallOptimization. Target independent isel should always pass along the "tail call" property. Change target hook LowerCall's parameter "isTailCall" into a refernce. If the target decides it's impossible to honor the tail call request, it should set isTailCall to false to make target independent isel happy. llvm-svn: 94626	2010-01-27 00:07:07 +00:00
Jim Grosbach	8546ec9c14	Patch by David Conrad: "On ARMv6T2 this turns cttz into rbit, clz instead of the 4 instruction sequence it is now." llvm-svn: 93758	2010-01-18 19:58:49 +00:00
Jim Grosbach	8f9a3ac12c	Framework for atomic binary operations. The emitter for the pseudo instructions just issues an error for the moment. The front end won't yet generate these intrinsics for ARM, so this is behind the scenes until complete. llvm-svn: 91200	2009-12-12 01:40:06 +00:00
Jim Grosbach	5c4e99fca6	Rough first pass at compare_and_swap atomic builtins for ARM mode. Work in progress. llvm-svn: 91090	2009-12-11 01:42:04 +00:00
Jim Grosbach	53e8854443	Add memory barrier intrinsic support for ARM. Moving towards adding the atomic operations intrinsics. llvm-svn: 91003	2009-12-10 00:11:09 +00:00
Evan Cheng	15b80e4a9f	isLegalICmpImmediate should take a signed integer; code clean up. llvm-svn: 86964	2009-11-12 07:13:11 +00:00
Evan Cheng	3d3c24a82c	Add TargetLowering::isLegalICmpImmediate. It tells LSR what immediate can be folded into target icmp instructions. llvm-svn: 86858	2009-11-11 19:05:52 +00:00
Jim Grosbach	d7cf55cd0e	Use Unified Assembly Syntax for the ARM backend. llvm-svn: 86494	2009-11-09 00:11:35 +00:00
Bob Wilson	1cf0b03064	Add ARM codegen for indirect branches. clang/test/CodeGen/indirect-goto.c runs! (unoptimized) llvm-svn: 85577	2009-10-30 05:45:42 +00:00
Evan Cheng	4a609f3cef	Use fconsts and fconstd to materialize small fp constants. llvm-svn: 85362	2009-10-28 01:44:26 +00:00
Anton Korobeynikov	29a44df5f8	ARM does not support offset folding (yet). Disable it for now. This fixes PR5031. Unfortunately, there is no small testcase :( llvm-svn: 82643	2009-09-23 19:04:09 +00:00
Evan Cheng	270d0f986f	Enhance EmitInstrWithCustomInserter() so target can specify CFG changes that sdisel will use to properly complete phi nodes. Not functionality change yet. llvm-svn: 82273	2009-09-18 21:02:19 +00:00
Sandeep Patel	68c5f477fa	Retype from unsigned to CallingConv::ID accordingly. Approved by Bob Wilson. llvm-svn: 80773	2009-09-02 08:44:58 +00:00
Bob Wilson	e0636a7aed	Remove unneeded ARM-specific DAG nodes for VLD* and VST* Neon operations. The instructions can be selected directly from the intrinsics. We will need to add some ARM-specific nodes for VLD/VST of 3 and 4 128-bit vectors, but those are not yet implemented. llvm-svn: 80117	2009-08-26 17:39:53 +00:00
Bob Wilson	a70623102e	Match VTRN, VZIP, and VUZP shuffles. Restore the tests for these operations, now using shuffles instead of intrinsics. llvm-svn: 79673	2009-08-21 20:54:19 +00:00
Anton Korobeynikov	232b19c3d5	Fix some typos and use type-based isel for VZIP/VUZP/VTRN llvm-svn: 79625	2009-08-21 12:41:42 +00:00
Anton Korobeynikov	9a232f46a8	Add lowering of ARM 4-element shuffles to multiple instructios via perfectshuffle-generated table. llvm-svn: 79624	2009-08-21 12:41:24 +00:00
Anton Korobeynikov	c32e99e3ed	Use masks not nodes for vector shuffle predicates. Provide set of 'legal' masks, so legalizer won't infinite cycle llvm-svn: 79619	2009-08-21 12:40:07 +00:00
Bob Wilson	32cd8550ce	Add support for Neon VEXT (vector extract) shuffles. This is derived from a patch by Anton Korzh. I modified it to recognize the VEXT shuffles during legalization and lower them to a target-specific DAG node. llvm-svn: 79428	2009-08-19 17:03:43 +00:00
Bill Wendling	bae6b2cca3	Reapply r79127. It was fixed by d0k. llvm-svn: 79136	2009-08-15 21:21:19 +00:00
Bill Wendling	d3fade656f	Revert r79127. It was causing compilation errors. llvm-svn: 79135	2009-08-15 21:14:01 +00:00
Evan Cheng	52d4e64711	Change allowsUnalignedMemoryAccesses to take type argument since some targets support unaligned mem access only for certain types. (Should it be size instead?) ARM v7 supports unaligned access for i16 and i32, some v6 variants support it as well. llvm-svn: 79127	2009-08-15 19:23:44 +00:00
Evan Cheng	dc49a8d3f1	Add Thumb2 lsr hooks. llvm-svn: 79032	2009-08-14 20:09:37 +00:00
Bob Wilson	eb54d51759	Create a new ARM-specific DAG node, VDUP, to represent a splat from a scalar_to_vector. Generate these VDUP nodes during legalization instead of trying to recognize the pattern during selection. llvm-svn: 78994	2009-08-14 05:13:08 +00:00
Bob Wilson	cce31f6831	During legalization, change Neon vdup_lane operations from shuffles to target-specific VDUPLANE nodes. This allows the subreg handling for the quad-register version to be done easily with Pats in the .td file, instead of with custom code in ARMISelDAGToDAG.cpp. llvm-svn: 78993	2009-08-14 05:08:32 +00:00
Bob Wilson	ef6e602bf4	Revert r78852 for now. I want to do this differently, but I don't have time to fix it tonight. llvm-svn: 78896	2009-08-13 05:58:56 +00:00
Bob Wilson	ff2db10211	Recognize Neon VDUP shuffles during legalization instead of selection. llvm-svn: 78852	2009-08-12 22:54:19 +00:00
Bob Wilson	ea3a402ae7	Recognize Neon VREV shuffles during legalization instead of selection. llvm-svn: 78850	2009-08-12 22:31:50 +00:00
Owen Anderson	53aa7a960c	Rename MVT to EVT, in preparation for splitting SimpleValueType out into its own struct type. llvm-svn: 78610	2009-08-10 22:56:29 +00:00
Evan Cheng	b972e5633f	It turns out most of the thumb2 instructions are not allowed to touch SP. The semantics of such instructions are unpredictable. We have just been lucky that tests have been passing. This patch takes pain to ensure all the PEI lowering code does the right thing when lowering frame indices, insert code to manipulate stack pointers, etc. It's also custom lowering dynamic stack alloc into pseudo instructions so we can insert the right instructions at scheduling time. This fixes PR4659 and PR4682. llvm-svn: 78361	2009-08-07 00:34:42 +00:00
Bob Wilson	0127031c20	Implement Neon VST[234] operations. llvm-svn: 78330	2009-08-06 18:47:44 +00:00
Anton Korobeynikov	22ef75155e	Missed pieces for ARM HardFP ABI. Patch by Sandeep Patel! llvm-svn: 78225	2009-08-05 19:04:42 +00:00
Dan Gohman	f9bbcd1afd	Major calling convention code refactoring. Instead of awkwardly encoding calling-convention information with ISD::CALL, ISD::FORMAL_ARGUMENTS, ISD::RET, and ISD::ARG_FLAGS nodes, TargetLowering provides three virtual functions for targets to override: LowerFormalArguments, LowerCall, and LowerRet, which replace the custom lowering done on the special nodes. They provide the same information, but in a more immediately usable format. This also reworks much of the target-independent tail call logic. The decision of whether or not to perform a tail call is now cleanly split between target-independent portions, and the target dependent portion in IsEligibleForTailCallOptimization. This also synchronizes all in-tree targets, to help enable future refactoring and feature work. llvm-svn: 78142	2009-08-05 01:29:28 +00:00
Bob Wilson	f45dee3ad2	Lower Neon VLD* intrinsics to custom DAG nodes, and manually allocate the results to fixed registers. llvm-svn: 78025	2009-08-04 00:36:16 +00:00
Evan Cheng	c6d70ae063	Optimize Thumb2 jumptable to use tbb / tbh when all the offsets fit in byte / halfword. llvm-svn: 77422	2009-07-29 02:18:14 +00:00
Evan Cheng	c8bed03349	In thumb2 mode, add pc is unpredictable. Use add + mov pc instead (that is until more optimization goes in). llvm-svn: 77364	2009-07-28 20:53:24 +00:00
Bob Wilson	8a37bbebfd	Add support for ARM Neon VREV instructions. Patch by Anton Korzh, with some modifications from me. llvm-svn: 77101	2009-07-26 00:39:34 +00:00
Evan Cheng	f3a1fce8ae	Change Thumb2 jumptable codegen to one that uses two level jumps: Before: adr r12, #LJTI3_0_0 ldr pc, [r12, +r0, lsl #2] LJTI3_0_0: .long LBB3_24 .long LBB3_30 .long LBB3_31 .long LBB3_32 After: adr r12, #LJTI3_0_0 add pc, r12, +r0, lsl #2 LJTI3_0_0: b.w LBB3_24 b.w LBB3_30 b.w LBB3_31 b.w LBB3_32 This has several advantages. 1. This will make it easier to optimize this to a TBB / TBH instruction + (smaller) table. 2. This eliminate the need for ugly asm printer hack to force the address into thumb addresses (bit 0 is one). 3. Same codegen for pic and non-pic. 4. This eliminate the need to align the table so constantpool island pass won't have to over-estimate the size. Based on my calculation, the later is probably slightly faster as well since ldr pc with shifter address is very slow. That is, it should be a win as long as the HW implementation can do a reasonable job of branch predict the second branch. llvm-svn: 77024	2009-07-25 00:33:29 +00:00
Bob Wilson	844d6c82a7	Fix comment typos. llvm-svn: 75479	2009-07-13 18:11:36 +00:00
Bill Wendling	512ff7353e	Update comments to make it clear that the function alignment is the Log2 of the bytes and not bytes. llvm-svn: 74624	2009-07-01 18:50:55 +00:00
Bill Wendling	31ceb1bcba	Add an "alignment" field to the MachineFunction object. It makes more sense to have the alignment be calculated up front, and have the back-ends obey whatever alignment is decided upon. This allows for future work that would allow for precise no-op placement and the like. llvm-svn: 74564	2009-06-30 22:38:32 +00:00
David Goodwin	dbf11ba800	Rename ARMcmpNZ to ARMcmpZ and use it to represent comparisons that set only the Z flag (i.e. eq and ne). Make ARMcmpZ commutative. llvm-svn: 74423	2009-06-29 15:33:01 +00:00
Bob Wilson	2e076c4e02	Add support for ARM's Advanced SIMD (NEON) instruction set. This is still a work in progress but most of the NEON instruction set is supported. llvm-svn: 73919	2009-06-22 23:27:02 +00:00
Anton Korobeynikov	a8fd40b50a	Address review comments: add 3 ARM calling conventions. Dispatch C calling conv. to one of these conventions based on target triple and subtarget features. llvm-svn: 73530	2009-06-16 18:50:49 +00:00
Bob Wilson	dd0e23610a	Minor formatting fixes. llvm-svn: 72172	2009-05-20 16:30:25 +00:00
Jim Grosbach	06928192ae	Update the names of the exception handling sjlj instrinsics to llvm.eh.sjlj.* for better clarity as to their purpose and scope. Add a description of llvm.eh.sjlj.setjmp to ExceptionHandling.html. (llvm.eh.sjlj.longjmp documentation coming when that implementation is added). llvm-svn: 71758	2009-05-14 00:46:35 +00:00
Jim Grosbach	91fa781df3	Spelling correction s/builting/builtin/ and remove trailing whitespace in a few places llvm-svn: 71735	2009-05-13 22:32:43 +00:00
Jim Grosbach	aeca45dd6f	Add support for GCC compatible builtin setjmp and longjmp intrinsics. This is a supporting preliminary patch for GCC-compatible SjLJ exception handling. Note that these intrinsics are not designed to be invoked directly by the user, but rather used by the front-end as target hooks for exception handling. llvm-svn: 71610	2009-05-12 23:59:14 +00:00
Bob Wilson	ea09d4aca8	Clean up formatting, remove trailing whitespace, fix comment typos and punctuation. No functional changes. llvm-svn: 69378	2009-04-17 20:35:10 +00:00
Bob Wilson	a4c2290e5f	Use CallConvLower.h and TableGen descriptions of the calling conventions for ARM. Patch by Sandeep Patel. llvm-svn: 69371	2009-04-17 19:07:39 +00:00
Bob Wilson	cf1ec2cc68	Fix PR3862: Recognize some ARM-specific constraints for immediates in inline assembly. llvm-svn: 68218	2009-04-01 17:58:54 +00:00
Dan Gohman	747e55bc9a	Constify TargetInstrInfo::EmitInstrWithCustomInserter, allowing ScheduleDAG's TLI member to use const. llvm-svn: 64018	2009-02-07 16:15:20 +00:00
Dale Johannesen	abf66b8343	Add some DL propagation to places that didn't have it yet. More coming. llvm-svn: 63673	2009-02-03 22:26:09 +00:00
Dan Gohman	02b93136e9	Const-qualify getPreIndexedAddressParts and friends. llvm-svn: 62259	2009-01-15 16:29:45 +00:00
Duncan Sands	6ed40141f7	Change the interface to the type legalization method ReplaceNodeResults: rather than returning a node which must have the same number of results as the original node (which means mucking around with MERGE_VALUES, and which is also easy to get wrong since SelectionDAG folding may mean you don't get the node you expect), return the results in a vector. llvm-svn: 60348	2008-12-01 11:39:25 +00:00
Dan Gohman	ed1cf1a8f1	Fix these enums' starting values to reflect the way that instruction opcodes are now numbered. No functionality change. llvm-svn: 56497	2008-09-23 18:42:32 +00:00
Dan Gohman	2ce6f2ad5e	Rename SDOperand to SDValue. llvm-svn: 54128	2008-07-27 21:46:04 +00:00
Duncan Sands	93e180342a	Rather than having a different custom legalization hook for each way in which a result type can be legalized (promotion, expansion, softening etc), just use one: ReplaceNodeResults, which returns a node with exactly the same result types as the node passed to it, but presumably with a bunch of custom code behind the scenes. No change if the new LegalizeTypes infrastructure is not turned on. llvm-svn: 53137	2008-07-04 11:47:58 +00:00
Duncan Sands	13237ac3b9	Wrap MVT::ValueType in a struct to get type safety and better control the abstraction. Rename the type to MVT. To update out-of-tree patches, the main thing to do is to rename MVT::ValueType to MVT, and rewrite expressions like MVT::getSizeInBits(VT) in the form VT.getSizeInBits(). Use VT.getSimpleVT() to extract a MVT::SimpleValueType for use in switch statements (you will get an assert failure if VT is an extended value type - these shouldn't exist after type legalization). This results in a small speedup of codegen and no new testsuite failures (x86-64 linux). llvm-svn: 52044	2008-06-06 12:08:01 +00:00
Dan Gohman	da44054867	Fix the SVOffset values for loads and stores produced by memcpy/memset expansion. It was a bug for the SVOffset value to be used in the actual address calculations. llvm-svn: 50359	2008-04-28 17:15:20 +00:00
Dan Gohman	2505d86783	Fix const-correctness issues with the SrcValue handling in the memory intrinsic expansion code. llvm-svn: 49666	2008-04-14 17:55:48 +00:00
Dan Gohman	544ab2c50b	Drop ISD::MEMSET, ISD::MEMMOVE, and ISD::MEMCPY, which are not Legal on any current target and aren't optimized in DAGCombiner. Instead of using intermediate nodes, expand the operations, choosing between simple loads/stores, target-specific code, and library calls, immediately. Previously, the code to emit optimized code for these operations was only used at initial SelectionDAG construction time; now it is used at all times. This fixes some cases where rep;movs was being used for small copies where simple loads/stores would be better. This also cleans up code that checks for alignments less than 4; let the targets make that decision instead of doing it in target-independent code. This allows x86 to use rep;movs in low-alignment cases. Also, this fixes a bug that resulted in the use of rep;stos for memsets of 0 with non-constant memory size when the alignment was at least 4. It's better to use the library in this case, which can be significantly faster when the size is large. This also preserves more SourceValue information when memory intrinsics are lowered into simple loads/stores. llvm-svn: 49572	2008-04-12 04:36:06 +00:00
Dan Gohman	e1d9ee66ed	Simplify some logic in ComputeMaskedBits. And change ComputeMaskedBits to pass the mask APInt by value, not by reference. llvm-svn: 47096	2008-02-13 22:28:48 +00:00
Dan Gohman	f990faf23b	Convert SelectionDAG::ComputeMaskedBits to use APInt instead of uint64_t. Add an overload that supports the uint64_t interface for use by clients that haven't been updated yet. llvm-svn: 47039	2008-02-13 00:35:47 +00:00
Nate Begeman	5420516b3f	This method should be virtual llvm-svn: 46723	2008-02-04 23:04:24 +00:00
Evan Cheng	29cfb67e28	Even though InsertAtEndOfBasicBlock is an ugly hack it still deserves a proper name. Rename it to EmitInstrWithCustomInserter since it does not necessarily insert instruction at the end. llvm-svn: 46562	2008-01-30 18:18:23 +00:00
Chris Lattner	f3ebc3f3d2	Remove attribution from file headers, per discussion on llvmdev. llvm-svn: 45418	2007-12-29 20:36:04 +00:00
Chris Lattner	f3f4ad9dd6	implement a trivial readme entry. llvm-svn: 44380	2007-11-27 22:36:16 +00:00
Chris Lattner	f81d5886c6	Several changes: 1) Change the interface to TargetLowering::ExpandOperationResult to take and return entire NODES that need a result expanded, not just the value. This allows us to handle things like READCYCLECOUNTER, which returns two values. 2) Implement (extremely limited) support in LegalizeDAG::ExpandOp for MERGE_VALUES. 3) Reimplement custom lowering in LegalizeDAGTypes in terms of the new ExpandOperationResult. This makes the result simpler and fully general. 4) Implement (fully general) expand support for MERGE_VALUES in LegalizeDAGTypes. 5) Implement ExpandOperationResult support for ARM f64->i64 bitconvert and ARM i64 shifts, allowing them to work with LegalizeDAGTypes. 6) Implement ExpandOperationResult support for X86 READCYCLECOUNTER and FP_TO_SINT, allowing them to work with LegalizeDAGTypes. LegalizeDAGTypes now passes several more X86 codegen tests when enabled and when type legalization in LegalizeDAG is ifdef'd out. llvm-svn: 44300	2007-11-24 07:07:01 +00:00
Rafael Espindola	fa0df55bdd	Move the LowerMEMCPY and LowerMEMCPYCall to a common place. Thanks for the suggestions Bill :-) llvm-svn: 43742	2007-11-05 23:12:20 +00:00
Rafael Espindola	419b6d7ce4	Make ARM and X86 LowerMEMCPY identical by moving the isThumb check into getMaxInlineSizeThreshold and by restructuring the X86 version. New I just have to move this to a common place :-) llvm-svn: 43554	2007-10-31 14:39:58 +00:00
Evan Cheng	1f2dd35898	Fix memcpy lowering when addresses are 4-byte aligned but size is not multiple of 4. llvm-svn: 43234	2007-10-22 22:11:27 +00:00
Rafael Espindola	18a831d783	split LowerMEMCPY into LowerMEMCPYCall and LowerMEMCPYInline in the ARM backend. llvm-svn: 43176	2007-10-19 14:35:17 +00:00
Dan Gohman	a160361c85	Migrate X86 and ARM from using X86ISD::{,I}DIV and ARMISD::MULHILO{U,S} to use ISD::{S,U}DIVREM and ISD::{S,U}MUL_HIO. Move the lowering code associated with these operators into target-independent in LegalizeDAG.cpp and TargetLowering.cpp. llvm-svn: 42762	2007-10-08 18:33:35 +00:00
Duncan Sands	86e0119822	Fold the adjust_trampoline intrinsic into init_trampoline. There is now only one trampoline intrinsic. llvm-svn: 41841	2007-09-11 14:10:23 +00:00
Dan Gohman	5f6a9da530	More explicit keywords. llvm-svn: 40757	2007-08-02 21:21:54 +00:00
Duncan Sands	644f917358	Support for trampolines, except for X86 codegen which is still under discussion. llvm-svn: 40549	2007-07-27 12:58:54 +00:00
Dan Gohman	309d3d51b3	Move ComputeMaskedBits, MaskedValueIsZero, and ComputeNumSignBits from TargetLowering to SelectionDAG so that they have more convenient access to the current DAG, in preparation for the ValueType routines being changed from standalone functions to members of SelectionDAG for the pre-legalize vector type changes. llvm-svn: 37704	2007-06-22 14:59:07 +00:00
Evan Cheng	c3c949b473	Allow predicated immediate ARM to ARM calls. llvm-svn: 37659	2007-06-19 21:05:09 +00:00
Dale Johannesen	58698d2534	More effective breakdown of memcpy into repeated load/store. These are now in the order lod;lod;lod;sto;sto;sto which means the load-store optimizer has a better chance of producing ldm/stm. Ideally you would get cooperation from the RA as well but this is not there yet. llvm-svn: 37179	2007-05-17 21:31:21 +00:00
Lauro Ramos Venancio	c39c12a3fa	ARM TLS: implement "general dynamic", "initial exec" and "local exec" models. llvm-svn: 36506	2007-04-27 13:54:47 +00:00
Lauro Ramos Venancio	ee2d164f0f	Implement PIC for arm-linux. llvm-svn: 36324	2007-04-22 00:04:12 +00:00
Chris Lattner	d44e24c896	remove dead target hooks llvm-svn: 35846	2007-04-09 23:33:39 +00:00
Chris Lattner	39f65335d5	remove some dead target hooks, subsumed by isLegalAddressingMode llvm-svn: 35840	2007-04-09 22:27:04 +00:00
Lauro Ramos Venancio	6be85337b0	- Divides the comparisons in two types: comparisons that only use N and Z flags (ARMISD::CMPNZ) and comparisons that use all flags (ARMISD::CMP). - Defines the instructions: TST, TEQ (ARM) and TST (Thumb). llvm-svn: 35573	2007-04-02 01:30:03 +00:00
Chris Lattner	1eb94d973a	implement the new addressing mode description hook. llvm-svn: 35521	2007-03-30 23:15:24 +00:00
Evan Cheng	c2cba18f2b	Remove isLegalAddressImmediate. llvm-svn: 35406	2007-03-28 01:53:55 +00:00
Chris Lattner	d685514e2e	switch TargetLowering::getConstraintType to take the entire constraint, not just the first letter. No functionality change. llvm-svn: 35322	2007-03-25 02:14:49 +00:00
Dale Johannesen	0c6bb5eab7	repair x86 performance, dejagnu problems from previous change llvm-svn: 35245	2007-03-21 21:51:52 +00:00
Dale Johannesen	bacf4acf65	do not share old induction variables when this would result in invalid instructions (that would have to be split later) llvm-svn: 35227	2007-03-20 21:54:54 +00:00
Dale Johannesen	8447d34903	fix obvious comment bug llvm-svn: 35196	2007-03-20 00:30:56 +00:00
Evan Cheng	0e34d6af6b	Added isLegalAddressExpression(). Only allows X +/- C for now. llvm-svn: 35122	2007-03-16 08:43:56 +00:00
Evan Cheng	2150b9286f	Updated TargetLowering LSR addressing mode hooks for ARM and Thumb. llvm-svn: 35075	2007-03-12 23:30:29 +00:00
Evan Cheng	83f35170fa	- Fix codegen for pc relative constant (e.g. JT) in thumb mode: .set PCRELV0, (LJTI1_0_0-(LPCRELL0+4)) LPCRELL0: add r1, pc, #PCRELV0 This is not legal since add r1, pc, #c requires the constant be a multiple of 4. Do the following instead: .set PCRELV0, (LJTI1_0_0-(LPCRELL0+4)) LPCRELL0: mov r1, #PCRELV0 add r1, pc - In thumb mode, it's not possible to use .set generate a pc relative stub address. The stub is ARM code which is in a different section from the thumb code. Load the value from a constpool instead. - Some asm printing clean up. llvm-svn: 33664	2007-01-30 20:37:08 +00:00
Evan Cheng	10043e215b	ARM backend contribution from Apple. llvm-svn: 33353	2007-01-19 07:51:42 +00:00

1 2 3 4 5

230 Commits