llvm-project

Commit Graph

Author	SHA1	Message	Date
Eli Friedman	406c471b69	Make the fast-isel code for literal 0.0 a bit shorter/faster, since 0.0 is common. rdar://problem/9303592 . llvm-svn: 130338	2011-04-27 22:41:55 +00:00
Kevin Enderby	886894cb70	Fix a bug in the case that there is no add or subtract symbol and the offset value is zero so it does not add a NULL expr operand. llvm-svn: 130330	2011-04-27 21:02:27 +00:00
Devang Patel	e3745fdcf3	Revert r130178. It turned out to be not the optimal path to emit complex location expressions. llvm-svn: 130326	2011-04-27 20:29:27 +00:00
Eli Friedman	bcc6914146	Refactor out code to fast-isel a memcpy operation with a small constant length. (I'm planning to use this to implement byval.) llvm-svn: 130274	2011-04-27 01:45:07 +00:00
Eli Friedman	0eea0293d9	Fix an edge case involving branches in fast-isel on x86. rdar://problem/9303306 . llvm-svn: 130272	2011-04-27 01:34:27 +00:00
Chris Lattner	1b06c71668	Transform: "icmp eq (trunc (lshr(X, cst1)), cst" to "icmp (and X, mask), cst" when X has multiple uses. This is useful for exposing secondary optimizations, but the X86 backend isn't ready for this when X has a single use. For example, this can disable load folding. This is inching towards resolving PR6627. llvm-svn: 130238	2011-04-26 20:18:20 +00:00
Jim Grosbach	d4b733e4d8	ARM and Thumb2 support for atomic MIN/MAX/UMIN/UMAX loads. rdar://9326019 llvm-svn: 130234	2011-04-26 19:44:18 +00:00
Jakob Stoklund Olesen	803a200077	Add a TRI::getLargestLegalSuperClass hook to provide an upper limit on register class inflation. The hook will be used by the register allocator when recomputing register classes after removing constraints. Thumb1 code doesn't allow anything larger than tGPR, and x86 needs to ensure that the spill size doesn't change. llvm-svn: 130228	2011-04-26 18:52:33 +00:00
Rafael Espindola	80cb3cb1d6	Print all the moves at a given label instead of just the first one. Remove previous DwarfCFI hack. llvm-svn: 130187	2011-04-26 03:58:56 +00:00
Devang Patel	cae2fbd6fc	Let dwarf writer allocate extra space in the debug location expression. This space, if requested, will be used for complex addresses of the Blocks' variables. llvm-svn: 130178	2011-04-26 00:12:46 +00:00
Chris Lattner	6e29892430	add a missed bitfield instcombine. llvm-svn: 130137	2011-04-25 18:44:26 +00:00
Akira Hatanaka	0e7ee666b7	Lower BlockAddress node when relocation-model is static. llvm-svn: 130131	2011-04-25 17:10:45 +00:00
Chandler Carruth	9b73c8e293	Remove some hard coded CR-LFs. Some of these were the entire files, one of these was just one line of a file. Explicitly set the eol-style property on the files to try and ensure this fix stays. llvm-svn: 130125	2011-04-25 07:11:23 +00:00
Duncan Sands	56ca6292dc	Fix comment typo. Noticed by Liu. llvm-svn: 130120	2011-04-25 06:21:43 +00:00
Sebastian Redl	5519ff9d4e	Fix Target/ARM/Thumb1FrameLowering.h header guard. llvm-svn: 130097	2011-04-24 15:47:01 +00:00
Jay Foad	1a180156b6	Remove unused STL header includes. llvm-svn: 130068	2011-04-23 19:53:52 +00:00
Benjamin Kramer	3db054650b	Silence an overzealous uninitialized variable warning from GCC. llvm-svn: 130053	2011-04-23 08:21:06 +00:00
Andrew Trick	0ed5778a1e	Thumb2 and ARM add/subtract with carry fixes. Fixes Thumb2 ADCS and SBCS lowering: <rdar://problem/9275821>. t2ADCS/t2SBCS are now pseudo instructions, consistent with ARM, so the assembly printer correctly prints the 's' suffix. Fixes Thumb2 adde -> SBC matching to check for live/dead carry flags. Fixes the internal ARM machine opcode mnemonic for ADCS/SBCS. Fixes ARM SBC lowering to check for live carry (potential bug). llvm-svn: 130048	2011-04-23 03:55:32 +00:00
Andrew Trick	1a1f8d4640	whitespace llvm-svn: 130046	2011-04-23 03:24:11 +00:00
Johnny Chen	57c892860e	Disassembly of A8.6.59 LDR (literal) Encoding T1 (16-bit thumb instruction) should print out ldr, not ldr.n. rdar://problem/9267772 llvm-svn: 130008	2011-04-22 19:12:43 +00:00
Benjamin Kramer	341c11da3b	DAGCombine: fold "(zext x) == C" into "x == (trunc C)" if the trunc is lossless. On x86 this allows to fold a load into the cmp, greatly reducing register pressure. movzbl (%rdi), %eax cmpl $47, %eax -> cmpb $47, (%rdi) This shaves 8k off gcc.o on i386. I'll leave applying the patch in README.txt to Chris :) llvm-svn: 130005	2011-04-22 18:47:44 +00:00
Devang Patel	3c39ec2933	Add asserts. llvm-svn: 129995	2011-04-22 16:44:29 +00:00
Benjamin Kramer	4c81624735	X86: Try to use a smaller encoding by transforming (X << C1) & C2 into (X & (C2 >> C1)) & C1. (Part of PR5039) This tends to happen a lot with bitfield code generated by clang. A simple example for x86_64 is uint64_t foo(uint64_t x) { return (x&1) << 42; } which used to compile into bloated code: shlq $42, %rdi ## encoding: [0x48,0xc1,0xe7,0x2a] movabsq $4398046511104, %rax ## encoding: [0x48,0xb8,0x00,0x00,0x00,0x00,0x00,0x04,0x00,0x00] andq %rdi, %rax ## encoding: [0x48,0x21,0xf8] ret ## encoding: [0xc3] with this patch we can fold the immediate into the and: andq $1, %rdi ## encoding: [0x48,0x83,0xe7,0x01] movq %rdi, %rax ## encoding: [0x48,0x89,0xf8] shlq $42, %rax ## encoding: [0x48,0xc1,0xe0,0x2a] ret ## encoding: [0xc3] It's possible to save another byte by using 'andl' instead of 'andq' but I currently see no way of doing that without making this code even more complicated. See the TODOs in the code. llvm-svn: 129990	2011-04-22 15:30:40 +00:00
Evan Cheng	c0d2004e3c	In Thumb2 mode, lower frame indix references to: add <rd>, sp, #<imm8> ldr <rd>, [sp, #<imm8>] When the offset from sp is multiple of 4 and in range of 0-1020. This saves code size by utilizing 16-bit instructions. rdar://9321541 llvm-svn: 129971	2011-04-22 01:42:52 +00:00
Rafael Espindola	5395f44fe8	Compute the size of the FDE encoding instead of hard coding it. Update X8664_ELFTargetObjectFile::getFDEEncoding to match reality. llvm-svn: 129959	2011-04-22 00:08:43 +00:00
Rafael Espindola	6aea59268a	Remove unused argument. llvm-svn: 129955	2011-04-21 23:39:26 +00:00
Devang Patel	94ad6ac13c	Fix DWARF description of Q registers. llvm-svn: 129952	2011-04-21 23:22:35 +00:00
Devang Patel	3712c14be9	Fix DWARF description of S registers. llvm-svn: 129947	2011-04-21 22:48:26 +00:00
Devang Patel	46bda61a81	As per ARM docs, register Dx is described as DW_OP_regx(256+x) in DWARF. llvm-svn: 129922	2011-04-21 17:51:06 +00:00
Justin Holewinski	d74d88a861	PTX: Expand useable register space llvm-svn: 129913	2011-04-21 16:08:02 +00:00
Che-Liang Chiou	14c48e5d66	ptx: fix parameter ordering This patch depends on the prior fix r129908 that changes to use std::find, rather than std::binary_search, on unordered array. Patch by Dan Bailey llvm-svn: 129909	2011-04-21 10:56:58 +00:00
Che-Liang Chiou	cdc51569ee	ptx: PTXMachineFunctionInfo no longer sort registers and so should not use std::binary_search llvm-svn: 129908	2011-04-21 10:16:20 +00:00
Evan Cheng	5f1ba4cd2d	Remove -use-divmod-libcall. Let targets opt in when they are available. llvm-svn: 129884	2011-04-20 22:20:12 +00:00
Eli Friedman	c93d399eed	Revert r129846; it's breaking a buildbot. See http://google1.osuosl.org:8011/builders/llvm-x86_64-linux-checks/builds/825/steps/test.llvm.stage2/logs/st.ll llvm-svn: 129869	2011-04-20 19:00:08 +00:00
Jakob Stoklund Olesen	0e34c1dfac	Prefer cheap registers for busy live ranges. On the x86-64 and thumb2 targets, some registers are more expensive to encode than others in the same register class. Add a CostPerUse field to the TableGen register description, and make it available from TRI->getCostPerUse. This represents the cost of a REX prefix or a 32-bit instruction encoding required by choosing a high register. Teach the greedy register allocator to prefer cheap registers for busy live ranges (as indicated by spill weight). llvm-svn: 129864	2011-04-20 18:19:48 +00:00
Stuart Hastings	7850af6ea0	Excise unintended hunk in 129858. <rdar://problem/7662569> llvm-svn: 129862	2011-04-20 18:09:26 +00:00
Stuart Hastings	45fe3c38c5	ARM byval support. Will be enabled by another patch to the FE. <rdar://problem/7662569> llvm-svn: 129858	2011-04-20 16:47:52 +00:00
Justin Holewinski	7d8895e767	PTX: Add intrinsics to list of built-in intrinsics, which allows them to be used by Clang. To help Clang integration, the PTX target has been split into two targets: ptx32 and ptx64, depending on the desired pointer size. - Add GCCBuiltin class to all intrinsics - Split PTX target into ptx32 and ptx64 llvm-svn: 129851	2011-04-20 15:37:17 +00:00
Che-Liang Chiou	6586f84685	ptx: add integer div and rem instruction Patched by Dan Bailey llvm-svn: 129848	2011-04-20 09:28:55 +00:00
Che-Liang Chiou	5a952b3c67	ptx: add floating-point comparison to setp Patched by Dan Bailey llvm-svn: 129847	2011-04-20 09:28:20 +00:00
Che-Liang Chiou	49160f9a71	ptx: fix parameter ordering Patched by Dan Bailey llvm-svn: 129846	2011-04-20 09:27:19 +00:00
Nick Lewycky	4dae63e35b	This should always be signed chars, so use int8_t. This fixes a miscompile when llvm is built with unsigned chars where an immediate such as 0xff would be zero extended to 64-bits, turning "cmp $0xff,%eax" into "cmp $0xffffffffffffffff,%eax". llvm-svn: 129845	2011-04-20 03:19:42 +00:00
Rafael Espindola	e473aaf540	Remove unused arguments. llvm-svn: 129844	2011-04-20 03:08:09 +00:00
Daniel Dunbar	cd01ed5bd6	ADT/Triple: Renambe isOSX... methods to isMacOSX for consistency with the OS triple component. llvm-svn: 129838	2011-04-20 00:14:25 +00:00
Johnny Chen	dc62e59776	Fix typo in the comment. llvm-svn: 129837	2011-04-19 23:58:52 +00:00
Daniel Dunbar	2b9b0e3748	ADT/Triple: Move a variety of clients to using isOSDarwin() and isOSWindows() predicates. llvm-svn: 129816	2011-04-19 21:14:45 +00:00
Daniel Dunbar	100455a3c8	Target/X86: Eliminate uses of getDarwinVers(). llvm-svn: 129813	2011-04-19 21:04:12 +00:00
Daniel Dunbar	44b530369d	Target/X86: Add getTargetTriple() accessor. llvm-svn: 129812	2011-04-19 21:01:47 +00:00
Daniel Dunbar	e3de896b5e	Target/PPC: Kill off DarwinVers, which is now dead. llvm-svn: 129811	2011-04-19 20:59:24 +00:00
Daniel Dunbar	f954a0f028	Target/PPC: Eliminate a use of getDarwinVers(). llvm-svn: 129810	2011-04-19 20:57:03 +00:00
Daniel Dunbar	a37aab2515	Target/PPC: Add a TargetTriple field. llvm-svn: 129809	2011-04-19 20:54:28 +00:00
Daniel Dunbar	9483bb6bf3	Target: Eliminate a use of getDarwinMajorNumber(). llvm-svn: 129803	2011-04-19 20:44:08 +00:00
Eric Christopher	c721b0db6d	Remove some duplicate op action entries and reorganize. llvm-svn: 129781	2011-04-19 18:49:19 +00:00
Bob Wilson	0858c3aaed	This patch combines several changes from Evan Cheng for rdar://8659675. Making use of VFP / NEON floating point multiply-accumulate / subtraction is difficult on current ARM implementations for a few reasons. 1. Even though a single vmla has latency that is one cycle shorter than a pair of vmul + vadd, a RAW hazard during the first (4? on Cortex-a8) can cause additional pipeline stall. So it's frequently better to single codegen vmul + vadd. 2. A vmla folowed by a vmul, vmadd, or vsub causes the second fp instruction to stall for 4 cycles. We need to schedule them apart. 3. A vmla followed vmla is a special case. Obvious issuing back to back RAW vmla + vmla is very bad. But this isn't ideal either: vmul vadd vmla Instead, we want to expand the second vmla: vmla vmul vadd Even with the 4 cycle vmul stall, the second sequence is still 2 cycles faster. Up to now, isel simply avoid codegen'ing fp vmla / vmls. This works well enough but it isn't the optimial solution. This patch attempts to make it possible to use vmla / vmls in cases where it is profitable. A. Add missing isel predicates which cause vmla to be codegen'ed. B. Make sure the fmul in (fadd (fmul)) has a single use. We don't want to compute a fmul and a fmla. C. Add additional isel checks for vmla, avoid cases where vmla is feeding into fp instructions (except for the #3 exceptional case). D. Add ARM hazard recognizer to model the vmla / vmls hazards. E. Add a special pre-regalloc case to expand vmla / vmls when it's likely the vmla / vmls will trigger one of the special hazards. Enable these fp vmlx codegen changes for Cortex-A9. llvm-svn: 129775	2011-04-19 18:11:57 +00:00
Bob Wilson	d04a83f8f2	Add -mcpu=cortex-a9-mp. It's cortex-a9 with MP extension. rdar://8648637. llvm-svn: 129774	2011-04-19 18:11:52 +00:00
Bob Wilson	a2881ee8a4	Avoid some 's' 16-bit instruction which partially update CPSR (and add false dependency) when it isn't dependent on last CPSR defining instruction. rdar://8928208 llvm-svn: 129773	2011-04-19 18:11:49 +00:00
Bob Wilson	df612ba006	Avoid write-after-write issue hazards for Cortex-A9. Add a avoidWriteAfterWrite() target hook to identify register classes that suffer from write-after-write hazards. For those register classes, try to avoid writing the same register in two consecutive instructions. This is currently disabled by default. We should not spill to avoid hazards! The command line flag -avoid-waw-hazard can be used to enable waw avoidance. llvm-svn: 129772	2011-04-19 18:11:45 +00:00
Bob Wilson	3e5944d96b	Some single-precision VFP instructions can execute in either the VPF or Neon pipelines, at least on Cortex-A9. llvm-svn: 129771	2011-04-19 18:11:38 +00:00
Bob Wilson	f33715e554	Improvements for the Cortex-A9 scheduling itineraries. llvm-svn: 129770	2011-04-19 18:11:36 +00:00
Eli Friedman	ee92a6b332	Add support for FastISel'ing varargs calls. llvm-svn: 129765	2011-04-19 17:22:22 +00:00
Chris Lattner	91328b317b	Implement support for x86 fastisel of small fixed-sized memcpys, which are generated en-mass for C++ PODs. On my c++ test file, this cuts the fast isel rejects by 10x and shrinks the generated .s file by 5% llvm-svn: 129755	2011-04-19 05:52:03 +00:00
Chris Lattner	34a08c2344	tidy up llvm-svn: 129753	2011-04-19 05:15:59 +00:00
Chris Lattner	5f4b783426	Implement support for fast isel of calls of i1 arguments, even though they are illegal, when they are a truncate from something else. This eliminates fully half of all the fastisel rejections on a test c++ file I'm working with, which should make a substantial improvement for -O0 compile of c++ code. This fixed rdar://9297003 - fast isel bails out on all functions taking bools llvm-svn: 129752	2011-04-19 05:09:50 +00:00
Chris Lattner	d7f7c93914	Handle i1/i8/i16 constant integer arguments to calls by prepromoting them. Before we would bail out on i1 arguments all together, now we just bail on non-constant ones. Also, we used to emit extraneous code. e.g. test12 was: movb $0, %al movzbl %al, %edi callq _test12 and test13 was: movb $0, %al xorl %edi, %edi movb %al, 7(%rsp) callq _test13f Now we get: movl $0, %edi callq _test12 and: movl $0, %edi callq _test13f llvm-svn: 129751	2011-04-19 04:42:38 +00:00
Chris Lattner	c59290a34c	be layout aware, to produce: testb $1, %al je LBB0_2 ## BB#1: ## %if.then movb $0, %al instead of: testb $1, %al jne LBB0_1 jmp LBB0_2 LBB0_1: ## %if.then movb $0, %al how 'bout that. llvm-svn: 129749	2011-04-19 04:26:32 +00:00
Chris Lattner	2c8a4c3b1b	fix rdar://9297006 - fast isel bails out on trunc to i1 -> bools cry, a common cause of fast isel rejects on c++ code. llvm-svn: 129748	2011-04-19 04:22:17 +00:00
Evan Cheng	7d6cd4902e	Change A9 scheduling itineraries VLD* / VST* entries default to "aligned". That is, it assumes addresses are 64-bit aligned (which should be the more common case). If the alignment is found not to be aligned, then getOperandLatency() would adjust the operand latency computation by one to compensate for it. rdar://9294833 llvm-svn: 129742	2011-04-19 01:21:49 +00:00
Evan Cheng	4079133796	Do not lose mem_operands while lowering VLD / VST intrinsics. llvm-svn: 129738	2011-04-19 00:04:03 +00:00
Jim Grosbach	ddac5dd269	Trim a few unneeded includes. llvm-svn: 129723	2011-04-18 21:35:54 +00:00
Eric Christopher	2e3fbaab39	Invert the meaning of printAliasInstr's return value. It now returns true on success and false on failure. Update callers. llvm-svn: 129722	2011-04-18 21:28:11 +00:00
Sean Callanan	5d73033e0f	Small fix to the ARM AsmParser to ensure that a superclass variable is instantiated properly. llvm-svn: 129713	2011-04-18 20:20:44 +00:00
Chris Lattner	80254a53cc	Add a new bit that ImmLeaf's can opt into, which allows them to duck out of the generated FastISel. X86 doesn't need to generate code to match ADD16ri8 since ADD16ri will do just fine. This is a small codesize win in the generated instruction selector. llvm-svn: 129692	2011-04-18 06:36:55 +00:00
Chris Lattner	c479e0631f	switch the rest of the x86 immediate patterns over to ImmLeaf, simplifying them and exposing more information to tblgen. It would be nice if other target authors adopted this as well, particularly arm since it has fastisel. llvm-svn: 129676	2011-04-17 22:12:55 +00:00
Chris Lattner	2ff8c1a25f	now that predicates have a decent abstraction layer on them, introduce a new kind of predicate: one that is specific to imm nodes. The predicate function specified here just checks an int64_t directly instead of messing around with SDNode's. The virtue of this is that it means that fastisel and other things can reason about these predicates. llvm-svn: 129675	2011-04-17 22:05:17 +00:00
Chris Lattner	514e292b72	Rework our internal representation of node predicates to expose more structure and fix some fixmes. We now have a TreePredicateFn class that handles all of the decoding of these things. This is an internal cleanup that has no impact on the code generated by tblgen. llvm-svn: 129670	2011-04-17 21:38:24 +00:00
Chris Lattner	b53ccb8e36	1. merge fast-isel-shift-imm.ll into fast-isel-x86-64.ll 2. implement rdar://9289501 - fast isel should fold trivial multiplies to shifts 3. teach tblgen to handle shift immediates that are different sizes than the shifted operands, eliminating some code from the X86 fast isel backend. 4. Have FastISel::SelectBinaryOp use (the poorly named) FastEmit_ri_ function instead of FastEmit_ri to simplify code. llvm-svn: 129666	2011-04-17 20:23:29 +00:00
Chris Lattner	eb729d48ff	fix an x86 fast isel issue where we'd completely give up on folding an address when we have a global variable base an an index. Instead, just give up on folding the global variable. Before we'd geenrate: _test: ## @test ## BB#0: movq _rtx_length@GOTPCREL(%rip), %rax leaq (%rax), %rax addq %rdi, %rax movzbl (%rax), %eax ret now we generate: _test: ## @test ## BB#0: movq _rtx_length@GOTPCREL(%rip), %rax movzbl (%rax,%rdi), %eax ret The difference is even more significant when there is a scale involved. This fixes rdar://9289558 - total fail with addr mode formation at -O0/x86-64 llvm-svn: 129664	2011-04-17 17:47:38 +00:00
Chris Lattner	4832660b4d	fix an oversight which caused us to compile the testcase (and other less trivial things) into a dummy lea. Before we generated: _test: ## @test movq _G@GOTPCREL(%rip), %rax leaq (%rax), %rax ret now we produce: _test: ## @test movq _G@GOTPCREL(%rip), %rax ret This is part of rdar://9289558 llvm-svn: 129662	2011-04-17 17:12:08 +00:00
Chris Lattner	4b026b962a	tidy up and reduce indentation. llvm-svn: 129661	2011-04-17 17:05:12 +00:00
Eli Friedman	55f7bf3289	Remove working entry from README. llvm-svn: 129654	2011-04-17 02:36:27 +00:00
Francois Pichet	47f86e6c60	MSVC needs the return 0 to compile. llvm-svn: 129640	2011-04-16 13:59:23 +00:00
Rafael Espindola	a83b177035	Put each personality function in a section. This fixes the gnu ld warning: error in foo.o; no .eh_frame_hdr table will be created. llvm-svn: 129635	2011-04-16 03:51:21 +00:00
Stuart Hastings	ebddfe60a0	Correct result when a branch condition is live across a block boundary. <rdar://problem/8933028> llvm-svn: 129634	2011-04-16 03:31:26 +00:00
Johnny Chen	48592ee5af	Thumb2 BFC was insufficiently encoded. rdar://problem/9292717 llvm-svn: 129619	2011-04-15 22:52:15 +00:00
Johnny Chen	761e1e3512	A8.6.315 VLD3 (single 3-element structure to all lanes) The a bit must be encoded as 0. rdar://problem/9292625 llvm-svn: 129618	2011-04-15 22:49:08 +00:00
Akira Hatanaka	e24891251c	Reverse unnecessary changes made in r129606 and r129608. There is no change in functionality. llvm-svn: 129612	2011-04-15 21:51:11 +00:00
Cameron Zwarich	9c65e4d69c	Add ORR and EOR to the CMP peephole optimizer. It's hard to get isel to generate a case involving EOR, so I only added a test for ORR. llvm-svn: 129610	2011-04-15 21:24:38 +00:00
Akira Hatanaka	d56f2d910b	Fix lines that exceed 80 columns. There is no change in functionality. llvm-svn: 129608	2011-04-15 21:06:38 +00:00
Akira Hatanaka	aef55c8801	Fix lines that have incorrect indentation or exceed 80 columns. There is no change in functionality. llvm-svn: 129606	2011-04-15 21:00:26 +00:00
Cameron Zwarich	0829b3065a	The AND instruction leaves the V flag unmodified, so it falls victim to the same problem as all of the other instructions we fold with CMPs. llvm-svn: 129602	2011-04-15 20:45:00 +00:00
Rafael Espindola	7583dbdc88	Fix cmake build. llvm-svn: 129601	2011-04-15 20:34:45 +00:00
Cameron Zwarich	93eae1571c	Add missing register forms of instructions to the ARM CMP-folding code. This fixes <rdar://problem/9287901>. llvm-svn: 129599	2011-04-15 20:28:28 +00:00
Akira Hatanaka	279169771b	Add pass that expands pseudo instructions into target instructions after register allocation. Define pseudos that get expanded into mtc1 or mfc1 instructions. llvm-svn: 129594	2011-04-15 19:52:08 +00:00
Evan Cheng	a2e61292f0	Increase SubtargetFeatureKV Value and Implies fields to 64 bits since some targets are getting very close to 32 subtarget features. Also teach tablegen to error when there are more than 64 features to guard against undefined behavior. rdar://9282332 llvm-svn: 129590	2011-04-15 19:35:46 +00:00
Rafael Espindola	a01cdb0e37	Add 129518 back with a fix for when we are producing eh just because of debug info. Change ELF systems to use CFI for producing the EH tables. This reduces the size of the clang binary in Debug builds from 690MB to 679MB. llvm-svn: 129571	2011-04-15 15:11:06 +00:00
Chris Lattner	0ab5e2cded	Fix a ton of comment typos found by codespell. Patch by Luis Felipe Strano Moraes! llvm-svn: 129558	2011-04-15 05:18:47 +00:00
NAKAMURA Takumi	b5e3e9dd27	Revert r129518, "Change ELF systems to use CFI for producing the EH tables. This reduces the" It broke several builds. llvm-svn: 129557	2011-04-15 03:35:57 +00:00
Evan Cheng	12bb05b75b	Fix another fcopysign lowering bug. If src is f64 and destination is f32, don't forget to right shift the source by 32 first. rdar://9287902 llvm-svn: 129556	2011-04-15 01:31:00 +00:00
Johnny Chen	681fef5986	For t2BFI, both Inst{26} and Inst{5} "should" be 0. Ref: I.1 Instruction encoding diagrams and pseudocode llvm-svn: 129552	2011-04-15 00:35:08 +00:00
Michael J. Spencer	30088ba110	Add 3DNow! intrinsics. llvm-svn: 129551	2011-04-15 00:32:41 +00:00
Johnny Chen	421316178e	The ARM disassembler did not handle the alignment correctly for VLDDUP instructions (single element or n-element structure to all lanes). llvm-svn: 129550	2011-04-15 00:10:45 +00:00
Evan Cheng	44887f9c7e	Follow up on r127913. Fix Thumb revsh isel. rdar://9286766 llvm-svn: 129548	2011-04-14 23:27:44 +00:00
Johnny Chen	4251b151b1	Add sanity checkings for Thumb2 Load/Store Register Exclusive family of operations. llvm-svn: 129531	2011-04-14 19:13:28 +00:00
Chris Lattner	6f195469b1	move PR9661 out to here. llvm-svn: 129527	2011-04-14 18:47:18 +00:00
Rafael Espindola	aa2a7cd828	Change ELF systems to use CFI for producing the EH tables. This reduces the size of the clang binary in Debug builds from 690MB to 679MB. llvm-svn: 129518	2011-04-14 15:18:53 +00:00
Michael J. Spencer	b88784c185	Fix whitespace and tabs. llvm-svn: 129517	2011-04-14 14:33:36 +00:00
Chris Lattner	1d313c6f6d	add a minor missed dag combine that is blocking mid-level optimization improvements, that will lead to fixing PR6627. llvm-svn: 129504	2011-04-14 04:21:42 +00:00
Bill Wendling	410ec4aad1	As Dan pointed out, movzbl, movsbl, and friends are nicer than their alias (movzx/movsx) because they give more information. Revert that part of the patch. llvm-svn: 129498	2011-04-14 01:46:37 +00:00
Bill Wendling	7e07d6fb69	Have the X86 back-end emit the alias instead of what's being aliased. In most cases, it's much nicer and more informative reading the alias. llvm-svn: 129497	2011-04-14 01:11:51 +00:00
Bill Wendling	6dd69d9241	Add an option to not print the alias of an instruction. It defaults to "print the alias". llvm-svn: 129485	2011-04-13 23:36:21 +00:00
Johnny Chen	d0fb04f437	Thumb disassembler did not handle tBRIND (indirect branch) properly. rdar://problem/9280370 llvm-svn: 129480	2011-04-13 21:59:01 +00:00
Johnny Chen	b6a37bff21	Check for unallocated instruction encodings when disassembling Thumb Branch instructions (tBcc and t2Bcc). rdar://problem/9280470 llvm-svn: 129471	2011-04-13 21:35:49 +00:00
Johnny Chen	ffa6378fd6	The LDRT/STRT (unpriviledged load/store) operations don't take SP or PC as Rt. rdar://problem/9279440 llvm-svn: 129469	2011-04-13 21:04:32 +00:00
Cameron Zwarich	415b5e8341	Fix a typo in an ARM-specific DAG combine. This fixes <rdar://problem/9278274>. llvm-svn: 129468	2011-04-13 21:01:19 +00:00
Cameron Zwarich	9398197ef1	Fix a regression caused by r102515 where explicit alignment on globals is ignored. There was a test to catch this, but it was just blindly updated in a large change. This fixes another part of <rdar://problem/9275290>. llvm-svn: 129466	2011-04-13 20:36:04 +00:00
Johnny Chen	70591cbc60	Check the corner cases for t2LDRSHi12 correctly and mark invalid encodings as such. rdar://problem/9276651 llvm-svn: 129462	2011-04-13 19:46:05 +00:00
Johnny Chen	0d306a7840	Fix a bug where for t2MOVCCi disassembly, the TIED_TO register operand was not properly handled. rdar://problem/9276427 llvm-svn: 129456	2011-04-13 17:51:02 +00:00
Johnny Chen	b2f9fa1fce	Forgot to add this change for http://llvm.org/viewvc/llvm-project?view=rev&revision=129387 . llvm-svn: 129451	2011-04-13 16:56:08 +00:00
Cameron Zwarich	70be27e913	Fix an obvious problem with an alignment computation. AsmPrinter actually does the max itself, so it is not easy to write a test case for this, but I added a test case that would fail if the code in AsmPrinter were removed. llvm-svn: 129432	2011-04-13 09:02:43 +00:00
Cameron Zwarich	8001850ee8	Fix a typo. llvm-svn: 129429	2011-04-13 06:39:16 +00:00
Cameron Zwarich	cdf59f7016	If a global variable has a specified alignment that is less than the preferred alignment for its type, use the minimum of the specified alignment and the ABI alignment. This fixes <rdar://problem/9275290>. llvm-svn: 129428	2011-04-13 06:03:16 +00:00
Bill Wendling	b902f1dd88	Reapply r129401 with patch for clang. llvm-svn: 129419	2011-04-13 00:36:11 +00:00
Johnny Chen	3c2f74c9f3	Add sanity check for Ld/St Dual forms of Thumb2 instructions. rdar://problem/9273947 llvm-svn: 129411	2011-04-12 23:31:00 +00:00
Jakob Stoklund Olesen	987164043c	Add @earlyclobber constraints to the writeback register of all ARM store instructions. The ARMARM specifies these instructions as unpredictable when storing the writeback register. This shouldn't affect code generation much since storing a pointer to itself is quite rare. llvm-svn: 129409	2011-04-12 23:27:48 +00:00
Bill Wendling	dbfde42468	Revert r129401 for now. Clang is using the old way of doing things. llvm-svn: 129403	2011-04-12 22:59:27 +00:00
Bill Wendling	47c24875a1	Remove the unaligned load intrinsics in favor of using native unaligned loads. Now that we have a first-class way to represent unaligned loads, the unaligned load intrinsics are superfluous. First part of <rdar://problem/8460511>. llvm-svn: 129401	2011-04-12 22:46:31 +00:00
Johnny Chen	960eef3db3	The Thumb2 RFE instructions need to have their second halfword fully specified. In addition, the base register is not rGPR, but GPR with th exception that: if n == 15 then UNPREDICTABLE rdar://problem/9273836 llvm-svn: 129391	2011-04-12 21:41:51 +00:00
Johnny Chen	01637b9acb	Add bad register checks for Thumb2 Ld/St instructions. rdar://problem/9269047 llvm-svn: 129387	2011-04-12 21:17:51 +00:00
Johnny Chen	ab86a519f8	The Thumb2 Ld, St, and Preload instructions with the i12 forms should have its Inst{23} be specified as '1' (add = TRUE). Also add a utility function for Thumb2. llvm-svn: 129377	2011-04-12 18:48:00 +00:00
Johnny Chen	d0e2be39ea	Print out a debug message when the reglist fails the sanity check for Thumb Ld/St Multiple. llvm-svn: 129365	2011-04-12 17:09:04 +00:00
Cameron Zwarich	fbcd69b96a	Split a store of a VMOVDRR into two integer stores to avoid mixing NEON and ARM stores of arguments in the same cache line. This fixes the second half of <rdar://problem/8674845>. llvm-svn: 129345	2011-04-12 02:24:17 +00:00
Johnny Chen	672ef14a62	A8.6.16 B Encoding T1 (tBcc) if cond == '1110' then UNDEFINED; rdar://problem/9268681 llvm-svn: 129325	2011-04-12 00:14:49 +00:00
Johnny Chen	dc8bf9ec08	Thumb disassembler was erroneously rejecting "blx sp" instruction. rdar://problem/9267838 llvm-svn: 129320	2011-04-11 23:33:30 +00:00
Wesley Peck	f30a0e2d80	Fix an error in the MBlaze delay slot filler. llvm-svn: 129313	2011-04-11 22:45:02 +00:00
Wesley Peck	1914c39bd4	Add scheduling information for the MBlaze backend. llvm-svn: 129311	2011-04-11 22:31:52 +00:00
Wesley Peck	e3685217d0	Don't crash on invalid instructions when disassembling MBlaze code. This fixes http://llvm.org/bugs/show_bug.cgi?id=9653 llvm-svn: 129303	2011-04-11 21:35:21 +00:00
Johnny Chen	f79d5365de	Fix the bug where the immediate shift amount for Thumb logical shift instructions are incorrectly disassembled. rdar://problem/9266265 llvm-svn: 129298	2011-04-11 21:14:35 +00:00
Owen Anderson	5140802cd9	Fix another using-CPSR-twice bug in my ADCS/SBCS cleanups, and make proper use of the Commutable bit. llvm-svn: 129294	2011-04-11 20:12:19 +00:00
Johnny Chen	74adbddade	Trivial comment fix. llvm-svn: 129288	2011-04-11 18:51:50 +00:00
Johnny Chen	66fab75920	Check invalid register encodings for LdFrm/StFrm ARM instructions and flag them as invalid instructions. llvm-svn: 129286	2011-04-11 18:34:12 +00:00
Kevin Enderby	9377a52c12	Adding support for printing operands symbolically to llvm's public 'C' disassembler API. Hooked this up to the ARM target so such tools as Darwin's otool(1) can now print things like branch targets for example this: blx _puts instead of this: blx #-36 And even print the expression encoded in the Mach-O relocation entried for things like this: movt r0, :upper16:((_foo-_bar)+1234) llvm-svn: 129284	2011-04-11 18:08:50 +00:00
Jay Foad	7c14a558fe	Don't include Operator.h from InstrTypes.h. llvm-svn: 129271	2011-04-11 09:35:34 +00:00
Nicolas Geoffray	9137ee85de	Bugfix in the Cpp backend after API change on PHINode::Create. llvm-svn: 129248	2011-04-10 17:39:40 +00:00
Chris Lattner	fc4fe00a65	fix rdar://8735979 - "int 3" doesn't match to "int3". Unfortunately, InstAlias doesn't allow matching immediate operands, so we have to write C++ code to do this. llvm-svn: 129223	2011-04-09 19:41:05 +00:00
Matt Beaumont-Gay	4e1796e8d1	Fix an apparent typo that made GCC complain llvm-svn: 129160	2011-04-08 21:59:49 +00:00
Evan Cheng	74d92c1924	Change -arm-trap-func= into a non-arm specific option. Now Intrinsic::trap is lowered into a call to the specified trap function at sdisel time. llvm-svn: 129152	2011-04-08 21:37:21 +00:00
Johnny Chen	f2faf4e53a	Check opcoe (dmb, dsb) instead of bitfields matching. llvm-svn: 129148	2011-04-08 20:03:46 +00:00
Johnny Chen	a9570f77d5	Hanlde the checking of bad regs for SMMLAR properly, instead of asserting. PR9650 rdar://problem/9257565 llvm-svn: 129147	2011-04-08 19:41:22 +00:00
Johnny Chen	875e0e4626	Sanity check the option operand for DMB/DSB. PR9648 rdar://problem/9257634 llvm-svn: 129146	2011-04-08 19:18:07 +00:00
Jim Grosbach	a5dcd98a47	Mark hasExtraDefRegAllocReq=1 on LDRD. The previous cleanup of LDRD got overzealous and removed it, causing post-RA scheduling to get overzealous in breaking antidependencies and invalidate these instructions. Hilarity and invalid assembly ensued. rdar://9244161 llvm-svn: 129144	2011-04-08 18:47:05 +00:00
Johnny Chen	7e51b4640f	Add sanity checking for bad register specifier(s) for the DPFrm instructions. Add more test cases to exercise the logical branches related to the above change. llvm-svn: 129117	2011-04-08 00:29:09 +00:00
Bill Wendling	bc3f79044a	Replace the old algorithm that emitted the "print the alias for an instruction" with the newer, cleaner model. It uses the IAPrinter class to hold the information that is needed to match an instruction with its alias. This also takes into account the available features of the platform. There is one bit of ugliness. The way the logic determines if a pattern is unique is O(N2), which is gross. But in reality, the number of items it's checking against isn't large. So while it's N2, it shouldn't be a massive time sink. llvm-svn: 129110	2011-04-07 21:20:06 +00:00
Evan Cheng	9a3f2772f0	Add option to emit @llvm.trap as a function call instead of a trap instruction. rdar://9249183. llvm-svn: 129107	2011-04-07 20:31:12 +00:00
Akira Hatanaka	052163e6d3	Fix indentation. llvm-svn: 129105	2011-04-07 20:25:10 +00:00
Akira Hatanaka	94ee37e487	Update ATUsed every time after expandRegLargeImmPair is called. llvm-svn: 129104	2011-04-07 20:23:26 +00:00
Mon P Wang	27f3330132	Fixed encoding for VEXTqf llvm-svn: 129101	2011-04-07 19:56:12 +00:00
Akira Hatanaka	d6f1c58914	Fix handling of functions with internal linkage. llvm-svn: 129099	2011-04-07 19:51:44 +00:00
Johnny Chen	04efb8f6ce	Add sanity checking for invalid register encodings for signed/unsigned extend instructions. Add some test cases. llvm-svn: 129098	2011-04-07 19:28:58 +00:00
Johnny Chen	07606661f9	Add sanity checking for invalid register encodings for saturating instructions. llvm-svn: 129096	2011-04-07 19:02:08 +00:00
Johnny Chen	194a2267ad	Add some more comments about checkings of invalid register numbers. And two test cases. llvm-svn: 129090	2011-04-07 18:33:19 +00:00
Tanya Lattner	266792a55a	Prevent ARM DAG Combiner from doing an AND or OR combine on an illegal vector type (vectors of size 3). Also included test cases. llvm-svn: 129074	2011-04-07 15:24:20 +00:00
Johnny Chen	313ec7953a	Sanity check MSRi for invalid mask values and reject it as invalid. rdar://problem/9246844 llvm-svn: 129050	2011-04-07 01:37:34 +00:00
Johnny Chen	c0e86fb965	The ARM disassembler was not recognizing USADA8 instruction. Need to add checking for register values for USAD8 and USADA8. rdar://problem/9247060 llvm-svn: 129047	2011-04-07 01:05:52 +00:00
Evan Cheng	a7c7b54dde	Change -arm-divmod-libcall to a target neutral option. llvm-svn: 129045	2011-04-07 00:58:44 +00:00
Johnny Chen	d4cced54b3	Should also check SMLAD for invalid register values. rdar://problem/9246650 llvm-svn: 129042	2011-04-07 00:50:25 +00:00
Owen Anderson	bdff1c997a	Teach the ARM peephole optimizer that RSB, RSC, ADC, and SBC can be used for folded comparisons, just like ADD and SUB. llvm-svn: 129038	2011-04-06 23:35:59 +00:00
Owen Anderson	f9bd6bad8a	Cleanups from Jim: remove redundant constraints and a dead FIXME. llvm-svn: 129036	2011-04-06 22:45:55 +00:00
Jim Grosbach	6ade7e0bac	Tidy up. llvm-svn: 129034	2011-04-06 22:35:47 +00:00
Johnny Chen	bd9a4f8d07	A8.6.393 The ARM disassembler should reject invalid (type, align) encodings as invalid instructions. So, instead of: Opcode=1641 Name=VST2b32_UPD Format=ARM_FORMAT_NLdSt(30) 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 ------------------------------------------------------------------------------------------------- \| 1: 1: 1: 1\| 0: 1: 0: 0\| 0: 0: 0: 0\| 0: 0: 1: 1\| 0: 0: 0: 0\| 1: 0: 0: 1\| 1: 0: 1: 1\| 0: 0: 1: 1\| ------------------------------------------------------------------------------------------------- vst2.32 {d0, d2}, [r3, :256], r3 we now have: Opcode=1641 Name=VST2b32_UPD Format=ARM_FORMAT_NLdSt(30) 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 ------------------------------------------------------------------------------------------------- \| 1: 1: 1: 1\| 0: 1: 0: 0\| 0: 0: 0: 0\| 0: 0: 1: 1\| 0: 0: 0: 0\| 1: 0: 0: 1\| 1: 0: 1: 1\| 0: 0: 1: 1\| ------------------------------------------------------------------------------------------------- mc-input.txt:1:1: warning: invalid instruction encoding 0xb3 0x9 0x3 0xf4 ^ llvm-svn: 129033	2011-04-06 22:14:48 +00:00
Johnny Chen	2ac486e387	A8.6.92 MCR (Encoding A1): if coproc == '101x' then SEE "Advanced SIMD and VFP" Since these "Advanced SIMD and VFP" instructions have more specfic encoding bits specified, if coproc == 10 or 11, we should reject the insn as invalid. rdar://problem/9239922 rdar://problem/9239596 llvm-svn: 129027	2011-04-06 20:49:02 +00:00
Johnny Chen	8bca174f48	Fix a bug in the disassembly of VGETLNs8 where the lane index was wrong. Also set the encoding bits (for A8.6.303, A8.6.328, A8.6.329) Inst{3-0} = 0b0000, in class NVLaneOp. rdar://problem/9240648 llvm-svn: 129015	2011-04-06 18:27:46 +00:00
Rafael Espindola	b4dd95b4f9	Add another case we are not optimizing. llvm-svn: 129012	2011-04-06 17:35:32 +00:00
Rafael Espindola	7a3b244d45	The original issue has been fixed by not doing unnecessary sign extensions. Change the test to force a sign extension and expose the problem again. llvm-svn: 129011	2011-04-06 17:19:35 +00:00
Johnny Chen	0ec0e98a6a	Add a missing opcode (SMLSLDX) to BadRegsMulFrm() function. Add more complete sanity check for LdStFrm instructions where if IBit (Inst{25}) is 1, Inst{4} should be 0. Otherwise, we should reject the insn as invalid. rdar://problem/9239347 rdar://problem/9239467 llvm-svn: 128977	2011-04-06 01:18:32 +00:00
Owen Anderson	867846b1f0	Reapply r128946 (pseudoization of various instructions), and fix the extra imp-def of CPSR it was adding. llvm-svn: 128965	2011-04-05 23:55:28 +00:00
Johnny Chen	f6e327c6a3	Fix a typo in the handling of PKHTB opcode, plus add sanity check for illegal register encodings for DisassembleArithMiscFrm(). rdar://problem/9238659 llvm-svn: 128958	2011-04-05 23:28:00 +00:00
Bob Wilson	d135c696c0	Clean up some code for clarity. llvm-svn: 128953	2011-04-05 23:03:25 +00:00
Owen Anderson	61e7a935bd	Revert r128946 while I figure out why it broke the buildbots. llvm-svn: 128951	2011-04-05 23:03:06 +00:00
Johnny Chen	c3656d29f6	A7.3 register encoding Qd -> bit[12] == 0 Qn -> bit[16] == 0 Qm -> bit[0] == 0 If one of these bits is 1, the instruction is UNDEFINED. rdar://problem/9238399 rdar://problem/9238445 llvm-svn: 128949	2011-04-05 22:57:07 +00:00
Owen Anderson	3501655ad9	Give RSBS and RSCS the pseudo treatment. llvm-svn: 128946	2011-04-05 22:42:54 +00:00
Johnny Chen	9da60e016b	ARM disassembler was erroneously accepting an invalid RSC instruction. Added checks for regs which should not be 15. rdar://problem/9237734 llvm-svn: 128945	2011-04-05 22:18:07 +00:00
Johnny Chen	25883487a1	ARM disassembler was erroneously accepting an invalid LSL instruction. For register-controlled shifts, we should check that the encoding constraint Inst{7} = 0 and Inst{4} = 1 is satisfied. rdar://problem/9237693 llvm-svn: 128941	2011-04-05 21:49:44 +00:00
Owen Anderson	77aa266de8	Fix bugs in the pseuo-ization of ADCS/SBCS pointed out by Jim, as well as doing the expansion earlier (using a custom inserter) to allow for the chance of predicating these instructions. llvm-svn: 128940	2011-04-05 21:48:57 +00:00
Johnny Chen	e9c644d4a0	The r128085 checkin modified the operand ordering for MRC/MRC2 instructions. Modify DisassembleCoprocessor() of ARMDisassemblerCore.cpp to react to the change. rdar://problem/9236873 llvm-svn: 128922	2011-04-05 20:32:23 +00:00
Johnny Chen	151582492d	ARM disassembler should flag (rGPRRegClassID, r13\|r15) as an error. llvm-svn: 128913	2011-04-05 19:42:11 +00:00
Jim Grosbach	d9dce561b6	Make second source operand of LDRD pre/post explicit. Finish what r128736 started. llvm-svn: 128903	2011-04-05 18:40:13 +00:00
Johnny Chen	33d3a9fadc	Constants with multiple encodings (ARM): An alternative syntax is available for a modified immediate constant that permits the programmer to specify the encoding directly. In this syntax, #<const> is instead written as #<byte>,#<rot>, where: <byte> is the numeric value of abcdefgh, in the range 0-255 <rot> is twice the numeric value of rotation, an even number in the range 0-30. llvm-svn: 128897	2011-04-05 18:02:46 +00:00
Johnny Chen	268d63f307	Check for invalid register encodings for UMAAL and friends where: if dLo == 15 \|\| dHi == 15 \|\| n == 15 \|\| m == 15 then UNPREDICTABLE; if dHi == dLo then UNPREDICTABLE; rdar://problem/9230202 llvm-svn: 128895	2011-04-05 17:43:10 +00:00
Owen Anderson	f7678b83d2	Convert ADCS and SBCS instructions into pseudos that are expanded to the ADC/ABC with the appropriate S-bit input value. llvm-svn: 128892	2011-04-05 17:24:25 +00:00
Bill Wendling	dd4dcd549b	Revamp the SjLj "dispatch setup" intrinsic. It needed to be moved closer to the setjmp statement, because the code directly after the setjmp needs to know about values that are on the stack. Also, the 'bitcast' of the function context was causing a dead load. This wouldn't be too horrible, except that at -O0 it wasn't optimized out, and because it wasn't using the correct base pointer (if there is a VLA), it would try to access a value from a garbage address. <rdar://problem/9130540> llvm-svn: 128873	2011-04-05 01:37:43 +00:00
Eric Christopher	b968f4defe	Just use BL all the time. It's safer that way. Fixes rdar://9184526 llvm-svn: 128869	2011-04-05 00:39:26 +00:00
Johnny Chen	9b3ccba636	Fix SRS/SRSW encoding bits. rdar://problem/9230801 ARM disassembler discrepancy: erroneously accepting SRS Plus add invalid-RFEorLDMIA-arm.txt test which should have been checked in with http://llvm.org/viewvc/llvm-project?view=rev&revision=128859. llvm-svn: 128864	2011-04-05 00:16:18 +00:00
Johnny Chen	782a60c117	A8.6.105 MUL Inst{15-12} should be specified as 0b0000. rdar://problem/9231168 ARM disassembler discrepancy: erroneously accepting MUL llvm-svn: 128862	2011-04-04 23:57:05 +00:00
Johnny Chen	a6129b4a7f	RFE encoding should also specify the "should be" encoding bits. rdar://problem/9229922 ARM disassembler discrepancy: erroneously accepting RFE Also LDC/STC instructions are predicated while LDC2/STC2 instructions are not, fixed while doing regression testings. llvm-svn: 128859	2011-04-04 23:39:08 +00:00
Joerg Sonnenberger	418f186a4b	Make OpcodeMask an unsigned long long literal to deal with overflow. llvm-svn: 128847	2011-04-04 21:38:17 +00:00
Johnny Chen	8372006296	Fix incorrect alignment for NEON VST2b32_UPD. rdar://problem/9225433 llvm-svn: 128841	2011-04-04 20:35:31 +00:00
Jakob Stoklund Olesen	13ce236c4c	Insert code in the right location when lowering PowerPC atomics. This causes defs to dominate uses, no instructions after terminators, and other goodness. llvm-svn: 128836	2011-04-04 17:57:29 +00:00
Bruno Cardoso Lopes	bda3632bcd	- Implement asm parsing support for LDRSBT, LDRHT, LDRSHT and STRHT also fix the encoding of the later. - Add a new encoding bit to describe the index mode used in AM3. - Teach printAddrMode3Operand to check by the addressing mode which index mode to print. - Testcases. llvm-svn: 128832	2011-04-04 17:18:19 +00:00
Akira Hatanaka	5ec2ead9b0	Move transformation of JmpLink and related nodes done during instruction selection to Legalize phase. llvm-svn: 128830	2011-04-04 17:11:07 +00:00
Jakob Stoklund Olesen	86e1a65ce5	PowerPC atomic pseudos clobber CR0, they don't read it. llvm-svn: 128829	2011-04-04 17:07:09 +00:00
Jakob Stoklund Olesen	7067bff976	Use X0 instead of R0 for the zero register on ppc64. The 32-bit R0 cannot be used where a 64-bit register is expected. llvm-svn: 128828	2011-04-04 17:07:06 +00:00
Joerg Sonnenberger	fc4789da4a	Add support for the VIA PadLock instructions. llvm-svn: 128826	2011-04-04 16:58:13 +00:00
Joerg Sonnenberger	cc53d9919f	Expand Op0Mask by one bit in preparation for the PadLock prefixes. Define most shift masks incrementally to reduce the redundant hard-coding. Introduce new shift for the VEX flags to replace the magic constant 32 in various places. llvm-svn: 128822	2011-04-04 15:58:30 +00:00
Jay Foad	11522097be	Remove some support for ReturnInsts with multiple operands, and for returning a scalar value in a function whose return type is a single- element structure or array. llvm-svn: 128810	2011-04-04 07:44:02 +00:00
Che-Liang Chiou	e34b271718	ptx: support setp's 4-operand format llvm-svn: 128767	2011-04-02 08:51:39 +00:00
Cameron Zwarich	6fe5c29430	Do some peephole optimizations to remove pointless VMOVs from Neon to integer registers that arise from argument shuffling with the soft float ABI. These instructions are particularly slow on Cortex A8. This fixes one half of <rdar://problem/8674845>. llvm-svn: 128759	2011-04-02 02:40:43 +00:00
Johnny Chen	8904cc49db	Fixed a bug in disassembly of STR_POST, where the immediate is the second operand in am2offset; instead of the second operand in addrmode_imm12. rdar://problem/9225289 llvm-svn: 128757	2011-04-02 02:24:54 +00:00
Akira Hatanaka	4111db6575	Undo changes mistakenly made in revision 128750. llvm-svn: 128751	2011-04-02 00:26:12 +00:00
Akira Hatanaka	977f555a76	Insert space before ';' to prevent warnings. llvm-svn: 128750	2011-04-02 00:15:58 +00:00
Johnny Chen	387b36eaae	Fixed MOVr for "should be" encoding bits for Inst{19-16} = 0b0000. rdar://problem/9224276 llvm-svn: 128749	2011-04-01 23:30:25 +00:00
Johnny Chen	6615fa1de0	MOVs should have Inst{19-16} as 0b0000, otherwise, the instruction is UNPREDICTABLE. rdar://problem/9224120 llvm-svn: 128748	2011-04-01 23:15:50 +00:00
Johnny Chen	1e1010f56f	Fix the instruction table entries for AI1_adde_sube_s_irs multiclass definition so that all the instruction have: let Inst{31-27} = 0b1110; // non-predicated Before, the ARM decoder was confusing: > 0x40 0xf3 0xb8 0x80 as: Opcode=16 Name=ADCSSrs Format=ARM_FORMAT_DPSOREGFRM(5) 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 ------------------------------------------------------------------------------------------------- \| 1: 0: 0: 0\| 0: 0: 0: 0\| 1: 0: 1: 1\| 1: 0: 0: 0\| 1: 1: 1: 1\| 0: 0: 1: 1\| 0: 1: 0: 0\| 0: 0: 0: 0\| ------------------------------------------------------------------------------------------------- adcs pc, r8, r0, asr #6 since the cond field for ADCSSrs is a wild card, and so is ADCrs, with the ADCSSrs having Inst{20} as '1'. Now, the AR decoder behaves correctly: > 0x40 0xf3 0xb8 0x80 > END Executing command: /Volumes/data/lldb/llvm/Debug+Asserts/bin/llvm-mc -disassemble -triple=arm-apple-darwin -debug-only=arm-disassembler mc-input.txt Opcode=19 Name=ADCrs Format=ARM_FORMAT_DPSOREGFRM(5) 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 ------------------------------------------------------------------------------------------------- \| 1: 0: 0: 0\| 0: 0: 0: 0\| 1: 0: 1: 1\| 1: 0: 0: 0\| 1: 1: 1: 1\| 0: 0: 1: 1\| 0: 1: 0: 0\| 0: 0: 0: 0\| ------------------------------------------------------------------------------------------------- adcshi pc, r8, r0, asr #6 > rdar://problem/9223094 llvm-svn: 128746	2011-04-01 22:32:51 +00:00
Evan Cheng	88530e6568	Avoid de-referencing pass beginning of a basic block. No small test case possible. rdar://9216009 llvm-svn: 128743	2011-04-01 22:09:28 +00:00
Akira Hatanaka	3d9df607ba	Remove redundant code. There are assignments to variables Base and Offset right after the code that is removed. llvm-svn: 128742	2011-04-01 21:56:02 +00:00
Akira Hatanaka	56d9ef53a2	Simplifies logic for printing target flags. llvm-svn: 128741	2011-04-01 21:41:06 +00:00
Owen Anderson	975ddf8035	When the architecture is explicitly armv6 or thumbv6, we need to mark the object file appropriately. llvm-svn: 128739	2011-04-01 21:07:39 +00:00
Jim Grosbach	360c369967	LDRD/STRD instructions should print both Rt and Rt2 in the asm string. llvm-svn: 128736	2011-04-01 20:26:57 +00:00
Johnny Chen	3dfb80afbf	Fix a LDRT/LDRBT decoding bug where for Encoding A2, if Inst{4} != 0, we should reject the instruction as invalid. llvm-svn: 128734	2011-04-01 20:21:38 +00:00
Akira Hatanaka	e625ba46b7	Modifies MipsAsmPrinter::isBlockOnlyReachableByFallthrough so that it handles delay slots correctly. llvm-svn: 128724	2011-04-01 18:57:38 +00:00
Johnny Chen	fe6fba3fe6	Fix LDRi12 immediate operand, which was changed to be the second operand in $addrmode_imm12 => (ops GPR:$base, i32imm:$offsimm). rdar://problem/9219356 llvm-svn: 128722	2011-04-01 18:26:38 +00:00
Akira Hatanaka	93f898f643	Add code for analyzing FP branches. Clean up branch Analysis functions. llvm-svn: 128718	2011-04-01 17:39:08 +00:00
Benjamin Kramer	bb21fac250	Initialize HasVMLxForwarding. llvm-svn: 128709	2011-04-01 09:20:31 +00:00
Evan Cheng	bd76679700	Issue libcalls __udivmodi4 / __divmodi4 for div / rem pairs. rdar://8911343 llvm-svn: 128696	2011-04-01 00:42:02 +00:00
Matt Beaumont-Gay	d911f92c46	Remove unused variables llvm-svn: 128692	2011-04-01 00:06:01 +00:00
Bruno Cardoso Lopes	ab8305063b	Apply again changes to support ARM memory asm parsing. I removed all LDR/STR changes and left them to a future patch. Passing all checks now. - Implement asm parsing support for LDRT, LDRBT, STRT, STRBT and fix the encoding wherever is possible. - Add a new encoding bit to describe the index mode used and teach printAddrMode2Operand to check by the addressing mode which index mode to print. - Testcases llvm-svn: 128689	2011-03-31 23:26:08 +00:00
Jakob Stoklund Olesen	0709342652	Provide a legal pointer register class when targeting thumb1. The LocalStackSlotAllocation pass was creating illegal registers. llvm-svn: 128687	2011-03-31 23:02:15 +00:00
Evan Cheng	38bf5adcea	Distribute (A + B) * C to (A * C) + (B * C) to make use of NEON multiplier accumulator forwarding: vadd d3, d0, d1 vmul d3, d3, d2 => vmul d3, d0, d2 vmla d3, d1, d2 llvm-svn: 128665	2011-03-31 19:38:48 +00:00
Johnny Chen	7b203f9cae	Fix single word and unsigned byte data transfer instruction encodings so that Inst{4} = 0. rdar://problem/9213022 llvm-svn: 128662	2011-03-31 19:28:35 +00:00
Akira Hatanaka	a535270d91	Added support for FP conditional move instructions and fixed bugs in handling of FP comparisons. llvm-svn: 128650	2011-03-31 18:26:17 +00:00
Johnny Chen	13baa0e650	Add BLXi to the instruction table for disassembly purpose. A8.6.23 BLX (immediate) rdar://problem/9212921 llvm-svn: 128644	2011-03-31 17:53:50 +00:00
Bruno Cardoso Lopes	c2452a6f1d	Revert r128632 again, until I figure out what break the tests llvm-svn: 128635	2011-03-31 15:54:36 +00:00
Richard Osborne	9a827b30ab	Add XCore intrinsics for initializing / starting / synchronizing threads. llvm-svn: 128633	2011-03-31 15:13:13 +00:00
Bruno Cardoso Lopes	4c0aebfb91	Reapply r128585 without generating a lib depedency cycle. An updated log: - Implement asm parsing support for LDRT, LDRBT, STRT, STRBT and {STR,LDC}{2}_{PRE,POST} fixing the encoding wherever is possible. - Move all instructions which use am2offset without a pattern to use addrmode2. - Add a new encoding bit to describe the index mode used and teach printAddrMode2Operand to check by the addressing mode which index mode to print. - Testcases llvm-svn: 128632	2011-03-31 14:52:28 +00:00
Matt Beaumont-Gay	73906b05ca	Revert "- Implement asm parsing support for LDRT, LDRBT, STRT, STRBT and" This revision introduced a dependency cycle, as nlewycky mentioned by email. llvm-svn: 128597	2011-03-31 00:39:16 +00:00
Owen Anderson	abda3caf67	Somehow we managed to forget to encode the lane index for a large swathe of NEON instructions. With this fix, the entire test-suite passes with the Thumb integrated assembler. llvm-svn: 128587	2011-03-30 23:45:29 +00:00
Evan Cheng	ee9d45dd55	Don't try to create zero-sized stack objects. llvm-svn: 128586	2011-03-30 23:44:13 +00:00
Bruno Cardoso Lopes	280264b889	- Implement asm parsing support for LDRT, LDRBT, STRT, STRBT and {STR,LDC}{2}_PRE. - Fixed the encoding in some places. - Some of those instructions were using am2offset and now use addrmode2. Codegen isn't affected, instructions which use SelectAddrMode2Offset were not touched. - Teach printAddrMode2Operand to check by the addressing mode which index mode to print. - This is a work in progress, more work to come. The idea is to change places which use am2offset to use addrmode2 instead, as to unify assembly parser. - Add testcases for assembly parser llvm-svn: 128585	2011-03-30 23:32:32 +00:00
Cameron Zwarich	53dd03d537	Add a ARM-specific SD node for VBSL so that forms with a constant first operand can be recognized. This fixes <rdar://problem/9183078>. llvm-svn: 128584	2011-03-30 23:01:21 +00:00
Akira Hatanaka	4e9ca1b3ba	fixed typo llvm-svn: 128574	2011-03-30 21:15:35 +00:00
Jay Foad	52131344a2	Remove PHINode::reserveOperandSpace(). Instead, add a parameter to PHINode::Create() giving the (known or expected) number of operands. llvm-svn: 128537	2011-03-30 11:28:46 +00:00
Evan Cheng	18381b4257	Add intrinsics @llvm.arm.neon.vmulls and @llvm.arm.neon.vmullu.* back. Frontends was lowering them to sext / uxt + mul instructions. Unfortunately the optimization passes may hoist the extensions out of the loop and separate them. When that happens, the long multiplication instructions can be broken into several scalar instructions, causing significant performance issue. Note the vmla and vmls intrinsics are not added back. Frontend will codegen them as intrinsics vmull* + add / sub. Also note the isel optimizations for catching mul + sext / zext are not changed either. First part of rdar://8832507, rdar://9203134 llvm-svn: 128502	2011-03-29 23:06:19 +00:00
Cameron Zwarich	143f9aea2b	Add Neon SINT_TO_FP and UINT_TO_FP lowering from v4i16 to v4f32. Fixes <rdar://problem/8875309> and <rdar://problem/9057191>. llvm-svn: 128492	2011-03-29 21:41:55 +00:00
Owen Anderson	7ac53ad643	Check early if this is an unsupported opcode, so that we can avoid needlessly instantiating the base register in some cases. llvm-svn: 128481	2011-03-29 20:27:38 +00:00
Johnny Chen	4bc2baeb28	A8.6.188 STC, STC2 The STC_OPTION and STC2_OPTION instructions should have their coprocessor option enclosed in {}. rdar://problem/9200661 llvm-svn: 128478	2011-03-29 19:49:38 +00:00
Owen Anderson	c48981f729	Add safety check that didn't show up in testing. llvm-svn: 128467	2011-03-29 17:42:25 +00:00
Owen Anderson	d6c5a741b5	Get rid of the non-writeback versions VLDMDB and VSTMDB, which don't actually exist. llvm-svn: 128461	2011-03-29 16:45:53 +00:00
Evan Cheng	e2086e740f	Optimizing (zext A + zext B) * C, to (VMULL A, C) + (VMULL B, C) during isel lowering to fold the zero-extend's and take advantage of no-stall back to back vmul + vmla: vmull q0, d4, d6 vmlal q0, d5, d6 is faster than vaddl q0, d4, d5 vmovl q1, d6 vmul q0, q0, q1 This allows us to vmull + vmlal for: f = vmull_u8( vget_high_u8(s), c); f = vmlal_u8(f, vget_low_u8(s), c); rdar://9197392 llvm-svn: 128444	2011-03-29 01:56:09 +00:00
Daniel Dunbar	3e2b335903	Integrated-As: Add support for setting the AllowTemporaryLabels flag via integrated-as. llvm-svn: 128431	2011-03-28 22:49:19 +00:00
Johnny Chen	f9cd139369	Fix ARM disassembly for PLD/PLDW/PLI which suffers from code rot and add some test cases. Add comments to ThumbDisassemblerCore.h for recent change made for t2PLD disassembly. llvm-svn: 128417	2011-03-28 18:41:58 +00:00
Che-Liang Chiou	cdedaf1f7d	ptx: clean up branch code a bit llvm-svn: 128405	2011-03-28 10:23:13 +00:00
Benjamin Kramer	8d2227373d	Make helper static. llvm-svn: 128338	2011-03-26 12:38:19 +00:00
Johnny Chen	923f3dac01	Fixed the t2PLD and friends disassembly and add two test cases. llvm-svn: 128322	2011-03-26 01:32:48 +00:00
Eric Christopher	d553096688	Fix the bfi handling for or (and a mask) (and b mask). We need the two masks to match inversely for the code as is to work. For the example given we actually want: bfi r0, r2, #1, #1 not #0, however, given the way the pattern is written it's not possible at the moment. Fixes rdar://9177502 llvm-svn: 128320	2011-03-26 01:21:03 +00:00
Johnny Chen	49316e40ba	Fix DisassembleThumb2DPReg()'s handling of RegClass. Cannot hardcode GPRRegClassID. Also add some test cases. rdar://problem/9189829 llvm-svn: 128304	2011-03-25 22:19:07 +00:00
Johnny Chen	aaf2c69400	DisassembleThumb2LdSt() did not handle t2LDRs correctly with respect to RegClass. Add two test cases. rdar://problem/9182892 llvm-svn: 128299	2011-03-25 19:35:37 +00:00
Johnny Chen	b35548f44d	Modify DisassembleThumb2LdStEx() to be more robust/correct in light of recent change to t2LDREX/t2STREX instructions. Add two test cases. llvm-svn: 128293	2011-03-25 18:29:49 +00:00
Benjamin Kramer	dc0082b087	Add a note. llvm-svn: 128286	2011-03-25 17:32:40 +00:00
Johnny Chen	aa84d41dfc	Instruction formats of SWP/SWPB were changed from LdStExFrm to MiscFrm. Modify the disassembler to handle that. rdar://problem/9184053 llvm-svn: 128285	2011-03-25 17:31:16 +00:00
Johnny Chen	757ca69770	Also need to handle invalid imod values for CPS2p. rdar://problem/9186136 llvm-svn: 128283	2011-03-25 17:03:12 +00:00
Jakob Stoklund Olesen	a1e3156ebd	Ignore special ARM allocation hints for unexpected register classes. Add an assertion to linear scan to prevent it from allocating registers outside the register class. <rdar://problem/9183021> llvm-svn: 128254	2011-03-25 01:48:18 +00:00
Johnny Chen	a52143bff3	Modify the wrong logic in the assert of DisassembleThumb2LdStDual() (the register classes were changed), modify the comment to be up-to-date, and add a test case for A8.6.66 LDRD (immediate) Encoding T1. llvm-svn: 128252	2011-03-25 01:09:48 +00:00
Matt Beaumont-Gay	303e3161bb	Suppress an unused variable warning in -asserts builds llvm-svn: 128244	2011-03-24 22:05:48 +00:00
Johnny Chen	9302df0ad9	Handle the added VBICivi NEON instructions, too. llvm-svn: 128243	2011-03-24 22:04:39 +00:00
Johnny Chen	02e59ad506	Plug a leak by ThumbDisassembler::getInstruction(), thanks to Benjamin Kramer! llvm-svn: 128241	2011-03-24 21:42:55 +00:00
Johnny Chen	6469ca0c33	T2 Load/Store Multiple: These instructions were changed to not embed the addressing mode within the MC instructions We also need to update the corresponding assert stmt. Also add a test case. llvm-svn: 128240	2011-03-24 21:36:56 +00:00
Benjamin Kramer	dd9eb21c3f	Plug a leak in the arm disassembler and put the tests back. llvm-svn: 128238	2011-03-24 21:14:28 +00:00
Bruno Cardoso Lopes	f170f8bff6	Add asm parsing support w/ testcases for strex/ldrex family of instructions llvm-svn: 128236	2011-03-24 21:04:58 +00:00
Johnny Chen	8bbc12824a	ADR was added with the wrong encoding for inst{24-21}, and the ARM decoder was fooled. Set the encoding bits to {0,?,?,0}, not 0. Plus delegate the disassembly of ADR to the more generic ADDri/SUBri instructions, and add a test case for that. llvm-svn: 128234	2011-03-24 20:42:48 +00:00
Jim Grosbach	a3df87fb01	Clean up assembly statement separator support. The MC asm lexer wasn't honoring a non-default (anything but ';') statement separator. Fix that, and generalize a bit to support multi-character statement separators. llvm-svn: 128227	2011-03-24 18:46:34 +00:00
Johnny Chen	c5207f7167	The r118201 added support for VORR (immediate). Update ARMDisassemblerCore.cpp to disassemble the VORRivi instructions properly within the DisassembleN1RegModImmFrm() function. Add a test case. llvm-svn: 128226	2011-03-24 18:40:38 +00:00
Johnny Chen	1dd041083d	Add comments to the handling of opcode CPS3p to reject invalid instruction encoding, a test case of invalid CPS3p encoding and one for invalid VLDMSDB due to regs out of range. llvm-svn: 128220	2011-03-24 17:04:22 +00:00
NAKAMURA Takumi	521eb7c11e	Target/X86: [PR8777][PR8778] Tweak alloca/chkstk for Windows targets. FIXME: Some cleanups would be needed. llvm-svn: 128206	2011-03-24 07:07:00 +00:00
Evan Cheng	f098bf1199	Nasty bug in ARMBaseInstrInfo::produceSameValue(). The MachineConstantPoolEntry entries being compared may not be ARMConstantPoolValue. Without checking whether they are ARMConstantPoolValue first, and if the stars and moons are aligned properly, the equality test may return true (when the first few words of two Constants' values happen to be identical) and very bad things can happen. rdar://9125354 llvm-svn: 128203	2011-03-24 06:20:03 +00:00
Johnny Chen	a75d158c41	CPS3p: Let's reject impossible imod values by returning false from the DisassembleMiscFrm() function. Fixed rdar://problem/9179416 ARM disassembler crash: "Unknown imod operand" (fuzz testing) Opcode=98 Name=CPS3p Format=ARM_FORMAT_MISCFRM(26) 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 ------------------------------------------------------------------------------------------------- \| 1: 1: 1: 1\| 0: 0: 0: 1\| 0: 0: 0: 0\| 0: 0: 1: 0\| 0: 0: 0: 1\| 1: 1: 0: 0\| 1: 0: 0: 1\| 0: 0: 1: 1\| ------------------------------------------------------------------------------------------------- Before: cpsUnknown imod operand UNREACHABLE executed at /Volumes/data/lldb/llvm/lib/Target/ARM/InstPrinter/../ARMBaseInfo.h:123! After: /Volumes/data/Radar/9179416/mc-input-arm.txt:1:1: warning: invalid instruction encoding 0x93 0x1c 0x2 0xf1 ^ llvm-svn: 128192	2011-03-24 02:24:36 +00:00
Johnny Chen	0f5d52d658	Load/Store Multiple: These instructions were changed to not embed the addressing mode within the MC instructions We also need to update the corresponding assert stmt. Also add two test cases. llvm-svn: 128191	2011-03-24 01:40:42 +00:00
Johnny Chen	1de8cc6f95	STRT and STRBT was incorrectly tagged as IndexModeNone during the refactorings (r119821). We now tag them as IndexModePost. llvm-svn: 128189	2011-03-24 01:07:26 +00:00
Johnny Chen	f949d8e13d	The r128103 fix to cope with the removal of addressing modes from the MC instructions were incomplete. The assert stmt needs to be updated and the operand index incrment is wrong. Fix the bad logic and add some sanity checking to detect bad instruction encoding; and add a test case. llvm-svn: 128186	2011-03-24 00:28:38 +00:00
Devang Patel	abc77347a7	Enable GlobalMerge on darwin. llvm-svn: 128183	2011-03-23 23:34:19 +00:00
Andrew Trick	4ab9a16569	Revert r128175. I'm backing this out for the second time. It was supposed to be fixed by r128164, but the mingw self-host must be defeating the fix. llvm-svn: 128181	2011-03-23 23:11:02 +00:00
Evan Cheng	425489d397	Cmp peephole optimization isn't always safe for signed arithmetics. int tries = INT_MAX; while (tries > 0) { tries--; } The check should be: subs r4, #1 cmp r4, #0 bgt LBB0_1 The subs can set the overflow V bit when r4 is INT_MAX+1 (which loop canonicalization apparently does in this case). cmp #0 would have cleared it while not changing the N and Z bits. Since BGT is dependent on the V bit, i.e. (N == V) && !Z, it is not safe to eliminate the cmp #0. rdar://9172742 llvm-svn: 128179	2011-03-23 22:52:04 +00:00
Andrew Trick	4046a0de91	Reapply Eli's r127852 now that the pre-RA scheduler can spill EFLAGS. (target-specific branchless method for double-width relational comparisons on x86) llvm-svn: 128175	2011-03-23 22:16:02 +00:00
Owen Anderson	8543d4f8a1	The high bit of a Thumb2 ADR's offset is stored in bit 26, not bit 25. This fixes 464.h264ref with the integrated assembler. llvm-svn: 128172	2011-03-23 22:03:44 +00:00
Justin Holewinski	06c8a38223	PTX: Improve support for 64-bit addressing - Fix bug in ADDRrr/ADDRri/ADDRii selection for 64-bit addresses - Add comparison selection for i64 - Add zext selection for i32 -> i64 - Add shl/shr/sha support for i64 llvm-svn: 128153	2011-03-23 16:58:51 +00:00
Johnny Chen	7ca3ddc233	For ARM Disassembler, start a newline to dump the opcode and friends for an instruction. Change inspired by llvm-bug 9530 submitted by Jyun-Yan You. llvm-svn: 128122	2011-03-22 23:49:46 +00:00
Johnny Chen	30350cdbdf	LDRT and LDRBT was incorrectly tagged as IndexModeNone during the refactorings (r119821). We now tag them as IndexModePost. This fixed http://llvm.org/bugs/show_bug.cgi?id=9530. llvm-svn: 128113	2011-03-22 22:28:49 +00:00
Eli Friedman	822e7bc061	A bit more analysis of a memset-related README entry. llvm-svn: 128107	2011-03-22 20:49:53 +00:00
Johnny Chen	230268261b	A8.6.399 VSTM: VFP Load/Store Multiple Instructions used to embed the IA/DB addressing mode within the MC instruction; that has been changed so that now, for example, VSTMDDB_UPD and VSTMDIA_UPD are two instructions. Update the ARMDisassemblerCore.cpp's DisassembleVFPLdStMulFrm() to reflect the change. Also add a test case. llvm-svn: 128103	2011-03-22 20:00:10 +00:00
Eric Christopher	a5a779ef45	Migrate the fix in r128041 to ARM's fastisel support as well. Fixes rdar://9169640 llvm-svn: 128100	2011-03-22 19:39:17 +00:00
Bruno Cardoso Lopes	f922b20922	Change MRC and MRC2 instructions to model the output register properly llvm-svn: 128085	2011-03-22 15:06:24 +00:00
Che-Liang Chiou	7413080cea	ptx: add analyze/insert/remove branch llvm-svn: 128084	2011-03-22 14:12:00 +00:00
Matt Beaumont-Gay	bfd23e4009	Avoid -Wunused-variable in -asserts builds llvm-svn: 128048	2011-03-22 00:37:28 +00:00
Dan Gohman	c1783b31a4	Fix fast-isel address mode folding to avoid folding instructions outside of the current basic block. This fixes PR9500, rdar://9156159. llvm-svn: 128041	2011-03-22 00:04:35 +00:00
Bill Wendling	00f0cddfd4	We need to pass the TargetMachine object to the InstPrinter if we are printing the alias of an InstAlias instead of the thing being aliased. Because we need to know the features that are valid for an InstAlias. This is part of a work-in-progress. llvm-svn: 127986	2011-03-21 04:13:46 +00:00
Eli Friedman	8e15a661bf	This README entry was fixed recently. llvm-svn: 127982	2011-03-21 01:33:03 +00:00
Evan Cheng	0663f23bd8	Re-apply r127953 with fixes: eliminate empty return block if it has no predecessors; update dominator tree if cfg is modified. llvm-svn: 127981	2011-03-21 01:19:09 +00:00
Daniel Dunbar	327cd36f74	Revert r127953, "SimplifyCFG has stopped duplicating returns into predecessors to canonicalize IR", it broke a lot of things. llvm-svn: 127954	2011-03-19 21:47:14 +00:00
Evan Cheng	824a711305	SimplifyCFG has stopped duplicating returns into predecessors to canonicalize IR to have single return block (at least getting there) for optimizations. This is general goodness but it would prevent some tailcall optimizations. One specific case is code like this: int f1(void); int f2(void); int f3(void); int f4(void); int f5(void); int f6(void); int foo(int x) { switch(x) { case 1: return f1(); case 2: return f2(); case 3: return f3(); case 4: return f4(); case 5: return f5(); case 6: return f6(); } } => LBB0_2: ## %sw.bb callq _f1 popq %rbp ret LBB0_3: ## %sw.bb1 callq _f2 popq %rbp ret LBB0_4: ## %sw.bb3 callq _f3 popq %rbp ret This patch teaches codegenprep to duplicate returns when the return value is a phi and where the phi operands are produced by tail calls followed by an unconditional branch: sw.bb7: ; preds = %entry %call8 = tail call i32 @f5() nounwind br label %return sw.bb9: ; preds = %entry %call10 = tail call i32 @f6() nounwind br label %return return: %retval.0 = phi i32 [ %call10, %sw.bb9 ], [ %call8, %sw.bb7 ], ... [ 0, %entry ] ret i32 %retval.0 This allows codegen to generate better code like this: LBB0_2: ## %sw.bb jmp _f1 ## TAILCALL LBB0_3: ## %sw.bb1 jmp _f2 ## TAILCALL LBB0_4: ## %sw.bb3 jmp _f3 ## TAILCALL rdar://9147433 llvm-svn: 127953	2011-03-19 17:17:39 +00:00
Nadav Rotem	e7a101ccab	Add support for legalizing UINT_TO_FP of vectors on platforms which do not have native support for this operation (such as X86). The legalized code uses two vector INT_TO_FP operations and is faster than scalarizing. llvm-svn: 127951	2011-03-19 13:09:10 +00:00
Johnny Chen	0c5f670fe7	Fixed an assert by the ARM disassembler for LDRD_PRE/POST. The relevant instruction table entries were changed sometime ago to no longer take <Rt2> as an operand. Modify ARMDisassemblerCore.cpp to accomodate the change and add a test case. llvm-svn: 127935	2011-03-19 01:16:20 +00:00
Owen Anderson	1d2f5cebe4	Add support to the ARM asm parser for the register-shifted-register forms of basic instructions like ADD. More work left to be done to support other instances of shifter ops in the ISA. llvm-svn: 127917	2011-03-18 22:50:18 +00:00

... 4 5 6 7 8 ...

17890 Commits