llvm-project

Commit Graph

Author	SHA1	Message	Date
Devang Patel	2266aa84a1	Refactor. llvm-svn: 129938	2011-04-21 21:07:35 +00:00
Jay Foad	5514afe6b2	PR9214: Convert Metadata API to use ArrayRef. llvm-svn: 129932	2011-04-21 19:59:31 +00:00
Matt Beaumont-Gay	70597d4e50	Don't recycle loop variables. llvm-svn: 129928	2011-04-21 19:46:23 +00:00
Jakob Stoklund Olesen	6a663b8dc8	Allow allocatable ranges from global live range splitting to be split again. These intervals are allocatable immediately after splitting, but they may be evicted because of later splitting. This is rare, but when it happens they should be split again. The remainder intervals that cannot be allocated after splitting still move directly to spilling. SplitEditor::finish can optionally provide a mapping from new live intervals back to the original interval indexes returned by openIntv(). Each original interval index can map to multiple new intervals after connected components have been separated. Dead code elimination may also add existing intervals to the list. The reverse mapping allows the SplitEditor client to treat the new intervals differently depending on the split region they came from. llvm-svn: 129925	2011-04-21 18:38:15 +00:00
Rafael Espindola	c3dc486752	Fix relative relocations. This is sufficient for running the rust testsuite with MC :-) llvm-svn: 129923	2011-04-21 18:36:50 +00:00
Devang Patel	46bda61a81	As per ARM docs, register Dx is described as DW_OP_regx(256+x) in DWARF. llvm-svn: 129922	2011-04-21 17:51:06 +00:00
Devang Patel	28f2719d83	Add comment in output stream. llvm-svn: 129921	2011-04-21 17:50:24 +00:00
Daniel Dunbar	6309828206	Revert r1296656, "Fix rdar://9289512 - not folding load into compare at -O0...", which broke a couple GCC test suite tests at -O0. llvm-svn: 129914	2011-04-21 16:14:46 +00:00
Justin Holewinski	d74d88a861	PTX: Expand useable register space llvm-svn: 129913	2011-04-21 16:08:02 +00:00
Che-Liang Chiou	14c48e5d66	ptx: fix parameter ordering This patch depends on the prior fix r129908 that changes to use std::find, rather than std::binary_search, on unordered array. Patch by Dan Bailey llvm-svn: 129909	2011-04-21 10:56:58 +00:00
Che-Liang Chiou	cdc51569ee	ptx: PTXMachineFunctionInfo no longer sort registers and so should not use std::binary_search llvm-svn: 129908	2011-04-21 10:16:20 +00:00
Nick Lewycky	8411b5511e	In gcov profiling, give all functions an extra unified return block. This is necessary since gcov counts transitions between blocks. It can't see if you've run every line in a straight-line function, so we add an edge for it to notice. llvm-svn: 129905	2011-04-21 03:18:00 +00:00
Nick Lewycky	ed749d8c94	Fix think-o: emit all 8 bytes of the EOF marker. Also reflow a line in a comment for 80 columns. llvm-svn: 129904	2011-04-21 02:48:39 +00:00
Nick Lewycky	8e0a38f88a	Add independent controls for whether GCOV profiling should emit .gcno files or instrument the program to emit .gcda. TODO: we should emit slightly different .gcda files when .gcno emission is off. llvm-svn: 129903	2011-04-21 01:56:25 +00:00
Nick Lewycky	f735b7b845	Structs have elements not parameters. I'm surprised this ever compiled... llvm-svn: 129888	2011-04-20 22:52:37 +00:00
Evan Cheng	5f1ba4cd2d	Remove -use-divmod-libcall. Let targets opt in when they are available. llvm-svn: 129884	2011-04-20 22:20:12 +00:00
Jakob Stoklund Olesen	86e53ced08	Add debug output for rematerializable instructions. llvm-svn: 129883	2011-04-20 22:14:20 +00:00
Jakob Stoklund Olesen	90d79bdcd2	Permit remat when a virtual register has multiple defs. TII::isTriviallyReMaterializable() shouldn't depend on any properties of the register being defined by the instruction. Rematerialization is going to create a new virtual register anyway. llvm-svn: 129882	2011-04-20 22:14:17 +00:00
Cameron Zwarich	ca4c633489	Fix another case of <rdar://problem/9184212> that only occurs with code generated by llvm-gcc, since llvm-gcc uses 2 i64s for passing a 4 x float vector on ARM rather than an i64 array like Clang. llvm-svn: 129878	2011-04-20 21:48:38 +00:00
Cameron Zwarich	76dfa226cf	The bitcast case here is actually handled uniformly earlier in the function, so delete it. llvm-svn: 129877	2011-04-20 21:48:34 +00:00
Cameron Zwarich	4cd9a4a975	Cleanup some code to better use an early return style in preparation for adding more cases. llvm-svn: 129876	2011-04-20 21:48:16 +00:00
Eli Friedman	c93d399eed	Revert r129846; it's breaking a buildbot. See http://google1.osuosl.org:8011/builders/llvm-x86_64-linux-checks/builds/825/steps/test.llvm.stage2/logs/st.ll llvm-svn: 129869	2011-04-20 19:00:08 +00:00
Jakob Stoklund Olesen	0e34c1dfac	Prefer cheap registers for busy live ranges. On the x86-64 and thumb2 targets, some registers are more expensive to encode than others in the same register class. Add a CostPerUse field to the TableGen register description, and make it available from TRI->getCostPerUse. This represents the cost of a REX prefix or a 32-bit instruction encoding required by choosing a high register. Teach the greedy register allocator to prefer cheap registers for busy live ranges (as indicated by spill weight). llvm-svn: 129864	2011-04-20 18:19:48 +00:00
Stuart Hastings	7850af6ea0	Excise unintended hunk in 129858. <rdar://problem/7662569> llvm-svn: 129862	2011-04-20 18:09:26 +00:00
Stuart Hastings	45fe3c38c5	ARM byval support. Will be enabled by another patch to the FE. <rdar://problem/7662569> llvm-svn: 129858	2011-04-20 16:47:52 +00:00
Daniel Dunbar	8991d611fd	sys/Host: Change getHostTriple() to return the full Darwin version on OS X. llvm-svn: 129852	2011-04-20 15:44:33 +00:00
Justin Holewinski	7d8895e767	PTX: Add intrinsics to list of built-in intrinsics, which allows them to be used by Clang. To help Clang integration, the PTX target has been split into two targets: ptx32 and ptx64, depending on the desired pointer size. - Add GCCBuiltin class to all intrinsics - Split PTX target into ptx32 and ptx64 llvm-svn: 129851	2011-04-20 15:37:17 +00:00
Rafael Espindola	ed16477cb9	Behave like gnu as when a relocation crosses sections. llvm-svn: 129850	2011-04-20 14:01:45 +00:00
Che-Liang Chiou	6586f84685	ptx: add integer div and rem instruction Patched by Dan Bailey llvm-svn: 129848	2011-04-20 09:28:55 +00:00
Che-Liang Chiou	5a952b3c67	ptx: add floating-point comparison to setp Patched by Dan Bailey llvm-svn: 129847	2011-04-20 09:28:20 +00:00
Che-Liang Chiou	49160f9a71	ptx: fix parameter ordering Patched by Dan Bailey llvm-svn: 129846	2011-04-20 09:27:19 +00:00
Nick Lewycky	4dae63e35b	This should always be signed chars, so use int8_t. This fixes a miscompile when llvm is built with unsigned chars where an immediate such as 0xff would be zero extended to 64-bits, turning "cmp $0xff,%eax" into "cmp $0xffffffffffffffff,%eax". llvm-svn: 129845	2011-04-20 03:19:42 +00:00
Rafael Espindola	e473aaf540	Remove unused arguments. llvm-svn: 129844	2011-04-20 03:08:09 +00:00
Eric Christopher	bcaedb5ce0	Rewrite the expander for umulo/smulo to remember to sign extend the input manually and pass all (now) 4 arguments to the mul libcall. Add a new ExpandLibCall for just this (copied gratuitously from type legalization). Fixes rdar://9292577 llvm-svn: 129842	2011-04-20 01:19:45 +00:00
Sean Callanan	d897f39797	Made the MC disassembler check before accessing MCInst operands for ARM. This allows it to be more tolerant of malformed MCInsts or incorrect instruction metadata. llvm-svn: 129840	2011-04-20 00:43:34 +00:00
Daniel Dunbar	cd01ed5bd6	ADT/Triple: Renambe isOSX... methods to isMacOSX for consistency with the OS triple component. llvm-svn: 129838	2011-04-20 00:14:25 +00:00
Johnny Chen	dc62e59776	Fix typo in the comment. llvm-svn: 129837	2011-04-19 23:58:52 +00:00
Daniel Dunbar	924699845a	ADT/Triple: Drop support for -osx style triples, we are going with -macosx instead. llvm-svn: 129836	2011-04-19 23:55:20 +00:00
Daniel Dunbar	0854f347d2	ADT/Triple: Add support for Triple::MacOSX per feedback from Chris, will remove Triple::OSX once Clang has moved. llvm-svn: 129833	2011-04-19 23:34:12 +00:00
Daniel Dunbar	2b9b0e3748	ADT/Triple: Move a variety of clients to using isOSDarwin() and isOSWindows() predicates. llvm-svn: 129816	2011-04-19 21:14:45 +00:00
Daniel Dunbar	163a0966a9	ADT/Triple: Add isOSDarwin() and isOSWindows() helper functions. llvm-svn: 129815	2011-04-19 21:12:05 +00:00
Daniel Dunbar	3c0fbce10b	ADT/Triple: Fix Triple::getArchNameForAssembler to support OSX and iOS enumeration values. llvm-svn: 129814	2011-04-19 21:07:03 +00:00
Daniel Dunbar	100455a3c8	Target/X86: Eliminate uses of getDarwinVers(). llvm-svn: 129813	2011-04-19 21:04:12 +00:00
Daniel Dunbar	44b530369d	Target/X86: Add getTargetTriple() accessor. llvm-svn: 129812	2011-04-19 21:01:47 +00:00
Daniel Dunbar	e3de896b5e	Target/PPC: Kill off DarwinVers, which is now dead. llvm-svn: 129811	2011-04-19 20:59:24 +00:00
Daniel Dunbar	f954a0f028	Target/PPC: Eliminate a use of getDarwinVers(). llvm-svn: 129810	2011-04-19 20:57:03 +00:00
Daniel Dunbar	a37aab2515	Target/PPC: Add a TargetTriple field. llvm-svn: 129809	2011-04-19 20:54:28 +00:00
Daniel Dunbar	9483bb6bf3	Target: Eliminate a use of getDarwinMajorNumber(). llvm-svn: 129803	2011-04-19 20:44:08 +00:00
Daniel Dunbar	4a7783b0c2	CodeGen: Eliminate a use of getDarwinMajorNumber(). - There is a minor semantic change here (evidenced by the test change) for Darwin triples that have no version component. I debated changing the default behavior of isOSVersionLT, but decided it made more sense for triples to be explicit. llvm-svn: 129802	2011-04-19 20:32:39 +00:00
Daniel Dunbar	99f904c72d	ADT/Triple: Generalize and simplify getDarwinNumber to just be getOSVersion. llvm-svn: 129799	2011-04-19 20:24:34 +00:00
Daniel Dunbar	d74bac70c4	ADT/Triple: Add support for more explicit "osx" and "ios" OS names. llvm-svn: 129798	2011-04-19 20:19:27 +00:00
Stuart Hastings	468086d5e1	Delete unnecessary variable. <rdar://problem/7662569> llvm-svn: 129796	2011-04-19 20:09:38 +00:00
Eric Christopher	c721b0db6d	Remove some duplicate op action entries and reorganize. llvm-svn: 129781	2011-04-19 18:49:19 +00:00
Bob Wilson	0858c3aaed	This patch combines several changes from Evan Cheng for rdar://8659675. Making use of VFP / NEON floating point multiply-accumulate / subtraction is difficult on current ARM implementations for a few reasons. 1. Even though a single vmla has latency that is one cycle shorter than a pair of vmul + vadd, a RAW hazard during the first (4? on Cortex-a8) can cause additional pipeline stall. So it's frequently better to single codegen vmul + vadd. 2. A vmla folowed by a vmul, vmadd, or vsub causes the second fp instruction to stall for 4 cycles. We need to schedule them apart. 3. A vmla followed vmla is a special case. Obvious issuing back to back RAW vmla + vmla is very bad. But this isn't ideal either: vmul vadd vmla Instead, we want to expand the second vmla: vmla vmul vadd Even with the 4 cycle vmul stall, the second sequence is still 2 cycles faster. Up to now, isel simply avoid codegen'ing fp vmla / vmls. This works well enough but it isn't the optimial solution. This patch attempts to make it possible to use vmla / vmls in cases where it is profitable. A. Add missing isel predicates which cause vmla to be codegen'ed. B. Make sure the fmul in (fadd (fmul)) has a single use. We don't want to compute a fmul and a fmla. C. Add additional isel checks for vmla, avoid cases where vmla is feeding into fp instructions (except for the #3 exceptional case). D. Add ARM hazard recognizer to model the vmla / vmls hazards. E. Add a special pre-regalloc case to expand vmla / vmls when it's likely the vmla / vmls will trigger one of the special hazards. Enable these fp vmlx codegen changes for Cortex-A9. llvm-svn: 129775	2011-04-19 18:11:57 +00:00
Bob Wilson	d04a83f8f2	Add -mcpu=cortex-a9-mp. It's cortex-a9 with MP extension. rdar://8648637. llvm-svn: 129774	2011-04-19 18:11:52 +00:00
Bob Wilson	a2881ee8a4	Avoid some 's' 16-bit instruction which partially update CPSR (and add false dependency) when it isn't dependent on last CPSR defining instruction. rdar://8928208 llvm-svn: 129773	2011-04-19 18:11:49 +00:00
Bob Wilson	df612ba006	Avoid write-after-write issue hazards for Cortex-A9. Add a avoidWriteAfterWrite() target hook to identify register classes that suffer from write-after-write hazards. For those register classes, try to avoid writing the same register in two consecutive instructions. This is currently disabled by default. We should not spill to avoid hazards! The command line flag -avoid-waw-hazard can be used to enable waw avoidance. llvm-svn: 129772	2011-04-19 18:11:45 +00:00
Bob Wilson	3e5944d96b	Some single-precision VFP instructions can execute in either the VPF or Neon pipelines, at least on Cortex-A9. llvm-svn: 129771	2011-04-19 18:11:38 +00:00
Bob Wilson	f33715e554	Improvements for the Cortex-A9 scheduling itineraries. llvm-svn: 129770	2011-04-19 18:11:36 +00:00
Eli Friedman	ee92a6b332	Add support for FastISel'ing varargs calls. llvm-svn: 129765	2011-04-19 17:22:22 +00:00
Jakob Stoklund Olesen	af12138d10	Force the greedy register allocator to be linked alongside linear scan. This means that the new register allocator can be used with 'clang -mllvm -regalloc=greedy'. llvm-svn: 129764	2011-04-19 17:17:58 +00:00
Eli Friedman	bcd09b3a7f	SelectBasicBlock is rather slow even when it doesn't do anything; skip the unnecessary work where possible. llvm-svn: 129763	2011-04-19 17:01:08 +00:00
Stuart Hastings	0b68c1219f	Support nested CALLSEQ_BEGIN/END; necessary for ARM byval support. <rdar://problem/7662569> llvm-svn: 129761	2011-04-19 16:16:58 +00:00
Jay Foad	6a85be25a4	Trivial simplification. llvm-svn: 129759	2011-04-19 15:23:29 +00:00
Chris Lattner	91328b317b	Implement support for x86 fastisel of small fixed-sized memcpys, which are generated en-mass for C++ PODs. On my c++ test file, this cuts the fast isel rejects by 10x and shrinks the generated .s file by 5% llvm-svn: 129755	2011-04-19 05:52:03 +00:00
Chris Lattner	34a08c2344	tidy up llvm-svn: 129753	2011-04-19 05:15:59 +00:00
Chris Lattner	5f4b783426	Implement support for fast isel of calls of i1 arguments, even though they are illegal, when they are a truncate from something else. This eliminates fully half of all the fastisel rejections on a test c++ file I'm working with, which should make a substantial improvement for -O0 compile of c++ code. This fixed rdar://9297003 - fast isel bails out on all functions taking bools llvm-svn: 129752	2011-04-19 05:09:50 +00:00
Chris Lattner	d7f7c93914	Handle i1/i8/i16 constant integer arguments to calls by prepromoting them. Before we would bail out on i1 arguments all together, now we just bail on non-constant ones. Also, we used to emit extraneous code. e.g. test12 was: movb $0, %al movzbl %al, %edi callq _test12 and test13 was: movb $0, %al xorl %edi, %edi movb %al, 7(%rsp) callq _test13f Now we get: movl $0, %edi callq _test12 and: movl $0, %edi callq _test13f llvm-svn: 129751	2011-04-19 04:42:38 +00:00
Chris Lattner	c59290a34c	be layout aware, to produce: testb $1, %al je LBB0_2 ## BB#1: ## %if.then movb $0, %al instead of: testb $1, %al jne LBB0_1 jmp LBB0_2 LBB0_1: ## %if.then movb $0, %al how 'bout that. llvm-svn: 129749	2011-04-19 04:26:32 +00:00
Chris Lattner	2c8a4c3b1b	fix rdar://9297006 - fast isel bails out on trunc to i1 -> bools cry, a common cause of fast isel rejects on c++ code. llvm-svn: 129748	2011-04-19 04:22:17 +00:00
Evan Cheng	7d6cd4902e	Change A9 scheduling itineraries VLD* / VST* entries default to "aligned". That is, it assumes addresses are 64-bit aligned (which should be the more common case). If the alignment is found not to be aligned, then getOperandLatency() would adjust the operand latency computation by one to compensate for it. rdar://9294833 llvm-svn: 129742	2011-04-19 01:21:49 +00:00
Evan Cheng	4079133796	Do not lose mem_operands while lowering VLD / VST intrinsics. llvm-svn: 129738	2011-04-19 00:04:03 +00:00
Devang Patel	0c7732499b	Use ArrayRef variants. llvm-svn: 129735	2011-04-18 23:51:03 +00:00
Ted Kremenek	28af26d878	Add BumpPtrAllocator::getTotalMemory() to allow clients to query how much memory a BumpPtrAllocator allocated. llvm-svn: 129727	2011-04-18 22:44:46 +00:00
Jim Grosbach	ddac5dd269	Trim a few unneeded includes. llvm-svn: 129723	2011-04-18 21:35:54 +00:00
Eric Christopher	2e3fbaab39	Invert the meaning of printAliasInstr's return value. It now returns true on success and false on failure. Update callers. llvm-svn: 129722	2011-04-18 21:28:11 +00:00
Eli Friedman	ec138b4b27	Simplify declarations slightly by using typedefs. llvm-svn: 129720	2011-04-18 21:21:37 +00:00
Eli Friedman	b2545fbc2a	malloc elimination: it's a bad idea to use raw_svector_ostream on a small heap-allocated SmallString because it unconditionally forces a malloc. (Revised version of r129688, with the necessary flush() call.) llvm-svn: 129716	2011-04-18 20:54:46 +00:00
Devang Patel	17740e70d5	Reduce clutter in asm output. Do not emit source location as comment for each instruction. llvm-svn: 129715	2011-04-18 20:26:49 +00:00
Jakob Stoklund Olesen	9f294a9e52	Handle spilling around an instruction that has an early-clobber re-definition of the spilled register. This is quite common on ARM now that some stores have early-clobber defines. llvm-svn: 129714	2011-04-18 20:23:27 +00:00
Sean Callanan	5d73033e0f	Small fix to the ARM AsmParser to ensure that a superclass variable is instantiated properly. llvm-svn: 129713	2011-04-18 20:20:44 +00:00
Eric Christopher	c37aa0b26a	Fix a bug where we were counting the alias sets as completely used registers for fast allocation a different way. This has us updating used registers only when we're using that exact register. Fixes rdar://9207598 llvm-svn: 129711	2011-04-18 19:26:25 +00:00
Chandler Carruth	2b1ba48f8d	Mark some functions as used which are used within debug-only code. This silences Clang's -Wunused-function when building in release mode. llvm-svn: 129709	2011-04-18 18:49:44 +00:00
Chris Lattner	48f75ad678	while we're at it, handle 'sdiv exact' of a power of 2 also, this fixes a few rejects on c++ iterator loops. llvm-svn: 129694	2011-04-18 07:00:40 +00:00
Chris Lattner	562d6e82bd	fix rdar://9297011 - udiv by power of two causing fast-isel rejects llvm-svn: 129693	2011-04-18 06:55:51 +00:00
Chris Lattner	80254a53cc	Add a new bit that ImmLeaf's can opt into, which allows them to duck out of the generated FastISel. X86 doesn't need to generate code to match ADD16ri8 since ADD16ri will do just fine. This is a small codesize win in the generated instruction selector. llvm-svn: 129692	2011-04-18 06:36:55 +00:00
Eli Friedman	3f8ecf5cc5	Revert r129688; it's breaking buildbots. llvm-svn: 129689	2011-04-18 05:54:54 +00:00
Eli Friedman	2dc287a147	More malloc elimination: it's a bad idea to use raw_svector_ostream on a small heap-allocated SmallString because it unconditionally forces a malloc. llvm-svn: 129688	2011-04-18 05:38:58 +00:00
Eli Friedman	0e40208d7b	Make the StringMaps attached to MCContext use the MCContext's allocator; reduces the number of calls to malloc(). llvm-svn: 129687	2011-04-18 05:02:31 +00:00
Chris Lattner	c479e0631f	switch the rest of the x86 immediate patterns over to ImmLeaf, simplifying them and exposing more information to tblgen. It would be nice if other target authors adopted this as well, particularly arm since it has fastisel. llvm-svn: 129676	2011-04-17 22:12:55 +00:00
Chris Lattner	2ff8c1a25f	now that predicates have a decent abstraction layer on them, introduce a new kind of predicate: one that is specific to imm nodes. The predicate function specified here just checks an int64_t directly instead of messing around with SDNode's. The virtue of this is that it means that fastisel and other things can reason about these predicates. llvm-svn: 129675	2011-04-17 22:05:17 +00:00
Chris Lattner	514e292b72	Rework our internal representation of node predicates to expose more structure and fix some fixmes. We now have a TreePredicateFn class that handles all of the decoding of these things. This is an internal cleanup that has no impact on the code generated by tblgen. llvm-svn: 129670	2011-04-17 21:38:24 +00:00
Chris Lattner	b53ccb8e36	1. merge fast-isel-shift-imm.ll into fast-isel-x86-64.ll 2. implement rdar://9289501 - fast isel should fold trivial multiplies to shifts 3. teach tblgen to handle shift immediates that are different sizes than the shifted operands, eliminating some code from the X86 fast isel backend. 4. Have FastISel::SelectBinaryOp use (the poorly named) FastEmit_ri_ function instead of FastEmit_ri to simplify code. llvm-svn: 129666	2011-04-17 20:23:29 +00:00
Chris Lattner	eb729d48ff	fix an x86 fast isel issue where we'd completely give up on folding an address when we have a global variable base an an index. Instead, just give up on folding the global variable. Before we'd geenrate: _test: ## @test ## BB#0: movq _rtx_length@GOTPCREL(%rip), %rax leaq (%rax), %rax addq %rdi, %rax movzbl (%rax), %eax ret now we generate: _test: ## @test ## BB#0: movq _rtx_length@GOTPCREL(%rip), %rax movzbl (%rax,%rdi), %eax ret The difference is even more significant when there is a scale involved. This fixes rdar://9289558 - total fail with addr mode formation at -O0/x86-64 llvm-svn: 129664	2011-04-17 17:47:38 +00:00
Chris Lattner	4832660b4d	fix an oversight which caused us to compile the testcase (and other less trivial things) into a dummy lea. Before we generated: _test: ## @test movq _G@GOTPCREL(%rip), %rax leaq (%rax), %rax ret now we produce: _test: ## @test movq _G@GOTPCREL(%rip), %rax ret This is part of rdar://9289558 llvm-svn: 129662	2011-04-17 17:12:08 +00:00
Chris Lattner	4b026b962a	tidy up and reduce indentation. llvm-svn: 129661	2011-04-17 17:05:12 +00:00
Chris Lattner	045c43855c	Fix rdar://9289512 - not folding load into compare at -O0 The basic issue here is that bottom-up isel is matching the branch and compare, and was failing to fold the load into the branch/compare combo. Fixing this (by allowing folding into any instruction of a sequence that is selected) allows us to produce things like: cmpb $0, 52(%rax) je LBB4_2 instead of: movb 52(%rax), %cl cmpb $0, %cl je LBB4_2 This makes the generated -O0 code run a bit faster, but also speeds up compile time by putting less pressure on the register allocator and generating less code. This was one of the biggest classes of missing load folding. Implementing this shrinks 176.gcc's c-decl.s (as a random example) by about 4% in (verbose-asm) line count. llvm-svn: 129656	2011-04-17 06:35:44 +00:00
Chris Lattner	d70ff0d807	split a complex predicate out to a helper function. Simplify two for loops, which don't need to check for falling off the end of a block and end of phi nodes, since terminators are never phis. llvm-svn: 129655	2011-04-17 06:03:19 +00:00
Eli Friedman	55f7bf3289	Remove working entry from README. llvm-svn: 129654	2011-04-17 02:36:27 +00:00
Chris Lattner	fba7ca63cc	fix rdar://9289583 - fast isel should handle non-canonical commutative binops allowing us to fold the immediate into the 'and' in this case: int test1(int i) { return 8&i; } llvm-svn: 129653	2011-04-17 01:16:47 +00:00
Eli Friedman	55b0acd624	PR9055: extend the fix to PR4050 (r70179) to apply to zext and anyext. Returning a new node makes the code try to replace the old node, which in the included testcase is killed by CSE. llvm-svn: 129650	2011-04-16 23:25:34 +00:00
Frits van Bommel	d6d4f987b4	Rename a misleadingly-named variable. llvm-svn: 129644	2011-04-16 14:32:34 +00:00
Francois Pichet	beb17d9359	Unbreak the MSVC 2010 build. For further information on this particular issue see: http://connect.microsoft.com/VisualStudio/feedback/details/520043/error-converting-from-null-to-a-pointer-type-in-std-pair llvm-svn: 129642	2011-04-16 14:20:39 +00:00
Jay Foad	7d03e9be47	Fix bug when checking phi operands in InstCombiner::visitPHINode(), found by code inspection. llvm-svn: 129641	2011-04-16 14:17:37 +00:00
Francois Pichet	47f86e6c60	MSVC needs the return 0 to compile. llvm-svn: 129640	2011-04-16 13:59:23 +00:00
Benjamin Kramer	659bfb34ff	Remove unused variable. llvm-svn: 129639	2011-04-16 10:30:47 +00:00
Rafael Espindola	a83b177035	Put each personality function in a section. This fixes the gnu ld warning: error in foo.o; no .eh_frame_hdr table will be created. llvm-svn: 129635	2011-04-16 03:51:21 +00:00
Stuart Hastings	ebddfe60a0	Correct result when a branch condition is live across a block boundary. <rdar://problem/8933028> llvm-svn: 129634	2011-04-16 03:31:26 +00:00
Evan Cheng	b14ce09fca	Fix divmod libcall lowering. Convert to {S\|U}DIVREM first and then expand the node to a libcall. rdar://9280991 llvm-svn: 129633	2011-04-16 03:08:26 +00:00
Rafael Espindola	c715e724de	Fix cmake build. llvm-svn: 129632	2011-04-16 02:06:46 +00:00
Nick Lewycky	c5ea8528cc	Move the re-stemming function up top and use it where it's currently inlined. Break the arc-profile code out to a function like the notes emission code is, and reorder the functions in the file. The only functionality change is that we no longer modify the Module when the Module has no debug info to use. llvm-svn: 129631	2011-04-16 02:05:18 +00:00
Nick Lewycky	966edd068f	Rename LineProfiling to GCOVProfiling to more accurately represent what it does. Also mostly implement it. Still a work-in-progress, but generates legal output on crafted test cases. llvm-svn: 129630	2011-04-16 01:20:23 +00:00
Devang Patel	514b4006c2	Introduce support to encode Objective-C property information in debugging information generated for an interface. llvm-svn: 129624	2011-04-16 00:11:51 +00:00
Johnny Chen	48592ee5af	Thumb2 BFC was insufficiently encoded. rdar://problem/9292717 llvm-svn: 129619	2011-04-15 22:52:15 +00:00
Johnny Chen	761e1e3512	A8.6.315 VLD3 (single 3-element structure to all lanes) The a bit must be encoded as 0. rdar://problem/9292625 llvm-svn: 129618	2011-04-15 22:49:08 +00:00
Akira Hatanaka	e24891251c	Reverse unnecessary changes made in r129606 and r129608. There is no change in functionality. llvm-svn: 129612	2011-04-15 21:51:11 +00:00
Cameron Zwarich	9c65e4d69c	Add ORR and EOR to the CMP peephole optimizer. It's hard to get isel to generate a case involving EOR, so I only added a test for ORR. llvm-svn: 129610	2011-04-15 21:24:38 +00:00
Akira Hatanaka	d56f2d910b	Fix lines that exceed 80 columns. There is no change in functionality. llvm-svn: 129608	2011-04-15 21:06:38 +00:00
Akira Hatanaka	aef55c8801	Fix lines that have incorrect indentation or exceed 80 columns. There is no change in functionality. llvm-svn: 129606	2011-04-15 21:00:26 +00:00
Cameron Zwarich	0829b3065a	The AND instruction leaves the V flag unmodified, so it falls victim to the same problem as all of the other instructions we fold with CMPs. llvm-svn: 129602	2011-04-15 20:45:00 +00:00
Rafael Espindola	7583dbdc88	Fix cmake build. llvm-svn: 129601	2011-04-15 20:34:45 +00:00
Rafael Espindola	beb74c3f00	Some refactoring suggested by Anton Korobeynikov. llvm-svn: 129600	2011-04-15 20:32:03 +00:00
Cameron Zwarich	93eae1571c	Add missing register forms of instructions to the ARM CMP-folding code. This fixes <rdar://problem/9287901>. llvm-svn: 129599	2011-04-15 20:28:28 +00:00
Akira Hatanaka	279169771b	Add pass that expands pseudo instructions into target instructions after register allocation. Define pseudos that get expanded into mtc1 or mfc1 instructions. llvm-svn: 129594	2011-04-15 19:52:08 +00:00
Evan Cheng	a2e61292f0	Increase SubtargetFeatureKV Value and Implies fields to 64 bits since some targets are getting very close to 32 subtarget features. Also teach tablegen to error when there are more than 64 features to guard against undefined behavior. rdar://9282332 llvm-svn: 129590	2011-04-15 19:35:46 +00:00
Lenny Maiorani	fad9d95722	Implements StringRef::compare with bounds. It is behaves similarly to strncmp(). Unit tests also included. llvm-svn: 129582	2011-04-15 17:56:50 +00:00
Jakob Stoklund Olesen	1af8b4dc92	Teach the SplitKit blitter to handle multiply defined values as well. The transferValues() function can now handle both singly and multiply defined values, as long as the resulting live range is known. Only rematerialized values have their live range recomputed by extendRange(). The updateSSA() function can now insert PHI values in bulk across multiple values in multiple target registers in one pass. The list of blocks received from transferValues() is in layout order which seems to work well for the iterative algorithm. Blocks from extendRange() are still in reverse BFS order, but this function is used so rarely now that it doesn't matter. llvm-svn: 129580	2011-04-15 17:24:49 +00:00
Jakob Stoklund Olesen	871f70609a	Remember to set flag. llvm-svn: 129579	2011-04-15 17:24:46 +00:00
Rafael Espindola	a01cdb0e37	Add 129518 back with a fix for when we are producing eh just because of debug info. Change ELF systems to use CFI for producing the EH tables. This reduces the size of the clang binary in Debug builds from 690MB to 679MB. llvm-svn: 129571	2011-04-15 15:11:06 +00:00
Chris Lattner	0ab5e2cded	Fix a ton of comment typos found by codespell. Patch by Luis Felipe Strano Moraes! llvm-svn: 129558	2011-04-15 05:18:47 +00:00
NAKAMURA Takumi	b5e3e9dd27	Revert r129518, "Change ELF systems to use CFI for producing the EH tables. This reduces the" It broke several builds. llvm-svn: 129557	2011-04-15 03:35:57 +00:00
Evan Cheng	12bb05b75b	Fix another fcopysign lowering bug. If src is f64 and destination is f32, don't forget to right shift the source by 32 first. rdar://9287902 llvm-svn: 129556	2011-04-15 01:31:00 +00:00
Johnny Chen	681fef5986	For t2BFI, both Inst{26} and Inst{5} "should" be 0. Ref: I.1 Instruction encoding diagrams and pseudocode llvm-svn: 129552	2011-04-15 00:35:08 +00:00
Michael J. Spencer	30088ba110	Add 3DNow! intrinsics. llvm-svn: 129551	2011-04-15 00:32:41 +00:00
Johnny Chen	421316178e	The ARM disassembler did not handle the alignment correctly for VLDDUP instructions (single element or n-element structure to all lanes). llvm-svn: 129550	2011-04-15 00:10:45 +00:00
Evan Cheng	44887f9c7e	Follow up on r127913. Fix Thumb revsh isel. rdar://9286766 llvm-svn: 129548	2011-04-14 23:27:44 +00:00
Eli Friedman	2395626605	Add an instcombine for constructs like a \| -(b != c); a select is more canonical, and generally leads to better code. Found while looking at an article about saturating arithmetic. llvm-svn: 129545	2011-04-14 22:41:27 +00:00
Owen Anderson	92651ec374	Fix an infinite alternation in JumpThreading where two transforms would repeatedly undo each other. The solution is to perform more aggressive constant folding to make one of the edges just folded away rather than trying to thread it. Fixes <rdar://problem/9284786>. Discovered with CSmith. llvm-svn: 129538	2011-04-14 21:35:50 +00:00
Mon P Wang	1cde91674a	Cleanup r129509 based on comments by Chris llvm-svn: 129532	2011-04-14 19:20:42 +00:00
Johnny Chen	4251b151b1	Add sanity checkings for Thumb2 Load/Store Register Exclusive family of operations. llvm-svn: 129531	2011-04-14 19:13:28 +00:00
Chris Lattner	6f195469b1	move PR9661 out to here. llvm-svn: 129527	2011-04-14 18:47:18 +00:00
Owen Anderson	a519284fec	Fix another instance of the DAG combiner not using the correct type for the RHS of a shift. llvm-svn: 129522	2011-04-14 17:30:49 +00:00
Rafael Espindola	aa2a7cd828	Change ELF systems to use CFI for producing the EH tables. This reduces the size of the clang binary in Debug builds from 690MB to 679MB. llvm-svn: 129518	2011-04-14 15:18:53 +00:00
Michael J. Spencer	b88784c185	Fix whitespace and tabs. llvm-svn: 129517	2011-04-14 14:33:36 +00:00
Mon P Wang	0f6bad7b6e	Cleanup r129472 by using a utility routine as suggested by Eli. llvm-svn: 129509	2011-04-14 08:04:01 +00:00
Andrew Trick	bfbd972b1f	In the pre-RA scheduler, maintain cmp+br proximity. This is done by pushing physical register definitions close to their use, which happens to handle flag definitions if they're not glued to the branch. This seems to be generally a good thing though, so I didn't need to add a target hook yet. The primary motivation is to generate code closer to what people expect and rule out missed opportunity from enabling macro-op fusion. As a side benefit, we get several 2-5% gains on x86 benchmarks. There is one regression: SingleSource/Benchmarks/Shootout/lists slows down be -10%. But this is an independent scheduler bug that will be tracked separately. See rdar://problem/9283108. Incidentally, pre-RA scheduling is only half the solution. Fixing the later passes is tracked by: <rdar://problem/8932804> [pre-RA-sched] on x86, attempt to schedule CMP/TEST adjacent with condition jump Fixes: <rdar://problem/9262453> Scheduler unnecessary break of cmp/jump fusion llvm-svn: 129508	2011-04-14 05:15:06 +00:00
Chris Lattner	1d313c6f6d	add a minor missed dag combine that is blocking mid-level optimization improvements, that will lead to fixing PR6627. llvm-svn: 129504	2011-04-14 04:21:42 +00:00
Chris Lattner	493b3e72f2	sink a call into its only use. llvm-svn: 129503	2011-04-14 04:12:47 +00:00
Chris Lattner	fba5cdfce1	rework FoldBranchToCommonDest to exit earlier when there is a bonus instruction around, reducing work. Greatly simplify handling of debug instructions. There is no need to build up a vector of them and then move them into the one predecessor if we're processing a block. Instead just rescan the block and copy them into the pred. If a block gets merged into multiple preds, this will retain more debug info. llvm-svn: 129502	2011-04-14 02:44:53 +00:00
Chris Lattner	35a65b2aa6	fix a couple -Wsign-compare warnings. llvm-svn: 129501	2011-04-14 02:27:25 +00:00
Bill Wendling	410ec4aad1	As Dan pointed out, movzbl, movsbl, and friends are nicer than their alias (movzx/movsx) because they give more information. Revert that part of the patch. llvm-svn: 129498	2011-04-14 01:46:37 +00:00
Bill Wendling	7e07d6fb69	Have the X86 back-end emit the alias instead of what's being aliased. In most cases, it's much nicer and more informative reading the alias. llvm-svn: 129497	2011-04-14 01:11:51 +00:00
Bill Wendling	6dd69d9241	Add an option to not print the alias of an instruction. It defaults to "print the alias". llvm-svn: 129485	2011-04-13 23:36:21 +00:00
Owen Anderson	9c12834eed	During post-legalization DAG combining, be careful to only create shifts where the RHS is of the legal type for the new operation. llvm-svn: 129484	2011-04-13 23:22:23 +00:00
Johnny Chen	d0fb04f437	Thumb disassembler did not handle tBRIND (indirect branch) properly. rdar://problem/9280370 llvm-svn: 129480	2011-04-13 21:59:01 +00:00
Mon P Wang	2e5528f0b2	Vectors with different number of elements of the same element type can have the same allocation size but different primitive sizes(e.g., <3xi32> and <4xi32>). When ScalarRepl promotes them, it can't use a bit cast but should use a shuffle vector instead. llvm-svn: 129472	2011-04-13 21:40:02 +00:00
Johnny Chen	b6a37bff21	Check for unallocated instruction encodings when disassembling Thumb Branch instructions (tBcc and t2Bcc). rdar://problem/9280470 llvm-svn: 129471	2011-04-13 21:35:49 +00:00
Johnny Chen	ffa6378fd6	The LDRT/STRT (unpriviledged load/store) operations don't take SP or PC as Rt. rdar://problem/9279440 llvm-svn: 129469	2011-04-13 21:04:32 +00:00
Cameron Zwarich	415b5e8341	Fix a typo in an ARM-specific DAG combine. This fixes <rdar://problem/9278274>. llvm-svn: 129468	2011-04-13 21:01:19 +00:00
Cameron Zwarich	9398197ef1	Fix a regression caused by r102515 where explicit alignment on globals is ignored. There was a test to catch this, but it was just blindly updated in a large change. This fixes another part of <rdar://problem/9275290>. llvm-svn: 129466	2011-04-13 20:36:04 +00:00
Devang Patel	2772f662da	Fix debug message. llvm-svn: 129463	2011-04-13 19:47:41 +00:00
Johnny Chen	70591cbc60	Check the corner cases for t2LDRSHi12 correctly and mark invalid encodings as such. rdar://problem/9276651 llvm-svn: 129462	2011-04-13 19:46:05 +00:00
Devang Patel	e141234940	Remove extra bytes that were added for gdb. We do not have good poiner to understand actual reason behind this fixme. Spot checking suggest that newer gdb does not need this. llvm-svn: 129461	2011-04-13 19:41:17 +00:00
Johnny Chen	0d306a7840	Fix a bug where for t2MOVCCi disassembly, the TIED_TO register operand was not properly handled. rdar://problem/9276427 llvm-svn: 129456	2011-04-13 17:51:02 +00:00
Johnny Chen	b2f9fa1fce	Forgot to add this change for http://llvm.org/viewvc/llvm-project?view=rev&revision=129387 . llvm-svn: 129451	2011-04-13 16:56:08 +00:00
Junjie Gu	377cc31a74	Fixed the revision 129449. llvm-svn: 129450	2011-04-13 16:45:49 +00:00
Junjie Gu	7c3b4593b5	Passing unroll parameters (unroll-count, threshold, and partial unroll) via LoopUnroll class's ctor. Doing so will allow multiple context with different loop unroll parameters to run. This is a minor change and no effect on existing application. llvm-svn: 129449	2011-04-13 16:15:29 +00:00
Rafael Espindola	6aafb64daf	Add the alias analysis to the C api. llvm-svn: 129447	2011-04-13 15:44:58 +00:00
Jim Grosbach	956de1ff66	MCJIT relocation resolution. llvm-svn: 129445	2011-04-13 15:28:10 +00:00
Jay Foad	0091fe8ca1	PR9214: Convert ConstantExpr::getIndices() to return an ArrayRef, plus related tweaks to ExprMapKeyType. llvm-svn: 129443	2011-04-13 15:22:40 +00:00
Jakob Stoklund Olesen	cda53febec	Stop using dead function. llvm-svn: 129442	2011-04-13 15:00:11 +00:00
Jay Foad	47f89e0f55	Remove some redundant llvm:: prefixes. llvm-svn: 129441	2011-04-13 14:39:42 +00:00
Jay Foad	5c984e563b	PR9214: Convert ConstantExpr::getWithOperands() to use ArrayRef. llvm-svn: 129439	2011-04-13 13:46:01 +00:00
Jay Foad	3b422a1249	Like the coding standards say, do not use "using namespace std". llvm-svn: 129435	2011-04-13 12:46:01 +00:00
Cameron Zwarich	70be27e913	Fix an obvious problem with an alignment computation. AsmPrinter actually does the max itself, so it is not easy to write a test case for this, but I added a test case that would fail if the code in AsmPrinter were removed. llvm-svn: 129432	2011-04-13 09:02:43 +00:00
Cameron Zwarich	8001850ee8	Fix a typo. llvm-svn: 129429	2011-04-13 06:39:16 +00:00
Cameron Zwarich	cdf59f7016	If a global variable has a specified alignment that is less than the preferred alignment for its type, use the minimum of the specified alignment and the ABI alignment. This fixes <rdar://problem/9275290>. llvm-svn: 129428	2011-04-13 06:03:16 +00:00
Andrew Trick	b53a00d2cb	Recommit r129383. PreRA scheduler heuristic fixes: VRegCycle, TokenFactor latency. Additional fixes: Do something reasonable for subtargets with generic itineraries by handle node latency the same as for an empty itinerary. Now nodes default to unit latency unless an itinerary explicitly specifies a zero cycle stage or it is a TokenFactor chain. Original fixes: UnitsSharePred was a source of randomness in the scheduler: node priority depended on the queue data structure. I rewrote the recent VRegCycle heuristics to completely replace the old heuristic without any randomness. To make the ndoe latency adjustments work, I also needed to do something a little more reasonable with TokenFactor. I gave it zero latency to its consumers and always schedule it as low as possible. llvm-svn: 129421	2011-04-13 00:38:32 +00:00
Bill Wendling	b902f1dd88	Reapply r129401 with patch for clang. llvm-svn: 129419	2011-04-13 00:36:11 +00:00
Eric Christopher	28f4c729f7	Temporarily revert r129408 to see if it brings the bots back. llvm-svn: 129417	2011-04-13 00:20:59 +00:00
Rafael Espindola	539d96f01d	Be consistent about being virtual and returning void in the cfi methods. Implement the ones that were missing in the asm streamer. llvm-svn: 129413	2011-04-12 23:59:07 +00:00
Johnny Chen	3c2f74c9f3	Add sanity check for Ld/St Dual forms of Thumb2 instructions. rdar://problem/9273947 llvm-svn: 129411	2011-04-12 23:31:00 +00:00
Jakob Stoklund Olesen	987164043c	Add @earlyclobber constraints to the writeback register of all ARM store instructions. The ARMARM specifies these instructions as unpredictable when storing the writeback register. This shouldn't affect code generation much since storing a pointer to itself is quite rare. llvm-svn: 129409	2011-04-12 23:27:48 +00:00
Eric Christopher	d829f43c06	Fix a bug where we were counting the alias sets as completely used registers for fast allocation. Fixes rdar://9207598 llvm-svn: 129408	2011-04-12 23:23:14 +00:00
Devang Patel	0e821f4673	I missed this new file in previous commit. llvm-svn: 129407	2011-04-12 23:21:44 +00:00
Devang Patel	28dce70364	Simplify. There is no need to use static variable. llvm-svn: 129406	2011-04-12 23:10:47 +00:00
Devang Patel	13d47f0ddc	Do not reuse parameter name. llvm-svn: 129405	2011-04-12 23:09:06 +00:00
Bill Wendling	dbfde42468	Revert r129401 for now. Clang is using the old way of doing things. llvm-svn: 129403	2011-04-12 22:59:27 +00:00
Devang Patel	f20c4f715f	This mechanical patch moves type handling into CompileUnit from DwarfDebug. In case of multiple compile unit in one object file, each compile unit is responsible for its own set of type entries anyway. This refactoring makes this obvious. llvm-svn: 129402	2011-04-12 22:53:02 +00:00
Bill Wendling	47c24875a1	Remove the unaligned load intrinsics in favor of using native unaligned loads. Now that we have a first-class way to represent unaligned loads, the unaligned load intrinsics are superfluous. First part of <rdar://problem/8460511>. llvm-svn: 129401	2011-04-12 22:46:31 +00:00
Eric Christopher	de9d58569f	Add more comments... err debug statements to the fast allocator. llvm-svn: 129400	2011-04-12 22:17:44 +00:00
Johnny Chen	960eef3db3	The Thumb2 RFE instructions need to have their second halfword fully specified. In addition, the base register is not rGPR, but GPR with th exception that: if n == 15 then UNPREDICTABLE rdar://problem/9273836 llvm-svn: 129391	2011-04-12 21:41:51 +00:00
Jakob Stoklund Olesen	c49df2c05a	SparseBitVector is SLOW. Use a Bitvector instead, we didn't need the smaller memory footprint anyway. This makes the greedy register allocator 10% faster. llvm-svn: 129390	2011-04-12 21:30:53 +00:00
Jim Grosbach	733d305fee	MCJIT lazy relocation resolution and symbol address re-assignment. Add handling for tracking the relocations on symbols and resolving them. Keep track of the relocations even after they are resolved so that if the RuntimeDyld client moves the object, it can update the address and any relocations to that object will be updated. For our trival object file load/run test harness (llvm-rtdyld), this enables relocations between functions located in the same object module. It should be trivially extendable to load multiple objects with mutual references. As a simple example, the following now works (running on x86_64 Darwin 10.6): $ cat t.c int bar() { return 65; } int main() { return bar(); } $ clang t.c -fno-asynchronous-unwind-tables -o t.o -c $ otool -vt t.o t.o: (__TEXT,__text) section _bar: 0000000000000000 pushq %rbp 0000000000000001 movq %rsp,%rbp 0000000000000004 movl $0x00000041,%eax 0000000000000009 popq %rbp 000000000000000a ret 000000000000000b nopl 0x00(%rax,%rax) _main: 0000000000000010 pushq %rbp 0000000000000011 movq %rsp,%rbp 0000000000000014 subq $0x10,%rsp 0000000000000018 movl $0x00000000,0xfc(%rbp) 000000000000001f callq 0x00000024 0000000000000024 addq $0x10,%rsp 0000000000000028 popq %rbp 0000000000000029 ret $ llvm-rtdyld t.o -debug-only=dyld ; echo $? Function sym: '_bar' @ 0 Function sym: '_main' @ 16 Extracting function: _bar from [0, 15] allocated to 0x100153000 Extracting function: _main from [16, 41] allocated to 0x100154000 Relocation at '_main' + 16 from '_bar(Word1: 0x2d000000) Resolving relocation at '_main' + 16 (0x100154010) from '_bar (0x100153000)(pcrel, type: 2, Size: 4). loaded '_main' at: 0x100154000 65 $ llvm-svn: 129388	2011-04-12 21:20:41 +00:00
Johnny Chen	01637b9acb	Add bad register checks for Thumb2 Ld/St instructions. rdar://problem/9269047 llvm-svn: 129387	2011-04-12 21:17:51 +00:00
Andrew Trick	1b60ad6644	Revert 129383. It causes some targets to hit a scheduler assert. llvm-svn: 129385	2011-04-12 20:14:07 +00:00
Andrew Trick	c5dd24a542	PreRA scheduler heuristic fixes: VRegCycle, TokenFactor latency. UnitsSharePred was a source of randomness in the scheduler: node priority depended on the queue data structure. I rewrote the recent VRegCycle heuristics to completely replace the old heuristic without any randomness. To make these heuristic adjustments to node latency work, I also needed to do something a little more reasonable with TokenFactor. I gave it zero latency to its consumers and always schedule it as low as possible. llvm-svn: 129383	2011-04-12 19:54:36 +00:00
Jakob Stoklund Olesen	c70b697a40	Create new intervals for isolated blocks during region splitting. This merges the behavior of splitSingleBlocks into splitAroundRegion, so the RS_Region and RS_Block register stages can be coalesced. That means the leftover intervals after region splitting go directly to spilling instead of a second pass of per-block splitting. llvm-svn: 129379	2011-04-12 19:32:53 +00:00
Rafael Espindola	fd794affe5	Remove LastOffset from the asm parser. llvm-svn: 129378	2011-04-12 18:53:30 +00:00
Johnny Chen	ab86a519f8	The Thumb2 Ld, St, and Preload instructions with the i12 forms should have its Inst{23} be specified as '1' (add = TRUE). Also add a utility function for Thumb2. llvm-svn: 129377	2011-04-12 18:48:00 +00:00
Jakob Stoklund Olesen	0840f50b76	Add SplitKit API to query and select the current interval being worked on. This makes it possible to target multiple registers in one pass. llvm-svn: 129374	2011-04-12 18:11:31 +00:00
Jakob Stoklund Olesen	68e84581c5	Fix a bug in RegAllocBase::addMBBLiveIns() where a basic block could accidentally be skipped. llvm-svn: 129373	2011-04-12 18:11:28 +00:00
Devang Patel	4547a9e658	Remove dead typedef. llvm-svn: 129368	2011-04-12 17:43:12 +00:00
Devang Patel	5eb4319dba	Refactor CompileUnit into a separate header. llvm-svn: 129367	2011-04-12 17:40:32 +00:00
Johnny Chen	d0e2be39ea	Print out a debug message when the reglist fails the sanity check for Thumb Ld/St Multiple. llvm-svn: 129365	2011-04-12 17:09:04 +00:00
Rafael Espindola	1ec0f46169	Fix the case of a .cfi_rel_offset before any .cfi_def_cfa_offset. llvm-svn: 129362	2011-04-12 16:12:03 +00:00
Rafael Espindola	2e1c9d2188	Implement .cfi_same_value. llvm-svn: 129361	2011-04-12 15:31:05 +00:00
Cameron Zwarich	fbcd69b96a	Split a store of a VMOVDRR into two integer stores to avoid mixing NEON and ARM stores of arguments in the same cache line. This fixes the second half of <rdar://problem/8674845>. llvm-svn: 129345	2011-04-12 02:24:17 +00:00
NAKAMURA Takumi	3f28443a07	lib/Transforms/Instrumentation/CMakeLists.txt: Add LineProfiling.cpp to fix up r129340. llvm-svn: 129343	2011-04-12 01:54:40 +00:00
Nick Lewycky	9d60e373cf	Add support for line profiling. Very work-in-progress. Use debug info in the IR to find the directory/file:line:col. Each time that location changes, bump a counter. Unlike the existing profiling system, we don't try to look at argv[], and thusly don't require main() to be present in the IR. This matches GCC's technique where you specify the profiling flag when producing each .o file. The runtime library is minimal, currently just calling printf at program shutdown time. The API is designed to make it possible to emit GCOV data later on. llvm-svn: 129340	2011-04-12 01:06:09 +00:00
Nick Lewycky	fbc5a4004c	Consider ConstantAggregateZero as well as ConstantArray/Struct. llvm-svn: 129338	2011-04-12 01:02:45 +00:00
Eric Christopher	c37833625a	Fix typo. llvm-svn: 129334	2011-04-12 00:48:08 +00:00
Nick Lewycky	11168326f8	Make IRBuilder support StringRef for building strings. Also document that the global variables produced are mergable. llvm-svn: 129330	2011-04-12 00:29:07 +00:00
Jim Grosbach	3ed03f18d1	Tidy up a bit now that we're using the MemoryManager interface. llvm-svn: 129328	2011-04-12 00:23:32 +00:00
Eric Christopher	ffc0e1f6e6	Match case for invalid constant error messages and add a new test for invalid hexadecimals. llvm-svn: 129326	2011-04-12 00:18:03 +00:00
Johnny Chen	672ef14a62	A8.6.16 B Encoding T1 (tBcc) if cond == '1110' then UNDEFINED; rdar://problem/9268681 llvm-svn: 129325	2011-04-12 00:14:49 +00:00
Dan Gohman	1c6c34834b	Fix reassociate to use a worklist instead of recursing when new reassociation opportunities are exposed. This fixes a bug where the nested reassociation expects to be the IR to be consistent, but it isn't, because the outer reassociation has disconnected some of the operands. rdar://9167457 llvm-svn: 129324	2011-04-12 00:11:56 +00:00
Eric Christopher	104af0619e	To avoid printing out multiple error messages for cases like: .long 80+08 go ahead and assume that if we've got an Error token that we handled it already. Otherwise if it's a token we can't handle then go ahead and return the default error. llvm-svn: 129322	2011-04-12 00:03:13 +00:00
Jakob Stoklund Olesen	507992e909	Reuse live interval union between functions. This saves a bit of compile time when compiling many small functions. llvm-svn: 129321	2011-04-11 23:57:14 +00:00
Johnny Chen	dc8bf9ec08	Thumb disassembler was erroneously rejecting "blx sp" instruction. rdar://problem/9267838 llvm-svn: 129320	2011-04-11 23:33:30 +00:00
Chris Lattner	7d4cdae564	comment cleanup, use moveBefore instead of removeFromParent+insertBefore. llvm-svn: 129319	2011-04-11 23:24:57 +00:00
Chris Lattner	e81d045d94	remove the StructRetPromotion pass. It is unused, not maintained and has some bugs. If this is interesting functionality, it should be reimplemented in the argpromotion pass. llvm-svn: 129314	2011-04-11 23:09:44 +00:00
Wesley Peck	f30a0e2d80	Fix an error in the MBlaze delay slot filler. llvm-svn: 129313	2011-04-11 22:45:02 +00:00
Wesley Peck	1914c39bd4	Add scheduling information for the MBlaze backend. llvm-svn: 129311	2011-04-11 22:31:52 +00:00
Eric Christopher	64749f2a89	Lex, and then fail on invalid constants. Testcase forthcoming. rdar://8490596 llvm-svn: 129309	2011-04-11 22:24:56 +00:00
Nick Lewycky	0f85789800	Just because a GlobalVariable's initializer is [N x { i32, void ()* }] doesn't mean that it has to be ConstantArray of ConstantStruct. We might have ConstantAggregateZero, at either level, so don't crash on that. Also, semi-deprecate the sentinal value. The linker isn't aware of sentinals so we end up with the two lists appended, each with their "sentinals" on them. Different parts of LLVM treated sentinals differently, so make them all just ignore the single entry and continue on with the rest of the list. llvm-svn: 129307	2011-04-11 22:11:20 +00:00
Rafael Espindola	82065cb6cf	Implement cfi_rel_offset llvm-svn: 129306	2011-04-11 21:49:50 +00:00
Jakob Stoklund Olesen	0f175ebc32	Speed up eviction by stopping collectInterferingVRegs as soon as the spill weight limit has been exceeded. llvm-svn: 129305	2011-04-11 21:47:01 +00:00
Wesley Peck	e3685217d0	Don't crash on invalid instructions when disassembling MBlaze code. This fixes http://llvm.org/bugs/show_bug.cgi?id=9653 llvm-svn: 129303	2011-04-11 21:35:21 +00:00
Bill Wendling	1e1f1c9ce1	The default of the dispatch switch statement was to branch to a BB that executed the 'unwind' instruction. However, later on that instruction was converted into a jump to the basic block it was located in, causing an infinite loop when we get there. It turns out, we get there if the _Unwind_Resume_or_Rethrow call returns (which it's not supposed to do). It returns if it cannot find a place to unwind to. Thus we would get what appears to be a "hang" when in reality it's just that the EH couldn't be propagated further along. Instead of infinitely looping (or calling `unwind', which none of our back-ends support (it's lowered into nothing...)), call the @llvm.trap() intrinsic instead. This may not conform to specific rules of a particular language, but it's rather better than infinitely looping. <rdar://problem/9175843&9233582> llvm-svn: 129302	2011-04-11 21:32:34 +00:00
Johnny Chen	f79d5365de	Fix the bug where the immediate shift amount for Thumb logical shift instructions are incorrectly disassembled. rdar://problem/9266265 llvm-svn: 129298	2011-04-11 21:14:35 +00:00
Evan Cheng	ef42bea704	Look pass copies when determining whether hoisting would end up inserting more copies. rdar://9266679 llvm-svn: 129297	2011-04-11 21:09:18 +00:00
Rafael Espindola	ffd2e5163b	implement .cfi_adjust_cfa_offset. llvm-svn: 129296	2011-04-11 20:29:16 +00:00
Owen Anderson	5140802cd9	Fix another using-CPSR-twice bug in my ADCS/SBCS cleanups, and make proper use of the Commutable bit. llvm-svn: 129294	2011-04-11 20:12:19 +00:00
Jakob Stoklund Olesen	7d05bce70c	Use a faster algorithm for computing MBB live-in registers after register allocation. LiveIntervals::findLiveInMBBs has to do a full binary search for each segment. llvm-svn: 129292	2011-04-11 20:01:41 +00:00
Johnny Chen	74adbddade	Trivial comment fix. llvm-svn: 129288	2011-04-11 18:51:50 +00:00
Evan Cheng	fe917efc8b	Fix a couple of places where changes are made but not tracked. llvm-svn: 129287	2011-04-11 18:47:20 +00:00
Johnny Chen	66fab75920	Check invalid register encodings for LdFrm/StFrm ARM instructions and flag them as invalid instructions. llvm-svn: 129286	2011-04-11 18:34:12 +00:00
Kevin Enderby	9377a52c12	Adding support for printing operands symbolically to llvm's public 'C' disassembler API. Hooked this up to the ARM target so such tools as Darwin's otool(1) can now print things like branch targets for example this: blx _puts instead of this: blx #-36 And even print the expression encoded in the Mach-O relocation entried for things like this: movt r0, :upper16:((_foo-_bar)+1234) llvm-svn: 129284	2011-04-11 18:08:50 +00:00
Jakob Stoklund Olesen	f8beafe207	Don't add live ranges for sub-registers when clobbering a physical register. Both coalescing and register allocation already check aliases for interference, so these extra segments are only slowing us down. This speeds up both linear scan and the greedy register allocator. llvm-svn: 129283	2011-04-11 18:08:10 +00:00
Jakob Stoklund Olesen	4fbbe3689d	Speed up LiveIntervalUnion::unify by handling end insertion specially. This particularly helps with the initial transfer of fixed intervals. llvm-svn: 129277	2011-04-11 15:00:44 +00:00
Jakob Stoklund Olesen	bfabc494f5	Time the initial seeding of live registers llvm-svn: 129276	2011-04-11 15:00:42 +00:00
Jakob Stoklund Olesen	96d04c8e00	Don't shrink live ranges after dead code elimination unless it is going to help. In particular, don't repeatedly recompute the PIC base live range after rematerialization. llvm-svn: 129275	2011-04-11 15:00:39 +00:00
Jay Foad	0159a1ee11	Fix or remove code which seemed to think that the operand of a Constant was always a User. llvm-svn: 129272	2011-04-11 09:48:55 +00:00
Jay Foad	7c14a558fe	Don't include Operator.h from InstrTypes.h. llvm-svn: 129271	2011-04-11 09:35:34 +00:00
Jay Foad	29426e87c1	Phi nodes always use an even number of operands, so don't ever allocate an odd number. llvm-svn: 129270	2011-04-11 09:25:51 +00:00
Bill Wendling	35a9c3cd72	Revert r129235 pending a vetting of the EH rewrite. --- Reverse-merging r129235 into '.': D test/Feature/bb_attrs.ll U include/llvm/BasicBlock.h U include/llvm/Bitcode/LLVMBitCodes.h U lib/VMCore/AsmWriter.cpp U lib/VMCore/BasicBlock.cpp U lib/AsmParser/LLParser.cpp U lib/AsmParser/LLLexer.cpp U lib/AsmParser/LLToken.h U lib/Bitcode/Reader/BitcodeReader.cpp U lib/Bitcode/Writer/BitcodeWriter.cpp llvm-svn: 129259	2011-04-10 23:18:04 +00:00
Nicolas Geoffray	9137ee85de	Bugfix in the Cpp backend after API change on PHINode::Create. llvm-svn: 129248	2011-04-10 17:39:40 +00:00
Bill Wendling	3d5450d809	Beginning of the Great Exception Handling Rewrite. * Add a "landing pad" attribute to the BasicBlock. * Modify the bitcode reader and writer to handle said attribute. Later: The verifier will ensure that the landing pad attribute is used in the appropriate manner. I.e., not applied to the entry block, and applied only to basic blocks that are branched to via a `dispatch' instruction. (This is a work-in-progress.) llvm-svn: 129235	2011-04-10 00:04:27 +00:00
Chris Lattner	fc4fe00a65	fix rdar://8735979 - "int 3" doesn't match to "int3". Unfortunately, InstAlias doesn't allow matching immediate operands, so we have to write C++ code to do this. llvm-svn: 129223	2011-04-09 19:41:05 +00:00

... 3 4 5 6 7 ...

46821 Commits