llvm-project

Commit Graph

Author	SHA1	Message	Date
Michael J. Spencer	98c7a114de	Support/PathV2: Use SmallVector::clear instead of set_size. llvm-svn: 121092	2010-12-07 01:23:19 +00:00
Michael J. Spencer	5529c57cfe	Support/PathV2: Clarify and correct documentation. llvm-svn: 121091	2010-12-07 01:23:08 +00:00
Michael J. Spencer	20daa28344	Support/PathV2: Move current_path from path to fs and fix the Unix implementation. Unix bug spotted by Dan Gohman. llvm-svn: 121090	2010-12-07 01:22:31 +00:00
Rafael Espindola	a2421ec705	Fix a crash reduced from gcc produced assembly. llvm-svn: 121085	2010-12-07 01:09:54 +00:00
Owen Anderson	99ea8a3510	Second attempt at converting Thumb2's LDRpci, including updating the gazillion places that need to know about it. llvm-svn: 121082	2010-12-07 00:45:21 +00:00
Rafael Espindola	93e3cf0ebd	Sorry for such a large commit. The summary is that only MachO cares about the actuall addresses in a .o file, so it is better to let the MachO writer compute it. This is good for two reasons. First, areas that shouldn't care about addresses now don't have access to it. Second, the layout of each section is independent. I should use this in a subsequent commit to speed it up. Most of the patch is just removing the section address computation. The two interesting parts are the change on how we handle padding in the end of sections and how MachO can get the address of a-b when a and b are in different sections. Since now the expression evaluation normally doesn't know the section address, it will think that a-b needs relocation and let the MachO writer know. Once it has computed the section addresses, it calls back the expression evaluation with the section addresses to resolve these expressions. The remaining problem is the handling of padding. Currently it will create a special alignment fragment at the end. Since that fragment doesn't update the alignment of the section, it needs the real address to be computed. Since now the layout will not compute a-b with a and b in different sections, the only effect that the special alignment fragment has is update the address size of the section. This can also be done by the MachO writer. llvm-svn: 121076	2010-12-07 00:27:36 +00:00
Jim Grosbach	9e1994698d	Add fixup for Thumb1 BL/BLX instructions. llvm-svn: 121072	2010-12-06 23:57:07 +00:00
Frits van Bommel	d9df6eaa9c	Implement jump threading of 'indirectbr' by keeping track of whether we're looking for ConstantInts or BlockAddresss. llvm-svn: 121066	2010-12-06 23:36:56 +00:00
Devang Patel	bca5b25721	Undefined value in reg 0 may need a marker to identify end of source range. This will be used to truncate live range of DBG_VALUE instruction by register allocator and friends. llvm-svn: 121061	2010-12-06 22:48:22 +00:00
Devang Patel	c24048a718	If dbg_declare() or dbg_value() is not lowered by isel then emit DEBUG message instead of creating DBG_VALUE for undefined value in reg0. llvm-svn: 121059	2010-12-06 22:39:26 +00:00
Rafael Espindola	1055f72b97	Use references to simplify the code a bit. llvm-svn: 121050	2010-12-06 22:30:54 +00:00
Wesley Peck	dba03b050f	Adding bug fix that was suppose to be part of 121044. patch contributed by Jack Whitham! llvm-svn: 121049	2010-12-06 22:19:28 +00:00
Wesley Peck	8da34b6c35	Fixed reversed operands for IDIV and CMP instructions in MBlaze backend. Use BRAD instead of BRD for indirect branches in MBlaze backend. patch contributed by Jack Whitham! llvm-svn: 121044	2010-12-06 22:06:49 +00:00
Jason W Kim	495c2bb9a6	Refactor ELFObjectWriter. + ARM/X86/MBlaze now share a common RecordRelocation + ARM/X86/MBlaze arch specific routines are limited to GetRelocType() llvm-svn: 121043	2010-12-06 21:57:34 +00:00
Chris Lattner	7ff0ba41bd	replace a linear scan with a symtab lookup, reduce indentation. No functionality change. llvm-svn: 121042	2010-12-06 21:53:07 +00:00
Rafael Espindola	e134b08daa	use getSymbolOffset. llvm-svn: 121041	2010-12-06 21:51:55 +00:00
Chris Lattner	4dc53e37d9	Use a stronger predicate here, pointed out by Duncan llvm-svn: 121040	2010-12-06 21:48:10 +00:00
Chris Lattner	ca335e38cf	add some DEBUG statements. llvm-svn: 121038	2010-12-06 21:13:51 +00:00
Wesley Peck	6ce9b60811	Fix a 16-bit immediate value detection bug in the MBlaze delay slot filler. Address more hazards in the MBlaze delay slot filler. patch contributed by Jack Whitham! llvm-svn: 121037	2010-12-06 21:11:01 +00:00
Rafael Espindola	408344017e	Another use of getSymbolOffset. llvm-svn: 121034	2010-12-06 19:55:05 +00:00
Rafael Espindola	0f30fec0bd	Remove the instruction fragment to data fragment lowering since it was causing freed data to be read. I will open a bug to track it being reenabled. llvm-svn: 121028	2010-12-06 19:08:48 +00:00
Owen Anderson	c1ee8e35d2	Revert r121021, which broke the buildbots. llvm-svn: 121026	2010-12-06 18:57:40 +00:00
Jim Grosbach	67f13b19b5	Trailing whitespace. llvm-svn: 121024	2010-12-06 18:47:44 +00:00
Owen Anderson	bb4a76fc95	Improve handling of Thumb2 PC-relative loads by converting LDRpci (and friends) to Pseudos. llvm-svn: 121021	2010-12-06 18:35:51 +00:00
Jim Grosbach	968c927201	Encode the register operand of ARM CondCode operands correctly. ARM::CPSR if the instruction is predicated, reg0 otherwise. llvm-svn: 121020	2010-12-06 18:30:57 +00:00
Jim Grosbach	0bfb4d5043	The ARM AsmMatcher needs to know that the CCOut operand is a register value, not an immediate. It stores either ARM::CPSR or reg0. llvm-svn: 121018	2010-12-06 18:21:12 +00:00
Rafael Espindola	44bbe36de6	Second try at making direct object emission produce the same results as llc + llvm-mc. This time ELF is not changed and I tested that llvm-gcc bootstrap on darwin10 using darwin9's assembler and linker. llvm-svn: 121006	2010-12-06 17:27:56 +00:00
Rafael Espindola	dee3062373	Revert previous two patches while I try to find out how to make both linux and darwin assemblers happy :-( llvm-svn: 121004	2010-12-06 15:35:15 +00:00
Rafael Espindola	34a06a0802	Add an EmitAbsValue helper method and use it in cases where we want to be sure that no relocations are used (on MochO). Fixes llc producing different output from llc + llvm-mc. llvm-svn: 121000	2010-12-06 14:53:14 +00:00
Chris Lattner	fb212de06d	Fix PR8735, a really terrible problem in the inliner's "alloca merging" optimization. Consider: static void foo() { A = alloca ... } static void bar() { B = alloca ... call foo(); } void main() { bar() } The inliner proceeds bottom up, but lets pretend it decides not to inline foo into bar. When it gets to main, it inlines bar into main(), and says "hey, I just inlined an alloca "B" into main, lets remember that. Then it keeps going and finds that it now contains a call to foo. It decides to inline foo into main, and says "hey, foo has an alloca A, and I have an alloca B from another inlined call site, lets reuse it". The problem with this of course, is that the lifetime of A and B are nested, not disjoint. Unfortunately I can't create a reasonable testcase for this: the one in the PR is both huge and extremely sensitive, because you minor tweaks end up causing foo to get inlined into bar too early. We already have tests for the basic alloca merging optimization and this does not break them. llvm-svn: 120995	2010-12-06 07:52:42 +00:00
Chris Lattner	cd3af96a8f	improve comment llvm-svn: 120994	2010-12-06 07:43:04 +00:00
Chris Lattner	5b6a865f2e	improve -debug output and comments a little. llvm-svn: 120993	2010-12-06 07:38:40 +00:00
Michael J. Spencer	0d025b6b42	Support/Windows: Make MinGW happy. llvm-svn: 120991	2010-12-06 06:02:07 +00:00
Michael J. Spencer	7ecd94cc0b	Support/FileSystem: Add directory_iterator implementation. llvm-svn: 120989	2010-12-06 04:28:42 +00:00
Michael J. Spencer	95e4ac16a5	Support/PathV2: Fix append to not add a slash to empty or root paths. llvm-svn: 120988	2010-12-06 04:28:23 +00:00
Michael J. Spencer	39c4621f42	Support/Windows: Add ScopedHandle and move some clients over to it. llvm-svn: 120987	2010-12-06 04:28:13 +00:00
Che-Liang Chiou	9f2af628a6	ptx: add shift instructions llvm-svn: 120982	2010-12-06 04:00:03 +00:00
Rafael Espindola	baf2f3b3eb	Remove the getAddress getter, initialize Ordinal in the constructor and use that on the ELF writer to detect a section we created. llvm-svn: 120981	2010-12-06 03:48:09 +00:00
Rafael Espindola	46e40188f7	Simplify a bit. llvm-svn: 120980	2010-12-06 03:36:43 +00:00
Rafael Espindola	45790b82af	Use getSymbolOffset on the COFF writer. llvm-svn: 120979	2010-12-06 03:24:04 +00:00
Rafael Espindola	ac60adb38d	Don't use PadSectionToAlignment on windows. llvm-svn: 120978	2010-12-06 03:03:44 +00:00
Rafael Espindola	e7284c3671	Add a getSymbolOffset method and use it in the ELF writer. llvm-svn: 120977	2010-12-06 02:57:26 +00:00
Chris Lattner	94fbdf3814	Fix PR8728, a miscompilation I recently introduced. When optimizing memcpy's like: memcpy(A, B) memcpy(A, C) we cannot delete the first memcpy as dead if A and C might be aliases. If so, we actually get: memcpy(A, B) memcpy(A, A) which is not correct to transform into: memcpy(A, A) This patch was heavily influenced by Jakub Staszak's patch in PR8728, thanks Jakub! llvm-svn: 120974	2010-12-06 01:48:06 +00:00
Evan Cheng	abd6d2742a	Eliminate unneeded #include's. llvm-svn: 120971	2010-12-05 23:41:43 +00:00
NAKAMURA Takumi	70fbbf534b	ARM/CMakeLists.txt: Add missing MLxExpansionPass.cpp since r120960. llvm-svn: 120966	2010-12-05 23:08:57 +00:00
Evan Cheng	12f4d615ab	Code clean up. llvm-svn: 120965	2010-12-05 23:03:45 +00:00
Evan Cheng	b8a662f0d1	Remove an unused variable. llvm-svn: 120964	2010-12-05 23:03:35 +00:00
Cameron Zwarich	c7223a3e37	Some cleanup before I start committing some incremental progress on StrongPHIElimination. llvm-svn: 120961	2010-12-05 22:34:08 +00:00
Evan Cheng	62c7b5bf76	Making use of VFP / NEON floating point multiply-accumulate / subtraction is difficult on current ARM implementations for a few reasons. 1. Even though a single vmla has latency that is one cycle shorter than a pair of vmul + vadd, a RAW hazard during the first (4? on Cortex-a8) can cause additional pipeline stall. So it's frequently better to single codegen vmul + vadd. 2. A vmla folowed by a vmul, vmadd, or vsub causes the second fp instruction to stall for 4 cycles. We need to schedule them apart. 3. A vmla followed vmla is a special case. Obvious issuing back to back RAW vmla + vmla is very bad. But this isn't ideal either: vmul vadd vmla Instead, we want to expand the second vmla: vmla vmul vadd Even with the 4 cycle vmul stall, the second sequence is still 2 cycles faster. Up to now, isel simply avoid codegen'ing fp vmla / vmls. This works well enough but it isn't the optimial solution. This patch attempts to make it possible to use vmla / vmls in cases where it is profitable. A. Add missing isel predicates which cause vmla to be codegen'ed. B. Make sure the fmul in (fadd (fmul)) has a single use. We don't want to compute a fmul and a fmla. C. Add additional isel checks for vmla, avoid cases where vmla is feeding into fp instructions (except for the #3 exceptional case). D. Add ARM hazard recognizer to model the vmla / vmls hazards. E. Add a special pre-regalloc case to expand vmla / vmls when it's likely the vmla / vmls will trigger one of the special hazards. Work in progress, only A+B are enabled. llvm-svn: 120960	2010-12-05 22:04:16 +00:00
Cameron Zwarich	a3fb8cb3d4	Remove the PHIElimination.h header, as it is no longer needed. llvm-svn: 120959	2010-12-05 21:39:42 +00:00

1 2 3 4 5 ...

43841 Commits