llvm-project

Commit Graph

Author	SHA1	Message	Date
Ulrich Weigand	994f49ed79	[PowerPC] Fix processing of ha16/lo16 fixups The current PowerPC MC back end distinguishes between fixup_ppc_ha16 and fixup_ppc_lo16, which are determined by the instruction the fixup applies to, and uses this distinction to decide whether a fixup ought to resolve to the high or the low part of a symbol address. This isn't quite correct, however. It is valid -if unusual- assembler to use, e.g. li 1, symbol@ha or lis 1, symbol@l Whether the high or the low part of the address is used depends solely on the @ suffix, not on the instruction. In addition, both li 1, symbol and lis 1, symbol are valid, assuming the symbol address fits into 16 bits; again, both will then refer to the actual symbol value (so li will load the value itself, while lis will load the value shifted by 16). To fix this, two places need to be adapted. If the fixup cannot be resolved at assembler time, a relocation needs to be emitted via PPCELFObjectWriter::getRelocType. This routine already looks at the VK_ type to determine the relocation. The only problem is that will reject any _LO modifier in a ha16 fixup and vice versa. This is simply incorrect; any of those modifiers ought to be accepted for either fixup type. If the fixup can be resolved at assembler time, adjustFixupValue currently selects the high bits of the symbol value if the fixup type is ha16. Again, this is incorrect; see the above example lis 1, symbol Now, in theory we'd have to respect a VK_ modifier here. However, in fact common code never even attempts to resolve symbol references using any nontrivial VK_ modifier at assembler time; it will always fall back to emitting a reloc and letting the linker handle it. If this ever changes, presumably there'd have to be a target callback to resolve VK_ modifiers. We'd then have to handle @ha etc. there. llvm-svn: 182091	2013-05-17 12:36:29 +00:00
Benjamin Kramer	2057a2b86f	Don't cast away constness. llvm-svn: 182086	2013-05-17 11:39:41 +00:00
David Tweed	2e7efedd39	Minor changes to the MCJITTest unittests to use the correct API for finalizing the JIT object (including XFAIL an ARM test that now needs fixing). Also renames internal function for consistency. llvm-svn: 182085	2013-05-17 10:01:46 +00:00
Christian Konig	b7be72df5b	R600/SI: return undef instead of null for skipped arguments This is a candidate for the stable branch. Fixes: https://bugs.freedesktop.org/show_bug.cgi?id=64694 Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> Tested-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 182084	2013-05-17 09:46:48 +00:00
Venkatraman Govindaraju	54bf611c79	[Sparc] Prevent instructions that defines or uses %o7 to be in call's delay slot. llvm-svn: 182063	2013-05-16 23:53:29 +00:00
Adrian Prantl	9c93059aa4	Generate debug info for by-value struct args even if they are not used. radar://problem/13865940 llvm-svn: 182062	2013-05-16 23:44:12 +00:00
Akira Hatanaka	252f54f769	[mips] Improve instruction selection for pattern (store (fp_to_sint $src), $ptr). Previously, three instructions were needed: trunc.w.s $f0, $f2 mfc1 $4, $f0 sw $4, 0($2) Now we need only two: trunc.w.s $f0, $f2 swc1 $f0, 0($2) llvm-svn: 182053	2013-05-16 21:17:15 +00:00
Rafael Espindola	b08d2c2db0	Remove addFrameMove. Now that we have good testing, remove addFrameMove and create cfi instructions directly. llvm-svn: 182052	2013-05-16 21:02:15 +00:00
Akira Hatanaka	d82ee940c3	[mips] Factor out unaligned store lowering code. llvm-svn: 182050	2013-05-16 20:45:17 +00:00
Jack Carter	03f0fd37a9	Mips assembler: Add TwoOperandConstraint definitions This patch removes alias definition for addiu $rs,$imm and instead uses the TwoOperandAliasConstraint field in the ArithLogicI instruction class. This way all instructions that inherit ArithLogicI class have the same macro defined. The usage examples are added to test files. Patch by Vladimir Medic llvm-svn: 182048	2013-05-16 20:24:27 +00:00
Jack Carter	59817110ff	Mips td file formatting: white space and long lines llvm-svn: 182047	2013-05-16 20:08:49 +00:00
Hal Finkel	5f587c59a5	Create an new preheader in PPCCTRLoops to avoid counter register clobbers Some IR-level instructions (such as FP <-> i64 conversions) are not chained w.r.t. the mtctr intrinsic and yet may become function calls that clobber the counter register. At the selection-DAG level, these might be reordered with the mtctr intrinsic causing miscompiles. To avoid this situation, if an existing preheader has instructions that might use the counter register, create a new preheader for the mtctr intrinsic. This extra block will be remerged with the old preheader at the MI level, but will prevent unwanted reordering at the selection-DAG level. llvm-svn: 182045	2013-05-16 19:58:38 +00:00
Akira Hatanaka	fce4dd7974	[mips] Test case for r182042. Add comment. llvm-svn: 182044	2013-05-16 19:57:23 +00:00
Akira Hatanaka	39d40f7baf	[mips] Fix instruction selection pattern for sint_to_fp node to avoid emitting an invalid instruction sequence. Rather than emitting an int-to-FP move instruction and an int-to-FP conversion instruction during instruction selection, we emit a pseudo instruction which gets expanded post-RA. Without this change, register allocation can possibly insert a floating point register move instruction between the two instructions, which is not valid according to the ISA manual. mtc1 $f4, $4 # int-to-fp move instruction. mov.s $f2, $f4 # move contents of $f4 to $f2. cvt.s.w $f0, $f2 # int-to-fp conversion. llvm-svn: 182042	2013-05-16 19:48:37 +00:00
Jack Carter	51785c4715	Mips assembler: Add branch macro definitions This patch adds bnez and beqz instructions which represent alias definitions for bne and beq instructions as follows: bnez $rs,$imm => bne $rs,$zero,$imm beqz $rs,$imm => beq $rs,$zero,$imm The corresponding test cases are added. Patch by Vladimir Medic llvm-svn: 182040	2013-05-16 19:40:19 +00:00
Benjamin Kramer	fc88c3761f	DAGCombine: Also shrink eq compares where the constant is exactly as large as the smaller type. if ((x & 255) == 255) before: movzbl %al, %eax cmpl $255, %eax after: cmpb $-1, %al llvm-svn: 182038	2013-05-16 18:47:58 +00:00
Akira Hatanaka	21bab5badc	[mips] Fix indentation. llvm-svn: 182036	2013-05-16 18:42:42 +00:00
Akira Hatanaka	7b6e4f1366	[mips] Delete unused enum value. llvm-svn: 182035	2013-05-16 18:40:12 +00:00
Jakob Stoklund Olesen	9ae96c7aab	Add TargetRegisterInfo::getCoveringLanes(). This lane mask provides information about which register lanes completely cover super-registers. See the block comment before getCoveringLanes(). llvm-svn: 182034	2013-05-16 18:03:08 +00:00
Ulrich Weigand	9d980cbdb9	[PowerPC] Use true offset value in "memrix" machine operands This is the second part of the change to always return "true" offset values from getPreIndexedAddressParts, tackling the case of "memrix" type operands. This is about instructions like LD/STD that only have a 14-bit field to encode immediate offsets, which are implicitly extended by two zero bits by the machine, so that in effect we can access 16-bit offsets as long as they are a multiple of 4. The PowerPC back end currently handles such instructions by carrying the 14-bit value (as it will get encoded into the actual machine instructions) in the machine operand fields for such instructions. This means that those values are in fact not the true offset, but rather the offset divided by 4 (and then truncated to an unsigned 14-bit value). Like in the case fixed in r182012, this makes common code operations on such offset values not work as expected. Furthermore, there doesn't really appear to be any strong reason why we should encode machine operands this way. This patch therefore changes the encoding of "memrix" type machine operands to simply contain the "true" offset value as a signed immediate value, while enforcing the rules that it must fit in a 16-bit signed value and must also be a multiple of 4. This change must be made simultaneously in all places that access machine operands of this type. However, just about all those changes make the code simpler; in many cases we can now just share the same code for memri and memrix operands. llvm-svn: 182032	2013-05-16 17:58:02 +00:00
Hal Finkel	47db66d43f	PPC32 cannot form counter loops around i64 FP conversions On PPC32, i64 FP conversions are implemented using runtime calls (which clobber the counter register). These must be excluded. llvm-svn: 182023	2013-05-16 16:52:41 +00:00
Aaron Ballman	b4284e6cb6	Fixing a 64-bit conversion warning in MSVC. llvm-svn: 182018	2013-05-16 16:03:36 +00:00
Rafael Espindola	63d2e0ad9a	Remove dead calls to addFrameMove. Without a PROLOG_LABEL present, the cfi instructions are never printed. llvm-svn: 182016	2013-05-16 15:08:37 +00:00
Ulrich Weigand	7aa76b6a07	[PowerPC] Report true displacement value from getPreIndexedAddressParts DAGCombiner::CombineToPreIndexedLoadStore calls a target routine to decompose a memory address into a base/offset pair. It expects the offset (if constant) to be the true displacement value in order to perform optional additional optimizations; in particular, to convert other uses of the original pointer into uses of the new base pointer after pre-increment. The PowerPC implementation of getPreIndexedAddressParts, however, simply calls SelectAddressRegImm, which returns a TargetConstant. This value is appropriate for encoding into the instruction, but it is not always usable as true displacement value: - Its type is always MVT::i32, even on 64-bit, where addresses ought to be i64 ... this causes the optimization to simply always fail on 64-bit due to this line in DAGCombiner: // FIXME: In some cases, we can be smarter about this. if (Op1.getValueType() != Offset.getValueType()) { - Its value is truncated to an unsigned 16-bit value if negative. This causes the above opimization to generate wrong code. This patch fixes both problems by simply returning the true displacement value (in its original type). This doesn't affect any other user of the displacement. llvm-svn: 182012	2013-05-16 14:53:05 +00:00
Richard Sandiford	7fdd268b68	[SystemZ] Tweak register array comment llvm-svn: 182007	2013-05-16 13:39:02 +00:00
Evgeniy Stepanov	1e7643243d	[msan] Switch TLS globals to initial-exec model. They are always defined in the main executable. llvm-svn: 181994	2013-05-16 09:14:05 +00:00
Patrik Hagglund	b3391b58f7	Removed unused variable, detected by gcc -Wunused-but-set-variable. Leftover from r181979. llvm-svn: 181993	2013-05-16 08:37:22 +00:00
Rafael Espindola	7242186b10	Delete dead code. llvm-svn: 181982	2013-05-16 04:59:17 +00:00
Rafael Espindola	e3d5e5354e	Don't call addFrameMove on XCore. getExceptionHandlingType is not ExceptionHandling::DwarfCFI on xcore, so etFrameInstructions is never called. There is no point creating cfi instructions if they are never used. llvm-svn: 181979	2013-05-16 04:16:25 +00:00
Richard Smith	e04f0d34d1	Respect the 'nobuiltin' attribute when determining if a call is to a memory builtin. llvm-svn: 181978	2013-05-16 04:12:04 +00:00
Rafael Espindola	6e8c0d94f8	Removed dead code. llvm-svn: 181975	2013-05-16 03:34:58 +00:00
Reed Kotler	515e937685	Patch number 2 for mips16/32 floating point interoperability stubs. This creates stubs that help Mips32 functions call Mips16 functions which have floating point parameters that are normally passed in floating point registers. llvm-svn: 181972	2013-05-16 02:17:42 +00:00
Derek Schuff	36f00d9f02	Revert "Support unaligned load/store on more ARM targets" This reverts r181898. llvm-svn: 181944	2013-05-15 23:07:43 +00:00
Eli Bendersky	b8cd7a0d7f	Remove dead code. This method is not being used/tested anywhere. llvm-svn: 181943	2013-05-15 22:41:28 +00:00
Arnold Schwaighofer	88e7fddc8c	LoopVectorize: Move call of canHoistAllLoads to canVectorizeWithIfConvert We only want to check this once, not for every conditional block in the loop. No functionality change (except that we don't perform a check redudantly anymore). llvm-svn: 181942	2013-05-15 22:38:14 +00:00
Rafael Espindola	84ee6c40a8	Delete dead code. llvm-svn: 181941	2013-05-15 22:27:35 +00:00
Hal Finkel	80267a0a37	undef setjmp in PPCCTRLoops Trying to unbreak the VS build by copying some undef code from Utils/LowerInvoke.cpp. llvm-svn: 181938	2013-05-15 22:20:24 +00:00
David Majnemer	8f16974273	X86: Remove redundant test instructions Increase the number of instructions LLVM recognizes as setting the ZF flag. This allows us to remove test instructions that redundantly recalculate the flag. llvm-svn: 181937	2013-05-15 22:03:08 +00:00
Hal Finkel	25c1992bc7	Implement PPC counter loops as a late IR-level pass The old PPCCTRLoops pass, like the Hexagon pass version from which it was derived, could only handle some simple loops in canonical form. We cannot directly adapt the new Hexagon hardware loops pass, however, because the Hexagon pass contains a fundamental assumption that non-constant-trip-count loops will contain a guard, and this is not always true (the result being that incorrect negative counts can be generated). With this commit, we replace the pass with a late IR-level pass which makes use of SE to calculate the backedge-taken counts and safely generate the loop-count expressions (including any necessary max() parts). This IR level pass inserts custom intrinsics that are lowered into the desired decrement-and-branch instructions. The most fragile part of this new implementation is that interfering uses of the counter register must be detected on the IR level (and, on PPC, this also includes any indirect branches in addition to function calls). Also, to make all of this work, we need a variant of the mtctr instruction that is marked as having side effects. Without this, machine-code level CSE, DCE, etc. illegally transform the resulting code. Hopefully, this can be improved in the future. This new pass is smaller than the original (and much smaller than the new Hexagon hardware loops pass), and can handle many additional cases correctly. In addition, the preheader-creation code has been copied from LoopSimplify, and after we decide on where it belongs, this code will be refactored so that it can be explicitly shared (making this implementation even smaller). The new test-case files ctrloop-{le,lt,ne}.ll have been adapted from tests for the new Hexagon pass. There are a few classes of loops that this pass does not transform (noted by FIXMEs in the files), but these deficiencies can be addressed within the SE infrastructure (thus helping many other passes as well). llvm-svn: 181927	2013-05-15 21:37:41 +00:00
Hal Finkel	1f6a7f53d8	Fix legalization of SETCC with promoted integer intrinsics If the input operands to SETCC are promoted, we need to make sure that we either use the promoted form of both operands (or neither); a mixture is not allowed. This can happen, for example, if a target has a custom promoted i1-returning intrinsic (where i1 is not a legal type). In this case, we need to use the promoted form of both operands. This change only augments the behavior of the existing logic in the case where the input types (which may or may not have already been legalized) disagree, and should not affect existing target code because this case would otherwise cause an assert in the SETCC operand promotion code. This will be covered by (essentially all of the) tests for the new PPCCTRLoops infrastructure. llvm-svn: 181926	2013-05-15 21:37:27 +00:00
Derek Schuff	d2c42d766d	Fix miscompile due to StackColoring incorrectly merging stack slots (PR15707) IR optimisation passes can result in a basic block that contains: llvm.lifetime.start(%buf) ... llvm.lifetime.end(%buf) ... llvm.lifetime.start(%buf) Before this change, calculateLiveIntervals() was ignoring the second lifetime.start() and was regarding %buf as being dead from the lifetime.end() through to the end of the basic block. This can cause StackColoring to incorrectly merge %buf with another stack slot. Fix by removing the incorrect Starts[pos].isValid() and Finishes[pos].isValid() checks. Just doing: Starts[pos] = Indexes->getMBBStartIdx(MBB); Finishes[pos] = Indexes->getMBBEndIdx(MBB); unconditionally would be enough to fix the bug, but it causes some test failures due to stack slots not being merged when they were before. So, in order to keep the existing tests passing, treat LiveIn and LiveOut separately rather than approximating the live ranges by merging LiveIn and LiveOut. This fixes PR15707. Patch by Mark Seaborn. llvm-svn: 181922	2013-05-15 21:15:09 +00:00
Rafael Espindola	0f2a6fe613	Cleanup relocation sorting for ELF. We want the order to be deterministic on all platforms. NAKAMURA Takumi fixed that in r181864. This patch is just two small cleanups: * Move the function to the cpp file. It is only passed to array_pod_sort. * Remove the ppc implementation which is now redundant llvm-svn: 181910	2013-05-15 18:22:01 +00:00
NAKAMURA Takumi	dc9f013a5d	PPCISelLowering.h: Escape \@ in comments. [-Wdocumentation] llvm-svn: 181907	2013-05-15 18:01:35 +00:00
NAKAMURA Takumi	dcc66456cc	Whitespace. llvm-svn: 181906	2013-05-15 18:01:28 +00:00
Michael Gottesman	b4e7f4d841	[objc-arc] Fixed a spelling error and made the statistic descriptions be consistent about their usage of periods. llvm-svn: 181901	2013-05-15 17:43:03 +00:00
Derek Schuff	72ddaba785	Support unaligned load/store on more ARM targets This patch matches GCC behavior: the code used to only allow unaligned load/store on ARM for v6+ Darwin, it will now allow unaligned load/store for v6+ Darwin as well as for v7+ on other targets. The distinction is made because v6 doesn't guarantee support (but LLVM assumes that Apple controls hardware+kernel and therefore have conformant v6 CPUs), whereas v7 does provide this guarantee (and Linux behaves sanely). Overall this should slightly improve performance in most cases because of reduced I$ pressure. Patch by JF Bastien llvm-svn: 181897	2013-05-15 16:08:30 +00:00
Ulrich Weigand	0684076858	Remove MCELFObjectTargetWriter::adjustFixupOffset hack Now that PowerPC no longer uses adjustFixupOffset, and no other back-end (ever?) did, we can remove the infrastructure itself (incidentally addressing a FIXME to that effect). llvm-svn: 181895	2013-05-15 15:07:42 +00:00
Ulrich Weigand	2fb140ef31	[PowerPC] Remove need for adjustFixupOffst hack Now that applyFixup understands differently-sized fixups, we can define fixup_ppc_lo16/fixup_ppc_lo16_ds/fixup_ppc_ha16 to properly be 2-byte fixups, applied at an offset of 2 relative to the start of the instruction text. This has the benefit that if we actually need to generate a real relocation record, its address will come out correctly automatically, without having to fiddle with the offset in adjustFixupOffset. Tested on both 64-bit and 32-bit PowerPC, using external and integrated assembler. llvm-svn: 181894	2013-05-15 15:07:06 +00:00
Richard Sandiford	ffd144174d	[SystemZ] Make use of SUBTRACT HALFWORD Thanks to Ulrich Weigand for noticing that this instruction was missing. llvm-svn: 181893	2013-05-15 15:05:29 +00:00
Ulrich Weigand	56f5b28d2e	[PowerPC] Correctly handle fixups of other than 4 byte size The PPCAsmBackend::applyFixup routine handles the case where a fixup can be resolved within the same object file. However, this routine is currently hard-coded to assume the size of any fixup is always exactly 4 bytes. This is sort-of correct for fixups on instruction text; even though it only works because several of what really would be 2-byte fixups are presented as 4-byte fixups instead (requiring another hack in PPCELFObjectWriter::adjustFixupOffset to clean it up). However, this assumption breaks down completely for fixups on data, which legitimately can be of any size (1, 2, 4, or 8). This patch makes applyFixup aware of fixups of varying sizes, introducing a new helper routine getFixupKindNumBytes (along the lines of what the ARM back end does). Note that in order to handle fixups of size 8, we also need to fix the return type of adjustFixupValue to uint64_t to avoid truncation. Tested on both 64-bit and 32-bit PowerPC, using external and integrated assembler. llvm-svn: 181891	2013-05-15 15:01:46 +00:00

1 2 3 4 5 ...

61297 Commits