llvm-project

Commit Graph

Author	SHA1	Message	Date
Hal Finkel	41e9b1d559	[PowerPC] Remove unused TM member variable to unbreak build Fix "error: private field 'TM' is not used [-Werror,-Wunused-private-field]" llvm-svn: 205660	2014-04-05 00:16:28 +00:00
Hal Finkel	de0b413ec0	[PowerPC] Adjust load/store costs in PPCTTI This provides more realistic costs for the insert/extractelement instructions (which are load/store pairs), accounts for the cheap unaligned Altivec load sequence, and for unaligned VSX load/stores. Bad news: MultiSource/Applications/sgefa/sgefa - 35% slowdown (this will require more investigation) SingleSource/Benchmarks/McGill/queens - 20% slowdown (we no longer vectorize this, but it was a constant store that was scalarized) MultiSource/Benchmarks/FreeBench/pcompress2/pcompress2 - 2% slowdown Good news: SingleSource/Benchmarks/Shootout/ary3 - 54% speedup SingleSource/Benchmarks/Shootout-C++/ary - 40% speedup MultiSource/Benchmarks/Ptrdist/ks/ks - 35% speedup MultiSource/Benchmarks/FreeBench/neural/neural - 30% speedup MultiSource/Benchmarks/TSVC/Symbolics-flt/Symbolics-flt - 20% speedup Unfortunately, estimating the costs of the stack-based scalarization sequences is hard, and adjusting these costs is like a game of whac-a-mole :( I'll revisit this again after we have better codegen for vector extloads and truncstores and unaligned load/stores. llvm-svn: 205658	2014-04-04 23:51:18 +00:00
Hal Finkel	b1308d525c	[PowerPC] PPCTTI Cleanup Remove the declaration of an unimplemented function. llvm-svn: 205657	2014-04-04 23:51:11 +00:00
Andrew Trick	326c1f6804	Minor change to StackMapLiveness DEBUG output. llvm-svn: 205656	2014-04-04 23:49:35 +00:00
Matt Arsenault	cf6f688a40	Add DAG parameter to ComputeNumSignBitsForTargetNode This way, you can check the number of sign bits in the operands. The depth parameter it already has is pretty useless without this. llvm-svn: 205649	2014-04-04 20:13:13 +00:00
Matt Arsenault	5e1e4316c4	Fix tabs llvm-svn: 205648	2014-04-04 20:13:08 +00:00
Juergen Ributzka	9dff139025	Update the test to use FileCheck. llvm-svn: 205647	2014-04-04 19:57:01 +00:00
Jim Grosbach	938fd46d2e	Tidy up naming. llvm-svn: 205633	2014-04-04 17:36:55 +00:00
Kai Nacke	6da86e8529	[mips] Add Octeon cnMips instructions seqi/snei and v3mulu/vmm0/vmulu. This patch adds the Octeon cnMips instructions seqi/snei and v3mulu/vmm0/vmulu. It is only for the assembler. Test case is included. Reviewed by: Daniel.Sanders@imgtec.com llvm-svn: 205631	2014-04-04 16:21:59 +00:00
Hal Finkel	fbf7e2a1a1	[PowerPC] Add a full condition code register to make the "cc" clobber work gcc inline asm supports specifying "cc" as a clobber of all condition registers. Add just enough modeling of the full register to make this work. Fixed PR19326. llvm-svn: 205630	2014-04-04 15:15:57 +00:00
Daniel Sanders	d4341a0ad7	[mips] abs.[ds], and neg.[ds] should be allowed regardless of -enable-no-nans-fp-math Summary: They behave in accordance with the Has2008 and ABS2008 configuration bits of the processor which are used to select between the 1985 and 2008 versions of IEEE 754. In 1985 mode, these instructions are arithmetic (i.e. they raise invalid operation exceptions when given NaN), in 2008 mode they are non-arithmetic (i.e. they are copies). nmadd.[ds], and nmsub.[ds] are still subject to -enable-no-nans-fp-math because the ISA spec does not explicitly state that they obey Has2008 and ABS2008. Reviewers: matheusalmeida Reviewed By: matheusalmeida Differential Revision: http://llvm-reviews.chandlerc.com/D3274 llvm-svn: 205628	2014-04-04 14:52:54 +00:00
Tim Northover	0e5eaae1cb	DAGLegalize: add last-ditch type-legalization for VSELECT. When LLVM sees something like (v1iN (vselect v1i1, v1iN, v1iN)) it can decide that the result is OK (v1i64 is legal on AArch64, for example) but it still need scalarising because of that v1i1. There was no code to do this though. AArch64 and ARM64 have DAG combines to produce efficient code and prevent that occuring in most such situations, but there are edge cases that they miss. This adds a legalization to cope with that. llvm-svn: 205626	2014-04-04 14:49:30 +00:00
Tim Northover	07a8ff4892	ARM64: handle v1i1 types arising from setcc properly. There were several overlapping problems here, and this solution is closely inspired by the one adopted in AArch64 in r201381. Firstly, scalarisation of v1i1 setcc operations simply fails if the input types are legal. This is fixed in LegalizeVectorTypes.cpp this time, and allows AArch64 code to be simplified slightly. Second, vselect with such a setcc feeding into it ends up in ScalarizeVectorOperand, where it's not handled. I experimented with an implementation, but found that whatever DAG came out was rather horrific. I think Hao's DAG combine approach is a good one for quality, though there are edge cases it won't catch (to be fixed separately). Should fix PR19335. llvm-svn: 205625	2014-04-04 14:49:21 +00:00
Stepan Dyatkovskiy	3f1fa3d545	Fix for PR18921 (LDRD/STRD part):: Removed "GNU Assembler extension (compatibility)" definitions from ARMInstrInfo.td Fixed ARMAsmParser::ParseInstruction GNU compatability branch, so it also works for thumb mode from now. Added new tests. llvm-svn: 205622	2014-04-04 10:17:56 +00:00
NAKAMURA Takumi	a25ac912eb	Tweak unconditional-branch.ll passing on any hosts, while investigating x86_64-mingw32. Sorry for the breakage. For now, it will fail in two ways: 1. To fail for targeting x86_64-mingw32. <stdin>:131:8: note: possible intended match here 0x30830a0100000002 3 0 1 0 0 is_stmt 2. To fail not to find the target x86. llc: : error: unable to get target for 'x86_64-unknown-unknown', see --version and --triple. llvm-svn: 205621	2014-04-04 10:16:51 +00:00
Tim Northover	85d6a16c46	ARM64: use regalloc-friendly COPY_TO_REGCLASS for bitcasts The previous patterns directly inserted FMOV or INS instructions into the DAG for scalar_to_vector & bitconvert patterns. This is horribly inefficient and can generated lots more GPR <-> FPR register traffic than necessary. It's much better to emit instructions the register allocator understands so it can coalesce the copies when appropriate. It led to at least one ISelLowering hack to avoid the problems, which was incorrect for v1i64 (FPR64 has no dsub). It can now be removed entirely. This should also fix PR19331. llvm-svn: 205616	2014-04-04 09:03:09 +00:00
Tim Northover	1e4f2c5e5f	ARM64: add 128-bit MLA operations to the custom selection code. Without this change, the llvm_unreachable kicked in. The code pattern being spotted is rather non-canonical for 128-bit MLAs, but it can happen and there's no point in generating sub-optimal code for it just because it looks odd. Should fix PR19332. llvm-svn: 205615	2014-04-04 09:03:02 +00:00
Stepan Dyatkovskiy	a09bd2379c	Fixed register class in STRD instruction for Thumb2 mode. llvm-svn: 205612	2014-04-04 08:14:13 +00:00
Craig Topper	840beec2d0	Make consistent use of MCPhysReg instead of uint16_t throughout the tree. llvm-svn: 205610	2014-04-04 05:16:06 +00:00
Jim Grosbach	08d57b951c	Fix spelling. Sigh. llvm-svn: 205605	2014-04-04 02:14:38 +00:00
Jim Grosbach	537f3ed838	ARM: Range based for-loop over block predecessors. No functional change. llvm-svn: 205604	2014-04-04 02:11:03 +00:00
Jim Grosbach	9ef3ad960d	Add iterator_ranges for block pred/succ. llvm-svn: 205603	2014-04-04 02:10:59 +00:00
Jim Grosbach	f92e8f5a8b	ARM: Use range-based for loops in frame lowering. No functional change. llvm-svn: 205602	2014-04-04 02:10:55 +00:00
Quentin Colombet	96bd2a1490	[RegAllocGreedy][Last Chance Recoloring] Emit diagnostics when last chance recoloring cut-offs are encountered and register allocation failed. This is related to PR18747 Patch by MAYUR PANDEY <mayur.p@samsung.com>. llvm-svn: 205601	2014-04-04 02:05:21 +00:00
Quentin Colombet	9c816f39ad	Revert r205599, the commit was not intended to have so many changes llvm-svn: 205600	2014-04-04 02:02:49 +00:00
Quentin Colombet	7ee4e79dec	[RegAllocGreedy][Last Chance Recoloring] Emit diagnostics when last chance recoloring cut-offs are hit. This is related to PR18747. Patch by MAYUR PANDEY <mayur.p@samsung.com> llvm-svn: 205599	2014-04-04 01:58:57 +00:00
Saleem Abdulrasool	c351ed2966	ARM: fix test case missed in previous roundup This should hopefully bring the last MSVC buildbot back to green! llvm-svn: 205596	2014-04-04 01:19:56 +00:00
Saleem Abdulrasool	a7a8a3e3ee	MIPS: remove vim swap file llvm-svn: 205595	2014-04-04 01:19:54 +00:00
Rafael Espindola	7247546ba3	Add an assert that this is only used with .o files. I am not sure how to get a relocation in a .dylib, but this function would return the wrong value if passed one. llvm-svn: 205592	2014-04-04 00:31:12 +00:00
Rafael Espindola	7e91bc9e32	Implement getRelocationAddress for MachO and ET_REL elf files. With that, fix the symbolizer to work with any ELF file. llvm-svn: 205588	2014-04-03 23:54:35 +00:00
Rafael Espindola	128b8111d7	Implement macho relocation iterators with section number + relocation number. This will make it possible to implement getRelocationAddress. llvm-svn: 205587	2014-04-03 23:51:28 +00:00
Saleem Abdulrasool	905b6d192c	ARM: yet another round of ARM test clean ups llvm-svn: 205586	2014-04-03 23:47:24 +00:00
Jim Grosbach	b8bd4a5e2a	Tidy up. Space before ':' in range-based for loops. llvm-svn: 205585	2014-04-03 23:43:26 +00:00
Jim Grosbach	bb1af943bb	Tidy up. 80 columns. llvm-svn: 205584	2014-04-03 23:43:22 +00:00
Jim Grosbach	1a59711505	Tidy up. Trailing whitespace. llvm-svn: 205583	2014-04-03 23:43:18 +00:00
Jim Grosbach	e04eb1dc12	Fix typo. llvm-svn: 205582	2014-04-03 23:43:12 +00:00
Rafael Espindola	0cc9ba116f	Fix llvm-objdump crash. llvm-svn: 205581	2014-04-03 23:20:02 +00:00
Rafael Espindola	77314aa014	Remove section_rel_empty. Just compare begin() and end() instead. llvm-svn: 205577	2014-04-03 22:42:22 +00:00
Rafael Espindola	c498415086	Reuse existing variable. llvm-svn: 205572	2014-04-03 21:48:41 +00:00
Eli Bendersky	bbef172f19	Optimize away unnecessary address casts. Removes unnecessary casts from non-generic address spaces to the generic address space for certain code patterns. Patch by Jingyue Wu. llvm-svn: 205571	2014-04-03 21:18:25 +00:00
Lang Hames	cb74fa696b	[ARM64] Teach the ARM64DeadRegisterDefinition pass to respect implicit-defs. When rematerializing through truncates, the coalescer may produce instructions with dead defs, but live implicit-defs of subregs: E.g. %X1<def,dead> = MOVi64imm 2, %W1<imp-def>; %X1:GPR64, %W1:GPR32 These instructions are live, and their definitions should not be rewritten. Fixes <rdar://problem/16492408> llvm-svn: 205565	2014-04-03 20:51:08 +00:00
NAKAMURA Takumi	4dca4d8bbd	unconditional-branch.ll is broken for targeting x86_64-cygming. Add an explicit triple for now. llvm-svn: 205563	2014-04-03 20:40:37 +00:00
Tom Stellard	a0150cb6a9	R600: Correct opcode for BFE_INT Acording to AMD documentation, the correct opcode for BFE_INT is 0x5, not 0x4 Fixes Arithm/Absdiff.Mat/3 OpenCV test Patch by: Bruno Jiménez llvm-svn: 205562	2014-04-03 20:19:29 +00:00
Tom Stellard	7ed0b5235a	R600/SI: Lower 64-bit immediates using REG_SEQUENCE llvm-svn: 205561	2014-04-03 20:19:27 +00:00
NAKAMURA Takumi	c5acee0f20	Revert r205551, "Attempt to XFAIL this on mingw and cygwin hosts." It didn't fail on cygming. That said, it emits errors to the stderr (with exit(0)); error: failed to compute relocation: IMAGE_REL_I386_SECREL error: failed to compute relocation: IMAGE_REL_I386_SECREL error: failed to compute relocation: IMAGE_REL_I386_SECREL error: failed to compute relocation: IMAGE_REL_I386_SECREL error: failed to compute relocation: IMAGE_REL_I386_SECREL error: failed to compute relocation: IMAGE_REL_I386_DIR32 error: failed to compute relocation: IMAGE_REL_I386_SECREL error: failed to compute relocation: IMAGE_REL_I386_DIR32 error: failed to compute relocation: IMAGE_REL_I386_SECREL error: failed to compute relocation: IMAGE_REL_I386_SECREL error: failed to compute relocation: IMAGE_REL_I386_DIR32 llvm-svn: 205560	2014-04-03 20:08:02 +00:00
NAKAMURA Takumi	8ff866c24e	llvm/test/CodeGen/X86/peephole-multiple-folds.ll: Relax expressions to satisfy win32. llvm-svn: 205559	2014-04-03 20:07:51 +00:00
Eric Christopher	5bdaea85cb	Attempt to XFAIL this on mingw and cygwin hosts. The line table on these is very much off and is more than just the branch from this bug incorrect: Address Line Column File ISA Discriminator Flags ------------------ ------ ------ ------ --- ------------- ------------- 0x30830a0100000002 3 0 1 0 0 is_stmt 0x30830a0100000008 3 0 1 0 0 is_stmt end_sequence llvm-svn: 205551	2014-04-03 18:23:52 +00:00
Eli Bendersky	9966b26dac	Fix PR19270 - type mismatch caused by invalid optimization. Patch by Jingyue Wu. llvm-svn: 205547	2014-04-03 17:51:58 +00:00
Eric Christopher	bc79fddb9a	Loosen up check so that we can pass on platforms that generate slightly more verbose than needed line tables, e.g.: Address Line Column File ISA Discriminator Flags ------------------ ------ ------ ------ --- ------------- ------------- 0x0000000000000000 1 0 1 0 0 is_stmt 0x0000000000000000 1 0 1 0 0 is_stmt prologue_end 0x0000000000000010 2 0 1 0 0 is_stmt 0x0000000000000018 4 0 1 0 0 is_stmt these should probably be looked at, but it isn't affecting the correctness of the testcase. llvm-svn: 205546	2014-04-03 17:40:08 +00:00
Saleem Abdulrasool	717c991923	ARM: update even more tests More updating of tests to be explicit about the target triple rather than relying on the default target triple supporting ARM mode. Indicate to lit that object emission is not yet available for Windows on ARM. llvm-svn: 205545	2014-04-03 17:35:22 +00:00

1 2 3 4 5 ...

101897 Commits