llvm-project

Commit Graph

Author	SHA1	Message	Date
Tim Northover	be1d1b6681	ARM64: don't emit .subsections_via_symbols on ELF. Part of PR19455. llvm-svn: 206610	2014-04-18 14:54:41 +00:00
Tim Northover	be3941cc79	ARM64: add extra NEG pattern. llvm-svn: 206609	2014-04-18 14:54:35 +00:00
Tim Northover	e3028832d1	AArch64/ARM64: add non-scalar lowering for more FCVT operations. llvm-svn: 206591	2014-04-18 13:16:42 +00:00
Tim Northover	01f315a556	AArch64/ARM64: improve spotting of EXT instructions from VECTOR_SHUFFLE. We couldn't cope if the first mask element was UNDEF before, which isn't ideal. llvm-svn: 206588	2014-04-18 12:50:58 +00:00
Tim Northover	a2c4c71c12	AArch64/ARM64: spot a greater variety of concat_vector operations. Code mostly copied from AArch64, just tidied up a trifle and plumbed into the ARM64 way of doing things. This also enables the AArch64 tests which inspired the previous untested commits. llvm-svn: 206574	2014-04-18 09:31:27 +00:00
Tim Northover	848bb3ced5	ARM64: implement cunning optimisation from AArch64 A vector extract followed by a dup can become a single instruction even if the types don't match. AArch64 handled this in ISelLowering, but a few reasonably simple patterns can take care of it in TableGen, so that's where I've put it. llvm-svn: 206573	2014-04-18 09:31:20 +00:00
Tim Northover	5ec51a8981	ARM64: spot a vector_shuffle that maps to INS and expand. Tests will be coming very shortly when all the optimisations needed to support AArch64's neon-copy.ll file are committed. llvm-svn: 206572	2014-04-18 09:31:15 +00:00
Tim Northover	46d98ea8de	ARM64: nick some AArch64 patterns for extract/insert -> INS. Tests will be committed shortly when all optimisations needed to support AArch64's neon-copy.ll file are supported. llvm-svn: 206571	2014-04-18 09:31:11 +00:00
Tim Northover	8b2fa3dfef	AArch64/ARM64: emit all vector FP comparisons as such. ARM64 was scalarizing some vector comparisons which don't quite map to AArch64's compare and mask instructions. AArch64's approach of sacrificing a little efficiency to emulate them with the limited set available was better, so I ported it across. More "inspired by" than copy/paste since the backend's internal expectations were a bit different, but the tests were invaluable. llvm-svn: 206570	2014-04-18 09:31:07 +00:00
Tim Northover	0a44e66bb8	AArch64/ARM64: port BSL logic from AArch64 & enable test. I enhanced it a little in the process. The decision shouldn't really be beased on whether a BUILD_VECTOR is a splat: any set of constants will do the job provided they're related in the correct way. Also, the BUILD_VECTOR could be any operand of the incoming AND nodes, so it's best to check for all 4 possibilities rather than assuming it'll be the RHS. llvm-svn: 206569	2014-04-18 09:31:01 +00:00
Tim Northover	547a4ae6fa	AArch64/ARM64: copy byval implementation from AArch64. It's not actually used to handle C or C++ ABI rules on ARM64, but could well be emitted by other language front-ends, so it's as well to have a sensible implementation. llvm-svn: 206568	2014-04-18 09:30:52 +00:00
Jim Grosbach	6bfe18a365	[ARM64,C++11] Range'ify another loop. llvm-svn: 206539	2014-04-17 23:41:57 +00:00
Louis Gerbarg	153e695ee2	Improve ARM64 vector creation This patch improves the performance of vector creation in caseiswhere where several of the lanes in the vector are a constant floating point value. It also includes new patterns to fold together some of the instructions when the value is 0.0f. Test cases included. rdar://16349427 llvm-svn: 206496	2014-04-17 20:51:50 +00:00
Jim Grosbach	0fba6d98fc	ARM64: [su]xtw use W regs as inputs, not X regs. Update the SXT[BHW]/UXTW instruction aliases and the shifted reg addressing mode handling. PR19455 and rdar://16650642 llvm-svn: 206495	2014-04-17 20:47:31 +00:00
Tim Northover	11a6082e33	ARM64: switch to IR-based atomic operations. Goodbye code! (Game: spot the bug fixed by the change). llvm-svn: 206490	2014-04-17 20:00:33 +00:00
Tim Northover	0129f298c4	ARM64: add acquire/release versions of the existing atomic intrinsics. These will be needed to support IR-level lowering of atomic operations. llvm-svn: 206489	2014-04-17 20:00:24 +00:00
Adam Nemet	287f989dde	[ARM64] Fix "Cannot select" for vector ctpop The commit of r205855: Author: Arnold Schwaighofer <aschwaighofer@apple.com> Date: Wed Apr 9 14:20:47 2014 +0000 SLPVectorizer: Only vectorize intrinsics whose operands are widened equally The vectorizer only knows how to vectorize intrinics by widening all operands by the same factor. Patch by Tyler Nowicki! exposed a backend bug causing a regression (Cannot select ctpop). The commit msg is a bit confusing because the patch actually changes the behavior for the loop-vectorizer as well. As things got refactored into a helper ctpop got snuck in to the trivially-vectorizable helper which is now used by both vectorizers. In other words, we started seeing vector-ctpops in the backend. This change makes ctpop LegalizeAction::Expand for the types not supported by the byte-only CNT instruction. We may be able to custom-lower these later to a single CNT but this is to fix the compiler crash first. Fixes <rdar://problem/16578951> llvm-svn: 206433	2014-04-17 01:01:37 +00:00
Aaron Ballman	5f1378c2a4	Replacing a non-ASCII character in a comment with an ASCII character. Fixes a C4819 warning in MSVC. llvm-svn: 206403	2014-04-16 17:09:20 +00:00
Tim Northover	ef7b34d403	ARM64: silence sign-comparison warning. llvm-svn: 206393	2014-04-16 15:28:06 +00:00
Tim Northover	3e69958b6b	AArch64/ARM64: produce correct relocation for conditional branches. llvm-svn: 206391	2014-04-16 15:27:52 +00:00
Tim Northover	3ec1de7767	AArch64/ARM64: port across stub handling for ELF C++ exceptions. The most important part here is that we should actuall emit the stubs we refer to in the exception table, but as a side issue this uses more sensible & GCC compatible representations for some of the bits of information. llvm-svn: 206380	2014-04-16 11:52:55 +00:00
Tim Northover	18f68f6d1a	ARM64: use 32-bit moves for constants where possible. If we know that a particular 64-bit constant has all high bits zero, then we can rely on the fact that 32-bit ARM64 instructions automatically zero out the high bits of an x-register. This gives the expansion logic less constraints to satisfy and so sometimes allows it to pick better sequences. Came up while porting test/CodeGen/AArch64/movw-consts.ll: this will allow a 32-bit MOVN to be used in @test8 soon. llvm-svn: 206379	2014-04-16 11:52:51 +00:00
Tim Northover	9cfb57dafa	ARM64: use the integrated assembler on ELF. llvm-svn: 206378	2014-04-16 11:52:40 +00:00
Aaron Ballman	58ce7f24cd	Fixing a compile error in debug versions of MSVC. It seems that the range-based for loop is confused by the DEBUG macro expansion unless a compound statement is used. llvm-svn: 206376	2014-04-16 11:15:57 +00:00
Tim Northover	97c5b6fe4f	ARM64: mark x7 as used when an i128 gets shunted onto the stack. The second half of a split i128 was ending up in x7, which is not a good thing. This is another part of PR19432. llvm-svn: 206366	2014-04-16 09:03:25 +00:00
Jim Grosbach	36c6a50512	[ARM64,C++11] Tidy up branch relaxation a bit w/ c++11. No functional change. llvm-svn: 206344	2014-04-16 00:42:46 +00:00
Jim Grosbach	01fc5887ad	ARM64: Nuke some dead code. Missed in previous commit. llvm-svn: 206343	2014-04-16 00:42:43 +00:00
Jim Grosbach	80633094f8	[ARM64,C++11] Clean up the ARM64 LOH collection pass. Range'ify a bunch of loops, mainly. As a result, we have a variety of objects via reference rather than by pointer, so propogate that through the various helper functions where it makes sense. llvm-svn: 206337	2014-04-15 22:57:02 +00:00
Quentin Colombet	72dad56c53	[ARM64] Set default CPU to generic instead of cyclone. llvm-svn: 206313	2014-04-15 19:08:46 +00:00
Tim Northover	ebb3123a5f	AArch64/ARM64: add missing pattern for extending load. llvm-svn: 206290	2014-04-15 14:00:19 +00:00
Tim Northover	cbcb7a37f7	AArch64/ARM64: only mangle MOVZ/MOVN during encoding when needed Sometimes we need emit the bits that would actually be a MOVN when producing a relocated MOVZ instruction (don't ask). But not always, a check which ARM64 got wrong until now. llvm-svn: 206289	2014-04-15 14:00:15 +00:00
Tim Northover	6e27b8ded5	AArch64/ARM64: add support for large code-model jump tables. I've left the MachO CodeGen as it is, there's a reasonable chance it should use the GOT like ConstPools, but I'm not certain. llvm-svn: 206288	2014-04-15 14:00:11 +00:00
Tim Northover	221b583951	AArch64/ARM64: add patterns for various commutations of FNMADD. llvm-svn: 206287	2014-04-15 14:00:06 +00:00
Tim Northover	b37cff1ae2	AArch64/ARM64: add half as a storage type on ARM64. This brings it into line with the AArch64 behaviour and should open the way for certain OpenCL features. llvm-svn: 206286	2014-04-15 14:00:03 +00:00
Tim Northover	80a70a265a	AArch64/ARM64: copy patterns for fixed-point conversions Code is mostly copied directly across, with a slight extension of the ISelDAGToDAG function so that it can cope with the floating-point constants being behind a litpool. llvm-svn: 206285	2014-04-15 13:59:57 +00:00
Tim Northover	f70577b1cd	ARM64: add constraints to various FastISel operations llvm-svn: 206284	2014-04-15 13:59:53 +00:00
Tim Northover	20603726ce	AArch64/ARM64: add dp tests from AArch64 llvm-svn: 206281	2014-04-15 13:59:40 +00:00
NAKAMURA Takumi	6091e1aed5	ARM64AsmParser.cpp: Fix vg_leak in MC/ARM64/fp-encoding.s. llvm-svn: 206279	2014-04-15 13:22:11 +00:00
Stepan Dyatkovskiy	95cdac43af	Optional hash symbol feature support for ARM64 http://reviews.llvm.org/D3328 llvm-svn: 206276	2014-04-15 11:43:09 +00:00
Lang Hames	a1bc0f5662	[MC] Require an MCContext when constructing an MCDisassembler. This patch re-introduces the MCContext member that was removed from MCDisassembler in r206063, and requires that an MCContext be passed in at MCDisassembler construction time. (Previously the MCContext member had been initialized in an ad-hoc fashion after construction). The MCCContext member can be used by MCDisassembler sub-classes to construct constant or target-specific MCExprs. This patch updates disassemblers for in-tree targets, and provides the MCRegisterInfo instance that some disassemblers were using through the MCContext (previously those backends were constructing their own MCRegisterInfo instances). llvm-svn: 206241	2014-04-15 04:40:56 +00:00
Jim Grosbach	2c6ff0cbb4	[ARM64,C++11]: Range'ify the dead-register-definition pass. Range-based for loops. No functional change intended. llvm-svn: 206239	2014-04-15 02:14:09 +00:00
Quentin Colombet	f9b61e6afd	[ARM64][MC] Set the default CPU string to generic. llvm-svn: 206228	2014-04-15 00:28:39 +00:00
Quentin Colombet	4097c8959c	[ARM64][MC] Set the default CPU to cyclone when initilizating the MC layer. This matches that ARM64Subtarget does for now. This is related to <rdar://problem/16573920> llvm-svn: 206211	2014-04-14 21:25:53 +00:00
Louis Gerbarg	cfc05450e5	Fix for codegen bug that could cause illegal cmn instruction generation In rare cases the dead definition elimination pass code can cause illegal cmn instructions when it replaces dead registers on instructions that use unmaterialized frame indexes. This patch disables the dead definition optimization for instructions which include frame index operands. rdar://16438284 llvm-svn: 206208	2014-04-14 21:05:05 +00:00
Louis Gerbarg	6d2e3c638f	Add a flag to disable the ARM64DeadRegisterDefinitionsPass This patch adds a -arm64-dead-def-elimination flag so that it is possible to disable dead definition elimination. Includes test case. llvm-svn: 206207	2014-04-14 21:05:02 +00:00
James Molloy	d60571bad7	[ARM64] Port over missing subtarget features, and CPU definitions from AArch64. llvm-svn: 206198	2014-04-14 17:38:00 +00:00
Tim Northover	cb9c3cfb58	ARM64: remove buggy REV16 pattern. The 32-bit pattern is still valid: 0123 -> 3210 -> 1032. llvm-svn: 206172	2014-04-14 12:59:52 +00:00
Tim Northover	b6abe806c7	AArch64/ARM64: enable directcond.ll test on ARM64. Code change is because optimizeCompareInstr didn't know how to pull the condition code out of FCSEL instructions. llvm-svn: 206171	2014-04-14 12:51:06 +00:00
Tim Northover	0d7bd4f444	ARM64: add patterns for csXYZ with reversed operands. AArch64 tests for this, and it's obviously a good idea. Have to invert the condition code, of course. llvm-svn: 206170	2014-04-14 12:51:02 +00:00
Tim Northover	2f48303436	ARM64: add support for AArch64's addsub_ext.ll There was one definite issue in ARM64 (the off-by-1 check for whether a shift could be folded in) and one difference that is probably correct: ARM64 didn't fold nodes with multiple uses into the arithmetic operations unless optimising for code size. llvm-svn: 206168	2014-04-14 12:50:50 +00:00
Tim Northover	23b1f08282	ARM64: optimise (cmp x, (sub 0, y)) to (cmn x, y). This transformation is only valid when being used for an EQ or NE comparison since the flags change otherwise. llvm-svn: 206167	2014-04-14 12:50:47 +00:00
Benjamin Kramer	30120c0626	Make helper static and place random global into the llvm namespace. llvm-svn: 206116	2014-04-12 18:39:57 +00:00
Juergen Ributzka	cf03068d91	[ARM64] Never hoist the shift value of a shift instruction. There is no need to check if we want to hoist the immediate value of an shift instruction. Simply return TCC_Free right away. llvm-svn: 206101	2014-04-12 02:53:51 +00:00
Juergen Ributzka	6e17aa45a3	[ARM64] Fix the cost model for cheap large constants. Originally the cost model would give up for large constants and just return the maximum cost. This is not what we want for constant hoisting, because some of these constants are large in bitwidth, but are still cheap to materialize. This commit fixes the cost model to either return TCC_Free if the cost cannot be determined, or accurately calculate the cost even for large constants (bitwidth > 128). This fixes <rdar://problem/16591573>. llvm-svn: 206100	2014-04-12 02:36:28 +00:00
Louis Gerbarg	b9a0551862	Add ARM64 CLS patterns This patch adds patterns to generate the cls instruction ARM64. Includes tests for 64 bit and 32 bit operands. rdar://15611957 llvm-svn: 206079	2014-04-11 22:27:58 +00:00
Lang Hames	95400e22f9	Remove redundant symbolization support from MCDisassembler interface. MCDisassembler has an MCSymbolizer member that is meant to take care of symbolizing during disassembly, but it also has several methods that enable the disassembler to do symbolization internally (i.e. without an attached symbolizer object). There is no need for this duplication, but ARM64 had been making use of it. This patch moves the ARM64 symbolization logic out of ARM64Disassembler and into an ARM64ExternalSymbolizer class, and removes the duplicated MCSymbolizer functionality from the MCDisassembler interface. Symbolization will now be done exclusively through MCSymbolizers. There should be no impact on disassembly for any platform, but this allows us to tidy up the MCDisassembler interface and simplify the process of (and invariants related to) disassembler setup. llvm-svn: 206063	2014-04-11 20:07:58 +00:00
David Blaikie	ceec2bdaa5	Implement depth_first and inverse_depth_first range factory functions. Also updated as many loops as I could find using df_begin/idf_begin - strangely I found no uses of idf_begin. Is that just used out of tree? Also a few places couldn't use df_begin because either they used the member functions of the depth first iterators or had specific ordering constraints (I added a comment in the latter case). Based on a patch by Jim Grosbach. (Jim - you just had iterator_range<T> where you needed iterator_range<idf_iterator<T>>) llvm-svn: 206016	2014-04-11 01:50:01 +00:00
Jim Grosbach	f77265bfee	[ARM64,C++11] Range'ify use-lists iterators in address type promotion. llvm-svn: 206013	2014-04-11 01:13:10 +00:00
Jim Grosbach	8838d793b7	[ARM64,C++11]: Range'ify use-list iterators in DAGToDAG. llvm-svn: 206007	2014-04-11 00:27:22 +00:00
Jim Grosbach	d3249d0923	[ARM64,C++11]: More range-based loop simplification. llvm-svn: 206006	2014-04-11 00:27:19 +00:00
Jim Grosbach	577e921344	[ARM64,C++11]: Range'ify loops in InstrInfo. llvm-svn: 205992	2014-04-10 22:00:18 +00:00
Jim Grosbach	8a0c50e5a9	[ARM64,C++11]: Range'ify loops in the conditional-compare pass. llvm-svn: 205988	2014-04-10 21:49:24 +00:00
NAKAMURA Takumi	12fbced6e8	ARM64/*/LLVMBuild.txt: Prune redundant deps. llvm-svn: 205963	2014-04-10 12:46:13 +00:00
NAKAMURA Takumi	554c287262	LLVMBuild.txt: Add missing dependencies. llvm-svn: 205962	2014-04-10 11:16:47 +00:00
NAKAMURA Takumi	98905d3f85	LLVMBuild.txt: Reformat. llvm-svn: 205961	2014-04-10 11:16:17 +00:00
NAKAMURA Takumi	d8570e5bc2	Fix abuse of StringRef on ARM64SysReg::MRSMapper::toString(Val, Valid). FIXME: Could we use SmallString here? llvm-svn: 205950	2014-04-10 03:05:59 +00:00
Saleem Abdulrasool	c5e0099ffc	ARM64: add an explicit cast to silence a silly warning GCC 4.8 complains with: warning: enumeral and non-enumeral type in conditional expression Although this is silly and harmless in this case, add an explicit cast to silence the warning. llvm-svn: 205949	2014-04-10 02:48:10 +00:00
Juergen Ributzka	48c8c07d0a	[ARM64] Fix immediate cost calculation for types larger than i64. The immediate cost calculation code was hitting an assertion in the included test case, because APInt was still internally 128-bits. Truncating it to 64-bits fixed the issue. Fixes <rdar://problem/16572521>. llvm-svn: 205947	2014-04-10 01:36:59 +00:00
Bob Wilson	ae89ddedff	Simple fix for build failures resulting from r205867. llvm-svn: 205918	2014-04-09 18:34:45 +00:00
Alp Toker	16f98b255d	Fix some doc and comment typos llvm-svn: 205899	2014-04-09 14:47:27 +00:00
Bradley Smith	246b0b617d	[ARM64] Change SYS without a register to an alias to make disassembling more consistant. llvm-svn: 205898	2014-04-09 14:44:58 +00:00
Bradley Smith	2cef19a2e6	[ARM64] Correctly disassemble ISB operand as ISB not DBarrier. llvm-svn: 205897	2014-04-09 14:44:54 +00:00
Bradley Smith	239120cada	[ARM64] Properly support both apple and standard syntax for FMOV llvm-svn: 205896	2014-04-09 14:44:49 +00:00
Bradley Smith	a2308f47d3	[ARM64] Flag setting logical/add/sub immediate instructions don't use SP. llvm-svn: 205895	2014-04-09 14:44:44 +00:00
Bradley Smith	f280e91849	[ARM64] Conditional branches must always print their condition code, even AL. llvm-svn: 205894	2014-04-09 14:44:39 +00:00
Bradley Smith	a19b7e83dc	[ARM64] Fix disassembly logic for extended loads/stores with 32-bit registers. llvm-svn: 205893	2014-04-09 14:44:36 +00:00
Bradley Smith	a0d7a9a12f	[ARM64] When printing a pre-indexed address with #0 , the ', #0' is not optional. llvm-svn: 205892	2014-04-09 14:44:31 +00:00
Bradley Smith	70c6acbbfd	[ARM64] Add missing shifted register MVN alias to ORN llvm-svn: 205891	2014-04-09 14:44:26 +00:00
Bradley Smith	403bbf95c0	[ARM64] SXTW/UXTW are only valid aliases for 32-bit operations. llvm-svn: 205890	2014-04-09 14:44:22 +00:00
Bradley Smith	779238a216	[ARM64] Fix canonicalisation of MOVs. MOV is too complex to be modelled by a dumb alias. llvm-svn: 205889	2014-04-09 14:44:18 +00:00
Bradley Smith	f823079acd	[ARM64] Fixup ADR/ADRP parsing such that they accept immediates and all labels types llvm-svn: 205888	2014-04-09 14:44:12 +00:00
Bradley Smith	af2710c96f	[ARM64] Ensure sp is decoded as SP, not XZR in LD1 instructions. llvm-svn: 205887	2014-04-09 14:44:07 +00:00
Bradley Smith	a0dce246ed	[ARM64] Tighten up the special casing in emitting arithmetic extends. UXTW should only be translated when the instruction uses WSP, not SP. Vice versa for UXTX and 64-bit instructions. llvm-svn: 205886	2014-04-09 14:44:03 +00:00
Bradley Smith	3971d3dc75	[ARM64] Rename LR to the UAL-compliant 'X30'. llvm-svn: 205885	2014-04-09 14:43:59 +00:00
Bradley Smith	6f1aa59c31	[ARM64] Rename FP to the UAL-compliant 'X29'. llvm-svn: 205884	2014-04-09 14:43:50 +00:00
Bradley Smith	5511f08055	[ARM64] Add a PostEncoderMethod to FCMP - the Rm field should canonically be zero but should be decoded/disassembled with any value. llvm-svn: 205883	2014-04-09 14:43:40 +00:00
Bradley Smith	eb4ca04db2	[ARM64] SCVTF and FCVTZS/U are undefined if scale<5> == 0. llvm-svn: 205882	2014-04-09 14:43:35 +00:00
Bradley Smith	db7b9b17eb	[ARM64] EXT and EXTR instructions on v8i8 and W regs respectively must have the top bit of their immediate clear. llvm-svn: 205881	2014-04-09 14:43:31 +00:00
Bradley Smith	60e7667886	[ARM64] Scaled fixed-point FCVTZSs should also have bit 29 set to zero. llvm-svn: 205880	2014-04-09 14:43:27 +00:00
Bradley Smith	7525b47208	[ARM64] UBFM/BFM is undefined on w registers when imms<5> or immr<5> is 1. llvm-svn: 205879	2014-04-09 14:43:24 +00:00
Bradley Smith	0243aa33fa	[ARM64] Floating point to fixed point scaled conversions are only available on fcvtzs and fcvtzu. llvm-svn: 205878	2014-04-09 14:43:20 +00:00
Bradley Smith	8f906a3c5f	[ARM64] Port over the PostEncoderMethod fix for SMULH/UMULH from AArch64. llvm-svn: 205877	2014-04-09 14:43:15 +00:00
Bradley Smith	9f29b726d5	[ARM64] Add missing tlbi operands and error for extra/missing register on tlbi aliases. llvm-svn: 205876	2014-04-09 14:43:11 +00:00
Bradley Smith	e8b4166acc	[ARM64] Rework system register parsing to overcome SPSel clash in MSR variants. llvm-svn: 205875	2014-04-09 14:43:06 +00:00
Bradley Smith	bc35b1f138	[ARM64] Port over the PostEncoderMethod from AArch64 for exclusive loads and stores, so the unused register fields are set to all-ones canonically but are recognised with any value. llvm-svn: 205874	2014-04-09 14:43:01 +00:00
Bradley Smith	4925be9b56	[ARM64] Use PStateMapper to ensure that MSRcpsr operands are validated during disassembly. llvm-svn: 205873	2014-04-09 14:42:56 +00:00
Bradley Smith	3339427e2a	[ARM64] Remove PrefetchOp and use ARM64PRFM instead. llvm-svn: 205872	2014-04-09 14:42:53 +00:00
Bradley Smith	16478c4ccf	[ARM64] Add WZR to isGPR32Register, since every use needs to check for this anyway. llvm-svn: 205871	2014-04-09 14:42:49 +00:00
Bradley Smith	3db2a85853	[ARM64] Remove ARM64SYS. llvm-svn: 205870	2014-04-09 14:42:45 +00:00
Bradley Smith	fb90df563f	[ARM64] Move CPSRField and DBarrier operands over to AArch64-style disassembly and assembly. This removes the last users of namespace ARM64SYS. llvm-svn: 205869	2014-04-09 14:42:42 +00:00
Bradley Smith	08c391c156	[ARM64] Switch the decoder, disassembler, instprinter and asmparser over to using AArch64-style system registers, and fix up test failures discovered in the process. llvm-svn: 205868	2014-04-09 14:42:36 +00:00
Bradley Smith	2ba17a4a17	[ARM64] Move ARM64BaseInfo.{cpp,h} into a Utils/ subdirectory, a la AArch64. These files are required in the decoder, disassembler and parser, and a layering violation was imminent. llvm-svn: 205867	2014-04-09 14:42:27 +00:00
Bradley Smith	ceeb04df60	[ARM64] Copy the named immediate operand mapping logic and enums from AArch64. AArch64's named immediate mapping and parsing is much more advanced than ARM64's. No functionality change - they're currently living side by side while I switch uses over. llvm-svn: 205866	2014-04-09 14:42:16 +00:00
Bradley Smith	8c0b88c987	[ARM64] Shifted register ALU ops are reserved if sf=0 and imm6<5>=1, and also (for add/sub only) if shift=11. llvm-svn: 205865	2014-04-09 14:42:11 +00:00
Bradley Smith	527bf86e56	[ARM64] Add support for NV condition code (exists only for valid assembly/disassembly, equivilant to AL) llvm-svn: 205864	2014-04-09 14:42:07 +00:00
Bradley Smith	6d7af17a3f	[ARM64] Add missing 1Q -> 1q vector kind alias llvm-svn: 205863	2014-04-09 14:42:01 +00:00
Bradley Smith	7d253f29a4	[ARM64] Add parsing for vector lists such as {v0.8b-v3.8b} llvm-svn: 205862	2014-04-09 14:41:58 +00:00
Bradley Smith	664aa67153	[ARM64] Correctly alias LSL to UXTW for 32bit instruction variants, rather than UXTX llvm-svn: 205861	2014-04-09 14:41:53 +00:00
Bradley Smith	35cadc58c9	[ARM64] STRHro and STRBro were not being decoded at all. llvm-svn: 205860	2014-04-09 14:41:49 +00:00
Bradley Smith	87c60e00d5	[ARM64] MOVK with sf=0 and hw<1>=1 is unallocated. Shift amount for ADD/SUB instructions is unallocated if shift > 4. llvm-svn: 205859	2014-04-09 14:41:45 +00:00
Bradley Smith	cd91e5cd0c	[ARM64] Register-offset loads and stores with the 'option' field equal to 00x or 10x are undefined. llvm-svn: 205858	2014-04-09 14:41:38 +00:00
Tim Northover	b36d428d27	ARM64: scalarize v1i64 mul operation This is the second part of fixing PR19367. llvm-svn: 205836	2014-04-09 07:07:02 +00:00
Tim Northover	b430cf6681	ARM64: add pattern for <1 x i64> custom not node. This should fix PR19367. llvm-svn: 205835	2014-04-09 06:55:39 +00:00
Juergen Ributzka	c11e8b67bb	[Constant Hoisting][ARM64] Enable constant hoisting for ARM64. This implements the target-hooks for ARM64 to enable constant hoisting. This fixes <rdar://problem/14774662> and <rdar://problem/16381500>. llvm-svn: 205791	2014-04-08 20:39:59 +00:00
Tim Northover	33d07468bc	ARM64: fix fmsub patterns which assumed accum operand was first Confusingly, the NEON fmla instructions put the accumulator first but the scalar versions put it at the end (like the fma lib function & LLVM's intrinsic). This should fix PR19345, assuming there's only one issue. llvm-svn: 205758	2014-04-08 12:23:51 +00:00
Jim Grosbach	e75c048ab9	Tidy up comments a bit. Punctuation, grammar, formatting, etc.. llvm-svn: 205749	2014-04-07 23:47:23 +00:00
Jim Grosbach	75010e7712	ARM64: Range based for loop in ARM64PromoteConstant pass llvm-svn: 205748	2014-04-07 23:47:21 +00:00
Jim Grosbach	64a28e70c8	ARM64: Clean up file header comment a bit. llvm-svn: 205747	2014-04-07 23:14:38 +00:00
David Blaikie	2f7711242a	MachineInstr: introduce explicit_operands and implicit_operands ranges Makes iteration over implicit and explicit machine operands more explicit (har har). Insipired by code review discussion for r205565. llvm-svn: 205680	2014-04-05 22:42:04 +00:00
Tim Northover	07a8ff4892	ARM64: handle v1i1 types arising from setcc properly. There were several overlapping problems here, and this solution is closely inspired by the one adopted in AArch64 in r201381. Firstly, scalarisation of v1i1 setcc operations simply fails if the input types are legal. This is fixed in LegalizeVectorTypes.cpp this time, and allows AArch64 code to be simplified slightly. Second, vselect with such a setcc feeding into it ends up in ScalarizeVectorOperand, where it's not handled. I experimented with an implementation, but found that whatever DAG came out was rather horrific. I think Hao's DAG combine approach is a good one for quality, though there are edge cases it won't catch (to be fixed separately). Should fix PR19335. llvm-svn: 205625	2014-04-04 14:49:21 +00:00
Tim Northover	85d6a16c46	ARM64: use regalloc-friendly COPY_TO_REGCLASS for bitcasts The previous patterns directly inserted FMOV or INS instructions into the DAG for scalar_to_vector & bitconvert patterns. This is horribly inefficient and can generated lots more GPR <-> FPR register traffic than necessary. It's much better to emit instructions the register allocator understands so it can coalesce the copies when appropriate. It led to at least one ISelLowering hack to avoid the problems, which was incorrect for v1i64 (FPR64 has no dsub). It can now be removed entirely. This should also fix PR19331. llvm-svn: 205616	2014-04-04 09:03:09 +00:00
Tim Northover	1e4f2c5e5f	ARM64: add 128-bit MLA operations to the custom selection code. Without this change, the llvm_unreachable kicked in. The code pattern being spotted is rather non-canonical for 128-bit MLAs, but it can happen and there's no point in generating sub-optimal code for it just because it looks odd. Should fix PR19332. llvm-svn: 205615	2014-04-04 09:03:02 +00:00
Craig Topper	840beec2d0	Make consistent use of MCPhysReg instead of uint16_t throughout the tree. llvm-svn: 205610	2014-04-04 05:16:06 +00:00
Jim Grosbach	b8bd4a5e2a	Tidy up. Space before ':' in range-based for loops. llvm-svn: 205585	2014-04-03 23:43:26 +00:00
Jim Grosbach	e04eb1dc12	Fix typo. llvm-svn: 205582	2014-04-03 23:43:12 +00:00
Lang Hames	cb74fa696b	[ARM64] Teach the ARM64DeadRegisterDefinition pass to respect implicit-defs. When rematerializing through truncates, the coalescer may produce instructions with dead defs, but live implicit-defs of subregs: E.g. %X1<def,dead> = MOVi64imm 2, %W1<imp-def>; %X1:GPR64, %W1:GPR32 These instructions are live, and their definitions should not be rewritten. Fixes <rdar://problem/16492408> llvm-svn: 205565	2014-04-03 20:51:08 +00:00
Tim Northover	2ad88d3aab	ARM64: always use i64 for the RHS of shift operations Switching between i32 and i64 based on the LHS type is a good idea in theory, but pre-legalisation uses i64 regardless of our choice, leading to potential ISel errors. Should fix PR19294. llvm-svn: 205519	2014-04-03 09:26:16 +00:00
Tim Northover	c7c6a93704	ARM64: don't generate __sincos_stret calls unless on MachO This should fix PR19314. llvm-svn: 205514	2014-04-03 07:06:13 +00:00
Jim Grosbach	2a2459f365	Make a few more range-based loops use explicit types. No functional change. llvm-svn: 205458	2014-04-02 20:21:22 +00:00
Jim Grosbach	36c4953348	Simplify resolveFrameIndex() signature. Just pass a MachineInstr reference rather than an MBB iterator. Creating a MachineInstr& is the first thing every implementation did anyway. llvm-svn: 205453	2014-04-02 19:28:18 +00:00
Jim Grosbach	df1e05bb8a	Make some range based loop types more explicit. No functional change, but more readable code. llvm-svn: 205451	2014-04-02 19:28:08 +00:00
Jim Grosbach	20b0790df7	[C++11,ARM64] Range based for and explicit 'override' in STP cleanup. No functional change intended. llvm-svn: 205446	2014-04-02 18:00:59 +00:00
Jim Grosbach	05abd709f3	[C++11,ARM64] Range based for loops in constant promotion. No functional change intended. llvm-svn: 205445	2014-04-02 18:00:56 +00:00
Jim Grosbach	7dc9edeaa5	[C++11,ARM64] Range based for loops in load/store pair optimizer. No functional change intended. llvm-svn: 205444	2014-04-02 18:00:53 +00:00
Jim Grosbach	020e657790	[C++11,ARM64] Range based for loops in target lowering. No functional change intended. llvm-svn: 205443	2014-04-02 18:00:51 +00:00
Jim Grosbach	91f1f47751	[C++11,ARM64] Range based for loops in frame lowering. No functional change intended. llvm-svn: 205442	2014-04-02 18:00:49 +00:00
Jim Grosbach	f39d752b03	[C++11,ARM64] Range based for loops in pseudo expansion. No functional change intended. llvm-svn: 205441	2014-04-02 18:00:46 +00:00
Jim Grosbach	673825ebac	[C++11,ARM64] Range based for loops for LOH No functional change intended. llvm-svn: 205440	2014-04-02 18:00:44 +00:00
Jim Grosbach	2539c3d07a	[C++11,ARM64] Range based for loops TLS cleanup. No functional change intended. llvm-svn: 205439	2014-04-02 18:00:41 +00:00
Jim Grosbach	0d0c5a614a	[C++11,ARM64] Range based for loops in branch relaxation. No functional change intended. llvm-svn: 205438	2014-04-02 18:00:39 +00:00
Jim Grosbach	1c762ca9bd	[C++11,ARM64] Range based for loops in address type promotion. No functional change intended. llvm-svn: 205437	2014-04-02 18:00:36 +00:00
Quentin Colombet	7bf9d8cd13	[ARM64][CollectLOH] Remove the link to the radar from the comments. llvm-svn: 205435	2014-04-02 16:40:49 +00:00
Tim Northover	6d69168ffd	ARM64: use GOT for weak symbols & PIC. Weak symbols cannot use the small code model's usual ADRP sequences since the instruction simply may not be able to encode a value of 0. This redirects them to use the GOT, which hopefully linkers are able to cope with even in the static relocation model. llvm-svn: 205426	2014-04-02 14:39:11 +00:00
Tim Northover	0d80f70530	ARM64: fix lowering of fp128 fptosi/fptoui We were creating libcall nodes that returned an MVT::f128, when these particular operations actually return an int of some stripe. llvm-svn: 205425	2014-04-02 14:39:07 +00:00
Tim Northover	ebd37ab382	ARM64: make sure first argument to INSERT_SUBVECTOR has right type. Again, coalescing and other optimisations swiftly made the MachineInstrs consistent again, but when compiled at -O0 a bad INSERT_SUBREGISTER was produced. llvm-svn: 205423	2014-04-02 14:38:58 +00:00
Tim Northover	5e3a484e3b	ARM64: convert fp16 narrowing ISel to pseudo-instruction The previous attempt was fine with optimisations, but was actually rather cavalier with its types. When compiled at -O0, it produced invalid COPY MachineInstrs. llvm-svn: 205422	2014-04-02 14:38:54 +00:00
Quentin Colombet	3c2b13b258	[ARM64][CollectLOH] Add some comments to explain how the LOHs framework works (for the compiler part), since the design document is not available. llvm-svn: 205379	2014-04-02 01:02:28 +00:00
Aaron Ballman	0947bb20d8	Fixing an MSVC warning about widening the result of a 32-bit shift implicitly. No functional change intended. llvm-svn: 205304	2014-04-01 12:24:25 +00:00
Tim Northover	4f1dd58e2e	ARM64: add intrinsic for pmull (p64 x p64 = p128) operations. llvm-svn: 205302	2014-04-01 12:22:37 +00:00
Aaron Ballman	d1726ee8fa	Fixing warnings in the MSVC build. No functional changes intended. llvm-svn: 205301	2014-04-01 12:22:20 +00:00

1 2 3 4 5 ...

277 Commits