llvm-project

Commit Graph

Author	SHA1	Message	Date
Chad Rosier	86a8f72041	[AArch64] This is a work in progress to provide a machine description for the Cortex-A53 subtarget in the AArch64 backend. This patch lays the ground work to annotate each AArch64 instruction (no NEON yet) with a list of SchedReadWrite types. The patch also provides the Cortex-A53 processor resources, maps those the the default SchedReadWrites, and provides basic latency. NEON support will be added in a subsequent patch with proper forwarding logic. Verification was done by setting the pre-RA scheduler to linearize to better gauge the effect of the MIScheduler. Even without modeling the forward logic, the results show a modest improvement for Cortex-A53. Reviewers: apazos, mcrosier, atrick Patch by Dave Estes <cestes@codeaurora.org>! llvm-svn: 203125	2014-03-06 16:04:00 +00:00
Chandler Carruth	9a4c9e597b	[Layering] Move DebugInfo.h into the IR library where its implementation already lives. llvm-svn: 203046	2014-03-06 00:46:21 +00:00
Kevin Qin	b08c6746c4	[AArch64]Fix improper diagnostics about offset range of load/store instructions. llvm-svn: 202775	2014-03-04 02:05:13 +00:00
Chad Rosier	70cb2311ab	Revert "[AArch64] This is a work in progress to provide a machine description" This reverts commit ff717c8fc786a0cfa1602982b91895fa09e514fc. llvm-svn: 202773	2014-03-04 00:32:07 +00:00
Chad Rosier	fe45290566	[AArch64] This is a work in progress to provide a machine description for the Cortex-A53 subtarget in the AArch64 backend. This patch lays the ground work to annotate each AArch64 instruction (no NEON yet) with a list of SchedReadWrite types. The patch also provides the Cortex-A53 processor resources, maps those the the default SchedReadWrites, and provides basic latency. NEON support will be added in a subsequent patch with proper forwarding logic. Verification was done by setting the pre-RA scheduler to linearize to better gauge the effect of the MIScheduler. Even without modeling the forward logic, the results show a modest improvement for Cortex-A53. Reviewers: apazos, mcrosier, atrick Patch by Dave Estes <cestes@codeaurora.org>! llvm-svn: 202767	2014-03-03 23:32:47 +00:00
Benjamin Kramer	b6d0bd48bd	[C++11] Replace llvm::next and llvm::prior with std::next and std::prev. Remove the old functions. llvm-svn: 202636	2014-03-02 12:27:27 +00:00
Craig Topper	73156025e0	Switch all uses of LLVM_OVERRIDE to just use 'override' directly. llvm-svn: 202621	2014-03-02 09:09:27 +00:00
Craig Topper	77dfe45f81	Switch all uses of LLVM_FINAL to just use 'final', and remove the macro. llvm-svn: 202618	2014-03-02 08:08:51 +00:00
Albrecht Kadlec	fbd12d35c8	trivial test commit llvm-svn: 202084	2014-02-24 22:18:38 +00:00
Christian Pirker	6c2f4d45e1	Add AArch64 big endian Target (aarch64_be) llvm-svn: 202024	2014-02-24 11:34:50 +00:00
Kevin Qin	07334d37de	[AArch64] Add register constraints to avoid generating STLXR and STXR with unpredictable behavior. llvm-svn: 201841	2014-02-21 07:45:48 +00:00
Oliver Stannard	7b2f2fba7f	AArch64: __va_list.__stack must be 8-byte aligned The va_start macro for AArch64 must set va_list.__stack to the address following the last named argument on the stack, rounded up to an alignment of 8 bytes. llvm-svn: 201797	2014-02-20 17:19:26 +00:00
Chad Rosier	63bfeb993b	[AArch64] Add support for TargetTransformInfo Analysis. llvm-svn: 201793	2014-02-20 16:00:08 +00:00
Christian Pirker	bd1eb0db1f	Test commit - remove the new line to lib/Target/AArch64/AArch64TargetMachine.cpp. llvm-svn: 201698	2014-02-19 16:58:28 +00:00
Christian Pirker	25ff038545	Test commit - added a new line to lib/Target/AArch64/AArch64TargetMachine.cpp. llvm-svn: 201692	2014-02-19 16:07:32 +00:00
Ana Pazos	7c27a265dc	[AArch64] Expanded sin, cos, pow with FP vector types inputs llvm-svn: 201601	2014-02-18 20:31:05 +00:00
Jiangning Liu	742c588edc	Fix a typo about lowering AArch64 va_copy. llvm-svn: 201541	2014-02-18 02:37:42 +00:00
Kevin Qin	edc95ee196	[AArch64 NEON] Fix a bug to avoid using floating type as condition type in lowering SELECT_CC. llvm-svn: 201395	2014-02-14 09:41:15 +00:00
Jiangning Liu	293349e4d7	Enable AArch64 NEON by default. llvm-svn: 201385	2014-02-14 04:38:09 +00:00
Hao Liu	7146ef8542	[AArch64]Fix the assertion failure caused by "v1i1 SETCC" DAG node. As v1i1 is illegal, the type legalizer tries to scalarize such node. But if the type operands of SETCC is legal, the scalarization algorithm will cause an assertion failure. llvm-svn: 201381	2014-02-14 02:21:56 +00:00
Daniel Sanders	753e17629d	Re-commit: Demote EmitRawText call in AsmPrinter::EmitInlineAsm() and remove hasRawTextSupport() call Summary: AsmPrinter::EmitInlineAsm() will no longer use the EmitRawText() call for targets with mature MC support. Such targets will always parse the inline assembly (even when emitting assembly). Targets without mature MC support continue to use EmitRawText() for assembly output. The hasRawTextSupport() check in AsmPrinter::EmitInlineAsm() has been replaced with MCAsmInfo::UseIntegratedAs which when true, causes the integrated assembler to parse inline assembly (even when emitting assembly output). UseIntegratedAs is set to true for targets that consider any failure to parse valid assembly to be a bug. Target specific subclasses generally enable the integrated assembler in their constructor. The default value can be overridden with -no-integrated-as. All tests that rely on inline assembly supporting invalid assembly (for example, those that use mnemonics such as 'foo' or 'hello world') have been updated to disable the integrated assembler. Changes since review (and last commit attempt): - Fixed test failures that were missed due to configuration of local build. (fixes crash.ll and a couple others). - Fixed tests that happened to pass because the local build was on X86 (should fix 2007-12-17-InvokeAsm.ll) - mature-mc-support.ll's should no longer require all targets to be compiled. (should fix ARM and PPC buildbots) - Object output (-filetype=obj and similar) now forces the integrated assembler to be enabled regardless of default setting or -no-integrated-as. (should fix SystemZ buildbots) Reviewers: rafael Reviewed By: rafael CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D2686 llvm-svn: 201333	2014-02-13 14:44:26 +00:00
Oliver Stannard	5bbb72f37e	Add Cortex-A53 and Cortex-A57 cores to the AArch64 backend llvm-svn: 201305	2014-02-13 09:46:11 +00:00
Hao Liu	7b6dfcf06a	[AArch64]Fix the problems that can't select mul/add/sub of v1i8/v1i16/v1i32 types. As this problems are similar to shl/sra/srl, also add patterns for shift nodes. llvm-svn: 201298	2014-02-13 05:42:33 +00:00
Hao Liu	4f345f3c03	[AArch64]Add support for spilling FPR8/FPR16. llvm-svn: 201287	2014-02-13 02:36:58 +00:00
Daniel Sanders	abe212a3b8	Revert r201237+r201238: Demote EmitRawText call in AsmPrinter::EmitInlineAsm() and remove hasRawTextSupport() call It introduced multiple test failures in the buildbots. llvm-svn: 201241	2014-02-12 15:39:20 +00:00
Daniel Sanders	a7d504cf58	Demote EmitRawText call in AsmPrinter::EmitInlineAsm() and remove hasRawTextSupport() call Summary: AsmPrinter::EmitInlineAsm() will no longer use the EmitRawText() call for targets with mature MC support. Such targets will always parse the inline assembly (even when emitting assembly). Targets without mature MC support continue to use EmitRawText() for assembly output. The hasRawTextSupport() check in AsmPrinter::EmitInlineAsm() has been replaced with MCAsmInfo::UseIntegratedAs which when true, causes the integrated assembler to parse inline assembly (even when emitting assembly output). UseIntegratedAs is set to true for targets that consider any failure to parse valid assembly to be a bug. Target specific subclasses generally enable the integrated assembler in their constructor. The default value can be overridden with -no-integrated-as. All tests that rely on inline assembly supporting invalid assembly (for example, those that use mnemonics such as 'foo' or 'hello world') have been updated to disable the integrated assembler. Reviewers: rafael Reviewed By: rafael CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D2686 llvm-svn: 201237	2014-02-12 14:44:54 +00:00
Chad Rosier	bcde0c49cb	[AArch64] Handle aliases of conditional branches without b.pred form. llvm-svn: 201091	2014-02-10 15:43:11 +00:00
Hao Liu	6e73761dc8	[AArch64]Implement the copy of two FPR8 registers by using FMOVss of two FPR32 registers in copyPhysReg. llvm-svn: 201061	2014-02-10 03:16:22 +00:00
Jim Grosbach	e9008de652	X86: Resolve a long standing FIXME and properly isel pextr[bw]. Generalize the AArch64 .td nodes for AssertZext and AssertSext. Use them to match the relevant pextr store instructions. The test widen_load-2.ll requires a slight change because with the stores gone, the remaining instructions are scheduled in a different order. Add test cases for SSE4 and AVX variants. Resolves rdar://13414672. Patch by Adam Nemet <anemet@apple.com>. llvm-svn: 200957	2014-02-07 00:16:33 +00:00
Tim Northover	fdbdb4b6d5	ARM & AArch64: merge NEON absolute compare intrinsics There was an extremely confusing proliferation of LLVM intrinsics to implement the vacge & vacgt instructions. This combines them all into two polymorphic intrinsics, shared across both backends. llvm-svn: 200768	2014-02-04 14:55:42 +00:00
Tim Northover	24979d8e10	AArch64 & ARM: refactor crypto intrinsics to take scalars Some of the SHA instructions take a scalar i32 as one argument (largely because they work on 160-bit hash fragments). This wasn't reflected in the IR previously, with ARM and AArch64 choosing different types (<4 x i32> and <1 x i32> respectively) which was ugly. This makes all the affected intrinsics take a uniform "i32", allowing them to become non-polymorphic at the same time. llvm-svn: 200706	2014-02-03 17:27:49 +00:00
Craig Topper	e7a9ee5c4a	Remove unnecessary include of AArch64GenInstrInfo.inc from AArch64Disassembler.cpp. None of the GET_ defines were set that would make the include do anything. llvm-svn: 200677	2014-02-03 06:33:17 +00:00
Chad Rosier	fe5ab2f5ba	[AArch64] Custom lower concat_vector patterns with v4i16, v4i32, v8i8, v8i16, v16i8 types. llvm-svn: 200491	2014-01-30 21:46:54 +00:00
Kevin Qin	92d64d2d56	[AArch64 NEON] Lower SELECT_CC with vector operand. When the scalar compare is between floating point and operands are vector, we custom lower SELECT_CC to use NEON SIMD compare for generating less instructions. llvm-svn: 200365	2014-01-29 01:57:30 +00:00
David Woodhouse	3fa98a65e9	Propagate MCSubtargetInfo through TableGen's getBinaryCodeForInstr() llvm-svn: 200349	2014-01-28 23:13:18 +00:00
David Woodhouse	9784cef38d	Explictly pass MCSubtargetInfo to MCCodeEmitter::EncodeInstruction() llvm-svn: 200348	2014-01-28 23:13:07 +00:00
David Woodhouse	e6c13e4abd	Change MCStreamer EmitInstruction interface to take subtarget info llvm-svn: 200345	2014-01-28 23:12:42 +00:00
Kevin Qin	4a183d7094	[AArch64 NEON] Try to generate CONCAT_VECTOR when lowering BUILD_VECTOR or SHUFFLE_VECTOR. Replace r199791. llvm-svn: 200180	2014-01-27 02:53:54 +00:00
Kevin Qin	9eeedfbaa6	Revert r199791. It's old version which has some bugs. I'll commit lattest patch soon. llvm-svn: 200179	2014-01-27 02:53:41 +00:00
Rafael Espindola	e41383f899	Pass a MCSubtargetInfo down to the TargetStreamer creation. With this the target streamers will be able to know the target features that are in use. llvm-svn: 200135	2014-01-26 06:38:58 +00:00
Rafael Espindola	24ea09ef7d	Construct the MCStreamer before constructing the MCTargetStreamer. This has a few advantages: * Only targets that use a MCTargetStreamer have to worry about it. * There is never a MCTargetStreamer without a MCStreamer, so we can use a reference. * A MCTargetStreamer can talk to the MCStreamer in its constructor. llvm-svn: 200129	2014-01-26 06:06:37 +00:00
Jiangning Liu	fb3c17b6c9	Improve pattern match from v1i8 to v1i32 for AArch64 Neon. llvm-svn: 200119	2014-01-26 04:55:53 +00:00
Jiangning Liu	6398d839c6	Implement pattern match from v1xx to v1xx for AArch64 Neon. llvm-svn: 200113	2014-01-26 03:27:40 +00:00
Kevin Qin	18662f4b7c	[AArch64 NEON] Add patterns for concat_vector on v2i32. llvm-svn: 200111	2014-01-26 02:46:15 +00:00
Ana Pazos	cd3b9f763e	[AArch64] Removed unused i8 type from FPR8 register class. The i8 type is not registered with any register class. This causes a segmentation fault in MachineLICM::getRegisterClassIDAndCost. The code selects the first type associated with register class FPR8, which happens to be i8. It uses this type (i8) to get the representative class pointer, which is 0. It then uses this pointer to access a field, resulting in segmentation fault. Since i8 type is not being used for printing any neon instruction we can safely remove it. llvm-svn: 200046	2014-01-24 22:36:53 +00:00
Alp Toker	cb40291100	Fix known typos Sweep the codebase for common typos. Includes some changes to visible function names that were misspelt. llvm-svn: 200018	2014-01-24 17:20:08 +00:00
Kevin Qin	21cd2152d3	[AArch64 NEON] Fix a bug in implementing register copy bwtween FPR16. llvm-svn: 199978	2014-01-24 07:53:04 +00:00
Ana Pazos	5d31f6945b	[AArch64] Added vselect patterns with float and double types llvm-svn: 199925	2014-01-23 19:18:57 +00:00
Kevin Qin	50944eb638	fix some spell mistakes around 'ConcatVector' and 'ShuffleVector' in AArch64 backend. llvm-svn: 199858	2014-01-23 01:35:13 +00:00
Kevin Qin	ce0190c6d5	[AArch64 NEON] Try to generate CONCAT_VECTOR when lowering BUILD_VECTOR or SHUFFLE_VECTOR. llvm-svn: 199791	2014-01-22 06:11:03 +00:00
Kevin Qin	6d379abd8f	[AArch64 NEON] Fix a bug caused by undef lane when generating VEXT. It was commited as r199628 but reverted in r199628 as causing regression test failed. It's because of old vervsion of patch I used to commit. Sorry for mistake. llvm-svn: 199704	2014-01-21 01:48:52 +00:00
Chandler Carruth	f835fc6f4f	Revert r199628: "[AArch64 NEON] Fix a bug caused by undef lane when generating VEXT." This test fails the newly added regression tests. llvm-svn: 199631	2014-01-20 08:18:01 +00:00
Kevin Qin	ff42e06ef4	[AArch64 NEON] Fix a bug caused by undef lane when generating VEXT. llvm-svn: 199628	2014-01-20 07:32:26 +00:00
Kevin Qin	ef66ff78ca	[AArch64 NEON] Accept both #0.0 and #0 for comparing with floating point zero in asm parser. For FCMEQ, FCMGE, FCMGT, FCMLE and FCMLT, floating point zero will be printed as #0.0 instead of #0. To support the history codes using #0, we consider to let asm parser accept both #0.0 and #0. llvm-svn: 199621	2014-01-20 02:14:05 +00:00
Kevin Qin	e0faea11b1	[AArch64 NEON] Expand vector for UDIV/SDIV/UREM/SREM/FREM as neon doesn't support these operations. llvm-svn: 199485	2014-01-17 09:54:30 +00:00
Hao Liu	17457a2ee2	[AArch64]Fix the problem can't select f16_to_f32 and f32_to_f16. Also add copy support for FPR16. Also add a missing test case file belongs to commit r197361. llvm-svn: 199463	2014-01-17 06:23:30 +00:00
Kevin Qin	212d9b4a56	[AArch64 NEON] Custom lower conversion between vector integer and vector floating point if element bit-width doesn't match. llvm-svn: 199462	2014-01-17 05:52:35 +00:00
Hao Liu	18d92262c5	[AArch64]Fix the problem can't select concat_vectors of two v1i32 types. Also fix the problem can't select scalar_to_vector from f32 to v2f32/v4f32. llvm-svn: 199461	2014-01-17 05:44:46 +00:00
Jiangning Liu	0a791c348b	For AArch64, lowering sext_inreg and generate optimized code by using SXTL. llvm-svn: 199296	2014-01-15 05:08:01 +00:00
Tim Northover	6e219cd588	AArch64: don't try to handle [SU]MUL_LOHI nodes We should set them to expand for now since there are no patterns dealing with them. Actually, there are no instructions either so I doubt they'll ever be acceptable. llvm-svn: 199265	2014-01-14 22:53:22 +00:00
Lang Hames	06234ec147	Add FPExt option to CCValAssign::LocInfo. When generating calling-convention promotion code, Tablegen will now select FPExt for floating point promotions (previously it had returned AExt, which is not valid for floating point types). Any out-of-tree targets that were relying on AExt being returned for FP promotions will need to update their code check for FPExt instead. llvm-svn: 199252	2014-01-14 19:56:36 +00:00
Rafael Espindola	08ff298d51	Revert "[AArch64] Added vselect patterns with float and double types" This reverts commit r199242. It is causing CodeGen/AArch64/neon-bsl.ll to fail. llvm-svn: 199248	2014-01-14 19:24:08 +00:00
Ana Pazos	787f540daa	[AArch64] Added vselect patterns with float and double types llvm-svn: 199242	2014-01-14 18:45:48 +00:00
Andrea Di Biagio	9bc0415c1f	[AArch64] Fix assertion failure caused by an invalid comparison between APInt values. APInt only knows how to compare values with the same BitWidth and asserts in all other cases. With this fix, function PerformORCombine does not use the APInt equality operator if the APInt values returned by 'isConstantSplat' differ in BitWidth. In that case they are different and no comparison is needed. llvm-svn: 199119	2014-01-13 16:51:00 +00:00
Kevin Qin	cfef55d6d4	[AArch64 NEON] Add missing patterns for bitcast from or to v1f64 llvm-svn: 199070	2014-01-13 01:58:38 +00:00
Kevin Qin	21e8f1c4eb	[AArch64 NEON] Add more scenarios to use perm instructions when lowering shuffle_vector This patch covered 2 more scenarios: 1. Two operands of shuffle_vector are the same, like %shuffle.i = shufflevector <8 x i8> %a, <8 x i8> %a, <8 x i32> <i32 0, i32 2, i32 4, i32 6, i32 8, i32 10, i32 12, i32 14> 2. One of operands is undef, like %shuffle.i = shufflevector <8 x i8> %a, <8 x i8> undef, <8 x i32> <i32 0, i32 2, i32 4, i32 6, i32 8, i32 10, i32 12, i32 14> After this patch, perm instructions will have chance to be emitted instead of lots of INS. llvm-svn: 199069	2014-01-13 01:56:29 +00:00
Saleem Abdulrasool	a6505ca4c2	correct target directive handling error handling The target specific parser should return `false' if the target AsmParser handles the directive, and `true' if the generic parser should handle the directive. Many of the target specific directive handlers would `return Error' which does not follow these semantics. This change simply changes the target specific routines to conform to the semantis of the ParseDirective correctly. Conformance to the semantics improves diagnostics emitted for the invalid directives. X86 is taken as a sample to ensure that multiple diagnostics are not presented for a single error. llvm-svn: 199068	2014-01-13 01:15:39 +00:00
Kristof Beyls	90ff80e329	Silence unused variable warning for non-asserting builds that was introduced in r198937. llvm-svn: 198941	2014-01-10 14:20:45 +00:00
Kristof Beyls	58306ad903	Make sure -use-init-array has intended effect on all AArch64 ELF targets, not just linux. llvm-svn: 198937	2014-01-10 13:41:49 +00:00
Ana Pazos	cfd2ca5826	[AArch64][NEON] Added UXTL and UXTL2 instruction aliases llvm-svn: 198791	2014-01-08 21:02:13 +00:00
Kevin Qin	44946439e1	[AArch64 NEON] Fix generating incorrect value type of NEON_VDUPLANE when lower build_vector if result value type mismatch with operand value type. llvm-svn: 198743	2014-01-08 08:06:14 +00:00
Rafael Espindola	894843cb4e	Move the llvm mangler to lib/IR. This makes it available to tools that don't link with target (like llvm-ar). llvm-svn: 198708	2014-01-07 21:19:40 +00:00
Chandler Carruth	8a8cd2bab9	Re-sort all of the includes with ./utils/sort_includes.py so that subsequent changes are easier to review. About to fix some layering issues, and wanted to separate out the necessary churn. Also comment and sink the include of "Windows.h" in three .inc files to match the usage in Memory.inc. llvm-svn: 198685	2014-01-07 11:48:04 +00:00
Hao Liu	7d11d99d20	[AArch64]Add support to spill/fill D tuples such as DPair/DTriple/DQuad. There is no test cases for D tuple as the original test cases are too large. As the spill/fill of the D tuple is similar to the Q tuple, the correctness can be guaranteed. llvm-svn: 198684	2014-01-07 10:50:43 +00:00
Hao Liu	27d88376bc	[AArch64]Add support to copy D tuples such as DPair/DTriple/DQuad and Q tuples such as QPair/QTriple/QQuad. There is no test case for D tuple as the original test cases are too large. As the copy of the D tuple is similar to the Q tuple, the correctness can be guaranteed. llvm-svn: 198682	2014-01-07 10:00:03 +00:00
Kevin Qin	cfa41a2569	[AArch64 NEON] Fixed incorrect immediate used in BIC instruction. llvm-svn: 198675	2014-01-07 05:10:47 +00:00
Bill Wendling	13199b17f8	Remove unnecessary #includes. llvm-svn: 198585	2014-01-06 06:00:00 +00:00
Bill Wendling	908bf814e7	Refactor function that checks that __builtin_returnaddress's argument is constant. This moves the check up into the parent class so that all targets can use it without having to copy (and keep in sync) the same error message. llvm-svn: 198579	2014-01-06 00:43:20 +00:00
Bill Wendling	df7dd28dc8	Emit an error message if the value passed to __builtin_returnaddress isn't a constant __builtin_returnaddress requires that the value passed into is be a constant. However, at -O0 even a constant expression may not be converted to a constant. Emit an error message intead of crashing. llvm-svn: 198531	2014-01-05 01:47:20 +00:00
Rafael Espindola	58873566b3	Make the llvm mangler depend only on DataLayout. Before this patch any program that wanted to know the final symbol name of a GlobalValue had to link with Target. This patch implements a compromise solution where the mangler uses DataLayout. This way, any tool that already links with Target (llc, clang) gets the exact behavior as before and new IR files can be mangled without linking with Target. With this patch the mangler is constructed with just a DataLayout and DataLayout is extended to include the information the Mangler needs. llvm-svn: 198438	2014-01-03 19:21:54 +00:00
Ana Pazos	e891c5f264	[AArch64][NEON] Added SXTL and SXTL2 instruction aliases llvm-svn: 198437	2014-01-03 19:20:31 +00:00
Rafael Espindola	6994fdf33c	Remove the 's' DataLayout specification During the years there have been some attempts at figuring out how to align byval arguments. A look at the commit log suggests that they were * Use the ABI alignment. * When that was not sufficient for x86-64, I added the 's' specification to DataLayout. * When that was not sufficient Evan added the virtual getByValTypeAlignment. * When even that was not sufficient, we just got the FE to add the alignment to the byval. This patch is just a simple cleanup that removes my first attempt at fixing the problem. I also added an AArch64 implementation of getByValTypeAlignment to make sure this patch is a nop. I also left the 's' parsing for backward compatibility. I will send a short email to llvmdev about the change for anyone maintaining an out of tree target. llvm-svn: 198287	2014-01-01 22:29:43 +00:00
Jiangning Liu	a0acf70af1	For AArch64 Neon, simplify scalar dup by lane0 for fp. llvm-svn: 198194	2013-12-30 02:44:35 +00:00
Hao Liu	fe3bfc8c41	[AArch64]Add code to spill/fill Q register tuples such as QPair/QTriple/QQuad. llvm-svn: 198193	2013-12-30 02:38:12 +00:00
Hao Liu	b591f835d6	[AArch64]Can't select shift left 0 of type v1i64 llvm-svn: 198192	2013-12-30 02:12:46 +00:00
Hao Liu	74107fe526	[AArch64]Fix the problem that can't select mul of v1i64/v2i64 types. E.g. Can't select such IR: %tmp = mul <2 x i64> %a, %b llvm-svn: 198188	2013-12-30 01:38:41 +00:00
Hao Liu	83799741fb	[AArch64]Fix a problem that the register order of fmls/fmla by element is incorrect. E.g. the codegen result is fmls v1.2s, v0.2s, v2.s[3] which is expected to be fmls v0.2s, v1.2s, v2.s[3] llvm-svn: 198001	2013-12-25 07:12:34 +00:00
Hao Liu	ce7a12be8f	[AArch64]Add patterns to match normal shift nodes: shl, sra and srl. llvm-svn: 197969	2013-12-24 09:00:21 +00:00
Kevin Qin	82bd84aadf	[AArch64 NEON] Fix a bug when lowering BUILD_VECTOR. DAG.getVectorShuffle() doesn't always return a vector_shuffle node. If mask is the exact sequence of it's operand(For example, operand_0 is v8i8, and the mask is 0, 1, 2, 3, 4, 5, 6, 7), it will directly return that operand. So a check is added here. llvm-svn: 197967	2013-12-24 08:16:06 +00:00
Kevin Qin	cd5f3153f5	[AArch64 NEON] Fix a pattern match failure with NEON_VDUP. This failure caused by improper condition when lowering shuffle_vector to scalar_to_vector. After this patch NEON_VDUP with v1i64 will not be generated. llvm-svn: 197966	2013-12-24 08:11:47 +00:00
Ana Pazos	bc2996b30f	[AArch64] Check fmul node single use in fused multiply patterns Check for single use of fmul node in fused multiply patterns to allow generation of fused multiply add/sub instructions. Otherwise fmul operation ends up being repeated more than once which does not help peformance on targets with only one MAC unit, as for example cortex-a53. llvm-svn: 197929	2013-12-24 00:47:29 +00:00
Ana Pazos	3ca23915cd	[AArch64 NEON] Fixed fused multiply negate add/sub patterns The correct pattern matching should be: - fnmadd is (-Ra) + (-Rn)Rm which should be matched as: fma (fneg node:$Rn), node:$Rm, (fneg node:$Ra) and as (f32 (fsub (f32 (fneg FPR32:$Ra)), (f32 (fmul FPR32:$Rn, FPR32:$Rm)))) - fnmsub is (-Ra) + RnRm which should be matched as fma node:$Rn, node:$Rm, (fneg node:$Ra) and as (f32 (fsub (f32 (fmul FPR32:$Rn, FPR32:$Rm)), FPR32:$Ra)))) llvm-svn: 197928	2013-12-24 00:40:10 +00:00
Kevin Qin	53eaea0104	[AArch64 NEON]Implment loading vector constant form constant pool. llvm-svn: 197551	2013-12-18 06:26:04 +00:00
Chad Rosier	5f87edb484	[AArch64] Fix v1fx patterns for Floating-point Multiply Extend and Floating-point Compare to Zero. llvm-svn: 197402	2013-12-16 18:29:35 +00:00
Rafael Espindola	bccb9d45ad	The preferred alignment defaults to the abi alignment. Omit if it is the same. llvm-svn: 197400	2013-12-16 18:01:51 +00:00
Rafael Espindola	8afbb28cea	On DataLayout, omit the default of p:64:64:64. llvm-svn: 197397	2013-12-16 17:15:29 +00:00
Hao Liu	774cabb538	[AArch64]Fix the pattern match failure for v1i8/v1i16/v1i32 types. Currently we have such types as legal vector types. The DAG combiner may generate some DAG nodes having such types but we don't have patterns to match them. E.g. a load i32 and a bitcast i32 to v1i32 will be combined into a load v1i32: bitcast (load i32) to v1i32 -> load v1i32. So this patch fixes such problems for load/dup instructions. If v1i8/v1i16/v1i32 are not legal any more, the code in this patch can be deleted. So I also add some FIXME. llvm-svn: 197361	2013-12-16 02:51:28 +00:00
Chad Rosier	e139dd4fe6	[AArch64] Simplify the Neon Scalar3Same patterns for floating-point reciprocal step, floating-point reciprocal square root step, floating-point absolute difference, and integer/floating-point compare instructions. Also, move the scalar general arithmetic operation patterns closer to similar code. No functional change intended. llvm-svn: 197250	2013-12-13 17:56:44 +00:00
Rafael Espindola	720ae4f885	Simplify the datalayout string of ARM and AArch64. No functionality change. Reviewed by Tim Northover. llvm-svn: 197172	2013-12-12 17:43:37 +00:00
Chad Rosier	4055f42d22	[AArch64] Removed unnecessary copy patterns with v1fx types. - Copy patterns with float/double types are enough. - Fix typos in test case names that were using v1fx. - There is no ACLE intrinsic that uses v1f32 type. And there is no conflict of neon and non-neon ovelapped operations with this type, so there is no need to support operations with this type. - Remove v1f32 from FPR32 register and disallow v1f32 as a legal type for operations. Patch by Ana Pazos! llvm-svn: 197159	2013-12-12 15:46:29 +00:00
Hao Liu	46a10eec28	[AArch64]Fix the problem that AArch64 backend fails to select scalar_to_vector of vector types having more than one element. llvm-svn: 197135	2013-12-12 07:36:26 +00:00
Chad Rosier	446d8ea0fb	[AArch64] Refactor NEON floating-point Max/Min/Maxnm/Minnm across vector AArch64 intrinsics to use f32 types, rather than their vector equivalents. llvm-svn: 197090	2013-12-11 23:21:25 +00:00
Chad Rosier	088f93d4b5	[AArch64] Add NEON scalar floating-point compare LLVM AArch64 intrinsics that use f32/f64 types, rather than their vector equivalents. llvm-svn: 197068	2013-12-11 21:03:46 +00:00
Chad Rosier	473a01e1c9	[AArch64] Refactor the NEON scalar floating-point reciprocal step and floating-point reciprocal square root step LLVM AArch64 intrinsics to use f32/f64 types, rather than their vector equivalents. llvm-svn: 197067	2013-12-11 21:03:43 +00:00
Chad Rosier	7098fcc062	[AArch64] Refactor the NEON scalar floating-point reciprocal estimate, floating- point reciprocal exponent, and floating-point reciprocal square root estimate LLVM AArch64 intrinsics to use f32/f64 types, rather than their vector equivalents. llvm-svn: 197066	2013-12-11 21:03:40 +00:00
Kevin Qin	310b6c08ba	[AArch64 NEON] Get instruction BSL matched to VSELECT. llvm-svn: 196998	2013-12-11 02:33:50 +00:00
NAKAMURA Takumi	8bc9bfaa5a	Prune redundant dependencies in LLVMBuild.txt. llvm-svn: 196988	2013-12-11 00:30:57 +00:00
Chad Rosier	f70af21651	[AArch64] Refactor the NEON floating-point absolute difference LLVM AArch64 intrinsic to use f32/f64 types, rather than their vector equivalents. llvm-svn: 196965	2013-12-10 21:33:59 +00:00
Chad Rosier	07cc3f9100	[AArch64] Refactor the NEON signed/unsigned floating-point convert to fixed-point LLVM AArch64 intrinsics to use f32/f64, rather than their vector equivalents. llvm-svn: 196964	2013-12-10 21:33:56 +00:00
Chad Rosier	98b8baa35c	[AArch64] Overload NEON signed/unsigned floating-point convert to fixed-point and fixed-point convert to floating-point LLVM AArch64 intrinsics. llvm-svn: 196963	2013-12-10 21:33:53 +00:00
Chad Rosier	cc34d187b8	[AArch64] Overload NEON signed/unsigned integer convert to floating-point LLVM AArch64 intrinsics. llvm-svn: 196962	2013-12-10 21:33:50 +00:00
Chad Rosier	7a9bba442f	[AArch64] Refactor the Neon vector/scalar floating-point convert intrinsics so that they use float/double rather than the vector equivalents when appropriate. llvm-svn: 196930	2013-12-10 16:11:39 +00:00
Chad Rosier	fcc4c366d1	[AArch64] Refactor the Neon vector/scalar floating-point convert implementation. Specifically, reuse the ARM intrinsics when possible. llvm-svn: 196926	2013-12-10 15:35:33 +00:00
Kevin Qin	43385c7065	[AArch64 NEON] Replace fpimm with fpz32 for floating compare with zero. This is a small change to be strict. Just want get pattern safer. llvm-svn: 196889	2013-12-10 06:51:07 +00:00
Kevin Qin	04396d1e69	[AArch64 NEON] Support poly128_t and implement relevant intrinsic. llvm-svn: 196887	2013-12-10 06:48:35 +00:00
NAKAMURA Takumi	396d4d3c7e	Add proper dependencies to LLVMBuild.txt in llvm/lib. I'll prune redundant deps in LLVMBuild.txt, later. llvm-svn: 196881	2013-12-10 05:39:34 +00:00
NAKAMURA Takumi	e3afe2ef62	Whitespaces. llvm-svn: 196880	2013-12-10 05:39:12 +00:00
Chad Rosier	5c8bf9c3db	[AArch64] Refactor the NEON scalar reduce pairwise intrinsics, so that they use float/double rather than the vector equivalents when appropriate. llvm-svn: 196833	2013-12-09 22:47:38 +00:00
Chad Rosier	3b0b3ee71e	[AArch64] Refactor NEON scalar reduce pairwise front-end codegen to remove unnecessary patterns in tablegen. llvm-svn: 196832	2013-12-09 22:47:34 +00:00
Chad Rosier	397ff3945c	[AArch64] Remove q and non-q intrinsic definitions in the NEON scalar reduce pairwise implementation, using an overloaded definition instead. llvm-svn: 196831	2013-12-09 22:47:31 +00:00
Ana Pazos	bde2828ae0	Fix pattern match for movi with 0D result Patch by Jiangning Liu. With some test case changes: - intrinsic test added to the existing /test/CodeGen/AArch64/neon-aba-abd.ll. - New test cases to cover movi 1D scenario without using the intrinsic in test/CodeGen/AArch64/neon-mov.ll. llvm-svn: 196806	2013-12-09 19:29:14 +00:00
Hao Liu	96a587a9f7	[AArch64]Add missing pair intrinsics such as: int32_t vminv_s32(int32x2_t a) which should be compiled into SMINP Vd.2S,Vn.2S,Vm.2S llvm-svn: 196749	2013-12-09 03:51:42 +00:00
Hao Liu	868caea6d1	[AArch64]Pattern match failures for truncate store and extend load llvm-svn: 196748	2013-12-09 03:34:08 +00:00
Ana Pazos	6b0a8c50dd	Implemented vget/vset_lane_f16 intrinsics llvm-svn: 196533	2013-12-05 21:07:49 +00:00
Jiangning Liu	65d8e3422a	For AArch64, add missing register cost calculation for big value types like v4i64 and v8i64. llvm-svn: 196456	2013-12-05 02:12:01 +00:00
Kevin Qin	afd095de8b	[AArch64 Neon] Add ACLE intrinsic vceqz_f64. llvm-svn: 196362	2013-12-04 08:02:34 +00:00
Kevin Qin	f9832e8de7	[AArch64 NEON] Add missing compare intrinsics. llvm-svn: 196360	2013-12-04 07:53:28 +00:00
Hao Liu	dca64f4a20	[AArch64]Add missing floating point convert, round and misc intrinsics. E.g. int64x1_t vcvt_s64_f64(float64x1_t a) -> FCVTZS Dd, Dn llvm-svn: 196210	2013-12-03 06:06:55 +00:00
Hao Liu	c250cbc095	AArch64: add missing ACLE intrinsics mapping to general arithmetic operation from VFP instructions. E.g. float64x1_t vadd_f64(float64x1_t a, float64x1_t b) -> FADD Dd, Dn, Dm. llvm-svn: 196208	2013-12-03 05:58:30 +00:00
NAKAMURA Takumi	bc815b2d21	Whitespace. llvm-svn: 196203	2013-12-03 05:28:27 +00:00
Hao Liu	21a461353a	AArch64: Add missing scalar pair intrinsics. E.g. "float32_t vaddv_f32(float32x2_t a)" to be matched into "faddp s0, v1.2s". llvm-svn: 196198	2013-12-03 03:39:47 +00:00
Jiangning Liu	3a541d46a1	Add some missing pattern matches for AArch64 Neon intrinsics like vuqadd_s64 and friends. llvm-svn: 196192	2013-12-03 01:33:52 +00:00
Jiangning Liu	94a7bb2130	Add some missing pattern matches for AArch64 Neon intrinsics like vmull_high_n_s16 and friends. llvm-svn: 196190	2013-12-03 01:29:32 +00:00
Rafael Espindola	5113d166f5	Refactor the setting of PrivateGlobalPrefix. No functionality change. llvm-svn: 196170	2013-12-02 23:39:26 +00:00
Chad Rosier	3106de3f9d	[AArch64] Implemented vcopy_lane patterns using scalar DUP instruction. Patch by Ana Pazos! llvm-svn: 196151	2013-12-02 21:05:16 +00:00
Rafael Espindola	957cf6f9e1	Remove dead code. MO_JumpTableIndex and MO_ExternalSymbol don't show up on inline asm. Keeping parts of the old asm printer just to print inline asm to a string that we then parse back looks like a hack. llvm-svn: 196111	2013-12-02 15:36:37 +00:00
Rafael Espindola	50712a456d	Change the default of AsmWriterClassName and isMCAsmWriter. llvm-svn: 196065	2013-12-02 04:55:42 +00:00
Hao Liu	ba38eee8ac	AArch64: The pattern match should check the range of the immediate value. Or we can generate some illegal instructions. E.g. shrn2 v0.4s, v1.2d, #35. The legal range should be in [1, 16]. llvm-svn: 195941	2013-11-29 02:11:22 +00:00
Jiangning Liu	c429c00f3b	Add missing pattern for supporting intrinsic function vbsl_f64 with argument double floating point. llvm-svn: 195938	2013-11-29 01:37:15 +00:00
Kevin Qin	337cfcc83c	[AArch64 NEON]Fix a assertion failure when disassemble SHLL instruction. llvm-svn: 195936	2013-11-29 01:29:16 +00:00
Benjamin Kramer	ea1982aff9	Silence sign-compare warning and reduce nesting. No functionality change. llvm-svn: 195932	2013-11-28 19:58:56 +00:00
NAKAMURA Takumi	ce746c6c49	[CMake] Let add_public_tablegen_target responsible to provide dependency to CommonTableGen. add_public_tablegen_target adds *CommonTableGen to LLVM_COMMON_DEPENDS. LLVM_COMMON_DEPENDS affects add_llvm_library (and other add_target stuff) within its scope. llvm-svn: 195927	2013-11-28 17:04:04 +00:00
NAKAMURA Takumi	413518f1f8	[CMake] Prune include_directories() in llvm/lib/Target. add_llvm_target() sets them. llvm-svn: 195921	2013-11-28 14:53:30 +00:00
NAKAMURA Takumi	979e604d8c	Add newline at eof. llvm-svn: 195920	2013-11-28 14:52:52 +00:00
Jiangning Liu	4bc9dbd846	Remove the variable only used by assert to avoid the build failure caused by build options [-Werror,-Wunused-variable]. llvm-svn: 195905	2013-11-28 01:34:55 +00:00
Hao Liu	f9f468abee	AArch64: Fix a bug about disassembling post-index load single element to 4 vectors llvm-svn: 195903	2013-11-28 01:07:45 +00:00
Jiangning Liu	97aa8cf8b7	Fix the AArch64 NEON bug exposed by checking constant integer argument range of ACLE intrinsics. llvm-svn: 195843	2013-11-27 14:02:25 +00:00
Chad Rosier	75290c6307	[AArch64] Add support for NEON scalar floating-point absolute difference. llvm-svn: 195803	2013-11-27 01:45:58 +00:00
Chad Rosier	9653d5c989	[AArch64] Add support for NEON scalar floating-point to integer convert instructions. llvm-svn: 195788	2013-11-26 22:17:37 +00:00
Kevin Qin	599c47d0de	Refactored the implementation of AArch64 NEON instruction ZIP, UZP and TRN. Fix a bug when mixed use of vget_high_u8() and vuzp_u8(). llvm-svn: 195716	2013-11-26 03:26:47 +00:00
Kevin Qin	33ca18fdcf	[AArch64]Implement 128 bit register copy with NEON. llvm-svn: 195713	2013-11-26 02:33:42 +00:00
Hao Liu	fbd2b4484c	Fixed a bug about disassembling AArch64 post-index load/store single element instructions. ie. echo "0x00 0x04 0x80 0x0d" \| ../bin/llvm-mc -triple=aarch64 -mattr=+neon -disassemble echo "0x00 0x00 0x80 0x0d" \| ../bin/llvm-mc -triple=aarch64 -mattr=+neon -disassemble will be disassembled into the same instruction st1 {v0b}[0], [x0], x0. llvm-svn: 195591	2013-11-25 01:53:26 +00:00
Hao Liu	e8bdc8c864	Fix a Cygwin build failure caused by enum values starting with '_', which is conflicted with some platform macros. This patch only renames variables, no functional change. llvm-svn: 195432	2013-11-22 09:24:41 +00:00
Hao Liu	25aed9bb5b	Fix the bugs about AArch64 Load/Store vector types and bitcast between i64 and vector types. e.g. "%tmp = load <2 x i64>* %ptr" can't be selected. "%tmp = bitcast i64 %in to <2 x i32>" can't be selected. llvm-svn: 195424	2013-11-22 08:47:22 +00:00
Hao Liu	91ae869692	Revert last change by haoliu because of buildbot failure. llvm-svn: 195423	2013-11-22 08:34:54 +00:00
Hao Liu	b75d80fdf0	Fix a Cygwin build failure caused by enum values starting with '_', which is conflicted with some platform macros. This solution only renames variables, no functional change. NOTE: This is a candidate for the 3.4 branch. llvm-svn: 195421	2013-11-22 08:17:16 +00:00
Jiangning Liu	a91633a435	For AArch64 back-end instruction selection, lower Neon_Lowxxx with EXTRCT_SUBREG. llvm-svn: 195408	2013-11-22 02:45:13 +00:00
Ana Pazos	9ac2fc85d2	Implemented Neon scalar vdup_lane intrinsics. Fixed scalar dup alias and added test case. llvm-svn: 195330	2013-11-21 08:16:15 +00:00
Ana Pazos	fbc1adbaa7	Implemented Neon scalar by element intrinsics. Intrinsics implemented: vqdmull_lane, vqdmulh_lane, vqrdmulh_lane, vqdmlal_lane, vqdmlsl_lane scalar Neon intrinsics. llvm-svn: 195327	2013-11-21 07:37:04 +00:00
Hao Liu	16edc4675c	Implement AArch64 neon instructions class SIMD lsone and SIMD lone-post. llvm-svn: 195078	2013-11-19 02:17:05 +00:00
Jiangning Liu	0c0c1e8598	Implement AArch64 SISD intrinsics for vget_high and vget_low. llvm-svn: 195074	2013-11-19 01:46:48 +00:00
Kevin Qin	7f8073edc2	implement MC layer of AArch64 neon instruction PMULL and PMULL2 with 128 bit integer. llvm-svn: 195072	2013-11-19 01:40:25 +00:00
Jiangning Liu	e329114ae5	Add predicate for AArch64 crypto instructions. llvm-svn: 195071	2013-11-19 01:38:31 +00:00
Juergen Ributzka	d12ccbd343	[weak vtables] Remove a bunch of weak vtables This patch removes most of the trivial cases of weak vtables by pinning them to a single object file. The memory leaks in this version have been fixed. Thanks Alexey for pointing them out. Differential Revision: http://llvm-reviews.chandlerc.com/D2068 Reviewed by Andy llvm-svn: 195064	2013-11-19 00:57:56 +00:00
Alexey Samsonov	49109a279c	Revert r194865 and r194874. This change is incorrect. If you delete virtual destructor of both a base class and a subclass, then the following code: Base *foo = new Child(); delete foo; will not cause the destructor for members of Child class. As a result, I observe plently of memory leaks. Notable examples I investigated are: ObjectBuffer and ObjectBufferStream, AttributeImpl and StringSAttributeImpl. llvm-svn: 194997	2013-11-18 09:31:53 +00:00
Kevin Qin	6588c1a638	[AArch64 NEON]Add mov alias for simd copy instructions. Set some unspecified bits of INS/DUP to zero as ARMARM requested. llvm-svn: 194996	2013-11-18 09:20:32 +00:00
Hao Liu	5a4e4e107d	Implement the newly added ACLE functions for ld1/st1 with 2/3/4 vectors. The functions are like: vst1_s8_x2 ... llvm-svn: 194990	2013-11-18 06:31:53 +00:00
Ana Pazos	d035209bd7	Implemented aarch64 Neon scalar vmulx_lane intrinsics Implemented aarch64 Neon scalar vfma_lane intrinsics Implemented aarch64 Neon scalar vfms_lane intrinsics Implemented legacy vmul_n_f64, vmul_lane_f64, vmul_laneq_f64 intrinsics (v1f64 parameter type) using Neon scalar instructions. Implemented legacy vfma_lane_f64, vfms_lane_f64, vfma_laneq_f64, vfms_laneq_f64 intrinsics (v1f64 parameter type) using Neon scalar instructions. llvm-svn: 194888	2013-11-15 23:32:10 +00:00
Juergen Ributzka	dbedae89b9	[weak vtables] Remove a bunch of weak vtables This patch removes most of the trivial cases of weak vtables by pinning them to a single object file. Differential Revision: http://llvm-reviews.chandlerc.com/D2068 Reviewed by Andy llvm-svn: 194865	2013-11-15 22:34:48 +00:00
Chad Rosier	0c57c3402e	[AArch64] Fix the scalar NEON ACLE functions so that they return float/double rather than the vector equivalent. llvm-svn: 194853	2013-11-15 21:28:10 +00:00
Alexey Samsonov	0d4f1c51db	Hopefully fix uninitialized memory read in AArch64AsmParser found by MSan bootstrap bot llvm-svn: 194818	2013-11-15 15:49:30 +00:00
Chad Rosier	fd675d932c	[AArch64] Remove redundant Neon_immAllOnes/Neon_immAllZeros leaf patterns. llvm-svn: 194733	2013-11-14 22:02:46 +00:00
NAKAMURA Takumi	b155fa5fb5	AArch64DAGToDAGISel::SelectVTBL(): Fix a warning. [-Wunused-variable] llvm-svn: 194679	2013-11-14 07:04:07 +00:00
Kevin Qin	afc8bdfd57	[AArch64 neon] support poly64 and relevant intrinsic functions. llvm-svn: 194659	2013-11-14 03:27:58 +00:00
Kevin Qin	aec95baf1a	Implement aarch64 neon instruction class SIMD misc. llvm-svn: 194656	2013-11-14 02:44:13 +00:00
Jiangning Liu	bb60ccf355	Implement AArch64 NEON instruction set AdvSIMD (table). llvm-svn: 194648	2013-11-14 01:57:32 +00:00
Chad Rosier	d3ae5f895e	[AArch64] Add support for legacy AArch32 NEON scalar shift by immediate instructions. This patch does not include the shift right and accumulate instructions. A number of non-overloaded intrinsics have been remove in favor of their overloaded counterparts. llvm-svn: 194598	2013-11-13 20:05:37 +00:00
Chad Rosier	1eb0ecf8ce	[AArch64] Implemented AdvSIMD scalar x indexed element format and AdvSIMD scalar copy in MC layer. Added the MC layer tests. Fixed triple setting in test cases. Patch by Ana Pazos <apazos@codeaurora.org>. llvm-svn: 194501	2013-11-12 19:13:08 +00:00
Chad Rosier	d3684a0566	[AArch64] The shift right/left and insert immediate builtins expect 3 source operands, a vector, an element to insert, and a shift amount. llvm-svn: 194406	2013-11-11 19:11:11 +00:00
Chad Rosier	35575e737c	[AArch64] Add support for NEON scalar floating-point convert to fixed-point instructions. llvm-svn: 194394	2013-11-11 18:04:07 +00:00
Tim Northover	ef276df244	AArch64: refactor vector list creation to be more uniform Instructions taking a vector list (e.g. "ld2 {v0.2d, v1.d2}, [x0]") need a special register-class to deal with the constraints, and C++ code to support selection. However, that C++ code can be made reasonably uniform to simplify the selection process. Hence this patch. No functionality change, so no tests. llvm-svn: 194361	2013-11-11 03:35:43 +00:00
Benjamin Kramer	3e9237a313	Remove some unnecessary temporary strings. llvm-svn: 194335	2013-11-09 22:48:13 +00:00
Richard Barton	5f54c655c1	Make PrintAsmOperand call to the superclass to handle 'n' and 'c' operand modifiers. llvm-svn: 194270	2013-11-08 18:09:57 +00:00
Amara Emerson	5e45b5f194	[AArch64] Remove NEON from "generic" CPU target. We can change this back when NEON support is complete and ready to become enabled by default. llvm-svn: 194152	2013-11-06 16:19:08 +00:00
Jiangning Liu	f4226f1d7b	Implement AArch64 Neon instruction set Perm. llvm-svn: 194123	2013-11-06 03:35:27 +00:00
Jiangning Liu	a50e22ca4f	Implement AArch64 Neon instruction set Bitwise Extract. llvm-svn: 194118	2013-11-06 02:25:49 +00:00
Jiangning Liu	d7c52676f6	Implement AArch64 Neon Crypto instruction classes AES, SHA, and 3 SHA. llvm-svn: 194085	2013-11-05 17:42:05 +00:00
Hao Liu	d6b40b51c7	Implement AArch64 post-index vector load/store multiple N-element structure class SIMD(lselem-post). Including following 14 instructions: 4 ld1 insts: post-index load multiple 1-element structure to sequential 1/2/3/4 registers. ld2/ld3/ld4: post-index load multiple N-element structure to sequential N registers (N=2,3,4). 4 st1 insts: post-index store multiple 1-element structure from sequential 1/2/3/4 registers. st2/st3/st4: post-index store multiple N-element structure from sequential N registers (N = 2,3,4). llvm-svn: 194043	2013-11-05 03:39:32 +00:00
Kevin Qin	97f6aaa8ad	Implemented aarch64 neon intrinsic vcopy_lane with float type. llvm-svn: 194041	2013-11-05 02:03:59 +00:00
Tim Northover	ace0bd4d33	AArch64: use default asm operand printing when modifier inapplicable If an inline assembly operand has multiple constraints (e.g. "Ir" for immediate or register) and an operand modifier (E.g. "w" for "print register as wN") then we need to decide behaviour when the modifier doesn't apply to the constraint. Previousely produced some combination of an assertion failure and a fatal error. GCC's behaviour appears to be to ignore the modifier and print the operand in the default way. This patch should implement that. llvm-svn: 194024	2013-11-04 23:04:07 +00:00
Chad Rosier	995d9c2fdc	[AArch64] Simplify a few of the instruction patterns. No functional change intended. llvm-svn: 193867	2013-11-01 17:13:44 +00:00
Chad Rosier	a4bfb44a9e	[AArch64] Fix assembly string formatting and other coding standard violations. llvm-svn: 193866	2013-11-01 17:13:42 +00:00
Chad Rosier	74b65cd811	[AArch64] Add support for NEON scalar fixed-point convert to floating-point instructions. llvm-svn: 193816	2013-10-31 22:36:59 +00:00
Chad Rosier	20e1f20d69	[AArch64] Add support for NEON scalar shift immediate instructions. llvm-svn: 193790	2013-10-31 19:28:44 +00:00
Amara Emerson	f80f95fcc7	[AArch64] Make the use of FP instructions optional, but enabled by default. This adds a new subtarget feature called FPARMv8 (implied by NEON), and predicates the support of the FP instructions and registers on this feature. llvm-svn: 193739	2013-10-31 09:32:11 +00:00
Chad Rosier	be020d0309	[AArch64] Add support for NEON scalar floating-point compare instructions. llvm-svn: 193691	2013-10-30 15:19:37 +00:00
Rafael Espindola	79858aa3df	Add a helper getSymbol to AsmPrinter. llvm-svn: 193627	2013-10-29 17:07:16 +00:00
Weiming Zhao	ffade617bd	[AArch64] Implement FrameAddr and ReturnAddr Fixes PR17690 llvm-svn: 193625	2013-10-29 17:00:25 +00:00
Tim Northover	d29ddf6713	AArch64: add 'a' inline asm operand modifier This is used in the Linux kernel, and effectively just means "print an address". llvm-svn: 193593	2013-10-29 08:22:33 +00:00
Nadav Rotem	d369d4bdf9	Optimize concat_vectors(X, undef) -> scalar_to_vector(X). This optimization is not SSE specific so I am moving it to DAGco. The new scalar_to_vector dag node exposed a missing pattern in the AArch64 target that I needed to add. llvm-svn: 193393	2013-10-25 06:41:18 +00:00
Amara Emerson	c5cae0f20c	[AArch64] Fix NZCV reg live-in bug in F128CSEL codegen. When generating the IfTrue basic block during the F128CSEL pseudo-instruction handling, the NZCV live-in for the newly created BB wasn't being added. This caused a fault during MI-sched/live range calculation when the predecessor for the fall-through BB didn't have a live-in for phys-reg as expected. llvm-svn: 193316	2013-10-24 08:28:24 +00:00
Chad Rosier	e012cb3783	[AArch64] Add the constraint to NEON scalar mla/mls instructions. llvm-svn: 193117	2013-10-21 20:11:47 +00:00
Chad Rosier	fe2f58c8a1	[AArch64] Add support for NEON scalar extract narrow instructions. llvm-svn: 192970	2013-10-18 14:03:24 +00:00
Chad Rosier	37d29173aa	[AArch64] Add support for NEON scalar three register different instruction class. The instruction class includes the signed saturating doubling multiply-add long, signed saturating doubling multiply-subtract long, and the signed saturating doubling multiply long instructions. llvm-svn: 192908	2013-10-17 18:12:29 +00:00
Chad Rosier	846a72539c	[AArch64] Add support for NEON scalar negate instruction. llvm-svn: 192843	2013-10-16 21:04:39 +00:00
Chad Rosier	175601d997	[AArch64] Add support for NEON scalar absolute value instruction. llvm-svn: 192842	2013-10-16 21:04:34 +00:00
Chad Rosier	f2b254558f	Fix comment. llvm-svn: 192805	2013-10-16 16:22:15 +00:00
Chad Rosier	178b1cefc7	[AArch64] Add support for NEON scalar signed saturating accumulated of unsigned value and unsigned saturating accumulate of signed value instructions. llvm-svn: 192800	2013-10-16 16:09:02 +00:00
Rafael Espindola	43c4e24fad	Add a MCAsmInfoELF class and factor some code into it. We had a MCAsmInfoCOFF, but no common class for all the ELF MCAsmInfos before. llvm-svn: 192760	2013-10-16 01:34:32 +00:00
Chad Rosier	9d51708677	[AArch64] Add support for NEON scalar signed saturating absolute value and scalar signed saturating negate instructions. llvm-svn: 192733	2013-10-15 21:18:44 +00:00
Chad Rosier	d1f40d760a	[AArch64] Add support for NEON scalar integer compare instructions. llvm-svn: 192596	2013-10-14 14:37:20 +00:00
Kevin Qin	a89e7a0e1c	Implement aarch64 neon instruction set AdvSIMD (copy). llvm-svn: 192410	2013-10-11 02:33:55 +00:00
Hao Liu	99eac7ee44	Implement AArch64 vector load/store multiple N-element structure class SIMD(lselem). Including following 14 instructions: 4 ld1 insts: load multiple 1-element structure to sequential 1/2/3/4 registers. ld2/ld3/ld4: load multiple N-element structure to sequential N registers (N=2,3,4). 4 st1 insts: store multiple 1-element structure from sequential 1/2/3/4 registers. st2/st3/st4: store multiple N-element structure from sequential N registers (N = 2,3,4). llvm-svn: 192361	2013-10-10 17:00:52 +00:00
Rafael Espindola	9558af461d	Revert "Implement AArch64 vector load/store multiple N-element structure class SIMD(lselem). Including following 14 instructions: 4 ld1 insts: load multiple 1-element structure to sequential 1/2/3/4 registers. ld2/ld3/ld4: load multiple N-element structure to sequential N registers (N=2,3,4). 4 st1 insts: store multiple 1-element structure from sequential 1/2/3/4 registers. st2/st3/st4: store multiple N-element structure from sequential N registers (N = 2,3,4)." This reverts commit r192352. It broke the build. llvm-svn: 192354	2013-10-10 15:15:17 +00:00
Hao Liu	9123ad8ab9	Implement AArch64 vector load/store multiple N-element structure class SIMD(lselem). Including following 14 instructions: 4 ld1 insts: load multiple 1-element structure to sequential 1/2/3/4 registers. ld2/ld3/ld4: load multiple N-element structure to sequential N registers (N=2,3,4). 4 st1 insts: store multiple 1-element structure from sequential 1/2/3/4 registers. st2/st3/st4: store multiple N-element structure from sequential N registers (N = 2,3,4). llvm-svn: 192352	2013-10-10 15:01:24 +00:00
Tim Northover	1fdb076a31	AArch64: enable MISched by default. Substantial SelectionDAG scheduling is going away soon, and is interfering with Hao's attempts to implement LDn/STn instructions, so I say we make the leap first. There were a few reorderings (inevitably) which broke some tests. I tried to replace them with CHECK-DAG variants mostly, but some too complex for that to be useful and I just reordered them. llvm-svn: 192282	2013-10-09 07:53:57 +00:00
Chad Rosier	9849cc6696	[AArch64] Add support for NEON scalar floating-point reciprocal estimate, reciprocal exponent, and reciprocal square root estimate instructions. llvm-svn: 192242	2013-10-08 22:09:04 +00:00
Chad Rosier	f7ed96ef76	[AArch64] Add support for NEON scalar signed/unsigned integer to floating-point convert instructions. llvm-svn: 192231	2013-10-08 20:43:30 +00:00
Rafael Espindola	a17151ad5a	Add a MCTargetStreamer interface. This patch fixes an old FIXME by creating a MCTargetStreamer interface and moving the target specific functions for ARM, Mips and PPC to it. The ARM streamer is still declared in a common place because it is used from lib/CodeGen/ARMException.cpp, but the Mips and PPC are completely hidden in the corresponding Target directories. I will send an email to llvmdev with instructions on how to use this. llvm-svn: 192181	2013-10-08 13:08:17 +00:00
Chad Rosier	b6ceeb9126	[AArch64] Add support for NEON scalar arithmetic instructions: SQDMULH, SQRDMULH, FMULX, FRECPS, and FRSQRTS. llvm-svn: 192107	2013-10-07 16:36:15 +00:00
Jiangning Liu	ad242fbb71	Implement aarch64 neon instruction set AdvSIMD (Across). llvm-svn: 192028	2013-10-05 08:22:10 +00:00
Jiangning Liu	ac5fd7e5d3	Implement aarch64 neon instruction set AdvSIMD (3V elem). llvm-svn: 191944	2013-10-04 09:20:44 +00:00
Jiangning Liu	63dc840fc5	Initial support for Neon scalar instructions. Patch by Ana Pazos. 1.Added support for v1ix and v1fx types. 2.Added Scalar Pairwise Reduce instructions. 3.Added initial implementation of Scalar Arithmetic instructions. llvm-svn: 191263	2013-09-24 02:47:27 +00:00
Tim Northover	31d093c705	ISelDAG: spot chain cycles involving MachineNodes Previously, the DAGISel function WalkChainUsers was spotting that it had entered already-selected territory by whether a node was a MachineNode (amongst other things). Since it's fairly common practice to insert MachineNodes during ISelLowering, this was not the correct check. Looking around, it seems that other nodes get their NodeId set to -1 upon selection, so this makes sure the same thing happens to all MachineNodes and uses that characteristic to determine whether we should stop looking for a loop during selection. This should fix PR15840. llvm-svn: 191165	2013-09-22 08:21:56 +00:00
Richard Mitton	21101b3231	Added support for generate DWARF .debug_aranges sections automatically. llvm-svn: 191052	2013-09-19 23:21:01 +00:00
Kevin Qin	36399e6b68	Implement 3 AArch64 neon instructions : umov smov ins. llvm-svn: 190839	2013-09-17 02:21:02 +00:00
Tim Northover	635a979038	AArch64: use RegisterOperand for NEON registers. Previously we modelled VPR128 and VPR64 as essentially identical register-classes containing V0-V31 (which had Q0-Q31 as "sub_alias" sub-registers). This model is starting to cause significant problems for code generation, particularly writing EXTRACT/INSERT_SUBREG patterns for converting between the two. The change here switches to classifying VPR64 & VPR128 as RegisterOperands, which are essentially aliases for RegisterClasses with different parsing and printing behaviour. This fits almost exactly with their real status (VPR128 == FPR128 printed strangely, VPR64 == FPR64 printed strangely). llvm-svn: 190665	2013-09-13 07:26:52 +00:00
Joey Gouly	0e76fa7df5	Add an instruction deprecation feature to TableGen. The 'Deprecated' class allows you to specify a SubtargetFeature that the instruction is deprecated on. The 'ComplexDeprecationPredicate' class allows you to define a custom predicate that is called to check for deprecation. For example: ComplexDeprecationPredicate<"MCR"> would mean you would have to define the following function: bool getMCRDeprecationInfo(MCInst &MI, MCSubtargetInfo &STI, std::string &Info) Which returns 'false' for not deprecated, and 'true' for deprecated and store the warning message in 'Info'. The MCTargetAsmParser constructor was chaned to take an extra argument of the MCInstrInfo class, so out-of-tree targets will need to be changed. llvm-svn: 190598	2013-09-12 10:28:05 +00:00
Bill Wendling	58e2d3d856	Generate compact unwind encoding from CFI directives. We used to generate the compact unwind encoding from the machine instructions. However, this had the problem that if the user used `-save-temps' or compiled their hand-written `.s' file (with CFI directives), we wouldn't generate the compact unwind encoding. Move the algorithm that generates the compact unwind encoding into the MCAsmBackend. This way we can generate the encoding whether the code is from a `.ll' or `.s' file. <rdar://problem/13623355> llvm-svn: 190290	2013-09-09 02:37:14 +00:00
Jiangning Liu	2878dc8fe7	Implement aarch64 neon instruction set AdvSIMD (3V Diff), covering the following 26 instructions, SADDL, UADDL, SADDW, UADDW, SSUBL, USUBL, SSUBW, USUBW, ADDHN, RADDHN, SABAL, UABAL, SUBHN, RSUBHN, SABDL, UABDL, SMLAL, UMLAL, SMLSL, UMLSL, SQDMLAL, SQDMLSL, SMULL, UMULL, SQDMULL, PMULL llvm-svn: 190288	2013-09-09 02:20:27 +00:00
Hao Liu	d4aede098f	Inplement aarch64 neon instructions in AdvSIMD(shift). About 24 shift instructions: sshr,ushr,ssra,usra,srshr,urshr,srsra,ursra,sri,shl,sli,sqshlu,sqshl,uqshl,shrn,sqrshrun,sqshrn,uqshr,sqrshrn,uqrshrn,sshll,ushll and 4 convert instructions: scvtf,ucvtf,fcvtzs,fcvtzu llvm-svn: 189925	2013-09-04 09:28:24 +00:00
Cameron Esfahani	943908b78d	Clean up some usage of Triple. The base class has methods for determining if the target is iOS and Linux. llvm-svn: 189604	2013-08-29 20:23:14 +00:00
Hao Liu	546bcd2f50	A minor change for an obvous problem caused by r188451: def imm0_63 : Operand<i32>, ImmLeaf<i32, [{ return Imm >= 0 && Imm < 63;}]>{ As it seems Imm <63 should be Imm <= 63. ImmLeaf is used in pattern match, but there is already a function check the shift amount range, so just remove ImmLeaf. Also add a test to check 63. llvm-svn: 188911	2013-08-21 17:47:53 +00:00
Hao Liu	cd8b02dce3	Clang and AArch64 backend patches to support shll/shl and vmovl instructions and ACLE functions llvm-svn: 188451	2013-08-15 08:26:11 +00:00
Michael Gottesman	7a8017290a	Update makeLibCall to return both the call and the chain associated with the libcall instead of just the call. This allows us to specify libcalls that return void. LowerCallTo returns a pair with the return value of the call as the first element and the chain associated with the return value as the second element. If we lower a call that has a void return value, LowerCallTo returns an SDValue with a NULL SDNode and the chain for the call. Thus makeLibCall by just returning the first value makes it impossible for you to set up the chain so that the call is not eliminated as dead code. I also updated all references to makeLibCall to reflect the new return type. llvm-svn: 188300	2013-08-13 17:54:56 +00:00
NAKAMURA Takumi	aaf66c7357	Target//CMakeLists.txt: Add the dependency to CommonTableGen explicitly for each corresponding CodeGen. Without explicit dependencies, both per-file action and in-CommonTableGen action could run in parallel. It races to emit .inc files simultaneously. llvm-svn: 187780	2013-08-06 06:38:37 +00:00
Tim Northover	40e9efd725	AArch64: add initial NEON support Patch by Ana Pazos. - Completed implementation of instruction formats: AdvSIMD three same AdvSIMD modified immediate AdvSIMD scalar pairwise - Completed implementation of instruction classes (some of the instructions in these classes belong to yet unfinished instruction formats): Vector Arithmetic Vector Immediate Vector Pairwise Arithmetic - Initial implementation of instruction formats: AdvSIMD scalar two-reg misc AdvSIMD scalar three same - Intial implementation of instruction class: Scalar Arithmetic - Initial clang changes to support arm v8 intrinsics. Note: no clang changes for scalar intrinsics function name mangling yet. - Comprehensive test cases for added instructions To verify auto codegen, encoding, decoding, diagnosis, intrinsics. llvm-svn: 187567	2013-08-01 09:20:35 +00:00
Tim Northover	caaf23852c	AArch64: fix even more JIT failures The last patch corrected some issues, but constant-pool entries had actual codegen bugs in the large memory model (which MCJIT uses). llvm-svn: 187126	2013-07-25 16:03:54 +00:00
Craig Topper	e952ad0bc1	Make some arrays 'static const' llvm-svn: 186311	2013-07-15 07:22:00 +00:00
Craig Topper	de1f151115	Add const qualifier to some static arrays. llvm-svn: 186309	2013-07-15 07:02:45 +00:00
Craig Topper	b94011fd28	Use SmallVectorImpl& instead of SmallVector to avoid repeating small vector size. llvm-svn: 186274	2013-07-14 04:42:23 +00:00
Stephen Lin	73de7bf5de	AArch64/PowerPC/SystemZ/X86: This patch fixes the interface, usage, and all in-tree implementations of TargetLoweringBase::isFMAFasterThanMulAndAdd in order to resolve the following issues with fmuladd (i.e. optional FMA) intrinsics: 1. On X86(-64) targets, ISD::FMA nodes are formed when lowering fmuladd intrinsics even if the subtarget does not support FMA instructions, leading to laughably bad code generation in some situations. 2. On AArch64 targets, ISD::FMA nodes are formed for operations on fp128, resulting in a call to a software fp128 FMA implementation. 3. On PowerPC targets, FMAs are not generated from fmuladd intrinsics on types like v2f32, v8f32, v4f64, etc., even though they promote, split, scalarize, etc. to types that support hardware FMAs. The function has also been slightly renamed for consistency and to force a merge/build conflict for any out-of-tree target implementing it. To resolve, see comments and fixed in-tree examples. llvm-svn: 185956	2013-07-09 18:16:56 +00:00
Rafael Espindola	9a21854513	Use a OwningPtr instead of a manual delete. llvm-svn: 185673	2013-07-04 22:15:33 +00:00
Rafael Espindola	dcc8935499	Fix leak. Should bring back the valgrind bot. llvm-svn: 185663	2013-07-04 19:20:00 +00:00
Jakob Stoklund Olesen	db429d9483	Remove the EXCEPTIONADDR, EHSELECTION, and LSDAADDR ISD opcodes. These exception-related opcodes are not used any longer. llvm-svn: 185625	2013-07-04 13:54:20 +00:00
Jakob Stoklund Olesen	a1f5b901a5	Revert r185595-185596 which broke buildbots. Revert "Simplify landing pad lowering." Revert "Remove the EXCEPTIONADDR, EHSELECTION, and LSDAADDR ISD opcodes." llvm-svn: 185600	2013-07-04 00:26:30 +00:00
Jakob Stoklund Olesen	f33ec531fa	Remove the EXCEPTIONADDR, EHSELECTION, and LSDAADDR ISD opcodes. These exception-related opcodes are not used any longer. llvm-svn: 185596	2013-07-03 23:56:31 +00:00
Rafael Espindola	64e1af8eb9	Remove address spaces from MC. This is dead code since PIC16 was removed in 2010. The result was an odd mix, where some parts would carefully pass it along and others would assert it was zero (most of the object streamer for example). llvm-svn: 185436	2013-07-02 15:49:13 +00:00
Tim Northover	8625fd8cad	AArch64: correct CodeGen of MOVZ/MOVK combinations. According to the AArch64 ELF specification (4.6.8), it's the assembler's responsibility to make sure the shift amount is correct in relocated MOVZ/MOVK instructions. This wasn't being obeyed by either the MCJIT CodeGen or RuntimeDyldELF (which happened to work out well for JIT tests). This commit should make us compliant in this area. llvm-svn: 185360	2013-07-01 19:23:10 +00:00
Chad Rosier	295bd43adb	The getRegForInlineAsmConstraint function should only accept MVT value types. llvm-svn: 184642	2013-06-22 18:37:38 +00:00

... 3 4 5 6 7 ...

523 Commits