llvm-project

Commit Graph

Author	SHA1	Message	Date
Tim Northover	d8d65a69cf	TableGen/ARM64: print aliases even if they have syntax variants. To get at least one use of the change (and some actual tests) in with its commit, I've enabled the AArch64 & ARM64 NEON mov aliases. llvm-svn: 208867	2014-05-15 11:16:32 +00:00
Tim Northover	ee20caaf82	TableGen: use PrintMethods to print more aliases llvm-svn: 208607	2014-05-12 18:04:06 +00:00
Hal Finkel	f0e086a0bc	Pass the value type to TLI::getRegisterByName We must validate the value type in TLI::getRegisterByName, because if we don't and the wrong type was used with the IR intrinsic, then we'll assert (because we won't be able to find a valid register class with which to construct the requested copy operation). For PPC64, additionally, the type information is necessary to decide between the 64-bit register and the 32-bit subregister. No functionality change. llvm-svn: 208508	2014-05-11 19:29:07 +00:00
Hal Finkel	b33e9872a0	Add 'override' to getRegisterByName in *ISelLowering.h No functionality change intended. llvm-svn: 208507	2014-05-11 19:28:55 +00:00
Renato Golin	c7aea40ec6	Implememting named register intrinsics This patch implements the infrastructure to use named register constructs in programs that need access to specific registers (bare metal, kernels, etc). So far, only the stack pointer is supported as a technology preview, but as it is, the intrinsic can already support all non-allocatable registers from any architecture. llvm-svn: 208104	2014-05-06 16:51:25 +00:00
Benjamin Kramer	e1ab3f062e	AArch64: Mark vector long multiplication as expand. There are no patterns for this. This was already fixed for ARM64 but I forgot to apply it to AArch64 too. llvm-svn: 207515	2014-04-29 09:37:54 +00:00
Craig Topper	e06fc4f0ca	[C++11] Add 'override' keywords and remove 'virtual'. Additionally add 'final' and leave 'virtual' on some methods that are marked virtual without overriding anything and have no obvious overrides themselves. AArch64 edition llvm-svn: 207510	2014-04-29 07:58:34 +00:00
Craig Topper	e73658ddbb	[C++] Use 'nullptr'. llvm-svn: 207394	2014-04-28 04:05:08 +00:00
Craig Topper	481fb2879f	Convert SelectionDAG::SelectNodeTo to use ArrayRef. llvm-svn: 207377	2014-04-27 19:21:11 +00:00
Craig Topper	64941d9786	Convert SelectionDAG::getMergeValues to use ArrayRef. llvm-svn: 207374	2014-04-27 19:20:57 +00:00
Craig Topper	206fcd450a	Convert getMemIntrinsicNode to take ArrayRef of SDValue instead of pointer and size. llvm-svn: 207329	2014-04-26 19:29:41 +00:00
Craig Topper	48d114bed1	Convert SelectionDAG::getNode methods to use ArrayRef<SDValue>. llvm-svn: 207327	2014-04-26 18:35:24 +00:00
Craig Topper	062a2baef0	[C++] Use 'nullptr'. Target edition. llvm-svn: 207197	2014-04-25 05:30:21 +00:00
Reid Kleckner	5772b77789	Add 'musttail' marker to call instructions This is similar to the 'tail' marker, except that it guarantees that tail call optimization will occur. It also comes with convervative IR verification rules that ensure that tail call optimization is possible. Reviewers: nicholas Differential Revision: http://llvm-reviews.chandlerc.com/D3240 llvm-svn: 207143	2014-04-24 20:14:34 +00:00
Tim Northover	6331d4b975	AArch64: print NEON lists with a space. This matches ARM64 behaviour, which I think is clearer. It also puts all the churn from that difference into one easily ignored commit. llvm-svn: 207116	2014-04-24 14:06:20 +00:00
Evgeniy Stepanov	0a951b775e	Create MCTargetOptions. For now it contains a single flag, SanitizeAddress, which enables AddressSanitizer instrumentation of inline assembly. Patch by Yuri Gorshenin. llvm-svn: 206971	2014-04-23 11:16:03 +00:00
Kevin Enderby	96918bc406	Fix the assembler to print a better relocatable expression error diagnostic that includes location information. Currently if one has this assembly: .quad (0x1234 + (4 * SOME_VALUE)) where SOME_VALUE is undefined ones gets the less than useful error message with no location information: % clang -c x.s clang -cc1as: fatal error: error in backend: expected relocatable expression With this fix one now gets a more useful error message with location information: % clang -c x.s x.s:5:8: error: expected relocatable expression .quad (0x1234 + (4 * SOME_VALUE)) ^ To do this I plumbed the SMLoc through the MCObjectStreamer EmitValue() and EmitValueImpl() interfaces so it could be used when creating the MCFixup. rdar://12391022 llvm-svn: 206906	2014-04-22 17:27:29 +00:00
Jiangning Liu	87486e0bac	[AArch64] Enable global merge pass. llvm-svn: 206861	2014-04-22 03:33:26 +00:00
Chandler Carruth	84e68b2994	[Modules] Fix potential ODR violations by sinking the DEBUG_TYPE definition below all of the header #include lines, lib/Target/... edition. llvm-svn: 206842	2014-04-22 02:41:26 +00:00
Chandler Carruth	ff55593c40	[cleanup] Fix two headers where we included a standard library header after including the generated code from tablegen. llvm-svn: 206841	2014-04-22 02:28:45 +00:00
Chandler Carruth	d174b72a28	[cleanup] Lift using directives, DEBUG_TYPE definitions, and even some system headers above the includes of generated '.inc' files that actually contain code. In a few targets this was already done pretty consistently, but it wasn't done really consistently anywhere. It is strictly cleaner IMO and necessary in a bunch of places where the DEBUG_TYPE is referenced from the generated code. Consistency with the necessary places trumps. Hopefully the build bots are OK with the movement of intrin.h... llvm-svn: 206838	2014-04-22 02:03:14 +00:00
Chandler Carruth	e96dd8975f	[Modules] Make Support/Debug.h modular. This requires it to not change behavior based on other files defining DEBUG_TYPE, which means it cannot define DEBUG_TYPE at all. This is actually better IMO as it forces folks to define relevant DEBUG_TYPEs for their files. However, it requires all files that currently use DEBUG(...) to define a DEBUG_TYPE if they don't already. I've updated all such files in LLVM and will do the same for other upstream projects. This still leaves one important change in how LLVM uses the DEBUG_TYPE macro going forward: we need to only define the macro after header files have been #include-ed. Previously, this wasn't possible because Debug.h required the macro to be pre-defined. This commit removes that. By defining DEBUG_TYPE after the includes two things are fixed: - Header files that need to provide a DEBUG_TYPE for some inline code can do so by defining the macro before their inline code and undef-ing it afterward so the macro does not escape. - We no longer have rampant ODR violations due to including headers with different DEBUG_TYPE definitions. This may be mostly an academic violation today, but with modules these types of violations are easy to check for and potentially very relevant. Where necessary to suppor headers with DEBUG_TYPE, I have moved the definitions below the includes in this commit. I plan to move the rest of the DEBUG_TYPE macros in LLVM in subsequent commits; this one is big enough. The comments in Debug.h, which were hilariously out of date already, have been updated to reflect the recommended practice going forward. llvm-svn: 206822	2014-04-21 22:55:11 +00:00
Jiangning Liu	ad874fca28	This commit allows vectorized loops to be unrolled by a factor of 2 for AArch64. A new test case is also added for ARM64. Patched by Z.Zheng llvm-svn: 206563	2014-04-18 07:57:54 +00:00
Jiangning Liu	40d81e10c5	This is one of the optimizations ported from ARM64 to AArch64 to address the performance gap between these two back ends. The test case newly added for AArch64 already exists in ARM64. Patched by Z.Zheng llvm-svn: 206559	2014-04-18 05:58:09 +00:00
Jiangning Liu	e56c30614f	This commit enables unaligned memory accesses of vector types on AArch64 back end. This should boost vectorized code performance. Patched by Z. Zheng llvm-svn: 206557	2014-04-18 03:58:38 +00:00
Chad Rosier	c4eb4f8827	[AArch64] Implement the getCSRFirstUseCost API, mirroring that in ARM64. llvm-svn: 206473	2014-04-17 16:19:54 +00:00
Craig Topper	abb4ac7f87	Convert SelectionDAG::getVTList to use ArrayRef llvm-svn: 206357	2014-04-16 06:10:51 +00:00
Nick Lewycky	aad475b324	Break PseudoSourceValue out of the Value hierarchy. It is now the root of its own tree containing FixedStackPseudoSourceValue (which you can use isa/dyn_cast on) and MipsCallEntry (which you can't). Anything that needs to use either a PseudoSourceValue* and Value* is strongly encouraged to use a MachinePointerInfo instead. llvm-svn: 206255	2014-04-15 07:22:52 +00:00
Lang Hames	a1bc0f5662	[MC] Require an MCContext when constructing an MCDisassembler. This patch re-introduces the MCContext member that was removed from MCDisassembler in r206063, and requires that an MCContext be passed in at MCDisassembler construction time. (Previously the MCContext member had been initialized in an ad-hoc fashion after construction). The MCCContext member can be used by MCDisassembler sub-classes to construct constant or target-specific MCExprs. This patch updates disassemblers for in-tree targets, and provides the MCRegisterInfo instance that some disassemblers were using through the MCContext (previously those backends were constructing their own MCRegisterInfo instances). llvm-svn: 206241	2014-04-15 04:40:56 +00:00
Chad Rosier	4ec124bc3e	[AArch64] Implement the isLegalAddressingMode and getScalingFactorCost APIs. llvm-svn: 206089	2014-04-12 00:14:23 +00:00
NAKAMURA Takumi	554c287262	LLVMBuild.txt: Add missing dependencies. llvm-svn: 205962	2014-04-10 11:16:47 +00:00
Chad Rosier	5f8d6a6c15	[AArch64] Implement the isZExtFree APIs. llvm-svn: 205926	2014-04-09 20:51:21 +00:00
Chad Rosier	9ce19fb65c	[AArch64] Implement the isTruncateFree API. In AArch64 i64 to i32 truncate operation is a subregister access. This allows more opportunities for LSR optmization to eliminate variables of different types (i32 and i64). llvm-svn: 205925	2014-04-09 20:43:40 +00:00
Alp Toker	16f98b255d	Fix some doc and comment typos llvm-svn: 205899	2014-04-09 14:47:27 +00:00
Tim Northover	07a8ff4892	ARM64: handle v1i1 types arising from setcc properly. There were several overlapping problems here, and this solution is closely inspired by the one adopted in AArch64 in r201381. Firstly, scalarisation of v1i1 setcc operations simply fails if the input types are legal. This is fixed in LegalizeVectorTypes.cpp this time, and allows AArch64 code to be simplified slightly. Second, vselect with such a setcc feeding into it ends up in ScalarizeVectorOperand, where it's not handled. I experimented with an implementation, but found that whatever DAG came out was rather horrific. I think Hao's DAG combine approach is a good one for quality, though there are edge cases it won't catch (to be fixed separately). Should fix PR19335. llvm-svn: 205625	2014-04-04 14:49:21 +00:00
Craig Topper	840beec2d0	Make consistent use of MCPhysReg instead of uint16_t throughout the tree. llvm-svn: 205610	2014-04-04 05:16:06 +00:00
Stepan Dyatkovskiy	8baf17fc5f	PR18929: According to ARM assembler language hash symbol is optional before immediates. For example, see here for more details: http://infocenter.arm.com/help/index.jsp?topic=/com.arm.doc.dui0473j/dom1359731154529.html llvm-svn: 205157	2014-03-30 17:09:54 +00:00
Rafael Espindola	5904e12bfa	Completely rewrite ELFObjectWriter::RecordRelocation. I started trying to fix a small issue, but this code has seen a small fix too many. The old code was fairly convoluted. Some of the issues it had: * It failed to check if a symbol difference was in the some section when converting a relocation to pcrel. * It failed to check if the relocation was already pcrel. * The pcrel value computation was wrong in some cases (relocation-pc.s) * It was missing quiet a few cases where it should not convert symbol relocations to section relocations, leaving the backends to patch it up. * It would not propagate the fact that it had changed a relocation to pcrel, requiring a quiet nasty work around in ARM. * It was missing comments. llvm-svn: 205076	2014-03-29 06:26:49 +00:00
Rafael Espindola	3e3de5e353	Add const. llvm-svn: 205013	2014-03-28 16:06:09 +00:00
Rafael Espindola	c03f44ca8a	Remove another unused argument. llvm-svn: 204961	2014-03-27 20:49:35 +00:00
Rafael Espindola	9ab380122a	Remove unused argument. llvm-svn: 204956	2014-03-27 20:41:17 +00:00
Logan Chien	30eb9f47c6	[AArch64] Lower SHL_PARTS, SRA_PARTS and SRL_PARTS Lower SHL_PARTS, SRA_PARTS and SRL_PARTS to perform 128-bit integer shift Patch by GuanHong Liu. llvm-svn: 204940	2014-03-27 16:28:09 +00:00
Christian Pirker	3aa0e6a1f9	AArch64_BE function argument passing for ARM ABI llvm-svn: 204814	2014-03-26 14:51:22 +00:00
Nuno Lopes	31617266ea	remove a bunch of unused private methods found with a smarter version of -Wunused-member-function that I'm playwing with. Appologies in advance if I removed someone's WIP code. include/llvm/CodeGen/MachineSSAUpdater.h \| 1 include/llvm/IR/DebugInfo.h \| 3 lib/CodeGen/MachineSSAUpdater.cpp \| 10 -- lib/CodeGen/PostRASchedulerList.cpp \| 1 lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp \| 10 -- lib/IR/DebugInfo.cpp \| 12 -- lib/MC/MCAsmStreamer.cpp \| 2 lib/Support/YAMLParser.cpp \| 39 --------- lib/TableGen/TGParser.cpp \| 16 --- lib/TableGen/TGParser.h \| 1 lib/Target/AArch64/AArch64TargetTransformInfo.cpp \| 9 -- lib/Target/ARM/ARMCodeEmitter.cpp \| 12 -- lib/Target/ARM/ARMFastISel.cpp \| 84 -------------------- lib/Target/Mips/MipsCodeEmitter.cpp \| 11 -- lib/Target/Mips/MipsConstantIslandPass.cpp \| 12 -- lib/Target/NVPTX/NVPTXISelDAGToDAG.cpp \| 21 ----- lib/Target/NVPTX/NVPTXISelDAGToDAG.h \| 2 lib/Target/PowerPC/PPCFastISel.cpp \| 1 lib/Transforms/Instrumentation/AddressSanitizer.cpp \| 2 lib/Transforms/Instrumentation/BoundsChecking.cpp \| 2 lib/Transforms/Instrumentation/MemorySanitizer.cpp \| 1 lib/Transforms/Scalar/LoopIdiomRecognize.cpp \| 8 - lib/Transforms/Scalar/SCCP.cpp \| 1 utils/TableGen/CodeEmitterGen.cpp \| 2 24 files changed, 2 insertions(+), 261 deletions(-) llvm-svn: 204560	2014-03-23 17:09:26 +00:00
Chad Rosier	b7747e31ef	[AArch64] Add SchedRW lists to NEON instructions. Previously, only regular AArch64 instructions were annotated with SchedRW lists. This patch does the same for NEON enabling these instructions to be scheduled by the MIScheduler. Additionally, store operations are now modeled and a few SchedRW lists were updated for bug fixes (e.g. multiple def operands). Reviewers: apazos, mcrosier, atrick Patch by Dave Estes <cestes@codeaurora.org>! llvm-svn: 204505	2014-03-21 19:34:41 +00:00
Kevin Qin	b2c78b07d6	[AArch64] Remove .data_region directive from AArch64. .data_region is only used in Darwin, so it shouldn't be generated for other OS. Currently AArch64 doesn't support darwin yet, so I removed it from AArch64. When Darwin is supported someday, we can add it back and associate it with Darwin. llvm-svn: 204424	2014-03-21 02:12:48 +00:00
Tim Northover	38a93aaaa1	AArch64: error when both positional & named operands are used. Only one instruction pair needed changing: SMULH & UMULH. The previous code worked, but MC was doing extra work treating Ra as a valid operand (which then got completely overwritten in MCCodeEmitter). No behaviour change, so no tests. llvm-svn: 203772	2014-03-13 09:00:13 +00:00
Rafael Espindola	3d5d464df8	Try harder to evaluate expressions when printing assembly. When printing assembly we don't have a Layout object, but we can still try to fold some constants. Testcase by Ulrich Weigand. llvm-svn: 203677	2014-03-12 16:55:59 +00:00
Tim Northover	2a661f3f73	AArch64: fix LowerCONCAT_VECTORS for new CodeGen. The function was making too many assumptions about its input: 1. The NEON_VDUP optimisation was far too aggressive, assuming (I think) that the input would always be BUILD_VECTOR. 2. We were treating most unknown concats as legal (by returning Op rather than SDValue()). I think only concats of pairs of vectors are actually legal. http://llvm.org/PR19094 llvm-svn: 203450	2014-03-10 09:34:07 +00:00
Chandler Carruth	e42bafece1	[AArch64] Fix a use of uninitialized memory introduced in r203125, and caught by the MSan bootstrap build bot. This should hopefully get the bot green at long last. llvm-svn: 203441	2014-03-10 03:52:47 +00:00
Chandler Carruth	aee3ca6cfd	[TTI] There is actually no realistic way to pop TTI implementations off the stack of the analysis group because they are all immutable passes. This is made clear by Craig's recent work to use override systematically -- we weren't overriding anything for 'finalizePass' because there is no such thing. This is kind of a lame restriction on the API -- we can no longer push and pop things, we just set up the stack and run. However, I'm not invested in building some better solution on top of the existing (terrifying) immutable pass and legacy pass manager. llvm-svn: 203437	2014-03-10 02:45:14 +00:00
Tim Northover	fad2761ca0	InstCombine: form shuffles from wider range of insert/extractelements Sequences of insertelement/extractelements are sometimes used to build vectorsr; this code tries to put them back together into shuffles, but could only produce a completely uniform shuffle types (<N x T> from two <N x T> sources). This should allow shuffles with different numbers of elements on the input and output sides as well. llvm-svn: 203229	2014-03-07 10:24:44 +00:00
Rafael Espindola	b1f25f1b93	Replace PROLOG_LABEL with a new CFI_INSTRUCTION. The old system was fairly convoluted: * A temporary label was created. * A single PROLOG_LABEL was created with it. * A few MCCFIInstructions were created with the same label. The semantics were that the cfi instructions were mapped to the PROLOG_LABEL via the temporary label. The output position was that of the PROLOG_LABEL. The temporary label itself was used only for doing the mapping. The new CFI_INSTRUCTION has a 1:1 mapping to MCCFIInstructions and points to one by holding an index into the CFI instructions of this function. I did consider removing MMI.getFrameInstructions completelly and having CFI_INSTRUCTION own a MCCFIInstruction, but MCCFIInstructions have non trivial constructors and destructors and are somewhat big, so the this setup is probably better. The net result is that we don't create temporary labels that are never used. llvm-svn: 203204	2014-03-07 06:08:31 +00:00
Rafael Espindola	0233bf19fd	Simplify. No functionality change. llvm-svn: 203202	2014-03-07 04:58:32 +00:00
Chad Rosier	86a8f72041	[AArch64] This is a work in progress to provide a machine description for the Cortex-A53 subtarget in the AArch64 backend. This patch lays the ground work to annotate each AArch64 instruction (no NEON yet) with a list of SchedReadWrite types. The patch also provides the Cortex-A53 processor resources, maps those the the default SchedReadWrites, and provides basic latency. NEON support will be added in a subsequent patch with proper forwarding logic. Verification was done by setting the pre-RA scheduler to linearize to better gauge the effect of the MIScheduler. Even without modeling the forward logic, the results show a modest improvement for Cortex-A53. Reviewers: apazos, mcrosier, atrick Patch by Dave Estes <cestes@codeaurora.org>! llvm-svn: 203125	2014-03-06 16:04:00 +00:00
Chandler Carruth	9a4c9e597b	[Layering] Move DebugInfo.h into the IR library where its implementation already lives. llvm-svn: 203046	2014-03-06 00:46:21 +00:00
Kevin Qin	b08c6746c4	[AArch64]Fix improper diagnostics about offset range of load/store instructions. llvm-svn: 202775	2014-03-04 02:05:13 +00:00
Chad Rosier	70cb2311ab	Revert "[AArch64] This is a work in progress to provide a machine description" This reverts commit ff717c8fc786a0cfa1602982b91895fa09e514fc. llvm-svn: 202773	2014-03-04 00:32:07 +00:00
Chad Rosier	fe45290566	[AArch64] This is a work in progress to provide a machine description for the Cortex-A53 subtarget in the AArch64 backend. This patch lays the ground work to annotate each AArch64 instruction (no NEON yet) with a list of SchedReadWrite types. The patch also provides the Cortex-A53 processor resources, maps those the the default SchedReadWrites, and provides basic latency. NEON support will be added in a subsequent patch with proper forwarding logic. Verification was done by setting the pre-RA scheduler to linearize to better gauge the effect of the MIScheduler. Even without modeling the forward logic, the results show a modest improvement for Cortex-A53. Reviewers: apazos, mcrosier, atrick Patch by Dave Estes <cestes@codeaurora.org>! llvm-svn: 202767	2014-03-03 23:32:47 +00:00
Benjamin Kramer	b6d0bd48bd	[C++11] Replace llvm::next and llvm::prior with std::next and std::prev. Remove the old functions. llvm-svn: 202636	2014-03-02 12:27:27 +00:00
Craig Topper	73156025e0	Switch all uses of LLVM_OVERRIDE to just use 'override' directly. llvm-svn: 202621	2014-03-02 09:09:27 +00:00
Craig Topper	77dfe45f81	Switch all uses of LLVM_FINAL to just use 'final', and remove the macro. llvm-svn: 202618	2014-03-02 08:08:51 +00:00
Albrecht Kadlec	fbd12d35c8	trivial test commit llvm-svn: 202084	2014-02-24 22:18:38 +00:00
Christian Pirker	6c2f4d45e1	Add AArch64 big endian Target (aarch64_be) llvm-svn: 202024	2014-02-24 11:34:50 +00:00
Kevin Qin	07334d37de	[AArch64] Add register constraints to avoid generating STLXR and STXR with unpredictable behavior. llvm-svn: 201841	2014-02-21 07:45:48 +00:00
Oliver Stannard	7b2f2fba7f	AArch64: __va_list.__stack must be 8-byte aligned The va_start macro for AArch64 must set va_list.__stack to the address following the last named argument on the stack, rounded up to an alignment of 8 bytes. llvm-svn: 201797	2014-02-20 17:19:26 +00:00
Chad Rosier	63bfeb993b	[AArch64] Add support for TargetTransformInfo Analysis. llvm-svn: 201793	2014-02-20 16:00:08 +00:00
Christian Pirker	bd1eb0db1f	Test commit - remove the new line to lib/Target/AArch64/AArch64TargetMachine.cpp. llvm-svn: 201698	2014-02-19 16:58:28 +00:00
Christian Pirker	25ff038545	Test commit - added a new line to lib/Target/AArch64/AArch64TargetMachine.cpp. llvm-svn: 201692	2014-02-19 16:07:32 +00:00
Ana Pazos	7c27a265dc	[AArch64] Expanded sin, cos, pow with FP vector types inputs llvm-svn: 201601	2014-02-18 20:31:05 +00:00
Jiangning Liu	742c588edc	Fix a typo about lowering AArch64 va_copy. llvm-svn: 201541	2014-02-18 02:37:42 +00:00
Kevin Qin	edc95ee196	[AArch64 NEON] Fix a bug to avoid using floating type as condition type in lowering SELECT_CC. llvm-svn: 201395	2014-02-14 09:41:15 +00:00
Jiangning Liu	293349e4d7	Enable AArch64 NEON by default. llvm-svn: 201385	2014-02-14 04:38:09 +00:00
Hao Liu	7146ef8542	[AArch64]Fix the assertion failure caused by "v1i1 SETCC" DAG node. As v1i1 is illegal, the type legalizer tries to scalarize such node. But if the type operands of SETCC is legal, the scalarization algorithm will cause an assertion failure. llvm-svn: 201381	2014-02-14 02:21:56 +00:00
Daniel Sanders	753e17629d	Re-commit: Demote EmitRawText call in AsmPrinter::EmitInlineAsm() and remove hasRawTextSupport() call Summary: AsmPrinter::EmitInlineAsm() will no longer use the EmitRawText() call for targets with mature MC support. Such targets will always parse the inline assembly (even when emitting assembly). Targets without mature MC support continue to use EmitRawText() for assembly output. The hasRawTextSupport() check in AsmPrinter::EmitInlineAsm() has been replaced with MCAsmInfo::UseIntegratedAs which when true, causes the integrated assembler to parse inline assembly (even when emitting assembly output). UseIntegratedAs is set to true for targets that consider any failure to parse valid assembly to be a bug. Target specific subclasses generally enable the integrated assembler in their constructor. The default value can be overridden with -no-integrated-as. All tests that rely on inline assembly supporting invalid assembly (for example, those that use mnemonics such as 'foo' or 'hello world') have been updated to disable the integrated assembler. Changes since review (and last commit attempt): - Fixed test failures that were missed due to configuration of local build. (fixes crash.ll and a couple others). - Fixed tests that happened to pass because the local build was on X86 (should fix 2007-12-17-InvokeAsm.ll) - mature-mc-support.ll's should no longer require all targets to be compiled. (should fix ARM and PPC buildbots) - Object output (-filetype=obj and similar) now forces the integrated assembler to be enabled regardless of default setting or -no-integrated-as. (should fix SystemZ buildbots) Reviewers: rafael Reviewed By: rafael CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D2686 llvm-svn: 201333	2014-02-13 14:44:26 +00:00
Oliver Stannard	5bbb72f37e	Add Cortex-A53 and Cortex-A57 cores to the AArch64 backend llvm-svn: 201305	2014-02-13 09:46:11 +00:00
Hao Liu	7b6dfcf06a	[AArch64]Fix the problems that can't select mul/add/sub of v1i8/v1i16/v1i32 types. As this problems are similar to shl/sra/srl, also add patterns for shift nodes. llvm-svn: 201298	2014-02-13 05:42:33 +00:00
Hao Liu	4f345f3c03	[AArch64]Add support for spilling FPR8/FPR16. llvm-svn: 201287	2014-02-13 02:36:58 +00:00
Daniel Sanders	abe212a3b8	Revert r201237+r201238: Demote EmitRawText call in AsmPrinter::EmitInlineAsm() and remove hasRawTextSupport() call It introduced multiple test failures in the buildbots. llvm-svn: 201241	2014-02-12 15:39:20 +00:00
Daniel Sanders	a7d504cf58	Demote EmitRawText call in AsmPrinter::EmitInlineAsm() and remove hasRawTextSupport() call Summary: AsmPrinter::EmitInlineAsm() will no longer use the EmitRawText() call for targets with mature MC support. Such targets will always parse the inline assembly (even when emitting assembly). Targets without mature MC support continue to use EmitRawText() for assembly output. The hasRawTextSupport() check in AsmPrinter::EmitInlineAsm() has been replaced with MCAsmInfo::UseIntegratedAs which when true, causes the integrated assembler to parse inline assembly (even when emitting assembly output). UseIntegratedAs is set to true for targets that consider any failure to parse valid assembly to be a bug. Target specific subclasses generally enable the integrated assembler in their constructor. The default value can be overridden with -no-integrated-as. All tests that rely on inline assembly supporting invalid assembly (for example, those that use mnemonics such as 'foo' or 'hello world') have been updated to disable the integrated assembler. Reviewers: rafael Reviewed By: rafael CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D2686 llvm-svn: 201237	2014-02-12 14:44:54 +00:00
Chad Rosier	bcde0c49cb	[AArch64] Handle aliases of conditional branches without b.pred form. llvm-svn: 201091	2014-02-10 15:43:11 +00:00
Hao Liu	6e73761dc8	[AArch64]Implement the copy of two FPR8 registers by using FMOVss of two FPR32 registers in copyPhysReg. llvm-svn: 201061	2014-02-10 03:16:22 +00:00
Jim Grosbach	e9008de652	X86: Resolve a long standing FIXME and properly isel pextr[bw]. Generalize the AArch64 .td nodes for AssertZext and AssertSext. Use them to match the relevant pextr store instructions. The test widen_load-2.ll requires a slight change because with the stores gone, the remaining instructions are scheduled in a different order. Add test cases for SSE4 and AVX variants. Resolves rdar://13414672. Patch by Adam Nemet <anemet@apple.com>. llvm-svn: 200957	2014-02-07 00:16:33 +00:00
Tim Northover	fdbdb4b6d5	ARM & AArch64: merge NEON absolute compare intrinsics There was an extremely confusing proliferation of LLVM intrinsics to implement the vacge & vacgt instructions. This combines them all into two polymorphic intrinsics, shared across both backends. llvm-svn: 200768	2014-02-04 14:55:42 +00:00
Tim Northover	24979d8e10	AArch64 & ARM: refactor crypto intrinsics to take scalars Some of the SHA instructions take a scalar i32 as one argument (largely because they work on 160-bit hash fragments). This wasn't reflected in the IR previously, with ARM and AArch64 choosing different types (<4 x i32> and <1 x i32> respectively) which was ugly. This makes all the affected intrinsics take a uniform "i32", allowing them to become non-polymorphic at the same time. llvm-svn: 200706	2014-02-03 17:27:49 +00:00
Craig Topper	e7a9ee5c4a	Remove unnecessary include of AArch64GenInstrInfo.inc from AArch64Disassembler.cpp. None of the GET_ defines were set that would make the include do anything. llvm-svn: 200677	2014-02-03 06:33:17 +00:00
Chad Rosier	fe5ab2f5ba	[AArch64] Custom lower concat_vector patterns with v4i16, v4i32, v8i8, v8i16, v16i8 types. llvm-svn: 200491	2014-01-30 21:46:54 +00:00
Kevin Qin	92d64d2d56	[AArch64 NEON] Lower SELECT_CC with vector operand. When the scalar compare is between floating point and operands are vector, we custom lower SELECT_CC to use NEON SIMD compare for generating less instructions. llvm-svn: 200365	2014-01-29 01:57:30 +00:00
David Woodhouse	3fa98a65e9	Propagate MCSubtargetInfo through TableGen's getBinaryCodeForInstr() llvm-svn: 200349	2014-01-28 23:13:18 +00:00
David Woodhouse	9784cef38d	Explictly pass MCSubtargetInfo to MCCodeEmitter::EncodeInstruction() llvm-svn: 200348	2014-01-28 23:13:07 +00:00
David Woodhouse	e6c13e4abd	Change MCStreamer EmitInstruction interface to take subtarget info llvm-svn: 200345	2014-01-28 23:12:42 +00:00
Kevin Qin	4a183d7094	[AArch64 NEON] Try to generate CONCAT_VECTOR when lowering BUILD_VECTOR or SHUFFLE_VECTOR. Replace r199791. llvm-svn: 200180	2014-01-27 02:53:54 +00:00
Kevin Qin	9eeedfbaa6	Revert r199791. It's old version which has some bugs. I'll commit lattest patch soon. llvm-svn: 200179	2014-01-27 02:53:41 +00:00
Rafael Espindola	e41383f899	Pass a MCSubtargetInfo down to the TargetStreamer creation. With this the target streamers will be able to know the target features that are in use. llvm-svn: 200135	2014-01-26 06:38:58 +00:00
Rafael Espindola	24ea09ef7d	Construct the MCStreamer before constructing the MCTargetStreamer. This has a few advantages: * Only targets that use a MCTargetStreamer have to worry about it. * There is never a MCTargetStreamer without a MCStreamer, so we can use a reference. * A MCTargetStreamer can talk to the MCStreamer in its constructor. llvm-svn: 200129	2014-01-26 06:06:37 +00:00
Jiangning Liu	fb3c17b6c9	Improve pattern match from v1i8 to v1i32 for AArch64 Neon. llvm-svn: 200119	2014-01-26 04:55:53 +00:00
Jiangning Liu	6398d839c6	Implement pattern match from v1xx to v1xx for AArch64 Neon. llvm-svn: 200113	2014-01-26 03:27:40 +00:00
Kevin Qin	18662f4b7c	[AArch64 NEON] Add patterns for concat_vector on v2i32. llvm-svn: 200111	2014-01-26 02:46:15 +00:00
Ana Pazos	cd3b9f763e	[AArch64] Removed unused i8 type from FPR8 register class. The i8 type is not registered with any register class. This causes a segmentation fault in MachineLICM::getRegisterClassIDAndCost. The code selects the first type associated with register class FPR8, which happens to be i8. It uses this type (i8) to get the representative class pointer, which is 0. It then uses this pointer to access a field, resulting in segmentation fault. Since i8 type is not being used for printing any neon instruction we can safely remove it. llvm-svn: 200046	2014-01-24 22:36:53 +00:00
Alp Toker	cb40291100	Fix known typos Sweep the codebase for common typos. Includes some changes to visible function names that were misspelt. llvm-svn: 200018	2014-01-24 17:20:08 +00:00
Kevin Qin	21cd2152d3	[AArch64 NEON] Fix a bug in implementing register copy bwtween FPR16. llvm-svn: 199978	2014-01-24 07:53:04 +00:00
Ana Pazos	5d31f6945b	[AArch64] Added vselect patterns with float and double types llvm-svn: 199925	2014-01-23 19:18:57 +00:00
Kevin Qin	50944eb638	fix some spell mistakes around 'ConcatVector' and 'ShuffleVector' in AArch64 backend. llvm-svn: 199858	2014-01-23 01:35:13 +00:00
Kevin Qin	ce0190c6d5	[AArch64 NEON] Try to generate CONCAT_VECTOR when lowering BUILD_VECTOR or SHUFFLE_VECTOR. llvm-svn: 199791	2014-01-22 06:11:03 +00:00
Kevin Qin	6d379abd8f	[AArch64 NEON] Fix a bug caused by undef lane when generating VEXT. It was commited as r199628 but reverted in r199628 as causing regression test failed. It's because of old vervsion of patch I used to commit. Sorry for mistake. llvm-svn: 199704	2014-01-21 01:48:52 +00:00
Chandler Carruth	f835fc6f4f	Revert r199628: "[AArch64 NEON] Fix a bug caused by undef lane when generating VEXT." This test fails the newly added regression tests. llvm-svn: 199631	2014-01-20 08:18:01 +00:00
Kevin Qin	ff42e06ef4	[AArch64 NEON] Fix a bug caused by undef lane when generating VEXT. llvm-svn: 199628	2014-01-20 07:32:26 +00:00
Kevin Qin	ef66ff78ca	[AArch64 NEON] Accept both #0.0 and #0 for comparing with floating point zero in asm parser. For FCMEQ, FCMGE, FCMGT, FCMLE and FCMLT, floating point zero will be printed as #0.0 instead of #0. To support the history codes using #0, we consider to let asm parser accept both #0.0 and #0. llvm-svn: 199621	2014-01-20 02:14:05 +00:00
Kevin Qin	e0faea11b1	[AArch64 NEON] Expand vector for UDIV/SDIV/UREM/SREM/FREM as neon doesn't support these operations. llvm-svn: 199485	2014-01-17 09:54:30 +00:00
Hao Liu	17457a2ee2	[AArch64]Fix the problem can't select f16_to_f32 and f32_to_f16. Also add copy support for FPR16. Also add a missing test case file belongs to commit r197361. llvm-svn: 199463	2014-01-17 06:23:30 +00:00
Kevin Qin	212d9b4a56	[AArch64 NEON] Custom lower conversion between vector integer and vector floating point if element bit-width doesn't match. llvm-svn: 199462	2014-01-17 05:52:35 +00:00
Hao Liu	18d92262c5	[AArch64]Fix the problem can't select concat_vectors of two v1i32 types. Also fix the problem can't select scalar_to_vector from f32 to v2f32/v4f32. llvm-svn: 199461	2014-01-17 05:44:46 +00:00
Jiangning Liu	0a791c348b	For AArch64, lowering sext_inreg and generate optimized code by using SXTL. llvm-svn: 199296	2014-01-15 05:08:01 +00:00
Tim Northover	6e219cd588	AArch64: don't try to handle [SU]MUL_LOHI nodes We should set them to expand for now since there are no patterns dealing with them. Actually, there are no instructions either so I doubt they'll ever be acceptable. llvm-svn: 199265	2014-01-14 22:53:22 +00:00
Lang Hames	06234ec147	Add FPExt option to CCValAssign::LocInfo. When generating calling-convention promotion code, Tablegen will now select FPExt for floating point promotions (previously it had returned AExt, which is not valid for floating point types). Any out-of-tree targets that were relying on AExt being returned for FP promotions will need to update their code check for FPExt instead. llvm-svn: 199252	2014-01-14 19:56:36 +00:00
Rafael Espindola	08ff298d51	Revert "[AArch64] Added vselect patterns with float and double types" This reverts commit r199242. It is causing CodeGen/AArch64/neon-bsl.ll to fail. llvm-svn: 199248	2014-01-14 19:24:08 +00:00
Ana Pazos	787f540daa	[AArch64] Added vselect patterns with float and double types llvm-svn: 199242	2014-01-14 18:45:48 +00:00
Andrea Di Biagio	9bc0415c1f	[AArch64] Fix assertion failure caused by an invalid comparison between APInt values. APInt only knows how to compare values with the same BitWidth and asserts in all other cases. With this fix, function PerformORCombine does not use the APInt equality operator if the APInt values returned by 'isConstantSplat' differ in BitWidth. In that case they are different and no comparison is needed. llvm-svn: 199119	2014-01-13 16:51:00 +00:00
Kevin Qin	cfef55d6d4	[AArch64 NEON] Add missing patterns for bitcast from or to v1f64 llvm-svn: 199070	2014-01-13 01:58:38 +00:00
Kevin Qin	21e8f1c4eb	[AArch64 NEON] Add more scenarios to use perm instructions when lowering shuffle_vector This patch covered 2 more scenarios: 1. Two operands of shuffle_vector are the same, like %shuffle.i = shufflevector <8 x i8> %a, <8 x i8> %a, <8 x i32> <i32 0, i32 2, i32 4, i32 6, i32 8, i32 10, i32 12, i32 14> 2. One of operands is undef, like %shuffle.i = shufflevector <8 x i8> %a, <8 x i8> undef, <8 x i32> <i32 0, i32 2, i32 4, i32 6, i32 8, i32 10, i32 12, i32 14> After this patch, perm instructions will have chance to be emitted instead of lots of INS. llvm-svn: 199069	2014-01-13 01:56:29 +00:00
Saleem Abdulrasool	a6505ca4c2	correct target directive handling error handling The target specific parser should return `false' if the target AsmParser handles the directive, and `true' if the generic parser should handle the directive. Many of the target specific directive handlers would `return Error' which does not follow these semantics. This change simply changes the target specific routines to conform to the semantis of the ParseDirective correctly. Conformance to the semantics improves diagnostics emitted for the invalid directives. X86 is taken as a sample to ensure that multiple diagnostics are not presented for a single error. llvm-svn: 199068	2014-01-13 01:15:39 +00:00
Kristof Beyls	90ff80e329	Silence unused variable warning for non-asserting builds that was introduced in r198937. llvm-svn: 198941	2014-01-10 14:20:45 +00:00
Kristof Beyls	58306ad903	Make sure -use-init-array has intended effect on all AArch64 ELF targets, not just linux. llvm-svn: 198937	2014-01-10 13:41:49 +00:00
Ana Pazos	cfd2ca5826	[AArch64][NEON] Added UXTL and UXTL2 instruction aliases llvm-svn: 198791	2014-01-08 21:02:13 +00:00
Kevin Qin	44946439e1	[AArch64 NEON] Fix generating incorrect value type of NEON_VDUPLANE when lower build_vector if result value type mismatch with operand value type. llvm-svn: 198743	2014-01-08 08:06:14 +00:00
Rafael Espindola	894843cb4e	Move the llvm mangler to lib/IR. This makes it available to tools that don't link with target (like llvm-ar). llvm-svn: 198708	2014-01-07 21:19:40 +00:00
Chandler Carruth	8a8cd2bab9	Re-sort all of the includes with ./utils/sort_includes.py so that subsequent changes are easier to review. About to fix some layering issues, and wanted to separate out the necessary churn. Also comment and sink the include of "Windows.h" in three .inc files to match the usage in Memory.inc. llvm-svn: 198685	2014-01-07 11:48:04 +00:00
Hao Liu	7d11d99d20	[AArch64]Add support to spill/fill D tuples such as DPair/DTriple/DQuad. There is no test cases for D tuple as the original test cases are too large. As the spill/fill of the D tuple is similar to the Q tuple, the correctness can be guaranteed. llvm-svn: 198684	2014-01-07 10:50:43 +00:00
Hao Liu	27d88376bc	[AArch64]Add support to copy D tuples such as DPair/DTriple/DQuad and Q tuples such as QPair/QTriple/QQuad. There is no test case for D tuple as the original test cases are too large. As the copy of the D tuple is similar to the Q tuple, the correctness can be guaranteed. llvm-svn: 198682	2014-01-07 10:00:03 +00:00
Kevin Qin	cfa41a2569	[AArch64 NEON] Fixed incorrect immediate used in BIC instruction. llvm-svn: 198675	2014-01-07 05:10:47 +00:00
Bill Wendling	13199b17f8	Remove unnecessary #includes. llvm-svn: 198585	2014-01-06 06:00:00 +00:00
Bill Wendling	908bf814e7	Refactor function that checks that __builtin_returnaddress's argument is constant. This moves the check up into the parent class so that all targets can use it without having to copy (and keep in sync) the same error message. llvm-svn: 198579	2014-01-06 00:43:20 +00:00
Bill Wendling	df7dd28dc8	Emit an error message if the value passed to __builtin_returnaddress isn't a constant __builtin_returnaddress requires that the value passed into is be a constant. However, at -O0 even a constant expression may not be converted to a constant. Emit an error message intead of crashing. llvm-svn: 198531	2014-01-05 01:47:20 +00:00
Rafael Espindola	58873566b3	Make the llvm mangler depend only on DataLayout. Before this patch any program that wanted to know the final symbol name of a GlobalValue had to link with Target. This patch implements a compromise solution where the mangler uses DataLayout. This way, any tool that already links with Target (llc, clang) gets the exact behavior as before and new IR files can be mangled without linking with Target. With this patch the mangler is constructed with just a DataLayout and DataLayout is extended to include the information the Mangler needs. llvm-svn: 198438	2014-01-03 19:21:54 +00:00
Ana Pazos	e891c5f264	[AArch64][NEON] Added SXTL and SXTL2 instruction aliases llvm-svn: 198437	2014-01-03 19:20:31 +00:00
Rafael Espindola	6994fdf33c	Remove the 's' DataLayout specification During the years there have been some attempts at figuring out how to align byval arguments. A look at the commit log suggests that they were * Use the ABI alignment. * When that was not sufficient for x86-64, I added the 's' specification to DataLayout. * When that was not sufficient Evan added the virtual getByValTypeAlignment. * When even that was not sufficient, we just got the FE to add the alignment to the byval. This patch is just a simple cleanup that removes my first attempt at fixing the problem. I also added an AArch64 implementation of getByValTypeAlignment to make sure this patch is a nop. I also left the 's' parsing for backward compatibility. I will send a short email to llvmdev about the change for anyone maintaining an out of tree target. llvm-svn: 198287	2014-01-01 22:29:43 +00:00
Jiangning Liu	a0acf70af1	For AArch64 Neon, simplify scalar dup by lane0 for fp. llvm-svn: 198194	2013-12-30 02:44:35 +00:00
Hao Liu	fe3bfc8c41	[AArch64]Add code to spill/fill Q register tuples such as QPair/QTriple/QQuad. llvm-svn: 198193	2013-12-30 02:38:12 +00:00
Hao Liu	b591f835d6	[AArch64]Can't select shift left 0 of type v1i64 llvm-svn: 198192	2013-12-30 02:12:46 +00:00
Hao Liu	74107fe526	[AArch64]Fix the problem that can't select mul of v1i64/v2i64 types. E.g. Can't select such IR: %tmp = mul <2 x i64> %a, %b llvm-svn: 198188	2013-12-30 01:38:41 +00:00
Hao Liu	83799741fb	[AArch64]Fix a problem that the register order of fmls/fmla by element is incorrect. E.g. the codegen result is fmls v1.2s, v0.2s, v2.s[3] which is expected to be fmls v0.2s, v1.2s, v2.s[3] llvm-svn: 198001	2013-12-25 07:12:34 +00:00
Hao Liu	ce7a12be8f	[AArch64]Add patterns to match normal shift nodes: shl, sra and srl. llvm-svn: 197969	2013-12-24 09:00:21 +00:00
Kevin Qin	82bd84aadf	[AArch64 NEON] Fix a bug when lowering BUILD_VECTOR. DAG.getVectorShuffle() doesn't always return a vector_shuffle node. If mask is the exact sequence of it's operand(For example, operand_0 is v8i8, and the mask is 0, 1, 2, 3, 4, 5, 6, 7), it will directly return that operand. So a check is added here. llvm-svn: 197967	2013-12-24 08:16:06 +00:00
Kevin Qin	cd5f3153f5	[AArch64 NEON] Fix a pattern match failure with NEON_VDUP. This failure caused by improper condition when lowering shuffle_vector to scalar_to_vector. After this patch NEON_VDUP with v1i64 will not be generated. llvm-svn: 197966	2013-12-24 08:11:47 +00:00
Ana Pazos	bc2996b30f	[AArch64] Check fmul node single use in fused multiply patterns Check for single use of fmul node in fused multiply patterns to allow generation of fused multiply add/sub instructions. Otherwise fmul operation ends up being repeated more than once which does not help peformance on targets with only one MAC unit, as for example cortex-a53. llvm-svn: 197929	2013-12-24 00:47:29 +00:00
Ana Pazos	3ca23915cd	[AArch64 NEON] Fixed fused multiply negate add/sub patterns The correct pattern matching should be: - fnmadd is (-Ra) + (-Rn)Rm which should be matched as: fma (fneg node:$Rn), node:$Rm, (fneg node:$Ra) and as (f32 (fsub (f32 (fneg FPR32:$Ra)), (f32 (fmul FPR32:$Rn, FPR32:$Rm)))) - fnmsub is (-Ra) + RnRm which should be matched as fma node:$Rn, node:$Rm, (fneg node:$Ra) and as (f32 (fsub (f32 (fmul FPR32:$Rn, FPR32:$Rm)), FPR32:$Ra)))) llvm-svn: 197928	2013-12-24 00:40:10 +00:00
Kevin Qin	53eaea0104	[AArch64 NEON]Implment loading vector constant form constant pool. llvm-svn: 197551	2013-12-18 06:26:04 +00:00
Chad Rosier	5f87edb484	[AArch64] Fix v1fx patterns for Floating-point Multiply Extend and Floating-point Compare to Zero. llvm-svn: 197402	2013-12-16 18:29:35 +00:00
Rafael Espindola	bccb9d45ad	The preferred alignment defaults to the abi alignment. Omit if it is the same. llvm-svn: 197400	2013-12-16 18:01:51 +00:00
Rafael Espindola	8afbb28cea	On DataLayout, omit the default of p:64:64:64. llvm-svn: 197397	2013-12-16 17:15:29 +00:00
Hao Liu	774cabb538	[AArch64]Fix the pattern match failure for v1i8/v1i16/v1i32 types. Currently we have such types as legal vector types. The DAG combiner may generate some DAG nodes having such types but we don't have patterns to match them. E.g. a load i32 and a bitcast i32 to v1i32 will be combined into a load v1i32: bitcast (load i32) to v1i32 -> load v1i32. So this patch fixes such problems for load/dup instructions. If v1i8/v1i16/v1i32 are not legal any more, the code in this patch can be deleted. So I also add some FIXME. llvm-svn: 197361	2013-12-16 02:51:28 +00:00
Chad Rosier	e139dd4fe6	[AArch64] Simplify the Neon Scalar3Same patterns for floating-point reciprocal step, floating-point reciprocal square root step, floating-point absolute difference, and integer/floating-point compare instructions. Also, move the scalar general arithmetic operation patterns closer to similar code. No functional change intended. llvm-svn: 197250	2013-12-13 17:56:44 +00:00
Rafael Espindola	720ae4f885	Simplify the datalayout string of ARM and AArch64. No functionality change. Reviewed by Tim Northover. llvm-svn: 197172	2013-12-12 17:43:37 +00:00
Chad Rosier	4055f42d22	[AArch64] Removed unnecessary copy patterns with v1fx types. - Copy patterns with float/double types are enough. - Fix typos in test case names that were using v1fx. - There is no ACLE intrinsic that uses v1f32 type. And there is no conflict of neon and non-neon ovelapped operations with this type, so there is no need to support operations with this type. - Remove v1f32 from FPR32 register and disallow v1f32 as a legal type for operations. Patch by Ana Pazos! llvm-svn: 197159	2013-12-12 15:46:29 +00:00
Hao Liu	46a10eec28	[AArch64]Fix the problem that AArch64 backend fails to select scalar_to_vector of vector types having more than one element. llvm-svn: 197135	2013-12-12 07:36:26 +00:00
Chad Rosier	446d8ea0fb	[AArch64] Refactor NEON floating-point Max/Min/Maxnm/Minnm across vector AArch64 intrinsics to use f32 types, rather than their vector equivalents. llvm-svn: 197090	2013-12-11 23:21:25 +00:00
Chad Rosier	088f93d4b5	[AArch64] Add NEON scalar floating-point compare LLVM AArch64 intrinsics that use f32/f64 types, rather than their vector equivalents. llvm-svn: 197068	2013-12-11 21:03:46 +00:00
Chad Rosier	473a01e1c9	[AArch64] Refactor the NEON scalar floating-point reciprocal step and floating-point reciprocal square root step LLVM AArch64 intrinsics to use f32/f64 types, rather than their vector equivalents. llvm-svn: 197067	2013-12-11 21:03:43 +00:00
Chad Rosier	7098fcc062	[AArch64] Refactor the NEON scalar floating-point reciprocal estimate, floating- point reciprocal exponent, and floating-point reciprocal square root estimate LLVM AArch64 intrinsics to use f32/f64 types, rather than their vector equivalents. llvm-svn: 197066	2013-12-11 21:03:40 +00:00
Kevin Qin	310b6c08ba	[AArch64 NEON] Get instruction BSL matched to VSELECT. llvm-svn: 196998	2013-12-11 02:33:50 +00:00
NAKAMURA Takumi	8bc9bfaa5a	Prune redundant dependencies in LLVMBuild.txt. llvm-svn: 196988	2013-12-11 00:30:57 +00:00
Chad Rosier	f70af21651	[AArch64] Refactor the NEON floating-point absolute difference LLVM AArch64 intrinsic to use f32/f64 types, rather than their vector equivalents. llvm-svn: 196965	2013-12-10 21:33:59 +00:00
Chad Rosier	07cc3f9100	[AArch64] Refactor the NEON signed/unsigned floating-point convert to fixed-point LLVM AArch64 intrinsics to use f32/f64, rather than their vector equivalents. llvm-svn: 196964	2013-12-10 21:33:56 +00:00
Chad Rosier	98b8baa35c	[AArch64] Overload NEON signed/unsigned floating-point convert to fixed-point and fixed-point convert to floating-point LLVM AArch64 intrinsics. llvm-svn: 196963	2013-12-10 21:33:53 +00:00
Chad Rosier	cc34d187b8	[AArch64] Overload NEON signed/unsigned integer convert to floating-point LLVM AArch64 intrinsics. llvm-svn: 196962	2013-12-10 21:33:50 +00:00
Chad Rosier	7a9bba442f	[AArch64] Refactor the Neon vector/scalar floating-point convert intrinsics so that they use float/double rather than the vector equivalents when appropriate. llvm-svn: 196930	2013-12-10 16:11:39 +00:00
Chad Rosier	fcc4c366d1	[AArch64] Refactor the Neon vector/scalar floating-point convert implementation. Specifically, reuse the ARM intrinsics when possible. llvm-svn: 196926	2013-12-10 15:35:33 +00:00
Kevin Qin	43385c7065	[AArch64 NEON] Replace fpimm with fpz32 for floating compare with zero. This is a small change to be strict. Just want get pattern safer. llvm-svn: 196889	2013-12-10 06:51:07 +00:00
Kevin Qin	04396d1e69	[AArch64 NEON] Support poly128_t and implement relevant intrinsic. llvm-svn: 196887	2013-12-10 06:48:35 +00:00
NAKAMURA Takumi	396d4d3c7e	Add proper dependencies to LLVMBuild.txt in llvm/lib. I'll prune redundant deps in LLVMBuild.txt, later. llvm-svn: 196881	2013-12-10 05:39:34 +00:00
NAKAMURA Takumi	e3afe2ef62	Whitespaces. llvm-svn: 196880	2013-12-10 05:39:12 +00:00
Chad Rosier	5c8bf9c3db	[AArch64] Refactor the NEON scalar reduce pairwise intrinsics, so that they use float/double rather than the vector equivalents when appropriate. llvm-svn: 196833	2013-12-09 22:47:38 +00:00
Chad Rosier	3b0b3ee71e	[AArch64] Refactor NEON scalar reduce pairwise front-end codegen to remove unnecessary patterns in tablegen. llvm-svn: 196832	2013-12-09 22:47:34 +00:00
Chad Rosier	397ff3945c	[AArch64] Remove q and non-q intrinsic definitions in the NEON scalar reduce pairwise implementation, using an overloaded definition instead. llvm-svn: 196831	2013-12-09 22:47:31 +00:00
Ana Pazos	bde2828ae0	Fix pattern match for movi with 0D result Patch by Jiangning Liu. With some test case changes: - intrinsic test added to the existing /test/CodeGen/AArch64/neon-aba-abd.ll. - New test cases to cover movi 1D scenario without using the intrinsic in test/CodeGen/AArch64/neon-mov.ll. llvm-svn: 196806	2013-12-09 19:29:14 +00:00
Hao Liu	96a587a9f7	[AArch64]Add missing pair intrinsics such as: int32_t vminv_s32(int32x2_t a) which should be compiled into SMINP Vd.2S,Vn.2S,Vm.2S llvm-svn: 196749	2013-12-09 03:51:42 +00:00
Hao Liu	868caea6d1	[AArch64]Pattern match failures for truncate store and extend load llvm-svn: 196748	2013-12-09 03:34:08 +00:00
Ana Pazos	6b0a8c50dd	Implemented vget/vset_lane_f16 intrinsics llvm-svn: 196533	2013-12-05 21:07:49 +00:00
Jiangning Liu	65d8e3422a	For AArch64, add missing register cost calculation for big value types like v4i64 and v8i64. llvm-svn: 196456	2013-12-05 02:12:01 +00:00
Kevin Qin	afd095de8b	[AArch64 Neon] Add ACLE intrinsic vceqz_f64. llvm-svn: 196362	2013-12-04 08:02:34 +00:00
Kevin Qin	f9832e8de7	[AArch64 NEON] Add missing compare intrinsics. llvm-svn: 196360	2013-12-04 07:53:28 +00:00
Hao Liu	dca64f4a20	[AArch64]Add missing floating point convert, round and misc intrinsics. E.g. int64x1_t vcvt_s64_f64(float64x1_t a) -> FCVTZS Dd, Dn llvm-svn: 196210	2013-12-03 06:06:55 +00:00
Hao Liu	c250cbc095	AArch64: add missing ACLE intrinsics mapping to general arithmetic operation from VFP instructions. E.g. float64x1_t vadd_f64(float64x1_t a, float64x1_t b) -> FADD Dd, Dn, Dm. llvm-svn: 196208	2013-12-03 05:58:30 +00:00
NAKAMURA Takumi	bc815b2d21	Whitespace. llvm-svn: 196203	2013-12-03 05:28:27 +00:00
Hao Liu	21a461353a	AArch64: Add missing scalar pair intrinsics. E.g. "float32_t vaddv_f32(float32x2_t a)" to be matched into "faddp s0, v1.2s". llvm-svn: 196198	2013-12-03 03:39:47 +00:00
Jiangning Liu	3a541d46a1	Add some missing pattern matches for AArch64 Neon intrinsics like vuqadd_s64 and friends. llvm-svn: 196192	2013-12-03 01:33:52 +00:00
Jiangning Liu	94a7bb2130	Add some missing pattern matches for AArch64 Neon intrinsics like vmull_high_n_s16 and friends. llvm-svn: 196190	2013-12-03 01:29:32 +00:00
Rafael Espindola	5113d166f5	Refactor the setting of PrivateGlobalPrefix. No functionality change. llvm-svn: 196170	2013-12-02 23:39:26 +00:00
Chad Rosier	3106de3f9d	[AArch64] Implemented vcopy_lane patterns using scalar DUP instruction. Patch by Ana Pazos! llvm-svn: 196151	2013-12-02 21:05:16 +00:00
Rafael Espindola	957cf6f9e1	Remove dead code. MO_JumpTableIndex and MO_ExternalSymbol don't show up on inline asm. Keeping parts of the old asm printer just to print inline asm to a string that we then parse back looks like a hack. llvm-svn: 196111	2013-12-02 15:36:37 +00:00
Rafael Espindola	50712a456d	Change the default of AsmWriterClassName and isMCAsmWriter. llvm-svn: 196065	2013-12-02 04:55:42 +00:00
Hao Liu	ba38eee8ac	AArch64: The pattern match should check the range of the immediate value. Or we can generate some illegal instructions. E.g. shrn2 v0.4s, v1.2d, #35. The legal range should be in [1, 16]. llvm-svn: 195941	2013-11-29 02:11:22 +00:00
Jiangning Liu	c429c00f3b	Add missing pattern for supporting intrinsic function vbsl_f64 with argument double floating point. llvm-svn: 195938	2013-11-29 01:37:15 +00:00
Kevin Qin	337cfcc83c	[AArch64 NEON]Fix a assertion failure when disassemble SHLL instruction. llvm-svn: 195936	2013-11-29 01:29:16 +00:00
Benjamin Kramer	ea1982aff9	Silence sign-compare warning and reduce nesting. No functionality change. llvm-svn: 195932	2013-11-28 19:58:56 +00:00
NAKAMURA Takumi	ce746c6c49	[CMake] Let add_public_tablegen_target responsible to provide dependency to CommonTableGen. add_public_tablegen_target adds *CommonTableGen to LLVM_COMMON_DEPENDS. LLVM_COMMON_DEPENDS affects add_llvm_library (and other add_target stuff) within its scope. llvm-svn: 195927	2013-11-28 17:04:04 +00:00
NAKAMURA Takumi	413518f1f8	[CMake] Prune include_directories() in llvm/lib/Target. add_llvm_target() sets them. llvm-svn: 195921	2013-11-28 14:53:30 +00:00
NAKAMURA Takumi	979e604d8c	Add newline at eof. llvm-svn: 195920	2013-11-28 14:52:52 +00:00
Jiangning Liu	4bc9dbd846	Remove the variable only used by assert to avoid the build failure caused by build options [-Werror,-Wunused-variable]. llvm-svn: 195905	2013-11-28 01:34:55 +00:00
Hao Liu	f9f468abee	AArch64: Fix a bug about disassembling post-index load single element to 4 vectors llvm-svn: 195903	2013-11-28 01:07:45 +00:00

... 2 3 4 5 6 ...

527 Commits