llvm-project

Commit Graph

Author	SHA1	Message	Date
Javed Absar	99a9343ae6	[ARM] Cortex-R4F is not VFPOnlySP Cortex-R4F TRM states that fpu supports both single and double precision. This patch corrects the information in ARM.td file and corresponding test. Reviewers: rengolin Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10763 llvm-svn: 240776	2015-06-26 12:14:56 +00:00
Hao Liu	2cd34bb585	[ARM] Lower interleaved memory accesses to vldN/vstN intrinsics. This patch also adds a function to calculate the cost of interleaved memory accesses. E.g. Lower an interleaved load: %wide.vec = load <8 x i32>, <8 x i32>* %ptr, align 4 %v0 = shuffle %wide.vec, undef, <0, 2, 4, 6> %v1 = shuffle %wide.vec, undef, <1, 3, 5, 7> into: %vld2 = { <4 x i32>, <4 x i32> } call llvm.arm.neon.vld2(%ptr, 4) %vec0 = extractelement { <4 x i32>, <4 x i32> } %vld2, i32 0 %vec1 = extractelement { <4 x i32>, <4 x i32> } %vld2, i32 1 E.g. Lower an interleaved store: %i.vec = shuffle <8 x i32> %v0, <8 x i32> %v1, <0, 4, 8, 1, 5, 9, 2, 6, 10, 3, 7, 11> store <12 x i32> %i.vec, <12 x i32>* %ptr, align 4 into: %sub.v0 = shuffle <8 x i32> %v0, <8 x i32> v1, <0, 1, 2, 3> %sub.v1 = shuffle <8 x i32> %v0, <8 x i32> v1, <4, 5, 6, 7> %sub.v2 = shuffle <8 x i32> %v0, <8 x i32> v1, <8, 9, 10, 11> call void llvm.arm.neon.vst3(%ptr, %sub.v0, %sub.v1, %sub.v2, 4) Differential Revision: http://reviews.llvm.org/D10533 llvm-svn: 240755	2015-06-26 02:45:36 +00:00
Hao Liu	7ec8ee3119	[AArch64] Lower interleaved memory accesses to ldN/stN intrinsics. This patch also adds a function to calculate the cost of interleaved memory accesses. E.g. Lower an interleaved load: %wide.vec = load <8 x i32>, <8 x i32>* %ptr %v0 = shuffle %wide.vec, undef, <0, 2, 4, 6> %v1 = shuffle %wide.vec, undef, <1, 3, 5, 7> into: %ld2 = { <4 x i32>, <4 x i32> } call llvm.aarch64.neon.ld2(%ptr) %vec0 = extractelement { <4 x i32>, <4 x i32> } %ld2, i32 0 %vec1 = extractelement { <4 x i32>, <4 x i32> } %ld2, i32 1 E.g. Lower an interleaved store: %i.vec = shuffle <8 x i32> %v0, <8 x i32> %v1, <0, 4, 8, 1, 5, 9, 2, 6, 10, 3, 7, 11> store <12 x i32> %i.vec, <12 x i32>* %ptr into: %sub.v0 = shuffle <8 x i32> %v0, <8 x i32> v1, <0, 1, 2, 3> %sub.v1 = shuffle <8 x i32> %v0, <8 x i32> v1, <4, 5, 6, 7> %sub.v2 = shuffle <8 x i32> %v0, <8 x i32> v1, <8, 9, 10, 11> call void llvm.aarch64.neon.st3(%sub.v0, %sub.v1, %sub.v2, %ptr) Differential Revision: http://reviews.llvm.org/D10533 llvm-svn: 240754	2015-06-26 02:32:07 +00:00
Matthias Braun	7c6d6491dd	Revert "X86: Reject register operands with obvious type mismatches." Revert until http://llvm.org/PR23955 is investigated. This reverts commit r239309. llvm-svn: 240746	2015-06-26 00:26:49 +00:00
Matthias Braun	f3518215f7	Fix mismatched architectures in test llvm-svn: 240745	2015-06-26 00:26:46 +00:00
Matthias Braun	611ff519d0	aad/fix labels in test/CodeGen/X86/StackColoring.ll llvm-svn: 240744	2015-06-26 00:26:44 +00:00
Alexey Samsonov	85c7d66fdc	Make llvm-dwarfdump exit with non-zero exit code if error was occured. llvm-svn: 240729	2015-06-25 23:40:15 +00:00
Adrian Prantl	09086d5338	Split test up into two target-spcific directories. llvm-svn: 240726	2015-06-25 23:38:22 +00:00
Anna Zaks	785c075786	[asan] Do not instrument special purpose LLVM sections. Do not instrument globals that are placed in sections containing "__llvm" in their name. This fixes a bug in ASan / PGO interoperability. ASan interferes with LLVM's PGO, which places its globals into a special section, which is memcpy-ed by the linker as a whole. When those goals are instrumented, ASan's memcpy wrapper reports an issue. http://reviews.llvm.org/D10541 llvm-svn: 240723	2015-06-25 23:35:48 +00:00
Anna Zaks	4f652b69b1	[asan] Don't run stack malloc on functions containing inline assembly. It makes LLVM run out of registers even on 64-bit platforms. For example, the following test case fails on darwin. clang -cc1 -O0 -triple x86_64-apple-macosx10.10.0 -emit-obj -fsanitize=address -mstackrealign -o ~/tmp/ex.o -x c ex.c error: inline assembly requires more registers than available void TestInlineAssembly(const unsigned char S, unsigned int pS, unsigned char D, unsigned int pD, unsigned int h) { unsigned int sr = 4, pDiffD = pD - 5; unsigned int pDiffS = (pS << 1) - 5; char flagSA = ((pS & 15) == 0), flagDA = ((pD & 15) == 0); asm volatile ( "mov %0, %%"PTR_REG("si")"\n" "mov %2, %%"PTR_REG("cx")"\n" "mov %1, %%"PTR_REG("di")"\n" "mov %8, %%"PTR_REG("ax")"\n" : : "m" (S), "m" (D), "m" (pS), "m" (pDiffS), "m" (pDiffD), "m" (sr), "m" (flagSA), "m" (flagDA), "m" (h) : "%"PTR_REG("si"), "%"PTR_REG("di"), "%"PTR_REG("ax"), "%"PTR_REG("cx"), "%"PTR_REG("dx"), "memory" ); } http://reviews.llvm.org/D10719 llvm-svn: 240722	2015-06-25 23:35:45 +00:00
Adrian Prantl	5332e4251c	Debug Info: Add basic test coverage for the DWARF encoding of bitfields. While looking at a couple of bugs in the debug info output for bitfields I noticed that there wasn't a single regression test to test my changes against, so here's a start. llvm-svn: 240717	2015-06-25 23:19:19 +00:00
Frederic Riss	16238d90b2	IAS: Use the root macro instanciation for location r224810 fixed the handling of macro debug locations in AsmParser. This patch fixes the logic to actually do what was intended: it uses the first macro of the macro stack instead of the last one. The updated testcase shows that the current scheme doesn't work when macro instanciations are nested and multiple files are used. Reviewers: compnerd Differential Revision: http://reviews.llvm.org/D10463 llvm-svn: 240705	2015-06-25 21:57:33 +00:00
Michael J. Spencer	594c028183	[Object][ELF] Add support for dumping dynamic relocations when sections are stripped. llvm-svn: 240703	2015-06-25 21:47:32 +00:00
Rafael Espindola	101824d345	llvm-nm: Don't print mapping symbols. This matches the behavior of gnu nm. Fixes pr23930. llvm-svn: 240695	2015-06-25 21:00:51 +00:00
Jingyue Wu	5e34ce33f5	[InstCombine] call SimplifyICmpInst with correct context Summary: Fixes PR23809. Without passing the context to SimplifyICmpInst, we would use the assume to prove that the condition feeding the assume is trivially true (see isValidAssumeForContext in ValueTracking.cpp), causing the removal of the assume which may be useful for later optimizations. Test Plan: pr23800.ll Reviewers: hfinkel, majnemer Reviewed By: hfinkel Subscribers: henryhu, llvm-commits, wengxt, broune, meheff, eliben Differential Revision: http://reviews.llvm.org/D10695 llvm-svn: 240683	2015-06-25 20:14:47 +00:00
Rafael Espindola	6dff814cdf	Diagnose undefined temporary symbols. We already disallowed .global .Lfoo so this is reasonable. This is a small cherry pick from r240130. llvm-svn: 240681	2015-06-25 20:10:45 +00:00
Paul Robinson	e6c34b49d3	Make this test verify .debug_pubnames is actually missing. It was matching at EOF regardless of whether the section was present. llvm-svn: 240679	2015-06-25 19:37:13 +00:00
Peter Collingbourne	2a3443c7c5	GVN: If a branch has two identical successors, we cannot declare either dead. This previously caused miscompilations as a result of phi nodes receiving undef incoming values from blocks dominated by such successors. Differential Revision: http://reviews.llvm.org/D10726 llvm-svn: 240670	2015-06-25 18:32:02 +00:00
Rafael Espindola	d63d3cd507	Add a test for a recent regression. llvm-svn: 240656	2015-06-25 16:16:08 +00:00
Rafael Espindola	60c1a8c01a	llvm-nm: print 'n' instead of '?' This matches gnu nm and has the advantage that there is a upper case N. llvm-svn: 240655	2015-06-25 16:01:53 +00:00
Kit Barton	13894c7f35	[PPC] Implement vmrgew and vmrgow instructions This patch adds support for the vector merge even word and vector merge odd word instructions introduced in POWER8. Phabricator review: http://reviews.llvm.org/D10704 llvm-svn: 240650	2015-06-25 15:17:40 +00:00
Bruno Cardoso Lopes	edb876d52c	[AsmPrinter] Fix crash in handleIndirectSymViaGOTPCRel Check for symbols in MCValue before using them. Bail out early in case they are null. This fixes PR23779. Differential Revision: http://reviews.llvm.org/D10712 rdar://problem/21532830 llvm-svn: 240649	2015-06-25 15:17:23 +00:00
Artur Pilipenko	0e21d54b51	Take alignment into account in isSafeToLoadUnconditionally Reviewed By: hfinkel Differential Revision: http://reviews.llvm.org/D10475 llvm-svn: 240636	2015-06-25 12:18:43 +00:00
Toma Tabacu	7bc44dcb0c	[mips] [IAS] Fix parsing of memory offset expressions with parenthesis depth >1. Summary: In an expression such as "(((a+b)+c)+d)", parseParenExpression() would only parse the "a+b)+c", which would result in an error later on in the parser. This means that we can only parse one level of inner parentheses. In order to fix this, I added a new function called parseParenExprOfDepth(), which parses a specified number of trailing parenthesis expressions (except for the outermost parenthesis), and changed MipsAsmParser to use it in parseMemOffset instead of parseParenExpression(). Reviewers: dsanders, rafael Reviewed By: dsanders, rafael Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9742 llvm-svn: 240625	2015-06-25 09:52:02 +00:00
Ahmed Bougacha	f1eccbecf8	[X86] Accept hasAVX512() as well as hasFMA() when generating FMA. We don't always have FMA, for example when using 'clang -mavx512f' without an explicit CPU. Also check for an explicit +avx512f instead of CPUs in a couple related tests. llvm-svn: 240616	2015-06-25 00:44:46 +00:00
Ahmed Bougacha	cee6d1bb3c	[X86] Cleanup fma tests a little bit. NFC. Reformat, isolate 213->231 xform, actually --check-prefix CHECK, and deduplicate the FMA intrinsic tests (FMA3 in AMD-land). llvm-svn: 240615	2015-06-25 00:40:25 +00:00
Swaroop Sridhar	e9247ab6d6	Enable StackMap Serialization for COFF Summary This change turns on the emission of __LLVM_Stackmaps section when generating COFF binaries. Test Plan Added a scenario to the test case: test\CodeGen\X86\statepoint-stackmap-format.ll. Code Review: http://reviews.llvm.org/D10680 llvm-svn: 240613	2015-06-25 00:28:42 +00:00
David Majnemer	63d606bdcb	[GVN] Intersect the IR flags when CSE'ing two instructions We performed a simple, but incomplete, intersection when it came time to CSE instructions. It didn't handle, for example, the 'exact' flag. This fixes PR23922. llvm-svn: 240595	2015-06-24 21:52:25 +00:00
David Majnemer	f6e500a0dc	[Reassociate] Don't propogate flags when creating negations Reassociate mutated existing instructions in order to form negations which would create additional reassociate opportunities. This fixes PR23926. llvm-svn: 240593	2015-06-24 21:27:36 +00:00
Jingyue Wu	9c71150bfb	Add NVPTXPeephole pass to reduce unnecessary address cast Summary: This patch first change the register that holds local address for stack frame to %SPL. Then the new NVPTXPeephole pass will try to scan the following pattern %vreg0<def> = LEA_ADDRi64 <fi#0>, 4 %vreg1<def> = cvta_to_local %vreg0 and transform it into %vreg1<def> = LEA_ADDRi64 %VRFrameLocal, 4 Patched by Xuetian Weng Test Plan: test/CodeGen/NVPTX/local-stack-frame.ll Reviewers: jholewinski, jingyue Reviewed By: jingyue Subscribers: eliben, jholewinski, llvm-commits Differential Revision: http://reviews.llvm.org/D10549 llvm-svn: 240587	2015-06-24 20:20:16 +00:00
Matthias Braun	ba3ecc3c80	ARMLoadStoreOptimizer: Fix errata 602117 handling and make testcase actually test for it This fixes PR23912 Differential Revision: http://reviews.llvm.org/D10620 llvm-svn: 240582	2015-06-24 20:03:27 +00:00
Alex Lorenz	54565cf02b	MIR Serialization: Serialize simple MachineRegisterInfo attributes. This commit serializes the 3 scalar boolean attributes from the MachineRegisterInfo class: IsSSA, TracksRegLiveness, and TracksSubRegLiveness. These attributes are serialized as part of the machine function YAML mapping. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D10618 llvm-svn: 240579	2015-06-24 19:56:10 +00:00
Jingyue Wu	6f72aed3ec	[LSR] canonicalize Prod*(1<<C) to Prod<<C Summary: Because LSR happens at a late stage where mul of a power of 2 is typically canonicalized to shl, this canonicalization emits code that can be better CSE'ed. Test Plan: Transforms/LoopStrengthReduce/shl.ll shows how this change makes GVN more powerful. Fixes some existing tests due to this change. Reviewers: sanjoy, majnemer, atrick Reviewed By: majnemer, atrick Subscribers: majnemer, llvm-commits Differential Revision: http://reviews.llvm.org/D10448 llvm-svn: 240573	2015-06-24 19:28:40 +00:00
Peter Collingbourne	f549598796	Object: Add XFAILed test case for r239560. We ought to also emit unmangled references to dllimported functions, but no existing linker needs this. llvm-svn: 240562	2015-06-24 18:03:39 +00:00
Alex Lorenz	12b554e6a7	MIR Serialization: Serialize the null register operands. This commit serializes the null register machine operands. It uses the '_' keyword to represent them, but the parser also allows the '%noreg' named register syntax. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D10580 llvm-svn: 240558	2015-06-24 17:34:58 +00:00
Michael Zolotukhin	79ff564ef3	[LoopVectorizer] Fix bailing-out condition for OptForSize case. With option OptForSize enabled, the Loop Vectorizer is not supposed to create tail loop. The condition checking that was invalid and was not matching to the comment above. Patch by Marianne Mailhot-Sarrasin. llvm-svn: 240556	2015-06-24 17:26:24 +00:00
Rafael Espindola	d68fb74c2b	Don't get confused with sections whose section number is reserved. It is perfectly possible for SHNDX to contain indexes that have the same value as reserved st_shndx values. llvm-svn: 240544	2015-06-24 14:48:54 +00:00
Simon Pilgrim	51aa1f86fb	[X86][AVX] Added full set of 256-bit vector shift tests. llvm-svn: 240542	2015-06-24 13:52:25 +00:00
Pawel Bylica	cc35812877	Fix instruction scheduling live register tracking Summary: This patch fixes PR23405 (https://llvm.org/bugs/show_bug.cgi?id=23405). During a node unscheduling an entry in LiveRegGens can be replaced with a new value. That corrupts the live reg tracking and LiveReg* structure is not cleared as should be during unscheduling. Problematic condition that enforces Gen replacement is `I->getSUnit()->getHeight() < LiveRegGens[I->getReg()]->getHeight()`. This condition should be checked only if LiveRegGen was set in current node unscheduling. Test Plan: Regression test included. Reviewers: hfinkel, atrick Reviewed By: atrick Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9993 llvm-svn: 240538	2015-06-24 12:49:42 +00:00
Zoran Jovanovic	67e04be640	[mips][microMIPS] Implement BREAK, EHB and EI instructions http://reviews.llvm.org/D10090 llvm-svn: 240531	2015-06-24 10:32:16 +00:00
Rafael Espindola	d7a32ea4b8	Change how symbol sizes are handled in lib/Object. COFF and MachO only define symbol sizes for common symbols. Reflect that in the class hierarchy by having a method for common symbols only in the base and a general one in ELF. This avoids the need of using a magic value for the size, which had a few problems * Most callers didn't check for it. * The ones that did could not tell the magic value from a file actually having that value. llvm-svn: 240529	2015-06-24 10:20:30 +00:00
Ahmed Bougacha	dd5da3e7ed	[X86] Don't generate vbroadcasti128 for v4i64 splats from memory. We used to erroneously match: (v4i64 shuffle (v2i64 load), <0,0,0,0>) Whereas vbroadcasti128 is more like: (v4i64 shuffle (v2i64 load), <0,1,0,1>) This problem doesn't exist for vbroadcastf128, which kept matching the intrinsic after r231182. We should perhaps re-introduce the intrinsic here as well, but that's a separate issue still being discussed. While there, add some proper vbroadcastf128 tests. We don't currently match those, like for loading vbroadcastsd/ss on AVX (the reg-reg broadcasts where added in AVX2). Fixes PR23886. llvm-svn: 240488	2015-06-24 00:07:16 +00:00
Ahmed Bougacha	89ae9a1e28	[X86] update_llc_test_checks vector-shuffle-*. NFC. Some of them had gone stale. llvm-svn: 240485	2015-06-24 00:03:48 +00:00
Alex Lorenz	240fc1e0aa	MIR Serialization: Serialize immediate machine operands. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D10573 llvm-svn: 240481	2015-06-23 23:42:28 +00:00
Alex Lorenz	51af160f4c	MIR Parser: Use correct source locations for machine instruction diagnostics. This commit translates the source locations for MIParser diagnostics from the locations in the machine instruction string to the locations in the MIR file. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D10574 llvm-svn: 240474	2015-06-23 22:39:23 +00:00
Simon Pilgrim	a0d5c5924a	[X86][SSE] Added full set of 128-bit vector shift tests. Removed some old duplicate tests. llvm-svn: 240465	2015-06-23 21:18:15 +00:00
Alexey Samsonov	19ffcb900f	Let llvm::ReplaceInstWithInst copy debug location from old to new instruction. Currently some users of this function do this explicitly, and all the rest forget to do this. ThreadSanitizer was one of such users, and had missing debug locations for calls into TSan runtime handling atomic operations, eventually leading to poorly symbolized stack traces and malfunctioning suppressions. This is another change relevant to PR23837. llvm-svn: 240460	2015-06-23 21:00:08 +00:00
Artem Belevich	6c9627252d	[NVPTX] Added missing test case for llvm.nvvm.sqrt.f NVPTX intrinsic Differential Revision: http://reviews.llvm.org/D10663 llvm-svn: 240437	2015-06-23 18:22:17 +00:00
Rafael Espindola	ad3b6bfa2a	Pass -m to the linker in this test. Fixes the test on a ppc host. llvm-svn: 240431	2015-06-23 18:04:54 +00:00
Alex Lorenz	f3db51de5e	MIR Serialization: Serialize physical register machine operands. This commit introduces functionality that's used to serialize machine operands. Only the physical register operands are serialized by this commit. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D10525 llvm-svn: 240425	2015-06-23 16:35:26 +00:00
Rafael Espindola	5f7ade26d0	objdump: Don't print a (always 0) size for MachO symbols. Only common symbol on MachO and COFF have a size. For COFF we already had a custom format. For MachO, there is no native objdump and we were printing it as ELF. Now we only print the sizes for symbols that actually have them. llvm-svn: 240422	2015-06-23 15:45:38 +00:00
Toma Tabacu	d88d79c79d	[mips] [IAS] Add partial support for the ULHU pseudo-instruction. Summary: This only adds support for ULHU of an immediate address with/without a source register. It does not include support for ULHU of the address of a symbol. Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9671 llvm-svn: 240410	2015-06-23 14:39:42 +00:00
Petar Jovanovic	b7915a1f0b	[mips64] Emit correct addend for some PC-relative relocations So far, LLVM has not emitted correct addend for N64 and N32 ABI. This patch fixes that. It also removes fixup from MCJIT for R_MIPS_PC16 relocation. Patch by Vladimir Radosavljevic. Differential Revision: http://reviews.llvm.org/D10565 llvm-svn: 240404	2015-06-23 13:54:42 +00:00
Daniel Jasper	41de8027b1	Revert r240302 ("Bring r240130 back."). This causes errors like: ld: error: blah.o: requires dynamic R_X86_64_PC32 reloc against '' which may overflow at runtime; recompile with -fPIC blah.cc:function f(): error: undefined reference to '' blah.o:g(): error: undefined reference to '' I have not yet come up with an appropriate reproduction. llvm-svn: 240394	2015-06-23 11:31:32 +00:00
Daniel Sanders	70b5908d39	[mips] llvm-readobj can parse .MIPS.abiflags. No need to check the bytes. Summary: Reviewers: atanasyan Reviewed By: atanasyan Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10538 llvm-svn: 240392	2015-06-23 10:11:36 +00:00
Elena Demikhovsky	5e2f8c4231	AVX-512: Added all forms of VPABS instruction Added all intrinsics, tests for encoding, tests for intrinsics. llvm-svn: 240386	2015-06-23 08:19:46 +00:00
Justin Bogner	47ab1fa6d6	test: Move target dependent test in their own folder for c API test Dissasembly tests depends on target. The problem is that it disable all tests if all targets are not compiled. This moves things around in order to get target specific code in a target specific folder. Patch by Amaury Sechet. Thanks! llvm-svn: 240380	2015-06-23 06:46:54 +00:00
Weiming Zhao	f1abad57da	Fix PR13851: Preserve metadata for the unswitched branch This patch copies the metadata of the unswitched branch to the newly crreated branch in loop unswitch pass. llvm-svn: 240378	2015-06-23 05:31:09 +00:00
Rafael Espindola	14522db74f	Add a test for the previous commit. This shows how two symbols at the same address are handled. llvm-svn: 240374	2015-06-23 03:42:44 +00:00
David Majnemer	726901b638	[InstCombine] Optimize subtract of selects into a select of a sub This came up when examining some code generated by clang's IRGen for certain member pointers. llvm-svn: 240369	2015-06-23 02:49:24 +00:00
Rafael Espindola	aeef0618b9	Fix tests when X86 is not enabled. llvm-svn: 240368	2015-06-23 02:45:44 +00:00
Rafael Espindola	a4a4093ed8	Compute correct symbol sizes for MachO and COFF. Before this would dump from the symbol start to the end of the section. llvm-svn: 240367	2015-06-23 02:20:37 +00:00
Sanjay Patel	e79b43a01f	[x86] generalize reassociation optimization in machine combiner to 2 instructions Currently ( D10321, http://reviews.llvm.org/rL239486 ), we can use the machine combiner pass to reassociate the following sequence to reduce the critical path: A = ? op ? B = A op X C = B op Y --> A = ? op ? B = X op Y C = A op B 'op' is currently limited to x86 AVX scalar FP adds (with fast-math on), but in theory, it could be any associative math/logic op (see TODO in code comment). This patch generalizes the pattern match to ignore the instruction that defines 'A'. So instead of a sequence of 3 adds, we now only need to find 2 dependent adds and decide if it's worth reassociating them. This generalization has a compile-time cost because we can now match more instruction sequences and we rely more heavily on the machine combiner to discard sequences where reassociation doesn't improve the critical path. For example, in the new test case: A = M div N B = A add X C = B add Y We'll match 2 reassociation patterns, but this transform doesn't reduce the critical path: A = M div N B = A add Y C = B add X We need the combiner to reject that pattern but select this: A = M div N B = X add Y C = B add A Differential Revision: http://reviews.llvm.org/D10460 llvm-svn: 240361	2015-06-23 00:39:40 +00:00
Evgeniy Stepanov	9e0d41ab09	Fix PR23914. r226830 moved the declaration of Buf to a nested scope, resulting in a dangling reference (in StringRef Name), and a use-after-free. llvm-svn: 240357	2015-06-22 23:36:03 +00:00
Adam Nemet	f530b329c7	[LoopDist] Improve variable names and comments in LoopVersioning class, NFC As with the previous patch, the goal is to turn the class into a general loop-versioning class. This patch removes any references to loop distribution. llvm-svn: 240352	2015-06-22 22:59:40 +00:00
Pawel Bylica	e6fd8c4232	Revert r240291: causes problems in self-hosted builds. llvm-svn: 240343	2015-06-22 21:54:07 +00:00
Peter Collingbourne	ea45d834e0	Linker: Do not expect comdat to exist in source module. llvm-svn: 240341	2015-06-22 21:46:51 +00:00
Frederic Riss	ebc162a766	[Object] Search for architecures by name in MachOUniversalBinary::getObjectForArch() The reason we need to search by name rather than by Triple::ArchType is to handle subarchitecture correclty. There is no different ArchType for the x86_64h architecture (it identifies itself as x86_64), or for the various ARM subarches. The only way to get to the subarch slice in an universal binary is to search by name. This issue led to hard to debug and transient symbolication failures in Asan tests (it mostly works, because the files are very similar). This also affects the Profiling infrastucture as it is the other user of that API. Reviewers: samsonov, bogner Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10604 llvm-svn: 240339	2015-06-22 21:33:24 +00:00
Pawel Bylica	776b553438	Set missing x86 arch in a CodeGen regression test. Fixes the regression test added in r240291. llvm-svn: 240336	2015-06-22 21:18:10 +00:00
Simon Pilgrim	c5f409c1ec	[X86][AVX2] Added missing stack folding tests for vpshufhw/vpshuflw llvm-svn: 240332	2015-06-22 21:10:42 +00:00
Tom Stellard	f0296cee9b	R600/SI: Use ELF64 format instead of ELF32 Reviewers: arsenm, rafael Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10392 llvm-svn: 240331	2015-06-22 21:03:54 +00:00
Tom Stellard	3aed34e947	R600: Use EM_AMDGPU for the ELF Machine type Reviewers: arsenm, rafael Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10390 llvm-svn: 240330	2015-06-22 21:03:52 +00:00
Ahmed Bougacha	ed3c4d1a3d	[X86] Teach load folding to accept scalar _Int users of MOVSS/MOVSD. The _Int instructions are special, in that they operate on the full VR128 instead of FR32. The load folding then looks at MOVSS, at the user, and bails out when it sees a size mismatch. What we really know is that the rm_Int instructions don't load the higher lanes, so folding is fine. This happens for the straightforward intrinsic code, e.g.: _mm_add_ss(a, _mm_load_ss(p)); Fixes PR23349. Differential Revision: http://reviews.llvm.org/D10554 llvm-svn: 240326	2015-06-22 20:51:51 +00:00
Alex Lorenz	91370c5d62	MIR Serialization: Introduce a lexer for machine instructions. This commit adds a function that tokenizes the string containing the machine instruction. This commit also adds a struct called 'MIToken' which is used to represent the lexer's tokens. Reviewers: Sean Silva Differential Revision: http://reviews.llvm.org/D10521 llvm-svn: 240323	2015-06-22 20:37:46 +00:00
Peter Collingbourne	de26a918c1	SafeStack: Create the unsafe stack pointer on demand. This avoids creating an unnecessary undefined reference on targets such as NVPTX that require such references to be declared in asm output. llvm-svn: 240321	2015-06-22 20:26:54 +00:00
Pete Cooper	80d21cb40d	Change .thumb_set to have the same error checks as .set. According to the documentation, .thumb_set is 'the equivalent of a .set directive'. We didn't have equivalent behaviour in terms of all the errors we could throw, for example, when a symbol is redefined. This change refactors parseAssignment so that it can be used by .set and .thumb_set and implements tests for .thumb_set for all the errors thrown by that method. Reviewed by Rafael Espíndola. llvm-svn: 240318	2015-06-22 19:35:57 +00:00
Sanjay Patel	09b2c890af	[x86] set default reciprocal (division and square root) codegen to match GCC D8982 ( checked in at http://reviews.llvm.org/rL239001 ) added command-line options to allow reciprocal estimate instructions to be used in place of divisions and square roots. This patch changes the default settings for x86 targets to allow that recip codegen (except for scalar division because that breaks too much code) when using -ffast-math or its equivalent. This matches GCC behavior for this kind of codegen. Differential Revision: http://reviews.llvm.org/D10396 llvm-svn: 240310	2015-06-22 18:29:44 +00:00
Sanjoy Das	6f567a4b79	[FaultMaps] Add a parser for the __llvm__faultmaps section. Summary: The parser is exercised by llvm-objdump using -print-fault-maps. As is probably obvious, the code itself was "heavily inspired" by http://reviews.llvm.org/D10434. Reviewers: reames, atrick, JosephTremoulet Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10491 llvm-svn: 240304	2015-06-22 18:03:02 +00:00
Rafael Espindola	2d6bae2e09	Bring r240130 back. Now that pr23900 is fixed, we can bring it back with no changes. Original message: Make all temporary symbols unnamed. What this does is make all symbols that would otherwise start with a .L (or L on MachO) unnamed. Some of these symbols still show up in the symbol table, but we can just make them unnamed. In order to make sure we produce identical results when going thought assembly, all .L (not just the compiler produced ones), are now unnamed. Running llc on llvm-as.opt.bc, the peak memory usage goes from 208.24MB to 205.57MB. llvm-svn: 240302	2015-06-22 17:52:52 +00:00
Alex Lorenz	8e0a1b4857	MIR Serialization: Serialize machine instruction names. This commit implements initial machine instruction serialization. It serializes machine instruction names. The instructions are represented using a YAML sequence of string literals and are a part of machine basic block YAML mapping. This commit introduces a class called 'MIParser' which will be used to parse the machine instructions and operands. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D10481 llvm-svn: 240295	2015-06-22 17:02:30 +00:00
Pawel Bylica	06407c0320	Fix shl folding in DAG combiner. Summary: The code responsible for shl folding in the DAGCombiner was assuming incorrectly that all constants are less than 64 bits. This patch simply changes the way values are compared. Test Plan: A regression test included. Reviewers: andreadb Reviewed By: andreadb Subscribers: andreadb, test, llvm-commits Differential Revision: http://reviews.llvm.org/D10602 llvm-svn: 240291	2015-06-22 15:58:11 +00:00
Rafael Espindola	bdf509aaaf	Add a triple to the test to fix it on some hosts. The slp vectorizer doesn't optimize this case in 32 bits. Fixes PR23453. llvm-svn: 240289	2015-06-22 15:44:20 +00:00
Toma Tabacu	8e0316d439	[mips] [IAS] Add support for LAReg with identical source and destination register operands. Summary: In this case, we're supposed to load the immediate in AT and then ADDu it with the source register and put it in the destination register. Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9367 llvm-svn: 240278	2015-06-22 13:10:23 +00:00
Elena Demikhovsky	55a997437c	AVX-512: added VPSHUFB instruction - all SKX forms Added intrinsics and encoding tests. llvm-svn: 240277	2015-06-22 13:00:42 +00:00
Toma Tabacu	fb9d125592	[mips] [IAS] Add support for LASym with identical source and destination register operands. Summary: In this case, we're supposed to load the address of the symbol in AT and then ADDu it with the source register and put it in the destination register. Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9366 llvm-svn: 240273	2015-06-22 12:08:39 +00:00
Elena Demikhovsky	ba5ab328e5	AVX-512: All forms of VCOPMRESS VEXPAND instructions, encoding tests. llvm-svn: 240272	2015-06-22 11:16:30 +00:00
Elena Demikhovsky	d78609a7ac	Reverted AVX-512 vector shuffle llvm-svn: 240258	2015-06-22 09:01:15 +00:00
Michael Kuperstein	fc21951cd7	[X86] Allow more call sequences to use push instructions for argument passing This allows more call sequences to use pushes instead of movs when optimizing for size. In particular, calling conventions that pass some parameters in registers (e.g. thiscall) are now supported. Differential Revision: http://reviews.llvm.org/D10500 llvm-svn: 240257	2015-06-22 08:31:22 +00:00
Elena Demikhovsky	e77566112c	AVX-512: Added intrinsics for VPERMT2W/D/Q/PS/PD and VPERMI2W/D/Q/PS/PD instructions. Added tests. llvm-svn: 240256	2015-06-22 06:45:48 +00:00
Rafael Espindola	ff373d2c73	Add the testcase from pr23900. llvm-svn: 240253	2015-06-22 01:29:24 +00:00
Duncan P. N. Exon Smith	3a73d9e067	AsmPrinter: Don't emit empty .debug_loc entries If we don't know how to represent a .debug_loc entry, skip the entry entirely rather than emitting an empty one. Similarly, if a .debug_loc list has no entries, don't create the list. We still want to create the variables, just in an optimized-out form that doesn't have a DW_AT_location. llvm-svn: 240244	2015-06-21 16:54:56 +00:00
Simon Pilgrim	fd704fe895	[X86][SSE] Added missing stack folding test for CVTSD2SS instruction. llvm-svn: 240241	2015-06-21 16:07:47 +00:00
Hans Wennborg	6ed81cbcdb	Switch lowering: add heuristic for filling leaf nodes in the weight-balanced binary search tree Sparse switches with profile info are lowered as weight-balanced BSTs. For example, if the node weights are {1,1,1,1,1,1000}, the right-most node would end up in a tree by itself, bringing it closer to the top. However, a leaf in this BST can contain up to 3 cases, and having a single case in a leaf node as in the example means the tree might become unnecessarily high. This patch adds a heauristic to the pivot selection algorithm that moves more cases into leaf nodes unless that would lower their rank. It still doesn't yield the optimal tree in every case, but I believe it's conservatibely correct. llvm-svn: 240224	2015-06-20 17:14:07 +00:00
Simon Pilgrim	056cbfe58d	[X86][SSE] Fix PerformSExtCombine bug that accessed the wrong return value of an aggregate type. Fix to rL237885 to ensure that it accesses the correct return value of an aggregate type. llvm-svn: 240223	2015-06-20 16:19:24 +00:00
Simon Pilgrim	d862e0f33a	[X86][SSE][CostModel] Added full set of sitofp/uitofp costings for SSE2/AVX/AVX2/AVX512F. Merged separate (but equivalent) SSE2/AVX512F tests. Removed codegen tests since these are already done better in test/CodeGen/X86. The actual cost values still need to be updated to match recent codegen improvements. llvm-svn: 240219	2015-06-20 14:58:01 +00:00
Peter Collingbourne	e3d2447f79	Use correct escaping for semicolon on Windows. llvm-svn: 240207	2015-06-20 01:28:20 +00:00
Peter Collingbourne	2e06b7d198	LibDriver tests require x86 target. llvm-svn: 240205	2015-06-20 01:14:37 +00:00
Peter Collingbourne	7070827be1	LibDriver: implement /libpath and $LIB; ignore /ignore and /machine. llvm-svn: 240203	2015-06-20 00:57:12 +00:00
Nico Weber	67e715ff7d	Revert 240130, it caused crashes (repro in PR23900). llvm-svn: 240193	2015-06-19 23:43:47 +00:00
Sanjoy Das	18c9dd31de	[CallGraph] Given -print-callgraph a stable printing order. Summary: Since FunctionMap has llvm::Function pointers as keys, the order in which the traversal happens can differ from run to run, causing spurious FileCheck failures. Have CallGraph::print sort the CallGraphNodes by name before printing them. Reviewers: bogner, chandlerc Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10575 llvm-svn: 240191	2015-06-19 23:20:31 +00:00
Rafael Espindola	3dc0d05bf4	Improve error handling of getRelocationAddend. This patch changes getRelocationAddend to use ErrorOr and considers it an error to try to get the addend of a REL section. If, for example, a x86_64 file has a REL section, that file is corrupted and we should reject it. Using ErrorOr is not ideal since we check the section type once per relocation instead of once per section. Checking once per section would involve getRelocationAddend just asserting and callers checking the section before iterating over the relocations. In any case, this is an improvement and includes a test. llvm-svn: 240176	2015-06-19 20:58:43 +00:00
Alex Lorenz	00302df3fe	MIR Parser: report an error when a basic block isn't found. This commit reports an error when the MIR parser can't find a basic block with the machine basic block's name. llvm-svn: 240174	2015-06-19 20:12:03 +00:00
Alex Lorenz	4f093bf1ce	MIR Serialization: Serialize the list of machine basic blocks with simple attributes. This commit implements the initial serialization of machine basic blocks in a machine function. Only the simple, scalar MBB attributes are serialized. The reference to LLVM IR's basic block is preserved when that basic block has a name. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D10465 llvm-svn: 240145	2015-06-19 17:43:07 +00:00
Michael Zolotukhin	4d8ffa082c	[SLP] Vectorize for all-constant entries. Differential Revision: http://reviews.llvm.org/D10531 llvm-svn: 240144	2015-06-19 17:40:15 +00:00
Matt Arsenault	5eb5eb59fc	AMDGPU: Fix some places missed in rename llvm-svn: 240143	2015-06-19 17:39:03 +00:00
Rafael Espindola	284a750c5f	Make all temporary symbols unnamed. What this does is make all symbols that would otherwise start with a .L (or L on MachO) unnamed. Some of these symbols still show up in the symbol table, but we can just make them unnamed. In order to make sure we produce identical results when going thought assembly, all .L (not just the compiler produced ones), are now unnamed. Running llc on llvm-as.opt.bc, the peak memory usage goes from 208.24MB to 205.57MB. llvm-svn: 240130	2015-06-19 12:16:55 +00:00
Ahmed Bougacha	9a9094260d	[ARM] Look through concat when lowering in-place shuffles (VZIP, ..) Currently, we canonicalize shuffles that produce a result larger than their operands with: shuffle(concat(v1, undef), concat(v2, undef)) -> shuffle(concat(v1, v2), undef) because we can access quad vectors (see PerformVECTOR_SHUFFLECombine). This is useful in the general case, but there are special cases where native shuffles produce larger results: the two-result ops. We can look through the concat when lowering them: shuffle(concat(v1, v2), undef) -> concat(VZIP(v1, v2):0, :1) This lets us generate the native shuffles instead of scalarizing to dozens of VMOVs. Differential Revision: http://reviews.llvm.org/D10424 llvm-svn: 240118	2015-06-19 02:32:35 +00:00
Ahmed Bougacha	7dbea8cec9	[ARM] Add D-sized vtrn/vuzp/vzip tests, and cleanup. NFC. llvm-svn: 240114	2015-06-19 02:15:34 +00:00
Eric Christopher	572e03a396	Fix "the the" in comments. llvm-svn: 240112	2015-06-19 01:53:21 +00:00
Alex Lorenz	82a9a7e42c	MIR Serialization: Reenable one of the MIRParser tests by reverting r239805. The test 'llvm/test/CodeGen/MIR/machine-function.mir' was disabled on x86 msc18 in r239805 as it failed. My commit r240054 have fixed the problem, so this commit reverts the commit that disabled the test as it should pass now. llvm-svn: 240074	2015-06-18 22:46:27 +00:00
Rafael Espindola	9ac06a0e6b	Improve the --expand-relocs handling of MachO. In a relocation target can take 3 basic forms * A r_value in scattered relocations. * A symbol in external relocations. * A section is non-external relocations. Have the dump reflect that. With this change we go from CHECK-NEXT: Extern: 0 CHECK-NEXT: Type: X86_64_RELOC_SUBTRACTOR (5) CHECK-NEXT: Symbol: 0x2 CHECK-NEXT: Scattered: 0 To just // CHECK-NEXT: Type: X86_64_RELOC_SUBTRACTOR (5) // CHECK-NEXT: Section: __data (2) Since the relocation is with a section, we print the seciton name and don't need to say that it is not scattered or external. Someone motivated can add further special cases for things like ARM64_RELOC_ADDEND and ARM_RELOC_PAIR. llvm-svn: 240073	2015-06-18 22:38:20 +00:00
Yi Jiang	e0b3499db7	Avoid redundant select node in early if-conversion pass llvm-svn: 240072	2015-06-18 22:34:09 +00:00
Hans Wennborg	67d492a544	Switch lowering: enable whole-switch jump tables at -O0. To same compile time, the analysis to find dense case-clusters in switches is not done at -O0. However, when the whole switch is dense enough, it is easy to turn it into a jump table, resulting in much faster code with no extra effort. llvm-svn: 240071	2015-06-18 22:22:30 +00:00
Rafael Espindola	cf022ba270	Pass --expand-relocs to a few more tests. llvm-svn: 240069	2015-06-18 22:12:47 +00:00
Sanjay Patel	c3e018e6fd	add test to show suboptimal load merging behavior llvm-svn: 240063	2015-06-18 21:34:26 +00:00
Simon Pilgrim	de94fa6438	[X86][SSE][CostModel] Fixed uitofp/sitofp cost target tests to specify sse2/avx2/avx512f directly instead of via a cpu model. llvm-svn: 240062	2015-06-18 21:26:01 +00:00
Sanjay Patel	9fce2bc7b1	fixed to test attributes and use better checks 1. Used update_llc_test_checks.py to tighten checks 2. Fixed triple (nothing Darwin-specific here) 3. Replaced CPU specifiers with attributes 4. Fixed comments 5. Removed IvyBridge run because it did not add any coverage llvm-svn: 240058	2015-06-18 21:12:24 +00:00
Rafael Espindola	aaaa575f71	Use --expand-relocs in a test. It will make the next change easier to read. llvm-svn: 240053	2015-06-18 20:57:35 +00:00
Colin LeMahieu	d2158755eb	[Hexagon] Printing packet brackets when asm printing and adding a number of tests that test packet brackets. llvm-svn: 240051	2015-06-18 20:43:50 +00:00
Sanjoy Das	c65d43e649	[CallGraph] Teach the CallGraph about non-leaf intrinsics. Summary: Currently intrinsics don't affect the creation of the call graph. This is not accurate with respect to statepoint and patchpoint intrinsics -- these do call (or invoke) LLVM level functions. This change fixes this inconsistency by adding a call to the external node for call sites that call these non-leaf intrinsics. This coupled with the fact that these intrinsics also escape the function pointer they call gives us a conservatively correct call graph. Reviewers: reames, chandlerc, atrick, pgavlin Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10526 llvm-svn: 240039	2015-06-18 19:28:26 +00:00
David Majnemer	46c852e438	[CodeGen] Don't emit a random reference to the personality function This should fix issues we've been seeing with Darwin. llvm-svn: 240036	2015-06-18 18:31:46 +00:00
James Y Knight	f90346f8f6	[SPARC] Repair GOT references to internal symbols. They had been getting emitted as a section + offset reference, which is bogus since the value needs to be the offset within the GOT, not the actual address of the symbol's object. Differential Revision: http://reviews.llvm.org/D10441 llvm-svn: 240020	2015-06-18 15:05:15 +00:00
Rafael Espindola	f14eec8d78	Convert a few tests to use llvm-mc. llvm-svn: 240017	2015-06-18 13:39:07 +00:00
Simon Pilgrim	1739421893	[X86][AVX2] Added AVX2 SINT_TO_FP/UINT_TO_FP tests llvm-svn: 240013	2015-06-18 12:32:28 +00:00
Asaf Badouh	81f03c30a5	[AVX512] add instructions: VPAVGB and VPAVGW review http://reviews.llvm.org/D10504 llvm-svn: 240012	2015-06-18 12:30:53 +00:00
Elena Demikhovsky	d3057e5e37	AVX-512: (fixed) Added encoding of all forms of VPERMT2W/D/Q/PS/PD and VPERMI2W/D/Q/PS/PD. Intrinsics and tests for them are comming in the next patch. llvm-svn: 240003	2015-06-18 08:56:19 +00:00
Elena Demikhovsky	4f13f3f9b8	reverted 239999 due to test failures llvm-svn: 240001	2015-06-18 08:06:49 +00:00
Elena Demikhovsky	975a637cd9	AVX-512: Added encoding of all forms of VPERMT2W/D/Q/PS/PD and VPERMI2W/D/Q/PS/PD. Intrinsics and tests for them are comming in the next patch. llvm-svn: 239999	2015-06-18 07:29:40 +00:00
Benjamin Kramer	c6e8bfc41d	[AsmPrinter] Make isRepeatedByteSequence smarter about odd integer types - zext the value to alloc size first, then check if the value repeats with zero padding included. If so we can still emit a .space - Do the checking with APInt.isSplat(8), which handles non-pow2 types - Also handle large constants (bit width > 64) - In a ConstantArray all elements have the same type, so it's sufficient to check the first constant recursively and then just compare if all following constants are the same by pointer compare llvm-svn: 239977	2015-06-17 23:55:17 +00:00
Simon Pilgrim	3aa039a4a8	[X86][SSE] Improved support for vector i16 to float conversions. Added explicit sign extension for v4i16/v8i16 to v4i32/v8i32 before conversion to floats. Matches existing support for v4i8/v8i8. Follow up to D10433 llvm-svn: 239966	2015-06-17 22:43:34 +00:00
Jingyue Wu	cd3afea451	Add NVPTXLowerAlloca pass to convert alloca'ed memory to local address Summary: This is done by first adding two additional instructions to convert the alloca returned address to local and convert it back to generic. Then replace all uses of alloca instruction with the converted generic address. Then we can rely NVPTXFavorNonGenericAddrSpace pass to combine the generic addresscast and the corresponding Load, Store, Bitcast, GEP Instruction together. Patched by Xuetian Weng (xweng@google.com). Test Plan: test/CodeGen/NVPTX/lower-alloca.ll Reviewers: jholewinski, jingyue Reviewed By: jingyue Subscribers: meheff, broune, eliben, jholewinski, llvm-commits Differential Revision: http://reviews.llvm.org/D10483 llvm-svn: 239964	2015-06-17 22:31:02 +00:00
David Majnemer	7fddeccb8b	Move the personality function from LandingPadInst to Function The personality routine currently lives in the LandingPadInst. This isn't desirable because: - All LandingPadInsts in the same function must have the same personality routine. This means that each LandingPadInst beyond the first has an operand which produces no additional information. - There is ongoing work to introduce EH IR constructs other than LandingPadInst. Moving the personality routine off of any one particular Instruction and onto the parent function seems a lot better than have N different places a personality function can sneak onto an exceptional function. Differential Revision: http://reviews.llvm.org/D10429 llvm-svn: 239940	2015-06-17 20:52:32 +00:00
Ahmed Bougacha	f32991461f	[CodeGenPrepare] Generalize inserted set from truncs to any inst. It's been used before to avoid infinite loops caused by separate CGP optimizations undoing one another. We found one more such issue caused by r238054. To avoid it, generalize the "InsertedTruncs" set to any inst, and use it to avoid touching those again. llvm-svn: 239938	2015-06-17 20:44:32 +00:00
Colin LeMahieu	bb71f7d251	[Hexagon] Adding a number of other tests for min/max instructions and loading i1s. llvm-svn: 239935	2015-06-17 20:29:33 +00:00
Peter Collingbourne	4fc603ded3	LowerBitSets: Do not assign names to aliases of unnamed bitset element objects. The restriction on unnamed aliases was removed in r239921. Mostly reverts r239590, but we keep the test. llvm-svn: 239923	2015-06-17 18:31:02 +00:00
Rafael Espindola	54fc298bbc	Allow aliases to be unnamed. If globals can be unnamed, there is no reason for aliases to be different. The restriction was there since the original implementation in r36435. I can only guess it was there because of the old bison parser for the old alias syntax. llvm-svn: 239921	2015-06-17 17:53:31 +00:00
Colin LeMahieu	ca8a82d5c7	[Hexagon] Adding some compare tests, fixing existing XFAILed tests, and removing mcpu=hexagonv4 since that's the minimum version anyway. llvm-svn: 239917	2015-06-17 17:19:05 +00:00
Diego Novillo	8c49a57266	Add documentation for new backedge mass propagation in irregular loops. Tweak test cases and rename headerIndexFor -> getHeaderIndex. llvm-svn: 239915	2015-06-17 16:28:22 +00:00
Benjamin Kramer	58675d4f84	[MC/Dwarf] Encode DW_CFA_advance_loc in target endianess. This matches GNU as output. llvm-svn: 239911	2015-06-17 15:14:35 +00:00
Toma Tabacu	f712ede932	[mips] [IAS] Add support for expanding LASym with a source register operand. Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9348 llvm-svn: 239910	2015-06-17 14:31:51 +00:00
Toma Tabacu	1a1083285c	[mips] [IAS] Add support for the B{L,G}{T,E}(U) branch pseudo-instructions. Summary: This does not include support for the immediate variants of these pseudo-instructions. Fixes llvm.org/PR20968. Reviewers: dsanders Reviewed By: dsanders Subscribers: seanbruno, emaste, llvm-commits Differential Revision: http://reviews.llvm.org/D8537 llvm-svn: 239905	2015-06-17 13:20:24 +00:00
Toma Tabacu	9e7b90c244	[mips] [IAS] Fix LA with relative label operands. Summary: Call MCSymbolRefExpr::create() with a MCSymbol* argument, not with a StringRef of the Symbol's name, in order to avoid creating invalid temporary symbols for relative labels (e.g. {$,.L}tmp00, {$,.L}tmp10 etc.). Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10498 llvm-svn: 239901	2015-06-17 12:30:37 +00:00
Toma Tabacu	6a1e0eb27d	[mips] [IAS] Add test for SW with relative label operands. NFC. Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10497 llvm-svn: 239899	2015-06-17 11:46:37 +00:00
Toma Tabacu	07c97b3b7e	[mips] [IAS] Fix LW with relative label operands. Summary: Previously, MCSymbolRefExpr::create() was called with a StringRef of the symbol name, which it would then search for in the Symbols StringMap (from MCContext). However, relative labels (which are temporary symbols) are apparently not stored in the Symbols StringMap, so we end up creating a new {$,.L}tmp symbol ({$,.L}tmp00, {$,.L}tmp10 etc.) each time we create an MCSymbolRefExpr by passing in the symbol name as a StringRef. Fortunately, there is a version of MCSymbolRefExpr::create() which takes an MCSymbol* and we already have an MCSymbol* at that point, so we can just pass that in instead of the StringRef. I also removed the local StringRef calls to MCSymbolRefExpr::create() from expandMemInst(), as those cases can be handled by evaluateRelocExpr() anyway. Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D9938 llvm-svn: 239897	2015-06-17 10:43:45 +00:00
Igor Breger	dfcc3d31a7	AVX-512: cvtusi2ss/d intrinsics. Change builtin function name and signature ( add third parameter - rounding mode ). Added tests for intrinsics. Differential Revision: http://reviews.llvm.org/D10473 llvm-svn: 239888	2015-06-17 07:23:57 +00:00
Matthias Braun	8321006d44	Revert "AArch64: Use CMP;CCMP sequences for and/or/setcc trees." The patch triggers a miscompile on SPEC 2006 403.gcc with the (ref) 200.i and scilab.i inputs. I opened PR23866 to track analysis of this. This reverts commit r238793. llvm-svn: 239880	2015-06-17 04:02:32 +00:00
Colin LeMahieu	be99a02b1b	[Hexagon] Adding MC ELF streamer and updating addend relocation test which shows correct ELF symbol. llvm-svn: 239876	2015-06-17 03:06:16 +00:00
Sanjay Patel	0848a8be92	Add some tests based on PR21711 These were originally added in r227242, but that patch was reverted because it caused a failure on AArch64. llvm-svn: 239860	2015-06-16 22:37:50 +00:00
Simon Atanasyan	6e07e9305b	[llvm-readobj] Print MIPS .reginfo section content llvm-svn: 239856	2015-06-16 21:47:43 +00:00
Simon Pilgrim	cae7b94cbd	[X86][SSE] Vectorize v2i32 to v2f64 conversions This patch enables support for the conversion of v2i32 to v2f64 to use the CVTDQ2PD xmm instruction and stay on the SSE unit instead of scalarizing, sign extending to i64 and using CVTSI2SDQ scalar conversions. Differential Revision: http://reviews.llvm.org/D10433 llvm-svn: 239855	2015-06-16 21:40:28 +00:00
Philip Reames	c25df11614	Reapply 239795 - [InstCombine] Propagate non-null facts to call parameters The original change broke clang side tests. I will be submitting those momentarily. This change includes post commit feedback on the original change from from Pete Cooper. Original Submission comments: If a parameter to a function is known non-null, use the existing parameter attributes to record that fact at the call site. This has no optimization benefit by itself - that I know of - but is an enabling change for http://reviews.llvm.org/D9129. Differential Revision: http://reviews.llvm.org/D9132 llvm-svn: 239849	2015-06-16 20:24:25 +00:00
Rafael Espindola	c6afe0d4e9	Improve handling of end of file in the bitcode reader. Before this patch the bitcode reader would read a module from a file that contained in order: * Any number of non MODULE_BLOCK sub blocks. * One MODULE_BLOCK * Any number of non MODULE_BLOCK sub blocks. * 4 '\n' characters to handle OS X's ranlib. Since we support lazy reading of modules, any information that is relevant for the module has to be in the MODULE_BLOCK or before it. We don't gain anything from checking what is after. This patch then changes the reader to stop once the MODULE_BLOCK has been successfully parsed. This avoids the ugly special case for .bc files in an archive and makes it easier to embed bitcode files. llvm-svn: 239845	2015-06-16 20:03:39 +00:00
Diego Novillo	9a779623d9	Fix PR 23525 - Separate header mass propagation in irregular loops. Summary: When propagating mass through irregular loops, the mass flowing through each loop header may not be equal. This was causing wrong frequencies to be computed for irregular loop headers. Fixed by keeping track of masses flowing through each of the headers in an irregular loop. To do this, we now keep track of per-header backedge weights. After the loop mass is distributed through the loop, the backedge weights are used to re-distribute the loop mass to the loop headers. Since each backedge will have a mass proportional to the different branch weights, the loop headers will end up with a more approximate weight distribution (as opposed to the current distribution that assumes that every loop header is the same). Reviewers: dexonsmith Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10348 llvm-svn: 239843	2015-06-16 19:10:58 +00:00
Igor Laevsky	8f3fa0ec63	[Statepoints] Test only change. Check that statepoint lowering didn't generate more than expected amount of spills. See http://reviews.llvm.org/D10402 for related discussion. llvm-svn: 239842	2015-06-16 19:07:05 +00:00
Frederic Riss	40baa0aad4	Have MachOObjectFile::isValidArch() accept armv7 llvm-svn: 239833	2015-06-16 17:37:03 +00:00
Alex Lorenz	5ef16b8a7c	MIR Parser: Report an error when a machine function doesn't have a corresponding function. This commit reports an error when a machine function from a MIR file that contains LLVM IR can't find a function with the same name in the loaded LLVM IR module. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D10468 llvm-svn: 239831	2015-06-16 17:06:29 +00:00
Rafael Espindola	35f6faed67	Add a test for padded bitcode files. llvm-svn: 239829	2015-06-16 16:36:15 +00:00
Kit Barton	4f79f96fd7	Properly handle the mftb instruction. The mftb instruction was incorrectly marked as deprecated in the PPC Backend. Instead, it should not be treated as deprecated, but rather be implemented using the mfspr instruction. A similar patch was put into GCC last year. Details can be found at: https://sourceware.org/ml/binutils/2014-11/msg00383.html. This change will replace instances of the mftb instruction with the mfspr instruction for all CPUs except 601 and pwr3. This will also be the default behaviour. Additional details can be found in: https://llvm.org/bugs/show_bug.cgi?id=23680 Phabricator review: http://reviews.llvm.org/D10419 llvm-svn: 239827	2015-06-16 16:01:15 +00:00
Matt Arsenault	ed891b5561	Revert "Revert "Fix merges of non-zero vector stores"" Reapply r239539. Don't assume the collected number of stores is the same vector size. Just take the first N stores to fill the vector. llvm-svn: 239825	2015-06-16 15:51:48 +00:00
Benjamin Kramer	1ee59cba5d	[InstSimplify] Allow folding of fdiv X, X with just NaNs ignored Any combination of +-inf/+-inf is NaN so it's already ignored with nnan and we can skip checking for ninf. Also rephrase logic in comments a bit. llvm-svn: 239821	2015-06-16 14:57:29 +00:00
Daniel Sanders	58405d856e	[mips][ias] Expand on r238751 to cover as many relocs as possible. Summary: Relocs that can be converted from absolute to PC-relative now do so if IsPCRel is true. Relocs that require PC-relative now call llvm_unreachable() if IsPCRel is false and similarly those that require absolute assert that IsPCRel is false. Note that while it looks like some relocs (e.g. R_MIPS_26) can be converted into the MIPS32r6/MIPS64r6 relocs (R_MIPS_PC*_S2), it isn't actually valid to do so. Placeholders have been left in the testcase for unsupported relocs and relocs that cannot be generated at the moment. Reviewers: vkalintiris Reviewed By: vkalintiris Subscribers: llvm-commits, rafael Differential Revision: http://reviews.llvm.org/D10184 llvm-svn: 239817	2015-06-16 13:46:26 +00:00
Daniel Sanders	c535d93b47	[llvm-mc] The object form of the GNU triple should be the same as the string form. Summary: GetTarget() may modify TripleName without also updating TheTriple. This can lead to situations where the MCObjectStreamer has a different triple to the rest of LLVM. This inconsistency caused sparc-little-endian.s to pass on Windows because most of LLVM had sparcel-pc-win32 while MCObjectStreamer had "". I believe the same kind of thing was also true of Darwin. Reviewers: rengolin Reviewed By: rengolin Subscribers: llvm-commits, rengolin, rafael Differential Revision: http://reviews.llvm.org/D10450 llvm-svn: 239808	2015-06-16 09:57:38 +00:00
Asaf Badouh	02d126cb9d	[AVX512] add integer min/max intrinsics support. review: http://reviews.llvm.org/D10439 llvm-svn: 239806	2015-06-16 08:39:27 +00:00
NAKAMURA Takumi	f68c7a27f4	Disable llvm/test/CodeGen/MIR/machine-function.mir on x86 msc18 for now. Investigating. The emission was as below; --- name: foo alignment: 31428584 exposesReturnsTwice: true hasInlineAsm: false ... --- name: bar alignment: 1701667182 exposesReturnsTwice: false hasInlineAsm: false ... --- name: func alignment: 8 exposesReturnsTwice: false hasInlineAsm: false ... --- name: func2 alignment: 16 exposesReturnsTwice: true hasInlineAsm: true ... llvm-svn: 239805	2015-06-16 06:57:35 +00:00
Elena Demikhovsky	77f0e9f662	X86: optimized i64 vector multiply with constant When we multiply two 64-bit vectors, we extract lower and upper part and use the PMULUDQ instruction. When one of the operands is a constant, the upper part may be zero, we know this at compile time. Example: %a = mul <4 x i64> %b, <4 x i64> < i64 5, i64 5, i64 5, i64 5>. I'm checking the value of the upper part and prevent redundant "multiply", "shift" and "add" operations. llvm-svn: 239802	2015-06-16 06:07:24 +00:00
Philip Reames	1a6305f313	Revert 239795 I forgot to update some clang test cases. I'll fix and resubmit tomorrow. llvm-svn: 239800	2015-06-16 01:20:53 +00:00
Ahmed Bougacha	8c7754b965	[AArch64] Generalize extract-high DUP extension to MOVI/MVNI. These are really immediate DUPs, and suffer from the same problem with long instructions with a high/2 variant (e.g. smull). By extending a MOVI (or DUP, before this patch), we can avoid an ext on the other operand of the long instruction, e.g. turning: ext.16b v0, v0, v0, #8 movi.4h v1, #0x53 smull.4s v0, v0, v1 into: movi.8h v1, #0x53 smull2.4s v0, v0, v1 While there, add a now-necessary combine to fold (VT NVCAST (VT x)). llvm-svn: 239799	2015-06-16 01:18:14 +00:00
Ahmed Bougacha	d300722b93	[AArch64] Robustize neon-2velem-high test. NFC. llvm-svn: 239798	2015-06-16 01:05:39 +00:00
Philip Reames	dfc29fba60	[InstCombine] Propagate non-null facts to call parameters If a parameter to a function is known non-null, use the existing parameter attributes to record that fact at the call site. This has no optimization benefit by itself - that I know of - but is an enabling change for http://reviews.llvm.org/D9129. Differential Revision: http://reviews.llvm.org/D9132 llvm-svn: 239795	2015-06-16 00:43:54 +00:00
Alex Lorenz	5b5f97537f	MIR Serialization: Print and parse simple machine function attributes. This commit serializes the simple, scalar attributes from the 'MachineFunction' class. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D10449 llvm-svn: 239790	2015-06-16 00:10:47 +00:00
Alex Lorenz	8e7a58d7cc	MIR Serialization: Create dummy functions when the MIR file doesn't have LLVM IR. This commit creates a dummy LLVM IR function with one basic block and an unreachable instruction for each parsed machine function when the MIR file doesn't have LLVM IR. This change is required as the machine function analysis pass creates machine functions only for the functions that are defined in the current LLVM module. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D10135 llvm-svn: 239778	2015-06-15 23:07:38 +00:00
Alex Lorenz	fe2aa97bab	MIR Serialization: Report an error when machine functions have the same name. This commit reports an error when the MIR parser encounters a machine function with the name that is the same as the name of a different machine function. Reviewers: Duncan P. N. Exon Smith Differential Revision: http://reviews.llvm.org/D10130 llvm-svn: 239774	2015-06-15 22:23:23 +00:00
Peter Collingbourne	58af6d1594	Add safestack attribute to LLVMAttribute enum and Go bindings. Correct constants in commented-out part of LLVMAttribute enum. Add tests that verify that the safestack attribute is only allowed as a function attribute. llvm-svn: 239772	2015-06-15 22:16:51 +00:00
Colin LeMahieu	ded2e90600	[Hexagon] Using readobj rather than objdump. llvm-svn: 239770	2015-06-15 21:57:41 +00:00
Colin LeMahieu	a071a8e5b6	[Hexagon] PC-relative offsets are relative to packet start rather than the offset of the relocation. Set relocation addend and check it's correct in the ELF. llvm-svn: 239769	2015-06-15 21:52:13 +00:00
Simon Pilgrim	aa9f712967	[X86][SSE] Added tests for vector i8/i16 to f32/f64 conversions llvm-svn: 239767	2015-06-15 21:49:31 +00:00
Peter Collingbourne	82437bf7a5	Protection against stack-based memory corruption errors using SafeStack This patch adds the safe stack instrumentation pass to LLVM, which separates the program stack into a safe stack, which stores return addresses, register spills, and local variables that are statically verified to be accessed in a safe way, and the unsafe stack, which stores everything else. Such separation makes it much harder for an attacker to corrupt objects on the safe stack, including function pointers stored in spilled registers and return addresses. You can find more information about the safe stack, as well as other parts of or control-flow hijack protection technique in our OSDI paper on code-pointer integrity (http://dslab.epfl.ch/pubs/cpi.pdf) and our project website (http://levee.epfl.ch). The overhead of our implementation of the safe stack is very close to zero (0.01% on the Phoronix benchmarks). This is lower than the overhead of stack cookies, which are supported by LLVM and are commonly used today, yet the security guarantees of the safe stack are strictly stronger than stack cookies. In some cases, the safe stack improves performance due to better cache locality. Our current implementation of the safe stack is stable and robust, we used it to recompile multiple projects on Linux including Chromium, and we also recompiled the entire FreeBSD user-space system and more than 100 packages. We ran unit tests on the FreeBSD system and many of the packages and observed no errors caused by the safe stack. The safe stack is also fully binary compatible with non-instrumented code and can be applied to parts of a program selectively. This patch is our implementation of the safe stack on top of LLVM. The patches make the following changes: - Add the safestack function attribute, similar to the ssp, sspstrong and sspreq attributes. - Add the SafeStack instrumentation pass that applies the safe stack to all functions that have the safestack attribute. This pass moves all unsafe local variables to the unsafe stack with a separate stack pointer, whereas all safe variables remain on the regular stack that is managed by LLVM as usual. - Invoke the pass as the last stage before code generation (at the same time the existing cookie-based stack protector pass is invoked). - Add unit tests for the safe stack. Original patch by Volodymyr Kuznetsov and others at the Dependable Systems Lab at EPFL; updates and upstreaming by myself. Differential Revision: http://reviews.llvm.org/D6094 llvm-svn: 239761	2015-06-15 21:07:11 +00:00
Alex Lorenz	735c47ec3e	MIR Serialization: Connect the machine function analysis pass to the MIR parser. This commit connects the machine function analysis pass (which creates machine functions) to the MIR parser, which will initialize the machine functions with the state from the MIR file and reconstruct the machine IR. This commit introduces a new interface called 'MachineFunctionInitializer', which can be used to provide custom initialization for the machine functions. This commit also introduces a new diagnostic class called 'DiagnosticInfoMIRParser' which is used for MIR parsing errors. This commit modifies the default diagnostic handling in LLVMContext - now the the diagnostics are printed directly into llvm::errs() so that the MIR parsing errors can be printed with colours. Reviewers: Justin Bogner Differential Revision: http://reviews.llvm.org/D9928 llvm-svn: 239753	2015-06-15 20:30:22 +00:00
Sanjoy Das	784582f116	Add "REQUIRES: asserts" to test case that uses -debug-only llvm-svn: 239748	2015-06-15 20:05:38 +00:00
Sanjoy Das	69fad0799e	[CodeGen] Add a pass to fold null checks into nearby memory operations. Summary: This change adds an "ImplicitNullChecks" target dependent pass. This pass folds null checks into memory operation using the FAULTING_LOAD pseudo-op introduced in previous patches. Depends on D10197 Depends on D10199 Depends on D10200 Reviewers: reames, rnk, pgavlin, JosephTremoulet, atrick Reviewed By: atrick Subscribers: ab, JosephTremoulet, llvm-commits Differential Revision: http://reviews.llvm.org/D10201 llvm-svn: 239743	2015-06-15 18:44:27 +00:00
Evgeny Astigeevich	ff1f4be4c7	On behalf of Alexandros Lamprineas: LLVM targeting aarch64 doesn't correctly produce aligned accesses for non-aligned data at -O0/fast-isel (-mno-unaligned-access). The root cause seems to be in fast-isel not producing unaligned access correctly for -mno-unaligned-access. The patch just aborts fast-isel for loads and stores when -mno-unaligned-access is present. The regression test is updated to check this new test case (-mno-unaligned-access together with fast-isel). Differential Revision: http://reviews.llvm.org/D10360 llvm-svn: 239732	2015-06-15 15:48:44 +00:00
Rafael Espindola	92200d237a	gold-plugin: save the .o when given -save-temps. The plugin now save the bitcode before and after optimizations and the .o that is passed to the linker. llvm-svn: 239726	2015-06-15 13:36:27 +00:00
Jingyue Wu	12b0c2835e	[ValueTracking] do not overwrite analysis results already computed Summary: ValueTracking used to overwrite the analysis results computed from assumes and dominating conditions. This patch fixes this issue. Test Plan: test/Analysis/ValueTracking/assume.ll Reviewers: hfinkel, majnemer Reviewed By: majnemer Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10283 llvm-svn: 239718	2015-06-15 05:46:29 +00:00
Hao Liu	1c2e89a57a	[AArch64] Delete two empty files, which should be removed by r239713. llvm-svn: 239715	2015-06-15 02:56:40 +00:00
Hao Liu	d0ca8d7edd	[AArch64] Revert r239711 again. We need to discuss how to share code between AArch64 and ARM backend. llvm-svn: 239713	2015-06-15 01:56:40 +00:00
Hao Liu	cb070e3833	[AArch64] Match interleaved memory accesses into ldN/stN instructions. Re-commit after adding "-aarch64-neon-syntax=generic" to fix the failure on OS X. This patch was firstly committed in r239514, then reverted in r239544 because of a syntax incompatible failure on OS X. llvm-svn: 239711	2015-06-15 01:35:49 +00:00
Benjamin Kramer	228680ded8	[InstSimplify] fsub nnan x, x -> 0.0 is valid without ninf Both inf - inf and (-inf) - (-inf) are NaN, so it's already covered by nnan. llvm-svn: 239702	2015-06-14 21:01:20 +00:00
Benjamin Kramer	4f0524614e	[InstSimplify] Add self-fdiv identities for -ffinite-math-only. When NaNs and Infs are ignored we can fold X / X -> 1.0 -X / X -> -1.0 X / -X -> -1.0 llvm-svn: 239701	2015-06-14 18:53:58 +00:00
Igor Breger	5e49697138	AVX-512: Implemented DAG lowering for shuff62x2/shufi62x2 instuctions ( Shuffle Packed Values at 128-bit Granularity ) Tests added , vector-shuffle-512-v8.ll test re-generated. Differential Revision: http://reviews.llvm.org/D10300 llvm-svn: 239697	2015-06-14 13:07:47 +00:00
Michael Kuperstein	e3de07a529	Add support for parsing the XOR operator in Intel syntax inline assembly. Differential Revision: http://reviews.llvm.org/D10385 Patch by marina.yatsina@intel.com llvm-svn: 239695	2015-06-14 12:59:45 +00:00
Igor Breger	abe4a79b75	AVX-512: Implemented cvtsi2ss/d cvtusi2ss/d instructions with round control for KNL. Added intrinsics for cvtsi2ss/d instructions. Added tests for intrinsics and encoding. Differential Revision: http://reviews.llvm.org/D10430 llvm-svn: 239694	2015-06-14 12:44:55 +00:00
Colin LeMahieu	b8575b14be	[Hexagon] Adding some codegen tests and updating some to match spec. llvm-svn: 239690	2015-06-13 21:46:39 +00:00
Simon Pilgrim	d3f6427446	[DAGCombiner] Added BSWAP(BSWAP(x)) -> x combine pattern. llvm-svn: 239682	2015-06-13 16:25:12 +00:00
Simon Pilgrim	011381d48b	[DAGCombiner] Added BSWAP vector constant folding support. llvm-svn: 239675	2015-06-13 14:08:15 +00:00
Tom Stellard	45bb48ea19	R600 -> AMDGPU rename llvm-svn: 239657	2015-06-13 03:28:10 +00:00
Tim Northover	02cfdbb7f1	AArch64: map bare-metal arm64-macho triple to MachO MC layer. Far better than an assertion about expecting ELF. llvm-svn: 239647	2015-06-12 23:37:11 +00:00
Tom Stellard	12a1910e87	R600/SI: Add assembler support for FLAT instructions - Add glc, slc, and tfe operands to flat instructions - Add missing flat instructions - Fix the encoding of flat_load_dwordx3 and flat_store_dwordx3. llvm-svn: 239637	2015-06-12 20:47:06 +00:00
Colin LeMahieu	79ec06525e	[Hexagon] Making intrinsic tests agnostic to register allocation. Narrowing intrinsic parameters to appropriate width. llvm-svn: 239634	2015-06-12 19:57:32 +00:00
Rafael Espindola	de28b7375f	Don't depend on the interleaving of stdout and stderr. That can change as we change the buffering. llvm-svn: 239602	2015-06-12 12:20:03 +00:00
John Brawn	d9e39d53b6	[ARM] Disabling vfp4 should disable fp16 ARMTargetParser::getFPUFeatures should disable fp16 whenever it disables vfp4, as otherwise something like -mcpu=cortex-a7 -mfpu=none leaves us with fp16 enabled (though the only effect that will have is a wrong build attribute). Differential Revision: http://reviews.llvm.org/D10397 llvm-svn: 239599	2015-06-12 09:38:51 +00:00

... 2 3 4 5 6 ...

30774 Commits