llvm-project

Commit Graph

Author	SHA1	Message	Date
Jay Foad	a0653a3e6c	Rename ComputeMaskedBits to computeKnownBits. "Masked" has been inappropriate since it lost its Mask parameter in r154011. llvm-svn: 208811	2014-05-14 21:14:37 +00:00
David Majnemer	2d6c023576	InstSimplify: Optimize signed icmp of -(zext V) Summary: We know that -(zext V) will always be <= zero, simplify signed icmps that have these. Uncovered using http://www.cs.utah.edu/~regehr/souper/ Reviewers: nicholas Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D3754 llvm-svn: 208809	2014-05-14 20:16:28 +00:00
Christian Pirker	6692e7c116	ARM-BE: test files for vector argument passing Reviewed at http://reviews.llvm.org/D3766 llvm-svn: 208793	2014-05-14 16:59:44 +00:00
Christian Pirker	85cdab63c4	[ARM64-BE] Fix byte order of CIE and FDE frames for exception handling Reviewed at http://reviews.llvm.org/D3741 llvm-svn: 208792	2014-05-14 16:51:58 +00:00
Logan Chien	95188b9092	Fix ARM EHABI when function has landingpad and nounwind. If the function has the landingpad instruction, then the handlerdata should be emitted even if the function has nouwnind attribute. Otherwise, following code will not work: void test1() noexcept { try { throw_exception(); } catch (...) { log_unexpected_exception(); } } Since the cantunwind was incorrectly emitted and the LSDA is not available. llvm-svn: 208791	2014-05-14 16:38:30 +00:00
Logan Chien	ba1b6951c3	More test case for r208715. The commit r208166 will cause some regression on ARM EHABI. This fix has been committed in r208715, and an assertion failure test case has been committed in r208770. This commit further extends the unittest so that the actual value in the handlerdata will be checked. llvm-svn: 208790	2014-05-14 16:37:32 +00:00
Benjamin Kramer	594f963ea6	X86: If we have an instruction that sets a flag and a zero test on the input of that instruction try to eliminate the test. For example tzcntl %edi, %ebx testl %edi, %edi je .label can be rewritten into tzcntl %edi, %ebx jb .label A minor complication is that tzcnt sets CF instead of ZF when the input is zero, we have to rewrite users of the flags from ZF to CF. Currently we recognize patterns using lzcnt, tzcnt and popcnt. Differential Revision: http://reviews.llvm.org/D3454 llvm-svn: 208788	2014-05-14 16:14:45 +00:00
Daniel Sanders	e92ab94d4a	[mips] Test that IAS for -mcpu=mips5 does not accept MIPS64 insns and -mcpu=mips(5\|64) does not accept MIPS64r2 Summary: To limit the number of tests required, only one 64-bit ISA prior to MIPS64 are tested. rdhwr has been deliberately left without an ISA annotation for now. This is because the assembler and CodeGen disagree on when the instruction is available. Strictly speaking, it is only available in MIPS32r2 and MIPS64r2. However, it is emulated by a kernel trap on earlier ISA's and is necessary for TLS so CodeGen should emit it on older ISA's too. Depends on D3697 Reviewers: vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3698 llvm-svn: 208785	2014-05-14 15:35:03 +00:00
Daniel Sanders	a2ffa21d80	[mips][mips64r6] Add sel.s and sel.d Summary: Also use named constants for common opcode fields. Depends on D3669 Reviewers: vmedic, zoran.jovanovic, jkolek Reviewed By: jkolek Differential Revision: http://reviews.llvm.org/D3670 llvm-svn: 208784	2014-05-14 15:29:44 +00:00
Evgeniy Stepanov	b4aa2b422b	Regression test for ARM EHABI breakage in r208166. llvm-svn: 208770	2014-05-14 11:13:31 +00:00
Evgeniy Stepanov	aaf4bb2394	[asan] Set debug location in ASan function prologue. Most importantly, it gives debug location info to the coverage callback. This change also removes 2 cases of unnecessary setDebugLoc when IRBuilder is created with the same debug location. llvm-svn: 208767	2014-05-14 10:30:15 +00:00
Serge Pavlov	e6de9e39a8	Fix the case when reordering shuffle and binop produces a constant. This resolves PR19737. llvm-svn: 208762	2014-05-14 09:05:09 +00:00
Simon Atanasyan	ae6bb33ac2	[obj2yaml] Support ELF input format in the obj2yaml tool. The ELF header e_flags field in the MIPS related test cases handled incorrectly. The obj2yaml prints too many flags. I will fix that in the next patches. The patch reviewed by Michael Spencer and Sean Silva. llvm-svn: 208752	2014-05-14 05:07:47 +00:00
Saleem Abdulrasool	27351f2022	ARM: implement support for the UDF mnemonic The UDF instruction is a reserved undefined instruction space. The assembler mnemonic was introduced with ARM ARM rev C.a. The instruction is not predicated and the immediate constant is ignored by the CPU. Add support for the three encodings for this instruction. The changes to the invalid instruction test is due to the fact that the invalid instructions actually overlap with the undefined instruction. Introduction of the new instruction results in a partial decode as an undefined sequence. Drop the tests as they are invalid instruction patterns anyways. llvm-svn: 208751	2014-05-14 03:47:39 +00:00
Nick Lewycky	f0cf8fa941	Optimize integral reciprocal (udiv 1, x and sdiv 1, x) to not use division. This fires exactly once in a clang bootstrap, but covers a few different results from http://www.cs.utah.edu/~regehr/souper/ llvm-svn: 208750	2014-05-14 03:03:05 +00:00
David Blaikie	9b8c8cda0d	Recommit r208506: DebugInfo: Include lexical scopes in inlined subroutines. This was reverted in r208642 due to regressions surrounding file changes within lexical scopes causing inlining information to be lost. The issue was in LexicalScopes::getOrCreateInlinedScope, where I was previously testing "isLexicalBlock" which is false for "DILexicalBlockFile" (a scope used to represent changes in the current file name) and assuming it was then a function (breaking out of the inlined scope path and reaching for the parent non-inlined scopes). By inverting the condition and testing for "isSubprogram" the correct behavior is attained. (also found some weirdness in Clang, see r208742 when reducing this test case - the resulting test case doesn't apply with the Clang fix, but I've added a more realistic test case to inline-scopes.ll which does reproduce the issue and demonstrate the fix) llvm-svn: 208748	2014-05-14 01:08:28 +00:00
Matt Arsenault	4b0402e317	R600/SI: Try to fix BFE operands when moving to VALU This was broken by r208479 llvm-svn: 208740	2014-05-13 23:45:50 +00:00
Christian Pirker	5b0e7a2be2	ARM: Additional test files for thumb fixups (checked with llvm-mv -show-encoding) llvm-svn: 208712	2014-05-13 17:06:51 +00:00
Christian Pirker	39db7ec81f	ARMEB: Fix byte order of EH frame unwinding instructions, with modified test file This commit was already commited as revision rL208689 and discussd in phabricator revision D3704. But the test file was crashing on OS X and windows. I fixed the test file in the same way as in rL208340. llvm-svn: 208711	2014-05-13 16:44:30 +00:00
Joey Gouly	12a8bf09d0	[CGP] r205941 changed the logic, so that a cast happens before 'Result' is compared to 'AddrMode.BaseReg'. In the case that 'AddrMode.BaseReg' is nullptr, 'Result' will also be nullptr, so the cast causes an assertion. We should use dyn_cast_or_null here to check 'Result' is not null and it is an instruction. Bug found by Mats Petersson, and I reduced his IR to get a test case. llvm-svn: 208705	2014-05-13 15:42:45 +00:00
Rafael Espindola	2e7eceb317	Revert "ARMEB: Fix byte order of EH frame unwinding instructions" This reverts commit r208689. The test was crashing on OS X and windows. llvm-svn: 208704	2014-05-13 15:19:56 +00:00
Christian Pirker	f2526a56fe	ARM: Additional test files for thumb fixups llvm-svn: 208691	2014-05-13 11:50:39 +00:00
Daniel Sanders	387fc15d2c	[mips] Marked up instructions added in MIPS32r2 and tested that IAS for -mcpu=mips(2\|32) does not accept them Summary: This required a new instruction group representing the 32-bit subset of MIPS-3 that was available in MIPS32R2. To limit the number of tests required, only one 32-bit and one 64-bit ISA prior to MIPS32/MIPS64 are tested. rdhwr has been deliberately left without an ISA annotation for now. This is because the assembler and CodeGen disagree on when the instruction is available. Strictly speaking, it is only available in MIPS32r2 and MIPS64r2. However, it is emulated by a kernel trap on earlier ISA's and is necessary for TLS so CodeGen should emit it on older ISA's too. Depends on D3696 Reviewers: vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3697 llvm-svn: 208690	2014-05-13 11:45:36 +00:00
Christian Pirker	ea3514ecdb	ARMEB: Fix byte order of EH frame unwinding instructions llvm-svn: 208689	2014-05-13 11:41:49 +00:00
Kevin Qin	97e5d98779	[ARM64] Fix the misleading diagnostic on bad extend amount of reg+reg addressing mode. A vague diagnostic replaced the misleading one. This can fix bug 19502. llvm-svn: 208669	2014-05-13 07:35:12 +00:00
Serge Pavlov	b575ee8294	Fix type of shuffle resulted from shuffle merge. This fix resolves PR19730. llvm-svn: 208666	2014-05-13 06:07:21 +00:00
Weiming Zhao	dd83691cc3	Folding into CSEL when there is ZEXT between SETCC and ADD Normally, patterns like (add x, (setcc cc ...)) will be folded into (csel x, x+1, not cc). However, if there is a ZEXT after SETCC, they won't be folded. This patch recognizes the ZEXT and allows the generation of CSINC. This patch fixes bug 19680. llvm-svn: 208660	2014-05-13 00:40:58 +00:00
Rafael Espindola	cae6590f30	Convert test to FileCheck. llvm-svn: 208658	2014-05-13 00:31:31 +00:00
Rafael Espindola	fbfcb533ba	Convert test to FileCheck. llvm-svn: 208644	2014-05-13 00:07:46 +00:00
David Blaikie	290e22872d	Revert "DebugInfo: Include lexical scopes in inlined subroutines." This reverts commit r208506. Some inlined subroutine scopes appear to be missing with this change. Reverting while I investigate. llvm-svn: 208642	2014-05-12 23:53:03 +00:00
Adam Nemet	5d78558c2b	[DAGCombiner] Split up an indexed load if only the base pointer value is live Right now the load may not get DCE'd because of the side-effect of updating the base pointer. This can happen if we lower a read-modify-write of an illegal larger type (e.g. i48) such that the modification only affects one of the subparts (the lower i32 part but not the higher i16 part). See the testcase. In order to spot the dead load we need to revisit it when SimplifyDemandedBits decided that the value of the load is masked off. This is the CommitTargetLoweringOpt piece. I checked compile time with ARM64 by sending SPEC bitcode files through llc. No measurable change. Fixes <rdar://problem/16031651> llvm-svn: 208640	2014-05-12 23:00:03 +00:00
Louis Gerbarg	b4013235e3	Fix ARM bswap16.ll test on Windows Windows on ARM only supports thumb mode execution, so we have to explicitly pick some non-Windows OS to test ARM mode codegen. llvm-svn: 208638	2014-05-12 22:13:07 +00:00
Reid Kleckner	7a59e0845f	Try to fix an SDAG dependence issue with sret r208453 added support for having sret on the second parameter. In that change, the code for copying sret into a virtual register was hoisted into the loop that lowers formal parameters. This caused a "Wrong topological sorting" assertion failure during scheduling when a parameter is passed in memory. This change undoes that by creating a second loop that deals with sret. I'm worried that this fix is incomplete. I don't fully understand the dependence issues. However, with this change we produce the same DAGs we used to produce, so if they are broken, they are just as broken as they have always been. llvm-svn: 208637	2014-05-12 22:01:27 +00:00
David Blaikie	e323670892	DebugInfo: Make gmlt debug info more gmlt-like by removing variables. For some impending improvements to debug info, LLVM will start assuming that when the CU specifies llvm::DIBuilder::LineTablesOnly, the IR for functions described by that CU will not include variables, types, etc. (might be worth having some test coverage for GMLT + non-GMLT CUs, especially with non-GMLT functions inlined into GMLT CU functions) llvm-svn: 208634	2014-05-12 21:33:03 +00:00
Adam Nemet	63e4b30f79	[Test] Trim unnecessary .c and .cpp from config.suffix in lit.local.cfg Tested by comparing make check VERBOSE=1 before and after to make sure no tests are missed. (VERBOSE=1 prints the list of tests.) Only one test :( remains where .cpp is required: tools/llvm-cov/range_based_for.cpp:// RUN: llvm-cov range_based_for.cpp \| FileCheck %s --check-prefix=STDOUT The topic was discussed in this thread: http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20140428/214905.html llvm-svn: 208621	2014-05-12 19:57:31 +00:00
Louis Gerbarg	efdcf23736	Add support bswap16 to/from memory compiling to rev16 on ARM/Thumb The current patterns for REV16 misses mostn __builtin_bswap16() due to legalization promoting the operands to from load/stores toi32s and then truncing/extending them. This patch adds new patterns that catch the resultant DAGs and codegens them to rev16 instructions. Tests included. rdar://15353652 llvm-svn: 208620	2014-05-12 19:53:52 +00:00
Sebastian Pop	b1a548f72d	do not assert when delinearization fails llvm-svn: 208615	2014-05-12 19:01:53 +00:00
David Blaikie	4abe19edad	DwarfDebug: Avoid an extra map lookup while constructing abstract scope DIEs and reduce nesting/conditionals. One test case had to be updated as it still had the extra indirection for the variable list - removing the extra indirection got it back to passing. llvm-svn: 208608	2014-05-12 18:23:35 +00:00
Tim Northover	ee20caaf82	TableGen: use PrintMethods to print more aliases llvm-svn: 208607	2014-05-12 18:04:06 +00:00
Matt Arsenault	62b1737081	R600: Add mul24 intrinsics llvm-svn: 208604	2014-05-12 17:49:57 +00:00
Matt Arsenault	2adca6090f	Make SimplifyDemandedBits understand BUILD_PAIR llvm-svn: 208598	2014-05-12 17:14:48 +00:00
Matheus Almeida	c0437c7782	[mips] Move disassembler test (test_2r_msa64) into correct folder. llvm-svn: 208594	2014-05-12 16:59:34 +00:00
Matheus Almeida	440000d6ec	[mips] Move disassembler test (Mips MSA test_vec) into correct folder. llvm-svn: 208592	2014-05-12 16:31:45 +00:00
Matheus Almeida	36c426e491	[mips] Move disassembler tests (Mips MSA test_i*, test_mi10) into correct folder. llvm-svn: 208590	2014-05-12 16:26:53 +00:00
Matheus Almeida	cfc8871596	[mips] Move disassembler tests (Mips MSA test_elm*) into correct folder. llvm-svn: 208589	2014-05-12 16:23:45 +00:00
Matheus Almeida	04092f5bc5	[mips] Move disassembler tests (Mips MSA test_lsa, test_dlsa) into correct folder. llvm-svn: 208588	2014-05-12 16:20:46 +00:00
Matheus Almeida	7fd9339e38	[mips] Move disassembler test (Mips MSA test_ctrlregs) into correct folder. llvm-svn: 208587	2014-05-12 16:16:59 +00:00
Matheus Almeida	38a9a8b675	[mips] Move disassembler test (Mips MSA test_bit) into correct folder. llvm-svn: 208586	2014-05-12 16:10:00 +00:00
Matheus Almeida	b4fce72b32	[mips] Move disassembler tests (Mips MSA test_2r, test_2rf, test_3r, test_3rf) into correct folder. llvm-svn: 208584	2014-05-12 16:03:20 +00:00
Daniel Sanders	f99637cb4d	Revert: r208582 - [mips][mips64r6] Add sel.s and sel.d Accidentally committed an unreviewed patch. Reverted it. llvm-svn: 208583	2014-05-12 15:43:41 +00:00
Daniel Sanders	52de11e475	[mips][mips64r6] Add sel.s and sel.d Summary: Also use named constants for common opcode fields. Depends on D3669 Reviewers: jkolek, vmedic, zoran.jovanovic Differential Revision: http://reviews.llvm.org/D3670 llvm-svn: 208582	2014-05-12 15:39:10 +00:00
Daniel Sanders	08e1e0a873	[mips][mips64r6] Add d?div, d?mod, d?divu, d?modu Summary: Depends on D3668 Reviewers: jkolek, zoran.jovanovic, vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3669 llvm-svn: 208579	2014-05-12 15:24:16 +00:00
Daniel Sanders	0ac5ec58b8	[mips][mips64r6] Added mul/mulu/muh/muhu Summary: The 'mul' line of the test is temporarily commented out because it currently matches the MIPS32 mul instead of the MIPS32r6 mul. This line will be uncommented when we disable the MIPS32 mul on MIPS32r6. Reviewers: jkolek, zoran.jovanovic, vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3668 llvm-svn: 208576	2014-05-12 15:12:45 +00:00
Benjamin Kramer	3b36b72a9c	X86: Make sure that we have SSE4.1 before we generate insertps nodes. PR19721. llvm-svn: 208552	2014-05-12 13:12:08 +00:00
Daniel Sanders	aadc357e5f	[mips] Marked up instructions added in MIPS32 and tested that IAS for -mcpu=mips2 does not accept them Summary: To limit the number of tests required, only one 32-bit and one 64-bit ISA prior to MIPS32/MIPS64 are explicitly tested. Depends on D3695 Reviewers: vmedic Differential Revision: http://reviews.llvm.org/D3696 llvm-svn: 208549	2014-05-12 13:04:32 +00:00
Daniel Sanders	07cdea2baa	[mips] Marked up instructions added in MIPS-V and tested that IAS for -mcpu=mips[1234] does not accept them Summary: This required a new instruction group representing the 32-bit subset of MIPS-V that was available in MIPS32R2 Most of these instructions are correctly rejected but with the wrong error message. These have been placed in a separate test for now. It happens because many of the MIPS V instructions have not been implemented. Depends on D3694 Reviewers: vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3695 llvm-svn: 208546	2014-05-12 12:52:44 +00:00
Daniel Sanders	070fd1c42a	[mips] Fold FeatureBitCount into FeatureMips32 and FeatureMips64 Summary: DCL[ZO] are now correctly marked as being MIPS64 instructions. This has no effect on the CodeGen tests since expansion of i64 prevented their use anyway. The check for MIPS16 to prevent the use of CLZ no longer prevents DCLZ as well. This is not a functional change since DCLZ is still prohibited by being a MIPS64 instruction (MIPS16 is only compatible with MIPS32). No functional change Reviewers: vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3694 llvm-svn: 208544	2014-05-12 12:41:59 +00:00
Daniel Sanders	39d0051847	[mips] Fold FeatureSwap into FeatureMips32r2 and FeatureMips64r2 Summary: dsbh and dshd are not available on Mips32r2. No codegen test changes required since expansion of i64 prevented the use of these instructions anyway. Depends on D3690 Reviewers: vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3692 llvm-svn: 208542	2014-05-12 12:15:41 +00:00
Bradley Smith	bbec45a4f1	[ARM64] Add proper bounds checking/diagnostics to logical shifts llvm-svn: 208540	2014-05-12 11:49:16 +00:00
Christian Pirker	238c7c165b	ARM: Implement big endian bit-conversion for NEON type llvm-svn: 208538	2014-05-12 11:19:20 +00:00
Serge Pavlov	02ff620c7b	Fix type of shuffle obtained from reordering with binary operation In transformation: BinOp(shuffle(v1,undef), shuffle(v2,undef)) -> shuffle(BinOp(v1, v2),undef) type of the undef argument must be same as type of BinOp. llvm-svn: 208531	2014-05-12 10:11:27 +00:00
Bradley Smith	9ba3c963ff	[ARM64] Correct more bounds checks/diagnostics for arithmetic shift operands llvm-svn: 208528	2014-05-12 09:41:43 +00:00
Elena Demikhovsky	8e8fde8e93	AVX-512: changes in intrinsics 1) Changed gather and scatter intrinsics. Now they are aligned with GCC built-ins. There is no more non-masked form. Masked intrinsic receives -1 if all lanes are executed. 2) I changed the function that works with intrinsics inside X86ISelLowering.cpp. I put all intrinsics in one table. I did it for INTRINSICS_W_CHAIN and plan to put all intrinsics from WO_CHAIN set to the same table in order to avoid the long-long "switch". (I wanted to use static map initialization that allowed by C++11 but I wasn't able to compile it on VS2012). 3) I added gather/scatter prefetch intrinsics. 4) I fixed MRMm encoding for masked instructions. llvm-svn: 208522	2014-05-12 07:18:51 +00:00
Serge Pavlov	0581109708	Fix reordering of shuffles and binary operations Do not apply transformation: BinOp(shuffle(v1), shuffle(v2)) -> shuffle(BinOp(v1, v2)) if operands v1 and v2 are of different size. This change fixes PR19717, which was caused by r208488. llvm-svn: 208518	2014-05-12 05:44:53 +00:00
Hal Finkel	0d8db46799	[PowerPC] Add global named register support Support for the intrinsics that read from and write to global named registers is added for r1, r2 and r13 (depending on the subtarget). llvm-svn: 208509	2014-05-11 19:29:11 +00:00
David Blaikie	9576766be9	DebugInfo: Include lexical scopes in inlined subroutines. llvm-svn: 208506	2014-05-11 18:12:17 +00:00
David Blaikie	866c87f07b	DebugInfo: Simplify/correct test for correct constant emission when dealing with derived types. This test was using the inliner and other optimizations to test a case that's actually a bug anyway. Bug and possible fix/discussion described here ( http://reviews.llvm.org/D3714 ). But the functionality that was implemented along with this test is still desired, so simplify the test to verify a more obvious/less wrong case that the functionality addressed: looking through const sugar to the underlying type when emitting a constant (so the constant is emitted as signed/unsigned as appropriate depending on the signedness of the underlying type). llvm-svn: 208504	2014-05-11 17:25:50 +00:00
Hal Finkel	c4c6c87666	[PowerPC] On PPC32, 128-bit shifts might be runtime calls The counter-loops formation pass needs to know what operations might be function calls (because they can't appear in counter-based loops). On PPC32, 128-bit shifts might be runtime calls (even though you can't use __int128 on PPC32, it seems that SROA might form them). Fixes PR19709. llvm-svn: 208501	2014-05-11 16:23:29 +00:00
David Blaikie	c05c8f483b	DwarfUnit: Share common constant value emission between APInts of small (<= 64 bit) and MCOperand immediates. Doesn't seem a good reason to duplicate this code (it was more literally duplicated prior to r208494, and while the dataN code /does/ actually fire in this case, it doesn't seem necessary (and the DWARF standard recommends using udata/sdata pervasively instead of dataN, so as to indicate signedness of the values)) llvm-svn: 208495	2014-05-11 15:47:39 +00:00
Simon Atanasyan	ca8b562709	[llvm-readobj] Print values of FLAGS and MIPS_FLAGS dynamic table tags in a human readable form. llvm-svn: 208489	2014-05-11 08:48:09 +00:00
Serge Pavlov	9ef66a8266	Reorder shuffle and binary operation. This patch enables transformations: BinOp(shuffle(v1), shuffle(v2)) -> shuffle(BinOp(v1, v2)) BinOp(shuffle(v1), const1) -> shuffle(BinOp, const2) They allow to eliminate extra shuffles in some cases. Differential Revision: http://reviews.llvm.org/D3525 llvm-svn: 208488	2014-05-11 08:46:12 +00:00
Filipe Cabecinhas	0e3d1cb5d6	Fixed a bug when lowering build_vector (PR19694) When lowering build_vector to an insertps, we would still lower it, even if the source vectors weren't v4x32. This would break on avx if the source was a v8x32. We now check the type of the source vectors. llvm-svn: 208487	2014-05-11 08:12:56 +00:00
Vincent Lejeune	29c0c210fc	R600/SI: Fold fabs/fneg into src input modifier llvm-svn: 208480	2014-05-10 19:18:39 +00:00
Vincent Lejeune	94af31fbe8	R600/SI: Prettier display of input modifiers llvm-svn: 208479	2014-05-10 19:18:33 +00:00
Tim Northover	55b3e22927	ARM64: fix SELECT_CC lowering in absence of NaNs. We were swapping the true & false results while testing for FMAX/FMIN, but not putting them back to the original state if the later checks failed. Should fix PR19700. llvm-svn: 208469	2014-05-10 07:37:50 +00:00
Kevin Enderby	6abc2e58ac	Fix llvm-nm to print the full 64-bit address for symbols in 64-bit object files. The implementation might be better to have a method is64Bit() in the class SymbolicFile instead of having the static routine isSymbolList64Bit() in llvm-nm.cpp . But this is very much in the sprit of isObject() and getNMTypeChar() in llvm-nm.cpp that has a series of if else statements based on the specific class of the SymbolicFile. I can update this if folks would like. Also the tests were updated to be explicit about checking the address for 64-bits or 32-bits from object files. llvm-svn: 208463	2014-05-09 23:57:49 +00:00
Benjamin Kramer	8722aa5754	SLPVectorizer: When sorting by domination for CSE don't assert on unreachable code. There is no total ordering if the CFG is disconnected. We don't care if we catch all CSE opportunities in dead code either so just exclude ignore them in the assert. PR19646 llvm-svn: 208461	2014-05-09 23:28:49 +00:00
Reid Kleckner	c487d73f41	Revert "[ms-cxxabi] Add a new calling convention that swaps 'this' and 'sret'" This reverts commit r200561. This calling convention was an attempt to match the MSVC C++ ABI for methods that return structures by value. This solution didn't scale, because it would have required splitting every CC available on Windows into two: one for methods and one for free functions. Now that we can put sret on the second arg (r208453), and Clang does that (r208458), revert this hack. llvm-svn: 208459	2014-05-09 22:56:42 +00:00
Reid Kleckner	7941856445	Allow sret on the second parameter as well as the first MSVC always places the implicit sret parameter after the implicit this parameter of instance methods. We used to handle this for x86_thiscallcc by allocating the sret parameter on the stack and leaving the this pointer in ecx, but that doesn't handle alternative calling conventions like cdecl, stdcall, fastcall, or the win64 convention. Instead, change the verifier to allow sret on the second parameter. This also requires changing the Mips and X86 backends to return the argument with the sret parameter, instead of assuming that the sret parameter comes first. The Sparc backend also returns sret parameters in a register, but I wasn't able to update it to handle secondary sret parameters. It currently calls report_fatal_error if you feed it an sret in the second parameter. Reviewers: rafael.espindola, majnemer Differential Revision: http://reviews.llvm.org/D3617 llvm-svn: 208453	2014-05-09 22:32:13 +00:00
Reid Kleckner	d0eda92845	Fix ARM intrinsics-overflow.ll test on Windows Windows on ARM only supports thumb mode execution, so we have to explicitly pick some non-Windows OS to test ARM mode codegen. llvm-svn: 208448	2014-05-09 21:52:48 +00:00
Rafael Espindola	aa2738291b	Don't crash on redefinitions. One error we were not deleting the alias or putting it in the Module. The end result is that there was an use left of the aliasee when the module was deleted. llvm-svn: 208447	2014-05-09 21:49:17 +00:00
Louis Gerbarg	3342bf1451	Add custom lowering for add/sub with overflow intrinsics to ARM This patch adds support to ARM for custom lowering of the llvm.{u\|s}add.with.overflow.i32 intrinsics for i32/i64. This is particularly useful for handling idiomatic saturating math functions as generated by InstCombineCompare. Test cases included. rdar://14853450 llvm-svn: 208435	2014-05-09 17:02:49 +00:00
Louis Gerbarg	1f54b82164	Add ExtractValue instruction to SimplifyCFG's ComputeSpeculationCost Since ExtractValue is not included in ComputeSpeculationCost CFGs containing ExtractValueInsts cannot be simplified. In particular this interacts with InstCombineCompare's tendency to insert add.with.overflow intrinsics for certain idiomatic math operations, preventing optimization. This patch adds ExtractValue to the ComputeSpeculationCost. Test case included rdar://14853450 llvm-svn: 208434	2014-05-09 17:02:46 +00:00
Tom Stellard	4c00b52e1a	R600/SI: Teach SIInstrInfo::moveToVALU() how to move S_LOAD_*_IMM instructions llvm-svn: 208432	2014-05-09 16:42:22 +00:00
Tom Stellard	d6cb8e8efd	R600/SI: Fix SMRD pattern for offsets > 32 bits We were dropping the high bits of 64-bit immediate offsets. llvm-svn: 208431	2014-05-09 16:42:21 +00:00
Tom Stellard	a2acad785a	R600: Expand i64 SELECT_CC llvm-svn: 208430	2014-05-09 16:42:19 +00:00
Tom Stellard	afa8b532b1	R600: Move MIN/MAX matching from LowerOperation() to PerformDAGCombine() llvm-svn: 208429	2014-05-09 16:42:16 +00:00
James Molloy	dd1aa14a21	Attempt to pacify the bots - this commit requires asserts. llvm-svn: 208424	2014-05-09 16:20:53 +00:00
Daniel Sanders	e57d866ed0	[mips] Marked up instructions added in MIPS-IV and tested that IAS for -mcpu=mips[123] does not accept them Summary: This required a new instruction group representing the 32-bit subset of MIPS-IV that was available in MIPS32 A small number of instructions are correctly rejected but with the wrong error message. These have been placed in a separate test for now. Depends on D3676 Reviewers: vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3677 llvm-svn: 208414	2014-05-09 14:06:17 +00:00
Oliver Stannard	c24f2171ca	ARM: HFAs must be passed in consecutive registers When using the ARM AAPCS, HFAs (Homogeneous Floating-point Aggregates) must be passed in a block of consecutive floating-point registers, or on the stack. This means that unused floating-point registers cannot be back-filled with part of an HFA, however this can currently happen. This patch, along with the corresponding clang patch (http://reviews.llvm.org/D3083) prevents this. llvm-svn: 208413	2014-05-09 14:01:47 +00:00
Simon Atanasyan	d6a20e5115	[yaml2obj] Follow-up to the r208228 and r208406. Remove duplicated YAML map keys. llvm-svn: 208412	2014-05-09 13:57:33 +00:00
Daniel Sanders	f2056bef32	[mips] Marked up instructions added in MIPS-III and tested that IAS for -mcpu=mips[12] does not accept them Summary: This required a new instruction group representing the 32-bit subset of MIPS-III that was available in MIPS32 A small number of instructions are correctly rejected but with the wrong error message. These have been placed in a separate test for now. There's some obvious InstAlias's that ought to be marked MIPS-III but arent. This is because they are not currently tested. I intend to catch these with a final pass through the tablegen records to find tablegen records without ISA annotations. Depends on D3674 Reviewers: vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3675 llvm-svn: 208408	2014-05-09 13:02:27 +00:00
NAKAMURA Takumi	f50871f460	Mark yaml2obj-elf-x86-rel.yaml as XFAIL:vg_leak for now. This has two pairs of duplicate hashes. llvm-svn: 208406	2014-05-09 11:24:18 +00:00
Daniel Sanders	b7f1c6ff3e	[mips][mips64r6] Add experimental support for MIPS32r6 and MIPS64r6 Summary: Adds MIPS32r6/MIPS64r6 and checks the compatibility requirements for these processors. I've also included comments to describe removed and re-encoded instructions, along with placeholder def's for the new instructions but there are no functional changes to codegen at this point. Reviewers: jkolek, vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3622 llvm-svn: 208399	2014-05-09 09:46:21 +00:00
Daniel Sanders	f075173127	[mips] Added missing daddu test to the valid instruction tests. Summary: Depends on D3673 Reviewers: vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3674 llvm-svn: 208398	2014-05-09 09:32:01 +00:00
Daniel Sanders	52bdd651e5	[mips] Added missing dsra -> dsrav and sra -> srav aliases. Summary: dsll, dsrl, sll, and srl already exist. Reviewers: vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3673 llvm-svn: 208397	2014-05-09 09:24:49 +00:00
NAKAMURA Takumi	370d5f808a	test/TableGen: Remove XFAIL:vg_leak out of 3 tests corresponding to r208293. llvm-svn: 208393	2014-05-09 08:18:33 +00:00
Saleem Abdulrasool	40bca0afab	ARM: support PIC on Windows on ARM Handle lowering of global addresses for PIC mode compilation on Windows. Always use the movw/movt load to load the address as Windows on ARM requires ARMv7+ and is a pure Thumb environment. llvm-svn: 208385	2014-05-09 00:58:32 +00:00
Filipe Cabecinhas	e4b482b3ed	Optimize shufflevector that copies an i64/f64 and zeros the rest. Summary: Also ran clang-format on the function. The code added is the last else if block. Reviewers: nadav, craig.topper, delena Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D3518 llvm-svn: 208372	2014-05-08 23:16:08 +00:00
Justin Bogner	7833d9facb	test/CodeGen: Check that the correct register is used in a store This tightens up r208351 to ensure that a store is fed with the correct value. Thanks to Quentin Colombet for spotting this! llvm-svn: 208368	2014-05-08 22:45:07 +00:00
Michael Zolotukhin	292d3caa15	[InstCombine] Some cleanup in optimization of redundant insertvalue instructions. And one more test added. llvm-svn: 208355	2014-05-08 19:50:24 +00:00
Justin Bogner	1de42075fc	Make a CodeGen test more robust against vector register selection llvm-svn: 208351	2014-05-08 18:53:56 +00:00
Sebastian Pop	45dd14bac2	add testcase for r208237: do not collect undef terms llvm-svn: 208347	2014-05-08 18:38:58 +00:00
Andrea Di Biagio	e85ba4df52	[X86] Add target specific combine rules to fold SSE2/AVX2 packed arithmetic shift intrinsics. This patch teaches the backend how to combine packed SSE2/AVX2 arithmetic shift intrinsics. The rules are: - Always fold a packed arithmetic shift by zero to its first operand; - Convert a packed arithmetic shift intrinsic dag node into a ISD::SRA only if the shift count is known to be smaller than the vector element size. This patch also teaches to function 'getTargetVShiftByConstNode' how fold target specific vector shifts by zero. Added two new tests to verify that the DAGCombiner is able to fold sequences of SSE2/AVX2 packed arithmetic shift calls. llvm-svn: 208342	2014-05-08 17:44:04 +00:00
Saleem Abdulrasool	39a939d7d2	test: fix test on Windows When building on Windows, the default target is Windows. Windows on ARM does not support ARM mode compilation, resulting in test failures. Simply specify a triple to ensure that we are testing the correct behaviour. llvm-svn: 208340	2014-05-08 17:11:29 +00:00
NAKAMURA Takumi	cc648f75eb	Mark test/TableGen/listconcat.td as XFAIL:vg_leak. llvm-tblgen is ignorant of vg_leak. llvm-svn: 208337	2014-05-08 17:06:10 +00:00
Daniel Sanders	f72791e464	[mips] Correct tests that are meant to test valid assembly. They were actually rejected by GAS. Summary: I've noticed a bug in my test generator script that caused 64-bit objects to be disassembled as if it were using the O32 ABI, giving the wrong register names. As a result, it generated assembly files that are rejected by GAS when assembling for the correct ABI. This was caused by the generator setting the ELF e_flags incorrectly before disassembling the object. This patch corrects the invalid tests that have already been committed by replacing the ABI-dependent register names with numeric registers. In addition to fixing the tests this allows the 32-bit and 64-bit ISA tests to be easily diffed to produce the invalid-*.s tests which test that instructions defined in later ISA's are not accepted. Depends on D3648 Reviewers: vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3649 llvm-svn: 208327	2014-05-08 15:17:29 +00:00
Christian Pirker	b5728191c2	ARM big endian function argument passing llvm-svn: 208316	2014-05-08 14:06:24 +00:00
Daniel Sanders	cdbbe08b05	[mips] Implement l[wd]c3, and s[wd]c3. Summary: These instructions were added in MIPS-I, and MIPS-II but were removed in MIPS-III. Interestingly, GAS continues to accept them when assembling for MIPS-III. For the moment, these instructions will follow GAS and accept them for MIPS-III and newer but this will be tightened up when the invalid-*.s tests are added. Depends on D3647 Reviewers: vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3648 llvm-svn: 208311	2014-05-08 13:02:11 +00:00
Dario Domizioli	058a8fd36c	Revert test commit. Removed blank line. llvm-svn: 208308	2014-05-08 12:54:43 +00:00
James Molloy	c42ea14f74	[ARM64-BE] Teach fast-isel about how to set up sub-word stack arguments for big endian calls. SelectionDAG already knows about this, but fast-isel was ignorant. llvm-svn: 208307	2014-05-08 12:53:50 +00:00
Daniel Sanders	d39320c6b6	[mips] Marked up instructions added in MIPS-II and tested that IAS for -mcpu=mips1 does not accept them Summary: A small number of instructions are rejected with the wrong error message. These have been placed in a separate test for now. There seems to be some parsing quirk that triggers when these instructions are disabled. Depends on D3571 Reviewers: vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D3647 llvm-svn: 208305	2014-05-08 12:40:48 +00:00
Daniel Sanders	8dcb116a3e	[mips] Implement tlbp, tlbr, tlbwi, and tlbwr Reviewers: vmedic, dsanders Reviewed By: dsanders Differential Revision: http://reviews.llvm.org/D3571 llvm-svn: 208301	2014-05-08 11:51:18 +00:00
Dario Domizioli	a2fa84edbf	Test commit. Added blank line. llvm-svn: 208298	2014-05-08 11:28:14 +00:00
Tim Northover	18f8bb84fa	ARM64: make sure FastISel emits SSA MachineInstrs We need to use a temporary register for a 2-step operation like REM. llvm-svn: 208297	2014-05-08 10:30:56 +00:00
Evgeniy Stepanov	9661ec0ec3	[asan] Preserve flags in asm instrumentation. Patch by Yuri Gorshenin. llvm-svn: 208296	2014-05-08 09:55:24 +00:00
Hal Finkel	6532c20faa	Move late partial-unrolling thresholds into the processor definitions The old method used by X86TTI to determine partial-unrolling thresholds was messy (because it worked by testing target features), and also would not correctly identify the target CPU if certain target features were disabled. After some discussions on IRC with Chandler et al., it was decided that the processor scheduling models were the right containers for this information (because it is often tied to special uop dispatch-buffer sizes). This does represent a small functionality change: - For generic x86-64 (which uses the SB model and, thus, will get some unrolling). - For AMD cores (because they still currently use the SB scheduling model) - For Haswell (based on benchmarking by Louis Gerbarg, it was decided to bump the default threshold to 50; we're working on a test case for this). Otherwise, nothing has changed for any other targets. The logic, however, has been moved into BasicTTI, so other targets may now also opt-in to this functionality simply by setting LoopMicroOpBufferSize in their processor model definitions. llvm-svn: 208289	2014-05-08 09:14:44 +00:00
Hao Liu	1187a3d8db	AArch64/ARM64: Port NEON post-increment load/store with 2/3/4 vectors to ARM64 backend. llvm-svn: 208284	2014-05-08 07:38:13 +00:00
Saleem Abdulrasool	2ff4948035	test: fix silly typo Oh silly Darwin and your case insensitive file system. llvm-svn: 208274	2014-05-08 01:41:04 +00:00
Saleem Abdulrasool	fc6b85b185	ARM: support FK_SecRel_2 relocations on WoA This adds FK_SecRel_2 relocation support to ARM. This enables the building of object files for armv7-windows-msvc which enables CodeView line tables for debugging as opposed to armv7-windows-itanium which currently uses DWARF. llvm-svn: 208273	2014-05-08 01:35:57 +00:00
Filipe Cabecinhas	095d9d573a	Lower certain build_vectors to insertps instructions Summary: Vectors built with zeros and elements in the same order as another (source) vector are optimized to be built using a single insertps instruction. Also optimize when we move one element in a vector to a different place in that vector while zeroing out some of the other elements. Further optimizations are possible, described in TODO comments. I will be implementing at least some of them in the near future. Added some tests for different cases where this optimization triggers. Reviewers: nadav, delena, craig.topper Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D3521 llvm-svn: 208271	2014-05-08 00:25:16 +00:00
Duncan P. N. Exon Smith	b80de1012a	IR: Don't allow non-default visibility on local linkage Visibilities of `hidden` and `protected` are meaningless for symbols with local linkage. - Change the assembler to reject non-default visibility on symbols with local linkage. - Change the bitcode reader to auto-upgrade `hidden` and `protected` to `default` when the linkage is local. - Update LangRef. <rdar://problem/16141113> llvm-svn: 208263	2014-05-07 22:57:20 +00:00
Quentin Colombet	60cdff65c7	[X86] Add a test case for r208252. Prior to r208252, the FMA 231 family was marked as isCommutable. However the memory variants of this family are not commutable. Therefore, we did not implemented the findCommutedOpIndices for those variants and missed that the default implementation (more or less: commute indices 1 and 2) was firing behind our back. As a result, as demonstrated in the test case before the fix, we were transforming a = b * c + a into a = a * c + b. I.e., before r208252 we were generating for this test case: vmovaps %xmm0, %xmm1 vmoss (%rsi), %xmm0 vfmadd231ss (%rdi), %xmm1, %xmm0 Instead of: vmoss (%rsi), %xmm1 vfmadd231ss (%rdi), %xmm1, %xmm0 <rdar://problem/16800495> llvm-svn: 208260	2014-05-07 22:52:58 +00:00
Adam Nemet	2973efb866	[Test] Remove c-index-test from the list of substitutions All the tests are under the clang tests and none should be under llvm moving forward. The topic was discussed in this thread: http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20140428/214905.html llvm-svn: 208234	2014-05-07 18:16:02 +00:00
Sebastian Pop	448712b1a6	split delinearization pass in 3 steps To compute the dimensions of the array in a unique way, we split the delinearization analysis in three steps: - find parametric terms in all memory access functions - compute the array dimensions from the set of terms - compute the delinearized access functions for each dimension The first step is executed on all the memory access functions such that we gather all the patterns in which an array is accessed. The second step reduces all this information in a unique description of the sizes of the array. The third step is delinearizing each memory access function following the common description of the shape of the array computed in step 2. This rewrite of the delinearization pass also solves a problem we had with the previous implementation: because the previous algorithm was by induction on the structure of the SCEV, it would not correctly recognize the shape of the array when the memory access was not following the nesting of the loops: for example, see polly/test/ScopInfo/multidim_only_ivs_3d_reverse.ll ; void foo(long n, long m, long o, double A[n][m][o]) { ; ; for (long i = 0; i < n; i++) ; for (long j = 0; j < m; j++) ; for (long k = 0; k < o; k++) ; A[i][k][j] = 1.0; Starting with this patch we no longer delinearize access functions that do not contain parameters, for example in test/Analysis/DependenceAnalysis/GCD.ll ;; for (long int i = 0; i < 100; i++) ;; for (long int j = 0; j < 100; j++) { ;; A[2i - 4j] = i; ;; B++ = A[6i + 8*j]; these accesses will not be delinearized as the upper bound of the loops are constants, and their access functions do not contain SCEVUnknown parameters. llvm-svn: 208232	2014-05-07 18:01:20 +00:00
Simon Atanasyan	68f6150156	[yaml2obj] Support ELF x86 relocations. llvm-svn: 208228	2014-05-07 17:06:38 +00:00
Chad Rosier	788e5e3d7c	[ARM64][fast-isel] Disable target specific optimizations at -O0. Functionally, this patch disables the dead register elimination pass and the load/store pair optimization pass at -O0. The ILP optimizations don't require the optimization level to be checked because the call to addILPOpts is predicated with the necessary check. The AdvSIMDScalar pass is disabled by default at all optimization levels. This patch leaves that pass disabled by default. Also, move command-line options into ARM64TargetMachine.cpp and add a few additional flags to aid in debugging. This fixes an issue with the -debug-pass=Structure flag where passes were printed, but not actually run (i.e., AdvSIMDScalar pass). llvm-svn: 208223	2014-05-07 16:41:55 +00:00
Daniel Sanders	d240953db2	[mips] Add highly experimental support for MIPS-I, MIPS-II, MIPS-III, and MIPS-V Summary: These processors will only be available for the integrated assembler at first (CodeGen will emit a fatal error saying they are not implemented). The intention is to work through the existing instructions and correctly annotate the ISA they were added in so that we have a sufficiently good base to start MIPS64r6 development. MIPS64r6 removes/re-encodes certain instructions and I believe it is best to define ISA's using set-union's as far as possible rather than using set-subtraction. Reviewers: vmedic Subscribers: emaste, llvm-commits Differential Revision: http://reviews.llvm.org/D3569 llvm-svn: 208221	2014-05-07 16:25:22 +00:00
Michael Zolotukhin	7d6293a0d3	[InstCombine] Add optimization of redundant insertvalue instructions. rdar://problem/11861387 llvm-svn: 208214	2014-05-07 14:30:18 +00:00
Evgeniy Stepanov	c14fc42137	[msan] Fix -fsanitize=memory -fno-integrated-as. llvm-svn: 208211	2014-05-07 14:10:51 +00:00
Tim Northover	88a51d983e	AArch64/ARM64: optimise vector selects & enable test When performing a scalar comparison that feeds into a vector select, it's actually better to do the comparison on the vector side: the scalar route would be "CMP -> CSEL -> DUP", the vector is "CM -> DUP" since the vector comparisons are all mask based. llvm-svn: 208210	2014-05-07 14:10:27 +00:00
James Molloy	d3c401a2d0	[ARM64-BE] Fix fast-isel, and add appropriate RUN lines to appropriate tests. llvm-svn: 208200	2014-05-07 12:33:55 +00:00
James Molloy	36132057da	[ARM64-BE] Fix variable-argument saving. llvm-svn: 208199	2014-05-07 12:33:48 +00:00
James Molloy	4049e4fd77	[ARM64-BE] Implement the lane-twiddling logic at AAPCS boundaries for big endian. The AAPCS states that values passed in registers must have a value as though they had been loaded with "LDR". LDR is equivalent to "LD1.64 vX.1D" - that is, loading scalars to vector registers and loading 1-element vectors is equivalent. The logic implemented here is to ensure that at all call boundaries and during formal argument lowering all vectors are treated as their bitwidth-based floating point scalar counterpart, which is always one of f64 or f128 (v2i32 -> f64, v4i32 -> f128 etc). A BITCAST is inserted so that the appropriate REV will be generated during code generation. llvm-svn: 208198	2014-05-07 12:33:41 +00:00
James Molloy	30e0e11eb4	[ARM64-BE] Implement the crazy bitcast handling for big endian vectors. Because we've canonicalised on using LD1/ST1, every time we do a bitcast between vector types we must do an equivalent lane reversal. Consider a simple memory load followed by a bitconvert then a store. v0 = load v2i32 v1 = BITCAST v2i32 v0 to v4i16 store v4i16 v2 In big endian mode every memory access has an implicit byte swap. LDR and STR do a 64-bit byte swap, whereas LD1/ST1 do a byte swap per lane - that is, they treat the vector as a sequence of elements to be byte-swapped. The two pairs of instructions are fundamentally incompatible. We've decided to use LD1/ST1 only to simplify compiler implementation. LD1/ST1 perform the equivalent of a sequence of LDR/STR + REV. This makes the original code sequence: v0 = load v2i32 v1 = REV v2i32 (implicit) v2 = BITCAST v2i32 v1 to v4i16 v3 = REV v4i16 v2 (implicit) store v4i16 v3 But this is now broken - the value stored is different to the value loaded due to lane reordering. To fix this, on every BITCAST we must perform two other REVs: v0 = load v2i32 v1 = REV v2i32 (implicit) v2 = REV v2i32 v3 = BITCAST v2i32 v2 to v4i16 v4 = REV v4i16 v5 = REV v4i16 v4 (implicit) store v4i16 v5 This means an extra two instructions, but actually in most cases the two REV instructions can be combined into one. For example: (REV64_2s (REV64_4h X)) === (REV32_4h X) There is also no 128-bit REV instruction. This must be synthesized with an EXT instruction. Most bitconverts require some sort of conversion. The only exceptions are: a) Identity conversions - vNfX <-> vNiX b) Single-lane-to-scalar - v1fX <-> fX or v1iX <-> iX Even though there are hundreds of changed lines, I have a fairly high confidence that they are somewhat correct. The changes to add two REV instructions per bitcast were pretty mechanical, and once I'd done that I threw the resulting .td at a script I wrote which combined the two REVs together (and added an EXT instruction, for f128) based on an instruction description I gave it. This was much less prone to error than doing it all manually, plus my brain would not just have melted but would have vapourised. llvm-svn: 208194	2014-05-07 11:28:53 +00:00
James Molloy	ccc7f982c1	[ARM64-BE] Make big endian (scalar) argument passing work correctly. This completes the port of r204814 (cpirker "AArch64_BE function argument passing for ARM ABI") from AArch64 to ARM64, and fixes a bunch of issues found during later development along the way. The biggest of these was that the alignment fixup logic wasn't replicated into all the places it should have been. llvm-svn: 208192	2014-05-07 11:28:36 +00:00
Tim Northover	df723343fa	AArch64/ARM64: run test on ARM64 too. llvm-svn: 208188	2014-05-07 10:47:04 +00:00
Tim Northover	76a94e6ead	AArch64/ARM64: put annotation in test It makes finding already covered tests much easier with "grep -L arm64". llvm-svn: 208187	2014-05-07 10:47:00 +00:00
Tim Northover	2d7cacd86b	AArch64/ARM64: disable test directory if ARM64 not present llvm-svn: 208186	2014-05-07 10:42:06 +00:00
Daniel Sanders	314e80e5f8	[tablegen] Add !listconcat operator with the similar semantics as !strconcat Summary: It concatenates two or more lists. In addition to the !strconcat semantics the lists must have the same element type. My overall aim is to make it easy to append to Instruction.Predicates rather than override it. This can be done by concatenating lists passed as arguments, or by concatenating lists passed in additional fields. Reviewers: dsanders Reviewed By: dsanders Subscribers: hfinkel, llvm-commits Differential Revision: http://reviews.llvm.org/D3506 llvm-svn: 208183	2014-05-07 10:13:19 +00:00
Evgeniy Stepanov	3819f02819	[asan] Add a flag to control asm instrumentation. With this change, asm instrumentation is disabled by default. llvm-svn: 208167	2014-05-07 07:54:11 +00:00
Joerg Sonnenberger	cf86ce136c	Allow using normal .eh_frame based unwinding on ARM. Use the same encodings as x86. Use this exception model for NetBSD. llvm-svn: 208166	2014-05-07 07:49:34 +00:00
Saleem Abdulrasool	acd0338c61	ARM: fix WoA PEI instruction selection The ARM::BLX instruction is an ARM mode instruction. The Windows on ARM target is limited to Thumb instructions. Correctly use the thumb mode tBLXr instruction. This would manifest as an errant write into the object file as the instruction is 4-bytes in length rather than 2. The result would be a corrupted object file that would eventually result in an executable that would crash at runtime. llvm-svn: 208152	2014-05-07 03:03:27 +00:00
Justin Bogner	cf27e1b996	llvm-cov: Handle missing source files as GCOV does If the source files referenced by a gcno file are missing, gcov outputs a coverage file where every line is simply /EOF/. This also occurs for lines in the coverage that are past the end of a file that is found. This change mimics gcov. llvm-svn: 208149	2014-05-07 02:11:23 +00:00
Justin Bogner	1a18d7caa3	llvm-cov: Implement --no-output In gcov, there's a -n/--no-output option, which disables the writing of any .gcov files, so that it emits only the summary info on stdout. This implements the same behaviour in llvm-cov. llvm-svn: 208148	2014-05-07 02:11:18 +00:00
Joerg Sonnenberger	818e725158	If a function needs a frame pointer, but r11 (aka fp) has not been used, remove it from the list of unspilled registers. Otherwise the following attempt to keep the stack aligned by picking an extra GPR register to spill will not work as it picks up r11. llvm-svn: 208129	2014-05-06 20:43:01 +00:00
Diego Novillo	dd49157db1	Do not make -pass-remarks additive. Summary: When I initially introduced -pass-remarks, I thought it would be a neat idea to make it additive. So, if one used it as: $ llc -pass-remarks=inliner --pass-remarks=loop.* the compiler would build the regular expression '(inliner)\|(loop.*)'. The more I think about it, the more I regret it. This is not how other flags work. The standard semantics are right-to-left overrides. This is how clang interprets -Rpass. And I think the two should be compatible in this respect. Reviewers: qcolombet Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D3614 llvm-svn: 208122	2014-05-06 19:14:00 +00:00
Benjamin Kramer	1625bfccbe	TTI: Estimate @llvm.fmuladd cost as fmul + fadd when FMA's aren't legal on the target. llvm-svn: 208115	2014-05-06 18:36:23 +00:00
Andrea Di Biagio	c14ccc9184	[X86] Improve the lowering of BITCAST dag nodes from type f64 to type v2i32 (and vice versa). Before this patch, the backend always emitted a store+load sequence to bitconvert from f64 to i64 the input operand of a ISD::BITCAST dag node that performed a bitconvert from type MVT::f64 to type MVT::v2i32. The resulting i64 node was then used to build a v2i32 vector. With this patch, the backend now produces a cheaper SCALAR_TO_VECTOR from MVT::f64 to MVT::v2f64. That SCALAR_TO_VECTOR is then followed by a "free" bitcast to type MVT::v4i32. The elements of the resulting v4i32 are then extracted to build a v2i32 vector (which is illegal and therefore promoted to MVT::v2i64). This is in general cheaper than emitting a stack store+load sequence to bitconvert the operand from type f64 to type i64. llvm-svn: 208107	2014-05-06 17:09:03 +00:00
Renato Golin	c7aea40ec6	Implememting named register intrinsics This patch implements the infrastructure to use named register constructs in programs that need access to specific registers (bare metal, kernels, etc). So far, only the stack pointer is supported as a technology preview, but as it is, the intrinsic can already support all non-allocatable registers from any architecture. llvm-svn: 208104	2014-05-06 16:51:25 +00:00
Rafael Espindola	52dc5d828f	Special case aliases in GlobalValue::getAlignment. An alias has the address of what it points to, so it also has the same alignment. This allows a few optimizations to see past aliases for free. llvm-svn: 208103	2014-05-06 16:48:58 +00:00
Tim Northover	618850b6a5	AArch64/ARM64: implement diagnosis of unpredictable loads & stores llvm-svn: 208091	2014-05-06 14:15:14 +00:00
Tim Northover	2ac82426f8	AArch64/ARM64: add two more MC tests to ARM64 set. llvm-svn: 208085	2014-05-06 12:50:58 +00:00
Tim Northover	d450746dc9	AArch64/ARM64: enable MC-level diagnostic tests for NEON insts. Obviously we can't expect the two backends to produce identical diagnostics, since what's possible depends quite a bit on how the .td files are structured. I think the ARM64 diagnostics are basically of the same quality in all the changed cases, so I've split the CHECK lines. llvm-svn: 208084	2014-05-06 12:50:55 +00:00
Tim Northover	15641cd4e1	AArch64/ARM64: make NEON vector list parsing a bit more robust It doesn't change the results, but it seems silly not to diagnose obvious problems early on. llvm-svn: 208083	2014-05-06 12:50:51 +00:00
Tim Northover	0f54f309bb	AArch64/ARM64: produce more informative diagnostic assembling some immediates No tests here, they'll be added when the entire neon-diagnostics.s test from AArch64 is enabled. llvm-svn: 208079	2014-05-06 11:18:53 +00:00
Christian Pirker	fdce7cea93	ARM: For thumb fixups store halfwords high first and low second llvm-svn: 208076	2014-05-06 10:05:11 +00:00
Kevin Qin	1353c3405d	[ARM64] Enable alignment control option in front-end for ARM64. This is the modification in llvm part. llvm-svn: 208074	2014-05-06 09:48:52 +00:00
Reid Kleckner	4a406d32e9	Fix i128 div/mod on mingw64 The Win64 docs are very clear that anything larger than 8 bytes is passed by reference, and GCC MinGW64 honors that for __modti3 and friends. Patch by Jameson Nash! llvm-svn: 208029	2014-05-06 01:20:42 +00:00
Nick Lewycky	5ef6bc8815	Improve 'tail' call marking in TRE. A bootstrap of clang goes from 375k calls marked tail in the IR to 470k, however this improvement does not carry into an improvement of the call/jmp ratio on x86. The most common pattern is a tail call + br to a block with nothing but a 'ret'. The number of tail call to loop conversions remains the same (1618 by my count). The new algorithm does a local scan over the use-def chains to identify local "alloca-derived" values, as well as points where the alloca could escape. Then, a visit over the CFG marks blocks as being before or after the allocas have escaped, and annotates the calls accordingly. llvm-svn: 208017	2014-05-05 23:59:03 +00:00
Tom Stellard	45b3dcd35b	R600: Expand i64 ISD:SUB llvm-svn: 208005	2014-05-05 21:47:15 +00:00
Filipe Cabecinhas	fe59062b75	Revert "Optimize shufflevector that copies an i64/f64 and zeros the rest." This reverts commit 207992. I misread the phab number on the LGTM. llvm-svn: 207993	2014-05-05 19:40:36 +00:00
Filipe Cabecinhas	263d98c19f	Optimize shufflevector that copies an i64/f64 and zeros the rest. Summary: Also ran clang-format on the function. The code added is the last else if block. Reviewers: nadav, craig.topper Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D3518 llvm-svn: 207992	2014-05-05 19:36:28 +00:00
Michael Zolotukhin	e37f33c466	Move test from r207969 to another folder and rename it. llvm-svn: 207984	2014-05-05 18:10:15 +00:00
Yi Jiang	a4821fc9fb	Always set alignment of vectorized LD/ST in SLP-Vectorizer. <rdar://problem/16812145> llvm-svn: 207983	2014-05-05 17:59:14 +00:00
Joerg Sonnenberger	302be7e891	Fix spelling. llvm-svn: 207982	2014-05-05 17:58:46 +00:00
Duncan P. N. Exon Smith	1789fb6493	LTO: -internalize sets visibility to default Visibility is meaningless when the linkage is local. Change `-internalize` to reset the visibility to `default`. <rdar://problem/16141113> llvm-svn: 207979	2014-05-05 17:40:44 +00:00
Rafael Espindola	595f54205c	Remove the -disable-cfi option. This also add a release note about it. If this stays I will cleanup MC next week. llvm-svn: 207977	2014-05-05 17:33:26 +00:00
Adam Nemet	47c4e4e46d	[Test] Remove substitution for clang clang should not be used in the llvm tests. The topic was discussed in this thread: http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20140428/214905.html llvm-svn: 207976	2014-05-05 17:17:27 +00:00
Rafael Espindola	82ad91915e	Modify test to not use -disable-cfi. llvm-svn: 207974	2014-05-05 16:47:07 +00:00
Rafael Espindola	665bd05095	Move test to the ARM64 directory. llvm-svn: 207972	2014-05-05 16:14:37 +00:00
Rafael Espindola	f463b63448	Convert a CodeGen test into a MC test. llvm-svn: 207971	2014-05-05 15:34:13 +00:00
Michael Zolotukhin	4e030e8fb4	Fix test from r207966 and add a comment there. llvm-svn: 207969	2014-05-05 14:46:53 +00:00
Michael Zolotukhin	0c380a30d8	Add regression test for r207692. llvm-svn: 207966	2014-05-05 14:05:25 +00:00
Saleem Abdulrasool	e8a7afef86	CodeGen: correct memset emittance for WoA Windows on ARM does not conform to AEABI. However, memset would be emitted using the AEABI signature, resulting in inverted parameters. Handle this special case appropriately. llvm-svn: 207943	2014-05-04 23:13:21 +00:00
Saleem Abdulrasool	9c4716e4b6	CodeGen: strengthen WoA AEABI avoidance tests Add additional test cases for WoA AEABI avoidance checking. llvm-svn: 207942	2014-05-04 23:13:18 +00:00
Saleem Abdulrasool	729c7a08fb	MC: support FK_SecRel_4 for Windows on ARM Add handling for FK_SecRel_4 (4-byte section relative relocations). These are used by the generation of DWARF debug information (the abbrevations use section relative relocations). This will also be used in generation of CodeView line tables. llvm-svn: 207941	2014-05-04 23:13:15 +00:00
Benjamin Kramer	9130cb8547	LoopUnroll: If we're doing partial unrolling, use the PartialThreshold to limit unrolling. Otherwise we use the same threshold as for complete unrolling, which is way too high. This made us unroll any loop smaller than 150 instructions by 8 times, but only if someone specified -march=core2 or better, which happens to be the default on darwin. llvm-svn: 207940	2014-05-04 19:12:38 +00:00
Arnold Schwaighofer	cd566c423a	SLPVectorizer: Bring back the insertelement patch (r205965) with fixes When can't assume a vectorized tree is rooted in an instruction. The IRBuilder could have constant folded it. When we rebuild the build_vector (the series of InsertElement instructions) use the last original InsertElement instruction. The vectorized tree root is guaranteed to be before it. Also, we can't assume that the n-th InsertElement inserts the n-th element into a vector. This reverts r207746 which reverted the revert of the revert of r205018 or so. Fixes the test case in PR19621. llvm-svn: 207939	2014-05-04 17:10:15 +00:00
Elena Demikhovsky	e73333a50f	AVX-512: minor change in rndscale intrinsic llvm-svn: 207937	2014-05-04 13:35:37 +00:00
Saleem Abdulrasool	82b69fa105	X86: repair export compatibility with MinGW/cygwin Both MinGW and cygwin (i686) construct export directives without the global leader prefix. This is mostly due to the fact that they use GNU ld which does not correctly handle the export directive. This apparently has been been broken for a while. However, this was recently reported as being broken by mingwandroid and diorcety of the msys2 project. Remove the global leader prefix if targeting MinGW or cygwin, otherwise, retain the global leader prefix. Add an explicit test for cygwin's behaviour of export directives. llvm-svn: 207926	2014-05-04 00:03:48 +00:00
Rafael Espindola	3d082fa507	Fix pr19645. The fix itself is fairly simple: move getAccessVariant to MCValue so that we replace the old weak expression evaluation with the far more general EvaluateAsRelocatable. This then requires that EvaluateAsRelocatable stop when it finds a non trivial reference kind. And that in turn requires the ELF writer to look harder for weak references. Last but not least, this found a case where we were being bug by bug compatible with gas and accepting an invalid input. I reported pr19647 to track it. llvm-svn: 207920	2014-05-03 19:57:04 +00:00
Joey Gouly	b0afd1b929	[ARM64] Correctly select ANDWri in FastISel. http://reviews.llvm.org/D3598 llvm-svn: 207917	2014-05-03 17:27:06 +00:00
Karthik Bhat	ddd0cb5ecf	Vectorize intrinsic math function calls in SLPVectorizer. This patch adds support to recognize and vectorize intrinsic math functions in SLPVectorizer. Review: http://reviews.llvm.org/D3560 and http://reviews.llvm.org/D3559 llvm-svn: 207901	2014-05-03 09:59:54 +00:00
Adam Nemet	6a56c37b95	[LSR] Add llc testcase for r207271/r207569. See PR19608 for the details but to summarize it was easy to modify the .ll file to get the desired def-use ordering. llvm-svn: 207887	2014-05-02 23:49:01 +00:00
Chandler Carruth	271635da0d	[sanitizers] Propagate the sanitizer options through to the lit context. This makes it really easy to debug leaks FYI: ASAN_OPTIONS=detect_leaks=1 ./bin/llvm-lit -v <path to test> llvm-svn: 207874	2014-05-02 21:47:35 +00:00
Justin Bogner	c475e1bc77	llvm-cov: Fix handling of line zero appearing in a line table Reading line tables in llvm-cov was pretty broken, but would happen to work as long as no line in the table was 0. It's not clear to me whether a line of zero should show up in these tables, but deciding to read a string in the middle of the line table is certainly the wrong thing to do if it does. I've also added some comments, as trying to figure out what this block of code was doing was fairly unpleasant. llvm-svn: 207866	2014-05-02 20:01:24 +00:00
Daniel Sanders	6ef0a2f1be	[tablegen] !strconcat accepts more than two arguments but this wasn't documented or tested. Summary: * Updated the documentation * Added a test for >2 arguments * Added a check for the lexical concatenation * Made the existing test a bit stricter. Reviewers: t.p.northover Reviewed By: t.p.northover Subscribers: t.p.northover, llvm-commits Differential Revision: http://reviews.llvm.org/D3485 llvm-svn: 207865	2014-05-02 19:25:52 +00:00
Nico Weber	4b2acde21a	Teach GlobalDCE how to remove empty global_ctor entries. This moves most of GlobalOpt's constructor optimization code out of GlobalOpt into Transforms/Utils/CDtorUtils.{h,cpp}. The public interface is a single function OptimizeGlobalCtorsList() that takes a predicate returning which constructors to remove. GlobalOpt calls this with a function that statically evaluates all constructors, just like it did before. This part of the change is behavior-preserving. Also add a call to this from GlobalDCE with a filter that removes global constructors that contain a "ret" instruction and nothing else – this fixes PR19590. llvm-svn: 207856	2014-05-02 18:35:25 +00:00
Akira Hatanaka	f76388dd7e	[GVN] Pass the phi-translated address of a load instead of the untranslated address to AnalyzeLoadFromClobberingLoad. This fixes a bug in load-PRE where PRE is applied to a load that is not partially redundant. <rdar://problem/16638765>. llvm-svn: 207853	2014-05-02 17:59:17 +00:00
Saleem Abdulrasool	734bca04ff	MC: place .file records into the correct section .file records are supposed to have a section identifier of 65534 (IMAGE_SCN_DEBUG) rather than 0. This is spelt out clearly within the PE/COFF specification. Fix this minor oversight with the implementation for support for .file records. llvm-svn: 207851	2014-05-02 17:45:24 +00:00
Tim Northover	820e041a3c	DAGCombine: prevent formation of illegal ConstantFP nodes. llvm-svn: 207850	2014-05-02 17:25:02 +00:00
Tom Stellard	3dbf1f8df0	R600: Expand vector sin and cos. v2: move code to AMDGPUISelLowering.cpp squash with tests (both EG and SI) Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 207845	2014-05-02 15:41:47 +00:00
Tom Stellard	605e116e8e	R600: Expand TruncStore i64 -> {i16,i8} llvm-svn: 207844	2014-05-02 15:41:46 +00:00
Tim Northover	d7360900a8	AArch64/ARM64: add patterns for post-indexed ST1 ops. llvm-svn: 207840	2014-05-02 14:54:27 +00:00
Tim Northover	d0b07e133b	AArch64/ARM64: support indexed loads/stores on vector types. While post-indexed LD1/ST1 instructions do exist for vector loads, this patch makes use of the more flexible addressing-modes in LDR/STR instructions. llvm-svn: 207838	2014-05-02 14:54:15 +00:00
Benjamin Kramer	42d262f410	Allow SelectionDAG::FoldConstantArithmetic to work when it's called with a vector VT but scalar values. llvm-svn: 207835	2014-05-02 12:35:22 +00:00
Nick Lewycky	718ada97bc	Fold strlen(expr ? "str1" : "str2") to x ? len1 : len2. This fires about 330 times in a bootstrap of clang. llvm-svn: 207828	2014-05-02 04:11:45 +00:00
Michael J. Spencer	1f10c5ea94	[IR] Make {extract,insert}element accept an index of any integer type. Given the following C code llvm currently generates suboptimal code for x86-64: __m128 bss4( const __m128 ptr, size_t i, size_t j ) { float f = ptr[i][j]; return (__m128) { f, f, f, f }; } ================================================= define <4 x float> @_Z4bss4PKDv4_fmm(<4 x float> nocapture readonly %ptr, i64 %i, i64 %j) #0 { %a1 = getelementptr inbounds <4 x float>* %ptr, i64 %i %a2 = load <4 x float>* %a1, align 16, !tbaa !1 %a3 = trunc i64 %j to i32 %a4 = extractelement <4 x float> %a2, i32 %a3 %a5 = insertelement <4 x float> undef, float %a4, i32 0 %a6 = insertelement <4 x float> %a5, float %a4, i32 1 %a7 = insertelement <4 x float> %a6, float %a4, i32 2 %a8 = insertelement <4 x float> %a7, float %a4, i32 3 ret <4 x float> %a8 } ================================================= shlq $4, %rsi addq %rdi, %rsi movslq %edx, %rax vbroadcastss (%rsi,%rax,4), %xmm0 retq ================================================= The movslq is uneeded, but is present because of the trunc to i32 and then sext back to i64 that the backend adds for vbroadcastss. We can't remove it because it changes the meaning. The IR that clang generates is already suboptimal. What clang really should emit is: %a4 = extractelement <4 x float> %a2, i64 %j This patch makes that legal. A separate patch will teach clang to do it. Differential Revision: http://reviews.llvm.org/D3519 llvm-svn: 207801	2014-05-01 22:12:39 +00:00
Reed Kotler	bab3f23da6	Add basic functionality for assignment of ints. This creates a lot of core infrastructure in which to add, with little effort, quite a bit more to mips fast-isel Test Plan: simplestore.ll Reviewers: dsanders Reviewed By: dsanders Differential Revision: http://reviews.llvm.org/D3527 llvm-svn: 207790	2014-05-01 20:39:21 +00:00
Rafael Espindola	ea9f9d4030	Don't propagate StorageClass and ComplexType to aliases. This matches gas' behaviour on COFF. I think that this yak is now sufficiently shaved for aliases with offset to work. llvm-svn: 207786	2014-05-01 19:02:03 +00:00
Eli Bendersky	a108a65df2	Add an optimization that does CSE in a group of similar GEPs. This optimization merges the common part of a group of GEPs, so we can compute each pointer address by adding a simple offset to the common part. The optimization is currently only enabled for the NVPTX backend, where it has a large payoff on some benchmarks. Review: http://reviews.llvm.org/D3462 Patch by Jingyue Wu. llvm-svn: 207783	2014-05-01 18:38:36 +00:00
David Blaikie	748be6c376	DebugInfo: Correct the attribute type kind. Post commit review feedback from Paul Robinson regarding r207777. llvm-svn: 207782	2014-05-01 18:31:21 +00:00
David Blaikie	0f82c225b8	PR19623: Implement typedefs of void. This the LLVM portion that will allow Clang and other frontends to emit typedefs of void by providing a null type for the typedef's underlying type. llvm-svn: 207777	2014-05-01 17:56:13 +00:00
Matt Arsenault	06028dd7be	R600/SI: Fix verifier error with pseudo store instructions. Use i32 instead of specifying SReg_32. When this is the pseudo INDIRECT_BASE_ADDR, this would give a bogus verifier error. llvm-svn: 207770	2014-05-01 16:37:52 +00:00
Rafael Espindola	575f79a409	Compute the correct section for zed = foo + 1 in COFF. This fixes pr19147. There are a few more related issues to fix, but the testcase in the bug now passes. llvm-svn: 207763	2014-05-01 13:37:57 +00:00
Bradley Smith	3567cc1b42	[ARM64] Prefer generation of bzero on Darwin only llvm-svn: 207760	2014-05-01 13:11:59 +00:00
Rafael Espindola	4a04294882	Don't force symbols to be globals in .thumb_set. We currently force symbols to be globals in .thumb_set. The intent seems to be that given .thumb_set foo, bar we emit an undefined symbol to bar if it is never defined. The side effect is that we mark bar as global, even if it is defined, which gas does not. Producing an undefined reference to bar is a general difference from MC and gas. For example, given a = b gas will produce an undefined reference to b, MC will not. I would be surprised if any code depends on this, but it it does, we should fix the general difference, not special case .thumb_set. llvm-svn: 207757	2014-05-01 12:45:43 +00:00
Tim Northover	05017b1f8c	AArch64/ARM64: rewrite test to use FileCheck & add ARM64 lines llvm-svn: 207754	2014-05-01 12:30:01 +00:00
Tim Northover	4ec135fa2e	AArch64/ARM64: port basic disassembly tests to ARM64. llvm-svn: 207753	2014-05-01 12:29:56 +00:00
Tim Northover	534acbdf73	AArch64/ARM64: print BFM instructions as BFI or BFXIL The canonical form of the BFM instruction is always one of the more explicit extract or insert operations, which makes reading output much easier. llvm-svn: 207752	2014-05-01 12:29:38 +00:00
Richard Barton	3db1d580b3	Correction to assert statemtent to allow 32-bit unsigned numbers with the top bit set. This fixes an ARM assembler crash - regression test added. llvm-svn: 207747	2014-05-01 11:37:44 +00:00
Chandler Carruth	18c2fbb143	Revert r205965, which essentially reverts r205018 for the second time. =[ Turns out that this was the root cause of PR19621. We found a crasher only recently (likely due to improvements elsewhere in the SLP vectorizer) but the reduced test case failed all the way back to here. I've confirmed that reverting this patch both fixes the reduced test case in PR19621 and the actual source file that led to it, so it seems to really be rooted here. I've replied to the commit thread with discussion of my (feeble) attempts to debug this. Didn't make it very far, so reverting now that we have a good test case so that things can get back to healthy while the debugging carries on. llvm-svn: 207746	2014-05-01 11:24:11 +00:00
Simon Atanasyan	c48c58437d	[llvm-readobj] Add support for Mips specific ELF header e_flags. llvm-svn: 207744	2014-05-01 11:07:19 +00:00
Bradley Smith	f57d5ca234	[ARM64] Conditionalize CPU specific system registers on subtarget features llvm-svn: 207742	2014-05-01 10:25:36 +00:00
Matheus Almeida	d92a3fa212	[mips] Move expansion of .cpsetup to target streamer. Summary: There are two functional changes: 1) The directive is not expanded for the ASM->ASM code path. 2) If PIC is not set, there's no expansion for the ASM->OBJ code path (same behaviour as GAS). Reviewers: dsanders Reviewed By: dsanders Differential Revision: http://reviews.llvm.org/D3482 llvm-svn: 207741	2014-05-01 10:24:46 +00:00
Daniel Sanders	88fbbcaa30	[mips] Removed two-operand alias for sllv, sr[al]v, rotrv, dsllv, dsr[al]v, and drotrv GAS doesn't actually accept these particular cases. The mnemonic without the trailing 'v' still supports two-operand aliases. llvm-svn: 207740	2014-05-01 10:08:36 +00:00
Oliver Stannard	7eacbd5a71	Record the DWARF version in MCContext Record the DWARF version in MCContext, and use it when emitting the dwarf version into the debug info. llvm-svn: 207739	2014-05-01 08:46:02 +00:00
Rafael Espindola	ff68cb7f4c	Start fixing pr19147. This makes the coff writer compute the correct symbol value for the test in pr19147. The section is still incorrect, that will be fixed in a followup patch. llvm-svn: 207728	2014-05-01 00:10:17 +00:00
David Blaikie	899ae61fee	Revert "Emit DW_AT_object_pointer once, on the declaration, for each function." Breaks GDB buildbot (http://lab.llvm.org:8011/builders/clang-x86_64-ubuntu-gdb-75/builds/14517) GCC emits DW_AT_object_pointer /everywhere/ (declaration, abstract definition, inlined subroutine), but it looks like GCC relies on it being somewhere other than the declaration, at least. I'll experiment further & can hopefully still remove it from the inlined_subroutine. This reverts commit r207705. llvm-svn: 207719	2014-04-30 22:58:19 +00:00
David Blaikie	44078b3260	DebugInfo: Omit DW_AT_artificial on DW_TAG_formal_parameters in DW_TAG_inlined_subroutines. They just don't need to be there - they're inherited from the abstract definition. In theory I would like them to be inherited from the declaration, but the DWARF standard doesn't quite say that... we can probably do it anyway but I'm less confident about that so I'll leave it for a separate commit. llvm-svn: 207717	2014-04-30 22:41:33 +00:00
James Molloy	cbb9791e3b	Move a testcase from ELF to ARM64, incorrectly placed in r207627. llvm-svn: 207706	2014-04-30 21:31:11 +00:00
David Blaikie	3b2a53a437	Emit DW_AT_object_pointer once, on the declaration, for each function. This effectively reverts r164326, but adds some comments and justification and ensures we /don't/ emit the DW_AT_object_pointer on the (abstract and concrete) definitions. (while still preserving it on standalone definitions involving ObjC Blocks) This does increase the size of member function declarations from 7 to 11 bytes, unfortunately, but still seems like the Right Thing to do so that callers that see only the declaration still have the information about the object pointer. That said, I don't know what, if any, DWARF consumers don't have a heuristic to guess this in the case of normal C++ member functions - perhaps we can remove it entirely. llvm-svn: 207705	2014-04-30 21:29:41 +00:00
Alexey Samsonov	c717e7803c	Don't expect to find fpcmp and PerfectShuffle when running lit tests llvm-svn: 207704	2014-04-30 21:26:35 +00:00
Weiming Zhao	7f6daf1799	[ARM64] Prevent bit extraction to be adjusted by following shift For pattern like ((x >> C1) & Mask) << C2, DAG combiner may convert it into (x >> (C1-C2)) & (Mask << C2), which makes pattern matching of ubfx more difficult. For example: Given %shr = lshr i64 %x, 4 %and = and i64 %shr, 15 %arrayidx = getelementptr inbounds [8 x [64 x i64]]* @arr, i64 0, %i64 2, i64 %and %0 = load i64* %arrayidx With current shift folding, it takes 3 instrs to compute base address: lsr x8, x0, #1 and x8, x8, #0x78 add x8, x9, x8 If using ubfx, it only needs 2 instrs: ubfx x8, x0, #4, #4 add x8, x9, x8, lsl #3 This fixes bug 19589 llvm-svn: 207702	2014-04-30 21:07:24 +00:00
James Molloy	3ef43692b4	Add a testcase for r207627. llvm-svn: 207697	2014-04-30 20:06:26 +00:00
Hans Wennborg	59f0cba30f	Use the new StringTableBuilder in yaml2elf http://reviews.llvm.org/D3574 llvm-svn: 207694	2014-04-30 19:38:09 +00:00
Michael Zolotukhin	1f4a960ccf	[X86] Never hoist the shift value of a shift instruction. There is no need to check if we want to hoist the immediate value of an shift instruction. Simply return TCC_Free right away. This change is like r206101, but for X86. rdar://problem/16190769 llvm-svn: 207692	2014-04-30 19:17:32 +00:00
Carlo Kok	307625c974	[IPO/MergeFunctions] changes so it doesn't try to bitcast a struct return type but instead recreates it with insert/extract value. llvm-svn: 207679	2014-04-30 17:53:04 +00:00
David Majnemer	91db08bfe4	IR: Conservatively verify inalloca arguments Summary: Try to spot obvious mismatches with inalloca use. Reviewers: rnk Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D3572 llvm-svn: 207676	2014-04-30 17:22:00 +00:00
Matheus Almeida	e844872830	[mips] Add instruction alias (negu). Summary: negu $reg is equivalent to negu $reg, $reg. Reviewers: dsanders Reviewed By: dsanders Differential Revision: http://reviews.llvm.org/D3510 llvm-svn: 207673	2014-04-30 16:53:49 +00:00
Matheus Almeida	b7be52343d	[mips] Add instruction alias (sltu). Summary: The pattern sltu $r1, $r2, $imm is found in handwritten assembly which is just a shorthand version of sltui $r1, $r2, $imm. Reviewers: dsanders Reviewed By: dsanders Differential Revision: http://reviews.llvm.org/D3508 llvm-svn: 207671	2014-04-30 16:29:56 +00:00
Hans Wennborg	83e6e1e926	ELFObjectWriter: deduplicate suffices in strtab We already do this for shstrtab, so might as well do it for strtab. This extracts the string table building code into a separate class. The idea is to use it for other object formats too. I mostly wanted to do this for the general principle, but it does save a little bit on object file size. I tried this on a clang bootstrap and saved 0.54% on the sum of object file sizes (1.14 MB out of 212 MB for a release build). Differential Revision: http://reviews.llvm.org/D3533 llvm-svn: 207670	2014-04-30 16:25:02 +00:00
Tim Northover	a8c577e454	ARM64: print fp immediates without using scientific notation. llvm-svn: 207669	2014-04-30 16:13:34 +00:00
Tim Northover	7346f062b6	AArch64/ARM64: implement remaining TLS relocations (purely MC). llvm-svn: 207668	2014-04-30 16:13:26 +00:00
Tim Northover	b8fb7f4193	AArch64/ARM64: add specific diagnostic for MRS/MSR and enable tests. llvm-svn: 207667	2014-04-30 16:13:20 +00:00
Tim Northover	3c9a9401d5	AArch64/ARM64: accept and print floating-point immediate 0 as "#0.0" It's been decided that in the future, the floating-point immediate in instructions like "fcmeq v0.2s, v1.2s, #0.0" will be canonically "0.0", which has been implemented on AArch64 already but not ARM64. This fixes that issue. llvm-svn: 207666	2014-04-30 16:13:07 +00:00
Matheus Almeida	56df6ff2c5	[mips] Add instruction alias (dsll and dsrl). Summary: The pattern dsll/dsrl $rd, $rt, $rs is found in handwritten assembly which is just a shorthand version of dsllv/dsrlv $rd, $rt, $rs. Reviewers: dsanders Reviewed By: dsanders Differential Revision: http://reviews.llvm.org/D3486 llvm-svn: 207664	2014-04-30 16:00:49 +00:00
Rafael Espindola	b36f6189a5	Relax the test a bit. It is not relevant where the symbol and section names are stored, just their values. llvm-svn: 207662	2014-04-30 15:32:21 +00:00
Tom Stellard	1bd80725b3	R600/SI: Use VALU instructions for copying i1 values We can't use SALU instructions for this since they ignore the EXEC mask and are always executed. This fixes several OpenCV tests. llvm-svn: 207661	2014-04-30 15:31:33 +00:00
Tom Stellard	0c354f25c9	R600/SI: Teach moveToVALU how to handle some SMRD instructions llvm-svn: 207660	2014-04-30 15:31:29 +00:00
Chad Rosier	864e35db0a	[ARM64][fast-isel] Fast-isel doesn't know how to handle f128. llvm-svn: 207659	2014-04-30 15:29:57 +00:00
Rafael Espindola	194924e64b	Rename the test, it is testing the symver directive. llvm-svn: 207658	2014-04-30 15:27:44 +00:00
Matheus Almeida	312ac02491	[mips] Add instruction alias (sll and srl). Summary: The pattern sll/srl $rd, $rt, $rs is found in handwritten assembly which is just a shorthand version of sllv/srlv $rd, $rt, $rs. Reviewers: dsanders Reviewed By: dsanders Differential Revision: http://reviews.llvm.org/D3483 llvm-svn: 207657	2014-04-30 15:23:04 +00:00
Sasa Stankovic	7b061a42b1	[mips] Fix MipsLongBranch pass to work when the offset from the branch to the target cannot be determined accurately. This is the case for NaCl where the sandboxing instructions are added in MC layer, after the MipsLongBranch pass. It is also the case when the code has inline assembly. Instead of calculating offset in the MipsLongBranch pass, use %hi(sym1 - sym2) and %lo(sym1 - sym2) expressions that are resolved during the fixup. This patch also deletes microMIPS test file test/CodeGen/Mips/micromips-long-branch.ll and implements microMIPS CHECKs in a much simpler way in a file test/CodeGen/Mips/longbranch.ll, together with MIPS32 and MIPS64. llvm-svn: 207656	2014-04-30 15:06:25 +00:00
Matheus Almeida	bbd5e85e21	[mips] Update tests with encoding information for slt, slti, sltiu and sltu. Summary: Also renamed non-portable register names (e.g. $t2) so that we don't end up with a different encoding for what appears to be an equivalent instruction. Reviewers: dsanders Reviewed By: dsanders Differential Revision: http://reviews.llvm.org/D3505 llvm-svn: 207655	2014-04-30 14:52:57 +00:00
Tim Northover	3ffee2340e	ARM64: enable AArch64's basic-a64-instructions test llvm-svn: 207650	2014-04-30 13:37:10 +00:00
Tim Northover	0ac99404f0	ARM64: print lsr instead of lsrv for variable shifts (etc) The canonical syntax for shifts by a variable amount does not end with 'v', but that syntax should be supported as an alias (presumably for legacy reasons). llvm-svn: 207649	2014-04-30 13:37:07 +00:00
Tim Northover	7030f05b4f	ARM64: use 32-bit operations for uxtb & uxth Testing will be enabled shortly with basic-a64-instructions.s llvm-svn: 207648	2014-04-30 13:37:02 +00:00
Tim Northover	a307769b15	AArch64/ARM64: copy support for bCC instead of b.CC across. llvm-svn: 207646	2014-04-30 13:36:56 +00:00
Tim Northover	20ad359b77	AArch64/ARM64: use HS instead of CS & LO instead of CC. On instructions using the NZCV register, a couple of conditions have dual representations: HS/CS and LO/CC (meaning unsigned-higher-or-same/carry-set and unsigned-lower/carry-clear). The first of these is more descriptive in most circumstances, so we should print it. llvm-svn: 207644	2014-04-30 13:14:03 +00:00
Daniel Sanders	e296a0fce5	[mips][msa] Fix vector insertions where the index is variable Summary: This isn't supported directly so we rotate the vector by the desired number of elements, insert to element zero, then rotate back. The i64 case generates rather poor code on MIPS32. There is an obvious optimisation to be made in future (do both insert.w's inside a shared rotate/unrotate sequence) but for now it's sufficient to select valid code instead of aborting. Depends on D3536 Reviewers: matheusalmeida Reviewed By: matheusalmeida Differential Revision: http://reviews.llvm.org/D3537 llvm-svn: 207640	2014-04-30 12:09:32 +00:00
Tim Northover	f9941a9dc6	ARM64: accept ELF-relocated load/store insts without a #. E.g. we print "ldr x0, [x0, :lo12:symbol]" so we need to accept that syntax too. llvm-svn: 207639	2014-04-30 12:00:20 +00:00
Matheus Almeida	525bc4f708	[mips] Add support for .cpload. Summary: This directive is used for setting up $gp in the beginning of a function. It expands to three instructions if PIC is enabled: lui $gp, %hi(_gp_disp) addui $gp, $gp, %lo(_gp_disp) addu $gp, $gp, $reg _gp_disp is a special symbol that the linker sets to the distance between the lui instruction and the context pointer (_gp). Reviewers: dsanders Reviewed By: dsanders Differential Revision: http://reviews.llvm.org/D3480 llvm-svn: 207637	2014-04-30 11:28:42 +00:00
Matheus Almeida	c0284d118f	[mips] Emit all three relocation operations for each relocation entry on Mips64 big-endian systems. Summary: The N64 ABI allows up to three operations to be specified per relocation record independently of the endianness. Reviewers: dsanders Reviewed By: dsanders Differential Revision: http://reviews.llvm.org/D3529 llvm-svn: 207636	2014-04-30 11:21:10 +00:00
Tim Northover	970c4a8d35	ARM64: use hex immediates for movz/movk instructions Since these are mostly used in "lsl #16", "lsl #32", "lsl #48" combinations to piece together an immediate in 16-bit chunks, hex is probably the most appropriate format. llvm-svn: 207635	2014-04-30 11:19:40 +00:00
Tim Northover	4b2f8a990e	ARM64: hexify printing various immediate operands This is mostly aimed at the NEON logical operations and MOVI/MVNI (since they accept weird shifts which are more naturally understandable in hex notation). Also changes BRK/HINT etc, which is probably a neutral change, but easier than the alternative. llvm-svn: 207634	2014-04-30 11:19:28 +00:00
Tim Northover	cfd6e66544	ARM64: print canonical syntax for add/sub (imm) instructions. Since these instructions only accept a 12-bit immediate, possibly shifted left by 12, the canonical syntax used by the architecture reference manual is "#N {, lsl #12 }". We should accept an immediate that has already been shifted, (e.g. Also, print a comment giving the full addend since it can be helpful. llvm-svn: 207633	2014-04-30 11:19:15 +00:00
James Molloy	7c39df37b2	[ARM64] Ensure arm64_be is dealt with when emitting debug info. This is a partial port of r204816 (cpirker "Elf support for MC-JIT runtime dynamic linker") from AArch64 to ARM64. llvm-svn: 207625	2014-04-30 10:15:35 +00:00
Tim Northover	41cec5c3cb	ARM64: make sure FastISel uses a GPR64 source in 64-bit extensions. llvm-svn: 207620	2014-04-30 09:32:01 +00:00
Saleem Abdulrasool	25947c318b	ARM: support stack probe emission for Windows on ARM This introduces the stack lowering emission of the stack probe function for Windows on ARM. The stack on Windows on ARM is a dynamically paged stack where any page allocation which crosses a page boundary of the following guard page will cause a page fault. This page fault must be handled by the kernel to ensure that the page is faulted in. If this does not occur and a write access any memory beyond that, the page fault will go unserviced, resulting in an abnormal program termination. The watermark for the stack probe appears to be at 4080 bytes (for accommodating the stack guard canaries and stack alignment) when SSP is enabled. Otherwise, the stack probe is emitted on the page size boundary of 4096 bytes. llvm-svn: 207615	2014-04-30 07:05:07 +00:00
Saleem Abdulrasool	0aca1c30c6	ARM: print COFF function header for Windows on ARM Emit the COFF header when printing out the function. This is important as the header contains two important pieces of information: the storage class for the symbol and the symbol type information. This bit of information is required for the linker to correctly identify the type of symbol that it is dealing with. llvm-svn: 207613	2014-04-30 06:14:25 +00:00
Saleem Abdulrasool	f8222631a5	ARM: partially handle 32-bit relocations for WoA IMAGE_REL_ARM_MOV32T relocations require that the movw/movt pair-wise relocation is not split up and reordered. When expanding the mov32imm pseudo-instruction, create a bundle if the machine operand is referencing an address. This helps ensure that the relocatable address load is not reordered by subsequent passes. Unfortunately, this only partially handles the case as the Constant Island Pass occurs after the instructions are unbundled and does not properly handle bundles. That is a more fundamental issue with the pass itself and beyond the scope of this change. llvm-svn: 207608	2014-04-30 04:54:58 +00:00
Reid Kleckner	fb69308568	Implement X86 code generation for musttail Currently, musttail codegen is relying on sibcall optimization, and reporting a fatal error if fails. Sibcall optimization fails when stack arguments need to be modified, which is insufficient for musttail. The logic for moving arguments in memory safely is already implemented for GuaranteedTailCallOpt. This change merely arranges for musttail calls to use it. No functional change for GuaranteedTailCallOpt. Reviewers: espindola Differential Revision: http://reviews.llvm.org/D3493 llvm-svn: 207598	2014-04-29 23:55:41 +00:00
Tom Stellard	919bb6b83f	R600/SI: Custom lower SI_IF and SI_ELSE to avoid machine verifier errors SI_IF and SI_ELSE are terminators which also produce a value. For these instructions ISel always inserts a COPY to move their value to another basic block. This COPY ends up between SI_(IF\|ELSE) and the S_BRANCH* instruction at the end of the block. This breaks MachineBasicBlock::getFirstTerminator() and also the machine verifier which assumes that terminators are grouped together at the end of blocks. To solve this we coalesce the copy away right after ISel to make sure there are no instructions in between terminators at the end of blocks. llvm-svn: 207591	2014-04-29 23:12:53 +00:00
Tom Stellard	58ac7440e6	R600/SI: Only select SALU instructions in the entry or exit block SALU instructions ignore control flow, so it is not always safe to use them within branches. This is a partial solution to this problem until we can come up with something better. llvm-svn: 207590	2014-04-29 23:12:48 +00:00
Tom Stellard	676f571999	R600: optimize the UDIVREM 64 algorithm This is a squash of several optimization commits: - calculate DIV_Lo and DIV_Hi separately - use BFE_U32 if we are operating on 32bit values - use precomputed constants instead of shifting in UDVIREM - skip the first 32 iterations of udivrem v2: Check whether BFE is supported before using it Patch by: Jan Vesely Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 207589	2014-04-29 23:12:46 +00:00
Rafael Espindola	85f3610222	Also handle ConstantAggregateZero when optimizing vpermilvar*. llvm-svn: 207582	2014-04-29 22:20:40 +00:00
Rafael Espindola	eb7bdbd0ce	Two fixes to the vpermilvar optimization. The instcomine logic to handle vpermilvar's pd and 256 variants was incorrect. The _256 variants have indexes into the individual 128 bit lanes and in all cases it also has to mask out unused bits. llvm-svn: 207577	2014-04-29 20:41:54 +00:00
Diego Novillo	cd64780d18	Fix vectorization remarks. This patch changes the vectorization remarks to also inform when vectorization is possible but not beneficial. Added tests to exercise some loop remarks. llvm-svn: 207574	2014-04-29 20:06:10 +00:00
Yi Jiang	1a3f18b161	Continue slp vectorization even the BB already has vectorized store radar://16641956 llvm-svn: 207572	2014-04-29 19:37:20 +00:00
Reed Kotler	67077b3032	Add Simple return instruction to Mips fast-isel Reviewers: dsanders Reviewed by: dsanders Differential Revision: http://reviews.llvm.org/D3430 llvm-svn: 207565	2014-04-29 17:57:50 +00:00
Tilmann Scheller	4418dda5ef	[ARM64] Disable regression tests for the old JIT. Since the ARM64 backend doesn't implement support for the old JIT those tests are failing when the regression tests are run on an AArch64 host. llvm-svn: 207530	2014-04-29 15:02:40 +00:00
Daniel Sanders	6857800b67	[mips][msa] Use CHECK-LABEL in basic_operations*.ll Differential Revision: http://reviews.llvm.org/D3536 llvm-svn: 207529	2014-04-29 14:28:58 +00:00
Joerg Sonnenberger	dd18d5b0f6	Parse and create GOT_PREL relocations. llvm-svn: 207526	2014-04-29 13:42:02 +00:00
Daniel Sanders	b3268e71e2	[mips][msa] Fix element extraction where the index is variable. Summary: This isn't supported directly so we splat the vector element and extract the most convenient copy. Reviewers: matheusalmeida Reviewed By: matheusalmeida Differential Revision: http://reviews.llvm.org/D3530 llvm-svn: 207524	2014-04-29 13:31:37 +00:00
Rafael Espindola	b60c829a2a	Centralize the handling of the thumb bit. This patch centralizes the handling of the thumb bit around MCStreamer::isThumbFunc and makes isThumbFunc handle aliases. This fixes a corner case, but the main advantage is having just one way to check if a MCSymbol is thumb or not. This should still be refactored to be ARM only, but at least now it is just one predicate that has to be refactored instead of 3 (isThumbFunc, ELF_Other_ThumbFunc, and SF_ThumbFunc). llvm-svn: 207522	2014-04-29 12:46:50 +00:00
Tim Northover	aacce57d61	ARM: fix test after change to indirect symbol emission. llvm-svn: 207519	2014-04-29 10:13:10 +00:00
Tim Northover	9e7782dcf3	X86: emit hidden stubs into a proper non_lazy_symbol_pointer section. rdar://problem/16660411 llvm-svn: 207518	2014-04-29 10:06:10 +00:00
Tim Northover	2372301bcf	ARM: emit hidden stubs into a proper non_lazy_symbol_pointer section. rdar://problem/16660411 llvm-svn: 207517	2014-04-29 10:06:05 +00:00
Benjamin Kramer	e1ab3f062e	AArch64: Mark vector long multiplication as expand. There are no patterns for this. This was already fixed for ARM64 but I forgot to apply it to AArch64 too. llvm-svn: 207515	2014-04-29 09:37:54 +00:00
Elena Demikhovsky	299cf511c4	AVX-512: optimized a shuffle pattern to VINSERTI64x4. Added intrinsics for VPERMT2PS/PD/D/Q instructions. llvm-svn: 207513	2014-04-29 09:09:15 +00:00
Zinovy Nis	d373fec199	[OPENMP][LV][D3423] Respect Hints.Force meta-data for loops in LoopVectorizer llvm-svn: 207512	2014-04-29 08:55:11 +00:00
Hao Liu	6db3410071	[ARM64]Fix a bug about incorrect operand order in an EXT instruction, which is introduced by r207485. llvm-svn: 207500	2014-04-29 07:51:19 +00:00
Hao Liu	cf37110920	[ARM64]Fix a bug when lowering shuffle vector to an EXT instruction. E.g. Mask like <-1, -1, 1, ...> will generate incorrect EXT index. llvm-svn: 207485	2014-04-29 01:50:36 +00:00
Chandler Carruth	c71b2c3c7f	Revert r207271 for now. This commit introduced a test case that ran clang directly from the LLVM test suite! That doesn't work. I've followed up on the review thread to try and get a viable solution sorted out, but trying to get the tree clean here. llvm-svn: 207462	2014-04-28 23:07:49 +00:00
Rafael Espindola	bc91d7e25a	Add an option for evaluating past symbols. When evaluating an assembly expression for a relocation, we want to stop at MCSymbols that are in the symbol table, even if they are variables. This is needed since the semantics may require that the relocation use them. That is not the case when computing the value of a symbol in the symbol table. There are no relocations in this case and we have to keep going until we hit a section or find out that the expression doesn't have an assembly time value. llvm-svn: 207445	2014-04-28 20:53:11 +00:00
David Blaikie	d8f0ac7b4a	DwarfDebug: Omit DW_AT_object_pointer on inlined_subroutines While refactoring out constructScopeDIE into two functions I realized we were emitting DW_AT_object_pointer in the inlined subroutine when we didn't need to (GCC doesn't, and the abstract subprogram definition has the information already). So here's the refactoring and the bug fix. This is one step of refactoring to remove some subtle memory ownership semantics. It turns out the original constructScopeDIE returned ownership in its return value in some cases and not in others. The split into two functions now separates those two semantics - further cleanup (unique_ptr, etc) will follow. llvm-svn: 207441	2014-04-28 20:27:02 +00:00
Duncan P. N. Exon Smith	c5a3139ebd	Reapply "blockfreq: Approximate irreducible control flow" This reverts commit r207287, reapplying r207286. I'm hoping that declaring an explicit struct and instantiating `addBlockEdges()` directly works around the GCC crash from r207286. This is a lot more boilerplate, though. llvm-svn: 207438	2014-04-28 20:02:29 +00:00
Hans Wennborg	e36e116826	InstCombine: don't drop 'inalloca' in PromoteCastOfAllocation (PR19569) llvm-svn: 207426	2014-04-28 17:40:03 +00:00
Chad Rosier	0def8e2652	[ARM64] Fix an issue where we were always assuming a copy was coming from a D subregister. llvm-svn: 207423	2014-04-28 16:21:50 +00:00
Rafael Espindola	3b5ee55804	Don't include an invalid symbol in the symbol table. The symbol table itself has no relocations, so it is not possible to represent things like a = undefined + 1 With the patch we just omit these variables. That matches the behaviour of the gnu assembler. llvm-svn: 207419	2014-04-28 13:39:57 +00:00
Rafael Espindola	407f5be3cc	List the entire symbol table in this test. This will allow us to extend this test to show that other symbols don't show up in the symbol table. llvm-svn: 207418	2014-04-28 13:26:35 +00:00
Rafael Espindola	9645090181	Produce an error instead of a crash in an expr we cannot represent. llvm-svn: 207414	2014-04-28 12:40:50 +00:00
Tim Northover	7b839f833d	ARM64: diagnose use of v16-v31 in certain indexed NEON instructions. Someone couldn't bear to have a completely orthogonal set of floating-point registers, so we've got some instructions that only accept v0-v15 (coming in ARMv9, V128_prime: you're allowed v2, v3, v5, v7, ...). Anyway, we were permitting even the out of range registers during assembly (CodeGen handled it correctly). This adds a diagnostic. llvm-svn: 207412	2014-04-28 11:27:43 +00:00
Chandler Carruth	e01fd5f63a	[inliner] Significantly improve the compile time in cases like PR19499 by avoiding inlining massive switches merely because they have no instructions in them. These switches still show up where we fail to form lookup tables, and in those cases they are actually going to cause a very significant code size hit anyways, so inlining them is not the right call. The right way to fix any performance regressions stemming from this is to enhance the switch-to-lookup-table logic to fire in more places. This makes PR19499 about 5x less bad. It uncovers a second compile time problem in that test case that is unrelated (surprisingly!). llvm-svn: 207403	2014-04-28 08:52:44 +00:00
Hao Liu	9a342778b9	[ARM64]Fix a bug cannot select UQSHL/SQSHL with constant i64 shift amount. llvm-svn: 207399	2014-04-28 07:34:27 +00:00
Chandler Carruth	0ef74f571c	Update tests to use the new format of printing a TimeValue. It's a bit odd to have the output of 'llvm-ar tv' depend on the format of TimeValue::str(), but that's what we have today. If anyone needs the output to remain compatible with GNU ar or old versions of llvm-ar, just shout and I'll switch the code to manually format its times. Note that there isn't a portable format -- Mac and GNU have different formats at least (thanks Rafael!) so... llvm-svn: 207387	2014-04-28 01:24:32 +00:00
Rafael Espindola	466d66358d	Add emitThumbSet to the arm target streamer. This fixes the asm printer implementation and lets the parser be unaware of what .thumb_set is. llvm-svn: 207381	2014-04-27 20:23:58 +00:00
Benjamin Kramer	ce4b3fee72	X86TTI: Adjust sdiv cost now that we can lower it on plain SSE2. Includes a fix for a horrible typo that caused all SDIV costs to be slightly off :) llvm-svn: 207371	2014-04-27 18:47:54 +00:00

... 4 5 6 7 8 ...

24428 Commits