llvm-project

Commit Graph

Author	SHA1	Message	Date
Elena Demikhovsky	40864b690b	AVX-512 set: added mask operations, lowering BUILD_VECTOR for i1 vector types. Added intrinsics and tests. llvm-svn: 187717	2013-08-05 08:52:21 +00:00
Reed Kotler	9c285b300d	Add the saving of S2. This is needed for some of the floating point helper functions. This can be optimized out later when the remaining parts of the helper function work is moved into the Mips16HardFloat pass. For now it forces us to use the 32 bit save/restore instructions instead of the 16 bit ones. llvm-svn: 187712	2013-08-04 23:56:53 +00:00
Bob Wilson	b9549baf7f	Remove "lto_on_osx" xfails, now that -rdynamic works on Darwin. Note that this will require a recent version of the linker for Darwin builds with LTO to pass these tests. llvm-svn: 187711	2013-08-04 23:55:24 +00:00
Benjamin Kramer	5bc180c14f	X86: Turn fp selects into mask operations. double test(double a, double b, double c, double d) { return a<b ? c : d; } before: _test: ucomisd %xmm0, %xmm1 ja LBB0_2 movaps %xmm3, %xmm2 LBB0_2: movaps %xmm2, %xmm0 after: _test: cmpltsd %xmm1, %xmm0 andpd %xmm0, %xmm2 andnpd %xmm3, %xmm0 orpd %xmm2, %xmm0 Small speedup on Benchmarks/SmallPT llvm-svn: 187706	2013-08-04 12:05:16 +00:00
Elena Demikhovsky	cd46691728	AVX-512 set: added VEXTRACTPS instruction llvm-svn: 187705	2013-08-04 10:46:07 +00:00
Tim Northover	adb550068a	X86: specify CPU on new test to fix atom buildbot Apparently Atoms use lea for stack adjustment, which we weren't looking for. llvm-svn: 187704	2013-08-04 10:00:45 +00:00
Tim Northover	ecc018c7b7	X86: correct tail return address calculation Due to the weird and wondeful usual arithmetic conversions, some calculations involving negative values were getting performed in uint32_t and then promoted to int64_t, which is really not a good idea. Patch by Katsuhiro Ueno. llvm-svn: 187703	2013-08-04 09:35:57 +00:00
Reed Kotler	30cedf65ef	Clean up code for Mips16 large frame handling. llvm-svn: 187701	2013-08-04 01:13:25 +00:00
Hal Finkel	b176acb6b7	Fix PPC64 64-bit GPR inline asm constraint matching Internally, the PowerPC backend names the 32-bit GPRs R[0-9]+, and names the 64-bit parent GPRs X[0-9]+. When matching inline assembly constraints with explicit register names, on PPC64 when an i64 MVT has been requested, we need to follow gcc's convention of using r[0-9]+ to refer to the 64-bit (parent) registers. At some point, we'll probably want to arrange things so that the generic code in TargetLowering uses the AsmName fields declared in *RegisterInfo.td in order to match these inline asm register constraints. If we do that, this change can be reverted. llvm-svn: 187693	2013-08-03 12:25:10 +00:00
Akira Hatanaka	7be35cb1bf	[mips] Expand vector truncating stores and extending loads. llvm-svn: 187667	2013-08-02 19:23:33 +00:00
Joey Gouly	5d0564d2e6	[ARMv8] Add an assembler warning for the deprecated 'setend' instruction. llvm-svn: 187666	2013-08-02 19:18:12 +00:00
Nadav Rotem	5defea90e6	SLPVectorizer: Fix PR16777. PHInodes may use multiple extracted values that come from different blocks. Thanks Alexey Samsonov. llvm-svn: 187663	2013-08-02 18:40:24 +00:00
Renato Golin	0178a25fc5	Fixes ARM LNT bot from SLP change in O3 This patch fixes the multiple breakages on ARM test-suite after the SLP vectorizer was introduced by default on O3. The problem was an illegal vector type on ARMTTI::getCmpSelInstrCost() <3 x i1> which is not simple. The guard protects this code from breaking (cause of the problems) but doesn't fix the issue that is generating the odd vector in the first place, which also needs to be investigated. llvm-svn: 187658	2013-08-02 17:10:04 +00:00
Carlo Kok	4382da983a	Bugfix for making the DWARF debug strings and labels to code emitted as secrel32 instead of long opcodes (only for coff). This makes them debuggable with GDB (with fix for 64bits msvc) llvm-svn: 187656	2013-08-02 16:14:15 +00:00
Tim Northover	cf708c3284	Fix handling of CHECK-DAG combined with CHECK-NOT Patch by Daniel Sanders. llvm-svn: 187651	2013-08-02 11:32:50 +00:00
NAKAMURA Takumi	6fda3b4b86	Revert r187597, "Bugfix for making the DWARF debug strings and labels to code emitted as secrel32 instead of long opcodes (only for coff). This makes them debuggable with GDB." It broke x86_64-win32 builder in llvm/test/DebugInfo. llvm-svn: 187642	2013-08-02 03:46:05 +00:00
Eric Christopher	cdc78961d3	Temporarily revert "Debug Info Finder\|Verifier: handle DbgLoc attached to instructions." in an attempt to bring back some bots. This reverts commit r187609. llvm-svn: 187638	2013-08-02 00:49:44 +00:00
Carlo Kok	3e8c33cff5	fix for LLVM debug info on llvm-mips-linux where the label name uses % instead of L as a prefix. llvm-svn: 187623	2013-08-01 22:15:34 +00:00
Bill Wendling	a5c536e1ee	Use function attributes to indicate that we don't want to realign the stack. Function attributes are the future! So just query whether we want to realign the stack directly from the function instead of through a random target options structure. llvm-svn: 187618	2013-08-01 21:42:05 +00:00
Reed Kotler	83f879ddb2	Fix some issues with Mips16 floating when certain intrinsics are present. This is actually an LLVM bug in the way it generates signatures for these when soft float is enabled. For example, floor ends up having the signature of int64(int64). The signature part is not the same as where the actual parameter types are recorded, and those ARE of course int64(int64) when soft float is enabled. (Yes, Mips16 hard float uses soft float but with different runtime rounes but then has to interoperate with Mips32 using normal floating point). This logic will eventually be moved to the Mips16HardFloat pass so it's not worth sorting out these issues in LLVM since nobody but Mips16 cares about these signatures, as far as I know, and even I won't eventually either. llvm-svn: 187613	2013-08-01 21:17:53 +00:00
Carlo Kok	aad6a6a3e0	ARM/Hexagon testcases can't compile x86 only testcase. Reverting change to testcase & fixing check for all. llvm-svn: 187610	2013-08-01 20:53:57 +00:00
Manman Ren	4c065e779c	Debug Info Finder\|Verifier: handle DbgLoc attached to instructions. Also remove checking of llvm.dbg.sp since it is not used in generating dwarf. Current state of Finder: DebugInfoFinder tries to list all debug info MDNodes used in a module. To list debug info MDNodes used by an instruction, DebugInfoFinder provides processDeclare, processValue and processLocation to handle DbgDeclareInst, DbgValueInst and DbgLoc attached to instructions. processModule will go through all DICompileUnits in llvm.dbg.cu and list debug info MDNodes used by the CUs. TODO: 1> Finder has a list of CUs, SPs, Types, Scopes and global variables. We need to add a list of variables that are used by DbgDeclareInst and DbgValueInst. 2> MDString fields should be null or isa<MDString> and MDNode fields should be null or isa<MDNode>. We currently use empty string or int 0 to represent null. 3> Go though Verify functions and make sure that they check field types. 4> Clean up existing testing cases to remove llvm.dbg.sp and make sure each testing case has a llvm.dbg.cu. llvm-svn: 187609	2013-08-01 20:52:39 +00:00
David Blaikie	a1ae0e6ecb	DebugInfo: Emit definitions for types with no members. The absence of members was a poor/incorrect proxy for "is definition". llvm-svn: 187607	2013-08-01 20:30:22 +00:00
Carlo Kok	d0b09c42a3	change the inlinefnlocalvar testcase so it uses a triple that's not coff (doesn't seem to matter for the testcase itself, what it tests isn't triple specific), as coff has a slightly different way of emitting what it checks for. llvm-svn: 187604	2013-08-01 20:17:37 +00:00
Bob Wilson	dd52d58680	Temporarily xfail a test that breaks on OS X when building with LTO. This is another case where internalize hides a symbol that is needed by a loadable module. I am currently investigating a proper fix but this patch will get our buildbot to pass in the meantime. <rdar://problem/14578094> llvm-svn: 187601	2013-08-01 19:29:26 +00:00
Carlo Kok	afcc62024e	Bugfix for making the DWARF debug strings and labels to code emitted as secrel32 instead of long opcodes (only for coff). This makes them debuggable with GDB. fixes Bug 16249 - LLVM generates broken debug info on Windows llvm-svn: 187597	2013-08-01 18:38:14 +00:00
Tom Stellard	0344cdfe39	R600: Add 64-bit float load/store support * Added R600_Reg64 class * Added T#Index#.XY registers definition * Added v2i32 register reads from parameter and global space * Added f32 and i32 elements extraction from v2f32 and v2i32 * Added v2i32 -> v2f32 conversions Tom Stellard: - Mark vec2 operations as expand. The addition of a vec2 register class made them all legal. Patch by: Dmitry Cherkassov Signed-off-by: Dmitry Cherkassov <dcherkassov@gmail.com> llvm-svn: 187582	2013-08-01 15:23:42 +00:00
Tom Stellard	53698938a4	R600: Use 64-bit alignment for 64-bit kernel arguments llvm-svn: 187581	2013-08-01 15:23:31 +00:00
Tom Stellard	98f675a994	R600/SI: Custom lower i64 ZERO_EXTEND llvm-svn: 187580	2013-08-01 15:23:26 +00:00
Elena Demikhovsky	b1266b5447	EVEX and compressed displacement encoding for AVX512 llvm-svn: 187576	2013-08-01 13:34:06 +00:00
Richard Sandiford	fd7f4ae6d4	[SystemZ] Reuse CC results for integer comparisons with zero This also fixes a bug in the predication of LR to LOCR: I'd forgotten that with these in-place instruction builds, the implicit operands need to be added manually. I think this was latent until now, but is tested by int-cmp-45.c. It also adds a CC valid mask to STOC, again tested by int-cmp-45.c. llvm-svn: 187573	2013-08-01 10:39:40 +00:00
Richard Sandiford	a075708abe	[SystemZ] Prefer comparisons with zero Convert >= 1 to > 0, etc. Using comparison with zero isn't a win on its own, but it exposes more opportunities for CC reuse (the next patch). llvm-svn: 187571	2013-08-01 10:29:45 +00:00
Vladimir Medic	deaa618cdc	Add tests for Mips DSP instructions. llvm-svn: 187570	2013-08-01 09:35:25 +00:00
Tim Northover	40e9efd725	AArch64: add initial NEON support Patch by Ana Pazos. - Completed implementation of instruction formats: AdvSIMD three same AdvSIMD modified immediate AdvSIMD scalar pairwise - Completed implementation of instruction classes (some of the instructions in these classes belong to yet unfinished instruction formats): Vector Arithmetic Vector Immediate Vector Pairwise Arithmetic - Initial implementation of instruction formats: AdvSIMD scalar two-reg misc AdvSIMD scalar three same - Intial implementation of instruction class: Scalar Arithmetic - Initial clang changes to support arm v8 intrinsics. Note: no clang changes for scalar intrinsics function name mangling yet. - Comprehensive test cases for added instructions To verify auto codegen, encoding, decoding, diagnosis, intrinsics. llvm-svn: 187567	2013-08-01 09:20:35 +00:00
Robert Lytton	4be00f8ad1	XCore target: Fix Vararg handling llvm-svn: 187565	2013-08-01 08:29:44 +00:00
Robert Lytton	4e60a3f4e3	XCore target: Add byval handling llvm-svn: 187563	2013-08-01 08:18:55 +00:00
Robert Lytton	b4787a159d	Xcore target Fix emitArrayBound() calling OutStreamer.Emit*() multiple times when trying to print a single line llvm-svn: 187562	2013-08-01 07:52:05 +00:00
Reed Kotler	302ae6b002	Fix some misc. issues with Mips16 fp stubs. 1) They should never be inlined. 2) A naming inconsistency with gcc mips16 3) Stubs should not have the global attribute llvm-svn: 187555	2013-08-01 02:26:31 +00:00
Kevin Enderby	78f9572f39	Added the B9.3.19 SUBS PC, LR, #imm (Thumb2) system instruction. While the .td entry is nice and all, it takes a pretty gross hack in ARMAsmParser::ParseInstruction() because of handling of other "subs" instructions to get it to match. Ran it by Jim Grosbach and he said it was about what he expected to make this work given the existing code. rdar://14214063 llvm-svn: 187530	2013-07-31 21:05:30 +00:00
Tom Stellard	ca69a53bae	Revert "R600: Non vector only instruction can be scheduled on trans unit" This reverts commit 98ce62780ea7185ba710868bf83c8077e8d7f6d6. llvm-svn: 187526	2013-07-31 20:43:27 +00:00
Vincent Lejeune	bb3f931123	R600: Avoid more than 4 literals in the same instruction group at scheduling llvm-svn: 187515	2013-07-31 19:32:07 +00:00
Vincent Lejeune	df18804e26	R600: Non vector only instruction can be scheduled on trans unit llvm-svn: 187514	2013-07-31 19:31:56 +00:00
Matt Arsenault	24b49c411c	Reject bitcasts between address spaces with different sizes llvm-svn: 187506	2013-07-31 17:49:08 +00:00
Richard Sandiford	791bea4182	[SystemZ] Implement isLegalAddressingMode() The loop optimizers were assuming that scales > 1 were OK. I think this is actually a bug in TargetLoweringBase::isLegalAddressingMode(), since it seems to be trying to reject anything that isn't r+i or r+r, but it has no default case for scales other than 0, 1 or 2. Implementing the hook for z means that z can no longer test any change there though. llvm-svn: 187497	2013-07-31 12:58:26 +00:00
Richard Sandiford	ee8343822e	[SystemZ] Be more careful about inverting CC masks (conditional loads) Extend r187495 to conditional loads. I split this out because the easiest way seemed to be to force a particular operand order in SystemZISelDAGToDAG.cpp. llvm-svn: 187496	2013-07-31 12:38:08 +00:00
Richard Sandiford	3d768e334b	[SystemZ] Be more careful about inverting CC masks System z branches have a mask to select which of the 4 CC values should cause the branch to be taken. We can invert a branch by inverting the mask. However, not all instructions can produce all 4 CC values, so inverting the branch like this can lead to some oddities. For example, integer comparisons only produce a CC of 0 (equal), 1 (less) or 2 (greater). If an integer EQ is reversed to NE before instruction selection, the branch will test for 1 or 2. If instead the branch is reversed after instruction selection (by inverting the mask), it will test for 1, 2 or 3. Both are correct, but the second isn't really canonical. This patch therefore keeps track of which CC values are possible and uses this when inverting a mask. Although this is mostly cosmestic, it fixes undefined behavior for the CIJNLH in branch-08.ll. Another fix would have been to mask out bit 0 when generating the fused compare and branch, but the point of this patch is that we shouldn't need to do that in the first place. The patch also makes it easier to reuse CC results from other instructions. llvm-svn: 187495	2013-07-31 12:30:20 +00:00
Richard Sandiford	8a757bba10	[SystemZ] Move compare-and-branch generation even later r187116 moved compare-and-branch generation from the instruction-selection pass to the peephole optimizer (via optimizeCompare). It turns out that even this is a bit too early. Fused compare-and-branch instructions don't interact well with predication, where a CC result is needed. They also make it harder to reuse the CC side-effects of earlier instructions (not yet implemented, but the subject of a later patch). Another problem was that the AnalyzeBranch family of routines weren't handling compares and branches, so we weren't able to reverse the fused form in cases where we would reverse a separate branch. This could have been fixed by extending AnalyzeBranch, but given the other problems, I've instead moved the fusing to the long-branch pass, which is also responsible for the opposite transformation: splitting out-of-range compares and branches into separate compares and long branches. I've added a test for the AnalyzeBranch problem. A test for the predication problem is included in the next patch, which fixes a bug in the choice of CC mask. llvm-svn: 187494	2013-07-31 12:11:07 +00:00
Richard Sandiford	6a06ba36ba	[SystemZ] Postpone NI->RISBG conversion to convertToThreeAddress() r186399 aggressively used the RISBG instruction for immediate ANDs, both because it can handle some values that AND IMMEDIATE can't, and because it allows the destination register to be different from the source. I realized later while implementing the distinct-ops support that it would be better to leave the choice up to convertToThreeAddress() instead. The AND IMMEDIATE form is shorter and is less likely to be cracked. This is a problem for 32-bit ANDs because we assume that all 32-bit operations will leave the high word untouched, whereas RISBG used in this way will either clear the high word or copy it from the source register. The patch uses the z196 instruction RISBLG for this instead. This means that z10 will be restricted to NILL, NILH and NILF for 32-bit ANDs, but I think that should be OK for now. Although we're using z10 as the base architecture, the optimization work is going to be focused more on z196 and zEC12. llvm-svn: 187492	2013-07-31 11:36:35 +00:00
Elena Demikhovsky	67b05fc0b3	Added INSERT and EXTRACT intructions from AVX-512 ISA. All insertf/extractf functions replaced with insert/extract since we have insertf and inserti forms. Added lowering for INSERT_VECTOR_ELT / EXTRACT_VECTOR_ELT for 512-bit vectors. Added lowering for EXTRACT/INSERT subvector for 512-bit vectors. Added a test. llvm-svn: 187491	2013-07-31 11:35:14 +00:00
Richard Sandiford	6cf80b3ec0	[SystemZ] Add RISBLG and RISBHG instruction definitions The next patch will make use of RISBLG for codegen. llvm-svn: 187490	2013-07-31 11:17:35 +00:00

1 2 3 4 5 ...

20297 Commits