llvm-project

Commit Graph

Author	SHA1	Message	Date
Matt Arsenault	23755997e4	R600: Remove unused define llvm-svn: 221543	2014-11-07 20:45:00 +00:00
Daniel Sanders	c43cda84ff	[mips] Promote i32 arguments to i64 for the N32/N64 ABI and fix <64-bit structs... Summary: ... and after all that refactoring, it's possible to distinguish softfloat floating point values from integers so this patch no longer breaks softfloat to do it. Remove direct handling of i32's in the N32/N64 ABI by promoting them to i64. This more closely reflects the ABI documentation and also fixes problems with stack arguments on big-endian targets. We now rely on signext/zeroext annotations (already generated by clang) and the Assert[SZ]ext nodes to avoid the introduction of unnecessary sign/zero extends. It was not possible to convert three tests to use signext/zeroext. These tests are bswap.ll, ctlz-v.ll, ctlz-v.ll. It's not possible to put signext on a vector type so we just accept the sign extends here for now. These tests don't pass the vectors the same way clang does (clang puts multiple elements in the same argument, these map 1 element to 1 argument) so we don't need to worry too much about it. With this patch, all known N32/N64 bugs should be fixed and we now pass the first 10,000 tests generated by ABITest.py. Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D6117 llvm-svn: 221534	2014-11-07 16:54:21 +00:00
Daniel Sanders	b315c8c762	[mips] Removed the remainder of MipsCC. NFC. Summary: One of the calls to AllocateStack (the one in LowerCall) doesn't look like it should be there but it was there before and removing it breaks the frame size calculation. Reviewers: vmedic, theraven Reviewed By: theraven Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D6116 llvm-svn: 221529	2014-11-07 15:33:08 +00:00
Daniel Sanders	2c6f4b430b	[mips] Remove MipsCC::reservedArgArea() in favour of MipsABIInfo::GetCalleeAllocdArgSizeInBytes(). NFC. Summary: Reviewers: theraven, vmedic Reviewed By: vmedic Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D6115 llvm-svn: 221528	2014-11-07 15:03:53 +00:00
NAKAMURA Takumi	0ebd071450	MipsCCState.h: Use LLVM_DELETED_FUNCTION for msc17. llvm-svn: 221527	2014-11-07 14:56:31 +00:00
Daniel Sanders	0456c15c58	[mips] Move MipsCCState to a separate file and clang-formatted it. Summary: Depends on D6113 Reviewers: theraven, vmedic Reviewed By: vmedic Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D6114 llvm-svn: 221525	2014-11-07 14:24:31 +00:00
Daniel Sanders	892cf8af46	[mips] Fix unused variable warnings introduced in r221521 llvm-svn: 221522	2014-11-07 12:43:01 +00:00
Daniel Sanders	d7eba31508	[mips] Remove remaining use of MipsCC::intArgRegs() in favour of MipsABIInfo::GetByValArgRegs() and MipsABIInfo::GetVarArgRegs() Summary: Depends on D6112 Reviewers: theraven, vmedic Reviewed By: vmedic Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D6113 llvm-svn: 221521	2014-11-07 12:21:37 +00:00
Daniel Sanders	4f1bedaa47	[mips] Remove MipsCC::getRegVT(). NFC Summary: It's no longer used. Reviewers: vmedic, theraven Reviewed By: theraven Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D6112 llvm-svn: 221519	2014-11-07 12:02:59 +00:00
Daniel Sanders	cfad1e3fca	[mips] Remove MipsCC::analyzeCallOperands in favour of CCState::AnalyzeCallOperands. NFC Summary: In addition to the usual f128 workaround, it was also necessary to provide a means of accessing ArgListEntry::IsFixed. Reviewers: theraven, vmedic Reviewed By: vmedic Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D6111 llvm-svn: 221518	2014-11-07 11:43:49 +00:00
Daniel Sanders	41a64c407f	[mips] Move SpecialCallingConv to MipsCCState and use it from tablegen-erated code. NFC Summary: In the long run, it should probably become a calling convention in its own right but for now just move it out of MipsISelLowering::analyzeCallOperands() so that we can drop this function in favour of CCState::AnalyzeCallOperands(). Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D6085 llvm-svn: 221517	2014-11-07 11:10:48 +00:00
Daniel Sanders	f3096a1c8d	[mips] Removed IsVarArg from MipsISelLowering::analyzeCallOperands(). NFC. Summary: CCState objects already carry this information in their isVarArg() method. Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D6084 llvm-svn: 221516	2014-11-07 10:45:16 +00:00
Ahmed Bougacha	72001cf287	[AArch64] Keep flags on condition vreg when instantiating a CB branch. Reversing a CB* instruction used to drop the flags on the condition. On the included testcase, this lead to a read from an undefined vreg. Using addOperand keeps the flags, here <undef>. Differential Revision: http://reviews.llvm.org/D6159 llvm-svn: 221507	2014-11-07 02:50:00 +00:00
Simon Pilgrim	615ab8e721	[X86][SSE] Vector integer/float conversion memory folding (cvttps2dq / cvttpd2dq) Fixed an issue with the (v)cvttps2dq and (v)cvttpd2dq instructions being incorrectly put in the 2 source operand folding tables instead of the 1 source operand and added the missing SSE/AVX versions. Also added missing (v)cvtps2dq and (v)cvtpd2dq instructions to the folding tables. Differential Revision: http://reviews.llvm.org/D6001 llvm-svn: 221489	2014-11-06 22:15:41 +00:00
Ahmed Bougacha	b5367eeea3	[X86] Add VFMADDSUB cases for the 213->231 custom inserter. Also add tests for vfmadd/vfmsub. llvm-svn: 221488	2014-11-06 22:04:15 +00:00
Ahmed Bougacha	9152361d73	[X86] Add missing FMA3 VFMADDSUB in the emitter. Also reuse the fma4 intrinsic test to cover fma3 instructions too. llvm-svn: 221487	2014-11-06 21:58:11 +00:00
Colin LeMahieu	2c769209a1	[Hexagon] Adding basic Hexagon ELF object emitter. llvm-svn: 221465	2014-11-06 17:05:51 +00:00
Eli Bendersky	799c564236	Clean up NVPTXLowerStructArgs.cpp. NFC * Remove unnecessary const_casts and C-style casts * Simplify attribute access code * Simplify ArrayRef creation * 80-col and clang-format llvm-svn: 221464	2014-11-06 17:05:49 +00:00
Daniel Sanders	2373af3475	[mips] Removed IsSoftFloat from MipsISelLowering::analyzeCallOperands(). NFC Summary: It isn't used anymore. Depends on D6081 Reviewers: vmedic Reviewed By: vmedic Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D6083 llvm-svn: 221463	2014-11-06 16:48:57 +00:00
Daniel Sanders	b70e27ca7b	[mips] Removed MipsISelLowering::analyzeFormalArguments() in favour of CCState::AnalyzeFormalArguments() Summary: As with returns, we must be able to identify f128 arguments despite them being lowered away. We do this with a pre-analyze step that builds a vector and then we use this vector from the tablegen-erated code. Reviewers: vmedic Reviewed By: vmedic Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D6081 llvm-svn: 221461	2014-11-06 16:36:30 +00:00
Andrea Di Biagio	7ecd22ca4a	[X86] When commuting SSE immediate blend, make sure that the new blend mask is a valid imm8. Example: define <4 x i32> @test(<4 x i32> %a, <4 x i32> %b) { %shuffle = shufflevector <4 x i32> %a, <4 x i32> %b, <4 x i32> <i32 4, i32 5, i32 6, i32 3> ret <4 x i32> %shuffle } Before llc (-mattr=+sse4.1), produced the following assembly instruction: pblendw $4294967103, %xmm1, %xmm0 After pblendw $63, %xmm1, %xmm0 llvm-svn: 221455	2014-11-06 14:36:45 +00:00
Aaron Ballman	e77ffe35bf	Fixing some -Wcast-qual warnings; NFC. llvm-svn: 221454	2014-11-06 14:32:30 +00:00
Toma Tabacu	27cab751ca	[mips] Tolerate the use of the %z inline asm operand modifier with non-immediates. Summary: Currently, we give an error if %z is used with non-immediates, instead of continuing as if the %z isn't there. For example, you use the %z operand modifier along with the "Jr" constraints ("r" makes the operand a register, and "J" makes it an immediate, but only if its value is 0). In this case, you want the compiler to print "$0" if the inline asm input operand turns out to be an immediate zero and you want it to print the register containing the operand, if it's not. We give an error in the latter case, and we shouldn't (GCC also doesn't). Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D6023 llvm-svn: 221453	2014-11-06 14:25:42 +00:00
Sasa Stankovic	b38db1eff8	[mips] Add the following MIPS options that control gp-relative addressing of small data items: -mgpopt, -mlocal-sdata, -mextern-sdata. Implement gp-relative addressing for constants. Differential Revision: http://reviews.llvm.org/D4903 llvm-svn: 221450	2014-11-06 13:20:12 +00:00
Toma Tabacu	dde4c464dd	[mips] Improve error/warning messages and testing for the .cpload assembler directive. Summary: Improved warning message when using .cpload inside a reorder section and added an error message for using .cpload with Mips16 enabled. Modified the tests to fit with the changes mentioned above, added a test-case for the N32 ABI in cpload.s and did some reformatting to make the tests easier to read. Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5465 llvm-svn: 221447	2014-11-06 10:02:45 +00:00
David Majnemer	03d2c51cf2	X86, MC: Tidy up some whitespace in GetRelocType No functionality change intended. llvm-svn: 221443	2014-11-06 08:10:37 +00:00
Quentin Colombet	dbe33e7aa4	[X86] Lower VSELECT into SHRUNKBLEND when we shrink the bits used into the condition to match a blend. This prevents optimizations that work on VSELECT to perform invalid transformations. Indeed, the optimized condition does not match the vector boolean content that is expected and bad things may happen. This patch yields the exact same code on the whole test-suite + specs (-O3 and -O3 -march=core-avx2), it improves one test case (vector-blend.ll) and fixes a bug reduced in vselect-avx.ll. <rdar://problem/18819506> llvm-svn: 221429	2014-11-06 02:25:03 +00:00
Simon Pilgrim	1fc483d991	[X86][SSE] Vector integer to float conversion memory folding Added missing memory folding for the (V)CVTDQ2PS instructions - we can safely fold these (but not the (V)CVTDQ2PD versions which have a register/memory size discrepancy in the source operand). I've added a test case demonstrating that stack folding now works. Differential Revision: http://reviews.llvm.org/D5981 llvm-svn: 221407	2014-11-05 22:28:25 +00:00
Matt Arsenault	f2676a5afc	R600/SI: Fix omod display for VOP3b llvm-svn: 221387	2014-11-05 19:35:00 +00:00
Derek Schuff	a54222045e	[x86 fast-isel] Materialize allocas with the correct-sized lea for ILP32 Summary: X86FastISel::fastMaterializeAlloca was incorrectly conditioning its opcode selection on subtarget bitness rather than pointer size. Differential Revision: http://reviews.llvm.org/D6136 llvm-svn: 221386	2014-11-05 19:27:21 +00:00
Matt Arsenault	f3cd4512ac	R600/SI: Move all rsrc building functions to SIISelLowering llvm-svn: 221383	2014-11-05 19:01:19 +00:00
Matt Arsenault	485defe58c	R600/SI: Remove SI_ADDR64_RSRC llvm-svn: 221382	2014-11-05 19:01:17 +00:00
Justin Holewinski	3d140fcfd1	[NVPTX] Add NVPTXLowerStructArgs pass This works around the limitation that PTX does not allow .param space loads/stores with arbitrary pointers. If a function has a by-val struct ptr arg, say foo(%struct.x byval %d), then add the following instructions to the first basic block : %temp = alloca %struct.x, align 8 %tt1 = bitcast %struct.x %d to i8 * %tt2 = llvm.nvvm.cvt.gen.to.param %tt2 %tempd = bitcast i8 addrspace(101) * to %struct.x addrspace(101) * %tv = load %struct.x addrspace(101) * %tempd store %struct.x %tv, %struct.x * %temp, align 8 The above code allocates some space in the stack and copies the incoming struct from param space to local space. Then replace all occurences of %d by %temp. Fixes PR21465. llvm-svn: 221377	2014-11-05 18:19:30 +00:00
Duncan P. N. Exon Smith	c5754a65e6	IR: MDNode => Value: NamedMDNode::getOperator() Change `NamedMDNode::getOperator()` from returning `MDNode ` to returning `Value `. To reduce boilerplate at some call sites, add a `getOperatorAsMDNode()` for named metadata that's expected to only return `MDNode` -- for now, that's everything, but debug node named metadata (such as llvm.dbg.cu and llvm.dbg.sp) will soon change. This is part of PR21433. Note that there's a follow-up patch to clang for the API change. llvm-svn: 221375	2014-11-05 18:16:03 +00:00
Tilmann Scheller	30c5ca25a5	[ARM] Remove more dead code. Dead code identified by the Clang static analyzer. llvm-svn: 221372	2014-11-05 17:45:04 +00:00
Zoran Jovanovic	06c9d55123	ps][microMIPS] Implement CodeGen support for ANDI16 instruction llvm-svn: 221371	2014-11-05 17:43:00 +00:00
Colin LeMahieu	816ef086f6	[Hexagon] [NFC] Alphabetizing cmake files. llvm-svn: 221370	2014-11-05 17:38:48 +00:00
Zoran Jovanovic	9f99723d92	ps][microMIPS] Implement CodeGen support for SLL16 and SRL16 instructions llvm-svn: 221369	2014-11-05 17:38:31 +00:00
Tilmann Scheller	c339992338	[ARM] Remove another redundant assignment. Found by the Clang static analyzer. llvm-svn: 221368	2014-11-05 17:34:04 +00:00
Zoran Jovanovic	8853171b46	[mips][microMIPS] Implement ANDI16 instruction llvm-svn: 221367	2014-11-05 17:31:00 +00:00
Tilmann Scheller	219ad28076	[ARM] Remove redundant assignment. Found by the Clang static analyzer. llvm-svn: 221366	2014-11-05 17:28:19 +00:00
Tilmann Scheller	f2572c5097	[ARM] Remove dead code identified by the Clang static analyzer. llvm-svn: 221358	2014-11-05 17:10:43 +00:00
Zoran Jovanovic	9c654830f7	[mips][microMIPS] Mark symbols as microMIPS if necessary Differential Revision: http://reviews.llvm.org/D6039 llvm-svn: 221355	2014-11-05 16:35:20 +00:00
Zoran Jovanovic	a87308c84c	Reverted revisions 221351, 221352 and 221353. llvm-svn: 221354	2014-11-05 16:19:59 +00:00
Zoran Jovanovic	3038500f3b	[mips][microMIPS] Implement CodeGen support for ANDI16 instruction Differential Revision: http://reviews.llvm.org/D5797 llvm-svn: 221353	2014-11-05 15:54:05 +00:00
Zoran Jovanovic	f4f5f1e272	[mips][microMIPS] Implement CodeGen support for SLL16 and SRL16 instructions Differential Revision: http://reviews.llvm.org/D5933 llvm-svn: 221352	2014-11-05 15:46:53 +00:00
Zoran Jovanovic	e548bb0634	[mips][microMIPS] Implement ANDI16 instruction Differential Revision: http://reviews.llvm.org/D5163 llvm-svn: 221351	2014-11-05 15:39:41 +00:00
Tom Stellard	326d6ece94	R600/SI: Change all instruction assembly names to lowercase. This matches the format produced by the AMD proprietary driver. //==================================================================// // Shell script for converting .ll test cases: (Pass the .ll files you want to convert to this script as arguments). //==================================================================// ; This was necessary on my system so that A-Z in sed would match only ; upper case. I'm not sure why. export LC_ALL='C' TEST_FILES="$" MATCHES=`grep -v Patterns SIInstructions.td \| grep -o '"[A-Z0-9_]\+["e]' \| grep -o '[A-Z0-9_]\+' \| sort -r` for f in $TEST_FILES; do # Check that there are SI tests: grep -q -e 'verde' -e 'bonaire' -e 'SI' -e 'tahiti' $f if [ $? -eq 0 ]; then for match in $MATCHES; do sed -i -e "s/$[ :]$match$/\L\1/" $f done # Try to get check lines with partial instruction names sed -i 's/$;[ ]SI[A-Z\\-]: $$[A-Z_0-9]\+$/\1\L\2/' $f fi done sed -i -e 's/bb0_1/BB0_1/g' ../../../test/CodeGen/R600/infinite-loop.ll sed -i -e 's/SI-NOT: bfe/SI-NOT: {{[^@]}}bfe/g'../../../test/CodeGen/R600/llvm.AMDGPU.bfe.32.ll ../../../test/CodeGen/R600/sext-in-reg.ll sed -i -e 's/exp_IEEE/EXP_IEEE/g' ../../../test/CodeGen/R600/llvm.exp2.ll sed -i -e 's/numVgprs/NumVgprs/g' ../../../test/CodeGen/R600/register-count-comments.ll sed -i 's/$; CHECK[-NOT]*: $$[A-Z_0-9]\+$/\1\L\2/' ../../../test/CodeGen/R600/select64.ll ../../../test/CodeGen/R600/sgpr-copy.ll //==================================================================// // Shell script for converting .td files (run this last) //==================================================================// export LC_ALL='C' sed -i -e '/Patterns/!s/$"[A-Z0-9_]\+[ "e]$/\L\1/g' SIInstructions.td sed -i -e 's/"EXP/"exp/g' SIInstrInfo.td llvm-svn: 221350	2014-11-05 14:50:53 +00:00
Andrea Di Biagio	ce46b97b48	[X86] Teach method 'isVectorClearMaskLegal' how to check for legal blend masks. This patch improves the folding of vector AND nodes into blend operations for targets that feature SSE4.1. A vector AND node where one of the operands is a constant build_vector with elements that are either zero or all-ones can be converted into a blend. This allows for example to simplify the following code: define <4 x i32> @test(<4 x i32> %A, <4 x i32> %B) { %1 = and <4 x i32> %A, <i32 0, i32 0, i32 0, i32 -1> %2 = and <4 x i32> %B, <i32 -1, i32 -1, i32 -1, i32 0> %3 = or <4 x i32> %1, %2 ret <4 x i32> %3 } Before this patch llc (-mcpu=corei7) generated: andps LCPI1_0(%rip), %xmm0, %xmm0 andps LCPI1_1(%rip), %xmm1, %xmm1 orps %xmm1, %xmm0, %xmm0 retq With this patch we generate a single 'vpblendw'. llvm-svn: 221343	2014-11-05 13:04:14 +00:00
Oliver Stannard	9e89d8cc5c	[ARM] Honor FeatureD16 in the assembler and disassembler Some ARM FPUs only have 16 double-precision registers, rather than the normal 32. LLVM represents this with the D16 target feature. This is currently used by CodeGen to avoid using high registers when they are not available, but the assembler and disassembler do not. I fix this in the assmebler and disassembler rather than the InstrInfo.td files, as the latter would require a large number of changes everywhere one of the floating-point instructions is referenced in the backend. This solution is similar to the one used for co-processor numbers and MSR masks. llvm-svn: 221341	2014-11-05 12:06:39 +00:00

1 2 3 4 5 ...

30642 Commits