llvm-project

Commit Graph

Author	SHA1	Message	Date
Bob Wilson	0f8a02830a	Fix indentation. llvm-svn: 99705	2010-03-27 04:01:23 +00:00
Bob Wilson	cf603fb1c5	Add a format argument to the N3V and N3VX classes, removing the N3Vf class. llvm-svn: 99704	2010-03-27 03:56:52 +00:00
Chris Lattner	07943af506	eliminate the last of the parallel's! llvm-svn: 99700	2010-03-27 02:47:14 +00:00
Eric Christopher	81c03447fc	When we promote a load of an argument make sure to take the alignment of the previous load - it's usually important. For example, we don't want to blindly turn an unaligned load into an aligned one. llvm-svn: 99699	2010-03-27 01:54:00 +00:00
Bill Wendling	6888e798d3	Forgot the part where we handle the ".llvm.eh.catch.all.value". llvm-svn: 99697	2010-03-27 01:24:30 +00:00
Bill Wendling	ec8b44adeb	Return if we changed anything or not. llvm-svn: 99695	2010-03-27 01:22:38 +00:00
Bill Wendling	f2c1f40e95	If a selector has a call to ".llvm.eh.catch.all.value" that we haven't converted, then use the initializer, since using the name itself won't work. llvm-svn: 99692	2010-03-27 01:19:12 +00:00
Johnny Chen	6094cdab9f	Add NVMulSLFrm to represent "3-register multiply with scalar" operations and set it as the format for the appropriate N3VSL<> classes. These instructions require special handling of the M:Vm field which encodes the restricted Dm and the lane index within Dm. Examples are A8.6.325 VMLA, VMLAL, VMLS, VMLSL (by scalar): vmlal.s32 q3, d2, d10[0] llvm-svn: 99690	2010-03-27 01:03:13 +00:00
Chris Lattner	c5e20d9031	eliminate almost all the rest of the x86-32 parallels. llvm-svn: 99686	2010-03-27 00:45:04 +00:00
Jim Grosbach	44313db557	Thumb2 storeFrom/LoadToStackSlot() need to handle tGPR regs directly, not pass through to the generic version. The generic functions use STR/LDR, but T2 needs the t2STR/t2LDR instead so we get the addressing mode correct. llvm-svn: 99678	2010-03-27 00:09:12 +00:00
Chris Lattner	35a069b3a5	improve portability to minix, patch by Kees van Reeuwijk for PR6704 llvm-svn: 99677	2010-03-26 23:54:15 +00:00
Johnny Chen	93acfbf441	Remove the duplicate multiclass N3VSh_QHSD and use N3VInt_QHSD which is modified to now take a format argument. N3VDInt<> and N3VQInt<> are modified to take a format argument as well. llvm-svn: 99676	2010-03-26 23:49:07 +00:00
Bill Wendling	d1aa77c37d	If we mark clean-ups as clean-ups, then it could break when inlining through an 'invoke' instruction. You will get a situation like this: bb: %ehptr = eh.exception() %sel = eh.selector(%ehptr, @per, 0); ... bb2: invoke _Unwind_Resume_or_Rethrow(%ehptr) %normal unwind to %lpad lpad: ... The unwinder will see the %sel call as a clean-up and, if it doesn't have a catch further up the call stack, it will skip running it. But there is another catch up the stack -- the catch for the %lpad. However, we can't see that. This is fixed in code-gen, where we detect this situation, and convert the "clean-up" selector call into a "catch-all" selector call. This gives us the correct semantics. llvm-svn: 99671	2010-03-26 23:41:30 +00:00
Johnny Chen	0b57de3c4c	Add NVExtFrm to represent NEON Vector Extract Instructions, that uses Inst{11-8} to encode the byte location of the extracted result in the concatenation of the operands, from the least significant end. Modify VEXTd and VEXTq classes to use the format. llvm-svn: 99659	2010-03-26 22:28:56 +00:00
Chris Lattner	574907ed88	remove a constructor implementation that isn't declared in the header. How can both clang and gcc accept this? PR6703 llvm-svn: 99658	2010-03-26 22:17:24 +00:00
Anton Korobeynikov	2f072e95e6	Add few missed libcalls and correct names for others. llvm-svn: 99656	2010-03-26 21:32:14 +00:00
Johnny Chen	2cf04957c2	Add N3RegVShFrm to represent 3-Register Vector Shift Instructions, which do not follow the N3RegFrm's operand order of D:Vd N:Vn M:Vm. The operand order of N3RegVShFrm is D:Vd M:Vm N:Vn (notice that M:Vm is the first src operand). Add a parent class N3Vf which requires passing a Format argument and which the N3V class is modified to inherit from. N3V class represents the "normal" 3-Register NEON Instructions with N3RegFrm. Also add a multiclass N3VSh_QHSD to represent clusters of NEON 3-Register Shift Instructions and replace 8 invocations with it. llvm-svn: 99655	2010-03-26 21:26:28 +00:00
Dale Johannesen	6096d5a279	Debug info shouldn't affect kills. llvm-svn: 99637	2010-03-26 19:21:26 +00:00
Jim Grosbach	bf59859b2b	vldm/vstm can only do up to 16 double-word registers at a time. Radar 7797856 llvm-svn: 99630	2010-03-26 18:41:09 +00:00
Johnny Chen	8fc94d6362	Add N3RegFrm to represent "NEON 3 vector register format" instructions. Examples are VABA (Vector Absolute Difference and Accumulate), VABAL (Vector Absolute Difference and Accumulate Long), and VABD (Vector Absolute Difference). llvm-svn: 99628	2010-03-26 18:32:20 +00:00
Evan Cheng	3365fb1412	Do not sibcall if stack needs to be dynamically aligned. llvm-svn: 99620	2010-03-26 16:26:03 +00:00
Evan Cheng	00a620c61e	Allow trivial sibcall of vararg callee when no arguments are being passed. llvm-svn: 99598	2010-03-26 02:13:13 +00:00
Evan Cheng	eb50ac5ccc	LiveVariables should clear kill / dead markers first. This allows us to remove a hack in the scheduler. llvm-svn: 99597	2010-03-26 02:12:24 +00:00
Johnny Chen	5d4e917d9f	Add N2RegVShLFrm and N2RegVShRFrm formats so that the disassembler can easily dispatch to the appropriate routines to handle the different interpretations of the shift amount encoded in the imm6 field. The Vd, Vm fields are interpreted the same between the two, though. See, for example, A8.6.367 VQSHL, VQSHLU (immediate) for N2RegVShLFrm format and A8.6.368 VQSHRN, VQSHRUN for N2RegVShRFrm format. llvm-svn: 99590	2010-03-26 01:07:59 +00:00
Jeffrey Yasskin	bfd38abbed	Avoid leaking argv and env arrays from lli. llvm-svn: 99589	2010-03-26 00:59:12 +00:00
Dan Gohman	d42e09d91e	Ignore debug intrinsics in yet more places. llvm-svn: 99580	2010-03-26 00:33:27 +00:00
Evan Cheng	7b4a1a221b	Try trivial remat before the coalescer gives up on a vr / physreg coalescing for fear of tying up a physical register. llvm-svn: 99575	2010-03-26 00:07:25 +00:00
Dale Johannesen	5d99d7fe79	Handle DEBUG_VALUE in this pass. llvm-svn: 99573	2010-03-26 00:02:44 +00:00
Jim Grosbach	71fcb4fedd	switch the flag for using NEON for SP floating point to a subtarget 'feature'. Re-commit. This time complete with testsuite updates. llvm-svn: 99570	2010-03-25 23:47:34 +00:00
Jim Grosbach	42bb89c7d9	need to fix 'make check' tests first. revert for a moment. llvm-svn: 99569	2010-03-25 23:34:05 +00:00
Jim Grosbach	7fce4e39aa	switch the flag for using NEON for SP floating point to a subtarget 'feature' llvm-svn: 99568	2010-03-25 23:32:19 +00:00
Gabor Greif	6c6b2fd2b2	rename pred_const_iterator to const_pred_iterator for consistency's sake llvm-svn: 99567	2010-03-25 23:25:28 +00:00
Johnny Chen	a3617ec88a	Removed instruction class NI from ARMInstrFormats.td. It doesn't seem to be used anywhere. llvm-svn: 99566	2010-03-25 23:11:56 +00:00
Jim Grosbach	a43386ba8f	switch the use-vml[as] instructions flag to a subtarget 'feature' llvm-svn: 99565	2010-03-25 23:11:16 +00:00
Gabor Greif	c78d720f02	rename use_const_iterator to const_use_iterator for consistency's sake llvm-svn: 99564	2010-03-25 23:06:16 +00:00
Daniel Dunbar	d821f4ac60	llvm-mc: Add a -mc-relax-all option, which relaxes every fixup. We always need exactly two passes in that case, and don't ever need to recompute any layout, so this is a nice baseline for relaxation performance. llvm-svn: 99563	2010-03-25 22:49:09 +00:00
Johnny Chen	91d2774416	Add NVDupLnFrm and change NVDupLane class to use that format. llvm-svn: 99557	2010-03-25 21:49:12 +00:00
Jim Grosbach	4b3b2ef65c	ARM cortex-a8 doesn't do vmla/vmls well. disable them by default for that cpu llvm-svn: 99549	2010-03-25 20:48:50 +00:00
Johnny Chen	d82f9002e4	Add NVCVTFrm (NEON Convert with fractional bits immediate) and modify N2VImm to expect a Format arg. N2VCvtD/N2VCvtQ are modified to use the NVCVTFrm format. llvm-svn: 99548	2010-03-25 20:39:04 +00:00
Evan Cheng	510bda2064	Code clean up. llvm-svn: 99544	2010-03-25 19:46:11 +00:00
Daniel Dunbar	6432bd744e	MC: Stop restarting layout on every relaxation. - Still O(N^2), just a faster form, and now its the MCAsmLayout's fault. On the .s I am tuning against (combine.s from 403.gcc): -- ddunbar@lordcrumb:MC$ diff stats-before.txt stats-after.txt 5,10c5,10 < 1728 assembler - Number of assembler layout and relaxation steps < 7707 assembler - Number of emitted assembler fragments < 120588 assembler - Number of emitted object file bytes < 2233448 assembler - Number of evaluated fixups < 1727 assembler - Number of relaxed instructions < 6723845 mcexpr - Number of MCExpr evaluations --- > 3 assembler - Number of assembler layout and relaxation steps > 7707 assembler - Number of emitted assembler fragments > 120588 assembler - Number of emitted object file bytes > 14796 assembler - Number of evaluated fixups > 1727 assembler - Number of relaxed instructions > 67889 mcexpr - Number of MCExpr evaluations -- Feel free to LOL at the -before numbers, if you like. I am a little surprised we make more than 2 relaxation passes. It's pretty trivial for us to do relaxation out-of-order if that would give a speedup. llvm-svn: 99543	2010-03-25 19:35:56 +00:00
Daniel Dunbar	d919276bc0	Fix -Asserts warning, again. llvm-svn: 99542	2010-03-25 19:35:53 +00:00
Jakob Stoklund Olesen	3758ff917e	Tag SSE2 integer instructions as SSEPackedInt. llvm-svn: 99540	2010-03-25 18:52:04 +00:00
Jakob Stoklund Olesen	f8d7eda663	Teach TableGen to understand X.Y notation in the TSFlagsFields strings. Remove much horribleness from X86InstrFormats as a result. Similar simplifications are probably possible for other targets. llvm-svn: 99539	2010-03-25 18:52:01 +00:00
Chris Lattner	fc4ec25363	fix a valgrind error on copy-constructor-synthesis.cpp, which is caused when the custom insertion hook deletes the instruction, then we try to set dead flags on it. Neither the code that I added nor the code that was there before was safe. llvm-svn: 99538	2010-03-25 18:49:10 +00:00
Evan Cheng	a1d0a02713	Remove an unused option. llvm-svn: 99537	2010-03-25 18:37:23 +00:00
Daniel Dunbar	0ba6a671d4	MC: Simplify main section layout process by moving alignment into LayoutSection. llvm-svn: 99529	2010-03-25 18:16:42 +00:00
Daniel Dunbar	25d114b2b2	MC: Sink Section address assignment into LayoutSection. llvm-svn: 99528	2010-03-25 18:16:38 +00:00
Jakob Stoklund Olesen	49e121d5e4	Add a late SSEDomainFix pass that twiddles SSE instructions to avoid domain crossings. On Nehalem and newer CPUs there is a 2 cycle latency penalty on using a register in a different domain than where it was defined. Some instructions have equvivalents for different domains, like por/orps/orpd. The SSEDomainFix pass tries to minimize the number of domain crossings by changing between equvivalent opcodes where possible. This is a work in progress, in particular the pass doesn't do anything yet. SSE instructions are tagged with their execution domain in TableGen using the last two bits of TSFlags. Note that not all instructions are tagged correctly. Life just isn't that simple. The SSE execution domain issue is very similar to the ARM NEON/VFP pipeline issue handled by NEONMoveFixPass. This pass may become target independent to handle both. llvm-svn: 99524	2010-03-25 17:25:00 +00:00
Johnny Chen	45ab3f3ccf	Added a new instruction class NVDupLane to be inherited by VDUPLND and VDUPLNQ, instead of the current N2V. Format of NVDupLane instances are set to NEONFrm currently. llvm-svn: 99518	2010-03-25 17:01:27 +00:00

1 2 3 4 5 ...

37259 Commits