llvm-project

Commit Graph

Author	SHA1	Message	Date
Eric Christopher	eb47a2a1e5	Make this code a little less magic number laden. llvm-svn: 131456	2011-05-17 07:47:55 +00:00
Chris Lattner	1e81f57bf0	add a note llvm-svn: 131455	2011-05-17 07:22:33 +00:00
Eli Friedman	7335e8a720	Back out r131444 and r131438; they're breaking nightly tests. I'll look into it more tomorrow. llvm-svn: 131451	2011-05-17 02:36:59 +00:00
Eli Friedman	83ba150f3a	Add x86 fast-isel for calls returning first-class aggregates. rdar://9435872. llvm-svn: 131438	2011-05-17 00:13:47 +00:00
Jim Grosbach	4e983166bc	Kill some dead code. llvm-svn: 131431	2011-05-16 22:24:07 +00:00
Eli Friedman	d4a3609d30	Remove dead code. Fix associated test to use FileCheck. llvm-svn: 131424	2011-05-16 21:28:22 +00:00
Eli Friedman	a4d4a0162d	Make fast-isel work correctly s/uadd.with.overflow intrinsics. llvm-svn: 131420	2011-05-16 21:06:17 +00:00
Rafael Espindola	e90c1cb221	sets bit 0 of the function address of thumb function in .symtab ("T is 1 if the target symbol S has type STT_FUNC and the symbol addresses a Thumb instruction ;it is 0 otherwise." from "ELF for the ARM Architecture" 4.7.1.2) Patch by Koan-Sin Tan! llvm-svn: 131406	2011-05-16 16:17:21 +00:00
Eli Friedman	8f1e11cde9	Fix a FIXME by moving the fast-isel implementation of the objectsize intrinsic from the x86 code to the generic code. llvm-svn: 131332	2011-05-14 00:47:51 +00:00
Rafael Espindola	df9db7ed92	Don't produce a vmovntdq if we don't have AVX support. llvm-svn: 131330	2011-05-14 00:30:01 +00:00
Eli Friedman	f080a57b81	Zap useless code; this hasn't done anything useful since fast-isel switched to being bottom-up (a very long time ago). llvm-svn: 131329	2011-05-14 00:19:32 +00:00
Julien Lerouge	7e11f9e26d	Fix a source of non determinism in FindUsedTypes, use a SetVector instead of a set. rdar://9423996 llvm-svn: 131283	2011-05-13 05:20:42 +00:00
Akira Hatanaka	e50a3d16e9	Fix setting of isCommutable flag. llvm-svn: 131233	2011-05-12 17:42:08 +00:00
Eric Christopher	2a9dbbbb12	Turn this into a table, this will make more sense shortly. Part of rdar://8470697 llvm-svn: 131200	2011-05-11 21:44:58 +00:00
Owen Anderson	b745623b71	Fix encoding of Thumb BLX register instructions. Patch by Koan-Sin Tan. llvm-svn: 131189	2011-05-11 17:00:48 +00:00
Nadav Rotem	8f971c27fb	Add custom lowering of X86 vector SRA/SRL/SHL when the shift amount is a splat vector. llvm-svn: 131179	2011-05-11 08:12:09 +00:00
Bill Wendling	50117f8186	Give the 'eh.sjlj.dispatchsetup' intrinsic call the value coming from the setjmp intrinsic call. This prevents it from being reordered so that it appears before the setjmp intrinsic (thus making it completely useless). <rdar://problem/9409683> llvm-svn: 131174	2011-05-11 01:11:55 +00:00
Eric Christopher	4a34e61e53	Optimize atomic lock or that doesn't use the result value. Next up: xor and and. Part of rdar://8470697 llvm-svn: 131171	2011-05-10 23:57:45 +00:00
Eric Christopher	e33464663f	Refactor lock versions of binary operators to be a little less cut and paste. llvm-svn: 131139	2011-05-10 18:36:16 +00:00
Jason W Kim	62f1ab79ec	First cut at getting debugging support for ARM/MC/ELF/.o DWARF stuff also gets fixed up by ELFARMAsmBackend::ApplyFixup(), but the offset is not guaranteed to be mod 4 == 0 as in text/data. llvm-svn: 131137	2011-05-10 18:07:25 +00:00
Justin Holewinski	72d74e4606	PTX: add PTX 2.3 setting in PTX sub-target. Patch by Wei-Ren Chen llvm-svn: 131123	2011-05-10 12:32:11 +00:00
Eric Christopher	5dc19f916c	Fix td file comments for Mips. Patch by Liu <proljc@gmail.com>! llvm-svn: 131086	2011-05-09 18:16:46 +00:00
Mon P Wang	92ff16b7bb	Fixed MC encoding for index_align for VLD1/VST1 (single element from one lane) for size 32 llvm-svn: 131085	2011-05-09 17:47:27 +00:00
Benjamin Kramer	d724a590e5	X86: Add a bunch of peeps for add and sub of SETB. "b + ((a < b) ? 1 : 0)" compiles into cmpl %esi, %edi adcl $0, %esi instead of cmpl %esi, %edi sbbl %eax, %eax andl $1, %eax addl %esi, %eax This saves a register, a false dependency on %eax (Intel's CPUs still don't ignore it) and it's shorter. llvm-svn: 131070	2011-05-08 18:36:07 +00:00
Jakob Stoklund Olesen	c132174169	Eliminate the ARM sub-register indexes that are not needed by the sources. Tablegen will invent its own names for these indexes, and the register file is a bit simpler. llvm-svn: 131059	2011-05-07 21:22:42 +00:00
Eric Christopher	1e3db02bda	Fix the non-MC encoding of pkhbt and pkhtb. Patch by Stephen Hines. llvm-svn: 131045	2011-05-07 04:37:27 +00:00
Akira Hatanaka	cbb7fa68ed	1. Keep lines in 80 columns. 2. Remove unused function. 3. Correct indentation. llvm-svn: 131028	2011-05-06 22:11:29 +00:00
Eli Friedman	2518f8376d	Make the logic for determining function alignment more explicit. No functionality change. llvm-svn: 131012	2011-05-06 20:34:06 +00:00
Rafael Espindola	a716096677	Dead code elimination. llvm-svn: 130984	2011-05-06 14:56:22 +00:00
Justin Holewinski	11d70b6b32	PTX: add PTX 2.3 language target Patch by Wei-Ren Chen llvm-svn: 130980	2011-05-06 11:40:36 +00:00
Rafael Espindola	bc8e3f8c45	Move PPC Linux to CFI. llvm-svn: 130951	2011-05-05 21:34:33 +00:00
Eli Friedman	f1e2b50a30	PR9848: pandn is not commutative. No test because I can't think of any way to write one that won't break quickly. llvm-svn: 130932	2011-05-05 17:45:31 +00:00
Bill Wendling	a48b1375df	Remove a flag that would set the ".eh" symbol as .globl. MachO was the only one who used this flag, and it now emits CFI and doesn't emit this anymore. All other targets left this flag "false". <rdar://problem/8486371> llvm-svn: 130918	2011-05-05 06:49:15 +00:00
Jakob Stoklund Olesen	808dca12f8	Fix X86RegisterInfo::getMatchingSuperRegClass for sub_8bit_hi. It is OK for B to be any GR8_ABCD_H superclass, the returned register class doesn't have to map surjectively onto B. llvm-svn: 130892	2011-05-04 23:54:54 +00:00
Jakob Stoklund Olesen	093a94cdae	Implement SystemZRegisterInfo::getMatchingSuperRegClass to enable cross-class joins. llvm-svn: 130857	2011-05-04 19:02:04 +00:00
Devang Patel	39ecf816c5	Do not emit location expression size twice. llvm-svn: 130854	2011-05-04 19:00:57 +00:00
Rafael Espindola	2998e6ce46	Fix cmake build. llvm-svn: 130850	2011-05-04 18:46:56 +00:00
Akira Hatanaka	23e8ecf125	Prevent instructions using $gp from being placed between a jalr and the instruction that restores the clobbered $gp. llvm-svn: 130847	2011-05-04 17:54:27 +00:00
Jakob Stoklund Olesen	839beb4124	Implement MSP430RegisterInfo::getMatchingSuperRegClass to enable cross-class coalescing. llvm-svn: 130814	2011-05-04 01:01:36 +00:00
Jakob Stoklund Olesen	f8be385c9b	Mark ultra-super-registers QQQQ as call-clobbered instead of the D sub-registers. LiveVariables doesn't understand that clobbering D0 and D1 completely overwrites Q0, so if Q0 is live-in to a function, its live range will extend beyond a function call that only clobbers D0 and D1. This shows up in the ARM/2009-11-01-NeonMoves test case. LiveVariables should probably implement the much stricter rules for physreg liveness that RAFast imposes - a physreg is killed by the first use of any alias. llvm-svn: 130801	2011-05-03 22:31:24 +00:00
Bill Wendling	db0996c822	Replace the "movnt" intrinsics with a native store + nontemporal metadata bit. <rdar://problem/8460511> llvm-svn: 130791	2011-05-03 21:11:17 +00:00
Akira Hatanaka	22fc723818	Fix function MipsRegisterInfo::getRegisterNumbering. llvm-svn: 130774	2011-05-03 18:41:54 +00:00
Bob Wilson	09585e1c5d	Temporarily disable use of divmod compiler-rt functions for iOS. llvm-svn: 130766	2011-05-03 17:33:22 +00:00
Bruno Cardoso Lopes	86c6e7057d	Fold ARM coprocessor intrinsics patterns into the instructions defs whenever it's possible. llvm-svn: 130764	2011-05-03 17:29:29 +00:00
Bruno Cardoso Lopes	168c9005b5	Add a few ARM coprocessor intrinsics. Testcases included llvm-svn: 130763	2011-05-03 17:29:22 +00:00
Benjamin Kramer	9c373c1c7a	Remove unused variables caught by GCC's -Wunused-but-set-variable. llvm-svn: 130755	2011-05-03 16:00:27 +00:00
Michael J. Spencer	9973738b65	Add pentium{3,4}m cpus. Patch by Alexander Best! llvm-svn: 130749	2011-05-03 03:42:50 +00:00
Eric Christopher	d2aa241378	xmm0 is an implicit parameter in this and so shouldn't be in the string template. Fixes rdar://8493866 llvm-svn: 130747	2011-05-03 01:28:32 +00:00
Dan Gohman	6136e94897	Add an unfolded offset field to LSR's Formula record. This is used to model constants which can be added to base registers via add-immediate instructions which don't require an additional register to materialize the immediate. llvm-svn: 130743	2011-05-03 00:46:49 +00:00
Eric Christopher	39b56b4b9f	Apparently the check for direct calls is unnecessary. llvm-svn: 130716	2011-05-02 20:16:33 +00:00
Rafael Espindola	5164e6e8b2	Add 130690 back. llvm-svn: 130693	2011-05-02 15:58:16 +00:00
Rafael Espindola	a392475865	Revert while I debug the tests that use march but not mtriple. llvm-svn: 130691	2011-05-02 15:42:31 +00:00
Rafael Espindola	c2aad4f2a3	Move ppc OS X to cfi too. I am building it on an old ppc mini, but it will take some time. llvm-svn: 130690	2011-05-02 15:00:52 +00:00
Rafael Espindola	fc8223670a	Add r130623 back now that ELF has been fixed to work with -fno-dwarf2-cfi-asm. llvm-svn: 130658	2011-05-01 15:44:13 +00:00
Chandler Carruth	ddc91b25e3	Remove an unused variable from this function introduced in r130637, likely a result of copy/paste. llvm-svn: 130640	2011-05-01 06:14:10 +00:00
Rafael Espindola	750cb61553	GCC uses a different encoding of pointers in the FDE when using -fno-dwarf2-cfi-asm. Implement the same behavior. llvm-svn: 130637	2011-05-01 04:49:54 +00:00
Rafael Espindola	95215a76cd	I forgot these files in the previous commit. llvm-svn: 130635	2011-05-01 04:19:24 +00:00
Rafael Espindola	fd05785324	Simplify the handling of pcrel relocations on ELF. Now we do the right thing for all symbol differences and can drop the old EmitPCRelSymbolValue method. This also make getExprForFDESymbol on ELF equal to the one on MachO, and it can be made non-virtual. llvm-svn: 130634	2011-05-01 03:50:49 +00:00
Rafael Espindola	b7c2286055	Revert the previous patch while I figure out how to make llvm-gcc less agressive about disabling cfi on linux :-( llvm-svn: 130626	2011-04-30 23:03:44 +00:00
Jakob Stoklund Olesen	2348cdd67f	X86AsmPrinter doesn't know how to handle the X86II::MO_GOT_ABSOLUTE_ADDRESS flag after folding ADD32ri to ADD32mi, so don't do that. This only happens when the greedy register allocator gets itself in trouble and spills %vreg9 here: 16L %vreg9<def> = MOVPC32r 0, %ESP<imp-use>; GR32:%vreg9 48L %vreg9<def> = ADD32ri %vreg9, <es:_GLOBAL_OFFSET_TABLE_>[TF=1], %EFLAGS<imp-def,dead>; GR32:%vreg9 That should never happen, the live range should be split instead. llvm-svn: 130625	2011-04-30 23:00:05 +00:00
Rafael Espindola	5265bc483e	Enable CFI on OS X. Currently the output should be almost identical to the one produced by CodeGen to make the transition easier. The only two differences I know of are: * Some files get an extra advance loc of size 0. This will be fixed when relaxations are enabled. * The optimization of declaring an EH symbol as an external variable is not implemented. This is a subset of adding the nounwind attribute, so we if really this at -O0 we should probably do it at the IL level. llvm-svn: 130623	2011-04-30 22:29:54 +00:00
Rafael Espindola	a3181d12c6	Add all the plumbing needed for MC to expand cfi to the old tables in the final assembly. It is the same technique used when targeting assemblers that don't support .loc. llvm-svn: 130587	2011-04-30 03:44:37 +00:00
Eric Christopher	e02e07c3a6	80-col. llvm-svn: 130558	2011-04-29 23:12:01 +00:00
Eli Friedman	468dfabce0	Zap a couple now-unused functions. llvm-svn: 130557	2011-04-29 22:56:48 +00:00
Eli Friedman	328bad02fa	Switch to ImmLeaf (which can be used by FastISel) for a few more common ARM/Thumb2 patterns. llvm-svn: 130552	2011-04-29 22:48:03 +00:00
Eric Christopher	7708746c33	Add FastEmitInst_ii for the arm fast isel generator. It doesn't use it, but if it ever did it needs the def machinery. llvm-svn: 130549	2011-04-29 22:07:50 +00:00
Eric Christopher	26b8ac4b8c	Some cleanup and optimize fallthrough more. llvm-svn: 130546	2011-04-29 21:56:31 +00:00
Eli Friedman	86caced370	Re-committing r130454, which does not in fact break anything. Fix a rather obscure crash caused by ARM fast-isel generating code which redefines a register. rdar://problem/9338332 . llvm-svn: 130539	2011-04-29 21:22:56 +00:00
Eric Christopher	8d46b47787	Add trunc->branch support, this won't help with clang's i8->i1 truncations for bools, but is a start. llvm-svn: 130534	2011-04-29 20:02:39 +00:00
Daniel Dunbar	dc3e4cc5ed	MCExpr: Add FindAssociatedSection, which attempts to mirror the 'as' semantics that associate sections with expressions. llvm-svn: 130517	2011-04-29 18:00:03 +00:00
Andrew Trick	e794e17524	Teach Thumb2 isel to fold and->rotr ==> ROR. Generalization of Nate Begeman's patch! llvm-svn: 130502	2011-04-29 14:18:15 +00:00
Benjamin Kramer	6708499b6d	This is done. llvm-svn: 130499	2011-04-29 14:09:57 +00:00
Chris Lattner	011eae7512	clean up after Sean's r127646 patch. llvm-svn: 130475	2011-04-29 05:40:18 +00:00
Chris Lattner	1d0c25756e	use the MachineInstrBuilder operator-> to simplify some code. There are probably more instances of this floating around. llvm-svn: 130474	2011-04-29 05:24:29 +00:00
Eric Christopher	9ade5e2495	Update comments and checks to match reality. llvm-svn: 130464	2011-04-29 00:07:20 +00:00
Eric Christopher	501d2e2c14	Whitespace. llvm-svn: 130463	2011-04-29 00:03:10 +00:00
Eli Friedman	517728b1ae	Revert r130454; apparently this doesn't actually work. llvm-svn: 130462	2011-04-28 23:55:14 +00:00
Eli Friedman	e4ecd42926	Fix a rather obscure crash caused by ARM fast-isel generating code which redefines a register. rdar://problem/9338332 . llvm-svn: 130454	2011-04-28 23:03:25 +00:00
Daniel Dunbar	a86188bf8e	Target/X86/MC: Add an option for disabling arith relaxation, for my own testing purposes. llvm-svn: 130438	2011-04-28 21:23:31 +00:00
Eli Friedman	7cd5101ad3	fast-isel sret calls, try 2. We actually do need to do something on x86-32. rdar://problem/9303592 . llvm-svn: 130429	2011-04-28 20:19:12 +00:00
Eli Friedman	d5a80ca3c8	Revert r130348; causing buildbot issues on x86-32. llvm-svn: 130412	2011-04-28 18:06:10 +00:00
Eric Christopher	4f012fd0a1	Be more layout aware here and swap the successor and branch condition if it means we get a fallthrough. llvm-svn: 130404	2011-04-28 16:52:09 +00:00
Rafael Espindola	c5dac4df2e	Add a getExprForPersonalitySymbol method to MCAsmInfo. Use it when converting the symbol passed to .cfi_personality into bytes is the file. llvm-svn: 130400	2011-04-28 16:09:09 +00:00
Eric Christopher	a98cd22d96	Let the immediate leaf pattern take transforms and switch the signed immediate patterns in arm to using the pattern. Handles rdar://9299434 llvm-svn: 130386	2011-04-28 05:49:04 +00:00
Chris Lattner	2a75c72e1c	move PR9803 to this readme. llvm-svn: 130385	2011-04-28 05:33:16 +00:00
Devang Patel	3e021533cd	Teach dwarf writer to handle complex address expression for .debug_loc entries. This fixes clang generated blocks' variables' debug info. Radar 9279956. llvm-svn: 130373	2011-04-28 02:22:40 +00:00
Justin Holewinski	45c30df303	PTX: support for select_cc and fixes for setcc - expansion of SELECT_CC into SETCC - force SETCC result type to i1 - custom selection for handling i1 using SETCC Patch by Dan Bailey llvm-svn: 130358	2011-04-28 00:19:56 +00:00
Justin Holewinski	498f703d84	PTX: support for select - selection of SELP instruction - new selp.ll test Patch by Dan Bailey llvm-svn: 130357	2011-04-28 00:19:55 +00:00
Justin Holewinski	e7d272e68b	PTX: mov fix and rounding correction for cvt - fix typo in MOV - correct fp rounding on CVT - new cvt.ll test Patch by Dan Bailey llvm-svn: 130356	2011-04-28 00:19:54 +00:00
Justin Holewinski	f6880019d1	PTX: support for fneg - selection of FNEG instruction - new fneg.ll test Patch by Dan Bailey llvm-svn: 130355	2011-04-28 00:19:53 +00:00
Justin Holewinski	99e03f1943	PTX: support for zext loads and trunc stores - expansion of EXTLOAD and TRUNCSTORE instructions Patch by Dan Bailey llvm-svn: 130354	2011-04-28 00:19:52 +00:00
Justin Holewinski	18e6ac83ea	PTX: support for bitwise operations on predicates - selection of bitwise preds (AND, OR, XOR) - new bitwise.ll test Patch by Dan Bailey llvm-svn: 130353	2011-04-28 00:19:51 +00:00
Justin Holewinski	638f0eb116	PTX: patch to AsmPrinter - immediate value cast as long not int - handles initializer for constant array Patch by Dan Bailey llvm-svn: 130352	2011-04-28 00:19:50 +00:00
Eli Friedman	8bd572fc58	fast-isel sret. We actually don't need to do anything special on x86. :) rdar://problem/9303592 . llvm-svn: 130348	2011-04-27 23:58:52 +00:00
Rafael Espindola	ce83fc3463	Remove unnecessary argument. llvm-svn: 130343	2011-04-27 23:17:57 +00:00
Rafael Espindola	08704349da	Rename getPersonalityPICSymbol to getCFIPersonalitySymbol, document it, and give it a bit more responsibility. Also implement it for MachO. If hacked to use cfi, 32 bit MachO will produce .cfi_personality 155, L___gxx_personality_v0$non_lazy_ptr and 64 bit will produce .cfi_presonality ___gxx_personality_v0 The general idea is that .cfi_personality gets passed the final symbol. It is up to codegen to produce it if using indirect representation (like 32 bit MachO), but it is up to MC to decide which relocations to create. llvm-svn: 130341	2011-04-27 23:08:15 +00:00
Eli Friedman	406c471b69	Make the fast-isel code for literal 0.0 a bit shorter/faster, since 0.0 is common. rdar://problem/9303592 . llvm-svn: 130338	2011-04-27 22:41:55 +00:00
Kevin Enderby	886894cb70	Fix a bug in the case that there is no add or subtract symbol and the offset value is zero so it does not add a NULL expr operand. llvm-svn: 130330	2011-04-27 21:02:27 +00:00
Devang Patel	e3745fdcf3	Revert r130178. It turned out to be not the optimal path to emit complex location expressions. llvm-svn: 130326	2011-04-27 20:29:27 +00:00
Eli Friedman	bcc6914146	Refactor out code to fast-isel a memcpy operation with a small constant length. (I'm planning to use this to implement byval.) llvm-svn: 130274	2011-04-27 01:45:07 +00:00
Eli Friedman	0eea0293d9	Fix an edge case involving branches in fast-isel on x86. rdar://problem/9303306 . llvm-svn: 130272	2011-04-27 01:34:27 +00:00
Chris Lattner	1b06c71668	Transform: "icmp eq (trunc (lshr(X, cst1)), cst" to "icmp (and X, mask), cst" when X has multiple uses. This is useful for exposing secondary optimizations, but the X86 backend isn't ready for this when X has a single use. For example, this can disable load folding. This is inching towards resolving PR6627. llvm-svn: 130238	2011-04-26 20:18:20 +00:00
Jim Grosbach	d4b733e4d8	ARM and Thumb2 support for atomic MIN/MAX/UMIN/UMAX loads. rdar://9326019 llvm-svn: 130234	2011-04-26 19:44:18 +00:00
Jakob Stoklund Olesen	803a200077	Add a TRI::getLargestLegalSuperClass hook to provide an upper limit on register class inflation. The hook will be used by the register allocator when recomputing register classes after removing constraints. Thumb1 code doesn't allow anything larger than tGPR, and x86 needs to ensure that the spill size doesn't change. llvm-svn: 130228	2011-04-26 18:52:33 +00:00
Rafael Espindola	80cb3cb1d6	Print all the moves at a given label instead of just the first one. Remove previous DwarfCFI hack. llvm-svn: 130187	2011-04-26 03:58:56 +00:00
Devang Patel	cae2fbd6fc	Let dwarf writer allocate extra space in the debug location expression. This space, if requested, will be used for complex addresses of the Blocks' variables. llvm-svn: 130178	2011-04-26 00:12:46 +00:00
Chris Lattner	6e29892430	add a missed bitfield instcombine. llvm-svn: 130137	2011-04-25 18:44:26 +00:00
Akira Hatanaka	0e7ee666b7	Lower BlockAddress node when relocation-model is static. llvm-svn: 130131	2011-04-25 17:10:45 +00:00
Chandler Carruth	9b73c8e293	Remove some hard coded CR-LFs. Some of these were the entire files, one of these was just one line of a file. Explicitly set the eol-style property on the files to try and ensure this fix stays. llvm-svn: 130125	2011-04-25 07:11:23 +00:00
Duncan Sands	56ca6292dc	Fix comment typo. Noticed by Liu. llvm-svn: 130120	2011-04-25 06:21:43 +00:00
Sebastian Redl	5519ff9d4e	Fix Target/ARM/Thumb1FrameLowering.h header guard. llvm-svn: 130097	2011-04-24 15:47:01 +00:00
Jay Foad	1a180156b6	Remove unused STL header includes. llvm-svn: 130068	2011-04-23 19:53:52 +00:00
Benjamin Kramer	3db054650b	Silence an overzealous uninitialized variable warning from GCC. llvm-svn: 130053	2011-04-23 08:21:06 +00:00
Andrew Trick	0ed5778a1e	Thumb2 and ARM add/subtract with carry fixes. Fixes Thumb2 ADCS and SBCS lowering: <rdar://problem/9275821>. t2ADCS/t2SBCS are now pseudo instructions, consistent with ARM, so the assembly printer correctly prints the 's' suffix. Fixes Thumb2 adde -> SBC matching to check for live/dead carry flags. Fixes the internal ARM machine opcode mnemonic for ADCS/SBCS. Fixes ARM SBC lowering to check for live carry (potential bug). llvm-svn: 130048	2011-04-23 03:55:32 +00:00
Andrew Trick	1a1f8d4640	whitespace llvm-svn: 130046	2011-04-23 03:24:11 +00:00
Johnny Chen	57c892860e	Disassembly of A8.6.59 LDR (literal) Encoding T1 (16-bit thumb instruction) should print out ldr, not ldr.n. rdar://problem/9267772 llvm-svn: 130008	2011-04-22 19:12:43 +00:00
Benjamin Kramer	341c11da3b	DAGCombine: fold "(zext x) == C" into "x == (trunc C)" if the trunc is lossless. On x86 this allows to fold a load into the cmp, greatly reducing register pressure. movzbl (%rdi), %eax cmpl $47, %eax -> cmpb $47, (%rdi) This shaves 8k off gcc.o on i386. I'll leave applying the patch in README.txt to Chris :) llvm-svn: 130005	2011-04-22 18:47:44 +00:00
Devang Patel	3c39ec2933	Add asserts. llvm-svn: 129995	2011-04-22 16:44:29 +00:00
Benjamin Kramer	4c81624735	X86: Try to use a smaller encoding by transforming (X << C1) & C2 into (X & (C2 >> C1)) & C1. (Part of PR5039) This tends to happen a lot with bitfield code generated by clang. A simple example for x86_64 is uint64_t foo(uint64_t x) { return (x&1) << 42; } which used to compile into bloated code: shlq $42, %rdi ## encoding: [0x48,0xc1,0xe7,0x2a] movabsq $4398046511104, %rax ## encoding: [0x48,0xb8,0x00,0x00,0x00,0x00,0x00,0x04,0x00,0x00] andq %rdi, %rax ## encoding: [0x48,0x21,0xf8] ret ## encoding: [0xc3] with this patch we can fold the immediate into the and: andq $1, %rdi ## encoding: [0x48,0x83,0xe7,0x01] movq %rdi, %rax ## encoding: [0x48,0x89,0xf8] shlq $42, %rax ## encoding: [0x48,0xc1,0xe0,0x2a] ret ## encoding: [0xc3] It's possible to save another byte by using 'andl' instead of 'andq' but I currently see no way of doing that without making this code even more complicated. See the TODOs in the code. llvm-svn: 129990	2011-04-22 15:30:40 +00:00
Evan Cheng	c0d2004e3c	In Thumb2 mode, lower frame indix references to: add <rd>, sp, #<imm8> ldr <rd>, [sp, #<imm8>] When the offset from sp is multiple of 4 and in range of 0-1020. This saves code size by utilizing 16-bit instructions. rdar://9321541 llvm-svn: 129971	2011-04-22 01:42:52 +00:00
Rafael Espindola	5395f44fe8	Compute the size of the FDE encoding instead of hard coding it. Update X8664_ELFTargetObjectFile::getFDEEncoding to match reality. llvm-svn: 129959	2011-04-22 00:08:43 +00:00
Rafael Espindola	6aea59268a	Remove unused argument. llvm-svn: 129955	2011-04-21 23:39:26 +00:00
Devang Patel	94ad6ac13c	Fix DWARF description of Q registers. llvm-svn: 129952	2011-04-21 23:22:35 +00:00
Devang Patel	3712c14be9	Fix DWARF description of S registers. llvm-svn: 129947	2011-04-21 22:48:26 +00:00
Devang Patel	46bda61a81	As per ARM docs, register Dx is described as DW_OP_regx(256+x) in DWARF. llvm-svn: 129922	2011-04-21 17:51:06 +00:00
Justin Holewinski	d74d88a861	PTX: Expand useable register space llvm-svn: 129913	2011-04-21 16:08:02 +00:00
Che-Liang Chiou	14c48e5d66	ptx: fix parameter ordering This patch depends on the prior fix r129908 that changes to use std::find, rather than std::binary_search, on unordered array. Patch by Dan Bailey llvm-svn: 129909	2011-04-21 10:56:58 +00:00
Che-Liang Chiou	cdc51569ee	ptx: PTXMachineFunctionInfo no longer sort registers and so should not use std::binary_search llvm-svn: 129908	2011-04-21 10:16:20 +00:00
Evan Cheng	5f1ba4cd2d	Remove -use-divmod-libcall. Let targets opt in when they are available. llvm-svn: 129884	2011-04-20 22:20:12 +00:00
Eli Friedman	c93d399eed	Revert r129846; it's breaking a buildbot. See http://google1.osuosl.org:8011/builders/llvm-x86_64-linux-checks/builds/825/steps/test.llvm.stage2/logs/st.ll llvm-svn: 129869	2011-04-20 19:00:08 +00:00
Jakob Stoklund Olesen	0e34c1dfac	Prefer cheap registers for busy live ranges. On the x86-64 and thumb2 targets, some registers are more expensive to encode than others in the same register class. Add a CostPerUse field to the TableGen register description, and make it available from TRI->getCostPerUse. This represents the cost of a REX prefix or a 32-bit instruction encoding required by choosing a high register. Teach the greedy register allocator to prefer cheap registers for busy live ranges (as indicated by spill weight). llvm-svn: 129864	2011-04-20 18:19:48 +00:00
Stuart Hastings	7850af6ea0	Excise unintended hunk in 129858. <rdar://problem/7662569> llvm-svn: 129862	2011-04-20 18:09:26 +00:00
Stuart Hastings	45fe3c38c5	ARM byval support. Will be enabled by another patch to the FE. <rdar://problem/7662569> llvm-svn: 129858	2011-04-20 16:47:52 +00:00
Justin Holewinski	7d8895e767	PTX: Add intrinsics to list of built-in intrinsics, which allows them to be used by Clang. To help Clang integration, the PTX target has been split into two targets: ptx32 and ptx64, depending on the desired pointer size. - Add GCCBuiltin class to all intrinsics - Split PTX target into ptx32 and ptx64 llvm-svn: 129851	2011-04-20 15:37:17 +00:00
Che-Liang Chiou	6586f84685	ptx: add integer div and rem instruction Patched by Dan Bailey llvm-svn: 129848	2011-04-20 09:28:55 +00:00
Che-Liang Chiou	5a952b3c67	ptx: add floating-point comparison to setp Patched by Dan Bailey llvm-svn: 129847	2011-04-20 09:28:20 +00:00
Che-Liang Chiou	49160f9a71	ptx: fix parameter ordering Patched by Dan Bailey llvm-svn: 129846	2011-04-20 09:27:19 +00:00
Nick Lewycky	4dae63e35b	This should always be signed chars, so use int8_t. This fixes a miscompile when llvm is built with unsigned chars where an immediate such as 0xff would be zero extended to 64-bits, turning "cmp $0xff,%eax" into "cmp $0xffffffffffffffff,%eax". llvm-svn: 129845	2011-04-20 03:19:42 +00:00
Rafael Espindola	e473aaf540	Remove unused arguments. llvm-svn: 129844	2011-04-20 03:08:09 +00:00
Daniel Dunbar	cd01ed5bd6	ADT/Triple: Renambe isOSX... methods to isMacOSX for consistency with the OS triple component. llvm-svn: 129838	2011-04-20 00:14:25 +00:00
Johnny Chen	dc62e59776	Fix typo in the comment. llvm-svn: 129837	2011-04-19 23:58:52 +00:00
Daniel Dunbar	2b9b0e3748	ADT/Triple: Move a variety of clients to using isOSDarwin() and isOSWindows() predicates. llvm-svn: 129816	2011-04-19 21:14:45 +00:00
Daniel Dunbar	100455a3c8	Target/X86: Eliminate uses of getDarwinVers(). llvm-svn: 129813	2011-04-19 21:04:12 +00:00
Daniel Dunbar	44b530369d	Target/X86: Add getTargetTriple() accessor. llvm-svn: 129812	2011-04-19 21:01:47 +00:00
Daniel Dunbar	e3de896b5e	Target/PPC: Kill off DarwinVers, which is now dead. llvm-svn: 129811	2011-04-19 20:59:24 +00:00
Daniel Dunbar	f954a0f028	Target/PPC: Eliminate a use of getDarwinVers(). llvm-svn: 129810	2011-04-19 20:57:03 +00:00
Daniel Dunbar	a37aab2515	Target/PPC: Add a TargetTriple field. llvm-svn: 129809	2011-04-19 20:54:28 +00:00
Daniel Dunbar	9483bb6bf3	Target: Eliminate a use of getDarwinMajorNumber(). llvm-svn: 129803	2011-04-19 20:44:08 +00:00
Eric Christopher	c721b0db6d	Remove some duplicate op action entries and reorganize. llvm-svn: 129781	2011-04-19 18:49:19 +00:00
Bob Wilson	0858c3aaed	This patch combines several changes from Evan Cheng for rdar://8659675. Making use of VFP / NEON floating point multiply-accumulate / subtraction is difficult on current ARM implementations for a few reasons. 1. Even though a single vmla has latency that is one cycle shorter than a pair of vmul + vadd, a RAW hazard during the first (4? on Cortex-a8) can cause additional pipeline stall. So it's frequently better to single codegen vmul + vadd. 2. A vmla folowed by a vmul, vmadd, or vsub causes the second fp instruction to stall for 4 cycles. We need to schedule them apart. 3. A vmla followed vmla is a special case. Obvious issuing back to back RAW vmla + vmla is very bad. But this isn't ideal either: vmul vadd vmla Instead, we want to expand the second vmla: vmla vmul vadd Even with the 4 cycle vmul stall, the second sequence is still 2 cycles faster. Up to now, isel simply avoid codegen'ing fp vmla / vmls. This works well enough but it isn't the optimial solution. This patch attempts to make it possible to use vmla / vmls in cases where it is profitable. A. Add missing isel predicates which cause vmla to be codegen'ed. B. Make sure the fmul in (fadd (fmul)) has a single use. We don't want to compute a fmul and a fmla. C. Add additional isel checks for vmla, avoid cases where vmla is feeding into fp instructions (except for the #3 exceptional case). D. Add ARM hazard recognizer to model the vmla / vmls hazards. E. Add a special pre-regalloc case to expand vmla / vmls when it's likely the vmla / vmls will trigger one of the special hazards. Enable these fp vmlx codegen changes for Cortex-A9. llvm-svn: 129775	2011-04-19 18:11:57 +00:00
Bob Wilson	d04a83f8f2	Add -mcpu=cortex-a9-mp. It's cortex-a9 with MP extension. rdar://8648637. llvm-svn: 129774	2011-04-19 18:11:52 +00:00
Bob Wilson	a2881ee8a4	Avoid some 's' 16-bit instruction which partially update CPSR (and add false dependency) when it isn't dependent on last CPSR defining instruction. rdar://8928208 llvm-svn: 129773	2011-04-19 18:11:49 +00:00
Bob Wilson	df612ba006	Avoid write-after-write issue hazards for Cortex-A9. Add a avoidWriteAfterWrite() target hook to identify register classes that suffer from write-after-write hazards. For those register classes, try to avoid writing the same register in two consecutive instructions. This is currently disabled by default. We should not spill to avoid hazards! The command line flag -avoid-waw-hazard can be used to enable waw avoidance. llvm-svn: 129772	2011-04-19 18:11:45 +00:00
Bob Wilson	3e5944d96b	Some single-precision VFP instructions can execute in either the VPF or Neon pipelines, at least on Cortex-A9. llvm-svn: 129771	2011-04-19 18:11:38 +00:00
Bob Wilson	f33715e554	Improvements for the Cortex-A9 scheduling itineraries. llvm-svn: 129770	2011-04-19 18:11:36 +00:00
Eli Friedman	ee92a6b332	Add support for FastISel'ing varargs calls. llvm-svn: 129765	2011-04-19 17:22:22 +00:00
Chris Lattner	91328b317b	Implement support for x86 fastisel of small fixed-sized memcpys, which are generated en-mass for C++ PODs. On my c++ test file, this cuts the fast isel rejects by 10x and shrinks the generated .s file by 5% llvm-svn: 129755	2011-04-19 05:52:03 +00:00
Chris Lattner	34a08c2344	tidy up llvm-svn: 129753	2011-04-19 05:15:59 +00:00
Chris Lattner	5f4b783426	Implement support for fast isel of calls of i1 arguments, even though they are illegal, when they are a truncate from something else. This eliminates fully half of all the fastisel rejections on a test c++ file I'm working with, which should make a substantial improvement for -O0 compile of c++ code. This fixed rdar://9297003 - fast isel bails out on all functions taking bools llvm-svn: 129752	2011-04-19 05:09:50 +00:00
Chris Lattner	d7f7c93914	Handle i1/i8/i16 constant integer arguments to calls by prepromoting them. Before we would bail out on i1 arguments all together, now we just bail on non-constant ones. Also, we used to emit extraneous code. e.g. test12 was: movb $0, %al movzbl %al, %edi callq _test12 and test13 was: movb $0, %al xorl %edi, %edi movb %al, 7(%rsp) callq _test13f Now we get: movl $0, %edi callq _test12 and: movl $0, %edi callq _test13f llvm-svn: 129751	2011-04-19 04:42:38 +00:00
Chris Lattner	c59290a34c	be layout aware, to produce: testb $1, %al je LBB0_2 ## BB#1: ## %if.then movb $0, %al instead of: testb $1, %al jne LBB0_1 jmp LBB0_2 LBB0_1: ## %if.then movb $0, %al how 'bout that. llvm-svn: 129749	2011-04-19 04:26:32 +00:00
Chris Lattner	2c8a4c3b1b	fix rdar://9297006 - fast isel bails out on trunc to i1 -> bools cry, a common cause of fast isel rejects on c++ code. llvm-svn: 129748	2011-04-19 04:22:17 +00:00
Evan Cheng	7d6cd4902e	Change A9 scheduling itineraries VLD* / VST* entries default to "aligned". That is, it assumes addresses are 64-bit aligned (which should be the more common case). If the alignment is found not to be aligned, then getOperandLatency() would adjust the operand latency computation by one to compensate for it. rdar://9294833 llvm-svn: 129742	2011-04-19 01:21:49 +00:00
Evan Cheng	4079133796	Do not lose mem_operands while lowering VLD / VST intrinsics. llvm-svn: 129738	2011-04-19 00:04:03 +00:00
Jim Grosbach	ddac5dd269	Trim a few unneeded includes. llvm-svn: 129723	2011-04-18 21:35:54 +00:00
Eric Christopher	2e3fbaab39	Invert the meaning of printAliasInstr's return value. It now returns true on success and false on failure. Update callers. llvm-svn: 129722	2011-04-18 21:28:11 +00:00
Sean Callanan	5d73033e0f	Small fix to the ARM AsmParser to ensure that a superclass variable is instantiated properly. llvm-svn: 129713	2011-04-18 20:20:44 +00:00
Chris Lattner	80254a53cc	Add a new bit that ImmLeaf's can opt into, which allows them to duck out of the generated FastISel. X86 doesn't need to generate code to match ADD16ri8 since ADD16ri will do just fine. This is a small codesize win in the generated instruction selector. llvm-svn: 129692	2011-04-18 06:36:55 +00:00
Chris Lattner	c479e0631f	switch the rest of the x86 immediate patterns over to ImmLeaf, simplifying them and exposing more information to tblgen. It would be nice if other target authors adopted this as well, particularly arm since it has fastisel. llvm-svn: 129676	2011-04-17 22:12:55 +00:00
Chris Lattner	2ff8c1a25f	now that predicates have a decent abstraction layer on them, introduce a new kind of predicate: one that is specific to imm nodes. The predicate function specified here just checks an int64_t directly instead of messing around with SDNode's. The virtue of this is that it means that fastisel and other things can reason about these predicates. llvm-svn: 129675	2011-04-17 22:05:17 +00:00
Chris Lattner	514e292b72	Rework our internal representation of node predicates to expose more structure and fix some fixmes. We now have a TreePredicateFn class that handles all of the decoding of these things. This is an internal cleanup that has no impact on the code generated by tblgen. llvm-svn: 129670	2011-04-17 21:38:24 +00:00
Chris Lattner	b53ccb8e36	1. merge fast-isel-shift-imm.ll into fast-isel-x86-64.ll 2. implement rdar://9289501 - fast isel should fold trivial multiplies to shifts 3. teach tblgen to handle shift immediates that are different sizes than the shifted operands, eliminating some code from the X86 fast isel backend. 4. Have FastISel::SelectBinaryOp use (the poorly named) FastEmit_ri_ function instead of FastEmit_ri to simplify code. llvm-svn: 129666	2011-04-17 20:23:29 +00:00
Chris Lattner	eb729d48ff	fix an x86 fast isel issue where we'd completely give up on folding an address when we have a global variable base an an index. Instead, just give up on folding the global variable. Before we'd geenrate: _test: ## @test ## BB#0: movq _rtx_length@GOTPCREL(%rip), %rax leaq (%rax), %rax addq %rdi, %rax movzbl (%rax), %eax ret now we generate: _test: ## @test ## BB#0: movq _rtx_length@GOTPCREL(%rip), %rax movzbl (%rax,%rdi), %eax ret The difference is even more significant when there is a scale involved. This fixes rdar://9289558 - total fail with addr mode formation at -O0/x86-64 llvm-svn: 129664	2011-04-17 17:47:38 +00:00
Chris Lattner	4832660b4d	fix an oversight which caused us to compile the testcase (and other less trivial things) into a dummy lea. Before we generated: _test: ## @test movq _G@GOTPCREL(%rip), %rax leaq (%rax), %rax ret now we produce: _test: ## @test movq _G@GOTPCREL(%rip), %rax ret This is part of rdar://9289558 llvm-svn: 129662	2011-04-17 17:12:08 +00:00
Chris Lattner	4b026b962a	tidy up and reduce indentation. llvm-svn: 129661	2011-04-17 17:05:12 +00:00
Eli Friedman	55f7bf3289	Remove working entry from README. llvm-svn: 129654	2011-04-17 02:36:27 +00:00
Francois Pichet	47f86e6c60	MSVC needs the return 0 to compile. llvm-svn: 129640	2011-04-16 13:59:23 +00:00
Rafael Espindola	a83b177035	Put each personality function in a section. This fixes the gnu ld warning: error in foo.o; no .eh_frame_hdr table will be created. llvm-svn: 129635	2011-04-16 03:51:21 +00:00
Stuart Hastings	ebddfe60a0	Correct result when a branch condition is live across a block boundary. <rdar://problem/8933028> llvm-svn: 129634	2011-04-16 03:31:26 +00:00
Johnny Chen	48592ee5af	Thumb2 BFC was insufficiently encoded. rdar://problem/9292717 llvm-svn: 129619	2011-04-15 22:52:15 +00:00
Johnny Chen	761e1e3512	A8.6.315 VLD3 (single 3-element structure to all lanes) The a bit must be encoded as 0. rdar://problem/9292625 llvm-svn: 129618	2011-04-15 22:49:08 +00:00
Akira Hatanaka	e24891251c	Reverse unnecessary changes made in r129606 and r129608. There is no change in functionality. llvm-svn: 129612	2011-04-15 21:51:11 +00:00
Cameron Zwarich	9c65e4d69c	Add ORR and EOR to the CMP peephole optimizer. It's hard to get isel to generate a case involving EOR, so I only added a test for ORR. llvm-svn: 129610	2011-04-15 21:24:38 +00:00
Akira Hatanaka	d56f2d910b	Fix lines that exceed 80 columns. There is no change in functionality. llvm-svn: 129608	2011-04-15 21:06:38 +00:00
Akira Hatanaka	aef55c8801	Fix lines that have incorrect indentation or exceed 80 columns. There is no change in functionality. llvm-svn: 129606	2011-04-15 21:00:26 +00:00
Cameron Zwarich	0829b3065a	The AND instruction leaves the V flag unmodified, so it falls victim to the same problem as all of the other instructions we fold with CMPs. llvm-svn: 129602	2011-04-15 20:45:00 +00:00
Rafael Espindola	7583dbdc88	Fix cmake build. llvm-svn: 129601	2011-04-15 20:34:45 +00:00
Cameron Zwarich	93eae1571c	Add missing register forms of instructions to the ARM CMP-folding code. This fixes <rdar://problem/9287901>. llvm-svn: 129599	2011-04-15 20:28:28 +00:00
Akira Hatanaka	279169771b	Add pass that expands pseudo instructions into target instructions after register allocation. Define pseudos that get expanded into mtc1 or mfc1 instructions. llvm-svn: 129594	2011-04-15 19:52:08 +00:00
Evan Cheng	a2e61292f0	Increase SubtargetFeatureKV Value and Implies fields to 64 bits since some targets are getting very close to 32 subtarget features. Also teach tablegen to error when there are more than 64 features to guard against undefined behavior. rdar://9282332 llvm-svn: 129590	2011-04-15 19:35:46 +00:00
Rafael Espindola	a01cdb0e37	Add 129518 back with a fix for when we are producing eh just because of debug info. Change ELF systems to use CFI for producing the EH tables. This reduces the size of the clang binary in Debug builds from 690MB to 679MB. llvm-svn: 129571	2011-04-15 15:11:06 +00:00
Chris Lattner	0ab5e2cded	Fix a ton of comment typos found by codespell. Patch by Luis Felipe Strano Moraes! llvm-svn: 129558	2011-04-15 05:18:47 +00:00
NAKAMURA Takumi	b5e3e9dd27	Revert r129518, "Change ELF systems to use CFI for producing the EH tables. This reduces the" It broke several builds. llvm-svn: 129557	2011-04-15 03:35:57 +00:00
Evan Cheng	12bb05b75b	Fix another fcopysign lowering bug. If src is f64 and destination is f32, don't forget to right shift the source by 32 first. rdar://9287902 llvm-svn: 129556	2011-04-15 01:31:00 +00:00
Johnny Chen	681fef5986	For t2BFI, both Inst{26} and Inst{5} "should" be 0. Ref: I.1 Instruction encoding diagrams and pseudocode llvm-svn: 129552	2011-04-15 00:35:08 +00:00
Michael J. Spencer	30088ba110	Add 3DNow! intrinsics. llvm-svn: 129551	2011-04-15 00:32:41 +00:00
Johnny Chen	421316178e	The ARM disassembler did not handle the alignment correctly for VLDDUP instructions (single element or n-element structure to all lanes). llvm-svn: 129550	2011-04-15 00:10:45 +00:00
Evan Cheng	44887f9c7e	Follow up on r127913. Fix Thumb revsh isel. rdar://9286766 llvm-svn: 129548	2011-04-14 23:27:44 +00:00
Johnny Chen	4251b151b1	Add sanity checkings for Thumb2 Load/Store Register Exclusive family of operations. llvm-svn: 129531	2011-04-14 19:13:28 +00:00
Chris Lattner	6f195469b1	move PR9661 out to here. llvm-svn: 129527	2011-04-14 18:47:18 +00:00

... 2 3 4 5 6 ...

17886 Commits