llvm-project

Commit Graph

Author	SHA1	Message	Date
Akira Hatanaka	6871031be9	[mips] Add instruction selection patterns for blez and bgez. llvm-svn: 182396	2013-05-21 17:13:47 +00:00
Justin Holewinski	48f4ad3fc0	[NVPTX] Add @llvm.nvvm.sqrt.f() intrinsic llvm-svn: 182394	2013-05-21 16:51:30 +00:00
Jyotsna Verma	1b056e422c	Hexagon: SelectionDAG should not use MVT::Other to check the legality of BR_CC. llvm-svn: 182390	2013-05-21 15:54:32 +00:00
Justin Holewinski	fff1f5f5e2	Drop @llvm.annotation and @llvm.ptr.annotation intrinsics during codegen. The intrinsic calls are dropped, but the annotated value is propagated. Fixes PR 15253 Original patch by Zeng Bin! llvm-svn: 182387	2013-05-21 14:37:16 +00:00
Hal Finkel	c5211291f1	Fix PPC branch selection for counter-based branches Although I had added some support for the BDZ/BDNZ branches into the selector (in r158204), I had not correctly adjusted the condition at the top of the loop. As a result, these branches were still essentially unsupported. This fixes PR16086. Unfortunately, any test case would be very large (because it would need to force the loop backedge to exceed the range of the 16-bit immediate). llvm-svn: 182385	2013-05-21 14:21:09 +00:00
Elena Demikhovsky	0dd4025ae9	removed commented lines llvm-svn: 182377	2013-05-21 13:27:44 +00:00
Evgeniy Stepanov	ebd7f8e7ef	[msan] A no-op implementation of VarArg handling. This stuff is used on platforms where MSan does not have a proper VarArg implementation (anything other than x86_64 at the moment). llvm-svn: 182375	2013-05-21 12:27:47 +00:00
Elena Demikhovsky	fad029202f	Removed SSEPacked domain from all forms (AVX, SSE, signed, unsigned) scalar compare instructions, like COMISS, COMISD. No functional changes. llvm-svn: 182371	2013-05-21 12:04:22 +00:00
Benjamin Kramer	18ef6b22b9	X86: When emulating unsigned PCMPGTQ with PCMPGTD, fix the sign bit for the smaller type. Otherwise we'll get a mix of signed and unsigned compares. Fixes PR15977. llvm-svn: 182364	2013-05-21 09:58:54 +00:00
Benjamin Kramer	8aaf197990	DAGCombine: Avoid an edge case where it tried to create an i0 type for (x & 0) == 0. Fixes PR16083. llvm-svn: 182357	2013-05-21 08:51:09 +00:00
Richard Sandiford	3b105a063f	Fix indentation llvm-svn: 182356	2013-05-21 08:48:24 +00:00
Eric Christopher	db142d4e1e	Add cmake bits for md5. llvm-svn: 182349	2013-05-21 01:30:38 +00:00
Eric Christopher	e1dc3c45e6	Add an md5 library derived from a public domain implementation for dwarf4 type signature computation. llvm-svn: 182348	2013-05-21 01:28:35 +00:00
Manman Ren	9d4c735885	Dwarf: use a single line table to generate assembly when .loc is used. This is to fix PR15408 where an undefined symbol Lline_table_start1 is used. Since we do not generate the debug_line section when .loc is used, Lline_table_start1 is not emitted and we can't refer to it when calculating at_stmt_list for a compile unit. llvm-svn: 182344	2013-05-21 00:57:22 +00:00
Reed Kotler	0fed8d4ef7	Add some additional functions to the list of helper functions for pic calls. These need to be there so we don't try and use helper functions when we call those. As part of this, make sure that we properly exclude helper functions in pic mode when indirect calls are involved. llvm-svn: 182343	2013-05-21 00:50:30 +00:00
David Blaikie	e63d5d1633	PR14606: Debug Info for namespace aliases/DW_TAG_imported_module This resolves the last of the PR14606 failures in the GDB 7.5 test suite by implementing an optional name field for DW_TAG_imported_modules/DIImportedEntities and using that to implement C++ namespace aliases (eg: "namespace X = Y;"). llvm-svn: 182328	2013-05-20 22:50:35 +00:00
Bill Wendling	eda5418e89	The DWARF EH pass doesn't need the TargetMachine, only the TargetLoweringBase like the other EH passes. llvm-svn: 182321	2013-05-20 21:54:18 +00:00
Bill Wendling	47447589c9	No need to store the TargetMachine variable in this class. llvm-svn: 182317	2013-05-20 21:28:28 +00:00
Bill Wendling	5f4740390e	Remove unused #include. llvm-svn: 182315	2013-05-20 20:59:12 +00:00
Hal Finkel	a969df84ab	Rename LoopSimplify.h to LoopUtils.h As discussed, LoopUtils.h is a better name. llvm-svn: 182314	2013-05-20 20:46:30 +00:00
Akira Hatanaka	5de4416962	[mips] Add (setne $lhs, 0) instruction selection pattern. llvm-svn: 182307	2013-05-20 18:18:07 +00:00
Akira Hatanaka	1cb024207f	[mips] Trap on integer division by zero. By default, a teq instruction is inserted after integer divide. No divide-by-zero checks are performed if option "-mnocheck-zero-division" is used. llvm-svn: 182306	2013-05-20 18:07:43 +00:00
Hal Finkel	e6d7c285b3	Remove copied preheader insertion logic from PPCCTRLoops Now that the preheader insertion logic in LoopSimplify is externally exposed, use it, and remove the copy-and-pasted version. No functionality change intended. llvm-svn: 182300	2013-05-20 16:47:10 +00:00
Hal Finkel	a12d82b421	Expose InsertPreheaderForLoop from LoopSimplify to other passes Other passes, PPC counter-loop formation for example, also need to add loop preheaders outside of the regular loop simplification pass. This makes InsertPreheaderForLoop a global function so that it can be used by other passes. No functionality change intended. llvm-svn: 182299	2013-05-20 16:47:07 +00:00
Justin Holewinski	4c47d87ba6	[NVPTX] Fix mis-use of CurrentFnSym in NVPTXAsmPrinter. This was causing a symbol name error in the output PTX. llvm-svn: 182298	2013-05-20 16:42:18 +00:00
Justin Holewinski	18f3a1ffe6	[NVPTX] Add programmatic interface to NVVMReflect pass llvm-svn: 182297	2013-05-20 16:42:16 +00:00
Hal Finkel	0859ef29d5	Rename PPC MTCTRse to MTCTRloop As the pairing of this instruction form with the bdnz/bdz branches is now enforced by the verification pass, make it clear from the name that these are used only for counter-based loops. No functionality change intended. llvm-svn: 182296	2013-05-20 16:08:37 +00:00
Hal Finkel	8ca3884147	Add a PPCCTRLoops verification pass When asserts are enabled, this adds a verification pass for PPC counter-loop formation. Unfortunately, without sacrificing code quality, there is no better way of forming counter-based loops except at the (late) IR level. This means that we need to recognize, at the IR level, anything which might turn into a function call (or indirect branch). Because this is currently a finite set of things, and because SelectionDAG lowering is basic-block local, this can be done. Nevertheless, it is fragile, and failure results in a miscompile. This verification pass checks that all (reachable) counter-based branches are dominated by a loop mtctr instruction, and that no instructions in between clobber the counter register. If these conditions are not satisfied, then an ICE will be triggered. In short, this is to help us sleep better at night. llvm-svn: 182295	2013-05-20 16:08:17 +00:00
Benjamin Kramer	927ca942ce	R600: Fix bug detected by GCC warning. R600TextureIntrinsicsReplacer.cpp:232: warning: the address of ‘ArgsType’ will always evaluate as ‘true’ This doesn't have any effect on the output as a vararg intrinsic behaves the same way as a non-vararg one. llvm-svn: 182293	2013-05-20 15:58:43 +00:00
Tom Stellard	f1ee716446	R600/SI: Use a multiclass for MUBUF_Load_Helper This will simplify the instructions and also the pattern definitions. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 182288	2013-05-20 15:02:31 +00:00
Tom Stellard	b8458f88d6	R600/SI: Add a pattern for S_LOAD_DWORDX2_* instructions Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 182287	2013-05-20 15:02:28 +00:00
Tom Stellard	d2eebf001e	R600/SI: Add pattern for rotr Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 182286	2013-05-20 15:02:24 +00:00
Tom Stellard	5643c4ac72	R600: Swap the legality of rotl and rotr The hardware supports rotr and not rotl. llvm-svn: 182285	2013-05-20 15:02:19 +00:00
Tom Stellard	1cfd7a50bb	R600/SI: Add patterns for 64-bit shift operations Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 182284	2013-05-20 15:02:12 +00:00
Tom Stellard	459a79a81c	R600/SI: Use the same names for VOP3 operands and encoding fields This makes it possible to reorder the operands without breaking the encoding. Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 182283	2013-05-20 15:02:08 +00:00
Tom Stellard	b35efba4d9	R600/SI: Make fitsRegClass() operands const Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 182282	2013-05-20 15:02:01 +00:00
Mihai Popa	f41e3f56a5	VSTn instructions have a number of encoding constraints which are not implemented. I have added these using wrapper methods around the original custom decoder (incidentally - this is a huge poorly written method that should be cleaned up. I have left it as is since the changes would be much to hard to review). llvm-svn: 182281	2013-05-20 14:57:05 +00:00
Mihai Popa	dcf0922720	Q registers are encoded in fields of the same length as D registers. As Q registers are half as many, the ARM reference manual mandates the least significant bit to be zeroed out. Failure to do so should result in an undefined instruction. With this change test/MC/Disassembler/ARM/invalid-VQADD-arm.txt is passing (removed XFAIL). llvm-svn: 182279	2013-05-20 14:42:43 +00:00
Richard Sandiford	312425f32d	[SystemZ] Add long branch pass Before this change, the SystemZ backend would use BRCL for all branches and only consider shortening them to BRC when generating an object file. E.g. a branch on equal would use the JGE alias of BRCL in assembly output, but might be shortened to the JE alias of BRC in ELF output. This was a useful first step, but it had two problems: (1) The z assembler isn't traditionally supposed to perform branch shortening or branch relaxation. We followed this rule by not relaxing branches in assembler input, but that meant that generating assembly code and then assembling it would not produce the same result as going directly to object code; the former would give long branches everywhere, whereas the latter would use short branches where possible. (2) Other useful branches, like COMPARE AND BRANCH, do not have long forms. We would need to do something else before supporting them. (Although COMPARE AND BRANCH does not change the condition codes, the plan is to model COMPARE AND BRANCH as a CC-clobbering instruction during codegen, so that we can safely lower it to a separate compare and long branch where necessary. This is not a valid transformation for the assembler proper to make.) This patch therefore moves branch relaxation to a pre-emit pass. For now, calls are still shortened from BRASL to BRAS by the assembler, although this too is not really the traditional behaviour. The first test takes about 1.5s to run, and there are likely to be more tests in this vein once further branch types are added. The feeling on IRC was that 1.5s is a bit much for a single test, so I've restricted it to SystemZ hosts for now. The patch exposes (and fixes) some typos in the main CodeGen/SystemZ tests. A later patch will remove the {{g}}s from that directory. llvm-svn: 182274	2013-05-20 14:23:08 +00:00
Justin Holewinski	01f89f0428	[NVPTX] Add GenericToNVVM IR converter to better handle idiomatic LLVM IR inputs This converter currently only handles global variables in address space 0. For these variables, they are promoted to address space 1 (global memory), and all uses are updated to point to the result of a cvta.global instruction on the new variable. The motivation for this is address space 0 global variables are illegal since we cannot declare variables in the generic address space. Instead, we place the variables in address space 1 and explicitly convert the pointer to address space 0. This is primarily intended to help new users who expect to be able to place global variables in the default address space. llvm-svn: 182254	2013-05-20 12:13:32 +00:00
Justin Holewinski	700b6fa934	[NVPTX] Fix i1 kernel parameters and global variables. ABI rules say we need to use .u8 for i1 parameters for kernels. llvm-svn: 182253	2013-05-20 12:13:28 +00:00
Stepan Dyatkovskiy	d0e34a200f	PR15868 fix. Introduction: In case when stack alignment is 8 and GPRs parameter part size is not N8: we add padding to GPRs part, so part's last byte must be recovered at address K8-1. We need to do it, since remained (stack) part of parameter starts from address K8, and we need to "attach" "GPRs head" without gaps to it: Stack: \|---- 8 bytes block ----\| \|---- 8 bytes block ----\| \|---- 8 bytes... [ [padding] [GPRs head] ] [ ------ Tail passed via stack ------ ... FIX: Note, once we added padding we need to correct all* Arg offsets that are going after padded one. That's why we need this fix: Arg offsets were never corrected before this patch. See new test-cases included in patch. We also don't need to insert padding for byval parameters that are stored in GPRs only. We need pad only last byval parameter and only in case it outsides GPRs and stack alignment = 8. Though, stack area, allocated for recovered byval params, must satisfy "Size mod 8 = 0" restriction. This patch reduces stack usage for some cases: We can reduce ArgRegsSaveArea since inner N*4 bytes sized byval params my be "packed" with alignment 4 in some cases. llvm-svn: 182237	2013-05-20 08:01:34 +00:00
Jakob Stoklund Olesen	f927800325	Also expand 64-bit bitcasts. llvm-svn: 182229	2013-05-20 01:01:43 +00:00
Jakob Stoklund Olesen	c7bc5fbc5c	Implement spill and fill of I64Regs. llvm-svn: 182228	2013-05-20 00:53:25 +00:00
Jakob Stoklund Olesen	751e9b8407	Mark i64 SETCC as expand so it is turned into a SELECT_CC. llvm-svn: 182227	2013-05-20 00:28:36 +00:00
Benjamin Kramer	8bad66e586	Replace some bit operations with simpler ones. No functionality change. llvm-svn: 182226	2013-05-19 22:01:57 +00:00
Jakob Stoklund Olesen	86c5469d26	Don't use %g0 to materialize 0 directly. The wired physreg doesn't work on tied operands like on MOVXCC. Add a README note to fix this later. llvm-svn: 182225	2013-05-19 21:47:13 +00:00
Jakob Stoklund Olesen	92ebf1153e	Select i64 values with %icc conditions. llvm-svn: 182224	2013-05-19 20:38:21 +00:00
Bob Wilson	111b0b6da4	Remove declaration of __clear_cache for __APPLE__. <rdar://problem/13924072> This fixes a bootstrapping problem with builds for Apple ARM targets. Clang had the wrong prototype for __clear_cache with ARM targets. Rafael fixed that in clang svn r181784 and r181810, but without those changes, we can't build this code for ARM because clang reports an error about the declaration in Memory.inc not matching the builtin declaration. Some of our buildbots need to use an older compiler that doesn't have the clang fix. Since __clear_cache is never used here when __APPLE__ is defined, I'm just conditionalizing the declaration to match that. I also moved the declaration of sys_icache_invalidate inside the conditional for __APPLE__ while I was at it. llvm-svn: 182223	2013-05-19 20:33:51 +00:00
Jakob Stoklund Olesen	7ca944b9db	Add floating point selects on %xcc predicates. llvm-svn: 182222	2013-05-19 20:33:11 +00:00

1 2 3 4 5 ...

61382 Commits