llvm-project

Commit Graph

Author	SHA1	Message	Date
Tim Northover	2e02ed253a	ARM: remove unused v(add\|sub)hn and vqdml[as]l intrinsics. Clang is now generating cleaner IR, so this removes the old variants which should be completely unused. llvm-svn: 189481	2013-08-28 14:33:33 +00:00
Tim Northover	8854ba7837	ARM: add patterns for vqdmlal with separate vqdmull and vqadds The vqdmlal and vqdmlls instructions are really just a fused pair consisting of a vqdmull.sN and a vqadd.sN. This adds patterns to LLVM so that we can switch Clang's CodeGen over to generating these instead of the special vqdmlal intrinsics. llvm-svn: 189480	2013-08-28 12:15:16 +00:00
Daniel Sanders	ce09d07824	[mips][msa] Added bnz.df, bnz.v, bz.df, and bz.v These intrinsics are legalized to V(ALL\|ANY)_(NON)?ZERO nodes, are matched as SN?Z_[BHWDV]_PSEUDO pseudo's, and emitted as a branch/mov sequence to evaluate to 0 or 1. Note: The resulting code is sub-optimal since it doesnt seem to be possible to feed the result of an intrinsic directly into a brcond. At the moment it uses (SETCC (VALL_ZERO $ws), 0, SETEQ) and similar which unnecessarily evaluates the boolean twice. llvm-svn: 189478	2013-08-28 12:14:50 +00:00
Daniel Sanders	e6ed5b72f1	[mips][msa] Added load/store intrinsics. llvm-svn: 189476	2013-08-28 12:04:29 +00:00
Alexey Samsonov	9b7e2b555c	80 cols llvm-svn: 189473	2013-08-28 11:25:12 +00:00
Elena Demikhovsky	9a5ed9c3bd	AVX-512: added SQRT, VRSQRT14, VCOMISS, VUCOMISS, VRCP14, VPABS llvm-svn: 189472	2013-08-28 11:21:58 +00:00
Daniel Sanders	ba9c8505fb	[mips][msa] Added move.v llvm-svn: 189471	2013-08-28 10:44:47 +00:00
Richard Sandiford	35b9be298a	[SystemZ] Add support for TMHH, TMHL, TMLH and TMLL For now just handles simple comparisons of an ANDed value with zero. The CC value provides enough information to do any comparison for a 2-bit mask, and some nonzero comparisons with more populated masks, but that's all future work. llvm-svn: 189469	2013-08-28 10:31:43 +00:00
Daniel Sanders	f9aa1d1902	[mips][msa] Added cfcmsa, and ctcmsa The MSA control registers have been added as reserved registers, and are only used via ISD::Copy(To\|From)Reg. The intrinsics are lowered into these nodes. llvm-svn: 189468	2013-08-28 10:26:24 +00:00
Daniel Sanders	0dc0dd464b	[mips][msa] Added f[cs]af, f[cs]or, f[cs]ueq, f[cs]ul[et], f[cs]une, fsun, ftrunc_[su], hadd_[su], hsub_[su], sr[al]r, sr[al]ri llvm-svn: 189467	2013-08-28 10:12:09 +00:00
Richard Sandiford	be133a8757	[SystemZ] Extend memcmp support to all constant lengths This uses the infrastructure added for memcpy and memmove in r189331. llvm-svn: 189458	2013-08-28 09:01:51 +00:00
Alexey Samsonov	a3a037df63	Fix use of uninitialized value added in r189400 (found by MemorySanitizer) llvm-svn: 189456	2013-08-28 08:30:47 +00:00
Ted Kremenek	b33f944f4e	Revert r189442 "Change default # of digits for APFloat::toString" This is breaking numerous Clang tests on the buildbot. llvm-svn: 189447	2013-08-28 06:21:46 +00:00
Eli Friedman	14cede2829	Change default # of digits for APFloat::toString The previous default was almost, but not quite enough digits to represent a floating-point value in a manner which preserves the representation when it's read back in. The larger default is much less confusing. I spent some time looking into printing exactly the right number of digits if a precision isn't specified, but it's kind of complicated, and I'm not really sure I understand what APFloat::toString is supposed to output for FormatPrecision != 0 (or maybe the current API specification is just silly, not sure which). I have a WIP patch if anyone is interested. llvm-svn: 189442	2013-08-28 05:23:51 +00:00
Eric Christopher	62caa709fe	Remove support for the .debug_inlined section. No known software in use supports it. llvm-svn: 189439	2013-08-28 04:04:28 +00:00
NAKAMURA Takumi	19675898da	X86JITInfo.cpp: Apply x64 version of X86CompilationCallback() to Cygwin64. For now, (defined(X86_64_JIT) && defined(__CYGWIN__)) satisfies Cygwin64. llvm-svn: 189437	2013-08-28 03:04:09 +00:00
NAKAMURA Takumi	9ea7c6d463	X86Subtarget.h: Recognize x86_64-cygwin. In the LLVM side, x86_64-cygwin is almost as same as x86_64-mingw32. llvm-svn: 189436	2013-08-28 03:04:02 +00:00
Argyrios Kyrtzidis	aae63a0ce6	[BumpPtrAllocator] Move DefaultSlabAllocator to a member of BumpPtrAllocator, instead of a static variable. The problem with having DefaultSlabAllocator being a global static is that it is undefined if BumpPtrAllocator will be usable during global initialization because it is not guaranteed that DefaultSlabAllocator will be initialized before BumpPtrAllocator is created and used. llvm-svn: 189433	2013-08-28 01:02:21 +00:00
Akira Hatanaka	9bfa2e2e7f	[mips] Use ptr_rc to simplify definitions of base+index load/store instructions. Also, fix predicates. llvm-svn: 189432	2013-08-28 00:55:15 +00:00
Akira Hatanaka	37e9b0dbb2	[mips] Clean up definitions of move word from/to coprocessor instructions. No functionality change. llvm-svn: 189431	2013-08-28 00:42:50 +00:00
Akira Hatanaka	62005d69a7	[mips] Set isAllocatable and CoveredBySubRegs. llvm-svn: 189430	2013-08-28 00:34:17 +00:00
Eric Christopher	e9fd605b41	Add a TODO here. llvm-svn: 189428	2013-08-28 00:13:08 +00:00
Eric Christopher	d033d6fb88	Add support for DW_FORM_dataN and DW_FORM_udata to the DIE hashing algorithm. Update the split dwarf hashing testcase accordingly - this should be the last time that the hash of an empty file changes. llvm-svn: 189427	2013-08-28 00:10:38 +00:00
Rui Ueyama	c3779ff83b	Revert "Option parsing: support case-insensitive option matching." as it broke Windows buildbot. This reverts r189416. llvm-svn: 189424	2013-08-28 00:02:06 +00:00
Eric Christopher	9d1daa87e7	Use DW_FORM_sdata for signed constant values and udata on occasion when we can. Migrate from using blocks when we're adding just a single attribute and floating point values are an unsigned, not signed, bag of bits. Update all test cases accordingly. llvm-svn: 189419	2013-08-27 23:49:04 +00:00
Rui Ueyama	7159bd9dcb	Option parsing: support case-insensitive option matching. Link.exe's command line options are case-insensitive. This patch adds a new attribute to OptTable to let the option parser to compare options, ignoring case. Command lines are generally case-insensitive on Windows. CL.exe is an exception. So this new attribute should be useful for other commands running on Windows. Differential Revision: http://llvm-reviews.chandlerc.com/D1485 llvm-svn: 189416	2013-08-27 23:47:01 +00:00
Manman Ren	547467b82d	DIBuilder: take an optional StringRef to pass in unique identifier. createClassType, createStructType, createUnionType, createEnumerationType, and createForwardDecl will take an optional StringRef to pass in the unique identifier. llvm-svn: 189410	2013-08-27 23:06:40 +00:00
Peter Collingbourne	28a10aff48	DataFlowSanitizer: Implement trampolines for function pointers passed to custom functions. Differential Revision: http://llvm-reviews.chandlerc.com/D1503 llvm-svn: 189408	2013-08-27 22:09:06 +00:00
David Majnemer	aa34d79ab5	[ms-inline asm] Support offsets after segment registers Summary: MASM let's you do stuff like 'MOV FS:20, EAX' and 'MOV EAX, FS:20' Reviewers: craig.topper, rnk Reviewed By: rnk CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D1470 llvm-svn: 189407	2013-08-27 21:56:17 +00:00
Joerg Sonnenberger	b822af4721	Given target assembler parsers a chance to handle variant expressions first. Use this to turn the PPC modifiers into PPC specific expressions, allowing them to work on constants. llvm-svn: 189400	2013-08-27 20:23:19 +00:00
Jack Carter	4e07b95daf	Changed comment llvm-svn: 189396	2013-08-27 19:45:28 +00:00
Nadav Rotem	6b41f7cc4c	Refactor 'vectorizeLoop' no functionality change. This patch merges LoopVectorize of InnerLoopVectorizer and InnerLoopUnroller by adding checks for VF=1. This helps in erasing the Unroller code that is almost identical to the InnerLoopVectorizer code. llvm-svn: 189391	2013-08-27 18:52:47 +00:00
Joey Gouly	e6d165ccb4	[ARMv8] Add MC support for the new load/store acquire/release instructions. llvm-svn: 189388	2013-08-27 17:38:16 +00:00
Elena Demikhovsky	93eeb47d49	AVX-512: added conversion instructions. llvm-svn: 189349	2013-08-27 13:54:04 +00:00
Tim Northover	819bfb5a25	DAGCombiner: make sure or/shl/srl really has zero high bits before forming bswap We want to convert code like (or (srl N, 8), (shl N, 8)) into (srl (bswap N), const), but this is only valid if the bits above 16 on the source pattern are 0, the checks we were doing on this were slightly wrong before. llvm-svn: 189348	2013-08-27 13:46:45 +00:00
Joey Gouly	a710d810f5	[ARMv8] Add some negative tests for the recent VFP/NEON instructions. Fix two issues I found while writing these tests. llvm-svn: 189341	2013-08-27 11:24:16 +00:00
Tim Northover	449d390f40	ARM: add natural patterns for vaddhl and vsubhl. These instructions aren't particularly complicated and it's well worth having patterns for some reasonably useful LLVM IR that will match them. Soon we should be able to switch Clang over to producing this natural version. llvm-svn: 189335	2013-08-27 10:31:36 +00:00
Daniel Sanders	b8bce4d935	[mips][msa] Added spill/reload support llvm-svn: 189332	2013-08-27 10:04:21 +00:00
Richard Sandiford	5e318f0bfe	[SystemZ] Extend memcpy and memset support to all constant lengths Lengths up to a certain threshold (currently 6 * 256) use a series of MVCs. Lengths above that threshold use a loop to handle X*256 bytes followed by a single MVC to handle the excess (if any). This loop will also be needed in future when support for variable lengths is added. Because the same tablegen classes are used to define MVC and CLC, the patch also has the side-effect of defining a pseudo loop instruction for CLC. That instruction isn't used yet (and wouldn't be handled correctly if it were). I'm planning to use it soon though. llvm-svn: 189331	2013-08-27 09:54:29 +00:00
Daniel Sanders	70835f6025	[mips][msa] Added bitconverts for vector types for big and little-endian llvm-svn: 189330	2013-08-27 09:40:30 +00:00
Alexey Samsonov	e3ba81bf19	Add support for DebugFission to DWARF parser Summary: 1) Make llvm-symbolizer properly symbolize files with split debug info (by using stanalone .dwo files). 2) Make DWARFCompileUnit parse and store corresponding .dwo file, if necessary. 3) Make bits of DWARF parsing more CompileUnit-oriented. Reviewers: echristo Reviewed By: echristo CC: bkramer, llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D1164 llvm-svn: 189329	2013-08-27 09:20:22 +00:00
Elena Demikhovsky	12f24673e0	AVX-512: Added FMA instructions. llvm-svn: 189326	2013-08-27 08:39:25 +00:00
Sylvestre Ledru	fb992ab385	Fix the build issue under ia64. Close bug #5715 Thanks to Luca Falavigna for the help and most of the patch. llvm-svn: 189324	2013-08-27 06:49:46 +00:00
Charles Davis	1827bd8a6c	Revert "Fix the build broken by r189315." and "Move everything depending on Object/MachOFormat.h over to Support/MachO.h." This reverts commits r189319 and r189315. r189315 broke some tests on what I believe are big-endian platforms. llvm-svn: 189321	2013-08-27 05:38:30 +00:00
Charles Davis	0c6f71b40d	Move everything depending on Object/MachOFormat.h over to Support/MachO.h. llvm-svn: 189315	2013-08-27 05:00:43 +00:00
Charles Davis	74ec8b0942	Support/MachO: Add a bunch of defines. Right now we have two headers for the Mach-O format. I'd like to get rid of one. Since the other object formats are all in Support, I chose to keep the Mach-O header in Support, and discard the other one. llvm-svn: 189314	2013-08-27 05:00:13 +00:00
Michael Gottesman	eab9a7fa7c	Fixed typo. Noticed by Stephen Checkoway <s@pahtak.org>. llvm-svn: 189312	2013-08-27 04:43:03 +00:00
Kai Nacke	1b7e4866f4	Fix wrong code offset for unwind code SET_FPREG. The code offset for unwind code SET_FPREG is wrong because it is set to constant 0. The fix is to do the same as for the other unwind codes: emit a label and later the absolute difference between the label and the begin of the prologue. Also enables the failing test case MC/COFF/seh.s Reviewed by Jim Grosbach, Charles Davis and Nico Rieck. llvm-svn: 189309	2013-08-27 04:16:16 +00:00
Owen Anderson	a0260f848d	Remove an over-zealous assertion. A pointer type could be illegal if the target is prepared to custom-legalize pointer operands. This assertion was evaluated before the target would have a chance to do so, making it impossible. llvm-svn: 189299	2013-08-27 00:28:23 +00:00
Eric Christopher	ca68bbf5c0	Formatting. llvm-svn: 189296	2013-08-26 23:58:22 +00:00
Eric Christopher	6b16b43ef9	Make the lifetime of the DICompileUnit we're constructing from the MDNode more clear as just for a single argument. llvm-svn: 189294	2013-08-26 23:57:03 +00:00
Eric Christopher	6fdf324f44	Have the skeleton compile unit construction method take the CU it is constructing from as an input and keep the same unique identifier. We can use this to connect items which must stay in the .o file (e.g. pubnames and pubtypes) to the skeleton cu rather than having duplicate unique numbers for the sections and needing to do lookups based on MDNode. llvm-svn: 189293	2013-08-26 23:50:43 +00:00
Eric Christopher	6d13fe007f	Remove duplicate set of CompilationDir. llvm-svn: 189292	2013-08-26 23:50:40 +00:00
Eric Christopher	bfceb2fe8f	Remove the language parameter and variable from the compile unit. We can get it via the MDNode that's passed in. Save that instead. llvm-svn: 189291	2013-08-26 23:50:38 +00:00
Matt Arsenault	5faa669b66	Fix lint assert on integer vector division llvm-svn: 189290	2013-08-26 23:29:33 +00:00
Eric Christopher	4d36ca009f	Treat the pubtypes section similarly to the pubnames section and emit it by default under linux or when we're trying to keep compatibility with old gdb versions. Fix testcase for option name change. llvm-svn: 189289	2013-08-26 23:24:35 +00:00
Eric Christopher	bf1ea3c727	Only emit the section sym if we're emitting the section. llvm-svn: 189288	2013-08-26 23:24:31 +00:00
Matt Arsenault	ed9f76d37b	Fix inserting instructions before last in bundle. The builder inserts from before the insert point, not after, so this would insert before the last instruction in the bundle instead of after it. I'm not sure if this can actually be a problem with any of the current insertions. llvm-svn: 189285	2013-08-26 23:08:37 +00:00
Manman Ren	0ed70aeb85	Debug Info: add an identifier field to DICompositeType. DICompositeType will have an identifier field at position 14. For now, the field is set to null in DIBuilder. For DICompositeTypes where the template argument field (the 13th field) was optional, modify DIBuilder to make sure the template argument field is set. Now DICompositeType has 15 fields. Update DIBuilder to use NULL instead of "i32 0" for null value of a MDNode. Update verifier to check that DICompositeType has 15 fields and the last field is null or a MDString. Update testing cases to include an extra field for DICompositeType. The identifier field will be used by type uniquing so a front end can genearte a DICompositeType with a unique identifer. llvm-svn: 189282	2013-08-26 22:39:55 +00:00
Nadav Rotem	bdc9ff4498	LoopVectorize: Implement partial loop unrolling when vectorization is not profitable. This patch enables unrolling of loops when vectorization is legal but not profitable. We add a new class InnerLoopUnroller, that extends InnerLoopVectorizer and replaces some of the vector-specific logic with scalars. This patch does not introduce any runtime regressions and improves the following workloads: SingleSource/Benchmarks/Shootout/matrix -22.64% SingleSource/Benchmarks/Shootout-C++/matrix -13.06% External/SPEC/CINT2006/464_h264ref/464_h264ref -3.99% SingleSource/Benchmarks/Adobe-C++/simple_types_constant_folding -1.95% llvm-svn: 189281	2013-08-26 22:33:26 +00:00
Eric Christopher	5297df025c	Fix thinko. llvm-svn: 189279	2013-08-26 20:58:35 +00:00
Jim Grosbach	667b147dba	ARM: Constrain regclass for TSTri instruction. Get the register class right for the TST instruction. This keeps the machine verifier happy, enabling us to turn it on for another test. rdar://12594152 llvm-svn: 189274	2013-08-26 20:22:05 +00:00
Bill Schmidt	8c3976eca1	Dummy code to silence warning from 4189266 llvm-svn: 189272	2013-08-26 20:11:46 +00:00
Jim Grosbach	5f71aab12e	ARM: FastISel verifier error cleanup. Constant pool and global value reference instructions need more restricted register classes than plain GPR. rdar://12594152 llvm-svn: 189270	2013-08-26 20:07:29 +00:00
Jim Grosbach	08aa534239	ARM: Fix ELF global base reg intialization. The create machine code wasn't properly in SSA, which the machine verifier properly complains about. Now that fast-isel is closer to verifier clean, errors like this show up more clearly. Additionally, the Thumb pseudo tPICADD was used for both ARM and Thumb mode functions, which is obviously wrong. Fix that along the way. Test case is part of the following commit which will finish making an additional fast-isel test verifier clean an enable it for the regression test suite. This commit is separate since its not just a verifier cleanup, but an actual correctness issue. rdar://12594152 (for the fast-isel verifier aspects) llvm-svn: 189269	2013-08-26 20:07:25 +00:00
Bill Schmidt	d89f678cfd	[PowerPC] More fast-isel chunks (returns and integer extends) Incremental improvement to fast-isel for PPC64. This allows us to select on ret, sext, and zext. Filling in sext/zext improves some of the existing logic in handling compare-immediates that needed extends. A simplified return convention for fast-isel is also added to the PPC64 calling conventions. All call/return processing for DAG selection is handled with custom code, so there isn't an existing CC to rely on here. The include of PPCGenCallingConv.inc causes compiler warnings due to the 32-bit calling conventions that are not used, so the dummy function "usePPC32CCs()" is added here to silence those. Test cases for the return and extend logic are added. llvm-svn: 189266	2013-08-26 19:42:51 +00:00
Yi Jiang	7107d41574	test commit. Remove blank line llvm-svn: 189265	2013-08-26 18:57:55 +00:00
Matt Arsenault	bcd8c577d7	Fix unused variable in release build llvm-svn: 189264	2013-08-26 18:38:29 +00:00
Matt Arsenault	8f21c838c0	Constify functions llvm-svn: 189234	2013-08-26 17:56:38 +00:00
Matt Arsenault	39274be65f	Vectorize starting from insertelements building a vector llvm-svn: 189233	2013-08-26 17:56:35 +00:00
Tom Stellard	838e2344ec	SelectionDAG: Remove unnecessary uses of TargetLowering::getPointerTy() If we have a binary operation like ISD:ADD, we can set the result type equal to the result type of one of its operands rather than using TargetLowering::getPointerTy(). Also, any use of DAG.getIntPtrConstant(C) as an operand for a binary operation can be replaced with: DAG.getConstant(C, OtherOperand.getValueType()); llvm-svn: 189227	2013-08-26 15:06:10 +00:00
Tom Stellard	35bb18c2a7	R600: Add support for vector local memory loads llvm-svn: 189226	2013-08-26 15:06:04 +00:00
Tom Stellard	c6f4a29ed5	R600: Add support for i8 and i16 local memory loads llvm-svn: 189225	2013-08-26 15:05:59 +00:00
Tom Stellard	7da047c9fb	SelectionDAG: Use correct pointer size when splitting vector stores llvm-svn: 189224	2013-08-26 15:05:55 +00:00
Tom Stellard	f3d166aa1e	R600: Add support for i8 and i16 local memory stores llvm-svn: 189223	2013-08-26 15:05:49 +00:00
Tom Stellard	2ffc330673	R600: Add support for v4i32 and v2i32 local stores llvm-svn: 189222	2013-08-26 15:05:44 +00:00
Tom Stellard	fd155828ed	SelectionDAG: Use correct pointer size when lowering function arguments v2 This adds minimal support to the SelectionDAG for handling address spaces with different pointer sizes. The SelectionDAG should now correctly lower pointer function arguments to the correct size as well as generate the correct code when lowering getelementptr. This patch also updates the R600 DataLayout to use 32-bit pointers for the local address space. v2: - Add more helper functions to TargetLoweringBase - Use CHECK-LABEL for tests llvm-svn: 189221	2013-08-26 15:05:36 +00:00
Elena Demikhovsky	0a2b6290f1	AVX-512: Added shuffle instructions - VPSHUFD, VPERMILPS, VMOVDDUP, VMOVLHPS, VMOVHLPS, VSHUFPS, VALIGN single and double forms. llvm-svn: 189215	2013-08-26 12:45:35 +00:00
Vladimir Medic	8277c1874a	This patch implements trap instructions for mips. The test cases are added. llvm-svn: 189213	2013-08-26 10:02:40 +00:00
Craig Topper	6269f49505	Make sure x86 instructions using ssmem/sdmem operand types are only able to parse memory operands of the proper size in Intel syntax. Primarily affects some of sse cvt instructions. llvm-svn: 189206	2013-08-26 00:39:04 +00:00
Craig Topper	5500d83793	Remove some unnecessary PredicateMethod overrides. Add RenderMethod overrides to remove forwarding in the X86AsmParser code itself. No functional change. llvm-svn: 189205	2013-08-26 00:13:09 +00:00
Craig Topper	8c26c424c6	Put some of the AVX-512 parsing stuff in a more consistent place with the existing functions. llvm-svn: 189204	2013-08-25 23:18:05 +00:00
Bill Schmidt	0300813d6a	[PowerPC] Add fast-isel branch and compare selection. First chunk of actual fast-isel selection code. This handles direct and indirect branches, as well as feeding compares for direct branches. PPCFastISel::PPCEmitIntExt() is just roughed in and will be expanded in a future patch. This also corrects a problem with selection for constant pool entries in JIT mode or with small code model. llvm-svn: 189202	2013-08-25 22:33:42 +00:00
Craig Topper	1885417372	First round of fixes for the x86 fixes for the x86 move accumulator from/to memory offset instructions. -Assembly parser now properly check the size of the memory operation specified in intel syntax. So 'mov word ptr [5], al' is no longer accepted. -x86-32 disassembly of these instructions no longer sign extends the 32-bit address immediate based on size. -Intel syntax printing prints the ptr size and places brackets around the address immediate. Known remaining issues with these instructions: -Segment override prefix is not supported. PR16962 and PR16961. -Immediate size should be changed by address size prefix. llvm-svn: 189201	2013-08-25 22:23:38 +00:00
Venkatraman Govindaraju	35e0c382d5	[Sparc] Add long double (f128) instructions to sparc backend. llvm-svn: 189198	2013-08-25 18:30:06 +00:00
Venkatraman Govindaraju	12d8089b8e	[Sparc] Added V9's extra floating point registers and their aliases. llvm-svn: 189195	2013-08-25 17:03:02 +00:00
Elena Demikhovsky	f8f478b19d	AVX-512: added UNPACK instructions and tests for all-zero/all-ones vectors llvm-svn: 189189	2013-08-25 12:54:30 +00:00
Chandler Carruth	ba689b3315	Fix a bug where we would corrupt the offset when evaluating a non-constant GEP. I don't have any test case that demonstrates this, Nadav (indirectly) pointed this out in code review. I'm not sure how possible it is to contrive a test case for the current users of this code that triggers the bad issue sadly. llvm-svn: 189188	2013-08-25 10:46:39 +00:00
David Majnemer	b78df507c8	AsmPrinter: Get rid of llvm$workaround$fake$stub$ We currently emit labels with the prefix Lllvm$workaround$fake$stub$ if the target's MCAsmInfo has getLinkOnceDirective() mapped to something interesting. This was apparently a work around introduced in r31033 for binutils that we don't need anymore. llvm-svn: 189187	2013-08-25 09:18:19 +00:00
Reed Kotler	7d0fb7ebd5	Start to add the LLVM builtins to the mips16 exclusion lists for fp. I need to add the rest of these to the list or else to delay putting out the actual stub until later in code generation when I know if the external function ever got emitted Resubmit this patch. The target triple needs to be added to the test so that clang does not tell the backend the wrong target when the host is BSD. There is a clang bug in here somewhere that I need to track down. At Mips this has been filed internally as a bug. llvm-svn: 189186	2013-08-25 02:40:25 +00:00
Craig Topper	0714d51cba	Add hasSideEffects/mayLoad/mayStore flags to the X86 moffs8/moffs16/moffs32/moffs64 versions of move. llvm-svn: 189182	2013-08-24 20:31:14 +00:00
Matt Arsenault	8405888af1	Check if in set on insertion instead of separately llvm-svn: 189179	2013-08-24 19:55:38 +00:00
Craig Topper	092e2fe426	Remove trailing whitespace. llvm-svn: 189178	2013-08-24 19:50:11 +00:00
Shuxin Yang	b64ab41936	Revert 189161 llvm-svn: 189176	2013-08-24 17:53:16 +00:00
Jakub Staszak	07f383f87a	Remove trailing spaces. llvm-svn: 189173	2013-08-24 14:16:00 +00:00
Benjamin Kramer	b12cf01908	Add a function object to compare the first or second component of a std::pair. Replace instances of this scattered around the code base. llvm-svn: 189169	2013-08-24 12:54:27 +00:00
Benjamin Kramer	260de74e48	Simplify code. No functionality change. llvm-svn: 189168	2013-08-24 12:15:54 +00:00
Benjamin Kramer	892daba8d3	DwarfDebug: Delete orphaned children. Leak found by valgrind. llvm-svn: 189167	2013-08-24 11:55:49 +00:00
Dmitri Gribenko	292c9200fc	Added const qualifier to StringRef::edit_distance member function Patch by Ismail Pazarbasi. llvm-svn: 189162	2013-08-24 01:50:41 +00:00
Reed Kotler	e531cbaa86	Start to add the builtind to the mips16 exclusion lists for fp. I need to add the rest of these to the list or else to delay putting out the actual stub until later in code generation when I know if the external function ever got emitted. llvm-svn: 189161	2013-08-24 01:24:44 +00:00
Justin Holewinski	aaa8b6e355	[NVPTX] Re-enable assembly printing support for inline assembly This support was removed by accident during the MC conversion llvm-svn: 189160	2013-08-24 01:17:23 +00:00
Manman Ren	00335ea34e	DebugInfoFinder: handle imported entities of a CU. llvm-svn: 189158	2013-08-24 00:32:12 +00:00
Rafael Espindola	94a2c5642d	Rename features to match what gcc and clang use. There is no advantage in being different and using the same names simplifies clang a bit. llvm-svn: 189141	2013-08-23 20:21:34 +00:00
Peter Collingbourne	a96296f3ab	DataFlowSanitizer: correctly combine labels in the case where they are equal. llvm-svn: 189133	2013-08-23 18:45:06 +00:00
Manman Ren	5477cfb592	DebugInfoFinder: handle template params of a DISubprogram. llvm-svn: 189131	2013-08-23 18:36:18 +00:00
Andrew Trick	475a9911ca	PrintVRegOrUnit llvm-svn: 189124	2013-08-23 17:48:53 +00:00
Andrew Trick	e4c1ba762d	Rename to RegPressure API parameters RegUnits. llvm-svn: 189123	2013-08-23 17:48:51 +00:00
Andrew Trick	01bc216482	Simplify RegPressure helpers. llvm-svn: 189122	2013-08-23 17:48:48 +00:00
Andrew Trick	86a7061e5d	Add a convenient PSetIterator for visiting pressure sets affected by a register. llvm-svn: 189121	2013-08-23 17:48:46 +00:00
Andrew Trick	c01b00400d	Adds cyclic critical path computation and heuristics, temporarily disabled. Estimate the cyclic critical path within a single block loop. If the acyclic critical path is longer, then the loop will exhaust OOO resources after some number of iterations. If lag between the acyclic critical path and cyclic critical path is longer the the time it takes to issue those loop iterations, then aggressively schedule for latency. llvm-svn: 189120	2013-08-23 17:48:43 +00:00
Andrew Trick	8dd26f002f	MI Sched: record local vreg uses. This will be used to compute the cyclic critical path and to update precomputed per-node pressure differences. In the longer term, it could also be used to speed up LiveInterval update by avoiding visiting all global vreg users. llvm-svn: 189118	2013-08-23 17:48:39 +00:00
Andrew Trick	a53e101627	mi-sched: Don't call MBB.size() in initSUnits. The driver already has instr count. This fixes a pathological compile time problem with very large blocks and lots of scheduling boundaries. llvm-svn: 189116	2013-08-23 17:48:33 +00:00
Jim Cownie	b09bb1ce19	Checking commit access; added one space llvm-svn: 189111	2013-08-23 15:51:37 +00:00
Joey Gouly	c7cda1c59e	[ARM] Fix another ARM FastISel -verify-machineinstrs issue. llvm-svn: 189109	2013-08-23 15:20:56 +00:00
Evgeniy Stepanov	d42863cc1f	[msan] Fix handling of va_arg overflow area on x86_64. The code was erroneously reading overflow area shadow from the TLS slot, bypassing the local copy. Reading shadow directly from TLS is wrong, because it can be overwritten by a nested vararg call, if that happens before va_start. llvm-svn: 189104	2013-08-23 12:11:00 +00:00
Joey Gouly	e3dd684aad	[ARMv8] Add CodeGen for VMAXNM/VMINNM. llvm-svn: 189103	2013-08-23 12:01:13 +00:00
Andrea Di Biagio	377496bbad	Add function attribute 'optnone'. This function attribute indicates that the function is not optimized by any optimization or code generator passes with the exception of interprocedural optimization passes. llvm-svn: 189101	2013-08-23 11:53:55 +00:00
Richard Sandiford	03481334b5	[SystemZ] Add basic prefetch support Just the instructions and intrinsics for now. llvm-svn: 189100	2013-08-23 11:36:42 +00:00
Richard Sandiford	24e597b8c5	[SystemZ] Try reversing comparisons whose first operand is in memory This allows us to make more use of the many compare reg,mem instructions. llvm-svn: 189099	2013-08-23 11:27:19 +00:00
Richard Sandiford	a481f58542	[SystemZ] Prefer LHI;ST... over LAY;MV... If we had a store of an integer to memory, and the integer and store size were suitable for a form of MV..., we used MV... no matter what. We could then have sequences like: lay %r2, 0(%r3,%r4) mvi 0(%r2), 4 In these cases it seems better to force the constant into a register and use a normal store: lhi %r2, 4 stc %r2, 0(%r3, %r4) since %r2 is more likely to be hoisted and is easier to rematerialize. llvm-svn: 189098	2013-08-23 11:18:53 +00:00
Richard Sandiford	37cd6cfba2	Turn MipsOptimizeMathLibCalls into a target-independent scalar transform ...so that it can be used for z too. Most of the code is the same. The only real change is to use TargetTransformInfo to test when a sqrt instruction is available. The pass is opt-in because at the moment it only handles sqrt. llvm-svn: 189097	2013-08-23 10:27:02 +00:00
Tim Northover	1f1b2756a4	ARM: make sure ARM-mode pseudo-inst requires IsARM I'd forgotten that "Requires" blocks override rather than add to the constraints, so my pseudo-instruction was being selected in Thumb mode leading to nonsense instructions. rdar://problem/14817358 llvm-svn: 189096	2013-08-23 10:16:39 +00:00
Daniel Sanders	3c9a0ad444	[mips][msa] Split MSA128 regset into size-specific sets containing the same registers. llvm-svn: 189095	2013-08-23 10:10:13 +00:00
Alexey Samsonov	6dae24df16	80 cols llvm-svn: 189091	2013-08-23 07:42:51 +00:00
Alexey Samsonov	a9debbfb01	Make DWARFCompileUnit non-copyable Summary: This is a part of D1164. DWARFCompileUnit is not that lightweight to copy it around, and we want it to own corresponding .dwo compile unit eventually. Reviewers: echristo Reviewed By: echristo CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D1298 llvm-svn: 189089	2013-08-23 06:56:01 +00:00
Jakob Stoklund Olesen	0c00704f27	Use register masks on SPARC call instructions. llvm-svn: 189085	2013-08-23 02:33:47 +00:00
Jakob Stoklund Olesen	a8960a1f7c	Add an OtherPreserved field to the CalleeSaved TableGen class. This field specifies registers that are preserved across function calls, but that should not be included in the generates SaveList array. This can be used ot generate regmasks for architectures that save registers through other means, like SPARC's register windows. llvm-svn: 189084	2013-08-23 02:25:47 +00:00
Michael Gottesman	823aaffd37	Update StripDeadDebugInfo to use DebugInfoFinder so that it is no longer stale to the point of not working and more resilient to debug info changes. The current version of StripDeadDebugInfo became stale and no longer actually worked since it was expecting an older version of debug info. This patch updates it to use DebugInfoFinder and the modern DebugInfo classes as much as possible to make it more redundent to such changes. Additionally, the only place where that was avoided (the code where we replace the old sets with the new), I call verify on the DIContextUnit implying that if the format changes and my live set changes no longer make sense an assert will be hit. In order to ensure that that occurs I have included a test case. The actual stripping of the dead debug info follows the same strategy as was used before in this class: find the live set and replace the old set in the given compile unit (which may contain dead global variables/functions) with the new live one. llvm-svn: 189078	2013-08-23 00:23:24 +00:00
Michael Gottesman	20f25eb958	[stack protector] Work around an issue with the BMOVPCB_CALL instruction on ARM by disabling does not return on __stack_chk_fail. This is to fix the bots while I look to see if there is something I can do here. rdar://14811848 llvm-svn: 189076	2013-08-22 23:45:24 +00:00
Bill Wendling	fe88aea706	Check only if we have this attribute. If it's not an attribute, then it's assumed false. llvm-svn: 189063	2013-08-22 21:16:14 +00:00
Tom Stellard	15e4811455	R600/SI: Fix another case of illegal VGPR to SGPR copy This fixes a crash in Unigine Tropics. https://bugs.freedesktop.org/show_bug.cgi?id=68389 llvm-svn: 189057	2013-08-22 20:21:02 +00:00
Peter Collingbourne	34f0c313e2	DataFlowSanitizer: Replace non-instrumented aliases of instrumented functions, and vice versa, with wrappers. Differential Revision: http://llvm-reviews.chandlerc.com/D1442 llvm-svn: 189054	2013-08-22 20:08:15 +00:00
Peter Collingbourne	761a4fc475	DataFlowSanitizer: Factor the wrapper builder out to buildWrapperFunction. Differential Revision: http://llvm-reviews.chandlerc.com/D1441 llvm-svn: 189053	2013-08-22 20:08:11 +00:00
Peter Collingbourne	59b1262d01	DataFlowSanitizer: Prefix the name of each instrumented function with "dfs$". DFSan changes the ABI of each function in the module. This makes it possible for a function with the native ABI to be called with the instrumented ABI, or vice versa, thus possibly invoking undefined behavior. A simple way of statically detecting instances of this problem is to prepend the prefix "dfs$" to the name of each instrumented-ABI function. This will not catch every such problem; in particular function pointers passed across the instrumented-native barrier cannot be used on the other side. These problems could potentially be caught dynamically. Differential Revision: http://llvm-reviews.chandlerc.com/D1373 llvm-svn: 189052	2013-08-22 20:08:08 +00:00
Joey Gouly	881eab53be	[ARMv8] Add CodeGen support for VSEL. This uses the ARMcmov pattern that Tim cleaned up in r188995. Thanks to Simon Tatham for his floating point help! llvm-svn: 189024	2013-08-22 15:29:11 +00:00
NAKAMURA Takumi	7b5d4f97a0	[Win32] mapped_file_region: Fix a bug in CreateFileMapping() that Size must contain Offset when Offset >= 65536. llvm-svn: 189021	2013-08-22 15:14:53 +00:00
NAKAMURA Takumi	edf7615332	Whitespace. llvm-svn: 189020	2013-08-22 15:14:45 +00:00
Mihai Popa	5500c0ff89	Fix ARM vcvt encoding when the number of fractional bits is zero. The instruction to convert between floating point and fixed point representations takes an immediate operand for the number of fractional bits of the fixed point value. ARMARM specifies that when that number of bits is zero, the assembler should encode floating point/integer conversion instructions. This patch adds the necessary instruction aliases to achieve this behaviour. llvm-svn: 189009	2013-08-22 13:16:07 +00:00
Chandler Carruth	1c34afcb61	Teach the SLP vectorizer the correct way to check for consecutive access using GEPs. Previously, it used a number of different heuristics for analyzing the GEPs. Several of these were conservatively correct, but failed to fall back to SCEV even when SCEV might have given a reasonable answer. One was simply incorrect in how it was formulated. There was good code already to recursively evaluate the constant offsets in GEPs, look through pointer casts, etc. I gathered this into a form code like the SLP code can use in a previous commit, which allows all of this code to become quite simple. There is some performance (compile time) concern here at first glance as we're directly attempting to walk both pointers constant GEP chains. However, a couple of thoughts: 1) The very common cases where there is a dynamic pointer, and a second pointer at a constant offset (usually a stride) from it, this code will actually not do any unnecessary work. 2) InstCombine and other passes work very hard to collapse constant GEPs, so it will be rare that we iterate here for a long time. That said, if there remain performance problems here, there are some obvious things that can improve the situation immensely. Doing a vectorizer-pass-wide memoizer for each individual layer of pointer values, their base values, and the constant offset is likely to be able to completely remove redundant work and strictly limit the scaling of the work to scrape these GEPs. Since this optimization was not done on the prior version (which would still benefit from it), I've not done it here. But if folks have benchmarks that slow down it should be straight forward for them to add. I've added a test case, but I'm not really confident of the amount of testing done for different access patterns, strides, and pointer manipulation. llvm-svn: 189007	2013-08-22 12:45:17 +00:00
Joey Gouly	e1de9e9c33	[ARM] Constrain some register classes in EmitAtomicBinary64 so that we pass these tests with -verify-machineinstrs. llvm-svn: 189006	2013-08-22 12:19:24 +00:00
Elena Demikhovsky	c35219e3ee	AVX-512: Added masked SHIFT commands, more encoding tests llvm-svn: 189005	2013-08-22 12:18:28 +00:00
Logan Chien	2361f51e82	Fix ARM FastISel PIC function call. The function call to external function should come with PLT relocation type if the PIC relocation model is used. llvm-svn: 189002	2013-08-22 12:08:04 +00:00
Chandler Carruth	989e630871	Add a new helper method to Value to strip in-bounds constant offsets of pointers, but accumulate the offset into an APInt in the process of stripping it. This is a pretty handy thing to have, such as when trying to determine if two pointers are at some constant relative offset. I'll be committing a patch shortly to use it for exactly that purpose. llvm-svn: 189000	2013-08-22 11:25:11 +00:00
NAKAMURA Takumi	26c8ea657f	MemoryBuffer.cpp: Consider if PageSize were not 4096 in shouldUseMmap(). Follow-up to r188903. The AllocationGranularity can be 65536 on Win32, even on Cygwin. llvm-svn: 188998	2013-08-22 10:23:52 +00:00
Tim Northover	421804420d	ARM: use TableGen patterns to select CMOV operations. Back in the mists of time (2008), it seems TableGen couldn't handle the patterns necessary to match ARM's CMOV node that we convert select operations to, so we wrote a lot of fairly hairy C++ to do it for us. TableGen can deal with it now: there were a few minor differences to CodeGen (see tests), but nothing obviously worse that I could see, so we should probably address anything that does come up in a localised manner. llvm-svn: 188995	2013-08-22 09:57:11 +00:00
Tim Northover	2ddeeed096	ARM: respect tied 64-bit inlineasm operands when printing The code for 'Q' and 'R' operand modifiers needs to look through tied operands to discover the register class. llvm-svn: 188990	2013-08-22 06:51:04 +00:00
Michael Gottesman	1adac3582d	[stackprotector] When finding the split point to splice off the end of a parentmbb into a successmbb, include any DBG_VALUE MI. Fix for PR16954. llvm-svn: 188987	2013-08-22 05:40:50 +00:00
Matt Arsenault	f599d97449	Teach LoopVectorize about address space sizes llvm-svn: 188980	2013-08-22 02:42:55 +00:00
Jim Grosbach	6a7a727174	ARM: R9 is not safe to use for tcGPR. Indirect tail-calls shouldn't use R9 for the branch destination, as it's not reliably a call-clobbered register. rdar://14793425 llvm-svn: 188967	2013-08-22 00:14:24 +00:00
Michael Gottesman	0dc00645a2	Fixed typo. llvm-svn: 188957	2013-08-21 22:53:54 +00:00
Michael Gottesman	0900993c3c	Removed trailing whitespace. llvm-svn: 188956	2013-08-21 22:53:29 +00:00
Tom Stellard	1b2c2d8414	SelectionDAG: Make sure stores are always added to the LegalizedNodes list When truncated vector stores were being custom lowered in VectorLegalizer::LegalizeOp(), the old (illegal) and new (legal) node pair was not being added to LegalizedNodes list. Instead of the legalized result being passed to VectorLegalizer::TranslateLegalizeResult(), the result was being passed back into VectorLegalizer::LegalizeOp(), which ended up adding a (new, new) pair to the list instead. This was causing an assertion failure when a custom lowered truncated vector store was the last instruction a basic block and the VectorLegalizer was unable to find it in the LegalizedNodes list when updating the DAG root. llvm-svn: 188953	2013-08-21 22:42:58 +00:00
Tom Stellard	f6d8023ca4	R600: Remove unnecessary casts Spotted by Bill Wendling. llvm-svn: 188942	2013-08-21 22:14:17 +00:00
Yunzhong Gao	05efa23294	No functionality change. Replace "(255 & value)" with "(0xFF & value)" to improve clarity. llvm-svn: 188941	2013-08-21 22:11:15 +00:00
Juergen Ributzka	3db39dc1ae	Teach BaseIndexOffset::match to identify base pointers in loops. The small utility function that pattern matches Base + Index + Offset patterns for loads and stores fails to recognize the base pointer for loads/stores from/into an array at offset 0 inside a loop. As a result DAGCombiner::MergeConsecutiveStores was not able to merge all stores. This commit fixes the issue by adding an additional pattern match and also a test case. Reviewer: Nadav llvm-svn: 188936	2013-08-21 21:53:38 +00:00
Bill Wendling	570d3020e3	Reorder headers according to lint. llvm-svn: 188932	2013-08-21 21:14:19 +00:00
Bill Wendling	0cb8c0b1c2	Remove use of forbidden 'iostream' header. Also obsessively reorder the headers to be in something closer to alphabetical order. llvm-svn: 188928	2013-08-21 20:36:42 +00:00
Matt Arsenault	745101d666	Teach InstCombine about address spaces llvm-svn: 188926	2013-08-21 19:53:10 +00:00
Ahmed Bougacha	4020363746	MC CFG: Remap enough for data too, analoguous to r188873. llvm-svn: 188925	2013-08-21 19:40:28 +00:00
Ahmed Bougacha	47c2a75e8d	Style cleanup following David's review for r188876. llvm-svn: 188924	2013-08-21 19:40:25 +00:00
Matt Arsenault	745832dcc9	Use attribute helper function llvm-svn: 188916	2013-08-21 18:54:50 +00:00
Matt Arsenault	3c71dabd88	Fix typo llvm-svn: 188915	2013-08-21 18:54:47 +00:00
Hao Liu	546bcd2f50	A minor change for an obvous problem caused by r188451: def imm0_63 : Operand<i32>, ImmLeaf<i32, [{ return Imm >= 0 && Imm < 63;}]>{ As it seems Imm <63 should be Imm <= 63. ImmLeaf is used in pattern match, but there is already a function check the shift amount range, so just remove ImmLeaf. Also add a test to check 63. llvm-svn: 188911	2013-08-21 17:47:53 +00:00
NAKAMURA Takumi	3bbbe2e826	Unix/Process.inc: Revert r72332, "Work around a page size issue on Cygwin." Offset in mmap(3) should be aligned to gepagesize(), 64k, or mmap(3) would fail. TODO: Invetigate places where 4096 would be required as pagesize, or 4096 would satisfy. llvm-svn: 188903	2013-08-21 13:47:12 +00:00
Mihai Popa	ae1112bae5	Make "mov" work for all Thumb2 MOV encodings According to the ARM specification, "mov" is a valid mnemonic for all Thumb2 MOV encodings. To achieve this, the patch adds one instruction alias with a special range condition to avoid collision with the Thumb1 MOV. llvm-svn: 188901	2013-08-21 13:14:58 +00:00
Elena Demikhovsky	33d447a2d6	AVX-512: Added SHIFT instructions. llvm-svn: 188899	2013-08-21 09:36:02 +00:00
Richard Sandiford	7d86e47d04	[SystemZ] Define remainig *MUL_LOHI patterns The initial port used MLG(R) for i64 UMUL_LOHI but left the other three combinations as not-legal-or-custom. Although 32x32->{32,32} multiplications exist, they're not as quick as doing a normal 64-bit multiplication, so it didn't seem like i32 SMUL_LOHI and UMUL_LOHI would be useful. There's also no direct instruction for i64 SMUL_LOHI, so it needs to be implemented in terms of UMUL_LOHI. However, not defining these patterns means that we don't convert division by a constant into multiplication, so this patch fills in the other cases. The new i64 SMUL_LOHI sequence is simpler than the one that we used previously for 64x64->128 multiplication, so int-mul-08.ll now tests the full sequence. llvm-svn: 188898	2013-08-21 09:34:56 +00:00
Daniel Sanders	41194e3f9e	[mips][msa] Matheus Almeida pointed out a silly mistake in r188893. Fixed it. I accidentally changed the encoding of the MSA registers to zero instead of 0 to 31. This change restores the encoding the registers had prior to r188893. This didn't show up in the existing tests because direct-object emission isn't implemented yet for MSA. llvm-svn: 188896	2013-08-21 09:09:52 +00:00
Richard Sandiford	af5f66ac9e	[SystemZ] Use FI[EDX]BRA for codegen llvm-svn: 188895	2013-08-21 09:04:20 +00:00
Richard Sandiford	8e92c389e4	[SystemZ] Add FI[EDX]BRA These are extensions of the existing FI[EDX]BR instructions, but use a spare bit to suppress inexact conditions. llvm-svn: 188894	2013-08-21 08:58:08 +00:00
Daniel Sanders	ec12322a28	[mips][msa] Define registers using foreach No functional change llvm-svn: 188893	2013-08-21 08:48:25 +00:00
Ahmed Bougacha	1792647942	MC CFG: Add YAML MCModule representation to enable MC CFG testing. Like yaml ObjectFiles, this will be very useful for testing the MC CFG implementation (mostly MCObjectDisassembler), by matching the output with YAML, and for potential users of the MC CFG, by using it as an input. There isn't much to the actual format, it is just a serialization of the MCModule class. Of note: - Basic block references (pred/succ, ..) are represented by the BB's start address. - Just as in the MC CFG, instructions are MCInsts with a size. - Operands have a prefix representing the type (only register and immediate supported here). - Instruction opcodes are represented by their names; enum values aren't stable, enum names mostly are: usually, a change to a name would need lots of changes in the backend anyway. Same with registers. All in all, an example is better than 1000 words, here goes: A simple binary: Disassembly of section __TEXT,__text: _main: 100000f9c: 48 8b 46 08 movq 8(%rsi), %rax 100000fa0: 0f be 00 movsbl (%rax), %eax 100000fa3: 3b 04 25 48 00 00 00 cmpl 72, %eax 100000faa: 0f 8c 07 00 00 00 jl 7 <.Lend> 100000fb0: 2b 04 25 48 00 00 00 subl 72, %eax .Lend: 100000fb7: c3 ret And the (pretty verbose) generated YAML: --- Atoms: - StartAddress: 0x0000000100000F9C Size: 20 Type: Text Content: - Inst: MOV64rm Size: 4 Ops: [ RRAX, RRSI, I1, R, I8, R ] - Inst: MOVSX32rm8 Size: 3 Ops: [ REAX, RRAX, I1, R, I0, R ] - Inst: CMP32rm Size: 7 Ops: [ REAX, R, I1, R, I72, R ] - Inst: JL_4 Size: 6 Ops: [ I7 ] - StartAddress: 0x0000000100000FB0 Size: 7 Type: Text Content: - Inst: SUB32rm Size: 7 Ops: [ REAX, REAX, R, I1, R, I72, R ] - StartAddress: 0x0000000100000FB7 Size: 1 Type: Text Content: - Inst: RET Size: 1 Ops: [ ] Functions: - Name: __text BasicBlocks: - Address: 0x0000000100000F9C Preds: [ ] Succs: [ 0x0000000100000FB7, 0x0000000100000FB0 ] <snip> ... llvm-svn: 188890	2013-08-21 07:29:02 +00:00
Ahmed Bougacha	69a7562335	MC CFG: Support disassembly at arbitrary addresses in MCObjectDisassembler. llvm-svn: 188889	2013-08-21 07:28:55 +00:00
Ahmed Bougacha	518cc6f811	MC CFG: Use data structures more appropriate than std::set. llvm-svn: 188888	2013-08-21 07:28:51 +00:00
Ahmed Bougacha	58ed11341b	MC CFG: Add an MCObjectSymbolizer in the MCObjectDisassembler. Used to detect calls to function symbol stubs (future commit). llvm-svn: 188887	2013-08-21 07:28:48 +00:00
Ahmed Bougacha	b09d140f6b	MC CFG: Add MCObjectDisassembler Mach-O implementation. Supports: - entrypoint, using LC_MAIN. - static ctors/dtors, using __mod_{init,exit}_func - translation between effective and object load address, using dyld's VM address slide. llvm-svn: 188886	2013-08-21 07:28:44 +00:00
Ahmed Bougacha	2eb593682a	MC CFG: Add "dynamic disassembly" support to MCObjectDisassembler. It can now disassemble code in situations where the effective load address is different than the load address declared in the object file. This happens for PIC, hence "dynamic". llvm-svn: 188884	2013-08-21 07:28:37 +00:00
Ahmed Bougacha	57bc9677cd	MC CFG: When disassembly is impossible, fallback to data bytes. This is the behavior of sequential disassemblers (llvm-objdump, ...), when there is no instruction size hint (fixed-length, ...) While there, also do some minor cleanup. llvm-svn: 188883	2013-08-21 07:28:32 +00:00
Ahmed Bougacha	a376353346	MC CFG: Add MCObjectDisassembler support for entrypoint + static ctors. For now, this isn't implemented for any format. llvm-svn: 188882	2013-08-21 07:28:29 +00:00
Ahmed Bougacha	ff12d02d51	MC CFG: Split MCBasicBlocks to mirror atom splitting. When an MCTextAtom is split, all MCBasicBlocks backed by it are automatically split, with a fallthrough between both blocks, and the successors moved to the second block. llvm-svn: 188881	2013-08-21 07:28:24 +00:00
Ahmed Bougacha	d3fc5b9648	MC CFG: Add a few needed methods, mainly MCModule::findFirstAtomAfter. While there, do some minor cleanup. llvm-svn: 188880	2013-08-21 07:28:17 +00:00
Ahmed Bougacha	ffeecb5c80	MC: ObjectSymbolizer can now recognize external function stubs. Only implemented in the Mach-O ObjectSymbolizer. The testcase sadly introduces a new binary. llvm-svn: 188879	2013-08-21 07:28:13 +00:00
Ahmed Bougacha	382a6d7562	MC: Refactor ObjectSymbolizer to make relocation/section info generation lazy. llvm-svn: 188878	2013-08-21 07:28:07 +00:00
Ahmed Bougacha	3012ac5387	MC CFG: Add more MCFunction container methods (find, empty). llvm-svn: 188876	2013-08-21 07:27:59 +00:00
Ahmed Bougacha	7bfc7da6e8	MC CFG: Keep pointer to parent MCModule in created MCFunctions. Also, drive-by cleaning around createFunction. llvm-svn: 188875	2013-08-21 07:27:55 +00:00
Ahmed Bougacha	d6351e76d5	MC CFG: Don't insert preds/succs again. llvm-svn: 188874	2013-08-21 07:27:50 +00:00
Ahmed Bougacha	c43aa4e88c	MC CFG: Remap enough for the inserted instruction. llvm-svn: 188873	2013-08-21 07:27:47 +00:00
David Majnemer	ed89b5c6e7	DebugInfo: Do not use the DWARF Version for the .debug_pubnames or .debug_pubtypes version field Summary: LLVM would generate DWARF with version 3 in the .debug_pubname and .debug_pubtypes version fields. This would lead SGI dwarfdump to fail parsing the DWARF with (in the instance of .debug_pubnames) would exit with: dwarfdump ERROR: dwarf_get_globals: DW_DLE_PUBNAMES_VERSION_ERROR (123) This fixes PR16950. Reviewers: echristo, dblaikie Reviewed By: echristo CC: cfe-commits Differential Revision: http://llvm-reviews.chandlerc.com/D1454 llvm-svn: 188869	2013-08-21 06:13:34 +00:00
Craig Topper	77df9cdd0b	Synchronize VEX JIT encoding code with the MCJIT version. Fix a bug in the MCJIT code where CurOp was being incremented even if the operand it was pointing at wasn't used. Maybe only matters if there are any EVEX_K instructions that aren't VEX_4V. llvm-svn: 188868	2013-08-21 05:57:45 +00:00
Nadav Rotem	7efc04cb40	In LLVM FMA3 operands are dst, src1, src2, src3, however dst is not encoded as it is always src1. This was causing the encoding of the operands to be off by one. Patch by Chris Bieneman. llvm-svn: 188866	2013-08-21 05:03:10 +00:00
Craig Topper	5c94bb8551	Rename mattr names for AVX-512 to from avx-512 -> avx512f, avx-512-pfi -> av512pf, avx-512-cdi -> avx512cd, avx-512-eri->avx512er. This matches better with official docs and what gcc patches appearto be using. I didn't touch the has* functions or the feature flag names to avoid change the td and lowering file while commits are still happening. llvm-svn: 188859	2013-08-21 03:57:57 +00:00
NAKAMURA Takumi	de8880a23d	X86TargetMachine.cpp: Clarify to emit GOT in i686-{cygming\|win32}-elf for mcjit. I suppose all "lli -use-mcjit i686-*" should require GOT, (and to fail.) llvm-svn: 188856	2013-08-21 02:37:25 +00:00
Jakub Staszak	84a0ae74b0	Move #includes from .h to .cpp file. llvm-svn: 188852	2013-08-21 01:20:11 +00:00
Akira Hatanaka	39f915b58a	[micromips] Print instruction alias "not" if the last operand of a nor is zero. llvm-svn: 188851	2013-08-21 01:18:46 +00:00
Bill Wendling	707f601fa5	Move registering the execution of a basic block to the beginning rather than the end. There are situations which can affect the correctness (or at least expectation) of the gcov output. For instance, if a call to __gcov_flush() occurs within a block before the execution count is registered and then the program aborts in some way, then that block will not be marked as executed. This is not normally what the user expects. If we move the code that's registering when a block is executed to the beginning, we can catch these types of situations. PR16893 llvm-svn: 188849	2013-08-20 23:52:00 +00:00
Akira Hatanaka	9a1fb6b9fc	[mips] Add support for mfhc1 and mthc1. llvm-svn: 188848	2013-08-20 23:47:25 +00:00
Akira Hatanaka	bfb6624797	[mips] Add support for calling convention CC_MipsO32_FP64, which is used when the size of floating point registers is 64-bit. Test case will be added when support for mfhc1 and mthc1 is added. llvm-svn: 188847	2013-08-20 23:38:40 +00:00
Akira Hatanaka	8dd951bc9f	[mips] Remove predicates that were incorrectly or unnecessarily added. llvm-svn: 188845	2013-08-20 23:21:55 +00:00
Jakub Staszak	d184e2decc	Add some constantness. llvm-svn: 188844	2013-08-20 23:04:15 +00:00
Akira Hatanaka	14e31a2fe7	[mips] Define register class FGRH32 for the high half of the 64-bit floating point registers. We will need this register class later when we add definitions for instructions mfhc1 and mthc1. Also, remove sub-register indices sub_fpeven and sub_fpodd and use sub_lo and sub_hi instead. llvm-svn: 188842	2013-08-20 22:58:56 +00:00
Arnold Schwaighofer	e1f3ab69d1	SLPVectorizer: Fix invalid iterator errors Update iterator when the SLP vectorizer changes the instructions in the basic block by restarting the traversal of the basic block. Patch by Yi Jiang! Fixes PR 16899. llvm-svn: 188832	2013-08-20 21:21:45 +00:00
Matt Arsenault	7a960a8455	Teach ConstantFolding about pointer address spaces llvm-svn: 188831	2013-08-20 21:20:04 +00:00
Akira Hatanaka	6781fc1648	[mips] Resolve register classes dynamically using ptr_rc to reduce the number of load/store instructions defined. Previously, we were defining load/store instructions for each pointer size (32 and 64-bit), but now we need just one definition. llvm-svn: 188830	2013-08-20 21:08:22 +00:00
Reed Kotler	d8f3362557	Add an option which permits the user to specify using a bitmask, that various functions be compiled as mips32, without having to add attributes. This is useful in certain situations where you don't want to have to edit the function attributes in the source. For now it's only an option used for the compiler developers when debugging the mips16 port. llvm-svn: 188826	2013-08-20 20:53:09 +00:00
Akira Hatanaka	a43b56d9af	[mips] Guard micromips instructions with predicate InMicroMips. Also, fix assembler predicate HasStdEnd so that it is false when the target is micromips. llvm-svn: 188824	2013-08-20 20:46:51 +00:00
Jim Grosbach	71a78f962b	ARM: Fix fast-isel copy/paste-o. Update testcase to be more careful about checking register values. While regexes are general goodness for these sorts of testcases, in this example, the registers are constrained by the calling convention, so we can and should check their explicit values. rdar://14779513 llvm-svn: 188819	2013-08-20 19:12:42 +00:00
Vladimir Medic	9bad0d33b6	Fix style issues in AsmParser.cpp llvm-svn: 188798	2013-08-20 13:33:18 +00:00
Elena Demikhovsky	540d582594	AVX-512: Added more patterns for VMOVSS, VMOVSD, VMOVD, VMOVQ llvm-svn: 188786	2013-08-20 11:00:29 +00:00
Daniel Sanders	4260527f5f	[mips][msa] Removed fcge, fcgt, fsge, fsgt These instructions were present in a draft spec but were removed before publication. llvm-svn: 188782	2013-08-20 09:41:47 +00:00
Richard Sandiford	2bf7b8cc4e	[SystemZ] Update README We now use MVST, CLST and SRST for the obvious cases. llvm-svn: 188781	2013-08-20 09:40:35 +00:00
Richard Sandiford	6f6d55161b	[SystemZ] Use SRST to optimize memchr SystemZTargetLowering::emitStringWrapper() previously loaded the character into R0 before the loop and made R0 live on entry. I'd forgotten that allocatable registers weren't allowed to be live across blocks at this stage, and it confused LiveVariables enough to cause a miscompilation of f3 in memchr-02.ll. This patch instead loads R0 in the loop and leaves LICM to hoist it after RA. This is actually what I'd tried originally, but I went for the manual optimisation after noticing that R0 often wasn't being hoisted. This bug forced me to go back and look at why, now fixed as r188774. We should also try to optimize null checks so that they test the CC result of the SRST directly. The select between null and the SRST GPR result could then usually be deleted as dead. llvm-svn: 188779	2013-08-20 09:38:48 +00:00
Benjamin Kramer	5a71250113	memcmp is not a valid way to compare structs with padding in them. llvm-svn: 188778	2013-08-20 09:27:31 +00:00
Daniel Sanders	f2a0f1d133	[mips][msa] Added insve llvm-svn: 188777	2013-08-20 09:22:54 +00:00
Richard Sandiford	96aa93d5f1	Fix overly pessimistic shortcut in post-RA MachineLICM Post-RA LICM keeps three sets of registers: PhysRegDefs, PhysRegClobbers and TermRegs. When it sees a definition of R it adds all aliases of R to the corresponding set, so that when it needs to test for membership it only needs to test a single register, rather than worrying about aliases there too. E.g. the final candidate loop just has: unsigned Def = Candidates[i].Def; if (!PhysRegClobbers.test(Def) && ...) { to test whether register Def is multiply defined. However, there was also a shortcut in ProcessMI to make sure we didn't add candidates if we already knew that they would fail the final test. This shortcut was more pessimistic than the final one because it checked whether _any alias_ of the defined register was multiply defined. This is too conservative for targets that define register pairs. E.g. on z, R0 and R1 are sometimes used as a pair, so there is a 128-bit register that aliases both R0 and R1. If a loop used R0 and R1 independently, and the definition of R0 came first, we would be able to hoist the R0 assignment (because that used the final test quoted above) but not the R1 assignment (because that meant we had two definitions of the paired R0/R1 register and would fail the shortcut in ProcessMI). This patch just uses the same check for the ProcessMI shortcut as we use in the final candidate loop. llvm-svn: 188774	2013-08-20 09:11:13 +00:00
Tim Northover	f79c3a5aef	ARM: implement some simple f64 materializations. Previously we used a const-pool load for virtually all 64-bit floating values. Actually, we can get quite a few common values (including 0.0, 1.0) via "vmov" instructions of one stripe or another. llvm-svn: 188773	2013-08-20 08:57:11 +00:00
Michael Gottesman	dc985ef0af	[stackprotector] Small cleanup. llvm-svn: 188772	2013-08-20 08:56:28 +00:00
Michael Gottesman	76c44be14a	[stackprotector] Small Bit of computation hoisting. llvm-svn: 188771	2013-08-20 08:56:26 +00:00
Michael Gottesman	1977d15e02	[stackprotector] Added significantly longer comment to FindPotentialTailCall to make clear its relationship to llvm::isInTailCallPosition. llvm-svn: 188770	2013-08-20 08:56:23 +00:00
Michael Gottesman	62c5d714a1	Removed trailing whitespace. llvm-svn: 188769	2013-08-20 08:46:16 +00:00
Michael Gottesman	56e246b1a1	[stackprotector] Removed stale TODO. llvm-svn: 188768	2013-08-20 08:46:13 +00:00
Daniel Sanders	869bdad93a	[mips][msa] Added and.v, bmnz.v, bmz.v, bsel.v, nor.v, or.v, xor.v llvm-svn: 188767	2013-08-20 08:38:21 +00:00
Michael Gottesman	5e57068b7a	[stackprotector] Added support for emitting the llvm intrinsic stack protector check. rdar://13935163 llvm-svn: 188766	2013-08-20 08:36:53 +00:00
Michael Gottesman	ce0e4c263b	[stackprotector] Refactor out the end of isInTailCallPosition into the function returnTypeIsEligibleForTailCall. This allows me to use returnTypeIsEligibleForTailCall in the stack protector pass. rdar://13935163 llvm-svn: 188765	2013-08-20 08:36:50 +00:00
Michael Gottesman	f7e1203d95	Remove unused variables that crept in. llvm-svn: 188761	2013-08-20 07:17:27 +00:00
Michael Gottesman	b27f0f1f6b	Teach selectiondag how to handle the stackprotectorcheck intrinsic. Previously, generation of stack protectors was done exclusively in the pre-SelectionDAG Codegen LLVM IR Pass "Stack Protector". This necessitated splitting basic blocks at the IR level to create the success/failure basic blocks in the tail of the basic block in question. As a result of this, calls that would have qualified for the sibling call optimization were no longer eligible for optimization since said calls were no longer right in the "tail position" (i.e. the immediate predecessor of a ReturnInst instruction). Then it was noticed that since the sibling call optimization causes the callee to reuse the caller's stack, if we could delay the generation of the stack protector check until later in CodeGen after the sibling call decision was made, we get both the tail call optimization and the stack protector check! A few goals in solving this problem were: 1. Preserve the architecture independence of stack protector generation. 2. Preserve the normal IR level stack protector check for platforms like OpenBSD for which we support platform specific stack protector generation. The main problem that guided the present solution is that one can not solve this problem in an architecture independent manner at the IR level only. This is because: 1. The decision on whether or not to perform a sibling call on certain platforms (for instance i386) requires lower level information related to available registers that can not be known at the IR level. 2. Even if the previous point were not true, the decision on whether to perform a tail call is done in LowerCallTo in SelectionDAG which occurs after the Stack Protector Pass. As a result, one would need to put the relevant callinst into the stack protector check success basic block (where the return inst is placed) and then move it back later at SelectionDAG/MI time before the stack protector check if the tail call optimization failed. The MI level option was nixed immediately since it would require platform specific pattern matching. The SelectionDAG level option was nixed because SelectionDAG only processes one IR level basic block at a time implying one could not create a DAG Combine to move the callinst. To get around this problem a few things were realized: 1. While one can not handle multiple IR level basic blocks at the SelectionDAG Level, one can generate multiple machine basic blocks for one IR level basic block. This is how we handle bit tests and switches. 2. At the MI level, tail calls are represented via a special return MIInst called "tcreturn". Thus if we know the basic block in which we wish to insert the stack protector check, we get the correct behavior by always inserting the stack protector check right before the return statement. This is a "magical transformation" since no matter where the stack protector check intrinsic is, we always insert the stack protector check code at the end of the BB. Given the aforementioned constraints, the following solution was devised: 1. On platforms that do not support SelectionDAG stack protector check generation, allow for the normal IR level stack protector check generation to continue. 2. On platforms that do support SelectionDAG stack protector check generation: a. Use the IR level stack protector pass to decide if a stack protector is required/which BB we insert the stack protector check in by reusing the logic already therein. If we wish to generate a stack protector check in a basic block, we place a special IR intrinsic called llvm.stackprotectorcheck right before the BB's returninst or if there is a callinst that could potentially be sibling call optimized, before the call inst. b. Then when a BB with said intrinsic is processed, we codegen the BB normally via SelectBasicBlock. In said process, when we visit the stack protector check, we do not actually emit anything into the BB. Instead, we just initialize the stack protector descriptor class (which involves stashing information/creating the success mbbb and the failure mbb if we have not created one for this function yet) and export the guard variable that we are going to compare. c. After we finish selecting the basic block, in FinishBasicBlock if the StackProtectorDescriptor attached to the SelectionDAGBuilder is initialized, we first find a splice point in the parent basic block before the terminator and then splice the terminator of said basic block into the success basic block. Then we code-gen a new tail for the parent basic block consisting of the two loads, the comparison, and finally two branches to the success/failure basic blocks. We conclude by code-gening the failure basic block if we have not code-gened it already (all stack protector checks we generate in the same function, use the same failure basic block). llvm-svn: 188755	2013-08-20 07:00:16 +00:00
Craig Topper	7a8cf01090	Fix formatting. No functional change. llvm-svn: 188746	2013-08-20 05:23:59 +00:00
Craig Topper	e13a066c94	Add AVX-512 and related features to the CPUID detection code. llvm-svn: 188745	2013-08-20 05:22:42 +00:00
Craig Topper	fd2b389263	Move AVX and non-AVX replication inside a couple multiclasses to avoid repeating each instruction for both individually. llvm-svn: 188743	2013-08-20 04:24:14 +00:00
Craig Topper	998a39aeed	Add an error check for a typo I accidentally made in a td file that caused an assert to fire. llvm-svn: 188742	2013-08-20 04:22:09 +00:00
Bill Schmidt	f381afc906	[PowerPC] More refactoring prior to real PPC emitPrologue/Epilogue changes. (Patch committed on behalf of Mark Minich, whose log entry follows.) This is a continuation of the refactorings performed in svn rev 188573 (see that rev's comments for more detail). This is my stage 2 refactoring: I combined the emitPrologue() & emitEpilogue() PPC32 & PPC64 code into a single flow, simplifying a lot of the code since in essence the PPC32 & PPC64 code generation logic is the same, only the instruction forms are different (in most cases). This simplification is necessary because my functional changes (yet to come) add significant complexity, and without the simplification of my stage 2 refactoring, the overall complexity of both emitPrologue() & emitEpilogue() would have become almost intractable for most mortal programmers (like me). This submission was intended to be a pure refactoring (no functional changes whatsoever). However, in the process of combining the PPC32 & PPC64 flows, I spotted a difference that I believe is a bug (see svn rev 186478 line 863, or svn rev 188573 line 888): This line appears to be restoring the BP with the original FP content, not the original BP content. When I merged the 32-bit and 64-bit code, I used the corresponding code from the 64-bit flow, which I believe uses the correct offset (BPOffset) for this operation. llvm-svn: 188741	2013-08-20 03:12:23 +00:00
Venkatraman Govindaraju	f625773bca	[Sparc] Use HWEncoding instead of unused Num field in Sparc register definitions. Also, correct the definitions of RETL and RET instructions. llvm-svn: 188738	2013-08-20 01:26:14 +00:00
Hal Finkel	0c5c01aa4a	Add a llvm.copysign intrinsic This adds a llvm.copysign intrinsic; We already have Libfunc recognition for copysign (which is turned into the FCOPYSIGN SDAG node). In order to autovectorize calls to copysign in the loop vectorizer, we need a corresponding intrinsic as well. In addition to the expected changes to the language reference, the loop vectorizer, BasicTTI, and the SDAG builder (the intrinsic is transformed into an FCOPYSIGN node, just like the function call), this also adds FCOPYSIGN to a few lists in LegalizeVector{Ops,Types} so that vector copysigns can be expanded. In TargetLoweringBase::initActions, I've made the default action for FCOPYSIGN be Expand for vector types. This seems correct for all in-tree targets, and I think is the right thing to do because, previously, there was no way to generate vector-values FCOPYSIGN nodes (and most targets don't specify an action for vector-typed FCOPYSIGN). llvm-svn: 188728	2013-08-19 23:35:46 +00:00
Hal Finkel	1cf48ab811	Don't form PPC CTR-based loops around a copysignl call copysign/copysignf never become function calls (because the SDAG expansion code does not lower to the corresponding function call, but rather directly implements the associated logic), but copysignl almost always is lowered into a call to the requested libm functon (and, thus, might clobber CTR). llvm-svn: 188727	2013-08-19 23:35:24 +00:00
Andrew Kaylor	4612fed911	Adding PIC support for ELF on x86_64 platforms llvm-svn: 188726	2013-08-19 23:27:43 +00:00
Peter Collingbourne	f708c87078	Introduce non-const overloads for GlobalAlias::{get,resolve}AliasedGlobal. llvm-svn: 188725	2013-08-19 23:13:33 +00:00
Jakub Staszak	b4eb6adebb	Use pop_back_val() instead of both back() and pop_back(). llvm-svn: 188723	2013-08-19 22:47:55 +00:00
Matt Arsenault	d79f7d9ea1	Teach InstCombine visitGetElementPtr about address spaces llvm-svn: 188721	2013-08-19 22:17:40 +00:00
Matt Arsenault	98f34e3abe	Cleanup visitGetElementPtr to make address space change easier llvm-svn: 188720	2013-08-19 22:17:34 +00:00
Matt Arsenault	94a028aa43	commonPointerCast cleanups to make address space change easier llvm-svn: 188719	2013-08-19 22:17:18 +00:00
Matt Arsenault	74742a1bb0	Fix assert with GEP ptr vector indexing structs Also fix it calculating the wrong value. The struct index is not a ConstantInt, so it was being interpreted as an array index. llvm-svn: 188713	2013-08-19 21:43:16 +00:00
Eric Christopher	574b5c8885	Use less verbose code and update comments. llvm-svn: 188711	2013-08-19 21:41:38 +00:00
Matt Arsenault	5aeae18e9d	Revert non-test parts of r188507 Re-add the inboundsless tests I didn't add originally llvm-svn: 188710	2013-08-19 21:40:31 +00:00
Eric Christopher	7da24888dd	Turn on pubnames by default on linux. Until gdb supports the new accelerator tables we should add the pubnames section so that gdb_index can be generated from gold at link time. On darwin we already emit the accelerator tables and so don't need to worry about pubnames. llvm-svn: 188708	2013-08-19 21:07:38 +00:00
Paul Redmond	62f840f46a	Improve the widening of integral binary vector operations - split WidenVecRes_Binary into WidenVecRes_Binary and WidenVecRes_BinaryCanTrap - WidenVecRes_BinaryCanTrap preserves the original behaviour for operations that can trap - WidenVecRes_Binary simply widens the operation and improves codegen for 3-element vectors by allowing widening and promotion on x86 (matches the behaviour of unary and ternary operation widening) - use WidenVecRes_Binary for operations on integers. Reviewed by: nrotem llvm-svn: 188699	2013-08-19 20:01:35 +00:00
Andrew Kaylor	5f3a9989a6	Adding comments to document RuntimeDyld relocation handling llvm-svn: 188697	2013-08-19 19:38:06 +00:00
Akira Hatanaka	ff7beb1754	[mips] Fix instruction definitions that were incorrectly marked as code-gen-only. llvm-svn: 188690	2013-08-19 19:08:03 +00:00
Peter Collingbourne	aac65a313d	Introduce SpecialCaseList::isIn overload for GlobalAliases. Differential Revision: http://llvm-reviews.chandlerc.com/D1437 llvm-svn: 188688	2013-08-19 19:00:35 +00:00
Mihai Popa	4a9df8a768	Thumb2 add immediate alias for SP The Thumb2 add immediate is in fact defined for SP. The manual is misleading as it points to a different section for add immediate with SP, however the encoding is the same as for add immediate with register only with the SP operand hard coded. As such add immediate with SP and add immediate with register can safely be treated as the same instruction. All the patch does is adjust a register constraint on an instruction alias. llvm-svn: 188676	2013-08-19 15:02:25 +00:00
Elena Demikhovsky	1490c5eb5b	AVX-512: added arithmetic and logical operations. ADD, SUB, MUL integer and FP types. OR, AND, XOR. Added embeded broadcast form for these instructions. llvm-svn: 188673	2013-08-19 13:26:14 +00:00
Richard Sandiford	784a580312	[SystemZ] Add negative integer absolute (load negative) For now this matches the equivalent of (neg (abs ...)), which did hit a few times in projects/test-suite. We should probably also match cases where absolute-like selects are used with reversed arguments. llvm-svn: 188671	2013-08-19 12:56:58 +00:00
Richard Sandiford	4b89705490	[SystemZ] Add integer absolute (load positive) llvm-svn: 188670	2013-08-19 12:48:54 +00:00
Richard Sandiford	709bda66b9	[SystemZ] Add support for sibling calls This first cut is pretty conservative. The final argument register (R6) is call-saved, so we would need to make sure that the R6 argument to a sibling call is the same as the R6 argument to the calling function, which seems worth keeping as a separate patch. Saying that integer truncations are free means that we no longer use the extending instructions LGF and LLGF for spills in int-conv-09.ll and int-conv-10.ll. Instead we treat the registers as 64 bits wide and truncate them to 32-bits where necessary. I think it's unlikely we'd use LGF and LLGF for spills in other situations for the same reason, so I'm removing the tests rather than replacing them. The associated code is generic and applies to many more instructions than just LGF and LLGF, so there is no corresponding code removal. llvm-svn: 188669	2013-08-19 12:42:31 +00:00
Michael Kuperstein	4bb3f8f2e4	Adds missing TLI check for library simplification of * pow(x, 0.5) -> fabs(sqrt(x)) * pow(2.0, x) -> exp2(x) llvm-svn: 188656	2013-08-19 06:55:47 +00:00
Hal Finkel	e4eb78188c	Add ExpandFloatOp_FCOPYSIGN to handle ppcf128-related expansions We had previously been asserting when faced with a FCOPYSIGN f64, ppcf128 node because there was no way to expand the FCOPYSIGN node. Because ppcf128 is the sum of two doubles, and the first double must have the larger magnitude, we can take the sign from the first double. As a result, in addition to fixing the crash, this is also an optimization. llvm-svn: 188655	2013-08-19 06:55:37 +00:00
Hal Finkel	dbc78e1f73	Add the PPC fcpsgn instruction Modern PPC cores support a floating-point copysign instruction, and we can use this to lower the FCOPYSIGN node (which is created from calls to the libm copysign function). A couple of extra patterns are necessary because the operand types of FCOPYSIGN need not agree. llvm-svn: 188653	2013-08-19 05:01:02 +00:00
David Blaikie	175b0b9a3b	llvm-dwarfdump: Do not include address offsets for attributes, only for tags This reduces the noise in diffs making it more likely that, at least for LLVM revision-over-revision, diffs will actually yield usable results. This is consistent with objdump's DWARF dumping behavior. llvm-svn: 188650	2013-08-19 03:36:23 +00:00
David Blaikie	715528be0b	DebugInfo: don't emit zero-length names for parameters We check this in many/all other cases, just missed this one it seems. Perhaps it'd be worth unifying this so we never emit zero-length DW_AT_names. llvm-svn: 188649	2013-08-19 03:34:03 +00:00
Peter Collingbourne	03c3324ccd	Remove SpecialCaseList::findCategory. It turned out that I didn't need this for DFSan. llvm-svn: 188646	2013-08-19 00:24:20 +00:00
Tim Northover	55349a29c6	ARM: make sure we keep inline asm operands tied. When patching inlineasm nodes to use GPRPair for 64-bit values, we were dropping the information that two operands were tied, which effectively broke the live-interval of vregs affected. llvm-svn: 188643	2013-08-18 18:06:03 +00:00
Elena Demikhovsky	3ce8dbbac2	AVX-512: Added VMOVD, VMOVQ, VMOVSS, VMOVSD instructions. llvm-svn: 188637	2013-08-18 13:08:57 +00:00
Craig Topper	e6861c9ce5	Make more of the lowering helpers static. Also use MVT instead of EVT in a couple places. llvm-svn: 188629	2013-08-18 08:53:01 +00:00
Dmitri Gribenko	8b2a3d1fea	Remove unused stdio.h includes llvm-svn: 188626	2013-08-18 08:29:51 +00:00
Chandler Carruth	67ff8b7185	Go through the really awkward dance required to delete the memory allocated by setupterm. Without this, some folks are seeing leaked memory whenever this routine is called more than once. Thanks to Craig Topper for the report. llvm-svn: 188615	2013-08-18 01:20:32 +00:00
Hal Finkel	3f5279cc26	Fix SCEVExpander creating distinct duplicate PHI entries This fixes SCEVExpander so that it does not create multiple distinct induction variables for duplicate PHI entries. Specifically, given some code like this: do.body6: ; preds = %do.body6, %do.body6, %if.then5 %end.0 = phi i8* [ undef, %if.then5 ], [ %incdec.ptr, %do.body6 ], [ %incdec.ptr, %do.body6 ] ... Note that it is legal to have multiple entries for a basic block so long as the associated value is the same. So the above input is okay, but expanding an AddRec in this loop could produce code like this: do.body6: ; preds = %do.body6, %do.body6, %if.then5 %indvar = phi i64 [ %indvar.next, %do.body6 ], [ %indvar.next1, %do.body6 ], [ 0, %if.then5 ] %end.0 = phi i8* [ undef, %if.then5 ], [ %incdec.ptr, %do.body6 ], [ %incdec.ptr, %do.body6 ] ... %indvar.next = add i64 %indvar, 1 %indvar.next1 = add i64 %indvar, 1 And this is not legal because there are two PHI entries for %do.body6 each with a distinct value. Unfortunately, I don't have an in-tree test case. llvm-svn: 188614	2013-08-18 00:16:23 +00:00
Joerg Sonnenberger	8e3050db51	PR 16899: Do not modify the basic block using the iterator, but keep the next value. This avoids crashes due to invalidation. Patch by Joey Gouly. llvm-svn: 188605	2013-08-17 11:04:47 +00:00
Tom Stellard	59ed08b238	R600: Fix possible use of an uninitialized variable Spotted by Nick Lewycky! llvm-svn: 188599	2013-08-17 00:06:51 +00:00
Tom Stellard	b249b75726	R600: Expand vector FRINT ops Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 188598	2013-08-16 23:51:33 +00:00
Tom Stellard	ad3aff246c	R600: Expand vector FFLOOR ops Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 188597	2013-08-16 23:51:29 +00:00
Tom Stellard	a92ff87929	R600: Expand vector float operations for both SI and R600 Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 188596	2013-08-16 23:51:24 +00:00
Jim Grosbach	d786679049	ARM: Properly constrain comparison fastisel register classes. Ongoing 'make the verifier happy' improvements to ARM fast-isel. rdar://12594152 llvm-svn: 188595	2013-08-16 23:37:40 +00:00
Jim Grosbach	3fa749102a	ARM: Fast-isel register class constrain for extends. Properly constrain the operand register class for instructions used in [sz]ext expansion. Update more tests to use the verifier now that we're getting the register classes correct. rdar://12594152 llvm-svn: 188594	2013-08-16 23:37:36 +00:00
Jim Grosbach	06c2a68125	ARM: Fix more fast-isel verifier failures. Teach the generic instruction selection helper functions to constrain the register classes of their input operands. For non-physical register references, the generic code needs to be careful not to mess that up when replacing references to result registers. As the comment indicates for MachineRegisterInfo::replaceRegWith(), it's important to call constrainRegClass() first. rdar://12594152 llvm-svn: 188593	2013-08-16 23:37:31 +00:00
Jim Grosbach	d69f3ed947	ARM: Clean up fast-isel machine verifier errors. Lots of machine verifier errors result from using a plain GPR regclass for incoming argument copies. A more restrictive rGPR class is more appropriate since it more accurately represents what's happening, plus it lines up better with isel later on so the verifier is happier. Reduces the number of ARM fast-isel tests not running with the verifier enabled by over half. rdar://12594152 llvm-svn: 188592	2013-08-16 23:37:23 +00:00
Reed Kotler	0eae85fb1f	Fix a subtle difference between running clang vs llc for mips16. This regards how mips16 is viewed. It's not really a target type but there has always been a target for it in the td files. It's more properly -mcpu=mips32 -mattr=+mips16 . This is how clang treats it but we have always had the -mcpu=mips16 which I probably should delete now but it will require updating all the .ll test cases for mips16. In this case it changed how we decide if we have a count bits instruction and whether instruction lowering should then expand ctlz. Now that we have dual mode compilation, -mattr=+mips16 really just indicates the inital processor mode that we are compiling for. (It is also possible to have -mcpu=64 -mattr=+mips16 but as far as I know, nobody has even built such a processor, though there is an architecture manual for this). llvm-svn: 188586	2013-08-16 23:05:18 +00:00
Reid Kleckner	bf4f9ebb9f	Actually, use GNU inline asm for cpuid with clang Clang doesn't support the MSVC __cpuid intrinsic yet, and fixing that is blocked on some fairly complicated issues. llvm-svn: 188584	2013-08-16 22:42:42 +00:00
David Blaikie	d4e106e39d	DebugInfo: Allow the addition of other (such as static data) members to a record type after construction Plus a type cleanup & minor fix to enumerate members of declarations. llvm-svn: 188577	2013-08-16 20:42:14 +00:00
Bill Schmidt	8893a3d159	[PowerPC] Preparatory refactoring for making prologue and epilogue safe on PPC32 SVR4 ABI [Patch and following text by Mark Minich; committing on his behalf.] There are FIXME's in PowerPC/PPCFrameLowering.cpp, method PPCFrameLowering::emitPrologue() related to "negative offsets of R1" on PPC32 SVR4. They're true, but the real issue is that on PPC32 SVR4 (and any ABI without a Red Zone), no spills may be made until after the stackframe is claimed, which also includes the LR spill which is at a positive offset. The same problem exists in emitEpilogue(), though there's no FIXME for it. I intend to fix this issue, making LLVM-compiled code finally safe for use on SVR4/EABI/e500 32-bit platforms (including in particular, OS-free embedded systems & kernel code, where interrupts may share the same stack as user code). In preparation for making these changes, to make the diffs for the functional changes less cluttered, I am providing the non-functional refactorings in two stages: Stage 1 does some minor fluffy refactorings to pull multiple method calls up into a single bool, creating named bools for repeated uses of obscure logic, moving some code up earlier because either stage 2 or my final version will require it earlier, and rewording/adding some comments. My stage 1 changes can be characterized as primarily fluffy cleanup, the purpose of which may be unclear until the stage 2 or final changes are made. My stage 2 refactorings combine the separate PPC32 & PPC64 logic, which is currently performed by largely duplicate code, into a single flow, with the differences handled by a group of constants initialized early in the methods. This submission is for my stage 1 changes. There should be no functional changes whatsoever; this is a pure refactoring. llvm-svn: 188573	2013-08-16 20:05:04 +00:00
Richard Mitton	ad6d349fbc	Fixed RuntimeDyldELF absolute relocations. If an ELF relocation is pointed at an absolute address, it will have a symbol ID of zero. RuntimeDyldELF::processRelocationRef was not previously handling this case, and was instead trying to handle it as a section-relative fixup. I think this is the right fix here, but my elf-fu is poor on some of the more exotic platforms, so I'd appreciate it if anyone with greater knowledge could verify this. llvm-svn: 188572	2013-08-16 18:54:26 +00:00
Aaron Ballman	b16cf535dd	Switching to using a helper function instead of manually converting the string to UTF-8. llvm-svn: 188566	2013-08-16 17:53:28 +00:00
Aaron Ballman	d9fd87bdf9	Removing unused functionality. llvm-svn: 188565	2013-08-16 17:33:57 +00:00
Jim Grosbach	d0de8ace8a	InstCombine: Use isAllOnesValue() instead of explicit -1. llvm-svn: 188563	2013-08-16 17:03:36 +00:00
Michel Danzer	8522270d7e	R600/SI: Add pattern for xor of i1 Fixes two recent piglit regressions with radeonsi. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 188559	2013-08-16 16:19:31 +00:00
Michel Danzer	20680b1cc5	R600/SI: Fix broken encoding of DS_WRITE_B32 The logic in SIInsertWaits::getHwCounts() only really made sense for SMRD instructions, and trying to shoehorn it into handling DS_WRITE_B32 caused it to corrupt the encoding of that by clobbering the first operand with the second one. Undo that damage and only apply the SMRD logic to that. Fixes some derivates related piglit regressions with radeonsi. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 188558	2013-08-16 16:19:24 +00:00
Daniel Sanders	6b32f892f2	Reverted test commit (r188556) llvm-svn: 188557	2013-08-16 15:27:12 +00:00
Daniel Sanders	7a2c9bc894	Test commit. Just a blank line llvm-svn: 188556	2013-08-16 15:26:36 +00:00
Benjamin Kramer	a8eecee121	R600: Allocate memoperand in the MachienFunction so it doesn't leak. llvm-svn: 188555	2013-08-16 14:48:09 +00:00
Aaron Ballman	dcd57573d4	Updating function comments; no functional changes intended. llvm-svn: 188554	2013-08-16 14:33:07 +00:00
Benjamin Kramer	309206667d	When initializing the PIC global base register on ARM/ELF add pc to fix the address. This unbreaks PIC with fast isel on ELF targets (PR16717). The output matches what GCC and SDag do for PIC but may not cover all of the many flavors of PIC that exist. llvm-svn: 188551	2013-08-16 12:52:08 +00:00
Mihai Popa	46c1bcb4e9	Add support for Thumb2 literal loads with negative zero offset Thumb2 literal loads use an offset encoding which allows for negative zero. This fixes parsing and encoding so that #-0 is correctly processed. The parser represents #-0 as INT32_MIN. llvm-svn: 188549	2013-08-16 12:03:00 +00:00
Mihai Popa	cf276b2c88	Fix Thumb2 aliasing complementary instructions taking modified immediates There are many Thumb instructions which take 12-bit immediates encoded in a special 8-byte value + 4-byte rotator form. Not all numbers are represented, and it's legal to transform an assembly instruction to be able to encode the immediate. For example: AND and BIC are complementary instructions; one can switch the AND to a BIC as long as the immediate is complemented. The intent is to switch one instruction into its complementary one when the immediate cannot be encoded in the form requested in the original assembly and when the complementary immediate is encodable. The patch addresses two issues: 1. definition of t2SOImmNot immediate - it has to check that the orignal value is not encoded naturally 2. t2AND and t2BIC instruction aliases which should use the Thumb2 SOImm operand rather than the ARM one. llvm-svn: 188548	2013-08-16 11:55:44 +00:00
Richard Sandiford	0dec06a28c	[SystemZ] Use SRST to implement strlen and strnlen It would also make sense to use it for memchr; I'm working on that now. llvm-svn: 188547	2013-08-16 11:41:43 +00:00
Richard Sandiford	bb83a50f57	[SystemZ] Use MVST to implement strcpy and stpcpy llvm-svn: 188546	2013-08-16 11:29:37 +00:00
Richard Sandiford	ca23271010	[SystemZ] Use CLST to implement strcmp llvm-svn: 188544	2013-08-16 11:21:54 +00:00
Richard Sandiford	e3827751e2	[SystemZ] Fix handling of 64-bit memcmp results Generalize r188163 to cope with return types other than MVT::i32, just as the existing visitMemCmpCall code did. I've split this out into a subroutine so that it can be used for other upcoming patches. I also noticed that I'd used the wrong API to record the out chain. It's a load that uses DAG.getRoot() rather than getRoot(), so the out chain should go on PendingLoads. I don't have a testcase for that because we don't do any interesting scheduling on z yet. llvm-svn: 188540	2013-08-16 10:55:47 +00:00
Richard Sandiford	a59012577c	[SystemZ] Fix sign of integer memcmp result r188163 used CLC to implement memcmp. Code that compares the result directly against zero can test the CC value produced by CLC, but code that needs an integer result must use IPM. The sequence I'd used was: ipm <reg> sll <reg>, 2 sra <reg>, 30 but I'd forgotten that this inverts the order, so that CC==1 ("less") becomes an integer greater than zero, and CC==2 ("greater") becomes an integer less than zero. This sequence should only be used if the CLC arguments are reversed to compensate. The problem then is that the branch condition must also be reversed when testing the CLC result directly. Rather than do that, I went for a different sequence that works with the natural CLC order: ipm <reg> srl <reg>, 28 rll <reg>, <reg>, 31 One advantage of this is that it doesn't clobber CC. A disadvantage is that any sign extension to 64 bits must be done separately, rather than being folded into the shifts. llvm-svn: 188538	2013-08-16 10:22:54 +00:00
Vladimir Medic	2df9ee6ec8	This patch implements wait instruction for mips. Examples are added in test files. llvm-svn: 188537	2013-08-16 10:17:03 +00:00
Craig Topper	8c929627d9	Don't use v16i32 for load pattern matching. All 512-bit loads are cated to v8i64. llvm-svn: 188534	2013-08-16 06:07:34 +00:00
Tom Stellard	dba25713a6	Revert "R600/SI: Fix incorrect encoding of DS_WRITE_B32 instructions" This reverts commit a6a39ced095c2f453624ce62c4aead25db41a18f. This is the wrong version of this fix. llvm-svn: 188523	2013-08-16 01:18:43 +00:00
Tom Stellard	82bef57f20	R600/SI: Fix incorrect encoding of DS_WRITE_B32 instructions The SIInsertWaits pass was overwriting the first operand (gds bit) of DS_WRITE_B32 with the second operand (value to write). This meant that any time the value to write was stored in an odd number VGPR, the gds bit would be set causing the instruction to write to GDS instead of LDS. llvm-svn: 188522	2013-08-16 01:12:20 +00:00
Tom Stellard	b03edeca67	R600: Add support for global vector loads with element types less than 32-bits Tested-by: Aaron Watry <awatry@gmail.com> llvm-svn: 188521	2013-08-16 01:12:16 +00:00
Tom Stellard	fbab827e2a	R600: Add support for global vector stores with elements less than 32-bits Tested-by: Aaron Watry <awatry@gmail.com> llvm-svn: 188520	2013-08-16 01:12:11 +00:00
Tom Stellard	d3ee8c103a	R600: Add support for i16 and i8 global stores Tested-by: Aaron Watry <awatry@gmail.com> llvm-svn: 188519	2013-08-16 01:12:06 +00:00
Tom Stellard	6d1379e180	R600: Add support for v4i32 stores on Cayman Tested-by: Aaron Watry <awatry@gmail.com> llvm-svn: 188518	2013-08-16 01:12:00 +00:00
Tom Stellard	16da74c205	R600: Enable folding of inline literals into REQ_SEQUENCE instructions Tested-by: Aaron Watry <awatry@gmail.com> llvm-svn: 188517	2013-08-16 01:11:55 +00:00
Tom Stellard	676c16d088	R600: Add IsExport bit to TableGen instruction definitions Tested-by: Aaron Watry <awatry@gmail.com> llvm-svn: 188516	2013-08-16 01:11:51 +00:00
Tom Stellard	ac00f9df79	R600: Change the RAT instruction assembly names so they match the docs Tested-by: Aaron Watry <awatry@gmail.com> llvm-svn: 188515	2013-08-16 01:11:46 +00:00
Jim Grosbach	20e3b9ac30	InstCombine: Simplify if(x!=0 && x!=-1). When both constants are positive or both constants are negative, InstCombine already simplifies comparisons like this, but when it's exactly zero and -1, the operand sorting ends up reversed and the pattern fails to match. Handle that special case. Follow up for rdar://14689217 llvm-svn: 188512	2013-08-16 00:15:20 +00:00
Aaron Ballman	0e63e53da1	Tighten up the yamilizer so it stops eliding empty sequences if the embedded empty sequence is the first key/value in a map which is itself in a sequence. Patch with help from Nick Kledzik. llvm-svn: 188508	2013-08-15 23:17:53 +00:00
Matt Arsenault	1de76773bc	Don't do FoldCmpLoadFromIndexedGlobal for non inbounds GEPs This path wasn't tested before without a datalayout, so add some more tests and re-run with and without one. llvm-svn: 188507	2013-08-15 23:11:07 +00:00
Matt Arsenault	5cae894a13	Fix spelling llvm-svn: 188506	2013-08-15 23:11:03 +00:00
Lang Hames	8a71d53448	Support X86_64_GOTLoad relocations in RuntimeDyldMachO by treating them the same way as X86_64_GOT relocations. The 'Load' part of GOTLoad is just an optimization hint for the linker anyway, and can be safely ignored. This patch also fixes some minor issues with the relocations introduced while processing an X86_64_GOT[Load]: the addend for the GOT entry should always be zero, and the addend for the replacement relocation at the original offset should be the same as the addend of the relocation being replaced. I haven't come up with a good way of testing this yet, but I'm working on it. This fixes <rdar://problem/14651564>. llvm-svn: 188499	2013-08-15 22:31:40 +00:00
Yunzhong Gao	c0c2b16932	Fixing a corner-case bug in strchr and strrchr lib call optimizations where the input character is not converted to char before comparing with zero. The patch was discussed in this thread: http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20130812/184069.html llvm-svn: 188489	2013-08-15 20:58:59 +00:00
Renato Golin	ca570633c5	make arm-use-movt available for all ARM Before this patch this flag is IOS specific, but is also useful for bare project like bootloaders / kernels etc, since movw / movt prevents simple relocation. Therefore make this flag more commonly available. note: this patch depends on a similiar rename in clang Patch by Jeroen Hofstee. llvm-svn: 188487	2013-08-15 20:54:38 +00:00
Renato Golin	0a41d9ae7f	make arm-reserve-r9 available for all ARM r9 is defined as a platform-specific register in the ARM EABI. It can be reserved for a special purpose or be used as a general purpose register. Add support for reserving r9 for all ARM, while leaving the IOS usage unchanged. Patch by Jeroen Hofstee. llvm-svn: 188485	2013-08-15 20:45:13 +00:00
Bill Wendling	33fae6935a	Make a few more things const. llvm-svn: 188484	2013-08-15 20:25:44 +00:00
Bill Wendling	2d092f05b4	Use a reference instead of making an unnecessary copy. Also use 'const'. llvm-svn: 188483	2013-08-15 20:21:49 +00:00
Peter Collingbourne	444c59e270	DataFlowSanitizer: Add a debugging feature to help us track nonzero labels. Summary: When the -dfsan-debug-nonzero-labels parameter is supplied, the code is instrumented such that when a call parameter, return value or load produces a nonzero label, the function __dfsan_nonzero_label is called. The idea is that a debugger breakpoint can be set on this function in a nominally label-free program to help identify any bugs in the instrumentation pass causing labels to be introduced. Reviewers: eugenis CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D1405 llvm-svn: 188472	2013-08-15 18:51:12 +00:00
Bill Wendling	2851907cdb	Constify the function parameters. llvm-svn: 188469	2013-08-15 18:46:14 +00:00
Mihai Popa	d79f00ba68	This fixes three issues related to Thumb literal loads: 1. The offset range for Thumb1 PC relative loads is [0..1020] and not [-1024..1020] 2. Thumb2 PC relative loads may define the PC, so the restriction placed on target register is removed 3. Removes unneeded alias between "ldr.n" and t1LDRpci. ".n" is actually stripped by both tablegen and the ASM parser, so this alias rule really does nothing llvm-svn: 188466	2013-08-15 15:43:06 +00:00
Jack Carter	d12e837f05	[Mips][msa] Added the simple builtins (madd_q to xori) Includes: madd_q, maddr_q, maddv, max_[asu], maxi_[su], min_[asu], mini_[su], mod_[su], msub_q, msubr_q, msubv, mul_q, mulr_q, mulv, nloc, nlzc, nori, ori, pckev, pckod, pcnt, sat_[su], shf, sld, sldi, sll, slli, splat, splati, sr[al], sr[al]i, subs_[su], subss_u, subus_s, subv, subvi, vshf, xori Patch by Daniel Sanders llvm-svn: 188460	2013-08-15 14:22:07 +00:00
Jack Carter	b95ee69163	[Mips][msa] Added the simple builtins (fadd to ftq) Includes: fadd, fceq, fcg[et], fclass, fcl[et], fcne, fcun, fdiv, fexdo, fexp2, fexup[lr], ffint_[su], ffql, ffqr, fill, flog2, fmadd, fmax, fmax_a, fmin, fmin_a, fmsub, fmul, frint, frcp, frsqrt, fseq, fsge, fsgt, fsle, fslt, fsne, fsqr, fsub, ftint_s, ftq Patch by Daniel Sanders llvm-svn: 188458	2013-08-15 13:45:36 +00:00
Jack Carter	babdcc8c2c	[Mips][msa] Added the simple builtins (add_a to dpsub[su], ilvev to ldi) Includes: add_a, adds_[asu], addv, addvi, andi.b, asub_[su].[bhwd], aver?_[su]_[bhwd], bclr, bclri, bins[lr], bins[lr]i, bmnzi, bmzi, bneg, bnegi, bseli, bset, bseti, c(eq\|ne), c(eq\|ne)i, cl[et]_[su], cl[et]i_[su], copy_[su].[bhw], div_[su], dotp_[su], dpadd_[su], dpsub_[su], ilvev, ilvl, ilvod, ilvr, insv, insve, ldi Patch by Daniel Sanders llvm-svn: 188457	2013-08-15 12:24:57 +00:00
Craig Topper	8dbc7e9d35	Revert r188449 as it turns out we're just missing the instructions that need the v16i32/v16f32 matching. llvm-svn: 188454	2013-08-15 08:38:25 +00:00
Hao Liu	cd8b02dce3	Clang and AArch64 backend patches to support shll/shl and vmovl instructions and ACLE functions llvm-svn: 188451	2013-08-15 08:26:11 +00:00
Craig Topper	2ffd06528d	Don't let isPermImmMask handle v16i32 since VPERMI doesn't match on that type. Remove 128-bit vector handling from isPermImmMask too, it's covered by isPSHUFDMask. llvm-svn: 188449	2013-08-15 07:30:51 +00:00
Alexey Samsonov	3186eb3efd	Tentative fix for global-buffer-overflow caused by r188426. Found by AddressSanitizer llvm-svn: 188448	2013-08-15 07:11:34 +00:00
Craig Topper	83e042a21b	Use MVT instead of EVT in X86ISelDAGToDAG since all the types should be legal. llvm-svn: 188446	2013-08-15 05:57:07 +00:00
Craig Topper	6f4dd2dacf	Use MVT in place of EVT in more X86 operation lowering functions. llvm-svn: 188445	2013-08-15 05:33:45 +00:00
Craig Topper	d9c2783d8f	Replace getValueType().getSimpleVT() with getSimpleValueType(). llvm-svn: 188442	2013-08-15 02:44:19 +00:00
Craig Topper	5671010cbb	Replace getValueType().getSimpleVT() with getSimpleValueType(). Also remove one weird cast from MVT->EVT just to call getSimpleVT(). llvm-svn: 188441	2013-08-15 02:33:50 +00:00
Mark Lacey	9d8103de7a	Auto-compute live intervals on demand. When new virtual registers are created during splitting/spilling, defer creation of the live interval until we need to use the live interval. Along with the recent commits to notify LiveRangeEdit when new virtual registers are created, this makes it possible for functions like TargetInstrInfo::loadRegFromStackSlot() and TargetInstrInfo::storeRegToStackSlot() to create multiple virtual registers as part of the process of generating loads/stores for different register classes, and then have the live intervals for those new registers computed when they are needed. llvm-svn: 188437	2013-08-14 23:50:16 +00:00
Mark Lacey	f367cd9239	Notify LiveRangeEdit of new virtual registers. Add a delegate class to MachineRegisterInfo with a single virtual function, MRI_NoteNewVirtualRegister(). Update LiveRangeEdit to inherit from this delegate class and override the definition of the callback with an implementation that tracks the newly created virtual registers. llvm-svn: 188435	2013-08-14 23:50:09 +00:00
Mark Lacey	f9ea88546f	Track new virtual registers by register number. Track new virtual registers by register number, rather than by the live interval created for them. This is the first step in separating the creation of new virtual registers and new live intervals. Eventually live intervals will be created and populated on demand after the virtual registers have been created and used in instructions. llvm-svn: 188434	2013-08-14 23:50:04 +00:00
Tom Stellard	d86003e31f	R600/SI: Improve legalization of vector operations This should fix hangs in the OpenCL piglit tests. llvm-svn: 188431	2013-08-14 23:25:00 +00:00
Tom Stellard	6785065ace	R600/SI: Replace v1i32 type with i32 in imageload and sample intrinsics llvm-svn: 188430	2013-08-14 23:24:53 +00:00
Tom Stellard	9fa1791a1b	R600/SI: Convert v16i8 resource descriptors to i128 Now that compute support is better on SI, we can't continue using v16i8 for descriptors since this is also a legal type in OpenCL. This patch fixes numerous hangs with the piglit OpenCL test and since we now use a target specific DAG node for LOAD_CONSTANT with the correct MemOperandFlags, this should also fix: https://bugs.freedesktop.org/show_bug.cgi?id=66805 llvm-svn: 188429	2013-08-14 23:24:45 +00:00
Tom Stellard	8e5da41374	R600/SI: Lower BUILD_VECTOR to REG_SEQUENCE v2 Using REG_SEQUENCE for BUILD_VECTOR rather than a series of INSERT_SUBREG instructions should make it easier for the register allocator to coalasce unnecessary copies. v2: - Use an SGPR register class if all the operands of BUILD_VECTOR are SGPRs. llvm-svn: 188427	2013-08-14 23:24:32 +00:00
Tom Stellard	df94dc3917	R600/SI: Choose the correct MOV instruction for copying immediates The instruction selector will now try to infer the destination register so it can decided whether to use V_MOV_B32 or S_MOV_B32 when copying immediates. llvm-svn: 188426	2013-08-14 23:24:24 +00:00
Tom Stellard	16a9a205c8	R600/SI: Assign a register class to the $vaddr operand for MIMG instructions The previous code declared the operand as unknown:$vaddr, which made it possible for scalar registers to be used instead of vector registers. llvm-svn: 188425	2013-08-14 23:24:17 +00:00
David Blaikie	d0d6fcc923	DebugInfo: Prefer references over pointers, pass by const reference for a type that will grow in the future llvm-svn: 188422	2013-08-14 22:23:05 +00:00
Tom Stellard	3494b7ee42	R600/SI: Handle MSAA texture targets Patch by: Marek Olšák Signed-off-by: Marek Olšák <marek.olsak@amd.com> llvm-svn: 188421	2013-08-14 22:22:14 +00:00
Tom Stellard	20ee94f152	R600/SI: Allow conversion between v32i8 and v8i32 Patch by: Marek Olšák Signed-off-by: Marek Olšák <marek.olsak@amd.com> llvm-svn: 188420	2013-08-14 22:22:09 +00:00
Tom Stellard	a36f077159	R600/SI: Fix an obvious typo Patch by: Marek Olšák Signed-off-by: Marek Olšák <marek.olsak@amd.com> llvm-svn: 188419	2013-08-14 22:22:03 +00:00
Tom Stellard	73c31d541e	R600/SI: Add pattern for fp_to_uint This fixes the F2U opcode for the Mesa driver. Patch by: Marek Olšák Signed-off-by: Marek Olšák <marek.olsak@amd.com> llvm-svn: 188418	2013-08-14 22:21:57 +00:00
Mark Lacey	a2626555f1	Fix small typo: s/succ/Succ/ llvm-svn: 188415	2013-08-14 22:11:42 +00:00
Peter Collingbourne	9d31d6f329	DataFlowSanitizer: Instrumentation for memset. Differential Revision: http://llvm-reviews.chandlerc.com/D1395 llvm-svn: 188412	2013-08-14 20:51:38 +00:00
Hal Finkel	b3ca00d2a3	Actually fix PPC64 64-bit GPR inline asm constraint matching This is a follow-up to r187693, correcting that code to request the correct register class. The previous version, with the wrong register class, was not really correcting the constraints, but rather was removing them. Coincidentally, this fixed the failing test case in r187693, but obviously created other problems. llvm-svn: 188407	2013-08-14 20:05:04 +00:00
Peter Collingbourne	68162e7512	DataFlowSanitizer: greylist is now ABI list. This replaces the old incomplete greylist functionality with an ABI list, which can provide more detailed information about the ABI and semantics of specific functions. The pass treats every function in the "uninstrumented" category in the ABI list file as conforming to the "native" (i.e. unsanitized) ABI. Unless the ABI list contains additional categories for those functions, a call to one of those functions will produce a warning message, as the labelling behaviour of the function is unknown. The other supported categories are "functional", "discard" and "custom". - "discard" -- This function does not write to (user-accessible) memory, and its return value is unlabelled. - "functional" -- This function does not write to (user-accessible) memory, and the label of its return value is the union of the label of its arguments. - "custom" -- Instead of calling the function, a custom wrapper __dfsw_F is called, where F is the name of the function. This function may wrap the original function or provide its own implementation. Differential Revision: http://llvm-reviews.chandlerc.com/D1345 llvm-svn: 188402	2013-08-14 18:54:12 +00:00
Reid Kleckner	be85cb9098	Use the MSVC __cpuid intrinsic instead of inline asm This works around PR16830 in LLVM when self-hosting clang on Windows. llvm-svn: 188397	2013-08-14 18:21:51 +00:00
Jakob Stoklund Olesen	4417c7b265	Remove unnecessary parameter to RenumberValues. Patch by Matthias Braun! llvm-svn: 188393	2013-08-14 17:28:52 +00:00

... 5 6 7 8 9 ...

63917 Commits