llvm-project

Commit Graph

Author	SHA1	Message	Date
Tom Stellard	69f86d199a	R600: Remove some dead code from the AMDILCFGStructurizer Reviewed-by: Vincent Lejeune<vljn at ovi.com> llvm-svn: 192812	2013-10-16 17:05:56 +00:00
Andrew Kaylor	f2b68f6754	Adding oprofile support for MCJIT. Patch by Dmitry Stogov llvm-svn: 192809	2013-10-16 16:32:47 +00:00
Chad Rosier	f2b254558f	Fix comment. llvm-svn: 192805	2013-10-16 16:22:15 +00:00
Rafael Espindola	40a3d01849	Assert on duplicate registration. Don't depend on function pointer equality. Before this patch we would assert when building llvm as multiple shared libraries (cmake's BUILD_SHARED_LIBS). The problem was the line if (T.AsmStreamerCtorFn == Target::createDefaultAsmStreamer) which returns false because of -fvisibility-inlines-hidden. It is easy to fix just this one case, but I decided to try to also make the registration more strict. It looks like the old logic for ignoring followup registration was just a temporary hack that outlived its usefulness. This patch converts the ifs to asserts, fixes the few cases that were registering twice and makes sure all the asserts compare with null. Thanks for Joerg for reporting the problem and reviewing the patch. llvm-svn: 192803	2013-10-16 16:21:40 +00:00
Chad Rosier	178b1cefc7	[AArch64] Add support for NEON scalar signed saturating accumulated of unsigned value and unsigned saturating accumulate of signed value instructions. llvm-svn: 192800	2013-10-16 16:09:02 +00:00
Arnold Schwaighofer	5078ea2bd9	SLPVectorizer: Don't vectorize volatile memory operations radar://15231682 llvm-svn: 192799	2013-10-16 16:09:00 +00:00
Benjamin Kramer	00eb07b791	DAGCombiner: Don't fold xor into not if getNOT would introduce an illegal constant. This happens e.g. with <2 x i64> -1 on x86_32. It cannot be generated directly because i64 is illegal. It would be nice if getNOT would handle this transparently, but I don't see a way to generate a legal constant there right now. Fixes PR17487. llvm-svn: 192795	2013-10-16 14:16:19 +00:00
Kostya Serebryany	d3d23bec66	[asan] Optimize accesses to global arrays with constant index Summary: Given a global array G[N], which is declared in this CU and has static initializer avoid instrumenting accesses like G[i], where 'i' is a constant and 0<=i<N. Also add a bit of stats. This eliminates ~1% of instrumentations on SPEC2006 and also partially helps when asan is being run together with coverage. Reviewers: samsonov Reviewed By: samsonov CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D1947 llvm-svn: 192794	2013-10-16 14:06:14 +00:00
Richard Sandiford	3e382972d9	[SystemZ] Handle extensions in RxSBG optimizations The input to an RxSBG operation can be narrower as long as the upper bits are don't care. This fixes a FIXME added in r192783. llvm-svn: 192790	2013-10-16 13:35:13 +00:00
Richard Sandiford	f722a8e30e	[SystemZ] Improve handling of SETCC We previously used the default expansion to SELECT_CC, which in turn would expand to "LHI; BRC; LHI". In most cases it's better to use an IPM-based sequence instead. llvm-svn: 192784	2013-10-16 11:10:55 +00:00
Richard Sandiford	374a0e50c4	Handle (shl (anyext (shr ...))) in SimpilfyDemandedBits This is really an extension of the current (shl (shr ...)) -> shl optimization. The main difference is that certain upper bits must also not be demanded. The motivating examples are the first two in the testcase, which occur in llvmpipe output. llvm-svn: 192783	2013-10-16 10:26:19 +00:00
Bill Wendling	91e6f6e198	Add a 'deleteModule' method to the Linker class. This deletes the Module ivar instead of having the LTO code generater do it. It also sets the pointer to 'NULL', so that if it's used again it will abort quickly. llvm-svn: 192778	2013-10-16 08:59:57 +00:00
NAKAMURA Takumi	272416fda9	Revert r192758 (and r192759), "MC: Better handling of tricky symbol and section names" GNU AS didn't like quotes in symbol names. Error: junk at end of line, first unrecognized character is `"' .def "@feat.00"; "@feat.00" = 1 Reproduced on Cygwin's 2.23.52.20130309 and mingw32's 2.20.1.20100303. llvm-svn: 192775	2013-10-16 08:22:49 +00:00
Craig Topper	c2ccbaffa3	Really fix build warning/error that I think r192756 was trying to fix. llvm-svn: 192773	2013-10-16 06:50:36 +00:00
Will Dietz	70b66f0df2	TypeFinder: prefer iterative algorithm to keep stack usage low. Introduce subtype_reverse_iterator to maintain the numbering assigned during the recursive type walk. llvm-svn: 192770	2013-10-16 04:10:06 +00:00
Rui Ueyama	50a86e125a	Fix a bug in Windows resource file detection. The magic bytes should not include the trailing NUL byte. llvm-svn: 192769	2013-10-16 03:29:49 +00:00
Rafael Espindola	0018a59d01	Add support for metadata representing .ident directives. llvm-svn: 192764	2013-10-16 01:49:05 +00:00
Eric Christopher	d2b497b522	Fix a pair of bugs in the emission of pubname tables: 1) Make sure we emit static member variables by checking at the end of createGlobalVariableDIE rather than piecemeal in the function. (As a note, createGlobalVariableDIE needs rewriting.) 2) Make sure we use the definition rather than declaration DIE for two things: a) determining linkage for gnu pubnames, and b) as the address of the DIE for global variables. (As a note, createGlobalVariableDIE really needs rewriting.) Adjust the testcase to make sure we're checking the correct DIEs. llvm-svn: 192761	2013-10-16 01:37:49 +00:00
Rafael Espindola	43c4e24fad	Add a MCAsmInfoELF class and factor some code into it. We had a MCAsmInfoCOFF, but no common class for all the ELF MCAsmInfos before. llvm-svn: 192760	2013-10-16 01:34:32 +00:00
Hans Wennborg	d34cf14339	MC: Better handling of tricky symbol and section names Because of win32 mangling, we produce symbol and section names with funny characters in them, most notably @ characters. MC would choke on trying to parse its own assembly output. This patch addresses that by: - Making @ trigger quoting of symbol names - Also quote section names in the same way - Just parse section names like other identifiers (to allow for quotes) - Don't assume @ signifies a symbol variant if it is in a string. Differential Revision: http://llvm-reviews.chandlerc.com/D1945 llvm-svn: 192758	2013-10-16 01:20:40 +00:00
Rafael Espindola	5645bade1b	Move .ident handling to MCStreamer. No functionality change, but exposes the API so that codegen can use it too. Patch by Katya Romanova. llvm-svn: 192757	2013-10-16 01:05:45 +00:00
Andrew Kaylor	b01a3bec88	Fixing build warning/error llvm-svn: 192756	2013-10-16 01:01:15 +00:00
David Blaikie	94ded5f39e	Simplify zero initialization of DIEAttrs variable. llvm-svn: 192755	2013-10-16 00:47:21 +00:00
Andrew Kaylor	877b931a41	Adding padding to the .eh_frame section in RuntimeDyld llvm-svn: 192754	2013-10-16 00:32:24 +00:00
Andrew Kaylor	c442a76c60	Adding support for deregistering EH frames with MCJIT. Patch by Yaron Keren llvm-svn: 192753	2013-10-16 00:14:21 +00:00
Matt Arsenault	226580656b	Fix typo llvm-svn: 192752	2013-10-15 23:44:48 +00:00
Matt Arsenault	df90c02e68	Fix missing C++ mode thing in header llvm-svn: 192751	2013-10-15 23:44:45 +00:00
Andrew Trick	e97d8d6dde	Enable MI Sched for x86. This changes the SelectionDAG scheduling preference to source order. Soon, the SelectionDAG scheduler can be bypassed saving a nice chunk of compile time. Performance differences that result from this change are often a consequence of register coalescing. The register coalescer is far from perfect. Bugs can be filed for deficiencies. On x86 SandyBridge/Haswell, the source order schedule is often preserved, particularly for small blocks. Register pressure is generally improved over the SD scheduler's ILP mode. However, we are still able to handle large blocks that require latency hiding, unlike the SD scheduler's BURR mode. MI scheduler also attempts to discover the critical path in single-block loops and adjust heuristics accordingly. The MI scheduler relies on the new machine model. This is currently unimplemented for AVX, so we may not be generating the best code yet. Unit tests are updated so they don't depend on SD scheduling heuristics. llvm-svn: 192750	2013-10-15 23:33:07 +00:00
Eric Christopher	a6c38a32a9	Make sure we're not attempting to construct a subprogram DIE twice and just look up the value. Fix the one case where we were trying to create a subprogram DIE and we should already have had one. Reflow formatting in collectDeadVariables while fixing. llvm-svn: 192749	2013-10-15 23:31:38 +00:00
Eric Christopher	5cb56322b8	Add an assert that we have a scope that matters for methods and remove a call to getNonCompileUnitScope as a method shouldn't be in the compile unit scope. llvm-svn: 192748	2013-10-15 23:31:36 +00:00
Eric Christopher	98f9c23614	Clean up, formatting, comments. No functional change. llvm-svn: 192747	2013-10-15 23:31:31 +00:00
Vincent Lejeune	5d6c2c318b	R600/SI: Remove some leftover MI dump call llvm-svn: 192743	2013-10-15 22:48:51 +00:00
Rui Ueyama	fc149a69cf	Path: Recognize Windows compiled resource file. Some background: One can pass compiled resource files (.res files) directly to the linker on Windows. If a resource file is given, the linker will run "cvtres" command in background to convert the resource file to a COFF file to link it. What I'm trying to do with this patch is to make the linker to recognize the resource file by file magic, so that it can run cvtres command. Differential Revision: http://llvm-reviews.chandlerc.com/D1943 llvm-svn: 192742	2013-10-15 22:45:38 +00:00
Andrew Kaylor	2ba21c5b1e	Separating ELF and MachO stub info functions for RuntimeDyld llvm-svn: 192737	2013-10-15 21:32:56 +00:00
Chad Rosier	9d51708677	[AArch64] Add support for NEON scalar signed saturating absolute value and scalar signed saturating negate instructions. llvm-svn: 192733	2013-10-15 21:18:44 +00:00
Andrew Kaylor	33c5b1bbe9	Fixing some host==target assumptions in RuntimeDyld llvm-svn: 192732	2013-10-15 20:44:55 +00:00
Adrian Prantl	5bf1d0093b	Remove some dead code. (DarwinGDBCompat was retired in r189903). llvm-svn: 192731	2013-10-15 20:26:37 +00:00
Manman Ren	fd956dbae0	Struct byval: fix a copy-paste error for thumb2. PR17309 llvm-svn: 192730	2013-10-15 19:42:32 +00:00
Michael Liao	ad71659def	Fix PR17546 - Type of index used in extract_vector_elt or insert_vector_elt supposes to be TLI.getVectorIdxTy() which is pointer type on most targets. It'd better to truncate (or zero-extend in case it's changed later) it to mask element type to guarantee they are matching instead of asserting that. llvm-svn: 192722	2013-10-15 17:51:58 +00:00
Michael Liao	8ba068211d	Fix PR16807 - Lower signed division by constant powers-of-2 to target-independent DAG operators instead of target-dependent ones to support them better on targets where vector types are legal but shift operators on that types are illegal. E.g., on AVX, PSRAW is only available on <8 x i16> though <16 x i16> is a legal type. llvm-svn: 192721	2013-10-15 17:51:02 +00:00
Benjamin Kramer	c97850be76	LoopVectorize: Properly reflect PODness in comments. llvm-svn: 192717	2013-10-15 16:19:54 +00:00
Pekka Jaaskelainen	eb4a6e7c28	Guard the debug temp variable with NDEBUG to avoid warning/error with NDEBUG defined. llvm-svn: 192709	2013-10-15 14:40:46 +00:00
Pekka Jaaskelainen	eb08e2e0c8	Do not assert when trying to add a meta data operand with MachineInstr::addOperand(). llvm-svn: 192707	2013-10-15 14:18:10 +00:00
Daniel Sanders	1dfddc73dc	[mips][msa] Added support for build_vector for v4f32 and v2f64. llvm-svn: 192699	2013-10-15 13:14:41 +00:00
Anders Waldenborg	0c3b653922	Revert "Add AllTargetsBindings sublibrary" as it breaks cmake build on (atleast) windows and darwin. llvm-svn: 192697	2013-10-15 13:04:27 +00:00
Anders Waldenborg	1d9cb434b3	Add AllTargetsBindings sublibrary instead of having static inlines in the llvm-c headers. This new library will be linked in when using the "all-targets" component and contains the LLVMInitializeAll* functions. This means that those functions will exist as real symbols in the shared library, and can therefore can be called from bindings that are using ffi the shared library. llvm-svn: 192690	2013-10-15 12:08:59 +00:00
Richard Sandiford	6af6ff1e15	[SystemZ] Use A(G)SI when spilling the target of a constant addition llvm-svn: 192681	2013-10-15 08:42:59 +00:00
Job Noorman	e9a1d4c274	Fix MSP430 calling convention to match MSPGCC llvm-svn: 192678	2013-10-15 08:19:39 +00:00
Craig Topper	ef9e993eaa	Remove x86_sse42_crc32_64_8 intrinsic. It has no functional difference from x86_sse42_crc32_32_8 and was not mapped to a clang builtin. I'm not even sure why this form of the instruction is even called out explicitly in the docs. Also add AutoUpgrade support to convert it into the other intrinsic with appropriate trunc and zext. llvm-svn: 192672	2013-10-15 05:20:47 +00:00
Andrew Trick	3a99693c5a	Improve on r192635, ExeDepsFix for avx, and add a test case. rdar:15221834 False AVX register dependencies cause 5x slowdown on flops-5/6 and significant slowdown on several others. This was blocking the switch to MI-Sched. llvm-svn: 192669	2013-10-15 03:39:43 +00:00
Akira Hatanaka	06aff571a3	[mips] Define a pseudo instruction which writes to both the lower and higher parts of the accumulators and gets expanded post-RA. llvm-svn: 192667	2013-10-15 01:48:30 +00:00
Akira Hatanaka	ec67c90216	[mips] Use predicates to guard instructions using accumulator registers instead of relying on AddedComplexity. llvm-svn: 192665	2013-10-15 01:21:37 +00:00
Akira Hatanaka	d98c99fd01	[mips] Rename isel nodes. llvm-svn: 192663	2013-10-15 01:12:50 +00:00
Akira Hatanaka	86c3c794fa	[mips] Transfer kill flag to the newly created operand. llvm-svn: 192662	2013-10-15 01:06:30 +00:00
Akira Hatanaka	8368b3b3df	[mips] Set HI/LO registers' HWEncoding field. llvm-svn: 192661	2013-10-15 01:00:00 +00:00
Akira Hatanaka	8f31b2fd3b	[mips] Delete unnecessary code. llvm-svn: 192660	2013-10-15 00:48:42 +00:00
Michael Gottesman	53c885c37a	Update comment list of GLOBALVAR modifiers in BitcodeWriter to include externally_initialized. Thanks to Shuxin Yang for catching this. llvm-svn: 192637	2013-10-14 22:36:51 +00:00
Quentin Colombet	778dba1dd8	[X86][FastISel] During X86 fastisel, the address of indirect call was resolved through bitcast, ptrtoint, and inttoptr instructions. This is valid only if the related instructions are in that same basic block, otherwise we may reference variables that were not live accross basic blocks resulting in undefined virtual registers. The bug was exposed when both SDISel and FastISel were used within the same function, i.e., one basic block is issued with FastISel and another with SDISel, as demonstrated with the testcase. <rdar://problem/15192473> llvm-svn: 192636	2013-10-14 22:32:09 +00:00
Andrew Trick	b6d56be69d	Fix the ExecutionDepsFix pass to handle AVX instructions. This pass is needed to break false dependencies. Without it, unlucky register assignment can result in wild (5x) swings in performance. This pass was trying to handle AVX but not getting it right. AVX doesn't have partial register defs, it has unused register reads in which the high bits of a source operand are copied into the unused bits of the dest. Fixing this requires conservative liveness analysis. This is awkard because the pass already has its own pseudo-liveness. However, proper liveness is expensive, and we would like to use a generic utility to compute it. The fix only invokes liveness on-demand. It is rare to detect a case that needs undef-read dependence breaking, but when it happens, it can be needed many times within a very large block. I think the existing heuristic which uses a register window of 16 is too conservative for loop-carried false dependencies. If the loop is a reduction. The out-of-order engine may be able to execute several loop iterations in parallel. However, I'll leave this tuning exercise for next time. llvm-svn: 192635	2013-10-14 22:19:03 +00:00
Andrew Trick	e2f7cc4cf3	LiveRegUnits: Use *MBB for consistency and convenience. llvm-svn: 192634	2013-10-14 22:18:59 +00:00
Andrew Trick	8460a3bfa1	whitespace llvm-svn: 192633	2013-10-14 22:18:56 +00:00
Eric Christopher	740025745b	Revert part of a fix from 2010, changes since then: a) x86-64 TLS has been documented b) the code path should use movq for the correct relocation to be generated. I've also added a fixme for the test case that we should improve the code generated, it should look something like is documented in the tls abi document. llvm-svn: 192631	2013-10-14 21:52:26 +00:00
Eric Christopher	755711e510	Reformat this routine slightly. llvm-svn: 192630	2013-10-14 21:52:23 +00:00
Eric Christopher	584d71c6cb	Remove some extraneous whitespace. llvm-svn: 192629	2013-10-14 21:52:18 +00:00
Andrew Trick	3f4d6c6538	LiveRegUnits::removeRegsInMask safety. Clobbering is exclusive not inclusive on register units. For liveness, we need to consider all the preserved registers. e.g. A regmask that clobbers YMM0 may preserve XMM0. Units are only clobbered when all super-registers are clobbered. llvm-svn: 192623	2013-10-14 20:45:19 +00:00
Andrew Trick	276dd453f0	Use a SparseSet in LiveRegUnits. Some clients may add block live ins and may track liveness over a large scope. This guarantees an efficient implementation in all cases with no memory allocation/deallocation, independent of the number of target registers. It could be slightly less convenient but is fine in the expected case. llvm-svn: 192622	2013-10-14 20:45:17 +00:00
Andrew Trick	0aed0cfc44	Move LiveRegUnits implementation into .cpp. Comment and format. llvm-svn: 192621	2013-10-14 20:45:14 +00:00
Andrew Trick	ff3585c51c	Convert LiveRegUnits methods to the current convention (it's new code). llvm-svn: 192619	2013-10-14 20:45:09 +00:00
Manman Ren	c6b6392794	Debug Info: static member DIE creation. Clean up creation of static member DIEs. We can create static member DIEs from two places, so we call getOrCreateStaticMemberDIE from the two places. getOrCreateStaticMemberDIE will get or create the context DIE first, then it will check if the DIE already exists, if not, we create the static member DIE and add it to the context. Creation of static member DIEs are handled in a similar way as subprogram DIEs. llvm-svn: 192618	2013-10-14 20:33:57 +00:00
David Blaikie	6004dbc9fa	Fix indenting. That wasn't confusing /at all/... llvm-svn: 192617	2013-10-14 20:15:04 +00:00
Will Dietz	5cb7f4e3f2	MachineSink: Fix and tweak critical-edge breaking heuristic. Per original comment, the intention of this loop is to go ahead and break the critical edge (in order to sink this instruction) if there's reason to believe doing so might "unblock" the sinking of additional instructions that define registers used by this one. The idea is that if we have a few instructions to sink "together" breaking the edge might be worthwhile. This commit makes a few small changes to help better realize this goal: First, modify the loop to ignore registers defined by this instruction. We don't sink definitions of physical registers, and sinking an SSA definition isn't going to unblock an upstream instruction. Second, ignore uses of physical registers. Instructions that define physical registers are rejected for sinking, and so moving this one won't enable moving any defining instructions. As an added bonus, while virtual register use-def chains are generally small due to SSA goodness, iteration over the uses and definitions (used by hasOneNonDBGUse) for physical registers like EFLAGS can be rather expensive in practice. (This is the original reason for looking at this) Finally, to keep things simple continue to only consider this trick for registers that have a single use (via hasOneNonDBGUse), but to avoid spuriously breaking critical edges only do so if the definition resides in the same MBB and therefore this one directly blocks it from being sunk as well. If sinking them together is meant to be, let the iterative nature of this pass sink the definition into this block first. Update tests to accomodate this change, add new testcase where sinking avoids pipeline stalls. llvm-svn: 192608	2013-10-14 16:57:17 +00:00
Rafael Espindola	8c1d78ad51	Remove lib/Transforms/Instrumentation/ProfilingUtils.* They were leftover from the old profiling support. Patch by Alastair Murray. llvm-svn: 192605	2013-10-14 16:46:46 +00:00
Rafael Espindola	9770bde505	Remove the now unused strong phi elimination pass. llvm-svn: 192604	2013-10-14 16:39:04 +00:00
Chris Lattner	94fc4bed1f	Basic blocks typically have few predecessors. Use a SmallDenseMap to avoid a heap allocation when this is the case. llvm-svn: 192602	2013-10-14 16:05:55 +00:00
Evgeniy Stepanov	be83d8f693	[msan] Instrument x86._cvt intrinsics. Currently MSan checks that arguments of cvt intrinsics are fully initialized. That's too much to ask: some of them only operate on lower half, or even quarter, of the input register. llvm-svn: 192599	2013-10-14 15:16:25 +00:00
Chad Rosier	d1f40d760a	[AArch64] Add support for NEON scalar integer compare instructions. llvm-svn: 192596	2013-10-14 14:37:20 +00:00
Bernard Ogden	53169762d0	Add Cortex-A57 support llvm-svn: 192591	2013-10-14 13:17:07 +00:00
Bernard Ogden	4400cde89a	Add subtarget feature support for Cortex-A53 Some previous implicit defaults have changed, for example FP and NEON are now on by default. llvm-svn: 192590	2013-10-14 13:16:57 +00:00
Matheus Almeida	2102188cfc	[mips][msa] Direct Object Emission support for BIT instructions. List of instructions: bclri.{b,h,w,d} binsli.{b,h,w,d} binsri.{b,h,w,d} bnegi.{b,h,w,d} bseti.{b,h,w,d} sat_s.{b,h,w,d} sat_u.{b,h,w,d} slli.{b,h,w,d} srai.{b,h,w,d} srari.{b,h,w,d} srli.{b,h,w,d} srlri.{b,h,w,d} llvm-svn: 192589	2013-10-14 13:07:39 +00:00
Matheus Almeida	5be0cd8720	[mips][msa] Direct Object Emission support for VEC instructions. List of instructions: and.v, bmnz.v, bmz.v, bsel.v, nor.v, or.v, xor.v. llvm-svn: 192588	2013-10-14 12:57:18 +00:00
Matheus Almeida	7d65ea76ea	[mips][msa] Direct Object Emission of INSVE.{b,h,w,d}. llvm-svn: 192587	2013-10-14 12:38:17 +00:00
Matheus Almeida	bc189eb318	[mips][msa] Direct Object Emission for the majority of the ELM instructions. List of instructions: copy_s.{b,h,w} copy_u.{b,h,w} sldi.{b,h,w,d} splati.{b,h,w,d} llvm-svn: 192586	2013-10-14 12:22:43 +00:00
Matheus Almeida	b74293dc55	[mips][msa] Direct Object Emission of INSERT.{B,H,W} instruction. INSERT is the first type of MSA instruction that requires a change to the way MSA registers are parsed. This happens because MSA registers may be suffixed by an index in the form of an immediate or a general purpose register. The changes to parseMSARegs reflect that requirement. llvm-svn: 192582	2013-10-14 11:49:30 +00:00
Evgeniy Stepanov	9b5517b127	[msan] Fix handling of scalar select of vectors. llvm-svn: 192575	2013-10-14 09:52:09 +00:00
Elena Demikhovsky	82a46ebe0a	Fixed a bug in dynamic allocation memory on stack. The alignment of allocated space was wrong, see Bugzila 17345. Done by Zvi Rackover <zvi.rackover@intel.com>. llvm-svn: 192573	2013-10-14 07:26:51 +00:00
Craig Topper	d7abdb6f12	Create classes to reduce the size of the tablegen entries for the CRC32 instructions. llvm-svn: 192568	2013-10-14 05:19:58 +00:00
Craig Topper	a422b09ae3	Allow pinsrw/pinsrb/pextrb/pextrw/movmskps/movmskpd/pmovmskb/extractps instructions to parse either GR32 or GR64 without resorting to duplicating instructions. llvm-svn: 192567	2013-10-14 04:55:01 +00:00
Craig Topper	4432208884	Add disassembler support for SSE4.1 register/register form of PEXTRW. There is a shorter encoding that was part of SSE2, but a memory form was added in SSE4.1. This is the register form of that encoding. llvm-svn: 192566	2013-10-14 01:42:32 +00:00
Craig Topper	7158745e55	Mark MOVMSKPS/MOVMSKPD/VPINSRWrr64i as AsmParserOnly to remove them from the disassembler tables. Add PINSRWrr64i to complement the AVX version. llvm-svn: 192565	2013-10-14 01:21:22 +00:00
David Majnemer	93fdc3fabf	Windows: Fix a typo in an assert llvm-svn: 192564	2013-10-14 01:17:32 +00:00
Craig Topper	c4a5a3f65d	Don't use 64-bit versions of MOVMSKPD in CodeGen. The instructions only produce a 1-bit result so we can just use SUBREG_TO_REG to extend the 32-bit versions. llvm-svn: 192562	2013-10-14 00:24:33 +00:00
David Majnemer	7af18578f8	Windows: Don't bother with pinning Kernel32.dll We don't delay load it so it shouldn't be going anywhere. llvm-svn: 192561	2013-10-14 00:06:58 +00:00
Will Dietz	5357df6290	MC: Don't assume incoming StringRef's are null terminated. This can happen when processing command line arguments, which are often stored as std::string's and later turned into StringRef's via std::string::data(). Unfortunately this is not guaranteed to return a null-terminated string until C++11, causing breakage on platforms that don't do this. llvm-svn: 192558	2013-10-13 22:09:26 +00:00
Vincent Lejeune	d6cbede9c5	R600: improve dump of S_WAITCNT llvm-svn: 192557	2013-10-13 17:56:28 +00:00
Vincent Lejeune	4ee6dd6136	R600/SI: Add SinkingPass before ISel llvm-svn: 192556	2013-10-13 17:56:21 +00:00
Vincent Lejeune	d623644d17	R600/SI: Support byval arguments llvm-svn: 192555	2013-10-13 17:56:16 +00:00
Vincent Lejeune	fa58a5fb60	R600: Use masked read sel for texture instructions llvm-svn: 192554	2013-10-13 17:56:10 +00:00
Vincent Lejeune	301beb80d4	R600: fix swizzle export llvm-svn: 192553	2013-10-13 17:56:04 +00:00
Vincent Lejeune	533352f696	R600: Clear the VPM bit of export instructions. It makes apparently no change it to set this bit or not but the docs recommand to left it cleared. llvm-svn: 192552	2013-10-13 17:55:57 +00:00
David Majnemer	a5732844a6	Windows: Use GetModuleHandleEx instead of LoadLibrary We were using an anti-pattern of: - LoadLibrary - GetProcAddress - FreeLibrary This is problematic because of several reasons: - We are holding on to pointers into a library we just unloaded. - Calling LoadLibrary results in an increase in the reference count of the library in question and any libraries that it depends on and so-on and so-forth. This is none too quick. Instead, use GetModuleHandleEx with GET_MODULE_HANDLE_EX_FLAG_PIN. This is done because because we didn't bring the reference for the library into existence and therefor shouldn't count on it being around later. llvm-svn: 192550	2013-10-13 10:34:21 +00:00

1 2 3 4 5 ...

64695 Commits