llvm-project

Commit Graph

Author	SHA1	Message	Date
Zoran Jovanovic	f4d4d789f7	Use instr mapping for microMIPS in llvm-mc. llvm-svn: 194792	2013-11-15 08:07:34 +00:00
Bob Wilson	da4147c743	Reapply "[asan] Poor man's coverage that works with ASan" I was able to successfully run a bootstrapped LTO build of clang with r194701, so this change does not seem to be the cause of our failing buildbots. llvm-svn: 194789	2013-11-15 07:16:09 +00:00
Andrew Trick	4f0794fd47	Platform proof a test case. llvm-svn: 194788	2013-11-15 05:52:56 +00:00
Matt Arsenault	a9e95abcbf	Add instcombine visitor for addrspacecast llvm-svn: 194786	2013-11-15 05:45:08 +00:00
Matt Arsenault	c5559bb14b	Add target hook to prevent folding some bitcasted loads. This is to avoid this transformation in some cases: fold (conv (load x)) -> (load (conv*)x) On architectures that don't natively support some vector loads efficiently casting the load to a smaller vector of larger types and loading is more efficient. Patch by Micah Villmow. llvm-svn: 194783	2013-11-15 04:42:23 +00:00
Peter Zotov	6519801a6e	[OCaml] Add REQUIRES: native, object-emission to the Target test While the test would work with any compiled in target with object emission support, it's nontrivial to formulate this condition in lit, so a conservative restriction is used instead. llvm-svn: 194781	2013-11-15 03:43:51 +00:00
Bob Wilson	ae73587c4b	Revert "[asan] Poor man's coverage that works with ASan" This reverts commit 194701. Apple's bootstrapped LTO builds have been failing, and this change (along with compiler-rt 194702-194704) is the only thing on the blamelist. I will either reappy these changes or help debug the problem, depending on whether this fixes the buildbots. llvm-svn: 194780	2013-11-15 03:28:22 +00:00
Peter Zotov	9c0f67f13f	[OCaml] Use native target in testsuite instead of hardcoding X86 llvm-svn: 194778	2013-11-15 03:19:08 +00:00
Peter Zotov	0c7f2977ca	[OCaml] Add Target and TargetMachine bindings to Llvm_target llvm-svn: 194774	2013-11-15 02:51:57 +00:00
Peter Zotov	8a1a3bfc05	[OCaml] Refactor Llvm_target interface This commit brings the module structure, argument order and primitive names in Llvm_target in order with the rest of the bindings, in preparation for adding TargetMachine API. llvm-svn: 194773	2013-11-15 02:51:44 +00:00
Reed Kotler	09e59155ef	Make all the conditional Mips 16 branches get initially set for the short form. Constant islands will expand them if they are out of range. Since there is not direct object emitter at this time, it does not have any material affect because the assembler sorts this out. But we need to know for the actual constant island work. We track the difference by putting # 16 inst in the comments. llvm-svn: 194766	2013-11-15 02:21:52 +00:00
Matt Arsenault	b03bd4d96b	Add addrspacecast instruction. Patch by Michele Scandale! llvm-svn: 194760	2013-11-15 01:34:59 +00:00
Tom Stellard	8f9fc20751	R600: Fix scheduling of instructions that use the LDS output queue The LDS output queue is accessed via the OQAP register. The OQAP register cannot be live across clauses, so if value is written to the output queue, it must be retrieved before the end of the clause. With the machine scheduler, we cannot statisfy this constraint, because it lacks proper alias analysis and it will mark some LDS accesses as having a chain dependency on vertex fetches. Since vertex fetches require a new clauses, the dependency may end up spiltting OQAP uses and defs so the end up in different clauses. See the lds-output-queue.ll test for a more detailed explanation. To work around this issue, we now combine the LDS read and the OQAP copy into one instruction and expand it after register allocation. This patch also adds some checks to the EmitClauseMarker pass, so that it doesn't end a clause with a value still in the output queue and removes AR.X and OQAP handling from the scheduler (AR.X uses and defs were already being expanded post-RA, so the scheduler will never see them). Reviewed-by: Vincent Lejeune <vljn at ovi.com> llvm-svn: 194755	2013-11-15 00:12:45 +00:00
Eric Christopher	ad59015bf7	Simplify testcase. llvm-svn: 194748	2013-11-14 23:43:10 +00:00
Rui Ueyama	829c4392e1	Recognize 0x0000 as a COFF file magic. Summary: Some machine-type-neutral object files containing only undefined symbols actually do exist in the Windows standard library. Need to recognize them as COFF files. Reviewers: Bigcheese CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D2164 llvm-svn: 194734	2013-11-14 22:09:08 +00:00
Tim Northover	28adfbb0d1	ARM: produce friendly error for invalid inline asm We used to perform an invalid operation on an MVT and crash, which wasn't much fun. Patch by Oliver Stannard. llvm-svn: 194714	2013-11-14 17:15:39 +00:00
Rafael Espindola	f04bb72b61	Add a triple and switch test to FileCheck. On windows we don't print .weak for function definitions, so count was only finding 1 'weak'. llvm-svn: 194713	2013-11-14 17:12:32 +00:00
NAKAMURA Takumi	dc23e8b819	llvm-cov.test: Remove XFAIL:arm. Seems this is passing since my tweaks. llvm-svn: 194712	2013-11-14 17:08:26 +00:00
Rafael Espindola	4929301af4	Error if we see an alias to a declaration. In ELF and COFF an alias is just another offset in a section. There is no way to represent an alias to something in another file. In MachO, the spec has the N_INDR type which should allow for exactly that, but is not currently implemented. Given that it is specified but not implemented, we error in codegen to avoid miscompiling but don't reject aliases to declarations in the verifier to leave the option open of implementing it. In the past we have used alias to declarations as a way of implementing weakref, which is why it exists in some old tests which this patch updates. llvm-svn: 194705	2013-11-14 13:58:06 +00:00
Kostya Serebryany	6da3f74061	[asan] Poor man's coverage that works with ASan llvm-svn: 194701	2013-11-14 13:27:41 +00:00
Evgeniy Stepanov	b22018abed	[msan] Use CHECK-DAG instead of CHECK where order of instructions does not matter. This may fix hexagon-elf bots. llvm-svn: 194700	2013-11-14 12:46:12 +00:00
Evgeniy Stepanov	585813e33d	[msan] Fast path optimization for wrap-indirect-calls feature of MemorySanitizer. Indirect call wrapping helps MSanDR (dynamic instrumentation companion tool for MSan) to catch all cases where execution leaves a compiler-instrumented module by allowing the tool to rewrite targets of indirect calls. This change is an optimization that skips wrapping for calls when target is inside the current module. This relies on the linker providing symbols at the begin and end of the module code (or code + data, does not really matter). Gold linker provides such symbols by default. GNU (BFD) linker needs a link flag: -Wl,--defsym=__executable_start=0. More info: https://code.google.com/p/memory-sanitizer/wiki/MSanDR#Native_exec llvm-svn: 194697	2013-11-14 12:29:04 +00:00
NAKAMURA Takumi	8b2f92a374	llvm-cov.test: Tweak win32 hosts not confused by \r\n in llvm-cov's stdout. "diff -b" -- Ignore space changes. llvm-svn: 194694	2013-11-14 11:45:10 +00:00
Elena Demikhovsky	0a74b7da35	AVX-512: Handled extractelement from mask vector; Added VMOSHDUP/VMOVSLDUP shuffle instructions. llvm-svn: 194691	2013-11-14 11:29:27 +00:00
Matt Arsenault	bc63770800	R600/SI: Add testcase for problem I ran into with the older version of the moveToVALU changes. llvm-svn: 194682	2013-11-14 07:57:29 +00:00
Andrew Trick	561f2218e0	Minor extension to llvm.experimental.patchpoint: don't require a call. If a null call target is provided, don't emit a dummy call. This allows the runtime to reserve as little nop space as it needs without the requirement of emitting a call. llvm-svn: 194676	2013-11-14 06:54:10 +00:00
Kevin Qin	6e0547dfc9	Add test case for AArch64 NEON instruction set misc. llvm-svn: 194673	2013-11-14 06:45:17 +00:00
Rafael Espindola	fe4e088dfb	Don't mangle \n and " There is nothing special about quotes and newlines from the object file point of view, only the assembler has to worry about expanding the \n and \". This patch then removes the special handling from the Mangler. llvm-svn: 194667	2013-11-14 06:05:49 +00:00
Kevin Qin	aec95baf1a	Implement aarch64 neon instruction class SIMD misc. llvm-svn: 194656	2013-11-14 02:44:13 +00:00
NAKAMURA Takumi	87826253ea	Suppress llvm-cov.test on Win32, with REQUIRES: shell "cd" is unsupported in lit internal runner. llvm-svn: 194652	2013-11-14 02:05:41 +00:00
Jiangning Liu	bb60ccf355	Implement AArch64 NEON instruction set AdvSIMD (table). llvm-svn: 194648	2013-11-14 01:57:32 +00:00
Yunzhong Gao	5cbcf56a7e	Fixing a heisenbug where the memory dependence analysis behaves differently with and without -g. Adding a test case to make sure that the threshold used in the memory dependence analysis is respected. The test case also checks that debug intrinsics are not counted towards this threshold. Differential Revision: http://llvm-reviews.chandlerc.com/D2141 llvm-svn: 194646	2013-11-14 01:10:52 +00:00
Yuchen Wu	d738beec44	llvm-cov: Removed StringMap holding GCOVLines. According to the hazy gcov documentation, it appeared to be technically possible for lines within a block to belong to different source files. However, upon further investigation, gcov does not actually support multiple source files for a single block. This change removes a level of separation between blocks and lines by replacing the StringMap of GCOVLines with a SmallVector of ints representing line numbers. This also means that the GCOVLines class is no longer needed. This paves the way for supporting the "-a" option, which will output block information. llvm-svn: 194637	2013-11-14 00:32:00 +00:00
Yuchen Wu	e28da84c96	llvm-cov: Replaced asserts with proper error handling. Unified the interface for read functions. They all return a boolean indicating if the read from file succeeded. Functions that previously returned the read value now store it into a variable that is passed in by reference instead. Callers will need to check the return value to detect if an error occurred. Also added a new test which ensures that no assertions occur when file contains invalid data. llvm-cov should return with error code 1 upon failure. llvm-svn: 194635	2013-11-14 00:07:15 +00:00
Reed Kotler	4b7afe5523	Take care of long short branch immediate instructions for mips16 in constant islands. llvm-svn: 194630	2013-11-13 23:52:18 +00:00
Tom Stellard	81d871dee3	R600/SI: Add support for private address space load/store Private address space is emulated using the register file with MOVRELS and MOVRELD instructions. llvm-svn: 194626	2013-11-13 23:36:50 +00:00
Tom Stellard	8216602a0b	R600/SI: Prefer SALU instructions for bit shift operations All shift operations will be selected as SALU instructions and then if necessary lowered to VALU instructions in the SIFixSGPRCopies pass. This allows us to do more operations on the SALU which will improve performance and is also required for implementing private memory using indirect addressing, since the private memory pointers must stay in the scalar registers. This patch includes some fixes from Matt Arsenault. llvm-svn: 194625	2013-11-13 23:36:37 +00:00
Yuchen Wu	c60ae7e1fa	llvm-cov: Changed XFAIL targets to be more generic. llvm-svn: 194622	2013-11-13 23:33:17 +00:00
Yuchen Wu	aae88013c7	Added basic unit test for llvm-cov. This test compares the output of llvm-cov against a coverage file generated by gcov. Currently, llvm-cov does not work on certain platforms (namely big-endian architectures such as PowerPC, among others). These platforms are marked as XFAIL for now, but will be fixed later. llvm-svn: 194616	2013-11-13 22:50:15 +00:00
Chad Rosier	d3ae5f895e	[AArch64] Add support for legacy AArch32 NEON scalar shift by immediate instructions. This patch does not include the shift right and accumulate instructions. A number of non-overloaded intrinsics have been remove in favor of their overloaded counterparts. llvm-svn: 194598	2013-11-13 20:05:37 +00:00
Weiming Zhao	0da5cc0765	Enable generating legacy IT block for AArch32 By default, the behavior of IT block generation will be determinated dynamically base on the arch (armv8 vs armv7). This patch adds backend options: -arm-restrict-it and -arm-no-restrict-it. The former one restricts the generation of IT blocks (the same behavior as thumbv8) for both arches. The later one allows the generation of legacy IT block (the same behavior as ARMv7 Thumb2) for both arches. Clang will support -mrestrict-it and -mno-restrict-it, which is compatible with GCC. llvm-svn: 194592	2013-11-13 18:29:49 +00:00
Richard Sandiford	09de091cbe	[SystemZ] Add the general form of BCR At the moment this is just the MC support. llvm-svn: 194585	2013-11-13 16:57:53 +00:00
Alexey Samsonov	a7181a1b35	FileCheck: fix matching of one check-prefix is a prefix of another Summary: Fix a case when "FileCheck --check-prefix=CHECK --check-prefix=CHECKER" would silently ignore check-lines of the form: CHECKER: foo Reviewers: dsanders Reviewed By: dsanders CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D2168 llvm-svn: 194577	2013-11-13 14:12:52 +00:00
Rafael Espindola	fdc88137f4	Remove AllowQuotesInName and friends from MCAsmInfo. Accepting quotes is a property of an assembler, not of an object file. For example, ELF can support any names for sections and symbols, but the gnu assembler only accepts quotes in some contexts and llvm-mc in a few more. LLVM should not produce different symbols based on a guess about which assembler will be reading the code it is printing. llvm-svn: 194575	2013-11-13 14:01:59 +00:00
Vladimir Medic	e10c1125df	Fix bug in .gpword directive parsing. llvm-svn: 194570	2013-11-13 13:18:04 +00:00
Zoran Jovanovic	ccb70caa13	Support for microMIPS trap instruction with immediate operands. llvm-svn: 194569	2013-11-13 13:15:03 +00:00
Diego Novillo	8d6568b56b	SampleProfileLoader pass. Initial setup. This adds a new scalar pass that reads a file with samples generated by 'perf' during runtime. The samples read from the profile are incorporated and emmited as IR metadata reflecting that profile. The profile file is assumed to have been generated by an external profile source. The profile information is converted into IR metadata, which is later used by the analysis routines to estimate block frequencies, edge weights and other related data. External profile information files have no fixed format, each profiler is free to define its own. This includes both the on-disk representation of the profile and the kind of profile information stored in the file. A common kind of profile is based on sampling (e.g., perf), which essentially counts how many times each line of the program has been executed during the run. The SampleProfileLoader pass is organized as a scalar transformation. On startup, it reads the file given in -sample-profile-file to determine what kind of profile it contains. This file is assumed to contain profile information for the whole application. The profile data in the file is read and incorporated into the internal state of the corresponding profiler. To facilitate testing, I've organized the profilers to support two file formats: text and native. The native format is whatever on-disk representation the profiler wants to support, I think this will mostly be bitcode files, but it could be anything the profiler wants to support. To do this, every profiler must implement the SampleProfile::loadNative() function. The text format is mostly meant for debugging. Records are separated by newlines, but each profiler is free to interpret records as it sees fit. Profilers must implement the SampleProfile::loadText() function. Finally, the pass will call SampleProfile::emitAnnotations() for each function in the current translation unit. This function needs to translate the loaded profile into IR metadata, which the analyzer will later be able to use. This patch implements the first steps towards the above design. I've implemented a sample-based flat profiler. The format of the profile is fairly simplistic. Each sampled function contains a list of relative line locations (from the start of the function) together with a count representing how many samples were collected at that line during execution. I generate this profile using perf and a separate converter tool. Currently, I have only implemented a text format for these profiles. I am interested in initial feedback to the whole approach before I send the other parts of the implementation for review. This patch implements: - The SampleProfileLoader pass. - The base ExternalProfile class with the core interface. - A SampleProfile sub-class using the above interface. The profiler generates branch weight metadata on every branch instructions that matches the profiles. - A text loader class to assist the implementation of SampleProfile::loadText(). - Basic unit tests for the pass. Additionally, the patch uses profile information to compute branch weights based on instruction samples. This patch converts instruction samples into branch weights. It does a fairly simplistic conversion: Given a multi-way branch instruction, it calculates the weight of each branch based on the maximum sample count gathered from each target basic block. Note that this assignment of branch weights is somewhat lossy and can be misleading. If a basic block has more than one incoming branch, all the incoming branches will get the same weight. In reality, it may be that only one of them is the most heavily taken branch. I will adjust this assignment in subsequent patches. llvm-svn: 194566	2013-11-13 12:22:21 +00:00
Alexey Samsonov	21a340fa99	FileCheck: fix a bug with multiple --check-prefix options. Summary: This fixes a subtle bug in new FileCheck feature added in r194343. When we search for the first satisfying check-prefix, we should actually return the first encounter of some check-prefix as a substring, even if it's not a part of valid check-line. Otherwise "FileCheck --check-prefix=FOO --check-prefix=BAR" with check file: FOO not a vaild check-line FOO: foo BAR: bar incorrectly accepted file: fog bar as it skipped the first two encounters of FOO, matching only BAR: line. Reviewers: arsenm, dsanders Reviewed By: dsanders CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D2166 llvm-svn: 194565	2013-11-13 11:56:22 +00:00
Robert Lytton	a83c0482dd	XCore target: implement exception handling llvm-svn: 194564	2013-11-13 10:19:31 +00:00
Vladimir Medic	77ffd7af4d	This patch fixes a bug in floating point operands parsing, when instruction alias uses default register operand. llvm-svn: 194562	2013-11-13 09:48:53 +00:00
NAKAMURA Takumi	db5d18d245	Add XFAIL:arm again on 4 MCJIT tests, since r194558. AArch64 has been left removed. They are failing on clang-native-arm-cortex-a9. Please tweak MCJIT/lit.local.cfg, if this didn't satisfy bots. llvm-svn: 194561	2013-11-13 07:43:10 +00:00
NAKAMURA Takumi	b71b7baa2f	Remove XFAIL:aarch64,arm from 4 tests in test/ExecutionEngine/MCJIT. They are reported as XPASSing. llvm-svn: 194558	2013-11-13 06:28:00 +00:00
Reed Kotler	5c8ae09537	Allow the code which returns the length for inline assembler to know specifically about the .space directive. This allows us to force large blocks of code to appear in test cases for things like constant islands without having to make giant test cases to force things like long branches to take effect. llvm-svn: 194555	2013-11-13 04:37:52 +00:00
Andrew Trick	5469ae8f21	Add a test case to verify that misusing anyregcc crashes as expected. llvm-svn: 194553	2013-11-13 03:46:19 +00:00
Matt Arsenault	00a0d6f672	R600: Fix selection failure on EXTLOAD llvm-svn: 194547	2013-11-13 02:39:07 +00:00
Juergen Ributzka	34c652d34d	SelectionDAG: Teach the legalizer to split SETCC if VSELECT needs splitting too. This patch reapplies r193676 with an additional fix for the Hexagon backend. The SystemZ backend has already been fixed by r194148. The Type Legalizer recognizes that VSELECT needs to be split, because the type is to wide for the given target. The same does not always apply to SETCC, because less space is required to encode the result of a comparison. As a result VSELECT is split and SETCC is unrolled into scalar comparisons. This commit fixes the issue by checking for VSELECT-SETCC patterns in the DAG Combiner. If a matching pattern is found, then the result mask of SETCC is promoted to the expected vector mask type for the given target. Now the type legalizer will split both VSELECT and SETCC. This allows the following X86 DAG Combine code to sucessfully detect the MIN/MAX pattern. This fixes PR16695, PR17002, and <rdar://problem/14594431>. Reviewed by Nadav llvm-svn: 194542	2013-11-13 01:57:54 +00:00
Andrew Trick	0ef482ef02	Cleanup the stackmap operand folding code and fix a corner case. I still don't know how to refer to the fixed operands symbolically. I plan to look into it. llvm-svn: 194529	2013-11-12 22:58:39 +00:00
Sebastian Pop	a1cc34b981	improve dependence analysis testcases print the name of the function on which the dependence analysis is performed such that changes to the testcase are easier to review. llvm-svn: 194528	2013-11-12 22:47:30 +00:00
Sebastian Pop	c62c679c1b	delinearization of arrays llvm-svn: 194527	2013-11-12 22:47:20 +00:00
Nadav Rotem	0ed2fdb5af	Fold (iszero(A&K1) \| iszero(A&K2)) -> (A&(K1\|K2)) != (K1\|K2) if we know that K1 and K2 are 'one-hot' (only one bit is on). llvm-svn: 194525	2013-11-12 22:38:59 +00:00
Nadav Rotem	53d32211b7	FoldBranchToCommonDest merges branches into a single branch with or/and of the condition. It has a heuristics for estimating when some of the dependencies are processed by out-of-order processors. This patch adds another rule to the heuristics that says that if the "BonusInstruction" that we speculatively execute is used by the condition of the second branch then it is okay to hoist it. This change exposes more opportunities for other passes to transform the code. It does not matter that much that we if-convert the code because the selectiondag builder splits or/and branches into multiple branches when profitable. llvm-svn: 194524	2013-11-12 22:37:16 +00:00
Akira Hatanaka	d6c9f6ebbe	[mips] Fix a bug in function CC_MipsO32_FP64. The second double precision argument was not being passed in $f14. llvm-svn: 194522	2013-11-12 22:16:18 +00:00
Akira Hatanaka	c8e4bd156b	[mips] Run test case with command line option -mattr=+fp64. llvm-svn: 194519	2013-11-12 22:06:45 +00:00
Justin Bogner	b10a520c8f	Protect user-supplied runtime library functions in LTO Add user-supplied C runtime and compiler-rt library functions to llvm.compiler.used to protect them from premature optimization by passes like -globalopt and -ipsccp. Calls to (seemingly unused) runtime library functions can be added by -instcombine and instruction lowering. Patch by Duncan Exon Smith, thanks! Fixes <rdar://problem/14740087> llvm-svn: 194514	2013-11-12 21:44:01 +00:00
Tim Northover	8eaf1543e5	ARM: diagnose invalid system LDM/STM The system LDM and STM instructions can't usually writeback to the base register. The one exception is when an LDM is actually an exception-return (i.e. contains PC in the register list). (There's already a test that "ldm sp!, {r0-r3, pc}^" works, which is why there is no positive test). rdar://problem/15223374 llvm-svn: 194512	2013-11-12 21:32:41 +00:00
Akira Hatanaka	937ce7c143	[mips] Fix and re-enable a test case that has been disabled for a long time. llvm-svn: 194510	2013-11-12 21:03:57 +00:00
Peter Zotov	7b321f832f	[OCaml] Dynamically link LLVM on --enable-shared builds This commit significantly speeds up both bytecode and native builds of LLVM clients (from ~20 second to sub-second link time), and allows to invoke LLVM functions from OCaml toplevel. The behavior for --disable-shared builds is unchanged. llvm-svn: 194509	2013-11-12 20:55:49 +00:00
Rafael Espindola	dd8757abbc	Corruptly merge constants with explicit and implicit alignments. Constant merge can merge a constant with implicit alignment with one that has explicit alignment. Before this change it was assuming that the explicit alignment was higher than the implicit one, causing the result to be under aligned in some cases. Fixes pr17815. Patch by Chris Smowton! llvm-svn: 194506	2013-11-12 20:21:43 +00:00
Chad Rosier	1eb0ecf8ce	[AArch64] Implemented AdvSIMD scalar x indexed element format and AdvSIMD scalar copy in MC layer. Added the MC layer tests. Fixed triple setting in test cases. Patch by Ana Pazos <apazos@codeaurora.org>. llvm-svn: 194501	2013-11-12 19:13:08 +00:00
Andrew Trick	3112a5e4c0	Simplify operand folding when rematerializing a load. We already know how to fold a reload from a frameindex without analyzing the load instruction. Generalize this to handle any frameindex load. This streamlines the logic for rematerializing loads from stack arguments. As a side effect, it allows stackmaps to record a stack argument location without spilling it. Verified no effect on codegen for llvm test-suite. llvm-svn: 194497	2013-11-12 18:06:12 +00:00
Daniel Sanders	8b59af15ed	[mips][msa] Enable inlinse assembly for MSA. Like GCC, this re-uses the 'f' constraint and a new 'w' print-modifier: asm ("ldi.w %w0, 1", "=f"(result)); Unlike GCC, the 'w' print-modifer is not _required_ to produce the intended output. This is a consequence of differences in the internal handling of the registers in each compiler. To be source-compatible between the compilers, users must use the 'w' print-modifier. MSA registers (including control registers) are supported in clobber lists. llvm-svn: 194476	2013-11-12 12:56:01 +00:00
Benjamin Kramer	7c30260ab3	SimplifyCFG: Use existing constant folding logic when forming switch tables. Both simpler and more powerful than the hand-rolled folding logic. llvm-svn: 194475	2013-11-12 12:24:36 +00:00
Daniel Sanders	3f6eb546d3	[mips][msa] Added support for matching bclr, and bclri from normal IR (i.e. not intrinsics) llvm-svn: 194471	2013-11-12 10:45:18 +00:00
Bradley Smith	9aa8ac9f23	[ARM] Add support for FP_HP_extension build attribute llvm-svn: 194470	2013-11-12 10:38:05 +00:00
Daniel Sanders	a5bc99f164	[mips][msa] Added support for matching bset, bseti, bneg, and bnegi from normal IR (i.e. not intrinsics) llvm-svn: 194469	2013-11-12 10:31:49 +00:00
Daniel Sanders	44657ef6e5	[mips][msa] Change constant used in ori tests to avoid conflict with bseti (also xori to avoid bnegi) Upcoming commit(s) are going to add support for bseti and bnegi. This would cause some existing tests to (correctly) change behaviour and emit a different instruction. This patch prevents this by changing the constant used in ori and xori tests so that they will not be matchable by the bseti and bnegi patterns when these instructions are matchable from normal IR. llvm-svn: 194467	2013-11-12 10:14:18 +00:00
Robert Lytton	494591b87f	XCore target: fix bug in aligning 'byval i8*' on the stack llvm-svn: 194466	2013-11-12 10:11:35 +00:00
Robert Lytton	f7f0c5e326	XCore target test for hidden declaration llvm-svn: 194465	2013-11-12 10:11:30 +00:00
Robert Lytton	61d9149c73	Add XCore support for ATOMIC_FENCE. ATOMIC_FENCE is lowered to a compiler barrier which is codegen only. There is no need to emit an instructions since the XCore provides sequential consistency. Original patch by Richard Osborne llvm-svn: 194464	2013-11-12 10:11:26 +00:00
Robert Lytton	ed835b6fd4	XCore target: return error for unsupported alignment llvm-svn: 194463	2013-11-12 10:11:05 +00:00
Yuchen Wu	b9a29f2782	Revert "Added basic unit test for llvm-cov." This reverts commit r194451. Not sure why the tests are failing on the buildbot. They run fine on my local machine. Could it possibly be because of the endianness of the architectures? The GCNO and GCDA files are little-endian encoded, and llvm-cov expects it to remain that way. Is this a safe assumption? llvm-svn: 194454	2013-11-12 05:57:06 +00:00
Yuchen Wu	062f24c973	llvm-cov: Added call to update run/program counts. Also updated test files that were generated from this change. llvm-svn: 194453	2013-11-12 04:59:08 +00:00
Yuchen Wu	b470652431	Added basic unit test for llvm-cov. This test compares the output of llvm-cov against a coverage file generated by gcov. Since the source file must be in the current directory when reading GCNO files, the test will first cd into the Inputs directory. llvm-svn: 194451	2013-11-12 04:52:53 +00:00
Matt Arsenault	72b31eee0b	R600/SI: Change formatting of printed registers. Print the range of registers used with a single letter prefix. This better matches what the shader compiler produces and is overall less obnoxious than concatenating all of the subregister names together. Instead of SGPR0, it will print s0. Instead of SGPR0_SGPR1, it will print s[0:1] and so on. There doesn't appear to be a straightforward way to get the actual register info in the InstPrinter, so this parses the generated name to print with the new syntax. The required test changes are pretty nasty, and register matching regexes are now worse. Since there isn't a way to add to a variable in FileCheck, some of the tests now don't check the exact number of registers used, but I don't think that will be a real problem. llvm-svn: 194443	2013-11-12 02:35:51 +00:00
Reed Kotler	f0e6968e2f	Change the default branch instruction to be the 16 bit variety for mips16. This has no material effect at this time since we don't have a direct object emitter for mips16 and the assembler can't tell them apart. I place a comment "16 bit inst" for those so that I can tell them apart in the output. The constant island pass has only been minimally changed to allow this. More complete branch work is forthcoming but this is the first step. llvm-svn: 194442	2013-11-12 02:27:12 +00:00
Matt Arsenault	dbf9f311b0	R600/SI: Add test that fails due to requiring i64 mul for pointers llvm-svn: 194433	2013-11-11 23:31:02 +00:00
Andrew Trick	a28099fdd4	Fix the recently added anyregcc convention to handle spilled operands. Fixes <rdar://15432754> [JS] Assertion: "Folded a def to a non-store!" The primary purpose of anyregcc is to prevent a patchpoint's call arguments and return value from being spilled. They must be available in a register, although the calling convention does not pin the register. It's up to the front end to avoid using this convention for calls with more arguments than allocatable registers. llvm-svn: 194428	2013-11-11 22:40:25 +00:00
Vincent Lejeune	f143af3fe9	R600: Use function inputs to represent data stored in gpr llvm-svn: 194425	2013-11-11 22:10:24 +00:00
Shuxin Yang	3168ab3376	Fix PR17952. The symptom is that an assertion is triggered. The assertion was added by me to detect the situation when value is propagated from dead blocks. (We can certainly get rid of assertion; it is safe to do so, because propagating value from dead block to alive join node is certainly ok.) The root cause of this bug is : edge-splitting is conducted on the fly, the edge being split could be a dead edge, therefore the block that split the critial edge needs to be flagged "dead" as well. There are 3 ways to fix this bug: 1) Get rid of the assertion as I mentioned eariler 2) When an dead edge is split, flag the inserted block "dead". 3) proactively split the critical edges connecting dead and live blocks when new dead blocks are revealed. This fix go for 3) with additional 2 LOC. Testing case was added by Rafael the other day. llvm-svn: 194424	2013-11-11 22:00:23 +00:00
Akira Hatanaka	8f1caeb0e1	[mips] Partially revert r193641. Stack alignment should not be determined by the floating point register mode. llvm-svn: 194423	2013-11-11 21:49:03 +00:00
Simon Atanasyan	5c8377f32c	Add support for DT_VERxxx and DT_MIPS_xxx .dynamic section entries to the llvm-readobj. The patch reviewed by Michael Spencer. http://llvm-reviews.chandlerc.com/D2113 llvm-svn: 194421	2013-11-11 20:51:48 +00:00
Artyom Skrobov	eff45103b3	[ARM] Add support for MVFR2 which is new in ARMv8 llvm-svn: 194416	2013-11-11 19:56:13 +00:00
Justin Holewinski	124e93de93	[NVPTX] Properly handle bitcast ConstantExpr when checking for the alignment of function parameters llvm-svn: 194410	2013-11-11 19:28:19 +00:00
Justin Holewinski	4f5bc9b33a	[NVPTX] Fix logic error in loading vector parameters of more than 4 components llvm-svn: 194409	2013-11-11 19:28:16 +00:00
Chad Rosier	d3684a0566	[AArch64] The shift right/left and insert immediate builtins expect 3 source operands, a vector, an element to insert, and a shift amount. llvm-svn: 194406	2013-11-11 19:11:11 +00:00
Chad Rosier	35575e737c	[AArch64] Add support for NEON scalar floating-point convert to fixed-point instructions. llvm-svn: 194394	2013-11-11 18:04:07 +00:00
Daniel Sanders	a1840d2f88	Vector forms of SHL, SRA, and SRL can be constant folded using SimplifyVBinOp too Reviewers: dsanders Reviewed By: dsanders CC: llvm-commits, nadav Differential Revision: http://llvm-reviews.chandlerc.com/D1958 llvm-svn: 194393	2013-11-11 17:23:41 +00:00
Matheus Almeida	c051a40506	[mips][msa] CHECK-DAG-ize MSA 3r-a.ll test. No functional changes. llvm-svn: 194391	2013-11-11 16:46:20 +00:00
Matheus Almeida	ce207fa078	[mips][msa] CHECK-DAG-ize MSA 2rf_int_float.ll test. No functional changes. llvm-svn: 194390	2013-11-11 16:38:55 +00:00
Matheus Almeida	fed22ad33b	[mips][msa] CHECK-DAG-ize MSA 2rf_float_int.ll test. No functional changes. llvm-svn: 194389	2013-11-11 16:31:46 +00:00
Matheus Almeida	c596839e67	[mips][msa] CHECK-DAG-ize MSA 2rf.ll test. No functional changes. llvm-svn: 194387	2013-11-11 16:24:53 +00:00
Matheus Almeida	9826d07a2f	[mips][msa] CHECK-DAG-ize MSA 2r.ll test. No functional changes. llvm-svn: 194386	2013-11-11 16:16:53 +00:00
Rafael Espindola	9d34018954	Add a testcase for pr17852. llvm-svn: 194385	2013-11-11 15:37:52 +00:00
Hal Finkel	c6a243987d	Add PPC option for full register names in asm On non-Darwin PPC systems, we currently strip off the register name prefix prior to instruction printing. So instead of something like this: mr r3, r4 we print this: mr 3, 4 The first form is the default on Darwin, and is understood by binutils, but not yet understood by our integrated assembler. Once our integrated-as understands full register names as well, this temporary option will be replaced by tying this functionality to the verbose-asm option. The numeric-only form is compatible with legacy assemblers and tools, and is also gcc's default on most PPC systems. On the other hand, it is harder to read, and there are some analysis tools that expect full register names. llvm-svn: 194384	2013-11-11 14:58:40 +00:00
Peter Zotov	18636a8777	[OCaml] Add missing Llvm_target functions llvm-svn: 194382	2013-11-11 14:47:28 +00:00
Peter Zotov	dfa957746c	[OCaml] Accept context explicitly in Llvm_target functions Llvm_target.intptr_type used to implicitly use global context. As none of other functions in OCaml bindings do, it is changed to accept context explicitly. llvm-svn: 194381	2013-11-11 14:47:20 +00:00
Peter Zotov	d52cf17584	[OCaml] Make Llvm_target.DataLayout.t automatically managed This breaks the API by removing Llvm_target.DataLayout.dispose. llvm-svn: 194380	2013-11-11 14:47:11 +00:00
Evgeniy Stepanov	560e089355	[msan] Propagate origin for insertvalue, extractvalue. llvm-svn: 194374	2013-11-11 13:37:10 +00:00
NAKAMURA Takumi	5c0be2f67a	Mark 36 tests as XFAIL:vg_leak in llvm/test/TableGen. In historical reason, tblgen is not strictly required to be free from memory leaks. For now, I mark them as XFAIL, they could be fixed, though. llvm-svn: 194353	2013-11-10 14:26:08 +00:00
NAKAMURA Takumi	cae86ce38b	Remove 6 of XFAIL(s) in llvm/test/TableGen, since r193736. They have been XPASSing. llvm-svn: 194352	2013-11-10 14:25:44 +00:00
Bill Wendling	fed6c220ec	Revert "Resurrect r191017 " GVN proceeds in the presence of dead code" plus a fix to PR17307 & 17308." This causes PR17852. This reverts commit d93e8a06b2ca09ab18f390cd514b7443e2e571f7. Conflicts: test/Transforms/GVN/cond_br2.ll llvm-svn: 194348	2013-11-10 07:34:34 +00:00
Nadav Rotem	5ba1c6ced8	SimplifyCFG has a heuristics for out-of-order processors that decides when it is worthwhile to merge branches. It tries to estimate if the operands of the instruction that we want to hoist are ready. This commit marks function arguments as 'ready' because they require no calculation. This boosts libquantum and a few other workloads from the testsuite. llvm-svn: 194346	2013-11-10 04:13:31 +00:00
Matt Arsenault	ba035bce21	Resolve TODO in test now that filecheck has multiple check prefixes. llvm-svn: 194344	2013-11-10 02:16:47 +00:00
Matt Arsenault	13df462691	Allow multiple check prefixes in FileCheck. This is useful if you want to run multiple variations of a single test, and the majority of check lines should be the same. llvm-svn: 194343	2013-11-10 02:04:09 +00:00
Matt Arsenault	5bcefabcda	Teach MergeFunctions about address spaces llvm-svn: 194342	2013-11-10 01:44:37 +00:00
Matt Arsenault	0fb71e545c	Use variable for register name in test llvm-svn: 194338	2013-11-10 00:57:17 +00:00
Reed Kotler	45c5927c5c	Mostly finish up constant islands port for Mips for load constants. Still need to finish the branch part. Still lots more review of the code, clean up and testing. llvm-svn: 194337	2013-11-10 00:09:26 +00:00
Akira Hatanaka	d1c58ed8a7	[mips] Make sure there is a chain edge dependency between loads that read formal arguments on the stack and stores created afterwards. We need this to ensure tail call optimized function calls do not write over the argument area of the stack before it is read out. llvm-svn: 194309	2013-11-09 02:38:51 +00:00
Juergen Ributzka	87ed906b2e	[Stackmap] Materialize the jump address within the patchpoint noop slide. This patch moves the jump address materialization inside the noop slide. This enables patching of the materialization itself or its complete removal. This patch also adds the ability to define scratch registers that can be used safely by the code called from the patchpoint intrinsic. At least one scratch register is required, because that one is used for the materialization of the jump address. This patch depends on D2009. Differential Revision: http://llvm-reviews.chandlerc.com/D2074 Reviewed by Andy llvm-svn: 194306	2013-11-09 01:51:33 +00:00
Juergen Ributzka	9969d3e6e8	[Stackmap] Add AnyReg calling convention support for patchpoint intrinsic. The idea of the AnyReg Calling Convention is to provide the call arguments in registers, but not to force them to be placed in a paticular order into a specified set of registers. Instead it is up tp the register allocator to assign any register as it sees fit. The same applies to the return value (if applicable). Differential Revision: http://llvm-reviews.chandlerc.com/D2009 Reviewed by Andy llvm-svn: 194293	2013-11-08 23:28:16 +00:00
Jim Grosbach	2fca51d3b4	X86: Assembly files with .cfi_cfa_def shouldn't hit llvm_unreachable() On darwin, when trying to create compact unwind info, a .cfi_cfa_def directive would case an llvm_unreachable() to be hit. Back off when we see this directive and generate the regular DWARF style eh_frame. rdar://15406518 llvm-svn: 194285	2013-11-08 22:33:06 +00:00
Quentin Colombet	b06a0ed4b0	[VirtRegMap] Fix for PR17825. Do not ignore noreturn definitions when setting isPhysRegUsed if the unwind information is required. Indeed, the runtime may need a correct stack to be able to unwind the call. llvm-svn: 194271	2013-11-08 18:14:17 +00:00
Tim Northover	93bcc66e73	ARM: fold prologue/epilogue sp updates into push/pop for code size ARM prologues usually look like: push {r7, lr} sub sp, sp, #4 If code size is extremely important, this can be optimised to the single instruction: push {r6, r7, lr} where we don't actually care about the contents of r6, but pushing it subtracts 4 from sp as a side effect. This should implement such a conversion, predicated on the "minsize" function attribute (-Oz) since I've yet to find any code it actually makes faster. llvm-svn: 194264	2013-11-08 17:18:07 +00:00
Artyom Skrobov	202ff08f97	[ARM] Handling for coprocessor instructions that are undefined starting from ARMv8 (Thumb encodings) llvm-svn: 194263	2013-11-08 16:25:50 +00:00
Artyom Skrobov	d2116a4ef7	[ARM] Handling for coprocessor instructions that are undefined starting from ARMv8 (ARM encodings) llvm-svn: 194262	2013-11-08 16:17:14 +00:00
Artyom Skrobov	e686cec7d4	[ARM] Handling for coprocessor instructions that are undefined starting from ARMv8 (ARM encodings) llvm-svn: 194261	2013-11-08 16:16:30 +00:00
Zoran Jovanovic	2914d2d980	Test for microMIPS trap instructions. llvm-svn: 194258	2013-11-08 14:55:31 +00:00
NAKAMURA Takumi	0d82bac470	llvm-ar: Let opening a directory failed in llvm-ar. Linux cannot open directories with open(2), although cygwin and *bsd can. Motivation: The test, Object/directory.ll, had been failing with --target=cygwin on Linux. XFAIL was improper for host issues. llvm-svn: 194257	2013-11-08 12:35:56 +00:00
Matheus Almeida	a3bac16950	[mips][msa] Update encoding of LDI instruction. The encoding was updated in MSA r1.07. llvm-svn: 194255	2013-11-08 10:43:11 +00:00
Artyom Skrobov	8653443902	[ARM] In ARMAsmParser, MatchCoprocessorOperandName() permitted p10 and p11 as operands for coprocessor instructions, resulting in encodings that clash with FP/NEON instruction encodings llvm-svn: 194253	2013-11-08 09:16:31 +00:00
David Majnemer	bd4fef4a89	IR: Do not canonicalize constant GEPs into an out-of-bounds array access Summary: Consider a GEP of: i8* getelementptr ({ [2 x i8], i32, i8, [3 x i8] }* @main.c, i32 0, i32 0, i64 0) If we proceeded to GEP the aforementioned object by 8, would form a GEP of: i8* getelementptr ({ [2 x i8], i32, i8, [3 x i8] }* @main.c, i32 0, i32 0, i64 8) Note that we would go through the first array member, causing an out-of-bounds accesses. This is problematic because we might get fooled if we are trying to evaluate loads using this GEP, for example, based off of an object with a constant initializer where the array is zero. This fixes PR17732. Reviewers: nicholas, chandlerc, void Reviewed By: void CC: llvm-commits, echristo, void, aemerson Differential Revision: http://llvm-reviews.chandlerc.com/D2093 llvm-svn: 194220	2013-11-07 22:15:53 +00:00
Zoran Jovanovic	c18b6d1083	Support for microMIPS trap instructions 1. llvm-svn: 194205	2013-11-07 14:35:24 +00:00
Vincent Lejeune	4f3751f2af	R600: Fix LowerUDIVREM llvm-svn: 194153	2013-11-06 17:36:04 +00:00
Benjamin Kramer	9e9773d46d	Add test case for PR12377, it was fixed by r194116. llvm-svn: 194147	2013-11-06 11:55:41 +00:00
Vladimir Medic	4c29985cd0	Implement gpword directive for mips, test case added. Stype changes using clang-format are also included. llvm-svn: 194145	2013-11-06 11:27:05 +00:00
Peter Zotov	578267fb73	[OCaml] Impement Llvm_irreader, bindings to LLVM assembly parser llvm-svn: 194138	2013-11-06 09:21:25 +00:00
Peter Zotov	d10ae6c527	[OCaml] Implement Llvm.string_of_llvalue llvm-svn: 194136	2013-11-06 09:21:08 +00:00
Jiangning Liu	f4226f1d7b	Implement AArch64 Neon instruction set Perm. llvm-svn: 194123	2013-11-06 03:35:27 +00:00
Jiangning Liu	a50e22ca4f	Implement AArch64 Neon instruction set Bitwise Extract. llvm-svn: 194118	2013-11-06 02:25:49 +00:00
Andrew Trick	34e2f0c4ea	Rewrite SCEV's backedge taken count computation. Patch by Michele Scandale! Rewrite of the functions used to compute the backedge taken count of a loop on LT and GT comparisons. I decided to split the handling of LT and GT cases becasue the trick "a > b == -a < -b" in some cases prevents the trip count computation due to the multiplication by -1 on the two operands of the comparison. This issue comes from the conservative computation of value range of SCEVs: taking the negative SCEV of an expression that have a small positive range (e.g. [0,31]), we would have a SCEV with a fullset as value range. Indeed, in the new rewritten function I tried to better handle the maximum backedge taken count computation when MAX/MIN expression are used to handle the cases where no entry guard is found. Some test have been modified in order to check the new value correctly (I manually check them and reasoning on possible overflow the new values seem correct). I finally added a new test case related to the multiplication by -1 issue on GT comparisons. llvm-svn: 194116	2013-11-06 02:08:26 +00:00
Andrew Trick	6664df12fb	Slightly change the way stackmap and patchpoint intrinsics are lowered. MorphNodeTo is not safe to call during DAG building. It eagerly deletes dependent DAG nodes which invalidates the NodeMap. We could expose a safe interface for morphing nodes, but I don't think it's worth it. Just create a new MachineNode and replaceAllUsesWith. My understaning of the SD design has been that we want to support early target opcode selection. That isn't very well supported, but generally works. It seems reasonable to rely on this feature even if it isn't widely used. llvm-svn: 194102	2013-11-05 22:44:04 +00:00
Tim Northover	f02287db27	ARM: permit bare dmb/dsb/isb aliases on Cortex-M0 Cortex-M0 supports these 32-bit instructions despite being Thumb1 only (mostly). We knew about that but not that the aliases without the default "sy" operand were also permitted. llvm-svn: 194094	2013-11-05 21:36:02 +00:00
Jiangning Liu	d7c52676f6	Implement AArch64 Neon Crypto instruction classes AES, SHA, and 3 SHA. llvm-svn: 194085	2013-11-05 17:42:05 +00:00
Michael Gottesman	24b2f6fdda	[objc-arc] Convert the one directional retain/release relation assert to a conditional check + fail. Due to the previously added overflow checks, we can have a retain/release relation that is one directional. This occurs specifically when we run into an additive overflow causing us to drop state in only one direction. If that occurs, we should bail and not optimize that retain/release instead of asserting. Apologies for the size of the testcase. It is necessary to cause the additive cfg overflow to trigger. rdar://15377890 llvm-svn: 194083	2013-11-05 16:02:40 +00:00
Alp Toker	a2f1b8d238	Provide a test input for opt This was only working previously due to a quirk in the way lit concatenates script commands. llvm-svn: 194078	2013-11-05 13:57:34 +00:00
Peter Zotov	28f6876ecc	[OCaml] (PR16318) Add missing argument to Llvm.const_intcast llvm-svn: 194065	2013-11-05 11:56:20 +00:00
Peter Zotov	ce7a91b277	[OCaml] (PR11717) Make declare_qualified_global respect address argument Original patch by Jonathan Ragan-Kelley llvm-svn: 194064	2013-11-05 11:56:13 +00:00
Reed Kotler	0f007fc4ce	Fix r194019 as requested by Eric Christopher. Submit the basic port of the rest of ARM constant islands code to Mips. Two test cases are added which reflect the next level of functionality: constants getting moved to water areas that are out of range from the initial placement at the end of the function and basic blocks being split to create water when none exists that can be used. There is a bunch of this code that is not complete and has been marked with IN_PROGRESS. I will finish cleaning this all up during the next week or two and submit the rest of the test cases. I have elminated some code for dealing with inline assembly because to me it unecessarily complicates things and some of the newer features of llvm like function attributies and builtin assembler give me better tools to solve the alignment issues created there. Also, for Mips16 I even have the option of not doing constant islands in the present of inline assembler if I chose. When everything has been completed I will summarize the port and notify people that are knowledgable regarding the ARM Constant Islands code so they can review it in it's entirety if they wish. llvm-svn: 194053	2013-11-05 08:14:14 +00:00
Hao Liu	d6b40b51c7	Implement AArch64 post-index vector load/store multiple N-element structure class SIMD(lselem-post). Including following 14 instructions: 4 ld1 insts: post-index load multiple 1-element structure to sequential 1/2/3/4 registers. ld2/ld3/ld4: post-index load multiple N-element structure to sequential N registers (N=2,3,4). 4 st1 insts: post-index store multiple 1-element structure from sequential 1/2/3/4 registers. st2/st3/st4: post-index store multiple N-element structure from sequential N registers (N = 2,3,4). llvm-svn: 194043	2013-11-05 03:39:32 +00:00
Kevin Qin	97f6aaa8ad	Implemented aarch64 neon intrinsic vcopy_lane with float type. llvm-svn: 194041	2013-11-05 02:03:59 +00:00
Yuchen Wu	f3e653e9a6	Revert "Added basic unit test for llvm-cov." This reverts commit 9cacd131c22b888303cb88e9a3235b2d7b2f19a1. llvm-svn: 194039	2013-11-05 01:56:26 +00:00
Yuchen Wu	0b8e9a1480	Added basic unit test for llvm-cov. This test compares the output of llvm-cov against a coverage file generated by gcov. llvm-svn: 194038	2013-11-05 01:56:23 +00:00
NAKAMURA Takumi	5267613e3a	Revert r194019 to r194021, "Submit the basic port of the rest of ARM constant islands code to Mips." It broke -Asserts build. llvm-svn: 194026	2013-11-04 23:14:36 +00:00
Tim Northover	ace0bd4d33	AArch64: use default asm operand printing when modifier inapplicable If an inline assembly operand has multiple constraints (e.g. "Ir" for immediate or register) and an operand modifier (E.g. "w" for "print register as wN") then we need to decide behaviour when the modifier doesn't apply to the constraint. Previousely produced some combination of an assertion failure and a fatal error. GCC's behaviour appears to be to ignore the modifier and print the operand in the default way. This patch should implement that. llvm-svn: 194024	2013-11-04 23:04:07 +00:00
Reed Kotler	3fe68871da	Add the test case that goes with the previous submission for constant islands. I forgot to add it to svn on that patch. Ooops. llvm-svn: 194020	2013-11-04 22:13:41 +00:00
Eric Christopher	542c8d934d	Check for both styles of clobbers, those produced by dragonegg and those produced by clang for the inline asm bswap conversion. Modified from a patch by Chris Smowton. llvm-svn: 194016	2013-11-04 21:41:21 +00:00
Matt Arsenault	a8e894405c	Fix another constant folding address space place I missed. This fixes an assertion failure with a different sized address space. llvm-svn: 194014	2013-11-04 20:46:52 +00:00
Matt Arsenault	243140f2fd	Scalarize select vector arguments when extracted. When the elements are extracted from a select on vectors or a vector select, do the select on the extracted scalars from the input if there is only one use. llvm-svn: 194013	2013-11-04 20:36:06 +00:00
Cameron McInally	d80f7d34de	Add support for AVX512 masked vector blend intrinsics. llvm-svn: 194006	2013-11-04 19:14:56 +00:00
Manman Ren	289ef7d992	Rename testing case to use - instead of _. llvm-svn: 194001	2013-11-04 18:52:06 +00:00
Rafael Espindola	48da4f4691	Change BitcodeReader to use error_code instead of bool + string. In order to create an ObjectFile implementation that uses bitcode files, we need to propagate the bitcode errors to the ObjectFile interface, so we need to convert it to use the same error handling as ObjectFile: error_code. llvm-svn: 193996	2013-11-04 16:16:24 +00:00
Zoran Jovanovic	8a80aa76c8	Support for microMIPS branch instructions. llvm-svn: 193992	2013-11-04 14:53:22 +00:00
Peter Zotov	7fc270a171	[OCaml] implement Llvm_passmgr_builder, bindings for PassManagerBuilder llvm-svn: 193968	2013-11-04 01:39:42 +00:00
Peter Zotov	0f22bab63f	[OCaml] Implement missing LLVMCore APIs llvm-svn: 193966	2013-11-04 01:39:26 +00:00
Elena Demikhovsky	dacddb0bab	AVX-512: added VPCONFLICT instruction and intrinsics, added EVEX_KZ to tablegen llvm-svn: 193959	2013-11-03 13:46:31 +00:00
Venkatraman Govindaraju	5ae77f7564	[SparcV9] Handle i64 <-> float conversions in sparcv9 mode. llvm-svn: 193957	2013-11-03 12:28:40 +00:00
David Majnemer	120f4a06fd	Revert "Inliner: Handle readonly attribute per argument when adding memcpy" This reverts commit r193356, it caused PR17781. A reduced test case covering this regression has been added to the test suite. llvm-svn: 193955	2013-11-03 12:22:13 +00:00
Peter Zotov	311548cdad	[OCaml] Implement Llvm.MemoryBuffer.{of_string,as_string} llvm-svn: 193953	2013-11-03 08:27:45 +00:00
Peter Zotov	45451cf62a	[OCaml] Implement Llvm_linker, bindings for the IR linker llvm-svn: 193951	2013-11-03 08:27:32 +00:00
Peter Zotov	cbae39416f	[OCaml] Implement Llvm_vectorize bindings llvm-svn: 193950	2013-11-03 08:27:22 +00:00
Peter Zotov	5186033b0b	[OCaml] Refactor Llvm_target tests Llvm_target tests did not check for return values. This actually caused them to miss a bug. llvm-svn: 193949	2013-11-03 08:27:13 +00:00
Venkatraman Govindaraju	f1d807ee13	[Sparc] Expand FP_TO_UINT, UINT_TO_FP for fp128. llvm-svn: 193947	2013-11-03 08:00:19 +00:00
Peter Zotov	3e0c21ed53	[OCaml] Llvm_scalar_opts: add missing transforms llvm-svn: 193946	2013-11-03 07:54:17 +00:00
Peter Zotov	e4deac7b4a	[OCaml] Llvm_ipo: add missing transforms llvm-svn: 193945	2013-11-03 07:54:08 +00:00
Bob Wilson	d8d92d90fa	Convert calls to __sinpi and __cospi into __sincospi_stret This adds an SimplifyLibCalls case which converts the special __sinpi and __cospi (float & double variants) into a __sincospi_stret where appropriate to remove duplicated work. Patch by Tim Northover llvm-svn: 193943	2013-11-03 06:48:38 +00:00
Bob Wilson	e7dde0c061	Enable optimization of sin / cos pair into call to __sincos_stret for iOS7+. rdar://12856873 Patch by Evan Cheng, with a fix for rdar://13209539 by Tilmann Scheller llvm-svn: 193942	2013-11-03 06:14:38 +00:00
Venkatraman Govindaraju	5615aca219	[SparcV9] Add ctpop instruction for i64. Also, expand ctlz, cttz and bswap. llvm-svn: 193941	2013-11-03 05:59:07 +00:00
Rafael Espindola	99a3ba7674	A better fix that also works on ppc: add a target tripple. llvm-svn: 193915	2013-11-02 06:00:09 +00:00
Rafael Espindola	3cd286643d	Fix this test to pass on darwin now that llvm-nm is working. llvm-svn: 193914	2013-11-02 05:29:22 +00:00
Rafael Espindola	a135632af0	Fix llvm-nm to mach OS X's nm on some tests. There is still a long way to go for llvm-nm, but at least we now match nm's letter output in the cases we test for. llvm-svn: 193912	2013-11-02 05:03:24 +00:00
Michael Liao	b638d05ecb	Fix PR17764 - When selecting BLEND from vselect, the operands need swapping as due to the difference between vselect and SSE/AVX's BLEND insn llvm-svn: 193900	2013-11-02 00:10:02 +00:00
David Blaikie	ba8125dfd0	DebugInfo: regenerate test case from Clang to adjust for fixes/improvements I hit some problems with future work due to the member subprogram of 'a_b's type having a subprogram (an implicit default ctor, !52 in the pre-commit source) with no name. Clang now generates a name for such a function but in this case doesn't even emit debug info for it as it is unused (Clang never emits the body of the ctor, instead just emitting memset if needed). llvm-svn: 193892	2013-11-01 22:29:28 +00:00
Arnold Schwaighofer	a846a7f8f0	LoopVectorizer: Perform redundancy elimination on induction variables When the loop vectorizer was part of the SCC inliner pass manager gvn would run after the loop vectorizer followed by instcombine. This way redundancy (multiple uses) were removed and instcombine could perform scalarization on the induction variables. Having moved the loop vectorizer to later we no longer run any form of redundancy elimination before we perform instcombine. This caused vectorized induction variables to survive that did not before. On a recent iMac this helps linpack back from 6000Mflops to 7000Mflops. This should also help lpbench and paq8p. I ran a Release (without Asserts) build over the test-suite and did not see any negative impact on compile time. radar://15339680 llvm-svn: 193891	2013-11-01 22:18:19 +00:00
David Blaikie	d0d458665a	DebugInfo: Improve readability of test case added in r193878 The point is to ensure that the attribute in question (DW_AT_data_member_location) is associated with the prior tag, so ensure that we don't see another tag starting between the intended tag and the desired attribute. llvm-svn: 193884	2013-11-01 20:59:53 +00:00
David Blaikie	f0bc1ec767	DebugInfo: add a test case for data member locations (coverage for r193835) llvm-svn: 193878	2013-11-01 18:25:55 +00:00
David Blaikie	c5f888c909	Fix a test case broken by r193872 llvm-svn: 193876	2013-11-01 18:18:16 +00:00
Manman Ren	1d0b6bb2ef	Add comments. llvm-svn: 193874	2013-11-01 18:06:25 +00:00
David Blaikie	2ede02f6d0	DebugInfo: Make pubnames header printing similar to unit header printing In a failed attempt to allow the gnu-public-names.ll test case to not hardcode the size of the unit that the pubnames section referred to I've at least managed to have unit headers and pubnames headers print out in a similar style. This failed to achieve the desired goal because the header in a unit specifies the length of the unit without the length element of the header whereas the length in the pubnames includes this element, so the numbers are off by 4 bytes. I don't know of any arithmetic powers in FileCheck so the test case can't simply say "CU_LENGTH + 4". llvm-svn: 193872	2013-11-01 17:53:30 +00:00
Benjamin Kramer	1fbcdca9e3	LoopVectorize: Look for consecutive acces in GEPs with trailing zero indices If we have a pointer to a single-element struct we can still build wide loads and stores to it (if there is no padding). llvm-svn: 193860	2013-11-01 14:09:50 +00:00
Bradley Smith	2521975a42	[ARM] Add Virtualization subtarget feature and more build attributes in this area Add a Virtualization ARM subtarget feature along with adding proper build attribute emission for Tag_Virtualization_use (encodes Virtualization and TrustZone) and Tag_MPextension_use. Also rework test/CodeGen/ARM/2010-10-19-mc-elf-objheader.ll testcase to something that is more maintainable. This changes the focus of this testcase away from testing CPU defaults (which is tested elsewhere), onto specifically testing that attributes are encoded correctly. llvm-svn: 193859	2013-11-01 13:27:35 +00:00
Bradley Smith	c848beba5e	[ARM] Fix Tag_ABI_HardFP_use build attribute Fix Tag_ABI_HardFP_use build attribute to handle single precision FP, replace deprecated Tag_ABI_HardFP_use value of 3 with 0 and also add some tests for Tag_ABI_VFP_args. llvm-svn: 193856	2013-11-01 11:21:16 +00:00
Hal Finkel	4d94930bcb	Consider (x == -1) unlikely in BranchProbabilityInfo This adds another heuristic to BPI, similar to the existing heuristic that considers (x == 0) unlikely to be true. As suggested in the PACT'98 paper by Deitrich, Cheng, and Hwu, -1 is often used to indicate an invalid index, and equality comparisons with -1 are also unlikely to succeed. Local experimentation supports this hypothesis: This yields a 1-2% speedup in the test-suite sqlite benchmark on the PPC A2 core, with no significant regressions. llvm-svn: 193855	2013-11-01 10:58:22 +00:00
Arnold Schwaighofer	70a4665f55	LoopVectorizer: If dependency checks fail try runtime checks When a dependence check fails we can still try to vectorize loops with runtime array bounds checks. This helps linpack to vectorize a loop in dgefa. And we are back to 2x of the scalar performance on a corei7-avx. radar://15339680 llvm-svn: 193853	2013-11-01 03:05:07 +00:00
Rafael Espindola	d7a0e60e8f	Use \01 to disable the mangler. Should fix the 32 bit windows bots. llvm-svn: 193846	2013-11-01 01:14:20 +00:00
David Blaikie	71d34a2eef	DebugInfo: Emit member variable locations as data instead of expressions in blocks Drive by space optimization. Also makes the DIEs more regular which might speed up DWARF parsing. llvm-svn: 193835	2013-11-01 00:25:45 +00:00
Andrew Trick	f990411256	These test cases for experimental features are a bit too darwin-specific still. Use a triple. llvm-svn: 193820	2013-10-31 22:46:51 +00:00
Chad Rosier	74b65cd811	[AArch64] Add support for NEON scalar fixed-point convert to floating-point instructions. llvm-svn: 193816	2013-10-31 22:36:59 +00:00
Andrew Trick	a3a11dedca	Add new calling convention for WebKit Java Script. llvm-svn: 193812	2013-10-31 22:12:01 +00:00
Andrew Trick	153ebe6d2a	Add support for stack map generation in the X86 backend. Originally implemented by Lang Hames. llvm-svn: 193811	2013-10-31 22:11:56 +00:00
Rafael Espindola	57afdc7f09	Relax check line to match what llvm-nm prints for COFF. llvm-svn: 193810	2013-10-31 22:07:46 +00:00
Manman Ren	87a2adc7fe	Do not convert "call asm" to "invoke asm" in Inliner. Given that backend does not handle "invoke asm" correctly ("invoke asm" will be handled by SelectionDAGBuilder::visitInlineAsm, which does not have the right setup for LPadToCallSiteMap) and we already made the assumption that inline asm does not throw in InstCombiner::visitCallSite, we are going to make the same assumption in Inliner to make sure we don't convert "call asm" to "invoke asm". If it becomes necessary to add support for "invoke asm" later on, we will need to modify the backend as well as remove the assumptions that inline asm does not throw. Fix rdar://15317907 llvm-svn: 193808	2013-10-31 21:56:03 +00:00
Rafael Espindola	775ef460c9	XFAIL on ppc64 too. llvm-svn: 193804	2013-10-31 21:27:02 +00:00
Rafael Espindola	cb5bd5e508	XFAIL this for now. llvm-svn: 193802	2013-10-31 21:22:43 +00:00
Rafael Espindola	282a47037b	Use LTO_SYMBOL_SCOPE_DEFAULT_CAN_BE_HIDDEN instead of the "dso list". There are two ways one could implement hiding of linkonce_odr symbols in LTO: * LLVM tells the linker which symbols can be hidden if not used from native files. * The linker tells LLVM which symbols are not used from other object files, but will be put in the dso symbol table if present. GOLD's API is the second option. It was implemented almost 1:1 in llvm by passing the list down to internalize. LLVM already had partial support for the first option. It is also very similar to how ld64 handles hiding these symbols when not doing LTO. This patch then * removes the APIs for the DSO list. * marks LTO_SYMBOL_SCOPE_DEFAULT_CAN_BE_HIDDEN all linkonce_odr unnamed_addr global values and other linkonce_odr whose address is not used. * makes the gold plugin responsible for handling the API mismatch. llvm-svn: 193800	2013-10-31 20:51:58 +00:00
Chad Rosier	77ada678ed	[AArch64] Add diagnostic tests for NEON scalar shift immediate instructions (see: r193790). llvm-svn: 193798	2013-10-31 20:11:32 +00:00
Chad Rosier	20e1f20d69	[AArch64] Add support for NEON scalar shift immediate instructions. llvm-svn: 193790	2013-10-31 19:28:44 +00:00
Roman Divacky	2262cfaf19	SparcV9 doesnt have rem instruction either. llvm-svn: 193789	2013-10-31 19:22:33 +00:00
Reid Kleckner	775f29573a	Use a larger invalid attribute bitcode number That way the test won't start faililng when someone adds a new attribute and wants to use the next logical enum (38) for bitcode. The new bitcode file tries to use the number 48 as an attribute instead. llvm-svn: 193787	2013-10-31 19:12:36 +00:00
Matt Arsenault	b78b1b2330	Add FileCheck tests for @LINE llvm-svn: 193782	2013-10-31 18:18:09 +00:00
Petar Jovanovic	1f8578dca6	[mips] XFAIL several MCJIT remote tests Two of the tests are new test cases (cross-module-a.ll, multi-module-a.ll) not yet supported on MIPS, while XFAIL for the other two tests was accidentally removed in r193570 and this change reverts those lines. llvm-svn: 193781	2013-10-31 18:10:25 +00:00
Manman Ren	4dbdc9021d	Debug Info: remove duplication of DIEs when a DIE can be shared across CUs. We add a map in DwarfDebug to map MDNodes that are shareable across CUs to the corresponding DIEs: MDTypeNodeToDieMap. These DIEs can be shared across CUs, that is why we keep the maps in DwarfDebug instead of CompileUnit. We make the assumption that if a DIE is not added to an owner yet, we assume it belongs to the current CU. Since DIEs for the type system are added to their owners immediately after creation, and other DIEs belong to the current CU, the assumption should be true. A testing case is added to show that we only create a single DIE for a type MDNode and we use ref_addr to refer to the type DIE. We also add a testing case to show ref_addr relocations for non-darwin platforms. llvm-svn: 193779	2013-10-31 17:54:35 +00:00
Roman Divacky	8d72f4a06f	Merge and filecheckize. llvm-svn: 193778	2013-10-31 17:50:45 +00:00
Andrew Trick	2af716afbe	Add Verifier test case for variable argument intrinsics. llvm-svn: 193768	2013-10-31 17:18:17 +00:00
Andrew Trick	a2efd99bdf	Enable variable arguments support for intrinsics. llvm-svn: 193766	2013-10-31 17:18:11 +00:00
Cameron McInally	394d557f41	Add AVX512 unmasked integer broadcast intrinsics and support. llvm-svn: 193748	2013-10-31 13:56:31 +00:00
Elena Demikhovsky	496656900e	AVX-512: Implemented CMOV for 512-bit vectors llvm-svn: 193747	2013-10-31 13:15:32 +00:00
Richard Sandiford	f834ea19db	[SystemZ] Automatically detect zEC12 and z196 hosts As on other hosts, the CPU identification instruction is priveleged, so we need to look through /proc/cpuinfo. I copied the PowerPC way of handling "generic". Several tests were implicitly assuming z10 and so failed on z196. llvm-svn: 193742	2013-10-31 12:14:17 +00:00
Amara Emerson	f80f95fcc7	[AArch64] Make the use of FP instructions optional, but enabled by default. This adds a new subtarget feature called FPARMv8 (implied by NEON), and predicates the support of the FP instructions and registers on this feature. llvm-svn: 193739	2013-10-31 09:32:11 +00:00
NAKAMURA Takumi	160cef8ddc	llvm/test/Bitcode/invalid.ll: Tweak expresion to mach "llvm-dis.EXE:" llvm-svn: 193738	2013-10-31 06:21:00 +00:00
Rafael Espindola	26b43cac18	Fix a use after free on invalid input. llvm-svn: 193737	2013-10-31 04:20:23 +00:00
Jim Grosbach	7236678687	Legalize: Improve legalization of long vector extends. When an extend more than doubles the size of the elements (e.g., a zext from v16i8 to v16i32), the normal legalization method of splitting the vectors will run into problems as by the time the destination vector is legal, the source vector is illegal. The end result is the operation often becoming scalarized, with the typical horrible performance. For example, on x86_64, the simple input of: define void @bar(<16 x i8> %a, <16 x i32>* %p) nounwind { %tmp = zext <16 x i8> %a to <16 x i32> store <16 x i32> %tmp, <16 x i32>*%p ret void } Generates: .section __TEXT,__text,regular,pure_instructions .section __TEXT,__const .align 5 LCPI0_0: .long 255 ## 0xff .long 255 ## 0xff .long 255 ## 0xff .long 255 ## 0xff .long 255 ## 0xff .long 255 ## 0xff .long 255 ## 0xff .long 255 ## 0xff .section __TEXT,__text,regular,pure_instructions .globl _bar .align 4, 0x90 _bar: vpunpckhbw %xmm0, %xmm0, %xmm1 vpunpckhwd %xmm0, %xmm1, %xmm2 vpmovzxwd %xmm1, %xmm1 vinsertf128 $1, %xmm2, %ymm1, %ymm1 vmovaps LCPI0_0(%rip), %ymm2 vandps %ymm2, %ymm1, %ymm1 vpmovzxbw %xmm0, %xmm3 vpunpckhwd %xmm0, %xmm3, %xmm3 vpmovzxbd %xmm0, %xmm0 vinsertf128 $1, %xmm3, %ymm0, %ymm0 vandps %ymm2, %ymm0, %ymm0 vmovaps %ymm0, (%rdi) vmovaps %ymm1, 32(%rdi) vzeroupper ret So instead we can check if there are legal types that enable us to split more cleverly when the input vector is already legal such that we don't turn it into an illegal type. If the extend is such that it's more than doubling the size of the input we check if - the number of vector elements is even, - the source type is legal, - the type of a split source is illegal, - the type of an extended (by doubling element size) source is legal, and - the type of that extended source when split is legal. If the conditions are met, instead of just splitting both the destination and the source types, we create an extend that only goes up one "step" (doubling the element width), and the continue legalizing the rest of the operation normally. The result is that this operates as a new, more effecient, termination condition for the loop of "split the operation until the destination type is legal." With this change, the above example now compiles to: _bar: vpxor %xmm1, %xmm1, %xmm1 vpunpcklbw %xmm1, %xmm0, %xmm2 vpunpckhwd %xmm1, %xmm2, %xmm3 vpunpcklwd %xmm1, %xmm2, %xmm2 vinsertf128 $1, %xmm3, %ymm2, %ymm2 vpunpckhbw %xmm1, %xmm0, %xmm0 vpunpckhwd %xmm1, %xmm0, %xmm3 vpunpcklwd %xmm1, %xmm0, %xmm0 vinsertf128 $1, %xmm3, %ymm0, %ymm0 vmovaps %ymm0, 32(%rdi) vmovaps %ymm2, (%rdi) vzeroupper ret This generalizes a custom lowering that was added a while back to the ARM backend. That lowering is no longer necessary, and is removed. The testcases for it, however, provide excellent ARM tests for this change and so remain. rdar://14735100 llvm-svn: 193727	2013-10-31 00:20:48 +00:00
Matt Arsenault	2ba54c3d90	Fix CodeGen for unaligned loads with address spaces llvm-svn: 193721	2013-10-30 23:30:05 +00:00
Matt Arsenault	38b8ecf378	Teach scalarrepl about address spaces llvm-svn: 193720	2013-10-30 22:54:58 +00:00
Rafael Espindola	6f1b2852fc	Produce .weak_def_can_be_hidden for some linkonce_odr values With this patch llvm produces a weak_def_can_be_hidden for linkonce_odr if they are also unnamed_addr or don't have their address taken. There is not a lot of documentation about .weak_def_can_be_hidden, but from the old discussion about linkonce_odr_auto_hide and the name of the directive this looks correct: these symbols can be hidden. Testing this with the ld64 in Xcode 5 linking clang reduces the number of exported symbols from 21053 to 19049. llvm-svn: 193718	2013-10-30 22:08:11 +00:00
Will Dietz	b67a714d37	Add DebugInfo testcase for high_pc encoded as constant, fixed in r193555. llvm-svn: 193711	2013-10-30 20:27:17 +00:00
Matt Arsenault	614ea99da7	Fix GVN creating bitcast between address spaces llvm-svn: 193710	2013-10-30 19:05:41 +00:00
Tom Roeder	04d88fba3e	This commit adds some (but not all) of the x86-64 relocations that are not currently supported in the ELF object writer, along with a simple test case. llvm-svn: 193709	2013-10-30 18:47:25 +00:00
Artyom Skrobov	c1be9c16bc	[ARM] NEON instructions were erroneously decoded from certain invalid encodings llvm-svn: 193705	2013-10-30 18:10:09 +00:00
Tom Stellard	c947d8ca64	R600: Custom lower f32 = uint_to_fp i64 llvm-svn: 193701	2013-10-30 17:22:05 +00:00
Daniel Sanders	d5f554f0bb	[mips][msa] Correct definition of bins[lr] and CHECK-DAG-ize related tests llvm-svn: 193695	2013-10-30 15:45:42 +00:00
Daniel Sanders	ab94b537d7	[mips][msa] Added support for matching bmnz, bmnzi, bmz, and bmzi from normal IR (i.e. not intrinsics) Also corrected the definition of the intrinsics for these instructions (the result register is also the first operand), and added intrinsics for bsel and bseli to clang (they already existed in the backend). These four operations are mostly equivalent to bsel, and bseli (the difference is which operand is tied to the result). As a result some of the tests changed as described below. bitwise.ll: - bsel.v test adapted so that the mask is unknown at compile-time. This stops it emitting bmnzi.b instead of the intended bsel.v. - The bseli.b test now tests the right thing. Namely the case when one of the values is an uimm8, rather than when the condition is a uimm8 (which is covered by bmnzi.b) compare.ll: - bsel.v tests now (correctly) emits bmnz.v instead of bsel.v because this is the same operation (see MSA.txt). i8.ll - CHECK-DAG-ized test. - bmzi.b test now (correctly) emits equivalent bmnzi.b with swapped operands because this is the same operation (see MSA.txt). - bseli.b still emits bseli.b though because the immediate makes it distinguishable from bmnzi.b. vec.ll: - CHECK-DAG-ized test. - bmz.v tests now (correctly) emits bmnz.v with swapped operands (see MSA.txt). - bsel.v tests now (correctly) emits bmnz.v with swapped operands (see MSA.txt). llvm-svn: 193693	2013-10-30 15:20:38 +00:00
Chad Rosier	be020d0309	[AArch64] Add support for NEON scalar floating-point compare instructions. llvm-svn: 193691	2013-10-30 15:19:37 +00:00
Daniel Sanders	d74b130cc9	[mips][msa] Added support for matching bins[lr]i.[bhwd] from normal IR (i.e. not intrinsics) This required correcting the definition of the bins[lr]i intrinsics because the result is also the first operand. It also required removing the (arbitrary) check for 32-bit immediates in MipsSEDAGToDAGISel::selectVSplat(). Currently using binsli.d with 2 bits set in the mask doesn't select binsli.d because the constant is legalized into a ConstantPool. Similar things can happen with binsri.d with more than 10 bits set in the mask. The resulting code when this happens is correct but not optimal. llvm-svn: 193687	2013-10-30 14:45:14 +00:00
Daniel Sanders	53fe6c4d56	[mips][msa] Combine binsri-like DAG of AND and OR into equivalent VSELECT (or (and $a, $mask), (and $b, $inverse_mask)) => (vselect $mask, $a, $b). where $mask is a constant splat. This allows bitwise operations to make use of bsel. It's also a stepping stone towards matching bins[lr], and bins[lr]i from normal IR. Two sets of similar tests have been added in this commit. The bsel_* functions test the case where binsri cannot be used. The binsr_*_i functions will start to use the binsri instruction in the next commit. llvm-svn: 193682	2013-10-30 13:51:01 +00:00
Daniel Sanders	e7ef0c817b	[mips][msa] Added support for matching splat.[bhw] from normal IR (i.e. not intrinsics) splat.d is implemented but this subtest is currently disabled. This is because it is difficult to match the appropriate IR on MIPS32. There is a patch under review that should help with this so I hope to enable the subtest soon. llvm-svn: 193680	2013-10-30 13:07:44 +00:00
Juergen Ributzka	3bd686d493	Revert "SelectionDAG: Teach the legalizer to split SETCC if VSELECT needs splitting too." Now Hexagon and SystemZ are not happy with it :-( llvm-svn: 193677	2013-10-30 06:36:19 +00:00
Juergen Ributzka	6ad05d6b95	SelectionDAG: Teach the legalizer to split SETCC if VSELECT needs splitting too. The Type Legalizer recognizes that VSELECT needs to be split, because the type is to wide for the given target. The same does not always apply to SETCC, because less space is required to encode the result of a comparison. As a result VSELECT is split and SETCC is unrolled into scalar comparisons. This commit fixes the issue by checking for VSELECT-SETCC patterns in the DAG Combiner. If a matching pattern is found, then the result mask of SETCC is promoted to the expected vector mask type for the given target. This mask has usually the same size as the VSELECT return type (except for Intel KNL). Now the type legalizer will split both VSELECT and SETCC. This allows the following X86 DAG Combine code to sucessfully detect the MIN/MAX pattern. This fixes PR16695, PR17002, and <rdar://problem/14594431>. Reviewed by Nadav llvm-svn: 193676	2013-10-30 05:48:18 +00:00
Manman Ren	f4c339e04a	Debug Info: instead of calling addToContextOwner which constructs the context after the DIE creation, we construct the context first. Ensure that we create the context before we create a type so that we can add the newly created type to the parent. Remove last use of addToContextOwner now that it's not needed. We use createAndAddDIE to wrap around "new DIE(". Now all shareable DIEs should be added to their parents right after the creation. Reviewed off-list by Eric, Thanks. llvm-svn: 193657	2013-10-29 22:49:29 +00:00
Akira Hatanaka	6b2d841975	[mips] Align the stack to 16-bytes for mfp64. llvm-svn: 193641	2013-10-29 19:29:03 +00:00
Manman Ren	75cc7658e1	Debug Info: clean up testing case. Add a tag before the name attribute for readability. Use CHECK-NEXT instead of CHECK-NOT followed by a CHECK. Add new lines to separate checking of different DIEs. llvm-svn: 193629	2013-10-29 17:27:14 +00:00
Weiming Zhao	acf48d75e5	add test cases for frameaddr and returnaddr for aarch64 llvm-svn: 193626	2013-10-29 17:01:29 +00:00
Zoran Jovanovic	507e084a18	Support for microMIPS jump instructions llvm-svn: 193623	2013-10-29 16:38:59 +00:00
Tom Stellard	6e1ee476ab	R600/SI: Add compute support for CI v2 v2: - Fix LDS size calculation Reviewed-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 193621	2013-10-29 16:37:28 +00:00
Tom Stellard	e118b8becd	R600: Expand vector FSQRT ops llvm-svn: 193620	2013-10-29 16:37:20 +00:00
Bernard Ogden	fce246f0c6	Test cleanup for v8 instructions Add some missing tests, factor out a test not specific to v8 into its own file. llvm-svn: 193611	2013-10-29 14:16:09 +00:00
Bernard Ogden	ee87e85505	ARM: Add subtarget feature for CRC Adds a subtarget feature for the CRC instructions (optional in v8-A) to the ARM (32-bit) backend. Differential Revision: http://llvm-reviews.chandlerc.com/D2036 llvm-svn: 193599	2013-10-29 09:47:35 +00:00
Tim Northover	d29ddf6713	AArch64: add 'a' inline asm operand modifier This is used in the Linux kernel, and effectively just means "print an address". llvm-svn: 193593	2013-10-29 08:22:33 +00:00
Manman Ren	f6b936bc06	Debug Info: instead of calling addToContextOwner which constructs the context after the DIE creation, we construct the context first. This touches creation of namespaces and global variables. The purpose is to handle all DIE creations similarly: constructs the context first, then creates the DIE and immediately adds the DIE to its parent. We use createAndAddDIE to wrap around "new DIE(". llvm-svn: 193589	2013-10-29 05:49:41 +00:00
NAKAMURA Takumi	16c7184ba4	Add llvm/test/Transforms/SLPVectorizer/ARM/lit.local.cfg. Tests there require ARM in targets. llvm-svn: 193580	2013-10-29 02:46:00 +00:00
Alp Toker	6a03374526	Fix "existant" typos llvm-svn: 193579	2013-10-29 02:35:28 +00:00

... 3 4 5 6 7 ...

21837 Commits