llvm-project

Commit Graph

Author	SHA1	Message	Date
Saleem Abdulrasool	8988c2a524	ARM: correct handling of features in arch_extension The subtarget information is the ultimate source of truth for the feature set that is enabled at this point. We would previously not propagate the feature information to the subtarget. While this worked for the most part (features would be enabled/disabled as requested), if another operation that changed the feature bits was encountered (such as a mode switch via a .arm or .thumb directive), we would end up resetting the behaviour of the architectural extensions. Handling this properly requires a slightly more complicated handling. We need to check if the feature is now being toggled. If so, only then do we toggle the features. In return, we no longer have to calculate the feature bits ourselves. The test changes are mostly to the diagnosis, which is now more uniform (a nice side effect!). Add an additional test to ensure that we handle this case properly. Thanks to Nico Weber for alerting me to this issue! llvm-svn: 214057	2014-07-27 19:07:09 +00:00
Saleem Abdulrasool	45cf67b8e9	ARM: convert loop to range based Convert a loop to use range based iteration. Rename structure members to help naming, and make structure definition anonymous. NFC. llvm-svn: 214056	2014-07-27 19:07:05 +00:00
Matt Arsenault	6f2a526101	Add alignment value to allowsUnalignedMemoryAccess Rename to allowsMisalignedMemoryAccess. On R600, 8 and 16 byte accesses are mostly OK with 4-byte alignment, and don't need to be split into multiple accesses. Vector loads with an alignment of the element type are not uncommon in OpenCL code. llvm-svn: 214055	2014-07-27 17:46:40 +00:00
Tim Northover	2c46beb0d1	AArch64: fix conversion of 'J' inline asm constraints. 'J' represents a negative number suitable for an add/sub alias instruction, but while preparing it to become an int64_t we were mangling the sign extension. So "i32 -1" became 0xffffffffLL, for example. Should fix one half of PR20456. llvm-svn: 214052	2014-07-27 07:10:29 +00:00
Chandler Carruth	64a7c828cb	[x86] Sink a variable only used by asserts into the asserts. Should fix some -Werror bots, sorry for the noise. llvm-svn: 214043	2014-07-27 01:45:49 +00:00
Chandler Carruth	80c5bfd843	[x86] Add a much more powerful framework for combining x86 shuffle instructions in the legalized DAG, and leverage it to combine long sequences of instructions to PSHUFB. Eventually, the other x86-instruction-specific shuffle combines will probably all be driven out of this routine. But the real motivation is to detect after we have fully legalized and optimized a shuffle to the minimal number of x86 instructions whether it is profitable to replace the chain with a fully generic PSHUFB instruction even though doing so requires either a load from a constant pool or tying up a register with the mask. While the Intel manuals claim it should be used when it replaces 5 or more instructions (!!!!) my experience is that it is actually very fast on modern chips, and so I've gon with a much more aggressive model of replacing any sequence of 3 or more instructions. I've also taught it to do some basic canonicalization to special-purpose instructions which have smaller encodings than their generic counterparts. There are still quite a few FIXMEs here, and I've not yet implemented support for lowering blends with PSHUFB (where its power really shines due to being able to zero out lanes), but this starts implementing real PSHUFB support even when using the new, fancy shuffle lowering. =] llvm-svn: 214042	2014-07-27 01:15:58 +00:00
Chandler Carruth	3ea985b375	[ADT] Add a remarkbly useful little helper routine to ArrayRef for checking whether the ArrayRef is equal to an explicit list of arguments. This is particularly easy to implement even without variadic templates because ArrayRef happens to be homogeneously typed. As a consequence we can use a "clever" wrapper type and default arguments to capture in a single method many arguments as well as how many arguments the user specified. Thanks to Dave Blaikie for helping me pull together this little helper. Suggestions for how to improve or generalize it are of course welcome. I'll be using it immediately in my follow-up patch. =D llvm-svn: 214041	2014-07-27 01:11:19 +00:00
Matt Arsenault	24aa028cfa	R600/SI: Fix broken test. There was no check prefix for the instruction lines. Match what is emitted though, although I'm pretty sure it is incorrect. llvm-svn: 214035	2014-07-26 21:21:42 +00:00
Joey Gouly	ec981058aa	Fix the failing test 'vector-idiv.ll'. On Darwin the comment character is ##. llvm-svn: 214028	2014-07-26 10:58:14 +00:00
Matt Arsenault	a5789bb4e1	R600: Move intrinsic lowering to separate functions llvm-svn: 214023	2014-07-26 06:23:37 +00:00
Chandler Carruth	5a85c7beb8	[SDAG] Add an assert that we don't mess up the number of values when replacing nodes in the legalizer. This caught a number of bugs for me during development. llvm-svn: 214022	2014-07-26 05:53:16 +00:00
Chandler Carruth	98655fa4d8	[SDAG] Simplify the code for handling single-value nodes and add a missing transfer of debug information (without which tests fail). llvm-svn: 214021	2014-07-26 05:52:51 +00:00
Chandler Carruth	411fb407f8	[SDAG] When performing post-legalize DAG combining, run the legalizer over each node in the worklist prior to combining. This allows the combiner to produce new nodes which need to go back through legalization. This is particularly useful when generating operands to target specific nodes in a post-legalize DAG combine where the operands are significantly easier to express as pre-legalized operations. My immediate use case will be PSHUFB formation where we need to build a constant shuffle mask with a build_vector node. This also refactors the relevant functionality in the legalizer to support this, and updates relevant tests. I've spoken to the R600 folks and these changes look like improvements to them. The avx512 change needs to be investigated, I suspect there is a disagreement between the legalizer and the DAG combiner there, but it seems a minor issue so leaving it to be re-evaluated after this patch. Differential Revision: http://reviews.llvm.org/D4564 llvm-svn: 214020	2014-07-26 05:49:40 +00:00
Nick Lewycky	d7c726c5e9	Fix broken assert. llvm-svn: 214019	2014-07-26 05:44:15 +00:00
NAKAMURA Takumi	1fa7769ba9	X86ShuffleDecode.cpp: Silence a warning. [-Wunused-variable] llvm-svn: 214016	2014-07-26 04:53:05 +00:00
NAKAMURA Takumi	f2df3f59fb	llvm/test/CodeGen/X86/vector-idiv.ll: Fix for -Asserts. llvm-svn: 214015	2014-07-26 04:47:01 +00:00
Chandler Carruth	5896698e2e	[x86] Fix PR20355 (for real). There are many layers to this bug. The tale starts with r212808 which attempted to fix inversion of the low and high bits when lowering MUL_LOHI. Sadly, that commit did not include any positive test cases, and just removed some operations from a test case where the actual logic being changed isn't fully visible from the test. What this commit did was two things. First, it reversed the low and high results in the formation of the MERGE_VALUES node for the multiple results. This is entirely correct. Second it changed the shuffles for extracting the low and high components from the i64 results of the multiplies to extract them assuming a big-endian-style encoding of the multiply results. This second change is wrong. There is no big-endian encoding in x86, the results of the multiplies are normal v2i64s: when cast to v4i32, the low i32s are at offsets 0 and 2, and the high i32s are at offsets 1 and 3. However, the first change wasn't enough to actually fix the bug, which is (I assume) why the second change was also made. There was another bug in the MERGE_VALUES formation: we weren't using a VTList, and so were getting a single result node! When grabbing the second result from the node, we got... well.. colud be anything. I think this appeared to invert things, but had to be causing other problems as well. Fortunately, I fixed the MERGE_VALUES issue in r213931, so we should have been fine, right? NOOOPE! Because the core bug was never addressed, the test in vector-idiv failed when I fixed the MERGE_VALUES node. Because there are essentially no docs for this node, I had to guess at how to fix it and tried swapping the operands, restoring the order of the original code before r212808. While this "fixed" the test case (in that we produced the write instructions) we were still extracting the wrong elements of the i64s, and thus PR20355 was still broken. This commit essentially reverts the big-endian-style extraction part of r212808 and goes back to the original masks which were correct. Now that the MERGE_VALUES node formation is also correct, everything works. I've also included a more detailed test from PR20355 to make sure this stays fixed. llvm-svn: 214011	2014-07-26 03:46:57 +00:00
Chandler Carruth	591c16a967	[x86] Finish switching from CHECK to ALL. This was mistakenly included in r214007 and then reverted when I backed that (very misguided) patch out. This recovers the test case cleanup which was good. llvm-svn: 214010	2014-07-26 03:46:54 +00:00
Chandler Carruth	f6406ac5d6	[x86] Revert r214007: Fix PR20355 ... The clever way to implement signed multiplication with unsigned is already implemented and tested and working correctly. The bug is somewhere else. Re-investigating. This will teach me to not scroll far enough to read the code that did what I thought needed to be done. llvm-svn: 214009	2014-07-26 02:14:54 +00:00
Chandler Carruth	1bf4d19172	[x86] Fix PR20355 (and dups) by not using unsigned multiplication when signed multiplication is requested. While there is not a difference in the low half of the result, the high half (used specifically to implement the signed division by these constants) certainly is used. The test case I've nuked was actively asserting wrong code. There is a delightful solution to doing signed multiplication even when we don't have it that Richard Smith has crafted, but I'll add the machinery back and implement that in a follow-up patch. This at least restores correctness. llvm-svn: 214007	2014-07-26 01:52:13 +00:00
Chandler Carruth	80adc64066	[x86] Add coverage for PMUL* instruction testing on SSE2 as well as SSE4.1. llvm-svn: 214001	2014-07-26 01:11:10 +00:00
Richard Smith	bda99b5856	[modules] Work around mislayering of MC / Object. llvm-svn: 214000	2014-07-26 01:10:32 +00:00
NAKAMURA Takumi	8b2e7bfac1	Update X86/Utils/LLVMBuild.txt corresponding to r213986. "Core" has been introduced. llvm-svn: 213995	2014-07-26 00:45:43 +00:00
NAKAMURA Takumi	b7f6584d25	IR/UseTest.cpp: Avoid std::to_string() to appease mingw32 bot. llvm-svn: 213994	2014-07-26 00:45:30 +00:00
Chandler Carruth	8709cb4a6b	[x86] More cleanup for this test -- simplify the command line. llvm-svn: 213991	2014-07-26 00:21:52 +00:00
Chandler Carruth	0e469609f3	[x86] Fix unused variable warning in no-asserts build. llvm-svn: 213989	2014-07-26 00:04:41 +00:00
Chandler Carruth	6da2d97a32	[x86] FileCheck-ize this test. llvm-svn: 213988	2014-07-25 23:59:20 +00:00
Chandler Carruth	185cc18d42	[x86] Teach the X86 backend to print shuffle comments for PSHUFB instructions which happen to have a constant mask. Currently, this only handles a very narrow set of cases, but those happen to be the cases that I care about for testing shuffles sanely. This is a bit trickier than other shuffle instructions because we're decoding constants out of the constant pool. The current MC layer makes it completely impossible to inspect a constant pool entry, so we have to do it at the MI level and attach the comment to the streamer on its way out. So no joy for disassembling, but it does make test cases and asm dumps much nicer. Sorry for no test cases, but it didn't really seem that valuable to go trolling through existing old test cases and updating them. I'll have lots of testing of this in the upcoming patch for SSSE3 emission in the new vector shuffle lowering code paths. llvm-svn: 213986	2014-07-25 23:47:11 +00:00
Matt Arsenault	c824458e81	R600/SI: Allow partial unrolling and increase thresholds. llvm-svn: 213985	2014-07-25 23:02:42 +00:00
Eric Christopher	ac4b69e40b	Move R600 subtarget dependent variables onto the subtarget. No functional change. llvm-svn: 213982	2014-07-25 22:22:39 +00:00
Alex Lorenz	b2ebf2a08b	coverage: remove empty mapping regions This patch removes the empty coverage mapping regions. Those regions were produced by clang's old mapping region generation algorithm, but the new algorithm doesn't generate them. llvm-svn: 213981	2014-07-25 22:22:24 +00:00
Hal Finkel	f5867a79c5	Canonicalization for @llvm.assume Adds simple logical canonicalization of assumption intrinsics to instcombine, currently: - invariant(a && b) -> invariant(a); invariant(b) - invariant(!(a \|\| b)) -> invariant(!a); invariant(!b) llvm-svn: 213977	2014-07-25 21:45:17 +00:00
Nico Weber	a822d94f57	Wrap to 80 columns, no behavior change. llvm-svn: 213975	2014-07-25 21:37:41 +00:00
NAKAMURA Takumi	c82ee2f1ad	llvm-uselistorder: Fix up LINK_COMPONENTS. llvm-svn: 213974	2014-07-25 21:33:18 +00:00
Hal Finkel	930469107d	Add @llvm.assume, lowering, and some basic properties This is the first commit in a series that add an @llvm.assume intrinsic which can be used to provide the optimizer with a condition it may assume to be true (when the control flow would hit the intrinsic call). Some basic properties are added here: - llvm.invariant(true) is dead. - llvm.invariant(false) is unreachable (this directly corresponds to the documented behavior of MSVC's __assume(0)), so is llvm.invariant(undef). The intrinsic is tagged as writing arbitrarily, in order to maintain control dependencies. BasicAA has been updated, however, to return NoModRef for any particular location-based query so that we don't unnecessarily block code motion. llvm-svn: 213973	2014-07-25 21:13:35 +00:00
Akira Hatanaka	ba3af24c25	[stack protector] Add test cases for thumb and thumb2. <rdar://problem/12475629> llvm-svn: 213970	2014-07-25 19:47:46 +00:00
Akira Hatanaka	e5b6e0d231	[stack protector] Fix a potential security bug in stack protector where the address of the stack guard was being spilled to the stack. Previously the address of the stack guard would get spilled to the stack if it was impossible to keep it in a register. This patch introduces a new target independent node and pseudo instruction which gets expanded post-RA to a sequence of instructions that load the stack guard value. Register allocator can now just remat the value when it can't keep it in a register. <rdar://problem/12475629> llvm-svn: 213967	2014-07-25 19:31:34 +00:00
Brad Smith	a74e3f0c51	Fix arc4random detection. Patch by Pascal Stumpf. llvm-svn: 213966	2014-07-25 19:28:44 +00:00
Rafael Espindola	78cfa0c72e	Remove dead code. llvm-svn: 213963	2014-07-25 19:06:39 +00:00
Hal Finkel	7c8ae53506	[PowerPC] Support TLS on PPC32/ELF Patch by Justin Hibbits! llvm-svn: 213960	2014-07-25 17:47:22 +00:00
Juergen Ributzka	5d6c43e294	[FastISel][AArch64] Add support for frameaddress intrinsic. This commit implements the frameaddress intrinsic for the AArch64 architecture in FastISel. There were two test cases that pretty much tested the same, so I combined them to a single test case. Fixes <rdar://problem/17811834> llvm-svn: 213959	2014-07-25 17:47:14 +00:00
Duncan P. N. Exon Smith	4b4d8ecde1	Move -verify-use-list-order into llvm-uselistorder Ugh. Turns out not even transformation passes link in how to read IR. I sincerely believe the buildbots will finally agree with my system after this though. (I don't really understand why all of this has been working on my system, but not on all the buildbots.) Create a new tool called llvm-uselistorder to use for verifying use-list order. For now, just dump everything from the (now defunct) -verify-use-list-order pass into the tool. This might be a better way to test use-list order anyway. Part of PR5680. llvm-svn: 213957	2014-07-25 17:13:03 +00:00
David Blaikie	29459ae83c	Reapply "DebugInfo: Don't put fission type units in comdat sections." This recommits r208930, r208933, and r208975 (by reverting r209338) and reverts r209529 (the FIXME to readd this functionality once the tools were fixed) now that DWP has been fixed to cope with a single section for all fission type units. Original commit message: "Since type units in the dwo file are handled by a debug aware tool, they don't need to leverage the ELF comdat grouping to implement deduplication. Avoid creating all the .group sections for these as a space optimization." llvm-svn: 213956	2014-07-25 17:11:58 +00:00
Hal Finkel	869b0a1fd4	Claim AA generally as code owner As per nominations from Chandler and Arnold. llvm-svn: 213955	2014-07-25 16:45:10 +00:00
Hans Wennborg	82f490c0ba	Fix MSVC2012 build error in UseListOrder.cpp I think the compiler got confused by the nested DEBUG macros. It was failing with: UseListOrder.cpp(80) : error C2059: syntax error : '}' llvm-svn: 213954	2014-07-25 16:22:13 +00:00
Duncan P. N. Exon Smith	15eb0ab28d	Bitcode: Don't optimize constants when preserving use-list order `ValueEnumerator::OptimizeConstants()` creates forward references within the constant pools, which makes predicting constants' use-list order difficult. For now, just disable the optimization. This can be re-enabled in the future in one of two ways: - Enable a limited version of this optimization that doesn't create forward references. One idea is to categorize constants by their "height" and make that the top-level sort. - Enable it entirely. This requires predicting how may times each constant will be recreated as its operands' and operands' operands' (etc.) forward references get resolved. This is part of PR5680. llvm-svn: 213953	2014-07-25 16:13:16 +00:00
David Blaikie	2f04011435	Recommit r212203: Don't try to construct debug LexicalScopes hierarchy for functions that do not have top level debug information. Reverted by Eric Christopher (Thanks!) in r212203 after Bob Wilson reported LTO issues. Duncan Exon Smith and Aditya Nandakumar helped provide a reduced reproduction, though the failure wasn't too hard to guess, and even easier with the example to confirm. The assertion that the subprogram metadata associated with an llvm::Function matches the scope data referenced by the DbgLocs on the instructions in that function is not valid under LTO. In LTO, a C++ inline function might exist in multiple CUs and the subprogram metadata nodes will refer to the same llvm::Function. In this case, depending on the order of the CUs, the first intance of the subprogram metadata may not be the one referenced by the instructions in that function and the assertion will fail. A test case (test/DebugInfo/cross-cu-linkonce-distinct.ll) is added, the assertion removed and a comment added to explain this situation. This was then reverted again in r213581 as it caused PR20367. The root cause of this was the early exit in LiveDebugVariables meant that spurious DBG_VALUE intrinsics that referenced dead variables were not removed, causing an assertion/crash later on. The fix is to have LiveDebugVariables strip all DBG_VALUE intrinsics in functions without debug info as they're not needed anyway. Test case added to cover this situation (that occurs when a debug-having function is inlined into a nodebug function) in test/DebugInfo/X86/nodebug_with_debug_loc.ll Original commit message: If a function isn't actually in a CU's subprogram list in the debug info metadata, ignore all the DebugLocs and don't try to build scopes, track variables, etc. While this is possibly a minor optimization, it's also a correctness fix for an incoming patch that will add assertions to LexicalScopes and the debug info verifier to ensure that all scope chains lead to debug info for the current function. Fix up a few test cases that had broken/incomplete debug info that could violate this constraint. Add a test case where this occurs by design (inlining a debug-info-having function in an attribute nodebug function - we want this to work because /if/ the nodebug function is then inlined into a debug-info-having function, it should be fine (and will work fine - we just stitch the scopes up as usual), but should the inlining not happen we need to not assert fail either). llvm-svn: 213952	2014-07-25 16:10:16 +00:00
David Blaikie	48af9c3527	DebugInfo: Fix up some test cases to have more correct debug info metadata. * Add CUs to the named CU node * Add missing DW_TAG_subprogram nodes * Add llvm::Functions to the DW_TAG_subprogram nodes This cleans up the tests so that they don't break under a soon-to-be-made change that is more strict about such things. llvm-svn: 213951	2014-07-25 16:05:18 +00:00
Hal Finkel	df14364f30	Add code owner of scoped-noalias metadata Add myself as the code owner for the scoped-noalias metadata I've developed. llvm-svn: 213950	2014-07-25 15:54:55 +00:00
Hal Finkel	ff0bcb60c9	Convert noalias parameter attributes into noalias metadata during inlining This functionality is currently turned off by default. Part of the motivation for introducing scoped-noalias metadata is to enable the preservation of noalias parameter attribute information after inlining. Sometimes this can be inferred from the code in the caller after inlining, but often we simply lose valuable information. The overall process if fairly simple: 1. Create a new unqiue scope domain. 2. For each (used) noalias parameter, create a new alias scope. 3. For each pointer, collect the underlying objects. Add a noalias scope for each noalias parameter from which we're not derived (and has not been captured prior to that point). 4. Add an alias.scope for each noalias parameter from which we might be derived (or has been captured before that point). Note that the capture checks apply only if one of the underlying objects is not an identified function-local object. llvm-svn: 213949	2014-07-25 15:50:08 +00:00
Hal Finkel	029cde639c	Simplify and improve scoped-noalias metadata semantics In the process of fixing the noalias parameter -> metadata conversion process that will take place during inlining (which will be committed soon, but not turned on by default), I have come to realize that the semantics provided by yesterday's commit are not really what we want. Here's why: void foo(noalias a, noalias b, noalias c, bool x) { q = x ? a : b; c = q; } Generically, we know that c does not alias with a and with b (so there is an 'and' in what we know we're not), and we know that q might be derived from a or from *b (so there is an 'or' in what we know that we are). So we do not want the semantics currently, where any noalias scope matching any alias.scope causes a NoAlias return. What we want to know is that the noalias scopes form a superset of the alias.scope list (meaning that all the things we know we're not is a superset of all of things the other instruction might be). Making that change, however, introduces a composibility problem. If we inline once, adding the noalias metadata, and then inline again adding more, and we append new scopes onto the noalias and alias.scope lists each time. But, this means that we could change what was a NoAlias result previously into a MayAlias result because we appended an additional scope onto one of the alias.scope lists. So, instead of giving scopes the ability to have parents (which I had borrowed from the TBAA implementation, but seems increasingly unlikely to be useful in practice), I've given them domains. The subset/superset condition now applies within each domain independently, and we only need it to hold in one domain. Each time we inline, we add the new scopes in a new scope domain, and everything now composes nicely. In addition, this simplifies the implementation. llvm-svn: 213948	2014-07-25 15:50:02 +00:00
Duncan P. N. Exon Smith	20a005f27a	Try to fix a layering violation introduced by r213945 The dragonegg buildbot (and others?) started failing after r213945/r213946 because `llvm-as` wasn't linking in the bitcode reader. I think moving the verify functions to the same file as the verify pass should fix the build. Adding a command-line option for maintaining use-list order in assembly as a drive-by to prevent warnings about unused static functions. llvm-svn: 213947	2014-07-25 15:41:49 +00:00
Duncan P. N. Exon Smith	f62acab1c0	Fix -Werror build after r213945 llvm-svn: 213946	2014-07-25 15:00:02 +00:00
Duncan P. N. Exon Smith	6b6fdc992a	IPO: Add use-list-order verifier Add a -verify-use-list-order pass, which shuffles use-list order, writes to bitcode, reads back, and verifies that the (shuffled) order matches. - The utility functions live in lib/IR/UseListOrder.cpp. - Moved (and renamed) the command-line option to enable writing use-lists, so that this pass can return early if the use-list orders aren't being serialized. It's not clear that this pass is the right direction long-term (perhaps a separate tool instead?), but short-term it's a great way to test the use-list order prototype. I've added an XFAIL-ed testcase that I'm hoping to get working pretty quickly. This is part of PR5680. llvm-svn: 213945	2014-07-25 14:49:26 +00:00
Amara Emerson	115d2df8a4	[ARM] Emit ABI_PCS_R9_use build attribute. Patch by Ben Foster! Differential Revision: http://reviews.llvm.org/D4657 llvm-svn: 213944	2014-07-25 14:03:14 +00:00
Benjamin Kramer	1f8930e3d3	Run sort_includes.py on the AArch64 backend. No functionality change. llvm-svn: 213938	2014-07-25 11:42:14 +00:00
Chandler Carruth	da490d2ec1	[cmake] Use the external project machinery for libcxxabi so that it can be disabled in CMake or relocated if desired. llvm-svn: 213936	2014-07-25 10:27:40 +00:00
NAKAMURA Takumi	b4553da481	llvm/test/CodeGen/ARM/inlineasm-global.ll: Add explicit triple to appease targeting *-win32. llvm-svn: 213933	2014-07-25 09:55:01 +00:00
NAKAMURA Takumi	dd620a242a	llvm/test/CodeGen/ARM/inlineasm-global.ll: Avoid specifing source file on llc. It sometimes confuses FileCheck. Consider the case that path contains 'stmib'. :) llvm-svn: 213932	2014-07-25 09:54:49 +00:00
Chandler Carruth	3de980d2ff	[SDAG] Enable the new assert for out-of-range result numbers in SDValues, fixing the two bugs left in the regression suite. The key for both of these was the use a single value type rather than a VTList which caused an unintentionally single-result merge-value node. Fix this by getting the appropriate VTList in place. Doing this exposed that the comments in x86's code abouth how MUL_LOHI operands are handle is wrong. The bug with the use of out-of-range result numbers was hiding the bug about the order of operands here (as best i can tell). There are more places where the code appears to get this backwards still... llvm-svn: 213931	2014-07-25 09:19:23 +00:00
Chandler Carruth	eae2d28cc9	[SDAG] Don't insert the VRBase into a mapping from SDValues when the def doesn't actually correspond to an SDValue at all. Fixes most of the remaining asserts on out-of-range SDValue result numbers. llvm-svn: 213930	2014-07-25 09:19:18 +00:00
Matt Arsenault	197a1e26e3	Store nodes only have 1 result. llvm-svn: 213928	2014-07-25 07:56:42 +00:00
Chandler Carruth	94bd553eb8	[SDAG] Start plumbing an assert into SDValues that we don't form one with a result number outside the range of results for the node. I don't know how we managed to not really check this very basic invariant for so long, but the code is very broken at this point. I have over 270 test failures with the assert enabled. I'm committing it disabled so that others can join in the cleanup effort and reproduce the issues. I've also included one of the obvious fixes that I already found. More fixes to come. llvm-svn: 213926	2014-07-25 07:23:23 +00:00
Akira Hatanaka	16e47ff42e	[ARM] In thumb mode, emit directive ".code 16" before file level inline assembly instructions. This is necessary to ensure ARM assembler switches to Thumb mode before it starts assembling the file level inline assembly instructions at the beginning of a .s file. <rdar://problem/17757232> llvm-svn: 213924	2014-07-25 05:12:49 +00:00
Lang Hames	98c3c0f38a	[X86] Add comments to clarify some non-obvious lines in the stackmap-nops.ll testcases. Based on code review from Philip Reames. Thanks Philip! llvm-svn: 213923	2014-07-25 04:50:08 +00:00
David Majnemer	bf32f773cc	llvm-vtabledump: use a std::map instead of a StringMap for VBTables StringMap doesn't guarantee any particular iteration order, this is suboptimal when comparing llvm-vtabledump's output for two object files. llvm-svn: 213921	2014-07-25 04:30:11 +00:00
Ehsan Akhgari	29b61ce770	Fix a warning in CoverageMappingReader.cpp llvm-svn: 213920	2014-07-25 02:51:57 +00:00
Lang Hames	5432649be7	[X86] Clarify some stackmap shadow optimization code as based on review feedback from Eric Christopher. No functional change. llvm-svn: 213917	2014-07-25 02:29:19 +00:00
Bill Schmidt	c9fa5dd618	[PATCH][PPC64LE] Correct little-endian usage of vmrgh* and vmrgl. Because the PowerPC vmrgh and vmrgl* instructions have a built-in big-endian bias, it is necessary to swap their inputs in little-endian mode when using them to implement a vector shuffle. This was previously missed in the vector LE implementation. There was already logic to distinguish between unary and "normal" vmrg* vector shuffles, so this patch extends that logic to use a third option: "swapped" vmrg* vector shuffles that are used for little endian in place of the "normal" ones. I've updated the vec-shuffle-le.ll test to check for the expected register ordering on the generated instructions. This bug was discovered when testing the LE and ELFv2 patches for safety if they were backported to 3.4. A different vectorization decision was made in 3.4 than on mainline trunk, and that exposed the problem. I've verified this fix takes care of that issue. llvm-svn: 213915	2014-07-25 01:55:55 +00:00
Alex Lorenz	a20a5d50ba	Add code coverage mapping data, reader, and writer. This patch implements the data structures, the reader and the writers for the new code coverage mapping system. The new code coverage mapping system uses the instrumentation based profiling to provide code coverage analysis. llvm-svn: 213910	2014-07-24 23:57:54 +00:00
Alex Lorenz	817e485470	Add code coverage mapping data, reader, and writer. This patch implements the data structures, the reader and the writers for the new code coverage mapping system. The new code coverage mapping system uses the instrumentation based profiling to provide code coverage analysis. llvm-svn: 213909	2014-07-24 23:55:56 +00:00
Kevin Enderby	08e1bbd645	Add an implementation for llvm-nm’s -print-file-name option (aka -o and -A). The -print-file-name option in llvm-nm is to precede each symbol with the object file it came from. While code for the parsing of this option and its aliases existed there was no code to implement it. llvm-svn: 213906	2014-07-24 23:31:52 +00:00
David Majnemer	ab131e86ce	Opportunistically fix the builders A builder complained that it couldn't find llvm-vtabledump, this is probably because it wasn't a dependency of the 'test' target. llvm-svn: 213905	2014-07-24 23:26:54 +00:00
David Majnemer	72ab1a5aee	llvm-vtabledump: A vtable dumper This tool's job is to dump the vtables inside object files. It is currently limited to MS ABI vf- and vb-tables but it will eventually support Itanium-style v-tables as well. Differential Revision: http://reviews.llvm.org/D4584 llvm-svn: 213903	2014-07-24 23:14:40 +00:00
Mark Heffernan	8ec1474f7f	After unrolling a loop with llvm.loop.unroll.count metadata (unroll factor hint) the loop unroller replaces the llvm.loop.unroll.count metadata with llvm.loop.unroll.disable metadata to prevent any subsequent unrolling passes from unrolling more than the hint indicates. This patch fixes an issue where loop unrolling could be disabled for other loops as well which share the same llvm.loop metadata. llvm-svn: 213900	2014-07-24 22:36:40 +00:00
Joerg Sonnenberger	b5459e6e22	Don't use 128bit functions on PPC32. llvm-svn: 213899	2014-07-24 22:20:10 +00:00
Chandler Carruth	9f4530b95d	[SDAG] Introduce a combined set to the DAG combiner which tracks nodes which have successfully round-tripped through the combine phase, and use this to ensure all operands to DAG nodes are visited by the combiner, even if they are only added during the combine phase. This is critical to have the combiner reach nodes that are introduced during combining. Previously these would sometimes be visited and sometimes not be visited based on whether they happened to end up on the worklist or not. Now we always run them through the combiner. This fixes quite a few bad codegen test cases lurking in the suite while also being more principled. Among these, the TLS codegeneration is particularly exciting for programs that have this in the critical path like TSan-instrumented binaries (although I think they engineer to use a different TLS that is faster anyways). I've tried to check for compile-time regressions here by running llc over a merged (but not LTO-ed) clang bitcode file and observed at most a 3% slowdown in llc. Given that this is essentially a worst case (none of opt or clang are running at this phase) I think this is tolerable. The actual LTO case should be even less costly, and the cost in normal compilation should be negligible. With this combining logic, it is possible to re-legalize as we combine which is necessary to implement PSHUFB formation on x86 as a post-legalize DAG combine (my ultimate goal). Differential Revision: http://reviews.llvm.org/D4638 llvm-svn: 213898	2014-07-24 22:15:28 +00:00
Chandler Carruth	80b869461e	[x86] Make vector legalization of extloads work more like the "normal" vector operation legalization with support for custom target lowering and fallback to expand when it fails, and use this to implement sext and anyext load lowering for x86 in a more principled way. Previously, the x86 backend relied on a target DAG combine to "combine away" sextload and extload nodes prior to legalization, or would expand them during legalization with terrible code. This is particularly problematic because the DAG combine relies on running over non-canonical DAG nodes at just the right time to match several common and important patterns. It used a combine rather than lowering because we didn't have good lowering support, and to expose some tricks being employed to more combine phases. With this change it becomes a proper lowering operation, the backend marks that it can lower these nodes, and I've added support for handling the canonical forms that don't have direct legal representations such as sextload of a v4i8 -> v4i64 on AVX1. With this change, our test cases for this behavior continue to pass even after the DAG combiner beigns running more systematically over every node. There is some noise caused by this in the test suite where we actually use vector extends instead of subregister extraction. This doesn't really seem like the right thing to do, but is unlikely to be a critical regression. We do regress in one case where by lowering to the target-specific patterns early we were able to combine away extraneous legal math nodes. However, this regression is completely addressed by switching to a widening based legalization which is what I'm working toward anyways, so I've just switched the test to that mode. Differential Revision: http://reviews.llvm.org/D4654 llvm-svn: 213897	2014-07-24 22:09:56 +00:00
Saleem Abdulrasool	8dc8fb18d8	Target: invert condition for Windows The Microsoft ABI and MSVCRT are considered the canonical C runtime and ABI. The long double routines are not part of this environment. However, cygwin and MinGW both provide supplementary implementations. Change the condition to reflect this reality. llvm-svn: 213896	2014-07-24 22:09:06 +00:00
Manman Ren	4d189fb9a6	Feedback from Hans on r213815. No functionaility change. llvm-svn: 213895	2014-07-24 21:13:20 +00:00
Hans Wennborg	e34a71aa91	Windows: Don't wildcard expand /? or -? Even if there's a file called c:\a, we want /? to be preserved as an option, not expanded to a filename. llvm-svn: 213894	2014-07-24 21:09:45 +00:00
Lang Hames	f49bc3f1b1	[X86] Optimize stackmap shadows on X86. This patch minimizes the number of nops that must be emitted on X86 to satisfy stackmap shadow constraints. To minimize the number of nops inserted, the X86AsmPrinter now records the size of the most recent stackmap's shadow in the StackMapShadowTracker class, and tracks the number of instruction bytes emitted since the that stackmap instruction was encountered. Padding is emitted (if it is required at all) immediately before the next stackmap/patchpoint instruction, or at the end of the basic block. This optimization should reduce code-size and improve performance for people using the llvm stackmap intrinsic on X86. <rdar://problem/14959522> llvm-svn: 213892	2014-07-24 20:40:55 +00:00
Reid Kleckner	9a412d13c1	Replace an assertion with a fatal error Frontends are responsible for putting inalloca on parameters that would be passed in memory and not registers. llvm-svn: 213891	2014-07-24 19:53:33 +00:00
Joerg Sonnenberger	6637d4e2e7	Use the same .eh_frame encoding for 32bit PPC as on i386. llvm-svn: 213890	2014-07-24 19:25:16 +00:00
Manman Ren	29a2005596	Try to fix the bots again by moving test to X86 directory. llvm-svn: 213884	2014-07-24 17:57:09 +00:00
Saleem Abdulrasool	c61ed0474e	X86: correct library call setup for Windows itanium This target is identical to the Windows MSVC (and follows Microsoft ABI for C). Correct the library call setup for this target. The same set of library calls are missing on this environment. llvm-svn: 213883	2014-07-24 17:46:36 +00:00
Matt Arsenault	83592a2d32	R600: Add FMA instructions for Evergreen llvm-svn: 213882	2014-07-24 17:41:01 +00:00
Manman Ren	a8bc9a4c36	Try to fix the bots. If this does not work, I am going to move it to X86 directory. llvm-svn: 213880	2014-07-24 17:18:33 +00:00
Saleem Abdulrasool	34610e33ae	X86: silence sign comparison warning GCC 4.8 detected a signed compare [-Wsign-compare]. Add a cast for the destination index. Add an assert to catch a potential overflow however unlikely it may be. llvm-svn: 213878	2014-07-24 17:12:06 +00:00
Matt Arsenault	83e60581c3	R600: Add new functions for splitting vector loads and stores. These will be used in future patches and shouldn't change anything yet. llvm-svn: 213877	2014-07-24 17:10:35 +00:00
Nico Weber	155dccd1eb	Let the integrated assembler understand .exitm, PR20426. llvm-svn: 213876	2014-07-24 17:08:39 +00:00
Nico Weber	2a8f922b1c	Remove unused field MacroInstantiation::TheMacro. No behavior change. llvm-svn: 213874	2014-07-24 16:29:04 +00:00
Nico Weber	404012b7dc	Let the integrated assembler understand .warning, PR20428. llvm-svn: 213873	2014-07-24 16:26:06 +00:00
Joerg Sonnenberger	c7dbc13e77	Include relative path for header outside the current directory. llvm-svn: 213872	2014-07-24 16:04:46 +00:00
Rafael Espindola	8c4c0213fd	Remove dead code. Every user has been switched to using EngineBuilder. llvm-svn: 213871	2014-07-24 16:02:28 +00:00
Tim Northover	7324e845a4	AArch64: refactor ReconstructShuffle function Quite a bit of cruft had accumulated as we realised the various different cases it had to handle and squeezed them in where possible. This refactoring mostly flattens the logic and special-cases. The result is slightly longer, but I think clearer. Should be no functionality change. llvm-svn: 213867	2014-07-24 15:39:55 +00:00
Duncan P. N. Exon Smith	857fd660d8	Fix r213824 on windows llvm-svn: 213866	2014-07-24 15:16:23 +00:00
Hal Finkel	9414665a3b	Add scoped-noalias metadata This commit adds scoped noalias metadata. The primary motivations for this feature are: 1. To preserve noalias function attribute information when inlining 2. To provide the ability to model block-scope C99 restrict pointers Neither of these two abilities are added here, only the necessary infrastructure. In fact, there should be no change to existing functionality, only the addition of new features. The logic that converts noalias function parameters into this metadata during inlining will come in a follow-up commit. What is added here is the ability to generally specify noalias memory-access sets. Regarding the metadata, alias-analysis scopes are defined similar to TBAA nodes: !scope0 = metadata !{ metadata !"scope of foo()" } !scope1 = metadata !{ metadata !"scope 1", metadata !scope0 } !scope2 = metadata !{ metadata !"scope 2", metadata !scope0 } !scope3 = metadata !{ metadata !"scope 2.1", metadata !scope2 } !scope4 = metadata !{ metadata !"scope 2.2", metadata !scope2 } Loads and stores can be tagged with an alias-analysis scope, and also, with a noalias tag for a specific scope: ... = load %ptr1, !alias.scope !{ !scope1 } ... = load %ptr2, !alias.scope !{ !scope1, !scope2 }, !noalias !{ !scope1 } When evaluating an aliasing query, if one of the instructions is associated with an alias.scope id that is identical to the noalias scope associated with the other instruction, or is a descendant (in the scope hierarchy) of the noalias scope associated with the other instruction, then the two memory accesses are assumed not to alias. Note that is the first element of the scope metadata is a string, then it can be combined accross functions and translation units. The string can be replaced by a self-reference to create globally unqiue scope identifiers. [Note: This overview is slightly stylized, since the metadata nodes really need to just be numbers (!0 instead of !scope0), and the scope lists are also global unnamed metadata.] Existing noalias metadata in a callee is "cloned" for use by the inlined code. This is necessary because the aliasing scopes are unique to each call site (because of possible control dependencies on the aliasing properties). For example, consider a function: foo(noalias a, noalias b) { a = b; } that gets inlined into bar() { ... if (...) foo(a1, b1); ... if (...) foo(a2, b2); } -- now just because we know that a1 does not alias with b1 at the first call site, and a2 does not alias with b2 at the second call site, we cannot let inlining these functons have the metadata imply that a1 does not alias with b2. llvm-svn: 213864	2014-07-24 14:25:39 +00:00
Aaron Ballman	99e0ea0aa8	Fixing an MSVC conversion warning about implicitly converting the shift results to 64-bits. No functional change intended. llvm-svn: 213863	2014-07-24 14:24:59 +00:00
Chandler Carruth	a9efa1ac3e	[Target] Teach the query interfaces for lowering of extloads and truncstores to support EVTs and return expand for non-simple ones. This makes them more consistent with the isLegal... query style methods and makes using them simpler in many scenarios. No functionality actually changed. llvm-svn: 213860	2014-07-24 12:20:53 +00:00
Hal Finkel	cc39b67530	AA metadata refactoring (introduce AAMDNodes) In order to enable the preservation of noalias function parameter information after inlining, and the representation of block-level __restrict__ pointer information (etc.), additional kinds of aliasing metadata will be introduced. This metadata needs to be carried around in AliasAnalysis::Location objects (and MMOs at the SDAG level), and so we need to generalize the current scheme (which is hard-coded to just one TBAA MDNode). This commit introduces only the necessary refactoring to allow for the introduction of other aliasing metadata types, but does not actually introduce any (that will come in a follow-up commit). What it does introduce is a new AAMDNodes structure to hold all of the aliasing metadata nodes associated with a particular memory-accessing instruction, and uses that structure instead of the raw MDNode in AliasAnalysis::Location, etc. No functionality change intended. llvm-svn: 213859	2014-07-24 12:16:19 +00:00
NAKAMURA Takumi	8d745ca7cc	Prune redundant libdeps. llvm-svn: 213857	2014-07-24 11:45:27 +00:00
NAKAMURA Takumi	98d18be5fe	Prune dependency to MC from each target disassembler. llvm-svn: 213856	2014-07-24 11:45:11 +00:00
NAKAMURA Takumi	5cc4606378	[CMake] tools/lto: Prune redundant libdep(s). llvm-svn: 213855	2014-07-24 11:44:44 +00:00
NAKAMURA Takumi	f4d666f54c	[CMake] LineEditorTests: Add Support to link_components. Even if LLVMSupport is added in add_unittests, LLVMSupport may be here as consistency. llvm-svn: 213854	2014-07-24 11:44:33 +00:00
Tilmann Scheller	96ef72e54a	[ARM] Make the assembler reject unpredictable pre/post-indexed ARM STRH instructions. The ARM ARM prohibits STRH instructions with writeback into the source register. With this commit this constraint is now enforced and we stop assembling STRH instructions with unpredictable behavior. llvm-svn: 213850	2014-07-24 09:55:46 +00:00
Daniel Sanders	bdcfab117c	[mips] Fix ll and sc instructions Summary: The ll and sc instructions for r6 and non-r6 are misplaced. This patch fixes that. Patch by Jyun-Yan You Differential Revision: http://reviews.llvm.org/D4578 llvm-svn: 213847	2014-07-24 09:47:14 +00:00
Matt Arsenault	9acb978105	R600: Match rcp node on pre-SI llvm-svn: 213844	2014-07-24 06:59:24 +00:00
Matt Arsenault	0daeb63f03	R600: Fix LowerSDIV24 Use ComputeNumSignBits instead of checking for i8 / i16 which only worked when AMDIL was lying about having legal i8 / i16. If an integer is known to fit in 24-bits, we can do division faster with float ops. llvm-svn: 213843	2014-07-24 06:59:20 +00:00
Rafael Espindola	7fd11896a8	Remove unused substitution. llvm-svn: 213839	2014-07-24 04:09:04 +00:00
Duncan P. N. Exon Smith	c2c410d7ec	IR: Fix comment from r213824 llvm-svn: 213836	2014-07-24 02:56:59 +00:00
NAKAMURA Takumi	43b9a9b2c8	Remove a stray semicolon. [-Wpedantic] llvm-svn: 213833	2014-07-24 02:11:24 +00:00
NAKAMURA Takumi	9c3bd7618a	Update library dependencies. llvm-svn: 213832	2014-07-24 02:10:42 +00:00
Matt Arsenault	034d666bb7	R600: Implement enableClusterLoads() llvm-svn: 213831	2014-07-24 02:10:17 +00:00
Kevin Qin	9a2a2c502b	[AArch64] Fix a bug generating incorrect instruction when building small vector. This bug is introduced by r211144. The element of operand may be smaller than the element of result, but previous commit can only handle the contrary condition. This commit is to handle this scenario and generate optimized codes like ZIP1. llvm-svn: 213830	2014-07-24 02:05:42 +00:00
Jiangning Liu	451f30e89f	[AArch64] Disable some optimization cases for type conversion from sint to fp, because those optimization cases are micro-architecture dependent and only make sense for Cyclone. A new predicate Cyclone is introduced in .td file. llvm-svn: 213827	2014-07-24 01:29:59 +00:00
Filipe Cabecinhas	933cccf3fa	Fixed PR20411 - bug in getINSERTPS() When we had a vector_shuffle where we had an input from each vector, we could miscompile it because we were assuming the input from V2 wouldn't be moved from where it was on the vector. Added a test case. llvm-svn: 213826	2014-07-24 01:28:21 +00:00
Duncan P. N. Exon Smith	1698de2879	IR: Add Value::sortUseList() Add `Value::sortUseList()`, templated on the comparison function to use. The sort is an iterative merge sort that uses a binomial vector of already-merged lists to limit the size overhead to `O(1)`. This is part of PR5680. llvm-svn: 213824	2014-07-24 00:53:19 +00:00
Reid Kleckner	2de13f620d	Add a VS "14" msbuild toolset This allows people to try clang inside MSBuild with the VS "14" CTP releases. Fixes PR20341. Patch by Marcel Raad! llvm-svn: 213819	2014-07-23 23:49:16 +00:00
Manman Ren	edc60376ed	SimplifyCFG: fix a bug in switch to table conversion We use gep to access the global array "switch.table", and the table index should be treated as unsigned. When the highest bit is 1, this commit zero-extends the index to an integer type with larger size. For a switch on i2, we used to generate: %switch.tableidx = sub i2 %0, -2 getelementptr inbounds [4 x i64]* @switch.table, i32 0, i2 %switch.tableidx It is incorrect when %switch.tableidx is 2 or 3. The fix is to generate %switch.tableidx = sub i2 %0, -2 %switch.tableidx.zext = zext i2 %switch.tableidx to i3 getelementptr inbounds [4 x i64]* @switch.table, i32 0, i3 %switch.tableidx.zext rdar://17735071 llvm-svn: 213815	2014-07-23 23:13:23 +00:00
Rafael Espindola	45bcf8a59c	Fix the build when building with only the ARM backend. llvm-svn: 213814	2014-07-23 22:54:28 +00:00
Rafael Espindola	8d5432c2f7	Document what backwards compatibility we provide for bitcode. llvm-svn: 213813	2014-07-23 22:43:22 +00:00
NAKAMURA Takumi	c0eb860c21	Let llvm/test/CodeGen/X86/avx512*-mask-op.ll(s) aware of Win32 x64 calling convention. llvm-svn: 213812	2014-07-23 22:38:25 +00:00
Eric Christopher	f19d12ba3c	Fix indenting. llvm-svn: 213811	2014-07-23 22:34:13 +00:00
Chandler Carruth	9f2a54c579	[x86] Rip out some broken test cases for avx512 i1 store support. It isn't reasonable to test storing things using undef pointers -- storing through those is at best "good luck" and really should be transformed to "unreachable". Random changes in the combiner can randomly break these tests for no good reason. I'm following up on the original commit regarding the right long-term strategy here. llvm-svn: 213810	2014-07-23 22:29:19 +00:00
Eric Christopher	6d0e40bfbf	Reorganize and simplify local variables. llvm-svn: 213809	2014-07-23 22:27:10 +00:00
Rafael Espindola	5addace56d	Finish inverting the MC -> Object dependency. There were still some disassembler bits in lib/MC, but their use of Object was only visible in the includes they used, not in the symbols. llvm-svn: 213808	2014-07-23 22:26:07 +00:00
Juergen Ributzka	fa154f03d1	[RuntimeDyld][AArch64] Update relocation tests and also add a simple GOT test. llvm-svn: 213807	2014-07-23 22:23:17 +00:00
Eric Christopher	9d9167950e	Remove the query for TargetMachine and TargetInstrInfo since we're already inside TargetInstrInfo. llvm-svn: 213806	2014-07-23 22:12:03 +00:00
David Blaikie	8e9cfa5497	ArgPromo+DebugInfo: Handle updating debug info over multiple applications of argument promotion. While the subprogram map cache used by Dead Argument Elimination works there, I made a mistake when reusing it for Argument Promotion in r212128 because ArgPromo may transform functions more than once whereas DAE transforms each function only once, removing all the dead arguments in one go. To address this, ensure that the map is updated after each argument promotion. In retrospect it might be a little wasteful to create a map of all subprograms when only handling a single CGSCC, but the alternative is walking the debug info for each function in the CGSCC that gets updated. It's not clear to me what the right tradeoff is there, but since the current tradeoff seems to be working OK (and the code to keep things updated is very cheap), let's stick with that for now. llvm-svn: 213805	2014-07-23 22:09:29 +00:00
David Blaikie	f997c6f90b	Test debug info in arg promotion with an actual promotion case, rather than a degenerate arg promotion that's actually DAE performed by ArgPromo Also the debug location I had here was bogus, describing the location of the call site as in the callee - and unnecessary, so just drop it. llvm-svn: 213803	2014-07-23 21:30:59 +00:00
Jim Grosbach	d3c7942f4a	Use an explicit triple in testcase. Make the test work better on non-darwin hosts. Hopefully. llvm-svn: 213801	2014-07-23 20:46:32 +00:00
Jim Grosbach	724e438c62	[X86,AArch64] Extend vcmp w/ unary op combine to work w/ more constants. The transform to constant fold unary operations with an AND across a vector comparison applies when the constant is not a splat of a scalar as well. llvm-svn: 213800	2014-07-23 20:41:43 +00:00
Jim Grosbach	8f6f0858ec	X86: restrict combine to when type sizes are safe. The folding of unary operations through a vector compare and mask operation is only safe if the unary operation result is of the same size as its input. For example, it's not safe for [su]itofp from v4i32 to v4f64. llvm-svn: 213799	2014-07-23 20:41:38 +00:00
Jim Grosbach	19dd3088c0	DAG: fp->int conversion for non-splat constants. Constant fold the lanes of the input constant build_vector individually so we correctly handle when the vector elements are not all the same constant value. PR20394 llvm-svn: 213798	2014-07-23 20:41:31 +00:00
Justin Holewinski	4d6f783281	[NVPTX] Add some extra tests for mul.wide to test non-power-of-two source types llvm-svn: 213794	2014-07-23 20:23:49 +00:00
Justin Holewinski	2cb5e181d1	[NVPTX] Silence a GCC warning found by the buildbots The cast to NVPTXTargetLowering was missing a 'const', but let's just access the right pointer through the subtarget anyway. llvm-svn: 213793	2014-07-23 20:23:47 +00:00
Mark Heffernan	9e112443b6	Do not add unroll disable metadata after unrolling pass for loops with #pragma clang loop unroll(full). llvm-svn: 213789	2014-07-23 20:05:44 +00:00
Juergen Ributzka	1b014504ab	[FastISel][AArch64] Fix return type in FastLowerCall. I used the wrong method to obtain the return type inside FinishCall. This fix simply uses the return type from FastLowerCall, which we already determined to be a valid type. Reduced test case from Chad. Thanks. llvm-svn: 213788	2014-07-23 20:03:13 +00:00
Justin Holewinski	ecca715b3c	[NVPTX] mul.wide generation works for any smaller integer source types, not just the next smaller power of two llvm-svn: 213784	2014-07-23 18:46:03 +00:00
Robert Khasanov	11d08548fd	[SKX] Added missed test files for rev 213757 llvm-svn: 213780	2014-07-23 18:17:49 +00:00
Saleem Abdulrasool	df2c3e89b5	AsmParser: remove deprecated LLIR support linker_private and linker_private_weak were deprecated in 3.5. Remove support for them now that the 3.5 branch has been created. llvm-svn: 213777	2014-07-23 18:09:31 +00:00
Saleem Abdulrasool	ac94ec025c	ExecutionEngine: remove a stray semicolon Detected via GCC 4.8 [-Wpedantic]. llvm-svn: 213776	2014-07-23 18:09:28 +00:00
Robert Khasanov	7a96f01ca1	[SKX] Fix lowercase "error:" in rev 213757 llvm-svn: 213774	2014-07-23 17:42:13 +00:00
Justin Holewinski	511664dc76	[NVPTX] Make sure we do not generate MULWIDE ISD nodes when optimizations are disabled With optimizations disabled, we disable the isel patterns for mul.wide; but we were still generating MULWIDE ISD nodes. Now, we only try to generate MULWIDE ISD nodes in DAGCombine if the optimization level is not zero. llvm-svn: 213773	2014-07-23 17:40:45 +00:00
Mark Heffernan	e6b4ba1c41	In unroll pragma syntax and loop hint metadata, change "enable" forms to a new form using the string "full". llvm-svn: 213772	2014-07-23 17:31:37 +00:00
Alex Lorenz	4c7ceab219	test commit: remove trailing space llvm-svn: 213770	2014-07-23 17:18:05 +00:00
Chad Rosier	17020f96c7	[AArch64] Lower sdiv x, pow2 using add + select + shift. The target-independent DAGcombiner will generate: asr w1, X, #31 w1 = splat sign bit. add X, X, w1, lsr #28 X = X + 0 or pow2-1 asr w0, X, asr #4 w0 = X/pow2 However, the add + shifts is expensive, so generate: add w0, X, 15 w0 = X + pow2-1 cmp X, wzr X - 0 csel X, w0, X, lt X = (X < 0) ? X + pow2-1 : X; asr w0, X, asr 4 w0 = X/pow2 llvm-svn: 213758	2014-07-23 14:57:52 +00:00
Robert Khasanov	74acbb7767	[SKX] Enabling mask instructions: encoding, lowering KMOVB, KMOVW, KMOVD, KMOVQ, KNOTB, KNOTW, KNOTD, KNOTQ Reviewed by Elena Demikhovsky <elena.demikhovsky@intel.com> llvm-svn: 213757	2014-07-23 14:49:42 +00:00
Tim Northover	14ff2df05c	ARM: spot SBFX-compatbile code expressed with sign_extend_inreg We were assuming all SBFX-like operations would have the shl/asr form, but often when the field being extracted is an i8 or i16, we end up with a SIGN_EXTEND_INREG acting on a shift instead. Simple enough to check for though. llvm-svn: 213754	2014-07-23 13:59:12 +00:00

1 2 3 4 5 ...

106056 Commits