llvm-project

Commit Graph

Author	SHA1	Message	Date
Stepan Dyatkovskiy	8baf17fc5f	PR18929: According to ARM assembler language hash symbol is optional before immediates. For example, see here for more details: http://infocenter.arm.com/help/index.jsp?topic=/com.arm.doc.dui0473j/dom1359731154529.html llvm-svn: 205157	2014-03-30 17:09:54 +00:00
Hal Finkel	90adf0fe06	Make use of previously generated stores in SelectionDAGLegalize::ExpandExtractFromVectorThroughStack When expanding EXTRACT_VECTOR_ELT and EXTRACT_SUBVECTOR using SelectionDAGLegalize::ExpandExtractFromVectorThroughStack, we store the entire vector and then load the piece we want. This is fine in isolation, but generating a new store (and corresponding stack slot) for each extraction ends up producing code of poor quality. When we scalarize a vector operation (using SelectionDAG::UnrollVectorOp for example) we generate one EXTRACT_VECTOR_ELT for each element in the vector. This used to generate one stored copy of the vector for each element in the vector. Now we search the uses of the vector for a suitable store before generating a new one, which results in much more efficient scalarization code. llvm-svn: 205153	2014-03-30 15:10:18 +00:00
NAKAMURA Takumi	411d35e25e	llvm/test/MC/ELF/nocompression.s: Loosen an expression to match "llvm-mc.EXE". llvm-svn: 205148	2014-03-30 14:04:00 +00:00
Hal Finkel	5c0d1454d6	[PowerPC] Handle VSX v2i64 SIGN_EXTEND_INREG sitofp from v2i32 to v2f64 ends up generating a SIGN_EXTEND_INREG v2i64 node (and similarly for v2i16 and v2i8). Even though there are no sign-extension (or algebraic shifts) for v2i64 types, we can handle v2i32 sign extensions by converting two and from v2i64. The small trick necessary here is to shift the i32 elements into the right lanes before the i32 -> f64 step. This is because of the big Endian nature of the system, we need the i32 portion in the high word of the i64 elements. For v2i16 and v2i8 we can do the same, but we first use the default Altivec shift-based expansion from v2i16 or v2i8 to v2i32 (by casting to v4i32) and then apply the above procedure. llvm-svn: 205146	2014-03-30 13:22:59 +00:00
Chandler Carruth	9df0fd4018	[Allocator] Lift the slab size and size threshold into template parameters rather than runtime parameters. There is only one user of these parameters and they are compile time for that user. Making these compile time seems to better reflect their intended usage as well. llvm-svn: 205143	2014-03-30 12:07:07 +00:00
Chandler Carruth	a05a221e63	[Allocator] Simplify unittests by using the default size parameters in more places. llvm-svn: 205141	2014-03-30 11:36:32 +00:00
Chandler Carruth	f95623b790	[Allocator] Stop forward-declaring BumpPtrAllocator in a few places. This is a necessary step to lifting some of its configuration into template parameters rather than runtime parameters. llvm-svn: 205140	2014-03-30 11:36:29 +00:00
Chandler Carruth	a48ecb7639	Don't mark the declarations of the TSan annotation functions as weak. That causes references to them to be weak references which can collapse to null if no definition is provided. We call these functions unconditionally, so a definition must be provided. Make the definitions provided in the .cpp file weak by re-declaring them as weak just prior to defining them. This should keep compilers which cannot attach the weak attribute to the definition happy while actually resolving the symbols correctly during the link. You might ask yourself upon reading this commit log: how did any of this work before? Well, fun story. It turns out we have some code in Support (BumpPtrAllocator) which both uses virtual dispatch and has out-of-line vtables used by that virtual dispatch. If you move the virtual dispatch into its header in just the right way, the optimizer gets to devirtualize, and remove all references to the vtable. Then the sad part: the references to this one vtable were the only strong symbol uses in the support library for llvm-tblgen AFAICT. At least, after doing something just like this, these symbols stopped getting their weak definition and random calls to them would segfault instead. Yay software. llvm-svn: 205137	2014-03-30 11:20:25 +00:00
Chandler Carruth	81f7061065	[ARM64] Fix a heap-use-after-free spotted by ASan. StringRef::lower() returns a std::string. Better yet, we can now stop thinking about what it returns and write 'auto'. It does the right thing. =] llvm-svn: 205135	2014-03-30 09:08:07 +00:00
Tim Northover	bf679cec67	ARM64: uncopy/paste helper function It was doing functional but highly suspect operations on bools due to the more limited shifting operands supported by memory instructions. Should fix some MSVC warnings. llvm-svn: 205134	2014-03-30 08:30:28 +00:00
Tim Northover	6b3258f087	ARM64: remove unused variables llvm-svn: 205133	2014-03-30 07:35:48 +00:00
Tim Northover	af6bfb21cd	ARM64: remove -m32/-m64 mapping with ARM. This is causing the ARM build-bots to fail since they only include the ARM backend and can't create an ARM64 target. llvm-svn: 205132	2014-03-30 07:25:23 +00:00
Tim Northover	3e52557212	ARM64: override all the things. Actually, mostly only those in the top-level directory that already had a "virtual" attached. But it's the thought that counts and it's been a long day. llvm-svn: 205131	2014-03-30 07:25:18 +00:00
Saleem Abdulrasool	f80b49b5d2	Support: correct Windows normalisation If the environment is unknown and no object file is provided, then assume an "MSVC" environment, otherwise, set the environment to the object file format. In the case that we have a known environment but a non-native file format for Windows (COFF) which is used for MCJIT, then append the custom file format to the triple as an additional component. This fixes the MCJIT tests on Windows. llvm-svn: 205130	2014-03-30 07:19:31 +00:00
NAKAMURA Takumi	d423225159	Suppress llvm/test/CodeGen/ARM64 for targeting pecoff. ARM64 is unaware of that. FIXME: Could we support them? llvm-svn: 205126	2014-03-30 05:01:17 +00:00
NAKAMURA Takumi	4cf1a3be82	llvm/test/Transforms/LoopStrengthReduce/ARM64/lsr-*.ll: Add explicit triple arm64-unknown for targeting pecoff. llvm-svn: 205125	2014-03-30 05:01:04 +00:00
NAKAMURA Takumi	09717bd1c4	X86Subtarget.h: isTargetWindows() should tell whether he is targeting msvc. FYI, !isWindowsGNUEnvironment() is insufficient. It missed cygwin. FIXME: The name "isTargetWindows" should be fixed. llvm-svn: 205124	2014-03-30 04:35:00 +00:00
Lang Hames	c339840666	[MC] Remove an unused (and broken) variant of the setupForSymbolicDisassembly method in MCDisassembler. llvm-svn: 205123	2014-03-30 04:27:33 +00:00
Lang Hames	652b0a4f3b	[PBQP] Move invalid graph nodeId/edgeId methods into base class. llvm-svn: 205122	2014-03-30 03:47:00 +00:00
Rafael Espindola	5e66a7e699	Add a missing break. Patch by Tobias Güntner. I tried to write a test, but the only difference is the Changed value that gets returned. It can be tested with "opt -debug-pass=Executions -functionattrs, but that doesn't seem worth it. llvm-svn: 205121	2014-03-30 03:26:17 +00:00
Saleem Abdulrasool	ceec2cba64	Support: normalize the default triple on Unix This will fix cross-compiling buildbots (e.g. cygwin). This is in the same vein as SVN r205070. Apply this to fix the cross-compiling scenario, even though the preferred solution is to update the build system to normalize the embedded triple rather than perform this at runtime every time. This is meant to tide us over until that approach is fleshed out and applied. llvm-svn: 205120	2014-03-30 03:22:37 +00:00
Rafael Espindola	986b14c507	Remove dead declarations. Patch by Tobias Güntner. llvm-svn: 205119	2014-03-30 02:33:01 +00:00
Benjamin Kramer	4d951abf1f	Remove outdated comment. llvm-svn: 205117	2014-03-29 20:16:23 +00:00
Dmitri Gribenko	1fd72104ad	Fix a few -Wdocumentation warnings llvm-svn: 205116	2014-03-29 19:40:32 +00:00
Benjamin Kramer	3ad660a515	Detemplatize LOHDirective. The ARM64 backend uses it only as a container to keep an MCLOHType and Arguments around so give it its own little copy. The other functionality isn't used and we had a crazy method specialization hack in place to keep it working. Unfortunately that was incompatible with MSVC. Also range-ify a couple of loops while at it. llvm-svn: 205114	2014-03-29 19:21:20 +00:00
Benjamin Kramer	61e595be4d	ARM64: Remove unused helper function, make others static. llvm-svn: 205112	2014-03-29 18:00:49 +00:00
Benjamin Kramer	48e7e85d29	tblgen: Twinify PrintFatalError. No functionality change. llvm-svn: 205110	2014-03-29 17:17:15 +00:00
Tim Northover	4e55afed7e	TableGen: don't save a StringRef to a local std::string. This caused a failure in some Windows builds. llvm-svn: 205109	2014-03-29 16:59:27 +00:00
Benjamin Kramer	fd719b9551	Avoid storing Twines. While there nested ifs into a helper function. No functionality change. llvm-svn: 205108	2014-03-29 16:54:29 +00:00
Hal Finkel	777c9dd90a	[PowerPC] Handle v2i64 comparisons v2i64 is a legal type under VSX, however we don't have native vector comparisons. We can handle eq/ne by casting it to an Altivec type, but everything else must be expanded. llvm-svn: 205106	2014-03-29 16:04:40 +00:00
Tim Northover	adbd34e045	ARM64: format register strings without creating a local Twine. It was causing horrible failures on some build-bots. llvm-svn: 205105	2014-03-29 15:35:57 +00:00
Logan Chien	8faefa2d31	llvm-mc: Fix build breakage caused by r205050. When LLVM is not built with zlib, nocompression.s will test for the error message. But this test case will cause breakage because the exit code is non-zero. This commit fix this issue by adding "not" to the command. llvm-svn: 205102	2014-03-29 15:10:22 +00:00
Hal Finkel	e8fba98735	[PowerPC] VSX instruction latency corrections The vector divide and sqrt instructions have high latencies, and the scalar comparisons are like all of the others. On the P7, permutations take an extra cycle over purely-simple vector ops. llvm-svn: 205096	2014-03-29 13:20:31 +00:00
Stepan Dyatkovskiy	df657cc1d5	Recommitted fix for PR18931, with extended tests set. Issue subject: Crash using integrated assembler with immediate arithmetic Fix description: Expressions like 'cmp r0, #(l1 - l2) >> 3' could not be evaluated on asm parsing stage, since it is impossible to resolve labels on this stage. In the end of stage we still have expression (MCExpr). Then, when we want to encode it, we expect it to be an immediate, but it still an expression. Patch introduces a Fixup (MCFixup instance), that is processed after main encoding stage. llvm-svn: 205094	2014-03-29 13:12:40 +00:00
Tim Northover	2125374ecf	ARM64: use 64-bit constant even on 32-bit machines Another existing bot failure so no tests. llvm-svn: 205093	2014-03-29 11:51:49 +00:00
Tim Northover	2011df293d	ARM64: change format specifier to work on 32-bit targets Existing tests were failing. llvm-svn: 205092	2014-03-29 11:47:07 +00:00
Chandler Carruth	7b7a67c5c8	[ARM64] Fix 'assert("...")' to be 'assert(0 && "...")'. Otherwise, it is no assert at all. ;] Some of these should probably be switched to llvm_unreachable, but I didn't want to perturb the behavior in this patch. Found by -Wstring-conversion, which I'll try to turn on in CMake builds at least as it is finding useful things. llvm-svn: 205091	2014-03-29 11:07:40 +00:00
Tim Northover	00ed9964c6	ARM64: initial backend import This adds a second implementation of the AArch64 architecture to LLVM, accessible in parallel via the "arm64" triple. The plan over the coming weeks & months is to merge the two into a single backend, during which time thorough code review should naturally occur. Everything will be easier with the target in-tree though, hence this commit. llvm-svn: 205090	2014-03-29 10:18:08 +00:00
Tim Northover	3e38d290c8	TableGen: avoid dereferencing nullptr variable ARM64 ended up reaching odder parts of TableGen alias generation than current backends and caused a segfault. llvm-svn: 205089	2014-03-29 09:03:22 +00:00
Tim Northover	753eca0f78	CodeGen: add sensible defaults for the ISD::FROUND operation Some exotic types didn't know how to handle FROUND, which ARM64 uses. llvm-svn: 205088	2014-03-29 09:03:18 +00:00
Tim Northover	d1c6f51730	MC-exceptions: add support for compact-unwind without .eh_frame ARM64 has compact-unwind information, but doesn't necessarily want to emit .eh_frame directives as well. This teaches MC about such a situation so that it will skip .eh_frame info when compact unwind has been successfully produced. For functions incompatible with compact unwind, the normal information is still written. llvm-svn: 205087	2014-03-29 09:03:13 +00:00
Tim Northover	cea0abb60a	CodeGenPrep: wrangle IR to exploit AArch64 tbz/tbnz inst. Given IR like: %bit = and %val, #imm-with-1-bit-set %tst = icmp %bit, 0 br i1 %tst, label %true, label %false some targets can emit just a single instruction (tbz/tbnz in the AArch64 case). However, with ISel acting at the basic-block level, all three instructions need to be together for this to be possible. This adds another transformation to CodeGenPrep to expose these opportunities, if targets opt in via the hook. llvm-svn: 205086	2014-03-29 08:22:29 +00:00
Tim Northover	0999cbd0b9	MC: add a RefKind field to MCValue This is principally to allow neater mapping of fixups to relocations in ARM64 ELF. Without this, there isn't enough information available to GetRelocType, leading to many more fixup_arm64_... enumerators. llvm-svn: 205085	2014-03-29 08:22:20 +00:00
Tim Northover	53d3251851	MachO: Add linker-optimisation hint framework to MC. Another part of the ARM64 backend (so tests will be following soon). This is currently used by the linker to relax adrp/ldr pairs into nops where possible, though could well be more broadly applicable. llvm-svn: 205084	2014-03-29 07:34:53 +00:00
Tim Northover	5627670e84	MachO: actually set linker-private prefix at MC level. This was accidentally omitted from r205081. llvm-svn: 205083	2014-03-29 07:33:24 +00:00
Tim Northover	c3988b4aa3	MachO: allow each section to have a linker-private symbol The upcoming ARM64 backend doesn't have section-relative relocations, so we give each section its own symbol to provide this functionality. Of course, it doesn't need to appear in the final executable, so linker-private is the best kind for this purpose. llvm-svn: 205081	2014-03-29 07:05:06 +00:00
Tim Northover	9086f061f0	Make GetCPISymbol a virtual method. ARM64 for iOS is going to want to emit these symbols in a linker-private style for efficiency, but other targets probably don't want that behaviour. llvm-svn: 205080	2014-03-29 07:04:59 +00:00
Tim Northover	4516de3412	Intrinsics: add LLVMHalfElementsVectorType constraint This is like the LLVMMatchType, except the verifier checks that the second argument is a vector with the same base type and half the number of elements. This will be used by the ARM64 backend. llvm-svn: 205079	2014-03-29 07:04:54 +00:00
Rafael Espindola	5904e12bfa	Completely rewrite ELFObjectWriter::RecordRelocation. I started trying to fix a small issue, but this code has seen a small fix too many. The old code was fairly convoluted. Some of the issues it had: * It failed to check if a symbol difference was in the some section when converting a relocation to pcrel. * It failed to check if the relocation was already pcrel. * The pcrel value computation was wrong in some cases (relocation-pc.s) * It was missing quiet a few cases where it should not convert symbol relocations to section relocations, leaving the backends to patch it up. * It would not propagate the fact that it had changed a relocation to pcrel, requiring a quiet nasty work around in ARM. * It was missing comments. llvm-svn: 205076	2014-03-29 06:26:49 +00:00
Hal Finkel	19be506a5e	[PowerPC] Add subregister classes for f64 VSX values We had stored both f64 values and v2f64, etc. values in the VSX registers. This worked, but was suboptimal because we would always spill 16-byte values even through we almost always had scalar 8-byte values. This resulted in an increase in stack-size use, extra memory bandwidth, etc. To fix this, I've added 64-bit subregisters of the Altivec registers, and combined those with the existing scalar floating-point registers to form a class of VSX scalar floating-point registers. The ABI code has also been enhanced to use this register class and some other necessary improvements have been made. llvm-svn: 205075	2014-03-29 05:29:01 +00:00
Saleem Abdulrasool	37511ecea8	Windows: canonicalise the default windows triple Canonicalise the default triple that is used on Windows. This should hopefully fix the MSVC buildbots. llvm-svn: 205070	2014-03-29 01:08:53 +00:00
Akira Hatanaka	9afbb8c2b1	[x86] Fix printing of register operands with q modifier. Emit 32-bit register names instead of 64-bit register names if the target does not have 64-bit general purpose registers. <rdar://problem/14653996> llvm-svn: 205067	2014-03-28 23:28:07 +00:00
David Blaikie	dca7c7c5f1	Debug Compression: Avoid compression debug_frame for now Turns out debug_frame does use multiple fragments, so it doesn't compress correctly with the current approach. Disable compressing it for now while I figure out what's the best solution for it. llvm-svn: 205059	2014-03-28 21:48:31 +00:00
David Majnemer	02f2188bb9	X86: Disable IsLegalToCallImmediateAddr for Win32 WinCOFF cannot form PC relative relocations to support absolute MCValues. We should reenable this once WinCOFF supports emission of IMAGE_REL_I386_REL32 relocations. This fixes PR19272. llvm-svn: 205058	2014-03-28 21:40:47 +00:00
David Blaikie	9c3857cb5e	Add missing include (for r205050) llvm-svn: 205053	2014-03-28 21:00:25 +00:00
David Blaikie	9b620b451a	llvm-mc: error when -compress-debug-sections is requested and zlib is not linked This is a bit of a stab in the dark, since I have zlib on my machine. Just going to bounce it off the bots & see if it sticks. Do we have some convention for negative REQUIRES: checks? Or do I just need to add a feature like I've done here? llvm-svn: 205050	2014-03-28 20:45:24 +00:00
Hal Finkel	2583b06310	[PowerPC] Fix VSX permutation isel Not only did I invert the indices when I wrote the code, but I also did the same thing when I wrote the regression test. Oops. llvm-svn: 205046	2014-03-28 20:24:55 +00:00
Rafael Espindola	950667a331	Convert one last llc -filetype=obj test. Unfortunately this one fails deep inside the mips backend, so xfail it. llvm-svn: 205042	2014-03-28 19:58:24 +00:00
Hal Finkel	7811c6188e	[PowerPC] v2[fi]64 need to be explicitly passed in VSX registers v2[fi]64 values need to be explicitly passed in VSX registers. This is because the code in TRI that finds the minimal register class given a register and a value type will assert if given an Altivec register and a non-Altivec type. llvm-svn: 205041	2014-03-28 19:58:11 +00:00
Rafael Espindola	b7dda8ebbc	Convert llc -filetype=obj test. llvm-svn: 205040	2014-03-28 19:41:33 +00:00
Rafael Espindola	249626a29f	Convert llc -filetype=obj test. llvm-svn: 205039	2014-03-28 19:38:20 +00:00
Rafael Espindola	da52f8c28c	Remove bogus test. It was using "lc -filetype=obj" just to pass the result to "llvm-objdupm -disassemble" and then filecheck assembly. The CHECK-NOT would never match anyway since it was missing $. llvm-svn: 205036	2014-03-28 19:26:05 +00:00
Rafael Espindola	8e18d3891e	Convert another llc -filetype=obj test. llvm-svn: 205033	2014-03-28 19:19:28 +00:00
Justin Bogner	96ba627007	Support: Functions for writing endian specific data to streams. This adds a new header, EndianStream.h, which supplies an adaptor for writing endian specific data to a raw_ostream. llvm-svn: 205032	2014-03-28 19:14:43 +00:00
Rafael Espindola	c44c26b4e1	Map ELf flags back to more specific section kinds. With that, convert another llc -filetype=obj test. llvm-svn: 205031	2014-03-28 19:14:08 +00:00
Rafael Espindola	b59fb7347a	Parse .gpdword and convert another llc -filetype=obj test. llvm-svn: 205028	2014-03-28 18:50:26 +00:00
Rafael Espindola	c9a688ab78	convert another llc -filetype=obj test. llvm-svn: 205027	2014-03-28 18:34:31 +00:00
Rafael Espindola	441f4acd9f	Convert "llc -filetype=obj" test into llvm-mc tests. llvm-svn: 205026	2014-03-28 18:30:07 +00:00
Arnold Schwaighofer	c9d58e8d32	SLPVectorizer: Take credit for free extractelement instructions Extract element instructions that will be removed when vectorzing lower the cost. Patch by Arch D. Robison! llvm-svn: 205020	2014-03-28 17:21:32 +00:00
Arnold Schwaighofer	b0d3bcdd32	SLPVectorizer: Fix typos Patch by Arch D. Robison! llvm-svn: 205019	2014-03-28 17:21:27 +00:00
Arnold Schwaighofer	b190cb30c3	SLPVectorizer: Ignore users that are insertelements we can reschedule them Patch by Arch D. Robison! llvm-svn: 205018	2014-03-28 17:21:22 +00:00
Mark Seaborn	f8388a7cb6	Exception handling docs: Clarify how the llvm.eh.* intrinsics are used The non-SJLJ and SJLJ intrinsics are generated by the frontend and backend respectively. Differential Revision: http://llvm-reviews.chandlerc.com/D3010 llvm-svn: 205017	2014-03-28 17:08:57 +00:00
David Blaikie	ff9a069a32	Only test compression when linked with zlib. I'll implement error handling and a negative test in both llvm-mc and Clang soon. llvm-svn: 205016	2014-03-28 17:04:53 +00:00
Rafael Espindola	d7610a5d67	Add const to a method I missed in the previous commit. llvm-svn: 205014	2014-03-28 16:14:12 +00:00
Rafael Espindola	3e3de5e353	Add const. llvm-svn: 205013	2014-03-28 16:06:09 +00:00
Erik Verbruggen	5e1bac3a38	Revert "InstCombine: merge constants in both operands of icmp." This reverts commit r204912, and follow-up commit r204948. This introduced a performance regression, and the fix is not completely clear yet. llvm-svn: 205010	2014-03-28 14:50:57 +00:00
Erik Verbruggen	2074ebd8af	Revert "GVN: merge overflow intrinsics with non-overflow instructions." This reverts commit r203553, and follow-up commits r203558 and r203574. I will follow this up on the mailinglist to do it in a way that won't cause subtle PRE bugs. llvm-svn: 205009	2014-03-28 14:42:34 +00:00
Christian Pirker	2a11160956	Add ARM big endian Target (armeb, thumbeb) Reviewed at http://llvm-reviews.chandlerc.com/D3095 llvm-svn: 205007	2014-03-28 14:35:30 +00:00
Tim Northover	24f46618b2	R600: avoid calling std::next on an iterator that might be end() This was causing my llc to go into an infinite loop on CodeGen/R600/address-space.ll (just triggered recently by some allocator changes). llvm-svn: 205005	2014-03-28 13:52:56 +00:00
Tim Northover	aa3cf1e691	Intrinsics: expand semantics of LLVMExtendedVectorType (& trunc) These are used in the ARM backends to aid type-checking on patterns involving intrinsics. By making sure one argument is an extended/truncated version of another. However, there's no reason to limit them to just vectors types. For example AArch64 has the instruction "uqshrn sD, dN, #imm" which would naturally use an intrinsic taking an i64 and returning an i32. llvm-svn: 205003	2014-03-28 12:31:39 +00:00
Chandler Carruth	1788325fda	[Allocator Cleanup] Sink the private data members and methods to the bottom of the interface to make it easier to scan and find the public API. No functionality changed. llvm-svn: 204996	2014-03-28 09:18:42 +00:00
Chandler Carruth	2c540f62bf	[Allocator Cleanup] Move generic pointer alignment helper out of an out-of-line private static method and into the collection of inline alignment helpers in MathExtras.h. llvm-svn: 204995	2014-03-28 09:08:14 +00:00
Chandler Carruth	3b56b9cf90	[Allocator Cleanup] Make the growth of the "slab" size of the BumpPtrAllocator significantly less strange by making it a simple function of the number of slabs allocated rather than by making it a recurrance. I think the previous behavior was essentially that the size of the slabs would be doubled after the first 128 were allocated, and then doubled again each time 64 more were allocated, but only if every allocation packed perfectly into the slab size. If not, the wasted space wouldn't be counted toward increasing the size, but allocations over the size threshold would. And since the allocations over the size threshold might be much larger than the slab size, this could have somewhat surprising consequences where we rapidly grow the slab size. This currently requires adding state to the allocator to track the number of slabs currently allocated, but that isn't too bad. I'm planning further changes to the allocator that will make this state fall out even more naturally. It still doesn't fully decouple the growth rate from the allocations which are over the size threshold. That fix is coming later. This specific fix will allow making the entire thing into a more stateless device and lifting the parameters into template parameters rather than runtime parameters. llvm-svn: 204993	2014-03-28 08:53:25 +00:00
Chandler Carruth	ead0f76443	[cleanup] Hoist the initialization and constants for slab sizes to the top of the default jit memory manager. This will allow them to be used as template parameters rather than runtime parameters in a subsequent commit. llvm-svn: 204992	2014-03-28 08:53:08 +00:00
David Blaikie	cacce82c4d	PBQP: Minor cleanups to r204857 * Use assignment instead of swap (since the original value is being destroyed anyway) * Rename "updateAdjEdgeId" to "setAdjEdgeId" llvm-svn: 204983	2014-03-27 23:42:21 +00:00
Adrian Prantl	79c8e8f046	C++11: convert verbose loops to range-based loops. llvm-svn: 204981	2014-03-27 23:30:04 +00:00
Hal Finkel	c6fc9b8960	[PowerPC] Use a small cleanup pass to remove VSX self copies As explained in r204976, because of how the allocation of VSX registers interacts with the call-lowering code, we sometimes end up generating self VSX copies. Specifically, things like this: %VSL2<def> = COPY %F2, %VSL2<imp-use,kill> (where %F2 is really a sub-register of %VSL2, and so this copy is a nop) This adds a small cleanup pass to remove these prior to post-RA scheduling. llvm-svn: 204980	2014-03-27 23:12:31 +00:00
Manman Ren	ed0de1368d	Provide a target override for the cost of using a callee-saved register for the first time. Thanks Andy for the discussion. rdar://16162005 llvm-svn: 204979	2014-03-27 23:10:04 +00:00
Saleem Abdulrasool	edbdd2e5df	Canonicalise Windows target triple spellings Construct a uniform Windows target triple nomenclature which is congruent to the Linux counterpart. The old triples are normalised to the new canonical form. This cleans up the long-standing issue of odd naming for various Windows environments. There are four different environments on Windows: MSVC: The MS ABI, MSVCRT environment as defined by Microsoft GNU: The MinGW32/MinGW32-W64 environment which uses MSVCRT and auxiliary libraries Itanium: The MSVCRT environment + libc++ built with Itanium ABI Cygnus: The Cygwin environment which uses custom libraries for everything The following spellings are now written as: i686-pc-win32 => i686-pc-windows-msvc i686-pc-mingw32 => i686-pc-windows-gnu i686-pc-cygwin => i686-pc-windows-cygnus This should be sufficiently flexible to allow us to target other windows environments in the future as necessary. llvm-svn: 204977	2014-03-27 22:50:05 +00:00
Hal Finkel	9dcb3583d5	[PowerPC] Don't remove self VSX copies in PPCInstrInfo::copyPhysReg Because of how the allocation of VSX registers interacts with the call-lowering code, we sometimes end up generating self VSX copies. Specifically, things like this: %VSL2<def> = COPY %F2, %VSL2<imp-use,kill> (where %F2 is really a sub-register of %VSL2, and so this copy is a nop) The problem is that ExpandPostRAPseudos always assumes that some instruction has been inserted, and adds implicit defs to it. This is a problem if no copy was inserted because it can cause subtle problems during post-RA scheduling. These self copies will have to be removed some other way. llvm-svn: 204976	2014-03-27 22:46:28 +00:00
Lang Hames	de76f4a39f	Temporarily remove assert while I dig in to issues that it's causing for LLDB. <rdar://problem/16349536> llvm-svn: 204975	2014-03-27 22:45:42 +00:00
Rui Ueyama	48d9138c69	Revert "[C++11] Do not check __GXX_EXPERIMENTAL_CXX0X__." This reverts commit r204964 because it disabled "= delete", "constexpr" and "explicit" on GCC. llvm-svn: 204973	2014-03-27 22:36:06 +00:00
Quentin Colombet	85b904d875	[X86][Vector Cost Model] Add a comment to explain the workaround in my previous commit (r204884). <rdar://problem/16381225> llvm-svn: 204972	2014-03-27 22:27:41 +00:00
Hal Finkel	82569b6366	[PowerPC] Fix v2f64 vector extract and related patterns First, v2f64 vector extract had not been declared legal (and so the existing patterns were not being used). Second, the patterns for that, and for scalar_to_vector, should really be a regclass copy, not a subregister operation, because the VSX registers directly hold both the vector and scalar data. llvm-svn: 204971	2014-03-27 22:22:48 +00:00
Rui Ueyama	fffa311e9e	[C++11] Do not check __GXX_EXPERIMENTAL_CXX0X__. Summary: Checking the experimental flag for C++0x is no longer needed. Differential Revision: http://llvm-reviews.chandlerc.com/D3206 llvm-svn: 204964	2014-03-27 21:56:29 +00:00
Hal Finkel	ad801b7459	[PowerPC] Expand v2i64 shifts These operations need to be expanded during legalization so that isel does not crash. In theory, we might be able to custom lower some of these. That, however, would need to be follow-up work. llvm-svn: 204963	2014-03-27 21:26:33 +00:00
Manman Ren	9dee449ee3	Register Allocator: refactoring and add comments. No functionality change. Thanks Andy for reviewing. rdar://16162005 llvm-svn: 204962	2014-03-27 21:21:57 +00:00
Rafael Espindola	c03f44ca8a	Remove another unused argument. llvm-svn: 204961	2014-03-27 20:49:35 +00:00
Hans Wennborg	a3c3b8104d	Win installer: provide a pretty icon llvm-svn: 204960	2014-03-27 20:48:37 +00:00
David Blaikie	7400a97952	DebugInfo: Support for compressed debug info sections 1) When creating a .debug_* section and instead create a .zdebug_ section. 2) When creating a fragment in a .zdebug_* section, make it a compressed fragment. 3) When computing the size of a compressed section, compress the data and use the size of the compressed data. 4) Emit the compressed bytes. Also, check that only if a section has a compressed fragment, then that is the only fragment in the section. Assert-fail if the fragment's data is modified after it is compressed. Initial review on llvm-commits by Eric Christopher and Rafael Espindola. llvm-svn: 204958	2014-03-27 20:45:58 +00:00
David Blaikie	70bd1fd22f	DebugInfo: TargetOptions/MCAsmInfo support for compressed debug info sections llvm-svn: 204957	2014-03-27 20:45:41 +00:00
Rafael Espindola	9ab380122a	Remove unused argument. llvm-svn: 204956	2014-03-27 20:41:17 +00:00
Reid Kleckner	3bdf9bc48b	InstCombine: Don't combine constants on unsigned icmps Fixes a miscompile introduced in r204912. It would miscompile code like (unsigned)(a + -49) <= 5U. The transform would turn this into (unsigned)a < 55U, which would return true for values in [0, 49], when it should not. llvm-svn: 204948	2014-03-27 17:49:27 +00:00
Matt Arsenault	b517c8128e	R600: Implement isZExtFree. This allows 64-bit operations that are truncated to be reduced to 32-bit ones. llvm-svn: 204946	2014-03-27 17:23:31 +00:00
Matt Arsenault	d125d74a73	R600/SI: Fix unreachable with a sext_in_reg to an illegal type. llvm-svn: 204945	2014-03-27 17:23:24 +00:00
Daniel Sanders	5e94e68f7b	[mips] Some uses of isMips64()/hasMips64() are really tests for 64-bit GPR's Summary: No functional change since these predicates are (currently) synonymous. Extracted from a patch by David Chisnall His work was sponsored by: DARPA, AFRL Differential Revision: http://llvm-reviews.chandlerc.com/D3202 llvm-svn: 204943	2014-03-27 16:42:17 +00:00
Logan Chien	30eb9f47c6	[AArch64] Lower SHL_PARTS, SRA_PARTS and SRL_PARTS Lower SHL_PARTS, SRA_PARTS and SRL_PARTS to perform 128-bit integer shift Patch by GuanHong Liu. llvm-svn: 204940	2014-03-27 16:28:09 +00:00
Rafael Espindola	24a669d225	Prevent alias from pointing to weak aliases. This adds back r204781. Original message: Aliases are just another name for a position in a file. As such, the regular symbol resolutions are not applied. For example, given define void @my_func() { ret void } @my_alias = alias weak void ()* @my_func @my_alias2 = alias void ()* @my_alias We produce without this patch: .weak my_alias my_alias = my_func .globl my_alias2 my_alias2 = my_alias That is, in the resulting ELF file my_alias, my_func and my_alias are just 3 names pointing to offset 0 of .text. That is not the semantics of IR linking. For example, linking in a @my_alias = alias void ()* @other_func would require the strong my_alias to override the weak one and my_alias2 would end up pointing to other_func. There is no way to represent that with aliases being just another name, so the best solution seems to be to just disallow it, converting a miscompile into an error. llvm-svn: 204934	2014-03-27 15:26:56 +00:00
Daniel Sanders	64cf5a4eb2	[mips] Attempting to use register $32 should be an error instead of an assertion. Reviewers: matheusalmeida Reviewed By: matheusalmeida Differential Revision: http://llvm-reviews.chandlerc.com/D3201 llvm-svn: 204932	2014-03-27 15:00:44 +00:00
Aaron Ballman	be648a3c16	The forward declare should be a struct instead of a class (to be consistent with the definition, as well as to silence an MSVC C4099 warning). llvm-svn: 204928	2014-03-27 14:10:00 +00:00
Daniel Sanders	5bce5f6245	[mips] Add support for .cpsetup Summary: Patch by Robert N. M. Watson His work was sponsored by: DARPA, AFRL Small corrections by myself. CC: theraven, matheusalmeida Differential Revision: http://llvm-reviews.chandlerc.com/D3199 llvm-svn: 204924	2014-03-27 13:52:53 +00:00
Daniel Sanders	bd0e39079a	[mips] The decision between GOT_DISP and GOT16 for global addresses depends on ABI rather than MIPS64 Summary: No functional change (for supported use cases) Reviewers: matheusalmeida Reviewed By: matheusalmeida Differential Revision: http://llvm-reviews.chandlerc.com/D3191 llvm-svn: 204922	2014-03-27 12:49:34 +00:00
Zoran Jovanovic	ada38ef61b	Split the file MipsAsmBackend.cpp in Split the file MipsAsmBackend.cpp and Split the file MipsAsmBackend.h. Differential Revision: http://llvm-reviews.chandlerc.com/D3134 llvm-svn: 204921	2014-03-27 12:38:40 +00:00
Karthik Bhat	82540e9ef8	All new elements except the last one initialized to NULL. Ideally, once parsing is complete, all elements should be non-NULL. To safe-guard BitcodeReader, this patch adds null check for all access to these list. Patch by Dinesh Dwivedi! llvm-svn: 204920	2014-03-27 12:08:23 +00:00
Matheus Almeida	a805e85cb2	[mips] Remove unused private field. llvm-svn: 204919	2014-03-27 12:02:48 +00:00
Matheus Almeida	61218ba798	[mips] NaCl should now use the custom MipsELFStreamer (recently added) in spite of MCELFStreamer. This is so that changes to MipsELFStreamer will automatically propagate through its subclasses. No functional changes (MipsELFStreamer has the same functionality of MCELFStreamer at the moment). Differential Revision: http://llvm-reviews.chandlerc.com/D3130 llvm-svn: 204918	2014-03-27 11:52:20 +00:00
Matheus Almeida	dac77fb389	[mips] Implement custom MCELFStreamer. This allows us to insert some hooks before emitting data into an actual object file. For example, we can capture the register usage for a translation unit by overriding the EmitInstruction method. The register usage information is needed to generate .reginfo and .Mips.options ELF sections. No functional changes. Differential Revision: http://llvm-reviews.chandlerc.com/D3129 llvm-svn: 204917	2014-03-27 11:39:03 +00:00
NAKAMURA Takumi	cb5ebf66df	Untabify. llvm-svn: 204916	2014-03-27 11:38:28 +00:00
NAKAMURA Takumi	cce8a58202	SmallVector<3> may be used here. llvm-svn: 204915	2014-03-27 11:33:11 +00:00
NAKAMURA Takumi	be8556d17a	IRTests/InstructionsTest.cpp: Avoid initializer list. llvm-svn: 204914	2014-03-27 11:32:41 +00:00
Erik Verbruggen	59a1219846	InstCombine: merge constants in both operands of icmp. Transform: icmp X+Cst2, Cst into: icmp X, Cst-Cst2 when Cst-Cst2 does not overflow, and the add has nsw. llvm-svn: 204912	2014-03-27 11:16:05 +00:00
Daniel Sanders	d897b564ca	[mips] Stop caching the result of hasMips64(), isABI_O32(), isABI_N32(), and isABI_N64() from MipsSubTarget in MipsTargetLowering Summary: The short name is quite convenient so provide an accessor for them instead. No functional change Depends on D3177 Reviewers: matheusalmeida Reviewed By: matheusalmeida Differential Revision: http://llvm-reviews.chandlerc.com/D3178 llvm-svn: 204911	2014-03-27 10:46:12 +00:00
Chandler Carruth	f17da19f34	[cleanup] Run clang-format over these routines to remove formatting differences from subsequent diffs, and ease review. Going to be performing some major surgery to simplify this stuff. llvm-svn: 204908	2014-03-27 09:56:23 +00:00
Chandler Carruth	e961abaada	[cleanup] Modernize doxygen comments for the BumpPtrAllocator and rewrite some of them to be more clear. The terminology being used in our allocators is making me really sad. We call things slab allocators that aren't at all slab allocators. It is quite confusing. llvm-svn: 204907	2014-03-27 09:53:31 +00:00
Elena Demikhovsky	bb2f6b72d3	AVX-512: Implemented masking for integer arithmetic & logic instructions. By Robert Khasanov rob.khasanov@gmail.com llvm-svn: 204906	2014-03-27 09:45:08 +00:00
Timur Iskhodzhanov	b714601f8f	Add a PR reference llvm-svn: 204904	2014-03-27 08:52:14 +00:00
Timur Iskhodzhanov	7eaa8257fc	Make the recent COFF debug info tests more readable llvm-svn: 204902	2014-03-27 08:46:44 +00:00
Stepan Dyatkovskiy	e8747e30ef	Rejected r204899 and r204900 due to remaining test failures on cmake-llvm-x86_64-linux buildbot. llvm-svn: 204901	2014-03-27 08:38:18 +00:00
Stepan Dyatkovskiy	4920628815	Fixed test for r204899 (pr18931 fix) llvm-svn: 204900	2014-03-27 08:20:26 +00:00
Stepan Dyatkovskiy	3530003008	Fix for pr18931: Crash using integrated assembler with immediate arithmetic Fix description: Expressions like 'cmp r0, #(l1 - l2) >> 3' could not be evaluated on asm parsing stage, since it is impossible to resolve labels on this stage. In the end of stage we still have expression (MCExpr). Then, when we want to encode it, we expect it to be an immediate, but it still an expression. Patch introduces a Fixup (MCFixup instance), that is processed after main encoding stage. llvm-svn: 204899	2014-03-27 07:49:39 +00:00
Jiangning Liu	1d3f2c7c82	ARM: raise error message when complex SO expressions can't really be solved as a constant at compilation time. llvm-svn: 204898	2014-03-27 07:42:58 +00:00
Lang Hames	219e520661	Add missing #include <cassert> to MCSymbolizer.h. llvm-svn: 204894	2014-03-27 02:58:32 +00:00
Lang Hames	f03e953a7f	Assert that MCSymbolizer is constructed with a valid (or at least non-null) RelocationInfo argument. llvm-svn: 204893	2014-03-27 02:49:18 +00:00
Lang Hames	2768d26a62	Move MCSymbolizer's constructor into header. It's trivial - there's no need for it to be out-of-line. llvm-svn: 204892	2014-03-27 02:42:52 +00:00
Lang Hames	eb37092342	Update MCSymbolizer and its subclasses' constructors to reflect the fact that they take ownership of the RelocationInfo they're constructed with. llvm-svn: 204891	2014-03-27 02:39:01 +00:00
Reid Kleckner	2f8e3001e0	inalloca: Really fix the docs llvm-svn: 204890	2014-03-27 01:38:48 +00:00
Reid Kleckner	a4b6a6257d	Remove unneeded stale type. llvm-svn: 204889	2014-03-27 01:34:51 +00:00
Reid Kleckner	24e3f7cceb	inalloca: Fix incorrect example IR and remove LangRef warning The LangRef warning wasn't formatting the way I intended it to anyway. Surprisingly inalloca appears to work, even when optimizations are enabled. We generate very bad code for it, but we can self-host and run lots of big tests. llvm-svn: 204888	2014-03-27 01:32:22 +00:00
Lang Hames	69247821a2	Remove forward declaration for Target class - Target is already defined here. No functional change. llvm-svn: 204885	2014-03-27 01:05:49 +00:00
Quentin Colombet	3914bf516b	[X86][Vectorizer Cost Model] Correct vectorization cost model for v2i64->v2f64 and v4i64->v4f64. The new costs match what we did for SSE2 and reflect the reality of our codegen. <rdar://problem/16381225> llvm-svn: 204884	2014-03-27 00:52:16 +00:00
Rafael Espindola	a041ef1bd8	Correctly propagates st_size. This also finally removes a bogus call to AliasedSymbol. llvm-svn: 204883	2014-03-27 00:28:24 +00:00
Jim Grosbach	6373e70f81	add 'requires asserts' to test that needs it llvm-svn: 204882	2014-03-27 00:20:42 +00:00
Justin Bogner	66ddccb160	llvm-cov: When reading strings in gcov data, skip leading zeros It seems that gcov, when faced with a string that is apparently zero length, just keeps reading words until it finds a length it likes better. I'm not really sure why this is, but it's simple enough to make llvm-cov follow suit. llvm-svn: 204881	2014-03-27 00:06:36 +00:00
Jim Grosbach	72fbde84b8	X86: Correct vectorization cost model for v8f32->v8i8. Fix the cost model to reflect the reality of our codegen. rdar://16370633 llvm-svn: 204880	2014-03-27 00:04:11 +00:00
Nick Lewycky	77d5fb40c8	Treat lifetime.start'd memory like we treat freshly alloca'd memory. Patch by Björn Steinbrink! llvm-svn: 204876	2014-03-26 23:45:15 +00:00
Eric Christopher	7790cf88e0	Reorder arguments on test command line to make it easier to cut and paste. llvm-svn: 204875	2014-03-26 23:10:28 +00:00
Hal Finkel	df3e34d944	[PowerPC] Generate VSX permutations for v2[fi]64 vectors llvm-svn: 204873	2014-03-26 22:58:37 +00:00
Justin Bogner	dddfc296a7	llvm-cov: Move XFAIL after the body of the test llvm-cov tests are sensitive to line number changes, so putting this at the end will limit churn when we fix the XFAIL. llvm-svn: 204871	2014-03-26 22:51:39 +00:00
Justin Bogner	b220a129ca	llvm-cov: Disable test on big endian machines llvm-svn: 204868	2014-03-26 22:36:48 +00:00
Reid Kleckner	23798a9731	CloneFunction: Clone all attributes, including the CC Summary: Tested with a unit test because we don't appear to have any transforms that use this other than ASan, I think. Fixes PR17935. Reviewers: nicholas CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D3194 llvm-svn: 204866	2014-03-26 22:26:35 +00:00
Ekaterina Romanova	b9aea9383a	This is a fix for PR# 19051. I noticed code gen differences due to code motion when running tests with and without the debug info at O2. The problem is in branch folding. A loop wanted to skip the debug info, but actually it didn't do so. llvm-svn: 204865	2014-03-26 22:15:28 +00:00
Manman Ren	14aa891976	Add comments. Addressing review comments from Evan on r204690. llvm-svn: 204864	2014-03-26 22:14:09 +00:00
Justin Bogner	95e0a70581	llvm-cov: Handle functions with no line number Functions may in an instrumented binary but not in the original source when they're inserted by the compiler or the runtime. These functions aren't meaningful to the user, so teach llvm-cov to skip over them instead of crashing. llvm-svn: 204863	2014-03-26 22:03:06 +00:00
Kevin Enderby	5611398b6b	Fix a problem with the ARM assembler incorrectly matching a vector list parameter that is using all lanes "{d0[], d2[]}" but can match and instruction with a ”{d0, d2}" parameter. I’m finishing up a fix for proper checking of the unsupported alignments on vld/vst instructions and ran into this. Thus I don’t have a test case at this time. And adding all code that will demonstrate the bug would obscure the very simple one line fix. So if you would indulge me on not having a test case at this time I’ll instead offer up a detailed explanation of what is going on in this commit message. This instruction: vld2.8 {d0[], d2[]}, [r4:64] is not legal as the alignment can only be 16 when the size is 8. Per this documentation: A8.8.325 VLD2 (single 2-element structure to all lanes) <align> The alignment. It can be one of: 16 2-byte alignment, available only if <size> is 8, encoded as a = 1. 32 4-byte alignment, available only if <size> is 16, encoded as a = 1. 64 8-byte alignment, available only if <size> is 32, encoded as a = 1. omitted Standard alignment, see Unaligned data access on page A3-108. So when code is added to the llvm integrated assembler to not match that instruction because of the alignment it then goes on to try to match other instructions and comes across this: vld2.8 {d0, d2}, [r4:64] and and matches it. This is because of the method ARMOperand::isVecListDPairSpaced() is missing the check of the Kind. In this case the Kind is k_VectorListAllLanes . While the name of the method may suggest that this is OK it really should check that the Kind is k_VectorList. As the method ARMOperand::isDoubleSpacedVectorAllLanes() is what was used to match {d0[], d2[]} and correctly checks the Kind: bool isDoubleSpacedVectorAllLanes() const { return Kind == k_VectorListAllLanes && VectorList.isDoubleSpaced; } where the original ARMOperand::isVecListDPairSpaced() does not check the Kind: bool isVecListDPairSpaced() const { if (isSingleSpacedVectorList()) return false; return (ARMMCRegisterClasses[ARM::DPairSpcRegClassID] .contains(VectorList.RegNum)); } Jim Grosbach has reviewed the change and said: Yep, that sounds right. … And by "right" I mean, "wow, that's a nasty latent bug I'm really, really glad to see fixed." :) rdar://16436683 llvm-svn: 204861	2014-03-26 21:54:11 +00:00
Eli Bendersky	8474162f2c	Add a unit test for Invoke iteration, similar to the one for Call The tests are refactored to use the same fixture. llvm-svn: 204860	2014-03-26 21:46:24 +00:00
Arnold Schwaighofer	1a444489e9	PR15967 Fix in basicaa for faulty returning no alias. This commit consist of two parts. The first part fix the PR15967. The wrong conclusion was made when the MaxLookup limit was reached. The fix introduce a out parameter (MaxLookupReached) to DecomposeGEPExpression that the function aliasGEP can act upon. The second part is introducing the constant MaxLookupSearchDepth to make sure that DecomposeGEPExpression and GetUnderlyingObject use the same search depth. This is a small cleanup to clarify the original algorithm. Patch by Karl-Johan Karlsson! llvm-svn: 204859	2014-03-26 21:30:19 +00:00
Lang Hames	5391ac4759	Simplify PBQP graph removeAdjEdgeId implementation. llvm-svn: 204857	2014-03-26 21:21:53 +00:00
Eli Bendersky	c35c4b3ddb	Fix bot breakage in InstructionsTest. Makes sure the Call dies before the Function llvm-svn: 204856	2014-03-26 21:11:34 +00:00
Eli Bendersky	84aa5e555f	Fix problem with r204836 In CallInst, op_end() points at the callee, which we don't want to iterate over when just iterating over arguments. Now take this into account when returning a iterator_range from arg_operands. Similar reasoning for InvokeInst. Also adds a unit test to verify this actually works as expected. llvm-svn: 204851	2014-03-26 20:41:15 +00:00
Hal Finkel	6e28e6aaaf	[PowerPC] VSX loads and stores support unaligned access I've not yet updated PPCTTI because I'm not sure what the actual relative cost is compared to the aligned uses. llvm-svn: 204848	2014-03-26 19:39:09 +00:00
Kevin Enderby	8108f38437	Fix the ARM VST4 (single 4-element structure from one lane) size 16 double-spaced registers instruction printing. This: vld4.16 {d17[1], d19[1], d21[1], d23[1]}, [r7]! was being printed as: vld4.16 {d17[1], d18[1], d19[1], d20[1]}, [r7]! rdar://16435096 llvm-svn: 204847	2014-03-26 19:35:40 +00:00
Lang Hames	d107f16a02	Remove PBQP-cost dimension sanity assertion in PBQP::Graph::addConstructedEdge. We're already effectively checking sanity for that in PBQP::Graph::addEdge. llvm-svn: 204844	2014-03-26 19:22:51 +00:00
Hal Finkel	7279f4b00d	[PowerPC] Use v2f64 <-> v2i64 VSX conversion instructions llvm-svn: 204843	2014-03-26 19:13:54 +00:00
Lang Hames	ff85ba1264	Change the PBQP graph adjacency list structure from std::set to std::vector. The edge data structure (EdgeEntry) now holds the indices of its entries in the adjacency lists of the nodes it connects. This trades a little ugliness for faster insertion/removal, which is now O(1) with a cheap constant factor. All of this is implementation detail within the PBQP graph, the external API remains unchanged. Individual register allocations are likely to change, since the adjacency lists will now be ordered differently (or rather, will now be unordered). This shouldn't affect the average quality of allocations however. llvm-svn: 204841	2014-03-26 18:58:00 +00:00
Matt Arsenault	90b733a3cf	R600: Add a testcase for sext_in_reg I missed. This sext_inreg i32 in i64 case was already handled, but not enabled. llvm-svn: 204840	2014-03-26 18:31:06 +00:00
Hal Finkel	ea76a44584	[PowerPC] Remove some dead VSX v4f32 store patterns These patterns are dead (because v4f32 stores are currently promoted to v4i32 and stored using Altivec instructions), and also are likely not correct (because they'd store the vector elements in the opposite order from that assumed by the rest of the Altivec code). llvm-svn: 204839	2014-03-26 18:26:36 +00:00
Hal Finkel	9281c9a38b	[PowerPC] Use VSX vector load/stores for v2[fi]64 These instructions have access to the complete VSX register file. In addition, they "swap" the order of the elements so that element 0 (the scalar part) comes first in memory and element 1 follows at a higher address. llvm-svn: 204838	2014-03-26 18:26:30 +00:00
Juergen Ributzka	6ff29a7b2f	[MCJIT] Check if there have been errors during RuntimeDyld execution. llvm-svn: 204837	2014-03-26 18:19:27 +00:00
Eli Bendersky	6d6a2bba63	Enable range-for iteration over call/invoke arguments. Similar to r204835 llvm-svn: 204836	2014-03-26 18:18:02 +00:00
Eli Bendersky	0c3cccef51	Add args() iteartor adapter to Function, for range-for loops. This patch is in similar vein to what done earlier to Module::globals/aliases etc. It allows to iterate over function arguments like this: for (Argument Arg : F.args()) { ... } llvm-svn: 204835	2014-03-26 18:04:27 +00:00
Jim Grosbach	ed2cd39b81	Fix for incorrect address sinking in the presence of potential overflows. In some cases it is possible for CGP to attempt to reuse a base address from another basic block. In those cases we have to be sure that all the address math was either done at the same bit width, or that none of it overflowed before it was extended. Patch by Louis Gerbarg <lgg@apple.com> rdar://16307442 llvm-svn: 204833	2014-03-26 17:27:01 +00:00
Hans Wennborg	d683a22dd2	Revert "X86 memcpy lowering: use "rep movs" even when esi is used as base pointer" (r204174) > For functions where esi is used as base pointer, we would previously fall ba > from lowering memcpy with "rep movs" because that clobbers esi. > > With this patch, we just store esi in another physical register, and restore > it afterwards. This adds a little bit of register preassure, but the more > efficient memcpy should be worth it. > > Differential Revision: http://llvm-reviews.chandlerc.com/D2968 This didn't work. I was ending up with code like this: lea edi,[esi+38h] mov ecx,0Fh mov edx,esi mov esi,ebx rep movs dword ptr es:[edi],dword ptr [esi] lea ecx,[esi+74h] <-- Ooops, we're now using esi before restoring it from edx. add ebx,3Ch mov esi,edx I guess if we want to do this we need stronger glue or something, or doing the expansion much later. llvm-svn: 204829	2014-03-26 16:30:54 +00:00
Hal Finkel	a6c8b51212	[PowerPC] Add v2i64 as a legal VSX type v2i64 needs to be a legal VSX type because it is the SetCC result type from v2f64 comparisons. We need to expand all non-arithmetic v2i64 operations. This fixes the lowering for v2f64 VSELECT. llvm-svn: 204828	2014-03-26 16:12:58 +00:00
Matheus Almeida	ea06727f03	[mips] Use TwoOperandAliasConstraint for ArithLogicR instructions. This enables TableGen to generate an additional two operand matcher for our ArithLogicR class of instructions (constituted by 3 register operands). E.g.: and $1, $2 <=> and $1, $1, $2 llvm-svn: 204826	2014-03-26 16:09:43 +00:00
Matheus Almeida	ab5633b70c	[mips] Add support to the '.dword' directive. The '.dword' directive accepts a list of expressions and emits them in 8-byte chunks in successive locations. llvm-svn: 204822	2014-03-26 15:44:18 +00:00
Joerg Sonnenberger	94321ec003	Clarify that select is only non-branching on the IR-level, it often ends up as jump table or other forms of branches on the machine level. llvm-svn: 204819	2014-03-26 15:30:21 +00:00
Matheus Almeida	3e2a702aa2	[mips] Rename function in MipsAsmParser. parseDirectiveWord is a generic function that parses an expression which means there's no need for it to have such an specific name. Renaming it to parseDataDirective so that it can also be used to handle .dword directives[1]. [1]To be added in a follow up commit. No functional changes. llvm-svn: 204818	2014-03-26 15:24:36 +00:00
Matheus Almeida	3b9c63d29b	[mips] Add support to '.set mips64'. The '.set mips64' directive enables the feature Mips:FeatureMips64 from assembly. Note that it doesn't modify the ELF header as opposed to the use of -mips64 from the command-line. The reason for this is that we want to be as compatible as possible with existing assemblers like GAS. llvm-svn: 204817	2014-03-26 15:14:32 +00:00
Christian Pirker	99974c7242	AArch64_BE Elf support for MC-JIT runtime dynamic linker llvm-svn: 204816	2014-03-26 14:57:32 +00:00
Matheus Almeida	a2cd009c51	[mips] Add support to '.set mips64r2'. The '.set mips64r2' directive enables the feature Mips:FeatureMips64r2 from assembly. Note that it doesn't modify the ELF header as opposed to the use of -mips64r2 from the command-line. The reason for this is that we want to be as compatible as possible with existing assemblers like GAS. llvm-svn: 204815	2014-03-26 14:52:22 +00:00
Christian Pirker	3aa0e6a1f9	AArch64_BE function argument passing for ARM ABI llvm-svn: 204814	2014-03-26 14:51:22 +00:00
Tim Northover	1ff5f29fb5	ARM: add intrinsics for the v8 ldaex/stlex We've already got versions without the barriers, so this just adds IR-level support for generating the new v8 ones. rdar://problem/16227836 llvm-svn: 204813	2014-03-26 14:39:31 +00:00
Joerg Sonnenberger	03014d6291	Clarify llvm.clear_cache description. llvm-svn: 204812	2014-03-26 14:35:21 +00:00
Matheus Almeida	fe1e39dcba	[mips] Hoist common functionality into a new function. Given that we support multiple directives that enable a particular feature (e.g. '.set mips16'), it's best to hoist that code into a new function so that we don't repeat the same pattern w.r.t parsing and handling error cases. No functional changes. llvm-svn: 204811	2014-03-26 14:26:27 +00:00
Renato Golin	93010e687f	Change @llvm.clear_cache default to call rt-lib After some discussion on IRC, emitting a call to the library function seems like a better default, since it will move from a compiler internal error to a linker error, that the user can work around until LLVM is fixed. I'm also adding a note on the responsibility of the user to confirm that the cache was cleared on platforms where nothing is done. llvm-svn: 204806	2014-03-26 14:01:32 +00:00
Daniel Sanders	6dd7251599	[mips] The decision to use MO_GOT_PAGE and MO_GOT_OFST depends on the ABI being N32 or N64 not the arch being MIPS64 Summary: No functional change (in supported use cases) Reviewers: matheusalmeida Reviewed By: matheusalmeida Differential Revision: http://llvm-reviews.chandlerc.com/D3177 llvm-svn: 204805	2014-03-26 13:59:42 +00:00
Cameron McInally	4532596b8f	Fix AVX512 Gather and Scatter execution domains. llvm-svn: 204804	2014-03-26 13:50:50 +00:00
Matheus Almeida	f79b281421	[mips] Add support for '.option pic2'. The directive '.option pic2' enables PIC from assembly source. At the moment none of the macros/directives check the PIC bit but that's going to be fixed relatively soon. For example, the expansion of macros like 'la' depend on the relocation model. llvm-svn: 204803	2014-03-26 13:40:29 +00:00
Renato Golin	c0a3c1d66b	Add @llvm.clear_cache builtin Implementing the LLVM part of the call to __builtin___clear_cache which translates into an intrinsic @llvm.clear_cache and is lowered by each target, either to a call to __clear_cache or nothing at all incase the caches are unified. Updating LangRef and adding some tests for the implemented architectures. Other archs will have to implement the method in case this builtin has to be compiled for it, since the default behaviour is to bail unimplemented. A Clang patch is required for the builtin to be lowered into the llvm intrinsic. This will be done next. llvm-svn: 204802	2014-03-26 12:52:28 +00:00
Hal Finkel	732f0f73a7	[PowerPC] Lower VSELECT using xxsel when VSX is available With VSX there is a real vector select instruction, and so we should use it. Note that VSELECT will still scalarize for v2f64 because the corresponding SetCC result type (v2i64) is not currently a legal type. llvm-svn: 204801	2014-03-26 12:49:28 +00:00
Daniel Sanders	6dad838f3a	[mips] Add tests for t0-t3 for N32/N64 These are aliases of t4-t7 and are provided for compatibility with both the original ABI documentation (using t4-t7) and GNU As (using t0-t3) llvm-svn: 204797	2014-03-26 11:46:34 +00:00
Daniel Sanders	a4b0c74765	[mips] The register names depend on the ABI being N32/N64 rather than the arch being mips64 Summary: Added test cases for O32 and N32 on MIPS64. Reviewers: matheusalmeida Reviewed By: matheusalmeida Differential Revision: http://llvm-reviews.chandlerc.com/D3175 llvm-svn: 204796	2014-03-26 11:39:07 +00:00
Timur Iskhodzhanov	b5b7a61646	Follow-up to r204790: don't try to emit line tables if there are no functions with DI in the TU llvm-svn: 204795	2014-03-26 11:24:36 +00:00
Daniel Sanders	85f482b02f	[mips] $s8 is an alias for $fp in all ABI's, not just N32/N64. llvm-svn: 204793	2014-03-26 11:05:24 +00:00
Daniel Sanders	91d4407cd8	[mips] Move the CHECK lines in mips*-register-names.s to make it more obvious which CHECK matches with which insn This reveals a small mistake in mips-register-names.s ($sp is tested twice and $s8 is not tested) which will be fixed in a follow-up commit. llvm-svn: 204792	2014-03-26 10:54:30 +00:00
Timur Iskhodzhanov	6a35c15589	Add tests for r204790 llvm-svn: 204791	2014-03-26 09:51:45 +00:00
Timur Iskhodzhanov	8499a12259	Fix PR19239 - Add support for generating debug info for functions without lexical scopes and/or debug info at all llvm-svn: 204790	2014-03-26 09:50:36 +00:00
Timur Iskhodzhanov	e32ef937eb	Use -LABEL checks in the COFF debug info tests llvm-svn: 204788	2014-03-26 08:45:02 +00:00
Rafael Espindola	65481d7b97	Revert "Prevent alias from pointing to weak aliases." This reverts commit r204781. I will follow up to with msan folks to see what is what they were trying to do with aliases to weak aliases. llvm-svn: 204784	2014-03-26 06:14:40 +00:00
Hal Finkel	bd4de9d478	[PowerPC] Generate logical vector VSX instructions These instructions are essentially the same as their Altivec counterparts, but have access to the larger VSX register file. llvm-svn: 204782	2014-03-26 04:55:40 +00:00
Rafael Espindola	3b712a84a9	Prevent alias from pointing to weak aliases. Aliases are just another name for a position in a file. As such, the regular symbol resolutions are not applied. For example, given define void @my_func() { ret void } @my_alias = alias weak void ()* @my_func @my_alias2 = alias void ()* @my_alias We produce without this patch: .weak my_alias my_alias = my_func .globl my_alias2 my_alias2 = my_alias That is, in the resulting ELF file my_alias, my_func and my_alias are just 3 names pointing to offset 0 of .text. That is not the semantics of IR linking. For example, linking in a @my_alias = alias void ()* @other_func would require the strong my_alias to override the weak one and my_alias2 would end up pointing to other_func. There is no way to represent that with aliases being just another name, so the best solution seems to be to just disallow it, converting a miscompile into an error. llvm-svn: 204781	2014-03-26 04:48:47 +00:00
David Blaikie	62dd7df612	DebugInfo: Add fission-related sections to COFF Allows this test to pass on COFF platforms so we don't need to restrict this test to a single target anymore. llvm-svn: 204780	2014-03-26 03:05:10 +00:00
Rafael Espindola	85a8491a93	Correctly detect if a symbol uses a reserved section index or not. The logic was incorrect for variables, causing them to end up in the wrong section if the section had an index >= 0xff00. llvm-svn: 204771	2014-03-26 00:16:43 +00:00
Quentin Colombet	6f12ae0d5c	[X86] Add broadcast instructions to the table used by ExeDepsFix pass. Adds the different broadcast instructions to the ReplaceableInstrsAVX2 table. That way the ExeDepsFix pass can take better decisions when AVX2 broadcasts are across domain (int <-> float). In particular, prior to this patch we were generating: vpbroadcastd LCPI1_0(%rip), %ymm2 vpand %ymm2, %ymm0, %ymm0 vmaxps %ymm1, %ymm0, %ymm0 ## <- domain change penalty Now, we generate the following nice sequence where everything is in the float domain: vbroadcastss LCPI1_0(%rip), %ymm2 vandps %ymm2, %ymm0, %ymm0 vmaxps %ymm1, %ymm0, %ymm0 <rdar://problem/16354675> llvm-svn: 204770	2014-03-26 00:10:22 +00:00
Rafael Espindola	10be0837ac	Create .symtab_shndxr only when needed. We need .symtab_shndxr if and only if a symbol references a section with an index >= 0xff00. The old code was trying to figure out if the section was needed ahead of time, making it a fairly dependent on the code actually writing the table. It was also somewhat conservative and would create the section in cases where it was not needed. If I remember correctly, the old structure was there so that the sections were created in the same order gas creates them. That was valuable when MC's support for ELF was new and we tested with elf-dump.py. This patch refactors the symbol table creation to another class and makes it obvious that .symtab_shndxr is really only created when we are about to output a reference to a section index >= 0xff00. While here, also improve the tests to use macros. One file is one section short of needing .symtab_shndxr, the second one has just the right number. llvm-svn: 204769	2014-03-25 23:44:25 +00:00
Hal Finkel	174e590966	[PowerPC] Select between VSX A-type and M-type FMA instructions just before RA The VSX instruction set has two types of FMA instructions: A-type (where the addend is taken from the output register) and M-type (where one of the product operands is taken from the output register). This adds a small pass that runs just after MI scheduling (and, thus, just before register allocation) that mutates A-type instructions (that are created during isel) into M-type instructions when: 1. This will eliminate an otherwise-necessary copy of the addend 2. One of the product operands is killed by the instruction The "right" moment to make this decision is in between scheduling and register allocation, because only there do we know whether or not one of the product operands is killed by any particular instruction. Unfortunately, this also makes the implementation somewhat complicated, because the MIs are not in SSA form and we need to preserve the LiveIntervals analysis. As a simple example, if we have: %vreg5<def> = COPY %vreg9; VSLRC:%vreg5,%vreg9 %vreg5<def,tied1> = XSMADDADP %vreg5<tied0>, %vreg17, %vreg16, %RM<imp-use>; VSLRC:%vreg5,%vreg17,%vreg16 ... %vreg9<def,tied1> = XSMADDADP %vreg9<tied0>, %vreg17, %vreg19, %RM<imp-use>; VSLRC:%vreg9,%vreg17,%vreg19 ... We can eliminate the copy by changing from the A-type to the M-type instruction. This means: %vreg5<def,tied1> = XSMADDADP %vreg5<tied0>, %vreg17, %vreg16, %RM<imp-use>; VSLRC:%vreg5,%vreg17,%vreg16 is replaced by: %vreg16<def,tied1> = XSMADDMDP %vreg16<tied0>, %vreg18, %vreg9, %RM<imp-use>; VSLRC:%vreg16,%vreg18,%vreg9 and we remove: %vreg5<def> = COPY %vreg9; VSLRC:%vreg5,%vreg9 llvm-svn: 204768	2014-03-25 23:29:21 +00:00
NAKAMURA Takumi	3a485fba1f	llvm/test/DebugInfo/empty.ll: Suppress crash for targeting pecoff while investigating. llvm-svn: 204766	2014-03-25 23:16:44 +00:00
Rafael Espindola	0ce0971afa	Use Endian.h to simplify this code a bit. While at it, factor some logic into FragmentWriter. This will allow more code to be factored out of the fairly large ELFObjectWriter. llvm-svn: 204765	2014-03-25 22:43:53 +00:00
Meador Inge	0d34006a81	[configure/make] Propagate names of build host tools when making BuildTools When cross-compiling LLVM itself the configure/make scripts get confused when creating the needed build host tools. For example, building and configuring like: CC_FOR_BUILD='i686-pc-linux-gnu-gcc' CXX_FOR_BUILD='i686-pc-linux-gnu-g++' CXX='i686-mingw32-g++' CC='i686-mingw32-gcc' LD='i686-mingw32-ld' /scratch /meadori/llvm-trunk/src/trunk/configure --host=i686-mingw32 CC_FOR_BUILD='i686-pc-linux-gnu-gcc' CXX_FOR_BUILD='i686-pc-linux-gnu-g++' CXX='i686-mingw32-g++' CC='i686-mingw32-gcc' LD='i686-mingw32-ld' make causes the following build break: checking whether the C compiler works... configure: error: cannot run C compiled programs. If you meant to cross compile, use `--host'. See `config.log' for more details. The 'config.log' shows that i686-mingw32-gcc is being used to create executables for the build host. This patch fixes the problem by propogating the names of the build host tools via BUILD_* when configuring/making BuildTools. Original patch by Ekaterina Sanina. llvm-svn: 204760	2014-03-25 21:45:41 +00:00
Juergen Ributzka	7be410f5d5	[Constant Hoisting] Make the constant candidate map local to the collectConstantCandidates method. llvm-svn: 204758	2014-03-25 21:21:10 +00:00
Hal Finkel	6c32ff31d0	[PowerPC] Correct commutable indices for VSX FMA instructions Although the first two operands are the ones that can be swapped, the tied input operand is listed before them, so we need to adjust for that. I have a test case for this, but it goes along with an upcoming commit (so it will come soon). llvm-svn: 204748	2014-03-25 19:26:43 +00:00
Hal Finkel	25e0454f10	[PowerPC] Add a TableGen relation for A-type and M-type VSX FMA instructions TableGen will create a lookup table for the A-type FMA instructions providing their corresponding M-form opcodes. This will be used by upcoming commits. llvm-svn: 204746	2014-03-25 18:55:11 +00:00
Matt Arsenault	0c274feedf	R600: Move computeMaskedBitsForTargetNode out of AMDILISelLowering.cpp Remove handling of select_cc, since it makes no sense to be there. This now does nothing, but I'll be adding some handling of other target nodes soon. llvm-svn: 204743	2014-03-25 18:18:27 +00:00
Duncan P. N. Exon Smith	3dbe10503a	blockfreq: Implement Pass::releaseMemory() Implement Pass::releaseMemory() in BlockFrequencyInfo and MachineBlockFrequencyInfo. Just delete the private implementation when not in use. Switch to a std::unique_ptr to make the logic more clear. <rdar://problem/14292693> llvm-svn: 204741	2014-03-25 18:01:38 +00:00
Duncan P. N. Exon Smith	936aef9238	blockfreq: Use const in MachineBlockFrequencyInfo <rdar://problem/14292693> llvm-svn: 204740	2014-03-25 18:01:32 +00:00
Juergen Ributzka	631c4914b2	[X86TTI] Make constant base pointers for getElementPtr opaque. If getElementPtr uses a constant as base pointer, then make the constant opaque. This prevents constant folding it with the offset. The offset can usually be encoded in the load/store instruction itself and the base address doesn't have to be rematerialized several times. llvm-svn: 204739	2014-03-25 18:01:25 +00:00
Juergen Ributzka	5eef98cf7a	[Stackmaps][X86TTI] Fix think-o in getIntImmCost calculation. The cost for the first four stackmap operands was always TCC_Free. This is only true for the first two operands. All other operands are TCC_Free if they are within 64bit. llvm-svn: 204738	2014-03-25 18:01:23 +00:00
Juergen Ributzka	e2e16844f5	[DAG] Keep the opaque constant flag when performing unary constant folding operations. Usually opaque constants shouldn't be folded, unless they are simple unary operations that don't create new constants. Although this shouldn't drop the opaque constant flag. This commit fixes this. Related to <rdar://problem/14774662> llvm-svn: 204737	2014-03-25 18:01:20 +00:00
Adam Nemet	4beef4c90d	[X86] Generate VPSHUFB for in-place v16i16 shuffles This used to resort to splitting the 256-bit operation into two 128-bit shuffles and then recombining the results. Fixes <rdar://problem/16167303> llvm-svn: 204735	2014-03-25 17:47:06 +00:00
Adam Nemet	ac6d6383a3	[X86] Factor out new helper getPSHUFB I found three implementations of this. This splits it out into a new function and uses it from the three places. My plan is to add a fourth use when lowering a vector_shuffle:v16i16. Compared the assembly output of test/CodeGen/X86 before and after. The only change is due to how the first PSHUFB was generated in LowerVECTOR_SHUFFLEv8i16. If the shuffle mask specified undef (i.e. -1), the old implementation would write -1 * 2 and -1 * 2 + 1 (254 and 255) in the control mask. Now we write 0x80. These are of course interchangeable since bit 7 decides if a constant zero is written in the result byte. The other instances of this code use 0x80 consistently. Related to <rdar://problem/16167303> llvm-svn: 204734	2014-03-25 17:47:03 +00:00
Richard Osborne	0af4aa9a19	[InstCombine] Don't fold bitcast into store if it would need addrspacecast Summary: Previously the code didn't check if the before and after types for the store were pointers to different address spaces. This resulted in instcombine using a bitcast to convert between pointers to different address spaces, causing an assertion due to the invalid cast. It is not be appropriate to use addrspacecast this case because it is not guaranteed to be a no-op cast. Instead bail out and do not do the transformation. CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D3117 llvm-svn: 204733	2014-03-25 17:21:41 +00:00
Richard Osborne	9805ec457d	Reuse earlier variables to make it clear the types involved in the cast. No functionality change. llvm-svn: 204732	2014-03-25 17:21:35 +00:00
Benjamin Kramer	b88c97f02e	Add missing slash to make the doxygen output less confusing. PR19187. llvm-svn: 204731	2014-03-25 17:20:28 +00:00
Matt Arsenault	86673ba836	R600: Add failing testcase for <3 x i32> stores. This is supposed to have the same store size and alignment as <4 x i32>, but currently is split into a 64-bit and 32-bit store. llvm-svn: 204729	2014-03-25 16:50:55 +00:00
Benjamin Kramer	e75eaca32f	ScalarEvolution: Compute exit counts for loops with a power-of-2 step. If we have a loop of the form for (unsigned n = 0; n != (k & -32); n += 32) {} then we know that n is always divisible by 32 and the loop must terminate. Even if we have a condition where the loop counter will overflow it'll always hold this invariant. PR19183. Our loop vectorizer creates this pattern and it's also occasionally formed by loop counters derived from pointers. llvm-svn: 204728	2014-03-25 16:25:12 +00:00
Matt Arsenault	b22426c510	Fix creating illegal setcc cond codes. If GT/UGT or LT/ULT were set to expand, a comparison with a constant would replace it with the illegal cond code. There are several more places later in this function that will have the same basic problem. Theoretically R600 should hit this problem for a test, but for some reason it doesn't. llvm-svn: 204727	2014-03-25 16:09:21 +00:00
Evgeniy Stepanov	86f318e8b4	[msan] Relax the test some more. This may or may not fix the bots. R204720 did not. llvm-svn: 204721	2014-03-25 14:32:05 +00:00
Evgeniy Stepanov	d2b07ddfac	[msan] Make some tests less strict. This may or may not fix the bots. llvm-svn: 204720	2014-03-25 14:15:14 +00:00
Rafael Espindola	663ac28603	Fix these tests on windows. It is impossible to create a hard link to a non existing file, so create a dummy file, create the link an delete the dummy file. On windows one cannot remove the current directory, so chdir first. llvm-svn: 204719	2014-03-25 13:19:03 +00:00
Evgeniy Stepanov	fc742acc8c	[msan] More precise instrumentation of select IR. Some bits of select result may be initialized even if select condition is not. https://code.google.com/p/memory-sanitizer/issues/detail?id=50 llvm-svn: 204716	2014-03-25 13:08:34 +00:00
Daniel Sanders	71a89d92f6	[mips] '.set at=$0' should be equivalent to '.set noat' Differential Revision: http://llvm-reviews.chandlerc.com/D3171 llvm-svn: 204714	2014-03-25 13:01:06 +00:00
Cameron McInally	45dc489403	Fix AVX2 Gather execution domains. llvm-svn: 204713	2014-03-25 12:36:38 +00:00
Daniel Sanders	b1d7e53a26	[mips] Correct testcase for .set at=$reg and emit the new warnings for numeric registers too. Summary: Remove the XFAIL added in my previous commit and correct the test such that it correctly tests the expansion of the assembler temporary. Also added a test to check that $at is always $1 when written by the user. Corrected the new assembler temporary warnings so that they are emitted for numeric registers too. Differential Revision: http://llvm-reviews.chandlerc.com/D3169 llvm-svn: 204711	2014-03-25 11:16:03 +00:00
Daniel Sanders	e231ae9e3a	[mips] Fix assembler temporary expansion and add associated warnings about the use of $at. Summary: The assembler temporary is normally $at ($1) but can be reassigned using '.set at=$reg'. Regardless of which register is nominated as the assembler temporary, $at remains $1 when written by the user. Adds warnings under the following conditions: * The register nominated as the assembler temporary is used by the user. * '.set noat' is in effect and $at is used by the user. Both of these only work for named registers. I have a follow up commit that makes it work for numeric registers as well. XFAIL set-at-directive.s since it incorrectly tests that $at is redefined by '.set at=$reg'. Testcases will follow in a separate commit. Patch by David Chisnall His work was sponsored by: DARPA, AFRL Differential Revision: http://llvm-reviews.chandlerc.com/D3167 llvm-svn: 204710	2014-03-25 10:57:07 +00:00
Yaron Keren	e485511e8e	Remove cmake module support for Visual C++ 2010 (MSVC10) but keep the MSVC11 (Visual C++ 2012) support. llvm-svn: 204706	2014-03-25 09:34:20 +00:00
Erik Verbruggen	e706b88304	Simplify loop that worked around bugs in old GCC/Xcode. GCC 4.0.1 and Xcode 2 are no longer supported for building llvm/clang. llvm-svn: 204705	2014-03-25 09:06:18 +00:00
Yaron Keren	24fdbe5676	Disable Visual C++ warning 4722 about aborting a destructor, it has no value for us. llvm-svn: 204704	2014-03-25 08:42:49 +00:00
David Majnemer	273bff4713	WinCOFF: Add support for -fdata-sections This is a pretty straight forward translation for COFF, we just need to stick the data in a COMDAT section marked as IMAGE_COMDAT_SELECT_NODUPLICATES. N.B. We must be careful to avoid sticking entities with private linkage in COMDAT groups. COFF is pretty hostile to the renaming of entities so we must be careful to disallow GlobalVariables with unstable names. llvm-svn: 204703	2014-03-25 06:14:26 +00:00
David Blaikie	3ffe4dd67f	DebugInfo: Add GNU_addr_base and GNU_ranges_base only when there are addresses or ranges Based on code review feedback from Eric in r204672. llvm-svn: 204702	2014-03-25 05:34:24 +00:00
Saleem Abdulrasool	1425622ad8	test: fix CHECK lines Thanks to gix for pointing out that the CHECK-LABEL lines were incorrect! llvm-svn: 204700	2014-03-25 03:39:39 +00:00
Andrew Trick	c8ac7ea261	SLP vectorizer: Don't hoist vector extracts of phis. Extracts coming from phis were being hoisted, while all others were sunk to their uses. This was inconsistent and didn't seem to serve a purpose. Changing all extracts to be sunk to uses is a prerequisite for adding block frequency to the SLP vectorizer's cost model. I benchmarked the change in isolation (without block frequency). I only saw noise on x86 and some potentially significant improvements on ARM. No major regressions is good enough for me. llvm-svn: 204699	2014-03-25 02:18:47 +00:00
David Blaikie	9c550ac4e7	DebugInfo: Support debug_loc under fission Implement debug_loc.dwo, as well as llvm-dwarfdump support for dumping this section. Outlined in the DWARF5 spec and http://gcc.gnu.org/wiki/DebugFission the debug_loc.dwo section has more variation than the standard debug_loc, allowing 3 different forms of entry (plus the end of list entry). GCC seems to, and Clang certainly, only use one form, so I've just implemented dumping support for that for now. It wasn't immediately obvious that there was a good refactoring to share the implementation of dumping support between debug_loc and debug_loc.dwo, so they're separate for now - ideas welcome or I may come back to it at some point. As per a comment in the code, we could choose different forms that may reduce the number of debug_addr entries we emit, but that will require further study. llvm-svn: 204697	2014-03-25 01:44:02 +00:00
David Blaikie	2d33d6a4c2	DebugInfo: Remove unnecessary zero-size check This seems excessive - switching section isn't expensive (or if it is we're already being wasteful, since we emitted the debug_loc section symbol earlier anyway) and otherwise there's no work that happens in this function when the list is empty. llvm-svn: 204696	2014-03-25 01:43:56 +00:00
Justin Bogner	8ce3b5fcaa	Support: Functions for consuming endian specific data from a buffer. This adds a function to Endian.h that reads from and updates a pointer into a buffer with endian specific data. This is more convenient for stream-like reading of data than endian::read. llvm-svn: 204693	2014-03-25 01:04:44 +00:00
Manman Ren	78cf02a07b	Register Allocator: check other options before using a CSR for the first time. When register allocator's stage is RS_Spill, we choose spill over using the CSR for the first time, if the spill cost is lower than CSRCost. When register allocator's stage is < RS_Split, we choose pre-splitting over using the CSR for the first time, if the cost of splitting is lower than CSRCost. CSRCost is set with command-line option "regalloc-csr-first-time-cost". The default value is 0 to generate the same codes as before this commit. With a value of 15 (1 << 14 is the entry frequency), I measured performance gain of 3% on 253.perlbmk and 1.7% on 197.parser, with instrumented PGO, on an arm device. rdar://16162005 llvm-svn: 204690	2014-03-25 00:16:25 +00:00
Kevin Enderby	89299400ac	Fix crashes when assembler directives are used that are not for Mach-O object files by generating an error instead. rdar://16335232 llvm-svn: 204687	2014-03-25 00:05:50 +00:00
Manman Ren	9db66b3d34	Register Allocator: refactoring (no functionality change). Factor out two functions calculateRegionSplitCost and doRegionSplit from tryRegionSplit. These two functions will be used in coming patches. rdar://16162005 llvm-svn: 204684	2014-03-24 23:23:42 +00:00
David Blaikie	84d8e18f2b	DebugInfo: Simplify debug loc list handling by keeping separate lists Rather than using a flat list with "empty" entries (ala the actual on-disk format), keep separate lists for each variable. llvm-svn: 204680	2014-03-24 22:38:38 +00:00
David Blaikie	34ec5d07e1	DwarfDebug: Simplify debug_loc merging No functional change intended. Merging up-front rather than delaying this task until later. This just seems simpler and more efficient (avoiding growing the debug loc list only to have to skip over those post-merged entries, etc). llvm-svn: 204679	2014-03-24 22:27:06 +00:00
Adrian Prantl	c95ec91e2a	Get rid of an unnecessary use of the * and & operators. llvm-svn: 204673	2014-03-24 21:33:01 +00:00

... 3 4 5 6 7 ...

101851 Commits