llvm-project

Commit Graph

Author	SHA1	Message	Date
Weiming Zhao	58eb5ab326	Report error for non-zero data in .bss User may initialize a var with non-zero value and specify .bss section. E.g. : int a __attribute__((section(".bss"))) = 2; This patch converts an assertion to error report for better user experience. Differential Revision: http://reviews.llvm.org/D4199 llvm-svn: 211455	2014-06-22 00:33:44 +00:00
Stepan Dyatkovskiy	f4af855930	MergeFunctions Pass, FnSet has been replaced with FnTree. Patch activates new implementation. So from now, merging process should take time O(Nlog(N)). Where N size of module (we are free to measure it in functions or in instructions). Internally FnTree represents binary tree. So every lookup operation takes O(log(N)) time. It is still not the last patch in series, we also have to clean-up pass from old code, and update pass comments. This patch belongs to patch series that improves MergeFunctions performance time from O(NN) to O(N*log(N)). llvm-svn: 211445	2014-06-21 20:54:36 +00:00
Stepan Dyatkovskiy	71038cadd4	MergeFunctions Pass, removed unused methods from old implementation. Patch removed next old FunctionComparator methods: * enumerate * isEquivalentOperation * isEquivalentGEP * isEquivalentType This patch belongs to patch series that improves MergeFunctions performance time from O(NN) to O(Nlog(N)). llvm-svn: 211444	2014-06-21 20:13:24 +00:00
Stepan Dyatkovskiy	0b58801b69	MergeFunctions, doSanityCheck: fixed body comments. llvm-svn: 211443	2014-06-21 19:07:51 +00:00
Stepan Dyatkovskiy	a77f3d8587	MergeFunctions Pass, introduced sanity check, that checks order relation, introduced among functions set. This patch belongs to patch series that improves MergeFunctions performance time from O(NN) to O(Nlog(N)). llvm-svn: 211442	2014-06-21 18:58:11 +00:00
Stepan Dyatkovskiy	17ee5ac20d	MergeFunctions Pass, introduced total ordering among top-level comparison methods. Patch changes return type of FunctionComparator::compare() and FunctionComparator::compare(const BasicBlock, const BasicBlock) methods from bool (equal or not) to {-1, 0, 1} (less, equal, great). This patch belongs to patch series that improves MergeFunctions performance time from O(NN) to O(Nlog(N)). llvm-svn: 211437	2014-06-21 17:55:51 +00:00
Benjamin Kramer	0bf086f80f	LoopUnrollRuntime: Check for overflow in the trip count calculation. Fixes PR19823. llvm-svn: 211436	2014-06-21 13:46:25 +00:00
Benjamin Kramer	b7f5fb5751	Legalizer: Add support for splitting insert_subvectors. We handle this by spilling the whole thing to the stack and doing the insertion as a store. PR19492. This happens in real code because the vectorizer creates v2i128 when AVX is enabled. llvm-svn: 211435	2014-06-21 12:56:42 +00:00
Benjamin Kramer	8dd637aa04	SCEVExpander: Fold constant PHIs harder. The logic below only understands proper IVs. PR20093. llvm-svn: 211433	2014-06-21 11:47:18 +00:00
Richard Trieu	c1485223a6	Add back functionality removed in r210497. Instead of asserting, output a message stating that a null pointer was found. llvm-svn: 211430	2014-06-21 02:43:02 +00:00
Andrea Di Biagio	e5015d8aba	[X86] Add ISel patterns to select SSE3/AVX ADDSUB instructions. This patch adds ISel patterns to select SSE3/AVX ADDSUB instructions from a sequence of "vadd + vsub + blend". Example: /// typedef float float4 __attribute__((ext_vector_type(4))); float4 foo(float4 A, float4 B) { float4 X = A - B; float4 Y = A + B; return (float4){X[0], Y[1], X[2], Y[3]}; } /// Before this patch, (with flag -mcpu=corei7) llc produced the following assembly sequence: movaps %xmm0, %xmm2 addps %xmm1, %xmm2 subps %xmm1, %xmm0 blendps $10, %xmm2, %xmm0 With this patch, we now get a single addsubps %xmm1, %xmm0 llvm-svn: 211427	2014-06-21 01:31:15 +00:00
Zachary Turner	d119fa028a	Fix the MinGW builder. Apparently std::call_once and std::recursive_mutex are not available on MinGW and breaks the builder. Revert to using a function local static and sys::Mutex just to get the tree green until we figure out a better solution. llvm-svn: 211424	2014-06-21 00:24:51 +00:00
Rafael Espindola	b4076b290e	Always use a temp symbol for CIE. Fixes pr19185. llvm-svn: 211423	2014-06-20 23:54:32 +00:00
Rafael Espindola	c3510c74f7	Use compact unwind for the iOS simulator. Another step in fixing pr19185. llvm-svn: 211416	2014-06-20 22:40:55 +00:00
Rafael Espindola	becdf63f7d	Use a helper function and clang-format. No functionality change. llvm-svn: 211415	2014-06-20 22:37:01 +00:00
Duncan P. N. Exon Smith	03c2bfc2ef	Support: ScaledNumber: Fix inconsistent test names llvm-svn: 211414	2014-06-20 22:36:09 +00:00
Duncan P. N. Exon Smith	818a8176ea	Support: Write ScaledNumbers::getLg{,Floor,Ceiling}() llvm-svn: 211413	2014-06-20 22:33:40 +00:00
Rafael Espindola	df100c337c	Delete dead code. The compact unwind info is only used by code that knows it is supported. llvm-svn: 211412	2014-06-20 22:30:31 +00:00
Duncan P. N. Exon Smith	411840d963	Support: Write ScaledNumber::getQuotient() and getProduct() llvm-svn: 211409	2014-06-20 21:47:47 +00:00
Duncan P. N. Exon Smith	0a594f8cbd	Support: Cleanup ScaledNumber::getAdjusted() doc llvm-svn: 211407	2014-06-20 21:44:36 +00:00
Duncan P. N. Exon Smith	d4ea631fec	Support: Mark end of namespaces This convinces clang-format to leave a newline. llvm-svn: 211406	2014-06-20 21:43:20 +00:00
Kevin Enderby	26646108c9	Fix some double printing of filenames for archives in llvm-nm when the tool is given multiple files. Also fix the same issue with Mach-O universal files. And fix the newline spacing to separate the output in these cases. llvm-svn: 211405	2014-06-20 21:29:27 +00:00
Rafael Espindola	b4357fc293	Don't produce eh_frame relocations when targeting the IOS simulator. First step for fixing pr19185. llvm-svn: 211404	2014-06-20 21:15:27 +00:00
Zachary Turner	c04b892f93	Revert "Replace Execution Engine's mutex with std::recursive_mutex." This reverts commit 1f502bd9d7d2c1f98ad93a09ffe435e11a95aedd, due to GCC / MinGW's lack of support for C++11 threading. It's possible this will go back in after we come up with a reasonable solution. llvm-svn: 211401	2014-06-20 21:07:14 +00:00
Reid Kleckner	4a01230db4	Generate native unwind info on Win64 This patch enables LLVM to emit Win64-native unwind info rather than DWARF CFI. It handles all corner cases (I hope), including stack realignment. Because the unwind info is not flexible enough to describe stack frames with a gap of unknown size in the middle, such as the one caused by stack realignment, I modified register spilling code to place all spills into the fixed frame slots, so that they can be accessed relative to the frame pointer. Patch by Vadim Chugunov! Reviewed By: rnk Differential Revision: http://reviews.llvm.org/D4081 llvm-svn: 211399	2014-06-20 20:35:47 +00:00
David Blaikie	7c8d13911a	Fix some -Wsign-compare fallout from changing container count member functions to return unsigned instead of bool. llvm-svn: 211393	2014-06-20 19:54:13 +00:00
Stepan Dyatkovskiy	6baeb8805c	Commited patch from Björn Steinbrink: Summary: Different range metadata can lead to different optimizations in later passes, possibly breaking the semantics of the merged function. So range metadata must be taken into consideration when comparing Load instructions. Thanks! llvm-svn: 211391	2014-06-20 19:11:56 +00:00
Adam Nemet	f67d999ebb	[Make] Fix dependencies for td.expanded Depend on all the .td files not just the main one. llvm-svn: 211390	2014-06-20 19:00:41 +00:00
Ulrich Weigand	32626014a6	[RuntimeDyld] Fix ppc64 stub relocations on little-endian When RuntimeDyldELF creates stub functions, it needs to install relocations that will resolve to the final address of the target routine. Since those are 16-bit relocs, they need to be applied to the least-significant halfword of the instruction. On big-endian ppc64, this means that addresses have to be adjusted by 2, which is what the code currently does. However, on a little-endian system, the address must not be adjusted; the least-significant halfword is the first one. This patch updates the RuntimeDyldELF code to take the target byte order into account. llvm-svn: 211384	2014-06-20 18:17:56 +00:00
Kevin Enderby	4eff6cdd2e	Fix a warning about the use of const being ignored with a cast. llvm-svn: 211383	2014-06-20 18:07:34 +00:00
Ulrich Weigand	dbc8e1ae28	[RuntimeDyld] Support more PPC64 relocations This adds support for several missing PPC64 relocations in the straight-forward manner to RuntimeDyldELF.cpp. Note that this actually fixes a failure of a large-model test case on PowerPC, allowing the XFAIL to be removed. llvm-svn: 211382	2014-06-20 17:51:47 +00:00
Tom Stellard	ae4c9e7bc3	R600/SI: Add patterns for ctpop inside a branch llvm-svn: 211378	2014-06-20 17:06:11 +00:00
Tom Stellard	9c603ebca4	R600/SI: Add a pattern for f32 ftrunc llvm-svn: 211377	2014-06-20 17:06:09 +00:00
Tom Stellard	a79e9f0f6d	R600: Expand vector flog2 llvm-svn: 211376	2014-06-20 17:06:07 +00:00
Tom Stellard	5222a88653	R600: Expand vector fexp2 llvm-svn: 211375	2014-06-20 17:06:05 +00:00
Tom Stellard	de16a2e59f	R600/SI: SI Control Flow Annotation bug fixed Mixing of AddAvailableValue and GetValueAtEndOfBlock methods of SSAUpdater leaded to the endless loop generation when the nested loops annotated. This fixes a bug in the OCL_ML/KNN OpenCV test. The test case is too complex for FileCheck and would be very fragile. Patch by: Elena Denisova llvm-svn: 211374	2014-06-20 17:06:02 +00:00
Tom Stellard	c9dedb8e29	R600/SI: Add a VALU pattern for i64 xor llvm-svn: 211373	2014-06-20 17:05:57 +00:00
Ulrich Weigand	59c6ab20d6	[PowerPC] Fix small argument stack slot offset for LE When small arguments (structures < 8 bytes or "float") are passed in a stack slot in the ppc64 SVR4 ABI, they must reside in the least significant part of that slot. On BE, this means that an offset needs to be added to the stack address of the parameter, but on LE, the least significant part of the slot has the same address as the slot itself. This changes the PowerPC back-end ABI code to only add the small argument stack slot offset for BE. It also adds test cases to verify the correct behavior on both BE and LE. llvm-svn: 211368	2014-06-20 16:34:05 +00:00
Rafael Espindola	e5bb30d9a7	Move test so that it is skipped if the ARM target is not enabled. llvm-svn: 211366	2014-06-20 15:30:38 +00:00
Rafael Espindola	1fc003e6c5	Allow a target to create a null streamer. Targets can assume that a target streamer is present, so they have to be able to construct a null streamer in order to set the target streamer in it to. Fixes a crash when using the null streamer with arm. llvm-svn: 211358	2014-06-20 13:11:28 +00:00
Yaron Keren	3eb83a0d67	Code in LoopStrengthReduce.cpp depends on SmallBitVector::size() being size_t and not unsigned. llvm-svn: 211356	2014-06-20 12:57:44 +00:00
Yaron Keren	c2a363aa33	Reverting size_type for the containers from size_type to unsigned. Various places in LLVM assume that container size and count are unsigned and do not use the container size_type. Therefore they break compilation (or possibly executation) for LP64 systems where size_t is 64 bit while unsigned is still 32 bit. If we'll ever that many items in the container size_type could be made size_t for a specific containers after reviweing its other uses. llvm-svn: 211353	2014-06-20 12:20:56 +00:00
Yaron Keren	d1109c874a	Attempting to fix the 64 bit bots. llvm-svn: 211351	2014-06-20 10:52:57 +00:00
Yaron Keren	6d3194f7d5	The count() function for STL datatypes returns unsigned, even where it's only 1/0 result like std::set. Some of the LLVM ADT already return unsigned count(), while others still return bool count(). In continuation to r197879, this patch modifies DenseMap, DenseSet, ScopedHashTable, ValueMap:: count() to return size_type instead of bool, 1 instead of true and 0 instead of false. size_type is typedef-ed locally within each class to size_t. http://reviews.llvm.org/D4018 Reviewed by dblaikie. llvm-svn: 211350	2014-06-20 10:26:56 +00:00
Oliver Stannard	5dc2934ba2	Emit the ARM build attributes ABI_PCS_wchar_t and ABI_enum_size. Emit the ARM build attributes ABI_PCS_wchar_t and ABI_enum_size based on module flags metadata. llvm-svn: 211349	2014-06-20 10:08:11 +00:00
Zoran Jovanovic	6a29b55a5a	ps][mips64r6] Added LSA/DLSA instructions Differential Revision: http://reviews.llvm.org/D3897 llvm-svn: 211346	2014-06-20 09:28:09 +00:00
Matt Arsenault	f5e2997aff	R600: Trivial subtarget feature cleanups. Remove an unused AMDIL leftover, correct extra periods appearing in the help menu. llvm-svn: 211341	2014-06-20 06:50:05 +00:00
Justin Bogner	6f07046808	ArgList: use MakeArgList overloads in subclasses and clean up some calls. llvm-svn: 211340	2014-06-20 04:36:29 +00:00
Karthik Bhat	e03a25da70	Add Support to Recognize and Vectorize NON SIMD instructions in SLPVectorizer. This patch adds support to recognize patterns such as fadd,fsub,fadd,fsub.../add,sub,add,sub... and vectorizes them as vector shuffles if they are profitable. These patterns of vector shuffle can later be converted to instructions such as addsubpd etc on X86. Thanks to Arnold and Hal for the reviews. http://reviews.llvm.org/D4015 llvm-svn: 211339	2014-06-20 04:32:48 +00:00
Duncan P. N. Exon Smith	2800a3770d	Support: Clean up getRounded() tests llvm-svn: 211337	2014-06-20 02:31:07 +00:00
Duncan P. N. Exon Smith	e9e44cd189	Support: Write ScaledNumbers::getAdjusted() llvm-svn: 211336	2014-06-20 02:31:03 +00:00
Rafael Espindola	bfb8b9152b	Small clanups: Use static instead of anonymous namespace. Delete write only variables. llvm-svn: 211335	2014-06-20 01:37:35 +00:00
Hans Wennborg	cfe341f5d0	Fix .cpp files claiming to be header files llvm-svn: 211334	2014-06-20 01:36:00 +00:00
Duncan P. N. Exon Smith	9c62dd583b	Support: Write ScaledNumbers::getRounded() Start extracting helper functions out of -block-freq's `UnsignedFloat` into `Support/ScaledNumber.h` with the eventual goal of moving and renaming the class to `ScaledNumber`. The bike shed about names is still being painted, but I'm going with this for now. llvm-svn: 211333	2014-06-20 01:30:43 +00:00
Chandler Carruth	8366cebeb5	[x86] Make the x86 PACKSSWB, PACKSSDW, PACKUSWB, and PACKUSDW instructions available as synthetic SDNodes PACKSS and PACKUS that will select to the correct instruction variants based on the return type. This allows us to use these rather important instructions when lowering vector shuffles. Also moves the relevant instruction definitions to be split out from the fully generic multiclasses to allow them to match these new SDNodes in the same way that the UNPCK instructions do. No functionality should actually be changed here. llvm-svn: 211332	2014-06-20 01:05:28 +00:00
Hans Wennborg	4dc895164a	Don't build switch lookup tables for dllimport or TLS variables We would previously put dllimport variables in switch lookup tables, which doesn't work because the address cannot be used in a constant initializer. This is basically the same problem that we have in PR19955. Putting TLS variables in switch tables also desn't work, because the address of such a variable is not constant. Differential Revision: http://reviews.llvm.org/D4220 llvm-svn: 211331	2014-06-20 00:38:12 +00:00
Rafael Espindola	393b2b594f	Revert "Add StringMap::insert(pair) consistent with the standard associative container concept." This reverts commit r211309. It looks like it broke some bots: http://lab.llvm.org:8011/builders/clang-x86_64-ubuntu-gdb-75/builds/15563/steps/compile/logs/stdio llvm-svn: 211328	2014-06-20 00:23:03 +00:00
Kevin Enderby	14a96ac343	Added the -m option as an alias for -format=darwin to llvm-nm and llvm-size which is what the darwin tools use for the Mach-O format output. llvm-svn: 211326	2014-06-20 00:04:16 +00:00
Rafael Espindola	562e0d8023	The gold plugin doesn't need disassemblers. Back in r128440 tools/LTO started exporting the disassembler interface. It was never clear why, but whatever the reason I am pretty sure it doesn't hold for tools/gold. llvm-svn: 211325	2014-06-19 23:06:53 +00:00
Rafael Espindola	c273aac3a1	Set gold plugin options in a sane order. This fixes the processing of --plugin-opt=-jump-table-type=arity. Nice properties: * We call InitTargetOptionsFromCodeGenFlags once. * We call parseCodeGenDebugOptions once. * It works :-) llvm-svn: 211322	2014-06-19 22:54:47 +00:00
Kevin Enderby	1e1b992ad7	Fix the output of llvm-nm for Mach-O files to use the characters ‘d’ and ‘b’ for data and bss symbols instead of the generic ’s’ for a symbol in a section. llvm-svn: 211321	2014-06-19 22:49:21 +00:00
Rafael Espindola	b201bfcbce	Simplify. No functionality change. Thanks to Alp Toker for noticing it. llvm-svn: 211320	2014-06-19 22:33:23 +00:00
Rafael Espindola	70d3c20b0f	Use the assignment operator. No functionality change. llvm-svn: 211319	2014-06-19 22:27:46 +00:00
Rafael Espindola	a0d30a9977	Reduce indentation. No functionality change. llvm-svn: 211318	2014-06-19 22:20:07 +00:00
Rafael Espindola	a064b0c476	Set missing options in LTOCodeGenerator::setTargetOptions. Patch by Tom Roeder, I just added the test. llvm-svn: 211317	2014-06-19 22:14:12 +00:00
Kevin Enderby	1983fcf86c	Change the output of llvm-nm and llvm-size for Mach-O universal files (aka fat files) to print “ (for architecture XYZ)” for fat files with more than one architecture to be like what the darwin tools do for fat files. Also clean up the Mach-O printing of archive membernames in llvm-nm to use the darwin form of "libx.a(foo.o)". llvm-svn: 211316	2014-06-19 22:03:18 +00:00
Rafael Espindola	6b244b1348	Use lib/LTO directly in the gold plugin. The tools/lto API is not the best choice for implementing a gold plugin. Among other issues: * It is an stable ABI. Old errors stay and we have to be really careful before adding new features. * It has to support two fairly different linkers: gold and ld64. * We end up with a plugin that depends on a shared lib, something quiet unusual in LLVM land. * It hides LLVM. For some features in the gold plugin it would be really nice to be able to just get a Module or a GlobalValue. This change is intended to be a very direct translation from the C API. It will just enable other fixes and cleanups. Tested with a LTO bootstrap on linux. llvm-svn: 211315	2014-06-19 21:14:13 +00:00
Eric Christopher	c40e5edbbc	Add a new subtarget hook for whether or not we'd like to enable the atomic load linked expander pass to run for a particular subtarget. This requires a check of the subtarget and so save the TargetMachine rather than only TargetLoweringInfo and update all callers. llvm-svn: 211314	2014-06-19 21:03:04 +00:00
Zachary Turner	5165b37c63	Include Threading.h instead of forward declaring a function. Previously this led to a circular header dependency, but a recent change has since removed this dependency, so the correct fix is to simply include the header rather than forward declare. llvm-svn: 211311	2014-06-19 20:20:03 +00:00
David Blaikie	37700dc057	Add StringMap::insert(pair) consistent with the standard associative container concept. Patch by Agustín Bergé. llvm-svn: 211309	2014-06-19 20:08:56 +00:00
Eric Christopher	b0a78ca11a	Since we're using DW_AT_string rather than DW_AT_strp for debug_info for assembly files we can't depend on the offset within the section after a string since it could be different between producers etc. Relax these tests accordingly. llvm-svn: 211308	2014-06-19 20:00:13 +00:00
Eric Christopher	d29430dae9	Fix up a few formatting issues. llvm-svn: 211307	2014-06-19 20:00:09 +00:00
Rafael Espindola	64a86e5fc2	Remove an incorrect fixme. dynamic-no-pic is just another output type. If gnu ld gets support for MachO, it should also add something like LDPO_DYN_NO_PIC to the plugin interface. llvm-svn: 211305	2014-06-19 19:45:25 +00:00
Alp Toker	1d099d9339	Fix typos llvm-svn: 211304	2014-06-19 19:41:26 +00:00
Justin Bogner	cd45f963e2	Support: Add llvm::sys::fs::copy_file A function to copy one file's contents to another. llvm-svn: 211302	2014-06-19 19:35:39 +00:00
David Greene	03b1c3f438	Remove bogus configure check Configure creates makefiles, so it doesn't make sense to check for them to see if we can configure. llvm-svn: 211301	2014-06-19 19:31:11 +00:00
David Greene	6367738990	Add option to keep flavor out of the install directory Sometimes we want to install things in "standard" locations and the flavor directories interfere with that. Add an option to keep them out of the install path. llvm-svn: 211300	2014-06-19 19:31:09 +00:00
David Greene	9ccdb1700c	Turn of -Werror by default Don't build with -Werror unless asked to. llvm-svn: 211299	2014-06-19 19:31:05 +00:00
Eric Christopher	be5184c44d	Fix this test a little harder - use llc_dwarf to make sure we don't try to execute it on windows. llvm-svn: 211298	2014-06-19 19:26:42 +00:00
Alp Toker	ec9b42a907	Remove unused includes following r211294 llvm-svn: 211297	2014-06-19 19:25:49 +00:00
Rafael Espindola	77c50d2394	Use the c++ APIs. No functionality change. llvm-svn: 211294	2014-06-19 19:11:22 +00:00
Eric Christopher	1f5faf7f0a	Relax this test a bit, we don't need the full contents of the frame section to match, just the version for this test. llvm-svn: 211293	2014-06-19 18:36:15 +00:00
David Blaikie	df4d5efc7c	Remove use of removed function, llvm_stop_multithreading llvm-svn: 211291	2014-06-19 18:26:28 +00:00
David Blaikie	9786757510	Remove circular header reference in Threading.h/Mutex.h llvm-svn: 211290	2014-06-19 18:26:26 +00:00
Zachary Turner	21fdc93272	Fix build on non-Windows platforms. llvm-svn: 211288	2014-06-19 18:25:06 +00:00
Zachary Turner	9c9710eaf4	Remove support for LLVM runtime multi-threading. After a number of previous small iterations, the functions llvm_start_multithreaded() and llvm_stop_multithreaded() have been reduced essentially to no-ops. This change removes them entirely. Reviewed by: rnk, dblaikie Differential Revision: http://reviews.llvm.org/D4216 llvm-svn: 211287	2014-06-19 18:18:23 +00:00
David Blaikie	de8e12a49a	DebugInfo: Fission: Ensure the address pool entries for location lists are emitted. The address pool was being emitted before location lists. The latter could add more entries to the pool which would be lost/never emitted. llvm-svn: 211284	2014-06-19 17:59:14 +00:00
Alp Toker	660839f210	MCNullStreamer: assign file IDs to resolve crashes and errors Use the MCStreamer base implementations for file ID tracking instead of overriding them as no-ops. Avoids assertions when streaming Dwarf debug info, and fixes ASM parsing of loc and file directives. llvm-svn: 211282	2014-06-19 17:15:36 +00:00
Jingyue Wu	37fcb5919d	[ValueTracking] Extend range metadata to call/invoke Summary: With this patch, range metadata can be added to call/invoke including IntrinsicInst. Previously, it could only be added to load. Rename computeKnownBitsLoad to computeKnownBitsFromRangeMetadata because range metadata is not only used by load. Update the language reference to reflect this change. Test Plan: Add several tests in range-2.ll to confirm the verifier is happy with having range metadata on call/invoke. Add two tests in AddOverFlow.ll to confirm annotating range metadata to call/invoke can benefit InstCombine. Reviewers: meheff, nlewycky, reames, hfinkel, eliben Reviewed By: eliben Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D4187 llvm-svn: 211281	2014-06-19 16:50:16 +00:00
Oliver Stannard	d306c3cec2	Tests for r211273 llvm-svn: 211279	2014-06-19 16:35:19 +00:00
Zachary Turner	6ad2444d5b	Kill the LLVM global lock. This patch removes the LLVM global lock, and updates all existing users of the global lock to use their own mutex. None of the existing users of the global lock were protecting code that was mutually exclusive with any of the other users of the global lock, so its purpose was not being met. Reviewed by: rnk Differential Revision: http://reviews.llvm.org/D4142 llvm-svn: 211277	2014-06-19 16:17:42 +00:00
Oliver Stannard	8b27308617	Emit DWARF info for all code section in an assembly file Currently, when using llvm as an assembler, DWARF debug information is only generated for the .text section. This patch modifies this so that DWARF info is emitted for all executable sections. llvm-svn: 211273	2014-06-19 15:52:37 +00:00
Oliver Stannard	f7693f4c1f	Emit DWARF3 call frame information when DWARF3+ debug info is requested Currently, llvm always emits a DWARF CIE with a version of 1, even when emitting DWARF 3 or 4, which both support CIE version 3. This patch makes it emit the newer CIE version when we are emitting DWARF 3 or 4. This will not reduce compatibility, as we already emit other DWARF3/4 features, and is worth doing as the DWARF3 spec removed some ambiguities in the interpretation of call frame information. It also fixes a minor bug where the "return address" field of the CIE was encoded as a ULEB128, which is only valid when the CIE version is 3. There are no test changes for this, because (as far as I can tell) none of the platforms that we test have a return address register with a DWARF register number >127. llvm-svn: 211272	2014-06-19 15:39:33 +00:00
Matheus Almeida	4f7ef8c6ef	[mips] Implementation of dli. Patch by David Chisnall His work was sponsored by: DARPA, AFRL Some small modifications to the original patch: we now error if it's not possible to expand an instruction (mips-expansions-bad.s has some examples). Added some comments to the expansions. llvm-svn: 211271	2014-06-19 15:08:04 +00:00
Matheus Almeida	3813d57929	[mips] Small update to the logic behind the expansion of assembly pseudo instructions. Summary: The functions that do the expansion now return false on success and true otherwise. This is so we can catch some errors during the expansion (e.g.: immediate too large). The next patch adds some test cases. Reviewers: vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D4214 llvm-svn: 211269	2014-06-19 14:39:14 +00:00
Dinesh Dwivedi	8bb5fb0661	Updated comments as suggested by Rafael. Thanks. llvm-svn: 211268	2014-06-19 14:11:53 +00:00
Dinesh Dwivedi	562fd7534c	Added instruction combine to transform few more negative values addition to subtraction (Part 1) This patch enables transforms for following patterns. (x + (~(y & c) + 1) --> x - (y & c) (x + (~((y >> z) & c) + 1) --> x - ((y>>z) & c) Differential Revision: http://reviews.llvm.org/D3733 llvm-svn: 211266	2014-06-19 10:36:52 +00:00
Andrea Di Biagio	54b0949af9	[X86] Teach how to combine horizontal binop even in the presence of undefs. Before this change, the backend was unable to fold a build_vector dag node with UNDEF operands into a single horizontal add/sub. This patch teaches how to combine a build_vector with UNDEF operands into a horizontal add/sub when possible. The algorithm conservatively avoids to combine a build_vector with only a single non-UNDEF operand. Added test haddsub-undef.ll to verify that we correctly fold horizontal binop even in the presence of UNDEFs. llvm-svn: 211265	2014-06-19 10:29:41 +00:00
Dinesh Dwivedi	b62e52e1b5	Refactored and updated SimplifyUsingDistributiveLaws() to * Find factorization opportunities using identity values. * Find factorization opportunities by treating shl(X, C) as mul (X, shl(C)) * Keep NSW flag while simplifying instruction using factorization. This fixes PR19263. Differential Revision: http://reviews.llvm.org/D3799 llvm-svn: 211261	2014-06-19 08:29:18 +00:00
Alp Toker	fb39de3be7	CommandLine: bail out when options get multiply registered These errors are strictly unrecoverable and indicate serious issues such as conflicting option names or an incorrectly linked LLVM distribution. With this change, the errors actually get detected so tests don't pass silently. llvm-svn: 211260	2014-06-19 07:25:25 +00:00
Alp Toker	0b346e6be7	Remove OwningPtr.h and associated tests llvm::OwningPtr is superseded by std::unique_ptr. llvm-svn: 211259	2014-06-19 07:25:18 +00:00
David Majnemer	6cf6c05322	InstCombine: Stop two transforms dueling InstCombineMulDivRem has: // Canonicalize (X+C1)CI -> XCI+C1CI. InstCombineAddSub has: // WX + YZ --> W (X+Z) iff W == Y These two transforms could fight with each other if C1CI would not fold away to something simpler than a ConstantExpr mul. The InstCombineMulDivRem transform only acted on ConstantInts until r199602 when it was changed to operate on all Constants in order to let it fire on ConstantVectors. To fix this, make this transform more careful by checking to see if we actually folded away C1CI. This fixes PR20079. llvm-svn: 211258	2014-06-19 07:14:33 +00:00
Eric Christopher	4c5bff36ad	Move -dwarf-version to an MC level command line option so it's used by all of the MC level tools and codegen. Fix up all uses in the compiler to use this and set it on the context accordingly. llvm-svn: 211257	2014-06-19 06:22:08 +00:00
Eric Christopher	07634e2a5b	Remove unnecessary include. llvm-svn: 211256	2014-06-19 06:22:05 +00:00
Eric Christopher	23c6d1f41a	80-column fixups. llvm-svn: 211255	2014-06-19 06:22:01 +00:00
Craig Topper	35b2f75733	Convert some assert(0) to llvm_unreachable or fold an 'if' condition into the assert. llvm-svn: 211254	2014-06-19 06:10:58 +00:00
Matt Arsenault	8e34ecb797	R600: Add a few tests I forgot to add. These belong with r210827 llvm-svn: 211253	2014-06-19 04:24:43 +00:00
Nick Lewycky	8561a49c27	Move optimization of some cases of (A & C1)\|(B & C2) from instcombine to instsimplify. Patch by Rahul Jain, plus some last minute changes by me -- you can blame me for any bugs. llvm-svn: 211252	2014-06-19 03:51:46 +00:00
Nick Lewycky	c961030ac2	Make instsimplify's analysis of icmp eq/ne use computeKnownBits to determine whether the icmp is always true or false. Patch by Suyog Sarda! llvm-svn: 211251	2014-06-19 03:35:49 +00:00
Nick Lewycky	802df52424	Remove redundant code in InstCombineShift, no functionality change because instsimplify already does this and instcombine calls instsimplify a few lines above. Patch by Suyog Sarda! llvm-svn: 211250	2014-06-19 03:28:28 +00:00
David Majnemer	6a5b812c7b	MS asm: Properly handle quoted symbol names We would get confused by '@' characters in symbol names, we would mistake the text following them for the variant kind. When an identifier a string, the variant kind will never show up inside of it. Instead, check to see if there is a variant following the string. This fixes PR19965. llvm-svn: 211249	2014-06-19 01:25:43 +00:00
Matt Arsenault	a0050b0961	R600/SI: Add intrinsics for various math instructions. These will be used for custom lowering and for library implementations of various math functions, so it's useful to expose these as builtins. llvm-svn: 211247	2014-06-19 01:19:19 +00:00
David Blaikie	d3d6de2703	Fix breakage from r211244 by using LLVM_EXPLICIT to avoid using explicit operators under MSVC where they're not supported. llvm-svn: 211246	2014-06-19 01:09:49 +00:00
Nikola Smiljanic	89e561a63e	PR10140 - StringPool's PooledStringPtr has non-const operator== causing bad OR-result. Mark conversion operator explicit and const qualify comparison operators. llvm-svn: 211244	2014-06-19 00:26:49 +00:00
Eric Christopher	3d19f1388f	Move ARMJITInfo off of the TargetMachine and down onto the subtarget. This required untangling a mess of headers that included around. This a recommit of r210953 with a fix for the removed accessor for JITInfo. llvm-svn: 211233	2014-06-18 22:48:09 +00:00
Matt Arsenault	2b0fa433a0	Use stdint macros for specifying size of constants llvm-svn: 211231	2014-06-18 22:11:03 +00:00
Kevin Enderby	4b8fc281d4	Teach llvm-size to know about Mach-O universal files (aka fat files) and fat files containing archives. Also fix a bug in MachOUniversalBinary::ObjectForArch::ObjectForArch() where it needed a >= when comparing the Index with the number of objects in a fat file. As the index starts at 0. llvm-svn: 211230	2014-06-18 22:04:40 +00:00
Matt Arsenault	692bd5ec2f	R600: Handle fnearbyint The difference from rint isn't really relevant here, so treat them as equivalent. OpenCL doesn't have nearbyint, so this is sort of pointless other than for completeness. llvm-svn: 211229	2014-06-18 22:03:45 +00:00
Marek Olsak	51b8e7b2e7	R600/SI: add gather4 and getlod intrinsics (v3) This contains all the previous patches + getlod support on top of it. It doesn't use SDNodes anymore, so it's quite small. It also adds v16i8 to SReg_128, which is used for the sampler descriptor. Reviewed-by: Tom Stellard llvm-svn: 211228	2014-06-18 22:00:29 +00:00
Matt Arsenault	b55c68f171	Use LL suffix for literal that should be 64-bits. This hopefully fixes Windows llvm-svn: 211225	2014-06-18 21:40:43 +00:00
Rafael Espindola	1fe2c5ab27	Add a symbols() range and use a range loop. llvm-svn: 211222	2014-06-18 21:14:57 +00:00
Rafael Espindola	794112a91f	Simplify code. We can delete the objects earlier now that we are copying the names to a buffer. llvm-svn: 211221	2014-06-18 21:08:17 +00:00
Saleem Abdulrasool	71ede29e9c	MC: do not add comment string to the AsmToken in AsmLexer::LexLineComment Fixes macros with varargs if the macro instantiation has a trailing comment. Patch by Janne Grunau! llvm-svn: 211219	2014-06-18 20:57:32 +00:00
Saleem Abdulrasool	763e2cb6e5	MCAsmParser: full support for gas' '.if{cond} expression' directives Patch by Janne Grunau! llvm-svn: 211218	2014-06-18 20:57:28 +00:00
Zachary Turner	62ce4e88fd	Replace Execution Engine's mutex with std::recursive_mutex. This change has a bit of a trickle down effect due to the fact that there are a number of derived implementations of ExecutionEngine, and that the mutex is not tightly encapsulated so is used by other classes directly. Reviewed by: rnk Differential Revision: http://reviews.llvm.org/D4196 llvm-svn: 211214	2014-06-18 20:17:35 +00:00
Rafael Espindola	8fb3111248	Revert a C API difference that I incorrectly introduced. LLVMGetBitcodeModuleInContext should not take ownership on error. I will try to localize this odd api requirement, but this should get the bots green. llvm-svn: 211213	2014-06-18 20:07:35 +00:00
Rafael Espindola	ccf10727b0	Make getBaseObject static. Thanks to David Majnemer for noticing. llvm-svn: 211208	2014-06-18 19:08:47 +00:00
Rafael Espindola	8af5cb2c28	Change IRObjectFile to parse the bitcode lazily. The main point of this class is to provide a cheap object interface to a bitcode file, so it has to be as lazy as possible. llvm-svn: 211207	2014-06-18 19:05:24 +00:00
Rafael Espindola	a1ea4ccc06	Remove BitcodeReader::setBufferOwned. We do have use cases for the bitcode reader owning the buffer or not, but we always know which one we have when we construct it. It might be possible to simplify this further, but this is a step in the right direction. llvm-svn: 211205	2014-06-18 18:55:41 +00:00
Diego Novillo	b2ad56effc	Simply test for available locations in optimization remarks. When emitting optimization remarks, we test for the presence of instruction locations by testing for a valid llvm.dbg.cu annotation. This is slightly inefficient because we can simply ask whether the debug location we have is known or not. Additionally, if my current plan works, I will need to remove the llvm.dbg.cu annotation from the IL (or prevent it from being generated) when -Rpass is used without -g. In those cases, we'll want to generate line tables but we will want to prevent code generation from emitting DWARF code for them. Tested on x86_64. llvm-svn: 211204	2014-06-18 18:46:58 +00:00
Ulrich Weigand	f460d69ada	[PowerPC] Remove unnecessary load of r12 in indirect call When looking at the 64-bit SVR4 indirect call sequence, I noticed an unnecessary load of r12. And indeed the code says: // R12 must contain the address of an indirect callee. But this is not correct; in the 64-bit SVR4 (ELFv1) ABI, there is no need to load r12 at this point. It seems this code and comment is a remnant of code originally shared with the Darwin ABI ... This patch simply removes the unnecessary load. llvm-svn: 211203	2014-06-18 18:33:36 +00:00
Rafael Espindola	f9ba889c61	Update to the latest registered ELF e_machine names and values. Patch by John Wolf! llvm-svn: 211202	2014-06-18 18:30:15 +00:00
Rafael Espindola	cd2de416eb	Run clang-format in a small chunk of code I am about to change. llvm-svn: 211201	2014-06-18 18:26:53 +00:00
Justin Bogner	989574ca99	ProfileData: Fix copy-paste type in RawInstrProfReader These deleted definitions had the wrong types. Patch by Alex L! llvm-svn: 211199	2014-06-18 18:20:44 +00:00
Weiming Zhao	8c89973462	[ARM] [MC] Refactor the constant pool classes ARMTargetStreamer implements ConstantPool and AssmeblerConstantPools to keep track of assembler-generated constant pools that are used for ldr-pseudo. When implementing ldr-pseudo for AArch64, these two classes can be reused. So this patch factors them out from ARM target to the general MC lib. llvm-svn: 211198	2014-06-18 18:17:25 +00:00
Ed Maste	f054c7ed68	ADT: correct typo in comment llvm-svn: 211196	2014-06-18 18:08:55 +00:00
Jan Vesely	85f0dbce5c	R600: Expand vector fceil Move fp64 fceil tests to fceil64.ll v2: rebase Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 211194	2014-06-18 17:57:29 +00:00
Ulrich Weigand	ad0cb91ed9	[PowerPC] Simplify and improve loading into TOC register During an indirect function call sequence on the 64-bit SVR4 ABI, generate code must load and then restore the TOC register. This does not use a regular LOAD instruction since the TOC register r2 is marked as reserved. Instead, the are two special instruction patterns: let RST = 2, DS = 2 in def LDinto_toc: DSForm_1a<58, 0, (outs), (ins g8rc:$reg), "ld 2, 8($reg)", IIC_LdStLD, [(PPCload_toc i64:$reg)]>, isPPC64; let RST = 2, DS = 10, RA = 1 in def LDtoc_restore : DSForm_1a<58, 0, (outs), (ins), "ld 2, 40(1)", IIC_LdStLD, [(PPCtoc_restore)]>, isPPC64; Note that these not only restrict the destination of the load to r2, but they also restrict the source of the load to particular address combinations. The latter is a problem when we want to support the ELFv2 ABI, since there the TOC save slot is no longer at 40(1). This patch replaces those two instructions with a single instruction pattern that only hard-codes r2 as destination, but supports generic addresses as source. This will allow supporting the ELFv2 ABI, and also helps generate more efficient code for calls to absolute addresses (allowing simplification of the ppc64-calls.ll test case). llvm-svn: 211193	2014-06-18 17:52:49 +00:00
Matt Arsenault	d22626f6bb	Work around ridiculous warning. Apparently C++ doesn't really have hex floating point constants. llvm-svn: 211192	2014-06-18 17:45:58 +00:00
Ulrich Weigand	e581920d12	[PowerPC] Add back test case for absolute calls (removed in r211174) As requested by Hal Finkel, this adds back a test for calls to a known-constant function pointer value, and verifies that the 64-bit SVR4 indirect function call sequence is used. llvm-svn: 211190	2014-06-18 17:28:56 +00:00
Arnold Schwaighofer	fc308f5c9f	Add a triple so that right syntax is choosen on mac osx systems llvm-svn: 211188	2014-06-18 17:20:49 +00:00
Matt Arsenault	43160e7af2	R600/SI: Add intrinsics for brev instructions llvm-svn: 211187	2014-06-18 17:13:57 +00:00
Matt Arsenault	dbc9aae1fb	R600/SI: Prettier operand printing for 64-bit ops. Copy what is done for 32-bit already so the order is about the same. llvm-svn: 211186	2014-06-18 17:13:51 +00:00
Matheus Almeida	784f797d4c	[mips] SYNC $stype instruction was added in Mips32 but SYNC with an implied operand ($stype = 0) is valid since Mips2. llvm-svn: 211185	2014-06-18 17:10:30 +00:00
Rafael Espindola	24d8b84838	Fix a memory leak in the error path. llvm-svn: 211184	2014-06-18 17:07:15 +00:00
Matt Arsenault	4601093267	R600: Implement f64 ftrunc, ffloor and fceil. CI has instructions for these, so this fixes them for older hardware. llvm-svn: 211183	2014-06-18 17:05:30 +00:00
Matt Arsenault	e8208ec95b	R600: Custom lower f64 frint for pre-CI llvm-svn: 211182	2014-06-18 17:05:26 +00:00
Matt Arsenault	7aeb813b2a	R600/SI: Temporary fix for f64 fneg This should be a source modifier, but this unblocks most of my math patches. llvm-svn: 211181	2014-06-18 17:05:22 +00:00
Matt Arsenault	520e7c44c1	R600/SI: Comparisons set vcc. llvm-svn: 211178	2014-06-18 16:53:48 +00:00
Adam Nemet	efd0785d82	[X86] AVX512: Add non-temporal stores Note that I followed the AVX2 convention here and didn't add LLVM intrinsics for stores. These can be generated with the nontemporal hint on LLVM IR stores (see new test). The GCC builtins are lowered directly into nontemporal stores. <rdar://problem/17082571> llvm-svn: 211176	2014-06-18 16:51:10 +00:00
Adam Nemet	ded81a810c	[X86] AVX512: Specify compressed displacement for vmovntdqa Use the max 64-bit element size with EVEX_CD8. This should work since element size is ignored for a full-vector access (FVM). llvm-svn: 211175	2014-06-18 16:51:07 +00:00
Ulrich Weigand	9aa09ef30f	[PowerPC] Do not use BLA with the 64-bit SVR4 ABI The PowerPC back-end uses BLA to implement calls to functions at known-constant addresses, which is apparently used for certain system routines on Darwin. However, with the 64-bit SVR4 ABI, this is actually incorrect. An immediate function pointer value on this platform is not directly usable as a target address for BLA: - in the ELFv1 ABI, the function pointer value refers to the function descriptor, not the code address - in the ELFv2 ABI, the function pointer value refers to the global entry point, but BL(A) would only be correct when calling the local entry point This bug didn't show up since using immediate function pointer values is not usually done in the 64-bit SVR4 ABI in the first place. However, I ran into this issue with a certain use case of LLVM as JIT, where immediate function pointer values were uses to implement callbacks from JITted code to helpers in statically compiled code. Fixed by simply not using BLA with the 64-bit SVR4 ABI. llvm-svn: 211174	2014-06-18 16:14:04 +00:00
Ulrich Weigand	dbb3e3e64f	Do not XFAIL test/tools/llvm-cov tests on powerpc64le All tests in test/tools/llvm-cov fail on big-endian targets and are supposed to be XFAILed there. However, including "powerpc64" in the XFAIL line is now incorrect, since that matches both powerpc64- and powerpc64le- targets, and the tests pass on the latter. Update the XFAIL lines to use powerpc64- instead (like mips64-). llvm-svn: 211172	2014-06-18 15:52:18 +00:00
Ulrich Weigand	7c3f0dc7e4	[PowerPC] Fix emitting instruction pairs on LE My patch r204634 to emit instructions in little-endian format failed to handle those special cases where we emit a pair of instructions from a single LLVM MC instructions (like the bl; nop pairs used to implement the call sequence). In those cases, we still need to emit the "first" instruction (the one in the more significant word) first, on both big and little endian, and not swap them. llvm-svn: 211171	2014-06-18 15:37:07 +00:00
Ulrich Weigand	457e606d1f	Support LE in RelocVisitor::visitELF_PPC64_* Since we now support both LE and BE PPC64 variants, use of getAddend64BE is no longer correct. Use the generic getELFRelocationAddend instead, as was already done for Mips. llvm-svn: 211170	2014-06-18 15:15:49 +00:00
Matheus Almeida	78f8b7b652	[mips] Fix expansion of memory operation if destination register is not a GPR. Summary: The assembler tries to reuse the destination register for memory operations whenever it can but it's not possible to do so if the destination register is not a GPR. Example: ldc1 $f0, sym should expand to: lui $at, %hi(sym) ldc1 $f0, %lo(sym)($at) It's entirely wrong to expand to: lui $f0, %hi(sym) ldc1 $f0, %lo(sym)($f0) Reviewers: dsanders Reviewed By: dsanders Differential Revision: http://reviews.llvm.org/D4173 llvm-svn: 211169	2014-06-18 14:49:56 +00:00
Matheus Almeida	7de68e77aa	[mips] Report correct location when "erroring" about the use of $at when it's not available. Summary: This removes the FIXMEs from test/MC/Mips/mips-noat.s. Reviewers: dsanders Reviewed By: dsanders Differential Revision: http://reviews.llvm.org/D4172 llvm-svn: 211168	2014-06-18 14:46:05 +00:00
Zoran Jovanovic	5c14b06940	[mips][mips64r6] Add BLTC and BLTUC instructions Differential Revision: http://reviews.llvm.org/D3923 llvm-svn: 211167	2014-06-18 14:36:00 +00:00
Matheus Almeida	29e254f849	[mips] Access $at only if necessary. Summary: This patch doesn't really change the logic behind expandMemInst but it allows us to assemble .S files that use .set noat with some macros. For example: .set noat lw $k0, offset($k1) Can expand to: lui $k0, %hi(offset) addu $k0, $k0, $k1 lw $k0, %lo(offset)($k0) with no need to access $at. Reviewers: dsanders, vmedic Reviewed By: dsanders, vmedic Differential Revision: http://reviews.llvm.org/D4159 llvm-svn: 211165	2014-06-18 14:15:42 +00:00
Cameron McInally	f10a7c963b	Add pattern for unsigned v4i32->v4f64 convert on AVX512. llvm-svn: 211164	2014-06-18 14:04:37 +00:00
Matheus Almeida	ee73cc5894	[mips] Update MipsAsmParser so that it's possible to handle immediates that start with the binary operator NOT (~). Reviewers: dsanders Reviewed By: dsanders Differential Revision: http://reviews.llvm.org/D4158 llvm-svn: 211163	2014-06-18 13:55:18 +00:00
Matheus Almeida	c3c18956de	[mips] Implement alias for 'and' and 'or' instructions for all ISAs. Summary: Examples: and $2, 4 <=> andi $2, $2, 4 or $2, 4 <=> ori $2, $2, 4 Reviewers: dsanders Reviewed By: dsanders Differential Revision: http://reviews.llvm.org/D4155 llvm-svn: 211161	2014-06-18 13:30:57 +00:00
Matheus Almeida	7e81576246	[mips] Remove the last usage of parseRegister from MipsAsmParser. Summary: Added negative test case so that we can be sure we handle erroneous situations while parsing the .cpsetup directive. Reviewers: dsanders Reviewed By: dsanders Differential Revision: http://reviews.llvm.org/D3681 llvm-svn: 211160	2014-06-18 13:08:59 +00:00
Jan Vesely	ecf5133a2b	R600: Implement 64bit SRA v2: Use capitalized variable name Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 211159	2014-06-18 12:27:17 +00:00
Jan Vesely	900ff2e74b	R600: Implement 64bit SRL v2: use C++ style comment Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 211158	2014-06-18 12:27:15 +00:00
Jan Vesely	25f362766e	R600: Implement 64bit SHL v2: Use c++ style comment Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 211157	2014-06-18 12:27:13 +00:00
Evgeniy Stepanov	4ea1647e8b	[msan] Handle X86 .psad. and .pmadd. intrinsics. llvm-svn: 211156	2014-06-18 12:02:29 +00:00
Tim Northover	d82ed2e581	DAG: move sret demotion into most basic LowerCallTo implementation. It looks like there are two versions of LowerCallTo here: the SelectionDAGBuilder one is designed to operate on LLVM IR, and the TargetLowering one in the case where everything is at DAG level. Previously, only the SelectionDAGBuilder variant could handle demoting an impossible return to sret semantics (before delegating to the TargetLowering version), but this functionality is also useful for certain libcalls (e.g. 128-bit operations on 32-bit x86). So this commit moves the sret handling down a level. rdar://problem/17242889 llvm-svn: 211155	2014-06-18 11:52:44 +00:00
Simon Atanasyan	c217c4047a	[llvm-readobj] Fix member functions name style. llvm-svn: 211152	2014-06-18 09:24:01 +00:00
Simon Atanasyan	a4ba8ec792	[llvm-readobj] Fix compile error. llvm-svn: 211151	2014-06-18 09:23:55 +00:00
Simon Atanasyan	80433900cc	[llvm-readobj][ELF] New `-mips-plt-got` command line option to output MIPS GOT section. Patch reviewed by Rafael Espindola. llvm-svn: 211150	2014-06-18 08:47:09 +00:00
JF Bastien	acf5bc16e3	Revert "Random Number Generator (llvm)" This reverts commit cccba093090d127e0b6d17473b14c264c14c5259. It causes build breakage. llvm-svn: 211146	2014-06-18 06:33:23 +00:00
JF Bastien	f8ad92da5c	Random Number Generator (llvm) Summary: Provides an abstraction for a random number generator (RNG) that produces a stream of pseudo-random numbers. The current implementation uses C++11 facilities and is therefore not cryptographically secure. The RNG is salted with the text of the current command line invocation. In addition, a user may specify a seed (reproducible builds). In clang, the seed can be set via -frandom-seed=X In the back end, the seed can be set via -rng-seed=X This is the llvm part of the patch. clang part: D3391 Reviewers: ahomescu, rinon, nicholas, jfb Reviewed By: jfb Subscribers: jfb, perl Differential Revision: http://reviews.llvm.org/D3390 llvm-svn: 211145	2014-06-18 06:23:25 +00:00
Kevin Qin	f0ec9aff2a	[AArch64] Fix a pattern match failure caused by creating improper CONCAT_VECTOR. ReconstructShuffle() may wrongly creat a CONCAT_VECTOR trying to concat 2 of v2i32 into v4i16. This commit is to fix this issue and try to generate UZP1 instead of lots of MOV and INS. Patch is initalized by Kevin Qin, and refactored by Tim Northover. llvm-svn: 211144	2014-06-18 05:54:42 +00:00
Craig Topper	2a30d7889f	Replace some assert(0)'s with llvm_unreachable. llvm-svn: 211141	2014-06-18 05:05:13 +00:00
Eric Christopher	f6e456dd63	Add the coverage cflags to the link step as well to make sure that we link in the support libraries. llvm-svn: 211131	2014-06-17 23:27:32 +00:00
Louis Gerbarg	343f5cdfad	Allow X86FastIsel to cope with 64 bit absolute relocations This patch is a follow up to r211040 & r211052. Rather than bailing out of fast isel this patch will generate an alternate instruction (movabsq) instead of the leaq. While this will always have enough room to handle the 64 bit displacment it is generally over kill for internal symbols (most displacements will be within 32 bits) but since we have no way of communicating the code model to the the assmebler in order to avoid flagging an absolute leal/leaq as illegal when using a symbolic displacement. llvm-svn: 211130	2014-06-17 23:22:41 +00:00
Juergen Ributzka	aa60209311	[FastISel][X86] Optimize predicates and fold CMP instructions. This optimizes predicates for certain compares, such as fcmp oeq %x, %x to fcmp ord %x, %x. The latter one is more efficient to generate. The same optimization is applied to conditional branches. llvm-svn: 211126	2014-06-17 21:55:43 +00:00
Zachary Turner	a9380b3efd	Remove more occurrences of the unused-mutex-parameter pattern. This pattern loses some of its usefulness when the mutex type is statically polymorphic as opposed to runtime polymorphic, as swapping out the mutex type requires changing a significant number of function parameters, and templatizing the function parameter requires the methods to be defined in the headers. Furthermore, if LLVM is compiled with threads disabled then there may even be no mutex to acquire anyway, so it should not be up to individual APIs to know whether or not acquiring a mutex is required to use those APIs to begin with. It should be up to the user of the API. llvm-svn: 211125	2014-06-17 21:54:18 +00:00
Tom Stellard	092f332ef2	R600/SI: Make sure target flags are set on pseudo VOP3 instructions llvm-svn: 211120	2014-06-17 19:34:46 +00:00
Hans Wennborg	a3d77e7348	lit: simplify population of the actual_inputs array Add all inputs to the array, except those starting with @, which are treated as response files and expanded. llvm-svn: 211119	2014-06-17 18:17:46 +00:00
Rafael Espindola	58cb745f31	Merge lib/Support/WindowsError.cpp into ib/Support/ErrorHandling.cpp. The OSX ranlib warns on files with no symbols, and lib/Support/WindowsError.cpp was empty when building on non-windows. llvm-svn: 211118	2014-06-17 18:06:45 +00:00
Kevin Enderby	246a460d1f	Add "-format darwin" to llvm-size to be like darwin's size(1) -m output, and and the -l option for the long format. Also when the object is a Mach-O file and the format is berkeley produce output like darwin’s default size(1) summary berkeley derived output. Like System V format, there are also some small changes in how and where the file names and archive member names are printed for darwin and Mach-O. Like the changes to llvm-nm these are the first steps in seeing if it is possible to make llvm-size produce the same output as darwin's size(1). llvm-svn: 211117	2014-06-17 17:54:13 +00:00
Matt Arsenault	295b86e81d	R600/SI: Match cttz_zero_undef llvm-svn: 211116	2014-06-17 17:36:27 +00:00
Matt Arsenault	8579601050	R600/SI: Match ctlz_zero_undef llvm-svn: 211115	2014-06-17 17:36:24 +00:00
Will Schmidt	12090677de	mark the old jit tests as unsupported for powerpc64 (for cmake) mark the old JIT tests as unsupported for powerpc64 - CMake style. This follows the style used for hexagon/arm64/aarch64. The equivalent tests still run under the supported MCJIT/* llvm-svn: 211111	2014-06-17 17:04:42 +00:00
Tom Stellard	880a80ad07	R600: Use LDS and vectors for private memory llvm-svn: 211110	2014-06-17 16:53:14 +00:00
Tom Stellard	85ad429f1f	R600/SI: Add a pattern for llvm.AMDGPU.barrier.global llvm-svn: 211109	2014-06-17 16:53:09 +00:00
Tom Stellard	aad4659470	SelectionDAG: Expand i64 = FP_TO_SINT i32 llvm-svn: 211108	2014-06-17 16:53:07 +00:00
Tom Stellard	8942276a2a	R600/SI: Re-initialize the m0 register after using it for indirect addressing We need to store a value greater than or equal to the number of LDS bytes allocated by the shader in the m0 register in order for LDS instructions to work correctly. We always initialize m0 at the beginning of a shader, but this register is also used for indirect addressing offsets, so we need to re-initialize it any time we use indirect addressing. llvm-svn: 211107	2014-06-17 16:53:04 +00:00
Juergen Ributzka	e35705675f	[FastISel][X86] Fix previous refactoring commit (r211077) Overlooked that fcmp_une uses an "or" instead of an "and" for combining the flags. llvm-svn: 211104	2014-06-17 14:47:45 +00:00
Dinesh Dwivedi	657105e582	Fixed jump threading going to infinite loop. This patch add code to remove unreachable blocks from function as they may cause jump threading to stuck in infinite loop. Differential Revision: http://reviews.llvm.org/D3991 llvm-svn: 211103	2014-06-17 14:34:19 +00:00
James Molloy	f1653b5260	Move SetTheory from utils/TableGen into lib/TableGen so Clang can use it. llvm-svn: 211100	2014-06-17 13:10:38 +00:00
James Molloy	c1fd09ba2c	Fix memory leak of RegScavenger accidentally added in r211037. llvm-svn: 211097	2014-06-17 12:31:41 +00:00
Tim Northover	d5531f72dc	AArch64: estimate inline asm length during branch relaxation To make sure branches are in range, we need to do a better job of estimating the length of an inline assembly block than "it's probably 1 instruction, who'd write asm with more than that?". Fortunately there's already a (highly suspect, see how many ways you can think of to break it!) callback for this purpose, which is used by the other targets. rdar://problem/17277590 llvm-svn: 211095	2014-06-17 11:31:42 +00:00
Evgeniy Stepanov	5d97293e26	[msan] Fix a comment. llvm-svn: 211094	2014-06-17 11:26:00 +00:00
Dmitri Gribenko	ebdd0a52e1	ConvertUTF tests: remove uses of initializer lists to restore compatibility with MSVC llvm-svn: 211093	2014-06-17 09:33:24 +00:00
Evgeniy Stepanov	df187feae4	[msan] Fix handling of multiplication by a constant with a number of trailing zeroes. Multiplication by an integer with a number of trailing zero bits leaves the same number of lower bits of the result initialized to zero. This change makes MSan take this into account in the case of multiplication by a compile-time constant. We don't handle the general, non-constant, case because (a) it's not going to be cheap (computation-wise); (b) multiplication by a partially uninitialized value in user code is a bad idea anyway. Constant case must be handled because it appears from LLVM optimization of a completely valid user code, as the test case in compiler-rt demonstrates. llvm-svn: 211092	2014-06-17 09:23:12 +00:00
Justin Bogner	f7f2cd35dc	Support: Inject LLVM_VERSION_INFO into the Support library Mimic r116632 in passing LLVM_VERSION_INFO from the Makefile build system to the build. This improves the -version output of tools that use llvm::cl under the configure+make system. llvm-svn: 211091	2014-06-17 06:52:47 +00:00
Justin Bogner	581b592414	tools: Add a space between package version and LLVM_VERSION_INFO This reads a little strangely. Add a space to clean it up. llvm-svn: 211090	2014-06-17 06:52:41 +00:00
Rafael Espindola	087d6274ae	Convert a few loops to use ranges. llvm-svn: 211089	2014-06-17 03:00:40 +00:00
Jordan Rose	57ffdb07fd	Add an overload for SourceMgr::PrintMessage that takes an existing diagnostic. llvm-svn: 211087	2014-06-17 02:15:40 +00:00
Jordan Rose	b4cfd0070d	Modernize doc comments for SourceMgr. No functionality change. llvm-svn: 211086	2014-06-17 02:15:36 +00:00
Jingyue Wu	33bd53df7f	[InstCombine] mark ADD with nuw if no unsigned overflow Summary: As a starting step, we only use one simple heuristic: if the sign bits of both a and b are zero, we can prove "add a, b" do not unsigned overflow, and thus convert it to "add nuw a, b". Updated all affected tests and added two new tests (@zero_sign_bit and @zero_sign_bit2) in AddOverflow.ll Test Plan: make check-all Reviewers: eliben, rafael, meheff, chandlerc Reviewed By: chandlerc Subscribers: chandlerc, llvm-commits Differential Revision: http://reviews.llvm.org/D4144 llvm-svn: 211084	2014-06-17 00:42:07 +00:00
Zachary Turner	b07f1e1fdd	Fix build breakage caused by change to ValueMapTest. llvm-svn: 211083	2014-06-17 00:38:40 +00:00
Duncan P. N. Exon Smith	73686d305a	SROA: Only split loads on byte boundaries r199771 accidently broke the logic that makes sure that SROA only splits load on byte boundaries. If such a split happens, some bits get lost when reassembling loads of wider types, causing data corruption. Move the width check up to reject such splits early, avoiding the corruption. Fixes PR19250. Patch by: Björn Steinbrink <bsteinbr@gmail.com> llvm-svn: 211082	2014-06-17 00:19:35 +00:00
Zachary Turner	814a49385c	Expose ValueMap's mutex type as a typedef instead of a sys::Mutex. This enables static polymorphism of the mutex type, which is necessary in order to replace the standard mutex implementation with a different type. llvm-svn: 211080	2014-06-17 00:17:38 +00:00
Juergen Ributzka	2da1bbc113	[FastISel][X86] Refactor the code to get the X86 condition from a helper function. NFC. Make use of helper functions to simplify the branch and compare instruction selection in FastISel. Also add test cases for compare and conditonal branch. llvm-svn: 211077	2014-06-16 23:58:24 +00:00
Eli Bendersky	ff90324599	Teach LoopUnrollPass to respect loop unrolling hints in metadata. [This is resubmitting r210721, which was reverted due to suspected breakage which turned out to be unrelated]. Some extra review comments were addressed. See D4090 and D4147 for more details. The Clang change that produces this metadata was committed in r210667 Patch by Mark Heffernan. llvm-svn: 211076	2014-06-16 23:53:02 +00:00
Zachary Turner	ccbf3d01f0	Revert r211066, 211067, 211068, 211069, 211070. These were committed accidentally from the wrong branch before having a review sign-off. llvm-svn: 211072	2014-06-16 22:49:41 +00:00
Zachary Turner	ff7d1f4af7	Cleanup more unreferenced MutexGuard parameters on functions. These parameters are intended to serve as sort of a contract that you cannot access the functions outside of a mutex. However, the entire JIT class cannot be accessed outside of a mutex anyway, and all methods acquire a lock as soon as they are entered. Since the containing class already is not intended to be thread-safe, it only serves to add code clutter. llvm-svn: 211071	2014-06-16 22:41:08 +00:00
Zachary Turner	0ab833c322	Programmer's Manual changes. llvm-svn: 211070	2014-06-16 22:40:48 +00:00
Zachary Turner	89ae856c46	Kill the LLVM global lock. llvm-svn: 211069	2014-06-16 22:40:42 +00:00
Zachary Turner	d4f7dfe7f2	Remove some code churn. llvm-svn: 211068	2014-06-16 22:40:29 +00:00
Zachary Turner	0f2c641f86	Remove some more code out into a separate CL. llvm-svn: 211067	2014-06-16 22:40:17 +00:00
Zachary Turner	b344f057d0	Users of the llvm global mutex must now acquire it manually. This allows the mutex to be acquired in a guarded, RAII fashion. llvm-svn: 211066	2014-06-16 22:39:38 +00:00
Reed Kotler	9fe3bfd087	Add load/store functionality Summary: This patches allows non conversions like i1=i2; where both are global ints. In addition, arithmetic and other things start to work since fast-isel will use existing patterns for non fast-isel from tablegen files where applicable. In addition i8, i16 will work in this limited context for assignment without the need for sign extension (zero or signed). It does not matter how i8 or i16 are loaded (zero or sign extended) since only the 8 or 16 relevant bits are used and clang will ask for sign extension before using them in arithmetic. This is all made more complete in forthcoming patches. for example: int i, j=1, k=3; void foo() { i = j + k; } Keep in mind that this pass is not enabled right now and is an experimental pass It can only be enabled with a hidden option to llvm of -mips-fast-isel. Test Plan: Run test-suite, loadstore2.ll and I will run some executable tests. Reviewers: dsanders Subscribers: mcrosier Differential Revision: http://reviews.llvm.org/D3856 llvm-svn: 211061	2014-06-16 22:05:47 +00:00
Jim Grosbach	cc71514d3a	AArch64: Add backend intrinsic for rbit. Define an intrinsic for the frontend to use and pattern match it to the RBIT instruction. rdar://9283021 llvm-svn: 211058	2014-06-16 21:55:35 +00:00
Jim Grosbach	07393ba31b	ARM: intrinsic support for rbit. We already have an ARMISD node. Create an intrinsic to map to it so we can add support for the frontend __rbit() intrinsic. rdar://9283021 llvm-svn: 211057	2014-06-16 21:55:30 +00:00
Bill Schmidt	5d82f09b53	[PPC64] Fix PR19893 - improve code generation for local function addresses Rafael opened http://llvm.org/bugs/show_bug.cgi?id=19893 to track non-optimal code generation for forming a function address that is local to the compile unit. The existing code was treating both local and non-local functions identically. This patch fixes the problem by properly identifying local functions and generating the proper addis/addi code. I also noticed that Rafael's earlier changes to correct the surrounding code in PPCISelLowering.cpp were also needed for fast instruction selection in PPCFastISel.cpp, so this patch fixes that code as well. The existing test/CodeGen/PowerPC/func-addr.ll is modified to test the new code generation. I've added a -O0 run line to test the fast-isel code as well. Tested on powerpc64[le]-unknown-linux-gnu with no regressions. llvm-svn: 211056	2014-06-16 21:36:02 +00:00
Eric Christopher	daca3cc54a	Since the DataLayout is always found off of the subtarget go ahead and query the base target machine implementation for it. llvm-svn: 211055	2014-06-16 21:18:27 +00:00
Zachary Turner	2f825df60b	Clean up some unnecessary mutex guards. These were being used as unreferenced parameters to enforce that the methods must not be called without holding a mutex, but all of the methods in question were internal, and the methods were only exposed through an interface whose entire purpose was to serialize access to these structures, so expecting the methods to be accessed under a mutex is reasonable enough. Reviewed by: blaikie Differential Revision: http://reviews.llvm.org/D4162 llvm-svn: 211054	2014-06-16 20:54:28 +00:00
Louis Gerbarg	dcf00251ea	Improve comments for r211040 Added comment to clarify why we r211040 choose to bail out of fast isel instead of generating a more complicated relocation, and fix mislabelled register in the comments of the asan test case. llvm-svn: 211052	2014-06-16 20:31:50 +00:00
Hans Wennborg	f9484b24b3	Revert "lit: warn when passed invalid pathname" (r210597) It was pointed out that this breaks the "virtual test discovery" mechanism, which allows for narming tests in the test exec root. Reverting until I can figure out how to fix this. llvm-svn: 211048	2014-06-16 20:18:41 +00:00
Tim Northover	b45c3b74b4	ARM: implement correct atomic operations on v7M ARM v7M has ldrex/strex but not ldrexd/strexd. This means 32-bit operations should work as normal, but 64-bit ones are almost certainly doomed. Patch by Phoebe Buckheister. llvm-svn: 211042	2014-06-16 18:49:36 +00:00
Louis Gerbarg	a5360c4cd8	Fix illegal relocations in X86FastISel On x86_86 the lea instruction can only use a 32 bit immediate value. When the code is compiled statically the RIP register is not used, meaning the immediate is all that can be used for the relocation, which is not sufficient in the case of targets more than +/- 2GB away. This patch bails out of fast isel in those cases and reverts to DAG which does the right thing. Test case included. llvm-svn: 211040	2014-06-16 17:35:40 +00:00
Jim Grosbach	fff5663d48	LowerSwitch: track bounding range for the condition tree. When LowerSwitch transforms a switch instruction into a tree of ifs it is actually performing a binary search into the various case ranges, to see if the current value falls into one cases range of values. So, if we have a program with something like this: switch (a) { case 0: do0(); break; case 1: do1(); break; case 2: do2(); break; default: break; } the code produced is something like this: if (a < 1) { if (a == 0) { do0(); } } else { if (a < 2) { if (a == 1) { do1(); } } else { if (a == 2) { do2(); } } } This code is inefficient because the check (a == 1) to execute do1() is not needed. The reason is that because we already checked that (a >= 1) initially by checking that also (a < 2) we basically already inferred that (a == 1) without the need of an extra basic block spawned to check if actually (a == 1). The patch addresses this problem by keeping track of already checked bounds in the LowerSwitch algorithm, so that when the time arrives to produce a Leaf Block that checks the equality with the case value / range the algorithm can decide if that block is really needed depending on the already checked bounds . For example, the above with "a = 1" would work like this: the bounds start as LB: NONE , UB: NONE as (a < 1) is emitted the bounds for the else path become LB: 1 UB: NONE. This happens because by failing the test (a < 1) we know that the value "a" cannot be smaller than 1 if we enter the else branch. After the emitting the check (a < 2) the bounds in the if branch become LB: 1 UB: 1. This is because by checking that "a" is smaller than 2 then the upper bound becomes 2 - 1 = 1. When it is time to emit the leaf block for "case 1:" we notice that 1 can be squeezed exactly in between the LB and UB, which means that if we arrived to that block there is no need to emit a block that checks if (a == 1). Patch by: Marcello Maggioni <hayarms@gmail.com> llvm-svn: 211038	2014-06-16 16:55:20 +00:00
James Molloy	f6419cfb14	Refactor the disabling of Thumb-1 LDM/STM generation Originally I switched the LD/ST optimizer off in TargetMachine as it was previously, but Eric has suggested he'd prefer that it be short-circuited in the pass itself. No functionality change. llvm-svn: 211037	2014-06-16 16:42:53 +00:00
Rafael Espindola	95cf2f25fe	Fix pr17056. This makes llvm-nm ignore members that are not sufficiently aligned for lib/Object to handle. These archives are invalid. GNU AR is able to handle this, but in general just warns about broken archive members. We should probably start warning too, but for now just make sure llvm-nm exits with an 0. llvm-svn: 211036	2014-06-16 16:41:00 +00:00
Rafael Espindola	ae460027a4	Convert the Archive API to use ErrorOr. Now that we have c++11, even things like ErrorOr<std::unique_ptr<...>> are easy to use. No intended functionality change. llvm-svn: 211033	2014-06-16 16:08:36 +00:00
Tilmann Scheller	9252057a07	[AArch64] Remove dead code. Both function declarations lack a callee and an implementation. llvm-svn: 211029	2014-06-16 15:15:41 +00:00
Cameron McInally	0d0489cea6	Hook up vector int_ctlz for AVX512. llvm-svn: 211024	2014-06-16 14:12:28 +00:00
Daniel Sanders	a84989a22d	[mips][mips64r6] ssnop is deprecated on MIPS32r6/MIPS64r6 Summary: Depends on D4120 Reviewers: jkolek, zoran.jovanovic, vmedic Reviewed By: zoran.jovanovic, vmedic Differential Revision: http://reviews.llvm.org/D4121 llvm-svn: 211021	2014-06-16 13:25:35 +00:00
Daniel Sanders	00463119a5	[mips][mips64r6] cl[oz], and dcl[oz] are re-encoded in MIPS32r6/MIPS64r6 Summary: There is no change to the restrictions, just the result register is stored once in the encoding rather than twice. The rt field is zero in MIPS32r6/MIPS64r6. Depends on D4119 Reviewers: zoran.jovanovic, jkolek, vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D4120 llvm-svn: 211019	2014-06-16 13:18:59 +00:00
Daniel Sanders	6a803f6162	[mips][mips64r6] ll, sc, lld, and scd are re-encoded on MIPS32r6/MIPS64r6. Summary: The linked-load, store-conditional operations have been re-encoded such that have a 9-bit offset instead of the 16-bit offset they have prior to MIPS32r6/MIPS64r6. While implementing this, I noticed that the atomic load/store pseudos always emit a sign extension using sll and sra. I have improved this to use seb/seh when they are available (MIPS32r2/MIPS64r2 and above). Depends on D4118 Reviewers: jkolek, zoran.jovanovic, vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D4119 llvm-svn: 211018	2014-06-16 13:13:03 +00:00
Dmitri Gribenko	caee8cbd57	Support/ConvertUTF: restore compatibility with MSVC, which only implements C89 llvm-svn: 211016	2014-06-16 11:22:33 +00:00
Dmitri Gribenko	1089db0ee6	Support/ConvertUTF: implement U+FFFD insertion according to the recommendation given in the Unicode spec That is, replace every maximal subpart of an ill-formed subsequence with one U+FFFD. llvm-svn: 211015	2014-06-16 11:09:46 +00:00
James Molloy	1e3b5a49e1	[AArch64] Fix a fencepost error in lowering for llvm.aarch64.neon.uqshl. Patch by Jiangning Liu! llvm-svn: 211014	2014-06-16 10:39:21 +00:00
Daniel Sanders	ddb7aa6aaa	[mips] Merge most of the big/little endian checks in atomic.ll Summary: There is very little difference between the big and little endian cases in test/CodeGen/Mips/atomic.ll. Merge them together using multiple FileCheck prefixes. Depends on D4117 Reviewers: jkolek, zoran.jovanovic, vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D4118 llvm-svn: 211013	2014-06-16 10:25:17 +00:00
Daniel Sanders	5e6f54e07b	[mips][mips64r6] [ls][wd]c2 were re-encoded with 11-bit signed immediates rather than 16-bit in MIPS32r6/MIPS64r6 Summary: The error message for the invalid.s cases isn't very helpful. It happens because there is an instruction with a wider immediate that would have matched if the NotMips32r6 predicate were true. I have some WIP to improve the message but it affects most error messages for removed/re-encoded instructions on MIPS32r6/MIPS64r6 and should therefore be a separate commit. Depens on D4115 Reviewers: zoran.jovanovic, jkolek, vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D4117 llvm-svn: 211012	2014-06-16 10:00:45 +00:00
Christian Pirker	2cc1cf0d4b	ARMEB: Fix trunc store for vector types Reviewed at http://reviews.llvm.org/D4135 llvm-svn: 211010	2014-06-16 09:17:30 +00:00
Jingyue Wu	baabe5091c	Canonicalize addrspacecast ConstExpr between different pointer types As a follow-up to r210375 which canonicalizes addrspacecast instructions, this patch canonicalizes addrspacecast constant expressions. Given clang uses ConstantExpr::getAddrSpaceCast to emit addrspacecast cosntant expressions, this patch is also a step towards having the frontend emit canonicalized addrspacecasts. Piggyback a minor refactor in InstCombineCasts.cpp Update three affected tests in addrspacecast-alias.ll, access-non-generic.ll and constant-fold-gep.ll and added one new test in constant-fold-address-space-pointer.ll llvm-svn: 211004	2014-06-15 21:40:57 +00:00
Matt Arsenault	2a60de548a	Fix copy paste error llvm-svn: 211003	2014-06-15 21:22:52 +00:00
Matt Arsenault	f302c941d8	R600: Add a rotr testcase I forgot to add llvm-svn: 211002	2014-06-15 21:09:00 +00:00
Matt Arsenault	717c1d0319	R600: Remove a few more things from AMDILISelLowering Try to keep all the setOperationActions for integer ops together. llvm-svn: 211001	2014-06-15 21:08:58 +00:00
Matt Arsenault	b5dff9ab50	R600: Fix assert on vector sdiv llvm-svn: 211000	2014-06-15 21:08:54 +00:00
Matt Arsenault	14d4645e46	R600: Move / cleanup more leftover AMDIL stuff. llvm-svn: 210998	2014-06-15 20:23:38 +00:00
Matt Arsenault	1578aa78d4	R600: Move division custom lowering out of AMDILISelLowering llvm-svn: 210997	2014-06-15 20:08:02 +00:00
Eric Christopher	f6db93ab81	Temporarily revert r210953 in an attempt to bring the ARM buildbots back. llvm-svn: 210996	2014-06-15 19:55:14 +00:00
Matt Arsenault	cf9a9a148e	R600: Report that integer division is expensive. Divides by weird constants now emit much better code. llvm-svn: 210995	2014-06-15 19:48:16 +00:00
Matt Arsenault	66ee0816da	R600: Remove dead code llvm-svn: 210994	2014-06-15 19:48:13 +00:00
David Blaikie	b9597a8e57	PR20038: DebugInfo missing DIEs for some concrete variables. I haven't nailed this down entirely, but this is about as small of a test case as I can seem to construct and adequately demonstrates the crasher. I'll continue investigating the root cause/fix(es). llvm-svn: 210993	2014-06-15 19:34:26 +00:00
Manuel Klimek	b671e78606	Add specialization of FoldingSetTrait for std::pair. llvm-svn: 210990	2014-06-15 14:42:25 +00:00
Tim Northover	65277a2bc0	LegalizeDAG: make sure cast is unsigned before using FP_TO_UINT. It's valid to use FP_TO_SINT when asking for a smaller type (e.g. all "unsigned int16" values fit into a "signed int32"), but the reverse isn't true. Unfortunately, I'm not actually aware of any architecture with asymmetric FP_TO_SINT and FP_TO_UINT handling and the logic happens to work in the symmetric case, so I can't actually write a test for this. llvm-svn: 210986	2014-06-15 09:27:20 +00:00
Tim Northover	dbecc3b3fc	AArch64: improve handling & modelling of FP_TO_XINT nodes. There's probably no acatual change in behaviour here, just updating the LowerFP_TO_INT function to be more similar to the reverse implementation and updating costs to current CodeGen. llvm-svn: 210985	2014-06-15 09:27:15 +00:00
Tim Northover	ef0d760cd9	AArch64: improve vector [su]itofp handling. This somehow got missed in the AArch64 merge, so should fix a performance regression since 3.4. llvm-svn: 210984	2014-06-15 09:27:06 +00:00
NAKAMURA Takumi	e876f5b61e	Don't expect tests always crashing. Add "REQUIRES:asserts". llvm-svn: 210983	2014-06-15 01:01:11 +00:00
Artyom Skrobov	c7b4253cfb	Replacing the private implementations of SwapValue with calls to sys::swapByteOrder() llvm-svn: 210980	2014-06-14 13:49:57 +00:00
Artyom Skrobov	9aea8432c5	Using llvm::sys::swapByteOrder() for the common case of byte-swapping a value in place llvm-svn: 210978	2014-06-14 13:18:07 +00:00
Artyom Skrobov	e2d6008d2e	Adding llvm::sys::swapByteOrder() for the common use-case of byte-swapping a value in place llvm-svn: 210976	2014-06-14 12:52:55 +00:00
Artyom Skrobov	ef5e867f16	Renaming SwapByteOrder() to getSwappedBytes() The next commit will add swapByteOrder(), acting in-place llvm-svn: 210973	2014-06-14 11:36:01 +00:00
Matt Arsenault	5eb038a9f2	R600: Add failing testcases. These are reduced from assert in the OpenCV CvtColor8u.BGR5652GRAY test. llvm-svn: 210969	2014-06-14 04:26:09 +00:00
Matt Arsenault	b2e8744eeb	Fix typo llvm-svn: 210968	2014-06-14 04:26:07 +00:00
Matt Arsenault	e682a19a1c	R600: Fix asserts related to constant initializers This would assert if a constant address space was extern and therefore didn't have an initializer. If the initializer was undef, it would hit the unreachable unhandled initializer case. An extern global should never really occur since we don't have machine linking, but bugpoint likes to remove initializers. llvm-svn: 210967	2014-06-14 04:26:05 +00:00
Matt Arsenault	41aa27c96b	R600: Use address space enum instead of value llvm-svn: 210966	2014-06-14 04:26:01 +00:00
Nick Lewycky	b06a796051	Remove extra whitespace in function declaration. No functionality change. llvm-svn: 210965	2014-06-14 03:48:29 +00:00
David Blaikie	6f9e867c45	DebugInfo: Remove some extra handling of abstract variables and instead rely solely on the delayed handling introduced in r210946 Now that we handle finding abstract variables at the end of the module, remove the upfront handling and just ensure the abstract variable is built when necessary. In theory we could have a split implementation, where inlined variables are immediately constructed referencing the abstract definition, and concrete variables are delayed - but let's go with one solution for now unless there's a reason not to. llvm-svn: 210961	2014-06-13 23:52:55 +00:00
Eric Christopher	fb0c26c696	Remove InstrItineraryData off of the TargetMachine - it's already on the subtarget and just forward the accessor. llvm-svn: 210955	2014-06-13 23:11:13 +00:00
Eric Christopher	a0cdc005dd	Move ARMJITInfo off of the TargetMachine and down onto the subtarget. This required untangling a mess of headers that included around. llvm-svn: 210953	2014-06-13 23:04:46 +00:00
Jiangning Liu	96e92c1d75	Move GlobalMerge from Transform to CodeGen. This patch is to move GlobalMerge pass from Transform/Scalar to CodeGen, because GlobalMerge depends on TargetMachine. In the mean time, the macro INITIALIZE_TM_PASS is also moved to CodeGen/Passes.h. With this fix we can avoid making libScalarOpts depend on libCodeGen. llvm-svn: 210951	2014-06-13 22:57:59 +00:00
Eric Christopher	f047bfd115	The hazard recognizer only needs a subtarget, not a target machine so make it take one. Fix up all users accordingly. llvm-svn: 210948	2014-06-13 22:38:52 +00:00
Eric Christopher	170ebcf07f	Fix typo. llvm-svn: 210947	2014-06-13 22:38:48 +00:00
David Blaikie	e847f132f7	DebugInfo: Reference abstract definitions from variables in concrete definitions that preceed their first inline definition. Rather than relying on abstract variables looked up at the time the concrete variable is created, look them up at the end of the module to ensure they're referenced even if they're created after the concrete definition. This completes/matches the work done in r209677 to handle this for the subprograms themselves. llvm-svn: 210946	2014-06-13 22:35:44 +00:00
Alexey Samsonov	aa90998c87	[DWARF parser] Use distinction between DW_AT_ranges_base and DW_AT_GNU_ranges_base instead of DWARF version llvm-svn: 210945	2014-06-13 22:31:03 +00:00
David Blaikie	be7c677008	DwarfDebug::getExistingAbstractVariable: constify an existing reference parameter that didn't need to be mutated. llvm-svn: 210944	2014-06-13 22:29:31 +00:00
David Blaikie	eb1a27239c	DebugInfo: Following up to r209677, refactor local variable emission to delay the choice between emitting the definition attributes or using DW_AT_abstract_definition This doesn't fix the abstract variable handling yet, but it introduces a similar delay mechanism as was added for subprograms, causing DW_AT_location to be reordered to the beginning of the attribute list for local variables, and fixes all the test fallout for that. A subsequent commit will remove the abstract variable handling in DbgVariable and just do the abstract variable lookup at module end to ensure that abstract variables introduced after their concrete counterparts are appropriately referenced by the concrete variable. llvm-svn: 210943	2014-06-13 22:18:23 +00:00
David Blaikie	04ed1d1b68	DebugInfo: Refactor some tests to allow DW_AT_name to not be the first attribute in a local variable. In an effort to fix concrete variables referencing abstract origins where the concrete variable preceeds the first inlined usage, the addition of attributes such as name, file, etc will be delayed until the end of the module (to wait to see if any inlined instances have occurred, thus necessitating an abstract definition that the concrete definition should also reference). These test cases don't actually need to care about this ordering of attributes, so update them to be more resilient to such changes coming in the near future. llvm-svn: 210940	2014-06-13 21:52:33 +00:00
David Blaikie	be315b63e2	test/DebugInfo/X86/dbg-value-isel.s: correct lexical block descriptor to match schema This silently broke a long time ago when I unified some aspects of the debug info schema. I'm just cleaning these up if/when they become a problem. llvm-svn: 210939	2014-06-13 21:52:28 +00:00
Zachary Turner	586fd74c30	Make the error-handling functions thread-safe. Prior to this change, error handling functions must be installed and removed only inside of an llvm_[start/stop]_multithreading pair. This change allows error handling functions to be installed any time, and from any thread. Reviewed by: chandlerc Differential Revision: http://reviews.llvm.org/D4140 llvm-svn: 210937	2014-06-13 21:20:44 +00:00
Alexey Samsonov	e595e1ade0	Remove top-level Clang -fsanitize= flags for optional ASan features. Init-order and use-after-return modes can currently be enabled by runtime flags. use-after-scope mode is not really working at the moment. The only problem I see is that users won't be able to disable extra instrumentation for init-order and use-after-scope by a top-level Clang flag. But this instrumentation was implicitly enabled for quite a while and we didn't hear from users hurt by it. llvm-svn: 210924	2014-06-13 17:53:44 +00:00
Tim Northover	51472bc600	X86: lower ATOMIC_CMP_SWAP_WITH_SUCCESS directly Lowering this new node allows us to fold the almost universal comparison for success before it's even formed. Instead we can create a copy from EFLAGS and an X86ISD::SETCC operation since all "cmpxchg" instructions set the zero-flag to the correct value. rdar://problem/13201607 llvm-svn: 210923	2014-06-13 17:29:39 +00:00
Matt Arsenault	fd8c24ede8	R600: Cleanup some old AMDIL stuff. Move / delete some of the more obviously wrong setOperationAction calls. Most of these are setting Expand for types that aren't legal which is the default anyway. Leave stuff that might require more thought on whether it's junk or not as it is. No functionality change. llvm-svn: 210922	2014-06-13 17:20:53 +00:00
Rafael Espindola	2a826e40fa	Finishing touch for the std::error_code transition. While std::error_code itself seems to work OK in all platforms, there are few annoying differences with regards to the std::errc enumeration. This patch adds a simple llvm enumeration, which will hopefully avoid build breakages in other platforms and surprises as we get more uses of std::error_code. llvm-svn: 210920	2014-06-13 17:20:48 +00:00
Tim Northover	20b9f739eb	Atomics: make use of the "cmpxchg weak" instruction. This also simplifies the IR we create slightly: instead of working out where success & failure should go manually, it turns out we can just always jump to a success/failure block created for the purpose. Later phases will sort out the mess without much difficulty. llvm-svn: 210917	2014-06-13 16:45:52 +00:00
Tim Northover	d039abdeeb	Atomics: switch direction of cmpxchg comparison This has two benefits: it makes the result more suitable for direct insertaion into the struct to emulate the new cmpxchg, and it means the name we give the instruction matches its actual effect better. llvm-svn: 210916	2014-06-13 16:45:36 +00:00
Tom Stellard	bc5b5370de	R600: Remove AMDIL instruction and register definitions Most of these are no longer used any more. llvm-svn: 210915	2014-06-13 16:38:59 +00:00
Tobias Grosser	9190e0dd62	opt: Initialize asm printers Without initializing the assembly printers a shared library build of opt is linked with these libraries whereas for a static build these libraries are dead code eliminated. This is unfortunate for plugins in case they want to use them, as they neither can rely on opt to provide this functionality nor can they link the printers in themselves as this breaks with a shared object build of opt. This patch calls InitializeAllAsmPrinters() from opt, which increases the static binary size from 50MB -> 52MB on my system (all backends compiled) and causes no measurable increase in the time needed to run 'make check'. llvm-svn: 210914	2014-06-13 16:12:08 +00:00
Rafael Espindola	54f1997979	Remove unused and odd code. This code was never being used and any use of it would look fairly strange. For example, it would try to map a object_error::parse_failed to std::errc::invalid_argument. llvm-svn: 210912	2014-06-13 15:36:17 +00:00
Rafael Espindola	efb3822ac5	Remove broken include. Looks like I got some git merge wrong. llvm-svn: 210911	2014-06-13 15:21:50 +00:00
Rafael Espindola	dbe027a9ed	Fix KillTheDoctor after r210725. We don't map these windows errors to generic ones since errc::timed_out is not defined on mingw. Just use the raw windows error value. llvm-svn: 210910	2014-06-13 15:01:11 +00:00
Tim Northover	6bf04e4512	SCCP: update for cmpxchg returning { iN, i1 } now. I accidentally missed this one since its use looked OK locally. llvm-svn: 210909	2014-06-13 14:54:09 +00:00
Zoran Jovanovic	a5acdcf924	[mips][mips64r6] Relocation R_MIPS_PC18_S3 Differential Revision: http://reviews.llvm.org/D3890 llvm-svn: 210908	2014-06-13 14:26:47 +00:00
Tim Northover	675a0965ed	Docs: remove extra {} around result types. It makes the types look like they're single-element structures. And when we have instructions that do result in a struct, that can get confusing rather quickly. llvm-svn: 210905	2014-06-13 14:24:23 +00:00
Tim Northover	1dcc9f90ed	Docs: fix grammar error in description llvm-svn: 210904	2014-06-13 14:24:16 +00:00
Tim Northover	420a216817	IR: add "cmpxchg weak" variant to support permitted failure. This commit adds a weak variant of the cmpxchg operation, as described in C++11. A cmpxchg instruction with this modifier is permitted to fail to store, even if the comparison indicated it should. As a result, cmpxchg instructions must return a flag indicating success in addition to their original iN value loaded. Thus, for uniformity all cmpxchg instructions now return "{ iN, i1 }". The second flag is 1 when the store succeeded. At the DAG level, a new ATOMIC_CMP_SWAP_WITH_SUCCESS node has been added as the natural representation for the new cmpxchg instructions. It is a strong cmpxchg. By default this gets Expanded to the existing ATOMIC_CMP_SWAP during Legalization, so existing backends should see no change in behaviour. If they wish to deal with the enhanced node instead, they can call setOperationAction on it. Beware: as a node with 2 results, it cannot be selected from TableGen. Currently, no use is made of the extra information provided in this patch. Test updates are almost entirely adapting the input IR to the new scheme. Summary for out of tree users: ------------------------------ + Legacy Bitcode files are upgraded during read. + Legacy assembly IR files will be invalid. + Front-ends must adapt to different type for "cmpxchg". + Backends should be unaffected by default. llvm-svn: 210903	2014-06-13 14:24:07 +00:00
Cameron McInally	ed5f645bf3	Fix bad copy-and-paste from r210652. AVX512 masked leading zero intrinsics. llvm-svn: 210901	2014-06-13 13:20:01 +00:00
Daniel Sanders	c171f65a87	[mips] Add cache and pref instructions Summary: cache and pref were added in MIPS-III, and MIPS32 but were re-encoded in MIPS32r6/MIPS64r6 to use a 9-bit offset rather than the 16-bit offset available to earlier cores. Resolved the decoding conflict between pref and lwc3. Depends on D4115 Reviewers: zoran.jovanovic, jkolek, vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D4116 llvm-svn: 210900	2014-06-13 13:15:59 +00:00
Daniel Sanders	af8b32e176	[mips][mips64r6] bc1any[24] are not available on MIPS32r6/MIPS64r6 Summary: These MIPS-3D instructions have never been implemented in LLVM so we only add testcases. Reviewers: zoran.jovanovic, jkolek, vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D4115 llvm-svn: 210899	2014-06-13 13:08:38 +00:00
Daniel Sanders	86cb398b9d	[mips][mips64r6] b(ge\|lt)zal are not available on MIPS32r6/MIPS64r6 and bal is a normal instruction Summary: b(ge\|lt)zal have been removed in MIPS32r6/MIPS64r6. However, bal (an alias for 'bgezal $zero, $offset') still remains with the same encoding it had prior to MIPS32r6/MIPS64r6. Updated the MipsNaCLELFStreamer, and MipsLongBranch to correctly handle the MIPS32r6/MIPS64r6 BAL instruction in addition to the existing BAL_BR pseudo. No changes were required to the CodeGen test that looks for BAL (test/CodeGen/Mips/longbranch.ll) since the new instruction has the same syntax. Depends on D4113 Reviewers: jkolek, zoran.jovanovic, vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D4114 llvm-svn: 210898	2014-06-13 13:02:52 +00:00
Daniel Sanders	e898236bc2	[mips][mips64r6] daddi is not available on MIPS64r6 Summary: It's not emitted by the code generator so we only need assembler tests. Also added missing daddi aliases from dsub mnemonics, and removed a couple duplicate dsub tests. Depends on D4112 Reviewers: zoran.jovanovic, jkolek, vmedic Reviewed By: vmedic Differential Revision: http://reviews.llvm.org/D4113 llvm-svn: 210897	2014-06-13 12:49:06 +00:00

... 4 5 6 7 8 ...

104995 Commits