llvm-project

Commit Graph

Author	SHA1	Message	Date
Dimitry Andric	4a5f8a19c7	Let test-release.sh checkout subprojects directly into the target tree, instead of using symlinks Summary: In the past I have run into several problems with the way `test-release.sh` creates all the subproject directories as siblings, and then uses symlinks to stitch them all together. In some scenarios this leads to clang not being able to find header files, etc. This patch changes the script so it directly exports into the correct target locations for each subproject. Reviewers: hans Subscribers: emaste, llvm-commits Differential Revision: http://reviews.llvm.org/D16420 llvm-svn: 258436	2016-01-21 21:57:49 +00:00
David L Kreitzer	4d7257dfa1	Fix for two constant propagation problems in GVN with the assume intrinsic instruction. Patch by Yuanrui Zhang. Differential Revision: http://reviews.llvm.org/D16100 llvm-svn: 258435	2016-01-21 21:32:35 +00:00
Kevin Enderby	1f472eace5	Fix MachOObjectFile::getSymbolSection() to not call report_fatal_error() but to return object_error::parse_failed. Then made the code in llvm-nm do for Mach-O files what is done in the darwin native tools which is to print "(?,?)" or just "s" for bad section indexes. Also added a test to show it prints the bad section index of "42" when printing the fields as raw hex. llvm-svn: 258434	2016-01-21 21:13:27 +00:00
Sanjay Patel	fcc7c1a0ba	[LibCallSimplifier] don't get fooled by a fake fmin() This is similar to the bug/fix: https://llvm.org/bugs/show_bug.cgi?id=26211 http://reviews.llvm.org/rL258325 The fmin() test case reveals another bug caused by sloppy code duplication. It will crash without this patch because fp128 is a valid floating-point type, but we would think that we had matched a function that used doubles. The new helper function can be used to replace similar checks that are used in several other places in this file. llvm-svn: 258428	2016-01-21 20:19:54 +00:00
Rong Xu	950af1558f	Fix buildbot failure due to r258420 Include the needed headfile to fix the buildbot failure due to r258420 [PGO] Passmanagerbuilder change that enable IR level PGO instrumentation. llvm-svn: 258423	2016-01-21 19:06:24 +00:00
David Majnemer	3af5bf30e3	[InstCombine] Simplify (x >> y) <= x This commit extends the patterns recognised by InstSimplify to also handle (x >> y) <= x in the same way as (x /u y) <= x. The missing optimisation was found investigating why LLVM did not optimise away bound checks in a binary search: https://github.com/rust-lang/rust/pull/30917 Patch by Andrea Canciani! Differential Revision: http://reviews.llvm.org/D16402 llvm-svn: 258422	2016-01-21 18:55:54 +00:00
Chad Rosier	406808e344	Partially revert "Add command line options to force function/loop alignments." This partially reverts r256571 in favor of the solution in r258409. llvm-svn: 258421	2016-01-21 18:49:15 +00:00
Rong Xu	34abbfb78e	[PGO] Passmanagerbuilder change that enable IR level PGO instrumentation This patch includes the passmanagerbuilder change that enables IR level PGO instrumentation. It adds two passmanagerbuilder options: -profile-generate=<profile_filename> and -profile-use=<profile_filename>. The new options are primarily for debug purpose. Reviewers: davidxl, silvas Differential Revision: http://reviews.llvm.org/D15828 llvm-svn: 258420	2016-01-21 18:28:59 +00:00
Adam Nemet	af761104ba	[TTI] Add getCacheLineSize Summary: And use it in PPCLoopDataPrefetch.cpp. @hfinkel, please let me know if your preference would be to preserve the ppc-loop-prefetch-cache-line option in order to be able to override the value of TTI::getCacheLineSize for PPC. Reviewers: hfinkel Subscribers: hulx2000, mcrosier, mssimpso, hfinkel, llvm-commits Differential Revision: http://reviews.llvm.org/D16306 llvm-svn: 258419	2016-01-21 18:28:36 +00:00
Rong Xu	ed9fec7365	[PGO] IR level instrumentation of indirect call value profiling This patch adds the instrumentation for indirect call value profiling. It finds all the indirect call-sites and generates instrprof_value_profile intrinsic calls. A new opt level option -disable-vp is introduced to disable this instrumentation. Reviewers: davidxl, betulb, vsk Differential Revision: http://reviews.llvm.org/D16016 llvm-svn: 258417	2016-01-21 18:11:44 +00:00
Sanjay Patel	4e971da272	make helper functions static; NFCI llvm-svn: 258416	2016-01-21 18:01:57 +00:00
Manuel Jacob	f3ee254bc2	Undo r258163 "Move part of an if condition into an assertion. NFC." This undoes the change made in r258163. The assertion fails if `Ptr` is of a vector type. The previous code doesn't look completely correct either, so I'll investigate this more. llvm-svn: 258411	2016-01-21 17:36:14 +00:00
Philip Reames	82e0f15f86	Fix a type in a comment Thanks to Sean Silva for pointing it out. llvm-svn: 258410	2016-01-21 17:32:12 +00:00
Geoff Berry	10494aca05	[BlockPlacement] Add option to align all non-fall-through blocks. Summary: This option is being added for testing purposes. Reviewers: mcrosier Subscribers: mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D16410 llvm-svn: 258409	2016-01-21 17:25:52 +00:00
Matthew Simpson	486bace5cc	Revert "[SLP] Truncate expressions to minimum required bit width" This reverts commit r258404. llvm-svn: 258408	2016-01-21 17:17:20 +00:00
Teresa Johnson	f5aa64f25f	Use early return to simplify code (NFC) Follow on to r258405. llvm-svn: 258407	2016-01-21 17:16:53 +00:00
Vedant Kumar	61035fa3cb	[GCOV] Avoid emitting profile arcs for module and skeleton CUs Do not emit profile arc files and note files for module and skeleton CU's. Our users report seeing unexpected .gcda and .gcno files in their projects when using gcov-style profiling with modules or frameworks. The unwanted files come from these modules. This is not very helpful for end-users. Further, we've seen reports of instrumented programs crashing while writing these files out (due to I/O failures). rdar://problem/22838296 Reviewed-by: aprantl Differential Revision: http://reviews.llvm.org/D15997 llvm-svn: 258406	2016-01-21 17:04:42 +00:00
Teresa Johnson	6f508afce1	[ThinLTO] Avoid unnecesary hash lookups during metadata linking (NFC) Replace sequences of count() followed by operator[] with either find() or insert(), depending on the context. llvm-svn: 258405	2016-01-21 16:46:40 +00:00
Matthew Simpson	cb17d72170	[SLP] Truncate expressions to minimum required bit width This change attempts to produce vectorized integer expressions in bit widths that are narrower than their scalar counterparts. The need for demotion arises especially on architectures in which the small integer types (e.g., i8 and i16) are not legal for scalar operations but can still be used in vectors. Like similar work done within the loop vectorizer, we rely on InstCombine to perform the actual type-shrinking. We use the DemandedBits analysis and ComputeNumSignBits from ValueTracking to determine the minimum required bit width of an expression. Differential revision: http://reviews.llvm.org/D15815 llvm-svn: 258404	2016-01-21 16:31:55 +00:00
Scott Egerton	2455701117	[mips] Allowed dla instructions on 32-bit architectures. Summary: This is now the same as the behaviour of the GNU assembler. This was done as it is required in order to build the Linux kernel with the integrated assembler enabled. Reviewers: dsanders, vkalintiris Subscribers: dsanders, llvm-commits Differential Revision: http://reviews.llvm.org/D13594 llvm-svn: 258400	2016-01-21 15:11:01 +00:00
Teresa Johnson	e0373a6796	Revert obsolete llvm-link -preserve-modules option/test This testing mode is now obsolete with the change to linkInModule to take a std::unique_ptr to Module. llvm-svn: 258399	2016-01-21 14:28:52 +00:00
Igor Breger	7a000f5bb2	AVX512: Masked move intrinsic implementation. Implemented intrinsic for the follow instructions (reg move) : VMOVDQU8/16, VMOVDQA32/64, VMOVAPS/PD. Differential Revision: http://reviews.llvm.org/D16316 llvm-svn: 258398	2016-01-21 14:18:11 +00:00
Michael Zuckerman	21a30a42a9	[AVX512] Adding VPERMT2B and VPERMI2B Intrinsics Differential Revision: http://reviews.llvm.org/D16398 llvm-svn: 258397	2016-01-21 13:36:01 +00:00
Krzysztof Parzyszek	14f9535eec	PR26172: unnecessary indirection in HexagonCopyToCombine.cpp llvm-svn: 258395	2016-01-21 12:45:17 +00:00
Marina Yatsina	ff262fa807	[X86] - Removing warning on legal cases caused by commit r258132 There's an overloading of the "movsd" and "cmpsd" instructions, e.g. movsd can be either "Move Data from String to String" or "Move or Merge Scalar Double-Precision Floating-Point Value". The former should produce warnings when parsing a memory operand that is not ESI/EDI, but the latter should not. Fixed the code to produce warnings only after making sure we're dealing with the first case. Expanded the tests of the produced warnings + fixed RUN line of the test so that it would check both stdout and stderr Differential Revision: http://reviews.llvm.org/D16359 llvm-svn: 258393	2016-01-21 11:37:06 +00:00
Manuel Jacob	e902459c4b	Change ConstantFoldInstOperands to take Instruction instead of opcode and type. NFC. Summary: The previous form, taking opcode and type, is moved to an internal helper and the new form, taking an instruction, is a wrapper around this helper. Although this is a slight cleanup on its own, the main motivation is to refactor the constant folding API to ease migration to opaque pointers. This will be follow-up work. Reviewers: eddyb Subscribers: dblaikie, llvm-commits Differential Revision: http://reviews.llvm.org/D16383 llvm-svn: 258391	2016-01-21 06:33:22 +00:00
Manuel Jacob	925d029461	Introduce ConstantFoldCastOperand function and migrate some callers of ConstantFoldInstOperands to use it. NFC. Summary: Although this is a slight cleanup on its own, the main motivation is to refactor the constant folding API to ease migration to opaque pointers. This will be follow-up work. Reviewers: eddyb Subscribers: zzheng, dblaikie, llvm-commits Differential Revision: http://reviews.llvm.org/D16380 llvm-svn: 258390	2016-01-21 06:31:08 +00:00
Manuel Jacob	a61ca37b6d	Introduce ConstantFoldBinaryOpOperands function and migrate some callers of ConstantFoldInstOperands to use it. NFC. Summary: Although this is a slight cleanup on its own, the main motivation is to refactor the constant folding API to ease migration to opaque pointers. This will be follow-up work. Reviewers: eddyb Subscribers: dblaikie, llvm-commits Differential Revision: http://reviews.llvm.org/D16378 llvm-svn: 258389	2016-01-21 06:26:35 +00:00
Tom Stellard	de008d338c	AMDGPU/SI: Pass whether to use the SI scheduler via Target Attribute Summary: Currently the SI scheduler can be selected via command line option, but it turned out it would be better if it was selectable via a Target Attribute. This patch adds "si-scheduler" attribute to the backend. Reviewers: tstellarAMD, echristo Subscribers: echristo, arsenm Differential Revision: http://reviews.llvm.org/D16192 llvm-svn: 258386	2016-01-21 04:28:34 +00:00
Xinliang David Li	b4fc4cbee6	re-submit test case (withright format-version) llvm-svn: 258384	2016-01-21 02:35:59 +00:00
Andrew Wilkins	7ab4dc76c4	llvm-go: call llvm-config with components Summary: Add components back into calls to llvm-config, which was accidentally removed in r258283. Reviewers: pcc Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D16392 llvm-svn: 258383	2016-01-21 02:33:39 +00:00
David Majnemer	8cc30787b0	Rename MCLineEntry to MCDwarfLineEntry MCLineEntry gives the impression that it is generic MC machinery. However, it is specific to DWARF. llvm-svn: 258381	2016-01-21 01:59:03 +00:00
Kostya Serebryany	2f13f223c7	[libFuzzer] don't use std::vector in one more hot path llvm-svn: 258380	2016-01-21 01:52:14 +00:00
Andrew Wilkins	a7a8ab71aa	[GlobalISel] make library an optional component Summary: Mark the LLVMGlobalISel library as optional in LLVMBuild.txt, since the library is only built if LLVM_BUILD_GLOBAL_ISEL is set. Without doing this, llvm-config includes the library in the list of components regardless of whether it's built, and then will error out when asked for the library names/paths. Reviewers: qcolombet Subscribers: joker.eph, llvm-commits, vkalintiris Differential Revision: http://reviews.llvm.org/D16386 llvm-svn: 258379	2016-01-21 01:41:03 +00:00
Quentin Colombet	25d8e21949	[GlobalISel] Move generic opcodes description to their own file. Differential Revision: http://reviews.llvm.org/D16384 llvm-svn: 258378	2016-01-21 01:37:18 +00:00
Xinliang David Li	d75eaf2b17	Revert 258376 -- wrong version llvm-svn: 258377	2016-01-21 01:21:00 +00:00
Xinliang David Li	bc05e4e4c0	[Coverage] Add a test case for comdat The binary contains two (merged) covmap sections which have duplicate CovMapRecords from comdat (template instantation). This test makes sure the reader reads it properly. It also tests that the coverage data from different instantiations of the same template function are properly merged in show output. llvm-svn: 258376	2016-01-21 00:57:42 +00:00
Mike Aizatsky	e313f8f8ff	[libfuzzer] use %p for printing addresses llvm-svn: 258370	2016-01-21 00:02:09 +00:00
Rafael Espindola	394524d940	Remove redundant argument. It is already a member variable. llvm-svn: 258369	2016-01-21 00:00:53 +00:00
Reid Kleckner	400f39308c	[readobj] Print CodeOffset first, it's easier to read llvm-svn: 258368	2016-01-20 23:21:14 +00:00
Dan Gohman	760bef5e50	[SelectionDAG] Fix constant offset folding to avoid commuting non-commutative operators. This fixes a miscompile in MultiSource/Benchmarks/MiBench/consumer-lame introduced in r258296. llvm-svn: 258366	2016-01-20 23:16:59 +00:00
Chad Rosier	816a1ab9d9	MachineScheduler: Add a command line option to disable post scheduler. llvm-svn: 258364	2016-01-20 23:08:32 +00:00
Chad Rosier	6338d7c390	MachineScheduler: Honor optnone functions in the pre-ra scheduler. llvm-svn: 258363	2016-01-20 22:38:25 +00:00
Rafael Espindola	55a7ae5cc7	Simplify the logic. NFC. Found while reviewing the change for PR26152. llvm-svn: 258362	2016-01-20 22:38:23 +00:00
Manuel Jacob	4e3b446ae8	Run clang-format over ConstantFolding.h, fixing inconsistent indentation. NFC. llvm-svn: 258361	2016-01-20 22:27:06 +00:00
Sanjay Patel	cd4377c74d	don't repeat function names in comments; NFC llvm-svn: 258360	2016-01-20 22:24:38 +00:00
David Blaikie	8ecf9938b2	Orc: Simplify lambda by using std::set's initializer_list ctor llvm-svn: 258359	2016-01-20 22:24:26 +00:00
Lang Hames	f129d6fb50	[Orc] Try to turn Orc execution unit tests back on for Linux. The fix in r258324 (plus r258354) should allow Orc execution tests to run on Linux. llvm-svn: 258358	2016-01-20 22:16:14 +00:00
George Burgess IV	1030d68e48	Fix typo in an error string. NFC. llvm-svn: 258357	2016-01-20 22:15:23 +00:00
Evgeniy Stepanov	9fb70f53ce	Fix PR26152. Fix the condition for when the new global takes over the name of the existing one to be the negation of the condition for the new global to get internal linkage. llvm-svn: 258355	2016-01-20 22:05:50 +00:00
Evgeniy Stepanov	b640415f9b	Fix build warning. error: field 'CCMgr' will be initialized after field 'IndirectStubsMgr' [-Werror,-Wreorder] : DL(TM.createDataLayout()), CCMgr(std::move(CCMgr)), llvm-svn: 258354	2016-01-20 22:02:07 +00:00
Tom Stellard	d1efda8e9e	AMDGPU/SI: Promote i1 SETCC operations Summary: While working on uniform branching, I've hit a few cases where we emit i1 SETCC operations. Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D16233 llvm-svn: 258352	2016-01-20 21:48:24 +00:00
Matt Arsenault	7836f895fe	AMDGPU: Fix old comments that mention AMDIL llvm-svn: 258350	2016-01-20 21:22:21 +00:00
Matt Arsenault	7ba334a7d9	AMDGPU: Remove AMDGPU.trunc intrinsic llvm-svn: 258348	2016-01-20 21:05:53 +00:00
Matt Arsenault	15fbe49daf	AMDGPU: Remove AMDIL.fraction intrinsic llvm-svn: 258347	2016-01-20 21:05:49 +00:00
Matt Arsenault	7cccd2672e	AMDGPU: Remove AMDIL.round.nearest intrinsic llvm-svn: 258346	2016-01-20 21:05:40 +00:00
Quentin Colombet	105cf2b179	[GlobalISel] Add the proper cmake plumbing. This patch adds the necessary plumbing to cmake to build the sources related to GlobalISel. To build the sources related to GlobalISel, we need to add -DBUILD_GLOBAL_ISEL=ON. By default, this is OFF, thus GlobalISel sources will not impact people that do not explicitly opt-in. Differential Revision: http://reviews.llvm.org/D15983 llvm-svn: 258344	2016-01-20 20:58:56 +00:00
Matt Arsenault	1c9e4ef0df	AMDGPU: Remove abs intrinsic llvm-svn: 258343	2016-01-20 20:58:29 +00:00
Matt Arsenault	f7e6e89718	AMDGPU: Remove min/max intrinsics This removes support for mesa 11.0.x llvm-svn: 258342	2016-01-20 20:50:19 +00:00
Sanjoy Das	a34ce95b60	Add a "gc-transition" operand bundle Summary: This adds a new kind of operand bundle to LLVM denoted by the `"gc-transition"` tag. Inputs to `"gc-transition"` operand bundle are lowered into the "transition args" section of `gc.statepoint` by `RewriteStatepointsForGC`. This removes the last bit of functionality that was unsupported in the deopt bundle based code path in `RewriteStatepointsForGC`. Reviewers: pgavlin, JosephTremoulet, reames Subscribers: sanjoy, mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D16342 llvm-svn: 258338	2016-01-20 19:50:25 +00:00
Simon Atanasyan	2d0d8530e3	[llvm-readobj][ELF] Teach llvm-readobj to show arch specific ELF section's flags Some architecture specific ELF section flags might have the same value (for example SHF_X86_64_LARGE and SHF_HEX_GPREL) and we have to check machine architectures to select an appropriate set of possible flags. The patch selects architecture specific flags into separate arrays `ElfxxxSectionFlags` and combines `ElfSectionFlags` and `ElfxxxSectionFlags` before pass to the `StreamWriter::printFlags()` method. Differential Revision: http://reviews.llvm.org/D16269 llvm-svn: 258334	2016-01-20 19:15:18 +00:00
Quentin Colombet	2d7fa7065f	[GlobalISel] Add a generic machine opcode for ADD. The selection process being split into separate passes, we need generic opcodes to translate the LLVM IR to target independent code. This patch adds an opcode for addition: G_ADD. Differential Revision: http://reviews.llvm.org/D15472 llvm-svn: 258333	2016-01-20 19:14:55 +00:00
Sanjay Patel	f44bd38092	fix typo; NFC llvm-svn: 258332	2016-01-20 18:59:48 +00:00
Sanjay Patel	545a456235	fix formatting; NFC llvm-svn: 258330	2016-01-20 18:59:16 +00:00
Rafael Espindola	b718237dfc	Accept subtractions involving a weak symbol. When a symbol S shows up in an expression in assembly there are two possible interpretations * The expression is referring to the value of S in this file. * The expression is referring to the value after symbol resolution. In the first case the assembler can reason about the value and try to produce a relocation. In the second case, that is only possible if the symbol cannot be preempted. Assemblers are not very consistent about which interpretation gets used. This changes MC to agree with GAS in the case of an expression of the form "Sym - WeakSym". llvm-svn: 258329	2016-01-20 18:57:48 +00:00
Sanjay Patel	bd2dc67142	[LibCallSimplifier] don't get fooled by a fake sqrt() The test case will crash without this patch because the subsequent call to hasUnsafeAlgebra() assumes that the call instruction is an FPMathOperator (ie, returns an FP type). This part of the function signature check was omitted for the sqrt() case, but seems to be in place for all other transforms. Before: http://reviews.llvm.org/rL257400 ...we would have needlessly continued execution in optimizeSqrt(), but the bug was harmless because we'd eventually fail some other check and return without damage. This should fix: https://llvm.org/bugs/show_bug.cgi?id=26211 Differential Revision: http://reviews.llvm.org/D16198 llvm-svn: 258325	2016-01-20 17:41:14 +00:00
Lang Hames	6c3e790e78	[Orc] Fix a use-after-move bug in the Orc C-bindings stack. llvm-svn: 258324	2016-01-20 17:39:52 +00:00
Sanjay Patel	1c600c6e83	80-cols; NFC llvm-svn: 258323	2016-01-20 16:41:43 +00:00
Keith Walker	8c44bf1b89	Write AArch64 big endian data fixup entries as BE. There was support for writing the AArch64 big endian data fixup entries in the .eh_frame section in BE. This is changed to write all such fixup entries in BE with no restriction on the section. This is similar to the existing support for fixup entries for ARM. A test is added to check the length field in the .debug_line section as this is an example of where such a fixup occurs. Differential Revision: http://reviews.llvm.org/D16064 llvm-svn: 258320	2016-01-20 15:59:14 +00:00
Tom Stellard	77a177722f	Correctly initialize SIAnnotateControlFlow Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D16304 llvm-svn: 258319	2016-01-20 15:48:27 +00:00
Michael Zuckerman	65c40afb03	[AVX512] Adding VPERMB Intrinsics Differential Revision: http://reviews.llvm.org/D16296 llvm-svn: 258316	2016-01-20 15:24:56 +00:00
Marina Yatsina	701938d64e	Fixing bug in rL258132: [X86] Adding support for missing variations of X86 string related instructions There was a bug in my rL258132 because there's an overloading of the "movsd" and "cmpsd" instructions, e.g. movsd can be either "Move Data from String to String" (the case I wanted to handle) or "Move or Merge Scalar Double-Precision Floating-Point Value" (the case that causes the asserts). Added code for escaping the unfamiliar scenarios and falling back to old behviour. Also changed the asserts to llvm_unreachable. llvm-svn: 258312	2016-01-20 14:03:47 +00:00
Krzysztof Parzyszek	2451c4835a	Proper handling of diamond-like cases in if-conversion If converter was somewhat careless about "diamond" cases, where there was no join block, or in other words, where the true/false blocks did not have analyzable branches. In such cases, it was possible for it to remove (needed) branches, resulting in a loss of entire basic blocks. Differential Revision: http://reviews.llvm.org/D16156 llvm-svn: 258310	2016-01-20 13:14:52 +00:00
Igor Breger	d3341f5021	AVX512: Store (MOVNTPD, MOVNTPS, MOVNTDQ) using non-temporal hint intrinsic implementation. Differential Revision: http://reviews.llvm.org/D16350 llvm-svn: 258309	2016-01-20 13:11:47 +00:00
Oliver Stannard	f7696f8267	[AArch64] Fix two bugs in the .inst directive The AArch64 .inst directive was implemented using EmitIntValue, which resulted in both $x and $d (code and data) mapping symbols being emitted at the same address. This fixes it to only emit the $x mapping symbol. EmitIntValue also emits the value in big-endian order when targeting big-endian systems, but instructions are always emitted in little-endian order for AArch64. Differential Revision: http://reviews.llvm.org/D16349 llvm-svn: 258308	2016-01-20 12:54:31 +00:00
Dylan McKay	cc018c1713	[AVR] Defnined calling conventions. NFC. llvm-svn: 258300	2016-01-20 09:30:01 +00:00
Petr Pavlu	eba3039238	[LTO] Fix error reporting when a file passed to libLTO is invalid or non-existent This addresses PR26060 where function lto_module_create() could return nullptr but lto_get_error_message() returned an empty string. The error() call after LTOModule::createFromFile() in llvm-lto is then removed because any error from this function should go through the diagnostic handler in llvm-lto which will exit the program. The error() call was added because this previously did not happen when the file was non-existent. This is fixed by the patch. (The situation that llvm-lto reports an error when the input file does not exist is tested by llvm/tools/llvm-lto/error.ll). Differential Revision: http://reviews.llvm.org/D16106 llvm-svn: 258298	2016-01-20 09:03:42 +00:00
Ivan Krasin	3b1c260d22	[Verifier] Fix performance regression for LTO builds Summary: Fix a significant performance regression by introducing GlobalValueVisited field and reusing the map. This is a follow up to r257823 that slowed down linking Chrome with LTO by 2.5x. If you revert this commit, please, also revert r257823. BUG=https://llvm.org/bugs/show_bug.cgi?id=26214 Reviewers: pcc, loladiro, joker.eph Subscribers: krasin1, joker.eph, loladiro, pcc Differential Revision: http://reviews.llvm.org/D16338 llvm-svn: 258297	2016-01-20 08:41:22 +00:00
Dan Gohman	edf98c5682	[SelectionDAG] Fold more offsets into GlobalAddresses SelectionDAG previously missed opportunities to fold constants into GlobalAddresses in several areas. For example, given `(add (add GA, c1), y)`, it would often reassociate to `(add (add GA, y), c1)`, missing the opportunity to create `(add GA+c, y)`. This isn't often visible on targets such as X86 which effectively reassociate adds in their complex address-mode folding logic, however it is currently visible on WebAssembly since it currently has very simple address mode folding code that doesn't reassociate anything. This patch fixes this by making SelectionDAG fold offsets into GlobalAddresses at the same times that it folds constants together, so that it doesn't miss any opportunities to perform such folding. Differential Revision: http://reviews.llvm.org/D16090 llvm-svn: 258296	2016-01-20 07:03:08 +00:00
Dan Gohman	e5d3c15d7d	[WebAssembly] Tighten up some regexes in some tests. llvm-svn: 258295	2016-01-20 05:55:09 +00:00
Dan Gohman	8394756937	[WebAssembly] Minor code cleanups. NFC. llvm-svn: 258294	2016-01-20 05:54:22 +00:00
Dan Gohman	26cf4f3689	[WebAssembly] Remove the Relooper code, as it is not currently being used. llvm-svn: 258293	2016-01-20 05:50:29 +00:00
Lang Hames	3c43dc27ab	[Orc] 'this' qualify more lambda-captured members. More workaround attempts for GCC ICEs. llvm-svn: 258288	2016-01-20 05:10:59 +00:00
Lang Hames	5959df89e9	[Orc] More qualifications of lambda-captured member variables to fix GCC ICEs. llvm-svn: 258286	2016-01-20 04:32:05 +00:00
Dan Gohman	7e64917fd1	[WebAssembly] Don't stackify stores across instructions with side effects. llvm-svn: 258285	2016-01-20 04:21:16 +00:00
Andrew Wilkins	dfd6088c3f	tools/llvm-config: improve shared library support Summary: This is a re-commit of r257003, which was reverted, along with the fixes from http://reviews.llvm.org/D15986. r252532 added support for reporting the monolithic library when LLVM_BUILD_LLVM_DYLIB is used. This would only be done if the individual components were not found, and the dynamic library is found. This diff extends this as follows: - If LLVM_LINK_LLVM_DYLIB is set, then prefer the shared library, even if all component libraries exist. - Two flags, --link-shared and --link-static are introduced to provide explicit guidance. If --link-shared is passed and the shared library does not exist, an error results. Additionally, changed the expected shared library names from (e.g.) LLVM-3.8.0 to LLVM-3.8. The former exists only in an installation (and then only in CMake builds I think?), and not in the build tree; this breaks usage of llvm-config during builds, e.g. by llvm-go. Reviewers: DiamondLovesYou, beanz Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D15986 llvm-svn: 258283	2016-01-20 04:03:09 +00:00
Lang Hames	efa5f6c170	[Orc] Qualify captured variable to work around GCC ICE. llvm-svn: 258278	2016-01-20 03:12:40 +00:00
Xinliang David Li	da656fe50e	Fix a bug in test llvm-svn: 258276	2016-01-20 02:49:53 +00:00
Joseph Tremoulet	b41632bf0f	[Inliner/WinEH] Honor implicit nounwinds Summary: Funclet EH tables require that a given funclet have only one unwind destination for exceptional exits. The verifier will therefore reject e.g. two cleanuprets with different unwind dests for the same cleanup, or two invokes exiting the same funclet but to different unwind dests. Because catchswitch has no 'nounwind' variant, and because IR producers are not required to annotate calls which will not unwind as 'nounwind', it is legal to nest a call or an "unwind to caller" catchswitch within a funclet pad that has an unwind destination other than caller; it is undefined behavior for such a call or catchswitch to unwind. Normally when inlining an invoke, calls in the inlined sequence are rewritten to invokes that unwind to the callsite invoke's unwind destination, and "unwind to caller" catchswitches in the inlined sequence are rewritten to unwind to the callsite invoke's unwind destination. However, if such a call or "unwind to caller" catchswitch is located in a callee funclet that has another exceptional exit with an unwind destination within the callee, applying the normal transformation would give that callee funclet multiple unwind destinations for its exceptional exits. There would be no way for EH table generation to determine which is the "true" exit, and the verifier would reject the function accordingly. Add logic to the inliner to detect these cases and leave such calls and "unwind to caller" catchswitches as calls and "unwind to caller" catchswitches in the inlined sequence. This fixes PR26147. Reviewers: rnk, andrew.w.kaylor, majnemer Subscribers: alexcrichton, llvm-commits Differential Revision: http://reviews.llvm.org/D16319 llvm-svn: 258273	2016-01-20 02:15:15 +00:00
Xinliang David Li	59411db520	[PGO] Add a new interface to be used by Indirect Call Promotion llvm-svn: 258271	2016-01-20 01:26:34 +00:00
Eduard Burtescu	23c4d83aa3	[NFC] Replace several manual GEP loops with gep_type_iterator. Reviewers: dblaikie Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D16335 llvm-svn: 258262	2016-01-20 00:26:52 +00:00
Xinliang David Li	440cd7027b	Function name change /NFC llvm-svn: 258260	2016-01-20 00:24:36 +00:00
Matthias Braun	d4f6409dff	MachineScheduler: Allow independent scheduling of sub register defs Note that this is disabled by default and still requires a patch to handleMove() which is not upstreamed yet. If the TrackLaneMasks policy/strategy is enabled the MachineScheduler will build a schedule graph where definitions of independent subregisters are no longer serialised. Implementation comments: - Without lane mask tracking a sub register def also counts as a use (except for the first one with the read-undef flag set), with lane mask tracking enabled this is no longer the case. - Pressure Diffs where previously maintained per definition of a vreg with the help of the SSA information contained in the LiveIntervals. With lanemask tracking enabled we cannot do this anymore and instead change the pressure diffs for all uses of the vreg as it becomes live/dead. For this changed style to work correctly we ignore uses of instructions that define the same register again: They won't affect register pressure. - With lanemask tracking we remove all read-undef flags from sub register defs when building the graph and re-add them later when all vreg lanes have become dead. Differential Revision: http://reviews.llvm.org/D14969 llvm-svn: 258259	2016-01-20 00:23:32 +00:00
Matthias Braun	5d458617aa	RegisterPressure: Make liveness tracking subregister aware Differential Revision: http://reviews.llvm.org/D14968 llvm-svn: 258258	2016-01-20 00:23:26 +00:00
Matthias Braun	3907fded1b	LiveInterval: Add utility class to rename independent subregister usage This renaming is necessary to avoid a subregister aware scheduler accidentally creating liveness "holes" which are rejected by the MachineVerifier. Explanation as found in this patch: Helper class that can divide MachineOperands of a virtual register into equivalence classes of connected components. MachineOperands belong to the same equivalence class when they are part of the same SubRange segment or adjacent segments (adjacent in control flow); Different subranges affected by the same MachineOperand belong to the same equivalence class. Example: vreg0:sub0 = ... vreg0:sub1 = ... vreg0:sub2 = ... ... xxx = op vreg0:sub1 vreg0:sub1 = ... store vreg0:sub0_sub1 The example contains 3 different equivalence classes: - One for the (dead) vreg0:sub2 definition - One containing the first vreg0:sub1 definition and its use, but not the second definition! - The remaining class contains all other operands involving vreg0. We provide a utility function here to rename disjunct classes to different virtual registers. Differential Revision: http://reviews.llvm.org/D16126 llvm-svn: 258257	2016-01-20 00:23:21 +00:00
Tom Stellard	2e045bbc5f	AMDGPU/SI: Prevent the DAGCombiner from creating setcc with i1 inputs Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D15035 llvm-svn: 258256	2016-01-20 00:13:22 +00:00
Sanjoy Das	16901a3e20	[MachineSink] Don't break ImplicitNulls Summary: This teaches MachineSink to not sink instructions that might break the implicit null check optimization that runs later. This should not affect frontends that do not use implicit null checks. Reviewers: aadg, reames, hfinkel, atrick Subscribers: majnemer, llvm-commits Differential Revision: http://reviews.llvm.org/D14632 llvm-svn: 258254	2016-01-20 00:06:14 +00:00
Davide Italiano	648f4e32ba	Reinstate the second part of a comment. NFC. Reported by: Filipe Cabecinhas Pointy-hat to: me llvm-svn: 258223	2016-01-19 23:39:28 +00:00
Quentin Colombet	4cf56917ea	[X86] Do not run shrink-wrapping on function with split-stack attribute or HiPE calling convention. The implementation of the related callbacks in the x86 backend for such functions are not ready to deal with a prologue block that is not the entry block of the function. This fixes PR26107, but the longer term solution would be to fix those callbacks. llvm-svn: 258221	2016-01-19 23:29:03 +00:00
Sanjay Patel	582857c95c	add tests to show missing memset/malloc optimizations (PR25892) llvm-svn: 258218	2016-01-19 23:07:10 +00:00
David Majnemer	ce10842036	[MC, COFF] Add .reloc support for WinCOFF This adds rudimentary support for a few relocations that we will use for the CodeView debug format. llvm-svn: 258216	2016-01-19 23:05:27 +00:00
Simon Pilgrim	4b919b2ab3	[X86][SSE] Add VZEXT_MOVL target shuffle decoding. Add support for decoding VZEXT_MOVL target shuffle masks, allowing it to be used as a source in target shuffle combines. llvm-svn: 258215	2016-01-19 23:04:56 +00:00
Nico Weber	963a5f4262	Reenable -Wexpansion-to-defined. I think I fixed all instances of this in the codebase (r258202, 258200, 258190). Also, the suppression didn't have an effect on bots using make anyways, and it looks like many bots still use configure/make bots. llvm-svn: 258210	2016-01-19 22:46:33 +00:00
Lang Hames	951f73a2de	[Orc] Oops - lambda capture changed in r258206 was correct. Fully qualify reference to Finalized in the body of the lambda instead to work around GCC ICE. llvm-svn: 258208	2016-01-19 22:32:58 +00:00
Quentin Colombet	2c49e2e664	[MachineFunction] Constify getter. NFC. llvm-svn: 258207	2016-01-19 22:31:12 +00:00
Lang Hames	97ce2bcefe	[Orc] Add missing capture to lambda. llvm-svn: 258206	2016-01-19 22:31:01 +00:00
Simon Pilgrim	e74653b67a	[X86][SSE] Add INSERTPS target shuffle combines. As vector shuffles can only reference two inputs many (V)INSERTPS patterns end up being split over two targets shuffles. This patch adds combines to attempt to combine (V)INSERTPS nodes with input/output nodes that are just zeroing out these additional vector elements. Differential Revision: http://reviews.llvm.org/D16072 llvm-svn: 258205	2016-01-19 22:24:12 +00:00
Lang Hames	df1ce15ef2	[Orc] Qualify call to make_unique to avoid ambiguity with std::make_unique. This should fix some of the bot failures associated with r258185. llvm-svn: 258204	2016-01-19 22:22:43 +00:00
Lang Hames	00b7bef269	[Orc] #undef a MACRO after I'm done with it. Suggested by Philip Reames in review of r257951. Thanks Philip! llvm-svn: 258203	2016-01-19 22:20:21 +00:00
Chad Rosier	5c72966ea3	[AArch64] Remove a bunch of useless FIXME comments. llvm-svn: 258193	2016-01-19 21:47:24 +00:00
Dan Gohman	cff798386e	[WebAssembly] Remove an unused data member. NFC. llvm-svn: 258192	2016-01-19 21:31:41 +00:00
Chad Rosier	b11c82d3e2	[AArch64] Remove more dead code after r258093. llvm-svn: 258191	2016-01-19 21:27:05 +00:00
Nico Weber	4e41694538	Fix undefined behavior in llvm's local changes to googletest. r100895 landed an llvm-only change to add minix support to googletest. It did that by putting "defined()" in a macro, which has undefined behavior. Slightly reshuffle things to remove that undefined behavior. Also mention in README.LLVM that minix support is a local change. llvm-svn: 258190	2016-01-19 21:22:36 +00:00
Xinliang David Li	0a83b1b994	Fix a coverage reading bug function record pointer is not advanced when duplicate entry is found. Test case to be added. llvm-svn: 258188	2016-01-19 21:18:12 +00:00
Lang Hames	bf4e1981e6	[Orc] Fix a stale comment. llvm-svn: 258187	2016-01-19 21:13:54 +00:00
Lang Hames	2fe7acb773	[Orc] Refactor ObjectLinkingLayer::addObjectSet to defer loading objects until they're needed. Prior to this patch objects were loaded (via RuntimeDyld::loadObject) when they were added to the ObjectLinkingLayer, but were not relocated and finalized until a symbol address was requested. In the interim, another object could be loaded and finalized with the same memory manager, causing relocation/finalization of the first object to fail (as the first finalization call may have marked the allocated memory for the first object read-only). By deferring the loadObject call (and subsequent memory allocations) until an object file is needed we can avoid prematurely finalizing memory. llvm-svn: 258185	2016-01-19 21:06:38 +00:00
Sanjoy Das	29a4b5dc0d	[SCEV] Fix PR26207 In some cases, the max backedge taken count can be more conservative than the exact backedge taken count (for instance, because ScalarEvolution::getRange is not control-flow sensitive whereas computeExitLimitFromICmp can be). In these cases, computeExitLimitFromCond (specifically the bit that deals with `and` and `or` instructions) can create an ExitLimit instance with a `SCEVCouldNotCompute` max backedge count expression, but a computable exact backedge count expression. This violates an implicit SCEV assumption: a computable exact BE count should imply a computable max BE count. This change - Makes the above implicit invariant explicit by adding an assert to ExitLimit's constructor - Changes `computeExitLimitFromCond` to be more robust around conservative max backedge counts llvm-svn: 258184	2016-01-19 20:53:51 +00:00
Sanjoy Das	0ff078736f	[SCEV] Use range-for; NFC llvm-svn: 258183	2016-01-19 20:53:46 +00:00
JF Bastien	17999f20fa	WebAssembly: mark known failure caused by r258125 The following test program triggers the assertion: https://github.com/gcc-mirror/gcc/blob/master/gcc/testsuite/gcc.c-torture/execute/20030916-1.c llvm-svn: 258182	2016-01-19 20:53:12 +00:00
Nico Weber	e18e076bd5	Fix bootstrap -Werror builds after clang r258128 llvm-svn: 258181	2016-01-19 20:52:17 +00:00
Kostya Serebryany	311f27c0a8	[libFuzzer] use std::mt19937 for generating random numbers by default. Fix MyStoll to handle negative values. Use std::any_of instead of std::find_if llvm-svn: 258178	2016-01-19 20:33:57 +00:00
Sanjay Patel	d4af297df1	getParent()->getParent() == getModule() ; NFC llvm-svn: 258176	2016-01-19 19:58:49 +00:00
Sanjay Patel	d3112a5bcc	function names start with a lowercase letter; NFC Note: There are no uses of these functions outside of SimplifyLibCalls, so they could be static functions in that file. llvm-svn: 258172	2016-01-19 19:46:10 +00:00
Hans Wennborg	b83a8ddfe8	test-release.sh: Use CMake also for Darwin This didn't work for 3.7, but hopefully it should work now. llvm-svn: 258168	2016-01-19 19:21:58 +00:00
Sanjay Patel	b50325e276	fix formatting; NFC llvm-svn: 258167	2016-01-19 19:17:47 +00:00
Sanjay Patel	4e86036733	don't repeat documentation comments in implementation file; NFC llvm-svn: 258166	2016-01-19 19:16:10 +00:00
Sanjay Patel	251cf1336a	don't repeat function names in documentation comments; NFC llvm-svn: 258164	2016-01-19 19:10:10 +00:00
Manuel Jacob	3f49f654a2	Move part of an if condition into an assertion. NFC. llvm-svn: 258163	2016-01-19 19:04:49 +00:00
Michael Zuckerman	4582bdab12	[AVX512] Adding VPERMT2B and VPERMI2B instruction . Differential Revision: http://reviews.llvm.org/D16297 llvm-svn: 258161	2016-01-19 18:47:02 +00:00
Philip Reames	1a196f7daf	Revert 258157 According the build bots, clang is using the Registry class somewhere as well. Will reapply with appropriate clang changes at a later point. llvm-svn: 258159	2016-01-19 18:41:10 +00:00
Sanjay Patel	d1f4f03f5e	[LibCallSimplifier] use instruction-level fast-math-flags to shrink calls This is a continuation of adding FMF to call instructions: http://reviews.llvm.org/rL255555 llvm-svn: 258158	2016-01-19 18:38:52 +00:00
Philip Reames	0f6650e8e8	[GC] Registry initialization and linkage interactions The Registry class constructs a linked list of nodes whose storage is inside static variables and nodes are added via static initializers. The trick is that those static initializers are in both the LLVM code base, and some random plugin that might get loaded in at runtime. The existing code tries to use C++ templates and their ODR rules to get a single definition of the registry for each type, but, experimentally, this doesn't quite work as designed. (Well, the entire structure doesn't. It might not actually be an ODR problem.) Previously, when I tried moving the GCStrategy class (along with it's registry) from CodeGen to IR, I ran into a problem where asking the GCStrategyRegistry a question would return inconsistent results depending on whether you asked from CodeGen (where the static initializers still were) or Transforms. My best guess is that this is a result of either a) an order of initialization error, or b) we ended up with two copies of the registry being created. I remember at the time having convinced myself it was probably (b), but I don't have any of my notes around from that investigation any more. See http://reviews.llvm.org/rL226311 for the original patch in question. This patch tries to remove the possibility of (b) above. (a) was already fixed in change 258109. Differential Revision: http://reviews.llvm.org/D16170 llvm-svn: 258157	2016-01-19 18:34:27 +00:00
Rong Xu	294572f116	[PGO] Create the profile data variable before the lowering This patch creates the profile data variable before lowering the profile intrinsics. Reviewers: davidxl, silvas Differential Revision: http://reviews.llvm.org/D16015 llvm-svn: 258156	2016-01-19 18:29:54 +00:00
Philip Reames	1ec08ac7e4	Add clarifying comments defining what a Loop is Our loop construct is not a way to identify cycles in the CFG. This wasn't immediately obvious from the header, so clarify that fact. The motivation for this was that I just fixed a out of tree bug due to a mistaken assumption (on my part) on what a Loop actually was. While it was fresh in my mind, I wanted to document the key point. llvm-svn: 258154	2016-01-19 18:26:01 +00:00
Sanjay Patel	81a63cd11f	[LibCallSimplifier] use instruction-level fast-math-flags to transform pow(x, [small integer]) calls This is a continuation of adding FMF to call instructions: http://reviews.llvm.org/rL255555 As with D15937, the intent of the patch is to preserve the current behavior of the transform except that we use the pow call's 'fast' attribute as a trigger rather than a function-level attribute. The TODO comment notes a potential follow-on patch that would propagate FMF to the new instructions. Differential Revision: http://reviews.llvm.org/D16122 llvm-svn: 258153	2016-01-19 18:15:12 +00:00
Chris Ray	b541a3488f	NFC Test Commit whitespace change in a comment Changed whitespace so comments line up. llvm-svn: 258151	2016-01-19 18:01:20 +00:00
Rafael Espindola	a39d305ded	Use larger write sizes for MCFillFragment. This brings the pr26208 testcase down to 3.2 seconds. Not checking it in since it does create a 4GB .o file. llvm-svn: 258149	2016-01-19 17:47:48 +00:00
Geoff Berry	5c6e076eb2	[cmake] Fix add_version_info_from_vcs git svn version bug. Summary: add_version_info_from_vcs was setting SVN_REVISION to the last fetched svn revision when using git svn instead of the svn revision corresponding to HEAD. This leads to conflicts with the definition of SVN_REVISION in SVNVersion.inc generated by GetSVN.cmake when HEAD is not the most recently fetched svn revision. Use 'git svn info' to determine SVN_REVISION when git svn is being used instead (as is done in GetSVN.cmake). Reviewers: beanz Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D16299 llvm-svn: 258148	2016-01-19 17:36:02 +00:00
Sanjay Patel	142c49bc42	remove outdated comment; NFC llvm-svn: 258147	2016-01-19 17:29:22 +00:00
Eduard Burtescu	19eb03106d	[opaque pointer types] [NFC] GEP: replace get(Pointer)ElementType uses with get{Source,Result}ElementType. Summary: GEPOperator: provide getResultElementType alongside getSourceElementType. This is made possible by adding a result element type field to GetElementPtrConstantExpr, which GetElementPtrInst already has. GEP: replace get(Pointer)ElementType uses with get{Source,Result}ElementType. Reviewers: mjacob, dblaikie Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D16275 llvm-svn: 258145	2016-01-19 17:28:00 +00:00
Michael Zuckerman	d9cac592f4	[AVX512] Adding VPERMB instruction Differential Revision: http://reviews.llvm.org/D16294 llvm-svn: 258144	2016-01-19 17:07:43 +00:00
Dan Gohman	b6fd39a3a7	[WebAssembly] Rematerialize constants rather than hold them live in registers. Teach the register stackifier to rematerialize constants that have multiple uses instead of leaving them in registers. In the WebAssembly encoding, it's the same code size to materialize most constants as it is to read a value from a register. llvm-svn: 258142	2016-01-19 16:59:23 +00:00
Rafael Espindola	1a7e8b4bc1	Simplify MCFillFragment. The value size was always 1 or 0, so we don't need to store it. In a no asserts build this takes the testcase of pr26208 from 11 to 10 seconds. llvm-svn: 258141	2016-01-19 16:57:08 +00:00
Dan Gohman	7126859e64	[WebAssembly] Change a FIXME to a TODO in a comment. llvm-svn: 258139	2016-01-19 16:52:50 +00:00
Dan Gohman	d1b53909b2	[WebAssembly] Re-enable this test, now that interactions with the coalescer are resolved. llvm-svn: 258138	2016-01-19 16:52:09 +00:00
Chad Rosier	401a4ab8d8	Typo. llvm-svn: 258137	2016-01-19 16:50:45 +00:00
Marina Yatsina	d9658d16fd	[X86] Add support for "xlat m8" According to x86 spec "xlat m8" is a legal instruction and it is equivalent to "xlatb". Differential Revision: http://reviews.llvm.org/D15150 llvm-svn: 258135	2016-01-19 16:35:38 +00:00
Manuel Jacob	c784e6acd9	Fix constant folding of constant vector GEPs with undef or null as pointer argument. Reviewers: eddyb Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D16321 llvm-svn: 258134	2016-01-19 16:34:31 +00:00
Marina Yatsina	b9f4f62cfe	[X86] Adding support for missing variations of X86 string related instructions The following are legal according to X86 spec: ins mem, DX outs DX, mem lods mem stos mem scas mem cmps mem, mem movs mem, mem Differential Revision: http://reviews.llvm.org/D14827 llvm-svn: 258132	2016-01-19 15:37:56 +00:00
Manuel Jacob	6a4761e384	Rename Variable `Ptr` to `PtrTy`. NFC. llvm-svn: 258130	2016-01-19 15:21:15 +00:00
Rafael Espindola	5568c83a60	Handle 64 bit offsets. No tests since llvm-mc takes 14 seconds on it. I will try to improve it and then test. Part of pr26208. llvm-svn: 258129	2016-01-19 15:19:08 +00:00
Dan Gohman	b13c91f159	[WebAssembly] Disable some WebAssembly-specific optimization passes at -O0. llvm-svn: 258127	2016-01-19 14:55:02 +00:00
Dan Gohman	3196650bf3	[WebAssembly] Use the templated form of MachineFunction::getSubtarget(). NFC. llvm-svn: 258126	2016-01-19 14:53:19 +00:00
Dan Gohman	0553299586	[WebAssembly] Re-enable loop idiom recognition for memcpy et al. llvm-svn: 258125	2016-01-19 14:49:23 +00:00
Asaf Badouh	d4a0d9a78c	[X86][AVX512]fix dag & add intrinsics for fixupimm cover all width and types (pd/ps/sd/ss) of fixupimm instruction and inrtinsics Differential Revision: http://reviews.llvm.org/D16313 llvm-svn: 258124	2016-01-19 14:21:39 +00:00
Andrew Wilkins	2a3810e8f7	docs: address post-commit review Rewording/expansion of CMake options suggested by Dan Liew. See http://reviews.llvm.org/D16208. llvm-svn: 258112	2016-01-19 05:43:21 +00:00
Philip Reames	b336bca07e	[GC] Lower vectors-of-pointers directly by default This commit changes the default on our lowering of vectors-of-pointers from splitting in RS4GC to reporting them in the final stack map. All of the changes to do so are already in place and tested. Assuming no problems are unearthed in the next week, we will be deleting the old code entirely next Monday. llvm-svn: 258111	2016-01-19 04:18:24 +00:00
Philip Reames	3195500297	[GC] Consolidate all built in GCs into a single file [NFC] Combine a bunch of small files into a single, still rather small, file. The primary purpose of this is to get all of the static initializers into a single file so as to have a well defined order of initialization. llvm-svn: 258109	2016-01-19 03:57:18 +00:00
Kelvin Li	510498c0d3	parseArch() supports more variations of arch names for PowerPC builds llvm-svn: 258103	2016-01-19 00:04:41 +00:00
Tobias Edler von Koch	3f4f6f3ed6	Add a change accidentally left out from r258100 Also remove an executable bit introduced by r258083. llvm-svn: 258101	2016-01-18 23:35:24 +00:00
Tobias Edler von Koch	8ecaf69291	[LTO] Restore original linkage of externals prior to splitting Summary: This is a companion patch for http://reviews.llvm.org/D16124. Internalized symbols increase the size of strongly-connected components in SCC-based module splitting and thus reduce the amount of parallelism. This patch records the original linkage of non-local symbols prior to internalization and then restores it just before splitting/CodeGen. This is also useful for cases where the linker requires symbols to remain external, for instance, so they can be placed according to linker script rules. It's currently under its own flag (-restore-globals) but should eventually share a common flag with D16124. Reviewers: joker.eph, pcc Subscribers: slarin, llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D16229 llvm-svn: 258100	2016-01-18 23:24:54 +00:00
Simon Pilgrim	c4d519d340	Fixed MSVC warning that not all control paths return a value. llvm-svn: 258099	2016-01-18 22:54:46 +00:00
Matt Arsenault	33e3ecee0c	AMDGPU: Reduce 64-bit SRAs llvm-svn: 258096	2016-01-18 22:09:04 +00:00
Matt Arsenault	6e3a45193a	AMDGPU: Split 64-bit and of constant up This breaks the tests that were meant for testing 64-bit inline immediates, so move those to shl where they won't be broken up. This should be repeated for the other related bit ops. llvm-svn: 258095	2016-01-18 22:01:13 +00:00
Simon Pilgrim	77d86d1c08	[X86][AVX2] Ensure integer execution domain for integer blend tests llvm-svn: 258094	2016-01-18 21:58:21 +00:00
Chad Rosier	234bf6fe5c	[AArch64] Remove unused arguments. NFC. AFAICT, these have been unused since the initial backend import. llvm-svn: 258093	2016-01-18 21:56:40 +00:00
Matt Arsenault	3cbbc10488	AMDGPU: Generalize shl combine Reduce 64-bit shl with constant > 32. We already special cased this for the == 32 case, but this also works for any >= 32 constant. llvm-svn: 258092	2016-01-18 21:55:14 +00:00
Simon Pilgrim	3ca2f21f50	[X86][SSE] Regenerate vector blend commutation tests llvm-svn: 258091	2016-01-18 21:46:46 +00:00
Matt Arsenault	80edab99ff	AMDGPU: Reduce 64-bit lshr by constant to 32-bit 64-bit shifts are very slow on some subtargets. llvm-svn: 258090	2016-01-18 21:43:36 +00:00
Davide Italiano	f0caa3eaab	[Support/ELF] Remove field erroneously added in r258025. Although glibc defines it, this is currently of no use for my primary use-case (dumping DT_* keys correctly). Its semantic is not described anywhere I can find, so better leave it out for now. Thanks to Rafael for pointing out in his post-commit review! llvm-svn: 258089	2016-01-18 21:20:02 +00:00
Adam Nemet	d8968f0945	[LAA] Include function name in debug output llvm-svn: 258088	2016-01-18 21:16:33 +00:00
Davide Italiano	5e82324fe4	[JIT] Add small-code model test for ELF. The coverage is almost non-existent, hopefully more will come after this. Differential Revision: http://reviews.llvm.org/D16096 llvm-svn: 258087	2016-01-18 21:14:12 +00:00
Matt Arsenault	4085e8fcef	AMDGPU: Cleanup sra test llvm-svn: 258086	2016-01-18 21:13:56 +00:00
Matt Arsenault	e83690c1cc	AMDGPU: Add subtarget feature for instruction rates llvm-svn: 258085	2016-01-18 21:13:50 +00:00
Simon Pilgrim	99c6c29c0c	Fixed MSVC Win64 warning of implicit conversion of 32-bit shift to 64-bits. llvm-svn: 258084	2016-01-18 21:11:19 +00:00
Sergei Larin	d19d4d30d8	Add to the split module utility an SCC based method which allows not to globalize any local variables. Summary: Currently llvm::SplitModule as the first step globalizes all local objects, which might not be desirable in some scenarios. This change adds a new flag to llvm::SplitModule that uses SCC approach to search for a balanced partition without the need to externalize symbols. Such partition might not be possible or fully balanced for a given number of partitions, and is a function of the module properties (global/local dependencies within the module). Joint development Tobias Edler von Koch (tobias@codeaurora.org) and Sergei Larin (slarin@codeaurora.org) Subscribers: llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D16124 llvm-svn: 258083	2016-01-18 21:07:13 +00:00
Rafael Espindola	df9e61b599	Delete dead code. llvm-svn: 258082	2016-01-18 21:01:50 +00:00
Simon Pilgrim	3e5fb61978	[X86][AVX2] Broadcast subvectors AVX2 can only broadcast from the zero'th element of a vector, but if the broadcastable element is the zero'th element of a 128-bit subvector its advantageous to extract the subvector, broadcast from that and avoid the loading of shuffle mask data that would be needed for VPERMPS/VPERMD. The only exception being when the source type is 4f64 or 4i64 which can directly use the immediate shuffle VPERMPD/VPERMQ directly. Differential Revision: http://reviews.llvm.org/D16050 llvm-svn: 258081	2016-01-18 20:59:04 +00:00
Rafael Espindola	a79078c3ce	Use new function name. NFC. llvm-svn: 258079	2016-01-18 20:55:24 +00:00
Krzysztof Parzyszek	7aae9b3782	[Hexagon] Recognize more copy-equivalents in RDF optimizations llvm-svn: 258076	2016-01-18 20:45:51 +00:00
Krzysztof Parzyszek	adc64b7df0	[RDF] Improvements to copy propagation - Allow any instruction to define equality between registers. - Keep the DFG updated. llvm-svn: 258075	2016-01-18 20:43:57 +00:00
Krzysztof Parzyszek	e6b0662092	[RDF] Improve compile-time performance of dead code elimination llvm-svn: 258074	2016-01-18 20:42:47 +00:00
Krzysztof Parzyszek	69e670d5f9	[RDF] Allow unlinking ref nodes from data-flow chains only llvm-svn: 258073	2016-01-18 20:41:34 +00:00
Craig Topper	5e46adb09a	[TableGen] Use FoldingSets instead of DenseMaps to unique UnOpInit, BinOpInit and TernOpInit. This remove the memory needed to store the key for the DenseMap. NFC llvm-svn: 258071	2016-01-18 20:36:06 +00:00
Craig Topper	7dcb1a5c89	[TableGen] Fix an assert I missed in r258063. llvm-svn: 258068	2016-01-18 19:59:05 +00:00
Tom Stellard	ccdc5391ea	TargetLowering: Improve handling of (setcc ([sz]ext x) 0, cc) in SimplifySetCC Summary: When SimplifySetCC sees a setcc node that compares the result of a value extension operation with a constant, it tries to simplify the setcc node by eliminating the extension and shrinking the constant. If shrinking the inputs to setcc is deemed not desirable by the target (e.g. the target does not want a setcc comparing i1 values), then it is still possible to optimize this sequence in some cases. This patch adds the following combines to SimplifySetCC when shrinking setcc inputs is not desirable: (setcc ([sz]ext (setcc x, y, cc)), 0, setne) -> (setcc (x, y, cc)) (setcc ([sz]ext (setcc x, y, cc)), 0, seteq) -> (setcc (x, Y, !cc)) There are no tests for this yet, but once AMDGPU correctly implements TargetLowering::isTypeDesirableForOp(), this new combine will be exercised by the existing CodeGen/AMDGPU/setcc-opt.ll test. Reviewers: resistor, arsenm Subscribers: jroelofs, arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D15034 llvm-svn: 258067	2016-01-18 19:55:21 +00:00
Craig Topper	0e41d0b963	[TableGen] Merge the SuperClass Record and SMRange vector into a single vector. This removes the state needed to manage the extra vector thus reducing the size of the Record class. NFC llvm-svn: 258065	2016-01-18 19:52:37 +00:00
Craig Topper	d4d3ebd937	[TableGen] Reorder fields in Record class to optimize memory usage. NFC llvm-svn: 258064	2016-01-18 19:52:29 +00:00
Craig Topper	fbfd578056	[TableGen] Allocate the Init pointer array for BitsInit/ListInit after the BitsInit/ListInit object itself. Saves a bit of memory. NFC llvm-svn: 258063	2016-01-18 19:52:24 +00:00
Sanjay Patel	c2ceb8b2d8	combine clauses with same output ; NFCI llvm-svn: 258062	2016-01-18 19:17:58 +00:00
Simon Atanasyan	e03126aea4	[llvm-readobj][ELF] s/dyn_rela_/dyn_rel_/ No functional changes. Follow up to r258001. These template functions might return both REL and RELA relocations. The 'rel' noun looks less ambiguous. llvm-svn: 258060	2016-01-18 18:52:04 +00:00
Sanjay Patel	7b7eec11c0	use m_OneUse ; NFCI llvm-svn: 258059	2016-01-18 18:36:38 +00:00
Sanjay Patel	3b8dcc731e	fix variable names, typos ; NFC llvm-svn: 258058	2016-01-18 18:28:09 +00:00
Sanjay Patel	d09b44a752	fix typo; NFC llvm-svn: 258057	2016-01-18 17:50:23 +00:00
Igor Breger	239fda676c	AVX512: Masked store intrinsic implementation. Implemented intrinsic for the follow instructions (store) : VMOVDQU8/16/32/64, VMOVDQA32/64, VMOVAPS/PD, VMOVUPS/PD. Differential Revision: http://reviews.llvm.org/D16271 llvm-svn: 258047	2016-01-18 13:52:57 +00:00
Elena Demikhovsky	9242ea87d6	Added Cannonlake processor to X86 Target Differential Revision: http://reviews.llvm.org/D16289 llvm-svn: 258046	2016-01-18 13:00:31 +00:00
Igor Breger	dd6522c653	AVX512 : Change v8i1 bitconvert GR8 pattern, remove unnecessary movzbl instruction. code example , previous implementation. movzbl %dil, %eax kmovw %eax, %k0 new code kmovw %edi, %k0 Differential Revision: http://reviews.llvm.org/D16287 llvm-svn: 258045	2016-01-18 12:02:45 +00:00
Oliver Stannard	9f68749eba	[ARM] Operands for PKHTB alias should be swapped When the shift immediate is zero, PKHTB is an alias for PKHBT, but the order of the input operands needs to be swapped. Differential Revision: http://reviews.llvm.org/D16288 llvm-svn: 258044	2016-01-18 11:56:35 +00:00
Michael Zuckerman	9c47e0681c	[AVX512] adding AVXVBMI feature flag Fixing wrong typo (avx515) → (avx512) Review over the shoulder by asaf . Differential Revision: http://reviews.llvm.org/D16190 llvm-svn: 258041	2016-01-18 11:12:47 +00:00
Xinliang David Li	42a13308a1	[Coverage] move a local var to be BinaryCoverageReader's member The symtab is logically referenced beyond the call to the create method. This changes makes sure its lifetime matches that of the reader. llvm-svn: 258036	2016-01-18 06:48:01 +00:00

... 2 3 4 5 6 ...

126590 Commits