llvm-project

Commit Graph

Author	SHA1	Message	Date
Xinliang David Li	ed966771da	[PGO] Implement ValueProfiling Closure interfaces for runtime value profile data This is one of the many steps to commonize value profiling support between profile runtime and compiler/llvm tools. After this change, profiler runtime now can share the same C APIs to do VP serialization/deseriazation with LLVM host tools (and produces value data in identical format between indexed and raw profile). It is not yet enabled in profiler runtime yet. Also added a unit test case to test runtime profile data serialization/deserialization interfaces implemented using common closure code. llvm-svn: 254110	2015-11-25 23:31:18 +00:00
Evgeniy Stepanov	9842d61ca4	[safestack] Fix alignment of dynamic allocas. Fixes PR25588. llvm-svn: 254109	2015-11-25 22:52:30 +00:00
Richard Diamond	a62513c5dc	Fix a use-after-free in `llvm-config`. Summary: This could happen if `GetComponentNames` is true, because `Name` from `VisitComponent` would reference a stack instance of `std::string` in `ComputeLibsForComponents`. Reviewers: beanz Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14913 llvm-svn: 254108	2015-11-25 22:49:48 +00:00
Dan Gohman	a774d719a0	[WebAssembly] Fix inline asm support for i64 operands. llvm-svn: 254106	2015-11-25 22:28:50 +00:00
Dan Gohman	d9b4218831	[WebAssembly] Fold setne and seteq comparisons into selects. llvm-svn: 254104	2015-11-25 22:13:48 +00:00
Kostya Serebryany	2d0ef14f5d	[libFuzzer] add a flag -exact_artifact_path llvm-svn: 254100	2015-11-25 21:40:46 +00:00
Krzysztof Parzyszek	70a134d29f	[Hexagon] Treat transfers of FP immediates are pseudo instructions This is a temporary fix to address ICE on 2005-10-21-longlonggtu.ll. The proper fix will be to use A2_tfrsi, but it will need more work to teach all users of A2_tfrsi to also expect a floating-point operand. llvm-svn: 254099	2015-11-25 21:40:03 +00:00
Dan Gohman	5941bde03c	[WebAssembly] Add some comments. NFC. llvm-svn: 254096	2015-11-25 21:32:06 +00:00
Marek Olsak	7ed6b2f414	AMDGPU/SI: select S_ABS_I32 when possible (v2) v2: added more tests, moved the SALU->VALU conversion to a separate function It looks like it's not possible to get subregisters in the S_ABS lowering code, and I don't feel like guessing without testing what the correct code would look like. llvm-svn: 254095	2015-11-25 21:22:45 +00:00
Dan Gohman	80e34e0a18	[WebAssembly] Fix WebAssembly register numbering for registers added late. If virtual registers are created late, mappings to WebAssembly registers need to be added explicitly. This patch adds a function to do so and teaches WebAssemblyPeephole to use it. This fixes an out-of-bounds access on the WARegs vector. llvm-svn: 254094	2015-11-25 21:13:02 +00:00
Davide Italiano	dd04fee8a6	[SCCP] More informative message if we don't know how to handle a terminator. llvm-svn: 254093	2015-11-25 21:03:36 +00:00
Matt Arsenault	49affb8462	AMDGPU: Check feature attributes in SIMachineFunctionInfo llvm-svn: 254091	2015-11-25 20:55:12 +00:00
Krzysztof Parzyszek	207c13f254	Add hexagonv55 and hexagonv60 as recognized CPUs, make v60 the default llvm-svn: 254089	2015-11-25 20:30:59 +00:00
Matt Arsenault	d179481857	AMDGPU: Add some tests for promotion of v2i64 scalar_to_vector llvm-svn: 254087	2015-11-25 20:01:03 +00:00
Matt Arsenault	61001bbc03	AMDGPU: Make v2i64/v2f64 legal types. They can be loaded and stored, so count them as legal. This is mostly to fix a number of common cases for load/store merging. llvm-svn: 254086	2015-11-25 19:58:34 +00:00
Artyom Skrobov	314ee04268	Expose isXxxConstant() functions from SelectionDAGNodes.h (NFC) Summary: Many target lowerings copy-paste the code to test SDValues for known constants. This code can instead be shared in SelectionDAG.cpp, and reused in the targets. Reviewers: MatzeB, andreadb, tstellarAMD Subscribers: arsenm, jyknight, llvm-commits Differential Revision: http://reviews.llvm.org/D14945 llvm-svn: 254085	2015-11-25 19:41:11 +00:00
Dan Gohman	fb3e0594e4	[WebAssembly] Use a physical register to describe ARGUMENT liveness. Instead of trying to move ARGUMENT instructions back up to the top after they've been scheduled or sunk down, use a fake physical register to create a liveness constraint that prevents ARGUMENT instructions from moving down in the first place. This is still not entirely ideal, however it is more robust than letting them move and moving them back. llvm-svn: 254084	2015-11-25 19:36:19 +00:00
Xinliang David Li	e809231b22	[PGO] Regroup functions in better order (NFC) llvm-svn: 254080	2015-11-25 19:13:00 +00:00
Dan Gohman	9c54d3b4c6	[WebAssembly] Clean up several FIXME comments. llvm-svn: 254079	2015-11-25 18:13:18 +00:00
Dan Gohman	1270b0a91d	[WebAssembly] Make several tests more strict. llvm-svn: 254077	2015-11-25 17:33:15 +00:00
Dan Gohman	81719f8555	[WebAssembly] Support for register stackifying with load and store instructions. llvm-svn: 254076	2015-11-25 16:55:01 +00:00
Dan Gohman	2c8fe6a428	[WebAssembly] Codegen support for ISD::ExternalSymbol llvm-svn: 254075	2015-11-25 16:44:29 +00:00
Dan Gohman	fd4a88c376	[WebAssembly] Add 'final' to some classes. NFC. llvm-svn: 254073	2015-11-25 16:29:24 +00:00
Dan Gohman	04c0401f28	[WebAssembly] Whitespace consistency. NFC. llvm-svn: 254071	2015-11-25 16:26:14 +00:00
Sanjay Patel	25150784ae	fix typo; NFC llvm-svn: 254069	2015-11-25 15:33:36 +00:00
Hal Finkel	005f840959	[PowerPC] Don't generate mfocrf on the e500mc The e500mc does not actually support the mfocrf instruction; update the processor definitions to reflect that fact. Patch by Tom Rix (with some test-case cleanup by me). llvm-svn: 254064	2015-11-25 10:14:31 +00:00
Eric Christopher	f83a2c2db2	Accept any stack offset, including none, here. llvm-svn: 254062	2015-11-25 09:21:36 +00:00
Eric Christopher	4675c439aa	Fix some places where we were assuming that memory type had been legalized to a simple type when lowering a truncating store of a vector type. In this case for an EVT we'll return Expand as we should in all of the cases anyhow. The testcase triggered at the one in VectorLegalizer::LegalizeOp, inspection found the rest. llvm-svn: 254061	2015-11-25 09:11:53 +00:00
Simon Pilgrim	c85c49c665	[X86][AVX] Regenerate Splat OptSize tests Tidied up triple and regenerate tests using update_llc_test_checks.py llvm-svn: 254060	2015-11-25 09:06:17 +00:00
Elena Demikhovsky	f07df9fcac	AVX-512: Fixed a bug in VPERMT2* intrinsic. It was wrong order of operands (from intrinsic to DAG node). I added more strict type specification for instruction selection. Differential Revision: http://reviews.llvm.org/D14942 llvm-svn: 254059	2015-11-25 08:17:56 +00:00
Xinliang David Li	f47cf5505f	[PGO] Convert InstrProfRecord based serialization methods to use common C methods 1. Convert serialization methods using InstrProfRecord as source into C (impl) interfaces using Closure. 2. Reimplement InstrProfRecord serialization method to use new C interface as dummy wrapper. Now it is ready to implement wrapper for runtime value profile data. (The new code need better source location -- but not changed in this patch to minimize diffs. ) llvm-svn: 254057	2015-11-25 06:23:38 +00:00
Xinliang David Li	ac5b860633	[PGO] convert a subset of C++ interfaces into C (for sharing) (NFC) llvm-svn: 254056	2015-11-25 04:29:24 +00:00
Xinliang David Li	4f18bef998	Move member functions closer to others of the same class (NFC) llvm-svn: 254055	2015-11-25 03:24:37 +00:00
Peter Collingbourne	463ff6d823	AsmParser: Make the code for parsing unnamed aliases more closely resemble that for unnamed globals. This fixes parsing of forward references to unnamed aliases. While here, remove an unnecessary isa check. llvm-svn: 254054	2015-11-25 02:54:07 +00:00
Xinliang David Li	9f68d876a0	Add missing documentation. (NFC) llvm-svn: 254051	2015-11-25 01:13:44 +00:00
Matthias Braun	4f85838320	Doxygen: Use mathjax to create formulas. The main motivation is to not require a latex installation when building the documentation. I would also expect a better image quality and the ability to copy&paste from formulas with a javascript based solution for displaying the math. Differential Revision: http://reviews.llvm.org/D14960 llvm-svn: 254048	2015-11-25 00:50:47 +00:00
Sanjoy Das	c521c7bea5	[OperandBundles] Extract duplicated code into a helper function, NFC llvm-svn: 254047	2015-11-25 00:42:24 +00:00
Sanjoy Das	7629346193	[InstCombine] Don't drop operand bundles Reviewers: majnemer Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14857 llvm-svn: 254046	2015-11-25 00:42:19 +00:00
Xinliang David Li	4945b16708	Fix function naming (NFC) llvm-svn: 254045	2015-11-25 00:08:49 +00:00
Hans Wennborg	e412b71f95	Revert r253528: "[X86] Enable shrink-wrapping by default." This caused PR25607 and also caused Chromium to crash on start-up. (Also had to update test/CodeGen/X86/avx-splat.ll, which was committed after shrink wrapping was enabled.) llvm-svn: 254044	2015-11-25 00:05:13 +00:00
Kaelyn Takata	d0955312d9	Fix an asan error where NumElements > 32 for at least one case in test/CodeGen/X86/avg.ll. llvm-svn: 254043	2015-11-25 00:03:29 +00:00
Rong Xu	55fa418a90	Revert r254021 llvm-svn: 254042	2015-11-24 23:57:51 +00:00
Rong Xu	25c106b347	[PGO] Revert revision r254021,r254028,r254035 Revert the above revision due to multiple issues. llvm-svn: 254040	2015-11-24 23:49:08 +00:00
Xinliang David Li	28b700373e	[PGO] Add mapper callback to interfaces retrieving value data for site (NFC) This allows cleaner implementation and merging retrieving/mapping in one pass. llvm-svn: 254038	2015-11-24 23:36:52 +00:00
Teresa Johnson	3930361969	[ThinLTO] Add option to limit importing based on instruction count Add a simple initial heuristic to control importing based on the number of instructions recorded in the function's summary. Add option to control the limit, and test using option. llvm-svn: 254036	2015-11-24 22:55:46 +00:00
Rong Xu	88cb57aba9	[PGO] Relax test cases in PGO instrumentation Fix buildbot failure for clang-x86_64-linux-selfhost-modules. http://lab.llvm.org:8011/builders/clang-x86_64-linux-selfhost-modules/builds/8866 The failing test cases are newly added from r254021. It seems the IR has a different order in this platform. In this patch, I temporarily relax the test case to make the build green. I'll have a complete fix (more robust way to test) soon. llvm-svn: 254035	2015-11-24 22:50:34 +00:00
Diego Novillo	0b6985a3c6	SamplePGO - Add test for hot/cold inlined functions. When the original binary is executed and sampled, the resulting profile contains information on the original inline stack. We currently follow the original inline plan if we notice that the inlined callsite has more than 0 samples to it. A better way is to determine whether the callsite is actually worth inlining. If the callsite accumulates a small fraction of the samples spent in the parent function, then we don't want to bother inlining it (as it means that the callsite is actually cold). This patch introduces a threshold expressed in percentage of samples in relation to the parent function. If the callsite uses less than N% of the total samples used by its parent, the original inline decision is not re-applied. I've set the threshold to the very arbitrary value of 5%. I'm yet to do any actual experiments to see what's a good value. I wanted to separate the basic mechanism from the tuning. llvm-svn: 254034	2015-11-24 22:38:37 +00:00
Simon Pilgrim	c1225c28e1	[X86][SSE] Regenerate PMUL tests Tidied up triple and regenerate tests using update_llc_test_checks.py llvm-svn: 254029	2015-11-24 22:09:31 +00:00
Rong Xu	4dd22b8d2b	[PGO] Fix build errors in x86_64-darwin Fix buildbot failure for x86_64-darwin due to r254021 llvm-svn: 254028	2015-11-24 21:55:50 +00:00
Evgeniy Stepanov	b05d380451	[msan] Relax origin-alignment test. Change origin-alignment test to test only the alignment of the origin store, and not the exact instruction sequence used to compute the address. This makes the test less fragile and, in particular, lets it pass both with the old and new MSan ABIs. llvm-svn: 254027	2015-11-24 21:44:16 +00:00
Rong Xu	1b665ca707	[PGO] MST based PGO instrumentation infrastructure This patch implements a minimum spanning tree (MST) based instrumentation for PGO. The use of MST guarantees minimum number of CFG edges getting instrumented. An addition optimization is to instrument the less executed edges to further reduce the instrumentation overhead. The patch contains both the instrumentation and the use of the profile to set the branch weights. Differential Revision: http://reviews.llvm.org/D12781 llvm-svn: 254021	2015-11-24 21:31:25 +00:00
Teresa Johnson	d450da3281	[ThinLTO] Refactor function body scan during importing into helper (NFC) llvm-svn: 254020	2015-11-24 21:15:19 +00:00
Xinliang David Li	2fc2515519	Fix sphinx-build error when building documentation. Consolidate the description of -binary/-text option description to avoid duplicate ID error by sphinux-build. llvm-svn: 254018	2015-11-24 20:48:25 +00:00
Sanjoy Das	990914d64c	[RuntimeDyld] Fix a class of arithmetic errors introduced in r253918 r253918 had refactored expressions like "A - B.Address + C" to "A - B.getAddressWithOffset(C)". This is incorrect, since the latter really computes "A - B.Address - C". None of the tests I can run locally on x86 broke due to this bug, but it is the current suspect for breakage on the AArch64 buildbots. llvm-svn: 254017	2015-11-24 20:37:01 +00:00
Simon Pilgrim	1b4fecb098	[X86][FMA] Optimize FNEG(FMA) Patterns X86 needs to use its own FMA opcodes, preventing the standard FNEG(FMA) pattern table recognition method used by other platforms. This patch adds support for lowering FNEG(FMA(X,Y,Z)) into a single suitably negated FMA instruction. Fix for PR24364 Differential Revision: http://reviews.llvm.org/D14906 llvm-svn: 254016	2015-11-24 20:31:46 +00:00
Matthias Braun	147110da84	LiveVariables should not clobber MachineOperand::IsDead, ::IsKill on reserved physical registers Patch by Nick Johnson <Nicholas.Paul.Johnson@deshawresearch.com> Differential Revision: http://reviews.llvm.org/D14875 llvm-svn: 254012	2015-11-24 20:06:56 +00:00
Teresa Johnson	130de7af7f	[ThinLTO] Enable iterative importing in FunctionImport pass Analyze imported function bodies and add any new external calls to the worklist for importing. Currently no controls on the importing so this will end up importing everything possible in the call tree below the importing module. Basic profitability checks coming next. Update test to check for iteratively inlined functions. llvm-svn: 254011	2015-11-24 19:55:04 +00:00
Cong Hou	db6220f84d	[X86] Fix several issues related to X86's psadbw instruction. This patch fixes the following issues: 1. Fix the return type of X86psadbw: it should not be the same type of inputs. For vNi8 inputs the output should be vMi64, where M = N/8. 2. Fix the return type of int_x86_avx512_psad_bw_512 accordingly. 3. Fix the definiton of PSADBW, VPSADBW, and VPSADBWY accordingly. 4. Adjust the return type when building a DAG node of X86ISD::PSADBW type. 5. Update related tests. Differential revision: http://reviews.llvm.org/D14897 llvm-svn: 254010	2015-11-24 19:51:26 +00:00
Teresa Johnson	b098f0c133	[ThinLTO] Handle previously imported and promoted locals in module linker The new function import pass exposed an issue when we import references to local values on multiple importing passes. They are renamed on each import pass, and we need to ensure that the already promoted and renamed references existing in the dest module are correctly identified and updated so that they aren't spuriously renamed again (due to a perceived conflict with the newly linked reference). llvm-svn: 254009	2015-11-24 19:46:58 +00:00
Xinliang David Li	6a829f78f9	[PGO] Introduce value profile data closure type. The closure is designed to abstact away two types of value profile data: - InstrProfRecord which is the primary data structure used to represent profile data in host tools (reader, writer, and profile-use) - value profile runtime data structure suitable to be used by C runtime library. Both sources of data need to serialize to disk/memory-buffer in common format: ValueProfData. The abstraction allows compiler-rt's raw profiler writer to share the same code with indexed profile writer. llvm-svn: 254008	2015-11-24 19:21:15 +00:00
Weiming Zhao	45d4cb9a14	[Utils] Put includes in correct order. NFC. Summary: Followed the guidelines in: http://llvm.org/docs/CodingStandards.html#include-style However, I noticed that uppercase named headers come before lowercase ones throughout the codebase. So kept them as is. Patch by Mandeep Singh Grang <mgrang@codeaurora.org> Reviewers: majnemer, davide, jmolloy, atrick Subscribers: sanjoy Differential Revision: http://reviews.llvm.org/D14939 llvm-svn: 254005	2015-11-24 18:57:06 +00:00
Xinliang David Li	759dc628c0	[PGO] Small interface change to be profile rt ready Convert two C++ static member functions to be C APIs. This is one of the many steps to get ready to share VP writer code with profiler runtime. llvm-svn: 253999	2015-11-24 18:15:46 +00:00
Sanjay Patel	968e91aea0	[InstCombine] fix propagation of fast-math-flags Noticed while working on D4583: http://reviews.llvm.org/D4583 llvm-svn: 253997	2015-11-24 17:51:20 +00:00
Sanjay Patel	739f2ce93a	use convenience function for copying IR flags; NFCI llvm-svn: 253996	2015-11-24 17:16:33 +00:00
Xinliang David Li	1b85d4c961	Minor refactor to make VP writing more efficient llvm-svn: 253994	2015-11-24 17:03:24 +00:00
Rafael Espindola	383d1f6aa6	Make this test a bit more strict. It now tests with files in both orders. llvm-svn: 253993	2015-11-24 16:43:53 +00:00
Krzysztof Parzyszek	b8bb90b744	Add vector types for intrinsics Author: Ron Lieberman <ronl@codeaurora.org> llvm-svn: 253992	2015-11-24 16:28:14 +00:00
Teresa Johnson	17626654fd	[ThinLTO] Fix FunctionImport alias checking and test Skip imports for weak_any aliases as well. Fix the test to check non-import of weak aliases and functions, and import of normal alias. llvm-svn: 253991	2015-11-24 16:10:43 +00:00
Krzysztof Parzyszek	47c1baeb1f	Add names for the new vector types in CodeGenTarget.cpp llvm-svn: 253989	2015-11-24 15:50:22 +00:00
Sanjay Patel	a0d354541d	[x86] remove duplicate movq instruction defs (PR25554) We had duplicated definitions for the same hardware '[v]movq' instructions. For example with SSE: def MOVZQI2PQIrr : RS2I<0x6E, MRMSrcReg, (outs VR128:$dst), (ins GR64:$src), "mov{d\|q}\t{$src, $dst\|$dst, $src}", // X86-64 only [(set VR128:$dst, (v2i64 (X86vzmovl (v2i64 (scalar_to_vector GR64:$src)))))], IIC_SSE_MOVDQ>; def MOV64toPQIrr : RS2I<0x6E, MRMSrcReg, (outs VR128:$dst), (ins GR64:$src), "mov{d\|q}\t{$src, $dst\|$dst, $src}", [(set VR128:$dst, (v2i64 (scalar_to_vector GR64:$src)))], IIC_SSE_MOVDQ>, Sched<[WriteMove]>; As shown in the test case and PR25554: https://llvm.org/bugs/show_bug.cgi?id=25554 This causes us to miss reusing an operand because later passes don't know these 'movq' are the same instruction. This patch deletes one pair of these defs. Sadly, this won't fix the original test case in the bug report. Something else is still broken. Differential Revision: http://reviews.llvm.org/D14941 llvm-svn: 253988	2015-11-24 15:44:35 +00:00
Krzysztof Parzyszek	aa93575b7e	[Hexagon] Add missing include of <cctype> Lack thereof breaks Windows builds due to the use of std::isspace in HexagonInstrInfo.cpp. llvm-svn: 253987	2015-11-24 15:11:13 +00:00
Krzysztof Parzyszek	b9a1c3a32c	[Hexagon] Bring HexagonInstrInfo up to date llvm-svn: 253986	2015-11-24 14:55:26 +00:00
Rafael Espindola	23117e5a7b	Add an already passing test. This tests that a declaration can resolve to an alias. I broke this locally while prototyping a change and it looks like a nice test to have. llvm-svn: 253984	2015-11-24 14:15:50 +00:00
Krzysztof Parzyszek	d4b566d50b	Add new vector types for 512-, 1024- and 2048-bit vectors Those types are needed to implement instructions for Hexagon Vector Extensions (HVX): 16x32, 16x64, 32x16, 32x32, 32x64, 64x8, 64x16, 64x32, 128x8, 128x16, 256x8, 512x1, and 1024x1. llvm-svn: 253978	2015-11-24 13:07:35 +00:00
Matt Arsenault	ff05da806c	AMDGPU: Split LDS vector loads If properly aligned this could allow using ds_read_b64. llvm-svn: 253975	2015-11-24 12:18:54 +00:00
Matt Arsenault	4d801cd357	AMDGPU: Split x8 and x16 vector loads instead of scalarize The one regression in the builtin tests is in the read2 test which now (again) has many extra copies, but this should be solved once the pass is replaced with a DAG combine. llvm-svn: 253974	2015-11-24 12:05:03 +00:00
Ismail Donmez	65487e2d7e	Fix build after r253954 llvm-svn: 253969	2015-11-24 09:48:09 +00:00
Pavel Labath	e3af02695d	Fix non-PIC build after 253959 CMAKE_EXE_LINKER_FLAGS is a string. Appending a flag using list(APPEND) introduces an extra semicolon which breaks stuff. Change this to append the value in the same way that everyone else seems to be doing. llvm-svn: 253968	2015-11-24 09:46:01 +00:00
Cong Hou	1938f2eb98	Let SelectionDAG start to use probability-based interface to add successors. The patch in http://reviews.llvm.org/D13745 is broken into four parts: 1. New interfaces without functional changes. 2. Use new interfaces in SelectionDAG, while in other passes treat probabilities as weights. 3. Use new interfaces in all other passes. 4. Remove old interfaces. This the second patch above. In this patch SelectionDAG starts to use probability-based interfaces in MBB to add successors but other MC passes are still using weight-based interfaces. Therefore, we need to maintain correct weight list in MBB even when probability-based interfaces are used. This is done by updating weight list in probability-based interfaces by treating the numerator of probabilities as weights. This change affects many test cases that check successor weight values. I will update those test cases once this patch looks good to you. Differential revision: http://reviews.llvm.org/D14361 llvm-svn: 253965	2015-11-24 08:51:23 +00:00
Craig Topper	5712d46114	[TableGen] Use std::remove_if instead of manually coded loops that call erase multiple times. NFC llvm-svn: 253964	2015-11-24 08:20:47 +00:00
Craig Topper	16f1cbd1e4	[TableGen] Use the other version of EnforceVectorEltTypeIs inside the TypeSet version of EnforceVectorEltTypeIs to reduce duplicated code. NFC llvm-svn: 253963	2015-11-24 08:20:45 +00:00
Craig Topper	dbfcc10e44	[TableGen] Fix formatting and use logical OR. NFC llvm-svn: 253962	2015-11-24 08:20:44 +00:00
Craig Topper	fef745c36a	[TableGen] Use std::set_intersection to merge TypeSets. NFC llvm-svn: 253961	2015-11-24 08:20:42 +00:00
Craig Topper	4856c81b46	[TableGen] Use SmallVector::assign instead of a resize and replace element. llvm-svn: 253960	2015-11-24 08:20:41 +00:00
Chris Bieneman	7494dc5e55	[CMake] When disabling PIC, also pass -fno-pie when linking if it is supported. Building clang with -fno-pie generates slightly faster code. In my not-very-rigorous testing I saw about a 4% speed up using the clang test-suite sources. llvm-svn: 253959	2015-11-24 08:04:59 +00:00
Craig Topper	d324d75102	Revert change that accidentally snuck into r253955. llvm-svn: 253956	2015-11-24 06:24:06 +00:00
Craig Topper	030418802a	[TableGen] Use array_pod_sort. NFC llvm-svn: 253955	2015-11-24 06:22:43 +00:00
Mehdi Amini	42418aba58	Add a FunctionImporter helper to perform summary-based cross-module function importing Summary: This is a helper to perform cross-module import for ThinLTO. Right now it is importing naively every possible called functions. Reviewers: tejohnson Subscribers: dexonsmith, llvm-commits Differential Revision: http://reviews.llvm.org/D14914 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 253954	2015-11-24 06:07:49 +00:00
Mehdi Amini	1d704cdedf	Add findFunctionInfoList() accessor to FunctionInfoIndex. Summary: This allows to query for a function in the map without creating an entry, allowing to use a const FunctionInfoIndex. Reviewers: tejohnson Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14912 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 253953	2015-11-24 06:07:42 +00:00
Cong Hou	bed60d35ed	[X86][SSE] Detect AVG pattern during instruction combine for SSE2/AVX2/AVX512BW. This patch detects the AVG pattern in vectorized code, which is simply c = (a + b + 1) / 2, where a, b, and c have the same type which are vectors of either unsigned i8 or unsigned i16. In the IR, i8/i16 will be promoted to i32 before any arithmetic operations. The following IR shows such an example: %1 = zext <N x i8> %a to <N x i32> %2 = zext <N x i8> %b to <N x i32> %3 = add nuw nsw <N x i32> %1, <i32 1 x N> %4 = add nuw nsw <N x i32> %3, %2 %5 = lshr <N x i32> %N, <i32 1 x N> %6 = trunc <N x i32> %5 to <N x i8> and with this patch it will be converted to a X86ISD::AVG instruction. The pattern recognition is done when combining instructions just before type legalization during instruction selection. We do it here because after type legalization, it is much more difficult to do pattern recognition based on many instructions that are doing type conversions. Therefore, for target-specific instructions (like X86ISD::AVG), we need to take care of type legalization by ourselves. However, as X86ISD::AVG behaves similarly to ISD::ADD, I am wondering if there is a way to legalize operands and result types of X86ISD::AVG together with ISD::ADD. It seems that the current design doesn't support this idea. Tests are added for SSE2, AVX2, and AVX512BW and both i8 and i16 types of variant vector sizes. Differential revision: http://reviews.llvm.org/D14761 llvm-svn: 253952	2015-11-24 05:44:19 +00:00
Davide Italiano	c304a0ddc1	[DIE] Make DIE.h NDEBUG conditional-free. Switch dump()/print() method definitions to LLVM_DUMP_METHOD instead. llvm-svn: 253945	2015-11-24 02:21:43 +00:00
Chris Bieneman	914742bb80	[CMake] export_executable_symbols also needs to add -rdynamic to the linker flags on Darwin Without -rdynamic LLVM built with LTO fails to pass "check" due to loadable modules failing. llvm-svn: 253944	2015-11-24 00:58:58 +00:00
Xinliang David Li	ff1a0bb254	Use make_unique [NFC] llvm-svn: 253942	2015-11-24 00:32:00 +00:00
Xinliang David Li	b46ad0a3c8	Remove trailing space in comments llvm-svn: 253941	2015-11-24 00:31:41 +00:00
Sanjay Patel	8ca4a5b9e5	minimize test case but still show the bug llvm-svn: 253940	2015-11-24 00:11:48 +00:00
Chris Bieneman	4cb7ab67c9	NFC. Fixing my consistently incorrect spelling. llvm-svn: 253936	2015-11-23 23:34:09 +00:00
Sanjay Patel	16fcf25eb9	added comment (using freshly updated update_llc_test_checks.py) llvm-svn: 253935	2015-11-23 23:22:05 +00:00
Sanjay Patel	d6e0cb01b1	[x86] add test to show suboptimal codegen (PR25554) llvm-svn: 253934	2015-11-23 23:18:20 +00:00
Sanjoy Das	5abfbb9246	[RuntimeDyld] Avoid unused-private-field warning; NFC Fixes the no asserts -Werror,-Wunused-private-field build. llvm-svn: 253933	2015-11-23 22:59:36 +00:00
Dan Gohman	192dddc595	[WebAssembly] Don't print the types of memory_size and grow_memory This matches the current spec, for now. llvm-svn: 253931	2015-11-23 22:37:29 +00:00

1 2 3 4 5 ...

124277 Commits