llvm-project

Commit Graph

Author	SHA1	Message	Date
Sanjay Patel	25150784ae	fix typo; NFC llvm-svn: 254069	2015-11-25 15:33:36 +00:00
Hal Finkel	005f840959	[PowerPC] Don't generate mfocrf on the e500mc The e500mc does not actually support the mfocrf instruction; update the processor definitions to reflect that fact. Patch by Tom Rix (with some test-case cleanup by me). llvm-svn: 254064	2015-11-25 10:14:31 +00:00
Eric Christopher	4675c439aa	Fix some places where we were assuming that memory type had been legalized to a simple type when lowering a truncating store of a vector type. In this case for an EVT we'll return Expand as we should in all of the cases anyhow. The testcase triggered at the one in VectorLegalizer::LegalizeOp, inspection found the rest. llvm-svn: 254061	2015-11-25 09:11:53 +00:00
Elena Demikhovsky	f07df9fcac	AVX-512: Fixed a bug in VPERMT2* intrinsic. It was wrong order of operands (from intrinsic to DAG node). I added more strict type specification for instruction selection. Differential Revision: http://reviews.llvm.org/D14942 llvm-svn: 254059	2015-11-25 08:17:56 +00:00
Xinliang David Li	f47cf5505f	[PGO] Convert InstrProfRecord based serialization methods to use common C methods 1. Convert serialization methods using InstrProfRecord as source into C (impl) interfaces using Closure. 2. Reimplement InstrProfRecord serialization method to use new C interface as dummy wrapper. Now it is ready to implement wrapper for runtime value profile data. (The new code need better source location -- but not changed in this patch to minimize diffs. ) llvm-svn: 254057	2015-11-25 06:23:38 +00:00
Xinliang David Li	ac5b860633	[PGO] convert a subset of C++ interfaces into C (for sharing) (NFC) llvm-svn: 254056	2015-11-25 04:29:24 +00:00
Xinliang David Li	4f18bef998	Move member functions closer to others of the same class (NFC) llvm-svn: 254055	2015-11-25 03:24:37 +00:00
Peter Collingbourne	463ff6d823	AsmParser: Make the code for parsing unnamed aliases more closely resemble that for unnamed globals. This fixes parsing of forward references to unnamed aliases. While here, remove an unnecessary isa check. llvm-svn: 254054	2015-11-25 02:54:07 +00:00
Sanjoy Das	c521c7bea5	[OperandBundles] Extract duplicated code into a helper function, NFC llvm-svn: 254047	2015-11-25 00:42:24 +00:00
Sanjoy Das	7629346193	[InstCombine] Don't drop operand bundles Reviewers: majnemer Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14857 llvm-svn: 254046	2015-11-25 00:42:19 +00:00
Xinliang David Li	4945b16708	Fix function naming (NFC) llvm-svn: 254045	2015-11-25 00:08:49 +00:00
Hans Wennborg	e412b71f95	Revert r253528: "[X86] Enable shrink-wrapping by default." This caused PR25607 and also caused Chromium to crash on start-up. (Also had to update test/CodeGen/X86/avx-splat.ll, which was committed after shrink wrapping was enabled.) llvm-svn: 254044	2015-11-25 00:05:13 +00:00
Kaelyn Takata	d0955312d9	Fix an asan error where NumElements > 32 for at least one case in test/CodeGen/X86/avg.ll. llvm-svn: 254043	2015-11-25 00:03:29 +00:00
Rong Xu	25c106b347	[PGO] Revert revision r254021,r254028,r254035 Revert the above revision due to multiple issues. llvm-svn: 254040	2015-11-24 23:49:08 +00:00
Xinliang David Li	28b700373e	[PGO] Add mapper callback to interfaces retrieving value data for site (NFC) This allows cleaner implementation and merging retrieving/mapping in one pass. llvm-svn: 254038	2015-11-24 23:36:52 +00:00
Teresa Johnson	3930361969	[ThinLTO] Add option to limit importing based on instruction count Add a simple initial heuristic to control importing based on the number of instructions recorded in the function's summary. Add option to control the limit, and test using option. llvm-svn: 254036	2015-11-24 22:55:46 +00:00
Diego Novillo	0b6985a3c6	SamplePGO - Add test for hot/cold inlined functions. When the original binary is executed and sampled, the resulting profile contains information on the original inline stack. We currently follow the original inline plan if we notice that the inlined callsite has more than 0 samples to it. A better way is to determine whether the callsite is actually worth inlining. If the callsite accumulates a small fraction of the samples spent in the parent function, then we don't want to bother inlining it (as it means that the callsite is actually cold). This patch introduces a threshold expressed in percentage of samples in relation to the parent function. If the callsite uses less than N% of the total samples used by its parent, the original inline decision is not re-applied. I've set the threshold to the very arbitrary value of 5%. I'm yet to do any actual experiments to see what's a good value. I wanted to separate the basic mechanism from the tuning. llvm-svn: 254034	2015-11-24 22:38:37 +00:00
Rong Xu	4dd22b8d2b	[PGO] Fix build errors in x86_64-darwin Fix buildbot failure for x86_64-darwin due to r254021 llvm-svn: 254028	2015-11-24 21:55:50 +00:00
Rong Xu	1b665ca707	[PGO] MST based PGO instrumentation infrastructure This patch implements a minimum spanning tree (MST) based instrumentation for PGO. The use of MST guarantees minimum number of CFG edges getting instrumented. An addition optimization is to instrument the less executed edges to further reduce the instrumentation overhead. The patch contains both the instrumentation and the use of the profile to set the branch weights. Differential Revision: http://reviews.llvm.org/D12781 llvm-svn: 254021	2015-11-24 21:31:25 +00:00
Teresa Johnson	d450da3281	[ThinLTO] Refactor function body scan during importing into helper (NFC) llvm-svn: 254020	2015-11-24 21:15:19 +00:00
Sanjoy Das	990914d64c	[RuntimeDyld] Fix a class of arithmetic errors introduced in r253918 r253918 had refactored expressions like "A - B.Address + C" to "A - B.getAddressWithOffset(C)". This is incorrect, since the latter really computes "A - B.Address - C". None of the tests I can run locally on x86 broke due to this bug, but it is the current suspect for breakage on the AArch64 buildbots. llvm-svn: 254017	2015-11-24 20:37:01 +00:00
Simon Pilgrim	1b4fecb098	[X86][FMA] Optimize FNEG(FMA) Patterns X86 needs to use its own FMA opcodes, preventing the standard FNEG(FMA) pattern table recognition method used by other platforms. This patch adds support for lowering FNEG(FMA(X,Y,Z)) into a single suitably negated FMA instruction. Fix for PR24364 Differential Revision: http://reviews.llvm.org/D14906 llvm-svn: 254016	2015-11-24 20:31:46 +00:00
Matthias Braun	147110da84	LiveVariables should not clobber MachineOperand::IsDead, ::IsKill on reserved physical registers Patch by Nick Johnson <Nicholas.Paul.Johnson@deshawresearch.com> Differential Revision: http://reviews.llvm.org/D14875 llvm-svn: 254012	2015-11-24 20:06:56 +00:00
Teresa Johnson	130de7af7f	[ThinLTO] Enable iterative importing in FunctionImport pass Analyze imported function bodies and add any new external calls to the worklist for importing. Currently no controls on the importing so this will end up importing everything possible in the call tree below the importing module. Basic profitability checks coming next. Update test to check for iteratively inlined functions. llvm-svn: 254011	2015-11-24 19:55:04 +00:00
Cong Hou	db6220f84d	[X86] Fix several issues related to X86's psadbw instruction. This patch fixes the following issues: 1. Fix the return type of X86psadbw: it should not be the same type of inputs. For vNi8 inputs the output should be vMi64, where M = N/8. 2. Fix the return type of int_x86_avx512_psad_bw_512 accordingly. 3. Fix the definiton of PSADBW, VPSADBW, and VPSADBWY accordingly. 4. Adjust the return type when building a DAG node of X86ISD::PSADBW type. 5. Update related tests. Differential revision: http://reviews.llvm.org/D14897 llvm-svn: 254010	2015-11-24 19:51:26 +00:00
Teresa Johnson	b098f0c133	[ThinLTO] Handle previously imported and promoted locals in module linker The new function import pass exposed an issue when we import references to local values on multiple importing passes. They are renamed on each import pass, and we need to ensure that the already promoted and renamed references existing in the dest module are correctly identified and updated so that they aren't spuriously renamed again (due to a perceived conflict with the newly linked reference). llvm-svn: 254009	2015-11-24 19:46:58 +00:00
Weiming Zhao	45d4cb9a14	[Utils] Put includes in correct order. NFC. Summary: Followed the guidelines in: http://llvm.org/docs/CodingStandards.html#include-style However, I noticed that uppercase named headers come before lowercase ones throughout the codebase. So kept them as is. Patch by Mandeep Singh Grang <mgrang@codeaurora.org> Reviewers: majnemer, davide, jmolloy, atrick Subscribers: sanjoy Differential Revision: http://reviews.llvm.org/D14939 llvm-svn: 254005	2015-11-24 18:57:06 +00:00
Xinliang David Li	759dc628c0	[PGO] Small interface change to be profile rt ready Convert two C++ static member functions to be C APIs. This is one of the many steps to get ready to share VP writer code with profiler runtime. llvm-svn: 253999	2015-11-24 18:15:46 +00:00
Sanjay Patel	968e91aea0	[InstCombine] fix propagation of fast-math-flags Noticed while working on D4583: http://reviews.llvm.org/D4583 llvm-svn: 253997	2015-11-24 17:51:20 +00:00
Sanjay Patel	739f2ce93a	use convenience function for copying IR flags; NFCI llvm-svn: 253996	2015-11-24 17:16:33 +00:00
Xinliang David Li	1b85d4c961	Minor refactor to make VP writing more efficient llvm-svn: 253994	2015-11-24 17:03:24 +00:00
Krzysztof Parzyszek	b8bb90b744	Add vector types for intrinsics Author: Ron Lieberman <ronl@codeaurora.org> llvm-svn: 253992	2015-11-24 16:28:14 +00:00
Teresa Johnson	17626654fd	[ThinLTO] Fix FunctionImport alias checking and test Skip imports for weak_any aliases as well. Fix the test to check non-import of weak aliases and functions, and import of normal alias. llvm-svn: 253991	2015-11-24 16:10:43 +00:00
Sanjay Patel	a0d354541d	[x86] remove duplicate movq instruction defs (PR25554) We had duplicated definitions for the same hardware '[v]movq' instructions. For example with SSE: def MOVZQI2PQIrr : RS2I<0x6E, MRMSrcReg, (outs VR128:$dst), (ins GR64:$src), "mov{d\|q}\t{$src, $dst\|$dst, $src}", // X86-64 only [(set VR128:$dst, (v2i64 (X86vzmovl (v2i64 (scalar_to_vector GR64:$src)))))], IIC_SSE_MOVDQ>; def MOV64toPQIrr : RS2I<0x6E, MRMSrcReg, (outs VR128:$dst), (ins GR64:$src), "mov{d\|q}\t{$src, $dst\|$dst, $src}", [(set VR128:$dst, (v2i64 (scalar_to_vector GR64:$src)))], IIC_SSE_MOVDQ>, Sched<[WriteMove]>; As shown in the test case and PR25554: https://llvm.org/bugs/show_bug.cgi?id=25554 This causes us to miss reusing an operand because later passes don't know these 'movq' are the same instruction. This patch deletes one pair of these defs. Sadly, this won't fix the original test case in the bug report. Something else is still broken. Differential Revision: http://reviews.llvm.org/D14941 llvm-svn: 253988	2015-11-24 15:44:35 +00:00
Krzysztof Parzyszek	aa93575b7e	[Hexagon] Add missing include of <cctype> Lack thereof breaks Windows builds due to the use of std::isspace in HexagonInstrInfo.cpp. llvm-svn: 253987	2015-11-24 15:11:13 +00:00
Krzysztof Parzyszek	b9a1c3a32c	[Hexagon] Bring HexagonInstrInfo up to date llvm-svn: 253986	2015-11-24 14:55:26 +00:00
Krzysztof Parzyszek	d4b566d50b	Add new vector types for 512-, 1024- and 2048-bit vectors Those types are needed to implement instructions for Hexagon Vector Extensions (HVX): 16x32, 16x64, 32x16, 32x32, 32x64, 64x8, 64x16, 64x32, 128x8, 128x16, 256x8, 512x1, and 1024x1. llvm-svn: 253978	2015-11-24 13:07:35 +00:00
Matt Arsenault	ff05da806c	AMDGPU: Split LDS vector loads If properly aligned this could allow using ds_read_b64. llvm-svn: 253975	2015-11-24 12:18:54 +00:00
Matt Arsenault	4d801cd357	AMDGPU: Split x8 and x16 vector loads instead of scalarize The one regression in the builtin tests is in the read2 test which now (again) has many extra copies, but this should be solved once the pass is replaced with a DAG combine. llvm-svn: 253974	2015-11-24 12:05:03 +00:00
Ismail Donmez	65487e2d7e	Fix build after r253954 llvm-svn: 253969	2015-11-24 09:48:09 +00:00
Cong Hou	1938f2eb98	Let SelectionDAG start to use probability-based interface to add successors. The patch in http://reviews.llvm.org/D13745 is broken into four parts: 1. New interfaces without functional changes. 2. Use new interfaces in SelectionDAG, while in other passes treat probabilities as weights. 3. Use new interfaces in all other passes. 4. Remove old interfaces. This the second patch above. In this patch SelectionDAG starts to use probability-based interfaces in MBB to add successors but other MC passes are still using weight-based interfaces. Therefore, we need to maintain correct weight list in MBB even when probability-based interfaces are used. This is done by updating weight list in probability-based interfaces by treating the numerator of probabilities as weights. This change affects many test cases that check successor weight values. I will update those test cases once this patch looks good to you. Differential revision: http://reviews.llvm.org/D14361 llvm-svn: 253965	2015-11-24 08:51:23 +00:00
Mehdi Amini	42418aba58	Add a FunctionImporter helper to perform summary-based cross-module function importing Summary: This is a helper to perform cross-module import for ThinLTO. Right now it is importing naively every possible called functions. Reviewers: tejohnson Subscribers: dexonsmith, llvm-commits Differential Revision: http://reviews.llvm.org/D14914 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 253954	2015-11-24 06:07:49 +00:00
Cong Hou	bed60d35ed	[X86][SSE] Detect AVG pattern during instruction combine for SSE2/AVX2/AVX512BW. This patch detects the AVG pattern in vectorized code, which is simply c = (a + b + 1) / 2, where a, b, and c have the same type which are vectors of either unsigned i8 or unsigned i16. In the IR, i8/i16 will be promoted to i32 before any arithmetic operations. The following IR shows such an example: %1 = zext <N x i8> %a to <N x i32> %2 = zext <N x i8> %b to <N x i32> %3 = add nuw nsw <N x i32> %1, <i32 1 x N> %4 = add nuw nsw <N x i32> %3, %2 %5 = lshr <N x i32> %N, <i32 1 x N> %6 = trunc <N x i32> %5 to <N x i8> and with this patch it will be converted to a X86ISD::AVG instruction. The pattern recognition is done when combining instructions just before type legalization during instruction selection. We do it here because after type legalization, it is much more difficult to do pattern recognition based on many instructions that are doing type conversions. Therefore, for target-specific instructions (like X86ISD::AVG), we need to take care of type legalization by ourselves. However, as X86ISD::AVG behaves similarly to ISD::ADD, I am wondering if there is a way to legalize operands and result types of X86ISD::AVG together with ISD::ADD. It seems that the current design doesn't support this idea. Tests are added for SSE2, AVX2, and AVX512BW and both i8 and i16 types of variant vector sizes. Differential revision: http://reviews.llvm.org/D14761 llvm-svn: 253952	2015-11-24 05:44:19 +00:00
Davide Italiano	c304a0ddc1	[DIE] Make DIE.h NDEBUG conditional-free. Switch dump()/print() method definitions to LLVM_DUMP_METHOD instead. llvm-svn: 253945	2015-11-24 02:21:43 +00:00
Sanjoy Das	5abfbb9246	[RuntimeDyld] Avoid unused-private-field warning; NFC Fixes the no asserts -Werror,-Wunused-private-field build. llvm-svn: 253933	2015-11-23 22:59:36 +00:00
Dan Gohman	192dddc595	[WebAssembly] Don't print the types of memory_size and grow_memory This matches the current spec, for now. llvm-svn: 253931	2015-11-23 22:37:29 +00:00
Xinliang David Li	c667683d2e	[PGO] In llvm-profdata text dump, add comment lines as annotations llvm-svn: 253930	2015-11-23 22:31:22 +00:00
Krzysztof Parzyszek	d5d083ccd4	Revert r253923. Per Eric's request. llvm-svn: 253928	2015-11-23 22:19:57 +00:00
Andy Ayers	9f7501896e	findDeadCallerSavedReg needs to pay attention to calling convention Caller saved regs differ between SysV and Win64. Use the tail call available set to scavenge from. Refactor register info to create new helper to get at tail call GPRs. Added a new test case for windows. Fixed up a number of X64 tests since now RCX is preferred over RDX on SysV. Differential Revision: http://reviews.llvm.org/D14878 llvm-svn: 253927	2015-11-23 22:17:44 +00:00
Dan Gohman	2f16f25391	[WebAssembly] Don't special-case call operand order. With the '=' suffix now indicating which operands are output operands, it's no longer as important to distinguish between a call's inputs and its outputs using operand ordering, so we can go back to printing them in the normal order. llvm-svn: 253925	2015-11-23 22:04:06 +00:00
Krzysztof Parzyszek	f358bfff17	Add new vector types for 512-, 1024- and 2048-bit vectors Those types are needed to implement instructions for Hexagon Vector Extensions (HVX): 16x32, 16x64, 32x16, 32x32, 32x64, 64x8, 64x16, 64x32, 128x8, 128x16, 256x8, 512x1, and 1024x1. llvm-svn: 253923	2015-11-23 22:00:17 +00:00
Dan Gohman	700515fa92	[WebAssembly] Suffix output operands with '='. This distinguishes input operands from output operands. This is something of a syntactic experiment to see whether the mild amount of clutter this adds is outweighed by the extra information it conveys to the reader. llvm-svn: 253922	2015-11-23 21:55:57 +00:00
Sanjoy Das	d5658b0896	[RuntimeDyld] Don't allocate unnecessary stub buffer space Summary: For relocation types that are known to not require stub functions, there is no need to allocate extra space for the stub functions. Reviewers: lhames, reames, maksfb Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14676 llvm-svn: 253920	2015-11-23 21:47:51 +00:00
Sanjoy Das	8082592ac9	[RuntimeDyld] Add bounds checking to SectionEntry::advanceStubOffset Summary: Change SectionEntry to keep track of the size of its underlying allocation, and use that to bounds check advanceStubOffset. Reviewers: lhames, andrew.w.kaylor, reames Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14675 llvm-svn: 253919	2015-11-23 21:47:46 +00:00
Sanjoy Das	277776a520	[RuntimeDyld] Add accessors to `SectionEntry`; NFC Summary: Remove naked access to the data members in `SectionEntry` and route accesses through accessor functions. This makes it obvious how the instances of the class are used, and will also facilitate adding bounds checking to `advanceStubOffset` in a later change. Reviewers: lhames, loladiro, andrew.w.kaylor Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14674 llvm-svn: 253918	2015-11-23 21:47:41 +00:00
Dan Gohman	7054ac1b8b	[WebAssembly] Model the return value of store instructions in wasm. llvm-svn: 253916	2015-11-23 21:16:35 +00:00
Chad Rosier	a15b4b6af2	[LIR] Put includes in correct order. NFC. llvm-svn: 253915	2015-11-23 21:09:13 +00:00
Xinliang David Li	6f7c19a494	[PGO] Add --text option for llvm-profdata show\|merge commands The new option is similar to the SampleProfile dump option. - dump raw/indexed format into text profile format - merge the profile and output into text profile format. Note that Value Profiling data text format is not yet designed. That functionality will be added later. Differential Revision: http://reviews.llvm.org/D14894 llvm-svn: 253913	2015-11-23 20:47:38 +00:00
Diego Novillo	243ea6a7d6	SamplePGO - Add coverage tracking for samples. The existing coverage tracker counts the number of records that were used from the input profile. An alternative view of coverage is to check how many available samples were applied. This way, if the profile contains several records with few samples, it doesn't really matter much that they were not applied. The more interesting records to apply are the ones that contribute many samples. llvm-svn: 253912	2015-11-23 20:12:21 +00:00
Andrew Kaylor	0615a0e65d	[WinEH] Fix a case where GVN could incorrectly PRE a load into an EH pad. Differential Revision: http://reviews.llvm.org/D14842 llvm-svn: 253908	2015-11-23 19:51:41 +00:00
Dan Gohman	aa0a4bd05b	[WebAssembly] Don't use set_local instructions explicitly. The current approach to using get_local and set_local is to use them implicitly, as register uses and defs. Introduce new copy instructions which are themselves no-ops except for the get_local and set_local that they imply, so that we use get_local and set_local consistently. llvm-svn: 253905	2015-11-23 19:30:43 +00:00
Teresa Johnson	6b92316811	[ThinLTO] Deduplicate function index loading into shared helper (NFC) Add a shared helper routine to read the function index from a file and create/return the function index object. Use it in llvm-link and llvm-lto. llvm-svn: 253903	2015-11-23 19:19:11 +00:00
Andrew Kaylor	d0430e8580	[WinEH] Fix problem where CodeGenPrepare incorrectly sinks a bitcast into an EH pad. Differential Revision: http://reviews.llvm.org/D14842 llvm-svn: 253902	2015-11-23 19:16:15 +00:00
Dan Gohman	f6857223c9	[WebAssembly] Always print loop end labels WebAssembly is currently using labels to end scopes, so for example a loop scope looks like this: BB0_0: loop BB0_1 ... BB0_1: with BB0_0 being the label of the first block not in the loop. This requires that the label be printed even when it's only reachable via fallthrough. To arrange this, insert a no-op LOOP_END instruction in such cases at the end of the loop. llvm-svn: 253901	2015-11-23 19:12:37 +00:00
Xinliang David Li	c7c1f8581a	[PGO] Introduce alignment macro for instr-prof control data(NFC) llvm-svn: 253893	2015-11-23 18:02:59 +00:00
Dan Gohman	e425c32224	[WebAssembly] Remove incomplete MCCodeEmitter bits. These are parts of a separate patch that I accidentally included in r253878. llvm-svn: 253892	2015-11-23 18:00:04 +00:00
Paul Robinson	af19bc3a9c	Add Windows error code and tidy formatting for system errors. Differential Revision: http://reviews.llvm.org/D14892 llvm-svn: 253888	2015-11-23 17:34:20 +00:00
Dan Gohman	53828fd777	[WebAssembly] Emit .param, .result, and .local through MC. This eliminates one of the main remaining uses of EmitRawText. llvm-svn: 253878	2015-11-23 16:50:18 +00:00
Diego Novillo	1ca881c4bb	SamplePGO - Clear coverage tracking when clearing per-function data. llvm-svn: 253877	2015-11-23 16:30:17 +00:00
Dan Gohman	3280793234	[WebAssembly] Use dominator information to improve BLOCK placement Always starting blocks at the top of their containing loops works, but creates unnecessarily deep nesting because it makes all blocks in a loop overlap. Refine the BLOCK placement algorithm to start blocks at nearest common dominating points instead, which significantly shrinks them and reduces overlapping. llvm-svn: 253876	2015-11-23 16:19:56 +00:00
Daniel Sanders	2b561336d9	[mips] .ent and .end should also set the type and size of the symbol respectively. Reviewers: vkalintiris Subscribers: llvm-commits, seanbruno, emaste, vkalintiris, dsanders Differential Revision: http://reviews.llvm.org/D14221 llvm-svn: 253875	2015-11-23 16:08:03 +00:00
Diego Novillo	39ab68f39b	SamplePGO - Use newly introduced local variable. NFC. llvm-svn: 253868	2015-11-23 15:24:13 +00:00
Krzysztof Parzyszek	29d23f9f4c	[Hexagon] Update instruction formats llvm-svn: 253867	2015-11-23 14:09:26 +00:00
Martell Malone	a6b867eb0d	ARM: address WoA division overflow crash Disable custom handling of signed 32-bit and 64-bit integer divide. Add test cases for both 32-bit and 64-bit integer overflow crashes. llvm-svn: 253865	2015-11-23 13:11:39 +00:00
Craig Topper	2241dfd2dc	[Mips] Remove an unnecessary wrapping of a predicate with std::ptr_fun. NFC llvm-svn: 253855	2015-11-23 07:19:06 +00:00
Davide Italiano	6f93df8105	[Analysis/CallGraph] Switch dump() definitions over to LLVM_DUMP_METHOD. llvm-svn: 253842	2015-11-23 02:58:42 +00:00
Davide Italiano	945d05f6a0	[LoopStrengthReduce] Mark dump() definitions as LLVM_DUMP_METHOD. llvm-svn: 253841	2015-11-23 02:47:30 +00:00
Mehdi Amini	8220e8a830	Add const qualifier for FunctionInfoIndex in ModuleLinker and linkInModule() (NFC) From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 253840	2015-11-23 01:59:16 +00:00
Sanjoy Das	0194743fad	[SCEV] Use C++11'isms llvm-svn: 253837	2015-11-22 21:20:13 +00:00
Benjamin Kramer	0969a2a74c	[MDBuilder] Simplify code using initializer lists. NFC. llvm-svn: 253826	2015-11-22 18:03:17 +00:00
Simon Pilgrim	1dfe53e180	Remove duplicate getValueType() calls. NFCI. llvm-svn: 253823	2015-11-22 16:49:38 +00:00
Krzysztof Parzyszek	6753f33388	Avoid dependency between TableGen and CodeGen Duplicate a few common definitions between DFAPacketizer.cpp and DFAPacketizerEmitter.cpp to avoid including files from CodeGen in TableGen. llvm-svn: 253820	2015-11-22 15:20:19 +00:00
Elena Demikhovsky	0fd11526e2	AVX-512: Optimized INSERT_SUBVECTOR for i1 vector types ISERT_SUBVECTOR for i1 vectors may be done with shifts, when we insert into the lower part, or into the upper part, on into all-zero vector. CONCAT_VECTORS uses ISERT_SUBVECTOR. Differential Revision: http://reviews.llvm.org/D14815 llvm-svn: 253819	2015-11-22 13:57:38 +00:00
Xinliang David Li	924e05843d	[PGO] move names of runtime sections definitions to InstrProfData.inc In profile runtime implementation for Darwin, Linux and FreeBSD, the names of sections holding profile control/counter/naming data need to be known by the runtime in order to locate the start/end of the data. Moving the name definitions to the common file to specify the connection. llvm-svn: 253814	2015-11-22 05:42:31 +00:00
Xinliang David Li	c76732396b	[PGO] Define value profiling updater API signature in InstrProfData.inc (NFC) llvm-svn: 253805	2015-11-22 00:22:07 +00:00
Rafael Espindola	d1beb07d39	Have a single way for creating unique value names. We had two code paths. One would create names like "foo.1" and the other names like "foo1". For globals it is important to use "foo.1" to help C++ name demangling. For locals there is no strong reason to go one way or the other so I kept the most common mangling (foo1). llvm-svn: 253804	2015-11-22 00:16:24 +00:00
Sanjay Patel	8066d906f1	fix formatting; NFC llvm-svn: 253802	2015-11-22 00:03:16 +00:00
Sanjoy Das	b37c4c414b	[SCEVExpander] Use C++isms; NFC llvm-svn: 253801	2015-11-21 23:20:10 +00:00
Teresa Johnson	6290dbc0f7	[ThinLTO] Handle bitcode without function summary sections gracefully Summary: Several fixes to the handling of bitcode files without function summary sections so that they are skipped during ThinLTO processing in llvm-lto and the gold plugin when appropriate instead of aborting. 1 Don't assert when trying to add a FunctionInfo that doesn't have a summary attached. 2 Skip FunctionInfo structures that don't have attached function summary sections when trying to create the combined function summary. 3 In both llvm-lto and gold-plugin, check whether a bitcode file has a function summary section before trying to parse the index, and skip the bitcode file if it does not. 4 Fix hasFunctionSummaryInMemBuffer in BitcodeReader, which had a bug where we returned to early while looking for the summary section. Also added llvm-lto and gold-plugin based tests for cases where we don't have function summaries in the bitcode file. I verified that either the first couple fixes described above are enough to avoid the crashes, or fixes 1,3,4. But have combined them all here for added robustness. Reviewers: joker.eph Subscribers: llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D14903 llvm-svn: 253796	2015-11-21 21:55:48 +00:00
Krzysztof Parzyszek	b46557292c	Hexagon V60/HVX DFA scheduler support Extended DFA tablegen to: - added "-debug-only dfa-emitter" support to llvm-tblgen - defined CVI_PIPE* resources for the V60 vector coprocessor - allow specification of multiple required resources - supports ANDs of ORs - e.g. [SLOT2, SLOT3], [CVI_MPY0, CVI_MPY1] means: (SLOT2 OR SLOT3) AND (CVI_MPY0 OR CVI_MPY1) - added support for combo resources - allows specifying ORs of ANDs - e.g. [CVI_XLSHF, CVI_MPY01] means: (CVI_XLANE AND CVI_SHIFT) OR (CVI_MPY0 AND CVI_MPY1) - increased DFA input size from 32-bit to 64-bit - allows for a maximum of 4 AND'ed terms of 16 resources - supported expressions now include: expression => term [AND term] [AND term] [AND term] term => resource [OR resource]* resource => one_resource \| combo_resource combo_resource => (one_resource [AND one_resource]*) Author: Dan Palermo <dpalermo@codeaurora.org> kparzysz: Verified AMDGPU codegen to be unchanged on all llc tests, except those dealing with instruction encodings. Reapply the previous patch, this time without circular dependencies. llvm-svn: 253793	2015-11-21 20:00:45 +00:00
Craig Topper	a5ea5289ff	Use modulo operator instead of multiplying result of a divide and subtracting from the original dividend. NFC. llvm-svn: 253792	2015-11-21 17:44:42 +00:00
Krzysztof Parzyszek	4ca21fc1aa	Revert r253790: it breaks all builds for some reason. llvm-svn: 253791	2015-11-21 17:38:33 +00:00
Krzysztof Parzyszek	220a9bc018	Hexagon V60/HVX DFA scheduler support Extended DFA tablegen to: - added "-debug-only dfa-emitter" support to llvm-tblgen - defined CVI_PIPE* resources for the V60 vector coprocessor - allow specification of multiple required resources - supports ANDs of ORs - e.g. [SLOT2, SLOT3], [CVI_MPY0, CVI_MPY1] means: (SLOT2 OR SLOT3) AND (CVI_MPY0 OR CVI_MPY1) - added support for combo resources - allows specifying ORs of ANDs - e.g. [CVI_XLSHF, CVI_MPY01] means: (CVI_XLANE AND CVI_SHIFT) OR (CVI_MPY0 AND CVI_MPY1) - increased DFA input size from 32-bit to 64-bit - allows for a maximum of 4 AND'ed terms of 16 resources - supported expressions now include: expression => term [AND term] [AND term] [AND term] term => resource [OR resource]* resource => one_resource \| combo_resource combo_resource => (one_resource [AND one_resource]*) Author: Dan Palermo <dpalermo@codeaurora.org> kparzysz: Verified AMDGPU codegen to be unchanged on all llc tests, except those dealing with instruction encodings. llvm-svn: 253790	2015-11-21 17:23:52 +00:00
Sanjay Patel	04df583a42	use ternary ops; NFC llvm-svn: 253787	2015-11-21 16:51:19 +00:00
Sanjay Patel	1f3fa2133a	remove unnecessary temp variables; NFC llvm-svn: 253786	2015-11-21 16:37:09 +00:00
Sanjay Patel	5a7bdc9632	fix typo; NFC llvm-svn: 253785	2015-11-21 16:16:29 +00:00
Jonas Paulsson	8f0d2b7f1f	[DAGCombiner] Bugfix for lost chain depenedency. When MergeConsecutiveStores() combines two loads and two stores into wider loads and stores, the chain users of both of the original loads must be transfered to the new load, because it may be that a chain user only depends on one of the loads. New test case: test/CodeGen/SystemZ/dag-combine-01.ll Reviewed by James Y Knight. Bugzilla: https://llvm.org/bugs/show_bug.cgi?id=25310#c6 llvm-svn: 253779	2015-11-21 13:25:07 +00:00
Simon Pilgrim	d5a154424b	[X86][AVX512] Added AVX512 VMOVLHPS/VMOVHLPS shuffle decode comments. llvm-svn: 253777	2015-11-21 13:04:42 +00:00
Simon Pilgrim	96cbce61b2	[X86][SSE] Legal XMM Register Class ordering for SSE1 It turns out we have a number of places that just grab the first type attached to a register class for various reasons. This is fine unless for some reason that type isn't legal on the current target, such as for SSE1 which doesn't support v16i8/v8i16/v4i32/v2i64 - all of which were included before 4f32 in the class. Given that this is such a rare situation I've just re-ordered the types and placed the float types first. Fix for PR16133 Differential Revision: http://reviews.llvm.org/D14787 llvm-svn: 253773	2015-11-21 12:38:34 +00:00
Weiming Zhao	8d5c08f591	[SimplifyLibCalls] Removed some TODOs which are already implemented. NFC. Summary: D14302 implements tan(atan(x)) -> x D14045 implements pow(exp(x), y) -> exp(x*y) Patch by Mandeep Singh Grang <mgrang@codeaurora.org> Reviewers: majnemer, davide Differential Revision: http://reviews.llvm.org/D14882 llvm-svn: 253768	2015-11-21 06:10:20 +00:00
Teresa Johnson	16e2a9eeb6	Move new assert to correct location This assert was meant to execute at the end of parseMetadata, but we return early and never reach the end of the function. Caught by a compile-time warning since the function doesn't return a value from that location. llvm-svn: 253762	2015-11-21 03:51:23 +00:00
Kostya Serebryany	b569368a5a	[libFuzzer] don't crash when reporting a leak in test_single_input mode llvm-svn: 253761	2015-11-21 03:46:43 +00:00
Matthias Braun	5a1857b6eb	ARMLoadStoreOptimizer: Cleanup isMemoryOp(); NFC llvm-svn: 253757	2015-11-21 02:09:49 +00:00
Vinicius Tinti	67cf33d9ab	Test commit llvm-svn: 253737	2015-11-20 23:20:12 +00:00
Rong Xu	a1f61fe841	Add some constantness to GetSuccessorNumber(). llvm-svn: 253733	2015-11-20 23:02:06 +00:00
Eric Christopher	25bf4a8617	Power8 and later support fusing addis/addi and addis/ld instruction pairs that use the same register to execute as a single instruction. No Functional Change Patch by Kyle Butt! llvm-svn: 253724	2015-11-20 22:38:20 +00:00
Owen Anderson	8e85130bb9	Fix another infinite loop in Reassociate caused by Constant::isZero(). Not all zero vectors are ConstantDataVector's. llvm-svn: 253723	2015-11-20 22:34:48 +00:00
Geoff Berry	5256fcada0	[CodeGenPrepare] Create more extloads and fewer ands Summary: Add and instructions immediately after loads that only have their low bits used, assuming that the (and (load x) c) will be matched as a extload and the ands/truncs fed by the extload will be removed by isel. Reviewers: mcrosier, qcolombet, ab Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14584 llvm-svn: 253722	2015-11-20 22:34:39 +00:00
Arnaud A. de Grandmaison	4e89e9f846	[ShrinkWrap] Teach ShrinkWrap to handle targets requiring a register scavenger. The included test only checks for a compiler crash for now. Several people are facing this issue, so we first resolve the crash, and will increase shrinkwrap's coverage later in a follow-up patch. llvm-svn: 253718	2015-11-20 21:54:27 +00:00
Diego Novillo	5fb49e5c5f	SamplePGO - Do not count never-executed inlined functions when computing coverage. If a function was originally inlined but not actually hot at runtime, its samples will not be counted inside the parent function. This throws off the coverage calculation because it expects to find more used records than it should. Fixed by ignoring functions that will not be inlined into the parent. Currently, this is inlined functions with 0 samples. In subsequent patches, I'll change this to mean "cold" functions. llvm-svn: 253716	2015-11-20 21:46:38 +00:00
Jun Bum Lim	80ec0d3f5a	[AArch64]Merge narrow zero stores to a wider store This change merges adjacent zero stores into a wider single store. For example : strh wzr, [x0] strh wzr, [x0, #2] becomes str wzr, [x0] This will fix PR25410. llvm-svn: 253711	2015-11-20 21:14:07 +00:00
Eric Christopher	c180836722	Weak non-function symbols were being accessed directly, which is incorrect, as the chosen representative of the weak symbol may not live with the code in question. Always indirect the access through the TOC instead. Patch by Kyle Butt! llvm-svn: 253708	2015-11-20 20:51:31 +00:00
Krzysztof Parzyszek	6c5ca95814	[Hexagon] Fix the return value from HexagonGenInsert::runOnMachineFunction llvm-svn: 253705	2015-11-20 20:46:23 +00:00
Reid Kleckner	437b1b3ea5	Fix the Windows build, include <tuple> for std::tie llvm-svn: 253698	2015-11-20 19:29:40 +00:00
Tilmann Scheller	925b193eed	Revert "[FunctionAttrs] Remove redundant assignment." This reverts r253661. Turns out that the assignment is not redundant (despite the Clang static analyzer claiming the opposite). The variable is being used by the lambda function AddUsersToWorklistIfCapturing(). llvm-svn: 253696	2015-11-20 19:17:10 +00:00
Nathan Slingerland	a731829788	[llvm-profdata] Add merge() to InstrProfRecord Summary: This change refactors two aspects of InstrProfRecord: 1) Add a merge() method to InstrProfRecord (previously InstrProfWriter combineInstrProfRecords()) in order to better encapsulate this functionality and to make the InstrProfRecord and SampleRecord APIs more consistent. 2) Make InstrProfRecord mergeValueProfData() a private method since it is only ever called internally by merge(). Reviewers: dnovillo, bogner, davidxl Subscribers: silvas, vsk, llvm-commits Differential Revision: http://reviews.llvm.org/D14786 llvm-svn: 253695	2015-11-20 19:12:43 +00:00
Artyom Skrobov	7f0fc9ccb7	Avoid duplicate entry for cortex-a7 in the TargetParser (NFC) Reviewers: t.p.northover, rengolin Subscribers: aemerson, rengolin, llvm-commits Differential Revision: http://reviews.llvm.org/D14757 llvm-svn: 253676	2015-11-20 16:46:14 +00:00
Artyom Skrobov	91f339ab3f	Handle ARMv6-J as an alias, instead of fake architecture Summary: This follows D14577 to treat ARMv6-J as an alias for ARMv6, instead of an architecture in its own right. The functional change is that the default CPU when targeting ARMv6-J changes from arm1136j-s to arm1136jf-s, which is currently used as the default CPU for ARMv6; both are, in fact, ARMv6-J CPUs. The J-bit (Jazelle support) is irrelevant to LLVM, and it doesn't affect code generation, attributes, optimizations, or anything else, apart from selecting the default CPU. Reviewers: rengolin, logan, compnerd Subscribers: aemerson, llvm-commits, rengolin Differential Revision: http://reviews.llvm.org/D14755 llvm-svn: 253675	2015-11-20 16:46:09 +00:00
Diego Novillo	df544a098a	SamplePGO - Add line offset and discriminator information to sample reports. While debugging some sampling coverage problems, I found this useful: When applying samples from a profile, it helps to also know what line offset and discriminator the sample belongs to. This makes it easy to correlate against the input profile. llvm-svn: 253670	2015-11-20 15:39:42 +00:00
Teresa Johnson	d4d3dfd8ef	[ThinLTO] Add MODULE_CODE_METADATA_VALUES record Summary: This is split out from the ThinLTO metadata mapping patch http://reviews.llvm.org/D14752. To avoid needing to parse the module level metadata during function importing, a new module-level record is added which holds the number of module-level metadata values. This is required because metadata value ids are assigned implicitly during parsing, and the function-level metadata ids start after the module-level metadata ids. I made a change to this version of the code compared to D14752 in order to add more consistent and thorough assertion checking of the new record value. We now unconditionally use the record value to initialize the MDValueList size, and handle it the same in parseMetadata for all module level metadata cases (lazy loading or not). Reviewers: dexonsmith, joker.eph Subscribers: davidxl, llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D14825 llvm-svn: 253668	2015-11-20 14:51:27 +00:00
Tilmann Scheller	4cd1d51a4d	[Hexagon] Remove redundant assignment. Identified by the Clang static analyzer. llvm-svn: 253664	2015-11-20 13:27:30 +00:00
Daniel Sanders	b700203c8b	Partially revert r253662: some unrelated work was accidentally committed with it. Sorry. llvm-svn: 253663	2015-11-20 13:16:35 +00:00
Daniel Sanders	be9db3c00a	Revert the revert 253497 and 253539 - These commits aren't the cause of the clang-cmake-mips failures. Sorry for the noise. llvm-svn: 253662	2015-11-20 13:13:53 +00:00
Tilmann Scheller	1e929f97f6	[FunctionAttrs] Remove redundant assignment. Identified by the Clang static analyzer. llvm-svn: 253661	2015-11-20 12:51:58 +00:00
Tilmann Scheller	bfd7ce01ea	[Hexagon] Remove redundant local variable. Identified by the Clang static analyzer. llvm-svn: 253660	2015-11-20 12:10:17 +00:00
Owen Anderson	630077ef55	Fix a pair of issues that caused an infinite loop in reassociate. Terrifyingly, one of them is a mishandling of floating point vectors in Constant::isZero(). How exactly this issue survived this long is beyond me. llvm-svn: 253655	2015-11-20 08:16:13 +00:00
Craig Topper	e325e3806f	Use range-based for loops. NFC llvm-svn: 253652	2015-11-20 07:18:48 +00:00
Hrvoje Varga	b65518c15c	[mips][microMIPS] Implement MUL[_S].PH, MULEQ_S.W.PHL, MULEQ_S.W.PHR, MULEU_S.PH.QBL, MULEU_S.PH.QBR, MULQ_RS.PH, MULQ_RS.W, MULQ_S.PH and MULQ_S.W instructions Differential Revision: http://reviews.llvm.org/D14280 llvm-svn: 253651	2015-11-20 07:14:52 +00:00
Dan Gohman	d9625276a7	[WebAssembly] Remove the AsmPrinter code for printing physical registers. WebAssembly does not have physical registers, so even if LLVM uses physical registers like SP, they'll need to be lowered to virtual registers before AsmPrinter time. llvm-svn: 253644	2015-11-20 03:13:31 +00:00
Dan Gohman	dfa81d8e22	[WebAssembly] Add a few open tasks to the target README.txt. llvm-svn: 253643	2015-11-20 03:08:27 +00:00
Dan Gohman	bb7ce8e408	[WebAssembly] Rename SWITCH to TABLESWITCH to match the current wording in the spec. llvm-svn: 253642	2015-11-20 03:02:49 +00:00
Dan Gohman	2dfc3b8be5	[WebAssembly] Remove done items from the README.txt. llvm-svn: 253640	2015-11-20 02:51:38 +00:00
Dan Gohman	7bafa0eaef	[WebAssembly] Add asserts that the expression stack is used in stack order. llvm-svn: 253638	2015-11-20 02:33:24 +00:00
Dan Gohman	b0992dafb3	[WebAssemby] Enforce FIFO ordering for instructions using stackified registers. llvm-svn: 253634	2015-11-20 02:19:12 +00:00
Peter Collingbourne	c85f4ced4d	ScalarEvolution: do not set nuw when creating exprs of form <expr> + <all-ones>. The nuw constraint will not be satisfied unless <expr> == 0. This bug has been around since r102234 (in 2010!), but was uncovered by r251052, which introduced more aggressive optimization of nuw scev expressions. Differential Revision: http://reviews.llvm.org/D14850 llvm-svn: 253627	2015-11-20 01:26:13 +00:00
Eric Christopher	eb027124af	Split the argument unscheduling loop in the WebAssembly register coloring pass. Turn the logic into "look for an insert point and then move things past the insert point". No functional change intended. llvm-svn: 253626	2015-11-20 00:34:54 +00:00
Tobias Edler von Koch	4d45090659	[LTO] Add option to emit assembly from LTOCodeGenerator This adds a new API, LTOCodeGenerator::setFileType, to choose the output file format for LTO CodeGen. A corresponding change to use this new API from llvm-lto and a test case is coming in a separate commit. Differential Revision: http://reviews.llvm.org/D14554 llvm-svn: 253622	2015-11-19 23:59:24 +00:00
Eric Christopher	8c3dbcab1d	Fix a [-Werror,-Wcovered-switch-default] warning by removing the unnecessary default case. llvm-svn: 253621	2015-11-19 23:45:42 +00:00
Reid Kleckner	cc2f6c35a3	[WinEH] Disable most forms of demotion Now that the register allocator knows about the barriers on funclet entry and exit, testing has shown that this is unnecessary. We still demote PHIs on unsplittable blocks due to the differences between the IR CFG and the Machine CFG. llvm-svn: 253619	2015-11-19 23:23:33 +00:00
Dan Gohman	3192ddfeba	[WebAssembly] Implement isCheapToSpeculateCtlz and isCheapToSpeculateCttz. This unbreaks test/CodeGen/WebAssembly/i32.ll and test/CodeGen/WebAssembly/i64.ll after r224899. llvm-svn: 253617	2015-11-19 23:04:59 +00:00
Diego Novillo	379cc5e71b	SamplePGO - Tweak debugging output for function samples. NFC. llvm-svn: 253612	2015-11-19 22:18:30 +00:00
Simon Pilgrim	a9912617c8	[X86][SSE4A] Fix issue with EXTRQI shuffles not starting at the correct start index. Found during stress testing. llvm-svn: 253611	2015-11-19 22:13:56 +00:00
Reid Kleckner	ebee6129cd	Fix UMRs in Mips disassembler on invalid instruction streams The Insn and Size local variables were used without initialization. llvm-svn: 253607	2015-11-19 21:51:55 +00:00
Simon Pilgrim	ae0140d6ec	[X86] Use existing MachineInstrBuilder::addDisp to create offseted pointer. NFC. Minor code duplication tidyup to D13988 llvm-svn: 253606	2015-11-19 21:50:57 +00:00
Davide Italiano	c807f487f7	Follow up to r253591. Turn into an assertion. Reported by: David Blaikie. llvm-svn: 253605	2015-11-19 21:50:08 +00:00
Chad Rosier	1cd3da15e8	[LIR] Update some comments. NFC. llvm-svn: 253603	2015-11-19 21:33:07 +00:00
Krzysztof Parzyszek	df537b97b1	Expand subregisters in MachineFrameInfo::getPristineRegs http://reviews.llvm.org/D14719 llvm-svn: 253600	2015-11-19 21:18:52 +00:00
Dehao Chen	014fb55711	Fix the debug build breakage that getDiscriminator is called by mistake. llvm-svn: 253597	2015-11-19 20:29:27 +00:00
Michael Zolotukhin	6c11c04db3	Revert r253253 and r253126: "Don't recompute LCSSA after loop-unrolling when possible." The change exposed a bug in IndVarSimplify (PR25578), which led to a failure (PR25538). When the bug is fixed, this patch can be reapplied. The tests are kept in tree, as they're useful anyway, and will not break with this revert. llvm-svn: 253596	2015-11-19 20:28:32 +00:00
Dehao Chen	23e2278e27	Reimplement discriminator assignment algorithm. Summary: The new algorithm is more efficient (O(n), n is number of basic blocks). And it is guaranteed to cover all cases of multiple BB mapped to same line. Reviewers: dblaikie, davidxl, dnovillo Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14738 llvm-svn: 253594	2015-11-19 19:53:05 +00:00
Davide Italiano	193c4edffb	[AddressSanitizer] assert(false) -> llvm_unreachable and remove return. llvm-svn: 253591	2015-11-19 19:28:23 +00:00
Jun Bum Lim	c12c2790e1	[AArch64] Refactoring aarch64-ldst-opt. NCF. Summary : * Rename isSmallTypeLdMerge() to isNarrowLoad(). * Rename NumSmallTypeMerged to NumNarrowTypePromoted. * Use Subtarget defined as a member variable. llvm-svn: 253587	2015-11-19 18:41:27 +00:00
Chad Rosier	3ecc8d8d83	[LIR] Fix 80-column from previous commit. llvm-svn: 253586	2015-11-19 18:25:11 +00:00
Chad Rosier	fddc01f393	[LIR] Sink checks into function to enable future refactoring. NFC. The purpose of this change is help delineate the memset and memcpy optimizations with the overall goal of resolving PR25520. llvm-svn: 253585	2015-11-19 18:22:21 +00:00
James Molloy	1d695a09dd	[GlobalOpt] Localize some globals that have non-instruction users We currently bail out of global localization if the global has non-instruction users. However, often these can be simple bitcasts or constant-GEPs, which we can easily turn into instructions before localizing. Be a bit more aggressive. llvm-svn: 253584	2015-11-19 18:04:33 +00:00
Sanjay Patel	2fe7728233	update comment and error message; NFC 'notail' was added in: http://reviews.llvm.org/rL252368 llvm-svn: 253580	2015-11-19 17:35:55 +00:00
Chad Rosier	85c21f0a6e	[LIR] Use the more appropriate method. NFC. llvm-svn: 253578	2015-11-19 17:27:28 +00:00
Jun Bum Lim	4c35ccac91	[AArch64]Extend merging narrow loads into a wider load This change extends r251438 to handle more narrow load promotions including byte type, unscaled, and signed. For example, this change will convert : ldursh w1, [x0, #-2] ldurh w2, [x0, #-4] into ldur w2, [x0, #-4] asr w1, w2, #16 and w2, w2, #0xffff llvm-svn: 253577	2015-11-19 17:21:41 +00:00
Sanjay Patel	4699b8ab6a	[CGP] despeculate expensive cttz/ctlz intrinsics This is another step towards allowing SimplifyCFG to speculate harder, but then have CGP clean things up if the target doesn't like it. Previous patches in this series: http://reviews.llvm.org/D12882 http://reviews.llvm.org/D13297 D13297 should catch most expensive ops, but speculation of cttz/ctlz requires special handling because of weirdness in the intrinsic definition for handling a zero input (that definition can probably be blamed on x86). For example, if we have the usual speculated-by-select expensive op pattern like this: %tobool = icmp eq i64 %A, 0 %0 = tail call i64 @llvm.cttz.i64(i64 %A, i1 true) ; is_zero_undef == true %cond = select i1 %tobool, i64 64, i64 %0 ret i64 %cond There's an instcombine that will turn it into: %0 = tail call i64 @llvm.cttz.i64(i64 %A, i1 false) ; is_zero_undef == false This CGP patch is looking for that case and despeculating it back into: entry: %tobool = icmp eq i64 %A, 0 br i1 %tobool, label %cond.end, label %cond.true cond.true: %0 = tail call i64 @llvm.cttz.i64(i64 %A, i1 true) ; is_zero_undef == true br label %cond.end cond.end: %cond = phi i64 [ %0, %cond.true ], [ 64, %entry ] ret i64 %cond This unfortunately may lead to poorer codegen (see the changes in the existing x86 test), but if we increase speculation in SimplifyCFG (the next step in this patch series), then we should avoid those kinds of cases in the first place. The need for this patch was originally mentioned here: http://reviews.llvm.org/D7506 with follow-up here: http://reviews.llvm.org/D7554 Differential Revision: http://reviews.llvm.org/D14630 llvm-svn: 253573	2015-11-19 16:37:10 +00:00
Hans Wennborg	dcc2500452	X86: More efficient legalization of wide integer compares In particular, this makes the code for 64-bit compares on 32-bit targets much more efficient. Example: define i32 @test_slt(i64 %a, i64 %b) { entry: %cmp = icmp slt i64 %a, %b br i1 %cmp, label %bb1, label %bb2 bb1: ret i32 1 bb2: ret i32 2 } Before this patch: test_slt: movl 4(%esp), %eax movl 8(%esp), %ecx cmpl 12(%esp), %eax setae %al cmpl 16(%esp), %ecx setge %cl je .LBB2_2 movb %cl, %al .LBB2_2: testb %al, %al jne .LBB2_4 movl $1, %eax retl .LBB2_4: movl $2, %eax retl After this patch: test_slt: movl 4(%esp), %eax movl 8(%esp), %ecx cmpl 12(%esp), %eax sbbl 16(%esp), %ecx jge .LBB1_2 movl $1, %eax retl .LBB1_2: movl $2, %eax retl Differential Revision: http://reviews.llvm.org/D14496 llvm-svn: 253572	2015-11-19 16:35:08 +00:00
NAKAMURA Takumi	768579c409	TargetParser.cpp: Fixup -- StringRef::startswith() is better here. NFC. llvm-svn: 253570	2015-11-19 15:42:52 +00:00
Diego Novillo	ef548d2918	SamplePGO - Sort samples by source location when emitting as text. When dumping function samples or writing them out as text format, it helps if the samples are emitted sorted by source location. The sorting of the maps is a bit slow, so we only do it on demand. llvm-svn: 253568	2015-11-19 15:33:08 +00:00
NAKAMURA Takumi	b6b254582f	llvm/lib/Support/TargetParser.cpp: Rework llvm::ARM::getArchExtFeature() to avoid abuse of Twine in r253470. llvm-svn: 253566	2015-11-19 15:03:11 +00:00
Chad Rosier	33efdf810f	[LV] Add a helper function, isReductionVariable. NFC. llvm-svn: 253565	2015-11-19 14:19:06 +00:00
Zoran Jovanovic	00f998b440	[mips] Expansion of ROL and ROR macros Author: obucina Reviewers: dsanders Subscribers: dsanders, llvm-commits Differential Revision: http://reviews.llvm.org/D10611 llvm-svn: 253564	2015-11-19 14:15:03 +00:00
Elena Demikhovsky	7c2c9fd243	AVX-512: Fixed COPY_TO_REGCLASS for mask registers Copying one mask register to another under BW should be done with kmovq instruction, otherwise we can loose some bits. Copying 8 bits under DQ may be done with kmovb. Differential Revision: http://reviews.llvm.org/D14812 llvm-svn: 253563	2015-11-19 13:13:00 +00:00
Simon Pilgrim	846b64e17a	[X86][AVX] Fix lowering of X86ISD::VZEXT_MOVL for 128-bit -> 256-bit extension The lowering patterns for X86ISD::VZEXT_MOVL for 128-bit to 256-bit vectors were just copying the lower xmm instead of actually masking off the first scalar using a blend. Fix for PR25320. Differential Revision: http://reviews.llvm.org/D14151 llvm-svn: 253561	2015-11-19 12:18:37 +00:00
Alexey Bataev	b7b82bf33e	Alternative to long nops for X86 CPUs, by Andrey Turetsky Make X86AsmBackend generate smarter nops instead of a bunch of 0x90 for code alignment for CPUs which don't support long nop instructions. Differential Revision: http://reviews.llvm.org/D14178 llvm-svn: 253557	2015-11-19 11:44:35 +00:00
James Molloy	0ecdbe7d6b	[FunctionAttrs] Provide a mechanism for adding function attributes from the command line This provides a way to force a function to have certain attributes from the command line. This can be useful when debugging or doing workload exploration, where manually editing IR is tedious or not possible (due to build systems etc). The syntax is -force-attribute=function_name:attribute_name All function attributes are parsed except alignstack as it requires an argument. llvm-svn: 253550	2015-11-19 08:49:57 +00:00
Igor Breger	1f78296869	AVX512: Implemented encoding, intrinsics and DAG lowering for VMOVDDUP instructions. Differential Revision: http://reviews.llvm.org/D14702 llvm-svn: 253548	2015-11-19 08:26:56 +00:00
Igor Breger	4424aaa28e	AVX512: Implemented encoding for the vmovss.s and vmovsd.s instructions. Differential Revision: http://reviews.llvm.org/D14771 llvm-svn: 253547	2015-11-19 07:58:33 +00:00
Igor Breger	81b79de54c	AVX512: Implemented encoding for the follow instructions. vmovapd.s, vmovaps.s, vmovdqa32.s, vmovdqa64.s, vmovdqu16.s, vmovdqu32.s, vmovdqu64.s, vmovdqu8.s, vmovupd.s, vmovups.s Differential Revision: http://reviews.llvm.org/D14768 llvm-svn: 253546	2015-11-19 07:43:43 +00:00
Elena Demikhovsky	1ca72e1846	Pointers in Masked Load, Store, Gather, Scatter intrinsics The masked intrinsics support all integer and floating point data types. I added the pointer type to this list. Added tests for CodeGen and for Loop Vectorizer. Updated the Language Reference. Differential Revision: http://reviews.llvm.org/D14150 llvm-svn: 253544	2015-11-19 07:17:16 +00:00
Pete Cooper	67cf9a723b	Revert "Change memcpy/memset/memmove to have dest and source alignments." This reverts commit r253511. This likely broke the bots in http://lab.llvm.org:8011/builders/clang-ppc64-elf-linux2/builds/20202 http://bb.pgr.jp/builders/clang-3stage-i686-linux/builds/3787 llvm-svn: 253543	2015-11-19 05:56:52 +00:00
Mehdi Amini	354f520fbc	Do not require a Context to extract the FunctionIndex from Bitcode (NFC) The LLVMContext was only used for Diagnostic. Pass a DiagnosticHandler instead. Differential Revision: http://reviews.llvm.org/D14794 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 253540	2015-11-19 05:52:29 +00:00
Weiming Zhao	b69babd01e	Fix bug 25440: GVN assertion after coercing loads Optimizations like LoadPRE in GVN will insert new instructions. If the insertion point is in a already processed BB, they should get a value number explicitly. If the insertion point is after current instruction, then just leave it. However, current GVN framework has no support for it. In this patch, we just bail out if a VN can't be found. Dfferential Revision: http://reviews.llvm.org/D14670 A test/Transforms/GVN/pr25440.ll M lib/Transforms/Scalar/GVN.cpp llvm-svn: 253536	2015-11-19 02:45:18 +00:00
Quentin Colombet	46d5c71135	[X86] Enable shrink-wrapping by default. Differential Revision: http://reviews.llvm.org/D14156 rdar://problem/21118279 llvm-svn: 253528	2015-11-19 00:38:00 +00:00
Cong Hou	7b2ae9abba	Fix several long lines (>80) in LoopVectorize.cpp. NFC. llvm-svn: 253527	2015-11-19 00:32:30 +00:00
Davide Italiano	c5cedd195a	[SimplifyLibCalls] New trick: pow(x, 0.5) -> sqrt(x) under -ffast-math. Differential Revision: http://reviews.llvm.org/D14466 llvm-svn: 253521	2015-11-18 23:21:32 +00:00
Quentin Colombet	f6645cce91	[AArch64] Enable shrink-wrapping by default. Differential Revision: http://reviews.llvm.org/D14360 rdar://problem/20820748 llvm-svn: 253520	2015-11-18 23:12:20 +00:00
Mehdi Amini	adb4057a15	Fix returned value for GVN: could return "false" even after modifying the IR This bug would manifest in some very specific cases where all the following conditions are fullfilled: - GVN didn't remove block - The regular GVN iteration didn't change the IR - PRE is enabled - PRE will not split critical edge - The last instruction processed by PRE didn't change the IR Because the CallGraph PassManager relies on this returned value to decide if it needs to recompute a node after the execution of Function passes, not returning the right value can lead to unexpected results. Fix for: https://llvm.org/bugs/show_bug.cgi?id=24715 Patch by Wenxiang Qiu <vincentqiuuu@gmail.com> From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 253518	2015-11-18 22:49:49 +00:00
Xinliang David Li	cfb1456572	Minor cleanups (from review feedback) 1. remove uneeded header inclusion 2. use reinterpret_cast instead of c ctyle 3. other format change llvm-svn: 253515	2015-11-18 22:42:27 +00:00
Davide Italiano	455ea11d13	[BuildLibCalls] EmitStrNLen() is dead code. Garbage collect. llvm-svn: 253514	2015-11-18 22:29:38 +00:00
Pete Cooper	72bc23ef02	Change memcpy/memset/memmove to have dest and source alignments. Note, this was reviewed (and more details are in) http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20151109/312083.html These intrinsics currently have an explicit alignment argument which is required to be a constant integer. It represents the alignment of the source and dest, and so must be the minimum of those. This change allows source and dest to each have their own alignments by using the alignment attribute on their arguments. The alignment argument itself is removed. There are a few places in the code for which the code needs to be checked by an expert as to whether using only src/dest alignment is safe. For those places, they currently take the minimum of src/dest alignments which matches the current behaviour. For example, code which used to read: call void @llvm.memcpy.p0i8.p0i8.i32(i8* %dest, i8* %src, i32 500, i32 8, i1 false) will now read: call void @llvm.memcpy.p0i8.p0i8.i32(i8* align 8 %dest, i8* align 8 %src, i32 500, i1 false) For out of tree owners, I was able to strip alignment from calls using sed by replacing: (call.llvm\.memset.)i32\ [0-9]\,\ i1 false\) with: $1i1 false) and similarly for memmove and memcpy. I then added back in alignment to test cases which needed it. A similar commit will be made to clang which actually has many differences in alignment as now IRBuilder can generate different source/dest alignments on calls. In IRBuilder itself, a new argument was added. Instead of calling: CreateMemCpy(Dst, Src, getInt64(Size), DstAlign, / isVolatile / false) you now call CreateMemCpy(Dst, Src, getInt64(Size), DstAlign, SrcAlign, / isVolatile */ false) There is a temporary class (IntegerAlignment) which takes the source alignment and rejects implicit conversion from bool. This is to prevent isVolatile here from passing its default parameter to the source alignment. Note, changes in future can now be made to codegen. I didn't change anything here, but this change should enable better memcpy code sequences. Reviewed by Hal Finkel. llvm-svn: 253511	2015-11-18 22:17:24 +00:00
Simon Pilgrim	c1a46b729b	[DAGCombiner] Vector constant folding for comparisons This patch adds support for vector constant folding of integer/float comparisons. This requires FoldConstantVectorArithmetic to support scalar constant operands (in this case ISD::CONDCASE). In future we should be able to support other scalar constant types as necessary (and possibly start calling FoldConstantVectorArithmetic for all node creations) Differential Revision: http://reviews.llvm.org/D14683 llvm-svn: 253504	2015-11-18 21:17:19 +00:00
Tim Northover	747ae9a7de	ARM: make sure backend is consistent about exception handling method. It turns out we decide whether to use SjLj exceptions or some alternative in two separate places in the backend, and they disagreed with each other. This led to inconsistent code and is generally a terrible idea. So make them consistent and add an assert that they do match (unfortunately MCAsmInfo isn't available in opt, so it can't be used to initialise the CodeGen version directly). llvm-svn: 253502	2015-11-18 21:10:39 +00:00
Mike Aizatsky	c7810baaa6	Disable gvn non-local speculative loads under asan. Summary: Fix for https://llvm.org/bugs/show_bug.cgi?id=25550 Differential Revision: http://reviews.llvm.org/D14763 llvm-svn: 253498	2015-11-18 20:43:00 +00:00
Betul Buyukkurt	6fac1741c9	[PGO] Value profiling support This change introduces an instrumentation intrinsic instruction for value profiling purposes, the lowering of the instrumentation intrinsic and raw reader updates. The raw profile data files for llvm-profdata testing are updated. llvm-svn: 253484	2015-11-18 18:14:55 +00:00
Matthew Simpson	343af07aa9	[Aarch64] Add cost for missing extensions. This patch adds a cost estimate for some missing sign and zero extensions. The costs were determined by counting the number of shift instructions generated without context for each new extension. Differential Revision: http://reviews.llvm.org/D14730 llvm-svn: 253482	2015-11-18 18:03:06 +00:00
Dan Gohman	94ef41ff1d	[WebAssembly] Add more whitespace characters to prettify the assembly output. llvm-svn: 253472	2015-11-18 17:05:35 +00:00
Bradley Smith	7b0a7d8d1e	[ARM] Add +feature names to TargetParser extensions table llvm-svn: 253470	2015-11-18 16:32:12 +00:00
Dan Gohman	1f29c68042	[WebAssembly] Add some spaces to the assembly output to vertically align operands. llvm-svn: 253468	2015-11-18 16:25:38 +00:00
Dan Gohman	4ba4816b97	[WebAssembly] Enable register coloring and register stackifying. This also takes the push/pop syntax another step forward, introducing stack slot numbers to make it easier to see how expressions are connected. For example, the value pushed in $push7 is popped in $pop7. And, this begins an experiment with making get_local and set_local implicit when an operation directly uses or defines a register. This greatly reduces clutter. If this experiment succeeds, it may make sense to do this for const instructions as well. And, this introduces more special code for ARGUMENTS; hopefully this code will soon be obviated by proper support for live-in virtual registers. llvm-svn: 253465	2015-11-18 16:12:01 +00:00
Manuel Klimek	272d3f17fc	Fix bug where WinCOFFObjectWriter would assume starting from an empty output. Starting on an input stream that is not at offset 0 would trigger the assert in WinCOFFObjectWriter.cpp:1065: assert(getStream().tell() <= (*i)->Header.PointerToRawData && "Section::PointerToRawData is insane!"); llvm-svn: 253464	2015-11-18 15:24:17 +00:00
Jonas Paulsson	af722f8287	[SelectionDAGBuilder] Make sure DemoteReg ends up in right reg-class. The virtual register containing the address for returned value on stack should in the DAG be represented with a CopyFromReg node and not a Register node. Otherwise, InstrEmitter will not make sure that it ends up in the right register class for the target instruction. SystemZ needs this, becuause the reg class for address registers is a subset of the general 64 bit register class. test/SystemZ/CodeGen/args-07.ll and args-04.ll updated to run with -verify-machineinstrs. Reviewed by Hal Finkel. llvm-svn: 253461	2015-11-18 14:59:00 +00:00
Igor Laevsky	7310c68e85	Revert "Revert "Strip metadata when speculatively hoisting instructions (r252604)" Failing clang test is now fixed by the r253458. llvm-svn: 253459	2015-11-18 14:50:18 +00:00
James Molloy	9ad4f22538	[LTO] Add an early run of functionattrs Because we internalize early, we can potentially mark a bunch of functions as norecurse. Do this before globalopt. llvm-svn: 253451	2015-11-18 11:24:42 +00:00
Asaf Badouh	0d957b8b09	[X86][AVX512CD] add mask broadcast intrinsics Differential Revision: http://reviews.llvm.org/D14573 llvm-svn: 253450	2015-11-18 09:42:45 +00:00
Igor Breger	5574730454	AVX512: Implemented encoding for vpextrw.s instruction. Differential Revision: http://reviews.llvm.org/D14766 llvm-svn: 253447	2015-11-18 08:46:16 +00:00
Sanjoy Das	f79d3449c5	[OperandBundles] Tighten OperandBundleDef's interface; NFC llvm-svn: 253446	2015-11-18 08:30:07 +00:00
Hrvoje Varga	78409019d9	[mips][microMIPS] Implement DPS.W.PH, DPSQ_S.W.PH, DPSQ_SA.L.W, DPSQX_S.W.PH, DPSQX_SA.W.PH, DPSU.H.QBL, DPSU.H.QBR and DPSX.W.PH instructions Differential Revision: http://reviews.llvm.org/D14058 llvm-svn: 253443	2015-11-18 07:41:35 +00:00
Craig Topper	66059c9f4d	Replace dyn_cast with isa in places that weren't using the returned value for more than a boolean check. NFC. llvm-svn: 253441	2015-11-18 07:07:59 +00:00
Rafael Espindola	55512f9b25	Default SetVector to use a DenseSet. We use to have an odd difference among MapVector and SetVector. The map used a DenseMop, but the set used a SmallSet, which in turn uses a std::set. I have changed SetVector to use a DenseSet. If you were depending on the old behaviour you can pass an explicit set type or use SmallSetVector. The common cases for needing to do it are: * Optimizing for small sets. * Sets for types not supported by DenseSet. llvm-svn: 253439	2015-11-18 06:52:18 +00:00
Sanjoy Das	2d16145acf	Teach the inliner to track deoptimization state Summary: This change teaches LLVM's inliner to track and suitably adjust deoptimization state (tracked via deoptimization operand bundles) as it inlines through call sites. The operation is described in more detail in the LangRef changes. Reviewers: reames, majnemer, chandlerc, dexonsmith Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14552 llvm-svn: 253438	2015-11-18 06:23:38 +00:00
Rafael Espindola	449711cb36	Stop producing .data.rel sections. If a section is rw, it is irrelevant if the dynamic linker will write to it or not. It looks like llvm implemented this because gcc was doing it. It looks like gcc implemented this in the hope that it would put all the relocated items close together and speed up the dynamic linker. There are two problem with this: * It doesn't work. Both bfd and gold will map .data.rel to .data and concatenate the input sections in the order they are seen. * If we want a feature like that, it can be implemented directly in the linker since it knowns where the dynamic relocations are. llvm-svn: 253436	2015-11-18 06:02:15 +00:00
Cong Hou	136bc65ec8	Remove a redundant assertion in MachineBasicBlock.cpp. NFC. llvm-svn: 253426	2015-11-18 01:55:56 +00:00
Cong Hou	11c1420173	Remove redundant code in MachineBasicBlock.cpp. NFC. llvm-svn: 253425	2015-11-18 01:45:10 +00:00
Kostya Serebryany	4d62322213	[libFuzzer] remove default initializer as a workaround for https://gcc.gnu.org/bugzilla/show_bug.cgi?id=68399 . Don't need it anyway. llvm-svn: 253419	2015-11-18 01:08:30 +00:00
Cong Hou	41cf1a5dfb	Improving edge probabilities computation when choosing the best successor in machine block placement. When looking for the best successor from the outer loop for a block belonging to an inner loop, the edge probability computation can be improved so that edges in the inner loop are ignored. For example, suppose we are building chains for the non-loop part of the following code, and looking for B1's best successor. Assume the true body is very hot, then B3 should be the best candidate. However, because of the existence of the back edge from B1 to B0, the probability from B1 to B3 can be very small, preventing B3 to be its successor. In this patch, when computing the probability of the edge from B1 to B3, the weight on the back edge B1->B0 is ignored, so that B1->B3 will have 100% probability. if (...) do { B0; ... // some branches B1; } while(...); else B2; B3; Differential revision: http://reviews.llvm.org/D10825 llvm-svn: 253414	2015-11-18 00:52:52 +00:00
Quentin Colombet	8cb95b8e51	[ARM] Enable shrink-wrapping by default. Differential Revision: http://reviews.llvm.org/D14357 rdar://problem/21942589 llvm-svn: 253411	2015-11-18 00:40:54 +00:00
David Blaikie	6196aa06c9	Generalize ownership/passing semantics to allow dsymutil to own abbreviations via unique_ptr While still allowing CodeGen/AsmPrinter in llvm to own them using a bump ptr allocator. (might be nice to replace the pointers there with something that at least automatically calls their dtors, if that's necessary/useful, rather than having it done explicitly (I think a typed BumpPtrAllocator already does this, or maybe a unique_ptr with a custom deleter, etc)) llvm-svn: 253409	2015-11-18 00:34:10 +00:00
Sanjay Patel	77f4486950	[InstCombine] refactor optimizeIntToFloatBitCast() ; NFCI The logic for handling the pattern without a shift is identical to the logic for handling the pattern with a shift if you set the shift amount to zero for the former. This should make it easier to see that we probably don't even need optimizeIntToFloatBitCast(). If we call something like foldVecTruncToExtElt() from visitTrunc(), we'll solve PR25543: https://llvm.org/bugs/show_bug.cgi?id=25543 llvm-svn: 253403	2015-11-18 00:00:04 +00:00
Simon Pilgrim	2da4178737	[X86][AVX512] Added AVX512 SHUFP/VPERMILP shuffle decode comments. llvm-svn: 253396	2015-11-17 23:29:49 +00:00
Xinliang David Li	99556877ae	[PGO] Move value profile data definitions out of IndexedInstrProf Move the data structure defintions out of the namespace. The defs will be shared by raw format. [NFC] llvm-svn: 253394	2015-11-17 23:00:40 +00:00
David Blaikie	4689ef5943	Fix null dereference committed in r253277 llvm-svn: 253393	2015-11-17 22:39:26 +00:00
David Blaikie	35c2eebfe4	dwarfdump: support indexed string dumping in dwp based on the STR_OFFSETS component of the index llvm-svn: 253392	2015-11-17 22:39:23 +00:00
Simon Pilgrim	8483df6e24	[X86][AVX512] Added support for AVX512 UNPCK shuffle decode comments. llvm-svn: 253391	2015-11-17 22:35:45 +00:00
Nathan Slingerland	e6e30d5e88	[llvm-profdata] Improve error messaging when merging mismatched profile data Summary: This change tries to make the root cause of instrumented profile data merge failures clearer. Previous: $ llvm-profdata merge test_0.profraw test_1.profraw -o test_merged.profdata test_1.profraw: foo: Function count mismatch test_1.profraw: bar: Function count mismatch test_1.profraw: baz: Function count mismatch ... Changed: $ llvm-profdata merge test_0.profraw test_1.profraw -o test_merged.profdata test_1.profraw: foo: Function basic block count change detected (counter mismatch) Make sure that all profile data to be merged is generated from the same binary. test_1.profraw: bar: Function basic block count change detected (counter mismatch) test_1.profraw: baz: Function basic block count change detected (counter mismatch) ... Reviewers: dnovillo, davidxl, bogner Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14739 llvm-svn: 253384	2015-11-17 22:08:53 +00:00
Reid Kleckner	c20276d0b2	[WinEH] Move WinEHFuncInfo from MachineModuleInfo to MachineFunction Summary: Now that there is a one-to-one mapping from MachineFunction to WinEHFuncInfo, we don't need to use a DenseMap to select the right WinEHFuncInfo for the current funclet. The main challenge here is that X86WinEHStatePass is an IR pass that doesn't have access to the MachineFunction. I gave it its own WinEHFuncInfo object that it uses to calculate state numbers, which it then throws away. As long as nobody creates or removes EH pads between this pass and SDAG construction, we will get the same state numbers. The other thing X86WinEHStatePass does is to mark the EH registration node. Instead of communicating which alloca was the registration through WinEHFuncInfo, I added the llvm.x86.seh.ehregnode intrinsic. This intrinsic generates no code and simply marks the alloca in use. Reviewers: JCTremoulet Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14668 llvm-svn: 253378	2015-11-17 21:10:25 +00:00
David Blaikie	c4e2bed738	dwarfdump: Reference the appropriate line table segment when dumping dwp files Also improves .dwo type unit dumping which didn't handle this either. llvm-svn: 253377	2015-11-17 21:08:05 +00:00
Andrew Kaylor	de642cef2c	[EH] Keep filter clauses for types that have been caught. The instruction combiner previously removed types from filter clauses in Landing Pad instructions if the type had previously been seen in a catch clause. This is incorrect and prevents unexpected exception handlers from rethrowing the caught type. Differential Revision: http://reviews.llvm.org/D14669 llvm-svn: 253370	2015-11-17 20:13:04 +00:00
Ulrich Weigand	36b8626b00	[RuntimeDyld] Fix resolving R_PPC64_REL24 relocations When resolving R_PPC64_REL24, code used to check for an address delta that fits in 24 bits, while the instructions that take this relocation actually can process address deltas that fit into 26 bits (as those instructions have a 24 bit field, but implicitly append two zero bits at the end since all instruction addresses are a multiple of 4). This means that code would signal overflow once a single object's text section exceeds 8 MB, while we can actually support up to 32 MB. Partially fixes PR25540. llvm-svn: 253369	2015-11-17 20:08:31 +00:00
Yunzhong Gao	8e348cc732	Switch lto codegen to using diagnostic handlers. This patch removes the std::string& argument from a number of C++ LTO API calls and instead makes them use the installed diagnostic handler. This would also improve consistency of diagnostic handling infrastructure: if an LTO client used lto_codegen_set_diagnostic_handler() to install a custom error handler, we do not want some error messages to go through the custom error handler, and some other error messages to go into sLastErrorString. llvm-svn: 253367	2015-11-17 19:48:12 +00:00
George Burgess IV	2ae15e0609	Specify explicit storage type for AllocType. NFC. llvm-svn: 253366	2015-11-17 19:48:06 +00:00
Elena Demikhovsky	3ec9e15ad4	Vector of pointers in function attributes calculation While setting function attributes we check all instructions that may access memory. For a call instruction we check all arguments. The special check is required for pointers. I added vector-of-pointers to the call arguments types that should be checked. Differential Revision: http://reviews.llvm.org/D14693 llvm-svn: 253363	2015-11-17 19:30:51 +00:00
Diego Novillo	ba920be4a2	SamplePGO - Move debug/dump function bodies out of header files. NFC. No point polluting the header declarations with debugging code. llvm-svn: 253361	2015-11-17 19:04:46 +00:00
David Blaikie	ff43d69ddf	StringRef-ify some Option APIs Patch by Eugene Kosov! Differential Revision: http://reviews.llvm.org/D14711 llvm-svn: 253360	2015-11-17 19:00:52 +00:00
Sanjay Patel	1de794aa3a	fix typos; NFC llvm-svn: 253359	2015-11-17 18:46:56 +00:00
Sanjay Patel	f09d1bfced	use local variables; NFCI llvm-svn: 253356	2015-11-17 18:37:23 +00:00
Charlie Turner	7968b981bf	[ARM] Don't pessimize i32 vselect. The underlying issues surrounding codegen for 32-bit vselects have been resolved. The pessimistic costs for 64-bit vselects remain due to the bad scalarization that is still happening there. I tested this on A57 in T32, A32 and A64 modes. I saw no regressions, and some improvements. From my benchmarks, I saw these improvements in A57 (T32) spec.cpu2000.ref.177_mesa 5.95% lnt.SingleSource/Benchmarks/Shootout/strcat 12.93% lnt.MultiSource/Benchmarks/MiBench/telecomm-CRC32/telecomm-CRC32 11.89% I also measured A57 A32, A53 T32 and A9 T32 and found no performance regressions. I see much bigger wins in third-party benchmarks with this change Differential Revision: http://reviews.llvm.org/D14743 llvm-svn: 253349	2015-11-17 17:25:15 +00:00
Sanjay Patel	431e1143ec	function names start with a lower case letter; NFC llvm-svn: 253348	2015-11-17 17:24:08 +00:00
Pawel Bylica	a90e745109	[Support] Tweak path::system_temp_directory() on Windows. Summary: This patch changes the behavior of path::system_temp_directory() on Windows to be closer to GetTempPath Windows API call. Enforces path separator to be the native one, makes path absolute, etc. GetTempPath is not used directly because of limitations/implementation bugs on Windows 7. Windows specific unit tests are added. Most of them runs in separated process with modified environment variables. This change fixes FileSystemTest.CreateDir unittest that had been failing when run from Unix-like shell on Windows (Unix-like path separator (/) used in env variables). Reviewers: chapuni, rafael, aaron.ballman Subscribers: rafael, llvm-commits Differential Revision: http://reviews.llvm.org/D14231 llvm-svn: 253345	2015-11-17 16:54:32 +00:00
Ahmed Bougacha	88ddeae8bd	[AArch64] Promote f16 SELECT_CC CC operands when op is legal. SELECT_CC has the nasty property of having operands with unrelated types. So if you do something like: f32 = select_cc f16, f16, f32, f32, cc You'd only look for the action for <select_cc, f32>, but never f16. If the types are all legal, but the op isn't (as for f16 on AArch64, or for f128 on x86_64/AArch64?), then you get into trouble. For f128, we have softenSetCCOperands to handle this case. Similarly, for f16, we can directly promote the CC operands. llvm-svn: 253344	2015-11-17 16:45:40 +00:00
Davide Italiano	7f9f835cfb	[JIT/Memory] Fix up semantic of setExecutable(). setExecutable() should do everything that's needed to make the memory executable on host, i.e. unconditionally set permissions + invalidate instruction cache. llvm-rtdyld will be updated in my next commit. Discusseed with: Lang Hames (as part of D13631). llvm-svn: 253341	2015-11-17 16:34:28 +00:00
Pat Gavlin	c8ea157811	Lower statepoints with multi-def targets. Statepoint lowering currently expects that the target method of a statepoint only defines a single value. This precludes using statepoints with ABIs that return values in multiple registers (e.g. the SysV AMD64 ABI). This change adds support for lowering statepoints with mutli-def targets. llvm-svn: 253339	2015-11-17 16:04:21 +00:00
Dan Gohman	7aa4abac24	Use TargetRegisterInfo for printing MachineOperand register comments Several places in AsmPrinter.cpp print comments describing MachineOperand registers using MCRegisterInfo, which uses MCOperand-oriented names. This doesn't work for targets that use virtual registers exclusively, as WebAssembly does, since virtual registers are represented and printed differently. This patch preserves what seems to be the spirit of r229978, avoiding the use of TM.getSubtargetImpl(), while still using MachineOperand-oriented printing for MachineOperands. Differential Revision: http://reviews.llvm.org/D14709 llvm-svn: 253338	2015-11-17 16:01:28 +00:00
Chad Rosier	6066dc69f1	Typo. llvm-svn: 253336	2015-11-17 13:58:10 +00:00
Bradley Smith	982a8888b8	[ARM] Default to ARMv4t in favour of adding Other to ARMArch llvm-svn: 253335	2015-11-17 13:38:29 +00:00
Charlie Turner	b4613c6973	[ARM] Match VABDL from log2 shuffles. Differential Revision: http://reviews.llvm.org/D14664 llvm-svn: 253334	2015-11-17 13:21:35 +00:00
Zlatko Buljan	72a7f9c1f5	[mips][microMIPS] Implement EXTP, EXTPDP, EXTPDPV, EXTPV, EXTR[_RS].W, EXTR_S.H, EXTRV[_RS].W and EXTRV_S.H instructions Differential Revision: http://reviews.llvm.org/D14174 llvm-svn: 253332	2015-11-17 12:54:15 +00:00
Bradley Smith	4320205484	[ARM] Properly initialize ARMArch in the ARM subtarget llvm-svn: 253331	2015-11-17 11:57:33 +00:00
Zlatko Buljan	246b21f66a	[mips][microMIPS] Implement SUBQ[_S].PH, SUBQ_S.W, SUBQH[_R].PH, SUBQH[_R].W, SUBU[_S].PH, SUBU[_S].QB and SUBUH[_R].QB instructions Differential Revision: http://reviews.llvm.org/D14114 llvm-svn: 253329	2015-11-17 10:11:22 +00:00
Oliver Stannard	9be59af3ab	[Assembler] Make fatal assembler errors non-fatal Currently, if the assembler encounters an error after parsing (such as an out-of-range fixup), it reports this as a fatal error, and so stops after the first error. However, for most of these there is an obvious way to recover after emitting the error, such as emitting the fixup with a value of zero. This means that we can report on all of the errors in a file, not just the first one. MCContext::reportError records the fact that an error was encountered, so we won't actually emit an object file with the incorrect contents. Differential Revision: http://reviews.llvm.org/D14717 llvm-svn: 253328	2015-11-17 10:00:43 +00:00
Oliver Stannard	07b43d39a8	[Assembler] Allow non-fatal errors after parsing This adds reportError to MCContext, which can be used as an alternative to reportFatalError when the assembler wants to try to continue processing the rest of the file after the error is reported, so that all of the errors ina file can be reported. It records the fact that an error was encountered, so we can avoid emitting an object file if any errors occurred. This patch doesn't add any uses of this function (a later patch will convert most uses of reportFatalError to use it), but there is a small functional change: we use the SourceManager to print the error message, even if we have a null SMLoc. This means that we get a SourceManager-style message, with the file and line information shown as <unknown>, rather than the "LLVM ERROR" style used by report_fatal_error. llvm-svn: 253327	2015-11-17 09:58:07 +00:00
Zlatko Buljan	3e0588d033	[mips][microMIPS] Implement PRECEQ.W.PHL, PRECEQ.W.PHR, PRECEQU.PH.QBL, PRECEQU.PH.QBLA, PRECEQU.PH.QBR, PRECEQU.PH.QBRA, PRECEU.PH.QBL, PRECEU.PH.QBLA, PRECEU.PH.QBR and PRECEU.PH.QBRA instructions Differential Revision: http://reviews.llvm.org/D14279 llvm-svn: 253326	2015-11-17 09:43:29 +00:00
Jay Foad	b64f0a5a1a	Fix typos in comments. llvm-svn: 253324	2015-11-17 08:54:53 +00:00
David Majnemer	6727c015dc	[AliasAnalysis] CatchPad and CatchRet can modify escaped memory CatchPad and CatchRet behave a lot like function calls: they can potentially modify any memory which has been escaped. llvm-svn: 253323	2015-11-17 08:15:14 +00:00
David Majnemer	0345b0fa9e	Fix a typo in BasicAliasAnalysis llvm-svn: 253322	2015-11-17 08:15:08 +00:00
Xinliang David Li	b8c3ad1d05	Fix unaligned memory read issue exposed by ubsan Indexed profile data as designed today does not guarantee counter data to be well aligned, so reading needs to use the slower form (with memcpy). This is less than ideal and should be improved in the future (i.e., with fixed length function key instead of variable length name key). llvm-svn: 253309	2015-11-17 03:47:21 +00:00
Rafael Espindola	65e4902156	Drop prelink support. The way prelink used to work was * The compiler decides if a given section only has relocations that are know to point to the same DSO. If so, it names it .data.rel.ro.local<something>. * The static linker puts all of these together. * The prelinker program assigns addresses to each library and resolves the local relocations. There are many problems with this: * It is incompatible with address space randomization. * The information passed by the compiler is redundant. The linker knows if a given relocation is in the same DSO or not. If could sort by that if so desired. * There are newer ways of speeding up DSO (gnu hash for example). * Even if we want to implement this again in the compiler, the previous implementation is pretty broken. It talks about relocations that are "resolved by the static linker". If they are resolved, there are none left for the prelinker. What one needs to track is if an expression will require only dynamic relocations that point to the same DSO. At this point it looks like the prelinker is an historical curiosity. For example, fedora has retired it because it failed to build for two releases (http://pkgs.fedoraproject.org/cgit/prelink.git/commit/?id=eb43100a8331d91c801ee3dcdb0a0bb9babfdc1f) This patch removes support for it. That is, it stops printing the ".local" sections. llvm-svn: 253280	2015-11-17 00:51:23 +00:00

... 3 4 5 6 7 ...

85026 Commits