llvm-project

Commit Graph

Author	SHA1	Message	Date
JF Bastien	61ad8b3907	Fix SCEV r256338. llvm-svn: 256344	2015-12-23 18:18:53 +00:00
Sanjoy Das	2fbfb25ad6	[SCEV] Fix getLoopBackedgeTakenCounts The way `getLoopBackedgeTakenCounts` is written right now isn't correct. It will try to compute and store the BE counts of a Loop #{child loop} number of times (which may be zero). llvm-svn: 256338	2015-12-23 17:48:14 +00:00
Chad Rosier	fba65d2fd3	[LIR] General refactoring to simplify code and the ease future code review. Move several checks into isLegalStores. Also, delineate between those stores that are memset-able and those that are memcpy-able. http://reviews.llvm.org/D15683 Patch by Haicheng Wu <haicheng@codeaurora.org>! llvm-svn: 256336	2015-12-23 17:29:33 +00:00
Philip Reames	42bd26f29d	[MachineLICM] Fix handling of memoperands As far as I can tell, the correct interpretation of an empty memoperands list is that we didn't have sufficient room to store information about the MachineInstr, NOT that the MachineInstr doesn't access any particular bit of memory. This appears to be fairly consistent in a number of places, but I'm not 100% sure of this interpretation. I'd really appreciate someone more knowledgeable confirming my reading of the code. This patch fixes two latent bugs in MachineLICM - given the above assumption - and adds comments to document the meaning and required handling. I don't have test cases; these were noticed by inspection. Differential Revision: http://reviews.llvm.org/D15730 llvm-svn: 256335	2015-12-23 17:05:57 +00:00
Simon Pilgrim	17377bdd45	[X86][AVX] Only shuffle the lower half of vectors if the upper half is undefined First step towards making better use of AVX's implicit zeroing of the upper half of a 256-bit vector by instructions that only act on the lower 128-bit vector - discussed on D14151. As well as the fact that 128-bit shuffle instructions are generally more capable, this can be performant for older CPUs with 128-bit ALUs (e.g. Jaguar, Sandy Bridge) that must treat 256-bit vectors as multiple micro-ops. Moved the similar subvector extraction shuffle combines from PerformShuffleCombine256 to lowerVectorShuffle as well. Note: I've avoided combining shuffles that reference elements from the upper halves of the input vectors - this may be reviewed in future work as well (AVX1 would probably always gain, but AVX2 does have some cross-lane shuffle instructions). Differential Revision: http://reviews.llvm.org/D15477 llvm-svn: 256332	2015-12-23 13:10:07 +00:00
David Majnemer	2bc2538470	[OperandBundles] Have GlobalsModRef play nice with operand bundles A call site's use of a Value might not correspond to an argument operand but to a bundle operand. llvm-svn: 256329	2015-12-23 09:58:46 +00:00
David Majnemer	63ad9e0543	[OperandBundles] Have TailCallElim play nice with operand bundles A call site's use of a Value might not correspond to an argument operand but to a bundle operand. This fixes PR25928. llvm-svn: 256328	2015-12-23 09:58:43 +00:00
David Majnemer	02f4787e45	[OperandBundles] Have InstCombine play nice with operand bundles Don't assume a call's use corresponds to an argument operand, it might correspond to a bundle operand. llvm-svn: 256327	2015-12-23 09:58:41 +00:00
David Majnemer	464be3724a	[OperandBundles] Have DeadArgElim play nice with operand bundles A call site's use of a Value might not correspond to an argument operand but to a bundle operand. llvm-svn: 256326	2015-12-23 09:58:36 +00:00
Igor Breger	7b46b4e798	AVX512BW: Enable packed word shift for 512bit vector. Enable lowering scalar immidiate shift v64i8 .Fix predicate for AVX1/2 shifts. Differential Revision: http://reviews.llvm.org/D15713 llvm-svn: 256324	2015-12-23 08:06:50 +00:00
David Majnemer	c640f863e0	[WinEH] Don't visit the same catchswitch twice We visited the same catchswitch twice because it was both the child of another funclet and the predecessor of a cleanuppad. Instead, change the numbering algorithm to only recurse if the unwind destination of the inner funclet agrees with the unwind destination of the catchswitch. This fixes PR25926. llvm-svn: 256317	2015-12-23 03:59:04 +00:00
Paul Robinson	22d0d31a72	Form reform for MCDwarf. MCDwarf emits a canned abbreviation table, but was not emitting proper forms for DWARF version 4, which is the default after r249655. Differential Revision: http://reviews.llvm.org/D15732 llvm-svn: 256313	2015-12-23 01:57:31 +00:00
Philip Reames	ee8f055327	[GC] Make GCStrategy::isGCManagedPointer a type predicate not a value predicate [NFC] Reasons: 1) The existing form was a form of false generality. None of the implemented GCStrategies use anything other than a type. Its becoming more and more clear we're going to need some type of strong GC pointer in the type system and we shouldn't pretend otherwise at this point. 2) The API was awkward when applied to vectors-of-pointers. The old one could have been made to work, but calling isGCManagedPointer(Ty->getScalarType()) is much cleaner than the Value alternatives. 3) The rewriting implementation effectively assumes the type based predicate as well. We should be consistent. llvm-svn: 256312	2015-12-23 01:42:15 +00:00
Dan Gohman	08d58bcf6a	[WebAssembly] Add a TODO comment for a possible future optimization. llvm-svn: 256306	2015-12-23 00:22:04 +00:00
Manuel Jacob	a4efd8ac2e	[RS4GC] Fix base pair printing for constants. Previously, "%" + name of the value was printed for each derived and base pointer. This is correct for instructions, but wrong for e.g. globals. llvm-svn: 256305	2015-12-23 00:19:45 +00:00
Akira Hatanaka	1cb242eb13	Provide a way to specify inliner's attribute compatibility and merging. This reapplies r256277 with two changes: - In emitFnAttrCompatCheck, change FuncName's type to std::string to fix a use-after-free bug. - Remove an unnecessary install-local target in lib/IR/Makefile. Original commit message for r252949: Provide a way to specify inliner's attribute compatibility and merging rules using table-gen. NFC. This commit adds new classes CompatRule and MergeRule to Attributes.td, which are used to generate code to check attribute compatibility and merge attributes of the caller and callee. rdar://problem/19836465 llvm-svn: 256304	2015-12-22 23:57:37 +00:00
Cong Hou	6a2c71af0b	[BPI] Fix two potential divide-by-zero operations that are introduced in r256263. llvm-svn: 256303	2015-12-22 23:45:55 +00:00
Dan Gohman	a2b2cdc813	[WebAssembly] Trim unneeded #includes. NFC. llvm-svn: 256301	2015-12-22 23:45:21 +00:00
Dan Gohman	cc38ba1954	[WebAssembly] Minor code simplification. NFC. llvm-svn: 256300	2015-12-22 23:39:16 +00:00
Changpeng Fang	b41574a961	AMDGPU/SI: Use flat for global load/store when targeting HSA Summary: For some reason doing executing an MUBUF instruction with the addr64 bit set and a zero base pointer in the resource descriptor causes the memory operation to be dropped when the shader is executed using the HSA runtime. This kind of MUBUF instruction is commonly used when the pointer is stored in VGPRs. The base pointer field in the resource descriptor is set to zero and and the pointer is stored in the vaddr field. This patch resolves the issue by only using flat instructions for global memory operations when targeting HSA. This is an overly conservative fix as all other configurations of MUBUF instructions appear to work. NOTE: re-commit by fixing a failure in Codegen/AMDGPU/llvm.dbg.value.ll Reviewers: tstellarAMD Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D15543 llvm-svn: 256282	2015-12-22 20:55:23 +00:00
Rafael Espindola	10d9a033db	Also add unnamed_addr to functions. llvm-svn: 256281	2015-12-22 20:43:30 +00:00
Akira Hatanaka	9c05cc5670	Revert r256277 and r256279. Some of the bots failed again. llvm-svn: 256280	2015-12-22 20:29:09 +00:00
Akira Hatanaka	3f1bf25db1	Add a .td file I forgot to add in r256277. llvm-svn: 256279	2015-12-22 20:06:50 +00:00
Akira Hatanaka	a61deb249b	Provide a way to specify inliner's attribute compatibility and merging. This reapplies r252990 and r252949. I've added member function getKind to the Attr classes which returns the enum or string of the attribute. Original commit message for r252949: Provide a way to specify inliner's attribute compatibility and merging rules using table-gen. NFC. This commit adds new classes CompatRule and MergeRule to Attributes.td, which are used to generate code to check attribute compatibility and merge attributes of the caller and callee. rdar://problem/19836465 llvm-svn: 256277	2015-12-22 20:00:05 +00:00
Rafael Espindola	5349d87a69	Delete dead GlobalAliases. llvm-svn: 256276	2015-12-22 19:50:22 +00:00
Rafael Espindola	4b0d24c00a	Revert "AMDGPU/SI: Use flat for global load/store when targeting HSA" This reverts commit r256273. It broke CodeGen/AMDGPU/llvm.dbg.value.ll llvm-svn: 256275	2015-12-22 19:46:44 +00:00
Rafael Espindola	2cc46b3701	Merge duplicated code. The code for deleting dead global variables and functions was duplicated. This is in preparation for also deleting dead global aliases. llvm-svn: 256274	2015-12-22 19:38:07 +00:00
Changpeng Fang	9b8a9be058	AMDGPU/SI: Use flat for global load/store when targeting HSA Summary: For some reason doing executing an MUBUF instruction with the addr64 bit set and a zero base pointer in the resource descriptor causes the memory operation to be dropped when the shader is executed using the HSA runtime. This kind of MUBUF instruction is commonly used when the pointer is stored in VGPRs. The base pointer field in the resource descriptor is set to zero and and the pointer is stored in the vaddr field. This patch resolves the issue by only using flat instructions for global memory operations when targeting HSA. This is an overly conservative fix as all other configurations of MUBUF instructions appear to work. Reviewers: tstellarAMD Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D15543 llvm-svn: 256273	2015-12-22 19:32:28 +00:00
Rafael Espindola	9f0bebc3da	Use early continue to reduce indentation. llvm-svn: 256272	2015-12-22 19:26:18 +00:00
Rafael Espindola	e4ed0e56ce	Simplify iterator management. NFC. Not passing an iterator to processGlobal will allow it to work with other GlobalValues. llvm-svn: 256271	2015-12-22 19:16:50 +00:00
Cong Hou	e93b8e1539	[BPI] Replace weights by probabilities in BPI. This patch removes all weight-related interfaces from BPI and replace them by probability versions. With this patch, we won't use edge weight anymore in either IR or MC passes. Edge probabilitiy is a better representation in terms of CFG update and validation. Differential revision: http://reviews.llvm.org/D15519 llvm-svn: 256263	2015-12-22 18:56:14 +00:00
Manuel Jacob	4e4f60ded0	Remove deprecated llvm.experimental.gc.result.{int,float,ptr} intrinsics. Summary: These were deprecated 11 months ago when a generic llvm.experimental.gc.result intrinsic, which works for all types, was added. Reviewers: sanjoy, reames Subscribers: sanjoy, chenli, llvm-commits Differential Revision: http://reviews.llvm.org/D15719 llvm-svn: 256262	2015-12-22 18:44:45 +00:00
Vedant Kumar	d167586a28	[Support] Allow multiple paired calls to {start,stop}Timer() Differential Revision: http://reviews.llvm.org/D15619 Reviewed-by: rafael llvm-svn: 256258	2015-12-22 17:36:17 +00:00
Manuel Jacob	990dfa6fe5	[RS4GC] Fix crash in the case that a live variable has a constant base. Summary: Previously, RS4GC crashed in CreateGCRelocates() because it assumed that every base is also in the array of live variables, which isn't true if a live variable has a constant base. This change fixes the crash by making sure CreateGCRelocates() won't try to relocate a live variable with a constant base. This would be unnecessary anyway because anything with a constant base won't move. Reviewers: reames Subscribers: llvm-commits, sanjoy Differential Revision: http://reviews.llvm.org/D15556 llvm-svn: 256252	2015-12-22 16:50:44 +00:00
Jun Bum Lim	6755c3bc5f	[AArch64] Promote loads from stored This is a recommit of r256004 which was reverted in r256160. The issue was the incorrect promotion for half and byte loads transformed into mov instructions. This fix will replace half and byte type loads only with bit field extracts. Original commit message: This change promotes load instructions which directly read from stored by replacing them with mov instructions. If the store is wider than the load, the load will be replaced with a bitfield extract. For example : STRWui %W1, %X0, 1 %W0 = LDRHHui %X0, 3 becomes STRWui %W1, %X0, 1 %W0 = UBFMWri %W1, 16, 31 llvm-svn: 256249	2015-12-22 16:36:16 +00:00
Chad Rosier	a108010385	Typo. NFC. llvm-svn: 256242	2015-12-22 15:06:47 +00:00
Asaf Badouh	13ffa4bf7c	[X86][AVX512] Add rcp14 and rsqrt14 intrinsics Differential Revision: http://reviews.llvm.org/D15414 llvm-svn: 256237	2015-12-22 11:40:04 +00:00
Keno Fischer	4eccf11373	[ASMPrinter] Fix missing handling of DW_OP_bit_piece In r256077, I added printing for DIExpressions in DEBUG_VALUE comments, but neglected to handle DW_OP_bit_piece operands. Thanks to Mikael Holmen and Joerg Sonnenberger for spotting this. llvm-svn: 256236	2015-12-22 07:14:50 +00:00
Kostya Serebryany	b0fb6e8508	[libFuzzer] add AFL-style dictionary for C++, remove the old file with tokens llvm-svn: 256229	2015-12-22 01:50:51 +00:00
David Majnemer	ff1d084aa2	[MC] Don't use the architecture to govern which object file format to use InitMCObjectFileInfo was trying to override the triple in awkward ways. For example, a triple specifying COFF but not Windows was forced as ELF. This makes it easy for internal invariants to get violated, such as those which triggered PR25912. This fixes PR25912. llvm-svn: 256226	2015-12-22 01:39:04 +00:00
Teresa Johnson	d213aa469e	Handle empty Subprogram list when linking metadata. Use an iterator that handles an empty subprogram list. Fixes PR25915. llvm-svn: 256224	2015-12-22 01:17:19 +00:00
Easwaran Raman	bdb6f1dcc3	Determine callee's hotness and adjust threshold based on that. NFC. This uses the same criteria used in CFE's CodeGenPGO to identify hot and cold callees and uses values of inlinehint-threshold and inlinecold-threshold respectively as the thresholds for such callees. Differential Revision: http://reviews.llvm.org/D15245 llvm-svn: 256222	2015-12-22 00:32:35 +00:00
Evgeniy Stepanov	8827f2db85	[safestack] Add option for non-TLS unsafe stack pointer. This patch adds an option, -safe-stack-no-tls, for using normal storage instead of thread-local storage for the unsafe stack pointer. This can be useful when SafeStack is applied to an operating system kernel. http://reviews.llvm.org/D15673 Patch by Michael LeMay. llvm-svn: 256221	2015-12-22 00:13:11 +00:00
Xinliang David Li	5fe0455563	[PGO] Fix another comdat related issue for COFF The linker requires that a comdat section must be associated with a another comdat section that precedes it. This means the comdat section's name needs to use the profile name var's name. Patch tested by Johan Engelen. llvm-svn: 256220	2015-12-22 00:11:15 +00:00
Vedant Kumar	11dc6dc71e	[Support] Timer: Use emplace_back() and range-based loops (NFC) llvm-svn: 256217	2015-12-21 23:41:38 +00:00
Vedant Kumar	3f79e32593	[Support] Timer: simplify the init() method llvm-svn: 256215	2015-12-21 23:27:44 +00:00
Dylan McKay	751a449e2f	[AVR] Added configuration file and machine function information class This commit adds the 'AVRMachineFunctionInfo' class, which simply stores basic properties about generated machine functions. llvm-svn: 256213	2015-12-21 23:13:15 +00:00
Eric Christopher	213a5daab7	Fix line endings after r256155. NFC. llvm-svn: 256211	2015-12-21 23:04:27 +00:00
Evgeniy Stepanov	fda72c52a2	[cfi] Fix LowerBitSets on 32-bit targets. This code attempts to truncate IntPtrTy to i32, which may be the same type. llvm-svn: 256205	2015-12-21 22:14:04 +00:00
David Majnemer	03e2cc3007	[MC, COFF] Support link /incremental conditionally Today, we always take into account the possibility that object files produced by MC may be consumed by an incremental linker. This results in us initialing fields which vary with time (TimeDateStamp) which harms hermetic builds (e.g. verifying a self-host went well) and produces sub-optimal code because we cannot assume anything about the relative position of functions within a section (call sites can get redirected through incremental linker thunks). Let's provide an MCTargetOption which controls this behavior so that we can disable this functionality if we know a-priori that the build will not rely on /incremental. llvm-svn: 256203	2015-12-21 22:09:27 +00:00
Jun Bum Lim	a23e5f7516	Enhance BranchProbabilityInfo::calcUnreachableHeuristics for InvokeInst This is recommit of r256028 with minor fixes in unittests: CodeGen/Mips/eh.ll CodeGen/Mips/insn-zero-size-bb.ll Original commit message: When identifying blocks post-dominated by an unreachable-terminated block in BranchProbabilityInfo, consider only the edge to the normal destination block if the terminator is InvokeInst and let calcInvokeHeuristics() decide edge weights for the InvokeInst. llvm-svn: 256202	2015-12-21 22:00:51 +00:00
Xinliang David Li	ab361efee7	Resubmit r256193 with test fix: assertion failure analyzed llvm-svn: 256201	2015-12-21 21:52:27 +00:00
Xinliang David Li	13da1f149e	Revert r256193: build bot failure triggered llvm-svn: 256198	2015-12-21 21:00:33 +00:00
Cong Hou	8df93ce455	[X86][SSE] Transform truncations between vectors of integers into X86ISD::PACKUS/PACKSS operations during DAG combine. This patch transforms truncation between vectors of integers into X86ISD::PACKUS/PACKSS operations during DAG combine. We don't do it in lowering phase because after type legalization, the original truncation will be turned into a BUILD_VECTOR with each element that is extracted from a vector and then truncated, and from them it is difficult to do this optimization. This greatly improves the performance of truncations on some specific types. Cost table is updated accordingly. Differential revision: http://reviews.llvm.org/D14588 llvm-svn: 256194	2015-12-21 20:42:43 +00:00
Xinliang David Li	6c494cd0df	[PGO] Fix profile var comdat generation problem with COFF When targeting COFF, it is required that a comdat section to have a global obj with the same name as the comdat (except for comdats with select kind to be associative). This fix makes sure that the comdat is keyed on the data variable for COFF. Also improved test coverage for this. llvm-svn: 256193	2015-12-21 20:41:20 +00:00
Michael Zolotukhin	0c97988e54	[ValueTracking] Properly handle non-sized types in isAligned function. Reviewers: apilipenko, reames, sanjoy, hfinkel Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D15597 llvm-svn: 256192	2015-12-21 20:38:18 +00:00
Adrian Prantl	ce8581389b	Fix PR24563 (LiveDebugVariables unconditionally propagates all DBG_VALUEs) LiveDebugVariables unconditionally propagates all DBG_VALUE down the dominator tree, which happens to work fine if there already is another DBG_VALUE or the DBG_VALUE happends to describe a single-assignment vreg but is otherwise wrong if the DBG_VALUE is coming from only one of the predecessors. In r255759 we introduced a proper data flow analysis scheduled after LiveDebugVariables that correctly propagates DBG_VALUEs across basic block boundaries. With the new pass in place, the incorrect propagation in LiveDebugVariables can be retired witout loosing any of the benefits where LiveDebugVariables happened to do the right thing. llvm-svn: 256188	2015-12-21 20:03:00 +00:00
Adrian Prantl	5d9acc2443	Teach ARMLoadStoreOptimizer to ignore DBG_VALUE instructions when merging instructions. As noted in PR24563. rdar://problem/23963293 llvm-svn: 256183	2015-12-21 19:25:03 +00:00
Tom Stellard	2b65ed306d	AMDGPU/SI: Fix encoding for FLAT_SCRATCH registers on VI Summary: These register has different encodings on CI and VI, so we add pseudo FLAT_SCRACTH registers to be used before MC, and subtarget specific registers to be used by the MC layer. Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D15661 llvm-svn: 256178	2015-12-21 18:44:27 +00:00
Tom Stellard	9da8620cdb	AMDGPU/SI: Change assembly name for flat scratch registers to flat_scratch This matches what the assembler accepts. llvm-svn: 256177	2015-12-21 18:44:21 +00:00
Matthew Simpson	11c4de6054	[AArch64] Add additional extract-extend patterns for smov This patch adds to the target description two additional patterns for matching extract-extend operations to SMOV. The patterns catch the v16i8-to-i64 and v8i16-to-i64 cases. The existing patterns miss these cases because the extracted elements must first be legalized to i32, resulting in any_extend nodes. This was originally implemented as a DAG combine (r255895), but was reverted due to failing out-of-tree tests. llvm-svn: 256176	2015-12-21 18:31:25 +00:00
Chad Rosier	353d71914a	Remove extra whitespace. NFC. llvm-svn: 256173	2015-12-21 18:08:05 +00:00
Teresa Johnson	4f04d85fa6	[ThinLTO] Rename variable to reflect bulk importing change (NFC) llvm-svn: 256171	2015-12-21 17:33:24 +00:00
Dan Gohman	d544e0c100	[WebAssembly] Convert a regular for loop to a range-based for loop. llvm-svn: 256169	2015-12-21 17:22:02 +00:00
Dan Gohman	d9b4cdb68d	[WebAssembly] Clean up comments and fix a missing #include dependency. llvm-svn: 256168	2015-12-21 17:19:31 +00:00
Dan Gohman	979b766fef	[WebAssembly] Remove an unneeded empty destructor. llvm-svn: 256167	2015-12-21 17:12:40 +00:00
Dan Gohman	d587aa5917	[WebAssembly] Enclose the operand variables for load and store instructions in braces. This allows the AsmMatcherEmitter to properly tokenize the AsmStrings for load and store instructions. This is a step towards asm parsing. llvm-svn: 256166	2015-12-21 16:58:49 +00:00
Dan Gohman	a783f10c16	[WebAssembly] Mark the ARGUMENT pseudo-instructions as CodeGenOnly. llvm-svn: 256165	2015-12-21 16:53:29 +00:00
Dan Gohman	dd20c70b61	[WebAssembly] Add some comments and make some minor source cleanups. llvm-svn: 256164	2015-12-21 16:50:41 +00:00
Dan Gohman	216e0c2ffe	Teach MCOperand::print how to print FPImm operands. llvm-svn: 256163	2015-12-21 16:47:10 +00:00
Teresa Johnson	4034d55158	Remove unused functions from ModuleLinker (NFC) Remove a couple ModuleLinker methods and a related static function that are no longer used after the linker split. llvm-svn: 256162	2015-12-21 15:49:59 +00:00
Teresa Johnson	3470295967	Remove overly strict new assert in BitcodeReader. This fixes a bug introduced by the ThinLTO metadata linking patch r255909. The assert is overly-strict and while useful in development of the patch, doesn't seem interesting to keep. Fixes PR25907. llvm-svn: 256161	2015-12-21 15:38:13 +00:00
Jun Bum Lim	4bb171c8da	Revert "[AArch64] Promote loads from stores" This reverts commit r256004 due to a failure in cortex-a53. llvm-svn: 256160	2015-12-21 15:36:49 +00:00
Chad Rosier	94274fb1ad	[LIR] Refactor code to enable future patch. NFC. llvm-svn: 256159	2015-12-21 14:49:32 +00:00
Chad Rosier	d016574df8	[AArch64] Enable PostRAScheduler for AArch64 generic build. Disable post-ra scheduler for perturbed tests to appease the bots and to preserve the history of the tests. http://reviews.llvm.org/D15652 llvm-svn: 256158	2015-12-21 14:43:45 +00:00
Igor Breger	44b60a3687	AVX512BW: Enable AND/OR/XOR vector byte/word paked operation by promoting to qword that natively suppored. llvm-svn: 256157	2015-12-21 14:40:36 +00:00
Amjad Aboud	60b5e1b6c0	Implemented Support of IA interrupt and exception handlers: http://lists.llvm.org/pipermail/cfe-dev/2015-September/045171.html Differential Revision: http://reviews.llvm.org/D15567 llvm-svn: 256155	2015-12-21 14:07:14 +00:00
Zlatko Buljan	5da2f6cd03	[mips][microMIPS] Implement DERET and DI instructions and check size operand for EXT and DEXT* instructions Differential Revision: http://reviews.llvm.org/D15570 llvm-svn: 256152	2015-12-21 13:08:58 +00:00
David Majnemer	18663f8787	[MC, COFF] Unbreak support for COFF timestamps Support for COFF timestamps was unintentionally broken in r246905 when it was conditionally available depending on whether or not LLVM was configured with LLVM_ENABLE_TIMESTAMPS. However, Config/config.h was never included which essentially broke the feature. Due to lax testing, the breakage was never identified until we observed strange failures during incremental links of Chromium. This issue is resolved by simply including Config/config.h in WinCOFFObjectWriter and teaching lit that the MC/COFF/timestamp.s test is conditionally supported depending on LLVM_ENABLE_TIMESTAMPS. With this in place, we can strengthen the test to ensure that it will not accidentally get broken in the future. This fixes PR25891. llvm-svn: 256137	2015-12-21 08:03:07 +00:00
NAKAMURA Takumi	9ec6a826dd	[Cygwin] Enable TLS as emutls. It resolves clang selfhosting with std::once() for Cygwin. FIXME: It may be EmulatedTLS-generic also for X86-Android. FIXME: Pass EmulatedTLS to LLVM CodeGen from Clang with -femulated-tls. llvm-svn: 256134	2015-12-21 02:37:23 +00:00
Manuel Jacob	8050a49737	[RS4GC] Add an assert which fails if there is a (yet unsupported) addrspacecast. The slightly strange indentation comes from clang-format. llvm-svn: 256132	2015-12-21 01:26:46 +00:00
Craig Topper	eafbd57ebc	[InstCombine] Fix indentation. NFC. llvm-svn: 256131	2015-12-21 01:02:28 +00:00
Dylan McKay	f061e9b7b2	[AVR] Added AVRCallingConv.td llvm-svn: 256130	2015-12-20 23:17:44 +00:00
Craig Topper	ca66fc5473	[X86] Use range-based for loop. NFC llvm-svn: 256127	2015-12-20 18:41:57 +00:00
Craig Topper	074e845260	[X86] Prevent constant hoisting for a couple compare immediates that the selection DAG knows how to optimize into a shift. This allows "icmp ugt %a, 4294967295" and "icmp uge %a, 4294967296" to be optimized into right shifts by 32 which can fold the immediate into the shift instruction. These patterns show up with some regularity in real code. Unfortunately, since getImmCost can't see the icmp predicate we can't be tell if we're only catching these specific cases. llvm-svn: 256126	2015-12-20 18:41:54 +00:00
Dylan McKay	029346f438	Add AVR.td and AVRRegisterInfo.td Summary: This adds the core AVR TableGen file, along with the register descriptions. Lines in AVR.td which require other TableGen files which haven't been committed yet are commented out. This is a fairly trivial patch, and should only require a quick review. I kept the line width smaller than 80 columns, but there are a few exceptions because I'm not sure how to split a string over several lines. Reviewers: stoklund Subscribers: dylanmckay, agnat Differential Revision: http://reviews.llvm.org/D14684 llvm-svn: 256120	2015-12-20 12:16:20 +00:00
Xinliang David Li	6005728843	Fix a latent UAF bug in profwriter llvm-svn: 256116	2015-12-20 08:46:18 +00:00
Weiming Zhao	613c6862fa	Fix mapping of @llvm.arm.ssat/usat intrinsics to ssat/usat instructions for Thumb2 Summary: r250697 fixed the mapping for ARM mode. We have to do the same for Thumb2 otherwise the same llvm.arm.ssat() will generate different saturating amount for ARM and Thumb. r250697: http://reviews.llvm.org/rL250697 Reviewers: rmaprath Subscribers: aemerson, llvm-commits, rengolin Differential Revision: http://reviews.llvm.org/D15653 llvm-svn: 256115	2015-12-20 06:41:44 +00:00
Xinliang David Li	a716cc5c33	[PGO] Improve Indexed Profile Reader efficiency With the support of value profiling added, the Indexed prof reader gets less efficient. The prof reader initialization used to be just reading the file header, but with VP support added, initialization needs to walk through all profile keys of ondisk hash table resulting in very poor locality and large memory increase (keys are stored together with the profile data in the mapped profile buffer). Even worse, when the reader is used by the compiler (not llvm-profdata too), the penalty becomes very high as compilation of each single module requires touching profile data buffer for the whole program. In this patch, the icall target values (MD5hash) are no longer eargerly converted back to name strings when the data is read into memory. New interface is added to to profile reader so that InstrProfSymtab can be lazily created for Indexed profile reader on-demand. Creating of the symtab is intended to be used by llvm-profdata tool for symbolic dumping of VP data. It can be used with compiler (for legacy out of tree uses) too but not recommended due to compile time and memory reasons mentioned above. Some other cleanups are also included: Function Addr to md5 map is now consolated into InstrProfSymtab. InstrProfStringtab is no longer used and eliminated. llvm-svn: 256114	2015-12-20 06:22:13 +00:00
Xinliang David Li	5c24da5d8e	Minor clean up -- move large single use method out of header(NFC) llvm-svn: 256113	2015-12-20 05:15:45 +00:00
Sanjoy Das	ab0626e35f	Nonnull elements in OperandBundleCallSites are not all Instructions `CloneAndPruneIntoFromInst` sometimes RAUW's dead instructions with `undef` before erasing them (to avoid deleting instructions that still have uses). This changes the `WeakVH` in `OperandBundleCallSites` to hold an `undef`, and we need to guard for this situation in eventuality in `llvm::InlineFunction`. llvm-svn: 256110	2015-12-19 22:40:28 +00:00
Rafael Espindola	30941d264b	Delete APIs that have been deprecated since 2010. llvm-svn: 256107	2015-12-19 21:42:07 +00:00
Rafael Espindola	e01e363fd9	Assert that we have all use/users in the getters. An error that is pretty easy to make is to use the lazy bitcode reader and then do something like if (V.use_empty()) The problem is that uses in unmaterialized functions are not accounted for. This patch adds asserts that all uses are known. llvm-svn: 256105	2015-12-19 20:03:23 +00:00
Manuel Jacob	5b90b147d4	Remove unnecessary casts. NFC. llvm-svn: 256101	2015-12-19 18:38:42 +00:00
Matt Arsenault	d206d6cc54	SelectionDAG: Cleanup integer bin op promotion functions. SDIV and UDIV had special handling, but this is the same handling that min/max need. llvm-svn: 256098	2015-12-19 17:18:43 +00:00
Vedant Kumar	3a63fb316c	Re-reapply "[IR] Move optional data in llvm::Function into a hungoff uselist" Make personality functions, prefix data, and prologue data hungoff operands of Function. This is based on the email thread "[RFC] Clean up the way we store optional Function data" on llvm-dev. Thanks to sanjoyd, majnemer, rnk, loladiro, and dexonsmith for feedback! Includes a fix to scrub value subclass data in dropAllReferences. Does not use binary literals. Differential Revision: http://reviews.llvm.org/D13829 llvm-svn: 256095	2015-12-19 08:52:49 +00:00
Vedant Kumar	44dd9871e8	Revert "Reapply "[IR] Move optional data in llvm::Function into a hungoff uselist"" This reverts commit r256093. This broke lld-x86_64-win7 because of -Werror,-Wc++1y-extensions. llvm-svn: 256094	2015-12-19 08:48:43 +00:00
Vedant Kumar	d481752e68	Reapply "[IR] Move optional data in llvm::Function into a hungoff uselist" Make personality functions, prefix data, and prologue data hungoff operands of Function. This is based on the email thread "[RFC] Clean up the way we store optional Function data" on llvm-dev. Thanks to sanjoyd, majnemer, rnk, loladiro, and dexonsmith for feedback! Includes a fix to scrub value subclass data in dropAllReferences. Differential Revision: http://reviews.llvm.org/D13829 llvm-svn: 256093	2015-12-19 08:29:51 +00:00
Vedant Kumar	e069c4b6d1	Revert "[IR] Move optional data in llvm::Function into a hungoff uselist" This reverts commit r256090. This broke llvm-clang-lld-x86_64-debian-fast. llvm-svn: 256091	2015-12-19 07:30:44 +00:00
Vedant Kumar	be7525d4fa	[IR] Move optional data in llvm::Function into a hungoff uselist Make personality functions, prefix data, and prologue data hungoff operands of Function. This is based on the email thread "[RFC] Clean up the way we store optional Function data" on llvm-dev. Thanks to sanjoyd, majnemer, rnk, loladiro, and dexonsmith for feedback! Differential Revision: http://reviews.llvm.org/D13829 llvm-svn: 256090	2015-12-19 07:08:56 +00:00
Kostya Serebryany	550e9c80a6	[libFuzzer] deprecate -save_minimized_corpus, -merge can be used instead llvm-svn: 256086	2015-12-19 03:42:16 +00:00
Kostya Serebryany	bf65644c97	[libFuzzer] split the tests to run them in parallel, remove one redundant test llvm-svn: 256085	2015-12-19 03:35:30 +00:00
Tom Stellard	ffc1a5aef7	AMDGPU/SI: Fix implemenation of isSourceOfDivergence() for graphics shaders Summary: The analysis of shader inputs was completely wrong. We were passing the wrong index to AttributeSet::hasAttribute() and the logic for which inputs where in SGPRs was wrong too. Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D15608 llvm-svn: 256082	2015-12-19 02:54:15 +00:00
Kostya Serebryany	27ab2d759f	[libFuzzer] make CrossOver just one of the other mutations llvm-svn: 256081	2015-12-19 02:49:09 +00:00
Philip Reames	5d54689bca	[RS4GC] Remove an overly strong assertion As shown by the included test case, it's reasonable to end up with constant references during base pointer calculation. The code actually handled this case just fine, we only had the assert to help isolate problems under the belief that constant references shouldn't be present in IR generated by managed frontends. This turned out to be wrong on two fronts: 1) Manual Jacobs is working on a language with constant references, and b) we found a case where the optimizer does create them in practice. llvm-svn: 256079	2015-12-19 02:38:22 +00:00
Keno Fischer	00cbf9a69a	Clean up the processing of dbg.value in various places Summary: First up is instcombine, where in the dbg.declare -> dbg.value conversion, the llvm.dbg.value needs to be called on the actual loaded value, rather than the address (since the whole point of this transformation is to be able to get rid of the alloca). Further, now that that's cleaned up, we can remove a hack in the backend, that would add an implicit OP_deref if the argument to dbg.value was an alloca. This stems from before the existence of DIExpression and is no longer necessary since the deref can be expressed explicitly. Now, in order to make sure that the tests pass with this change, we need to correct the printing of DEBUG_VALUE comments to take into account the expression, which wasn't taken into account before. Unfortunately, for both these changes, there were a number of incorrect test cases (mostly the wrong number of DW_OP_derefs, but also a couple where the test itself was broken more badly). aprantl and I have gone through and adjusted these test case in order to make them pass with these fixes and in some cases to make sure they're actually testing what they are meant to test. Reviewers: aprantl Subscribers: dsanders Differential Revision: http://reviews.llvm.org/D14186 llvm-svn: 256077	2015-12-19 02:02:44 +00:00
Matt Arsenault	2aed6ca1d3	AMDGPU: Switch barrier intrinsics to using convergent noduplicate prevents unrolling of small loops that happen to have barriers in them. If a loop has a barrier in it, it is OK to duplicate it for the unroll. llvm-svn: 256075	2015-12-19 01:46:41 +00:00
Matt Arsenault	10a509292c	Fix broken type legalization of min/max This was using an anyext when promoting the type when zext/sext is required. llvm-svn: 256074	2015-12-19 01:39:48 +00:00
Nicolai Haehnle	6bcf8b2890	AMDGPU/SI: use S_MOV_B64 for larger copies in copyPhysReg Reviewers: arsenm, tstellarAMD Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D15629 llvm-svn: 256073	2015-12-19 01:36:26 +00:00
Nicolai Haehnle	dd58705af6	AMDGPU: fix overlapping copies in copyPhysReg Summary: When copying aggregate registers within the same register class, there may be an overlap between source and destination that forces us to do the copy backwards. Do the simplest possible thing that guarantees the correct order of moves when there are overlaps, and does whatever when there is no overlap. (The last part forces some trivial adjustments to test cases.) Together with r255906, this fixes a VM fault in Unreal Elemental Demo. While at it, change the generation of kill and def flags to something that looks more reasonable. This method is used very late during compilation, so it probably doesn't matter in practice, and to be honest, I don't know if this change is actually correct because the semantics in connection with aggregate registers vs. sub-registers are not clear to me. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=93264 Reviewers: arsenm, tstellarAMD Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D15622 llvm-svn: 256072	2015-12-19 01:16:06 +00:00
Kostya Serebryany	14c50288cc	[libFuzzer] print successfull mutations sequences llvm-svn: 256071	2015-12-19 01:09:49 +00:00
Rafael Espindola	2339ffed97	Deprecate a few C APIs. This deprecates: * LLVMParseBitcode * LLVMParseBitcodeInContext * LLVMGetBitcodeModuleInContext * LLVMGetBitcodeModule They are replaced with the functions with a 2 suffix which do not record a diagnostic. llvm-svn: 256065	2015-12-18 23:46:42 +00:00
Xinliang David Li	020f22d810	[PGO] Cleanup: Move large member functions out of line (NFC) llvm-svn: 256058	2015-12-18 23:06:37 +00:00
Xinliang David Li	49ee76d082	[PGO] Simplify computehash interface (NFC) llvm-svn: 256047	2015-12-18 22:22:12 +00:00
Alexey Samsonov	1eaae4c3b1	[Symbolize] Improve the ownership of parsed objects. This code changes the way Symbolize handles parsed binaries: now parsed OwningBinary<Binary> is not broken into (binary, memory buffer) pair, and is just stored as-is in a cache. ObjectFile components of Mach-O universal binaries are also stored explicitly in a separate cache. Additionally, this change: * simplifies the code that parses/caches binaries: it's now done in a single place, not three different functions. * makes flush() method behave as expected, and actually clear the cached parsed binaries and objects. * fixes a dangling pointer issue described in http://reviews.llvm.org/D15638 llvm-svn: 256041	2015-12-18 22:02:14 +00:00
Cong Hou	fd0d62b87e	Use getEdgeProbability() instead of getEdgeWeight() in BFI and remove getEdgeWeight() interfaces from MBPI. This patch removes all getEdgeWeight() interfaces from CodeGen directory. As getEdgeProbability() is a little more expensive than getEdgeWeight(), I will compose a patch soon in which BPI only stores probabilities instead of edge weights so that getEdgeProbability() will have O(1) time. Differential revision: http://reviews.llvm.org/D15489 llvm-svn: 256039	2015-12-18 21:53:24 +00:00
Jingyue Wu	3f422280f5	[DivergenceAnalysis] fix a bug in computing influence regions Fixes PR25864 llvm-svn: 256036	2015-12-18 21:44:26 +00:00
Jingyue Wu	ba3ca76ed2	[NaryReassociate] allow candidate to have a different type Summary: If Candiadte may have a different type from GEP, we should bitcast or pointer cast it to GEP's type so that the later RAUW doesn't complain. Added a test in nary-gep.ll Reviewers: tra, meheff Subscribers: mcrosier, llvm-commits, jholewinski Differential Revision: http://reviews.llvm.org/D15618 llvm-svn: 256035	2015-12-18 21:36:30 +00:00
Rafael Espindola	708a91a103	Revert "Enhance BranchProbabilityInfo::calcUnreachableHeuristics for InvokeInst" This reverts commit r256028. It broke: LLVM :: CodeGen/Mips/eh.ll LLVM :: CodeGen/Mips/insn-zero-size-bb.ll llvm-svn: 256032	2015-12-18 21:23:32 +00:00
Rafael Espindola	79753a07a6	Remove redundant argument. NFC. llvm-svn: 256031	2015-12-18 21:18:57 +00:00
Jun Bum Lim	51a247065e	Enhance BranchProbabilityInfo::calcUnreachableHeuristics for InvokeInst When identifying blocks post-dominated by an unreachable-terminated block in BranchProbabilityInfo, consider only the edge to the normal destination block if the terminator is InvokeInst and let calcInvokeHeuristics() decide edge weights for the InvokeInst. llvm-svn: 256028	2015-12-18 20:53:47 +00:00
Krzysztof Parzyszek	21dc8bdd9e	[Hexagon] Add PIC support llvm-svn: 256025	2015-12-18 20:19:30 +00:00
Rafael Espindola	c4a03483f4	Drop materializeAllPermanently. This inlines materializeAll into the only caller (materializeAllPermanently) and renames materializeAllPermanently to just materializeAll. llvm-svn: 256024	2015-12-18 20:13:39 +00:00
Changpeng Fang	c9963936e7	AMDGPU/SI: Test commit Summary: This is just my first commit. Test! Reviewers: none Subscribers: none Differential Revision: none llvm-svn: 256022	2015-12-18 20:04:28 +00:00
Changpeng Fang	ef735b74c1	Revert "AMDGPU/SI: Test commit" This reverts commit a493cb636e0152ad28210934a47c6c44b1437193. llvm-svn: 256021	2015-12-18 20:04:26 +00:00
Changpeng Fang	7fdf674c2e	AMDGPU/SI: Test commit Summary: This is just my first commit. Test! Reviewers: none Subscribers: none Differential Revision: none llvm-svn: 256020	2015-12-18 19:57:41 +00:00
Rafael Espindola	18c63b0f18	Drop support for dematerializing. It was only used on lib/Linker and the use was "dead" since it was used on a function the IRMover had just moved. llvm-svn: 256019	2015-12-18 19:57:26 +00:00
Pete Cooper	98052537f0	Revert "Improve DWARFDebugFrame::parse to also handle __eh_frame." This reverts commit r256008. Its breaking multiple buildbots, although works for me locally. llvm-svn: 256013	2015-12-18 19:45:38 +00:00
Teresa Johnson	bef543635a	Rename variables to reflect linker split (NFC) Renamed variables to be more reflective of whether they are an instance of Linker, IRLinker or ModuleLinker. Also fix a stale comment. llvm-svn: 256011	2015-12-18 19:28:59 +00:00
Eric Christopher	9a8b5e7ece	Convert Arg, ArgList, and Option to dump() to dbgs() rather than errs(). Also add print() functions. Patch by Justin Lebar! llvm-svn: 256010	2015-12-18 18:55:26 +00:00
Eric Christopher	42b56eefd8	Add a dump method for ArgList. Patch by Justin Lebar! llvm-svn: 256009	2015-12-18 18:55:22 +00:00
Pete Cooper	6c97f4c7d7	Improve DWARFDebugFrame::parse to also handle __eh_frame. LLVM MC has single methods which can handle the output of EH frame and DWARF CIE's and FDE's. This code improves DWARFDebugFrame::parse to do the same for parsing. This also allows llvm-objdump to support the --dwarf=frames option which objdump supports. This option dumps the .eh_frame section using the new code in DWARFDebugFrame::parse. http://reviews.llvm.org/D15535 Reviewed by Rafael Espindola. llvm-svn: 256008	2015-12-18 18:51:08 +00:00
Krzysztof Parzyszek	a45c0e0d4e	Recognize strings for Hexagon-specific variant kinds llvm-svn: 256007	2015-12-18 18:47:27 +00:00
Andrew Kaylor	123048d26a	[WinEH] Update LCSSA to handle catchswitch with handlers inside and outside a loop Differential Revision: http://reviews.llvm.org/D15630 llvm-svn: 256005	2015-12-18 18:12:35 +00:00
Jun Bum Lim	3509d64c24	[AArch64] Promote loads from stores This change promotes load instructions which directly read from stores by replacing them with mov instructions. If the store is wider than the load, the load will be replaced with a bitfield extract. For example : STRWui %W1, %X0, 1 %W0 = LDRHHui %X0, 3 becomes STRWui %W1, %X0, 1 %W0 = UBFMWri %W1, 16, 31 llvm-svn: 256004	2015-12-18 18:08:30 +00:00
Teresa Johnson	0e7c82cb69	[ThinLTO/LTO] Don't link in unneeded metadata Summary: Third patch split out from http://reviews.llvm.org/D14752. Only map in needed DISubroutine metadata (imported or otherwise linked in functions and other DISubroutine referenced by inlined instructions). This is supported for ThinLTO, LTO and llvm-link --only-needed, with associated tests for each one. Depends on D14838. Reviewers: dexonsmith, joker.eph Subscribers: davidxl, llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D14843 llvm-svn: 256003	2015-12-18 17:51:37 +00:00
Rafael Espindola	7a36355b21	Handle archives with paths in the names. We always create archives with just he filename as the member name, but other archives can put a more complicated path in there. This patches handles it by computing just the filename as we do when adding a new member. If storing the path is important for some reason, we should probably have an orthogonal option for doing that and do it for both old and new members. Fixes pr25877. llvm-svn: 256001	2015-12-18 16:07:17 +00:00
Rafael Espindola	d7f9c250df	clang-format to reduce diff in another patch. llvm-svn: 255999	2015-12-18 14:06:34 +00:00
Rafael Espindola	f382b8836a	Fix error handling in LLVMGetBitcodeModuleInContext. It was not setting OutMessage. llvm-svn: 255998	2015-12-18 13:58:05 +00:00
Vaivaswatha Nagaraj	ed237938da	GlobalsAA: Take advantage of ArgMemOnly, InaccessibleMemOnly and InaccessibleMemOrArgMemOnly attributes Summary: 1. Modify AnalyzeCallGraph() to retain function info for external functions if the function has [InaccessibleMemOr]ArgMemOnly flags. 2. When analyzing the use of a global is function parameter at a call site, mark the callee also as modifying the global appropriately. 3. Add additional test cases. Depends on D15499 Reviewers: hfinkel, jmolloy Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D15605 llvm-svn: 255994	2015-12-18 11:02:52 +00:00
Zlatko Buljan	252cca555f	[mips][microMIPS][DSP] Implement PACKRL.PH, PICK.PH, PICK.QB, SHILO, SHILOV and WRDSP instructions Differential Revision: http://reviews.llvm.org/D14429 llvm-svn: 255991	2015-12-18 08:59:37 +00:00
Philip Reames	dd0948a1b6	[RS4GC] Use an value handle to help isolate errors quickly Inspired by the bug reported in 25846. Whatever we end up doing about that one, the value handle change is a generally good one since it will help catch this type of mistake more quickly. Patch by: Manuel Jacob llvm-svn: 255984	2015-12-18 03:53:28 +00:00
Vedant Kumar	2892a4a302	Revert "[Option] Introduce Arg::print(raw_ostream&) and use llvm::dbgs" This reverts commit r255977. This is part of http://reviews.llvm.org/D15634. llvm-svn: 255978	2015-12-18 02:30:45 +00:00
Vedant Kumar	a1e51fd968	[Option] Introduce Arg::print(raw_ostream&) and use llvm::dbgs llvm-svn: 255977	2015-12-18 02:27:52 +00:00
Eric Christopher	a6b96004b5	Reorganize the C API headers to improve build times. Type specific declarations have been moved to Type.h and error handling routines have been moved to ErrorHandling.h. Both are included in Core.h so nothing should change for projects directly including the headers, but transitive dependencies may be affected. llvm-svn: 255965	2015-12-18 01:46:52 +00:00
Eric Christopher	8c2adf6b49	Remove unused class variables. llvm-svn: 255939	2015-12-17 23:43:40 +00:00
Hans Wennborg	a6a2e512cf	[X86] Use push-pop for materializing small constants under 'minsize' Use the 3-byte (4 with REX prefix) push-pop sequence for materializing small constants. This is smaller than using a mov (5, 6 or 7 bytes depending on size and REX prefix), but it's likely to be slower, so only used for 'minsize'. This is a follow-up to r255656. Differential Revision: http://reviews.llvm.org/D15549 llvm-svn: 255936	2015-12-17 23:18:39 +00:00
Philip Reames	d7a6cc859a	[InstCombine] Extend peephole DSE to handle unordered atomics This extends the same line of reasoning used in EarlyCSE w/http://reviews.llvm.org/D15352 to the DSE implementation in InstCombine. Key points: * We only remove unordered or simple stores. * The loads producing values consumed by dead stores don't influence whether the store is dead. Differential Revision: http://reviews.llvm.org/D15354 llvm-svn: 255932	2015-12-17 22:19:27 +00:00
JF Bastien	d1fb58538f	Polish atomic pointers Summary: I didn't realize that we already allowed atomic load/store of pointers, it was added in 2012 by r162146. This patch updates the documentation and tightens the verifier by using DataLayout to make sure that the stored size is byte-sized and power-of-two. DataLayout is also used for integers, and while I'm here I updated the corresponding code for cmpxchg and rmw. See the following discussion for context and upcoming changes to add floating-point and vector atomics: https://groups.google.com/forum/#!topic/llvm-dev/Nh0P_E3CRoo/discussion Reviewers: reames Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D15512 llvm-svn: 255931	2015-12-17 22:09:19 +00:00
Matthew Simpson	13dddb0799	Revert "[AArch64] Add DAG combine for extract extend pattern" This reverts commit r255895. The patch breaks internal tests. Reverting until a fix is ready. llvm-svn: 255928	2015-12-17 21:29:47 +00:00
Rafael Espindola	776e458d81	Drop function that are deprecated since 2010. These functions were deprecated in r97608. llvm-svn: 255927	2015-12-17 21:16:12 +00:00
Dave Bartolomeo	ea039c121b	Test commit llvm-svn: 255926	2015-12-17 20:54:16 +00:00
Dan Gohman	670a60ed52	[WebAssembly] Switch WebAssemblyMCAsmInfo.h from MCAsmInfo to MCAsmInfoELF. llvm-svn: 255925	2015-12-17 20:50:45 +00:00
Sanjoy Das	0de2feceb1	[SCEV] Add and use SCEVConstant::getAPInt; NFCI llvm-svn: 255921	2015-12-17 20:28:46 +00:00
Weiming Zhao	24fbef55f9	[InstCombine] Adding "\n" to debug output. NFC. Summary: [InstCombine] Adding '\n' to debug output. NFC. Patch by Zhaoshi Zheng <zhaoshiz@codeaurora.org> Reviewers: apazos, majnemer, weimingz Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D15403 llvm-svn: 255920	2015-12-17 19:53:41 +00:00
Philip Reames	15145fb7b1	[EarlyCSE] DSE of atomic unordered stores The rules for removing trivially dead stores are a lot less complicated than loads. Since we know the later store post dominates the former and the former dominates the later, unless the former has side effects other than the actual store, we can remove it. One slightly surprising thing is that we can freely remove atomic stores, even if the later one isn't atomic. There's no guarantee the atomic one was every visible. For the moment, we don't handle DSE of ordered atomic stores. We could extend the same chain of reasoning to them, but the catch is we'd then have to model the ordering effect without a store instruction. Since our fences are a stronger than our operation orderings, simple using a fence isn't an obvious win. This arguable calls for a refinement in our fence specification, but that's (much) later work. Differential Revision: http://reviews.llvm.org/D15352 llvm-svn: 255914	2015-12-17 18:50:50 +00:00
Teresa Johnson	e5a6191732	[ThinLTO] Metadata linking for imported functions Summary: Second patch split out from http://reviews.llvm.org/D14752. Maps metadata as a post-pass from each module when importing complete, suturing up final metadata to the temporary metadata left on the imported instructions. This entails saving the mapping from bitcode value id to temporary metadata in the importing pass, and from bitcode value id to final metadata during the metadata linking postpass. Depends on D14825. Reviewers: dexonsmith, joker.eph Subscribers: davidxl, llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D14838 llvm-svn: 255909	2015-12-17 17:14:09 +00:00
Tom Stellard	caaa3aa07c	AMDGPU/SI: Reserve appropriate number of sgprs for flat scratch init. Reviewers: tstellarAMD Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D15583 Patch by: Changpeng Fang llvm-svn: 255908	2015-12-17 17:05:09 +00:00
Nicolai Haehnle	87323da6eb	AMDGPU: Fix off-by-one in SIRegisterInfo::eliminateFrameIndex Summary: The method insertNOPs expected the number of wait states to be passed as parameter, while eliminateFrameIndex passed the immediate argument for the S_NOP, leading to an off-by-one error. Rename the method to make the meaning of its parameter clearer. The number of 4 / 5 wait states (which is what the method has always _tried_ to do according to the comment) is correct according to the hardware docs. I stumbled upon this while trying to track down the cause of https://bugs.freedesktop.org/show_bug.cgi?id=93264. While clearly needed, this patch unfortunately does not fix that bug... Reviewers: arsenm, tstellarAMD Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D15542 llvm-svn: 255906	2015-12-17 16:46:42 +00:00
Andy Gibbs	33a0eb740e	Revert r254592 (virtual dtor in SCEVPredicate). Clang has better diagnostics in this case. It is not necessary therefore to change the destructor to avoid what is effectively an invalid warning in gcc. Instead, better handle the warning flags given to the compiler. llvm-svn: 255905	2015-12-17 16:43:53 +00:00
Teresa Johnson	4a9bf5872c	Mark a couple ModuleLinker member functions as const (NFC) llvm-svn: 255903	2015-12-17 16:34:53 +00:00
Rafael Espindola	f44db24e1f	Avoid explicit relocation sorting most of the time. These days relocations are created and stored in a deterministic way. The order they are created is also suitable for the .o file, so we don't need an explicit sort. The last remaining exception is MIPS. llvm-svn: 255902	2015-12-17 16:22:06 +00:00
Rafael Espindola	9e1cae510f	Revert "[AArch64] Enable PostRAScheduler for AArch64 generic build" This reverts commit r255896. It broke the tests. llvm-svn: 255899	2015-12-17 15:12:26 +00:00
Rafael Espindola	d0e16522c7	Always sort by offset first. NFC. Every target changing sortRelocs was first calling the parent implementation. Just run that first. llvm-svn: 255898	2015-12-17 15:08:24 +00:00
Diego Novillo	8561841875	Fix unused variable warning in release builds. NFC. llvm-svn: 255897	2015-12-17 14:58:34 +00:00
MinSeong Kim	d05e9fd194	[AArch64] Enable PostRAScheduler for AArch64 generic build This patch enables PostRAScheduler specifically for AArch64 generic build, which is beneficial from the performance perspective. Speedups up to 2 to 7% for some benchmarks on A57 and A53 are observed. Also benchmarks from LLVM test-suite did not regress. Differential Revision: http://reviews.llvm.org/D15557 llvm-svn: 255896	2015-12-17 14:51:22 +00:00
Matthew Simpson	4355e404d5	[AArch64] Add DAG combine for extract extend pattern This patch adds a DAG combine for (any_extend (extract_vector_elt v, i)) -> (extract_vector_elt v, i). The combine enables us to better match some SMOV patterns. Differential Revision: http://reviews.llvm.org/D15515 llvm-svn: 255895	2015-12-17 14:30:55 +00:00
Rafael Espindola	850ba46dd6	Simplify. NFC. llvm-svn: 255894	2015-12-17 14:19:52 +00:00
Alexey Bataev	7b72b658cc	[X86] Add option for enabling LEA optimization pass, by Andrey Turetsky Add option to enable/disable LEA optimization pass. By default the pass is disabled. Differential Revision: http://reviews.llvm.org/D15573 llvm-svn: 255881	2015-12-17 07:34:39 +00:00
Dan Gohman	5bf22fc84a	[WebAssembly] Convert WebAssemblyTargetObjectFile to TargetLoweringObjectFileELF llvm-svn: 255877	2015-12-17 04:55:44 +00:00
Matthias Braun	454192917b	AArch64: Simplify emitEpilogue() and related code; NFC This is in preparation to an upcoming patch. llvm-svn: 255872	2015-12-17 03:18:47 +00:00
Dan Gohman	05ac43fec3	[WebAssembly] Experimental ELF writer support This creates the initial infrastructure for writing ELF output files. It doesn't yet have any implementation for encoding instructions. Differential Revision: http://reviews.llvm.org/D15555 llvm-svn: 255869	2015-12-17 01:39:00 +00:00
Cong Hou	b9e8d483b5	Fix PR25838. This is a quick fix to PR25838. The issue comes from the restriction that we cannot normalize probabilities containing both known and unknown ones. A patch that removes this restriction is under the review now: http://reviews.llvm.org/D15548 llvm-svn: 255867	2015-12-17 01:29:08 +00:00
Xinliang David Li	50de45dcc1	[PGO] InstrPGO and coverage code refactoring (NFC) Introduce a new class InstrProfSymtab to abstract the PGO symbol table for prof and coverage reader. The symtab is is to lookup function's PGO name using function keys. The first user of the class is CoverageMapping Reader. More will follow. llvm-svn: 255862	2015-12-17 00:53:37 +00:00
JF Bastien	eefff9ccc5	WebAssembly: update expected torture test failures We now have 240 expected failures. llvm-svn: 255858	2015-12-17 00:12:06 +00:00
Rafael Espindola	c49ac5e7c2	Use std::unique_ptr. NFC. llvm-svn: 255852	2015-12-16 23:49:14 +00:00
Dan Gohman	4172953813	[WebAssembly] Fix legalization of shift operators on large integer types. llvm-svn: 255847	2015-12-16 23:25:51 +00:00
Derek Schuff	8bb5f2927a	[WebAssembly] Implement eliminateCallFramePseudo Summary: Implement eliminateCallFramePsuedo to handle ADJCALLSTACKUP/DOWN pseudo-instructions. Add a test calling a vararg function which causes non-0 adjustments. This revealed an issue with RegisterCoalescer wherein it eliminates a COPY from SP32 to a vreg but failes to update the live ranges of EXPR_STACK, causing a machineinstr verifier failure (so this test is commented out). Also add a dynamic alloca test, which causes a callseq_end dag node with a 0 (instead of undef) second argument to be generated. We currently fail to select that, so adjust the ADJCALLSTACKUP tablegen code to handle it. Differential Revision: http://reviews.llvm.org/D15587 llvm-svn: 255844	2015-12-16 23:21:30 +00:00
Rafael Espindola	434e956181	Change linkInModule to take a std::unique_ptr. Passing in a std::unique_ptr should help find errors when the module is used after being linked into another module. llvm-svn: 255842	2015-12-16 23:16:33 +00:00
Eric Christopher	bfba572425	Fix funciton->function typo. llvm-svn: 255841	2015-12-16 23:10:53 +00:00
Rafael Espindola	3f210fc0c8	Drop an unnecessary use of writev. It looks like the code this patch deletes is based on a misunderstanding of what guarantees writev provides. In particular, writev with 1 iovec is not "more atomic" than a write. Testing on OS X shows that both write and writev from multiple processes can be intermixed. llvm-svn: 255837	2015-12-16 22:59:06 +00:00
Ahmed Bougacha	66834ec6e1	[AArch64] Simplify some TRI/TII getters. NFC. We don't need static_casts when we use the right Subtarget. llvm-svn: 255836	2015-12-16 22:54:06 +00:00
Rafael Espindola	b94ab5ffbd	Simplify memory management with std::unique_ptr. llvm-svn: 255831	2015-12-16 22:28:34 +00:00
Ahmed Bougacha	cecb6b0865	[CodeGen] Make MachineInstrBuilder::copyImplicitOps const. NFC. This matches the other MIB methods, none of which modify the builder. Without this, we can't chain copyImplicitOps. Also reformat the few users, in PPCEarlyReturn. llvm-svn: 255828	2015-12-16 22:15:30 +00:00
Nathan Slingerland	48dd080c77	[PGO] Handle and report overflow during profile merge for all types of data Summary: Surface counter overflow when merging profile data. Merging still occurs on overflow but counts saturate to the maximum representable value. Overflow is reported to the user. Reviewers: davidxl, dnovillo, silvas Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D15547 llvm-svn: 255825	2015-12-16 21:45:43 +00:00
Manman Ren	cbe4f9417d	CXX_FAST_TLS calling convention: performance improvement for AArch64. The access function has a short entry and a short exit, the initialization block is only run the first time. To improve the performance, we want to have a short frame at the entry and exit. We explicitly handle most of the CSRs via copies. Only the CSRs that are not handled via copies will be in CSR_SaveList. Frame lowering and prologue/epilogue insertion will generate a short frame in the entry and exit according to CSR_SaveList. The majority of the CSRs will be handled by register allcoator. Register allocator will try to spill and reload them in the initialization block. We add CSRsViaCopy, it will be explicitly handled during lowering. 1> we first set FunctionLoweringInfo->SplitCSR if conditions are met (the target supports it for the given machine function and the function has only return exits). We also call TLI->initializeSplitCSR to perform initialization. 2> we call TLI->insertCopiesSplitCSR to insert copies from CSRsViaCopy to virtual registers at beginning of the entry block and copies from virtual registers to CSRsViaCopy at beginning of the exit blocks. 3> we also need to make sure the explicit copies will not be eliminated. The target independent portion was committed as r255353. rdar://problem/23557469 Differential Revision: http://reviews.llvm.org/D15341 llvm-svn: 255821	2015-12-16 21:04:19 +00:00
Manman Ren	3e3edc91f9	CXX_FAST_TLS calling convention: target independent portion. Update supportSplitCSR's interface to take machine function instead of the calling convention. Review comments for http://reviews.llvm.org/D15341 llvm-svn: 255818	2015-12-16 20:45:48 +00:00
Derek Schuff	993d35b4aa	Remove now-unused include llvm-svn: 255817	2015-12-16 20:43:10 +00:00
Derek Schuff	83717cc297	Iterate over phys regs instead llvm-svn: 255816	2015-12-16 20:43:08 +00:00
Derek Schuff	45cd5a79b2	[WebAssembly] Print an extra local decl when the user stack pointer is used Differential Revision: http://reviews.llvm.org/D15546 llvm-svn: 255815	2015-12-16 20:43:06 +00:00
Krzysztof Parzyszek	4f9164d9b3	[Hexagon] Misc fixes to r255807 llvm-svn: 255811	2015-12-16 20:07:04 +00:00
Paul Robinson	6c27a2c40e	Set debugger tuning from TargetOptions (NFC) Differential Revision: http://reviews.llvm.org/D15427 llvm-svn: 255810	2015-12-16 19:58:30 +00:00
Krzysztof Parzyszek	56bbf54b43	[Hexagon] Update the Hexagon packetizer llvm-svn: 255807	2015-12-16 19:36:12 +00:00
Reid Kleckner	187d33ee74	Revert "[ARM] Add ARMv8.2-A FP16 scalar instructions" This reverts commit r255762. llvm-svn: 255806	2015-12-16 19:21:03 +00:00
Dan Gohman	b3aa1ecab0	[WebAssembly] Fix the CFG Stackifier to handle unoptimized branches If a branch both branches to and falls through to the same block, treat it as an explicit branch. llvm-svn: 255803	2015-12-16 19:06:41 +00:00
Justin Bogner	883a3ea67f	LPM: Make callers of LPM.deleteLoopFromQueue update LoopInfo directly. NFC As of r255720, the loop pass manager will DTRT when passes update the loop info for removed loops, so they no longer need to reach into LPPassManager APIs to do this kind of transformation. This change very nearly removes the need for the LPPassManager to even be passed into loop passes - the only remaining pass that uses the LPM argument is LoopUnswitch. llvm-svn: 255797	2015-12-16 18:40:20 +00:00
Matt Arsenault	e05ff15186	AMDGPU: Override getCFInstrCost The default cost was 0 with the assumption that it is predictable. llvm-svn: 255796	2015-12-16 18:37:19 +00:00
Tom Stellard	5ce530608f	MachineScheduler: Add a target hook for deciding which RegPressure sets to increase Summary: This patch adds a function called getRegPressureSetScore() to TargetRegisterInfo. The MachineScheduler uses this when comparing instruction that increase the register pressure of different sets to determine which set is safer to increase. This hook is useful for GPU targets where the number of registers in the class is not the best metric for determing which presser set is safer to increase. Future work may include adding more parameters to this function, like for example, the current pressure level of the set or the amount that the pressure will be increased/decreased. Reviewers: qcolombet, escha, arsenm, atrick, MatzeB Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14806 llvm-svn: 255795	2015-12-16 18:31:01 +00:00
Charlie Turner	5b8895b496	[SLPVectorizer] Ensure dominated reduction values. When considering incoming values as part of a reduction phi, ensure the incoming value is dominated by said phi. Failing to ensure this property causes miscompiles. Fixes PR25787. Many thanks to Mattias Eriksson for reporting, reducing and analyzing the problem for me. Differential Revision: http://reviews.llvm.org/D15580 llvm-svn: 255792	2015-12-16 18:23:44 +00:00
Dan Gohman	e2831b4e27	[WebAssembly] Use the new offset syntax for memory operands in inline asm. llvm-svn: 255788	2015-12-16 18:14:49 +00:00
Ulrich Weigand	88a7a2eac7	[SystemZ] Sort relocs to avoid code corruption by linker optimization The SystemZ linkers provide an optimization to transform a general- or local-dynamic TLS sequence into an initial-exec sequence if possible. Do do that, the compiler generates a function call to __tls_get_offset, which is a brasl instruction annotated with two relocations: - a R_390_PLT32DBL to install __tls_get_offset as branch target - a R_390_TLS_GDCALL / R_390_TLS_LDCALL to inform the linker that the TLS optimization should be performed if possible If the optimization is performed, the brasl is replaced by an ld load instruction. However, both relocs are processed independently by the linker. Therefore it is crucial that the R_390_PLT32DBL is processed first (installing the branch target for the brasl) and the R_390_TLS_GDCALL is processed second (replacing the whole brasl with an ld). If the relocs are swapped, the linker will first replace the brasl with an ld, and then install the __tls_get_offset branch target offset. Since ld has a different layout than brasl, this may even result in a completely different (or invalid) instruction; in any case, the resulting code is corrupted. Unfortunately, the way the MC common code sorts relocations causes these two to always end up the wrong way around, resulting in wrong code generation by the linker and crashes. This patch overrides the sortRelocs routine to detect this particular pair of relocs and enforce the required order. llvm-svn: 255787	2015-12-16 18:12:40 +00:00
Ulrich Weigand	47f3649374	[SystemZ] Fix assertion failure in adjustSubwordCmp When comparing a zero-extended value against a constant small enough to be in range of the inner type, it doesn't matter whether a signed or unsigned compare operation (for the outer type) is being used. This is why the code in adjustSubwordCmp had this assertion: assert(C.ICmpType == SystemZICMP::Any && "Signedness shouldn't matter here."); assuming the the caller had already detected that fact. However, it turns out that there cases, in particular with always-true or always- false conditions that have not been eliminated when compiling at -O0, where this is not true. Instead of failing an assertion if C.ICmpType is not SystemZICMP::Any here, we can simply set it safely to SystemZICMP::Any, however. llvm-svn: 255786	2015-12-16 18:04:06 +00:00
Tobias Edler von Koch	b51460cf86	[Hexagon] Make memcpy lowering thread-safe This removes an unpleasant hack involving a global variable for special lowering of certain memcpy calls. These are now lowered as intended in EmitTargetCodeForMemcpy in the same way that other targets do it. llvm-svn: 255785	2015-12-16 17:29:37 +00:00
Dan Gohman	30a42bf585	[WebAssembly] Support more kinds of inline asm operands llvm-svn: 255782	2015-12-16 17:15:17 +00:00
Krzysztof Parzyszek	2005d7dc01	[Packetizer] Add a check whether an instruction should be packetized now Add a function VLIWPacketizerList::shouldAddToPacket, which will allow specific implementations to decide if it is profitable to add given instruction to the current packet. llvm-svn: 255780	2015-12-16 16:38:16 +00:00
Vaivaswatha Nagaraj	fb3f4907c0	Add InaccessibleMemOnly and inaccessibleMemOrArgMemOnly attributes Summary: This patch introduces two new function attributes InaccessibleMemOnly: This attribute indicates that the function may only access memory that is not accessible by the program/IR being compiled. This is a weaker form of ReadNone. inaccessibleMemOrArgMemOnly: This attribute indicates that the function may only access memory that is either not accessible by the program/IR being compiled, or is pointed to by its pointer arguments. This is a weaker form of ArgMemOnly Test cases have been updated. This revision uses this (`d001932f3a`) as reference. Reviewers: jmolloy, hfinkel Subscribers: reames, joker.eph, llvm-commits Differential Revision: http://reviews.llvm.org/D15499 llvm-svn: 255778	2015-12-16 16:16:19 +00:00
James Molloy	3d21dcf3ed	[SimplifyCFG] Don't create unnecessary PHIs In conditional store merging, we were creating PHIs when we didn't need to. If the value to be predicated isn't defined in the block we're predicating, then it doesn't need a PHI at all (because we only deal with triangles and diamonds, any value not in the predicated BB must dominate the predicated BB). This fixes a large code size increase in some benchmarks in a popular embedded benchmark suite. Now with a fix (and fixed tests) for the conformance issue seen in Chromium. llvm-svn: 255767	2015-12-16 14:12:44 +00:00
Oliver Stannard	2de8c16913	[ARM] Add ARMv8.2-A FP16 vector instructions ARMv8.2-A adds 16-bit floating point versions of all existing SIMD floating-point instructions. This is an optional extension, so all of these instructions require the FeatureFullFP16 subtarget feature. Note that VFP without SIMD is not a valid combination for any version of ARMv8-A, but I have ensured that these instructions all depend on both FeatureNEON and FeatureFullFP16 for consistency. Differential Revision: http://reviews.llvm.org/D15039 llvm-svn: 255764	2015-12-16 12:37:39 +00:00
Oliver Stannard	48568cbe18	[ARM] Add ARMv8.2-A FP16 scalar instructions ARMv8.2-A adds 16-bit floating point versions of all existing VFP floating-point instructions. This is an optional extension, so all of these instructions require the FeatureFullFP16 subtarget feature. The assembly for these instructions uses S registers (AArch32 does not have H registers), but the instructions have ".f16" type specifiers rather than ".f32" or ".f64". The top 16 bits of each source register are ignored, and the top 16 bits of the destination register are set to zero. These instructions are mostly the same as the 32- and 64-bit versions, but they use coprocessor 9 rather than 10 and 11. Two new instructions, VMOVX and VINS, have been added to allow packing and extracting two 16-bit floats stored in the top and bottom halves of an S register. New fixup kinds have been added for the PC-relative load and store instructions, but no ELF relocations have been added as they have a range of 512 bytes. Differential Revision: http://reviews.llvm.org/D15038 llvm-svn: 255762	2015-12-16 11:35:44 +00:00
Michael Kuperstein	e75e6e2a23	[X86] Improve shift combining This folds (ashr (shl a, [56,48,32,24,16]), SarConst) into (shl, (sext (a), [56,48,32,24,16] - SarConst)) or into (lshr, (sext (a), SarConst - [56,48,32,24,16])) depending on sign of (SarConst - [56,48,32,24,16]) sexts in X86 are MOVs. The MOVs have the same code size as above SHIFTs (only SHIFT by 1 has lower code size). However the MOVs have 2 advantages to SHIFTs on x86: 1. MOVs can write to a register that differs from source. 2. MOVs accept memory operands. This fixes PR24373. Patch by: evgeny.v.stupachenko@intel.com Differential Revision: http://reviews.llvm.org/D13161 llvm-svn: 255761	2015-12-16 11:22:37 +00:00
Keno Fischer	94f181a45f	[SectionMemoryManager] Make better use of virtual memory Summary: On Windows, the allocation granularity can be significantly larger than a page (64K), so with many small objects, just clearing the FreeMem list rapidly leaks quite a bit of virtual memory space (if not rss). Fix that by only removing those parts of the FreeMem blocks that overlap pages for which we are applying memory permissions, rather than dropping the FreeMem blocks entirely. Reviewers: lhames Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D15202 llvm-svn: 255760	2015-12-16 11:13:23 +00:00
Vikram TV	859ad29b52	Recommit LiveDebugValues pass after fixing a couple of minor issues. llvm-svn: 255759	2015-12-16 11:09:48 +00:00
Cong Hou	08ec3d91bb	Minor change to TailDuplication.cpp to turn on normalization when removing successor llvm-svn: 255752	2015-12-16 06:03:30 +00:00
George Burgess IV	500d3039d7	Minor cleanup of Attribute code. NFC. llvm-svn: 255751	2015-12-16 05:21:02 +00:00
Chen Li	3e8330a1fe	[SelectionDAGBuilder] Adds support for landingpads of token type Summary: This patch adds a check in visitLandingPad to see if landingpad's result type is token type. If so, do not create DAG nodes for its exception pointer and selector value. This patch enables the back end to handle landingpads of token type. Reviewers: JosephTremoulet, majnemer, rnk Subscribers: sanjoy, llvm-commits Differential Revision: http://reviews.llvm.org/D15405 llvm-svn: 255749	2015-12-16 04:48:42 +00:00
Peter Collingbourne	16c1978760	Fuzzer: Fix library dependencies. Newer versions of libstdc++ (4.9+), as well as libc++, depend directly on libpthread from the standard library headers, so libfuzzer needs to declare a standard library dependency. llvm-svn: 255745	2015-12-16 02:14:57 +00:00
Philip Reames	23319014a9	Speculative fix for windows build llvm-svn: 255743	2015-12-16 01:24:05 +00:00
Philip Reames	ae1f265bf1	[EarlyCSE] DSE of stores which write back loaded values Extend EarlyCSE with an additional style of dead store elimination. If we write back a value just read from that memory location, we can eliminate the store under the assumption that the value hasn't changed. I'm implementing this mostly because I noticed the omission when looking at the code. It seemed strange to have InstCombine have a peephole which was more powerful than EarlyCSE. :) Differential Revision: http://reviews.llvm.org/D15397 llvm-svn: 255739	2015-12-16 01:01:30 +00:00
Philip Reames	61a24ab6cc	[IR] Add support for floating pointer atomic loads and stores This patch allows atomic loads and stores of floating point to be specified in the IR and adds an adapter to allow them to be lowered via existing backend support for bitcast-to-equivalent-integer idiom. Previously, the only way to specify a atomic float operation was to bitcast the pointer to a i32, load the value as an i32, then bitcast to a float. At it's most basic, this patch simply moves this expansion step to the point we start lowering to the backend. This patch does not add canonicalization rules to convert the bitcast idioms to the appropriate atomic loads. I plan to do that in the future, but for now, let's simply add the support. I'd like to get instruction selection working through at least one backend (x86-64) without the bitcast conversion before canonicalizing into this form. Similarly, I haven't yet added the target hooks to opt out of the lowering step I added to AtomicExpand. I figured it would more sense to add those once at least one backend (x86) was ready to actually opt out. As you can see from the included tests, the generated code quality is not great. I plan on submitting some patches to fix this, but help from others along that line would be very welcome. I'm not super familiar with the backend and my ramp up time may be material. Differential Revision: http://reviews.llvm.org/D15471 llvm-svn: 255737	2015-12-16 00:49:36 +00:00
Justin Bogner	e0fde5c6d0	Fix typo in r255720 llvm-svn: 255724	2015-12-16 00:17:34 +00:00
Wolfgang Pieb	60b7ca6713	Test commit: fixed spelling error in comment. llvm-svn: 255721	2015-12-16 00:08:18 +00:00
Justin Bogner	6e9810c8ef	LPM: Simplify how passes mark loops for deletion. NFC When a pass removes a loop it currently has to reach up into the LPPassManager's internals to update the state of the iteration over loops. This reverse dependency results in a pretty awkward interplay of the LPPassManager and its Passes. Here, we change this to instead keep track of when a loop has become "unlooped" in the Loop objects themselves, then the LPPassManager can check this and manipulate its own state directly. This opens the door to allow most of the loop passes to work without a backreference to the LPPassManager. I've kept passes calling the LPPassManager::deleteLoopFromQueue API now so I could put an assert in to prove that this is NFC, but a later pass will update passes just to preserve the LoopInfo directly and stop referencing the LPPassManager completely. llvm-svn: 255720	2015-12-16 00:01:02 +00:00
Richard Trieu	8f3118f449	Remove one of the void casts used to suppress unused variable warning. llvm-svn: 255709	2015-12-15 23:47:17 +00:00
Reid Kleckner	7850c9f5ca	[WinEH] Make llvm.x86.seh.recoverfp work on x64 It adjusts from RSP-after-prologue to RBP, which is what SEH filters need to do before they can use llvm.localrecover. Fixes SEH filter captures, which were broken in r250088. Issue reported by Alex Crichton. llvm-svn: 255707	2015-12-15 23:40:58 +00:00
Evgeniy Stepanov	4059eeca82	Suppress unused variable warning in the no-asserts build. llvm-svn: 255706	2015-12-15 23:30:29 +00:00
Richard Trieu	fc69e7d65b	Cast variable to void to resolve unused variable warning in non-asserts builds. llvm-svn: 255704	2015-12-15 23:25:34 +00:00
Hans Wennborg	7036e503d7	Fix "Not having LAHF/SAHF" assert. It wants to assert that the subtarget is 64-bit, not the register. llvm-svn: 255703	2015-12-15 23:21:46 +00:00
Tom Stellard	7750f4ed9e	AMDGPU/SI: Set the code object work group segment size when targeting HSA Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D15493 llvm-svn: 255702	2015-12-15 23:15:25 +00:00
Sanjay Patel	271efcdf20	[x86] inline calls to fmaxf / llvm.maxnum.f32 using maxss (PR24475) This patch improves on the suggested codegen from PR24475: https://llvm.org/bugs/show_bug.cgi?id=24475 but only for the fmaxf() case to start, so we can sort out any bugs before extending to fmin, f64, and vectors. The fmax / maxnum definitions provide us flexibility for signed zeros, so the only thing we have to worry about in this replacement sequence is NaN handling. Note 1: It may be better to implement this as lowerFMAXNUM(), but that exposes a problem: SelectionDAGBuilder::visitSelect() transforms compare/select instructions into FMAXNUM nodes if we declare FMAXNUM legal or custom. Perhaps that should be checking for NaN inputs or global unsafe-math before transforming? As it stands, that bypasses a big set of optimizations that the x86 backend already has in PerformSELECTCombine(). Note 2: The v2f32 test reveals another bug; the vector is extended to v4f32, so we have completely unnecessary operations happening on undef elements of the vector. Differential Revision: http://reviews.llvm.org/D15294 llvm-svn: 255700	2015-12-15 23:11:43 +00:00
James Y Knight	99fcb721b2	[Sparc] Tweak r255668: Use llvm_unreachable. llvm-svn: 255698	2015-12-15 23:07:16 +00:00
Evgeniy Stepanov	67849d56c3	Cross-DSO control flow integrity (LLVM part). An LTO pass that generates a __cfi_check() function that validates a call based on a hash of the call-site-known type and the target pointer. llvm-svn: 255693	2015-12-15 23:00:08 +00:00
Tom Stellard	a495307e5e	AMDGPU/SI: Set the code objects private segment size when targeting HSA. Summary: I'm not sure how things worked before without this. Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D15492 llvm-svn: 255692	2015-12-15 22:55:30 +00:00
Cong Hou	a73ffa2206	[LoopVectorizer] Refine loop vectorizer's register usage calculator by ignoring specific instructions. (This is the third attempt to check in this patch, and the first two are r255454 and r255460. The once failed test file reg-usage.ll is now moved to test/Transform/LoopVectorize/X86 directory with target datalayout and target triple indicated.) LoopVectorizationCostModel::calculateRegisterUsage() is used to estimate the register usage for specific VFs. However, it takes into account many instructions that won't be vectorized, such as induction variables, GetElementPtr instruction, etc.. This makes the loop vectorizer too conservative when choosing VF. In this patch, the induction variables that won't be vectorized plus GetElementPtr instruction will be added to ValuesToIgnore set so that their register usage won't be considered any more. Differential revision: http://reviews.llvm.org/D15177 llvm-svn: 255691	2015-12-15 22:45:09 +00:00
Tom Stellard	29dd05e92f	AMDGPU/SI: Emit constant variables in the .hsatext section when targeting HSA Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D15426 llvm-svn: 255689	2015-12-15 22:39:36 +00:00
Dan Gohman	4b9d7916ee	[WebAssembly] Implement instruction selection for constant offsets in addresses. Add instruction patterns for matching load and store instructions with constant offsets in addresses. The code is fairly redundant due to the need to replicate everything between imm, tglobaldadr, and texternalsym, but this appears to be common tablegen practice. The main alternative appears to be to introduce matching functions with C++ code, but sticking with purely generated matchers seems better for now. Also note that this doesn't yet support offsets from getelementptr, which will be the most common case; that will depend on a change in target-independent code in order to set the NoUnsignedWrap flag, which I'll submit separately. Until then, the testcase uses ptrtoint+add+inttoptr with a nuw on the add. Also implement isLegalAddressingMode with an approximation of this. Differential Revision: http://reviews.llvm.org/D15538 llvm-svn: 255681	2015-12-15 22:01:29 +00:00
Xinliang David Li	38b9a32fcd	Initialize all bytes in vp data (msan error) llvm-svn: 255680	2015-12-15 21:57:08 +00:00
Reid Kleckner	d7045faa10	[WinEH] Remove unused intrinsic llvm.x86.seh.restoreframe We can clean this up now that we have the X86 CATCHRET instruction to restore the FP, SP, and BP. llvm-svn: 255677	2015-12-15 21:41:34 +00:00
David Majnemer	3bb88c0210	[WinEH] Use operand bundles to describe call sites SimplifyCFG allows tail merging with code which terminates in unreachable which, in turn, makes it possible for an invoke to end up in a funclet which it was not originally part of. Using operand bundles on invokes allows us to determine whether or not an invoke was part of a funclet in the source program. Furthermore, it allows us to unambiguously answer questions about the legality of inlining into call sites which the personality may have trouble with. Differential Revision: http://reviews.llvm.org/D15517 llvm-svn: 255674	2015-12-15 21:27:27 +00:00
Tom Stellard	a6f24c6565	AMDGPU/SI: Select constant loads with non-uniform addresses to MUBUF instructions Summary: We were previously selecting all constant loads to SMRD instructions and legalizing the SMRDs with non-uniform addresses during the SIFixSGPRCopesPass. This new solution is more simple and also generates much better code, because the instruction selector is able to take advantage of all the MUBUF addressing modes that are legalization pass wasn't able to. We also no longer need to generate v_add_* instructions when we have a uniform pointer and a non-uniform offset, as this is now folded into the MUBUF instruction during instruction selection. Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D15425 llvm-svn: 255672	2015-12-15 20:55:55 +00:00
Xinliang David Li	4ec401406e	Coverage code refactoring /NFC llvm-svn: 255670	2015-12-15 19:44:45 +00:00
Justin Bogner	843fb204b7	LPM: Stop threading `Pass ` through all of the loop utility APIs. NFC A large number of loop utility functions take a `Pass ` and reach into it to find out which analyses to preserve. There are a number of problems with this: - The APIs have access to pretty well any Pass state they want, so it's hard to tell what they may or may not do. - Other APIs have copied these and pass around a `Pass *` even though they don't even use it. Some of these just hand a nullptr to the API since the callers don't even have a pass available. - Passes in the new pass manager don't work like the current ones, so the APIs can't be used as is there. Instead, we should explicitly thread the analysis results that we actually care about through these APIs. This is both simpler and more reusable. llvm-svn: 255669	2015-12-15 19:40:57 +00:00
James Y Knight	33beb24318	[Sparc] Fix handling of double incoming arguments on sparc little-endian. On SparcV8, doubles get passed in two 32-bit integer registers. The call code was already handling endianness correctly, but the incoming argument code was not -- it got the two halves in opposite order. Also remove some dead code in LowerFormalArguments_32 to handle less-than-32bit values, which can't actually happen. Finally, add some test cases for the 32-bit calling convention, cribbed from the 64abi.ll test, and run for both big and little-endian. llvm-svn: 255668	2015-12-15 19:23:12 +00:00
Michael Kuperstein	53946bf8c6	[X86] MOVPC32r should only emit CFI adjustments when needed We only want to emit CFI adjustments when actually using DWARF. This fixes PR25828. Differential Revision: http://reviews.llvm.org/D15522 llvm-svn: 255664	2015-12-15 18:50:32 +00:00
Tom Stellard	dbe374b2c5	AMDGPU/SI: Implement AMDGPUTargetTransformInfo::isSourceOfDivergence() Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D15476 llvm-svn: 255661	2015-12-15 18:04:38 +00:00
Sanjay Patel	38a022623a	[SimplifyCFG] allow speculation of exactly one expensive instruction (PR24818) This is the last general step to allow more IR-level speculation with a safety harness in place in CodeGenPrepare. The intent is to restore the behavior enabled by: http://reviews.llvm.org/rL228826 but prevent bad performance such as: https://llvm.org/bugs/show_bug.cgi?id=24818 Earlier patches in this sequence: D12882 (disable SimplifyCFG speculation for expensive instructions) D13297 (have CGP despeculate expensive ops) D14630 (have CGP despeculate special versions of cttz/ctlz) As shown in the test cases, we only have two instructions currently affected: ctz for some x86 and fdiv generally. Allowing exactly one expensive instruction is a bit of a hack, but it lines up with what is currently implemented in CGP. If we make the despeculation more general in CGP, we can make the speculation here more liberal. A follow-up patch will adjust the cost for sqrt and possibly other typically expensive math intrinsics (currently everything is cheap by default). GPU targets would likely want to override those expensive default costs (just as they probably should already override the cost of div/rem) because just about any math is cheaper than control-flow on those targets. Differential Revision: http://reviews.llvm.org/D15213 llvm-svn: 255660	2015-12-15 17:38:29 +00:00
Nathan Slingerland	7f5b47ddd4	[llvm-profdata] Add support for weighted merge of profile data (2nd try) Summary: This change adds support for specifying a weight when merging profile data with the llvm-profdata tool. Weights are specified by using the --weighted-input=<weight>,<filename> option. Input files not specified with this option (normal positional list after options) are given a default weight of 1. Adding support for arbitrary weighting of input profile data allows for relative importance to be placed on the input data from multiple training runs. Both sampled and instrumented profiles are supported. Reviewers: davidxl, dnovillo, bogner, silvas Subscribers: silvas, davidxl, llvm-commits Differential Revision: http://reviews.llvm.org/D15306 llvm-svn: 255659	2015-12-15 17:37:09 +00:00
Nicolai Hahnle	78fd4f087b	AMDGPU: mark ldexp LibCalls as unavailable Summary: The LibCallSimplifier will turn llvm.exp2.* intrinsics into ldexp* libcalls which do not make sense with the AMDGPU backend. In the long run, we'll want an llvm.ldexp.* intrinsic to properly make use of this optimization, but this works around the problem for now. See also: http://reviews.llvm.org/D14327 (suggested llvm.ldexp.* implementation) Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=92709 Reviewers: arsenm, tstellarAMD Differential Revision: http://reviews.llvm.org/D14990 llvm-svn: 255658	2015-12-15 17:24:15 +00:00
Tom Stellard	8f307217c3	AMDGPU/SI: Fix bitcast between v2f32 and f64 The radeonsi fp64 support can hit these now that some redundant bitcasts are folded. Patch by: Michel Dänzer Signed-off-by: Michel Dänzer <michel.daenzer@amd.com> llvm-svn: 255657	2015-12-15 17:11:17 +00:00
Hans Wennborg	08d5905bac	[X86] Smaller code for materializing 32-bit 1 and -1 constants "movl $-1, %eax" is 5 bytes, "xorl %eax, %eax; decl %eax" is 3 bytes. This commit makes LLVM use the latter when optimizing for size. Differential Revision: http://reviews.llvm.org/D14971 llvm-svn: 255656	2015-12-15 17:10:28 +00:00
JF Bastien	dac806c783	WebAssembly: update expected torture test failures We now have 252 expected failures. llvm-svn: 255654	2015-12-15 17:07:07 +00:00

... 3 4 5 6 7 ...

85863 Commits