llvm-project

Commit Graph

Author	SHA1	Message	Date
Hans Wennborg	a7e396b5ef	Revert "Re-commit r265039 "[X86] Merge adjacent stack adjustments in eliminateCallFramePseudoInstr (PR27140)"" It seems to be causing ASan tests to crash, probably due to miscompiling the run-time somehow. llvm-svn: 265551	2016-04-06 16:10:20 +00:00
Quentin Colombet	d1d324b2ae	[AArch64] Use the default constructor of RegisterBankInfo when GlobalISel is not built. This will avoid link-time error as the defautl constructor of RegisterBankInfo is the only one available when GlobalISel is not built. llvm-svn: 265549	2016-04-06 15:53:13 +00:00
Quentin Colombet	911181882e	[RegisterBankInfo] Inline the destructor to avoid link-time error when GlobalISel is not built. llvm-svn: 265548	2016-04-06 15:47:17 +00:00
Wei Mi	18293bef4e	Recommit r265309 after fixed an invalid memory reference bug happened when DenseMap growed and moved memory. I verified it fixed the bootstrap problem on x86_64-linux-gnu but I cannot verify whether it fixes the bootstrap error on clang-ppc64be-linux. I will watch the build-bot result closely. Replace analyzeSiblingValues with new algorithm to fix its compile time issue. The patch is to solve PR17409 and its duplicates. analyzeSiblingValues is a N x N complexity algorithm where N is the number of siblings generated by reg splitting. Although it causes siginificant compile time issue when N is large, it is also important for performance since it removes redundent spills and enables rematerialization. To solve the compile time issue, the patch removes analyzeSiblingValues and replaces it with lower cost alternatives containing two parts. The first part creates a new spill hoisting method in postOptimization of register allocation. It does spill hoisting at once after all the spills are generated instead of inside every instance of selectOrSplit. The second part queries the define expr of the original register for rematerializaiton and keep it always available during register allocation even if it is already dead. It deletes those dead instructions only in postOptimization. With the two parts in the patch, it can remove analyzeSiblingValues without sacrificing performance. Differential Revision: http://reviews.llvm.org/D15302 llvm-svn: 265547	2016-04-06 15:41:07 +00:00
Silviu Baranga	a393baf1fd	Revert r265535 until we know how we can fix the bots llvm-svn: 265541	2016-04-06 14:06:32 +00:00
Sam Kolton	ff90c60a78	[AMDGPU] AsmParser: disable DPP for unsupported instructions. New dpp tests. Fix v_nop_dpp. Summary: 1. Disable DPP encoding for instructions that do not support it: - VOP1: - v_readfirstlane_b32 - v_clrexcp - v_movreld_b32 - v_movrels_b32 - v_movrelsd_b32 - VOP2: - v_madmk_f16/32 - v_madak_f16/32 - VOPC, VINTRP, VOP3 2. Fix DPP for v_nop 3. New DPP tests for VOP1 and VOP2 instructions Reviewers: nhaustov, tstellarAMD, vpykhtin Subscribers: tstellarAMD, arsenm Differential Revision: http://reviews.llvm.org/D18552 llvm-svn: 265538	2016-04-06 13:29:59 +00:00
Chad Rosier	074ce836f0	Simplify logic. NFC. llvm-svn: 265537	2016-04-06 13:27:13 +00:00
Silviu Baranga	72b4a4a330	[SCEV] Introduce a guarded backedge taken count and use it in LAA and LV Summary: When the backedge taken codition is computed from an icmp, SCEV can deduce the backedge taken count only if one of the sides of the icmp is an AddRecExpr. However, due to sign/zero extensions, we sometimes end up with something that is not an AddRecExpr. However, we can use SCEV predicates to produce a 'guarded' expression. This change adds a method to SCEV to get this expression, and the SCEV predicate associated with it. In HowManyGreaterThans and HowManyLessThans we will now add a SCEV predicate associated with the guarded backedge taken count when the analyzed SCEV expression is not an AddRecExpr. Note that we only do this as an alternative to returning a 'CouldNotCompute'. We use new feature in Loop Access Analysis and LoopVectorize to analyze and transform more loops. Reviewers: anemet, mzolotukhin, hfinkel, sanjoy Subscribers: flyingforyou, mcrosier, atrick, mssimpso, sanjoy, mzolotukhin, llvm-commits Differential Revision: http://reviews.llvm.org/D17201 llvm-svn: 265535	2016-04-06 13:18:26 +00:00
Evgeny Astigeevich	9c24ebfa6d	[AArch64][CodeGen] NFC refactor AArch64InstrInfo::optimizeCompareInstr to prepare it for fixing a bug in it AArch64InstrInfo::optimizeCompareInstr has a bug which causes generation of incorrect code (PR#27158). The patch refactors the function to simplify reviewing the fix of the bug. 1. Function name ‘modifiesConditionCode’ is changed to ‘areCFlagsAccessedBetweenInstrs’ to reflect that the function can check modifying accesses, reading accesses or both. 2. Function ‘AArch64InstrInfo::optimizeCompareInstr’ - Documented the function - Cmp_NZCV is DeadNZCVIdx to reflect that it is an operand index of dead NZCV - The code for the case of substituting CmpInstr is put into separate functions the main of them is ‘substituteCmpInstr’. Differential Revision: http://reviews.llvm.org/D18609 llvm-svn: 265531	2016-04-06 11:39:00 +00:00
Chuang-Yu Cheng	6e1408a891	[ppc64] Temporary disable sibling call optimization on ppc64 due to breaking test case r265506 breaks print-stack-trace.cc test case of compiler-rt in bootstrap test. http://lab.llvm.org:8011/builders/clang-ppc64be-linux-multistage/builds/1708 llvm-svn: 265528	2016-04-06 10:48:36 +00:00
David Majnemer	12fd50410d	[SLPVectorizer] Vectorizing the libm sqrt to llvm's sqrt intrinsic requires nnan To quote the langref "Unlike sqrt in libm, however, llvm.sqrt has undefined behavior for negative numbers other than -0.0 (which allows for better optimization, because there is no need to worry about errno being set). llvm.sqrt(-0.0) is defined to return -0.0 like IEEE sqrt." This means that it's unsafe to replace sqrt with llvm.sqrt unless the call is annotated with nnan. Thanks to Hal Finkel for pointing this out! llvm-svn: 265521	2016-04-06 07:04:53 +00:00
Duncan P. N. Exon Smith	3e0430e0a8	IR: Move MDStrings to a BumpPtrAllocator We never delete any MDString until the context is destroyed. Might as well throw them onto a BumpPtrAllocator. llvm-svn: 265520	2016-04-06 06:41:54 +00:00
Duncan P. N. Exon Smith	bdfc984679	IRMover: Steal arguments when moving functions, NFC Instead of copying arguments from the source function to the destination, steal them. This has a few advantages. - The ValueMap doesn't need to be seeded with (or cleared of) Arguments. - Often the destination function won't have created any arguments yet, so this avoids malloc traffic. - Argument names don't need to be copied. Because argument lists are lazy, this required a new Function::stealArgumentListFrom helper. llvm-svn: 265519	2016-04-06 06:38:15 +00:00
Davide Italiano	22680e1c5c	Revert "[IRVerifier] Don't crash on invalid DIFile inside DISubprogram." This reverts commit r265515 as lots of tests need to be fixed before this actually can go in. llvm-svn: 265517	2016-04-06 04:34:38 +00:00
Richard Trieu	f35d4b0928	Add parentheses to silence warning. llvm-svn: 265516	2016-04-06 04:22:00 +00:00
Davide Italiano	2deceb0339	[IRVerifier] Don't crash on invalid DIFile inside DISubprogram. llvm-svn: 265515	2016-04-06 03:57:47 +00:00
Davide Italiano	8dc23a3cb5	[IRVerifier] Avoid crashing on an invalid compile unit. llvm-svn: 265514	2016-04-06 03:07:58 +00:00
Matthias Braun	8e594fdf19	AArch64: Fix compile error Fixed to adapt a use of enterBasicBlock() in my last commit (because I had follow on patches in my repository that change the code). llvm-svn: 265513	2016-04-06 02:59:44 +00:00
Matthias Braun	7dc03f060e	RegisterScavenger: Take a reference as enterBasicBlock() argument. Make it obvious that the argument cannot be nullptr. Remove an unnecessary nullptr check in initRegState. llvm-svn: 265511	2016-04-06 02:47:09 +00:00
Matthias Braun	3bb0fcc118	LivePhysRegs: Remove redundant check llvm-svn: 265509	2016-04-06 02:46:04 +00:00
Duncan P. N. Exon Smith	6f2e37429a	ValueMapper: Fix delayed blockaddress handling after r265273 r265273 added Mapper::mapBlockAddress, which delays mapping a blockaddress value until the function has a body. The condition was backwards, and should be checking Function::empty instead of GlobalValue::isDeclaration. llvm-svn: 265508	2016-04-06 02:25:12 +00:00
Duncan P. N. Exon Smith	29883866a4	AsmParser: Don't crash on unresolved !tbaa Instead of crashing, give a nice error. As a drive-by, fix the location associated with the errors for unresolved metadata (the location was off by one token). llvm-svn: 265507	2016-04-06 02:06:40 +00:00
Chuang-Yu Cheng	2e5973ef74	[ppc64] Enable sibling call optimization on ppc64 ELFv1/ELFv2 abi This patch enable sibling call optimization on ppc64 ELFv1/ELFv2 abi, and add a couple of test cases. This patch also passed llvm/clang bootstrap test, and spec2006 build/run/result validation. Original issue: https://llvm.org/bugs/show_bug.cgi?id=25617 Great thanks to Tom's (tjablin) help, he contributed a lot to this patch. Thanks Hal and Kit's invaluable opinions! Reviewers: hfinkel kbarton http://reviews.llvm.org/D16315 llvm-svn: 265506	2016-04-06 02:04:38 +00:00
Chuang-Yu Cheng	024a623c55	[Power9] Implement add-pc, multiply-add, modulo, extend-sign-shift, random number, set bool, and dfp test significance This patch implement the following instructions: - addpcis subpcis - maddhd maddhdu maddld - modsw moduw modsd modud - darn - extswsli extswsli. - setb - dtstsfi dtstsfiq Total 15 instructions Reviewers: nemanjai hfinkel tjablin amehsan kbarton http://reviews.llvm.org/D17885 llvm-svn: 265505	2016-04-06 01:47:02 +00:00
Chuang-Yu Cheng	eaf4b3d75c	[Power9] Implement copy-paste, msgsync, slb, and stop instructions This patch implements the following BookII and Book III instructions: - copy copy_first cp_abort paste paste. paste_last - msgsync - slbieg slbsync - stop Total 10 instructions Reviewers: nemanjai hfinkel tjablin amehsan kbarton llvm-svn: 265504	2016-04-06 01:46:45 +00:00
Sanjoy Das	99abb2728b	[RS4GC] Add a comment llvm-svn: 265503	2016-04-06 01:33:54 +00:00
Sanjoy Das	65a60670e8	Lower @llvm.experimental.deoptimize as a noreturn call While preserving the return value for @llvm.experimental.deoptimize at the IR level is useful during mid-level optimization, doing so at the machine instruction level requires generating some extra code and a return that is non-ideal. This change has LLVM lower ``` %val = call @llvm.experimental.deoptimize ret %val ``` to effectively ``` call @__llvm_deoptimize() unreachable ``` instead. llvm-svn: 265502	2016-04-06 01:33:49 +00:00
NAKAMURA Takumi	285c8ff753	AArch64CodeGen: Make AArch64RegisterBankInfo.cpp optional along LLVM_BUILD_GLOBAL_ISEL. llvm-svn: 265499	2016-04-06 01:18:08 +00:00
David Majnemer	25d03dbcde	[SLPVectorizer] Vectorize libcalls of sqrt We didn't realize that we could transform the libcall into a vectorized intrinsic. llvm-svn: 265493	2016-04-06 00:14:59 +00:00
Quentin Colombet	5300950f3a	[AArch64] Initial implementation of the targeting of the register bank information. llvm-svn: 265489	2016-04-05 23:34:59 +00:00
Quentin Colombet	06bdd3c914	[RegisterBankInfo] Simplify the API for build a register bank. As part of the TRI argument of addRegBankCoverage we already have access to the TargetRegisterClass through the ID of that register class. Therefore, there is no point in needing a TargetRegisterClass instance, the ID is enough to get to it. llvm-svn: 265487	2016-04-05 23:26:39 +00:00
Sanjoy Das	8d89a2b296	[RS4GC] NFC cleanup of the DeferredReplacement class Instead of constructors use clearly named factory methods. llvm-svn: 265486	2016-04-05 23:18:53 +00:00
Sanjoy Das	49e974b33b	[RS4GC] Better codegen for deoptimize calls Don't emit a gc.result for a statepoint lowered from @llvm.experimental.deoptimize since the call into __llvm_deoptimize is effectively noreturn. Instead follow the corresponding gc.statepoint with an "unreachable". llvm-svn: 265485	2016-04-05 23:18:35 +00:00
Manman Ren	802cd6f9d7	Swift Calling Convention: swiftcc for ARM. Differential Revision: http://reviews.llvm.org/D18769 llvm-svn: 265482	2016-04-05 22:44:44 +00:00
Evgeniy Stepanov	dde29e2799	Faster stack-protector for Android/AArch64. Bionic has a defined thread-local location for the stack protector cookie. Emit a direct load instead of going through __stack_chk_guard. llvm-svn: 265481	2016-04-05 22:41:50 +00:00
Manman Ren	f8bdd88cd9	Swift Calling Convention: add swiftcc. Differential Revision: http://reviews.llvm.org/D17863 llvm-svn: 265480	2016-04-05 22:41:47 +00:00
Quentin Colombet	64bba01a63	[RegisterBank] Implement the verify method to check for the obvious mistakes. llvm-svn: 265479	2016-04-05 22:34:01 +00:00
Quentin Colombet	0195826998	[RegisterBankInfo] Add debug print to check how the initialization is going. llvm-svn: 265475	2016-04-05 21:47:56 +00:00
George Burgess IV	7e5404cc20	[CFLAA] Fix PR27213; incorrect tagging of args/globals Prior to this patch, CFLAA wouldn't tag arguments/globals properly if it didn't find any "interesting" edges on them. This means that, if all you do is store constants to a global or argument, we would never actually treat it as a global/argument. Test case: define void @foo(i32* %A, i32* %B) #0 { entry: store i32 0, i32* %A, align 4 store i32 0, i32* %B, align 4 ret void } CFLAA would say that %A can't alias %B, because neither pointer was used in an interesting way. This patch makes us note whether something is an argument, global, ... regardless of how interesting CFLAA thinks its uses are. (For the record, using a value in an interesting way means loading from it, using it in a GEP, ...) llvm-svn: 265474	2016-04-05 21:40:45 +00:00
Quentin Colombet	c94fbee9f6	[RegisterBank] Add printable capabilities for future debugging. llvm-svn: 265473	2016-04-05 21:40:43 +00:00
Duncan P. N. Exon Smith	818e5f38d2	Try harder to appease MSVC after r265456 r265465 wasn't good enough. I need to spell out all the moves. llvm-svn: 265470	2016-04-05 21:25:33 +00:00
Quentin Colombet	85689d934a	[RegisterBankInfo] Make addRegBankCoverage more capable to ease targeting jobs. Now, addRegBankCoverage also adds the subreg-classes not just the sub-classes of the given register class. llvm-svn: 265469	2016-04-05 21:20:12 +00:00
Junmo Park	53470fc451	Minor code cleanups. NFC. llvm-svn: 265468	2016-04-05 21:14:31 +00:00
Duncan P. N. Exon Smith	1de3c7e790	IR: Introduce ConstantAggregate, NFC Add a common parent class for ConstantArray, ConstantVector, and ConstantStruct called ConstantAggregate. These are the aggregate subclasses of Constant that take operands. This is mainly a cleanup, adding common `isa` target and removing duplicated code. However, it also simplifies caching which constants point transitively at `GlobalValue` (a possible future direction). llvm-svn: 265466	2016-04-05 21:10:45 +00:00
Duncan P. N. Exon Smith	f880d35b80	Try to appease MSVC after r265456 I can't remember if adding `= default` will make MSVC happy, or if I have to spell this out. Let's try the cleaner version first. llvm-svn: 265465	2016-04-05 21:07:01 +00:00
Quentin Colombet	d347d695c2	[RegisterBankInfo] Implement the methods to create register banks. llvm-svn: 265464	2016-04-05 21:06:15 +00:00
Duncan P. N. Exon Smith	db63bda88d	IR: Add missing assertion for ConstantVector::ConstantVector Use the same assertion as ConstantArray. Vectors should have the right number of elements. llvm-svn: 265463	2016-04-05 20:53:47 +00:00
Quentin Colombet	c4db2ad5b8	[RegisterBank] Provide a way to check if a register bank is valid. Change the default constructor to create invalid object. The target will have to properly initialize the register banks before using them. llvm-svn: 265460	2016-04-05 20:48:32 +00:00
Duncan P. N. Exon Smith	91d3cfed78	Revert "Fix Clang-tidy modernize-deprecated-headers warnings in remaining files; other minor fixes." This reverts commit r265454 since it broke the build. E.g.: http://lab.llvm.org:8080/green/job/clang-stage1-cmake-RA-incremental_build/22413/ llvm-svn: 265459	2016-04-05 20:45:04 +00:00
Duncan P. N. Exon Smith	ea7df770ae	ValueMapper: Rewrite Mapper::mapMetadata without recursion This commit completely rewrites Mapper::mapMetadata (the implementation of llvm::MapMetadata) using an iterative algorithm. The guts of the new algorithm are in MDNodeMapper::map, the entry function in a new class. Previously, Mapper::mapMetadata performed a recursive exploration of the graph with eager "just in case there's a reason" malloc traffic. The new algorithm has these benefits: - New nodes and temporaries are not created eagerly. - Uniquing cycles are not duplicated (see new unit test). - No recursion. Given a node to map, it does this: 1. Use a worklist to perform a post-order traversal of the transitively referenced unmapped nodes. 2. Track which nodes will change operands, and which will have new addresses in the mapped scheme. Propagate the changes through the POT until fixed point, to pick up uniquing cycles that need to change. 3. Map all the distinct nodes without touching their operands. If RF_MoveDistinctMetadata, they get mapped to themselves; otherwise, they get mapped to clones. 4. Map the uniqued nodes (bottom-up), lazily creating temporaries for forward references as needed. 5. Remap the operands of the distinct nodes. Mehdi helped me out by profiling this with -flto=thin. On his workload (importing/etc. for opt.cpp), MapMetadata sped up by 15%, contributed about 50% less to persistent memory, and made about 100x fewer calls to malloc. The speedup is less than I'd hoped. The profile mainly blames DenseMap lookups; perhaps there's a way to reduce them (e.g., by disallowing remapping of MDString). It would be nice to break the strange remaining recursion on the Value side: MapValue => materializeInitFor => RemapInstruction => MapValue. I think we could do this by having materializeInitFor return a worklist of things to be remapped. llvm-svn: 265456	2016-04-05 20:23:21 +00:00
Eugene Zelenko	1760dc2a23	Fix Clang-tidy modernize-deprecated-headers warnings in remaining files; other minor fixes. Some Include What You Use suggestions were used too. Use anonymous namespaces in source files. Differential revision: http://reviews.llvm.org/D18778 llvm-svn: 265454	2016-04-05 20:19:49 +00:00
Ahmed Bougacha	50e6cd4a3a	[X86] Reuse EFLAGS and form LOCKed ops when only user is SETCC. We only generate LOCKed versions of add/sub when the result is unused. It often happens that the result is used, but only by a comparison. We can optimize those out by reusing EFLAGS, which lets us use the proper instructions, instead of having to fallback to LXADD. Instead of doing this as an MI peephole (as we do for the other non-LOCKed (really, non-MR) forms), do it in ISel. It becomes quite tricky later. This also makes it eventually possible to stop expanding and/or/xor if the only user is an icmp (also see D18141). This uses the LOCK ISD opcodes added by r262244. Differential Revision: http://reviews.llvm.org/D17633 llvm-svn: 265450	2016-04-05 20:02:57 +00:00
Quentin Colombet	b235d32e74	[GlobalISel] Add the RegisterBankInfo class for the handling of register banks. llvm-svn: 265449	2016-04-05 20:02:47 +00:00
Ahmed Bougacha	629446ba03	[X86] Simplify early-exit check. NFC. llvm-svn: 265447	2016-04-05 20:02:22 +00:00
Quentin Colombet	bdc3b4d523	[GlobalISel] Add a class, RegisterBank, to represent register banks. llvm-svn: 265445	2016-04-05 19:54:44 +00:00
Sanjay Patel	4c7d094451	fix typo; NFC llvm-svn: 265442	2016-04-05 19:27:39 +00:00
Quentin Colombet	8e8e85c19f	[GlobalISel] Add the skeleton of the RegBankSelect pass. This pass is reponsible for assigning the generic virtual registers to register banks. llvm-svn: 265440	2016-04-05 19:06:01 +00:00
Sanjay Patel	f3bb6c51bc	fix documentation comments; NFC llvm-svn: 265434	2016-04-05 18:23:30 +00:00
Manman Ren	e221a870d3	Swift Calling Convention: swifterror target-independent change. At IR level, the swifterror argument is an input argument with type ErrorObject*. For targets that support swifterror, we want to optimize it to behave as an inout value with type ErrorObject; it will be passed in a fixed physical register. The main idea is to track the virtual registers for each swifterror value. We define swifterror values as AllocaInsts with swifterror attribute or a function argument with swifterror attribute. In SelectionDAGISel.cpp, we set up swifterror values (SwiftErrorVals) before handling the basic blocks. When iterating over all basic blocks in RPO, before actually visiting the basic block, we call mergeIncomingSwiftErrors to merge incoming swifterror values when there are multiple predecessors or to simply propagate them. There, we create a virtual register for each swifterror value in the entry block. For predecessors that are not yet visited, we create virtual registers to hold the swifterror values at the end of the predecessor. The assignments are saved in SwiftErrorWorklist and will be materialized at the end of visiting the basic block. When visiting a load from a swifterror value, we copy from the current virtual register assignment. When visiting a store to a swifterror value, we create a virtual register to hold the swifterror value and update SwiftErrorMap to track the current virtual register assignment. Differential Revision: http://reviews.llvm.org/D18108 llvm-svn: 265433	2016-04-05 18:13:16 +00:00
Jacques Pienaar	42991b3e5a	[lanai] LanaiSetflagAluCombiner more conservative Summary: LanaiSetflagAluCombiner could previously combine instructions across basic building blocks even when not legal. Make the LanaiSetflagAluCombiner more conservative to avoid this. Reviewers: eliben Subscribers: joker.eph, llvm-commits Differential Revision: http://reviews.llvm.org/D18746 llvm-svn: 265411	2016-04-05 16:18:13 +00:00
Sam Parker	0d3a3a537c	[ARM] Cleanup of smul and smla instruction descriptions Removed the SDNode argument passed to the AI_smul and AI_smla multiclass definitions as they are always mul. Differential Revision: http://reviews.llvm.org/D18791 llvm-svn: 265409	2016-04-05 16:01:25 +00:00
Konstantin Zhuravlyov	e63e02cb0c	[AMDGPU] Emit linkonce and linkonce_odr symbols Differential Revision: http://reviews.llvm.org/D18726 llvm-svn: 265408	2016-04-05 16:00:58 +00:00
Haicheng Wu	3618fa786f	[BlockPlacement] Remove an unnecessary continue NFC. llvm-svn: 265407	2016-04-05 15:37:08 +00:00
Chuang-Yu Cheng	d3fb38cae5	Don't delete empty preheaders in CodeGenPrepare if it would create a critical edge Presently, CodeGenPrepare deletes all nearly empty (only phi and branch) basic blocks. This pass can delete loop preheaders which frequently creates critical edges. A preheader can be a convenient place to spill registers to the stack. If the entrance to a loop body is a critical edge, then spills may occur in the loop body rather than immediately before it. This patch protects loop preheaders from deletion in CodeGenPrepare even if they are nearly empty. Since the patch alters the CFG, it affects a large number of test cases. In most cases, the changes are merely cosmetic (basic blocks have different names or instruction orders change slightly). I am somewhat concerned about the test/CodeGen/Mips/brdelayslot.ll test case. If the loop preheader is not deleted, then the MIPS backend does not take advantage of a branch delay slot. Consequently, I would like some close review by a MIPS expert. The patch also partially subsumes D16893 from George Burgess IV. George correctly notes that CodeGenPrepare does not actually preserve the dominator tree. I think the dominator tree was usually not valid when CodeGenPrepare ran, but I am using LoopInfo to mark preheaders, so the dominator tree is now always valid before CodeGenPrepare. Author: Tom Jablin (tjablin) Reviewers: hfinkel george.burgess.iv vkalintiris dsanders kbarton cycheng http://reviews.llvm.org/D16984 llvm-svn: 265397	2016-04-05 14:06:20 +00:00
Peter Zotov	0a2fa0a13b	[llvm-c] Expose LLVM{Get,Set}ModuleIdentifier Patch by Nicole Mazzuca <npmazzuca@gmail.com>. Differential Revision: http://reviews.llvm.org/D18736 llvm-svn: 265394	2016-04-05 13:56:59 +00:00
Simon Dardis	d9d41f531e	[mips] MIPSR6 Compact jump support This patch adds support for compact jumps similiar to the previous compact branch support for MIPSR6. Unlike compact branches, compact jumps do not have a forbidden slot. As MipsInstrInfo::getEquivalentCompactForm can determine the correct expansion for jumps and branches for both microMIPS and MIPSR6, remove the unnecessary distinction in the delay slot filler. Reviewers: vkalintiris Subscribers: llvm-commits, dsanders llvm-svn: 265390	2016-04-05 12:50:29 +00:00
Justin Holewinski	c79979299a	[NVPTX] Handle ldg created from sign-/zero-extended load Reviewers: jingyue Subscribers: jholewinski Differential Revision: http://reviews.llvm.org/D18053 llvm-svn: 265389	2016-04-05 12:38:01 +00:00
David L Kreitzer	188de5ae69	Adds the ability to use an epilog remainder loop during loop unrolling and makes this the default behavior. Patch by Evgeny Stupachenko (evstupac@gmail.com). Differential Revision: http://reviews.llvm.org/D18158 llvm-svn: 265388	2016-04-05 12:19:35 +00:00
Haojian Wu	591ae46820	Add parentheses around `&&` within `\|\|` to avoid compiler warning message. Summary: The assert code is introduced by r265370. Reviewers: bkramer Subscribers: tejohnson Differential Revision: http://reviews.llvm.org/D18786 llvm-svn: 265383	2016-04-05 09:07:47 +00:00
Dmitry Polukhin	a3d5b0b218	[IFUNC] Use GlobalIndirectSymbol when aliases and ifuncs have something similar Second part extracted from http://reviews.llvm.org/D15525 Use GlobalIndirectSymbol in all cases when aliases and ifuncs have something in common. Differential Revision: http://reviews.llvm.org/D18754 llvm-svn: 265382	2016-04-05 08:47:51 +00:00
Teresa Johnson	fb7c764496	[ThinLTO] Refactor some common code into getGlobalValueInfo method (NFC) Refactor common code that queries the ModuleSummaryIndex for a value's GlobalValueInfo struct into getGlobalValueInfo helper methods, which will also be used by D18763. llvm-svn: 265370	2016-04-05 00:40:16 +00:00
JF Bastien	1c3c223b65	Lanai: fix -Wsign-compare warning llvm-svn: 265368	2016-04-05 00:20:27 +00:00
Teresa Johnson	f4cf1c3eb4	Don't fold double constant to an integer if dest type not integral Summary: I encountered this issue when constant folding during inlining tried to fold away a bitcast of a double to an x86_mmx, which is not an integral type. The test case exposes the same issue with a smaller code snippet during early CSE. Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D18528 llvm-svn: 265367	2016-04-04 23:50:46 +00:00
JF Bastien	393b79ee00	Lanai: fix -Wpedantic warnings Extra semicolon. llvm-svn: 265365	2016-04-04 23:47:30 +00:00
Reid Kleckner	7de6761561	Fix non-determinism in order of LLVM attributes We were using array_pod_sort on an array of type 'Attribute', which wraps a pointer to AttributeImpl. For the most part this didn't matter because the printing code prints enum attributes in a defined order, but integer attributes such as 'align' and 'dereferenceable' were not ordered. Furthermore, AttributeImpl::operator< was broken for integer attributes. An integer attribute is a kind and an integer value, and both pieces need to be compared. By fixing the comparison operator, we can go back to std::sort, and things look good now. This should fix clang arm-swiftcall.c test failures on Windows. llvm-svn: 265361	2016-04-04 23:06:05 +00:00
Sanjay Patel	e77c7de459	use range loop; NFCI llvm-svn: 265360	2016-04-04 23:05:06 +00:00
Sanjay Patel	769b5fd546	fix typos; NFC llvm-svn: 265356	2016-04-04 22:45:56 +00:00
Amaury Sechet	56f056c01f	Style update in Core.h/Core.cpp . NFC llvm-svn: 265353	2016-04-04 22:00:25 +00:00
Justin Bogner	35c6903f22	Revert "CodeGen: Remove dead code in TailDuplicate" It seems this is reachable after all. It hit on 7zip-benchmark in lnt on ppc64: http://lab.llvm.org:8011/builders/clang-ppc64be-linux-lnt/builds/2317 This reverts r265347. llvm-svn: 265352	2016-04-04 21:41:54 +00:00
Matthias Braun	7511abd5c1	MachineScheduler: Ignore COPYs with undef/dead op in CopyConstrain mutation. There is no problem with the code today, but the fix will avoid a crash in test/CodeGen/AMDGPU/subreg-coalescer-undef-use.ll once the DetectDeadLanes pass is added. llvm-svn: 265351	2016-04-04 21:23:46 +00:00
Teresa Johnson	3c35e0999b	Clean up calls to WriteBitcodeToFile (NFC) Remove a default parameter value being passed unnecessarily, which also reduces the changes required when this parameter is changed in D18763. Document the remaining non-default bool value passed for another parameter. llvm-svn: 265348	2016-04-04 21:19:31 +00:00
Justin Bogner	9ab8131a57	CodeGen: Remove dead code in TailDuplicate I noticed that this isn't covered by our existing tests and spent some time trying to come up with an example it actually hits. I tried hand rolling something based on the explanation in the comment, but couldn't get anything that didn't abort tail duplication earlier for one reason or another. Then, I tried cranking tail-dup-size cranked up so this would fire more and ran a bootstrap of clang and the nightly test suite - those don't hit this either. This reverts r132816 and replaces it with an assert. llvm-svn: 265347	2016-04-04 21:11:40 +00:00
Hans Wennborg	a47a692341	Re-commit r265039 "[X86] Merge adjacent stack adjustments in eliminateCallFramePseudoInstr (PR27140)" The original commit miscompiled things on 32-bit Windows, e.g. a Clang boostrap. It turns out that mergeSPUpdates() was a bit too generous in what it interpreted as a stack adjustment, causing the following code: addl $12, %esp leal -4(%ebp), %esp To be "optimized" into simply: addl $8, %esp This commit tightens up mergeSPUpdates() and includes a new test (test14 in movtopush.ll) for this situation. llvm-svn: 265345	2016-04-04 21:02:46 +00:00
Zia Ansari	a82a58a4e5	Enable unroll for constant bound loops when TripCount is not modulo of unroll factor, reducing it to maximum power-of-2 that satisfies threshold limit. Commit for Evgeny Stupachenko (evstupac@gmail.com) Differential Revision: http://reviews.llvm.org/D18290 llvm-svn: 265337	2016-04-04 19:24:46 +00:00
Chandler Carruth	613eec8210	Revert r263460: [SpillPlacement] Fix a quadratic behavior in spill placement. That commit looks wonderful and awesome. Sadly, it greatly exacerbates PR17409 and effectively regresses build time for a lot of (very large) code when compiled with ASan or MSan. We thought this could be fixed forward by landing D15302 which at last fixes that PR, but some issues were discovered and it looks like that got reverted, so reverting this as well temporarily. As soon as the fix for PR17409 lands and sticks, we should re-land this patch as it won't trigger more significant test cases hitting that bug. Many thanks to Quentin and Wei here as they're doing all the awesome hard work!!! llvm-svn: 265331	2016-04-04 18:57:50 +00:00
Betul Buyukkurt	18131c4216	[PGO] Avoid instrumenting direct callee's at value sites. Direct callees' that are cast to other function prototypes, show up in the Call/Invoke instructions as ConstantExpr's. Currently llvm::CallSite's getCalledFunction() fails to return the callees in such expressions as direct calls. Value profiling should avoid instrumenting such cases. Mostly NFC. llvm-svn: 265330	2016-04-04 18:56:36 +00:00
Matthias Braun	870c34f0cf	ARM, AArch64, X86: Check preserved registers for tail calls. We can only perform a tail call to a callee that preserves all the registers that the caller needs to preserve. This situation happens with calling conventions like preserver_mostcc or cxx_fast_tls. It was explicitely handled for fast_tls and failing for preserve_most. This patch generalizes the check to any calling convention. Related to rdar://24207743 Differential Revision: http://reviews.llvm.org/D18680 llvm-svn: 265329	2016-04-04 18:56:13 +00:00
Teresa Johnson	916495d894	[ThinLTO] Add option to dump value name to GUID mapping Summary: Useful for debugging since we lose this correlation after the permodule summary/VST is read and until we later materialize source modules in the function importer. Reviewers: joker.eph Subscribers: llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D18555 llvm-svn: 265327	2016-04-04 18:52:58 +00:00
Teresa Johnson	0beb858e97	[ThinLTO] Augment FunctionImport dump with value name to GUID map Summary: To aid in debugging, dump out the correlation between value names and GUID for each source module when it is materialized. This will make it easier to comprehend the earlier summary-based function importing debug trace which only has access to and prints the GUIDs. Reviewers: joker.eph Subscribers: llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D18556 llvm-svn: 265326	2016-04-04 18:52:23 +00:00
Brendon Cahoon	86f783e315	[DependenceAnalysis] Check if result of getConstantPart is null A seg-fault occurs due to a reference of a null pointer, which is the value returned by getConstantPart. This function returns null if the constant part is not found. The code that calls this function needs to check for the null return value. Differential Revision: http://reviews.llvm.org/D18718 llvm-svn: 265319	2016-04-04 18:13:18 +00:00
Derek Schuff	73900c6876	Replace MachineRegisterInfo::isSSA() with a MachineFunctionProperty Use the MachineFunctionProperty mechanism to indicate whether a MachineFunction is in SSA form instead of a custom method on MachineRegisterInfo. NFC Differential Revision: http://reviews.llvm.org/D18574 llvm-svn: 265318	2016-04-04 18:03:29 +00:00
Wei Mi	fb5252cac1	Revert r265309 and r265312 because they caused some errors I need to investigate. llvm-svn: 265317	2016-04-04 17:45:03 +00:00
Derek Schuff	1dbf7a571f	Add MachineFunctionProperty checks for AllVRegsAllocated for target passes Summary: This adds the same checks that were added in r264593 to all target-specific passes that run after register allocation. Reviewers: qcolombet Subscribers: jyknight, dsanders, llvm-commits Differential Revision: http://reviews.llvm.org/D18525 llvm-svn: 265313	2016-04-04 17:09:25 +00:00
Wei Mi	cdaf1df657	Fix unused var warning caused by r265309. llvm-svn: 265312	2016-04-04 17:03:58 +00:00
Wei Mi	ffbc9c7f3b	Replace analyzeSiblingValues with new algorithm to fix its compile time issue. The patch is to solve PR17409 and its duplicates. analyzeSiblingValues is a N x N complexity algorithm where N is the number of siblings generated by reg splitting. Although it causes siginificant compile time issue when N is large, it is also important for performance since it removes redundent spills and enables rematerialization. To solve the compile time issue, the patch removes analyzeSiblingValues and replaces it with lower cost alternatives containing two parts. The first part creates a new spill hoisting method in postOptimization of register allocation. It does spill hoisting at once after all the spills are generated instead of inside every instance of selectOrSplit. The second part queries the define expr of the original register for rematerializaiton and keep it always available during register allocation even if it is already dead. It deletes those dead instructions only in postOptimization. With the two parts in the patch, it can remove analyzeSiblingValues without sacrificing performance. Differential Revision: http://reviews.llvm.org/D15302 llvm-svn: 265309	2016-04-04 16:42:40 +00:00
Daniel Sanders	b3c2764f89	[mips] Range check simm32 and fold MIPS16's imm32 into simm32. Summary: At this point we should be able to enable IAS by default for O32 without breaking check-all, or recursion. Reviewers: vkalintiris Subscribers: dsanders, llvm-commits Differential Revision: http://reviews.llvm.org/D18439 llvm-svn: 265302	2016-04-04 15:32:49 +00:00
Ulrich Weigand	99ac5045ab	[SystemZ] Add compare-and-branch instructions to MC This adds MC support for fused compare + indirect branch instructions, ie. CRB, CGRB, CLRB, CLGRB, CIB, CGIB, CLIB, CLGIB. They aren't actually generated yet -- this is preparation for their use for conditional returns in the next iteration of D17339. Author: koriakin Differential Revision: http://reviews.llvm.org/D18742 llvm-svn: 265296	2016-04-04 14:26:43 +00:00
Ulrich Weigand	a9ac6d6cc2	[SystemZ] Support ATOMIC_FENCE A cross-thread sequentially consistent fence should be lowered into z/Architecture's BCR serialization instruction, instead of causing a fatal error in the back-end. Author: bryanpkc Differential Revision: http://reviews.llvm.org/D18644 llvm-svn: 265292	2016-04-04 12:45:44 +00:00
Ulrich Weigand	f557d08325	[SystemZ] Support llvm.frameaddress/llvm.returnaddress intrinsics Enable the SystemZ back-end to lower FRAMEADDR and RETURNADDR, which previously would cause the back-end to crash. Currently, only a frame count of zero is supported. Author: bryanpkc Differential Revision: http://reviews.llvm.org/D18514 llvm-svn: 265291	2016-04-04 12:44:55 +00:00
Elena Demikhovsky	e99c561391	AVX-512: Truncating store for i1 vectors Implemented truncstore for KNL and skylake-avx512. Covered vectors from v2i1 to v64i1. We save the value in bits (not in bytes) - v32i1 is saved in 4 bytes. Differential Revision: http://reviews.llvm.org/D18740 llvm-svn: 265283	2016-04-04 07:17:47 +00:00
Duncan P. N. Exon Smith	8e65f8ddfd	ValueMapper: Remove old FIXMEs; almost NFC Remove a few old FIXMEs from the original commit of the Metadata/Value split in r223802. These are commented out assertions to the effect that calls between mapValue and mapMetadata never return nullptr. (The only behaviour change is that Mapper::mapSimpleMetadata memoizes the nullptr return.) When I originally rewrote the mapping code, I thought we could be stricter in the new metadata hierarchy and never return nullptr when RF_NullMapMissingGlobalValues was off. It's still not entirely clear to me why these assertions failed (a few months ago, I had a theory that I forgot to write down, but that's helping no one). Understood or not, I no longer see how these commented-out assertions would be useful. I'm relegating them to the annals of source control before making significant changes to ValueMapper.cpp. llvm-svn: 265282	2016-04-04 04:59:56 +00:00
Duncan P. N. Exon Smith	fef609f15e	IR: Lazily create ReplaceableMetadataImpl on MDNode RAUW support on MDNode usually requires an extra allocation for ReplaceableMetadataImpl. This is only strictly necessary if there are tracking references to the MDNode. Make the construction of ReplaceableMetadataImpl lazy, so that we don't get allocations if we don't need them. Since MDNode::isResolved now checks MDNode::isTemporary and MDNode::NumUnresolved instead of whether a ReplaceableMetadataImpl is allocated, the internal changes are intrusive (at various internal checkpoints, isResolved now has a different answer). However, there should be no real functionality change here; just slightly lazier allocation behaviour. The external semantics should be identical. llvm-svn: 265279	2016-04-03 21:23:52 +00:00
Amaury Sechet	7c2883cf85	Various style fix in Core.h/Core.cpp . NFC llvm-svn: 265277	2016-04-03 21:06:04 +00:00
Duncan P. N. Exon Smith	756e1c3db4	ValueMapper: Disallow metadata mapping recursion through mapValue This adds an assertion to maintain the property from r265273. When Mapper::mapSimpleMetadata calls Mapper::mapValue, it should not find its way back to mapMetadataImpl. This guarantees that mapSimpleMetadata is not involved in any recursion. Since Mapper::mapValue calls out to arbitrary materializers, we need to save a bit on the ValueMap to make this assertion effective. There should be no functionality change here. This co-recursion should already have been impossible. llvm-svn: 265276	2016-04-03 20:54:51 +00:00
Duncan P. N. Exon Smith	a997856b3d	Work around MSVC failure from r265273 http://lab.llvm.org:8011/builders/sanitizer-windows/builds/19726 llvm-svn: 265275	2016-04-03 20:42:21 +00:00
Simon Pilgrim	0edd3d771a	[X86] Removed duplicate code. llvm-svn: 265274	2016-04-03 20:40:35 +00:00
Duncan P. N. Exon Smith	c6065e3a25	ValueMapper: Avoid recursion in mapSimplifiedMetadata, NFC The main change is to delay materializing GlobalValue initializers from Mapper::mapValue until Mapper::~Mapper. This effectively removes all recursion from mapSimplifiedMetadata, as promised in r265270. mapSimplifiedMetadata calls mapValue for ConstantAsMetadata nodes to find the mapped constant, and now it shouldn't be possible for mapValue to indirectly re-invoke mapMetadata. I'll add an assertion to that effect in a follow-up (separated so that the assertion can easily be reverted independently, if it comes to that). This a step toward a broader goal: converting Mapper::mapMetadataImpl from a recursive to an iterative algorithm. When a BlockAddress points at a BasicBlock inside an unmaterialized function body, we need to delay it until the function body is materialized in Mapper::~Mapper. This commit creates a temporary BasicBlock and returns a new BlockAddress, then RAUWs the BasicBlock once it is known. This situation should be extremely rare since a BlockAddress is usually used from within the function it's referencing (and BlockAddress itself is rare). There should be no observable functionality change. llvm-svn: 265273	2016-04-03 20:17:45 +00:00
Peter Zotov	8efe38a1e2	[CodeGenPrepare] Fix r265264 (again). Don't require TLI for SinkCmpExpression, like it wasn't before r265264. llvm-svn: 265271	2016-04-03 19:32:13 +00:00
Duncan P. N. Exon Smith	ae8bd4bd11	ValueMapper: Split out mapSimpleMetadata, NFC Split out a helper for mapping metadata without operands. This is any metadata that is not an MDNode, and any MDNode where the answer is known without looking at operands. Through some weird twists, this function is co-recursive: mapSimpleMetadata => MapValue => materializeInitFor => linkFunctionBody => RemapInstructions => MapMetadata => mapSimpleMetadata I plan to break the recursion in a follow-up. llvm-svn: 265270	2016-04-03 19:31:01 +00:00
Duncan P. N. Exon Smith	829dc87a68	ValueMapper: Introduce Mapper helper class, NFC Remove a bunch of boilerplate from ValueMapper.cpp by using a new file-local class called Mapper. llvm-svn: 265268	2016-04-03 19:06:24 +00:00
Simon Pilgrim	cd0dfc93eb	[X86][SSE] Support for MOVMSK signbit extraction instructions Add support for lowering with the MOVMSK instruction to extract vector element signbits to a GPR. This is an early step towards more optimal handling of vector comparison results. Differential Revision: http://reviews.llvm.org/D18741 llvm-svn: 265266	2016-04-03 18:22:03 +00:00
Peter Zotov	f87e550e89	[CodeGenPrepare] Fix r265264. The case where there was no TargetLowering was not handled, leading to null pointer dereferences. llvm-svn: 265265	2016-04-03 17:11:53 +00:00
Peter Zotov	0b6d7bc682	[CodeGenPrepare] Avoid sinking soft-FP comparisons Sinking comparisons in CGP can undo the job of hoisting them done earlier by LICM, and soft-FP makes this an expensive mistake. A common pattern that produces floating point comparisons uniform over a loop is an explicit check for division by zero. If the divisor is hoisted out of the loop, the comparison can also be, but hoisting the function that unwinds is never legal, since it may cause side effects in the loop body prior to the unwinding to not be executed. Differential Revision: http://reviews.llvm.org/D18744 llvm-svn: 265264	2016-04-03 16:36:17 +00:00
Simon Pilgrim	20d1d4f045	[X86] Tidied up X86ISD instruction nodes. NFCI. Tidied up comments, stripped trailing whitespace, split apart nodes that aren't related. No change in ordering although there is definitely some scope for it. llvm-svn: 265263	2016-04-03 14:14:32 +00:00
Peter Zotov	0218d0f383	Mark some FP intrinsics as safe to speculatively execute Floating point intrinsics in LLVM are generally not speculatively executed, since most of them are defined to behave the same as libm functions, which set errno. However, the only error that can happen when executing ceil, floor, nearbyint, rint and round libm functions per POSIX.1-2001 is -ERANGE, and that requires the maximum value of the exponent to be smaller than the number of mantissa bits, which is not the case with any of the floating point types supported by LLVM. The trunc and copysign functions never set errno per per POSIX.1-2001. Differential Revision: http://reviews.llvm.org/D18643 llvm-svn: 265262	2016-04-03 12:30:46 +00:00
Elena Demikhovsky	5e426f7356	AVX-512: Load and Extended Load for i1 vectors Implemented load+{sign\|zero}_extend for i1 vectors Fixed failures in i1 vector load. Covered loading of v2i1, v4i1, v8i1, v16i1, v32i1, v64i1 vectors for KNL and SKX. Differential Revision: http://reviews.llvm.org/D18737 llvm-svn: 265259	2016-04-03 08:41:12 +00:00
Davide Italiano	d4f5a059e0	[SimplifyLibCalls] Garbage collect dead code. We already skip optimizations if the return value of printf() is used, so CI->use_empty() is always true. Differential Revision: http://reviews.llvm.org/D18656 llvm-svn: 265253	2016-04-03 01:46:52 +00:00
Jacques Pienaar	796975d311	[lanai] Fix for LanaiDelaySlotFiller and LanaiMCInstLower.cpp Summary: * Fix to stop delay slot filler from inserting SP modifying instructions in the newly expanded call/return instructions. * In LowerSymbol the outermost type was not LanaiMCExpr if there was a binary expression * Remove printExpr in LanaiInstPrinter Subscribers: joker.eph, llvm-commits Differential Revision: http://reviews.llvm.org/D18734 llvm-svn: 265251	2016-04-03 00:49:27 +00:00
Zoran Jovanovic	2b7cc5a4ae	[mips][microMIPS] Revert commits r264245 and r264248. Commit r264245 was the reason for failing tests in LLVM test suite. Commit r264248 depends on the first one. llvm-svn: 265249	2016-04-02 23:06:13 +00:00
Saleem Abdulrasool	85b43639b1	AArch64: support .cpu directive Add support for the AArch64 .cpu directive. This is a slightly involved directive since the parameter is actually a variable encoded string. The general structure is: <cpu>[[+-]<feature>]* We now map some of the supported string names for features for internal representation of feature flags. If we encounter one which we do not support, bail out as we cannot validate the assembly any longer. Resolves PR27010. llvm-svn: 265240	2016-04-02 19:29:52 +00:00
Duncan P. N. Exon Smith	6d72d166dc	Linker: Split mapUnneededSubprograms into two; almost NFC Split the loop through compile units in mapUnneededSubprograms in two. First, visit imported entities to ensure that we've visited all need subprograms. Second, visit subprograms, and drop the ones we don't need. Hypothetically this protects against a subprogram from one compile unit being referenced from an imported entity in a different compile unit. I don't think that's valid IR (a debug info expert could confirm), but I think the refactor makes the code more clear. llvm-svn: 265233	2016-04-02 17:54:01 +00:00
Duncan P. N. Exon Smith	751114b39d	Remove redundant assertion after cast, NFC llvm-svn: 265232	2016-04-02 17:41:52 +00:00
Duncan P. N. Exon Smith	0d60a9887f	Linker: Avoid unnecessary work when moving named metadata IRLinker::mapUnneededSubprograms has to be sure that any "needed" subprograms get linked in. Rather than traversing through imported entities using llvm::getSubprogram, call MapMetadata. The latter memoizes the result in the ValueMap (sharing work with IRLinker::linkNamedMDNodes proper), and makes the local SmallPtrSet redundant. llvm-svn: 265231	2016-04-02 17:39:31 +00:00
Mehdi Amini	8958c40430	Rename FunctionIndex into GlobalValueIndex to reflect the recent changes (NFC) The index used to contain only Function, but now contains GlobalValue in general. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265230	2016-04-02 17:29:47 +00:00
Duncan P. N. Exon Smith	4b520e5ef6	Linker: Remove IRMover::isMetadataUnneeded indirection; almost NFC Instead of checking live during MapMetadata whether a subprogram is needed, seed the ValueMap with `nullptr` up-front. There is a small hypothetical functionality change. Previously, calling MapMetadataOp on a node whose "scope:" chain led to an unneeded subprogram would return nullptr. However, if that were ever called, then the subprogram would be needed; a situation that the IRMover is supposed to avoid a priori! Besides cleaning up the code a little, this restores a nice property: MapMetadataOp returns the same as MapMetadata. llvm-svn: 265229	2016-04-02 17:12:00 +00:00
Duncan P. N. Exon Smith	da4a56d1ab	ValueMapper: Add support for seeding metadata with nullptr Support seeding a ValueMap with nullptr for Metadata entries, a situation I didn't consider in the Metadata/Value split. I added a ValueMapper::getMappedMD accessor that returns an Optional<Metadata*> with the mapped (possibly null) metadata. IRMover needs to use this to avoid modifying the map when it's checking for unneeded subprograms. I updated a call from bugpoint since I find the new code clearer. llvm-svn: 265228	2016-04-02 17:04:38 +00:00
Duncan P. N. Exon Smith	520f8542ff	Bitcode: Try to emit metadata in function blocks Whenever metadata is only referenced by a single function, emit the metadata just in that function block. This should improve lazy-loading by reducing the amount of metadata in the global block. For now, this should catch all DILocations, and anything else that happens to be referenced only by a single function. It's also a first step toward a couple of possible future directions (which this commit does not implement): 1. Some debug info metadata is only referenced from compile units and individual functions. If we can drop the link from the compile unit, this optimization will get more powerful. 2. Any uniqued metadata that isn't referenced globally can in theory be emitted in every function block that references it (trading off bitcode size and full-parse time vs. lazy-load time). Note: this assumes the new BitcodeReader error checking from r265223. The metadata stored in function blocks gets purged after parsing each function, which means unresolved forward references will get lost. Since all the global metadata should have already been resolved by the time we get to the function metadata blocks we just need to check for that case. (If for some reason we need to handle bitcode that fails the checks in r265223, the fix is to store about-to-be-dropped unresolved nodes in MetadataList::shrinkTo until they can be handled succesfully by a future call to MetadataList::tryToResolveCycles.) llvm-svn: 265226	2016-04-02 15:22:57 +00:00
Duncan P. N. Exon Smith	0b76b723f4	Fix doxygen comments from r265224, NFC llvm-svn: 265225	2016-04-02 15:16:56 +00:00
Duncan P. N. Exon Smith	9342911f31	BitcodeWriter: Further unify function metadata, NFC Further unify the handling of function-local metadata with global metadata, by exposing the same interface in ValueEnumerator. Both contexts use the same accessors: - getMDStrings(): get the strings for this block. - getNonMDStrings(): get the non-strings for this block. A future commit will start adding strings to the function-block. llvm-svn: 265224	2016-04-02 15:09:42 +00:00
Duncan P. N. Exon Smith	8742de9b20	BitcodeReader: Check for unresolved function metadata A follow-up commit will start using function metadata blocks more heavily. This commit adds some error checking to confirm that metadata is fully resolved before (and after) materializing each function. This is valid even when reading very old bitcode from before the metadata/value split. The global metadata block always came before the function blocks. However, in case somehow this causes a regression (i.e., an old LLVM did produce such bitcode after all) I'm committing separately. llvm-svn: 265223	2016-04-02 14:55:01 +00:00
Mehdi Amini	1e5fddda3d	Reverts r265219. Unintentionally commited... time to call the day off! From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265221	2016-04-02 05:35:03 +00:00
Mehdi Amini	89038a1071	Fix "warning: variabl 'XX’ set but not used" in release build (variable used in assertion, NFC) From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265220	2016-04-02 05:34:19 +00:00
Mehdi Amini	5921a3ae66	wip From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265219	2016-04-02 05:34:14 +00:00
Mehdi Amini	b049431bec	constify GlobalValue::getGUID() and GlobalValue::getGlobalIdentifier() (NFC) From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265217	2016-04-02 05:25:27 +00:00
Mehdi Amini	024a79f780	Revert "ThinLTO: add module caching handling." This reverts commit r265214, unintentionally commited. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265216	2016-04-02 05:08:18 +00:00
Mehdi Amini	ad5741b075	Create a typedef GlobalValue::GUID for uint64_t and RAUW (NFC) Summary: This should make the code more readable, especially all the map declarations. Reviewers: tejohnson Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D18721 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265215	2016-04-02 05:07:53 +00:00
Mehdi Amini	2cd609482d	ThinLTO: add module caching handling. Reviewers: tejohnson Subscribers: llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D18494 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265214	2016-04-02 05:07:08 +00:00
Mehdi Amini	e70901552c	80 lines column after renaming "shouldDiscardValueNames" (NFC) From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265212	2016-04-02 03:59:58 +00:00
Mehdi Amini	50af49fcdc	Rename Context::discardValueNames() to shouldDiscardValueNames() (NFC) Suggested by Sean Silva. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265211	2016-04-02 03:46:17 +00:00
Mehdi Amini	27814980a3	Add Cache Pruning support Incremental LTO will usea cache to store object files. This patch handles the pruning part of the cache, exposing a few knobs: - Pruning interval: the implementation keeps a "timestamp" file in the directory and will scan it only after a given interval since the last modification of the timestamp file. This is for performance purpose, we don't want to scan continuously the folder. - Entry expiration: this is the time after which a file that hasn't been used is remove from the cache. - Maximum size: expressed in percentage of the available disk space, it helps to avoid that we blow up the disk space. http://reviews.llvm.org/D18422 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265209	2016-04-02 03:28:26 +00:00
Hans Wennborg	fa6e414eef	Fix -Wpedantic warning about extra semi-colon llvm-svn: 265204	2016-04-02 01:03:41 +00:00
Rong Xu	0eb3603626	[PGO] Use a helper function to find all indirect call-sites Use a helper function to find all the direct-calls-sites in a function. Also split the code into a separated file as this will be use by indirect-call-promotion transformation. Differential Revision: http://reviews.llvm.org/D18704 llvm-svn: 265199	2016-04-01 23:16:44 +00:00
Tim Northover	5dad9df9f7	AArch64: avoid clobbering SP for dead MOVimm pseudos. We were producing ORR, which actually defines a GPR32sp rather than a GPR32. Should fix PR23209. llvm-svn: 265198	2016-04-01 23:14:52 +00:00
Nico Weber	73853ab4f8	Make DIASession work if msdia*.dll isn't registered. This fixes various symbolization test failures for me when I build with a hermetic VS2015 without having run the 2015 installer. http://reviews.llvm.org/D18707 llvm-svn: 265193	2016-04-01 22:21:51 +00:00
Mehdi Amini	5a2e5d324e	ThinLTO: special handling for LinkOnce functions These function can be dropped by the compiler if they are no longer referenced in the current module. However there is a change that another module is still referencing them because of the import. Multiple solutions can be used: - Always import LinkOnce when a caller is imported. This ensure that every module with a call to a LinkOnce has the definition and will be able to emit it if it emits the call. - Turn the LinkOnce into Weak, so that it is always emitted. - Turn all LinkOnce into available_externally and come back after all modules are codegen'ed to emit only one copy of the linkonce, when there is still a reference to it. This patch implement the second option, with am optimization that only one module will turn the LinkOnce into Weak, while the others will turn it into available_externally, so that there is exactly one copy emitted for the whole compilation. http://reviews.llvm.org/D18346 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 265190	2016-04-01 21:53:50 +00:00
Manman Ren	9bfd0d03e9	Swift Calling Convention: add swifterror attribute. A ``swifterror`` attribute can be applied to a function parameter or an AllocaInst. This commit does not include any target-specific change. The target-specific optimization will come as a follow-up patch. Differential Revision: http://reviews.llvm.org/D18092 llvm-svn: 265189	2016-04-01 21:41:15 +00:00
Rong Xu	92c2eae4e1	Fix buildbot lldb-amd64-ninja-netbsd7 failure llvm-svn: 265180	2016-04-01 20:15:04 +00:00
James Y Knight	e6a4646372	Remove useless check for ThreadModel==Single in ARMISelLowering. NFC. ThreadModel::Single is already handled already by ARMPassConfig adding LowerAtomicPass to the pass list, which lowers all atomics to non-atomic ops and deletes fences. So by the time we get to ISel, there's no atomic fences left, so they don't need special handling. llvm-svn: 265178	2016-04-01 19:33:19 +00:00
Peter Collingbourne	dd711b93e0	LowerBitSets: Move declarations to separate namespace. Should fix modules build. llvm-svn: 265176	2016-04-01 18:46:50 +00:00
Mike Aizatsky	f13cbee12e	[libfuzzer] adding license headers to cpp files Differential Revision: http://reviews.llvm.org/D18705 llvm-svn: 265174	2016-04-01 18:38:58 +00:00

1 2 3 4 5 ...

88811 Commits