llvm-project

Commit Graph

Author	SHA1	Message	Date
Andrea Di Biagio	b14ae8692d	[CodeGenPrepare] Removed duplicate logic. SimplifyCFG already knows how to speculate calls to cttz/ctlz. SimplifyCFG now knows how to speculate calls to intrinsic cttz/ctlz that are 'cheap' for the target. Therefore, some of the logic in CodeGenPrepare that was originally added at revision 224899 can now be removed. This patch is basically a no functional change. It removes the duplicated logic in CodeGenPrepare and converts all the existing target specific tests for cttz/ctlz into SimplifyCFG tests. Differential Revision: http://reviews.llvm.org/D7608 llvm-svn: 229105	2015-02-13 14:15:48 +00:00
Arnaud A. de Grandmaison	a7c90d8487	[PBQP] Conservativelly allocatable nodes can be spilled and give a better solution Although such nodes are allocatable, the cost of spilling may be less than allocating to register, so spilling the node may provide a better solution. The assert does not account for this case, so remove it for now. llvm-svn: 229103	2015-02-13 12:04:42 +00:00
James Molloy	1b6207e6eb	[SimplifyCFG] Be more aggressive Up the phi node folding threshold from a cheap "1" to a meagre "2". Update tests for extra added selects and slight code churn. llvm-svn: 229099	2015-02-13 10:48:30 +00:00
Toma Tabacu	16a74499af	[mips] Improve support for the .set at/noat assembler directives. Summary: Made the following changes: Added calls to emitDirectiveSetNoAt() and emitDirectiveSetAt(). Added special emit function for .set at=$reg, emitDirectiveSetAtWithArg(unsigned RegNo). Improved parsing error checks for .set at. Refactored parser code for .set at. Improved testing of both directives. Improved code readability and comments. Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7176 llvm-svn: 229097	2015-02-13 10:30:57 +00:00
Chandler Carruth	30d69c2e36	[PM] Remove the old 'PassManager.h' header file at the top level of LLVM's include tree and the use of using declarations to hide the 'legacy' namespace for the old pass manager. This undoes the primary modules-hostile change I made to keep out-of-tree targets building. I sent an email inquiring about whether this would be reasonable to do at this phase and people seemed fine with it, so making it a reality. This should allow us to start bootstrapping with modules to a certain extent along with making it easier to mix and match headers in general. The updates to any code for users of LLVM are very mechanical. Switch from including "llvm/PassManager.h" to "llvm/IR/LegacyPassManager.h". Qualify the types which now produce compile errors with "legacy::". The most common ones are "PassManager", "PassManagerBase", and "FunctionPassManager". llvm-svn: 229094	2015-02-13 10:01:29 +00:00
Chandler Carruth	71f308adb7	Re-sort #include lines using my handy dandy ./utils/sort_includes.py script. This is in preparation for changes to lots of include lines. llvm-svn: 229088	2015-02-13 09:09:03 +00:00
Chandler Carruth	d99f427e31	Revert a series of commits starting at r228886 which is triggering some regressions for LLDB on Linux. Rafael indicated on lldb-dev that we should just go ahead and revert these but that he wasn't at a computer. The patches backed out are as follows: r228980: Add support for having multiple sections with the name and ... r228889: Invert the section relocation map. r228888: Use the existing SymbolTableIndex intsead of doing a lookup. r228886: Create the Section -> Rel Section map when it is first needed. These patches look pretty nice to me, so hoping its not too hard to get them re-instated. =D llvm-svn: 229080	2015-02-13 07:52:39 +00:00
Craig Topper	916708f152	[X86] Add support for parsing and printing the mnemonic aliases for the XOP VPCOM instructions. llvm-svn: 229078	2015-02-13 07:42:25 +00:00
Craig Topper	e32546dd29	[X86] Fix XOP vpcom intrinsic autoupgrade to map 'true' and 'false' to the correct immediates. Seems they were swapped. llvm-svn: 229077	2015-02-13 07:42:15 +00:00
Zachary Turner	a952c49c20	llvm-pdbdump: Add more comprehensive dumping of symbol types. In particular this patch adds the ability to dump complete function signature information including argument types as correctly formatted strings. A side effect of this is that almost all symbol and meta types are now formatted. llvm-svn: 229076	2015-02-13 07:40:03 +00:00
Mehdi Amini	383d7ae0bd	InstCombine: cleanup redundant dyn_cast<> (NFC) From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 229075	2015-02-13 07:38:04 +00:00
Craig Topper	007a713ebf	Fix a typo in a comment. NFC llvm-svn: 229071	2015-02-13 06:07:29 +00:00
Craig Topper	4e0700f365	[X86] Remove int_x86_sse2_psll_dq_bs and int_x86_sse2_psrl_dq_bs intrinsics. The builtins aren't used by clang. llvm-svn: 229069	2015-02-13 06:07:24 +00:00
Chandler Carruth	1fbc316534	[unroll] Concede defeat and disable the unroll analyzer for now. The issues with the new unroll analyzer are more fundamental than code cleanup, algorithm, or data structure changes. I've sent an email to the original commit thread with details and a proposal for how to redesign things. I'm disabling this for now so that we don't spend time debugging issues with it in its current state. llvm-svn: 229064	2015-02-13 05:31:46 +00:00
Michael Liao	d266b928ae	[InstCombine] Fix a bug when combining `icmp` from `ptrtoint` - First, there's a crash when we try to combine that pointers into `icmp` directly by creating a `bitcast`, which is invalid if that two pointers are from different address spaces. - It's not always appropriate to cast one pointer to another if they are from different address spaces as that is not no-op cast. Instead, we only combine `icmp` from `ptrtoint` if that two pointers are of the same address space. llvm-svn: 229063	2015-02-13 04:51:26 +00:00
Chandler Carruth	6c03dff7cc	[unroll] Merge the simplification and DCE estimation methods on the UnrollAnalyzer. Now they share a single worklist and have less implicit state between them. There was no real benefit to separating these two things out. I'm going to subsequently refactor things to share even more code. llvm-svn: 229062	2015-02-13 04:39:05 +00:00
Chandler Carruth	d9591d8922	[unroll] Remove pointless dyn_cast<>s to Instruction - the users of an instruction must by definition be instructions. llvm-svn: 229061	2015-02-13 04:33:21 +00:00
Chandler Carruth	5457e20d27	[unroll] Don't check the loop set for whether an instruction is contained in it each time we try to add it to the worklist, just check this when pulling it off the worklist. That way we do it at most once per instruction with the cost of the worklist set we would need to pay anyways. llvm-svn: 229060	2015-02-13 04:30:44 +00:00
Chandler Carruth	e5c30e4e10	[unroll] Change the other worklist in the unroll analyzer to be a set vector. In addition to dramatically reducing the work required for contrived example loops, this also has to correct some serious latent bugs in the cost computation. Previously, we might add an instruction onto the worklist once for every load which it used and was simplified. Then we would visit it many times and accumulate "savings" each time. I mean, fortunately this couldn't matter for things like calls with 100s of operands, but even for binary operators this code seems like it must be double counting the savings. I just noticed this by inspection and due to the runtime problems it can introduce, I don't have any test cases for cases where the cost produced by this routine is unacceptable. llvm-svn: 229059	2015-02-13 04:27:50 +00:00
Chandler Carruth	7824bc9241	[unroll] Replace a boolean, for loop, condition, and break with std::all_of and a lambda. Much cleaner, no functionality changed. llvm-svn: 229058	2015-02-13 04:18:14 +00:00
Chandler Carruth	06d537cdd6	[unroll] Directly query for dead instructions. In the unroll analyzer, it is checking each user to see if that user will become dead. However, it first checked if that user was missing from the simplified values map, and then if was also missing from the dead instructions set. We add everything from the simplified values map to the dead instructions set, so the first step is completely subsumed by the second. Moreover, the first step requires inserting something into the simplified value map which isn't what we want at all. This also replaces a dyn_cast with a cast as an instruction cannot be used by a non-instruction. llvm-svn: 229057	2015-02-13 04:14:05 +00:00
Chandler Carruth	82cb30f10c	[unroll] Replace a linear time check for no uses with a constant time check. Also hoist this into the enqueue process as it is faster even than testing the worklist set, we should just directly filter these out much like we filter out constants and such. llvm-svn: 229056	2015-02-13 04:06:08 +00:00
Chandler Carruth	3b057b3216	[unroll] Rather than an operand set, use a setvector for the worklist. We don't just want to handle duplicate operands within an instruction, but also duplicates across operands of different instructions. I should have gone straight to this, but I had convinced myself that it wasn't going to be necessary briefly. I've come to my senses after chatting more with Nick, and am now happier here. llvm-svn: 229054	2015-02-13 03:57:40 +00:00
Chandler Carruth	17a0496b5a	[unroll] Extract the code to enqueue operansd for the worklist in the unroll analysis into a lambda and call it. That's much simpler than duplicating all the code. llvm-svn: 229053	2015-02-13 03:49:41 +00:00
Chandler Carruth	8c86375a10	[unroll] Use a small set to de-duplicate operands prior to putting them into the worklist. This avoids allocating lots of worklist memory for them when there are large numbers of repeated operands. llvm-svn: 229052	2015-02-13 03:48:38 +00:00
Chandler Carruth	93063e6191	[unroll] Make the unroll cost analysis terminate deterministically and reasonably quickly. I don't have a reduced test case, but for a version of FFMPEG, this makes the loop unroller start finishing at all (after over 15 minutes of running, it hadn't terminated for me, no idea if it was a true infloop or just exponential work). The key thing here is to check the DeadInstructions set when pulling things off the worklist. Without this, we would re-walk the user list of already dead instructions again and again and again. Consider phi nodes with many, many operands and other patterns. The other important aspect of this is that because we would keep re-visiting instructions that were already known dead, we kept adding their cost savings to this! This would cause our cost savings to be insanely inflated from this. While I was here, I also rotated the operand walk out of the worklist loop to make the code easier to read. There is still work to be done to minimize worklist traffic because we don't de-duplicate operands. This means we may add the same instruction onto the worklist 1000s of times if it shows up in 1000s of operansd to a PHI node for example. Still, with this patch, the ffmpeg testcase I have finishes quickly and I can't measure the runtime impact of the unroll analysis any more. I'll probably try to do a few more cleanups to this code, but not sure how much cleanup I can justify right now. llvm-svn: 229038	2015-02-13 03:40:58 +00:00
Duncan P. N. Exon Smith	b4aa16f2bc	IR: Drop never-used defaults for DIBuilder::createTemplate*(), NFC No caller specifies anything different; these parameters are dead code and probably always have been. The new hierarchy doesn't bother with the fields at all (see r228607 and r228652). llvm-svn: 229037	2015-02-13 03:35:29 +00:00
Matt Arsenault	63bef0d177	R600/SI: Remove unnecessary check for fpimm llvm-svn: 229034	2015-02-13 02:47:22 +00:00
Chandler Carruth	dd6029fc6e	[unroll] Make range based for loops a bit more explicit and more readable. The biggest thing that was causing me problems is recognizing the references vs. poniters here. I also found that for maps naming the loop variable as KeyValue helps make it obvious why you don't actually use it directly. Finally, using 'auto' instead of 'User *' doesn't seem like a good tradeoff. Much like with the other cases, I like to know its a pointer, and 'User' is just as long and tells the reader a lot more. llvm-svn: 229033	2015-02-13 02:45:17 +00:00
Chandler Carruth	87fdafc7b2	[IC] Fix a bug with the instcombine canonicalizing of loads and propagating of metadata. We were propagating !nonnull metadata even when the newly formed load is no longer of a pointer type. This is clearly broken and results in LLVM failing the verifier and aborting. This patch just restricts the propagation of !nonnull metadata to when we actually have a pointer type. This bug report and the initial version of this patch was provided by Charles Davis! Many thanks for finding this! We still need to add logic to round-trip the metadata correctly if we combine from pointer types to integer types and then back by using range metadata for the integer type loads. But this is the minimal and safe version of the patch, which is important so we can backport it into 3.6. llvm-svn: 229029	2015-02-13 02:30:01 +00:00
Chandler Carruth	415f41258f	[unroll] Avoid the "Insn" abbreviation of Instruction. This is quite hard to type and read for me, and is inconsistent with the other abbreviation in the base class "Inst". For most of these (where they are used widely) I prefer just spelling it out as Instruction. I've changed two of the short-lived variables to use "Inst" to match the base class. llvm-svn: 229028	2015-02-13 02:17:39 +00:00
Chandler Carruth	302a133b1e	[unroll] Tidy up the integer we use to accumululate the number of instructions optimized. NFC, just separating this out from the functionality changing commit. llvm-svn: 229026	2015-02-13 02:10:56 +00:00
Duncan P. N. Exon Smith	1c93116489	AsmWriter/Bitcode: MDImportedEntity llvm-svn: 229025	2015-02-13 01:46:02 +00:00
Duncan P. N. Exon Smith	d45ce96c38	AsmWriter/Bitcode: MDObjCProperty llvm-svn: 229024	2015-02-13 01:43:22 +00:00
Duncan P. N. Exon Smith	0c5c0124ac	AsmWriter/Bitcode: MDExpression llvm-svn: 229023	2015-02-13 01:42:09 +00:00
Duncan P. N. Exon Smith	72fe2d0b79	AsmWriter/Bitcode: MDLocalVariable llvm-svn: 229022	2015-02-13 01:39:44 +00:00
Duncan P. N. Exon Smith	c8f810a017	AsmWriter/Bitcode: MDGlobalVariable llvm-svn: 229020	2015-02-13 01:35:40 +00:00
Duncan P. N. Exon Smith	2847f3805e	AsmWriter/Bitcode: MDTemplate{Type,Value}Parameter llvm-svn: 229019	2015-02-13 01:34:32 +00:00
Duncan P. N. Exon Smith	e146000565	AsmWriter/Bitcode: MDNamespace llvm-svn: 229018	2015-02-13 01:32:09 +00:00
Duncan P. N. Exon Smith	06a0702e40	AsmWriter/Bitcode: MDLexicalBlockFile llvm-svn: 229017	2015-02-13 01:30:42 +00:00
Duncan P. N. Exon Smith	a96d409997	AsmWriter/Bitcode: MDLexicalBlock llvm-svn: 229016	2015-02-13 01:29:28 +00:00
Duncan P. N. Exon Smith	890533e987	AsmWriter: MDSubprogram: Recognize DW_VIRTUALITY in 'virtuality' llvm-svn: 229015	2015-02-13 01:28:16 +00:00
Duncan P. N. Exon Smith	19fc5ed7db	AsmWriter/Bitcode: MDSubprogram llvm-svn: 229014	2015-02-13 01:26:47 +00:00
Duncan P. N. Exon Smith	c1f1acc751	AsmWriter/Bitcode: MDCompileUnit llvm-svn: 229013	2015-02-13 01:25:10 +00:00
Zachary Turner	2a5c0a27b6	Improve llvm-pdbdump output display. This patch adds a number of improvements to llvm-pdbdump. 1) Dumping of the entire global scope, and not only those symbols that live in individual compilands. 2) Prepend class name to member functions and data 3) Improved display of bitfields. 4) Support for dumping more kinds of data symbols. llvm-svn: 229012	2015-02-13 01:23:51 +00:00
Duncan P. N. Exon Smith	54e2bc6c9b	AsmWriter/Bitcode: MDSubroutineType llvm-svn: 229011	2015-02-13 01:22:59 +00:00
Duncan P. N. Exon Smith	aece2dc3f5	AsmWriter: MDCompositeType: Recognize DW_LANG in 'runtimeLang' llvm-svn: 229010	2015-02-13 01:21:25 +00:00
Duncan P. N. Exon Smith	171d077ae4	AsmWriter/Bitcode: MDDerivedType and MDCompositeType llvm-svn: 229009	2015-02-13 01:20:38 +00:00
Duncan P. N. Exon Smith	f14b9c7cc1	AsmWriter/Bitcode: MDFile llvm-svn: 229007	2015-02-13 01:19:14 +00:00
Duncan P. N. Exon Smith	cd6636c3bf	AsmWriter: MDBasicType: Recognize DW_ATE in 'encoding' llvm-svn: 229006	2015-02-13 01:17:35 +00:00
Duncan P. N. Exon Smith	09e03f38d6	AsmWriter/Bitcode: MDBasicType llvm-svn: 229005	2015-02-13 01:14:58 +00:00
Duncan P. N. Exon Smith	8775476419	AsmWriter/Bitcode: MDEnumerator llvm-svn: 229004	2015-02-13 01:14:11 +00:00
Duncan P. N. Exon Smith	c7363f1147	AsmWriter/Bitcode: MDSubrange llvm-svn: 229003	2015-02-13 01:10:38 +00:00
Duncan P. N. Exon Smith	193a4fdafd	IR: Add MDExpression::ExprOperand Port `DIExpression::Operand` over to `MDExpression::ExprOperand`. The logic is needed directly in `MDExpression` to support printing in assembly. llvm-svn: 229002	2015-02-13 01:07:46 +00:00
Duncan P. N. Exon Smith	3b631d291e	Support: Add dwarf::getOperationEncoding() llvm-svn: 229001	2015-02-13 01:05:00 +00:00
Duncan P. N. Exon Smith	8f46ee61c1	Support: Rewrite LocationAtom and OperationEncodingString(), NFC Use `Dwarf.def` more. llvm-svn: 229000	2015-02-13 01:04:08 +00:00
Akira Hatanaka	c43df5187c	[LinkModules] Change the way ModuleLinker merges triples. This commit makes the following changes: - Stop issuing a warning when the triples' string representations do not match exactly if the Triple objects generated from the strings compare equal. - On Apple platforms, choose the triple that has the larger minimum version number. rdar://problem/16743513 Differential Revision: http://reviews.llvm.org/D7591 llvm-svn: 228999	2015-02-13 00:40:41 +00:00
Eric Christopher	dc3a8a4a66	PPCFrameLowering's FramePointerOffset can be computed at initialization time. Do so. llvm-svn: 228998	2015-02-13 00:39:38 +00:00
Eric Christopher	736d39e189	The TOC save offset can be computed at compile time, do so and propagate changes. llvm-svn: 228997	2015-02-13 00:39:36 +00:00
Eric Christopher	f71609b5dd	The return save offset can be computed at initialization time - do so and save the value. llvm-svn: 228996	2015-02-13 00:39:27 +00:00
Chandler Carruth	10a9926ab5	[unroll] Don't use a map from pointer to bool. Use a set. This is much more efficient. In particular, the query with the user instruction has to insert a false for every missing instruction into the set. This is just a cleanup a long the way to fixing the underlying algorithm problems here. llvm-svn: 228994	2015-02-13 00:29:39 +00:00
Michael Zolotukhin	1b48019751	Prevent division by 0. When we try to estimate number of potentially removed instructions in loop unroller, we analyze first N iterations and then scale the computed number by TripCount/N. We should bail out early if N is 0. llvm-svn: 228988	2015-02-13 00:17:03 +00:00
Chandler Carruth	186ad60815	[unroll] Update the new analysis logic from r228265 to use modern coding conventions for function names consistently. Some were already using this but not all. llvm-svn: 228987	2015-02-13 00:00:24 +00:00
Rafael Espindola	b6a812ebb1	Add support for having multiple sections with the same name and comdat. Using this in combination with -ffunction-sections allows LLVM to output a .o file with mulitple sections named .text. This saves space by avoiding long unique names of the form .text.<C++ mangled name>. llvm-svn: 228980	2015-02-12 23:29:51 +00:00
David Majnemer	a12fcb790f	X86: Don't crash if we can't decode the pshufb mask Constant pool entries are uniqued by their contents regardless of their type. This means that a pshufb can have a shuffle mask which isn't a simple array of bytes. The code path which attempts to decode the mask didn't check for failure, causing PR22559. llvm-svn: 228979	2015-02-12 23:26:26 +00:00
Rafael Espindola	e4bcad4754	Learn that __DATA,__objc_classrefs is not atomized via symbols. This should hopefully fix objc on AArch64. llvm-svn: 228976	2015-02-12 23:11:59 +00:00
Olivier Sallenave	05e69157b6	Change max interleave factor to 12 for POWER7 and POWER8. llvm-svn: 228973	2015-02-12 22:57:58 +00:00
Hal Finkel	271e9f2870	[SDAG] Don't try to use FP_EXTEND/FP_ROUND for int<->fp promotions The PowerPC backend has long promoted some floating-point vector operations (such as select) to integer vector operations. Unfortunately, this behavior was broken by r216555. When using FP_EXTEND/FP_ROUND for promotions, we must check that both the old and new types are floating-point types. Otherwise, we must use BITCAST as we did prior to r216555 for everything. llvm-svn: 228969	2015-02-12 22:43:52 +00:00
Duncan P. N. Exon Smith	b93569d182	IR: Stop abusing DW_TAG_base_type for compile unit arrays The sub-arrays for compile units have for a long time been initialized to distinct temporary nodes with the `DW_TAG_base_type` tag, with no other operands. These invalid `DIBasicType`s are later replaced with appropriate arrays. This seems like a poor man's assertion that the arrays do eventually get replaced. These days, temporaries in the graph will cause assertions when writing bitcode or assembly, so this isn't necessary. Use temporary empty tuples instead. Note that the whole idea of using temporaries and then replacing them later is wasteful here. We never actually want to merge compile units by uniquing based on content. Compile units should use `getDistinct()` instead of `get()`, and then their operands can be freely replaced later on. llvm-svn: 228967	2015-02-12 21:52:11 +00:00
Rafael Espindola	3105fd8335	Remove mostly unused setters. Most of the code was setting the TargetOptions directly. llvm-svn: 228961	2015-02-12 21:16:34 +00:00
Zachary Turner	c074de041b	Add concrete type overloads to PDBSymbol::findChildren(). Frequently you only want to iterate over children of a specific type (e.g. functions). Previously you would get back a generic interface that allowed iteration over the base symbol type, which you would have to dyn_cast<> each one of. With this patch, we allow the user to specify the concrete type as a template parameter, and it will return an iterator which returns instances of the concrete type directly. llvm-svn: 228960	2015-02-12 21:09:24 +00:00
Reed Kotler	aa150ed780	Add bulk of returning of values to Mips fast-isel Summary: Implement the bulk of returning values in Mips fast-isel Test Plan: reatabi.ll Passes test-suite at -O0,-O2 and with mips32r2 and mips32r1. Reviewers: dsanders Reviewed By: dsanders Subscribers: llvm-commits, aemerson, rfuhler Differential Revision: http://reviews.llvm.org/D5920 llvm-svn: 228958	2015-02-12 21:05:12 +00:00
Bjorn Steinbrink	6f972a13f6	Fix a crash in the assumption cache when inlining indirect function calls Summary: Instances of the AssumptionCache are per function, so we can't re-use the same AssumptionCache instance when recursing in the CallAnalyzer to analyze a different function. Instead we have to pass the AssumptionCacheTracker to the CallAnalyzer so it can get the right AssumptionCache on demand. Reviewers: hfinkel Subscribers: llvm-commits, hans Differential Revision: http://reviews.llvm.org/D7533 llvm-svn: 228957	2015-02-12 21:04:22 +00:00
Benjamin Kramer	443c7967ea	InstCombine: Allow folding of xor into icmp by changing the predicate for vectors The loop vectorizer can create this pattern. llvm-svn: 228954	2015-02-12 20:26:46 +00:00
Simon Pilgrim	295eaad2b3	Relaxed over-zealous alignment requirement for VEX-encoded AES instructions llvm-svn: 228953	2015-02-12 20:01:03 +00:00
Rafael Espindola	203c5b9f39	On ELF, put PIC jump tables in a non executable section. Fixes PR22558. llvm-svn: 228939	2015-02-12 17:46:49 +00:00
Rafael Espindola	29786d4c16	Put each jump table in an independent section if the function is too. This allows the linker to GC both, fixing pr22557. llvm-svn: 228937	2015-02-12 17:16:46 +00:00
Benjamin Kramer	40957cc2ce	Fix accidental bit flip. llvm-svn: 228936	2015-02-12 16:30:00 +00:00
Benjamin Kramer	71e1eb5ab4	CoverageMapping: Bitvectorize code. No functionality change. llvm-svn: 228934	2015-02-12 16:18:07 +00:00
James Molloy	e805ad95dc	[LoopRerolling] Be more forgiving with instruction order. We can't solve the full subgraph isomorphism problem. But we can allow obvious cases, where for example two instructions of different types are out of order. Due to them having different types/opcodes, there is no ambiguity. llvm-svn: 228931	2015-02-12 15:54:14 +00:00
Benjamin Kramer	5f6a907288	MathExtras: Bring Count(Trailing\|Leading)Ones and CountPopulation in line with countTrailingZeros Update all callers. llvm-svn: 228930	2015-02-12 15:35:40 +00:00
Tim Northover	be0fda3c33	Triple: refactor redundant code. Should be no functional change, since most of the logic removed was completely pointless (after some previous refactoring) and the rest duplicated elsewhere. Patch by Kamil Rytarowski. llvm-svn: 228926	2015-02-12 15:12:13 +00:00
Michael Kuperstein	f4d1aca568	[X86] Call frame optimization - allow stack-relative movs to be folded into a push Since we track esp precisely, there's no reason not to allow this. llvm-svn: 228924	2015-02-12 14:17:35 +00:00
Asiri Rathnayake	e045e378ad	ARM: Fix another regression introduced in r223113 The changes in r223113 (ARM modified-immediate syntax) have broken instructions like: mov r0, #~0xffffff00 The problem is that I've added a spurious range check on the immediate operand to ensure that it lies between INT32_MIN and UINT32_MAX. While this range check is correct in theory, it causes problems because the operand is stored in an int64_t (by MC). So valid 32-bit constants like \#~0xffffff00 become out of range. The solution is to simply remove this range check. It is not possible to validate the range of the immediate operand with the current setup because: 1) The operand is stored in an int64_t by MC, 2) The immediate can be of the forms #imm, #-imm, #~imm or even #((~imm)) etc. So we just chop the value to 32 bits and use it. Also noted that the original range check was note tested by any of the unit tests. I've added a new test to cover #~imm kind of operands. Change-Id: I411e90d84312a2eff01b732bb238af536c4a7599 llvm-svn: 228920	2015-02-12 13:37:28 +00:00
Dmitry Vyukov	2e8d82e607	tsan: do not instrument not captured values I've built some tests in WebRTC with and without this change. With this change number of __tsan_read/write calls is reduced by 20-40%, binary size decreases by 5-10% and execution time drops by ~5%. For example: $ ls -l old/modules_unittests new/modules_unittests -rwxr-x--- 1 dvyukov 41708976 Jan 20 18:35 old/modules_unittests -rwxr-x--- 1 dvyukov 38294008 Jan 20 18:29 new/modules_unittests $ objdump -d old/modules_unittests \| egrep "callq.__tsan_(read\|write\|unaligned)" \| wc -l 239871 $ objdump -d new/modules_unittests \| egrep "callq.__tsan_(read\|write\|unaligned)" \| wc -l 148365 http://reviews.llvm.org/D7069 llvm-svn: 228917	2015-02-12 09:55:28 +00:00
Elena Demikhovsky	d2cb3c8876	AVX-512: Fixed the "test" operation for i1 type Using KORTESTW for comparison i1 value with zero was wrong since the instruction tests 16 bits. KORTESTW may be used with KSHIFTL+KSHIFTR that clean the 15 upper bits. I removed (X86cmp i1, 0) pattern and zero-extend i1 to i8 and then use TESTB. There are some cases where i1 is in the mask register and the upper bits are already zeroed. Then KORTESTW is the better solution, but it is subject for optimization. Meanwhile, I'm fixing the correctness issue. llvm-svn: 228916	2015-02-12 08:40:34 +00:00
Michael Kuperstein	db95d04be4	[X86] A heuristic to estimate the size impact for converting stack-relative parameter movs to pushes This gives a rough estimate of whether using pushes instead of movs is profitable, in terms of size. We go over all calls in the MachineFunction and compute: a) For each callsite that can not use pushes, the penalty of not having a reserved call frame. b) For each callsite that can use pushes, the gain of actually replacing the movs with pushes (and the potential penalty of having to readjust the stack). Differential Revision: http://reviews.llvm.org/D7561 llvm-svn: 228915	2015-02-12 08:36:35 +00:00
Ahmed Bougacha	24433a7005	[CodeGen] Don't blindly combine (fp_round (fp_round x)) to (fp_round x). We used to do this DAG combine, but it's not always correct: If the first fp_round isn't a value preserving truncation, it might introduce a tie in the second fp_round, that wouldn't occur in the single-step fp_round we want to fold to. In other words, double rounding isn't the same as rounding. Differential Revision: http://reviews.llvm.org/D7571 llvm-svn: 228911	2015-02-12 06:15:29 +00:00
George Burgess IV	33305e7280	Fixed a bug where CFLAA would crash the compiler. We would crash if we couldn't locate a Function that either Location's Value belonged to. Now we just print out a debug message and return conservatively. llvm-svn: 228901	2015-02-12 03:07:07 +00:00
Chandler Carruth	63aaa98d94	[slp] Fix a nasty bug in the SLP vectorizer that Joerg pointed out. Apparently some code finally started to tickle this after my canonicalization changes to instcombine. The bug stems from trying to form a vector type out of scalars that aren't compatible at all. In this example, from x86_mmx values. The code in the vectorizer that checks for reasonable types whas checking for aggregates or vectors, but there are lots of other types that should just never reach the vectorizer. Debugging this was made more confusing by the lie in an assert in VectorType::get() -- it isn't that the types are primitive. The types must be integer, pointer, or floating point types. No other types are allowed. I've improved the assert and added a helper to the vectorizer to handle the element type validity checks. It now re-uses the VectorType static function and then further excludes weird target-specific types that we probably shouldn't be touching here (x86_fp80 and ppc_fp128). Neither of these are really reachable anyways (neither 80-bit nor 128-bit things will get vectorized) but it seems better to just eagerly exclude such nonesense. I've added a test case, but while it definitely covers two of the paths through this code there may be more paths that would benefit from test coverage. I'm not familiar enough with the SLP vectorizer to synthesize test cases for all of these, but was able to update the code itself by inspection. llvm-svn: 228899	2015-02-12 02:30:56 +00:00
Hal Finkel	7a0516ea66	[PowerPC] Mark jumps as expensive (using using CR bits) On PowerPC, which has a full set of logical operations on (its multiple sets of) condition-register bits, it is not profitable to break of complex conditions feeding a jump into multiple jumps. We can turn off this feature of CGP/SDAGBuilder by marking jumps as "expensive". P7 test-suite speedups (no regressions): MultiSource/Benchmarks/FreeBench/pcompress2/pcompress2 -0.626647% +/- 0.323583% MultiSource/Benchmarks/Olden/power/power -18.2821% +/- 8.06481% llvm-svn: 228895	2015-02-12 01:02:52 +00:00
Zachary Turner	36f807c860	Revert "Change Path::filename_pos() to skip the drive letter." This reverts commit 228874. For some reason users reported seeing Clang taking up 25+GB of memory and bringing down machines with this change. Reverting until we figure it out. llvm-svn: 228890	2015-02-12 00:05:49 +00:00
Rafael Espindola	bbcdb9da19	Invert the section relocation map. It now points from rel section to section. Use it to set sh_info, avoiding a brittle name lookup. llvm-svn: 228889	2015-02-11 23:38:33 +00:00
Rafael Espindola	62118a1fe3	Use the existing SymbolTableIndex instead of doing a lookup. NFC. llvm-svn: 228888	2015-02-11 23:33:46 +00:00
Rafael Espindola	fbfbdc4377	Create the Seciton -> Rel Section map when it is first needed. NFC. Saves a walk over every section. llvm-svn: 228886	2015-02-11 23:17:48 +00:00
Tim Northover	02438033e8	DeadArgElim: aggregate Return assessment properly. I mistakenly thought the liveness of each "RetVal(F, i)" depended only on F. It actually depends on the index too, which means we need to be careful about how the results are combined before return. In particular if a single Use returns Live, that counts for the entire object, at the granularity we're considering. llvm-svn: 228885	2015-02-11 23:13:11 +00:00
Rafael Espindola	ef6baea74e	Remove unused argument. NFC. llvm-svn: 228884	2015-02-11 23:11:18 +00:00
David Majnemer	ab2b25bc97	Unbreak buildbots The next offset should be updated as well. llvm-svn: 228883	2015-02-11 22:51:55 +00:00
Rafael Espindola	fbd0ddf082	Don't recompute the entire section map just to add 3 entries. NFC. llvm-svn: 228881	2015-02-11 22:41:26 +00:00
David Majnemer	3df3c61e91	MC, COFF: Align section contents to a four byte boundary llvm-svn: 228879	2015-02-11 22:22:30 +00:00
Zachary Turner	3e76643a95	Change Path::filename_pos() to skip the drive letter. For Windows, filename_pos() tries to find the filename by searching for separators after the last :. Instead, it should really check for the only location that a : is valid, which is in the second character, and search for separators after that. llvm-svn: 228874	2015-02-11 21:16:35 +00:00
Rafael Espindola	d966522377	Remove unused argument. NFC. llvm-svn: 228873	2015-02-11 21:08:00 +00:00
Mehdi Amini	9730116bd6	Reassociate: cannot negate a INT_MIN value Summary: When trying to canonicalize negative constants out of multiplication expressions, we need to check that the constant is not INT_MIN which cannot be negated. Reviewers: mcrosier Reviewed By: mcrosier Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7286 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 228872	2015-02-11 19:54:44 +00:00
Tom Stellard	0648588e7d	R600/SI: Disable subreg liveness This is temporary while we try to fix a crash in the register coalescer. llvm-svn: 228861	2015-02-11 18:24:53 +00:00
Adrian Prantl	18a25b016e	Allow DIBuilder::replaceVTableHolder() to work with temporary nodes, tested via the clang test CodeGenCXX/vtable-holder-self-reference.cpp . llvm-svn: 228854	2015-02-11 17:45:10 +00:00
Adrian Prantl	9a8049238e	Add a trackIfUnresolved to DIBuilder::createInheritance(), tested via the clang test CodeGenCXX/vtable-holder-self-reference.cpp . llvm-svn: 228853	2015-02-11 17:45:08 +00:00
Adrian Prantl	534a81a9ec	Generalize DIBuilder's createReplaceableForwardDecl() to a more flexible createReplaceableCompositeType() that allows to create non-forward-declared temporary nodes. Paired commit with CFE. llvm-svn: 228852	2015-02-11 17:45:05 +00:00
Tom Stellard	de5b7b180a	R600: Split AMDGPUPassConfig into R600PassConfig and GCNPassConfig llvm-svn: 228850	2015-02-11 17:11:51 +00:00
Tom Stellard	c65b36061a	R600: Create an R600TargetMachine for pre-gcn GPUs No functinality change. R600TargetMachine inherits from AMDGPUTargetMachine. llvm-svn: 228849	2015-02-11 17:11:50 +00:00
Jonas Paulsson	bf8d0cc699	Fix SelectionDAG compile time issue with alias analysis. Add new token factor node and its users to worklist if alias analysis is turned on, in DAGCombiner::visitTokenFactor(). Alias analysis may cause a lot of new token factors to be inserted into the DAG, and they need to be optimized to avoid significant slow-downs. Reviewed by Hal Finkel. llvm-svn: 228841	2015-02-11 16:10:31 +00:00
Rafael Espindola	25d2c20c0c	Don't repeat name in comment and clang-format a function. llvm-svn: 228831	2015-02-11 14:44:17 +00:00
James Molloy	7c336576a5	[SimplifyCFG] Swap to using TargetTransformInfo for cost analysis. We're already using TTI in SimplifyCFG, so remove the hard-baked "cheapness" heuristic and use TTI directly. Generally NFC intended, but we're using a slightly different heuristic now so there is a slight test churn. Test changes: * combine-comparisons-by-cse.ll: Removed unneeded branch check. * 2014-08-04-muls-it.ll: Test now doesn't branch but emits muleq. * coalesce-subregs.ll: Superfluous block check. * 2008-01-02-hoist-fp-add.ll: fadd is safe to speculate. Change to udiv. * PhiBlockMerge.ll: Superfluous CFG checking code. Main checks still present. * select-gep.ll: A variable GEP is not expensive, just TCC_Basic, according to the TTI. llvm-svn: 228826	2015-02-11 12:15:41 +00:00
Daniel Sanders	a19216c8f4	[mips] Merge disassemblers into a single implementation. Summary: Currently we have Mips32 and Mips64 disassemblers and this causes the target triple to affect the disassembly despite all the relevant information being in the ELF header. These implementations do not need to be separate. This patch merges them together such that the appropriate tables are checked for the subtarget (e.g. Mips64 is checked when GP64 is enabled). Reviewers: vmedic Reviewed By: vmedic Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7498 llvm-svn: 228825	2015-02-11 11:28:56 +00:00
James Molloy	f147359376	[LoopReroll] Introduce the concept of DAGRootSets. A DAGRootSet models an induction variable being used in a rerollable loop. For example: x[i3+0] = y1 x[i3+1] = y2 x[i3+2] = y3 Base instruction -> i3 +---+----+ / \| \ ST[y1] +1 +2 <-- Roots \| \| ST[y2] ST[y3] There may be multiple DAGRootSets, for example: x[i2+0] = ... (1) x[i2+1] = ... (1) x[i2+4] = ... (2) x[i2+5] = ... (2) x[(i+1234)2+5678] = ... (3) x[(i+1234)2+5679] = ... (3) This concept is similar to the "Scale" member used previously, but allows multiple independent sets of roots based off the same induction variable. llvm-svn: 228821	2015-02-11 09:19:47 +00:00
David Majnemer	fad5a31160	AsmParser: Validate alloca's type An alloca's type should be weird things like metadata. llvm-svn: 228820	2015-02-11 09:13:11 +00:00
David Majnemer	04578fcfa5	DataLayout: Report when the preferred alignment is less than the ABI llvm-svn: 228819	2015-02-11 09:13:09 +00:00
David Majnemer	d7677e7a8d	Verifier: Check for null operands in !llvm.module.flags llvm-svn: 228818	2015-02-11 09:13:06 +00:00
Michael Kuperstein	1921d3d6f3	[X86] Split information collection from actual transformation in call frame optimization This splits collecting information from actually performing the transformation, so that we can add a heuristic in between the two. NFC. Differential Revision: http://reviews.llvm.org/D7497 llvm-svn: 228817	2015-02-11 08:53:55 +00:00
Arnaud A. de Grandmaison	de79026d5e	[PBQP] Cautiously update edge costs in the solver The NodeMetadata are maintained in an incremental way. When an edge between 2 nodes has its cost updated, in the course of graph reduction for example, the NodeMetadata need first to have the old edge cost removed, then the new edge cost added. Only once the NodeMetadata have been fully updated, it becomes safe to consider promoting the nodes to the ConservativelyAllocatable or OptimallyReducible sets. Previously, this promotion was occuring right after the removing the old cost, and this was breaking the assumption that a ConservativelyAllocatable should not be spilled. This patch also adds asserts to: - enforces the invariant that a node's reduction can not be downgraded, - only not provably allocatable or optimally reducible nodes can be spilled. llvm-svn: 228816	2015-02-11 08:25:36 +00:00
David Majnemer	9fd8cdc009	Verifier: Make sure !llvm.ident's operand isn't null llvm-svn: 228815	2015-02-11 08:23:20 +00:00
David Majnemer	300745351f	AsmParser: Don't crash when insertvalue has bad operands llvm-svn: 228813	2015-02-11 07:43:58 +00:00
David Majnemer	19b51054af	AsmParser: Switch some vectors to maps This speeds up parsing .ll files with metadata nodes with large IDs. llvm-svn: 228812	2015-02-11 07:43:56 +00:00
Peter Collingbourne	d20eff0ea6	Fix build for CMake < 2.8.12. llvm-svn: 228810	2015-02-11 05:58:57 +00:00
Zachary Turner	3bd47cee78	Use ADDITIONAL_HEADER_DIRS in all LLVM CMake projects. This allows IDEs to recognize the entire set of header files for each of the core LLVM projects. Differential Revision: http://reviews.llvm.org/D7526 Reviewed By: Chris Bieneman llvm-svn: 228798	2015-02-11 03:28:02 +00:00
Justin Bogner	d24e185784	InstrProf: Lower coverage mappings by setting their sections appropriately Add handling for __llvm_coverage_mapping to the InstrProfiling pass. We need to make sure the constant and any profile names it refers to are in the correct sections, which is easier and cleaner to do here where we have to know about profiling sections anyway. This is really tricky to test without a frontend, so I'm committing the test for the fix in clang. If anyone knows a good way to test this within LLVM, please let me know. Fixes PR22531. llvm-svn: 228793	2015-02-11 02:52:44 +00:00
Andrew Kaylor	7ad134a746	Temporary workaround to fix MSVC 2012 build problems llvm-svn: 228788	2015-02-11 02:16:34 +00:00
Reid Kleckner	96d011315a	Don't promote asynch EH invokes of nounwind functions to calls If the landingpad of the invoke is using a personality function that catches asynch exceptions, then it can catch a trap. Also add some landingpads to invalid LLVM IR test cases that lack them. Over-the-shoulder reviewed by David Majnemer. llvm-svn: 228782	2015-02-11 01:23:16 +00:00
Tom Stellard	94b7231740	R600/SI: Store immediate offsets > 12-bits in soffset This will save us from having to extend these offsets to 64-bits and storing them in a pair of vgprs. llvm-svn: 228776	2015-02-11 00:34:35 +00:00
Tom Stellard	c53861ab84	R600/SI: Add soffset operand to mubuf addr64 instruction We were previously hard-coding soffset to 0. llvm-svn: 228775	2015-02-11 00:34:32 +00:00
Zachary Turner	df3cc51f06	Fix some warnings due to -Wcovered-switch-default. llvm-svn: 228773	2015-02-11 00:13:39 +00:00
Zachary Turner	be6d1e49b0	Convert std::make_unique<> to llvm::make_unique<>. llvm-svn: 228768	2015-02-10 23:46:48 +00:00
Petar Jovanovic	d9f52043b1	Fix makeLibCall argument (signed) in SoftenFloatRes_XINT_TO_FP function The isSigned argument of makeLibCall function was hard-coded to false (unsigned). This caused zero extension on MIPS64 soft float. As the result SingleSource/Benchmarks/Stanford/FloatMM test and SingleSource/UnitTests/2005-07-17-INT-To-FP test failed. The solution was to use the proper argument. Patch by Strahinja Petrovic. Differential Revision: http://reviews.llvm.org/D7292 llvm-svn: 228765	2015-02-10 23:30:14 +00:00
Adrian Prantl	ca7e470221	Debug Info: Support variables that are described by more than one MMI table entry. This happens when SROA splits up an alloca and the resulting allocas cannot be lowered to SSA values because their address is passed to a function. Fixes PR22502. llvm-svn: 228764	2015-02-10 23:18:28 +00:00
Adrian Prantl	d49691f779	Fix indentation. llvm-svn: 228763	2015-02-10 23:18:15 +00:00
David Majnemer	7679300d93	EarlyCSE: It isn't safe to CSE across synchronization boundaries This fixes PR22514. llvm-svn: 228760	2015-02-10 23:09:43 +00:00
Zachary Turner	a5549178f1	Rewrite llvm-pdbdump in terms of LLVMDebugInfoPDB. This makes llvm-pdbdump available on all platforms, although it will currently fail to create a dumper if there is no PDB reader implementation for the current platform. It implements dumping of compilands and children, which is less information than was previously available, but it has to be rewritten from scratch using the new set of interfaces, so the rest of the functionality will be added back in subsequent commits. llvm-svn: 228755	2015-02-10 22:43:25 +00:00
David Majnemer	ca19485f08	X86: @llvm.frameaddress should defer to SelectionDAG for Win CFI llvm-svn: 228754	2015-02-10 22:00:34 +00:00
Simon Atanasyan	0ca59894aa	[Object] Reformat the code with clang-format No functional changes. llvm-svn: 228751	2015-02-10 21:38:25 +00:00
David Majnemer	13d0b11d7b	X86: Make @llvm.frameaddress work correctly with Windows unwind codes Simply loading or storing the frame pointer is not sufficient for Windows targets. Instead, create a synthetic frame object that we will lower later. References to this synthetic object will be replaced with the correct reference to the frame address. llvm-svn: 228748	2015-02-10 21:22:05 +00:00
Zachary Turner	cffff26b68	Provide DIA implementation of DebugInfoPDB. This implements DebugInfoPDB when the DIA SDK is present on the system. Specifically, this means that the following conditions are met: 1) You are building on Windows. 2) You are building with MSVC. 3) Visual Studio did not corrupt the installation of DIA due to a known issue with side-by-side installations of VS2012 and VS2013. If all of these conditions are true, you will be able to pass a value of PDB_Reader::DIA to PDB::createPdbReader(). There are no tests for this yet, as any test will be in the form of a lit test which tests the llvm-pdbdump.exe, which still needs to be rewritten in terms of this library. llvm-svn: 228747	2015-02-10 21:17:52 +00:00
Eric Christopher	f3e79e8714	Reformat (and remove some tabs) to make debugging this code a little easier to step through. llvm-svn: 228746	2015-02-10 21:15:06 +00:00
Andrew Kaylor	78b53dbcc1	Adding support for llvm.eh.begincatch and llvm.eh.endcatch intrinsics and beginning the documentation of native Windows exception handling. Differential Revision: http://reviews.llvm.org/D7398 llvm-svn: 228733	2015-02-10 19:52:43 +00:00
Tim Northover	43c0d2db50	DeadArgElim: arguments affect all returned sub-values by default. Unless we meet an insertvalue on a path from some value to a return, that value will be live if any of the return's components are live, so all of those components must be added to the MaybeLiveUses. Previously we were deleting arguments if sub-value 0 turned out to be dead. llvm-svn: 228731	2015-02-10 19:49:18 +00:00
Bill Schmidt	67f36bd0d8	Fix up r228725, missed change in PPCSubtarget definition llvm-svn: 228728	2015-02-10 19:31:55 +00:00
Duncan P. N. Exon Smith	4ee4a98eaa	IR: Add MDNode::replaceWithPermanent() Add new API for converting temporaries that may self-reference. Self-referencing nodes are not allowed to be uniqued, so sending them into `replaceWithUniqued()` is dangerous (and this commit adds assertions that prevent it). `replaceWithPermanent()` has similar semantics to `get()` followed by calls to `replaceOperandWith()`. In particular, if there's a self-reference, it returns a distinct node; otherwise, it returns a uniqued one. Like `replaceWithUniqued()` and `replaceWithDistinct()` (well, it calls out to them) it mutates the temporary node in place if possible, only calling `replaceAllUsesWith()` on a uniquing collision. llvm-svn: 228726	2015-02-10 19:13:46 +00:00
Bill Schmidt	82f1c775a0	[PowerPC] Fix reverted patch r227976 to avoid register assignment issues See full discussion in http://reviews.llvm.org/D7491. We now hide the add-immediate and call instructions together in a separate pseudo-op, which is tagged to define GPR3 and clobber the call-killed registers. The PPCTLSDynamicCall pass prior to RA now expands this op into the two separate addi and call ops, with explicit definitions of GPR3 on both instructions, and explicit clobbers on the call instruction. The pass is now marked as requiring and preserving the LiveIntervals and SlotIndexes analyses, and fixes these up after the replacement sequences are introduced. Self-hosting has been verified on LE P8 and BE P7 with various optimization levels, etc. It has also been verified with the --no-tls-optimize flag workaround removed. llvm-svn: 228725	2015-02-10 19:09:05 +00:00
David Majnemer	a7d908eb2b	X86: Emit Win64 SaveXMM opcodes at the right offset in the right order Walk the instructions marked FrameSetup and consider any stores of XMM registers to the stack as needing a SaveXMM opcode. This fixes PR22521. Differential Revision: http://reviews.llvm.org/D7527 llvm-svn: 228724	2015-02-10 19:01:47 +00:00
Hal Finkel	57c6ac5e41	[PowerPC] Support the (old) cntlz instruction alias Some old assembly code uses the cntlz alias for cntlzw, binutils supports this, and we should too. Fixes PR22519. llvm-svn: 228719	2015-02-10 18:45:02 +00:00
Colin LeMahieu	404d5b242d	[Hexagon] Adding vector load with post-increment instructions. Adding decoder function for 64bit control register class. llvm-svn: 228708	2015-02-10 16:59:36 +00:00
Zoran Jovanovic	416886793f	[mips][microMIPS] Implement movep instruction Differential Revision: http://reviews.llvm.org/D7465 llvm-svn: 228703	2015-02-10 16:36:20 +00:00

1 2 3 4 5 ...

76887 Commits