llvm-project

Commit Graph

Author	SHA1	Message	Date
Reid Kleckner	98a48afa5d	Revert "[SimplifyCFG] Rewrite SinkThenElseCodeToEnd" This reverts commit r279229. It breaks intrinsic function calls in diamonds. llvm-svn: 279313	2016-08-19 20:22:39 +00:00
Duncan P. N. Exon Smith	11cb5385a9	Reapply "ADT: Tidy up ilist_traits static asserts, NFC" This spiritually reapplies r279012 (reverted in r279052) without the r278974 parts. The differences: - Only the HasGetNext trait exists here, so I've only cleaned up (and tested) it. I still added HasObsoleteCustomization since I know this will be expanding when r278974 is reapplied. - I changed the unit tests to use static_assert to catch problems earlier in the build. - I added negative tests for the type traits. Original commit message follows. ---- Change the ilist traits to use decltype instead of sizeof, and add HasObsoleteCustomization so that additions to this list don't need to be added in two places. I suspect this will now work with MSVC, since the trait tested in r278991 seems to work. If for some reason it continues to fail on Windows I'll follow up by adding back the #ifndef _MSC_VER. llvm-svn: 279312	2016-08-19 20:17:23 +00:00
Tim Northover	b16734fbaa	GlobalISel: translate floating-point constants llvm-svn: 279311	2016-08-19 20:09:15 +00:00
Tim Northover	d3761cd165	GlobalISel: translate float/int conversion instructions. llvm-svn: 279310	2016-08-19 20:09:11 +00:00
Tim Northover	5a28c3642f	GlobalISel: support translating select instructions. llvm-svn: 279309	2016-08-19 20:09:07 +00:00
Tim Northover	b604622bba	GlobalISel: fix insert/extract to work on ConstantExprs too. No tests yet unfortunately (ConstantFolding reduces all supported constants to ConstantInts before we get to translation). Soon. llvm-svn: 279308	2016-08-19 20:09:03 +00:00
Tim Northover	96f981268f	GlobalISel: fix stale comment llvm-svn: 279307	2016-08-19 20:09:01 +00:00
Tim Northover	bbbfb1cfb8	GlobalISel: translate insertvalue instructions. This adds a G_INSERT instruction, which technically makes G_SEQUENCE redundant (it's equivalent to a G_INSERT into an IMPLICIT_DEF). We'll leave G_SEQUENCE for now though: it's likely to be far more common as it's a fundamental part of legalization, so avoiding the mess and bloat of the extra IMPLICIT_DEFs is probably worthwhile. llvm-svn: 279306	2016-08-19 20:08:55 +00:00
Tom Stellard	68726a5359	MachineScheduler: Add constructor functions for the DAGMutations Summary: This way they can be re-used by target-specific schedulers. Reviewers: atrick, MatzeB, kparzysz Subscribers: kparzysz, llvm-commits, MatzeB Differential Revision: https://reviews.llvm.org/D23678 llvm-svn: 279305	2016-08-19 19:59:18 +00:00
Krzysztof Parzyszek	021151d6c1	[Hexagon] Add RUN line to test llvm-svn: 279304	2016-08-19 19:36:35 +00:00
Krzysztof Parzyszek	fb4c4178a2	[Hexagon] Fix subesthetic indentation llvm-svn: 279303	2016-08-19 19:29:15 +00:00
Krzysztof Parzyszek	505eb498bd	[Hexagon] Allow i1 values for 'r' constraint in inline-asm llvm-svn: 279302	2016-08-19 19:17:28 +00:00
Simon Pilgrim	054e7d2ec1	[CostModel][X86] Added sub, or, and, fadd and fsub costs and missing 512-bit mul costs llvm-svn: 279301	2016-08-19 19:07:10 +00:00
Sanjay Patel	7a104615c5	[InstCombine] remove an icmp fold that is already handled by InstSimplify Specifically, this is done near the end of "SimplifyICmpInst" using computeKnownBits() as the broader solution. There are even vector tests (yay!) for this in test/Transforms/InstSimplify/compare.ll. I considered putting an assert here instead of just deleting, but then we could assert every possible fold in InstSimplify in InstCombine, so...less is more? llvm-svn: 279300	2016-08-19 19:03:07 +00:00
Richard Smith	46d396041b	Add missing #include found by modules build. llvm-svn: 279298	2016-08-19 18:57:17 +00:00
Krzysztof Parzyszek	8849a51370	[Hexagon] Do not cache alloca instructions during isel They can be deleted or replicated, so the cache may become outdated. They only need to be visited once during frame lowering, so just scan the function instead. llvm-svn: 279297	2016-08-19 18:46:13 +00:00
Chandler Carruth	9b35e6d746	[PM] Re-instate r279227 and r279228 with a fix to the way the templating was done to hopefully appease MSVC. As an upside, this also implements the suggestion Sanjoy made in code review, so two for one! =] I'll be watching the bots to see if there are still issues. llvm-svn: 279295	2016-08-19 18:36:06 +00:00
Tim Northover	26b76f2c59	GlobalISel: improve representation of G_SEQUENCE and G_EXTRACT First, make sure all types involved are represented, rather than being implicit from the register width. Second, canonicalize all types to scalar. These operations just act in bits and don't care about vectors. Also standardize spelling of Indices in the MachineIRBuilder (NFC here). llvm-svn: 279294	2016-08-19 18:32:14 +00:00
Simon Pilgrim	fbfa3ee4f6	[CostModel][X86] Added some AVX512 and 512-bit vector cost tests llvm-svn: 279291	2016-08-19 18:24:10 +00:00
Kyle Butt	5b10483618	Revert "IfConversion: Rescan diamonds." This reverts commit bfd62a4b4465dd21811bf615c3b04c30ddb09f7b. llvm-svn: 279289	2016-08-19 18:17:06 +00:00
Kyle Butt	ce0196de3f	Revert "CodeGen: If Convert blocks that would form a diamond when tail-merged." This reverts commit 0fda93481c4231c06b838ef476c0c404c51ff875. llvm-svn: 279288	2016-08-19 18:17:04 +00:00
Tim Northover	2fa5fa391f	GlobalISel: allow extractvalue to extract an aggregate. llvm-svn: 279287	2016-08-19 18:09:41 +00:00
Krzysztof Parzyszek	3d9946eb23	[Hexagon] Fixes for new-value jump formation - Recognize C2_cmpgtui, S2_tstbit_i, and S4_ntstbit_i. - Avoid creating new-value instructions with both source operands equal. llvm-svn: 279286	2016-08-19 17:54:49 +00:00
Tim Northover	6f80b08c64	GlobalISel: support translation of extractvalue instructions. llvm-svn: 279285	2016-08-19 17:47:05 +00:00
Simon Pilgrim	e309d2d0c3	[CostModel][X86] Add fdiv + frem cost tests llvm-svn: 279283	2016-08-19 17:39:00 +00:00
Sanjay Patel	e38e79c3e6	[InstCombine] use local variables to reduce code in foldICmpShlConstant; NFC llvm-svn: 279282	2016-08-19 17:34:05 +00:00
Krzysztof Parzyszek	5a7bef9c14	[Hexagon] Fix a few omissions in HexagonInstrInfo llvm-svn: 279280	2016-08-19 17:20:57 +00:00
Sanjay Patel	38b7506f75	[InstCombine] rename variables in foldICmpShlConstant(); NFC llvm-svn: 279279	2016-08-19 17:20:37 +00:00
Tim Northover	91c8173093	GlobalISel: support overflow arithmetic intrinsics. Unsigned addition and subtraction can reuse the instructions created to legalize large width operations (i.e. both produce and consume a carry flag). Signed operations and multiplies get a dedicated op-with-overflow instruction. Once this is produced the two values are combined into a struct register (which will almost always be merged with a corresponding G_EXTRACT as part of legalization). llvm-svn: 279278	2016-08-19 17:17:06 +00:00
Vitaly Buka	170dede75d	Revert "[asan] Optimize store size in FunctionStackPoisoner::poisonRedZones" This reverts commit r279178. Speculative revert in hope to fix asan crash on arm. llvm-svn: 279277	2016-08-19 17:15:38 +00:00
Vitaly Buka	c8f4d69c82	Revert "[asan] Fix size of shadow incorrectly calculated in r279178" This reverts commit r279222. Speculative revert in hope to fix asan crash on arm. llvm-svn: 279276	2016-08-19 17:15:33 +00:00
Lang Hames	6e9f0309e9	[RuntimeDyld] Revert r279182 and 279201 -- they broke some ARM bots. llvm-svn: 279275	2016-08-19 17:06:39 +00:00
Michael Kuperstein	41898f0396	[AliasSetTracker] Degrade AliasSetTracker when may-alias sets get too large. Repeated inserts into AliasSetTracker have quadratic behavior - inserting a pointer into AST is linear, since it requires walking over all "may" alias sets and running an alias check vs. every pointer in the set. We can avoid this by tracking the total number of pointers in "may" sets, and when that number exceeds a threshold, declare the tracker "saturated". This lumps all pointers into a single "may" set that aliases every other pointer. (This is a stop-gap solution until we migrate to MemorySSA) This fixes PR28832. Differential Revision: https://reviews.llvm.org/D23432 llvm-svn: 279274	2016-08-19 17:05:22 +00:00
Simon Pilgrim	d7a3782ae4	[X86][SSE] Generalised combining to VZEXT_MOVL to any vector size This doesn't change tests codegen as we already combined to blend+zero which is what we lower VZEXT_MOVL to on SSE41+ targets, but it does put us in a better position when we improve shuffling for optsize. llvm-svn: 279273	2016-08-19 17:02:00 +00:00
Krzysztof Parzyszek	639545b4d8	[Hexagon] Enforce LLSC packetization rules Ensure that load locked and store conditional instructions are only packetized with ALU32 instructions. Patch by Ben Craig. llvm-svn: 279272	2016-08-19 16:57:05 +00:00
Reid Kleckner	a871d3872a	Fix regression in InstCombine introduced by r278944 The intended transform is: // Simplify icmp eq (or (ptrtoint P), (ptrtoint Q)), 0 // -> and (icmp eq P, null), (icmp eq Q, null). P and Q are both pointer types, but may have different types. We need two calls to getNullValue() to make the icmps. llvm-svn: 279271	2016-08-19 16:53:18 +00:00
Tom Stellard	9d7ac684a9	MachineScheduler: Make some GenericScheduler member variables protected Summary: We will need these in AMDGPU's new SchedStrategy implmentation. Reviewers: MatzeB, atrick Subscribers: llvm-commits, MatzeB Differential Revision: https://reviews.llvm.org/D23679 llvm-svn: 279270	2016-08-19 16:44:32 +00:00
Krzysztof Parzyszek	b7640d4df0	[Hexagon] Minor updates to register definitions llvm-svn: 279269	2016-08-19 16:40:19 +00:00
David Majnemer	5554edabef	[CloneFunction] Don't remove unrelated nodes from the CGSSC CGSCC use a WeakVH to track call sites. RAUW a call within a function can result in that WeakVH getting confused about whether or not the call site is still around. llvm-svn: 279268	2016-08-19 16:37:40 +00:00
Krzysztof Parzyszek	9335bf0ec5	[Hexagon] Fix incorrect generation of S4_subi_asl_ri Patch by Jyotsna Verma. llvm-svn: 279267	2016-08-19 16:35:05 +00:00
Sanjay Patel	a867afe094	[InstCombine] use m_APInt to allow icmp (shl 1, Y), C folds for splat constant vectors llvm-svn: 279266	2016-08-19 16:12:16 +00:00
Krzysztof Parzyszek	dddb097a1f	[Hexagon] Add missing pattern for C4_cmplte llvm-svn: 279265	2016-08-19 16:11:33 +00:00
Sanjay Patel	57b12d3876	[InstCombine] use m_APInt to allow icmp X, C folds for splat constant vectors Of course, we really need to refactor and fix all of the cmp predicates, but this one is interesting because without it, we later perform an information-losing transform of icmp (shl 1, Y), C, and we can't recover the better fold. llvm-svn: 279263	2016-08-19 15:40:44 +00:00
Mehdi Amini	9989f80ae8	[LTO] Remove dead-code: collectUsedGlobalVariables has been moved to Thin and LTO specifc path (NFC) llvm-svn: 279261	2016-08-19 15:35:44 +00:00
Sanjay Patel	78111a7617	[InstCombine] add tests for missing vector icmp folds llvm-svn: 279259	2016-08-19 15:27:28 +00:00
Sanjay Patel	14cdf1968f	[InstCombine] add missing tests for basic icmp folds These are implicitly included as part of larger test cases, but they don't exist stand-alone (and don't happen for vectors...). llvm-svn: 279257	2016-08-19 15:21:45 +00:00
Krzysztof Parzyszek	0b8672269c	[Hexagon] Make p0 an explicit operand in VA1_clr* subinstructions, NFC llvm-svn: 279255	2016-08-19 15:17:19 +00:00
Krzysztof Parzyszek	6ce82951c3	[Hexagon] Add explicit default constructor for HexagonSelectionDAGInfo llvm-svn: 279254	2016-08-19 15:13:54 +00:00
Krzysztof Parzyszek	7d200668e4	Unxfail passing tests on Hexagon llvm-svn: 279252	2016-08-19 15:07:58 +00:00
Krzysztof Parzyszek	0ba9754584	[Hexagon] Allow tail-call optimization when mixing C and fast calling conv Patch by Arnold Schwaighofer. llvm-svn: 279251	2016-08-19 15:02:18 +00:00
Krzysztof Parzyszek	66dd6797e8	[Hexagon] Check for empty live interval Patch by Brendon Cahoon. llvm-svn: 279249	2016-08-19 14:29:43 +00:00
Krzysztof Parzyszek	db019ae801	[Hexagon] Consider zext/sext of a load to i32 to be free llvm-svn: 279248	2016-08-19 14:22:07 +00:00
Anton Korobeynikov	b38195c1a8	Revert r279242 - it's failing the tests llvm-svn: 279247	2016-08-19 14:18:34 +00:00
Krzysztof Parzyszek	a243adfd27	[Hexagon] Handle J2_jumptpt and J2_jumpfpt instructions llvm-svn: 279246	2016-08-19 14:14:09 +00:00
Krzysztof Parzyszek	067debe0a0	[Hexagon] Fix indentation, NFC llvm-svn: 279245	2016-08-19 14:12:51 +00:00
Krzysztof Parzyszek	9273ecc176	[Hexagon] Remove unnecessary llvm::, NFC llvm-svn: 279244	2016-08-19 14:10:57 +00:00
Krzysztof Parzyszek	75e74ee699	[Hexagon] Rename the HEXAGON_MC namespace to Hexagon_MC, NFC llvm-svn: 279243	2016-08-19 14:09:47 +00:00
Anton Korobeynikov	2aae31a945	Fix PR27500: on MSP430 the branch destination offset is measured in words, not bytes. In addition, the branch instructions will have proper BB destinations, not offsets, like before. Patch by Vadzim Dambrouski! Differential Revision: https://reviews.llvm.org/D20162 llvm-svn: 279242	2016-08-19 14:07:10 +00:00
Krzysztof Parzyszek	6421b934ec	[Hexagon] Mark PS_jumpret as pseudo-instruction, expand it into J2_jumpr llvm-svn: 279241	2016-08-19 14:04:45 +00:00
Krzysztof Parzyszek	bd8ef4b8ce	[Hexagon] Improvements to handling and generation of FP instructions Improved handling of fma, floating point min/max, additional load/store instructions for floating point types. Patch by Jyotsna Verma. llvm-svn: 279239	2016-08-19 13:34:31 +00:00
Benjamin Kramer	96fcf5df03	[LoopVectorize] Don't copy std::vector in for-range loop. llvm-svn: 279233	2016-08-19 12:44:24 +00:00
Chandler Carruth	b8824a5d3f	[PM] Revert r279227 and r279228 until I can find someone to help me solve completely opaque MSVC build errors. It complains about lots of stuff with this change without givin nearly enough information to even try to fix. llvm-svn: 279231	2016-08-19 10:51:55 +00:00
Simon Pilgrim	f1b8fdc074	[X86][SSE] Add support for matching commuted insertps patterns INSERTPS doesn't fit well with our shuffle mask canonicalization, so we need to attempt both the original mask and the commuted mask to more likely get a match llvm-svn: 279230	2016-08-19 10:31:53 +00:00
James Molloy	11a1936b70	[SimplifyCFG] Rewrite SinkThenElseCodeToEnd The new version has several advantages: 1) IMSHO it's more readable and neater 2) It handles loads and stores properly 3) It can handle any number of incoming blocks rather than just two. I'll be taking advantage of this in a followup patch. With this change we can now finally sink load-modify-store idioms such as: if (a) return b += 3; else return b += 4; => %z = load i32, i32* %y %.sink = select i1 %a, i32 5, i32 7 %b = add i32 %z, %.sink store i32 %b, i32* %y ret i32 %b When this works for switches it'll be even more powerful. llvm-svn: 279229	2016-08-19 10:10:27 +00:00
Chandler Carruth	5dbc90a8f1	[PM] Fix a compile error with GCC. NFC. llvm-svn: 279228	2016-08-19 09:53:10 +00:00
Chandler Carruth	db1759ace1	[PM] Make the the new pass manager support fully generic extra arguments to run methods, both for transform passes and analysis passes. This also allows the analysis manager to use a different set of extra arguments from the pass manager where useful. Consider passes over analysis produced units of IR like SCCs of the call graph or loops. Passes of this nature will often want to refer to the analysis result that was used to compute their IR units (the call graph or LoopInfo). And for transformations, they may want to communicate special update information to the outer pass manager. With this change, it becomes possible to have a run method for a loop pass that looks more like: PreservedAnalyses run(Loop &L, AnalysisManager<Loop, LoopInfo> &AM, LoopInfo &LI, LoopUpdateRecord &UR); And to query the analysis manager like: AM.getResult<MyLoopAnalysis>(L, LI); This makes accessing the known-available analyses convenient and clear, and it makes passing customized data structures around easy. My initial use case is going to be in updating the pass manager layers when the analysis units of IR change. But there are more use cases here such as having a layer that lets inner passes signal whether certain additional passes should be run because of particular simplifications made. Two desires for this have come up in the past: triggering additional optimization after successfully unrolling loops, and triggering additional inlining after collapsing indirect calls to direct calls. Despite adding this layer of generic extensibility, the only change to existing, simple usage are for places where we forward declare the AnalysisManager template. We really shouldn't be doing this because of the fragility exposed here, but currently it makes coping with the legacy PM code easier. Differential Revision: http://reviews.llvm.org/D21462 llvm-svn: 279227	2016-08-19 09:45:16 +00:00
Chandler Carruth	6b6375b1d0	[PM] Try to work-around what appears to be an MSVC SFINAE issue with r279217 where it fails to select the path that other compilers select. The workaround won't be as careful to produce an error when an analysis result is incorrect, but we can rely on non-MSVC builds to catch such errors it seems and MSVC doesn't seem to support the alternative techniques. Hoping this brings the windows bots back to life. If not, will have to revert all of this. llvm-svn: 279225	2016-08-19 09:26:00 +00:00
James Molloy	7ee640f9b6	[CodeGen] Fix a trivial type conversion bug dating back to pre-2008 The heuristic above this code is incredibly suspect, but disregarding that it mutates the cast opcode so we need to check the mutated opcode later to see if we need to emit an AssertSext or AssertZext node. Fixes PR29041. llvm-svn: 279223	2016-08-19 08:38:50 +00:00
Vitaly Buka	b81960a6c8	[asan] Fix size of shadow incorrectly calculated in r279178 Summary: r279178 generates 8 times more stores than necessary. Reviewers: eugenis Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D23708 llvm-svn: 279222	2016-08-19 08:33:53 +00:00
Chandler Carruth	6d6310dd4a	[PM] NFC refactoring: remove the AnalysisManagerBase class, folding it into the AnalysisManager class template. Back when I first added this base class there were separate analysis managers and some plausible reason why it would be a useful factoring of common code between them. However, after a lot of refactoring cleaning, we now have entirely shared code. The base class was just an arbitrary division between code in one class template and a separate class template. It didn't add anything and forced lots of indirection through "derived_this" for no real gain. We can always factor a base CRTP class out with common code if there is ever some other analysis manager that wants to share a subset of logic. But for now, folding things into the primary template is a non-trivial simplification with no down sides I see. It shortens the code considerably, removes an unhelpful abstraction, and will make subsequent patches dramatically less complex which enhance the analysis manager infrastructure to effectively cope with invalidation. llvm-svn: 279221	2016-08-19 08:31:47 +00:00
Vassil Vassilev	8fa30f2829	[modules] Add missing include. llvm-svn: 279219	2016-08-19 08:30:42 +00:00
Chandler Carruth	92d3c7e8e2	[PM] Redesign how the new PM detects whether an analysis result provides its own invalidate method. Previously, the technique would assume that if a result didn't have an invalidate method that didn't exactly match the expected signature it didn't have one at all. This is in fact not the case. And we had analyses with incorrect signatures for the invalidate method in the tree that would be erroneously invalidated in certain cases! Yikes. Moreover a result might legitimately want to have multiple overloads for the invalidate method, and if one changes or a new one is needed we again really want a compiler error. For example in the tree we had not added the overload for a function IR unit to the invalidate routine for TLI. Doh. So a new techique for the SFINAE detection here: if the result has any member spelled "invalidate" we turn off the synthesis of a default version. We don't care if it is a member function or a member variable or how many overloads there are. Once a result has something by that name it must provide suitable overloads for the contexts in which it is used. This seems much more resilient and durable. Huge props to Richard Smith who helped me figure out how on earth we could even do this in C++. It took quite some doing. The technique is remarkably clean however, and merely requires that the analysis results are not final classes. I think that's a requirement we can live with even if it is a bit odd. I've fixed the two bad in-tree analysis results. And this will make my next change which changes the API for invalidate much easier to validate as correct. llvm-svn: 279217	2016-08-19 07:49:23 +00:00
Chandler Carruth	b7be5b6479	[PM] Rework the new PM support for building the ModuleSummaryIndex to directly produce the index as the value type result. This requires making the index movable which is straightforward. It greatly simplifies things by allowing us to completely avoid the builder API and the layers of abstraction inherent there. Instead both pass managers can directly construct these when run by value. They still won't be constructed truly eagerly thanks to the optional in the legacy PM. The code that directly builds the index can also just share a direct function. A notable change here is that the result type of the analysis for the new PM is no longer a reference type. This was really problematic when making changes to how we handle result types to make our interface requirements much more strict and precise. But I think this is an overall improvement. Differential Revision: https://reviews.llvm.org/D23701 llvm-svn: 279216	2016-08-19 07:49:19 +00:00
NAKAMURA Takumi	a535636759	Fix tests in llvm/test/tools/gold/X86 to satisfy r279014. They would unexpectedly pass if test/tools/gold/X86/Output had outputs of previous tests. llvm-svn: 279214	2016-08-19 06:44:44 +00:00
Xinliang David Li	63248ab888	[Profile] Fix edge count read bug Use uint64_t to avoid value truncation before scaling. llvm-svn: 279213	2016-08-19 06:31:45 +00:00
Mehdi Amini	18b91112af	[LTO] Move callback member from base class to the derived where it is used (NFC) llvm-svn: 279212	2016-08-19 06:10:03 +00:00
Mehdi Amini	cc1fe9b9d6	Constify some path in the bitcode writer (NFC) llvm-svn: 279211	2016-08-19 06:06:18 +00:00
Mehdi Amini	026ddbb4d6	[LTO] Add a move to inialize member in ctor initialization list (NFC) llvm-svn: 279210	2016-08-19 05:56:37 +00:00
Xinliang David Li	2c9336823c	[Profile] Simple code refactoring for reuse /NFC llvm-svn: 279209	2016-08-19 05:31:33 +00:00
Dean Michael Berris	1dd1ca9727	[XRay] Synthesize a reference to the xray_instr_map Without the synthesized reference to a symbol in the xray_instr_map, linker section garbage collection will helpfully remove the whole xray_instr_map section from the final executable (or archive). This will cause the runtime to not be able to identify the sleds and hot-patch the calls/jumps into the runtime trampolines. This change adds a reference from the text section at the end of the function to keep around the associated xray_instr_map section as well. We also make sure that we catch this reference in the test. Reviewers: chandlerc, echristo, majnemer, mehdi_amini Subscribers: mehdi_amini, llvm-commits, dberris Differential Revision: https://reviews.llvm.org/D23398 llvm-svn: 279204	2016-08-19 04:44:30 +00:00
Lang Hames	e2ca3b65fc	[RuntimeDyld][MCJIT] Un-XFAIL some tests that were fixed by r279182. llvm-svn: 279201	2016-08-19 03:12:16 +00:00
Matthias Braun	fdc4c6b426	Revert "RegScavenging: Add scavengeRegisterBackwards()" The ppc64 multistage bot fails on this. This reverts commit r279124. Also Revert "CodeGen: Add/Factor out LiveRegUnits class; NFCI" because it depends on the previous change This reverts commit r279171. llvm-svn: 279199	2016-08-19 03:03:24 +00:00
Chandler Carruth	e8529c28f1	[ADT] Add the worlds simplest STL extra. Or at least close to it. This is a little class template that just builds an inheritance chain of empty classes. Despite how simple this is, it can be used to really nicely create ranked overload sets. I've added a unittest as much to document this as test it. You can pass an object of this type as an argument to a function overload set an it will call the first viable and enabled candidate at or below the rank of the object. I'm planning to use this in a subsequent commit to more clearly rank overload candidates used for SFINAE. All credit for this technique and both lines of code here to Richard Smith who was helping me rewrite the SFINAE check in question to much more effectively capture the intended set of checks. llvm-svn: 279197	2016-08-19 02:07:51 +00:00
Lang Hames	b65f16c8e5	[RuntimeDyld] Add support for ELF R_ARM_REL32 and R_ARM_GOT_PREL. Patch by William Dillon. Thanks William! This patch adds support for the R_ARM_REL32 and R_ARM_GOT_PREL ELF ARM relocations to RuntimeDyld, which should allow JITing of code that produces these relocations. No test case: Unfortunately RuntimeDyldELF's GOT building mechanism (which uses a separate section for GOT entries) isn't compatible with RuntimeDyldChecker. The correct fix for this is to fix RuntimeDyldELF's GOT support (it's fundamentally broken at the moment: separate sections aren't guaranteed to be in range of a GOT entry load), but that's a non-trivial job. llvm-svn: 279182	2016-08-19 01:15:39 +00:00
Vitaly Buka	aa654292bd	[asan] Optimize store size in FunctionStackPoisoner::poisonRedZones Summary: Reduce store size to avoid leading and trailing zeros. Reviewers: kcc, eugenis Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D23648 llvm-svn: 279178	2016-08-18 23:51:15 +00:00
Andrew Kaylor	81901d658f	Include X86CallFrameOptimization in the opt-bisect process. Differential Revision: https://reviews.llvm.org/D23683 llvm-svn: 279175	2016-08-18 22:49:51 +00:00
Saleem Abdulrasool	dab786fb78	AArch64: remove extraneous padding The structs BarrierOp, PrefetchOp, PSBHintOp are in AArch64AsmParser.cpp (inside anonymous namespace). This diff changes the order of fields and removes the excessive padding (8 bytes). Patch by Alexander Shaposhnikov! llvm-svn: 279173	2016-08-18 22:35:06 +00:00
Chris Bieneman	9f1fd7371e	[CMake] Add variables for tracking which runtimes are included This allows sub-projects to have conditionals based on the presence of other projects. llvm-svn: 279172	2016-08-18 22:18:11 +00:00
Matthias Braun	91f95f0201	CodeGen: Add/Factor out LiveRegUnits class; NFCI This is a set of register units intended to track register liveness, it is similar in spirit to LivePhysRegs. You can also think of this as the liveness tracking parts of the RegisterScavenger factored out into an own class. This was proposed in http://llvm.org/PR27609 Differential Revision: http://reviews.llvm.org/D21916 llvm-svn: 279171	2016-08-18 22:11:28 +00:00
Jacques Pienaar	bfa5ea0818	Fix link quotes on AArch64's CompilerWriterInfo section. Reviewers: t.p.northover Subscribers: t.p.northover, aemerson, rengolin Differential Revision: https://reviews.llvm.org/D23697 llvm-svn: 279169	2016-08-18 22:10:06 +00:00
Kyle Butt	780b517d6b	CodeGen: If Convert blocks that would form a diamond when tail-merged. The following function currently relies on tail-merging for if conversion to succeed. The common tail of cond_true and cond_false is extracted, and this then forms a diamond pattern that can be successfully if converted. If this block does not get extracted, either because tail-merging is disabled or the threshold is higher, we should still recognize this pattern and if-convert it. Fixed a regression in the original commit. Need to un-reverse branches after reversing them, or other conversions go awry. Regression on self-hosting bots with no obvious explanation. Tidied up range handling to be more obviously correct, but there was no smoking gun. define i32 @t2(i32 %a, i32 %b) nounwind { entry: %tmp1434 = icmp eq i32 %a, %b ; <i1> [#uses=1] br i1 %tmp1434, label %bb17, label %bb.outer bb.outer: ; preds = %cond_false, %entry %b_addr.021.0.ph = phi i32 [ %b, %entry ], [ %tmp10, %cond_false ] %a_addr.026.0.ph = phi i32 [ %a, %entry ], [ %a_addr.026.0, %cond_false ] br label %bb bb: ; preds = %cond_true, %bb.outer %indvar = phi i32 [ 0, %bb.outer ], [ %indvar.next, %cond_true ] %tmp. = sub i32 0, %b_addr.021.0.ph %tmp.40 = mul i32 %indvar, %tmp. %a_addr.026.0 = add i32 %tmp.40, %a_addr.026.0.ph %tmp3 = icmp sgt i32 %a_addr.026.0, %b_addr.021.0.ph br i1 %tmp3, label %cond_true, label %cond_false cond_true: ; preds = %bb %tmp7 = sub i32 %a_addr.026.0, %b_addr.021.0.ph %tmp1437 = icmp eq i32 %tmp7, %b_addr.021.0.ph %indvar.next = add i32 %indvar, 1 br i1 %tmp1437, label %bb17, label %bb cond_false: ; preds = %bb %tmp10 = sub i32 %b_addr.021.0.ph, %a_addr.026.0 %tmp14 = icmp eq i32 %a_addr.026.0, %tmp10 br i1 %tmp14, label %bb17, label %bb.outer bb17: ; preds = %cond_false, %cond_true, %entry %a_addr.026.1 = phi i32 [ %a, %entry ], [ %tmp7, %cond_true ], [ %a_addr.026.0, %cond_false ] ret i32 %a_addr.026.1 } Without tail-merging or diamond-tail if conversion: LBB1_1: @ %bb @ =>This Inner Loop Header: Depth=1 cmp r0, r1 ble LBB1_3 @ BB#2: @ %cond_true @ in Loop: Header=BB1_1 Depth=1 subs r0, r0, r1 cmp r1, r0 it ne cmpne r0, r1 bgt LBB1_4 LBB1_3: @ %cond_false @ in Loop: Header=BB1_1 Depth=1 subs r1, r1, r0 cmp r1, r0 bne LBB1_1 LBB1_4: @ %bb17 bx lr With diamond-tail if conversion, but without tail-merging: @ BB#0: @ %entry cmp r0, r1 it eq bxeq lr LBB1_1: @ %bb @ =>This Inner Loop Header: Depth=1 cmp r0, r1 ite le suble r1, r1, r0 subgt r0, r0, r1 cmp r1, r0 bne LBB1_1 @ BB#2: @ %bb17 bx lr llvm-svn: 279168	2016-08-18 22:09:27 +00:00
Kyle Butt	491afad8f6	IfConversion: Rescan diamonds. The cost of predicating a diamond is only the instructions that are not shared between the two branches. Additionally If a predicate clobbering instruction occurs in the shared portion of the branches (e.g. a cond move), it may still be possible to if convert the sub-cfg. This change handles these two facts by rescanning the non-shared portion of a diamond sub-cfg to recalculate both the predication cost and whether both blocks are pred-clobbering. llvm-svn: 279167	2016-08-18 22:09:25 +00:00
Kyle Butt	d76755ec95	IfConversion: Handle inclusive ranges more carefully. This may affect calculations for thresholds, but is not a significant change in behavior. The problem was that an inclusive range must have an additonal flag to showr that it is empty, because otherwise begin == end implies that the range has one element, and it may not be possible to move past on either side. llvm-svn: 279166	2016-08-18 22:09:23 +00:00
Hemant Kulkarni	e77a0a9a3b	llvm-objdump: Add Hexagon printer changes for -S/-l options Differential Revision: https://reviews.llvm.org/D23521 llvm-svn: 279161	2016-08-18 21:50:13 +00:00
Chris Bieneman	1bb239c091	[CMake] Create convenience targets for runtime projects Each runtime project has a top-level target that is the name of the runtime (minus the "lib" prefix if applicable). This creates top-level targets mapping to runtime projects. llvm-svn: 279160	2016-08-18 21:47:18 +00:00
Zhan Jun Liau	cf2f4b3251	[SystemZ] Use valid base/index regs for inline asm Summary: Inline asm memory constraints can have the base or index register be assigned to %r0 right now. Make sure that we assign only ADDR64 registers to the base and index. Reviewers: uweigand Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D23367 llvm-svn: 279157	2016-08-18 21:44:15 +00:00
Tim Shen	36bde4f81d	[Analysis] Change several Analysis pieces to use NodeRef. NFC. Reviewers: dblaikie, grosser Subscribers: mzolotukhin, llvm-commits Differential Revision: https://reviews.llvm.org/D23625 llvm-svn: 279156	2016-08-18 21:41:24 +00:00
Chris Bieneman	997ee2b8cf	[CMake] Make llvm-config implicit dependency for subprojects The subproject interface being used for runtime libraries expects that llvm-config is passed into the subproject for consumption. We currently do this for every subproject, so we should expect that all LLVM ExternalProjects depend on llvm-config for the time being. Eventually I'd like to see the sub-projects using LLVMConfig.cmake instead of the llvm-config binary, but that will take time to roll out. llvm-svn: 279155	2016-08-18 21:41:21 +00:00
Chris Bieneman	a7c87a9430	[CMake] Minor fix to regex in r279152 The third version component is optional in Xcode's version spew, so we need to make it optional in the regex. llvm-svn: 279153	2016-08-18 21:36:36 +00:00
Chris Bieneman	11eed999c4	[CMake] Support for generating Xcode 8 compatible toolchains Xcode 8 requires toolchain compatibility version 2. This allows us to select the correct compatibility version based on the installed version of Xcode. llvm-svn: 279152	2016-08-18 21:32:48 +00:00
Sanjay Patel	98cd99dfc6	[InstCombine] add helper function for folds of icmp (shl 1, Y), C; NFCI Clean up the existing code by: 1. Renaming variables 2. Adding local variables 3. Making it vector-safe This is still guarded by a ConstantInt check, so no functional change is intended. But this should be ready to go: if we move the ConstantInt check down, all of these folds should do the right thing for vector types. llvm-svn: 279150	2016-08-18 21:28:30 +00:00
Jacques Pienaar	2b25799bcc	[lanai] Add ISA document to CompilerWritersInfo Summary: Add Lanai ISA document to CompilerWritersInfo. Reviewers: eliben Subscribers: aemerson, llvm-commits Differential Revision: https://reviews.llvm.org/D23693 llvm-svn: 279149	2016-08-18 21:25:17 +00:00
Tom Stellard	a1619cd9aa	AMDGPU/SI: Fix a test in wqm.ll to always use s_cbranch_vcc* Summary: We need to use floating-point compares to ensure that s_cbranch_vcc* instructions are always generated. With integer compares, future optimizations could cause s_cbranch_scc* to be generated instead. Reviewers: arsenm, nhaehnle Subscribers: llvm-commits, kzhuravl Differential Revision: https://reviews.llvm.org/D23401 llvm-svn: 279148	2016-08-18 21:21:53 +00:00
Kostya Serebryany	32661f9d66	[libFuzzer] add more __attribute__((visibility("default"))) llvm-svn: 279143	2016-08-18 20:52:52 +00:00
Amaury Sechet	763c59dc9a	Make cltz and cttz zero undef when the operand cannot be zero in InstCombine Summary: Also add popcount(n) == bitsize(n) -> n == -1 transformation. Reviewers: majnemer, spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D23134 llvm-svn: 279141	2016-08-18 20:43:50 +00:00
Sanjay Patel	40e8ca46ad	[InstCombine] use m_APInt to allow icmp (trunc X, Y), C folds for splat constant vectors This is a sibling of: https://reviews.llvm.org/rL278859 https://reviews.llvm.org/rL278935 https://reviews.llvm.org/rL278945 https://reviews.llvm.org/rL279066 https://reviews.llvm.org/rL279077 https://reviews.llvm.org/rL279101 llvm-svn: 279133	2016-08-18 20:28:54 +00:00
Sanjay Patel	5f4ce4e23d	[InstCombine] clean up foldICmpTruncConstant(); NFCI 1. Fix variable names 2. Add local variables to reduce code llvm-svn: 279132	2016-08-18 20:25:16 +00:00
Michael Kuperstein	2bc3d4d46c	[SelectionDAG] Rename fextend -> fpextend, fround -> fpround, frnd -> fround The names of the tablegen defs now match the names of the ISD nodes. This makes the world a slightly saner place, as previously "fround" matched ISD::FP_ROUND and not ISD::FROUND. Differential Revision: https://reviews.llvm.org/D23597 llvm-svn: 279129	2016-08-18 20:08:15 +00:00
Wei Ding	52bb661dec	AMDGPU : Fix QSAD and MQSAD instructions' incorrect data type. Differential Revision: http://reviews.llvm.org/D23689 llvm-svn: 279126	2016-08-18 19:51:14 +00:00
Matthew Simpson	11db6b6b8c	[SLP] Initialize VectorizedValue when gathering We abort building vectorizable trees in some cases (e.g., if the maximum recursion depth is reached, if the region size is too large, etc.). If this happens for a reduction, we can be left with a root entry that needs to be gathered. For these cases, we need make sure we actually set VectorizedValue to the resulting vector. This patch ensures we properly set VectorizedValue, and it also ensures the insertelement sequence generated for the gathers is inserted at the correct location. Reference: https://llvm.org/bugs/show_bug.cgi?id=28330 Differential Revison: https://reviews.llvm.org/D23410 llvm-svn: 279125	2016-08-18 19:50:32 +00:00
Matthias Braun	075d0c23d5	RegScavenging: Add scavengeRegisterBackwards() Re-apply r276044 with off-by-1 instruction fix for the reload placement. This is a variant of scavengeRegister() that works for enterBasicBlockEnd()/backward(). The benefit of the backward mode is that it is not affected by incomplete kill flags. This patch also changes PrologEpilogInserter::doScavengeFrameVirtualRegs() to use the register scavenger in backwards mode. Differential Revision: http://reviews.llvm.org/D21885 llvm-svn: 279124	2016-08-18 19:47:59 +00:00
Kyle Butt	64e428147f	Branch Folding: Accept explicit threshold for tail merge size. This is prep work for allowing the threshold to be different during layout, and to enforce a single threshold between merging and duplicating during layout. No observable change intended. llvm-svn: 279117	2016-08-18 18:57:29 +00:00
Pete Cooper	a8db71e840	Add a version of Intrinsic::getName which is more efficient when there are no overloads. When running 'opt -O2 verify-uselistorder-nodbg.lto.bc', there are 33m allocations. 8.2m come from std::string allocations in Intrinsic::getName(). Turns out this method only returns a std::string because it needs to handle overloads, but that is not the common case. This adds an overload of getName which just returns a StringRef when there are no overloads and so saves on the allocations. llvm-svn: 279113	2016-08-18 18:30:54 +00:00
Simon Pilgrim	99fd9c5f56	[X86][SSE] Missed insertps shuffle patterns llvm-svn: 279111	2016-08-18 18:19:28 +00:00
Chris Bieneman	69f289cf1d	[CMake] Silence message on multi-configuration generators The Xcode and Visual Studio generators always log "-- No build type selected, default to Debug". This is because CMake doesn't initialize "CMAKE_CONFIGURATION_TYPES" until the generator's EnableLanguage call gets hit. The first place EnableLanguage gets hit in our configuration is in the project() call. Since CMAKE_BUILD_TYPE isn't used until after we call project() it is safe to just move this check down a bit. llvm-svn: 279110	2016-08-18 18:17:28 +00:00
Vitaly Buka	0596387ad3	[asan] Extend test Summary: PR27453 Reviewers: eugenis Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D23647 llvm-svn: 279109	2016-08-18 18:17:19 +00:00
Valery Pykhtin	609c2f8137	[AMDGPU] add s_incperflevel/s_decperflevel intrinsics. Differential revision: https://reviews.llvm.org/D23666 llvm-svn: 279106	2016-08-18 18:06:20 +00:00
Elliot Colp	687691aeac	Fix SystemZ compilation abort caused by negative AND mask Normally, when an AND with a constant is lowered to NILL, the constant value is truncated to 16 bits. However, since r274066, ANDs whose results are used in a shift are caught by a different pattern that does not truncate. The instruction printer expects a 16-bit unsigned immediate operand for NILL, so this results in an abort. This patch adds code to manually truncate the constant in this situation. The rest of the bits are then set, so we will detect a case for NILL "naturally" rather than using peephole optimizations. Differential Revision: http://reviews.llvm.org/D21854 llvm-svn: 279105	2016-08-18 18:04:26 +00:00
Duncan P. N. Exon Smith	84c2da47f9	AArch64: Don't call getIterator() on iterators Remove an unnecessary round-trip: iterator => operator->() => getIterator() In some cases, the iterator is end(), so the dereference of operator-> is invalid (UB). The testcase only crashes with r278974 (currently reverted to investigate this), which adds an assertion for invalid dereferences of ilist nodes. Fixes PR29035. llvm-svn: 279104	2016-08-18 17:58:09 +00:00
Eugene Zelenko	61a72d8850	[LLVM] Fix some Clang-tidy modernize-use-using and Include What You Use warnings Differential revision: https://reviews.llvm.org/D23675 llvm-svn: 279102	2016-08-18 17:56:27 +00:00
Sanjay Patel	fa5ca2bf46	[InstCombine] use m_APInt to allow icmp (udiv X, Y), C folds for splat constant vectors This is a sibling of: https://reviews.llvm.org/rL278859 https://reviews.llvm.org/rL278935 https://reviews.llvm.org/rL278945 https://reviews.llvm.org/rL279066 https://reviews.llvm.org/rL279077 llvm-svn: 279101	2016-08-18 17:55:59 +00:00
Dan Gohman	c9623db884	[WebAssembly] Disable the store-results optimization. The WebAssemly spec removing the return value from store instructions, so remove the associated optimization from LLVM. This patch leaves the store instruction operands in place for now, so stores now always write to "$drop"; these will be removed in a seperate patch. llvm-svn: 279100	2016-08-18 17:51:27 +00:00
Chandler Carruth	e2f36bcb84	[Assumptions] Make collecting ephemeral values not quadratic in the number of assume intrinsics. The classical way to have a cache-friendly vector style container when we need queue semantics for BFS instead of stack semantics for DFS is to use an ever-growing vector and an index. Erasing from the front requires O(size) work, and unless we expect the worklist to grow very large, its probably cheaper to just grow and race down the list. But that makes it more bad that we're putting the assume intrinsics in this at all. We end up looking at the (by definition empty) use list to see if they're ephemeral (when we've already put them in that set), etc. Instead, directly populate the worklist with the operands when we mark the assume intrinsics as ephemeral. Also, test the visited set before putting things into the worklist so we don't accumulate the same value in the list 100s of times. It would be nice to use a set-vector for this but I think its useful to test the set earlier to avoid repeatedly querying whether the same instruction is safe to speculate. Hopefully with these changes the number of values pushed onto the worklist is smaller, and we avoid quadratic work by letting it grow as necessary. Differential Revision: https://reviews.llvm.org/D23396 llvm-svn: 279099	2016-08-18 17:51:24 +00:00
Vedant Kumar	c948d182e1	Fix -Wpessimizing-move error, NFC llvm-svn: 279095	2016-08-18 17:39:53 +00:00
Sanjay Patel	12a4105647	[InstCombine] clean up foldICmpUDivConstant; NFC 1. Better variable names 2. Remove unnecessary check of ConstantInt llvm-svn: 279094	2016-08-18 17:37:26 +00:00
Duncan P. N. Exon Smith	9d748f9499	Reapply "ADT: Remove references in has_rbegin for reverse()" This reverts commit r279086, reapplying r279084. I'm not sure what I ran before, because the compile failure for ADTTests reproduced locally. The problem is that TestRev is calling BidirectionalVector::rbegin() when the BidirectionalVector is const, but rbegin() is always non-const. I've updated BidirectionalVector::rbegin() to be callable from const. Original commit message follows. -- As a follow-up to r278991, add some tests that check that decltype(reverse(R).begin()) == decltype(R.rbegin()), and get them passing by adding std::remove_reference to has_rbegin. I'm using static_assert instead of EXPECT_TRUE (and updated the other has_rbegin check from r278991 in the same way) since I figure that's more helpful. llvm-svn: 279091	2016-08-18 17:15:25 +00:00
Zachary Turner	ac5763eca4	Resubmit "Write the TPI stream from a PDB to Yaml." The original patch was breaking some buildbots due to an incorrect ordering of function definitions which caused some compilers to recognize a definition but others to not. llvm-svn: 279089	2016-08-18 16:49:29 +00:00
Saleem Abdulrasool	c6bf547564	llvm-objdump: add coff import library symbol listing support This adds behaviour similar to binutils' objdump which can show symbols in an import library. Differences from that stem around the fact that we do not create section symbols nor the all import import descriptor symbol reference. However, this does mean that the tool can serve as a possible replacement for the existing tool. llvm-svn: 279088	2016-08-18 16:39:19 +00:00
Duncan P. N. Exon Smith	5195d3fc0e	Revert "ADT: Remove references in has_rbegin for reverse()" This reverts commit r279084, since it failed on a bot: http://bb.pgr.jp/builders/cmake-llvm-x86_64-linux/builds/41733 llvm-svn: 279086	2016-08-18 16:27:41 +00:00
Duncan P. N. Exon Smith	b28eb332d9	ADT: Remove references in has_rbegin for reverse() As a follow-up to r278991, add some tests that check that decltype(reverse(R).begin()) == decltype(R.rbegin()), and get them passing by adding std::remove_reference to has_rbegin. I'm using static_assert instead of EXPECT_TRUE (and updated the other has_rbegin check from r278991 in the same way) since I figure that's more helpful. llvm-svn: 279084	2016-08-18 16:22:54 +00:00
Artur Pilipenko	615b820af6	CVP. Turn marking adds as no wrap (introduced by r278107) off by default It causes a regression on our internal benchmark. Introduce cvp-dont-process flag and set it off by default while investigating the regression. llvm-svn: 279082	2016-08-18 16:08:35 +00:00
Ahmed Bougacha	33e19fe1c4	[AArch64][GlobalISel] Select floating-point binary ops. There is no FREM instruction, but the others are straightforward. llvm-svn: 279081	2016-08-18 16:05:11 +00:00
Ahmed Bougacha	71d033a17f	[GlobalISel] Add floating-point binary ops. llvm-svn: 279080	2016-08-18 16:05:06 +00:00
Davide Italiano	d1279df752	[IRCE] Switch over to LLVM_DUMP_METHOD. NFCI. llvm-svn: 279079	2016-08-18 15:55:49 +00:00
Richard Barton	5808bd656a	[ARM] Correct ARMv8-A optional extension definitions in TargetParser The ARMv8-A descriptions in the ARM and AArch64 TargetParsers are incorrect architecturally and mismatched to the backend descriptions. RAS is an optional extension to ARMv8-A and ARMv8.1-A and mandatory in ARMv8.2-A. Correct the ARMTargetParser descriptions which had this as enabled by default in the earlier versions. The FP16 and SPE extensions are optional in ARMv8.2-A and the backend defaults them as off. They are not available as extensions to earlier ARMv8-A versions. Correct the AArch64TargetParser which had these as enabled by default in all ARMv8-A definitions. These macros are only used to define preprocessor macros. There are no macros yet as ACLE has not caught up with ARMv8.2-A so not possible to add a test. Differential Revision: https://reviews.llvm.org/D23500 llvm-svn: 279078	2016-08-18 15:50:11 +00:00
Sanjay Patel	6347807f87	[InstCombine] use m_APInt to allow icmp (mul X, Y), C folds for splat constant vectors This is a sibling of: https://reviews.llvm.org/rL278859 https://reviews.llvm.org/rL278935 https://reviews.llvm.org/rL278945 https://reviews.llvm.org/rL279066 llvm-svn: 279077	2016-08-18 15:44:44 +00:00
Derek Schuff	ccdceda128	[WebAssembly] Refactor WebAssemblyLowerEmscriptenException pass for setjmp/longjmp This patch changes the code structure of WebAssemblyLowerEmscriptenException pass to support both exception handling and setjmp/longjmp. It also changes the name of the pass and the source file. 1. Change the file/pass name to WebAssemblyLowerEmscriptenExceptions -> WebAssemblyLowerEmscriptenEHSjLj to make it clear that it supports both EH and SjLj 2. List function / global variable names at the top so they can be changed easily 3. Some cosmetic changes Patch by Heejin Ahn Differential Revision: https://reviews.llvm.org/D23588 llvm-svn: 279075	2016-08-18 15:27:25 +00:00
Ahmed Bougacha	1d0560b14d	[AArch64][GlobalISel] Select G_SDIV/G_UDIV. There is no REM instruction; that will require an expansion. It's not obvious that should be done in select, rather than as a (custom?) legalization. llvm-svn: 279074	2016-08-18 15:17:13 +00:00
Ahmed Bougacha	13db94540c	[GlobalISel] Add support for DIV/REM. llvm-svn: 279073	2016-08-18 15:17:01 +00:00
Sanjay Patel	5b112845da	[InstCombine] use APInt in isSignTest instead of ConstantInt; NFC This will enable vector splat folding, but NFC until the callers have their ConstantInt restrictions removed. llvm-svn: 279072	2016-08-18 14:59:14 +00:00
Saleem Abdulrasool	3780b3a9eb	llvm-readobj: handle import libraries with -coff-exports `link -dump -exports` lists exported symbols from import libraries as well as normal dlls. Ensure that we can handle import libraries as well in llvm-readobj. llvm-svn: 279069	2016-08-18 14:32:11 +00:00
Sanjay Patel	7d37b221a2	fix typo; NFC llvm-svn: 279068	2016-08-18 14:17:34 +00:00
Krzysztof Parzyszek	b1b0372337	[Hexagon] Create vcombine in HexagonCopyToCombine llvm-svn: 279067	2016-08-18 14:12:34 +00:00
Sanjay Patel	4c5e60d95c	[InstCombine] use m_APInt to allow icmp (xor X, Y), C folds for splat constant vectors This is a sibling of: https://reviews.llvm.org/rL278859 https://reviews.llvm.org/rL278935 https://reviews.llvm.org/rL278945 llvm-svn: 279066	2016-08-18 14:10:48 +00:00
Simon Pilgrim	ab7c46eccf	[X86][SSE] Add SSE1 tests to make sure we don't merge loads on illegal types llvm-svn: 279065	2016-08-18 13:41:26 +00:00
Simon Dardis	ea3431598e	[mips] Correct tail call encoding for MIPSR6 r277708 enabled tails calls for MIPS but used the 'jr' instruction when the jump target was held in a register. For MIPSR6, 'jalr $zero, $reg' should have been used. Additionally, add missing patterns for external and global symbols for tail calls. Reviewers: dsanders, vkalintiris Differential Review: https://reviews.llvm.org/D23301 llvm-svn: 279064	2016-08-18 13:22:43 +00:00
Chad Rosier	83f6bbc154	[Reassociate] Add test for PR28367. llvm-svn: 279063	2016-08-18 13:22:37 +00:00
Alex Bradbury	3447ca3f08	(Trivial) TargetPassConfig: assert when TargetMachine has no MCAsmInfo Summary: This is a pretty trivial, but I thought it was worth just checking that nobody feels it's completely the wrong thing to be doing. The motivation is that when starting a new backend, you often start with a minimal stub, pretty much just FooTargetMachine and FooTargetInfo. Once that's built, you might naturally try `llc -march=foo myinput.ll` and it seems more developer-friendly if this ends up asserting due to the lack of MCAsmInfo with an informative message rather than just segfaulting. Reviewers: MatzeB, chandlerc Subscribers: bogner, llvm-commits Differential Revision: https://reviews.llvm.org/D23443 llvm-svn: 279061	2016-08-18 13:08:58 +00:00
Simon Pilgrim	916485c765	Remove trailing whitespace llvm-svn: 279054	2016-08-18 11:22:22 +00:00
Diana Picus	9405ae704b	Revert "ADT: Remove UB in ilist (and use a circular linked list)" This reverts commit r278974 which broke some of our bots (e.g. clang-cmake-aarch64-42vma, clang-cmake-aarch64-full). llvm-svn: 279053	2016-08-18 11:17:53 +00:00

1 2 3 4 5 ...

136989 Commits