llvm-project

Commit Graph

Author	SHA1	Message	Date
Peter Collingbourne	d786047583	Try using llvm::errc, should hopefully fix version mismatch problem on clang-s390x-linux bot. llvm-svn: 298285	2017-03-20 18:19:41 +00:00
Krzysztof Parzyszek	8490251de3	[Hexagon] Recognize polynomial-modulo loop idiom again Regain the ability to recognize loops calculating polynomial modulo operation. This ability has been lost due to some changes in the preceding optimizations. Add code to preprocess the IR to a form that the pattern matching code can recognize. llvm-svn: 298282	2017-03-20 18:12:58 +00:00
Konstantin Zhuravlyov	2534bc07f4	[AMDGPU] Run always inliner early in opt Differential Revision: https://reviews.llvm.org/D31141 llvm-svn: 298281	2017-03-20 18:06:45 +00:00
Daniel Berlin	fa42a23cfc	Add missing updated test from VN coercion changes. Instructions were renamed. NFC llvm-svn: 298280	2017-03-20 18:04:19 +00:00
David Blaikie	795dc94614	Fix UB found by -Wtautological-undefined-compare llvm-svn: 298279	2017-03-20 18:01:07 +00:00
Jonathan Roelofs	8d1ba8943f	Mention that the Sphinx docs build wants version 1.5 or later `misc.highlighting_failure` support was added to `suppress_warnings` in that version, and the warnings-as-errors docs build relies on it. llvm-svn: 298277	2017-03-20 17:49:36 +00:00
Reid Kleckner	8819c73878	[WinEH] Adjust decision to emit SEH moves for leaf functions Move the check for "MF->hasWinCFI()" up into the calculation of the shouldEmitMoves boolean, rather than putting it in the early returning if. This ensures that endFunction doesn't try to emit .seh_* directives for leaf functions. llvm-svn: 298276	2017-03-20 17:45:59 +00:00
Tim Northover	89268b183f	GlobalISel: allow quad-precision values to be dumped. Otherwise the fallback path fails with an assertion on AAPCS AArch64 targets, when "long double" is encountered. llvm-svn: 298273	2017-03-20 16:52:08 +00:00
Peter Collingbourne	25a17ba4c7	Support, LTO: When pruning a directory, ignore files matching a prefix. This is a safeguard against data loss if the user specifies a directory that is not a cache directory. Teach the existing cache pruning clients to create files with appropriate names. Differential Revision: https://reviews.llvm.org/D31109 llvm-svn: 298271	2017-03-20 16:41:57 +00:00
Dehao Chen	e593049fb0	Updates branch_weights annotation for call instructions during inlining. Summary: Inliner should update the branch_weights annotation to scale it to proper value. Reviewers: davidxl, eraman Reviewed By: eraman Subscribers: zzheng, llvm-commits Differential Revision: https://reviews.llvm.org/D30767 llvm-svn: 298270	2017-03-20 16:40:44 +00:00
Adrian Prantl	6d80a262d5	Use isa<> instead of dyn_cast<> (NFC). llvm-svn: 298268	2017-03-20 16:39:41 +00:00
David Blaikie	820a833125	Avoid potential ADL ambiguity between llvm::make_unique and std::make_unique (when building as C++14+) Patch by Moritz Kiefer! llvm-svn: 298266	2017-03-20 16:35:33 +00:00
Dmitry Preobrazhensky	1e124e1825	[AMDGPU][MC] Fix for Bugs 28201, 28199, 28170 + LIT tests This fix enables sp3 abs modifier with constants Reviewers: artem.tamazov Differential Revision: https://reviews.llvm.org/D30825 llvm-svn: 298265	2017-03-20 16:33:20 +00:00
Craig Topper	d92d2fc763	[InstCombine] Print a debug message when we constant fold an operand during worklist creation InstCombine tries to constant fold instruction operands during worklist building, but we don't print that we're doing this. We also set a change flag here that causes us to rebuild and rerun the worklist one more time even if processing the worklist itself created no additional changes. So in the log I saw two inst combine runs that visited all instructions without printing that anything was changed. I may be submitting another patch to remove the change flag unless I can find some reason why we should be doing that. Differential Revision: https://reviews.llvm.org/D31091 llvm-svn: 298264	2017-03-20 16:31:14 +00:00
Jessica Paquette	02cbfb2926	[Outliner] ACTUALLY remove the errs output I don't know how to type. This fixes the last commit which would have made all of the overflows legal, and kept the screaming. llvm-svn: 298263	2017-03-20 16:25:04 +00:00
Daniel Berlin	12883b1673	Templatize parts of VNCoercion, and add constant-only versions of the functions to be used in NewGVN. NFCI. Summary: This is ground work for the changes to enable coercion in NewGVN. GVN doesn't care if they end up constant because it eliminates as it goes. NewGVN cares. IRBuilder and ConstantFolder deliberately present the same interface, so we use this to our advantage to templatize our functions to make them either constant only or not. Reviewers: davide Subscribers: llvm-commits, Prazek Differential Revision: https://reviews.llvm.org/D30928 llvm-svn: 298262	2017-03-20 16:08:29 +00:00
Jessica Paquette	5d59a4ee19	[Outliner] Remove output for offset range check Forgot to remove some output before committing last time. (Instruction fixups don't actually overflow anywhere in the test suite so far, so I missed it). To prevent the outliner from screaming "Overflow!" in the event that that does happen, this commit removes that output. llvm-svn: 298260	2017-03-20 15:51:45 +00:00
Daniel Sanders	b96f40dd03	[tablegen][globalisel] Capture instructions into locals and related infrastructure for multiple instructions matches. Summary: Prepare the way for nested instruction matching support by having actions like CopyRenderer look up operands in the RuleMatcher rather than a specific InstructionMatcher. This allows actions to reference any operand from any matched instruction. It works by checking the 'shape' of the match and capturing each matched instruction to a local variable. If the shape is wrong (not enough operands, leaf nodes where non-leafs are expected, etc.), then the rule exits early without checking the predicates. Once we've captured the instructions, we then test the predicates as before (except using the local variables). If the match is successful, then we render the new instruction as before using the local variables. It's not noticable in this patch but by the time we support multiple instruction matching, this patch will also cause a significant improvement to readability of the emitted code since MRI.getVRegDef(I->getOperand(0).getReg()) will simply be MI1 after emitCxxCaptureStmts(). This isn't quite NFC because I've also fixed a bug that I'm surprised we haven't encountered yet. It now checks there are at least the expected number of operands before accessing them with getOperand(). Depends on D30531 Reviewers: t.p.northover, qcolombet, aditya_nandakumar, ab, rovka Reviewed By: rovka Subscribers: dberris, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D30535 llvm-svn: 298257	2017-03-20 15:20:42 +00:00
Dmitry Preobrazhensky	40af9c35d3	[AMDGPU][MC] Fix for Bugs 28200, 28202 + LIT tests Fixed several related issues with VOP3 fp modifiers. Reviewers: artem.tamazov Differential Revision: https://reviews.llvm.org/D30821 llvm-svn: 298255	2017-03-20 14:50:35 +00:00
Diana Picus	d79253a9f7	[GlobalISel] Use the correct calling conv for calls This commit adds a parameter that lets us pass in the calling convention of the call to CallLowering::lowerCall. This allows us to handle situations where the calling convetion of the callee is different from that of the caller. Differential Revision: https://reviews.llvm.org/D31039 llvm-svn: 298254	2017-03-20 14:40:18 +00:00
Simon Pilgrim	00b34996b4	Use MutableArrayRef for APFloat::convertToInteger As discussed on D31074, use MutableArrayRef for destination integer buffers to help assert before stack overflows happen. llvm-svn: 298253	2017-03-20 14:40:12 +00:00
Maxim Ostapenko	a525a2ba75	[sancov] Fix broken links and displaced coloring in coverage-report-server.py This patch fixes two issues: * Fixed relative links to source files * Enumeration of lines in source files starts from 1 instead of 0 to align with .symcov files generated by sancov -symbolize Patch by Dmitiriy Nikiforov. Differential Revision: https://reviews.llvm.org/D31038 llvm-svn: 298250	2017-03-20 14:06:04 +00:00
Simon Pilgrim	610ad9b53f	Strip trailing whitespace llvm-svn: 298249	2017-03-20 13:55:35 +00:00
Simon Pilgrim	9abf2424cb	Strip trailing whitespace llvm-svn: 298248	2017-03-20 13:54:44 +00:00
Simon Pilgrim	a28495929b	Strip trailing whitespace llvm-svn: 298247	2017-03-20 13:53:59 +00:00
Konstantin Zhuravlyov	8a67eb144f	Revert "[AMDGPU] Run always inliner early in opt" This reverts commit r297958, it breaks device-libs build. llvm-svn: 298239	2017-03-20 09:26:08 +00:00
Craig Topper	f4fa291d83	[IR] Move a few static functions in Instruction class inline. They just check for certain opcodes and opcode enums are available in Instruction.h. llvm-svn: 298237	2017-03-20 06:40:39 +00:00
Michael Zolotukhin	c69955c6f3	[ConstantRange] Add setSizeSmallerThanOf method. Summary: ConstantRange class currently has a method getSetSize, which is mostly used to compare set sizes of two constant ranges (there is only one spot where it's used in a slightly different scenario). This patch introduces setSizeSmallerThanOf method, which does such comparison in a more efficient way. In the original method we have to extend our types to (BitWidth+1), which can result it using slow case of APInt, extra memory allocations, etc. The change is supposed to not change any functionality, but it slightly improves compile time. Here is compile time improvements that I observed on CTMark: * tramp3d-v4 -2.02% * pairlocalalign -1.82% * lencod -1.67% Reviewers: sanjoy, atrick, pete Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31104 llvm-svn: 298236	2017-03-20 06:33:07 +00:00
Craig Topper	b5c2bfa869	[IR] Remove some unneeded includes from Operator.h and fix cpp files that were transitively depending on it. NFC llvm-svn: 298235	2017-03-20 05:08:41 +00:00
Craig Topper	c98f8835b5	[IR] Add missing copyright header. llvm-svn: 298234	2017-03-20 05:08:38 +00:00
Craig Topper	0085ffb940	[APInt] Don't initialize VAL to 0 in APInt constructors. Push it down to the initSlowCase and other init methods. I'm not sure if zeroing VAL before writing pVal is really necessary, but at least one other place did it in code. But by taking the store out of line, this reduces the opt binary by about 20k on my local x86-64 build. llvm-svn: 298233	2017-03-20 01:29:52 +00:00
Xin Tong	cbf04d95e6	Remove unnecessary IDom check Summary: This Idom check seems unnecessary. The immediate children of a node on the Dominator Tree should always be the IDom of its immediate children in this case. Reviewers: hfinkel, majnemer, dberlin Reviewed By: dberlin Subscribers: dberlin, davide, llvm-commits Differential Revision: https://reviews.llvm.org/D26954 llvm-svn: 298232	2017-03-20 00:30:19 +00:00
Craig Topper	ff9749f759	[InstCombine] Remove duplicate code in SimplifyDemandedUseBits for URem. NFC llvm-svn: 298231	2017-03-19 21:45:57 +00:00
Craig Topper	5992c8d1dc	[AVX-512] Handle kor/kand/kandn/kxor/kxnor/knot intrinsics at lowering time instead of isel Summary: Currently we handle these intrinsics at isel with special patterns. But as they just map to normal logic operations, we should just handle them at lowering. This will expose them to DAG combine optimizations. Right now the kor-sequence test generates a bunch of regclass copies between GR16 and VK16 that the peephole optimizer and/or register coallescing are removing to keep everything in the mask domain. By handling the logic op intrinsics earlier, these copies become bitcasts in the DAG and get removed by DAG combine which seems more robust. This should help enable my plan to stop copying between K registers and GR8/GR16. The peephole optimizer can't remove a chain of copies between K and GR32 with insert_subreg/extract_subreg present in the chain so the kor-sequence test break. But this patch should dodge the problem entirely. Reviewers: zvi, delena, RKSimon, igorb Reviewed By: igorb Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31056 llvm-svn: 298228	2017-03-19 17:11:09 +00:00
Craig Topper	ff2283ec0e	[InstCombine] Use update_test_checks.py to regenerate a test. NFC llvm-svn: 298227	2017-03-19 17:04:52 +00:00
Simon Pilgrim	8424df7dea	Fix constant folding of fp2int to large integers We make the assumption in most of our constant folding code that a fp2int will target an integer of 128-bits or less, calling the APFloat::convertToInteger with only uint64_t[2] of raw bits for the result. Fuzz testing (PR24662) showed that we don't handle other cases at all, resulting in stack overflows and all sorts of crashes. This patch uses the APSInt version of APFloat::convertToInteger instead to better handle such cases. Differential Revision: https://reviews.llvm.org/D31074 llvm-svn: 298226	2017-03-19 16:50:25 +00:00
Simon Pilgrim	5fa1b9a12f	Fix MSVC warning: "switch statement contains 'default' but no 'case' labels". NFCI. llvm-svn: 298225	2017-03-19 16:39:04 +00:00
Ahmed Bougacha	931904d777	[GlobalISel] Don't select trivially dead instructions. Folding instructions when selecting can cause them to become dead. Don't select these dead instructions (if they don't have other side effects, and don't define physical registers). Preserve existing tests by adding COPYs. In some tests, the G_CONSTANT vregs never get constrained to a class: the only use of the vreg was folded into another instruction, so the G_CONSTANT, now dead, never gets selected. llvm-svn: 298224	2017-03-19 16:13:00 +00:00
Ahmed Bougacha	48bcd22ce8	[GlobalISel][AArch64] Add DBG_VALUE select test. NFC. llvm-svn: 298223	2017-03-19 16:12:53 +00:00
Ahmed Bougacha	dcd416a4b9	[GlobalISel][AArch64] Split out cast select tests. NFC. And remove some redundant bitcast tests. Also split the test functions themselves: it makes it obvious to see what's tested where and what isn't, it makes the tests much easier to read and manually update, and, most importantly, it makes them almost trivial to update using tooling. Yes, it's obnoxiously verbose, but said tooling helps upgrade to better MIR syntax whenever available. llvm-svn: 298222	2017-03-19 16:12:51 +00:00
Ahmed Bougacha	7f2d17331c	[GlobalISel] Move method definition to the proper file. NFC. llvm-svn: 298221	2017-03-19 16:12:48 +00:00
Ahmed Bougacha	531e275aa8	[CodeGen] Update hasSideEffects comment. NFC. We used to have 3 side effect flags, but as of r222809, we only have hasSideEffects. Change the comment to reflect that. llvm-svn: 298220	2017-03-19 16:12:45 +00:00
Xin Tong	bcb17ecf04	Correct a rebase mistake. Left out AA in jumpthreading SimplifyPartiallyRedundantLoad llvm-svn: 298219	2017-03-19 15:41:46 +00:00
Xin Tong	967e313078	Remove unused arguments. NFCI llvm-svn: 298218	2017-03-19 15:31:16 +00:00
Xin Tong	d67fb1b66e	[JumpThreading] Perform phi-translation in SimplifyPartiallyRedundantLoad. Summary: In case we are loading on a phi-load in SimplifyPartiallyRedundantLoad. Try to phi translate it into incoming values in the predecessors before we search for available loads. This needs https://reviews.llvm.org/D30524 Reviewers: davide, sanjoy, efriedma, dberlin, rengolin Reviewed By: dberlin Subscribers: junbuml, llvm-commits Differential Revision: https://reviews.llvm.org/D30543 llvm-svn: 298217	2017-03-19 15:30:53 +00:00
Xin Tong	aef0fcb191	Extract FindAvailablePtrLoadStore out of FindAvailableLoadedValue. NFCI Summary: Extract FindAvailablePtrLoadStore out of FindAvailableLoadedValue. Prepare for upcoming change which will do phi-translation for load on phi pointer in jump threading SimplifyPartiallyRedundantLoad. This is in preparation for https://reviews.llvm.org/D30543 Reviewers: efriedma, sanjoy, davide, dberlin Reviewed By: davide Subscribers: junbuml, davide, llvm-commits Differential Revision: https://reviews.llvm.org/D30524 llvm-svn: 298216	2017-03-19 15:27:52 +00:00
Teresa Johnson	9b4b8c8d7b	Enable stripping of multiple DILocation on !llvm.loop metadata Summary: I found that stripDebugInfo was still leaving significant amounts of debug info due to !llvm.loop that contained DILocation after stripping. The support for stripping debug info on !llvm.loop added in r293377 only removes a single DILocation. Enhance that to remove all DILocation from !llvm.loop. Reviewers: hfinkel, aprantl, dsanders Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31117 llvm-svn: 298213	2017-03-19 13:54:57 +00:00
Oren Ben Simhon	75537b6566	[MIR] Test assumes x64 windows calling convention upon printing/parsing MIR output/input. llvm-svn: 298212	2017-03-19 13:23:20 +00:00
Benjamin Kramer	6520f83ba4	[MIR] Add triple to test that assumes it runs on windows. llvm-svn: 298211	2017-03-19 13:04:35 +00:00
Oren Ben Simhon	9ce0ec5dbc	CalleeSavedRegister was removed from MIR and is recalculated upon MIR parsing. llvm-svn: 298210	2017-03-19 11:18:09 +00:00
Oren Ben Simhon	a96fdbf233	Moving the test to x86 because other architectures do not suport regcall calling convention. llvm-svn: 298209	2017-03-19 08:53:42 +00:00
Oren Ben Simhon	0ef61ec32a	[MIR] Support Customed Register Mask and CSRs The MIR printer dumps a string that describe the register mask of a function. A static predefined list of register masks matches a static list of strings. However when the register mask is not from the static predefined list, there is no descriptor string and the printer fails. This patch adds support to custom register mask printing and dumping. Also the list of callee saved registers (describing the registers that must be preserved for the caller) might be dynamic. As such this data needs to be dumped and parsed back to the Machine Register Info. Differential Revision: https://reviews.llvm.org/D30971 llvm-svn: 298207	2017-03-19 08:14:18 +00:00
Craig Topper	3a86a04404	[InstCombine] Use setHighBits/setLowBits/setBitsFrom in place of getLowBitsSet/getHighBitsSet. llvm-svn: 298204	2017-03-19 05:49:16 +00:00
Brian Gesiak	1640e68728	[Analysis] bitreverse(undef) returns undef Summary: The reverse of an artbitrary bitpattern is also an arbitrary bitpattern. Reviewers: trentxintong, arsenm, majnemer Reviewed By: majnemer Subscribers: majnemer, wdng, llvm-commits Differential Revision: https://reviews.llvm.org/D31118 llvm-svn: 298201	2017-03-19 04:40:42 +00:00
Daniel Berlin	46b72e6de6	NewGVN: Now that we have a better verifier, we can prove that we can erase the predicateuser set each time we mark it touched llvm-svn: 298199	2017-03-19 00:07:32 +00:00
Daniel Berlin	d43f0ee7e1	NewGVN: Remove dead code (for now) llvm-svn: 298198	2017-03-19 00:07:27 +00:00
Craig Topper	d55e153b87	[GVN] Fix accidental double storage of the function BasicBlock list in iterateOnFunction Summary: iterateOnFunction creates a ReversePostOrderTraversal object which does a post order traversal in its constructor and stores the results in an internal vector. Iteration over it just reads from the internal vector in reverse order. The GVN code seems to be unaware of this and iterates over ReversePostOrderTraversal object and makes a copy of the vector into a local vector. (I think at one point in time we used a DFS here instead which would have required the local vector). The net affect of this is that we have two vectors containing the basic block list. As I didn't want to expose the implementation detail of ReversePostOrderTraversal's constructor to GVN, I've changed the code to do an explicit post order traversal storing into the local vector and then reverse iterate over that. I've also removed the reserve(256) since the ReversePostOrderTraversal wasn't doing that. I can add it back if we thinks it important. Though it seemed weird that it wasn't based on the size of the function. Reviewers: davide, anemet, dberlin Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31084 llvm-svn: 298191	2017-03-18 18:24:41 +00:00
Craig Topper	7cfd4a9d7a	[ValueTracking] Remove deadish code from computeKnownBitsAddSub. The code assigned to KnownZero, but later code unconditionally assigned over it. I'm pretty sure the later code can handle the same cases and more equally well. llvm-svn: 298190	2017-03-18 18:21:46 +00:00
Daniel Berlin	06329a98e3	NewGVN: Greatly enhance the ability of the NewGVN verifier to detect issues, subsuming previous verifier. llvm-svn: 298188	2017-03-18 15:41:40 +00:00
Daniel Berlin	41b39169e2	NewGVN: Fix PHI evaluation bug exposed by new verifier. We were checking whether the incoming block was reachable instead of whether the specific edge was reachable llvm-svn: 298187	2017-03-18 15:41:36 +00:00
Daniel Berlin	182771f620	DebugCounters: Add API for setting/unsetting programatically. This is required so we can re-set the counter state for verifiers, etc. llvm-svn: 298186	2017-03-18 15:41:13 +00:00
Matthias Braun	e6ff30b696	ExecutionDepsFix: Let targets specialize the pass; NFC Let targets specialize the pass with the register class so we can get a parameterless default constructor and can put the pass into the pass registry to enable testing with -run-pass=. llvm-svn: 298184	2017-03-18 05:08:58 +00:00
Matthias Braun	e9f8209e87	ExecutionDepsFix: Normalize names; NFC Normalize ExeDepsFix, execution-fix, ExecutionDependencyFix and ExecutionDepsFix to the last one. llvm-svn: 298183	2017-03-18 05:05:40 +00:00
Matthias Braun	6e67052912	CodeGen.cpp: Sort alphabetically; NFC llvm-svn: 298182	2017-03-18 05:05:32 +00:00
Matthias Braun	9dd6069be1	InitializePasses.h: Cleanup; NFC - Sort alphabetically - Normalize spaces llvm-svn: 298181	2017-03-18 05:05:29 +00:00
Craig Topper	3eb0d80de0	[ValueTracking] Add APInt::setSignBit and use it to replace ORing with getSignBit which will malloc if the bit width is larger than 64. llvm-svn: 298180	2017-03-18 04:01:29 +00:00
Nirav Dave	ac6081cb67	Make library calls sensitive to regparm module flag (Fixes PR3997). Reviewers: mkuper, rnk Subscribers: mehdi_amini, jyknight, aemerson, llvm-commits, rengolin Differential Revision: https://reviews.llvm.org/D27050 llvm-svn: 298179	2017-03-18 00:44:07 +00:00
Nirav Dave	6de2c77944	Capitalize ArgListEntry fields. NFC. llvm-svn: 298178	2017-03-18 00:43:57 +00:00
Bruno Cardoso Lopes	8fef5556bc	[LockFileManager] Reduce lock timeout Go back to behavior pre-r231309 and reduce the timeout from 8 to ~1.5 min now that we have (a) PCMCache mechanism (r298165) and (b) timeout that doesn't cause a failure, but actually build the module (r298175). rdar://problem/30297862 llvm-svn: 298176	2017-03-18 00:32:34 +00:00
Stanislav Mekhanoshin	8e45acfc38	[AMDGPU] Add address space based alias analysis pass This is direct port of HSAILAliasAnalysis pass, just cleaned for style and renamed. Differential Revision: https://reviews.llvm.org/D31103 llvm-svn: 298172	2017-03-17 23:56:58 +00:00
Craig Topper	0f5063c754	[BuildLibCalls] emitPutChar should infer function attributes for putchar When InstCombine calls into SimplifyLibCalls and it createa putChar calls, we don't infer the attributes. And since SimplifyLibCalls doesn't use InstCombine's IRBuilder the calls doesn't end up in the worklist on this iteration of InstCombine. So it gets picked up on the next iteration where it causes an IR change. This of course causes InstCombine to run another iteration. So this patch just gets the attributes right the first time. We already did this for puts and some other libcalls. Differential Revision: https://reviews.llvm.org/D31094 llvm-svn: 298171	2017-03-17 23:48:02 +00:00
Sanjay Patel	0429b1a431	[x86] regenerate checks; NFC llvm-svn: 298166	2017-03-17 23:04:18 +00:00
Sanjay Patel	77e6ebe748	[x86] regenerate checks; NFC llvm-svn: 298164	2017-03-17 22:47:21 +00:00
Evgeniy Stepanov	4d490de232	Fix docs-llvm-html build. llvm-svn: 298163	2017-03-17 22:31:13 +00:00
Jessica Paquette	ea8cc09be0	[Outliner] Add outliner for AArch64 This commit adds the necessary target hooks for outlining in AArch64. It also refactors the switch statement used in `getMemOpBaseRegImmOfsWidth` into a more general function, `getMemOpInfo`. This allows the outliner to share that code without copying and pasting it. The AArch64 outliner can be run using -mllvm -enable-machine-outliner, as with the X86-64 outliner. The test for this pass verifies that the outliner does, in fact outline functions, fixes up the stack accesses properly, and can correctly generate a tail call. In the future, this test should be replaced with a MIR test, so that we can properly test immediate offset overflows in fixed-up instructions. llvm-svn: 298162	2017-03-17 22:26:55 +00:00
Eli Friedman	f7b060bd3e	[SCEV] Use const Loop L instead of Loop L. NFC Use const pointer in the trip count and trip multiple calculations. Patch by Huihui Zhang <huihuiz@codeaurora.org> llvm-svn: 298161	2017-03-17 22:19:52 +00:00
Evgeniy Stepanov	c5aa6b9411	[asan] Fix dead stripping of globals on Linux. Use a combination of !associated, comdat, @llvm.compiler.used and custom sections to allow dead stripping of globals and their asan metadata. Sometimes. Currently this works on LLD, which supports SHF_LINK_ORDER with sh_link pointing to the associated section. This also works on BFD, which seems to treat comdats as all-or-nothing with respect to linker GC. There is a weird quirk where the "first" global in each link is never GC-ed because of the section symbols. At this moment it does not work on Gold (as in the globals are never stripped). Differential Revision: https://reviews.llvm.org/D30121 llvm-svn: 298158	2017-03-17 22:17:29 +00:00
Evgeniy Stepanov	51c962f72e	Add !associated metadata. This is an ELF-specific thing that adds SHF_LINK_ORDER to the global's section pointing to the metadata argument's section. The effect of that is a reverse dependency between sections for the linker GC. !associated does not change the behavior of global-dce. The global may also need to be added to llvm.compiler.used. Since SHF_LINK_ORDER is per-section, !associated effectively enables fdata-sections for the affected globals, the same as comdats do. Differential Revision: https://reviews.llvm.org/D29104 llvm-svn: 298157	2017-03-17 22:17:24 +00:00
Eli Friedman	46ddab3810	[SelectionDAG] Remove redundant stores more aggressively. Handle TokenFactors more aggressively in SDValue::reachesChainWithoutSideEffects. This isn't really a very effective change anymore because of other changes to chain handling, but it's a cheap check, and the expanded comments are still useful. It might be possible to loosen the hasOneUse() requirement with a deeper analysis, but a naive implementation of that check would be expensive. Differential Revision: https://reviews.llvm.org/D29845 llvm-svn: 298156	2017-03-17 22:15:50 +00:00
Peter Collingbourne	6bfe4d369b	LTO: Work around libstdc++ version mismatch bug, see D31063 review thread. llvm-svn: 298127	2017-03-17 21:49:09 +00:00
Matt Arsenault	59ece95f6c	AMDGPU: Fix broken condition in hazard recognizer Fixes bug 32248. llvm-svn: 298125	2017-03-17 21:36:28 +00:00
Vassil Vassilev	c70a8f2df1	[coverity] Fix uninit variable. Patch by John Harvey! llvm-svn: 298122	2017-03-17 20:58:08 +00:00
Matt Arsenault	e70d5dcf3e	AMDGPU: Fix handling of constant phi input loop conditions If the loop condition was an i1 phi with a constantexpr input, this would add a loop intrinsic fed by a phi dependent on a call to if.break in the same block. Insert the call in the loop header. llvm-svn: 298121	2017-03-17 20:52:21 +00:00
Rong Xu	8e06e80b87	[PGO] Change the internal options description. nfc. llvm-svn: 298120	2017-03-17 20:51:44 +00:00
Matt Arsenault	c5b641ac02	AMDGPU: Cleanup control flow intrinsics Move backend internal intrinsics along with the rest of the normal intrinsics, and use the Intrinsic::getDeclaration API instead of manually constructing the type list. It's surprising this was working before. fdiv.fast had the wrong number of parameters. The control flow intrinsic declaration attributes were not being applied, and their types were inconsistent. The actual IR use types did not match the declaration, and were closer to the types used for the patterns. The brcond lowering was changing the types, so introduce new nodes for those. llvm-svn: 298119	2017-03-17 20:41:45 +00:00
Sanjay Patel	455703a0c6	[x86] clean up setcc with negated operand transform and add missing test; NFCI llvm-svn: 298118	2017-03-17 20:29:40 +00:00
Reid Kleckner	edf1cbb580	[X86] Emit fewer instructions to allocate >16GB stack frames Summary: Use this code pattern when RAX is live, instead of emitting up to 2 billion adjustments: pushq %rax movabsq +-$Offset+-8, %rax addq %rsp, %rax xchg %rax, (%rsp) movq (%rsp), %rsp Try to clean this code up a bit while I'm here. In particular, hoist the logic that handles the entire adjustment with `movabsq $imm, %rax` out of the loop. This negates the offset in the prologue and uses ADD because X86 only has a two operand subtract which always subtracts from the destination register, which can no longer be RSP. Fixes PR31962 Reviewers: majnemer, sdardis Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D30052 llvm-svn: 298116	2017-03-17 20:25:49 +00:00
Rong Xu	661ffe104e	[PGO] Add omitted test cases. llvm-svn: 298115	2017-03-17 20:05:13 +00:00
Jun Bum Lim	4230101def	[CodeGenPrep]Restructure promoting Ext to form ExtLoad Summary: Instead of just looking for a load which is mergable with Ext to form ExtLoad, trying to promote Exts as long as the cost is acceptable. This change is not a NFC as it continue promoting Exts even after finding a load during promotions; the change in arm64-codegen-prepare-extload.ll described in 2.b might show the case. This change was motivated from D26524. Based on this change, I will move the transformation performed in aarch64-type-promotion into CGP. Reviewers: jmolloy, qcolombet, mcrosier, javed.absar Reviewed By: qcolombet Subscribers: rengolin, llvm-commits, aemerson Differential Revision: https://reviews.llvm.org/D27853 llvm-svn: 298114	2017-03-17 19:05:21 +00:00
Rong Xu	e60343d6b0	[PGO] Value profile for size of memory intrinsic calls This patch annotates the valuesites profile to memory intrinsics. Differential Revision: http://reviews.llvm.org/D31002 llvm-svn: 298110	2017-03-17 18:07:26 +00:00
Vedant Kumar	c2a250c35b	[Bitcode] Add compatibility test for the 4.0 release Fork off compatibility.ll for the 4.0 release. The *.bc file in this commit was produced using a Release build of the release_40 branch. llvm-svn: 298109	2017-03-17 17:53:26 +00:00
Simon Pilgrim	5a68d401c7	[SelectionDAG] Add SelectionDAG.computeKnownBits test support for ISD::ABS llvm-svn: 298108	2017-03-17 17:45:36 +00:00
Sanjay Patel	25bd713d33	[x86] avoid adc/sbb assert when both sides of add are zexted (PR32316) As noted in the comment, we might want to account for this case, but I didn't look at what that would mean for the asm. I'm also not sure why this only reproduces with avx512, but I'm putting a conservative fix in for now to avoid the crash. Also, if both sides of an add are zexted, shouldn't we shrink that add? https://bugs.llvm.org/show_bug.cgi?id=32316 llvm-svn: 298107	2017-03-17 17:27:31 +00:00
Reid Kleckner	98e56430b9	Fix wasm build after arg_begin iterator type change llvm-svn: 298106	2017-03-17 17:24:03 +00:00
Reid Kleckner	56d028d974	Store Arguments in a flat array instead of an iplist This saves two pointers from Argument and eliminates some extra allocations. Arguments cannot be inserted or removed from a Function because that would require changing its Type, which LLVM does not allow. Instead, passes that change prototypes, like DeadArgElim, create a new Function and copy over argument names and attributes. The primary benefit of iplist is O(1) random insertion and removal. We just don't need that for arguments, so don't use it. Reviewed By: chandlerc Subscribers: dlj, inglorion, llvm-commits Differential Revision: https://reviews.llvm.org/D31058 llvm-svn: 298105	2017-03-17 17:16:39 +00:00
Stanislav Mekhanoshin	ee2dd785f6	Only unswitch loops with uniform conditions Loop unswitching can be extremely harmful for a SIMT target. In case if hoisted condition is not uniform a SIMT machine will execute both clones of a loop sequentially. Therefor LoopUnswitch checks if the condition is non-divergent. Since DivergenceAnalysis adds an expensive PostDominatorTree analysis not needed for non-SIMT targets a new option is added to avoid unneded analysis initialization. The method getAnalysisUsage is called when TargetTransformInfo is not yet available and we cannot use it here. For that reason a new field DivergentTarget is added to PassManagerBuilder to control the behavior and set this field from a target. Differential Revision: https://reviews.llvm.org/D30796 llvm-svn: 298104	2017-03-17 17:13:41 +00:00
Simon Pilgrim	d06b025c9c	[X86] Add SelectionDAG.computeKnownBits test showing inability to handle ISD::ABS We have to be careful as abs(INT_MIN) == INT_MIN. llvm-svn: 298103	2017-03-17 16:58:15 +00:00
Chad Rosier	a69dcb6b66	[AArch64] Use alias analysis in the load/store optimization pass. This allows the optimization to rearrange loads and stores more aggressively. Differential Revision: http://reviews.llvm.org/D30903 llvm-svn: 298092	2017-03-17 14:19:55 +00:00
Oliver Stannard	8761e9bb43	[Asm] Don't list '@<type>' in diag when '@' is a comment This fixes https://bugs.llvm.org//show_bug.cgi?id=31280 Differential revision: https://reviews.llvm.org/D31026 llvm-svn: 298067	2017-03-17 11:10:17 +00:00
Andre Vieira	913ffeb5ba	[ARM] Fix triple format in test branch disassemble test Fixing triple format in the tests added for the branch label fix for Thumb Targets. Also recommitting previously approved patch, see https://reviews.llvm.org/D30943. Reviewed by: samparker Differential Revision: https://reviews.llvm.org/D30987 llvm-svn: 298056	2017-03-17 09:37:10 +00:00
Craig Topper	a8d4097445	[AVX-512] Make VEX encoded FMA instructions available when AVX512 is enabled regardless of whether +fma was added on the command line. We weren't able to handle isel of the 128/256-bit FMA instructions when AVX512F was enabled but VLX and FMA weren't. I didn't mask FeatureAVX512 imply FeatureFMA as I wasn't sure I wanted disabling FMA to also disable AVX512. Instead we just can't prevent FMA instructions if AVX512 is enabled. Another option would be to promote 128/256-bit to 512-bit, do the operation and extract it. But that requires a lot of extra isel patterns. Since no CPUs exist that support AVX512, but not FMA just using the VEX instructions seems better. llvm-svn: 298051	2017-03-17 07:37:31 +00:00
Craig Topper	02cd0bfa46	[X86] Remove unused predicate. NFC llvm-svn: 298050	2017-03-17 07:37:27 +00:00
Jonas Paulsson	f496bd9a59	[SystemZ] New CodeGen tests for vector compare / select. New SystemZ tests for the improved codegen of vector compare and select, including cases with a logical combination of two compares. Review: Ulrich Weigand. https://reviews.llvm.org/D29489 llvm-svn: 298049	2017-03-17 07:11:46 +00:00
Jonas Paulsson	f20386de62	Add SystemZ to utils/update_llc_test_checks.py. Extend script for auto-generating CHECK lines so that it works for SystemZ. This is a pre-commit for the new tests resulting from https://reviews.llvm.org/D29489 llvm-svn: 298048	2017-03-17 07:11:42 +00:00
Jonas Paulsson	8a7bd24c82	[SystemZ] Add use of super-reg in splitMove() If one of the subregs of the 128 bit reg is undefined when splitMove() splits a store into two instructions, a use of an undefined physical register results. To remedy this, an implicit use of the super register is added onto both new instructions, along with propagated kill and undef flags. This was discovered with llvm-stress, and that test case is attached as test/CodeGen/SystemZ/splitMove_undefReg_mverifier.ll Thanks to Matthias Braun for helping with a nice explanation. Review: Ulrich Weigand llvm-svn: 298047	2017-03-17 06:47:08 +00:00
Craig Topper	6a1290a0fd	[AVX-512] Give priority to EVEX encoded scalar FMA instructions when we have FMA, AVX512 and no VLX. We were giving priority if VLX was enabled. llvm-svn: 298046	2017-03-17 06:10:37 +00:00
Craig Topper	c1338f21ed	[X86] Use update_llc_test_checks.py to regenerate a test and add command lines to demonstrate that we don't pick EVEX encoded instruction when AVX512 and FMA3 are both enabled. This bug only exists on the scalar llvm.fma instrinsics. Looks like we don't test the llvm.fma intrinsics very thoroughly. In fact I don't see any tests for the vector versions. llvm-svn: 298045	2017-03-17 06:00:01 +00:00
Craig Topper	30c89eeeb6	[X86] Use update_llc_test_checks.py to regenerate a test. llvm-svn: 298044	2017-03-17 05:59:57 +00:00
Craig Topper	e4d5aa7efc	[X86] Cleanup the AddedComplexity values on move immediate instructions. NFC This makes the values a little more consistent between similar instruction and reduces the values some. This results in better grouping in the isel table saving a few bytes. llvm-svn: 298043	2017-03-17 05:59:54 +00:00
Kostya Serebryany	7acabdc497	[libFuzzer] inline the code of __sanitizer_cov_trace_pc_guard into it llvm-svn: 298032	2017-03-17 01:45:15 +00:00
Kostya Serebryany	f7e610eda1	[libFuzzer] Experimenting with dictionary minimization. Summary: Tracking issue: https://github.com/google/oss-fuzz/issues/331 Reviewers: kcc Reviewed By: kcc Differential Revision: https://reviews.llvm.org/D30940 llvm-svn: 298031	2017-03-17 01:40:09 +00:00
Kostya Serebryany	a52c8d0daf	[libFuzzer] add a test with two different bugs llvm-svn: 298030	2017-03-17 01:33:16 +00:00
Sanjoy Das	c4e4dcdf64	[RSForGC] Handle vector GEPs We were not handling getelemenptr instructions of vector type before. Since getelemenptr instructions for vector types follow the same rule as getelementptr instructions for non-vector types, we can just handle them in the same way. llvm-svn: 298028	2017-03-17 00:55:53 +00:00
Zachary Turner	2d9c082033	Revert "Make NativeExeSymbol a concrete subclass of NativeRawSymbol [PDB]" For some reason this is causing ANSI color codes to be printed even when run through FileCheck. llvm-svn: 298026	2017-03-17 00:46:42 +00:00
Matthias Braun	f0b68d3fbc	SplitKit: Correctly implement partial subregister copies - This fixes a bug where subregister incompatible with the vregs register class where used. - Implement the case where multiple copies are necessary to cover a given lanemask. Differential Revision: https://reviews.llvm.org/D30438 llvm-svn: 298025	2017-03-17 00:41:39 +00:00
Matthias Braun	fa289ec7f0	VirtRegMap: Correctly deal with bundles when deleting identity copies. This fixes two problems when VirtRegMap encounters bundles: - When substituting a vreg subregister def with an actual register the internal read flag must be cleared. - Removing an identity COPY from a bundle needs to use removeFromBundle() and a newly introduced function to update SlotIndexes. No testcase here, because none of the in-tree targets trigger this, however an upcoming commit of mine will need this and the testcase there will trigger this. Differential Revision: https://reviews.llvm.org/D30925 llvm-svn: 298024	2017-03-17 00:41:33 +00:00
Eric Christopher	53da761570	Remove LessPreciseFPMADOption from TargetOptions along with all of the associated command line options and functions - it's currently unused in all of llvm and clang other than being set and reset. llvm-svn: 298023	2017-03-17 00:38:03 +00:00
Eli Friedman	da228fee0c	[ARM] Use alias analysis in ARMPreAllocLoadStoreOpt. This allows the optimization to rearrange loads and stores more aggressively. This doesn't really affect performance, but it helps codesize. Differential Revision: https://reviews.llvm.org/D30839 llvm-svn: 298021	2017-03-17 00:34:26 +00:00
Peter Collingbourne	128423f99b	LTO: Fix a potential race condition in the caching API. After the call to sys::fs::exists succeeds, indicating a cache hit, we call AddFile and the client will open the file using the supplied path. If the client is using cache pruning, there is a potential race between the pruner and the client. To avoid this, change the caching API so that it provides a MemoryBuffer to the client, and have clients use that MemoryBuffer where possible. This scheme won't work with the gold plugin because the plugin API expects a file path. So we have the gold plugin use the buffer identifier as a path and live with the race for now. (Note that the gold plugin isn't actually affected by the problem at the moment because it doesn't support cache pruning.) This effectively reverts r279883 modulo the change to use the existing path in the gold plugin. Differential Revision: https://reviews.llvm.org/D31063 llvm-svn: 298020	2017-03-17 00:34:07 +00:00
Zachary Turner	3735006063	SmallString doesn't have implicit conversion from const char*. llvm-svn: 298019	2017-03-17 00:28:23 +00:00
Zachary Turner	8fb09e3ea7	Don't rely on an implicit std::tuple constructor. Apparently it doesn't have one, so using an initializer list doesn't work correctly. llvm-svn: 298018	2017-03-17 00:16:21 +00:00
Zachary Turner	2ed2aa75bf	[pdb] Fix an uninitialized read, and add a test for it. This was originally reported in pr32249, uncovered by PTVS-Studio. There was no code coverage for this path because it was difficult to construct odd-case PDB files that were not generated by cl. Now that we can write construct minimal PDB files from YAML, it's easy to construct fragments that generate whatever we want. In this patch I add a test that creates 2 type records. One with a unique name, and one without. I verify that we can go from PDB to Yaml with no errors. In a future patch I'd like to add something like llvm-pdbdump raw -lookup-type that will just dump one record and nothing else, which should make it a bit cleaner to find this kind of thing. llvm-svn: 298017	2017-03-17 00:15:55 +00:00
Zachary Turner	42cb87f401	[PDB] It is not an error getting the "Invalid" Annotation opcode. The linker can insert invalid opcodes to indicate padding bytes, and we should not fail in this case. llvm-svn: 298016	2017-03-17 00:15:27 +00:00
Jacques Pienaar	da9352c173	clean Lanai namespace Summary: This patch cleans the namespace of the Lanai target. Reviewers: jpienaar Reviewed By: jpienaar Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D30955 llvm-svn: 298015	2017-03-16 23:22:10 +00:00
Zachary Turner	78d15af1fa	Fix unit test. llvm-svn: 298014	2017-03-16 23:19:40 +00:00
Reid Kleckner	45707d4d5a	Remove getArgumentList() in favor of arg_begin(), args(), etc Users often call getArgumentList().size(), which is a linear way to get the number of function arguments. arg_size(), on the other hand, is constant time. In general, the fact that arguments are stored in an iplist is an implementation detail, so I've removed it from the Function interface and moved all other users to the argument container APIs (arg_begin(), arg_end(), args(), arg_size()). Reviewed By: chandlerc Differential Revision: https://reviews.llvm.org/D31052 llvm-svn: 298010	2017-03-16 22:59:15 +00:00
Reid Kleckner	c9a392b9dd	Remove dead F parameter from Argument constructor When Function creates its argument list, it does the ilist push_back itself. No other caller passes in a parent function, so this is dead, and it uses the soon-to-be-deleted getArgumentList accessor. llvm-svn: 298009	2017-03-16 22:58:56 +00:00
Zachary Turner	1875334f32	Fix linux build. llvm-svn: 298007	2017-03-16 22:34:18 +00:00
Adrian McCarthy	21b54cf632	Make NativeExeSymbol a concrete subclass of NativeRawSymbol [PDB] This moves exe symbol-specific method implementations out of NativeRawSymbol into a concrete subclass. Also adds implementations for hasCTypes and hasPrivateSymbols and a simple test to ensure the native reader can access the summary information for the executable from the PDB. Differential Revision: https://reviews.llvm.org/D31059 llvm-svn: 298005	2017-03-16 22:28:39 +00:00
Zachary Turner	5c5091fcb7	[Support] Support both Windows and Posix paths on both platforms. Previously which path syntax we supported dependend on what platform we were compiling LLVM on. While this is normally desirable, there are situations where we need to be able to handle a path that we know was generated on a remote host. Remote debugging, for example, or parsing debug info. 99% of the code in LLVM for handling paths was platform agnostic and literally just a few branches were gated behind pre-processor checks, so this changes those sites to use runtime checks instead, and adds a flag to every path API that allows one to override the host native syntax. Differential Revision: https://reviews.llvm.org/D30858 llvm-svn: 298004	2017-03-16 22:28:04 +00:00
Reid Kleckner	eb54909c8c	Make Argument::getArgNo() constant time, not O(#args) getArgNo is actually hot in LLVM, because its how we check for attributes on arguments: bool Argument::hasNonNullAttr() const { if (!getType()->isPointerTy()) return false; if (getParent()->getAttributes(). hasAttribute(getArgNo()+1, Attribute::NonNull)) return true; It actually shows up as the 23rd hottest leaf function in a 13s sample of LTO of llc. This grows Argument by four bytes, but I have another pending patch to shrink it by removing its ilist_node base. Reviewed By: chandlerc Subscribers: inglorion, llvm-commits, mehdi_amini Differential Revision: https://reviews.llvm.org/D31057 llvm-svn: 298003	2017-03-16 22:25:45 +00:00
Kyle Butt	d609d6ebf0	CodeGen: BlockPlacement: Adjust test case so it covers rL297925. NFC I had ajusted the test case before when testing a chain of length 2, and then reverted it with rL296845 when I switched to 3 triangles. After running benchmarks and examining generated code at length 2 I forgot to put the test back. llvm-svn: 298000	2017-03-16 21:33:29 +00:00
Rong Xu	60faea19f8	Resubmit r297897: [PGO] Value profile for size of memory intrinsic calls R297897 inadvertently enabled annotation for memop profiling. This new patch fixed it. llvm-svn: 297996	2017-03-16 21:15:48 +00:00
Adrian Prantl	47ea6478ed	Salvage debug info from instructions about to be deleted [Reapplies r297971 and punting on finding a better API for findDbgValues()] This patch improves debug info quality in InstCombine by looking at values that are about to be deleted, checking whether there are any dbg.value instrinsics referring to them, and potentially encoding the semantics of the deleted instruction into the dbg.value's DIExpression. In the example in the testcase (which was extracted from XNU) there is a sequence of %4 = load %struct.entry, %struct.entry* %next2, align 8, !dbg !41 %5 = bitcast %struct.entry* %4 to i8, !dbg !42 %add.ptr4 = getelementptr inbounds i8, i8 %5, i64 -8, !dbg !43 %6 = bitcast i8* %add.ptr4 to %struct.entry, !dbg !44 call void @llvm.dbg.value(metadata %struct.entry %6, i64 0, metadata !20, metadata !21), !dbg 34 When these instructions are eliminated by instcombine one after another, we can still salvage the otherwise dead debug info: - Bitcasts have no effect, so have the dbg.value point to operand(0) - Loads can be expressed via a DW_OP_deref - Constant gep instructions can be replaced by DWARF expression arithmetic The API introduced by this patch is not specific to instcombine and can be useful in other places, too. rdar://problem/30725338 Differential Revision: https://reviews.llvm.org/D30919 llvm-svn: 297994	2017-03-16 21:14:09 +00:00
Michael Kuperstein	2da2bfa088	[LoopUnroll] Don't peel loops where the latch isn't the exiting block Peeling assumed this doesn't happen, but didn't check it. This fixes PR32178. Differential Revision: https://reviews.llvm.org/D30757 llvm-svn: 297993	2017-03-16 21:07:48 +00:00
Michael Zolotukhin	99de88d1f3	[SCEV] Compute affine range in another way to avoid bitwidth extending. Summary: This approach has two major advantages over the existing one: 1. We don't need to extend bitwidth in our computations. Extending bitwidth is a big issue for compile time as we often end up working with APInts wider than 64bit, which is a slow case for APInt. 2. When we zero extend a wrapped range, we lose some information (we replace the range with [0, 1 << src bit width)). Thus, avoiding such extensions better preserves information. Correctness testing: I ran 'ninja check' with assertions that the new implementation of getRangeForAffineAR gives the same results as the old one (this functionality is not present in this patch). There were several failures - I inspected them manually and found out that they all are caused by the fact that we're returning more accurate results now (see bullet (2) above). Without such assertions 'ninja check' works just fine, as well as SPEC2006. Compile time testing: CTMark/Os: - mafft/pairlocalalign -16.98% - tramp3d-v4/tramp3d-v4 -12.72% - lencod/lencod -11.51% - Bullet/bullet -4.36% - ClamAV/clamscan -3.66% - 7zip/7zip-benchmark -3.19% - sqlite3/sqlite3 -2.95% - SPASS/SPASS -2.74% - Average -5.81% Performance testing: The changes are expected to be neutral for runtime performance. Reviewers: sanjoy, atrick, pete Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D30477 llvm-svn: 297992	2017-03-16 21:07:38 +00:00
Derek Schuff	b879539aac	[WebAssembly] Fix some broken type encodings in wasm binary A recent change switch the in-memory wasm value types to be signed integers, but I missing a few cases where these were being writing to the binary. Differential Revision: https://reviews.llvm.org/D31014 Patch by Sam Clegg llvm-svn: 297991	2017-03-16 20:49:48 +00:00
Zachary Turner	c9500616d8	Silence -Wcovered-switch-default warning. llvm-svn: 297990	2017-03-16 20:45:11 +00:00
Sanjay Patel	6105bb5eaf	[InstCombine] avoid breaking up bitcasted vector min/max patterns (PR32306) As the related tests show, we're not canonicalizing to this form for scalars or vectors yet, but this solves the immediate problem in: https://bugs.llvm.org/show_bug.cgi?id=32306 llvm-svn: 297989	2017-03-16 20:42:45 +00:00
Jonathan Roelofs	7fcc647a5c	Clarify a comment about mapped_iterator. NFC llvm-svn: 297988	2017-03-16 20:37:59 +00:00
Peter Collingbourne	00f200f385	ArchiveWriter: Remove unused variables. NFC. llvm-svn: 297987	2017-03-16 20:35:07 +00:00
Sanjay Patel	634e622069	[InstCombine] add tests for PR32306 and missed min/max canonicalization; NFC llvm-svn: 297986	2017-03-16 20:31:38 +00:00
Reid Kleckner	1ef2777aa4	Create msbuild only when using MSVC Summary: I could be wrong but it seems to have no use for MinGW. Related diff: https://reviews.llvm.org/D29772 Reviewers: chandlerc, rnk Reviewed By: rnk Patch by: Mateusz Mikuła Subscribers: rnk, llvm-commits, mgorny Differential Revision: https://reviews.llvm.org/D29952 llvm-svn: 297985	2017-03-16 20:24:14 +00:00
Zachary Turner	05d5e6136f	[PDB] Add support for parsing Flags from PDB Stream. This was discovered when running `llvm-pdbdump diff` against two files, the second of which was generated by running the first one through pdb2yaml and then yaml2pdb. The second one was missing some bytes from the PDB Stream, and tracking this down showed that at the end of the PDB Stream were some additional bytes that we were ignoring. Looking back to the reference code, these seem to specify some additional flags that indicate whether the PDB supports various optional features. This patch adds support for reading, writing, and round-tripping these flags through YAML and the raw dumper, and updates the tests accordingly. llvm-svn: 297984	2017-03-16 20:19:11 +00:00
Zachary Turner	02278ce09f	[llvm-pdbdump] Add support for diffing the PDB Stream. In doing so I discovered that we completely ignore some bytes of the PDB Stream after we "finish" loading it. These bytes seem to specify some additional information about what kind of data is present in the PDB. A subsequent patch will add code to read in those fields and store their values. llvm-svn: 297983	2017-03-16 20:18:41 +00:00
Adrian Prantl	fa9e84eb6d	Revert commit r297971 because of issues reported by msan. llvm-svn: 297982	2017-03-16 20:11:54 +00:00
Matthias Braun	e959544517	TargetInstrInfo: Provide default implementation of isTailCall(). In fact this default implementation should be the only implementation, keep it virtual for now to accomodate targets that don't model flags correctly. Differential Revision: https://reviews.llvm.org/D30747 llvm-svn: 297980	2017-03-16 20:02:30 +00:00
Daniel Sanders	16846764d0	[globalisel] Correct one more simple immediate that should be a ConstantInt. llvm-svn: 297979	2017-03-16 19:59:19 +00:00
Craig Topper	96efc5c227	[AVX-512] Add tests for kandn, kor, kxor, and kxnor intrinsics. llvm-svn: 297978	2017-03-16 19:58:06 +00:00
Zachary Turner	a46533b338	[llvm-pdbdump] clang-format Diff.cpp Looks like this file did not have clang-format run on it when its initial revision was committed. llvm-svn: 297977	2017-03-16 19:52:15 +00:00
Simon Pilgrim	fbfb19b1d7	Remove redundant conditions (PR31753). NFCI. llvm-svn: 297976	2017-03-16 19:52:00 +00:00
Adrian Prantl	4a7781aa38	Fix unused variable warnings. llvm-svn: 297973	2017-03-16 18:33:01 +00:00
Adrian Prantl	4377314a98	Salvage debug info from instructions about to be deleted This patch improves debug info quality in InstCombine by looking at values that are about to be deleted, checking whether there are any dbg.value instrinsics referring to them, and potentially encoding the semantics of the deleted instruction into the dbg.value's DIExpression. In the example in the testcase (which was extracted from XNU) there is a sequence of %4 = load %struct.entry, %struct.entry* %next2, align 8, !dbg !41 %5 = bitcast %struct.entry* %4 to i8, !dbg !42 %add.ptr4 = getelementptr inbounds i8, i8 %5, i64 -8, !dbg !43 %6 = bitcast i8* %add.ptr4 to %struct.entry, !dbg !44 call void @llvm.dbg.value(metadata %struct.entry %6, i64 0, metadata !20, metadata !21), !dbg 34 When these instructions are eliminated by instcombine one after another, we can still salvage the otherwise dead debug info: - Bitcasts have no effect, so have the dbg.value point to operand(0) - Loads can be expressed via a DW_OP_deref - Constant gep instructions can be replaced by DWARF expression arithmetic The API introduced by this patch is not specific to instcombine and can be useful in other places, too. rdar://problem/30725338 Differential Revision: https://reviews.llvm.org/D30919 llvm-svn: 297971	2017-03-16 18:22:52 +00:00
Peter Collingbourne	cb46a6baf7	LTO: Create temporary cache files in the cache directory instead of $TMPDIR. This fixes a race condition where another linker process can observe a partially written file if we copy it from another file system, and allows the link to be independent of the amount of free disk space in $TMPDIR. Differential Revision: https://reviews.llvm.org/D31045 llvm-svn: 297970	2017-03-16 18:20:06 +00:00
Adrian Prantl	dc855221af	Attempt to fix bot failure on Windows. Looks like this expression was accidentally using 32-bit arithmetic. llvm-svn: 297969	2017-03-16 18:06:04 +00:00
Daniel Sanders	0e64202871	[globalisel] Correct G_CONSTANT path of selectArithImmed() Earlier stages of GlobalISel always use ConstantInt in G_CONSTANT so that's what we should check for. This fixes a crash introduced in r297782. llvm-svn: 297968	2017-03-16 18:04:50 +00:00
Adrian Prantl	3621309e8d	Rearrange fields. NFC. llvm-svn: 297967	2017-03-16 17:42:47 +00:00
Adrian Prantl	a63b8e8227	Rename methods in DwarfExpression to adhere to the LLVM coding guidelines. NFC. llvm-svn: 297966	2017-03-16 17:42:45 +00:00
Adrian Prantl	981f03e6a2	PR32288: More efficient encoding for DWARF expr subregister access. Citing http://bugs.llvm.org/show_bug.cgi?id=32288 The DWARF generated by LLVM includes this location: 0x55 0x93 0x04 DW_OP_reg5 DW_OP_piece(4) When GCC's DWARF is simply 0x55 (DW_OP_reg5) without the DW_OP_piece. I believe it's reasonable to assume the DWARF consumer knows which part of a register logically holds the value (low bytes, high bytes, how many bytes, etc) for a primitive value like an integer. This patch gets rid of the redundant DW_OP_piece when a subregister is at offset 0. It also adds previously missing subregister masking when a subregister is followed by another operation. (This reapplies r297960 with two additional testcase updates). rdar://problem/31069390 https://reviews.llvm.org/D31010 llvm-svn: 297965	2017-03-16 17:14:56 +00:00
Reid Kleckner	9f3585dad3	[cmake] Refactor warning flag logic to use Unix warnings with clang-cl Summary: clang-cl understands the GCC-style -W[no-]foo flags, and for the most part ignores MSVC -wd flags. So, let's pass the curated set of warning flags we use on Unix on Windows. We can also stop passing /W4 -wd*, which for the most part corresponds to -Wall -Wextra with a bunch of flags that we mostly ignore. I had to disable -Wnon-virtual-dtor on Windows, because it fires on every COM class ever. I filed PR32286 to fix this. So far I've only found two instances of -Wstring-conversion in the WinASan code, which I'll fix. Other than that we seem clean. Reviewers: hans Subscribers: mgorny, llvm-commits Differential Revision: https://reviews.llvm.org/D30992 llvm-svn: 297964	2017-03-16 17:05:16 +00:00
Reid Kleckner	1275a2dec8	[IR] Inline some Function accessors I checked that all of these out-of-line methods previously compiled to simple loads and bittests, so they are pretty good candidates for inlining. In particular, arg_size() and arg_empty() are popular and are just two loads, so they seem worth inlining. llvm-svn: 297963	2017-03-16 16:57:31 +00:00
Adrian Prantl	c5b3351750	Revert "PR32288: More efficient encoding for DWARF expr subregister access." This reverts commit 2bf453116889a576956892ea9683db4fcd96e30e while investigating buildbot breakage. llvm-svn: 297962	2017-03-16 16:38:22 +00:00
Adrian Prantl	8508e87998	PR32288: More efficient encoding for DWARF expr subregister access. Citing http://bugs.llvm.org/show_bug.cgi?id=32288 The DWARF generated by LLVM includes this location: 0x55 0x93 0x04 DW_OP_reg5 DW_OP_piece(4) When GCC's DWARF is simply 0x55 (DW_OP_reg5) without the DW_OP_piece. I believe it's reasonable to assume the DWARF consumer knows which part of a register logically holds the value (low bytes, high bytes, how many bytes, etc) for a primitive value like an integer. This patch gets rid of the redundant DW_OP_piece when a subregister is at offset 0. It also adds previously missing subregister masking when a subregister is followed by another operation. rdar://problem/31069390 https://reviews.llvm.org/D31010 llvm-svn: 297960	2017-03-16 16:34:14 +00:00
Hiroshi Inoue	138a3faa3e	Test commit. llvm-svn: 297959	2017-03-16 16:30:06 +00:00
Stanislav Mekhanoshin	f80507979d	[AMDGPU] Run always inliner early in opt We can mark functions to always inline early in the opt. Since we do not have call support this early inlining creates opportunities for inter-procedural optimizations which would not occur otherwise. Differential Revision: https://reviews.llvm.org/D31016 llvm-svn: 297958	2017-03-16 16:11:46 +00:00
Simon Pilgrim	e5d7e6f5e3	[X86] Add PR22338 test case llvm-svn: 297957	2017-03-16 15:10:42 +00:00
Aditya Kumar	24f6ad51bb	Fix: Refactor SimplifyCFG:canSinkInstructions [NFC] Differential Revision: https://reviews.llvm.org/D30116 llvm-svn: 297955	2017-03-16 14:09:18 +00:00
Jonas Paulsson	facc4c5c7b	[BasicTTIImpl] Bugfix in getIntrinsicInstrCost() Don't call getScalarizationOverhead(RetTy, true, false) if RetTy is void type. Review: Hal Finkel https://reviews.llvm.org/D31024 llvm-svn: 297954	2017-03-16 14:05:34 +00:00
Bjorn Pettersson	c98dabb1a0	[InstCombine] Liberate assert in InstCombiner::visitZExt Summary: The call to canEvaluateZExtd in InstCombiner::visitZExt may return with BitsToClear == SrcTy->getScalarSizeInBits(), but there is an assert that BitsToClear should be smaller than SrcTy->getScalarSizeInBits(). I have a test case that triggers the assert, but it only happens for my downstream target. I've not been able to trigger it for any upstream target. The assert triggered for a piece of code such as this %shr1 = lshr i16 undef, 15 ... %shr2 = lshr i16 %shr1, 1 %conv = zext i16 %shr2 to i32 Normally the lshr instructions are constant folded before we visit the zext (that is why it is so hard to reproduce). The original pattern, before instcombine, is of course a lot more complicated in my test case. The shift count in the second lshr is for example determined by the outcome of a PHI instruction. It seems like other rewrites by instcombine leads up to the pattern above. And then the zext is pulled from the worklist, and visited (hitting the assert), before we detect that the lshr instrucions can be constant folded. Anyway, since the canEvaluateZExtd may return with BitsToClear equal to SrcTy->getScalarSizeInBits(), and since the rewrite that converts the expression type to avoid a zero extend works also for the case where SrcBitsKept ends up being zero, then it should be OK to liberate the assert to assert(BitsToClear <= SrcTy->getScalarSizeInBits() && "Unreasonable BitsToClear"); Reviewers: hfinkel Reviewed By: hfinkel Subscribers: hfinkel, llvm-commits Differential Revision: https://reviews.llvm.org/D30993 llvm-svn: 297952	2017-03-16 13:22:01 +00:00
Eric Liu	971de62291	Revert "[PGO] Value profile for size of memory intrinsic calls" This commit reverts r297897 and r297909. llvm-svn: 297951	2017-03-16 13:16:35 +00:00
James Henderson	566fdf4a2a	[Support] Add support for getting file system permissions on Windows and implement sys::fs::set/getPermissions to work with them This change adds support for functions to set and get file permissions, in a similar manner to the C++17 permissions() function in <filesystem>. The setter uses chmod on Unix systems and SetFileAttributes on Windows, setting the permissions as passed in. The getter simply uses the existing status() function. Prior to this change, status() would always return an unknown value for the permissions on a Windows file, making it impossible to test the new function on Windows. I have therefore added support for this as well. On Linux, prior to this change, the permissions included the file type, which should actually be accessed via a different member of the file_status class. Note that on Windows, only the *_write permission bits have any affect - if any are set, the file is writable, and if not, the file is read-only. This is in common with what MSDN describes for their behaviour of std::filesystem::permissions(), and also what boost::filesystem does. The motivation behind this change is so that we can easily test behaviour on read-only files in LLVM unit tests, but I am sure that others may find it useful in some situations. Reviewers: zturner, amccarth, aaron.ballman Differential Revision: https://reviews.llvm.org/D30736 llvm-svn: 297945	2017-03-16 11:22:09 +00:00
Chandler Carruth	814e0df1c5	[PM/Inliner] Fix a bug in r297374 where we would leave stale calls in the work queue and crash when trying to visit them after deleting the function containing those calls. llvm-svn: 297940	2017-03-16 10:45:42 +00:00
Chandler Carruth	6ef42cc6bb	[PM/Inliner] Add a test case that encapsulates the core issue addressed in r297374. I've extracted a small version of this from the C++ metaprogram Richard came up with to exercise these kinds of issues and written comments to describe both how to reproduce a fresh version of the test case and what likely failure modes are. The test case is still a bit brittle as it depends on the particular inline cost modeling and SCC visitation order, but it definitely would have caught the bug right away when developing things so it seems a really valuable test case to have. llvm-svn: 297935	2017-03-16 10:13:55 +00:00
Oren Ben Simhon	da59ffae91	Fixing typos. llvm-svn: 297932	2017-03-16 08:15:52 +00:00
Jonas Paulsson	84319bfc40	[SelectionDAG] Optimize VSELECT->SETCC of incompatible or illegal types. Don't scalarize VSELECT->SETCC when operands/results needs to be widened, or when the type of the SETCC operands are different from those of the VSELECT. (VSELECT SETCC) and (VSELECT (AND/OR/XOR (SETCC,SETCC))) are handled. The previous splitting of VSELECT->SETCC in DAGCombiner::visitVSELECT() is no longer needed and has been removed. Updated tests: test/CodeGen/ARM/vuzp.ll test/CodeGen/NVPTX/f16x2-instructions.ll test/CodeGen/X86/2011-10-19-widen_vselect.ll test/CodeGen/X86/2011-10-21-widen-cmp.ll test/CodeGen/X86/psubus.ll test/CodeGen/X86/vselect-pcmp.ll Review: Eli Friedman, Simon Pilgrim https://reviews.llvm.org/D29489 llvm-svn: 297930	2017-03-16 07:17:12 +00:00
Tobias Grosser	115c022282	[ADCE] Remove redundent code [NFC] Summary: In commit r289548 ([ADCE] Add code to remove dead branches) a redundant loop nest was accidentally introduced, which implements exactly the same functionality as has already been available right after. This redundancy has been found when inspecting the ADCE code in the context of our recent discussions on post-dominator modeling. This redundant code was also eliminated by r296535 (which sparked the discussion), but only as part of a larger semantic change of the post-dominance modeling. As this redundency in [ADCE] is really just an oversight completely independent of the post-dominance changes under discussion, we remove this redundancy independently. Reviewers: dberlin, david2050 Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31023 llvm-svn: 297929	2017-03-16 03:59:23 +00:00
Peter Collingbourne	255c6e1bae	Fix pessimising moves. llvm-svn: 297928	2017-03-16 03:54:38 +00:00
Peter Collingbourne	d1eac7bc36	Support: Add a cache pruning policy parser. The idea is that the policy string fully specifies the policy and is portable between clients. Differential Revision: https://reviews.llvm.org/D31020 llvm-svn: 297927	2017-03-16 03:42:00 +00:00
Colin LeMahieu	553ec588ae	[Hexagon] Fixing test from 297920 to add Hexagon triple. llvm-svn: 297926	2017-03-16 01:52:10 +00:00
Kyle Butt	08655997eb	CodeGen: BlockPlacement: Reduce TriangleChainCount to 2 This produces a 1% speedup on an important internal Google benchmark (protocol buffers), with no other regressions in google or in the llvm test-suite. Only 5 targets in the entire llvm test-suite are affected, and on those 5 targets the size increase is 0.027% llvm-svn: 297925	2017-03-16 01:32:29 +00:00
David Blaikie	6c5bbae7b3	Check that line table entries don't spill over between sections even if they're empty at the end of a section llvm-svn: 297922	2017-03-16 00:52:18 +00:00
David Blaikie	618d3429f2	Simplify/make more explicit (by making less explicit in some ways) some function calls llvm-svn: 297921	2017-03-16 00:43:19 +00:00
Colin LeMahieu	ddebad956e	[Hexagon] Updating inline saturate lanes for v62 version. llvm-svn: 297920	2017-03-16 00:35:28 +00:00
Kostya Serebryany	63689ca031	[libFuzzer] add a dummy LLVMFuzzerMutate to afl/afl_driver.cpp (https://github.com/google/oss-fuzz/issues/460 ) llvm-svn: 297919	2017-03-15 23:54:40 +00:00
Kostya Serebryany	6d5412596c	[libFuzzer] call __sanitizer_print_memory_profile with two params llvm-svn: 297916	2017-03-15 23:27:53 +00:00
Simon Pilgrim	cee3fc61cb	Remove redundant condition (PR32263). NFCI. llvm-svn: 297915	2017-03-15 23:27:43 +00:00
Matt Arsenault	7dc01c96ae	AMDGPU: Allow sinking of addressing modes for atomic_inc/dec llvm-svn: 297913	2017-03-15 23:15:12 +00:00
Vitaly Buka	fbf031f27b	Revert "[libFuzzer] Use ChooseUnitIdxToMutate to pick item for crossover." Fails LLVMFuzzer.LLVMFuzzer.value-profile-strncmp.test. This reverts commit r297891. llvm-svn: 297912	2017-03-15 23:08:18 +00:00
Vitaly Buka	ca6ecd213a	Revert "Revert "[PGO] Minor cleanup for count instruction in SelectInstVisitor."" Previously reverted wrong revision. This reverts commit r297910. llvm-svn: 297911	2017-03-15 23:07:41 +00:00
Vitaly Buka	de85ad895d	Revert "[PGO] Minor cleanup for count instruction in SelectInstVisitor." Fails LLVMFuzzer.LLVMFuzzer.value-profile-strncmp.test This reverts commit r297892. llvm-svn: 297910	2017-03-15 23:06:22 +00:00
Rong Xu	8acf76b6b9	Fix build failure from r297897. llvm-svn: 297909	2017-03-15 23:00:19 +00:00
Peter Collingbourne	cead56fb22	Support: Simplify the CachePruning API. NFCI. Change the function that implements the pruning into a free function that takes the policy as a struct argument. Differential Revision: https://reviews.llvm.org/D31009 llvm-svn: 297907	2017-03-15 22:54:18 +00:00
Simon Pilgrim	a7d1da833c	Removed unnecessary semicolon. NFCI. llvm-svn: 297906	2017-03-15 22:50:47 +00:00
Craig Topper	6c66bbca4a	[StackColoring] Remove unused header file for post-order traversal. Update comment that indicated we were using it when we really use a depth-first search. NFC llvm-svn: 297904	2017-03-15 22:40:26 +00:00
Matt Arsenault	02d915be90	CodeGenPrepare: Sink addressing modes for atomics llvm-svn: 297903	2017-03-15 22:35:20 +00:00
Zachary Turner	ed261f69dc	Try to fix build break due to template argument deduction. llvm-svn: 297902	2017-03-15 22:32:59 +00:00
Zachary Turner	f1220084f6	[llvm-pdbdump] Add support for diffing the String Table. llvm-svn: 297901	2017-03-15 22:19:30 +00:00
Zachary Turner	ea4e60754e	[pdb] Write the module info and symbol record streams. Previously we did not have support for writing detailed module information for each module, as well as the symbol records. This patch adds support for this, and in doing so enables the ability to construct minimal PDBs from just a few lines of YAML. A test is added to illustrate this functionality. llvm-svn: 297900	2017-03-15 22:18:53 +00:00
Eric Christopher	17ce8a2f5e	Fix up grammar in a comment. llvm-svn: 297898	2017-03-15 21:50:46 +00:00
Rong Xu	4ed52798ce	[PGO] Value profile for size of memory intrinsic calls This patch adds the value profile support to profile the size parameter of memory intrinsic calls: memcpy, memcmp, and memmov. Differential Revision: http://reviews.llvm.org/D28965 llvm-svn: 297897	2017-03-15 21:47:27 +00:00
Zvi Rackover	9ad1b235d5	Second attempt for fix Hexagon buildbot by moving test to under X86/ llvm-svn: 297893	2017-03-15 21:13:45 +00:00
Rong Xu	d709b0fe95	[PGO] Minor cleanup for count instruction in SelectInstVisitor. Summary: NSIs can be double-counted by different operations in SelectInstVisitor. Sink the the update to VM_counting mode only. Also reset the value for each counting operation. Reviewers: davidxl Reviewed By: davidxl Subscribers: xur, llvm-commits Differential Revision: https://reviews.llvm.org/D30999 llvm-svn: 297892	2017-03-15 21:05:24 +00:00
Vitaly Buka	b99247e608	[libFuzzer] Use ChooseUnitIdxToMutate to pick item for crossover. Differential Revision: https://reviews.llvm.org/D30683 llvm-svn: 297891	2017-03-15 20:59:27 +00:00
David Blaikie	718e8ad239	Add GDB pretty-printer for llvm::Twine type Patch by Simon Marchi! Differential Revision: https://reviews.llvm.org/D30994 llvm-svn: 297889	2017-03-15 20:51:44 +00:00
Zvi Rackover	de43c859f5	Limit test's triple in attempt to fix broken buildbot Regression test for a target-independent bug keeps failing in the Hexagon backend due to what appears an unrelated issue. llvm-svn: 297888	2017-03-15 20:29:58 +00:00
Adrian McCarthy	ad6d60a46b	NFC: Corrects comments that were supposed to go in with earlier commit. llvm-svn: 297887	2017-03-15 20:29:06 +00:00
Sanjay Patel	f1e1fba1b0	[EarlyCSE] reduce indent; NFCI llvm-svn: 297886	2017-03-15 20:25:05 +00:00
Daniel Sanders	bdfebb8121	[tablegen][globalisel] Trivial changes to reduce size of following patch. NFC. Summary: * Move namespace { * Trivial: Typo * RuleMatcher: Separate class and definition * Trivial: const findNodeEquiv Reviewers: t.p.northover, qcolombet, aditya_nandakumar, ab, rovka Reviewed By: rovka Subscribers: dberris, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D30531 llvm-svn: 297884	2017-03-15 20:18:38 +00:00
Adrian McCarthy	65d2688842	Introduce NativeEnumModules and NativeCompilandSymbol Together, these allow lldb-pdbdump to list all the modules from a PDB using a native reader (rather than DIA). Note that I'll probably be specializing NativeRawSymbol in a subsequent patch. Differential Revision: https://reviews.llvm.org/D30956 llvm-svn: 297883	2017-03-15 20:17:58 +00:00
Zvi Rackover	48cdde0e59	[DAGCombine] Bail out if can't create a vector with at least two elements Summary: Fixes pr32278 Reviewers: igorb, craig.topper, RKSimon, spatel, hfinkel Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D30978 llvm-svn: 297878	2017-03-15 19:48:36 +00:00
Derek Schuff	13f080fbeb	[WebAssembly] Update format of 'names' section. This change updates to the format of the 'names' sectionin the generated wasm binary to match the latest changesto the design and 'wabt'. Differential Revision: https://reviews.llvm.org/D30950 Patch by Sam Clegg llvm-svn: 297877	2017-03-15 19:36:02 +00:00
Simon Pilgrim	06c70adcf0	[X86] Add missing BITREVERSE costs for SSE2 vectors and i8/i16/i32/i64 scalars Prep work for PR31810 llvm-svn: 297876	2017-03-15 19:34:55 +00:00
Ahmed Bougacha	2fb8030748	[GlobalISel] Avoid translating synthetic constants to new G_CONSTANTS. Currently, we create a G_CONSTANT for every "synthetic" integer constant operand (for instance, for the G_GEP offset). Instead, share the G_CONSTANTs we might have created by going through the ValueToVReg machinery. When we're emitting synthetic constants, we do need to get Constants from the context. One could argue that we shouldn't modify the context at all (for instance, this means that we're going to use a tad more memory if the constant wasn't used elsewhere), but constants are mostly harmless. We currently do this for extractvalue and all. For constant fcmp, this does mean we'll emit an extra COPY, which is not necessarily more optimal than an extra materialized constant. But that preserves the current intended design of uniqued G_CONSTANTs, and the rematerialization problem exists elsewhere and should be resolved with a single coherent solution. llvm-svn: 297875	2017-03-15 19:21:11 +00:00
Ahmed Bougacha	62cd73d989	[GlobalISel][AArch64] Select ADDXri. We're now able to select ADDWri thanks to the new complex pattern support. Extend that to ADDXri. llvm-svn: 297874	2017-03-15 19:20:59 +00:00
Matt Arsenault	86e02ce2dc	AMDGPU: Fix unnecessary ands when packing f16 vectors computeKnownBits didn't handle fp_to_fp16 to report the high bits as 0. ARM maps the generic node to an instruction that does not modify the high bits of the register, so introduce a target node where the high bits are known 0. llvm-svn: 297873	2017-03-15 19:04:26 +00:00
Tim Northover	0d98b03b9f	ARM: avoid clobbering register in v6 jump-table expansion. If we got unlucky with register allocation and actual constpool placement, we could end up producing a tTBB_JT with an index that's already been clobbered. Technically, we might be able to fix this situation up with a MOV, but I think the constant islands pass is complex enough without having to deal with more weird edge-cases. llvm-svn: 297871	2017-03-15 18:38:13 +00:00
Rong Xu	a3bbf96eba	[PGO] Refactor the code for value profile annotation This patch refactors the code for value profile annotation to facilitate of adding other kind of value profiles. Differential Revision: http://reviews.llvm.org/D30989 llvm-svn: 297870	2017-03-15 18:23:39 +00:00
Ahmed Bougacha	07f247b6c2	[GlobalISel] Insert translated switch icmp blocks after switch parent. Now that we preserve the IR layout, we would end up with all the newly synthesized switch comparison blocks at the end of the function. Instead, use a hopefully more reasonable layout, with the comparison blocks immediately following the switch comparison blocks. llvm-svn: 297869	2017-03-15 18:22:37 +00:00
Ahmed Bougacha	a61c214f51	[GlobalISel] Preserve IR block layout. It makes the output function layout more predictable; the layout has an effect on performance, we don't want it to be at the mercy of the translator's visitation order and such. The predictable output is also easier to digest. getOrCreateBB isn't appropriately named anymore, as it never needs to create anything. Rename it and extract the MBB creation logic out of it. A couple tests were sensitive to the order. Update them. llvm-svn: 297868	2017-03-15 18:22:33 +00:00
Ahmed Bougacha	d8319e99d8	[GlobalISel] Use ASCII quote/apostrophe in comment. NFC. llvm-svn: 297867	2017-03-15 18:22:24 +00:00
Matt Arsenault	0e6e018054	AMDGPU: Minor SIAnnotateControlFlow cleanups Newline fixes, early return, range loops. llvm-svn: 297865	2017-03-15 18:00:12 +00:00
Zachary Turner	84efd4deb6	[YAML] When outputting, provide the ability to write default values. Previously, if you attempted to write a key/value pair and the value was equal to the key's default value, we would not output the value. Sometimes it is useful to be able to see this value in the output anyway. llvm-svn: 297864	2017-03-15 17:47:39 +00:00
Reid Kleckner	9d53b6ab48	Move some LAST_* enum sentinels out of their enums These are not valid values of the enum, so this will improve clang -Wcovered-switch-default diagnostics. It also fixes some -Wbitfield-enum-conversion warnings. llvm-svn: 297863	2017-03-15 17:43:40 +00:00
Craig Topper	bcb6093610	[CodeGen] Use APInt::setLowBits/setHighBits/setBitsFrom in more places This patch replaces ORs with getHighBits/getLowBits etc. with setLowBits/setHighBits/setBitsFrom. In a few of the places we weren't ORing, but the KnownZero/KnownOne vectors were already initialized to zero. We exploit this in most places already there were just some that were inconsistent. Differential Revision: https://reviews.llvm.org/D30965 llvm-svn: 297860	2017-03-15 16:53:53 +00:00
Ahmed Bougacha	1a6deeefe0	[GlobalISel][AArch64] Add back constant select tests. NFC. More of r297856. llvm-svn: 297859	2017-03-15 16:51:41 +00:00
Ahmed Bougacha	d691cf731c	[GlobalISel][AArch64] Use appropriate test function names. NFC. These FP tests are on FPR, not GPR. Don't lie in the name. llvm-svn: 297857	2017-03-15 16:29:40 +00:00
Ahmed Bougacha	170778f0db	[GlobalISel][AArch64] Split out select tests. NFC. The test has grown enough to be annoying to navigate. While there, Remove unnecessary RUNs, and cleanup a couple comments. llvm-svn: 297856	2017-03-15 16:29:37 +00:00
Ahmed Bougacha	2b7f1377aa	[GlobalISel] Remove dead member. NFC. llvm-svn: 297855	2017-03-15 16:29:32 +00:00
Peter Collingbourne	11c03c3e03	Re-apply r296730, "cmake: Configure the ThinLTO cache directory when using ELF lld or gold." All known cache-related bugs observed when self hosting have been fixed (r296907 and r297853). llvm-svn: 297854	2017-03-15 16:28:43 +00:00
Peter Collingbourne	d44a01aae6	CodeGen: Use the source filename as the argument to .file, rather than the module ID. Using the module ID here is wrong for a couple of reasons: 1) The module ID is not persisted, so we can end up with different object file contents given the same input file (for example if the same file is accessed via different paths). 2) With ThinLTO the module ID field may contain the path to a bitcode file, which is incorrect, as the .file argument is supposed to contain the path to a source file. Differential Revision: https://reviews.llvm.org/D30584 llvm-svn: 297853	2017-03-15 16:24:52 +00:00
Simon Pilgrim	018eedd9a5	[SelectionDAG] Support BUILD_VECTOR implicit truncation in SelectionDAG::ComputeNumSignBits (PR32273) llvm-svn: 297852	2017-03-15 16:22:24 +00:00
Nemanja Ivanovic	ffcf0fb1cc	[PowerPC][Altivec] Add mfvrd and mffprd extended mnemonic mfvrd and mffprd are both alias to mfvrsd. This patch enables correct parsing of the aliases, but we still emit a mfvrsd. Committing on behalf of brunoalr (Bruno Rosa). Differential Revision: https://reviews.llvm.org/D29177 llvm-svn: 297849	2017-03-15 16:04:53 +00:00
Simon Pilgrim	a5f332edd1	[SelectionDAG][AArch64] Add test case showing incorrect SelectionDAG::ComputeNumSignBits BUILD_VECTOR handling Reduced from a mixture of PR32273 and David Green's test cases showing SelectionDAG::ComputeNumSignBits not correctly handling BUILD_VECTOR implicit truncation of inputs. llvm-svn: 297847	2017-03-15 15:40:34 +00:00
Sanjay Patel	fa929a2134	Cyle -> Cycle; NFCI llvm-svn: 297846	2017-03-15 15:37:42 +00:00
Eric Liu	8c7d28b2f1	Revert "Refactor SimplifyCFG:canSinkInstructions [NFC]" This reverts commit r297839, which breaks Transforms/SimplifyCFG/sink-common-code.ll llvm-svn: 297845	2017-03-15 15:29:42 +00:00
Artyom Skrobov	e72e1ba434	Revert "[Thumb1] Fix the bug when adding/subtracting -2147483648" This reverts r297820 which apparently fails on A15 hosts. llvm-svn: 297842	2017-03-15 14:50:43 +00:00
Simon Pilgrim	6778b8f715	Reverted unintended commit llvm-svn: 297841	2017-03-15 14:47:30 +00:00
Simon Pilgrim	3804a12fc3	Fix Wint-in-bool-context warning (PR32248) llvm-svn: 297840	2017-03-15 14:38:19 +00:00
Aditya Kumar	ee55bf3e34	Refactor SimplifyCFG:canSinkInstructions [NFC] llvm-svn: 297839	2017-03-15 14:26:45 +00:00
Sam Parker	db20d48336	Reverting r297821 due to breaking lld test. llvm-svn: 297838	2017-03-15 14:06:42 +00:00
Sanjay Patel	8633d96836	[Target] fix typo; NFC llvm-svn: 297836	2017-03-15 14:01:28 +00:00
Eric Liu	8f49635500	Add 'REQUIRES: asserts' to pr32278.ll introduced in r297822 llvm-svn: 297835	2017-03-15 13:37:20 +00:00
Simon Pilgrim	493f4462bf	[X86][SSE] Fixed shuffle MOVSS/MOVSD combining of all zeroable inputs Turns out it can happen, so the assertion was too harsh Found during fuzz testing llvm-svn: 297833	2017-03-15 13:16:46 +00:00
Petar Jovanovic	b71386a4a4	[Mips] Add support to match more patterns for DEXT and CINS This patch adds support for recognizing more patterns to match to DEXT and CINS instructions. It finds cases where multiple instructions could be replaced with a single DEXT or CINS instruction. For example, for the following: define i64 @dext_and32(i64 zeroext %a) { entry: %and = and i64 %a, 4294967295 ret i64 %and } instead of generating: 0000000000000088 <dext_and32>: 88: 64010001 daddiu at,zero,1 8c: 0001083c dsll32 at,at,0x0 90: 6421ffff daddiu at,at,-1 94: 03e00008 jr ra 98: 00811024 and v0,a0,at 9c: 00000000 nop the following gets generated: 0000000000000068 <dext_and32>: 68: 03e00008 jr ra 6c: 7c82f803 dext v0,a0,0x0,0x20 Cases that are covered: DEXT: 1. and $src, mask where mask > 0xffff 2. zext $src zero extend from i32 to i64 CINS: 1. and (shl $src, pos), mask 2. shl (and $src, mask), pos 3. zext (shl $src, pos) zero extend from i32 to i64 Patch by Violeta Vukobrat. Differential Revision: https://reviews.llvm.org/D30464 llvm-svn: 297832	2017-03-15 13:10:08 +00:00
Simon Pilgrim	a0b0b74b9a	Align cost model columns. NFCI. llvm-svn: 297824	2017-03-15 11:57:42 +00:00
Zvi Rackover	4aacd5d3c4	Fix malformed XFAIL in previous commit llvm-svn: 297823	2017-03-15 11:44:14 +00:00
Zvi Rackover	81f7b88910	[DAGCombine] Add reproducer for pr32278 llvm-svn: 297822	2017-03-15 11:34:51 +00:00
Sam Parker	274472f7c5	[ARM] Fix for branch label disassembly for Thumb Different MCInstrAnalysis classes for arm and thumb mode, each with their own evaluateBranch implementation. I added a test case and fixed the coff-relocations test to use '<label>:' rather than '<label>' in the CHECK-LABEL entries, since the ones without the colon would match branch targets. Might be worth noticing that llvm-objdump does not lookup the relocation and thus assigns it a target depending on the encoded immediate which #0, so it thinks it branches to the next instruction. Committed on behalf of Andre Vieira (avieira). Differential Revision: https://reviews.llvm.org/D30943 llvm-svn: 297821	2017-03-15 10:21:23 +00:00
Artyom Skrobov	3fa5fd1dd2	[Thumb1] Fix the bug when adding/subtracting -2147483648 Differential Revision: https://reviews.llvm.org/D30829 llvm-svn: 297820	2017-03-15 10:19:16 +00:00
Nuno Lopes	ae455c562d	fix gcc -Wmisleading-indentation [NFC] llvm-svn: 297816	2017-03-15 09:33:33 +00:00

... 3 4 5 6 7 ...

146655 Commits