llvm-project

Commit Graph

Author	SHA1	Message	Date
Simon Pilgrim	8bda035121	[X86][AVX] Tests showing missing SHUFPD + ZERO lowering This lowers to SHUFPD if the input is zeroinitializer but not with a demanded elts optimized build vector. llvm-svn: 298370	2017-03-21 13:30:40 +00:00
Valery Pykhtin	fd4c410f4d	[AMDGPU] Iterative scheduling infrastructure + minimal registry scheduler Differential revision: https://reviews.llvm.org/D31046 llvm-svn: 298368	2017-03-21 13:15:46 +00:00
Volkan Keles	044e003203	[GlobalISel] Fix shufflevector tests clang-lld-x86_64-2stage fails because of the order of the instructions. `CHECK-DAG` directives should fix the problem. llvm-svn: 298367	2017-03-21 13:12:59 +00:00
Sam Kolton	f60ad58dad	[ADMGPU] SDWA peephole optimization pass. Summary: First iteration of SDWA peephole. This pass tries to combine several instruction into one SDWA instruction. E.g. it converts: ''' V_LSHRREV_B32_e32 %vreg0, 16, %vreg1 V_ADD_I32_e32 %vreg2, %vreg0, %vreg3 V_LSHLREV_B32_e32 %vreg4, 16, %vreg2 ''' Into: ''' V_ADD_I32_sdwa %vreg4, %vreg1, %vreg3 dst_sel:WORD_1 dst_unused:UNUSED_PAD src0_sel:WORD_1 src1_sel:DWORD ''' Pass structure: 1. Iterate over machine instruction in basic block and try to apply "SDWA patterns" to each of them. SDWA patterns match machine instruction into either source or destination SDWA operand. E.g. ''' V_LSHRREV_B32_e32 %vreg0, 16, %vreg1''' is matched to source SDWA operand '''%vreg1 src_sel:WORD_1'''. 2. Iterate over found SDWA operands and find instruction that could be potentially coverted into SDWA. E.g. for source SDWA operand potential instruction are all instruction in this basic block that uses '''%vreg0''' 3. Iterate over all potential instructions and check if they can be converted into SDWA. 4. Convert instructions to SDWA. This review contains basic implementation of SDWA peephole pass. This pass requires additional testing fot both correctness and performance (no performance testing done). There are several ways this pass can be improved: 1. Make this pass work on whole function not only basic block. As I can see this can be done right now without changes to pass. 2. Introduce more SDWA patterns 3. Introduce mnemonics to limit when SDWA patterns should apply Reviewers: vpykhtin, alex-t, arsenm, rampitec Subscribers: wdng, nhaehnle, mgorny Differential Revision: https://reviews.llvm.org/D30038 llvm-svn: 298365	2017-03-21 12:51:34 +00:00
Andrea Di Biagio	7937be7dd3	[DebugInfo][X86] Teach Optimize LEAs pass to handle debug values This patch fixes an issue in the Optimize LEAs pass where redundant LEAs were not removed because they were being used by debug values. The debug values are now ignored when determining whether LEAs are redundant. For now the debug values for the redundant LEAs are marked as undefined, effectively lost. The intention is for a follow up patch which will attempt to preserve the debug values where possible. Patch by Andrew Ng. Differential Revision: https://reviews.llvm.org/D30835 llvm-svn: 298360	2017-03-21 11:36:21 +00:00
Artur Pilipenko	4cc6130f52	NFC. InstCombiner::visitFAdd extract LHSIntVal/RHSIntVal local variables llvm-svn: 298359	2017-03-21 11:32:15 +00:00
Volkan Keles	47debaeef0	[GlobalISel] Move isTriviallyDead to Utils. NFC. Make it accessible by the targets to avoid code duplication. llvm-svn: 298358	2017-03-21 10:47:35 +00:00
Jonas Paulsson	54c7680e1f	[DAGTypeLegalizer] Handle widening truncate to vector of i1. Previously, PromoteIntRes_TRUNCATE() did not handle the case where the operand needs widening, which resulted in llvm_unreachable(). This patch adds the needed handling, along with a test case. Review: Eli Friedman, Simon Pilgrim. https://reviews.llvm.org/D31077 llvm-svn: 298357	2017-03-21 10:24:14 +00:00
David Green	da21170c49	[ConstantFolding] Fix to prevent constant folding having to repeatedly scan operands. NFCI After the loop unroll threshold was increased in r295538, very large constant expressions can be created. This prevents them from having to be recursively scanned, leading to a compile time blow-up. Differential Revision: https://reviews.llvm.org/D30689 llvm-svn: 298356	2017-03-21 10:17:39 +00:00
Volkan Keles	75bdc7690e	[GlobalISel] Translate shufflevector Reviewers: qcolombet, aditya_nandakumar, t.p.northover, javed.absar, ab, dsanders Reviewed By: javed.absar Subscribers: dberris, rovka, llvm-commits, kristof.beyls Differential Revision: https://reviews.llvm.org/D30962 llvm-svn: 298347	2017-03-21 08:44:13 +00:00
Craig Topper	a9611ea4c0	[APFloat] Tag the fltSemantic getter functions with LLVM_READNONE. This gives about an 8k reduction in the size of the opt binary on my local x86-64 build. llvm-svn: 298344	2017-03-21 07:18:00 +00:00
Craig Topper	1fb691ef33	[APInt] Add LLVM_READONLY to some methods. llvm-svn: 298342	2017-03-21 06:19:46 +00:00
Jonas Paulsson	bd65421f08	[SystemZ] Don't drop MO flags in foldMemoryOperandImpl() The def operand of the new LG/LD should have the old def operands flags and subreg index. New test: test/CodeGen/SystemZ/fold-memory-op-impl.ll Review: Ulrich Weigand llvm-svn: 298341	2017-03-21 05:49:40 +00:00
Serge Pavlov	23be94599d	Fix evaluation of LLVM_DEFINITIONS CMake variable LLVM_DEFINITIONS collects preprocessor definitions provided for host compiler that builds llvm components. A function add_llvm_definitions was introduced in AddLLVMDefinitions.cmake to keep track of these definitions and was intended to be a replacement for CMake command add_definitions. Actually in many cases add_definitions is still used and the content of LLVM_DEFINITIONS is not actual now. On the other hand the current version of CMake allows getting set of definitions in a more convenient way. This fix implements evaluation of the variable by reading corresponding cmake property. Differential Revision: https://reviews.llvm.org/D31125 llvm-svn: 298336	2017-03-21 04:03:24 +00:00
Vitaly Buka	c12716e742	Revert "[Hexagon] Recognize polynomial-modulo loop idiom again" Fix memory leaks on check-llvm tests detected by Asan. This reverts commit r298282. llvm-svn: 298329	2017-03-21 00:59:51 +00:00
Eli Friedman	76732acc23	[ARM] Revert r297443 and r297820. The glueless lowering of addc/adde in Thumb1 has known serious miscompiles (see https://reviews.llvm.org/D31081), and r297820 causes an infinite loop for certain constructs. It's not clear when they will be fixed, so let's just take them out of the tree for now. (I resolved a small conflict with r297453.) llvm-svn: 298328	2017-03-21 00:26:39 +00:00
Zachary Turner	5821a3bf36	[Support] Fill the file_status struct with link count. Differential Revision: https://reviews.llvm.org/D31110 llvm-svn: 298326	2017-03-20 23:55:20 +00:00
Zachary Turner	82a0c97b32	Add a function to MD5 a file's contents. In doing so, clean up the MD5 interface a little. Most existing users only care about the lower 8 bytes of an MD5, but for some users that care about the upper and lower, there wasn't a good interface. Furthermore, consumers of the MD5 checksum were required to handle endianness details on their own, so it seems reasonable to abstract this into a nicer interface that just gives you the right value. Differential Revision: https://reviews.llvm.org/D31105 llvm-svn: 298322	2017-03-20 23:33:18 +00:00
Vadzim Dambrouski	ba789cbd3d	[ARM] Fix PR32130: Handle promotion of zero sized constants. The special case of zero sized values was previously not handled correctly. This patch handles this by not promoting if the size is zero. Patch by Tim Neumann. Differential Revision: https://reviews.llvm.org/D31116 llvm-svn: 298320	2017-03-20 22:59:57 +00:00
Sanjay Patel	f238902f52	[x86] add tests for setcc of i128/i256; NFC llvm-svn: 298317	2017-03-20 22:15:40 +00:00
Matt Arsenault	6b00d40900	InstCombine: Check source value precision when reducing cast intrinsic Missed this check when porting from the libcall version. llvm-svn: 298312	2017-03-20 21:59:24 +00:00
Tim Northover	4340d64f91	GlobalISel: add implicit defs & uses when mutating an instruction. Otherwise a scheduler might do bad things to the code we produce. llvm-svn: 298311	2017-03-20 21:58:23 +00:00
Adrian Prantl	956484b7b5	Replace uses of DwarfExpression::addMachineReg* with addMachineRegExpression and mark the methods as protected. Besides reducing the surface area of DwarfExpression, this is in preparation for an upcoming bugfix in the DwarfExpression implementation, for which it will be necessary to defer emitting register operations until the rest of the expression is known. NFC llvm-svn: 298309	2017-03-20 21:35:09 +00:00
Adrian Prantl	52884b7be8	Make implementation details in DwarfExpression protected. (NFC) llvm-svn: 298308	2017-03-20 21:34:19 +00:00
Evgeniy Stepanov	e829eecc05	[Fuchsia] Use %gs for ABI slots under -mcmodel=kernel Make x86_64-fuchsia targets under -mcmodel=kernel use %gs rather than %fs to access ABI slots for stack-protector and safe-stack Patch by Roland McGrath. Differential Revision: https://reviews.llvm.org/D30870 llvm-svn: 298302	2017-03-20 20:35:37 +00:00
Eli Friedman	b1578d3612	[SCEV] Fix trip multiple calculation If loop bound containing calculations like min(a,b), the Scalar Evolution API getSmallConstantTripMultiple returns 4294967295 "-1" as the trip multiple. The problem is that, SCEV use -1 * umax to represent umin. The multiple constant -1 was returned, and the logic of guarding against huge trip counts was skipped. Because -1 has 32 active bits. The fix attempt to factor more general cases. First try to get the greatest power of two divisor of trip count expression. In case overflow happens, the trip count expression is still divisible by the greatest power of two divisor returned. Returns 1 if not divisible by 2. Patch by Huihui Zhang <huihuiz@codeaurora.org> Differential Revision: https://reviews.llvm.org/D30840 llvm-svn: 298301	2017-03-20 20:25:46 +00:00
David L. Jones	d61548471c	[X86] Clean up test/CodeGen/X86/2006-03-01-InstrSchedBug.ll Summary: - Migrated from grep to FileCheck. - Re-indented, removed boilerplate comments. - Added 'entry' label at beginning of basic block. Patch by Jorge Gorbe! Reviewed By: RKSimon Subscribers: RKSimon, jgorbe, llvm-commits Differential Revision: https://reviews.llvm.org/D30317 llvm-svn: 298298	2017-03-20 20:10:30 +00:00
Zachary Turner	b7dbd87d5e	Explicitly add move constructor/assignment operators. These are needed due to some obscure rules in the standard about how std::vector selects between copy and move constructors, which can cause a conforming implementation to attempt to select the copy constructor of RuleMatcher, which will fail since std::unique_ptr<> isn't copyable. llvm-svn: 298294	2017-03-20 19:56:52 +00:00
Nirav Dave	f5f0864ac2	Add test case for merging of chained stores of mismatched type. llvm-svn: 298293	2017-03-20 19:48:22 +00:00
Kevin Enderby	a8d256cb36	Add the rest of the error checking for Mach-O dyld compact bind entry errors and test cases for each of the error checks. To do this more plumbing was needed so that the segment indexes and segment offsets can be checked. Basically what was done was the SegInfo from llvm-objdump’s MachODump.cpp was moved into libObject for Mach-O objects as BindRebaseSegInfo and it is only created when an iterator for bind or rebase entries are created. This commit really only adds the error checking and test cases for the bind table entires and the checking for the lazy bind and weak bind entries are still to be fully done as well as the rebase entires. Though some of the plumbing for those are added with this commit. Those other error checks and test cases will be added in follow on commits. Note, the two llvm_unreachable() calls should now actually be unreachable with the error checks in place and would take a logic bug in the error checking code to be reached if the segment indexes and segment offsets are used from a checked bind entry. Comments have been added to the methods that require the arguments to have been checked prior to calling. llvm-svn: 298292	2017-03-20 19:46:55 +00:00
Evgeniy Stepanov	c440572715	Revert r298158. Revert "[asan] Fix dead stripping of globals on Linux." OOM in gold linker. llvm-svn: 298288	2017-03-20 18:45:34 +00:00
Simon Pilgrim	008f9d2e76	Fixed include order. NFCI. llvm-svn: 298286	2017-03-20 18:33:23 +00:00
Peter Collingbourne	d786047583	Try using llvm::errc, should hopefully fix version mismatch problem on clang-s390x-linux bot. llvm-svn: 298285	2017-03-20 18:19:41 +00:00
Krzysztof Parzyszek	8490251de3	[Hexagon] Recognize polynomial-modulo loop idiom again Regain the ability to recognize loops calculating polynomial modulo operation. This ability has been lost due to some changes in the preceding optimizations. Add code to preprocess the IR to a form that the pattern matching code can recognize. llvm-svn: 298282	2017-03-20 18:12:58 +00:00
Konstantin Zhuravlyov	2534bc07f4	[AMDGPU] Run always inliner early in opt Differential Revision: https://reviews.llvm.org/D31141 llvm-svn: 298281	2017-03-20 18:06:45 +00:00
Daniel Berlin	fa42a23cfc	Add missing updated test from VN coercion changes. Instructions were renamed. NFC llvm-svn: 298280	2017-03-20 18:04:19 +00:00
David Blaikie	795dc94614	Fix UB found by -Wtautological-undefined-compare llvm-svn: 298279	2017-03-20 18:01:07 +00:00
Jonathan Roelofs	8d1ba8943f	Mention that the Sphinx docs build wants version 1.5 or later `misc.highlighting_failure` support was added to `suppress_warnings` in that version, and the warnings-as-errors docs build relies on it. llvm-svn: 298277	2017-03-20 17:49:36 +00:00
Reid Kleckner	8819c73878	[WinEH] Adjust decision to emit SEH moves for leaf functions Move the check for "MF->hasWinCFI()" up into the calculation of the shouldEmitMoves boolean, rather than putting it in the early returning if. This ensures that endFunction doesn't try to emit .seh_* directives for leaf functions. llvm-svn: 298276	2017-03-20 17:45:59 +00:00
Tim Northover	89268b183f	GlobalISel: allow quad-precision values to be dumped. Otherwise the fallback path fails with an assertion on AAPCS AArch64 targets, when "long double" is encountered. llvm-svn: 298273	2017-03-20 16:52:08 +00:00
Peter Collingbourne	25a17ba4c7	Support, LTO: When pruning a directory, ignore files matching a prefix. This is a safeguard against data loss if the user specifies a directory that is not a cache directory. Teach the existing cache pruning clients to create files with appropriate names. Differential Revision: https://reviews.llvm.org/D31109 llvm-svn: 298271	2017-03-20 16:41:57 +00:00
Dehao Chen	e593049fb0	Updates branch_weights annotation for call instructions during inlining. Summary: Inliner should update the branch_weights annotation to scale it to proper value. Reviewers: davidxl, eraman Reviewed By: eraman Subscribers: zzheng, llvm-commits Differential Revision: https://reviews.llvm.org/D30767 llvm-svn: 298270	2017-03-20 16:40:44 +00:00
Adrian Prantl	6d80a262d5	Use isa<> instead of dyn_cast<> (NFC). llvm-svn: 298268	2017-03-20 16:39:41 +00:00
David Blaikie	820a833125	Avoid potential ADL ambiguity between llvm::make_unique and std::make_unique (when building as C++14+) Patch by Moritz Kiefer! llvm-svn: 298266	2017-03-20 16:35:33 +00:00
Dmitry Preobrazhensky	1e124e1825	[AMDGPU][MC] Fix for Bugs 28201, 28199, 28170 + LIT tests This fix enables sp3 abs modifier with constants Reviewers: artem.tamazov Differential Revision: https://reviews.llvm.org/D30825 llvm-svn: 298265	2017-03-20 16:33:20 +00:00
Craig Topper	d92d2fc763	[InstCombine] Print a debug message when we constant fold an operand during worklist creation InstCombine tries to constant fold instruction operands during worklist building, but we don't print that we're doing this. We also set a change flag here that causes us to rebuild and rerun the worklist one more time even if processing the worklist itself created no additional changes. So in the log I saw two inst combine runs that visited all instructions without printing that anything was changed. I may be submitting another patch to remove the change flag unless I can find some reason why we should be doing that. Differential Revision: https://reviews.llvm.org/D31091 llvm-svn: 298264	2017-03-20 16:31:14 +00:00
Jessica Paquette	02cbfb2926	[Outliner] ACTUALLY remove the errs output I don't know how to type. This fixes the last commit which would have made all of the overflows legal, and kept the screaming. llvm-svn: 298263	2017-03-20 16:25:04 +00:00
Daniel Berlin	12883b1673	Templatize parts of VNCoercion, and add constant-only versions of the functions to be used in NewGVN. NFCI. Summary: This is ground work for the changes to enable coercion in NewGVN. GVN doesn't care if they end up constant because it eliminates as it goes. NewGVN cares. IRBuilder and ConstantFolder deliberately present the same interface, so we use this to our advantage to templatize our functions to make them either constant only or not. Reviewers: davide Subscribers: llvm-commits, Prazek Differential Revision: https://reviews.llvm.org/D30928 llvm-svn: 298262	2017-03-20 16:08:29 +00:00
Jessica Paquette	5d59a4ee19	[Outliner] Remove output for offset range check Forgot to remove some output before committing last time. (Instruction fixups don't actually overflow anywhere in the test suite so far, so I missed it). To prevent the outliner from screaming "Overflow!" in the event that that does happen, this commit removes that output. llvm-svn: 298260	2017-03-20 15:51:45 +00:00
Daniel Sanders	b96f40dd03	[tablegen][globalisel] Capture instructions into locals and related infrastructure for multiple instructions matches. Summary: Prepare the way for nested instruction matching support by having actions like CopyRenderer look up operands in the RuleMatcher rather than a specific InstructionMatcher. This allows actions to reference any operand from any matched instruction. It works by checking the 'shape' of the match and capturing each matched instruction to a local variable. If the shape is wrong (not enough operands, leaf nodes where non-leafs are expected, etc.), then the rule exits early without checking the predicates. Once we've captured the instructions, we then test the predicates as before (except using the local variables). If the match is successful, then we render the new instruction as before using the local variables. It's not noticable in this patch but by the time we support multiple instruction matching, this patch will also cause a significant improvement to readability of the emitted code since MRI.getVRegDef(I->getOperand(0).getReg()) will simply be MI1 after emitCxxCaptureStmts(). This isn't quite NFC because I've also fixed a bug that I'm surprised we haven't encountered yet. It now checks there are at least the expected number of operands before accessing them with getOperand(). Depends on D30531 Reviewers: t.p.northover, qcolombet, aditya_nandakumar, ab, rovka Reviewed By: rovka Subscribers: dberris, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D30535 llvm-svn: 298257	2017-03-20 15:20:42 +00:00

1 2 3 4 5 ...

146487 Commits