llvm-project

Commit Graph

Author	SHA1	Message	Date
Stanislav Mekhanoshin	5680b0ca9f	[AMDGPU] fcanonicalize elimination optimization We are using multiplication by 1.0 to flush denormals and quiet sNaNs. That is possible to omit this multiplication if source of the fcanonicalize instruction is known to be flushed/quieted, i.e. if it comes from another instruction known to do the normalization and we are using IEEE mode to quiet sNaNs. Differential Revision: https://reviews.llvm.org/D35218 llvm-svn: 307848	2017-07-12 21:20:28 +00:00
Anna Thomas	8e431a9851	[LoopUnrollRuntime] NFC: Refactored safety checks of unrolling multi-exit loop Refactored the code and separated out a function `canSafelyUnrollMultiExitLoop` to reduce redundant checks and make it easier to add profitability heuristics later. Added tests to runtime unrolling to make sure that unrolling for multi-exit loops is not done unless the option -unroll-runtime-multi-exit is true. llvm-svn: 307843	2017-07-12 20:55:43 +00:00
Michael Kuperstein	fdb46b2fb4	[LV] Don't allow outside uses of IVs if the SCEV is predicated on loop conditions. This fixes PR33706. Differential Revision: https://reviews.llvm.org/D35227 llvm-svn: 307837	2017-07-12 19:53:55 +00:00
Simon Dardis	e171a913d6	[mips][mt][6/7] Add support for mftr, mttr instructions. Unlike many other instructions, these instructions have aliases which take coprocessor registers, gpr register, accumulator (and dsp accumulator) registers, floating point registers, floating point control registers and coprocessor 2 data and control operands. For the moment, these aliases are treated as pseudo instructions which are expanded into the underlying instruction. As a result, disassembling these instructions shows the underlying instruction and not the alias. Reviewers: slthakur, atanasyan Differential Revision: https://reviews.llvm.org/D35253 llvm-svn: 307836	2017-07-12 19:47:45 +00:00
Jakub Kuderski	b323f4f173	[LoopRotate] Fix DomTree update logic for unreachable nodes. Fix PR33701. Summary: LoopRotate manually updates the DoomTree by iterating over all predecessors of a basic block and computing the Nearest Common Dominator. When a predecessor happens to be unreachable, `DT.findNearestCommonDominator` returns nullptr. This patch teaches LoopRotate to handle this case and fixes [[ https://bugs.llvm.org/show_bug.cgi?id=33701 \| PR33701 ]]. In the future, LoopRotate should be taught to use the new incremental API for updating the DomTree. Reviewers: dberlin, davide, uabelho, grosser Subscribers: efriedma, mzolotukhin Differential Revision: https://reviews.llvm.org/D35074 llvm-svn: 307828	2017-07-12 18:42:16 +00:00
Sanjay Patel	4450e73b5e	[x86] improve SBB optimizations for SETB/SETA with subtract This is another step towards removing a combine that turns sext into select of constants and preparing the backend for an IR future where select is the canonical form. Earlier commits in this area: https://reviews.llvm.org/rL306040 https://reviews.llvm.org/rL306072 https://reviews.llvm.org/rL307404 (https://reviews.llvm.org/D34652) https://reviews.llvm.org/rL307471 llvm-svn: 307821	2017-07-12 17:56:46 +00:00
Sanjay Patel	6d6c06879c	[x86] add tests for improving sbb transforms; NFC We're subtracting X from X the hard way... llvm-svn: 307819	2017-07-12 17:44:50 +00:00
Justin Bogner	4fc696635d	GlobalISel: Handle selection of G_IMPLICIT_DEF in AArch64 A generic variant of IMPLICIT_DEF was added in r306875, but this survives to selection and hits a `Cannot Select`. Add handling that converts the note to a regular IMPLICIT_DEF. llvm-svn: 307817	2017-07-12 17:32:32 +00:00
George Burgess IV	6f92d2dd24	Add a test for r307754 As promised in D35003. Uses -codegenprepare instead of -instcombine since we hit the same buggy path anyway, and CGP lets us keep this test really simple (instcombine likes turning the alloca T, N into alloca [N x T], which hides the bug this is testing for). llvm-svn: 307811	2017-07-12 16:30:37 +00:00
Simon Dardis	76eb647e1e	[mips][mt][5/7] Add support for fork and yield instructions. Reviewers: slthakur, atanasyan Differential Revision: https://reviews.llvm.org/D35252 llvm-svn: 307808	2017-07-12 16:23:57 +00:00
Rafael Espindola	1e6b49e144	Add back a CHECK line. I accidentally removed it in r307730. Thanks to Martin Storsjö for noticing! llvm-svn: 307801	2017-07-12 16:14:00 +00:00
Evandro Menezes	14ba3d7730	[CodeGen] Add dependency printer Add SDep printer to make debugging sessions more productive. Differential revision: https://reviews.llvm.org/D35144 llvm-svn: 307799	2017-07-12 15:30:59 +00:00
Davide Italiano	a63981aaa9	[X86/FastIsel] Fall-back to SelectionDAG when lowering soft-floats. FastIsel can't handle them, so we would end up crashing during register class selection. Fixes PR26522. Differential Revision: https://reviews.llvm.org/D35272 llvm-svn: 307797	2017-07-12 15:26:06 +00:00
Daniel Neilson	57226ef33c	Add element atomic memmove intrinsic Summary: Continuing the work from https://reviews.llvm.org/D33240, this change introduces an element unordered-atomic memmove intrinsic. This intrinsic is essentially memmove with the implementation requirement that all loads/stores used for the copy are done with unordered-atomic loads/stores of a given element size. Reviewers: eli.friedman, reames, mkazantsev, skatkov Reviewed By: reames Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D34884 llvm-svn: 307796	2017-07-12 15:25:26 +00:00
Simon Dardis	2de1ddbd9c	[mips][mt][4/7] Add IAS support for dvpe, evpe instructions. Reviewers: slthakur, atanasyan Differential Revision: https://reviews.llvm.org/D35251 llvm-svn: 307793	2017-07-12 14:48:27 +00:00
Simon Pilgrim	8dfbc772d7	[X86][SSE] Fix file check prefix warning breaking buildbots llvm-svn: 307790	2017-07-12 13:41:13 +00:00
Kamil Rytarowski	cce21c1dfe	Make shell redirection construct portable Summary: NetBSD shell sh(1) does not support ">& /dev/null" construct. This is bashism. The portable and POSIX solution is to use: "> /dev/null 2>&1". This change fixes 22 Unexpected Failures on NetBSD/amd64 for the "check-llvm" target. Sponsored by <The NetBSD Foundation> Reviewers: joerg, dim, rnk Reviewed By: joerg, rnk Subscribers: rnk, davide, llvm-commits Differential Revision: https://reviews.llvm.org/D35277 llvm-svn: 307789	2017-07-12 13:24:46 +00:00
John Brawn	97cc283117	[ARM] Adjust ifcvt heuristic for the diamond ifcvt case When we have a diamond ifcvt the fallthough block will have a branch at the end of it that disappears when predicated, so discount it from the predication cost. Differential Revision: https://reviews.llvm.org/D34952 llvm-svn: 307788	2017-07-12 13:23:10 +00:00
Simon Pilgrim	ebbb969d21	[X86][SSE] Add 512-bit (iX bitcast(vXi1)) test cases Improves test coverage for pre-AVX512 targets as well llvm-svn: 307783	2017-07-12 12:44:10 +00:00
Simon Dardis	7323f7ac63	[mips][mt] Add missing files from last commit llvm-svn: 307779	2017-07-12 12:33:40 +00:00
Florian Hahn	745266b2a7	[Linker] Add directives to support mixing ARM/Thumb module-level inline asm. Summary: By prepending `.text .thumb .balign 2` to the module-level inline assembly from a Thumb module, the assembler will generate the assembly from that module as Thumb, even if the destination module uses an ARM triple. Similar directives are used for module-level inline assembly in ARM modules. The alignment and instruction set are reset based on the target triple before emitting the first function label. Reviewers: olista01, tejohnson, echristo, t.p.northover, rafael Reviewed By: echristo Subscribers: aemerson, javed.absar, eraman, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D34622 llvm-svn: 307772	2017-07-12 11:52:28 +00:00
Diana Picus	21014df5e0	[ARM] GlobalISel: Select s64 G_FCMP Very similar to how we select s32 G_FCMP, the only thing that is different is the exact opcodes that we use. llvm-svn: 307763	2017-07-12 09:01:54 +00:00
Michael Zuckerman	fce5c67920	[X86][LLVM]Expanding Supports lowerInterleavedStore() in X86InterleavedAccess. Adding base test for AVX512 llvm-svn: 307761	2017-07-12 08:01:44 +00:00
Matthias Braun	053b084263	Specify complete target triple in test This should fix the problems on the greendragon build. llvm-svn: 307747	2017-07-12 01:16:50 +00:00
Peter Collingbourne	cacac6a104	LowerTypeTests: When importing functions skip definitions where the summary contains a decl. This normally indicates mixed CFI + non-CFI compilation, and will result in us treating the function in the same way as a function defined outside of the LTO unit. Part of PR33752. Differential Revision: https://reviews.llvm.org/D35281 llvm-svn: 307744	2017-07-12 00:39:12 +00:00
Sam Clegg	9c07f94a1f	[WebAssembly] Expose the offset of each data segment Summary: This allows tools like lld that process relocations to apply data relocation correctly. This information is required because relocation are stored as section offset. Subscribers: jfb, dschuff, jgravelle-google, aheejin Differential Revision: https://reviews.llvm.org/D35234 llvm-svn: 307741	2017-07-12 00:24:54 +00:00
Reid Kleckner	8d8888ff42	[codeview] Change readobj symbol dumping format Avoid duplicating DictScope with hand-written names everywhere. Print the S_-prefixed symbol kind for every record. This should make it easier to search for certain kinds of records when debugging PDB linking. llvm-svn: 307732	2017-07-11 23:41:41 +00:00
Rafael Espindola	1beb702ba2	Fully fix the movw/movt addend. The issue is not if the value is pcrel. It is whether we have a relocation or not. If we have a relocation, the static linker will select the upper bits. If we don't have a relocation, we have to do it. llvm-svn: 307730	2017-07-11 23:18:25 +00:00
Davide Italiano	b8ad3eebca	[IPO] Temporarily rollback r307215. [GlobalOpt] Remove unreachable blocks before optimizing a function. While the change is presumably correct, it exposes a latent bug in DI which breaks on of the CFI checks. I'll analyze it further and try to understand what's going on. llvm-svn: 307729	2017-07-11 23:10:17 +00:00
Konstantin Zhuravlyov	bb80d3e1d3	Enhance synchscope representation OpenCL 2.0 introduces the notion of memory scopes in atomic operations to global and local memory. These scopes restrict how synchronization is achieved, which can result in improved performance. This change extends existing notion of synchronization scopes in LLVM to support arbitrary scopes expressed as target-specific strings, in addition to the already defined scopes (single thread, system). The LLVM IR and MIR syntax for expressing synchronization scopes has changed to use syncscope("<scope>"), where <scope> can be "singlethread" (this replaces singlethread keyword), or a target-specific name. As before, if the scope is not specified, it defaults to CrossThread/System scope. Implementation details: - Mapping from synchronization scope name/string to synchronization scope id is stored in LLVM context; - CrossThread/System and SingleThread scopes are pre-defined to efficiently check for known scopes without comparing strings; - Synchronization scope names are stored in SYNC_SCOPE_NAMES_BLOCK in the bitcode. Differential Revision: https://reviews.llvm.org/D21723 llvm-svn: 307722	2017-07-11 22:23:00 +00:00
Sanjay Patel	7c026cb1af	[x86] auto-generate full checks; NFC llvm-svn: 307718	2017-07-11 22:04:36 +00:00
Simon Dardis	805f1e03b8	[mips][mt][2/7] Implement .module and .set directives for the MT ASE. This patch implements the .module and .set directives for the MT ASE, notably that .module sets the relevant flags in .MIPS.abiflags and .set doesn't. Reviewers: slthakur, atanasyan Differential Revision: https://reviews.llvm.org/D35249 llvm-svn: 307716	2017-07-11 21:28:36 +00:00
Martin Storsjo	0e83e85f63	[ARM, ELF] Don't shift movt relocation offsets For ELF, a movw+movt pair is handled as two separate relocations. If an offset should be applied to the symbol address, this offset is stored as an immediate in the instruction (as opposed to stored as an offset in the relocation itself). Even though the actual value stored in the movt immediate after linking is the top half of the value, we need to store the unshifted offset prior to linking. When the relocation is made during linking, the offset gets added to the target symbol value, and the upper half of the value is stored in the instruction. This makes sure that movw+movt with offset symbols get properly handled, in case the offset addition in the lower half should be carried over to the upper half. This makes the output from the additions to the test case match the output from GNU binutils. For COFF and MachO, the movw/movt relocations are handled as a pair, and the overflow from the lower half gets carried over to the movt, so they should keep the shifted offset just as before. Differential Revision: https://reviews.llvm.org/D35242 llvm-svn: 307713	2017-07-11 21:07:10 +00:00
Xinliang David Li	801b5319c5	[ProfileData] Add new option to dump topn hottest functions Differential Revision: http://reviews.llvm.org/D35155 llvm-svn: 307702	2017-07-11 20:30:43 +00:00
Davide Italiano	ee1c82112e	[NewGVN] Check for congruency of memory accesses. This is fine as nothing in the code relies on leader and memory leader being the same for a given congruency class. Ack'ed by Dan. Fixes PR33720. llvm-svn: 307699	2017-07-11 19:49:12 +00:00
Michael Zuckerman	1fe5628aa0	reverting 307677. llvm-svn: 307698	2017-07-11 19:46:11 +00:00
Tony Jiang	892f8c42dc	[PPC] Fix one test case regression for patch https://reviews.llvm.org/D34337 . llvm-svn: 307691	2017-07-11 19:07:10 +00:00
Evgeniy Stepanov	3d5ea713f7	[msan] Only check shadow memory for operands that are sized. Fixes PR33347: https://bugs.llvm.org/show_bug.cgi?id=33347. Differential Revision: https://reviews.llvm.org/D35160 Patch by Matt Morehouse. llvm-svn: 307684	2017-07-11 18:13:52 +00:00
Simon Dardis	ae719c5a17	[mips][mt][1/7] Add the MT ASE as a subtarget feature. Preparatory work for adding the MIPS MT (multi-threading) ASE instructions. Reviewers: slthakur, atanasyan Differential Revision: https://reviews.llvm.org/D35247 llvm-svn: 307679	2017-07-11 18:03:20 +00:00
Michael Zuckerman	4b6d01a008	[X86][LLVM]Expanding Supports lowerInterleavedStore() in X86InterleavedAccess. Base test for avx512 adding new base test to trunk befor commit change on the test llvm-svn: 307677	2017-07-11 17:17:49 +00:00
Anna Thomas	5526a33f4f	[LoopUnrollRuntime] Avoid multi-exit nested loop with epilog generation The loop structure for the outer loop does not contain the epilog preheader when we try to unroll inner loop with multiple exits and epilog code is generated. For now, we just bail out in such cases. Added a test case that shows the problem. Without this bailout, we would trip on assert saying LCSSA form is incorrect for outer loop. llvm-svn: 307676	2017-07-11 17:16:33 +00:00
Krzysztof Parzyszek	f67cd8259d	[Hexagon] Do not rely on callee-saved info in hasFP llvm-svn: 307675	2017-07-11 17:11:54 +00:00
Tony Jiang	d5acad053b	[PPC] Fix two bugs in frame lowering. 1. The available program storage region of the red zone to compilers is 288 bytes rather than 244 bytes. 2. The formula for negative number alignment calculation should be y = x & ~(n-1) rather than y = (x + (n-1)) & ~(n-1). Differential Revision: https://reviews.llvm.org/D34337 llvm-svn: 307672	2017-07-11 16:42:20 +00:00
Krzysztof Parzyszek	c86e2ef3f5	[Hexagon] Add support for nontemporal loads and stores on HVX Patch by Michael Wu. Differential Revision: https://reviews.llvm.org/D35104 llvm-svn: 307671	2017-07-11 16:39:33 +00:00
Diana Picus	1e33c9c166	[ARM] GlobalISel: Tighten G_FCMP selection test. NFC Use CHECK-NEXT for the comparison sequence, to make sure we don't get any unexpected instructions in the middle of our flag manipulation efforts. llvm-svn: 307656	2017-07-11 12:34:33 +00:00
George Rimar	0493e436ee	[DWARF] - Add testcase for checking message about broken relocations. Addresses comments for r306677, which fixed error message itself. llvm-svn: 307655	2017-07-11 12:29:07 +00:00
Guy Blank	509d1b2a5a	[X86][AVX512] regenerate avx512-insert-extract.ll llvm-svn: 307654	2017-07-11 11:51:49 +00:00
Diana Picus	069da27f49	[ARM] GlobalISel: Add reg mapping for s64 G_FCMP Map the result into GPR and the operands into FPR. llvm-svn: 307653	2017-07-11 11:47:45 +00:00
Diana Picus	84baba20db	[ARM] GlobalISel: Tighten legalizer tests. NFC Make sure that all the legalizer tests where the original instruction needs to be removed check for the removal. We do this by adding CHECK-NOT lines before and after the replacement sequence. This won't catch pathological cases where the instruction remains somewhere in the middle of the instruction sequence that's supposed to replace it, but hopefully that won't occur in practice (since ideally we'd be setting the insert point for the new instruction sequence either before or after the original instruction and not fiddle with it while building the sequence). llvm-svn: 307647	2017-07-11 10:52:08 +00:00
Daniel Sanders	57938df813	[globalisel][tablegen] Fix an multi-insn match bug where ComplexPattern is used on multiple insns. In each rule, each use of ComplexPattern is assigned an element in the Renderers array. The matcher then collects renderer functions in this array and they are used to render instructions. This works well for a single instruction but a bug in the allocation mechanism causes the elements to be assigned on a per-instruction basis rather than a per-rule basis. So in the case of: (set GPR32:$dst, (Op complex:$src1, complex:$src2)) tablegen currently assigns elements 0 and 1 to $src1 and $src2 respectively, but for: (set GPR32:$dst, (Op complex:$src1, (Op complex:$src2))) it currently assigned both $src1 and $src2 the same element (0). This results in one complex operand being rendered twice and the other being forgotten. This patch corrects the allocation such that $src1 and $src2 are still allocated different elements in this case. llvm-svn: 307646	2017-07-11 10:40:18 +00:00

1 2 3 4 5 ...

46048 Commits