llvm-project

Commit Graph

Author	SHA1	Message	Date
Simon Pilgrim	ca57e53ded	[SelectionDAG] Add support for vector demandedelts in SRA opcodes llvm-svn: 286461	2016-11-10 15:05:09 +00:00
Simon Pilgrim	37c9034bd6	[DAGCombiner] Correctly extract the ConstOrConstSplat shift value for SHL nodes We were failing to extract a constant splat shift value if the shifted value was being masked. The (shl (and (setcc) N01CV) N1CV) -> (and (setcc) N01CV<<N1CV) combine was unnecessarily preventing this. llvm-svn: 286454	2016-11-10 14:35:09 +00:00
Simon Pilgrim	3bf99c056a	[SelectionDAG] Add support for vector demandedelts in SHL/SRL opcodes llvm-svn: 286448	2016-11-10 13:52:42 +00:00
Oliver Stannard	18ca2adf2d	[ARM] Thumb2 LDR (literal) should accept PC as the destination The version of this instruction with the .w suffix already correctly accepts this, but the alias without the .w did not. Differential Revision: https://reviews.llvm.org/D26499 llvm-svn: 286446	2016-11-10 13:20:41 +00:00
Sanjoy Das	3d75b62ffe	[SCEVExpander] Hoist unsigned divisons when safe That is, when the divisor is a constant non-zero. llvm-svn: 286438	2016-11-10 07:56:12 +00:00
Sanjoy Das	e30a281449	[SCEVExpander] Don't hoist divisions Fixes PR30942. llvm-svn: 286437	2016-11-10 07:56:09 +00:00
Craig Topper	bd298c37d1	[AVX-512] Allow legacy cvtpd2dq intrinsics to select EVEX encoded instruction when available. llvm-svn: 286435	2016-11-10 07:47:17 +00:00
Craig Topper	e0845d8e8c	[AVX-512][X86] Convert avx_cvtt_ps2dq_256 and sse2_cvttps2dq intrinsics to ISD::FP_TO_SINT in the intrinsics table and delete patterns. While nearby also move CVTDQ2PS patterns into their instructions. This allows these intrinsics to also use EVEX instructons. llvm-svn: 286434	2016-11-10 07:24:52 +00:00
Craig Topper	f37b9b9b5f	[X86] Convert int_x86_avx_cvtt_pd2dq_256 to fp_to_sint using the intrinsics table. Removes extra patterns and allows legacy intrinsic to select EVEX encoded instructions when available. llvm-svn: 286433	2016-11-10 06:45:39 +00:00
Craig Topper	2afed2c790	[X86] Move some custom patterns into the currently empty pattern of their corresponding instructions. NFC llvm-svn: 286432	2016-11-10 06:45:37 +00:00
Craig Topper	1d2e74f030	[X86] Remove some patterns still referencing int_x86_sse2_cvttpd2dq that should have been removed in r286344. NFC llvm-svn: 286431	2016-11-10 06:45:34 +00:00
Sanjoy Das	0ae390abce	[SCEV] Eta reduce some lambdas; NFC llvm-svn: 286429	2016-11-10 06:33:54 +00:00
Sanjay Patel	4e1b5a53c7	[InstCombine] avoid infinite loop from shuffle-extract-insert sequence (PR30923) Removing the limitation in visitInsertElementInst() causes several regressions because we're not prepared to fold sequences of shuffles or inserts and extracts separated by shuffles. Fixing that appears to be a difficult mission because we are purposely trying to avoid creating shuffles with arbitrary shuffle masks because some targets may choke on those. https://llvm.org/bugs/show_bug.cgi?id=30923 llvm-svn: 286423	2016-11-10 00:15:14 +00:00
Peter Collingbourne	32ab3a817d	Re-apply r286384, "X86: Introduce the "relocImm" ComplexPattern, which represents a relocatable immediate.", with a fix for 32-bit x86. Teach X86InstrInfo::analyzeCompare() not to crash on CMP and SUB instructions that take a global address operand. llvm-svn: 286420	2016-11-09 23:53:43 +00:00
Dehao Chen	38a666d6e5	Add isHotBB helper function to ProfileSummaryInfo Summary: This will unify all BB hotness checks. Reviewers: eraman, davidxl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D26353 llvm-svn: 286415	2016-11-09 23:36:02 +00:00
Eli Friedman	ddbf83ea14	Preserve assumption cache in loop-rotate. No testcase included because I can't figure out how to reduce it. (It's easy to write a testcase where rotation clones an assume, but that doesn't actually seem to trigger the crash in opt on its own; maybe an issue with the laziness?) Differential Revision: https://reviews.llvm.org/D26434 llvm-svn: 286410	2016-11-09 23:05:01 +00:00
Tim Northover	09dd2496b7	GlobalISel: fix typo. NFC llvm-svn: 286408	2016-11-09 22:40:02 +00:00
Tim Northover	a9105be437	GlobalISel: translate invoke and landingpad instructions Pretty bare-bones support for exception handling (no weird MSVC stuff, no SjLj etc), but it should get things going. llvm-svn: 286407	2016-11-09 22:39:54 +00:00
Mehdi Amini	67d1a41226	Make BitcodeReader::parseIdentificationBlock() robust to EOF This method is particular: it iterates at the top-level and does not have an enclosing block. llvm-svn: 286394	2016-11-09 21:26:49 +00:00
Evgeny Stupachenko	c2698cd903	Minor unroll pass refacoring. Summary: Unrolled Loop Size calculations moved to a function. Constant representing number of optimized instructions when "back edge" becomes "fall through" replaced with variable. Some comments added. Reviewers: mzolotukhin Differential Revision: http://reviews.llvm.org/D21719 From: Evgeny Stupachenko <evstupac@gmail.com> llvm-svn: 286389	2016-11-09 19:56:39 +00:00
Sanjoy Das	26f28a2836	[Verifier] clang-format a section; NFC Suggested in D26438 since I'm touching related code. llvm-svn: 286388	2016-11-09 19:36:39 +00:00
Sanjoy Das	6b46a0d1e8	[SCEV] Refactor out a useful pattern; NFC llvm-svn: 286386	2016-11-09 18:22:43 +00:00
Peter Collingbourne	a9cadeddd4	Revert r286384, "X86: Introduce the "relocImm" ComplexPattern, which represents a relocatable immediate." Suspected to be the cause of a sanitizer-windows bot failure: Assertion failed: isImm() && "Wrong MachineOperand accessor", file C:\b\slave\sanitizer-windows\llvm\include\llvm/CodeGen/MachineOperand.h, line 420 llvm-svn: 286385	2016-11-09 18:17:50 +00:00
Peter Collingbourne	4c15db45e4	X86: Introduce the "relocImm" ComplexPattern, which represents a relocatable immediate. A relocatable immediate is either an immediate operand or an operand that can be relocated by the linker to an immediate, such as a regular symbol in non-PIC code. Start using relocImm for 32-bit and 64-bit MOV instructions, and for operands of type "imm32_su". Remove a number of now-redundant patterns. Differential Revision: https://reviews.llvm.org/D25812 llvm-svn: 286384	2016-11-09 17:51:58 +00:00
Krzysztof Parzyszek	f817efbbb0	[Hexagon] Silence "sometimes uninitialized" warning in HexagonCopyToCombine llvm-svn: 286383	2016-11-09 17:50:46 +00:00
Peter Collingbourne	7f00d0a125	Bitcode: Change the materializer interface to return llvm::Error. Differential Revision: https://reviews.llvm.org/D26439 llvm-svn: 286382	2016-11-09 17:49:19 +00:00
Krzysztof Parzyszek	a540997ce4	[Hexagon] Separate Hexagon subreg indices for different register classes For pairs of 32-bit registers: isub_lo, isub_hi. For pairs of vector registers: vsub_lo, vsub_hi. Add generic subreg indices: ps_sub_lo, ps_sub_hi, and a function HexagonRegisterInfo::getHexagonSubRegIndex(RegClass, GenericSubreg) that returns the appropriate subreg index for RegClass. llvm-svn: 286377	2016-11-09 16:19:08 +00:00
Krzysztof Parzyszek	601d7eb11a	[Hexagon] Eliminate Insert4 pseudo-instruction, use combines instead llvm-svn: 286368	2016-11-09 14:16:29 +00:00
Jonas Paulsson	e127fe7083	[SystemZ] A few fixes in scheduler files. Review: U Weigand llvm-svn: 286362	2016-11-09 12:47:57 +00:00
Pavel Labath	c207bec388	Remove TimeValue usage from Scalar/SROA.cpp. NFC. llvm-svn: 286361	2016-11-09 12:07:12 +00:00
Pavel Labath	775bbc3736	Zero-initialize chrono duration objects The default duration constructor does not zero-initialize the object, we need to do that manually. llvm-svn: 286359	2016-11-09 11:43:57 +00:00
Jonas Paulsson	28f29487b9	[MachineScheduler] Comments fixing. The name/comment of the third argument to the ScheduleDAGMI constructor is RemoveKillFlags and not IsPostRA. Only the comments are changed. Review: A Trick llvm-svn: 286350	2016-11-09 09:59:27 +00:00
Alexandros Lamprineas	0ee3ec2fe4	[ARM] Loop Strength Reduction crashes when targeting ARM or Thumb. Scalar Evolution asserts when not all the operands of an Add Recurrence Expression are loop invariants. Loop Strength Reduction should only create affine Add Recurrences, so that both the start and the step of the expression are loop invariants. Differential Revision: https://reviews.llvm.org/D26185 llvm-svn: 286347	2016-11-09 08:53:07 +00:00
Craig Topper	f334ac19ad	[AVX-512] Add lowering to cvttpd2udq/cvttps2udq for fptoui v2f64/2f32 to 2i32 This patch adds support for fptoui to 2i32 from both 2f64 and 2f32, building on Simon's change for the signed version in r284459 and using AVX-512 instructions. If we don't have VLX support we need to use a 512-bit operation for v2f64->v2i32 and extract the result. It also recognises that cvttpd2udq zeroes the upper 64-bits of the xmm result. Differential Revision: https://reviews.llvm.org/D26331 llvm-svn: 286345	2016-11-09 07:48:51 +00:00
Craig Topper	731bf9c5d6	[X86] Lower AVX512 and SSE intrinsics for CVTTPD2DQ to X86ISD::CVTTPD2DQ. Summary: This allows the SSE intrinsic to use the EVEX instruction when available. It also fixes EVEX to not use a weird (v4i32 (fp_to_sint v2f64)) node and it merges some isel patterns. This also fixes some cases that weren't combining vzmovl with cvttpd2dq to remove extra moves. Reviewers: delena, zvi, RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D26330 llvm-svn: 286344	2016-11-09 07:31:32 +00:00
Craig Topper	28e3dfc02b	[AVX-512] Use alignedstore256 in patterns that look for stores of the lower 256-bits of a 512-bit vector to use a 256-bit aligned store. Previously we were only checking for 16 byte alignment instead of 32 byte alignment. Fixes PR30947. llvm-svn: 286342	2016-11-09 05:31:57 +00:00
Craig Topper	5c842be9a0	[AVX-512] Make VBMI instruction set enabling imply that the BWI instruction set is also enabled. Summary: This is needed to make the v64i8 and v32i16 types legal for the 512-bit VBMI instructions. Fixes PR30912. Reviewers: delena, zvi Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D26322 llvm-svn: 286339	2016-11-09 04:50:48 +00:00
Mehdi Amini	b6a11a7879	Revert "[ThinLTO] Prevent exporting of locals used/defined in module level asm" This reverts commit r286297. Introduces a dependency from libAnalysis to libObject, which I missed during the review. llvm-svn: 286329	2016-11-09 01:45:13 +00:00
Peter Collingbourne	7576cb0fa7	Bitcode: Remove the remnants of the BitcodeDiagnosticInfo class. The BitcodeReader no longer produces BitcodeDiagnosticInfo diagnostics. The only remaining reference was in the gold plugin; the code there has been dead since we stopped producing InvalidBitcodeSignature error codes in r225562. While at it remove the InvalidBitcodeSignature error code. llvm-svn: 286326	2016-11-09 01:09:11 +00:00
Dehao Chen	947dbe1254	Enable Loop Sink pass for functions that has profile. Summary: For functions with profile data, we are confident that loop sink will be optimal in sinking code. Reviewers: davidxl, hfinkel Subscribers: mehdi_amini, mzolotukhin, llvm-commits Differential Revision: https://reviews.llvm.org/D26155 llvm-svn: 286325	2016-11-09 00:58:19 +00:00
Peter Collingbourne	58f7f0759f	Bitcode: Change the BitcodeReader to use llvm::Error internally. Differential Revision: https://reviews.llvm.org/D26430 llvm-svn: 286323	2016-11-09 00:51:04 +00:00
Sanjay Patel	e104554412	[ValueTracking] recognize obfuscated variants of umin/umax The smallest tests that expose this are codegen tests (because SelectionDAGBuilder::visitSelect() uses matchSelectPattern to create UMAX/UMIN nodes), but it's also possible to see the effects in IR alone with folds of min/max pairs. If these were written as unsigned compares in IR, InstCombine canonicalizes the unsigned compares to signed compares. Ie, running the optimizer pessimizes the codegen for this case without this patch: define <4 x i32> @umax_vec(<4 x i32> %x) { %cmp = icmp ugt <4 x i32> %x, <i32 2147483647, i32 2147483647, i32 2147483647, i32 2147483647> %sel = select <4 x i1> %cmp, <4 x i32> %x, <4 x i32> <i32 2147483647, i32 2147483647, i32 2147483647, i32 2147483647> ret <4 x i32> %sel } $ ./opt umax.ll -S \| ./llc -o - -mattr=avx vpmaxud LCPI0_0(%rip), %xmm0, %xmm0 $ ./opt -instcombine umax.ll -S \| ./llc -o - -mattr=avx vpxor %xmm1, %xmm1, %xmm1 vpcmpgtd %xmm0, %xmm1, %xmm1 vmovaps LCPI0_0(%rip), %xmm2 ## xmm2 = [2147483647,2147483647,2147483647,2147483647] vblendvps %xmm1, %xmm0, %xmm2, %xmm0 Differential Revision: https://reviews.llvm.org/D26096 llvm-svn: 286318	2016-11-09 00:24:44 +00:00
Greg Clayton	bde0a1632b	Added the ability to dump hex bytes easily into a raw_ostream. Unit tests were added to verify this functionality keeps working correctly. Example output for raw hex bytes: llvm::ArrayRef<uint8_t> Bytes = ...; llvm::outs() << format_hex_bytes(Bytes); 554889e5 4881ec70 04000048 8d051002 00004c8d 05fd0100 004c8b0d d0020000 Example output for raw hex bytes with offsets: llvm::outs() << format_hex_bytes(Bytes, 0x100000d10); 0x0000000100000d10: 554889e5 4881ec70 04000048 8d051002 0x0000000100000d20: 00004c8d 05fd0100 004c8b0d d0020000 Example output for raw hex bytes with ASCII with offsets: llvm::outs() << format_hex_bytes_with_ascii(Bytes, 0x100000d10); 0x0000000100000d10: 554889e5 4881ec70 04000048 8d051002 \|UH.?H.?p...H....\| 0x0000000100000d20: 00004c8d 05fd0100 004c8b0d d0020000 \|..L..?...L..?...\| The default groups bytes into 4 byte groups, but this can be changed to 1 byte: llvm::outs() << format_hex_bytes(Bytes, 0x100000d10, 16 /NumPerLine/, 1 /ByteGroupSize/); 0x0000000100000d10: 55 48 89 e5 48 81 ec 70 04 00 00 48 8d 05 10 02 0x0000000100000d20: 00 00 4c 8d 05 fd 01 00 00 4c 8b 0d d0 02 00 00 llvm::outs() << format_hex_bytes(Bytes, 0x100000d10, 16 /NumPerLine/, 2 /ByteGroupSize/); 0x0000000100000d10: 5548 89e5 4881 ec70 0400 0048 8d05 1002 0x0000000100000d20: 0000 4c8d 05fd 0100 004c 8b0d d002 0000 llvm::outs() << format_hex_bytes(Bytes, 0x100000d10, 8 /NumPerLine/, 1 /ByteGroupSize/); 0x0000000100000d10: 55 48 89 e5 48 81 ec 70 0x0000000100000d18: 04 00 00 48 8d 05 10 02 0x0000000100000d20: 00 00 4c 8d 05 fd 01 00 0x0000000100000d28: 00 4c 8b 0d d0 02 00 00 https://reviews.llvm.org/D26405 llvm-svn: 286316	2016-11-09 00:15:54 +00:00
Sanjay Patel	4e9d6cd354	[InstCombine] fix profitability equation for max-of-nots transform As the test change shows, we can increase the critical path by adding a 'not' instruction, so make sure that we're actually removing an instruction if we do this transform. This transform could also cause us to miss folds of min/max pairs. llvm-svn: 286315	2016-11-09 00:13:11 +00:00
Sanjay Patel	99dc5feff1	[InstCombine] reduce indentation; NFC llvm-svn: 286314	2016-11-08 23:49:15 +00:00
Zachary Turner	44728f4014	Fix some size_t / uint32_t ambiguity errors. llvm-svn: 286305	2016-11-08 22:30:11 +00:00
Zachary Turner	4efa0a4201	[CodeView] Hook up CodeViewRecordIO to type serialization path. Previously support had been added for using CodeViewRecordIO to read (deserialize) CodeView type records. This patch adds support for writing those same records. With this patch, reading and writing of CodeView type records finally uses a single codepath. Differential Revision: https://reviews.llvm.org/D26253 llvm-svn: 286304	2016-11-08 22:24:53 +00:00
Adrian Prantl	3502f2089c	Emit the DW_AT_type for a C++ static member definition if it is more specific than the one in its DW_AT_specification. If a static member is an array, the translation unit containing the member definition may have a more specific type (including its length) than TUs only seeing the class declaration. This patch adds a DW_AT_type to the member's DW_TAG_variable in addition to the DW_AT_specification in these cases. The member type in the DW_AT_specification still shows the more generic type (without the length) to avoid defeating type uniquing. The DWARF standard discourages “duplicating” a DW_AT_type in a member variable definition but doesn’t explicitly forbid it. Having the more specific type (with the array length) available is what allows the debugger to print the contents of a static array member variable. https://reviews.llvm.org/D26368 rdar://problem/28706946 llvm-svn: 286302	2016-11-08 22:11:38 +00:00
David L. Jones	e09ae201f2	GlobalISel: make sure debugging variables are appropriately elided in release builds. Summary: There are two variables here that break. This change constrains both of them to debug builds (via DEBUG() or #ifndef NDEBUG). Reviewers: bkramer, t.p.northover Subscribers: mehdi_amini, vkalintiris Differential Revision: https://reviews.llvm.org/D26421 llvm-svn: 286300	2016-11-08 22:03:23 +00:00
Teresa Johnson	6955feebf3	[ThinLTO] Prevent exporting of locals used/defined in module level asm Summary: This patch uses the same approach added for inline asm in r285513 to similarly prevent promotion/renaming of locals used or defined in module level asm. All static global values defined in normal IR and used in module level asm should be included on either the llvm.used or llvm.compiler.used global. The former were already being flagged as NoRename in the summary, and I've simply added llvm.compiler.used values to this handling. Module level asm may also contain defs of values. We need to prevent export of any refs to local values defined in module level asm (e.g. a ref in normal IR), since that also requires renaming/promotion of the local. To do that, the summary index builder looks at all values in the module level asm string that are not marked Weak or Global, which is exactly the set of locals that are defined. A summary is created for each of these local defs and flagged as NoRename. This required adding handling to the BitcodeWriter to look at GV declarations to see if they have a summary (rather than skipping them all). Finally, added an assert to IRObjectFile::CollectAsmUndefinedRefs to ensure that an MCAsmParser is available, otherwise the module asm parse would silently fail. Initialized the asm parser in the opt tool for use in testing this fix. Fixes PR30610. Reviewers: mehdi_amini Subscribers: johanengelen, krasin, llvm-commits Differential Revision: https://reviews.llvm.org/D26146 llvm-svn: 286297	2016-11-08 21:53:35 +00:00

1 2 3 4 5 ...

96564 Commits