llvm-project

Commit Graph

Author	SHA1	Message	Date
Zvi Rackover	7569436f81	[DAGCombine] A shuffle of a splat is always the splat itself Summary: Add a simplification: shuffle (splat-shuffle), undef, M --> splat-shuffle Fixes pr32449 Patch by Sanjay Patel Reviewers: eli.friedman, RKSimon, spatel Reviewed By: spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31426 llvm-svn: 299047	2017-03-30 01:42:57 +00:00
Eric Christopher	9ea300f08d	If the DIUnit has flags passed on it then have DW_AT_producer be a combination of DICompileUnit::Producer and Flags. The darwin behavior is unchanged and will continue to use DW_AT_APPLE_flags. Patch by Zhizhou Yang llvm-svn: 299038	2017-03-29 23:34:27 +00:00
Reid Kleckner	acd9a6f09d	[codeview] Fix buggy BeginIndexMapSize assertion This assert is just trying to test that processing each record adds exactly one entry to the index map. The assert logic was wrong when the first record in the type stream was a field list. I've simplified the code by moving the LF_FIELDLIST-specific logic into the callback for that record type. llvm-svn: 299035	2017-03-29 22:51:22 +00:00
Sanjay Patel	b8a728f993	[CodeGen] clean up and add tests for scalar and-of-setcc; NFC https://bugs.llvm.org/show_bug.cgi?id=32401 llvm-svn: 299034	2017-03-29 21:58:52 +00:00
Adrian McCarthy	4d93d66ddd	Re-land: "Make NativeExeSymbol a concrete subclass of NativeRawSymbol [PDB]" This should work on all platforms now that r299006 has landed. Tested locally on Windows and Linux. This moves exe symbol-specific method implementations out of NativeRawSymbol into a concrete subclass. Also adds implementations for hasCTypes and hasPrivateSymbols and a simple test to ensure the native reader can access the summary information for the executable from the PDB. Original Differential Revision: https://reviews.llvm.org/D31059 llvm-svn: 299019	2017-03-29 19:27:08 +00:00
Rafael Espindola	b26bc7fddc	Add ifunc support to ModuleSymbolTable. Do that by creating a global_values, which is similar to global_objects, but also iterates over aliases and ifuncs. llvm-svn: 299018	2017-03-29 19:26:26 +00:00
Matthew Simpson	c8f0aeccda	[InstCombine] Correct the check for vector GEPs Some of the GEP combines (e.g., descaling) can't handle vector GEPs. We have an existing check that attempts to bail out if given a vector GEP. However, the check only tests the GEP's pointer operand. A GEP results in a vector of pointers if at least one of its operands is vector-typed (e.g., its pointer operand could be a scalar, but its index could be a vector). We should just check the type of the GEP itself. This should fix PR32414. Reference: https://bugs.llvm.org/show_bug.cgi?id=32414 Differential Revision: https://reviews.llvm.org/D31470 llvm-svn: 299017	2017-03-29 18:23:08 +00:00
Simon Pilgrim	2845189bd1	[X86][AVX2] Prevent unary interleaving patterns from calling lowerVectorShuffleAsSplitOrBlend (PR32453) llvm-svn: 298993	2017-03-29 13:00:00 +00:00
Simon Pilgrim	be22cff6fd	[X86][MMX] Added generic sitofp test to compare against existing cvtdq2ps test. llvm-svn: 298989	2017-03-29 10:47:18 +00:00
Craig Topper	d9f51350b8	[AVX-512] Remove explicit KMOVWrk from isel patterns. COPY_TO_REGCLASS to GR32 is enough. llvm-svn: 298985	2017-03-29 07:31:56 +00:00
Craig Topper	d284606327	[AVX-512] Remove explicit KMOVWrk/KMOVWKr instructions from patterns where we can just use COPY_TO_REGCLASS instead. This will result in a KMOVW or KMOVD being emitted during register allocation. And in at least some cases this might allow the register coalescer to remove the copy all together. llvm-svn: 298984	2017-03-29 06:55:28 +00:00
Dean Michael Berris	60c2487874	[XRay] Update FDR log reader to be aware of buffer sizes per thread. Summary: It is problematic for this reader that it expects to read data from several threads, but the header or message format does not define framing. Since the buffers are reused, we can't rely on skipping zeroed out data as a synchronization method either. There is an argument that this is not version compatible with the format the reader expected previously. I argue that since the writer wrote garbage past the end of buffer record, there is no currently working reader to compromise. The corresponding writer change is posted to D31384. Reviewers: dberris, pelikan Reviewed By: dberris Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31385 llvm-svn: 298983	2017-03-29 06:10:12 +00:00
Dean Michael Berris	f454301b56	[XRay][tools] Handle "no subcommand" case for llvm-xray Summary: Currently the llvm-xray commandline tool fails to handle the case for when no subcommand is provided in a graceful manner. This fixes that to print the help message explaining the subcommands and the available options. Reviewers: pcc, pelikan Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31409 llvm-svn: 298975	2017-03-29 04:55:45 +00:00
Adam Nemet	92a5cf4366	[SDAG] Remove -enable-fmf-dag This is no longer needed as spotted by Sanjay in https://reviews.llvm.org/D31165. llvm-svn: 298963	2017-03-28 23:46:14 +00:00
Craig Topper	a795be60c1	[AVX-512] Add test case that was supposed to go with r298957. llvm-svn: 298959	2017-03-28 23:29:35 +00:00
Guozhi Wei	f8d40181c9	[PPC] In PPCBoolRetToInt change the bool value to i64 if the target is ppc64 In PPCBoolRetToInt bool value is changed to i32 type. On ppc64 it may introduce an extra zero extension for the return value. This patch changes the integer type to i64 to avoid the zero extension on ppc64. This patch fixed PR32442. Differential Revision: https://reviews.llvm.org/D31407 llvm-svn: 298955	2017-03-28 22:55:01 +00:00
Eric Christopher	69b191c628	Add a similar test for tailcall optimization as in r270287 for aarch64. llvm-svn: 298952	2017-03-28 22:37:43 +00:00
Stanislav Mekhanoshin	baf31ac7c8	[AMDGPU] Boost unroll threshold for loops reading local memory This is less important than increase threshold for private memory, but still brings performance improvements in a wide range of tests. Unrolling more for local memory serves three purposes: it allows to combine ds operations if offset becomes static, saves registers used for offsets in case of static offsets, and allows better lds latency hiding. Differential Revision: https://reviews.llvm.org/D31412 llvm-svn: 298948	2017-03-28 22:13:51 +00:00
Simon Pilgrim	c7c5aa47cf	[X86][MMX] Match MMX fp_to_sint conversions from XMM registers We currently perform the various fp_to_sint XMM conversion and then transfer to the MMX register (on 32-bit via the stack). This patch improves support for MOVDQ2Q XMM to MMX transfers and adds the XMM->MMX fp_to_sint direct conversion patterns. The SSE2 specifications are the same as for XMM->XMM and XMM->MMX rounding/exceptions/etc. Differential Revision: https://reviews.llvm.org/D30868 llvm-svn: 298943	2017-03-28 21:32:11 +00:00
Adam Nemet	cd847a8f30	[IR] Add AllowContract to FastMathFlags -ffp-contract=fast does not currently work with LTO because it's passed as a TargetOption to the backend rather than in the IR. This adds it to FastMathFlags. This is toward fixing PR25721 Differential Revision: https://reviews.llvm.org/D31164 llvm-svn: 298939	2017-03-28 20:11:52 +00:00
Mehdi Amini	b5a46c1f45	Add support for -fno-builtin to LTO and ThinLTO to libLTO Reviewers: tejohnson, pcc Subscribers: Prazek, dexonsmith, llvm-commits Differential Revision: https://reviews.llvm.org/D30791 llvm-svn: 298936	2017-03-28 18:55:44 +00:00
Stanislav Mekhanoshin	9053f22eeb	[AMDGPU] Split -amdgpu-early-inline-all option Previously it was covered by the internalization. It turns out we cannot run internalizer in FE, it break separate compilation tests. Thus early inliner gets its own option. Differential Revision: https://reviews.llvm.org/D31429 llvm-svn: 298935	2017-03-28 18:23:24 +00:00
Sanjay Patel	f01a1dad7f	[x86] use VPMOVMSK to replace memcmp libcalls for 32-byte equality Follow-up to: https://reviews.llvm.org/rL298775 llvm-svn: 298933	2017-03-28 17:23:49 +00:00
Weiming Zhao	da4d12a8e5	Revert "Dont emit Mapping symbols for sections that contain only data." It breaks some lld tests. This reverts commit 3a50eea6d9732ab40e9a7aebe6be777b53a8b35c. llvm-svn: 298932	2017-03-28 17:15:11 +00:00
Nirav Dave	472b5efc8b	[SDAG] Deal with deleted node in PromoteIntShiftOp Deal with case that initial node is deleted during dag-combine leading to an assertional failure in promoteIntShiftOp. Fixes PR32420. Reviewers: spatel, RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31403 llvm-svn: 298931	2017-03-28 17:09:49 +00:00
Zvi Rackover	a4c354951b	Add reproducer test for pr32449. NFC. llvm-svn: 298930	2017-03-28 16:45:23 +00:00
Simon Pilgrim	3e2aa7f40e	[X86][AVX2] Add support for combining v16i16 shuffles to VPBLENDW llvm-svn: 298929	2017-03-28 16:40:38 +00:00
Craig Topper	058f2f6d72	[AVX-512] Fix accidental uses of AH/BH/CH/DH after copies to/from mask registers We've had several bugs(PR32256, PR32241) recently that resulted from usages of AH/BH/CH/DH either before or after a copy to/from a mask register. This ultimately occurs because we create COPY_TO_REGCLASS with VK1 and GR8. Then in CopyToFromAsymmetricReg in X86InstrInfo we find a 32-bit super register for the GR8 to emit the KMOV with. But as these tests are demonstrating, its possible for the GR8 register to be a high register and we end up doing an accidental extra or insert from bits 15:8. I think the best way forward is to stop making copies directly between mask registers and GR8/GR16. Instead I think we should restrict to only copies between mask registers and GR32/GR64 and use EXTRACT_SUBREG/INSERT_SUBREG to handle the conversion from GR32 to GR16/8 or vice versa. Unfortunately, this complicates fastisel a bit more now to create the subreg extracts where we used to create GR8 copies. We can probably make a helper function to bring down the repitition. This does result in KMOVD being used for copies when BWI is available because we don't know the original mask register size. This caused a lot of deltas on tests because we have to split the checks for KMOVD vs KMOVW based on BWI. Differential Revision: https://reviews.llvm.org/D30968 llvm-svn: 298928	2017-03-28 16:35:29 +00:00
Sanjay Patel	5d39a98612	[x86] add separate check prefix for SSE; NFC We want to check each test on each target, so we need another prefix when SSE and AVX diverge (as they will if we handle 32-byte and higher). llvm-svn: 298926	2017-03-28 15:55:50 +00:00
Nirav Dave	5b414ebe63	[SDAG] Avoid deleted SDNodes PromoteIntBinOp Reorder work in PromoteIntBinOp to prevent stale (deleted) nodes from being used. Fixes PR32340 and PR32345. Reviewers: hfinkel, dbabokin Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31148 llvm-svn: 298923	2017-03-28 15:41:12 +00:00
Nirav Dave	9b5563c52c	[SDAG] Fix Stale SDNode usage in visitAND Reorder CombineTo Calls to prevent potential use of deleted node. Fixes PR32372. Reviewers: jnspaulsson, RKSimon, uweigand, jonpa Reviewed By: jonpa Subscribers: jonpa, llvm-commits Differential Revision: https://reviews.llvm.org/D31346 llvm-svn: 298920	2017-03-28 14:11:20 +00:00
Sanjay Patel	e4f11334fa	[x86] add AVX2 run to show 256-bit opportunity; NFC llvm-svn: 298918	2017-03-28 13:46:50 +00:00
Sanne Wouda	d4658ee634	[AArch64] [Assembler] option to disable negative immediate conversions Summary: Similar to the ARM target in https://reviews.llvm.org/rL298380, this patch adds identical infrastructure for disabling negative immediate conversions, and converts the existing aliases to the new infrastucture. Reviewers: rengolin, javed.absar, olista01, SjoerdMeijer, samparker Reviewed By: samparker Subscribers: samparker, aemerson, llvm-commits Differential Revision: https://reviews.llvm.org/D31243 llvm-svn: 298908	2017-03-28 10:02:56 +00:00
Igor Breger	f580fce2c3	[GlobalISel][X86] support G_FRAME_INDEX instruction selection. Summary: G_LOAD/G_STORE, add alternative RegisterBank mapping. For G_LOAD, Fast and Greedy mode choose the same RegisterBank mapping (GprRegBank ) for the G_GLOAD + G_FADD , can't get rid of cross register bank copy GprRegBank->VecRegBank. Reviewers: zvi, rovka, qcolombet, ab Reviewed By: zvi Subscribers: llvm-commits, dberris, kristof.beyls, eladcohen, guyblank Differential Revision: https://reviews.llvm.org/D30979 llvm-svn: 298907	2017-03-28 09:35:06 +00:00
Anna Thomas	ba04f4e925	rename instcombine test file. NFC llvm-svn: 298904	2017-03-28 08:34:07 +00:00
Weiming Zhao	320848458b	Dont emit Mapping symbols for sections that contain only data. Summary: Dont emit mapping symbols for sections that contain only data. Patched by Shankar Easwaran <shankare@codeaurora.org> Reviewers: rengolin, peter.smith, weimingz, kparzysz, t.p.northover Reviewed By: t.p.northover Subscribers: t.p.northover, llvm-commits Differential Revision: https://reviews.llvm.org/D30724 llvm-svn: 298901	2017-03-28 05:40:36 +00:00
Alex Shlyapnikov	bbd5cc63d7	Revert "[asan] Delay creation of asan ctor." Speculative revert. Some libfuzzer tests are affected. This reverts commit r298731. llvm-svn: 298890	2017-03-27 23:11:50 +00:00
Alex Shlyapnikov	09171aa31f	Revert "[asan] Put ctor/dtor in comdat." Speculative revert, some libfuzzer tests are affected. This reverts commit r298756. llvm-svn: 298889	2017-03-27 23:11:47 +00:00
Renato Golin	be2f7d9d61	[ARM] Mark falky test unsupported until we find the cause llvm-svn: 298887	2017-03-27 22:38:43 +00:00
Javed Absar	3d59437093	Improve machine schedulers for in-order processors This patch enables schedulers to specify instructions that cannot be issued with any other instructions. It also fixes BeginGroup/EndGroup. Reviewed by: Andrew Trick Differential Revision: https://reviews.llvm.org/D30744 llvm-svn: 298885	2017-03-27 20:46:37 +00:00
Kevin Enderby	6c1d2b4cb2	Add the error handling for Mach-O dyld compact lazy bind, weak bind and rebase entry errors and test cases for each of the error checks. Also verified with Nick Kledzik that a BIND_OPCODE_SET_ADDEND_SLEB opcode is legal in a lazy bind table, so code that had that as an error check was removed. With MachORebaseEntry and MachOBindEntry classes now returning an llvm::Error in all cases for malformed input the variables Malformed and logic to set use them is no longer needed and has been removed from those classes. Also in a few places, removed the redundant Done assignment to true when also calling moveToEnd() as it does that assignment. This only leaves the dyld compact export entries left to have error handling yet to be added for the dyld compact info. llvm-svn: 298883	2017-03-27 20:09:23 +00:00
Matthew Simpson	b8ff4a4a70	[LV] Transform truncations of non-primary induction variables The vectorizer tries to replace truncations of induction variables with new induction variables having the smaller type. After r295063, this optimization was applied to all integer induction variables, including non-primary ones. When optimizing the truncation of a non-primary induction variable, we still need to transform the new induction so that it has the correct start value. This should fix PR32419. Reference: https://bugs.llvm.org/show_bug.cgi?id=32419 llvm-svn: 298882	2017-03-27 20:07:38 +00:00
Ahmed Bougacha	f75782f9dc	[GlobalISel][AArch64] Fold FI into LDR/STR ui addressing mode. A majority of loads and stores at O0 access an alloca. It's trivial to fold the G_FRAME_INDEX into the instruction; do it. llvm-svn: 298864	2017-03-27 17:31:56 +00:00
Ahmed Bougacha	8a654085d0	[GlobalISel][AArch64] Fold G_GEP into LDR/STR ui addressing mode. We're not to the point of supporting the load/store patterns yet (because they extensively use PatFrags). But in the meantime, we can implement some of the simplest addressing modes. llvm-svn: 298863	2017-03-27 17:31:52 +00:00
Ahmed Bougacha	85a66a6d9f	[GlobalISel][AArch64] Select store of zero to WZR/XZR. These occur very frequently, and are quite trivial to catch. llvm-svn: 298862	2017-03-27 17:31:48 +00:00
Ahmed Bougacha	641cb203b6	[GlobalISel][AArch64] Select CBZ. CBZ/CBNZ represent a substantial portion of all conditional branches. Look through G_ICMP to select them. We can't use tablegen yet because the existing patterns match an AArch64ISD node. llvm-svn: 298856	2017-03-27 16:35:31 +00:00
Ahmed Bougacha	c1cbcee170	[GlobalISel][AArch64] Use proper constant types in test. NFC. llvm-svn: 298854	2017-03-27 16:35:23 +00:00
Dmitry Preobrazhensky	c512d44845	[AMDGPU][MC] Fix for Bug 28207 + LIT tests Enabled clamp and omod for v_cvt_* opcodes which have src0 of an integer type Reviewers: vpykhtin, arsenm Differential Revision: https://reviews.llvm.org/D31327 llvm-svn: 298852	2017-03-27 15:57:17 +00:00
Chad Rosier	862a41270f	[AArch64] Mark mrs of TPIDR_EL0 (thread pointer) as not having side effects. Among other things, this allows Machine LICM to hoist a costly 'mrs' instruction from within a loop. Differential Revision: http://reviews.llvm.org/D31151 llvm-svn: 298851	2017-03-27 15:52:38 +00:00
Anna Thomas	f57ae33381	[InstCombine] Avoid incorrect folding of select into phi nodes when incoming element is a vector type Summary: We are incorrectly folding selects into phi nodes when the incoming value of a phi node is a constant vector. This optimization is done in `FoldOpIntoPhi` when the select condition is a phi node with constant incoming values. Without the fix, we are miscompiling (i.e. incorrectly folding the select into the phi node) when the vector contains non-zero elements. This patch fixes the miscompile and we will correctly fold based on the select vector operand (see added test cases). Reviewers: majnemer, sanjoy, spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31189 llvm-svn: 298845	2017-03-27 13:52:51 +00:00

1 2 3 4 5 ...

43857 Commits