llvm-project

Commit Graph

Author	SHA1	Message	Date
Bjorn Pettersson	839775a277	[Debug info] Handle endianness when moving debug info for split integer values Summary: Take the target's endianness into account when splitting the debug information in DAGTypeLegalizer::SetExpandedInteger. This patch fixes so that, for big-endian targets, the fragment expression corresponding to the high part of a split integer value is placed at offset 0, in order to correctly represent the memory address order. I have attached a PPC32 reproducer where the resulting DWARF pieces for a 64-bit integer were incorrectly reversed. Patch by: dstenb Reviewers: JDevlieghere, aprantl, dblaikie Reviewed By: JDevlieghere, aprantl, dblaikie Subscribers: nemanjai Differential Revision: https://reviews.llvm.org/D38172 llvm-svn: 314666	2017-10-02 12:46:32 +00:00
Simon Pilgrim	e2e27aff9b	[X86][SSE] Add createPackShuffleMask helper function. NFCI. llvm-svn: 314658	2017-10-02 10:12:51 +00:00
Simon Pilgrim	c04c7443ea	[X86][SSE] matchBinaryVectorShuffle - add support for different src/dst value shuffle types Preparation for support for combining to PACKSS/PACKUS llvm-svn: 314656	2017-10-02 09:45:08 +00:00
Hiroshi Inoue	dcedd66b00	[PowerPC] support ZERO_EXTEND in tryBitPermutation This patch add a support of ISD::ZERO_EXTEND in PPCDAGToDAGISel::tryBitPermutation to increase the opportunity to use rotate-and-mask by reordering ZEXT and ANDI. Since tryBitPermutation stops analyzing nodes if it hits a ZEXT node while traversing SDNodes, we want to avoid ZEXT between two nodes that can be folded into a rotate-and-mask instruction. For example, we allow these nodes t9: i32 = add t7, Constant:i32<1> t11: i32 = and t9, Constant:i32<255> t12: i64 = zero_extend t11 t14: i64 = shl t12, Constant:i64<2> to be folded into a rotate-and-mask instruction. Such case often happens in array accesses with logical AND operation in the index, e.g. array[i & 0xFF]; Differential Revision: https://reviews.llvm.org/D37514 llvm-svn: 314655	2017-10-02 09:24:00 +00:00
Simon Pilgrim	3bbbf31590	Fix typo in comment. NFCI. llvm-svn: 314653	2017-10-02 09:10:50 +00:00
Simon Pilgrim	e575651370	[X86] Cleanup uses of computeKnownBits by using MaskedValueIsZero helper instead. NFCI. llvm-svn: 314652	2017-10-02 09:08:45 +00:00
Michael Zuckerman	e4084f6bdb	[X86][LLVM]Expanding Supports lowerInterleaved{store\|load}() in X86InterleavedAccess (VF64 stride 3-4) I continue to support different VF interleaved and in this pass for this patch, I added the vf64 stride3 support for both load and store. I also added support fot the stride4 store. Reviewers: 1. zvi 2. dorit 3. igorb 4. guyblank Differential Revision: https://reviews.llvm.org/D37687 Change-Id: I3d238efedf217d1768b348d710de1efa2f19d27b llvm-svn: 314651	2017-10-02 07:35:25 +00:00
Craig Topper	d37625859a	[X86] Fix copy pasto in X86FastISel::fastEmitInst_rrrr. The 4th operand was not being constrained and the third operand was being constrained twice. llvm-svn: 314648	2017-10-02 05:46:53 +00:00
Craig Topper	bb7866162c	[X86] Use a bool flag instead of assigning an unsigned to two different values that we only use in an equality comparison. llvm-svn: 314647	2017-10-02 05:46:52 +00:00
Craig Topper	c05c390a7c	[X86] Use _NOREX MOVZX instructions for some patterns even in 32-bit mode. This unifies the patterns between both modes. This should be effectively NFC since all the available registers in 32-bit mode statisfy this constraint. llvm-svn: 314643	2017-10-02 00:44:50 +00:00
Ron Lieberman	9bcdd80b66	[Hexagon] Check vector elements for equivalence in the HexagonVectorLoopCarriedReuse pass If the two instructions being compared for equivalence have corresponding operands that are integer constants, then check their values to determine equivalence. Patch by Suyog Sarda! llvm-svn: 314642	2017-10-02 00:34:07 +00:00
Ron Lieberman	f90493d220	[Hexagon] Patch to Extract i1 element from vector of i1 This patch extracts 1 element from vector consisting of elements of size 1 bit at given index. llvm-svn: 314641	2017-10-02 00:16:15 +00:00
Craig Topper	6e025a3ecc	[InstCombine] Use APInt for all the math in foldICmpDivConstant Summary: This currently uses ConstantExpr to do its math, but as noted in a TODO it can all be done directly on APInt. Reviewers: spatel, majnemer Reviewed By: majnemer Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D38440 llvm-svn: 314640	2017-10-01 23:53:54 +00:00
Craig Topper	c20b46da2f	[X86] Change register&memory TEST instructions from MRMSrcMem to MRMDstMem Summary: Intel documentation shows the memory operand as the first operand. But we currently treat it as the second operand. Conceptually the order doesn't matter since it doesn't write memory. We have aliases to parse with the operands in either order and the isel matching is commutable. For the register&register form order does matter for the assembly parser. PR22995 was previously filed and fixed by changing the register&register form from MRMSrcReg to MRMDestReg to match gas. Ideally the memory form should match by using MRMDestMem. I believe this supercedes D38025 which was trying to switch the register&register form back to pre-PR22995. Reviewers: aymanmus, RKSimon, zvi Reviewed By: aymanmus Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D38120 llvm-svn: 314639	2017-10-01 23:53:53 +00:00
Craig Topper	00230604d3	[X86] Remove a couple unnecessary COPY_TO_REGCLASS from some output patterns where the instruction already produces the correct register class. llvm-svn: 314638	2017-10-01 23:53:50 +00:00
Simon Pilgrim	df23a2700d	[X86][SSE] Add faux shuffle combining support for PACKUS llvm-svn: 314631	2017-10-01 18:43:48 +00:00
Simon Pilgrim	4f255ad6a0	[X86][AVX2] Simplify PACKUS combine test Trying to use a AND mask is tricky as after legalization its nigh impossible for computeKnownBits to do anything with it llvm-svn: 314630	2017-10-01 18:17:39 +00:00
Simon Pilgrim	836fa6dcfd	[X86][SSE] Improve shuffle combining of PACKSS instructions. Support unary packing and fix the faux shuffle mask for vectors larger than 128 bits. llvm-svn: 314629	2017-10-01 17:54:55 +00:00
Simon Pilgrim	d25c200cd6	[X86][SSE] Add shuffle combining tests with PACKSS/PACKUS llvm-svn: 314628	2017-10-01 17:30:44 +00:00
Sanjay Patel	c7076a3ba9	[x86] formatting; NFC llvm-svn: 314627	2017-10-01 14:39:10 +00:00
Jina Nahias	98c7f91e54	pre-commit adding test for broadcastm pattern Differential Revision: https://reviews.llvm.org/D38312 Change-Id: Ifbc4189549f2f59995019a86f85f989c04e4d37d llvm-svn: 314626	2017-10-01 14:25:21 +00:00
Daniel Jasper	3c9c60c727	Revert r314579: "Recommi r314561 after fixing over-debug assertion". And follow-up r314585. Leads to segfaults. I'll forward reproduction instructions to the patch author. Also, for a recommit, still add the original patch description. Otherwise, it becomes really tedious to find out what a patch actually does. The fact that it is a recommit with a fix is somewhat secondary. llvm-svn: 314622	2017-10-01 09:53:53 +00:00
Michael Zuckerman	1746895490	Adding test for interleved, case stride 4 vf64 store<NFC>. Change-Id: I9ea62aac81b763c83d26613dca6fcd846997a017 llvm-svn: 314621	2017-10-01 09:37:38 +00:00
Michal Gorny	d6a4c79b14	[lit] Fix running lit tests in unconfigured source dir Fix llvm_tools_dir attribute access not to fail when the variable is not present. This directory is not really necessary to run lit tests, and the code already accounts for it being None. The reference was added in r313407, and it breaks the stand-alone lit package in Gentoo. Differential Revision: https://reviews.llvm.org/D38442 llvm-svn: 314620	2017-10-01 07:13:25 +00:00
Dehao Chen	d26dae0d34	Separate the logic when handling indirect calls in SamplePGO ThinLTO compile phase and other phases. Summary: In SamplePGO ThinLTO compile phase, we will not invoke ICP as it may introduce confusion to the 2nd annotation. This patch extracted that logic and makes it clearer before profile annotation. In the mean time, we need to make function importing process both inlined callsites as well as not promoted indirect callsites. Reviewers: tejohnson Reviewed By: tejohnson Subscribers: sanjoy, mehdi_amini, llvm-commits, inglorion Differential Revision: https://reviews.llvm.org/D38094 llvm-svn: 314619	2017-10-01 05:24:51 +00:00
Xin Tong	bffac0eb81	Fix typo. NFC llvm-svn: 314615	2017-10-01 00:10:52 +00:00
Xin Tong	c063c3f09d	Revert "Fix typo [NFC]" This reverts commit e60b5028619be1c81bd039d63a0627dac32d38f9. Incorrectly include changes that are not typo fix. llvm-svn: 314614	2017-10-01 00:09:53 +00:00
Xin Tong	efec219e1b	Fix typo [NFC] llvm-svn: 314613	2017-10-01 00:07:24 +00:00
Daniel Berlin	d36c27bedb	NewGVN: Fix PR 34473, by not using ExactlyEqualsExpression for finding phi of ops users. llvm-svn: 314612	2017-09-30 23:51:55 +00:00
Daniel Berlin	c1305af09b	NewGVN: Evaluate phi of ops expressions before creating phi node llvm-svn: 314611	2017-09-30 23:51:54 +00:00
Daniel Berlin	9b926e90d3	NewGVN: Allow dependent PHI of ops llvm-svn: 314610	2017-09-30 23:51:53 +00:00
Daniel Berlin	de6958ee85	NewGVN: Make OpIsSafeForPhiOfOps non-recursive llvm-svn: 314609	2017-09-30 23:51:04 +00:00
Simon Pilgrim	1fffcc4580	Regenerate mul combine tests to update broadcast comment. llvm-svn: 314607	2017-09-30 22:27:46 +00:00
Dehao Chen	4f5d830343	Refactor the SamplePGO profile annotation logic to extract inlineCallInstruction. (NFC) llvm-svn: 314601	2017-09-30 20:46:15 +00:00
Simon Pilgrim	a8dd6f4f30	[X86][SSE] Fold (VSRAI (VSHLI X, C1), C1) --> X iff NumSignBits(X) > C1 Remove sign extend in register style pattern if the sign is already extended enough llvm-svn: 314599	2017-09-30 17:57:34 +00:00
Craig Topper	619569841a	[AVX-512] Add patterns to make fp compare instructions commutable during isel. llvm-svn: 314598	2017-09-30 17:02:39 +00:00
Simon Pilgrim	5bd43bce07	[X86][SSE] Add vector truncation cases inspired by PR34773 We should be using PACKSS/PACKUS more aggressively when we know the state of the upper bits llvm-svn: 314597	2017-09-30 16:14:59 +00:00
Michael Zuckerman	b92b6d424f	Code refactoring for the interleaved code <NFC> Change-Id: I7831c9febad8e14278a5bc87584a0053dc837be1 llvm-svn: 314596	2017-09-30 14:55:03 +00:00
Gadi Haber	c3b33f0f0d	[X86][SKX] Added codegen regression test for avx512 instructions scheduling.NFC. NFC. Added code gen regression tests for avx512 instructions scheduling called avx512-schedule.ll and avx512-shuffle-schedule.ll. This patch is in preparation of a larger patch of adding all SKX instruction scheduling and therefore the scheduling for the avx512 instructions are still missing. Reviewers: zvi, delena, RKSimon, igorb Differential Revision: https://reviews.llvm.org/D38035 Change-Id: I792762763127a921b9e13684b58af03646536533 llvm-svn: 314594	2017-09-30 14:30:23 +00:00
Daniel Jasper	0a51ec29c9	Revert r314435: "[JumpThreading] Preserve DT and LVI across the pass" Causes a segfault on a builtbot (and in our internal bootstrapping of Clang). See Eli's response on the commit thread. llvm-svn: 314589	2017-09-30 11:57:19 +00:00
Xinliang David Li	b8aac3ac19	Fix buildbot failure -- tighten type check for matching phi llvm-svn: 314585	2017-09-30 05:27:46 +00:00
Craig Topper	d92ade96f4	[X86] Support v64i8 mulhu/mulhs Implemented by splitting into two v32i8 mulhu/mulhs and concatenating the results. Differential Revision: https://reviews.llvm.org/D38307 llvm-svn: 314584	2017-09-30 04:21:46 +00:00
Xinliang David Li	3409d9c07f	Recommi r314561 after fixing over-debug assertion llvm-svn: 314579	2017-09-30 00:46:32 +00:00
Marek Sokolowski	7f7745c038	[llvm-rc] Serialize DIALOG(EX) to .res files (serialization, pt 4). This is now able to serialize DIALOG and DIALOGEX resources to .res files. It still can't parse dialog-specific CAPTION, FONT, and STYLE optional statement - these will be added in the following patch. A limited set of controls is included. However, more can be easily added by extending SupportedCtls map defined in ResourceScriptStmt.cpp. Differential Revision: https://reviews.llvm.org/D37862 llvm-svn: 314578	2017-09-30 00:38:52 +00:00
Adrian Prantl	17d0bb9611	typos llvm-svn: 314577	2017-09-30 00:31:15 +00:00
Adrian Prantl	61913a1ffa	llvm-dwarfdump: implement the --name lookup option. llvm-svn: 314576	2017-09-30 00:22:25 +00:00
Adrian Prantl	a01c38b7a3	Fix 80 column violations llvm-svn: 314575	2017-09-30 00:22:24 +00:00
Adrian Prantl	fa1636137b	Add comments llvm-svn: 314574	2017-09-30 00:22:21 +00:00
Stanislav Mekhanoshin	1d8cf2be89	[AMDGPU] Set fast-math flags on functions given the options We have a single library build without relaxation options. When inlined library functions remove fast math attributes from the functions they are integrated into. This patch sets relaxation attributes on the functions after linking provided corresponding relaxation options are given. Math instructions inside the inlined functions remain to have no fast flags, but inlining does not prevent fast math transformations of a surrounding caller code anymore. Differential Revision: https://reviews.llvm.org/D38325 llvm-svn: 314568	2017-09-29 23:40:19 +00:00
Yaxun Liu	b33607e5a1	CodeGen: Fix pointer info in expandUnalignedLoad/Store Currently expandUnalignedLoad/Store uses place holder pointer info for temporary memory operand in stack, which does not have correct address space. This causes unaligned private double16 load/store to be lowered to flat_load instead of buffer_load for amdgcn target. This fixes failures of OpenCL conformance test basic/vload_private/vstore_private on target amdgcn---amdgizcl. Differential Revision: https://reviews.llvm.org/D35361 llvm-svn: 314566	2017-09-29 23:31:14 +00:00
Adrian Prantl	71128ee717	fix 80 column violation. llvm-svn: 314564	2017-09-29 22:46:22 +00:00
Xinliang David Li	455dec098b	Revert 314561 due to debug build assertion failure llvm-svn: 314563	2017-09-29 22:30:34 +00:00
Marek Sokolowski	42f494d6a6	[llvm-rc] Serialize MENU resources to .res files (serialization, pt 3). This allows MENU resources to be serialized. MENU resource statement doc: msdn.microsoft.com/en-us/library/windows/desktop/aa381025.aspx POPUP sub-statement doc: msdn.microsoft.com/en-us/library/windows/desktop/aa381030.aspx MENUITEM sub-statement doc: msdn.microsoft.com/en-us/library/windows/desktop/aa381024.aspx MENUHEADER structure: msdn.microsoft.com/en-us/library/windows/desktop/ms648018.aspx (and NORMALMENUITEM, POPUPMENUITEM structs). Thanks for Nico Weber for his original work in this area. Differential Revision: https://reviews.llvm.org/D37828 llvm-svn: 314562	2017-09-29 22:25:05 +00:00
Xinliang David Li	5b9d96825b	Eliminate PHI (int typed) which has only one use by intptr This patch will eliminate redundant intptr/ptrtoint that pessimizes analyses such as SCEV, AA and will make optimization passes such as auto-vectorization more powerful. Differential revision: http://reviews.llvm.org/D37832 llvm-svn: 314561	2017-09-29 22:10:15 +00:00
Alex Shlyapnikov	e76aa3b0b2	Revert "Use the basic cost if a GEP is not used as addressing mode" This reverts commit r314517. This commit crashes sanitizer bots, for example: http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux/builds/4167 Stack snippet: ... /mnt/b/sanitizer-buildbot1/sanitizer-x86_64-linux/build/llvm/include/llvm/Support/Casting.h:255:0 llvm::TargetTransformInfoImplCRTPBase<llvm::X86TTIImpl>::getGEPCost(llvm::GEPOperator const, llvm::ArrayRef<llvm::Value const>) /mnt/b/sanitizer-buildbot1/sanitizer-x86_64-linux/build/llvm/include/llvm/Analysis/TargetTransformInfoImpl.h:742:0 llvm::TargetTransformInfoImplCRTPBase<llvm::X86TTIImpl>::getUserCost(llvm::User const, llvm::ArrayRef<llvm::Value const>) /mnt/b/sanitizer-buildbot1/sanitizer-x86_64-linux/build/llvm/include/llvm/Analysis/TargetTransformInfoImpl.h:782:0 /mnt/b/sanitizer-buildbot1/sanitizer-x86_64-linux/build/llvm/lib/Analysis/TargetTransformInfo.cpp:116:0 /mnt/b/sanitizer-buildbot1/sanitizer-x86_64-linux/build/llvm/include/llvm/ADT/SmallVector.h:116:0 /mnt/b/sanitizer-buildbot1/sanitizer-x86_64-linux/build/llvm/include/llvm/ADT/SmallVector.h:343:0 /mnt/b/sanitizer-buildbot1/sanitizer-x86_64-linux/build/llvm/include/llvm/ADT/SmallVector.h:864:0 /mnt/b/sanitizer-buildbot1/sanitizer-x86_64-linux/build/llvm/include/llvm/Analysis/TargetTransformInfo.h:285:0 ... llvm-svn: 314560	2017-09-29 22:04:45 +00:00
Eugene Zelenko	4f81cdd818	[CodeGen] Fix some Clang-tidy modernize-use-using and Include What You Use warnings; other minor fixes (NFC). llvm-svn: 314559	2017-09-29 21:55:49 +00:00
Brian Gesiak	615a3bbdad	Revert "[CMake] Remove `CMAKE_.*_OUTPUT_DIRECTORY` (NFCI)" Summary: It appears polly makes use of the `CMAKE_RUNTIME_OUTPUT_DIRECTORY` variable when configuring its lit test suite. Reverting this for now. llvm-svn: 314551	2017-09-29 19:50:41 +00:00
Brian Gesiak	cccbed8450	[CMake] Remove `CMAKE_._OUTPUT_DIRECTORY` (NFCI) Summary: Three `CMAKE_._OUTPUT_DIRECTORY` variables used to be set in CMake and referenced in various other parts of the project. However, in r198205 chapuni added a note to "don't set them anymore", and any remaining references to them were subsequently removed in r198316 and r199592. Now that the variables are no longer used anywhere, remove them, along with the comments advising against using them any longer. Test Plan: I ran `check-all` and confirmed the tests built and passed. Reviewers: beanz, chapuni Reviewed By: beanz Subscribers: mgorny Differential Revision: https://reviews.llvm.org/D38389 llvm-svn: 314550	2017-09-29 19:34:57 +00:00
Marek Sokolowski	22fccd6408	[llvm-rc] Serialize ACCELERATORS to .res files (serialization, pt 2). This allows llvm-rc to serialize ACCELERATORS resources. Additionally, as this is the first type of resource to support basic optional resource statements (LANGUAGE, CHARACTERISTICS, VERSION), ACCELERATORS statement documentation: msdn.microsoft.com/en-us/library/windows/desktop/aa380610.aspx Accelerator table structure documentation: msdn.microsoft.com/en-us/library/windows/desktop/ms648010.aspx Optional resource statement fields are described in: msdn.microsoft.com/en-us/library/windows/desktop/ms648027.aspx Thanks for Nico Weber for his original work in this area. Differential Revision: https://reviews.llvm.org/D37824 llvm-svn: 314549	2017-09-29 19:07:44 +00:00
Matthew Simpson	f4bb480b62	[LV] Use correct insertion point when type shrinking reductions When type shrinking reductions, we should insert the truncations and extends at the end of the loop latch block. Previously, these instructions were inserted at the end of the loop header block. The difference is only a problem for loops with predicated instructions (e.g., conditional stores and instructions that may divide by zero). For these instructions, we create new basic blocks inside the vectorized loop, which cause the loop header and latch to no longer be the same block. This should fix PR34687. Reference: https://bugs.llvm.org/show_bug.cgi?id=34687 llvm-svn: 314542	2017-09-29 18:07:39 +00:00
Marek Sokolowski	c75a087c7a	[llvm-rc] Refactoring needed for ACCELERATORS and MENU resources. This is a part of llvm-rc serialization patch set (serialization, pt 1.5). This: * Unifies the internal representation of flags in ACCELERATORS and MENU with the corresponding representation in .res files (noticed in https://reviews.llvm.org/D37828#inline-329828). * Creates an RCResource subclass, OptStatementsRCResource, describing resource statements that can declare resource-local optional statements (proposed in https://reviews.llvm.org/D37824#inline-329775). These modifications don't fit to any of the current patches, so I'm submitting them as a separate patch. Differential Revision: https://reviews.llvm.org/D37841 llvm-svn: 314541	2017-09-29 17:46:32 +00:00
Sanjoy Das	d06dd61292	Use LLVM_ENABLE_ABI_BREAKING_CHECKS correctly llvm-svn: 314539	2017-09-29 17:17:54 +00:00
Marek Sokolowski	8f19343a78	[llvm-rc] Serialize HTML resources to .res files (serialization, pt 1). This allows to process HTML resources defined in .rc scripts and output them to resulting .res files. Additionally, some infrastructure allowing to output these files is created. This is the first resource type we can operate on. Thanks to Nico Weber for his original work in this area. Differential Revision: reviews.llvm.org/D37283 llvm-svn: 314538	2017-09-29 17:14:09 +00:00
Adam Nemet	3a762d9b0e	Display relative hotness with two decimal digits after the decimal point I've seen cases where tiny inlined functions have such a high execution count that most everything would show up with a relative of hotness of 0%. Since the inlined functions effectively disappear you need to tune in the lower range, thus we need more precision. llvm-svn: 314537	2017-09-29 16:56:54 +00:00
Simon Pilgrim	1ad9ea3ae2	Fix Wmismatched-tags warning. InlineAsmIdentifierInfo was declared a class in some places and a struct in others. Partial reversion of rL314508 llvm-svn: 314536	2017-09-29 16:52:27 +00:00
Francis Ricci	a7bf226529	[test] Enable LeakSanitizer on 64-bit Darwin ASan llvm builds Summary: Also disables leak checking on lto tests, due to many leaks reported in the system's ld64. Reviewers: kcc, pcc, bogner, kubamracek Subscribers: mehdi_amini, llvm-commits Differential Revision: https://reviews.llvm.org/D37781 llvm-svn: 314535	2017-09-29 16:51:50 +00:00
Sam Clegg	63ebb81386	[WebAssembly] Allow each data segment to specify its own alignment Also, add a flags field as we will almost certainly be needing that soon too. Differential Revision: https://reviews.llvm.org/D38296 llvm-svn: 314534	2017-09-29 16:50:08 +00:00
Hongbin Zheng	c8abdf5f25	[SimplifyIndVar] Do not fail when we constant fold an IV user to ConstantPointerNull The type of a SCEVConstant may not match the corresponding LLVM Value. In this case, we skip the constant folding for now. TODO: Replace ConstantInt Zero by ConstantPointerNull llvm-svn: 314531	2017-09-29 16:32:12 +00:00
Nicolai Haehnle	c2e79c2dfc	AMDGPU: fix bad test exposed by r314522 The test attempts to use -1 as carry-in for v_addc_*. Before writing r314522, I did actually test this on real hardware, and found that it doesn't work. So r314522 is correct in restricting the carry-in operand: just remove those tests to make things pass again. llvm-svn: 314530	2017-09-29 16:07:05 +00:00
Teresa Johnson	0d0ba25470	[ThinLTO] Use decimal suffix for promoted values to match demanglers Summary: Demanglers such as libiberty know how to strip suffixes of the form \.[a-zA-Z]+\.\d+, but our current promoted value suffixes are .llvm.${modulehash}, where the module hash is in hex. Change the module hash to decimal to allow demanglers to handle this. Reviewers: danielcdh Subscribers: llvm-commits, inglorion Differential Revision: https://reviews.llvm.org/D38405 llvm-svn: 314527	2017-09-29 15:55:42 +00:00
Jonas Devlieghere	a15f25d325	[dwarfdump][NFC] Consistent printing of address ranges This implement the insertion operator for DWARF address ranges so they are consistently printed as [LowPC, HighPC). While a dump method might have felt more consistent, it is used exclusively for printing error messages in the verifier and never used for actual dumping. Hence this approach is more intuitive and creates less clutter at the call sites. Differential revision: https://reviews.llvm.org/D38395 llvm-svn: 314523	2017-09-29 15:41:22 +00:00
Nicolai Haehnle	ce4ddd06da	AMDGPU: VALU carry-in and v_cndmask condition cannot be EXEC The hardware will only forward EXEC_LO; the high 32 bits will be zero. Additionally, inline constants do not work. At least, v_addc_u32_e64 v0, vcc, v0, v1, -1 which could conceivably be used to combine (v0 + v1 + 1) into a single instruction, acts as if all carry-in bits are zero. The llvm.amdgcn.ps.live test is adjusted; it would be nice to combine s_mov_b64 s[0:1], exec v_cndmask_b32_e64 v0, v1, v2, s[0:1] into v_mov_b32 v0, v3 but it's not particularly high priority. Fixes dEQP-GLES31.functional.shaders.helper_invocation.value.* llvm-svn: 314522	2017-09-29 15:37:31 +00:00
Jun Bum Lim	0e16a59e83	Use the basic cost if a GEP is not used as addressing mode Summary: Currently, getGEPCost() returns TCC_FREE whenever a GEP is a legal addressing mode in the target. However, since it doesn't check its actual users, it will return FREE even in cases where the GEP cannot be folded away as a part of actual addressing mode. For example, if an user of the GEP is a call instruction taking the GEP as a parameter, then the GEP may not be folded in isel. Reviewers: hfinkel, efriedma, mcrosier, jingyue, haicheng Reviewed By: hfinkel Subscribers: javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D38085 llvm-svn: 314517	2017-09-29 14:50:16 +00:00
Jonas Paulsson	c9e363ac69	[SystemZ] implement shouldCoalesce() Implement shouldCoalesce() to help regalloc avoid running out of GR128 registers. If a COPY involving a subreg of a GR128 is coalesced, the live range of the GR128 virtual register will be extended. If this happens where there are enough phys-reg clobbers present, regalloc will run out of registers (if there is not a single GR128 allocatable register available). This patch tries to allow coalescing only when it can prove that this will be safe by checking the (local) interval in question. Review: Ulrich Weigand, Quentin Colombet https://reviews.llvm.org/D37899 https://bugs.llvm.org/show_bug.cgi?id=34610 llvm-svn: 314516	2017-09-29 14:31:39 +00:00
Simon Pilgrim	b6cf279214	Fix spelling in comments. NFCI. llvm-svn: 314515	2017-09-29 14:13:47 +00:00
Amara Emerson	7d6c55f8aa	[X86] Improve codegen for inverted overflow checking intrinsics. Adds a new combine for: xor(setcc cc, val), 1 --> setcc (invert(cc), val) Differential Revision: https://reviews.llvm.org/D38161 llvm-svn: 314514	2017-09-29 13:53:44 +00:00
Sam Parker	963da5b119	[ARM] v8.3-a complex number support New instructions are added to AArch32 and AArch64 to aid floating-point multiplication and addition of complex numbers, where the complex numbers are packed in a vector register as a pair of elements. The Imaginary part of the number is placed in the more significant element, and the Real part of the number is placed in the less significant element. This patch adds assembler for the ARM target. Differential Revision: https://reviews.llvm.org/D36789 llvm-svn: 314511	2017-09-29 13:11:33 +00:00
Michael Zuckerman	0b5db55b96	Small modification <NFC> Change-Id: I360abccee12cae29bd2ac4f8399c9ecc92eb7f13 llvm-svn: 314510	2017-09-29 12:45:54 +00:00
Simon Pilgrim	dbcad23e50	Fix Wmismatched-tags warning. InlineAsmIdentifierInfo was declared a class in some places and a class in others. llvm-svn: 314508	2017-09-29 11:42:05 +00:00
Aleksandar Beserminji	29341b88ac	[mips] Reordering callseq* nodes to be linear Fix nested callseq* nodes by moving callseq_start after the arguments calculation to temporary registers, so that callseq* nodes in resulting DAG are linear. Recommitting r314497. This version does not contain test which fails when compiler is not build in debug mode. Differential Revision: https://reviews.llvm.org/D37328 llvm-svn: 314507	2017-09-29 11:05:02 +00:00
Aleksandar Beserminji	a0a01e7172	Revert "[mips] Reordering callseq* nodes to be linear" Added test relies on the compiler being built in debug mode, which may not be the case. This reverts commit r314497. llvm-svn: 314506	2017-09-29 10:52:03 +00:00
Simon Dardis	f21d8d6ad5	[mips] Add missing license info, formatting changes. NFCI Add missing license information to MicroMipsInstrFPU.td and fix most of the formatting errors present. Others will be addressed in a follow up commits. llvm-svn: 314505	2017-09-29 10:08:06 +00:00
Simon Pilgrim	2b96841d1d	[X86][SSE] Added more tests for vector multiplications as utility for D37896 Added additional tests for vector multiplications with multipliers that are: * powers of 2 displaced by 1, * product of a power of 2 displaced by one with another power of 2. Patch by @pacxx (Michael Haidl) Differential Revision: https://reviews.llvm.org/D38350 llvm-svn: 314504	2017-09-29 10:02:01 +00:00
Aleksandar Beserminji	0168ef26ec	[mips] Add test cases for dext/dins family of instructions Add missing test cases for dext, dextm, dextu, dins, dinsm and dinsu instructions. Differential Revision: https://reviews.llvm.org/D37741 llvm-svn: 314503	2017-09-29 09:53:24 +00:00
Tim Renouf	ef1ae8ffac	[AMDGPU] calling conventions for AMDPAL OS type Summary: This commit adds comments on how the AMDPAL OS type overloads the existing AMDGPU_ calling conventions used by Mesa, and adds a couple of new ones. Reviewers: arsenm, nhaehnle, dstuttard Subscribers: mehdi_amini, kzhuravl, wdng, yaxunl, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D37752 llvm-svn: 314502	2017-09-29 09:51:22 +00:00
Tim Renouf	132291589f	[AMDGPU] AMDPAL scratch buffer support Summary: Added support for scratch (including spilling) for OS type amdpal: generates code to set up the scratch descriptor if it is needed. With amdpal, the scratch resource descriptor is loaded from offset 0 of the global information table. The low 32 bits of the address of the global information table is passed in s0. Added amdgpu-git-ptr-high function attribute to hard-wire the high 32 bits of the address of the global information table. If the function attribute is not specified, or is 0xffffffff, then the backend generates code to use the high 32 bits of pc. The documentation for the AMDPAL ABI will be added in a later commit. Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, t-tye Differential Revision: https://reviews.llvm.org/D37483 llvm-svn: 314501	2017-09-29 09:49:35 +00:00
Tim Renouf	9f7ead3334	[Triple] Add AMDPAL operating system type Summary: This operating system type represents the AMDGPU PAL runtime, and will be required by the AMDGPU backend in order to generate correct code for this runtime. Currently it generates the same code as not specifying an OS at all. That will change in future commits. Patch from Tim Corringham. Subscribers: arsenm, nhaehnle Differential Revision: https://reviews.llvm.org/D37380 llvm-svn: 314500	2017-09-29 09:48:12 +00:00
Jonas Devlieghere	19fc4d941f	[dwarfdump][NFC] Consistent errors and warnings with --verify This patch introduces 3 helper functions: error(), warn() and note() to make printing during verification more consistent. When supported, the respective prefixes are printed in color using the same color scheme as clang. Differential revision: https://reviews.llvm.org/D38368 llvm-svn: 314498	2017-09-29 09:33:31 +00:00
Aleksandar Beserminji	502dcb035a	[mips] Reordering callseq* nodes to be linear Fix nested callseq* nodes by moving callseq_start after the arguments calculation to temporary registers, so that callseq* nodes in resulting DAG are linear. Differential Revision: https://reviews.llvm.org/D37328 llvm-svn: 314497	2017-09-29 09:32:14 +00:00
Coby Tayree	c3d24118e8	[X86][MS-InlineAsm] Extended support for variables / identifiers on memory / immediate expressions Allow the proper recognition of Enum values and global variables inside ms inline-asm memory / immediate expressions, as they require some additional overhead and treated incorrect if doesn't early recognized. supersedes D33278, D35774 Differential Revision: https://reviews.llvm.org/D37412 llvm-svn: 314493	2017-09-29 07:02:46 +00:00
Adam Nemet	9d57dc6fb1	Make find_opt_files vararg This is slightly less verbose for the common case of a single build directory and more intuitive when using this API directly from the interpreter. llvm-svn: 314491	2017-09-29 05:20:53 +00:00
Lang Hames	13cda49c96	[ORC] Replace decltype with a concrete type to make MSVC happy. This should fix some build failures on windows bots due to r314486. llvm-svn: 314490	2017-09-29 05:03:43 +00:00
Brian Gesiak	16b86e7d18	[CMake] Fix typo "Wraning" (NFC) Summary: The typo was added in https://reviews.llvm.org/rL247151. It should be "warning", not "wraning". llvm-svn: 314486	2017-09-29 02:48:07 +00:00
Saleem Abdulrasool	46ee7330bb	llvm-readobj: fix a few typos (NFC) Correct the spelling of multiple in a couple of sites. Patch by Alex Langford! llvm-svn: 314485	2017-09-29 02:45:44 +00:00
Sanjoy Das	0ac5ba5ade	Revert "[BypassSlowDivision] Improve our handling of divisions by constants" This reverts commit r314253. It causes a miscompile on P100 in an internal benchmark. Reverting while I investigate. llvm-svn: 314482	2017-09-29 00:54:16 +00:00
Adrian Prantl	f51e78017d	llvm-dwarfdump: support .apple-namespaces in --find llvm-svn: 314481	2017-09-29 00:52:33 +00:00
Marek Sokolowski	4a765da3e9	[llvm-rc] Import all make_unique invocations from llvm namespace. Previous patch fixed one of LLVM buildbots (lld-x86_64-win7). However, some others have already been failing because of make_unique compilation error (llvm-clang-x86_64-expensive-checks-win). llvm-svn: 314480	2017-09-29 00:33:57 +00:00
Adrian Prantl	714ee4d536	llvm-dwarfdump: add support for .apple_types in --find llvm-svn: 314479	2017-09-29 00:33:22 +00:00
Marek Sokolowski	b5f39a05a3	[llvm-rc] Add user-defined resources parsing ability. [8/8] This allows llvm-rc to parse user-defined resources (ref: msdn.microsoft.com/en-us/library/windows/desktop/aa381054.aspx). These statements either import files, or put the specified raw data in the resulting resource file. Thanks to Nico Weber for his original work in this area. Differential Revision: https://reviews.llvm.org/D37033 llvm-svn: 314478	2017-09-29 00:14:18 +00:00
Marek Sokolowski	7e89ee7fdc	[llvm-rc] Add integer expressions parsing ability. [7/8] This allows the ints to be written as integer expressions evaluating to unsigned 16-bit/32-bit integers. All the expressions may use the following operators: + - & \| ~, and parentheses. Minus token - can be also unary. There is no precedence of the operators other than the unary operators binding stronger than their binary counterparts. Differential Revision: https://reviews.llvm.org/D37022 llvm-svn: 314477	2017-09-28 23:53:25 +00:00
Jessica Paquette	919991690c	[MachineOutliner][NFC] Simplify logic in pruneCandidates This commit yanks out the repeated sections of code in pruneCandidates into two lambdas: ShouldSkipCandidate and Prune. This simplifies the logic in pruneCandidates significantly, and reduces the chance of introducing bugs by folding all of the shared logic into one place. llvm-svn: 314475	2017-09-28 23:39:36 +00:00
Craig Topper	6255c7b675	[X86] Don't select (cmp (and, imm), 0) to testw Summary: X86ISelDAGToDAG tries to analyze ANDs compared with 0 to optimize to narrower immediates using subregisters. I don't think we should be optimizing to 16-bit test instructions. It goes against our normal behavior of promoting i16 operations to i32. It only saves one byte due to the need to add a 0x66 prefix. I think it would also be subject to a length changing prefix penalty in the decoders on Intel CPUs. Reviewers: RKSimon, zvi, spatel Reviewed By: spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D38273 llvm-svn: 314474	2017-09-28 23:35:36 +00:00
Marek Sokolowski	99ead70fea	[llvm-rc] Fix-up for r314468 (argument-dependent lookup in make_unique). llvm-svn: 314472	2017-09-28 23:12:53 +00:00
Matthias Braun	51687912a4	ARM: Fix cases where CSI Restored bit is not cleared LR is an untypical callee saved register in that it is restored into a different register (PC) and thus does not live-out of the return block. This case requires the `Restored` flag in CalleeSavedInfo to be cleared. This fixes a number of cases where this wasn't handled correctly yet. llvm-svn: 314471	2017-09-28 23:12:06 +00:00
Yonghong Song	ef29a84d48	bpf: fix a bug for disassembling ld_pseudo inst Signed-off-by: Yonghong Song <yhs@fb.com> llvm-svn: 314469	2017-09-28 22:47:34 +00:00
Marek Sokolowski	fb74cb1edf	[llvm-rc] Add VERSIONINFO parsing ability. [6/8] This extends the set of llvm-rc parser's available resources by another one, VERSIONINFO. Ref: msdn.microsoft.com/en-us/library/windows/desktop/aa381058.aspx Thanks to Nico Weber for his original work in this area. Differential Revision: https://reviews.llvm.org/D37021 llvm-svn: 314468	2017-09-28 22:41:38 +00:00
Eugene Zelenko	3b87336a0c	[Hexagon] Fix some Clang-tidy modernize and Include What You Use warnings; other minor fixes (NFC). llvm-svn: 314467	2017-09-28 22:27:31 +00:00
Sanjay Patel	4664d77316	[x86] add tests for possible insertelement to shuffle transform; NFC See PR34716 and D38316 for more discussion. llvm-svn: 314466	2017-09-28 22:27:25 +00:00
Ulrich Weigand	df86855f61	[SystemZ] Fix fall-out from r314428 The expensive-checks build bot found a problem with the r314428 commit: if CC is live after a ATOMIC_CMP_SWAPW instruction, it needs to be marked as live-in to the block after the loop the pseudo gets expanded to. This actually fixes a code-gen bug as well, since if the CC isn't live, the CR and JLH are merged to a CRJLH which doesn't actually set the condition code any more. llvm-svn: 314465	2017-09-28 22:08:25 +00:00
Craig Topper	ed19350293	[X86] Make use of vpmovwb when possible in LowerMULH If we have BWI, we can truncate in a much simpler way by using vpmovwb. This even works without VLX by using the wider zmm->ymm truncate with a subvector extract. Differential Revision: https://reviews.llvm.org/D38375 llvm-svn: 314457	2017-09-28 20:10:34 +00:00
Evgeniy Stepanov	fa769be5e7	Fix -Werror build. /code/llvm-project/llvm/unittests/ExecutionEngine/Orc/RTDyldObjectLinkingLayerTest.cpp:260:38: error: lambda capture 'this' is not used [-Werror,-Wunused-lambda-capture] [this](decltype(ObjLayer)::ObjHandleT, llvm-svn: 314454	2017-09-28 19:43:53 +00:00
Martin Storsjo	d6218cc385	[ARM] Restore the right frame pointer register in Int_eh_sjlj_longjmp In setupEntryBlockAndCallSites in CodeGen/SjLjEHPrepare.cpp, we fetch and store the actual frame pointer, but on return via the longjmp intrinsic, it always was restored into the r7 variable. On windows, the frame pointer should be restored into r11 instead of r7. On Darwin (where sjlj exception handling is used by default), the frame pointer is always r7, both in arm and thumb mode, and likewise, on windows, the frame pointer always is r11. On linux however, if sjlj exception handling is enabled (which it isn't by default), libcxxabi and the user code can be built in differing modes using different registers as frame pointer. Therefore, when restoring registers on a platform where we don't always use the same register depending on code mode, restore both r7 and r11. Differential Revision: https://reviews.llvm.org/D38253 llvm-svn: 314451	2017-09-28 19:04:30 +00:00
Martin Storsjo	adceba59a2	[ARM] Fix SJLJ exception handling when manually chosen on a platform where it isn't default Differential Revision: https://reviews.llvm.org/D38252 llvm-svn: 314450	2017-09-28 19:04:14 +00:00
Matthias Braun	5c3e8a450e	MIR: Serialize CaleeSavedInfo Restored flag llvm-svn: 314449	2017-09-28 18:52:14 +00:00
Craig Topper	56bfbfb117	[AVX512] Add avx512bw command lines to 128-bit idiv tests. The multiply lowering on some of the tests can take advantage of the vpmovwb to simplify the truncate. llvm-svn: 314448	2017-09-28 18:45:29 +00:00
Craig Topper	3819be6cf6	[X86] Use target independent ZERO_EXTEND/SIGN_EXTEND nodes were possible in LowerMULH We aren't do any in register extends here so we should be able to just the target independent nodes directly and allow them to be lowered as necessary. llvm-svn: 314447	2017-09-28 18:45:28 +00:00
Craig Topper	fc104bfbc0	[X86] Move a setOperation action for ISD::TRUNCATE near another one in the same if. Remove one that is redundant with another subtarget features. llvm-svn: 314446	2017-09-28 18:45:27 +00:00
Adrian Prantl	2095e60851	Address further review feedback. (NFC) llvm-svn: 314443	2017-09-28 18:31:51 +00:00
Adrian Prantl	367064abe4	try and appease gcc llvm-svn: 314442	2017-09-28 18:27:00 +00:00
Adrian Prantl	99fdb9d927	llvm-dwarfdump: implement --find for .apple_names This patch implements the dwarfdump option --find=<name>. This option looks for a DIE in the accelerator tables and dumps it if found. This initial patch only adds support for .apple_names to keep the review small, adding the other sections and pubnames support should be trivial though. Differential Revision: https://reviews.llvm.org/D38282 llvm-svn: 314439	2017-09-28 18:10:52 +00:00
Lang Hames	705db63ce1	[ORC] Fix the type of RTDyldObjectLinkingLayer::NotifyLoadedFtor. Bug found by Stefan Granitz. Thanks Stefan! llvm-svn: 314436	2017-09-28 17:43:07 +00:00
Evandro Menezes	3701df55c6	[JumpThreading] Preserve DT and LVI across the pass JumpThreading now preserves dominance and lazy value information across the entire pass. The pass manager is also informed of this preservation with the goal of DT and LVI being recalculated fewer times overall during compilation. This change prepares JumpThreading for enhanced opportunities; particularly those across loop boundaries. Patch by: Brian Rzycki <b.rzycki@samsung.com>, Sebastian Pop <s.pop@samsung.com> Differential revision: https://reviews.llvm.org/D37528 llvm-svn: 314435	2017-09-28 17:24:40 +00:00
Craig Topper	ceff6da6e9	[X86] Use BWI instructions to improve lowering of v32i8 MULHU/S Summary: If we have BWI instructions we can widen to v32i16 to do the multiply instead of splitting. Reviewers: RKSimon, spatel, zvi Reviewed By: zvi Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D38305 llvm-svn: 314432	2017-09-28 17:00:21 +00:00
Craig Topper	fd6b8a67fb	[X86] Remove dead code from X86ISelDAGToDAG.cpp multiply handling Summary: Lowering never creates X86ISD::UMUL for 8-bit types. X86ISD::UMUL8 is used instead. If X86ISD::UMUL 8-bit were ever used it would crash. DAGCombiner replaces UMUL_LOHI/SMUL_LOHI with a wider MUL and a shift if the type twice as wide is legal. So we should never see i8 UMUL_LOHI/SMUL_LOHI. In fact I think there was a bug in part of the i8 code. Similar is true for i16 though without the bug. Reviewers: RKSimon, spatel, zvi Reviewed By: zvi Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D38276 llvm-svn: 314430	2017-09-28 16:56:36 +00:00
Craig Topper	71a8cf9f99	[X86] Use correct subvector index when combining two insert subvectors featuring zero vectors. Previously we were using one of the subvector indices twice. The included test case causes an assert without this change. Thanks to Simon Pilgrim for catching this. llvm-svn: 314429	2017-09-28 16:53:16 +00:00
Ulrich Weigand	0f1de04979	[SystemZ] Custom-expand ATOMIC_CMP_AND_SWAP_WITH_SUCCESS The SystemZ compare-and-swap instructions already provide the "success" indication via a condition-code value, so the default expansion of those operations generates an unnecessary extra comparsion. llvm-svn: 314428	2017-09-28 16:22:54 +00:00
Jonas Devlieghere	35fdaa94f7	[dwarfdump] Verify that CUs have a unit DIE. This patch adds a check to the DWARF verifier to detect CUs without a unit DIE. Differential revision: https://reviews.llvm.org/D38363 llvm-svn: 314426	2017-09-28 15:57:50 +00:00
Simon Pilgrim	2ff339303e	Use SDValue::getConstantOperandVal helper. NFCI. llvm-svn: 314425	2017-09-28 15:53:27 +00:00
Simon Dardis	c8e33c5ca1	[mips] Remove codegen support for branch likely instructions. This patch disables codegen support for branch likely instructions to address a potential bug. These branches were unselectable as they had the same patterns as the normal branches but came after them when ISel was concerned. The branch likely instructions were marked as having no delay slots when they have annulling delay slots. The delay slot filler does not currently handle annulling delay slot branches, so this would lead to wrong codegen if these branches were generated. Reviewers: atanasyan, nitesh.jain Differential Revision: https://reviews.llvm.org/D38169 llvm-svn: 314421	2017-09-28 15:24:07 +00:00
Hans Wennborg	6519562bc6	Docs: fix link to Debugger intrinsic functions llvm-svn: 314420	2017-09-28 15:16:37 +00:00
Benjamin Kramer	c965b30e54	[LoopUnroll] Fix use after poison. llvm-svn: 314418	2017-09-28 14:47:39 +00:00
Amara Emerson	bb16282fb1	[X86] Add overflow intrinsic test in preparation for D38161. This commit adds the test file before codegen changes as requested in D38161 to make it easier to see the difference. llvm-svn: 314416	2017-09-28 13:43:48 +00:00
Bjorn Pettersson	715a5efaad	[DebugInfo] Do not extend range for physreg in LiveDebugVariables Summary: A DBG_VALUE that is referring to a physical register is valid up until the next def of the register, or the end of the basic block that it belongs to. LiveDebugVariables is computing live intervals (slot index ranges) for DBG_VALUE instructions, before regalloc, in order to be able to re-insert DBG_VALUE instructions again after regalloc. When the DBG_VALUE is mapping a variable to a physical register we do not need to compute the range. We should simply re-insert the DBG_VALUE at the start position. The problem that was found, resulting in this patch, was a situation when the DBG_VALUE was the last real use of the physical register. The computeIntervals/extendDef methods extended the range to cover the whole basic block, even though the physical register very well could be allocated to some virtual register inside the basic block. So the extended range could not be trusted. This patch is a preparation for https://reviews.llvm.org/D38229, where the goal is to insert DBG_VALUE after each new definition of a variable, even if the virtual registers that the variable was connected to has been coalesced into using the same physical register (e.g. due to two address instructions). For more info see https://bugs.llvm.org/show_bug.cgi?id=34545 Reviewers: aprantl, rnk, echristo Reviewed By: aprantl Subscribers: Ka-Ka, llvm-commits Differential Revision: https://reviews.llvm.org/D38140 llvm-svn: 314414	2017-09-28 13:10:06 +00:00
Benjamin Kramer	8df9bfcd8a	[LoopInfo] Don't poison random memory regions. The second argument for Allocator::Deallocate is the number of elements, not the size of a single element. In asan mode specifying a large number of elements poisoned random memory regions, leading to crashes everywhere. llvm-svn: 314413	2017-09-28 12:53:20 +00:00
Florian Hahn	8af01573a3	[LVI] Move LVILatticeVal class to separate header file (NFC). Summary: This allows sharing the lattice value code between LVI and SCCP (D36656). It also adds a `satisfiesPredicate` function, used by D36656. Reviewers: davide, sanjoy, efriedma Reviewed By: sanjoy Subscribers: mgorny, llvm-commits Differential Revision: https://reviews.llvm.org/D37591 llvm-svn: 314411	2017-09-28 11:09:22 +00:00
Coby Tayree	566348f2a0	[x86][AsmParser] Allow some more MS size directives MS allows the following size directives: float/double and long as synonymous to dword/qword and dword, respectively. Differential Revision: https://reviews.llvm.org/D37190 llvm-svn: 314410	2017-09-28 11:04:08 +00:00
Sean Eveson	fa8ef35e78	[llvm-cov] Create directory structure when filtering using -name= options Before this change using any of the -name= command line options with an output directory would result in a single file (functions.txt/functions.html) containing the coverage for those specific functions. Now you get the same directory structure as when not using any -name*= options. Differential Revision: https://reviews.llvm.org/D38280 llvm-svn: 314396	2017-09-28 10:07:30 +00:00
Alex Bradbury	5518cbfc41	Teach TargetInstrInfo::getInlineAsmLength to parse .space directives with integer arguments It's currently quite difficult to test passes like branch relaxation, which requires branches with large displacement to be generated. The .space assembler directive makes it easy to create arbitrarily large basic blocks, but getInlineAsmLength is not able to parse it and so the size of the block is not correctly estimated. Other backends (AArch64, AMDGPU) introduce options just for testing that artificially restrict the ranges of branch instructions (e.g. aarch64-tbz-offset-bits). Although parsing a single form of the .space directive feels inelegant, it does allow a more direct testing approach. This patch adapts the .space parsing code from Mips16InstrInfo::getInlineAsmLength and removes it now the extra functionality is provided by the base implementation. I want to move this functionality to the generic getInlineAsmLength as 1) I need the same for RISC-V, and 2) I feel other backends will benefit from more direct testing of large branch displacements. Differential Revision: https://reviews.llvm.org/D37798 llvm-svn: 314393	2017-09-28 09:31:46 +00:00
Hiroshi Inoue	79c0bec06e	[PowerPC] eliminate partially redundant compare instruction This is a follow-on of D37211. D37211 eliminates a compare instruction if two conditional branches can be made based on the one compare instruction, e.g. if (a == 0) { ... } else if (a < 0) { ... } This patch extends this optimization to support partially redundant cases, which often happen in while loops. For example, one compare instruction is moved from the loop body into the preheader by this optimization in the following example. do { if (a == 0) dummy1(); a = func(a); } while (a > 0); Differential Revision: https://reviews.llvm.org/D38236 llvm-svn: 314390	2017-09-28 08:38:19 +00:00
Alex Bradbury	9d3f12501a	[RISCV] Add common fixups and relocations %lo(), %hi(), and %pcrel_hi() are supported and test cases have been added to ensure the appropriate fixups and relocations are generated. I've added an instruction format field which is used in RISCVMCCodeEmitter to, for instance, tell whether it should emit a lo12_i fixup or a lo12_s fixup (RISC-V has two 12-bit immediate encodings depending on the instruction type). Differential Revision: https://reviews.llvm.org/D23568 llvm-svn: 314389	2017-09-28 08:26:24 +00:00
Mikael Holmen	07f1e2e2b3	[RegAllocGreedy]: Allow recoloring of done register if it's non-tied Summary: If we have a non-allocated register, we allow us to try recoloring of an already allocated and "Done" register, even if they are of the same register class, if the non-allocated register has at least one tied def and the allocated one has none. It should be easier to recolor the non-tied register than the tied one, so it might be an improvement even if they use the same regclasses. Reviewers: qcolombet Reviewed By: qcolombet Subscribers: llvm-commits, MatzeB Differential Revision: https://reviews.llvm.org/D38309 llvm-svn: 314388	2017-09-28 08:22:35 +00:00
Alex Bradbury	52b68efdd4	[RISCV] Define RISC-V specific e_flags Add RISC-V e_flags as defined in the ABI document: https://github.com/riscv/riscv-elf-psabi-doc/blob/master/riscv-elf.md#file-header Differential Revision: https://reviews.llvm.org/D38310 Patch by Chih-Mao Chen. llvm-svn: 314386	2017-09-28 07:54:01 +00:00
Jatin Bhateja	75001c9ed8	[X86] Adding more cases to horizontal [f]add/[f]sub for avx512. Reviewers: jbhateja Reviewed By: jbhateja Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D38344 llvm-svn: 314385	2017-09-28 07:40:52 +00:00
George Burgess IV	f8e11b803d	[DAGCombiner] Fix an off-by-one error in vector logic Without this, we could end up trying to get the Nth (0-indexed) element from a subvector of size N. Differential Revision: https://reviews.llvm.org/D37880 llvm-svn: 314380	2017-09-28 06:17:19 +00:00
Yonghong Song	e9165f8720	bpf: add new insns for bswap_to_le and negation This patch adds new insn, "reg = be16/be32/be64 reg", for bswap to little endian for big-endian target (bpfeb). It also adds new insn for negation "reg = -reg". Currently, for source code, e.g., b = -a LLVM still prefers to generate: b = 0 - a But "reg = -reg" format can be used in assembly code. Signed-off-by: Yonghong Song <yhs@fb.com> Acked-by: Alexei Starovoitov <ast@kernel.org> llvm-svn: 314376	2017-09-28 02:46:11 +00:00
Sanjoy Das	def1729dc4	Use a BumpPtrAllocator for Loop objects Summary: And now that we no longer have to explicitly free() the Loop instances, we can (with more ease) use the destructor of LoopBase to do what LoopBase::clear() was doing. Reviewers: chandlerc Subscribers: mehdi_amini, mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D38201 llvm-svn: 314375	2017-09-28 02:45:42 +00:00
Lang Hames	cf771adfea	[ORC] Update the GlobalMappingLayer interface to fit the error-ized layer concept. Add a unit-test to make sure we don't backslide, and tweak the MockBaseLayer utility to make it easier to test this kind of thing in the future. llvm-svn: 314374	2017-09-28 02:17:35 +00:00
Rui Ueyama	5908845a7e	Fix a UBsan bot. If we do not initialize Prefix here, Prefix.data() returns a nullptr. Later, it is passed to memcpy. memcpy's behavior is undefined if src (or dst) is a nullptr even if a given size is 0. That's why this code triggered UBsan. llvm-svn: 314368	2017-09-28 00:27:39 +00:00
Eugene Zelenko	fa57bd0ced	[CodeGen] Fix some Clang-tidy modernize-use-default-member-init and Include What You Use warnings; other minor fixes (NFC). llvm-svn: 314363	2017-09-27 23:26:01 +00:00
Justin Lebar	8ea84426c9	Check for overflows when calculating the offset in GetGEPCost. Summary: This avoids C++ UB if the GEP is weird and the calculation overflows int64_t, and it's also observable in the cost model's results. Such GEPs are almost surely not valid pointers, but LLVM nonetheless generates them sometimes. Reviewers: sanjoy Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D38337 llvm-svn: 314362	2017-09-27 23:16:56 +00:00
Galina Kistanova	1c6f0bb63e	Reverted r313993. This patch produces a crash and hexagon_vector_loop_carried_reuse_constant.ll test fails on Windows (llvm-clang-x86_64-expensive-checks-win build bot). llvm-svn: 314361	2017-09-27 23:09:14 +00:00
Craig Topper	0cd25942f7	Revert r314017 '[InstCombine] Simplify check for RHS being a splat constant in foldICmpUsingKnownBits by just checking Op1Min==Op1Max rather than going through m_APInt.' This reverts r314017 and similar code added in later commits. It seems to not work for pointer compares and is causing a bot failure for the last several days. llvm-svn: 314360	2017-09-27 22:57:18 +00:00
Dylan McKay	dffaaa3017	Update the description of AVR32 for the ELFDumper AVR32 is an unrelated architecture with 32-bit addressing. llvm-svn: 314359	2017-09-27 22:39:37 +00:00
Rui Ueyama	0dbb0f107e	Fix -Wunused-variable for Release build. llvm-svn: 314353	2017-09-27 22:03:15 +00:00
Sanjoy Das	4f3ebd537c	Return the LoopUnrollResult from tryToUnrollLoop; NFC I will use this in a later change. llvm-svn: 314352	2017-09-27 21:45:22 +00:00
Sanjoy Das	8e8c1bc490	LoopDeletion: use return value instead of passing in LPMUpdater; NFC I will use this refactoring in a later patch. llvm-svn: 314351	2017-09-27 21:45:21 +00:00
Sanjoy Das	3567d3d2ec	Rename LoopUnrollStatus to LoopUnrollResult; NFC A "Result" suffix is more appropriate here llvm-svn: 314350	2017-09-27 21:45:19 +00:00
Rui Ueyama	283f56ac03	Fix off-by-one error in TarWriter. The tar format originally supported up to 99 byte filename. The two extensions are proposed later: Ustar or PAX. In the UStar extension, a pathanme is split at a '/' and its "prefix" and "suffix" are stored in different locations in the tar header. Since "prefix" can be up to 155 byte, it can represent up to 254 byte filename (but exact limit depends on the location of '/' character in a pathname.) Our TarWriter first attempt to use UStar extension and then fallback to PAX extension. But there's a bug in UStar header creation. "Suffix" part must be a NUL- terminated string, but we didn't handle it correctly. As a result, if your filename just 100 characters long, the last character was droppped. This patch fixes the issue. Differential Revision: https://reviews.llvm.org/D38149 llvm-svn: 314349	2017-09-27 21:38:02 +00:00
Brian Gesiak	88f2aa12d9	[CMake] Fix typo: "in-tree" -> "in-source" (NFC) Summary: In-source builds of LLVM, in which a user invokes `cmake` from within the LLVM source directory, or invokes `cmake -B/path/to/source/dir/of/llvm`, are explicitly checked for and disallowed by LLVM's `CMakeLists.txt`. In-tree builds, on the other hand, refer to when the source directories of projects such as Clang are nested within the `llvm/tools` source directory. These are not disallowed, and are in fact a common way of building LLVM and Clang. Revise the comment to match the logic underneath it: it checks for an "in-source build", not an "in-tree build". Reviewers: beanz Reviewed By: beanz Subscribers: mgorny Differential Revision: https://reviews.llvm.org/D38317 llvm-svn: 314348	2017-09-27 21:37:33 +00:00
Don Hinton	53eb637115	Cleanup some problems with LLVM_ENABLE_DUMP in release builds, and always set LLVM_ENABLE_DUMP=ON for +Asserts builds. Differential Revision: https://reviews.llvm.org/D38306 llvm-svn: 314346	2017-09-27 21:19:56 +00:00
Rui Ueyama	23fa4de2db	Do not remove a target file in FileOutputBuffer::create(). FileOutputBuffer::create() attempts to remove a target file if the file is a regular one, which results in an unexpected result in a failure scenario. If something goes wrong and the user of FileOutputBuffer decides to not call commit(), it leaves nothing. An existing file is removed, and no new file is created. What we should do is to atomically replace an existing file with a new file using rename(), so that it wouldn't remove an existing file without creating a new one. Differential Revision: https://reviews.llvm.org/D38283 llvm-svn: 314345	2017-09-27 21:19:24 +00:00
Jessica Paquette	4cf187b5b4	[MachineOutliner] AArch64: Avoid saving + restoring LR if possible This commit allows the outliner to avoid saving and restoring the link register on AArch64 when it is dead within an entire class of candidates. This introduces changes to the way the outliner interfaces with the target. For example, the target now interfaces with the outliner using a MachineOutlinerInfo struct rather than by using getOutliningCallOverhead and getOutliningFrameOverhead. This also improves several comments on the outliner's cost model. https://reviews.llvm.org/D36721 llvm-svn: 314341	2017-09-27 20:47:39 +00:00
Craig Topper	c16a472966	Revert r314249 "Recommit r314151 "[X86] Make all the NOREX CodeGenOnly instructions into postRA pseudos like the NOREX version of TEST.""" This caused PR34751 llvm-svn: 314339	2017-09-27 20:34:17 +00:00
Craig Topper	e0d8290094	Revert r314248 "[X86] Don't emit X86::MOV8rr_NOREX from X86InstrInfo::copyPhysReg." This contributed to PR34751 llvm-svn: 314338	2017-09-27 20:34:13 +00:00
Simon Pilgrim	870007b4f8	[X86][SSE] Pull out variable shuffle mask combine logic. NFCI. Hopefully this will make it easier to vary the combine depth threshold per-target. llvm-svn: 314337	2017-09-27 20:19:53 +00:00
Than McIntosh	dee2cf67ea	[CodeGen] Emit necessary .note sections for -fsplit-stack Summary: According to https://gcc.gnu.org/wiki/SplitStacks, the linker expects a zero-sized .note.GNU-split-stack section if split-stack is used (and also .note.GNU-no-split-stack section if it also contains non-split-stack functions), so it can handle the cases where a split-stack function calls non-split-stack function. This change adds the sections if needed. Fixes PR #34670. Reviewers: thanm, rnk, luqmana Reviewed By: rnk Subscribers: llvm-commits Patch by Cherry Zhang <cherryyz@google.com> Differential Revision: https://reviews.llvm.org/D38051 llvm-svn: 314335	2017-09-27 19:34:00 +00:00
Craig Topper	7b1d503d7f	[X86] Rewrite the zero vector checks in lowerV2X128VectorShuffle to use the Zeroable APInt We already have zeroable bits in an APInt. We might as well use that instead of checking for an all zero BUILD_VECTOR. Differential Revision: https://reviews.llvm.org/D37950 llvm-svn: 314332	2017-09-27 18:56:20 +00:00
Craig Topper	05f71dd036	[X86] In combineLoopSADPattern, pad result with zeros and use full size add instead of using a smaller add and inserting. In some cases the result psadbw is smaller than the type of the add that started the match. Currently in these cases we are using a smaller add and inserting the result. If we instead combine the psadbw with zeros and use the full size add we can take advantage of implicit zeroing we get if we emit a narrower move before the add. In a future patch, I want to make isel aware that the psadbw itself already zeroed the upper bits and remove the move entirely. Differential Revision: https://reviews.llvm.org/D37453 llvm-svn: 314331	2017-09-27 18:36:45 +00:00
Alexey Bataev	022cc6c41e	[SLP] Fix crash on propagate IR flags for undef operands of min/max reductions. If both operands of the newly created SelectInst are Undefs the resulting operation is also Undef, not SelectInst. It may cause crashes when trying to propagate IR flags because function expects exactly SelectInst instruction, nothing else. llvm-svn: 314323	2017-09-27 17:42:49 +00:00
Roman Lebedev	1e053ab09a	[support] mapped_file_region: and fix the windows code too Followup for r314312 / r314313 Sorry, i really failed to fully grep all the codebase :/ llvm-svn: 314321	2017-09-27 17:24:34 +00:00
Chad Rosier	d8b4b06f5d	[InstCombine] Gating select arithmetic optimization. These changes faciliate positive behavior for arithmetic based select expressions that match its translation criteria, keeping code size gated to neutral or improved scenarios. Patch by Michael Berg <michael_c_berg@apple.com>! Differential Revision: https://reviews.llvm.org/D38263 llvm-svn: 314320	2017-09-27 17:16:51 +00:00
Geoff Berry	c032b2beb0	[AArch64][Falkor] Ignore SP based loads in HW prefetch fixups. Reviewers: mcrosier Subscribers: aemerson, rengolin, javed.absar, kristof.beyls Differential Revision: https://reviews.llvm.org/D38301 llvm-svn: 314319	2017-09-27 17:14:10 +00:00
Javed Absar	6c5605e772	[Misched] : Fix typo in comment. NFC. llvm-svn: 314316	2017-09-27 16:39:17 +00:00
Sanjay Patel	fee80d5e65	[SLP] fix typos/formatting; NFC llvm-svn: 314315	2017-09-27 16:32:56 +00:00
Sean Eveson	1439fa6236	Revert "[llvm-cov] Create directory structure when filtering using -name*= options" Test failures. llvm-svn: 314314	2017-09-27 16:20:07 +00:00
Roman Lebedev	21b013ebc1	[Support] mapped_file_region::size() returns size_t Fixup last commit, found by clang-stage1-cmake-RA-incremental bot. llvm-svn: 314313	2017-09-27 16:08:33 +00:00
Roman Lebedev	7c983671f2	[Support] mapped_file_region: store size as size_t Summary: Found when testing stage-2 build with D38101. ``` In file included from /build/llvm/lib/Support/Path.cpp:1045: /build/llvm/lib/Support/Unix/Path.inc:648:14: error: comparison 'uint64_t' (aka 'unsigned long') > 18446744073709551615 is always false [-Werror,-Wtautological-constant-compare] if (length > std::numeric_limits<size_t>::max()) { ~~~~~~ ^ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ``` `size_t` is `uint64_t` here, apparently, thus any `uint64_t` value always fits into `size_t`. Initial patch was to use some preprocessor logic to not check if the size is known to fit at compile time. But Zachary Turner suggested using this approach. Reviewers: Bigcheese, rafael, zturner, mehdi_amini Reviewed by (via email): zturner Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D38132 llvm-svn: 314312	2017-09-27 15:59:16 +00:00
Sean Eveson	51b817479b	[llvm-cov] Create directory structure when filtering using -name= options Before this change using any of the -name= command line options with an output directory would result in a single file (functions.txt/functions.html) containing the coverage for those specific functions. Now you get the same directory structure as when not using any -name*= options. Differential Revision: https://reviews.llvm.org/D38280 llvm-svn: 314310	2017-09-27 15:37:40 +00:00
Sanjay Patel	0f9b4773c1	[SimplifyCFG] add a struct to house optional folds (PR34603) This was intended to be no-functional-change, but it's not - there's a test diff. So I thought I should stop here and post it as-is to see if this looks like what was expected based on the discussion in PR34603: https://bugs.llvm.org/show_bug.cgi?id=34603 Notes: 1. The test improvement occurs because the existing 'LateSimplifyCFG' marker is not carried through the recursive calls to 'SimplifyCFG()->SimplifyCFGOpt().run()->SimplifyCFG()'. The parameter isn't passed down, so we pick up the default value from the function signature after the first level. I assumed that was a bug, so I've passed 'Options' down in all of the 'SimplifyCFG' calls. 2. I split 'LateSimplifyCFG' into 2 bits: ConvertSwitchToLookupTable and KeepCanonicalLoops. This would theoretically allow us to differentiate the transforms controlled by those params independently. 3. We could stash the optional AssumptionCache pointer and 'LoopHeaders' pointer in the struct too. I just stopped here to minimize the diffs. 4. Similarly, I stopped short of messing with the pass manager layer. I have another question that could wait for the follow-up: why is the new pass manager creating the pass with LateSimplifyCFG set to true no matter where in the pipeline it's creating SimplifyCFG passes? // Create an early function pass manager to cleanup the output of the // frontend. EarlyFPM.addPass(SimplifyCFGPass()); --> /// \brief Construct a pass with the default thresholds /// and switch optimizations. SimplifyCFGPass::SimplifyCFGPass() : BonusInstThreshold(UserBonusInstThreshold), LateSimplifyCFG(true) {} <-- switches get converted to lookup tables and loops may not be in canonical form If this is unintended, then it's possible that the current behavior of dropping the 'LateSimplifyCFG' setting via recursion was masking this bug. Differential Revision: https://reviews.llvm.org/D38138 llvm-svn: 314308	2017-09-27 14:54:16 +00:00
Haicheng Wu	3ec848bc50	[InlineCost] add visitSelectInst() InlineCost can understand Select IR now. This patch finds free Select IRs and continue the propagation of SimplifiedValues, ConstantOffsetPtrs, and SROAArgValues. Differential Revision: https://reviews.llvm.org/D37198 llvm-svn: 314307	2017-09-27 14:44:56 +00:00
Gadi Haber	87337a2bb9	[X86][SKX][KNL] Updated regression tests to use -mattr instead of -mcpu flag.NFC. NFC. Updated 8 regression tests to use -mattr instead of -mcpu flag as follows: -mcpu=knl --> -mattr=+avx512f -mcpu=skx --> -mattr=+avx512f,+avx512bw,+avx512vl,+avx512dq The updates are as part of the preparation of a large commit to add all instruction scheduling for the SKX target. Reviewers: delena, zvi, RKSimon Differential Revision: https://reviews.llvm.org/D38222 Change-Id: I2381c9b5bb75ecacfca017243c22d054f6eddd14 llvm-svn: 314306	2017-09-27 14:44:15 +00:00
Zvi Rackover	eb7a0bf847	X86 Tests: Unsigned saturation subtraction tests. NFC. Summary: Adding tests for D37534. Commit on behalf of julia.koval@intel.com Reviewers: n.bozhenov, zvi, spatel, DavidKreitzer Reviewed By: zvi Differential Revision: https://reviews.llvm.org/D37510 llvm-svn: 314305	2017-09-27 14:38:05 +00:00
Krzysztof Parzyszek	d0b6ceb2a0	Typo: const MCSchedModel SchedModel -> const MCSchedModel &SchedModel llvm-svn: 314301	2017-09-27 12:48:48 +00:00
Mikael Holmen	3bcc9f0c1f	[RegAllocGreedy] Fix spelling error, "inteference" -> "interference", NFC llvm-svn: 314299	2017-09-27 11:27:50 +00:00
Hiroshi Inoue	ed1ffa49a4	[PowerPC] eliminate unconditional branch to the next instruction This patch makes analyzeBranch eliminate unconditional branch to the next instruction. After basic blocks are re-organized by optimizers, such as machine block placement, a BB may end with an unconditional branch to the next (fallthrough) BB. This patch removes such redundant branch instruction. Differential Revision: https://reviews.llvm.org/D37730 llvm-svn: 314297	2017-09-27 10:33:02 +00:00
Javed Absar	1a77bcc0d2	[Misched]: Remove double call getMicroOpFactor.NFC. Reviewed by: @MatzeB Differential Revision: https://reviews.llvm.org/D38176 llvm-svn: 314296	2017-09-27 10:31:58 +00:00
Coby Tayree	836c50cc2f	[X86][AsmParser] fix PR32035 Differential Revision: https://reviews.llvm.org/D37473 llvm-svn: 314295	2017-09-27 10:29:29 +00:00
Jonas Devlieghere	2bc4c5411f	[test] Don't verify .debug_line offsets in bitcode tests. The exact values of the .debug_line offsets should not be hard-coded in the checks for bitcode tests. Fixes: http://bb.pgr.jp/builders/test-llvm-i686-linux-RA/builds/543 llvm-svn: 314294	2017-09-27 10:23:34 +00:00
Simon Pilgrim	3b0d9e789e	[X86][AVX] Improve (i4 bitcast (v4i1 x)) handling for 256-bit vector compare results. As commented on D37849 and rL313547, AVX1 targets were missing a chance to use vmovmskpd for v4f64/v4i64 results for bool vector bitcasts llvm-svn: 314293	2017-09-27 10:10:17 +00:00
Simon Pilgrim	a932bfcc93	Use const where possible. NFCI. llvm-svn: 314292	2017-09-27 10:03:17 +00:00
Jonas Devlieghere	777731ab2b	[dwarfdump] Fix printing of .debug_line offset. Fixes 32-bit buildbots: http://bb.pgr.jp/builders/test-llvm-i686-linux-RA/builds/542 http://lab.llvm.org:8011/builders/clang-cmake-thumbv7-a15/builds/11533 http://lab.llvm.org:8011/builders/clang-cmake-armv7-a15/builds/11494 llvm-svn: 314291	2017-09-27 10:00:27 +00:00
Jonas Devlieghere	65af0f9584	[dwarfdump] Add support for -debug-line=OFFSET This patch adds support for passing an offset to -debug-line. Differential revision: https://reviews.llvm.org/D38240 llvm-svn: 314288	2017-09-27 09:33:45 +00:00
Jonas Devlieghere	622c563b5a	[dwarfdump] Add support for -debug-loc=OFFSET This patch adds support for passing an offset to -debug-loc. Differential revision: https://reviews.llvm.org/D38237 llvm-svn: 314286	2017-09-27 09:33:36 +00:00
Sean Eveson	25ea19ea86	[llvm-cov] Improve const-correctness of filters. NFC. llvm-svn: 314281	2017-09-27 08:32:36 +00:00
Sam Parker	211f47aa37	[ARM] isTruncateFree fix I implemented isTruncateFree in rL313533, this patch fixes the logic to match my comment, as the previous logic was too general. Now the only truncates that are free are i64 -> i32. Differential Revision: https://reviews.llvm.org/D38234 llvm-svn: 314280	2017-09-27 08:30:45 +00:00
Martin Pelikan	de4806d321	[XRay] initialize all members of YAMLXRayRecord for -Wmissing-field-initializers llvm-svn: 314278	2017-09-27 07:30:48 +00:00
Martin Storsjo	aa1533bf9b	[X86] Fix SJLJ struct offsets for x86_64 This is necessary, but not sufficient, for having working SJLJ exception handling on x86_64. Differential Revision: https://reviews.llvm.org/D38254 llvm-svn: 314277	2017-09-27 06:08:23 +00:00
Martin Storsjo	eccaf04e40	[X86] Remove erroneous callsite offsetting in SJLJ landing pads The callsite value is already stored indexed from 0 in the _Unwind_Context struct. When accessed via the functions _Unwind_GetIP and _Unwind_SetIP, the value is indexed from 1, but those functions handle the offseting. When reading directly from the struct here, we shouldn't subtract 1. This matches the code generated by the ARM target, where SJLJ exception handling is used by default on iOS. This makes clang-built object files for 32 bit x86 mingw work when linked with libgcc/libstdc++. Differential Revision: https://reviews.llvm.org/D38251 llvm-svn: 314276	2017-09-27 06:08:16 +00:00
Martin Storsjo	233349fe51	[X86] Correct byte offsets and data types in a comment. NFC. This matches the types of the struct members defined in lib/CodeGen/SjLjEHPrepare.cpp, and the definition of this struct in libgcc. Differential Revision: https://reviews.llvm.org/D38248 llvm-svn: 314275	2017-09-27 06:08:04 +00:00
Craig Topper	177a3923ce	[X86] Use extract128BitVector in LowerMULH so we can extract from constant build vectors. llvm-svn: 314274	2017-09-27 06:04:55 +00:00

... 2 3 4 5 6 ...

155008 Commits