llvm-project

Commit Graph

Author	SHA1	Message	Date
Craig Topper	4d93adfed5	[X86] Change XRSTOR to use PS instead of TB to match XSAVE. I don't think this changes anything functionally yet, but I plan to fix the disassembler to use this to disable matching certain instructions with 0xf3/0xf2/0x66 prefixes. llvm-svn: 316337	2017-10-23 16:11:33 +00:00
Simon Pilgrim	1dcb913be6	[X86][SSE] Remove AssertZext stage from PEXTRW/PEXTRB lowering. NFCI. Remove AssertZext and instead add PEXTRW/PEXTRB support to computeKnownBitsForTargetNode to simplify instruction selection. Differential Revision: https://reviews.llvm.org/D39169 llvm-svn: 316336	2017-10-23 16:00:57 +00:00
Nico Weber	ce2d749ed3	clang-cl: Expose --version. This is for consistency with lld-link, see https://reviews.llvm.org/D38972 Also give --version a help text so it shows up in --help / /? output (for both clang-cl and regular clang). llvm-svn: 316335	2017-10-23 15:54:44 +00:00
Andrew V. Tischenko	777308b548	Update DPPD/DPPS instruction scheduling on btver2. Differential Revision: https://reviews.llvm.org/D39046 llvm-svn: 316334	2017-10-23 15:53:30 +00:00
Craig Topper	8f182fdd8b	[X86] Add PTWRITE instruction for assembler and disassembler. llvm-svn: 316333	2017-10-23 15:53:21 +00:00
Craig Topper	5f0339d2f3	[X86] Add RDPID instruction for assembler and disassembler. llvm-svn: 316332	2017-10-23 15:53:16 +00:00
Simon Pilgrim	32da2f9245	[DAGCombine] Permit combining of shuffles of equivalent splat BUILD_VECTORs combineShuffleOfScalars is very conservative about shuffled BUILD_VECTORs that can be combined together. This patch adds one additional case - if both BUILD_VECTORs represent splats of the same scalar value but with different UNDEF elements, then we should create a single splat BUILD_VECTOR, sharing only the UNDEF elements defined by the shuffle mask. Differential Revision: https://reviews.llvm.org/D38696 llvm-svn: 316331	2017-10-23 15:48:08 +00:00
Sam McCall	f9cb007355	Support formatting formatv_objects. Summary: Support formatting formatv_objects. While here, fix documentation about member-formatters, and attempted perfect-forwarding (I think). Reviewers: zturner Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D38997 llvm-svn: 316330	2017-10-23 15:40:44 +00:00
Rui Ueyama	a4cf97bc9c	Add the --version option. Differential Revision: https://reviews.llvm.org/D38972 llvm-svn: 316329	2017-10-23 14:57:53 +00:00
Simon Pilgrim	03c8753924	[X86][SSE] Regenerate bitcast-and-setcc tests Avoid the retl/retq changes in an upcoming patch llvm-svn: 316328	2017-10-23 14:47:49 +00:00
Ilya Biryukov	b080cb155b	[clangd] Allow to pass code completion opts to ClangdServer. Reviewers: bkramer, krasimir, sammccall Reviewed By: krasimir Subscribers: klimek, cfe-commits Differential Revision: https://reviews.llvm.org/D38731 llvm-svn: 316327	2017-10-23 14:46:48 +00:00
Simon Pilgrim	e131cb0bd5	[X86][AVX2] Regenerate AVX2 intrinsics tests on 32 + 64-bit targets llvm-svn: 316326	2017-10-23 14:19:46 +00:00
Simon Pilgrim	c680c4742b	[X86][AVX] Regenerate AVX intrinsics tests on 32 + 64-bit targets llvm-svn: 316325	2017-10-23 14:17:59 +00:00
Simon Pilgrim	eae6e9dbc5	[X86][F16C] Regenerate F16C schedule tests llvm-svn: 316324	2017-10-23 14:15:24 +00:00
Ilya Biryukov	a7e3763a2b	[clangd] Updated outdated test comment. NFC. llvm-svn: 316323	2017-10-23 14:08:52 +00:00
Artur Gainullin	610df9c890	Test commit. llvm-svn: 316322	2017-10-23 13:25:49 +00:00
George Rimar	7fc298afe4	[llvm-dwarfdump] - Teach tool about few GNU call_sites constants. This teaches tool about following consants: DW_TAG_GNU_call_site, DW_TAG_GNU_call_site_parameter, DW_AT_GNU_call_site_value, DW_AT_GNU_all_call_sites. Constants documented here: https://sourceware.org/elfutils/DwarfExtensions Differential revision: https://reviews.llvm.org/D39119 llvm-svn: 316321	2017-10-23 11:24:14 +00:00
Ayman Musa	4b2bd5ff5e	[X86] Add test for opportunity to use bzhi X86 instruction instead of load+and instructions. Transformation uploaded for CR in https://reviews.llvm.org/D34141. llvm-svn: 316320	2017-10-23 10:24:19 +00:00
Andrew V. Tischenko	eff4fc0d41	Fix for Bug 30718 - Failure to disassemble certain MOV with rex.R. The issue was in illegal segment register index. Differential Revision: https://reviews.llvm.org/D38786 llvm-svn: 316319	2017-10-23 09:36:33 +00:00
Martin Storsjo	32fefef7fc	[MinGW] Omit libc++/libc++abi/libunwind from autoexporting Differential Revision: https://reviews.llvm.org/D39167 llvm-svn: 316318	2017-10-23 09:08:28 +00:00
Martin Storsjo	ddb094ad36	[COFF] Fix exporting of functions starting with underscores, etc This fixes exporting functions in the following cases: - functions starting with an underscore in def files - functions starting with an underscore, via dllexport attributes, for mingw - fastcall and vectorcall functions when declared undecorated in def files - vectorcall functions when declared decorated in def files - stdcall functions when declared decorated in def files for mingw This still exports the stdcall functions with the wrong name in the normal msvc/link.exe mode, if declared with decoration in the def file though (this is not a regression though). Exporting functions via def files including decoration is not something I believe is routinely done though, but is tested to try to match link.exe's behaviour as far as easily possible. Differential Revision: https://reviews.llvm.org/D39170 llvm-svn: 316317	2017-10-23 09:08:24 +00:00
Martin Storsjo	843cbbddeb	[COFF] Improve the check for functions that should get an extra underscore This fixes exporting functions starting with an underscore, and fully decorated fastcall/vectorcall functions. Tests will be added in the lld repo. Differential Revision: https://reviews.llvm.org/D39168 llvm-svn: 316316	2017-10-23 09:08:13 +00:00
Haojian Wu	1afddd4136	Fix a -Wpedantic warning. llvm-svn: 316315	2017-10-23 09:02:59 +00:00
Haojian Wu	54e84b3966	[rename] Don't overwrite the template argument when renaming a template function. Reviewers: ioeric Reviewed By: ioeric Subscribers: cierpuchaw, cfe-commits, klimek Differential Revision: https://reviews.llvm.org/D39120 llvm-svn: 316314	2017-10-23 08:58:50 +00:00
Sam Parker	487ab86942	[ARM] Allow unrolling of multi-block loops. Before, loop unrolling was only enabled for loops with a single block. This restriction has been removed and replaced by: - allow a maximum of two exiting blocks, - a four basic block limit for cores with a branch predictor. Differential Revision: https://reviews.llvm.org/D38952 llvm-svn: 316313	2017-10-23 08:05:14 +00:00
Ilya Biryukov	01e3bf8afd	[clangd] Report proper kinds for 'Keyword' and 'Snippet' completion items. Reviewers: rwols, malaperle, krasimir, bkramer, sammccall Reviewed By: rwols, sammccall Subscribers: klimek, cfe-commits Differential Revision: https://reviews.llvm.org/D38720 llvm-svn: 316311	2017-10-23 06:06:21 +00:00
Richard Smith	8910fe699e	For better compatibility with C++11 and C++14, emit a nondiscardable definition of a static constexpr data member if it's defined 'constexpr' out of line, not only if it's defined 'constexpr' in the class. llvm-svn: 316310	2017-10-23 03:58:34 +00:00
Craig Topper	64cb997ce1	[X86] Update a doxygen comment in the disassembler tablegen code. NFC llvm-svn: 316309	2017-10-23 03:42:35 +00:00
Craig Topper	326008c615	[X86] Fix disassembly of EVEX rounding control and SAE instructions. Fixes PR31955. llvm-svn: 316308	2017-10-23 02:26:24 +00:00
Petr Hosek	2fd533db9f	[ELF] When placing orphans, handle case when last section is dead r315292 introduced a change that's supposed to consistently ignore "dead" output sections when placing orphans. Unfortunately, that change doesn't handle the special case when the orphan section is second to last section and the last section is dead (e.g. because it's being discarded) introducing a regression in some cases. This change handles this case by using the same predicate when checking the last section. Differential Revision: https://reviews.llvm.org/D39172 llvm-svn: 316307	2017-10-23 00:51:08 +00:00
Rui Ueyama	8faafa4fb1	Add R_PPC_ADDR16_HI relocation support The support of R_PPC_ADDR16_HI improves ld compatibility and makes things on par with RuntimeDyldELF that already implements this relocation. Patch by vit9696. llvm-svn: 316306	2017-10-22 23:33:49 +00:00
Rui Ueyama	d96724db42	Remove a fast lookup table from MergeInputSection. We used to have a map from section piece offsets to section pieces as a cache for binary search. But I found that the map took quite a large amount of memory and didn't make linking faster. So, in this patch, I removed the map. This patch saves 566 MiB of RAM (2.019 GiB -> 1.453 GiB) when linking clang with debug info, and the link time is 4% faster in that test case. Thanks for Sean Silva for pointing this out. llvm-svn: 316305	2017-10-22 23:02:07 +00:00
Faisal Vali	39ff401026	[c++2a] Update cxx_status w __VA_OPT__ marked as completed in SVN. llvm-svn: 316304	2017-10-22 22:29:52 +00:00
Saleem Abdulrasool	9e802eaf60	ExecutionEngine: make COFF Thumb2 assertions non-tautological The overflow detection assertions were tautological due to truncation. Adjust them to no longer be tautological. Patch by Alex Langford! llvm-svn: 316303	2017-10-22 20:51:25 +00:00
Yichao Yu	92c11ee352	Fix invalid ptrtoint in InstCombine Summary: It's unclear if this is the only thing we can do but at least this is consistent with the check of address space agreement in `isBitCastable`. The code is used at least in both instcombine and jumpthreading though I could only find a way to trigger the invalid cast in instcombine. Reviewers: loladiro, sanjoy, majnemer Reviewed By: sanjoy Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D34335 llvm-svn: 316302	2017-10-22 20:28:17 +00:00
Benjamin Kramer	24952ce5b9	Create fewer copies of StringMaps. No functionality change intended. llvm-svn: 316301	2017-10-22 20:16:28 +00:00
Martin Storsjo	c5115b9e65	Make HIDDEN_DIRECTIVE a function-like macro. NFCI. This avoids a hack for making it a no-op for windows. Also explicitly check for _WIN32 instead of assuming it. Differential Revision: https://reviews.llvm.org/D39156 llvm-svn: 316300	2017-10-22 19:39:26 +00:00
Benjamin Kramer	a7c822a238	[X86] Add missing override. NFC. llvm-svn: 316299	2017-10-22 19:16:31 +00:00
Sanjay Patel	b80daf0b48	[SimplifyCFG] delay switch condition forwarding to -latesimplifycfg As discussed in D39011: https://reviews.llvm.org/D39011 ...replacing constants with a variable is inverting the transform done by other IR passes, so we definitely don't want to do this early. In fact, it's questionable whether this transform belongs in SimplifyCFG at all. I'll look at moving this to codegen as a follow-up step. llvm-svn: 316298	2017-10-22 19:10:07 +00:00
Fangrui Song	dc168722da	[utils] Support -mtriple=powerpc64 Summary: test/CodeGen/PowerPC/pr33093.ll uses both powerpc64 (big-endian) and powerpc64le while the former was unsupported. Subscribers: nemanjai Differential Revision: https://reviews.llvm.org/D39164 llvm-svn: 316297	2017-10-22 18:43:23 +00:00
Simon Pilgrim	ce55eab936	Strip trailing whitespace. NFCI. llvm-svn: 316296	2017-10-22 18:38:57 +00:00
Marina Yatsina	f9371d821f	Add logic to greedy reg alloc to avoid bad eviction chains This fixes bugzilla 26810 https://bugs.llvm.org/show_bug.cgi?id=26810 This is intended to prevent sequences like: movl %ebp, 8(%esp) # 4-byte Spill movl %ecx, %ebp movl %ebx, %ecx movl %edi, %ebx movl %edx, %edi cltd idivl %esi movl %edi, %edx movl %ebx, %edi movl %ecx, %ebx movl %ebp, %ecx movl 16(%esp), %ebp # 4 - byte Reload Such sequences are created in 2 scenarios: Scenario #1: vreg0 is evicted from physreg0 by vreg1 Evictee vreg0 is intended for region splitting with split candidate physreg0 (the reg vreg0 was evicted from) Region splitting creates a local interval because of interference with the evictor vreg1 (normally region spliiting creates 2 interval, the "by reg" and "by stack" intervals. Local interval created when interference occurs.) one of the split intervals ends up evicting vreg2 from physreg1 Evictee vreg2 is intended for region splitting with split candidate physreg1 one of the split intervals ends up evicting vreg3 from physreg2 etc.. until someone spills Scenario #2 vreg0 is evicted from physreg0 by vreg1 vreg2 is evicted from physreg2 by vreg3 etc Evictee vreg0 is intended for region splitting with split candidate physreg1 Region splitting creates a local interval because of interference with the evictor vreg1 one of the split intervals ends up evicting back original evictor vreg1 from physreg0 (the reg vreg0 was evicted from) Another evictee vreg2 is intended for region splitting with split candidate physreg1 one of the split intervals ends up evicting vreg3 from physreg2 etc.. until someone spills As compile time was a concern, I've added a flag to control weather we do cost calculations for local intervals we expect to be created (it's on by default for X86 target, off for the rest). Differential Revision: https://reviews.llvm.org/D35816 Change-Id: Id9411ff7bbb845463d289ba2ae97737a1ee7cc39 llvm-svn: 316295	2017-10-22 17:59:38 +00:00
Craig Topper	dac20263a1	[X86] More correctly support LIG and WIG for EVEX instructions in the disassembler tables. This is similar to how we generate the VEX tables. More fixes are still needed for the instructions that use EVEX.b (broadcast and embedded rounding). llvm-svn: 316294	2017-10-22 17:22:29 +00:00
Sanjay Patel	24226504a7	[SimplifyCFG] try harder to forward switch condition to phi (PR34471) The missed canonicalization/optimization in the motivating test from PR34471 leads to very different codegen: int switcher(int x) { switch(x) { case 17: return 17; case 19: return 19; case 42: return 42; default: break; } return 0; } int comparator(int x) { if (x == 17) return 17; if (x == 19) return 19; if (x == 42) return 42; return 0; } For the first example, we use a bit-test optimization to avoid a series of compare-and-branch: https://godbolt.org/g/BivDsw Differential Revision: https://reviews.llvm.org/D39011 llvm-svn: 316293	2017-10-22 16:51:03 +00:00
Faisal Vali	81b756e6a3	[C++17] Fix PR34970 - tweak overload resolution for class template deduction-guides in line with WG21's p0620r0. In order to identify the copy deduction candidate, I considered two approaches: - attempt to determine whether an implicit guide is a copy deduction candidate by checking certain properties of its subsituted parameter during overload-resolution. - using one of the many bits (WillHaveBody) from FunctionDecl (that CXXDeductionGuideDecl inherits from) that are otherwise irrelevant for deduction guides After some brittle gymnastics w the first strategy, I settled on the second, although to avoid confusion and to give that bit a better name, i turned it into a member of an anonymous union. Given this identification 'bit', the tweak to overload resolution was a simple reordering of the deduction guide checks (in SemaOverload.cpp::isBetterOverloadCandidate), in-line with Jason Merrill's p0620r0 drafting which made it into the working paper. Concordant with that, I made sure the copy deduction candidate is always added. References: See https://bugs.llvm.org/show_bug.cgi?id=34970 See http://wg21.link/p0620r0 llvm-svn: 316292	2017-10-22 14:45:08 +00:00
Jan Vesely	7ab2d0bdcd	shared: Implement aligned vector stores (vstorea_half) Float version passes newly posted piglit tests on turks, float and double pass on carrizo. v2: scalar vstorea_half v3: fix typo Reviewer: Aaron Watry Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 316291	2017-10-22 14:21:59 +00:00
Jan Vesely	12061c7125	shared: Implement aligned vector loads (vloada_half) Passes newly posted piglits on turks and carrizo v2: add scalar vloada_half v3: fix typo Reviewer: Aaron Watry Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 316290	2017-10-22 14:21:56 +00:00
Momchil Velikov	d6a4ab3d49	[ARM] Dynamic stack alignment for 16-bit Thumb This patch implements dynamic stack (re-)alignment for 16-bit Thumb. When targeting processors, which support only the 16-bit Thumb instruction set the compiler ignores the alignment attributes of automatic variables and may silently generate incorrect code. Differential revision: https://reviews.llvm.org/D38143 llvm-svn: 316289	2017-10-22 11:56:35 +00:00
Guy Blank	92d5ce3bd4	[X86] Add a pass to convert instruction chains between domains. The pass scans the function to find instruction chains that define registers in the same domain (closures). It then calculates the cost of converting the closure to another domain. If found profitable, the instructions are converted to instructions in the other domain and the register classes are changed accordingly. This commit adds the pass infrastructure and a simple conversion from the GPR domain to the Mask domain. Differential Revision: https://reviews.llvm.org/D37251 Change-Id: Ic2cf1d76598110401168326d411128ae2580a604 llvm-svn: 316288	2017-10-22 11:43:08 +00:00
Nitesh Jain	757f74c2d3	[mips] Adds support for R_MIPS_26, HIGHER, HIGHEST relocations in RuntimeDyld. Reviewers: sdardis Subscribers: jaydeep, bhushan, llvm-commits Differential Revision: https://reviews.llvm.org/D38314 llvm-svn: 316287	2017-10-22 09:47:41 +00:00

1 2 3 4 5 ...

274542 Commits All Branches Search

274542 Commits

All Branches