llvm-project

Commit Graph

Author	SHA1	Message	Date
Yuka Takahashi	33cf63b7f2	[Bash-autocompletion] Auto complete cc1 options if -cc1 is specified Summary: We don't want to autocomplete flags whose Flags class has `NoDriverOption` when argv[1] is not `-cc1`. Another idea for this implementation is to make --autocomplete a cc1 option and handle it in clang Frontend, by porting --autocomplete handler from Driver to Frontend, so that we can handle Driver options and CC1 options in unified manner. Differential Revision: https://reviews.llvm.org/D34770 llvm-svn: 307479	2017-07-08 17:48:59 +00:00
Max Kazantsev	b9edcbcb1d	Re-enable "[IndVars] Canonicalize comparisons between non-negative values and indvars" The patch was reverted due to a bug. The bug was that if the IV is the 2nd operand of the icmp instruction, then the "Pred" variable gets swapped and differs from the instruction's predicate. In this patch we use the original predicate to do the transformation. Also added a test case that exercises this situation. Differentian Revision: https://reviews.llvm.org/D35107 llvm-svn: 307477	2017-07-08 17:17:30 +00:00
Sanjay Patel	15689aeae9	[LoopVectorize] partly revert r307475 Bots are failing because of the additional checks. llvm-svn: 307476	2017-07-08 16:34:46 +00:00
Sanjay Patel	28f53ef44d	[LoopVectorize] auto-generate complete checks; NFC I'm looking at a cmp transform in InstCombine that would affect these tests, but it's hard to know if it makes things better or worse without seeing the full IR. OTOH, maybe these tests shouldn't be running a bunch of transform passes in the first place? llvm-svn: 307475	2017-07-08 16:10:42 +00:00
Simon Pilgrim	d053634d6f	Fix -Wimplicit-fallthrough warning. NFCI. llvm-svn: 307473	2017-07-08 15:26:26 +00:00
Sanjay Patel	18ee908ca2	[x86] add SBB optimization for SETBE (ule) condition code x86 scalar select-of-constants (Cond ? C1 : C2) combining/lowering is a mess with missing optimizations. We handle some patterns, but miss logical variants. To clean that up, we should convert all select-of-constants to logic/math and enhance the combining for the expected patterns from that. Selecting 0 or -1 needs extra attention to produce the optimal code as shown here. Attempt to verify that all of these IR forms are logically equivalent: http://rise4fun.com/Alive/plxs Earlier steps in this series: rL306040 rL306072 rL307404 (D34652) As acknowledged in the earlier review, there's a possibility that some Intel uarch would prefer to produce an xor to clear the fake register operand with sbb %eax, %eax. This will likely need to be addressed in a separate pass. llvm-svn: 307471	2017-07-08 14:04:48 +00:00
Kamil Rytarowski	affab3047e	[Solaris] get rid of _RESTRICT_KYWD warning during the build Summary: (re)definition of _RESTRICT_KYWD rightfully causes a warning message during the Solaris build. This hack is not needed if build compiler is properly configured (.e.g /usr/bin/gcc) so just remove it. Reviewers: ro, mgorny, krytarowski, joerg Reviewed By: joerg Subscribers: quenelle, llvm-commits Patch by Fedor Sergeev (Oracle). Differential Revision: https://reviews.llvm.org/D35054 llvm-svn: 307469	2017-07-08 11:27:56 +00:00
Craig Topper	ffe672d200	[X86] In getHostCPUName, remove some code that changes some AMD CPU names based on features not being enabled. The CPU name is really just used for scheduler and other microarchitectural optimizations. The feature flags should be determined by getHostCPUFeatures which should always be used with getHostCPUName. Trying to alter CPU name strings to control features just isn't practical. Most of these types of things were removed from Intel CPUs a while ago. This is part of my plan to bring compiler-rt's cpu_model.c file up to date with the equivalent functionality in libgcc. A lot of the code in that file is copied from Host.cpp and we want to keep them reasonably in sync. llvm-svn: 307467	2017-07-08 06:44:36 +00:00
Craig Topper	1f9d3c0612	[X86] Correct the BDVER4 model numbers to include 0x70-0x7f. According to wikipedia and some other googling suggests these should also be considered as BDVER4. llvm-svn: 307466	2017-07-08 06:44:35 +00:00
Craig Topper	2ace153a82	[X86] Minor formatting fix. NFC llvm-svn: 307465	2017-07-08 06:44:34 +00:00
Craig Topper	c6bbe4becb	[X86] Use 'unsigned' instead of 'unsigned int' for consistency in the X86 portion of Host.cpp. llvm-svn: 307463	2017-07-08 05:16:14 +00:00
Craig Topper	bb8c799e1a	[X86] Cleanup some CPUID usage in getAvailableFeatures. We should make sure leaf 1 is available before accessing it. Same with leaf 0x80000001. llvm-svn: 307462	2017-07-08 05:16:13 +00:00
Eric Beckmann	c8dba240b1	Revert "Revert "Revert "Revert "Switch external cvtres.exe for llvm's own resource library."""" This reverts commit 147f45ff24456aea59575fa4ac16c8fa554df46a. Revert "Revert "Revert "Revert "Replace trivial use of external rc.exe by writing our own .res file."""" This reverts commit 61a90a67ed54a1f0dfeab457b65abffa129569e4. The patches were intially reverted because they were causing a failure on CrWinClangLLD. Unfortunately, this was done haphazardly and didn't compile, so the revert was reverted again quickly to fix this. One that was done, the revert of the revert was itself reverted. This allowed me to finally fix the actual bug in r307452. This patch re-enables the code path that had originally been causing the bug, now that it (should) be fixed. llvm-svn: 307460	2017-07-08 03:06:10 +00:00
Eric Christopher	8737f0650c	Remove a variable that was only used in asserts and had a duplicate copy in something we did use anyhow. llvm-svn: 307457	2017-07-08 01:03:29 +00:00
Eric Beckmann	7c865f004b	Add name offset flags, for parity with cvtres.exe. Summary: The original cvtres.exe sets the high bit when an identifier offset points to a string. Even though this is not mentioned in the spec, and in fact does not seem to cause errors with most cases, for some reason this causes a failure in Chromium where the new resource file is not verified as a new version. This patch sets this high bit flag, and also adds a test case to check that the output of our library is always identical to original cvtres. Reviewers: zturner, ruiu Subscribers: llvm-commits, hiraditya Differential Revision: https://reviews.llvm.org/D35099 llvm-svn: 307452	2017-07-07 23:23:53 +00:00
Craig Topper	bb4069e439	[InstCombine] Make InstCombine's IRBuilder be passed by reference everywhere Previously the InstCombiner class contained a pointer to an IR builder that had been passed to the constructor. Sometimes this would be passed to helper functions as either a pointer or the pointer would be dereferenced to be passed by reference. This patch makes it a reference everywhere including the InstCombiner class itself so there is more inconsistency. This a large, but mechanical patch. I've done very minimal formatting changes on it despite what clang-format wanted to do. llvm-svn: 307451	2017-07-07 23:16:26 +00:00
Lei Huang	317104183d	[PowerPC] NFC : Common up definitions of isIntS16Immediate and update parameter to int16_t llvm-svn: 307442	2017-07-07 21:12:35 +00:00
David Blaikie	94b98b2caf	ProfData: Fix some unchecked Errors in unit tests The 'NoError' function was meant to be used as the input to ASSERT/EXPECT_TRUE, but it is easy to forget this (it could be annotated with nodiscard to help this) so many sites that look like they're checked are not (& silently discard the failure). Only one site actually has an Error sneaking out this way and I've replaced that one with a FIXME+consumeError. The rest of the code has been modified to use the EXPECT_THAT_ERROR macros Zach introduced a while back. Between the options available this seems OK/good/something to standardize on - though it's difficult to build a matcher that could handle checking for a specific llvm::Error result, so those remain using the custom ErrorEquals (& the nodiscard added to ensure it is not misused as it was previous to this patch). It could still be generalized a bit further (even not as far as a matcher, but at least support multiple kinds of Error, etc) & added to the general Error utility header. llvm-svn: 307440	2017-07-07 21:02:59 +00:00
Dehao Chen	64c46574b0	Increase the import-threshold for crtical functions. Summary: For interative sample-pgo, if a hot call site is inlined in the profiling binary, we should inline it in before profile annotation in the backend. Before that, the compile phase first collects all GUIDs that needs to be imported and creates virtual "hot" call edge in the summary. However, "hot" is not good enough to guarantee the callsites get inlined. This patch introduces "critical" call edge, and assign much higher importing threshold for those edges. Reviewers: tejohnson Reviewed By: tejohnson Subscribers: sanjoy, mehdi_amini, llvm-commits, eraman Differential Revision: https://reviews.llvm.org/D35096 llvm-svn: 307439	2017-07-07 21:01:00 +00:00
Dehao Chen	3a9861420c	Add sample PGO support to ThinLTO new pass manager. Summary: For SamplePGO + ThinLTO, because profile annotation is done twice at both PrepareForThinLTO pipeline and backend compiler, the following changes are needed at the PrepareForThinLTO phase to ensure the IR is not changed dramatically. Otherwise the profile annotation will be inaccurate in the backend compiler. * disable hot-caller heuristic * disable loop unrolling * disable indirect call promotion This will unblock the new PM testing for sample PGO (tools/clang/test/CodeGen/pgo-sample-thinlto-summary.c), which will be covered in another cfe patch. Reviewers: chandlerc, tejohnson, davidxl Reviewed By: tejohnson Subscribers: sanjoy, mehdi_amini, Prazek, inglorion, llvm-commits Differential Revision: https://reviews.llvm.org/D34895 llvm-svn: 307437	2017-07-07 20:53:10 +00:00
Zachary Turner	3a11fdf8ce	[PDB] More changes to bring lld PDBs to parity with MSVC. 1) Don't write a /src/headerblock stream. This appears to be written conditionally by MSVC, but it's not clear what the condition is. For now, just remove it since we dont' know what it is anyway and the particular pdb we've checked in for the test doesn't have one. 2) Write a valid timestamp for the PDB file signature. This leads to non-reproducible builds, but it matches the default behavior of link, so it should be out default as well. If we need reproducibility, we should add a separate command line option for it that is off by default. 3) Write an empty FPO stream. MSVC seems to always write an FPO stream. This change makes the stream directory match up, although we still need to make the contents of the FPO stream match. llvm-svn: 307436	2017-07-07 20:25:39 +00:00
Anna Thomas	e3872003d0	[LoopUnrollRuntime] Support multiple exit blocks unrolling when prolog remainder generated With the NFC refactoring in rL307417 (git SHA 987dd01), all the logic is in place to support multiple exit/exiting blocks when prolog remainder is generated. This patch removed the assert that multiple exit blocks unrolling is only supported when epilog remainder is generated. Also, added test runs and checks with PROLOG prefix in runtime-loop-multiple-exits.ll test cases. llvm-svn: 307435	2017-07-07 20:12:32 +00:00
Craig Topper	77235d345e	[PatternMatch] Implemenet m_SignMask using Constant::isMinSignedValue instead of doing splat detection and analyzing the resulting APInt. llvm-svn: 307433	2017-07-07 19:56:23 +00:00
Craig Topper	5a06c2264b	[PatternMatch] Implement m_AnyZero using Constant::isZeroValue instead of ORing together isNullValue and isNegativeZeroValue. NFCI llvm-svn: 307432	2017-07-07 19:56:21 +00:00
Craig Topper	2c4018064e	[PatternMatch] Implement m_One and m_AllOnes using Constant::isOneValue/isAllOnesValue instead of doing our own splat detection and checking the resulting APInt. Should result in less compiled code. llvm-svn: 307431	2017-07-07 19:56:20 +00:00
Craig Topper	45f53414ad	[APInt] Add a fastpath for the single word case of isOneValue to match isNullValue, isAllOnesValue, etc. NFCI llvm-svn: 307430	2017-07-07 19:56:18 +00:00
Sanjay Patel	4cea2ec254	[DAGCombiner] use local variable to shorten code; NFCI llvm-svn: 307429	2017-07-07 19:34:42 +00:00
Quentin Colombet	868ef847a6	[RegAllocFast] Don't insert kill flags of super-register for partial kill When reusing a register for a new definition, the fast register allocator used to insert a kill flag at the previous last use of that register to inform later passes that this register is free between the redef and the last use. However, this may be wrong when subregisters are involved. Indeed, a partially redef would have trigger a kill of the full super register, potentially wrongly marking all the other subregisters as free. Given we don't track which lanes are still live, we cannot set the kill flag in such case. Note: This bug has been latent for about 7 years (r104056). llvmg.org/PR33677 llvm-svn: 307428	2017-07-07 19:25:45 +00:00
Quentin Colombet	81551148b7	[RegAllocFast] Add the proper initialize method to use the .mir infrastructure NFC llvm-svn: 307427	2017-07-07 19:25:42 +00:00
Zachary Turner	fe71c546e7	[llvm-pdbutil] Fix build. Some platforms require an explicit specialization of std::hash for PdbRaw_FeaturesSig. Also a test involving case sensitivity needed to be fixed. For now that particular check just accepts any path even if they're completely different. Long term we should output paths in the correct case to match MSVC. llvm-svn: 307426	2017-07-07 19:00:06 +00:00
Davide Italiano	4eb210bdeb	[Local] Update the comment for removeUnreachableBlocks. It referenced a wrong function name, and didn't mention what the second argument did. This should be slightly more accurate now. llvm-svn: 307425	2017-07-07 18:54:14 +00:00
Matthias Braun	af7442abd0	FuzzerUtilDarwin.cpp: We need to pass modifiable strings to posix_spawn This fixes a bug where unmodifiable strings where passed to posix_spawn. This is an attempt to unbreak the greendragon libFuzzer bot. llvm-svn: 307424	2017-07-07 18:53:24 +00:00
Zachary Turner	448dea419c	Use windows path syntax when writing PDB module name. Without this we would just append whatever the user wrote on the command line, so if we're in C:\foo and we run lld-link bar/baz.obj, we would write C:\foo\bar/baz.obj in various places in the PDB. MSVC linker does not do this, so we shouldn't either. This fixes some differences in the diff test, so we update the test as well. Differential Revision: https://reviews.llvm.org/D35092 llvm-svn: 307423	2017-07-07 18:46:14 +00:00
Zachary Turner	c1e93e5fa4	Fix some differences between lld and MSVC generated PDBs. A couple of things were different about our generated PDBs. 1) We were outputting the wrong Version on the PDB Stream. The version we were setting was newer than what MSVC is setting. It's not clear what the implications are, but we change LLD to use PdbImplVC70, as MSVC does. 2) For the optional debug stream indices in the DBI Stream, we were outputting 0 to mean "the stream is not present". MSVC outputs uint16_t(-1), which is the "correct" way to specify that a stream is not present. So we fix that as well. 3) We were setting the PDB Stream signature to 0. This is supposed to be the result of calling time(nullptr). Although this leads to non-deterministic builds, a better way to solve that is by having a command line option explicitly for generating a reproducible build, and have the default behavior of lld-link match the default behavior of link. To test this, I'm making use of the new and improved `pdb diff` sub command. To make it suitable for writing tests against, I had to modify the diff subcommand slightly to print less verbose output. Previously it would always print \| <column> \| <value1> \| <value2> \| which is quite verbose, and the values are fragile. All we really want to know is "did we produce the same value as link?" So I added command line options to print a single character representing the result status (different, identical, equivalent), and another to hide the value display. Note that just inspecting the diff output used to write the test, you can see some things that are obviously wrong. That is just reflective of the fact that this is the state of affairs today, not that we're asserting that this is "correct". We can use this as a starting point to discover differences, fix them, and update the test. Differential Revision: https://reviews.llvm.org/D35086 llvm-svn: 307422	2017-07-07 18:45:56 +00:00
Zachary Turner	f3b4b2d89d	[llvm-pdbutil] Improve diff mode. We're getting to the point that some MS tools (e.g. DIA) can recognize our PDBs but others (e.g. link.exe) cannot. I think the way forward is to improve our tooling to help us find differences more easily. For example, if we can compile the same program with clang-cl and cl and have a tool tell us all the places where the PDBs differ, this could tell us what we're doing wrong. It's tricky though, because there are a lot of "benign" differences in a PDB. For example, if the string table in one PDB consists of "foo" followed by "bar" and in the other PDB it consists of "bar" followed by "foo", this is not necessarily a critical difference, as long as the uses of these strings also refer to the correct location. On the other hand, if the second PDB doesn't even contain the string "foo" at all, this is a critical difference. diff mode has been in llvm-pdbutil for quite a while, but because of the above challenge along with some others, it's been hard to make it useful. I think this patch addresses that. It looks for all the same things, but it now prints the output in tabular format (carefully formatted and aligned into tables and fields), and it highlights critical differences in red, non-critical differences in yellow, and identical fields in green. This makes it easy to spot the places we differ, and the general concept of outputting arbitrary fields in tabular format can be extended to provide analysis into many of the different types of information that show up in a PDB. Differential Revision: https://reviews.llvm.org/D35039 llvm-svn: 307421	2017-07-07 18:45:37 +00:00
Craig Topper	1491bdfd78	vim: add 'builtin', 'nobuiltin', 'nonnull', and 'speculatable' to the keyword list. llvm-svn: 307419	2017-07-07 18:28:45 +00:00
Gor Nishanov	8cdf648795	[cloning] Do not duplicate types when cloning functions Summary: This is an addon to the change rl304488 cloning fixes. (Originally rl304226 reverted rl304228 and reapplied rl304488 https://reviews.llvm.org/D33655) rl304488 works great when DILocalVariables that comes from the inlined function has a 'unique-ed' type, but, in the case when the variable type is distinct we will create a second DILocalVariable in the scope of the original function that was inlined. Consider cloning of the following function: ``` define private void @f() !dbg !5 { %1 = alloca i32, !dbg !11 call void @llvm.dbg.declare(metadata i32* %1, metadata !14, metadata !12), !dbg !18 ret void, !dbg !18 } !14 = !DILocalVariable(name: "inlined", scope: !15, file: !6, line: 5, type: !17) ; came from an inlined function !15 = distinct !DISubprogram(name: "inlined", linkageName: "inlined", scope: null, file: !6, line: 8, type: !7, isLocal: true, isDefinition: true, scopeLine: 9, isOptimized: false, unit: !0, variables: !16) !16 = !{!14} !17 = distinct !DICompositeType(tag: DW_TAG_structure_type, name: "some_struct", size: 32, align: 32) ``` Without this fix, when function 'f' is cloned, we will create another DILocalVariable for "inlined", due to its type being distinct. ``` define private void @f.1() !dbg !23 { %1 = alloca i32, !dbg !26 call void @llvm.dbg.declare(metadata i32* %1, metadata !28, metadata !12), !dbg !30 ret void, !dbg !30 } !14 = !DILocalVariable(name: "inlined", scope: !15, file: !6, line: 5, type: !17) !15 = distinct !DISubprogram(name: "inlined", linkageName: "inlined", scope: null, file: !6, line: 8, type: !7, isLocal: true, isDefinition: true, scopeLine: 9, isOptimized: false, unit: !0, variables: !16) !16 = !{!14} !17 = distinct !DICompositeType(tag: DW_TAG_structure_type, name: "some_struct", size: 32, align: 32) ; !28 = !DILocalVariable(name: "inlined", scope: !15, file: !6, line: 5, type: !29) ; OOPS second DILocalVariable !29 = distinct !DICompositeType(tag: DW_TAG_structure_type, name: "some_struct", size: 32, align: 32) ``` Now we have two DILocalVariable for "inlined" within the same scope. This result in assert in AsmPrinter/DwarfDebug.h:131: void llvm::DbgVariable::addMMIEntry(const llvm::DbgVariable &): Assertion `V.Var == Var && "conflicting variable"' failed. (Full example: See: https://bugs.llvm.org/show_bug.cgi?id=33492) In this change we prevent duplication of types so that when a metadata for DILocalVariable is cloned it will get uniqued to the same metadate node as an original variable. Reviewers: loladiro, dblaikie, aprantl, echristo Reviewed By: loladiro Subscribers: EricWF, llvm-commits Differential Revision: https://reviews.llvm.org/D35106 llvm-svn: 307418	2017-07-07 18:24:20 +00:00
Anna Thomas	734ab3f75c	[LoopUnrollRuntime] NFC: use the precomputed loop exit in ConnectProlog Minor refactoring to use the preexisting loop exit that's already calculated. We do not need to recompute the loop exit in ConnectProlog. Apart from avoiding redundant computation, this is required for supporting multiple loop exits when Prolog remainder loops are generated. llvm-svn: 307417	2017-07-07 18:05:28 +00:00
Tony Jiang	c260e0eb56	[PPC CodeGen] Expand the bitreverse.i32 intrinsic. Differential Revision: https://reviews.llvm.org/D33572 Fix PR: https://bugs.llvm.org/show_bug.cgi?id=33093 llvm-svn: 307413	2017-07-07 16:41:55 +00:00
Simon Pilgrim	cb07d67a5c	Fix some more -Wimplicit-fallthrough warnings. NFCI. llvm-svn: 307411	2017-07-07 16:40:06 +00:00
Matthew Simpson	12eaef75ce	[ARM] Implement interleaved access bug fix from r306334 r306334 fixed a bug in AArch64 dealing with wide interleaved accesses having pointer types. The bug also exists in ARM, so this patch copies over the fix. llvm-svn: 307409	2017-07-07 16:15:05 +00:00
Sam Kolton	10ac2fd2eb	[AMDGPU] Assembler: refactor convert methods (VOP3 and MIMG) Summary: Simplified converter methods for VOP3 and MIMG. Reviewers: dp, artem.tamazov Subscribers: arsenm, kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, tpr, vpykhtin, t-tye Differential Revision: https://reviews.llvm.org/D35047 llvm-svn: 307407	2017-07-07 15:21:52 +00:00
Rafael Espindola	7413496f8d	Fix variable names. NFC. llvm-svn: 307406	2017-07-07 15:20:55 +00:00
Sanjay Patel	dd36f75733	[x86] add SBB optimization for SETAE (uge) condition code x86 scalar select-of-constants (Cond ? C1 : C2) combining/lowering is a mess with missing optimizations. We handle some patterns, but miss logical variants. To clean that up, we should convert all select-of-constants to logic/math and enhance the combining for the expected patterns from that. DAGCombiner already has the foundation to allow the transforms, so we just need to fill in the holes for x86 math op lowering. Selecting 0 or -1 needs extra attention to produce the optimal code as shown here. Attempt to verify that all of these IR forms are logically equivalent: http://rise4fun.com/Alive/plxs Earlier steps in this series: rL306040 rL306072 Differential Revision: https://reviews.llvm.org/D34652 llvm-svn: 307404	2017-07-07 14:56:20 +00:00
Sanjay Patel	1bbdf4e11a	[DemandedBits] fix formatting; NFC llvm-svn: 307403	2017-07-07 14:39:26 +00:00
Dmitry Preobrazhensky	b2d24e23ce	[AMDGPU][mc][gfx9] Added support of op_sel/op_sel_hi for V_MAD_MIX* See https://bugs.llvm.org//show_bug.cgi?id=33595 Reviewers: vpykhtin, artem.tamazov, arsenm Differential Revision: https://reviews.llvm.org/D35021 llvm-svn: 307402	2017-07-07 14:29:06 +00:00
Chad Rosier	3f02123f7c	[ValueTracking] Fix the identity case (LHS => RHS) when the LHS is false. Prior to this commit both of the added test cases were passing. However, in the latter case (test7) we were doing a lot more work to arrive at the same answer (i.e., we were using isImpliedCondMatchingOperands() to determine the implication.). llvm-svn: 307400	2017-07-07 13:55:55 +00:00
Andrew V. Tischenko	a2ab3ed0df	NFC: I simply added CHECK-LABEL to prevent false matches in the tests. llvm-svn: 307397	2017-07-07 13:41:33 +00:00
Simon Pilgrim	16ee09bf72	[Lanai] Fix -Wimplicit-fallthrough warning. NFCI. llvm-svn: 307396	2017-07-07 13:22:47 +00:00
Simon Pilgrim	087e87d595	[Hexagon] Fix some more -Wimplicit-fallthrough warnings. NFCI. llvm-svn: 307395	2017-07-07 13:21:43 +00:00
Simon Pilgrim	8b4dc53326	[AArch64] Fix -Wimplicit-fallthrough warnings. NFCI. llvm-svn: 307393	2017-07-07 13:03:28 +00:00
Anna Thomas	cace053fb5	[SafepointIRVerifier] Avoid false positives in GC verifier for compare between pointers Today the safepoint IR verifier catches some unrelocated uses of base pointers that are actually valid. With this change, we narrow down the set of false positives. Specifically, the verifier knows about compares to null and compares between 2 unrelocated pointers. Reviewed by: skatkov Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D35057 llvm-svn: 307392	2017-07-07 13:02:29 +00:00
Florian Hahn	d4550baf3b	[AArch64] Use 16 bytes as preferred function alignment on Cortex-A57. Summary: This change gives a 0.89% speed on execution time, a 0.94% improvement in benchmark scores and a 0.62% increase in binary size on a Cortex-A57. These numbers are the geomean results on a wide range of benchmarks from the test-suite, SPEC2000, SPEC2006 and a range of proprietary suites. The software optimization guide for the Cortex-A57 recommends 16 byte branch alignment. Reviewers: t.p.northover, mcrosier, javed.absar, kristof.beyls, sbaranga Reviewed By: kristof.beyls Subscribers: aemerson, rengolin, llvm-commits Differential Revision: https://reviews.llvm.org/D34954 llvm-svn: 307389	2017-07-07 10:43:01 +00:00
Daniel Jasper	66a6d0146a	Fix uninitalized memory access introduced in r307350. Found by MSAN :). llvm-svn: 307383	2017-07-07 10:23:13 +00:00
Simon Pilgrim	6c1695c6f7	[PowerPC] Fix -Wimplicit-fallthrough warnings. NFCI. llvm-svn: 307382	2017-07-07 10:21:44 +00:00
Simon Pilgrim	0f5b35059d	[AMDGPU] Fix -Wimplicit-fallthrough warnings. NFCI. llvm-svn: 307381	2017-07-07 10:18:57 +00:00
Florian Hahn	e3666ec9d6	[AArch64] Use 16 bytes as preferred function alignment on Cortex-A72. Summary: This change gives a 0.34% speed on execution time, a 0.61% improvement in benchmark scores and a 0.57% increase in binary size on a Cortex-A72. These numbers are the geomean results on a wide range of benchmarks from the test-suite, SPEC2000, SPEC2006 and a range of proprietary suites. The software optimization guide for the Cortex-A72 recommends 16 byte branch alignment. Reviewers: t.p.northover, kristof.beyls, rengolin, sbaranga, mcrosier, javed.absar Reviewed By: kristof.beyls Subscribers: llvm-commits, aemerson Differential Revision: https://reviews.llvm.org/D34961 llvm-svn: 307380	2017-07-07 10:15:49 +00:00
Simon Pilgrim	ecedd8d803	[Sparc] Fix -Wimplicit-fallthrough warning. NFCI. llvm-svn: 307378	2017-07-07 10:14:46 +00:00
Alex Lorenz	215be39cab	Update the Windows version of updateTripleOSVersion to account for changes in r307372 llvm-svn: 307377	2017-07-07 10:08:52 +00:00
Simon Pilgrim	8c4069e842	[SystemZ] Fix -Wimplicit-fallthrough warnings. NFCI. llvm-svn: 307376	2017-07-07 10:07:09 +00:00
Simon Pilgrim	ce1fb22c6a	[Arm] Fix -Wimplicit-fallthrough warnings. NFCI. llvm-svn: 307375	2017-07-07 10:05:45 +00:00
Simon Pilgrim	adb80fbaf4	[Hexagon] Fix -Wimplicit-fallthrough warnings. NFCI. llvm-svn: 307374	2017-07-07 10:04:12 +00:00
Alex Lorenz	3803df3dcd	[Support] sys::getProcessTriple should return a macOS triple using the system's version of macOS sys::getProcessTriple returns LLVM_HOST_TRIPLE, whose system version might not be the actual version of the system on which the compiler running. This commit ensures that, for macOS, sys::getProcessTriple returns a triple with the system's macOS version. rdar://33177551 Differential Revision: https://reviews.llvm.org/D34446 llvm-svn: 307372	2017-07-07 09:53:47 +00:00
Florian Hahn	9872a6aaad	[AArch64] Add test case for preferred function alignment (NFC). Reviewers: evandro, joelkevinjones, mcrosier Reviewed By: joelkevinjones, mcrosier Subscribers: mcrosier, aemerson, llvm-commits, rengolin, evandro, javed.absar, joelkevinjones, kristof.beyls Differential Revision: https://reviews.llvm.org/D34951 llvm-svn: 307369	2017-07-07 09:17:53 +00:00
Diana Picus	77367378ac	[ARM] GlobalISel: Fixup r307365 Rename member DebugLoc -> DbgLoc (so it doesn't conflict with the class name). llvm-svn: 307366	2017-07-07 08:53:27 +00:00
Diana Picus	5b91653840	[ARM] GlobalISel: Select hard G_FCMP for s32 We lower to a sequence consisting of: - MOVi 0 into a register - VCMPS to do the actual comparison and set the VFP flags - FMSTAT to move the flags out of the VFP unit - MOVCCi to either use the "zero register" that we have previously set with the MOVi, or move 1 into the result register, based on the values of the flags As was the case with soft-float, for some predicates (one, ueq) we actually need two comparisons instead of just one. When that happens, we generate two VCMPS-FMSTAT-MOVCCi sequences and chain them by means of using the result of the first MOVCCi as the "zero register" for the second one. This is a bit overkill, since one comparison followed by two non-flag-setting conditional moves should be enough. In any case, the backend manages to CSE one of the comparisons away so it doesn't matter much. Note that unlike SelectionDAG and FastISel, we always use VCMPS, and not VCMPES. This makes the code a lot simpler, and it also seems correct since the LLVM Lang Ref defines simple true/false returns if the operands are QNaN's. For SNaN's, even VCMPS throws an Invalid Operand exception, so they won't be slipping through unnoticed. Implementation-wise, this introduces a template so we can share the same code that we use for handling integer comparisons, since the only differences are in the details (exact opcodes to be used etc). Hopefully this will be easy to extend to s64 G_FCMP. llvm-svn: 307365	2017-07-07 08:39:04 +00:00
Craig Topper	15e3de1c1e	[TableGen] Cleanup capturing of instruction namespace for the fast isel emitter to remove a std::string and duplicated code. NFC llvm-svn: 307363	2017-07-07 06:22:36 +00:00
Craig Topper	86a9aee88f	[TableGen] Use StringRef instead of std::string for CodeGenInstruction namespace. NFC llvm-svn: 307362	2017-07-07 06:22:35 +00:00
Craig Topper	3d0463ff82	[TableGen] Add a proper namespace to an Instruction in an AsmMatcher test. This is required after r307358. llvm-svn: 307361	2017-07-07 05:50:45 +00:00
Rafael Espindola	77e87b3444	Reduce code duplication. By addding a mapNameToDWARFSection we only need to check section names in one place. llvm-svn: 307359	2017-07-07 05:36:53 +00:00
Craig Topper	2b347eb15a	[TableGen] Fix some mismatches in the use of Namespace fields versus Target name in some of our emitters. Some of our emitters were using the name of the Target to reference things that were created by others emitters using Namespace. Apparently all targets have the same Target name as their instruction and register Namespace field? Someone on IRC had a target that didn't do this and was getting build errors. This patch is a necessary, but maybe not sufficient fix. llvm-svn: 307358	2017-07-07 05:19:25 +00:00
Zachary Turner	6c4bfba8f3	[PDB] Teach libpdb to write DBI Stream ECNames. Based strictly on the name, this seems to have something to do width edit & continue. The goal of this patch has nothing to do with supporting edit and continue though. msvc link.exe writes very basic information into this area even when not compiling with support for E&C, and so the goal here is to bring lld-link to parity. Since we cannot know what assumptions standard tools make about the content of PDB files, we need to be as close as possible. This ECNames data structure is a standard PDB string hash table. link.exe puts a single string into this hash table, which is the full path to the PDB file on disk. It then references this string from the module descriptor for the compiler generated `* Linker *` module. With this patch, lld-link will generate the exact same sequence of bytes as MSVC link for this subsection for a given object file input (as reported by `llvm-pdbutil bytes -ec`). llvm-svn: 307356	2017-07-07 05:04:36 +00:00
Lang Hames	b76aa4ac0a	[Orc] Add missing return value (left out in r307350). llvm-svn: 307354	2017-07-07 03:22:57 +00:00
Tony Tye	d5b1cbf046	Correct GFX9 processor names. Differential Revision: https://reviews.llvm.org/D33736 llvm-svn: 307353	2017-07-07 03:10:01 +00:00
Matthias Braun	eeb1516884	RegisterScavenging: Fix PR33687 When scavenging for a use in instruction MI, we will reload after that instruction and hence cannot spill uses/defs of this instruction. This fixes http://llvm.org/PR33687 llvm-svn: 307352	2017-07-07 03:02:18 +00:00
Matthias Braun	1b54aa5879	LiveRegUnits: Rename accumulateBackward()->accumulate() Contrary to the stepForward()/stepBackward() method accumulate() doesn't have a direction as defs, uses and clobbers all have the same effect. Also improve the documentation comment. llvm-svn: 307351	2017-07-07 03:02:17 +00:00
Lang Hames	4ce98662e7	[ORC] Errorize the ORC APIs. This patch updates the ORC layers and utilities to return and propagate llvm::Errors where appropriate. This is necessary to allow ORC to safely handle error cases in cross-process and remote JITing. llvm-svn: 307350	2017-07-07 02:59:13 +00:00
Yaxun Liu	b909f11a31	[InferAddressSpaces] Fix assertion about null pointer InferAddressSpaces does not check address space in collectFlatAddressExpressions, which causes values with non flat address space put into Postorder and causes assertion in cloneValueWithNewAddressSpace. This patch fixes assertion in OpenCL 2.0 conformance test generic_address_space subtest for amdgcn target. Differential Revision: https://reviews.llvm.org/D34991 llvm-svn: 307349	2017-07-07 02:40:13 +00:00
Sam Clegg	5e3d33a781	[WebAssembly] Support weak defined symbols Model weakly defined symbols as symbols that are both exports and imported and marked as weak. Local references to the symbols refer to the import but the linker can resolve this to the weak export if not strong symbol is found at link time. Differential Revision: https://reviews.llvm.org/D35029 llvm-svn: 307348	2017-07-07 02:01:29 +00:00
Sean Fertile	9cd1cdf814	Extend memcpy expansion in Transform/Utils to handle wider operand types. Adds loop expansions for known-size and unknown-sized memcpy calls, allowing the target to provide the operand types through TTI callbacks. The default values for the TTI callbacks use int8 operand types and matches the existing behaviour if they aren't overridden by the target. Differential revision: https://reviews.llvm.org/D32536 llvm-svn: 307346	2017-07-07 02:00:06 +00:00
Evgeniy Stepanov	7d3eeaaa96	Revert r307342, r307343. Revert "Copy arguments passed by value into explicit allocas for ASan." Revert "[asan] Add end-to-end tests for overflows of byval arguments." Build failure on lldb-x86_64-ubuntu-14.04-buildserver. Test failure on clang-cmake-aarch64-42vma and sanitizer-x86_64-linux-android. llvm-svn: 307345	2017-07-07 01:31:23 +00:00
Evgeniy Stepanov	2a7a4bc1c9	Copy arguments passed by value into explicit allocas for ASan. ASan determines the stack layout from alloca instructions. Since arguments marked as "byval" do not have an explicit alloca instruction, ASan does not produce red zones for them. This commit produces an explicit alloca instruction and copies the byval argument into the allocated memory so that red zones are produced. Patch by Matt Morehouse. Differential revision: https://reviews.llvm.org/D34789 llvm-svn: 307342	2017-07-07 00:48:25 +00:00
Anna Thomas	ccce853863	[SafepointIRVerifier] NFC: Refactor code for identifying exclusive base type Added a new Enum to identify if the base pointer is exclusively null or exlusively some constant or not exclusively any constant. Converted the base pointer identification method from recursive to iterative form. llvm-svn: 307340	2017-07-07 00:40:37 +00:00
George Karpenkov	22a402fb66	[lit] Modify LIT to accept environment variable LIT_FILTER to select tests. This is especially useful when lit is invoked indirectly by the build system, and additional arguments can not be easily specified. Differential Revision: https://reviews.llvm.org/D35091 llvm-svn: 307339	2017-07-07 00:22:11 +00:00
Wei Mi	7586755013	[ConstHoisting] Turn on consthoist-with-block-frequency by default. Using profile information to guide consthoisting is generally helpful for performance, so the patch turns it on by default. No compile time or perf regression were found using spec2000 and spec2006 on x86. Some significant improvement (>20%) was seen on internal benchmarks. Differential Revision: https://reviews.llvm.org/D35063 llvm-svn: 307338	2017-07-07 00:11:05 +00:00
Michael Kuperstein	20d8e4ef76	Reverting r307326 because it breaks clang tests. llvm-svn: 307334	2017-07-06 23:24:39 +00:00
Craig Topper	cb22039bee	[InstCombine] No need to pass DataLayout to helper functions if we're passing the InstCombiner object. We can just ask it for the DataLayout. NFC llvm-svn: 307333	2017-07-06 23:18:43 +00:00
Craig Topper	4853c4304b	[InstCombine] Remove unused arguments from some helper functions. NFC llvm-svn: 307332	2017-07-06 23:18:42 +00:00
Craig Topper	2bb9f0f620	[InstCombine] Change a couple helper functions to only take the IRBuilder as an argument and not the whole InstCombiner object. NFC llvm-svn: 307331	2017-07-06 23:18:41 +00:00
Wei Mi	20526b2725	[ConstHoisting] choose to hoist when frequency is the same. The patch is to adjust the strategy of frequency based consthoisting: Previously when the candidate block has the same frequency with the existing blocks containing a const, it will not hoist the const to the candidate block. For that case, now we change the strategy to hoist the const if only existing blocks have more than one block member. This is helpful for reducing code size. Differential Revision: https://reviews.llvm.org/D35084 llvm-svn: 307328	2017-07-06 22:32:27 +00:00
Michael Kuperstein	b9fc48da83	[NVPTX] Add lowering of i128 params. The patch adds support of i128 params lowering. The changes are quite trivial to support i128 as a "special case" of integer type. With this patch, we lower i128 params the same way as aggregates of size 16 bytes: .param .b8 _ [16]. Currently, NVPTX can't deal with the 128 bit integers: * in some cases because of failed assertions like ValVTs.size() == OutVals.size() && "Bad return value decomposition" * in other cases emitting PTX with .i128 or .u128 types (which are not valid [1]) [1] http://docs.nvidia.com/cuda/parallel-thread-execution/index.html#fundamental-types Differential Revision: https://reviews.llvm.org/D34555 Patch by: Denys Zariaiev (denys.zariaiev@gmail.com) llvm-svn: 307326	2017-07-06 22:18:54 +00:00
Lang Hames	65f69b0244	[ORC] Add missing <memory> include for shared_ptr. Accidentally left out of r307319. llvm-svn: 307322	2017-07-06 22:02:49 +00:00
David L. Jones	d4053fb2ec	Change remaining references to lit.util.capture to use subprocess.check_output. Summary: The capture() function was removed in r306625. This should fix PGO breakages reported by Michael Zolotukhin. Reviewers: mzolotukhin Subscribers: sanjoy, llvm-commits Differential Revision: https://reviews.llvm.org/D35088 llvm-svn: 307320	2017-07-06 21:46:47 +00:00
Lang Hames	2c0403e65c	[ORC] Update GlobalMappingLayer::addModuleSet to addModule. This layer was accidentally left out of r306166. llvm-svn: 307319	2017-07-06 21:33:48 +00:00
Rafael Espindola	8c1fc04f3f	Use @LINE in two more tests. llvm-svn: 307318	2017-07-06 21:33:23 +00:00
Martin Storsjo	68d0fcd7aa	[COFF, AArch64] Set the private label prefix to .L This fixes calls to external functions starting with a capital L, fixing errors like this: fatal error: error in backend: assembler label 'LocalFree' can not be undefined Differential Revision: https://reviews.llvm.org/D35079 llvm-svn: 307317	2017-07-06 21:08:34 +00:00
Matt Arsenault	9aa45f047f	AMDGPU: Add macro fusion schedule DAG mutation Try to increase opportunities to shrink vcc uses. llvm-svn: 307313	2017-07-06 20:57:05 +00:00
Matt Arsenault	a81198d82d	AMDGPU: Minor cleanup of shrinking logic llvm-svn: 307312	2017-07-06 20:56:59 +00:00
Matt Arsenault	60b91e0ba2	AMDGPU: Remove unnecessary IR from MIR tests llvm-svn: 307311	2017-07-06 20:56:57 +00:00
Reid Kleckner	c58b7c5973	[lit] Factor out some shell input/output redirection logic, NFC This is a very light refactoring aimed at improving readability. There is definitely still room for improvement here. llvm-svn: 307310	2017-07-06 20:40:27 +00:00
Stanislav Mekhanoshin	9d7b1c9ddb	[AMDGPU] Always use rcp + mul with fast math Regardless of relaxation options such as -cl-fast-relaxed-math we are producing rather long code for fdiv via amdgcn_fdiv_fast intrinsic. This intrinsic is used to replace fdiv with 2.5ulp metadata and does not handle denormals, thus believed to be fast. An fdiv instruction can also have fast math flag either by itself or together with fpmath metadata. Clang used with a relaxation flag always produces both metadata and fast flag: %div = fdiv fast float %v, %0, !fpmath !12 !12 = !{float 2.500000e+00} Current implementation ignores fast flag and favors metadata. An instruction with just fast flag would be lowered to a fastest rcp + mul, but that never happen on practice because of described mutual clang and BE behavior. This change allows an "fdiv fast" to be always lowered as rcp + mul. Differential Revision: https://reviews.llvm.org/D34844 llvm-svn: 307308	2017-07-06 20:34:21 +00:00
Davide Italiano	f4891d29f8	[lib/LTO] Add a comment to explain where we set the linkage in the summary. Pointed out by Teresa! llvm-svn: 307305	2017-07-06 20:04:20 +00:00
Chad Rosier	a72a9ff557	[ValueTracking] Support icmps fed by 'and' and 'or'. This patch adds support for handling some forms of ands and ors in ValueTracking's isImpliedCondition API. PR33611 https://reviews.llvm.org/D34901 llvm-svn: 307304	2017-07-06 20:00:25 +00:00
Davide Italiano	6a5fbe52fa	[LTO] Fix the interaction between linker redefined symbols and ThinLTO This is the same as r304719 but for ThinLTO. The substantial difference is that in this case we don't have whole visibility, just the summary. In the LTO case, when we got the resolution for the input file we could just see if the linker told us whether a symbol was linker redefined (using --wrap or --defsym) and switch the linkage directly for the GV. Here, we have the summary. So, we record that the linkage changed from <whatever it was> to $weakany to prevent IPOs across this symbol boundaries and actually just switch the linkage at FunctionImport time. This patch should also fixes the lld bits (as all the scaffolding for communicating if a symbol is linker redefined should be there & should be the same), but I'll make sure to add some tests there as well. Fixes PR33192. Differential Revision: https://reviews.llvm.org/D35064 llvm-svn: 307303	2017-07-06 19:58:26 +00:00
Aditya Nandakumar	1745121a45	[GISel]: Enhance the MachineIRBuilder API Allows the MachineIRBuilder APIs to directly create registers (based on LLT or TargetRegisterClass) as well as accept MachineInstrBuilders and implicitly converts to register(with getOperand(0).getReg()). Eg usage: LLT s32 = LLT::scalar(32); auto C32 = Builder.buildConstant(s32, 32); auto Tmp = Builder.buildInstr(TargetOpcode::G_SUB, s32, C32, OtherReg); auto Tmp2 = Builder.buildInstr(Opcode, DstReg, Builder.buildConstant(s32, 31)); .... Only a few methods added for now. Reviewed by Tim llvm-svn: 307302	2017-07-06 19:40:07 +00:00
Simon Pilgrim	a80cb1d7a7	[X86][SSE] Tests for bitcasting iX integers to vXi1 boolean vectors Including sign/zero extension to legal types llvm-svn: 307301	2017-07-06 19:33:10 +00:00
Rafael Espindola	5e8d5af0b5	Add @LINE to checks in a test. This makes it a lot easier to see which error failed a check. llvm-svn: 307300	2017-07-06 19:09:35 +00:00
Chris Lattner	d37992f035	remove an unused empty file. llvm-svn: 307299	2017-07-06 19:06:13 +00:00
David Blaikie	cf9d52c690	Prototype: Reduce llvm-profdata merge memory usage further The InstrProfWriter already stores the name and hash of the record in the nested maps it uses for lookup while merging - this data is duplicated in the value within the maps. Refactor the InstrProfRecord to use a nested struct for the counters themselves so that InstrProfWriter can use this nested struct alone without the name or hash duplicated there. This work is incomplete, but enough to demonstrate the value (around a 50% decrease in memory usage for a large test case (10GB -> 5GB)). Though most of that decrease is probably from removing the SoftInstrProfError as well, but I haven't implemented a replacement for it yet. (it needs to go with the counters, because the operations on the counters - merging, etc, are where the failures are - unlike the name/hash which are totally unused by those counter-related operations and thus easy to split out) Ongoing discussion about removing SoftInstrProfError as a field of the InstrProfRecord is happening on the thread that added it - including the possibility of moving back towards an earlier version of that proposed patch that passed SoftInstrProfError through the various APIs, rather than as a member of InstrProfRecord. Reviewers: davidxl Differential Revision: https://reviews.llvm.org/D34838 llvm-svn: 307298	2017-07-06 19:00:12 +00:00
Mandeep Singh Grang	fbbcea5dee	[llvm] Separate out reverse iteration flag into its own header Summary: This will ease out adding reverse iteration flags to other containers by simply including the header. Reviewers: mehdi_amini, dexonsmith, davide, dblaikie Reviewed By: dblaikie Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D35042 llvm-svn: 307297	2017-07-06 18:52:16 +00:00
Craig Topper	e9bf7ebacf	[InstCombine] Remove include of DIBuilder.h and Dwarf.h as they don't appear to be necessary. llvm-svn: 307295	2017-07-06 18:47:47 +00:00
Leo Li	5499b1b8be	Modify constraints in `llvm::canReplaceOperandWithVariable` Summary: `Instruction::Switch`: only first operand can be set to a non-constant value. `Instruction::InsertValue` both the first and the second operand can be set to a non-constant value. `Instruction::Alloca` return true for non-static allocation. Reviewers: efriedma Reviewed By: efriedma Subscribers: srhines, pirama, llvm-commits Differential Revision: https://reviews.llvm.org/D34905 llvm-svn: 307294	2017-07-06 18:47:05 +00:00
Craig Topper	ca2c87653c	[Constants] Replace calls to ConstantInt::equalsInt(0)/equalsInt(1) with isZero and isOne. NFCI llvm-svn: 307293	2017-07-06 18:39:49 +00:00
Craig Topper	79ab643da8	[Constants] If we already have a ConstantInt*, prefer to use isZero/isOne/isMinusOne instead of isNullValue/isOneValue/isAllOnesValue inherited from Constant. NFCI Going through the Constant methods requires redetermining that the Constant is a ConstantInt and then calling isZero/isOne/isMinusOne. llvm-svn: 307292	2017-07-06 18:39:47 +00:00
Anna Thomas	eb6d5d1950	[LoopUnrollRuntime] Bailout when multiple exiting blocks to the unique latch exit block Currently, we do not support multiple exiting blocks to the latch exit block. However, this bailout wasn't triggered when we had a unique exit block (which is the latch exit), with multiple exiting blocks to that unique exit. Moved the bailout so that it's triggered in both cases and added testcase. llvm-svn: 307291	2017-07-06 18:39:26 +00:00
Craig Topper	47c8f66997	[InstCombine] Remove Builder argument from InstCombiner::tryFactorization. NFC Builder is already a member of the InstCombiner class so we can use it with passing it. llvm-svn: 307290	2017-07-06 18:35:52 +00:00
Simon Pilgrim	0fee3372c9	[X86][SSE] Dropped -mcpu from bitcast+setcc tests Use triple and attribute only for consistency Added SSE2/AVX tests on 256-bit vectors to test PACKSS behaviour llvm-svn: 307289	2017-07-06 18:27:34 +00:00
Simon Pilgrim	8ae7e41bea	Fix spelling in comments. NFCI. llvm-svn: 307288	2017-07-06 18:17:07 +00:00
Peter Collingbourne	c855615831	Bitcode: Include any strings added to the string table in the module hash. Differential Revision: https://reviews.llvm.org/D35037 llvm-svn: 307286	2017-07-06 17:56:01 +00:00
Adam Nemet	8d10129e59	[opt-viewer] Move under tools, install it We weren't installing opt-viewer and co before, this fixes the omission. I am also moving the tools from utils/ to tools/. I believe that this is more appropriate since these tools have matured greatly in the past year through contributions by multiple people (thanks!) so they are ready to become external tools. The tools are installed under <install>/share/opt-viewer/. I am not adding the llvm- prefix. If people feel strongly about adding that, this is probably a good time since the new location will require some mental adjustment anyway. Fixes PR33521 Differential Revision: https://reviews.llvm.org/D35048 llvm-svn: 307285	2017-07-06 17:51:15 +00:00
Reid Kleckner	3f85192930	[PDB] Fill in "Parent" and "End" fields of scope-like symbol records Summary: There are a variety of records that open scopes: function scopes, block scopes, and inlined call site scopes. These symbol records contain Parent and End fields with the offsets of other symbol records. The End field contains the offset of the matching S_END or S_INLINESITE_END record. The Parent field contains the offset of the parent record, or 0 if this is a top-level scope (i.e. a function). With this change, `llvm-pdbutil pretty -all` no longer crashes on PDBs produced by LLD. I haven't tried a real debugger yet. Reviewers: zturner, ruiu Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D34898 llvm-svn: 307278	2017-07-06 16:39:32 +00:00
Craig Topper	dfd01ea9ed	[SimplifyCFG] Move a portion of an if statement that should already be implied to an assert Summary: In this code we got to Dom by following the predecessor link of BB. So it stands to reason that BB should also show up as a successor of Dom's terminator right? There isn't a way to have the CFG connect in only one direction is there? Reviewers: jmolloy, davide, mcrosier Reviewed By: mcrosier Subscribers: mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D35025 llvm-svn: 307276	2017-07-06 16:29:43 +00:00
Craig Topper	95e4142f94	[InstCombine] Change helper method to a file local static method. NFC llvm-svn: 307275	2017-07-06 16:24:23 +00:00
Craig Topper	fc42acef92	[InstCombine] Clarify comment to mention other transform that it does. NFC llvm-svn: 307274	2017-07-06 16:24:22 +00:00
Craig Topper	22795de20a	[InstCombine] Add single use checks to SimplifyBSwap to ensure we are really saving instructions Bswap isn't a simple operation so we need to make sure we are really removing a call to it before doing these simplifications. For the case when both LHS and RHS are bswaps I've allowed it to be moved if either LHS or RHS has a single use since that at least allows us to move it later where it might find another bswap to combine with and it decreases the use count on the other side so maybe the other user can be optimized. Differential Revision: https://reviews.llvm.org/D34974 llvm-svn: 307273	2017-07-06 16:24:21 +00:00
Craig Topper	3e1909d797	[InstCombine] Don't create extra ConstantInt objects in foldSelectICmpAnd. NFCI Instead just use APInt objects and only create a ConstantInt at the end if we need it for the Offset. llvm-svn: 307270	2017-07-06 15:58:54 +00:00
Wei Mi	90707394e3	[LSR] Narrow search space by filtering non-optimal formulae with the same ScaledReg and Scale. When the formulae search space is huge, LSR uses a series of heuristic to keep pruning the search space until the number of possible solutions are within certain limit. The big hammer of the series of heuristics is NarrowSearchSpaceByPickingWinnerRegs, which picks the register which is used by the most LSRUses and deletes the other formulae which don't use the register. This is a effective way to prune the search space, but quite often not a good way to keep the best solution. We saw cases before that the heuristic pruned the best formula candidate out of search space. To relieve the problem, we introduce a new heuristic called NarrowSearchSpaceByFilterFormulaWithSameScaledReg. The basic idea is in order to reduce the search space while keeping the best formula, we want to keep as many formulae with different Scale and ScaledReg as possible. That is because the central idea of LSR is to choose a group of loop induction variables and use those induction variables to represent LSRUses. An induction variable candidate is often represented by the Scale and ScaledReg in a formula. If we have more formulae with different ScaledReg and Scale to choose, we have better opportunity to find the best solution. That is why we believe pruning search space by only keeping the best formula with the same Scale and ScaledReg should be more effective than PickingWinnerReg. And we use two criteria to choose the best formula with the same Scale and ScaledReg. The first criteria is to select the formula using less non shared registers, and the second criteria is to select the formula with less cost got from RateFormula. The patch implements the heuristic before NarrowSearchSpaceByPickingWinnerRegs, which is the last resort. Testing shows we get 1.8% and 2% on two internal benchmarks on x86. llvm nightly testsuite performance is neutral. We also tried lsr-exp-narrow and it didn't help on the two improved internal cases we saw. Differential Revision: https://reviews.llvm.org/D34583 llvm-svn: 307269	2017-07-06 15:52:14 +00:00
Simon Pilgrim	713600747e	[X86][SSE4A] Add support for shuffle combining to INSERTQI. llvm-svn: 307268	2017-07-06 15:34:17 +00:00
Sanjay Patel	0c37b19331	[CGP, x86] update test checks; NFC This was auto-generated using an older version of the script, and that version does not work with phis, so if we enable expansion it will go bad. llvm-svn: 307267	2017-07-06 15:31:38 +00:00
Simon Pilgrim	03641df383	[X86][SSE4A] Add test showing missed opportunities to combine INSERTQI shuffle llvm-svn: 307265	2017-07-06 14:52:24 +00:00
Joel Jones	aff09bf052	Doxygen formatting. NFCI llvm-svn: 307263	2017-07-06 14:17:36 +00:00
Sanjay Patel	2a341620e7	[x86] fix over-specified triple and auto-generate checks; NFC llvm-svn: 307262	2017-07-06 14:15:15 +00:00
Mikael Holmen	9c3e2eac6a	[MachineVerifier] Add check that tied physregs aren't different. Summary: Added MachineVerifier code to check register ties more thoroughly, especially so that physical registers that are tied are the same. This may help e.g. when creating MIR files. Original patch by Jesper Antonsson Reviewers: stoklund, sanjoy, qcolombet Reviewed By: qcolombet Subscribers: qcolombet, llvm-commits Differential Revision: https://reviews.llvm.org/D34394 llvm-svn: 307259	2017-07-06 13:18:21 +00:00
Ilya Biryukov	13cde86e74	Fixes to Dockerfile scripts. - Put buildfiles into /tmp/clang-build/build, instead of /tmp/clang-build. We checkout the sources to /tmp/clang-build/src and running cmake in /tmp/clang-build was done by mistake. - Don't add an extra ';' at the start of enabled projects list. It worked either way, but looked strange. - Minor comment update. llvm-svn: 307258	2017-07-06 13:10:55 +00:00
Simon Pilgrim	7b79fbd4ea	[X86][SSE] combineX86ShuffleChain - merge duplicate creations of integer mask types llvm-svn: 307257	2017-07-06 13:09:19 +00:00
Ilya Biryukov	7f02a75a74	Made a script to build docker images easier to use. Summary: - Removed double indirection via command-line args (i.e. two `--` options of `build_docker_image.sh`). - Added a comment on how to build 2-stage clang install into the `build_docker_image.sh`, it used to be only in the `docs/Docker.rst`. Reviewers: klimek, mehdi_amini Reviewed By: klimek Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D35050 llvm-svn: 307256	2017-07-06 12:46:51 +00:00
Simon Pilgrim	77ad6d9bb2	[X86][SSE] combineX86ShuffleChain - merge duplicate 'Zeroable' element masks llvm-svn: 307255	2017-07-06 12:40:10 +00:00
Simon Pilgrim	cc0f785dca	[X86][SSE4A] Add support for shuffle combining to EXTRQ. llvm-svn: 307254	2017-07-06 12:22:58 +00:00
Simon Pilgrim	40c0ae200f	[X86][SSE4A] Add scheduling tests for SSE4A instructions llvm-svn: 307251	2017-07-06 11:26:43 +00:00
Simon Pilgrim	1dd0bd1949	[X86][SSE4A] Split EXTRQ/INSERTQ shuffle matching from lowering. NFCI. First step toward supporting shuffle combining to EXTRQ/INSERTQ. llvm-svn: 307250	2017-07-06 11:06:54 +00:00
Max Kazantsev	98838527c6	Revert "Revert "Revert "[IndVars] Canonicalize comparisons between non-negative values and indvars""" It appears that the problem is still there. Needs more analysis to understand why SaturatedMultiply test fails. llvm-svn: 307249	2017-07-06 10:47:13 +00:00
Daniel Sanders	3ae33209cc	[globalisel][tablegen] Rename and re-comment render functions to match the new MatchTables. NFC. The conversion to MatchTable left the function names and comments referring to C++ statements and expressions. Updated the names and comments to account for the fact that they're no longer unconstrained statements/expressions. llvm-svn: 307248	2017-07-06 10:37:17 +00:00
David Stuttard	7528d4bd42	[RegisterCoalescer] Fix for SubRange join unreachable Summary: During remat, some subranges might end up having invalid segments which caused problems for later coalescing. Added in a check to remove segments that are invalidated as part of the remat. See http://llvm.org/PR33524 Subscribers: MatzeB, qcolombet Differential Revision: https://reviews.llvm.org/D34391 llvm-svn: 307247	2017-07-06 10:07:57 +00:00
Daniel Sanders	9d662d2486	[globalisel][tablegen] Rename and re-comment to match the new MatchTables. NFC. The conversion to MatchTable left the function names and comments referring to C++ statements and expressions. Updated the names and comments to account for the fact that they're no longer unconstrained statements/expressions. llvm-svn: 307246	2017-07-06 10:06:12 +00:00
Diana Picus	c3a9c34761	[ARM] GlobalISel: Map s32 G_FCMP in reg bank select Map hard G_FCMP operands to FPR and the result to GPR. llvm-svn: 307245	2017-07-06 09:57:46 +00:00
Max Kazantsev	c8db20b78c	Revert "Revert "[IndVars] Canonicalize comparisons between non-negative values and indvars"" It seems that the patch was reverted by mistake. Clang testing showed failure of the MathExtras.SaturatingMultiply test, however I was unable to reproduce the issue on the fresh code base and was able to confirm that the transformation introduced by the change does not happen in the said test. This gives a strong confidence that the actual reason of the failure of the initial patch was somewhere else, and that problem now seems to be fixed. Re-submitting the change to confirm that. llvm-svn: 307244	2017-07-06 09:57:41 +00:00
Diana Picus	d0104eaae8	[ARM] GlobalISel: Legalize G_FCMP for s32 This covers both hard and soft float. Hard float is easy, since it's just Legal. Soft float is more involved, because there are several different ways to handle it based on the predicate: one and ueq need not only one, but two libcalls to get a result. Furthermore, we have large differences between the values returned by the AEABI and GNU functions. AEABI functions return a nice 1 or 0 representing true and respectively false. GNU functions generally return a value that needs to be compared against 0 (e.g. for ogt, the value returned by the libcall is > 0 for true). We could introduce redundant comparisons for AEABI as well, but they don't seem easy to remove afterwards, so we do different processing based on whether or not the result really needs to be compared against something (and just truncate if it doesn't). llvm-svn: 307243	2017-07-06 09:09:33 +00:00
George Rimar	a56702e0c6	[DWARF] - Provide default implementation for getSectionLoadAddress() method of LoadedObjectInfo It is a bit unconvinent that client should implement this method even if not use it. Patch provides default implementation. Differential revision: https://reviews.llvm.org/D35009 llvm-svn: 307242	2017-07-06 08:46:01 +00:00
Daniel Sanders	85ffd3609f	[globalisel][tablegen] Import rules containing intrinsic_wo_chain. Summary: As of this patch, 1018 out of 3938 rules are currently imported. Depends on D32275 Reviewers: qcolombet, kristof.beyls, rovka, t.p.northover, ab, aditya_nandakumar Reviewed By: qcolombet Subscribers: dberris, igorb, llvm-commits Differential Revision: https://reviews.llvm.org/D32278 llvm-svn: 307240	2017-07-06 08:12:20 +00:00
Diana Picus	cd460c89c4	[ARM] GlobalISel: Widen s1, s8, s16 G_CONSTANT Get the legalizer to widen small constants. llvm-svn: 307239	2017-07-06 08:04:16 +00:00
David Blaikie	fd8777ed88	Fix -Wunused-function by making function declarations in a header non-static Also avoids ODR violations by ensuring names used in headers find the same entity, not different, file-local entities in each translation unit. llvm-svn: 307237	2017-07-06 05:33:32 +00:00
David Blaikie	5b079d83a6	Simplify InstrProfRecord tests, eliminating named temporaries in favor of braced init args This will also simplify an API transition and class renaming coming soon. llvm-svn: 307236	2017-07-06 05:19:17 +00:00
David L. Jones	a63c3369aa	[lit] Fix unit test discovery for Visual Studio builds. Fix by Andrew Ng! The Visual Studio build can contain output for multiple configuration types ( e.g. Debug, Release & RelWithDebInfo) within the same build output directory. Therefore when discovering unit tests, the "build mode" sub directory containing the appropriate configuration is included in the search. This sub directory may not always be present, so a test for its existence is required. Reviewers: zturner, modocache, dlj Reviewed By: zturner, dlj Subscribers: grimar, bd1976llvm, gbreynoo, edd, jhenderson, llvm-commits Differential Revision: https://reviews.llvm.org/D34976 llvm-svn: 307235	2017-07-06 03:23:18 +00:00
Frederich Munch	52dfcd18d1	Avoid constructing GlobalExtensions only to find out it is empty. Summary: GlobalExtensions is dereferenced twice, once for iteration and then a check if it is empty. As a ManagedStatic this dereference forces it's construction which is unnecessary. Reviewers: efriedma, davide, mehdi_amini Reviewed By: mehdi_amini Subscribers: chapuni, llvm-commits, mehdi_amini Differential Revision: https://reviews.llvm.org/D33381 llvm-svn: 307229	2017-07-06 00:09:09 +00:00
Eric Beckmann	f6090b620e	Revert "Revert "Revert "Switch external cvtres.exe for llvm's own resource library.""" This reverts commit ae21ee0b6cacbc1efaf4d42502e71da2f0eb45c3. The initial revert was done in order to prevent ongoing errors on chromium bots such as CrWinClangLLD. However, this was done haphazardly and I didn't realize there were test and compilation failures, so this revert was reverted. Now that those have been fixed, we can revert the revert of the revert. llvm-svn: 307227	2017-07-05 23:46:06 +00:00
Eric Beckmann	81979b038f	Revert "Revert "Revert "Replace trivial use of external rc.exe by writing our own .res file.""" This reverts commit 5fecbbbe5049665d86834cf69d8f75db4f392308. The initial revert was done in order to prevent ongoing errors on chromium bots such as CrWinClangLLD. However, this was done haphazardly and I didn't realize there were test and compilation failures, so this revert was reverted. Now that those have been fixed, we can revert the revert of the revert. llvm-svn: 307226	2017-07-05 23:45:50 +00:00
Craig Topper	4584476cac	[IR] Use CmpInst::isFPPredicate/isIntPredicate in a few other places. NFC llvm-svn: 307224	2017-07-05 23:35:46 +00:00
Davide Italiano	7dd0694f96	[GlobalOpt] Remove unreachable blocks before optimizing a function. LLVM's definition of dominance allows instructions that are cyclic in unreachable blocks, e.g.: %pat = select i1 %condition, @global, i16* %pat because any instruction dominates an instruction in a block that's not reachable from entry. So, remove unreachable blocks from the function, because a) there's no point in analyzing them and b) GlobalOpt should otherwise grow some more complicated logic to break these cycles. Differential Revision: https://reviews.llvm.org/D35028 llvm-svn: 307215	2017-07-05 22:28:28 +00:00
Craig Topper	ad7137838e	[IR] Use CmpInst::isIntPredicate()/isFPPredicate in some asserts instead of doing the equivalent range check. NFC llvm-svn: 307210	2017-07-05 22:09:00 +00:00
Vadim Chugunov	e6f76558c7	Fix libcall expansion creating DAG nodes with invalid type post type legalization. If we are lowering a libcall after legalization, we'll split the return type into a pair of legal values. Patch by Jatin Bhateja and Eli Friedman. Differential Revision: https://reviews.llvm.org/D34240 llvm-svn: 307207	2017-07-05 22:01:49 +00:00
Zachary Turner	91dcab417c	Fix std::min ambiguity between uint32 and size_t. llvm-svn: 307205	2017-07-05 21:59:20 +00:00
Zachary Turner	8120ebf4c7	[llvm-pdbutil] Add the ability to truncate stream purpose names. This will be useful for aligning fields to a fixed with in subsequent patches. llvm-svn: 307204	2017-07-05 21:54:58 +00:00
Brendon Cahoon	cb8c7b912d	[DependenceAnalysis] Make sure base objects are the same when comparing GEPs The dependence analysis was returning incorrect information when using the GEPs to compute dependences. The analysis uses the GEP indices under certain conditions, but was doing it incorrectly when the base objects of the GEP are aliases, but pointing to different locations in the same array. This patch adds another check for the base objects. If the base pointer SCEVs are not equal, then the dependence analysis should fall back on the path that uses the whole SCEV for the dependence check. This fixes PR33567. Differential Revision: https://reviews.llvm.org/D34702 llvm-svn: 307203	2017-07-05 21:35:47 +00:00
Galina Kistanova	0d1990e2e7	Added more info on silent master to the doc. llvm-svn: 307200	2017-07-05 20:45:44 +00:00
Craig Topper	cc418b656a	[InstCombine] Use CmpInst::Predicate with m_Cmp instead of ICmpInst::Predicate. NFC There isn't really an ICmpInst version so we're just accessing the CmpInst version through inheritance. llvm-svn: 307199	2017-07-05 20:31:00 +00:00
Sam Clegg	9bf73c078b	[WebAssembly] Fix types for address taken functions Differential Revision: https://reviews.llvm.org/D34966 llvm-svn: 307198	2017-07-05 20:25:08 +00:00
Alexander Shaposhnikov	d968f6f423	[tablegen] Avoid creating temporary strings If a method / function returns a StringRef but the variable is of type const std::string& a temporary string is created (StringRef has a cast operator to std::string), which is a suboptimal behavior. Differential revision: https://reviews.llvm.org/D34994 Test plan: make check-all llvm-svn: 307195	2017-07-05 20:14:54 +00:00
Sam Clegg	8c4baa00de	[WebAssembly] MC: Don't generate extra types for weak alias Previously we were generating a void(void) function type for a weak alias. Update the weak-alias test case to catch this. Differential Revision: https://reviews.llvm.org/D34734 llvm-svn: 307194	2017-07-05 20:09:26 +00:00
Rafael Espindola	243288a488	Add a test for relocation addend on mips. An lld test found a bug in a llvm patch I am working on. It is better to have test coverage for that in llvm too. llvm-svn: 307192	2017-07-05 19:31:07 +00:00
Eric Beckmann	1d50926e71	Revert "Revert "Replace trivial use of external rc.exe by writing our own .res file."" This reverts commit 8c8dce3b8f15d6ebaefc35ce88f15a85c8cdbd6e. llvm-svn: 307191	2017-07-05 19:04:48 +00:00
Eric Beckmann	0eafa581a3	Revert "Revert "Switch external cvtres.exe for llvm's own resource library."" This reverts commit 165e578e47f1cd38191120aad23a9020fb5476dd. Forgot to run tests on this. llvm-svn: 307190	2017-07-05 19:04:33 +00:00
Eric Beckmann	36793a0ecf	Revert "Switch external cvtres.exe for llvm's own resource library." This reverts commit 600d52c278e123dd08bee24c1f00932b55add8de. This patch still seems to break CrWinClangLLD, reverting until I can find root problem. llvm-svn: 307189	2017-07-05 18:59:16 +00:00
Eric Beckmann	8cc9fd31e6	Revert "Replace trivial use of external rc.exe by writing our own .res file." This patch still seems to break CrWinClangLLD, reverting this once more until I can discover root problem. This reverts commit 3dbbc8ce43be50ffde2b1c655c6d3a25796fe78b. llvm-svn: 307188	2017-07-05 18:59:01 +00:00
Zachary Turner	eae44dfee9	[PDB] Add a test that verifies every known type record. We had a lot of one-off tests for this type and that type, or "every type that happens to be generated by this program I built". Eventually I got a bug report filed where we were crashing on a type that was not covered by any of these tests. So this test carefully constructs a minimal C++ program that will cause every type we support to be emitted. This ensures full coverage for type records. Differential Revision: https://reviews.llvm.org/D34915 llvm-svn: 307187	2017-07-05 18:43:25 +00:00
Quentin Colombet	f3f7d4d64b	[AMDGPU] Move GISel accessor initialization from TargetMachine to Subtarget. NFC llvm-svn: 307186	2017-07-05 18:40:56 +00:00
Sean Fertile	3cd1a0368c	[Power9] Disable removing extra swaps on P9. On power 8 we sometimes insert swaps to deal with the difference between Little-Endian and Big-Endian. The swap removal pass is supposed to clean up these swaps. On power 9 we don't need this pass since we do not need to insert the swaps in the first place. Commiting on behalf of Stefan Pintilie. Differential Revision: https://reviews.llvm.org/D34627 llvm-svn: 307185	2017-07-05 18:37:10 +00:00
Simon Pilgrim	ac78daf517	{DAGCombiner] Fold (rot x, 0) -> x llvm-svn: 307184	2017-07-05 18:27:11 +00:00
Simon Pilgrim	49123d4bb0	[X86] Test bitfield loadstore tests on i686 as well llvm-svn: 307182	2017-07-05 18:09:30 +00:00
Sean Fertile	d44cb1838f	[PowerPC] Make sure that we remove dead PHI nodes after the PPCCTRLoops pass. Commiting on behalf of Stefan Pintilie. Differential Revision: https://reviews.llvm.org/D34829 llvm-svn: 307180	2017-07-05 17:57:57 +00:00
Andrew Zhogin	45d192823e	[DAGCombiner] visitRotate patch to optimize pair of ROTR/ROTL instructions into one with combined shift operand. For two ROTR operations with shifts C1, C2; combined shift operand will be (C1 + C2) % bitsize. Differential revision: https://reviews.llvm.org/D12833 llvm-svn: 307179	2017-07-05 17:55:42 +00:00
Simon Pilgrim	55006b407b	[X86][SSE] Dropped -mcpu from bitcast+setcc mask tests Use triple and attribute only for consistency llvm-svn: 307176	2017-07-05 17:30:30 +00:00
Tony Jiang	aa5a6a1c30	[Power9] Exploit vector extract with variable index. This patch adds the exploitation for new power 9 instructions which extract variable elements from vectors: VEXTUBLX VEXTUBRX VEXTUHLX VEXTUHRX VEXTUWLX VEXTUWRX Differential Revision: https://reviews.llvm.org/D34032 Commit on behalf of Zaara Syeda (syzaara@ca.ibm.com) llvm-svn: 307174	2017-07-05 16:55:00 +00:00
Tony Jiang	9a91a18110	[Power9] Exploit vector integer extend instructions when indices aren't correct. This patch adds on to the exploitation added by https://reviews.llvm.org/D33510. This now catches build vector nodes where the inputs are coming from sign extended vector extract elements where the indices used by the vector extract are not correct. We can still use the new hardware instructions by adding a shuffle to move the elements to the correct indices. I introduced a new PPCISD node here because adding a vector_shuffle and changing the elements of the vector_extracts was getting undone by another DAG combine. Commit on behalf of Zaara Syeda (syzaara@ca.ibm.com) Differential Revision: https://reviews.llvm.org/D34009 llvm-svn: 307169	2017-07-05 16:00:38 +00:00
Daniel Sanders	d560a64e42	[globalisel][tablegen] Fix another unused variable warning introduced by r307159 llvm-svn: 307168	2017-07-05 15:34:16 +00:00
David Blaikie	8713157747	DebugInfo: Generalize LoadedObjectInfoHelper from RuntimeDyld Make it usable by any class derived (even indirectly) from LoadedObjectInfo by allowing a custom base class to be specified and perfect forwarding to the ctor. llvm-svn: 307166	2017-07-05 15:23:56 +00:00
Daniel Sanders	a6cfce6863	[globalisel][tablegen] Finish fixing compile-time regressions by merging the matcher and emitter state machines. Summary: Also, made a few minor tweaks to shave off a little more cumulative memory consumption: * All rules share a single NewMIs instead of constructing their own. Only one will end up using it. * Use MIs.resize(1) instead of MIs.clear();MIs.push_back(I) and prevent GIM_RecordInsn from changing MIs[0]. Depends on D33764 Reviewers: rovka, vitalybuka, ab, t.p.northover, qcolombet, aditya_nandakumar Reviewed By: ab Subscribers: kristof.beyls, igorb, llvm-commits Differential Revision: https://reviews.llvm.org/D33766 llvm-svn: 307159	2017-07-05 14:50:18 +00:00
Dinar Temirbulatov	b78adec638	[SLPVectorizer] Add an extra parameter to cancelScheduling function, NFCI. llvm-svn: 307158	2017-07-05 13:53:03 +00:00
David Green	b26a0a460c	[IndVarSimplify] Add AShr exact flags using induction variables ranges. This adds exact flags to AShr/LShr flags where we can statically prove it is valid using the range of induction variables. This allows further optimisations to remove extra loads. Differential Revision: https://reviews.llvm.org/D34207 llvm-svn: 307157	2017-07-05 13:25:58 +00:00
Ulrich Weigand	43579cf4a0	[SystemZ] Simplify handling of 128-bit multiply/divide instruction Several integer multiply/divide instructions require use of a register pair as input and output. This patch moves setting up the input register pair from C++ code to TableGen, simplifying the whole process and making it more easily extensible. No functional change. llvm-svn: 307155	2017-07-05 13:17:31 +00:00
Ulrich Weigand	e2a68e96f0	[SystemZ] Small cleanups to SystemZScheduleZ13.td Fixes a couple of whitespace errors, re-sorts the vector floating-point instructions to make them more easily extensible, and adds a missing pseudo instruction. No functional change. llvm-svn: 307154	2017-07-05 13:14:43 +00:00
Nirav Dave	65b7ab1be4	[Hexagon] Preclude non-memory test from being optimized away. NFC. llvm-svn: 307153	2017-07-05 13:08:03 +00:00
Tom Stellard	edd69bc778	CMake: Add LLVM_UTILS_INSTALL_DIR option Summary: This is like the LLVM_TOOLS_INSTALL_DIR option, but for the utils that are installed when the LLVM_INSTALL_UTILS. This option defaults to 'bin' to remain consistent with the current behavior, but distros may want to install these to libexec/llvm. Reviewers: beanz Reviewed By: beanz Subscribers: llvm-commits, mgorny Differential Revision: https://reviews.llvm.org/D30655 llvm-svn: 307150	2017-07-05 12:57:30 +00:00
Diana Picus	fc1675eb16	[GlobalISel] Refactor Legalizer helpers for libcalls We used to have a helper that replaced an instruction with a libcall. That turns out to be too aggressive, since sometimes we need to replace the instruction with at least two libcalls. Therefore, change our existing helper to only create the libcall and leave the instruction removal as a separate step. Also rename the helper accordingly. llvm-svn: 307149	2017-07-05 12:57:24 +00:00
Sjoerd Meijer	6d14fdf62d	[AsmParser] Mnemonic Spell Corrector This implements suggesting other mnemonics when an invalid one is specified, for example: $ echo "adXd r1,r2,#3" \| llvm-mc -triple arm <stdin>:1:1: error: invalid instruction, did you mean: add, qadd? adXd r1,r2,#3 ^ The implementation is target agnostic, but as a first step I have added it only to the ARM backend; so the ARM backend is a good example if someone wants to enable this too for another target. Differential Revision: https://reviews.llvm.org/D33128 llvm-svn: 307148	2017-07-05 12:39:13 +00:00
Daniel Sanders	805e9cb3fc	[globalisel][tablegen] Fix the misuse of STATISTICS() on release builds (like r307088) after r307133. r307133 brought back a couple instances of the same mistake that was already fixed by r307088. Fixed it again. Using NumPatternEmitted as a unique id for the tables is not valid on release builds since the counters don't count in that case. llvm-svn: 307146	2017-07-05 12:14:18 +00:00
Diana Picus	3e8851a1b4	[ARM] GlobalISel: Extract tiny helper. NFC Extract functionality for determining if the target uses AEABI. llvm-svn: 307145	2017-07-05 11:53:51 +00:00
Diana Picus	97a5d9b5a7	[MachineIRBuilder] Fix formatting. NFC. llvm-svn: 307144	2017-07-05 11:47:23 +00:00
Igor Breger	0c979d49eb	[GlobalISel][X86] For now don't handle not trivial function arguments lowering. llvm-svn: 307142	2017-07-05 11:40:35 +00:00
Diana Picus	3e40b46bf0	[MachineIRBuilder] Add buildOr helper. NFC. This isn't used anywhere yet, but I need it for a future commit. llvm-svn: 307141	2017-07-05 11:32:12 +00:00
Igor Breger	55e2f5963a	[GlobalIsel] allow x86_fp80 values to be dumped. Summary: Otherwise the fallback path fails with an assertion on x86_64 targets, when "x86_fp80" is encountered. Reviewers: t.p.northover, zvi, guyblank Reviewed By: zvi Subscribers: rovka, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D34975 llvm-svn: 307140	2017-07-05 11:11:10 +00:00
Diana Picus	05e704f453	[MachineIRBuilder] Add buildBinaryOp helper. NFC Add a helper for building simple binary ops like add, mul, sub, and. This can be used in the future for quickly adding support for or, xor. llvm-svn: 307139	2017-07-05 11:02:31 +00:00
Daniel Sanders	1745076c59	[globalisel][tablegen] Fix an unused variable warning in release builds after r307133 llvm-svn: 307138	2017-07-05 10:16:48 +00:00
Max Kazantsev	ebe56283bc	Revert "[IndVars] Canonicalize comparisons between non-negative values and indvars" This patch seems to cause failures of test MathExtras.SaturatingMultiply on multiple buildbots. Reverting until the reason of that is clarified. Differential Revision: https://reviews.llvm.org/rL307126 llvm-svn: 307135	2017-07-05 09:44:41 +00:00
Daniel Sanders	d93a35ae40	[globalisel][tablegen] Added instruction emission to the state-machine-based matcher. Summary: This further improves the compile-time regressions that will be caused by a re-commit of r303259. Also added included preliminary work in preparation for the multi-insn emitter since I needed to change the relevant part of the API for this patch anyway. Depends on D33758 Reviewers: rovka, vitalybuka, ab, t.p.northover, qcolombet, aditya_nandakumar Reviewed By: ab Subscribers: kristof.beyls, igorb, llvm-commits Differential Revision: https://reviews.llvm.org/D33764 llvm-svn: 307133	2017-07-05 09:39:33 +00:00
Max Kazantsev	80bc4a5554	[IndVars] Canonicalize comparisons between non-negative values and indvars -If there is a IndVar which is known to be non-negative, and there is a value which is also non-negative, then signed and unsigned comparisons between them produce the same result. Both of those can be seen in the same loop. To allow other optimizations to simplify them, we turn all instructions like %c = icmp slt i32 %iv, %b to %c = icmp ult i32 %iv, %b if both %iv and %b are known to be non-negative. Differential Revision: https://reviews.llvm.org/D34979 llvm-svn: 307126	2017-07-05 06:38:49 +00:00
Igor Breger	9d5571a226	[GlobalISel][X86] Allow graceful fallback for struct/array argument/return value lowering. Going to support it in follow patch. llvm-svn: 307125	2017-07-05 06:24:13 +00:00
Nemanja Ivanovic	5fd4ea36fd	Add the missing triple to the test case added as part of r307120. llvm-svn: 307122	2017-07-05 05:14:43 +00:00
Nemanja Ivanovic	845a7968bc	[PowerPC] Fix for PR33636 Remove casts to a constant when a node can be an undef. Differential Revision: https://reviews.llvm.org/D34808 llvm-svn: 307120	2017-07-05 04:51:29 +00:00
Yuka Takahashi	34a7c3baf9	[Bash-autocompletion] Show flags which has HelpText or GroupID Summary: Otherwise internal flags will be also completed. Differential Revision: https://reviews.llvm.org/D34930 llvm-svn: 307116	2017-07-05 02:36:32 +00:00
Nirav Dave	b320ef9fab	Rewrite areNonVolatileConsecutiveLoads to use BaseIndexOffset Relanding after rewriting undef.ll test to avoid host-dependant endianness. As discussed in D34087, rewrite areNonVolatileConsecutiveLoads using generic checks. Also, propagate missing local handling from there to BaseIndexOffset checks. Tests of note: * test/CodeGen/X86/build-vector* - Improved. * test/CodeGen/BPF/undef.ll - Improved store alignment allows an additional store merge * test/CodeGen/X86/clear_upper_vector_element_bits.ll - This is a case we already do not handle well. Here, the DAG is improved, but scheduling causes a code size degradation. Reviewers: RKSimon, craig.topper, spatel, andreadb, filcab Subscribers: nemanjai, llvm-commits Differential Revision: https://reviews.llvm.org/D34472 llvm-svn: 307114	2017-07-05 01:21:23 +00:00
Alexander Shaposhnikov	ed37df7ea3	[profiledata] Avoid creating a temporary vector in getNumValueData getValueSitesForKind returns ArrayRef which has a cast operator to std::vector, as a result a temporary vector is created if the type of the variable is const std::vector& that is suboptimal in this case. Differential revision: https://reviews.llvm.org/D34970 Test plan: make check-all llvm-svn: 307113	2017-07-05 01:20:52 +00:00
Anna Thomas	740f529dba	[SafepointIRVerifier] Add verifier pass for finding GC relocation bugs Original Patch and summary by Philip Reames. RewriteStatepointsForGC tries to rewrite a function in a manner where the optimizer can't end up using a pointer value after it might have been relocated by a safepoint. This pass checks the invariant that RSForGC is supposed to establish and that (if we constructed semantics correctly) later passes must preserve. This has been a really useful diagnostic tool when initially developing the rewriting scheme and has found numerous bugs. Differential Revision: https://reviews.llvm.org/D15940 Reviewed by: swaroop.sridhar, mjacob Subscribers: llvm-commits llvm-svn: 307112	2017-07-05 01:16:29 +00:00
Dylan McKay	a24aa19900	Revert "[AVR] Add the branch selection pass from the GitHub repository" This reverts commit 602ef067c1d58ecb425d061f35f2bc4c7e92f4f3. llvm-svn: 307111	2017-07-05 00:50:56 +00:00
Dylan McKay	f115c7f917	[AVR] Add the branch selection pass from the GitHub repository We should rewrite this using the generic branch relaxation pass, but for the moment having this pass is better than hitting an assertion error. llvm-svn: 307109	2017-07-05 00:41:19 +00:00
Gadi Haber	689426e3cb	NFC. Made some updates to the half.ll test under CodeGen to make it friendly to the update_llc_test_checks .py tool as follows: 1.Removing the llc flag -asm-verbose=false 2.Grouping the multiple check-prefix directives 3.Apply update_llc_test_checks.py tool on the test This change is needed to easily update scheduling changes in an upcoming patch. Reviewers: zvi, RKSimon, craig.topper Differential Revision: https://reviews.llvm.org/D34934 llvm-svn: 307108	2017-07-04 21:51:05 +00:00
Craig Topper	5e1fa83bf2	Recommit r307064, "[InstCombine] Add test cases demonstrating creation of extra bswap instrinsic calls when when optimizing bswap and bitwise ops when the bswaps have additional uses. NFC" The test check lines have now been fixed. llvm-svn: 307106	2017-07-04 20:15:24 +00:00
Andrew Zhogin	2f8be0552e	[ARM][test] Added test/CodeGen/ARM/ror.ll test. NFC precommit for D12833. llvm-svn: 307103	2017-07-04 19:50:22 +00:00
Simon Pilgrim	ac3e7f3f57	[X86][SSE4A] Add support for combining from non-v16i8 EXTRQI/INSERTQI shuffles With the improved shuffle decoding we can now combine EXTRQI/INSERTQI shuffles from non-v16i8 vector types llvm-svn: 307099	2017-07-04 18:11:02 +00:00
Simon Pilgrim	f809c5f11c	Fix signed/unsigned comparison warnings llvm-svn: 307098	2017-07-04 17:42:01 +00:00
Alexander Timofeev	982aee6a38	[AMDGPU] Switch scalarize global loads ON by default Differential revision: https://reviews.llvm.org/D34407 llvm-svn: 307097	2017-07-04 17:32:00 +00:00
Anna Thomas	ada4ddc0bc	[LoopDeletion] NFC: Add loop being analyzed debug statement llvm-svn: 307096	2017-07-04 17:00:03 +00:00
Simon Pilgrim	9f0a0bd20b	[X86][SSE4A] Generalized EXTRQI/INSERTQI shuffle decodes The existing decodes only worked for v16i8 vectors, this adds support for any 128-bit vector llvm-svn: 307095	2017-07-04 16:53:12 +00:00
Hiroshi Inoue	79f8933f23	fix trivial typos in comments; NFC llvm-svn: 307094	2017-07-04 16:35:26 +00:00
Daniel Sanders	f1530f2512	[globalisel][tablegen] Fix the modules build after r307079 Exclude InstructionSelectorImpl.h since DEBUG_TYPE may vary between includes. llvm-svn: 307093	2017-07-04 16:29:38 +00:00
Andrew Zhogin	de5d250a0b	[DAGCombiner] Intermediate variables in visitRotate promoted to the function's begin. NFC precommit for D12833. llvm-svn: 307091	2017-07-04 15:57:39 +00:00
Daniel Sanders	c60abe37f2	[globalisel][tablegen] Fix release builds after r307079 Using NumPatternEmitted as a unique id for the tables is not valid on release builds since the counters don't count in that case. Also fix an unused variable warning. llvm-svn: 307088	2017-07-04 15:31:50 +00:00
Anna Thomas	505941e7d6	[FastISel] Move gc intrinsic test to X86 directory Move from generic to X86 directory since gc intrinsics only supposed in X86 64 bit. Add target triple as well. Fixes build failure in i686-linux-RA caused by rL307084. llvm-svn: 307086	2017-07-04 15:24:08 +00:00
Alexander Kornienko	656466ed0b	Fix dangling StringRefs found by clang-tidy misc-dangling-handle check. llvm-svn: 307085	2017-07-04 15:13:02 +00:00
Anna Thomas	a66a98cc74	[FastISel][SelectionDAG]Teach fastISel about GC intrinsics Summary: We are crashing in LLC at O0 when gc intrinsics are present in the block. The reason being FastISel performs basic block ISel by modifying GC.relocates to be the first instruction in the block. This can cause us to visit the GC relocate before it's corresponding GC.statepoint is visited, which is incorrect. When we lower the statepoint, we record the base and derived pointers, along with the gc.relocates. After this we can visit the gc.relocate. This patch avoids fastISel from incorrectly creating the block with gc.relocate as the first instruction. Reviewers: qcolombet, skatkov, qikon, reames Reviewed by: skatkov Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D34421 llvm-svn: 307084	2017-07-04 15:09:09 +00:00
Marek Olsak	b83f5c99ba	[AMDGPU] Fix latency of MIMG instructions Patch by cwabbott (Connor Abbott). llvm-svn: 307081	2017-07-04 14:43:38 +00:00
Ilya Biryukov	0273b81cfb	NFC. Removed mention of missing script from build_docker_image.sh. llvm-svn: 307080	2017-07-04 14:41:21 +00:00
Daniel Sanders	6ab0daade8	[globalisel][tablegen] Partially fix compile-time regressions by converting matcher to state-machine(s) Summary: Replace the matcher if-statements for each rule with a state-machine. This significantly reduces compile time, memory allocations, and cumulative memory allocation when compiling AArch64InstructionSelector.cpp.o after r303259 is recommitted. The following patches will expand on this further to fully fix the regressions. Reviewers: rovka, ab, t.p.northover, qcolombet, aditya_nandakumar Reviewed By: ab Subscribers: vitalybuka, aemerson, javed.absar, igorb, llvm-commits, kristof.beyls Differential Revision: https://reviews.llvm.org/D33758 llvm-svn: 307079	2017-07-04 14:35:06 +00:00
Anna Thomas	90f69abc8b	[LoopDeletion] NFC: Add debug statements to the optimization We have a DEBUG option for loop deletion, but no related debug messages. Added some debug messages to state why loop deletion failed. llvm-svn: 307078	2017-07-04 14:05:19 +00:00
Hiroshi Inoue	2344b7611a	fix trivial typos in comments; NFC llvm-svn: 307075	2017-07-04 13:09:29 +00:00
Simon Pilgrim	d128222f0c	[X86] Add combine tests for vector rotates Reference tests for D12833 llvm-svn: 307073	2017-07-04 12:33:53 +00:00
NAKAMURA Takumi	ff1d5aefe3	Revert r307064, "[InstCombine] Add test cases demonstrating creation of extra bswap instrinsic calls when when optimizing bswap and bitwise ops when the bswaps have additional uses. NFC" Seems confused between %tmpN and unnamed %N to give same name. llvm-svn: 307070	2017-07-04 12:13:27 +00:00
NAKAMURA Takumi	501efda909	llvm/ExecutionEngine/Orc/ObjectTransformLayer.h: Add <memory> to appease libstdc++'s std::shared_ptr. llvm-svn: 307069	2017-07-04 12:12:37 +00:00
Gadi Haber	4980790e81	NFC commit. Converting the Codegen test "extractelement-legalization-store-ordering.ll" to be "update_llc_test_checks" friendly. The changes to the test are needed for an upcoming scheduling patch. Reviewers: zvi, RKSimon Differential Revision: https://reviews.llvm.org/D34935 llvm-svn: 307066	2017-07-04 07:18:03 +00:00
Craig Topper	0f746c2793	[InstCombine] Add TODOs for a couple things that should maybe be in InstSimplify instead. NFC llvm-svn: 307065	2017-07-04 06:50:48 +00:00
Craig Topper	872d750560	[InstCombine] Add test cases demonstrating creation of extra bswap instrinsic calls when when optimizing bswap and bitwise ops when the bswaps have additional uses. NFC I assume bswap intrinsics are somewhat costly so we should be making sure we are getting rid of them not creating more. llvm-svn: 307064	2017-07-04 06:50:44 +00:00
Alexander Shaposhnikov	8b5f0c9111	[tablegen] Avoid creating a temporary vector in getInstructionCase Record::getValues returns ArrayRef which has a cast operator to std::vector, as a result a temporary vector is created if the type of the variable is const std::vector& that is suboptimal in this case. Differential revision: https://reviews.llvm.org/D34969 Test plan: make check-all llvm-svn: 307063	2017-07-04 06:16:53 +00:00
Craig Topper	ad140cfb68	[X86] Add comment string for broadcast loads from the constant pool. Summary: When broadcasting from the constant pool its useful to print out the final vector similar to what we do for normal moves from the constant pool. I changed only a couple tests that were broadcast focused. One of them had been previously hand tweaked after running the script so that it could check the constant pool declaration. But I think this patch makes that unnecessary now since we can check the comment instead. Reviewers: spatel, RKSimon, zvi Reviewed By: spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D34923 llvm-svn: 307062	2017-07-04 05:46:11 +00:00
Alexander Shaposhnikov	49fc24a8bf	[llvm] Revert "[tablegen] Avoid creating a temporary vector in getInstructionCase" Revert rL307059 because of the incorrect commit message & patch, will recommit later. llvm-svn: 307061	2017-07-04 05:37:37 +00:00
Craig Topper	a4c5caf67a	[X86] Add RDRAND feature to GLM CPU Summary: I believe this should be supported on GLM since RDSEED is. Reviewers: m_zuckerman, zvi, RKSimon Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D34828 llvm-svn: 307060	2017-07-04 05:33:19 +00:00
Alexander Shaposhnikov	680f017487	[tablegen] Avoid creating a temporary vector in getInstructionCase Record::getValues returns ArrayRef which has a cast operator to std::vector, as a result a temporary vector is created if the type of the variable is const std::vector& that was suboptimal in this case. Differential revision: https://reviews.llvm.org/D34969 Test plan: make check-all llvm-svn: 307059	2017-07-04 05:11:30 +00:00
Lang Hames	5b51816020	[Orc] Remove the memory manager argument to addModule, and de-templatize the symbol resolver argument. De-templatizing the symbol resolver is part of the ongoing simplification of ORC layer API. Removing the memory management argument (and delegating construction of memory managers for RTDyldObjectLinkingLayer to a functor passed in to the constructor) allows us to build JITs whose base object layers need not be compatible with RTDyldObjectLinkingLayer's memory mangement scheme. For example, a 'remote object layer' that sends fully relocatable objects directly to the remote does not need a memory management scheme at all (that will be handled by the remote). llvm-svn: 307058	2017-07-04 04:42:30 +00:00
Dylan McKay	b224d98594	[AVR] Fix bug which caused assertion errors for some FRMIDX instructions Previously, if a basic block ended with a FRMIDX instruction, we would end up doing something like this. *std::next(MBB.end()) Which would hit an error: "Assertion `!NodePtr->isKnownSentinel()' failed." llvm-svn: 307057	2017-07-04 04:40:06 +00:00
Dylan McKay	eef7a6a32f	[AVR] Add a missing clobber declaration to LPMW llvm-svn: 307056	2017-07-04 02:52:43 +00:00
Nirav Dave	a2810e677b	[DAG] Fixed predicate for determining when two frame indices addresses are comparable. NFCI. llvm-svn: 307055	2017-07-04 02:20:17 +00:00
NAKAMURA Takumi	e4a741376b	Revert r307026, "[AMDGPU] Switch scalarize global loads ON by default" It broke a testcase. Failing Tests (1): LLVM :: CodeGen/AMDGPU/alignbit-pat.ll llvm-svn: 307054	2017-07-04 02:14:18 +00:00

... 3 4 5 6 7 ...

151563 Commits