llvm-project

Commit Graph

Author	SHA1	Message	Date
Alina Sbirlea	6cba96ed52	[LICM/MSSA] Add promotion to scalars by building an AliasSetTracker with MemorySSA. Summary: Experimentally we found that promotion to scalars carries less benefits than sinking and hoisting in LICM. When using MemorySSA, we build an AliasSetTracker on demand in order to reuse the current infrastructure. We only build it if less than AccessCapForMSSAPromotion exist in the loop, a cap that is by default set to 250. This value ensures there are no runtime regressions, and there are small compile time gains for pathological cases. A much lower value (20) was found to yield a single regression in the llvm-test-suite and much higher benefits for compile times. Conservatively we set the current cap to a high value, but we will explore lowering it when MemorySSA is enabled by default. Reviewers: sanjoy, chandlerc Subscribers: nemanjai, jlebar, Prazek, george.burgess.iv, jfb, jsji, llvm-commits Differential Revision: https://reviews.llvm.org/D56625 llvm-svn: 353339	2019-02-06 20:25:17 +00:00
Nirav Dave	b3506bf985	[DAG] Immediately cleanup unused nodes from extend-based combines. llvm-svn: 353338	2019-02-06 20:12:03 +00:00
Michael Berg	f0d81a31b6	Move IR flag handling directly into builder calls for cases translated from Instructions in GlobalIsel Reviewers: aditya_nandakumar, volkan Reviewed By: aditya_nandakumar Subscribers: rovka, kristof.beyls, volkan, Petar.Avramovic Differential Revision: https://reviews.llvm.org/D57630 llvm-svn: 353336	2019-02-06 19:57:06 +00:00
Alina Sbirlea	910c6bef3e	[AliasSetTracker] Pass MustAlias to addPointer more often. Summary: Pass the alias info to addPointer when available. Will save an alias() call for must sets when adding a known Must or May alias. [Part of a series of cleanup patches] Reviewers: reames, mkazantsev Subscribers: sanjoy, jlebar, llvm-commits Differential Revision: https://reviews.llvm.org/D56613 llvm-svn: 353335	2019-02-06 19:55:12 +00:00
Craig Topper	1c7ee20819	[X86] Change the CPU on the test case for pr40529.ll to really show the bug. NFC llvm-svn: 353334	2019-02-06 19:50:59 +00:00
Nirav Dave	c6bfa103a5	[X86][DAG] Avoid creating dangling bitcast. combineExtractWithShuffle may leave a dangling bitcast which may prevent further optimization in later passes. Avoid constructing it unless it is used. llvm-svn: 353333	2019-02-06 19:45:47 +00:00
Sanjay Patel	29a710be6a	[x86] add tests for horizontal ops (PR38971, PR33758); NFC llvm-svn: 353332	2019-02-06 19:40:11 +00:00
Jonas Paulsson	b21dde0530	[SystemZ] Improved handling of the @llvm.ctlz intrinsic. Since SystemZ supports counting of leading zeros with the FLOGR instruction, isCheapToSpeculateCtlz() should return true, which it now does. ISD::CTLZ_ZERO_UNDEF i32 is now handled the same way as ISD::CTLZ is, which is needed since promotion to i64 is required and CTLZ_ZERO_UNDEF is only expanded to CTLZ if it is Legal or Custom. Review: Ulrich Weigand https://reviews.llvm.org/D57710 llvm-svn: 353330	2019-02-06 19:23:31 +00:00
Peter Collingbourne	02fc3c696c	build: Remove the cmake check for malloc.h. As far as I can tell, malloc.h is only being used here to provide a definition of mallinfo (malloc itself is declared in stdlib.h via cstdlib). We already have a macro for whether mallinfo is available, so switch to using that instead. Differential Revision: https://reviews.llvm.org/D57807 llvm-svn: 353329	2019-02-06 19:20:47 +00:00
Jonas Paulsson	8cda83a5db	[SystemZ] Wait with VGBM selection until after DAGCombine2. Don't lower BUILD_VECTORs to BYTE_MASK, but instead expose the BUILD_VECTORs to the DAGCombiner and select them to VGBM in Select(). This allows the DAGCombiner to understand the constant vector values. For floating point, only all-zeros vectors are now generated with VGBM, as it turned out to be somewhat complicated to handle any arbitrary constants, while in practice this is very rare and hardly needed. The SystemZ ISD opcodes z_byte_mask, z_vzero and z_vones have been removed. Review: Ulrich Weigand https://reviews.llvm.org/D57152 llvm-svn: 353325	2019-02-06 18:59:19 +00:00
Florian Hahn	169f64238f	[opt-viewer] Add --filter option to select remarks for displaying. This allows limiting the displayed remarks to the ones with names matching the filter (regular) expression. Generating html pages for a larger project with optimization remarks can result in a huge HTML documents and using --filter allows to focus on a set of interesting remarks. Reviewers: hfinkel, anemet, thegameg, serge-sans-paille Reviewed By: anemet Differential Revision: https://reviews.llvm.org/D57827 llvm-svn: 353322	2019-02-06 18:43:37 +00:00
Bjorn Pettersson	350352c8a5	[SelectionDAG] Cleanup some code comments. NFC Don't repeat the function name in some doxygen comments. (Just a minor cleanup, while testing to push from the git monorepo setup.) llvm-svn: 353317	2019-02-06 17:36:18 +00:00
Jessica Paquette	e288c526f1	[GlobalISel][NFC] Gardening: Factor out code for simple unary intrinsics There was a lot of repeated code wrt unary math intrinsics in translateKnownIntrinsic. This factors out the repeated MIRBuilder code into two functions: translateSimpleUnaryIntrinsic and getSimpleUnaryIntrinsicOpcode. This simplifies adding simple unary intrinsics, since after this, all you have to do is add the mapping to SimpleUnaryIntrinsicOpcodes. Differential Revision: https://reviews.llvm.org/D57774 llvm-svn: 353316	2019-02-06 17:25:54 +00:00
James Henderson	c836e48841	[yaml2obj]Allow number for ELF symbol type yaml2obj previously only recognised standard STT_* names, and didn't allow arbitrary numbers. This change allows the user to specify a number for the type instead. It also adds a test to verify the existing behaviour for obj2yaml for unkown symbol types. Reviewed by: grimar Differential Revision: https://reviews.llvm.org/D57822 llvm-svn: 353315	2019-02-06 17:16:33 +00:00
Sanjay Patel	68bc5fb0ad	[InstCombine] X \| C == C --> (X & ~C) == 0 We should canonicalize to one of these forms, and compare-with-zero could be more conducive to follow-on transforms. This also leads to generally better codegen as shown in PR40611: https://bugs.llvm.org/show_bug.cgi?id=40611 llvm-svn: 353313	2019-02-06 16:43:54 +00:00
Sanjay Patel	51abb86f09	[InstCombine] add tests for PR40611 and regenerate checks; NFC Lots of unrelated diffs here from the newer version of the script. llvm-svn: 353312	2019-02-06 16:17:05 +00:00
Tim Northover	474f5d9b55	AArch64: enforce even/odd register pairs for CASP instructions. ARMv8.1a CASP instructions need the first of the pair to be an even register (otherwise the encoding is unallocated). We enforced this during assembly, but not CodeGen before. llvm-svn: 353308	2019-02-06 15:26:35 +00:00
Nirav Dave	e5c37958f9	[InlineAsm][X86] Add backend support for X86 flag output parameters. Allow custom handling of inline assembly output parameters and add X86 flag parameter support. llvm-svn: 353307	2019-02-06 15:26:29 +00:00
Nirav Dave	54511076d4	[SelectionDAGBuilder] Refactor Inline Asm output check. NFCI. llvm-svn: 353305	2019-02-06 15:12:46 +00:00
Ulrich Weigand	17a0012687	[SystemZ] Do not return INT_MIN from strcmp/memcmp The IPM sequence currently generated to compute the strcmp/memcmp result will return INT_MIN for the "less than zero" case. While this is in compliance with the standard, strictly speaking, it turns out that common applications cannot handle this, e.g. because they negate a comparison result in order to implement reverse compares. This patch changes code to use a different sequence that will result in -2 for the "less than zero" case (same as GCC). However, this requires that the two source operands of the compare instructions are inverted, which breaks the optimization in removeIPMBasedCompare. Therefore, I've removed this (and all of optimizeCompareInstr), and replaced it with a mostly equivalent optimization in combineCCMask at the DAGcombine level. llvm-svn: 353304	2019-02-06 15:10:13 +00:00
Tim Northover	71025a2f3e	AArch64: annotate atomics with dropped acquire semantics when printing. A quirk of the v8.1a spec is that when the writeback regiser for an atomic read-modify-write instruction is wzr/xzr, the instruction no longer enforces acquire ordering. However, it's still written with the misleading 'a' mnemonic. So this adds an annotation when disassembling such instructions, mentioning the change. llvm-svn: 353303	2019-02-06 15:07:59 +00:00
Sanjay Patel	e84fbb67a1	[x86] vectorize cast ops in lowering to avoid register file transfers The proposal in D56796 may cross the line because we're trying to avoid vectorization transforms in generic DAG combining. So this is an alternate, later, x86-specific translation of that patch. There are several potential follow-ups to enhance this: 1. Allow extraction from non-zero element index. 2. Peek through extends of smaller width integers. 3. Support x86-specific conversion opcodes like X86ISD::CVTSI2P Differential Revision: https://reviews.llvm.org/D56864 llvm-svn: 353302	2019-02-06 14:59:39 +00:00
Andrea Di Biagio	02974728dc	[MCA] Speedup ResourceManager queries. NFCI When a resource unit R is released, the ResourceManager notifies groups that contain R. Before this patch, the logic in method ResourceManager::release() implemented a potentially slow iterative search of dependent groups on the entire set of processor resources. This patch replaces that logic with a simpler (and often faster) lookup on array `Resource2Groups`. This patch gives an average speedup of ~3-4% (observed on a release build when testing for target btver2). No functional change intended. llvm-svn: 353301	2019-02-06 14:57:28 +00:00
Nico Weber	da2bb5d5f6	gn build: Merge r353265, r353237 llvm-svn: 353298	2019-02-06 13:53:47 +00:00
Eugene Leviant	ef6eba2401	Attempt to fix buildbot after r353289 llvm-svn: 353294	2019-02-06 13:45:22 +00:00
Clement Courbet	5a6712b633	[DAGCombine][NFC] GatherAllAliases should take a LSBaseSDNode. GatherAllAliases only makes sense for LSBaseSDNode. Enforce it with static typing instead of runtime cast. llvm-svn: 353291	2019-02-06 12:36:17 +00:00
Max Kazantsev	cd48ac3661	[NFC] Simplify check in guard widening llvm-svn: 353290	2019-02-06 11:27:00 +00:00
Eugene Leviant	f324f6dcfb	[llvm-objcopy] Allow regular expressions in name comparison Differential revision: https://reviews.llvm.org/D57517 llvm-svn: 353289	2019-02-06 11:00:07 +00:00
James Henderson	b6b5b1a592	[DebugInfo]Print correct value for special opcode address increment The wrong variable was being used when printing the address increment in verbose output of .debug_line. This patch fixes this. Reviewed by: JDevlieghere Differential Revision: https://reviews.llvm.org/D57693 llvm-svn: 353288	2019-02-06 10:31:50 +00:00
James Henderson	cd1424aebb	[DebugInfo][llvm-symbolizer]Add some tests for edge cases when symbolizing This patch adds half a dozen new tests that test various edge cases in the behaviour of the symbolizer and DWARF data parsing. All of them test the current behaviour. Reviewed by: JDevlieghere, aprantl Differential Revision: https://reviews.llvm.org/D57741 llvm-svn: 353286	2019-02-06 10:13:14 +00:00
Roman Lebedev	418280100b	[yaml::BinaryRef] Slight perf tuning (for llvm-exegesis analysis mode) Summary: llvm-exegesis uses this functionality to read it's benchmark dumps. This reading of `.yaml`s takes ~60% of runtime for 14656 benchmark points (i.e. one sweep over all x86 instructions), but only 30% of time for 3x as much benchmark points. In particular, this `BinaryRef` appears to be an obvious pain point. Without patch: ``` $ perf stat -r 25 ./bin/llvm-exegesis -mode=analysis -analysis-epsilon=1.0 -benchmarks-file=/tmp/benchmarks-inverse_throughput-onefull.yaml -analysis-clusters-output-file="" -analysis-inconsistencies-output-file=/tmp/clusters-orig.html no exegesis target for x86_64-unknown-linux-gnu, using default Parsed 14656 benchmark points Printing sched class consistency analysis results to file '/tmp/clusters-orig.html' ... no exegesis target for x86_64-unknown-linux-gnu, using default Parsed 14656 benchmark points Printing sched class consistency analysis results to file '/tmp/clusters-orig.html' Performance counter stats for './bin/llvm-exegesis -mode=analysis -analysis-epsilon=1.0 -benchmarks-file=/tmp/benchmarks-inverse_throughput-onefull.yaml -analysis-clusters-output-file= -analysis-inconsistencies-output-file=/tmp/clusters-orig.html' (25 runs): 972.86 msec task-clock # 0.994 CPUs utilized ( +- 0.25% ) 30 context-switches # 30.774 M/sec ( +- 21.74% ) 0 cpu-migrations # 0.370 M/sec ( +- 67.81% ) 11873 page-faults # 12211.512 M/sec ( +- 0.00% ) 3898373408 cycles # 4009682.186 GHz ( +- 0.25% ) (83.12%) 360399748 stalled-cycles-frontend # 9.24% frontend cycles idle ( +- 0.54% ) (83.24%) 1099450483 stalled-cycles-backend # 28.20% backend cycles idle ( +- 0.59% ) (33.63%) 4910528820 instructions # 1.26 insn per cycle # 0.22 stalled cycles per insn ( +- 0.13% ) (50.21%) 1111976775 branches # 1143726625.854 M/sec ( +- 0.10% ) (66.77%) 23248474 branch-misses # 2.09% of all branches ( +- 0.19% ) (83.29%) 0.97850 +- 0.00647 seconds time elapsed ( +- 0.66% ) ``` With the patch: ``` $ perf stat -r 25 ./bin/llvm-exegesis -mode=analysis -analysis-epsilon=1.0 -benchmarks-file=/tmp/benchmarks-inverse_throughput-onefull.yaml -analysis-clusters-output-file="" -analysis-inconsistencies-output-file=/tmp/clusters-new.html no exegesis target for x86_64-unknown-linux-gnu, using default Parsed 14656 benchmark points Printing sched class consistency analysis results to file '/tmp/clusters-new.html' ... no exegesis target for x86_64-unknown-linux-gnu, using default Parsed 14656 benchmark points Printing sched class consistency analysis results to file '/tmp/clusters-new.html' Performance counter stats for './bin/llvm-exegesis -mode=analysis -analysis-epsilon=1.0 -benchmarks-file=/tmp/benchmarks-inverse_throughput-onefull.yaml -analysis-clusters-output-file= -analysis-inconsistencies-output-file=/tmp/clusters-new.html' (25 runs): 905.29 msec task-clock # 0.999 CPUs utilized ( +- 0.11% ) 15 context-switches # 16.533 M/sec ( +- 32.27% ) 0 cpu-migrations # 0.000 K/sec 11873 page-faults # 13121.789 M/sec ( +- 0.00% ) 3627759720 cycles # 4009283.100 GHz ( +- 0.11% ) (83.19%) 370401480 stalled-cycles-frontend # 10.21% frontend cycles idle ( +- 0.22% ) (83.19%) 1007114438 stalled-cycles-backend # 27.76% backend cycles idle ( +- 0.34% ) (33.62%) 4414014304 instructions # 1.22 insn per cycle # 0.23 stalled cycles per insn ( +- 0.08% ) (50.36%) 1003751700 branches # 1109314021.971 M/sec ( +- 0.07% ) (66.97%) 24611010 branch-misses # 2.45% of all branches ( +- 0.10% ) (83.41%) 0.90593 +- 0.00105 seconds time elapsed ( +- 0.12% ) ``` So this decreases the overall run time of llvm-exegesis analysis mode (on one sweep) by roughly -7%. To be noted, `BinaryRef::writeAsBinary()` change is the reason for the perf changes, usage of `llvm::isHexDigit()` instead of `isxdigit()` does not appear to have any perf impact, i have only changed it "for symmetry". `writeAsBinary()` change is correct, it produces identical de-hex-ified buffer, and the final output is thus identical: ``` $ sha512sum /tmp/clusters-* db4bbd904fe8840853b589b032c5041bc060b91bcd9c27b914b56581fbc473550eea74b852238c79963b5adf2419f379e9f5db76784048b48e3937f9f3e732bf /tmp/clusters-new.html db4bbd904fe8840853b589b032c5041bc060b91bcd9c27b914b56581fbc473550eea74b852238c79963b5adf2419f379e9f5db76784048b48e3937f9f3e732bf /tmp/clusters-orig.html ``` Reviewers: silvas, espindola, sbc100, zturner, courbet, gchatelet Reviewed By: gchatelet Subscribers: tschuett, RKSimon, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57699 llvm-svn: 353282	2019-02-06 08:57:01 +00:00
Fangrui Song	b8ee8c8517	Fix misspelled filenames in file headers of llvm/{MC,Object,CodeGen}/*.h llvm-svn: 353278	2019-02-06 08:02:46 +00:00
Max Kazantsev	36b392cbe4	[NFC] Factor out detatchment of dead blocks from their erasing llvm-svn: 353277	2019-02-06 07:56:36 +00:00
Max Kazantsev	a4ccfc1841	[LoopSimplifyCFG] Do not count dead exit blocks twice, make CFG simpler llvm-svn: 353276	2019-02-06 07:49:17 +00:00
Max Kazantsev	0d7ad3c9a3	[NFC] Revert rL353274 llvm-svn: 353275	2019-02-06 06:33:02 +00:00
Max Kazantsev	61e6ffc398	[NFC] Extend API of DeleteDeadBlock(s) to collect updates without DTU llvm-svn: 353274	2019-02-06 06:00:02 +00:00
Max Kazantsev	bad4db8b1a	[NFC] Replace readonly SmallVectorImpl with ArrayRef llvm-svn: 353273	2019-02-06 05:40:31 +00:00
Teresa Johnson	716abbeb43	[HotColdSplit] Move splitting after instrumented PGO use Summary: Follow up to D57082 which moved splitting earlier in the pipeline, in order to perform it before inlining. However, it was moved too early, before the IR is annotated with instrumented PGO data. This caused the splitting to incorrectly determine cold functions. Move it to just after PGO annotation (still before inlining), in both pass managers. Reviewers: vsk, hiraditya, sebpop Subscribers: mehdi_amini, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57805 llvm-svn: 353270	2019-02-06 04:29:39 +00:00
Petr Hosek	23fdd5a37f	[CMake] Unify scripts for generating VCS headers Previously, there were two different scripts for generating VCS headers: one used by LLVM and one used by Clang and lldb. They were both similar, but different. They were both broken in their own ways, for example the one used by Clang didn't properly handle monorepo resulting in an incorrect version information reported by Clang. This change unifies two the scripts by introducing a new script that's used from both LLVM, Clang and lldb, ensures that the new script supports both monorepo and standalone SVN and Git setups, and removes the old scripts. Differential Revision: https://reviews.llvm.org/D57063 llvm-svn: 353268	2019-02-06 03:51:00 +00:00
Philip Reames	00ae46ba52	[AliasSetTracker] Minor style tweak to avoid a variable w/two distinct live ranges [NFC] llvm-svn: 353267	2019-02-06 03:46:40 +00:00
Philip Reames	b5bb4a4ec6	[Test] Add codegen tests for unordered and monotonic integer operations llvm-svn: 353266	2019-02-06 03:19:04 +00:00
Richard Trieu	5f436fc57a	Move DomTreeUpdater from IR to Analysis DomTreeUpdater depends on headers from Analysis, but is in IR. This is a layering violation since Analysis depends on IR. Relocate this code from IR to Analysis to fix the layering violation. llvm-svn: 353265	2019-02-06 02:52:52 +00:00
Sanjay Patel	997b2aba58	[x86] add tests for extract+sitofp; NFC llvm-svn: 353249	2019-02-06 00:19:56 +00:00
Heejin Ahn	4367587fc6	[WebAssembly] Tidy up `let` statements in .td files (NFC) Summary: - Delete {} for one-line `let` statements - Don't indent within `let` blocks - Add comments after `let` block's closing braces Reviewers: tlively Subscribers: dschuff, sbc100, jgravelle-google, sunfish, jfb, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57730 llvm-svn: 353248	2019-02-06 00:17:03 +00:00
Alina Sbirlea	b9c1bc6d3c	[BasicAA] Cache nonEscapingLocalObjects for alias() calls. Summary: Use a small cache for Values tested by nonEscapingLocalObject(). Since the calls to PointerMayBeCaptured are fairly expensive, this saves a good amount of compile time for anything relying heavily on BasicAA.alias() calls. This uses the same approach as the AliasCache, i.e. the cache is reset after each alias() call. The cache is not used or updated by modRefInfo calls since it's harder to know when to reset the cache. Testcases that show improvements with this patch are too large to include. Example compile time improvement: 7s to 6s. Reviewers: chandlerc, sunfish Subscribers: sanjoy, jlebar, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57627 llvm-svn: 353245	2019-02-05 23:52:08 +00:00
Nico Weber	d3391bdd91	gn build: Fix clang-tidy build Not depending on //clang/lib/StaticAnalyzer/Core and //clang/lib/StaticAnalyzer/Frontend causes a linker error even if ClangSACheckers are not supported. Undefined symbols for architecture x86_64: "clang::ento::CreateAnalysisConsumer(clang::CompilerInstance&)", referenced from: clang::tidy::ClangTidyASTConsumerFactory::CreateASTConsumer( clang::CompilerInstance&, llvm::StringRef) in libclangTidy.a(libclangTidy.ClangTidy.o) Patch from Mirko Bonadei <mbonadei@webrtc.org>! Differential Revision: https://reviews.llvm.org/D57777 llvm-svn: 353244	2019-02-05 23:48:13 +00:00
Vedant Kumar	bd94b4287c	[HotColdSplit] Do not split out `resume` instructions Resumes that are not reachable from a cleanup landing pad are considered to be unreachable. It’s not safe to split them out. rdar://47808235 llvm-svn: 353242	2019-02-05 23:39:02 +00:00
David Blaikie	e49209ed88	Orc: Simplify RPC naming system by using function-local statics The existing scheme of class template static members for Name and NameMutex is a bit verbose, involves global ctors (even if they're cheap for string and mutex, still not entirely free), and (importantly/my immediate motivation here) trips over a bug in LLVM's modules implementation that's a bit involved (hmm, sounds like Mr. Smith has a fix for the modules thing - but I'm still inclined to commit this patch as general goodness). llvm-svn: 353241	2019-02-05 23:38:55 +00:00
Douglas Yung	92e3c97e8a	Fixup test on Windows with a case-insensitive filesystem due to path printing changes from r352704. llvm-svn: 353238	2019-02-05 23:27:38 +00:00
Lang Hames	3e040e05f8	[ADT] Add a fallible_iterator wrapper. A fallible iterator is one whose increment or decrement operations may fail. This would usually be supported by replacing the ++ and -- operators with methods that return error: class MyFallibleIterator { public: // ... Error inc(); Errro dec(); // ... }; The downside of this style is that it no longer conforms to the C++ iterator concept, and can not make use of standard algorithms and features such as range-based for loops. The fallible_iterator wrapper takes an iterator written in the style above and adapts it to (mostly) conform with the C++ iterator concept. It does this by providing standard ++ and -- operator implementations, returning any errors generated via a side channel (an Error reference passed into the wrapper at construction time), and immediately jumping the iterator to a known 'end' value upon error. It also marks the Error as checked any time an iterator is compared with a known end value and found to be inequal, allowing early exit from loops without redundant error checking. Usage looks like: MyFallibleIterator I = ..., E = ...; Error Err = Error::success(); for (auto &Elem : make_fallible_range(I, E, Err)) { // Loop body is only entered when safe. // Early exits from loop body permitted without checking Err. if (SomeCondition) return; } if (Err) // Handle error. Since failure causes a fallible iterator to jump to end, testing that a fallible iterator is not an end value implicitly verifies that the error is a success value, and so is equivalent to an error check. Reviewers: dblaikie, rupprecht Subscribers: mgorny, dexonsmith, kristina, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57618 llvm-svn: 353237	2019-02-05 23:17:11 +00:00
Heejin Ahn	7b7a4ef3d3	[WebAssembly] Add a comment about why v128.const test was disabled (NFC) llvm-svn: 353236	2019-02-05 23:01:41 +00:00
Sanjay Patel	cddb1e5469	[InstCombine] limit extracting shuffle transform based on uses As discussed in D53037, this can lead to worse codegen, and we don't generally expect the backend to be able to optimize arbitrary shuffles. If there's only one use of the 1st shuffle, that means it's getting removed, so that should always be safe. llvm-svn: 353235	2019-02-05 22:58:45 +00:00
Heejin Ahn	1b8df42712	[WebAssembly] Disable a v128.const test line temporarily r353131 caused failures in v128.const test for clang-ppc64be-linux-lnt and clang-s390x-linux bots. This temporarily disables that line until it is fixed. llvm-svn: 353234	2019-02-05 22:47:29 +00:00
Sanjay Patel	0272b44ea2	[InstCombine] split shuffle test to show extra use constraint; NFC As discussed in D53037, this transform can cause codegen problems if the 1st shuffle has multiple uses. llvm-svn: 353233	2019-02-05 22:46:13 +00:00
Rong Xu	ce10d5ead4	[PGO] Use a function for creating variable for profile file name. NFC. Factored out the code for creating variable for profile file name to a function. llvm-svn: 353230	2019-02-05 22:34:45 +00:00
Petar Jovanovic	b33f00f508	[elfabi] Fix the type of the variable formated for error output Change the format type of Dyn.SONameOffset to PRIx64 since it is a uint64_t. The problem was detected on mips builds, where it was printing junk values and causing test failure. Patch by Milos Stojanovic. Differential Revision: https://reviews.llvm.org/D57676 llvm-svn: 353225	2019-02-05 22:23:46 +00:00
Aditya Nandakumar	fef7619b05	[NFC][GlobalISel]: Add a convenience method to MachineInstrBuilder to simplify getOperand(i).getReg() https://reviews.llvm.org/D57608 It's a common pattern in GISel to have a MachineInstrBuilder from which we get various regs (commonly MIB->getOperand(0).getReg()). This adds a helper method and the above can be replaced with MIB.getReg(0). llvm-svn: 353223	2019-02-05 22:14:40 +00:00
Craig Topper	4562f420cd	[X86] Regenerate tests missed in r353061. NFC We now print the implicit %st register on these instruction, but since they occur at the end of the line, FileCheck didn't see they were missing. llvm-svn: 353222	2019-02-05 21:47:42 +00:00
Reid Kleckner	f38bc4fc99	[MC] Don't error on numberless .file directives on MachO Summary: Before r349976, MC ignored such directives when producing an object file and asserted when re-producing textual assembly output. I turned this assertion into a hard error in both cases in r349976, but this makes it unnecessarily difficult to write a single assembly file that supports both MachO and other object formats that support .file. A user reported this as PR40578, and we decided to go back to ignoring the directive. Fixes PR40578 Reviewers: mstorsjo Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57772 llvm-svn: 353218	2019-02-05 21:14:09 +00:00
Matt Davis	0d0e9c08a4	[llvm-readobj] Display sections that do not belong to a segment in the section-mapping Summary: The following patch adds the "None" line to the section to segment mapping dump. That line lists the sections that do not belong to any segment. I realize that this change differs from GNU readelf which does not display the latter information. I'd rather not add this "feature" under a command line option. I think that might introduce confusion, since users would have to make an additional decision as to if they want to see all of the section-to-segment map or just a subset of it. Another option is to only print the "None" line if the `--section-mapping` option is passed; however, that might also introduce some confusion, because the section-to-segment map would be different between`--program-headers` and the `--section-mapping` output. While the difference is just the "None" line, it seems that if we choose to display the segment-to-section mapping, then we should always display the whole map including the sections that do not belong to segments. ``` Section to Segment mapping: Segment Sections... 00 01 .interp 02 .interp .note.ABI-tag .gnu.hash 03 .init_array .fini_array .dynamic 04 .dynamic 05 .note.ABI-tag 06 .eh_frame_hdr 07 08 .init_array .fini_array .dynamic .got None .comment .symtab .strtab .shstrtab <--- THIS LINE ``` Reviewers: grimar, rupprecht, jhenderson, espindola Reviewed By: rupprecht Subscribers: khemant, emaste, arichardson, llvm-commits Differential Revision: https://reviews.llvm.org/D57700 llvm-svn: 353217	2019-02-05 21:01:01 +00:00
Thomas Lively	315056692d	[WebAssembly] Lower memmove to memory.copy Summary: The lowering is identical to the memcpy lowering. Reviewers: aheejin Subscribers: dschuff, sbc100, jgravelle-google, hiraditya, sunfish, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57727 llvm-svn: 353216	2019-02-05 20:57:40 +00:00
Evandro Menezes	e5bb58b115	[TargetLibraryInfo] Regroup run time functions for Windows (NFC) Regroup supported and unsupported functions by precision and C standard. llvm-svn: 353213	2019-02-05 20:24:21 +00:00
Matt Arsenault	5b3084e3ab	Move some llvm-mc tests where they belong llvm-svn: 353211	2019-02-05 20:12:48 +00:00
Matt Arsenault	a3d0c5adaf	GlobalISel: Verify G_GEP llvm-svn: 353209	2019-02-05 20:04:12 +00:00
Scott Linder	e2c5847414	[AMDGPU] Consider XOR in waterfall loop as a terminator Ensure the XOR in the waterfall loop for indirect addressing is considered a terminator. Differential Revision: https://reviews.llvm.org/D57703 llvm-svn: 353207	2019-02-05 19:50:32 +00:00
Alexey Bataev	f3a9150324	[DEBUG_INFO][NVPTX] Generate DW_AT_address_class to get the values in debugger. Summary: According to https://docs.nvidia.com/cuda/archive/10.0/ptx-writers-guide-to-interoperability/index.html#cuda-specific-dwarf, the compiler should emit the DW_AT_address_class attribute for all variable and parameter. It means, that DW_AT_address_class attribute should be used in the non-standard way to support compatibility with the cuda-gdb debugger. Clang is able to generate the information about the variable address class. This information is emitted as the expression sequence `DW_OP_constu <DWARF Address Space> DW_OP_swap DW_OP_xderef`. The patch tries to find all such expressions and transform them into `DW_AT_address_class <DWARF Address Space>` if target is NVPTX and the debugger is gdb. If the expression is not found, then default values are used. For the local variables <DWARF Address Space> is set to ADDR_local_space(6), for the globals <DWARF Address Space> is set to ADDR_global_space(5). The values are taken from the table in the same section 5.2. CUDA-Specific DWARF Definitions. Reviewers: echristo, probinson Subscribers: jholewinski, aprantl, llvm-commits Differential Revision: https://reviews.llvm.org/D57157 llvm-svn: 353203	2019-02-05 19:33:47 +00:00
Matt Arsenault	a3f9b71c09	AMDGPU: Fix assert on trunc from bitcast of build_vector The v2i64 argument is lowered to a bitcast of v4i32 build_vector. This would then attempt to use the i32-element as the source of the vector truncate. This really would need to collect 2 elements from the build_vector to produce the intended truncate. llvm-svn: 353202	2019-02-05 19:23:57 +00:00
Simon Pilgrim	b0afc69435	[X86][SSE] Disable ZERO_EXTEND shuffle combining rL352997 enabled ZERO_EXTEND from non-shuffle-able value types. I've disabled it for now to fix a regression identified by @asbirlea until I can fix this properly. llvm-svn: 353198	2019-02-05 19:15:48 +00:00
Petar Jovanovic	40a7f63c37	[PGO] Fix the type of the formated variable Change the format type of Value to PRIu64 since it is a uint64_t. The problem was detected on mips boards building 32-bit binaries, where it was printing junk values and causing test failure. Patch by Milos Stojanovic. Differential Revision: https://reviews.llvm.org/D57583 llvm-svn: 353194	2019-02-05 18:09:28 +00:00
Robert Widmann	d5444ccf17	[LLVM-C] Add Bindings to GlobalIFunc Summary: Adds the standard gauntlet of accessors for global indirect functions and updates the echo test. Now it would be nice to have a target abstraction so one could know if they have access to a suitable ELF linker and runtime. Reviewers: whitequark, deadalnix Reviewed By: whitequark Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D56177 llvm-svn: 353193	2019-02-05 18:05:44 +00:00
Anton Korobeynikov	b26134bf92	Enable integrated assembler on MSP430 by default. Patch by Kristina Bessonova! Differential Revision: https://reviews.llvm.org/D56787 llvm-svn: 353192	2019-02-05 18:01:45 +00:00
Oliver Stannard	78dc38ec94	[AArch64][Outliner] Don't outline BTI instructions We can't outline BTI instructions, because they need to be the very first instruction executed after an indirect call or branch. If we outline them, then an indirect call might go to the branch to the outlined function, which will fault. Differential revision: https://reviews.llvm.org/D57753 llvm-svn: 353190	2019-02-05 17:21:57 +00:00
Simon Pilgrim	822d2e35e7	[X86][AVX] Attempt to combine shuffles to subvector broadcast load llvm-svn: 353189	2019-02-05 17:02:49 +00:00
Matt Arsenault	86504fb20e	AArch64/GlobalISel: Don't clamp from 2 to 2 This is equivalent to clampMaxNumElements, but saves a check. llvm-svn: 353188	2019-02-05 16:57:18 +00:00
Sam Clegg	d9c9dc036c	[WebAssembly] Object: Remove redundant method. NFC. Differential Revision: https://reviews.llvm.org/D57719 llvm-svn: 353183	2019-02-05 16:30:21 +00:00
Simon Pilgrim	ec5a6761e5	[X86][AVX] Add PR34041 subvector broadcast test cases llvm-svn: 353182	2019-02-05 16:18:30 +00:00
Sanjay Patel	b696b771bf	[CGP] add test for unsigned subtract of 1 with overflow; NFC llvm-svn: 353179	2019-02-05 15:27:40 +00:00
Sanjay Patel	237e208f16	[AArch64][x86] add tests for unsigned subtract with overflow; NFC llvm-svn: 353178	2019-02-05 15:26:42 +00:00
Nico Weber	50be01149c	gn build: BUILD.gn files for clang-tidy and clang-apply-replacements Patch from Mirko Bonadei <mbonadei@webrtc.org>! Differential Revision: https://reviews.llvm.org/D57329 llvm-svn: 353177	2019-02-05 15:14:38 +00:00
Krasimir Georgiev	12971803c4	Fix typo in comment, NFCI llvm-svn: 353176	2019-02-05 15:00:56 +00:00
Nico Weber	697f914dff	gn build: Merge r353072 llvm-svn: 353175	2019-02-05 14:47:36 +00:00
Thomas Preud'homme	a5e233bf79	Recommit: Detect incorrect FileCheck variable CLI definition Summary: While the backend code of FileCheck relies on definition of variable from the command-line to have an equal sign '=' and a variable name before that, the frontend does not actually enforce it. This leads to FileCheck crashing when invoked with invalid syntax for the -D option. This patch adds the missing validation in the frontend. It also makes the -D option an AlwaysPrefix option to be able to detect -D=FOO as being a define without variable and -D as missing its value. Copyright: - Linaro (changes in version 2 of revision D55940) - GraphCore (changes in later versions) Reviewers: jdenny Subscribers: JonChesterfield, hiraditya, kristina, probinson, llvm-commits Differential Revision: https://reviews.llvm.org/D55940 llvm-svn: 353173	2019-02-05 14:17:28 +00:00
Thomas Preud'homme	f929a0f81b	Recommit: Add support for prefix-only CLI options Summary: Add support for options that always prefix their value, giving an error if the value is in the next argument or if the option is given a value assignment (ie. opt=val). This is the desired behavior for the -D option of FileCheck for instance. Copyright: - Linaro (changes in version 2 of revision D55940) - GraphCore (changes in later versions and introduced when creating D56549) Reviewers: jdenny Subscribers: llvm-commits, probinson, kristina, hiraditya, JonChesterfield Differential Revision: https://reviews.llvm.org/D56549 llvm-svn: 353172	2019-02-05 14:17:16 +00:00
Simon Pilgrim	8450ad17a9	[X86][SSE] Rename SimplifyDemandedVectorElts BLENDV tests I'm going to be adding SimplifyDemandedBits tests shortly. llvm-svn: 353171	2019-02-05 14:11:50 +00:00
Andrea Di Biagio	4bce783ee3	[MCA] Moved the logic that updates register dependencies from DispatchStage to RegisterFile. NFC DispatchStage should always delegate to an object of class RegisterFile the task of updating data dependencies. ReadState and WriteState objects should not be modified directly by DispatchStage. This patch also renames stage IS_AVAILABLE to IS_DISPATCHED. llvm-svn: 353170	2019-02-05 14:11:41 +00:00
Serge Guelton	cad6336675	gn build: Fix Python 3 write_vcsrevision script compatibility Trivial fix: decode was not called for all subprocess.check_output calls. Commited on behalf of Andrew Boyarshin Differential Revision: https://reviews.llvm.org/D57505 llvm-svn: 353168	2019-02-05 13:01:12 +00:00
Simon Pilgrim	62af24cc93	[X86][SSE] Add SimplifyDemandedVectorElts support for X86ISD::BLENDV llvm-svn: 353165	2019-02-05 12:27:29 +00:00
Simon Pilgrim	bd3adbb899	[X86][SSE] Add tests showing missing SimplifyDemandedVectorElts support for X86ISD::BLENDV llvm-svn: 353164	2019-02-05 12:18:34 +00:00
Andrea Di Biagio	998a925e0e	[MCA] Simplify the logic in method WriteState::addUser. NFCI In some cases, it is faster to just grow the set of 'Users' rather than performing a llvm::find_if every time a new user is added to the set. No functional change intended. llvm-svn: 353162	2019-02-05 11:36:55 +00:00
Jeremy Morse	84ca706be1	[DebugInfo][NFCI] Split salvageDebugInfo into helper functions Some use cases are appearing where salvaging is needed that does not correspond to an instruction being deleted -- for example an instruction being sunk, or a Value not being available in a block being isel'd. Enable more fine grained control over how salavging occurs by splitting the logic into helper functions, separating things that are specific to working on DbgVariableIntrinsics from those specific to interpreting IR and building DIExpressions. Differential Revision: https://reviews.llvm.org/D57696 llvm-svn: 353156	2019-02-05 11:11:28 +00:00
Hans Wennborg	7279895454	Fix format string in bindings/go/llvm/ir_test.go (PR40561) The test started failing for me recently. I don't see any changes around this code, so maybe it's my local go version that changed or something. The error seems real to me: we're trying to print an Attribute with %d. The test talks about "attribute masks" I'm not sure what that refers to, but I suppose we could print the raw pointer value, since that's what the test seems to be comparing. Differential revision: https://reviews.llvm.org/D57672 llvm-svn: 353155	2019-02-05 11:01:54 +00:00
Simon Pilgrim	9e595e3663	[X86][AVX] Attempt to share broadcasts of different widths (PR39454) If we have broadcasts of different vector widths, keep the longest vector width and extract subvectors for the shorter vectors (which should be free). Differential Revision: https://reviews.llvm.org/D57663 llvm-svn: 353154	2019-02-05 10:58:43 +00:00
Simon Pilgrim	fbb3086fc8	[CostModel][X86] Add UMUL fixed point cost tests llvm-svn: 353153	2019-02-05 10:55:38 +00:00
Florian Hahn	3b251963c3	[CGP] Add support for sinking operands to their users, if they are free. This patch improves code generation for some AArch64 ACLE intrinsics. It adds support to CGP to duplicate and sink operands to their user, if they can be folded into a target instruction, like zexts and sub into usubl. It adds a TargetLowering hook shouldSinkOperands, which looks at the operands of instructions to see if sinking is profitable. I decided to add a new target hook, as for the sinking to be profitable, at least on AArch64, we have to look at multiple operands of an instruction, instead of looking at the users of a zext for example. The sinking is done in CGP, because it works around an instruction selection limitation. If instruction selection is not limited to a single basic block, this patch should not be needed any longer. Alternatively this could be done in the LoopSink pass, which tries to undo LICM for instructions in blocks that are not executed frequently. Note that we do not force the operands to sink to have a single user, because we duplicate them before sinking. Therefore this is only desirable if they really can be done for free. Additionally we could consider the impact on live ranges later on. This should fix https://bugs.llvm.org/show_bug.cgi?id=40025. As for performance, we have internal code that uses intrinsics and can be speed up by 10% by this change. Reviewers: SjoerdMeijer, t.p.northover, samparker, efriedma, RKSimon, spatel Reviewed By: samparker Differential Revision: https://reviews.llvm.org/D57377 llvm-svn: 353152	2019-02-05 10:27:40 +00:00
Diana Picus	e24b104a11	[ARM GlobalISel] Support G_GEP for Thumb2 Same as ARM, but use a different opcode in the instruction selection. llvm-svn: 353151	2019-02-05 10:21:37 +00:00
Dan Liew	de5220ed5e	Previously if the user configured their build but then changed LLVM_ENABLED_PROJECT and reconfigured it had no effect on what projects were actually built. This was very confusing behaviour. The reason for this is that the value of the `LLVM_TOOL_<PROJECT>_BUILD` variables are already set. The problem here is that we have two sources of truth: * The projects listed in LLVM_ENABLE_PROJECTS. * The projects enabled/disabled with LLVM_TOOL_<PROJECT>_BUILD. At configure time we have no real way of knowing which source of truth the user wants so we apply the following heuristic: If the user ever sets `LLVM_ENABLE_PROJECTS` in the CMakeCache then that is used as the single source of truth and we force the `LLVM_TOOL_<PROJECT>_BUILD` CMake cache variables to have the appropriate values that match the contents of the `LLVM_ENABLE_PROJECTS`. If the user never sets `LLVM_ENABLE_PROJECTS` then they can continue to use and set the `LLVM_TOOL_<PROJECT>_BUILD` variables as the "source of truth". The problem with this approach is that if the user ever tries to use both `LLVM_ENABLE_PROJECTS` and `LLVM_TOOL_<PROJECT>_BUILD` for the same build directory then any user set value for `LLVM_TOOL_<PROJECT>_BUILD` variables will get overwriten, likely without the user noticing. Hopefully the above shouldn't matter in practice because the LLVM_TOOL_<PROJECT>_BUILD variables are not documented, but LLVM_ENABLE_PROJECTS is. We should probably deprecate the `LLVM_TOOL_<PROJECT>_BUILD` variables at some point by turning them into to regular CMake variables that don't live in the CMake cache. Differential Revision: https://reviews.llvm.org/D57535 llvm-svn: 353148	2019-02-05 08:47:28 +00:00
Hiroshi Inoue	02a2bb2f54	[NFC] fix trivial typos in comments llvm-svn: 353147	2019-02-05 08:30:48 +00:00
Clement Courbet	f3da6abf0f	[DAG][NFC] Add unit tests. In preparation for D57541. llvm-svn: 353144	2019-02-05 08:00:17 +00:00
Clement Courbet	17b51b655e	[DAG] BaseIndexOffset: FrameIndexSDNodes with the same FrameIndex compare equal. Reviewers: niravd Subscribers: arphaman, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57692 llvm-svn: 353143	2019-02-05 07:36:20 +00:00
Craig Topper	f86eb00f12	[X86] Connect the default fpsr and dirflag clobbers in inline assembly to the registers we have defined for them. Summary: We don't currently map these constraints to physical register numbers so they don't make it to the MachineIR representation of inline assembly. This could have problems for proper dependency tracking in the machine schedulers though I don't have a test case that shows that. Reviewers: rnk Reviewed By: rnk Subscribers: eraman, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57641 llvm-svn: 353141	2019-02-05 06:13:06 +00:00
Peter Collingbourne	6141b037a9	gn build: Upgrade to NDK r19. NDK r19 includes a sysroot that can be used directly by the compiler without creating a standalone toolchain, so we just need a handful of flags to point Clang there. Differential Revision: https://reviews.llvm.org/D57733 llvm-svn: 353139	2019-02-05 05:10:19 +00:00
Craig Topper	31259c52ff	[X86] Add test case from PR40529. NFC llvm-svn: 353138	2019-02-05 04:48:23 +00:00
Max Kazantsev	d5e595b7a6	[LSR] Check SCEV on isZero() after extend. PR40514 When LSR first adds SCEVs to BaseRegs, it only does it if `isZero()` has returned false. In the end, in invocation of `InsertFormula`, it asserts that all values there are still not zero constants. However between these two points, it makes some transformations, in particular extends them to wider type. SCEV does not give us guarantee that if `S` is not a constant zero, then `sext(S)` is also not a constant zero. It might have missed some optimizing transforms when it was calculating `S` and then made them when it took `sext`. For example, it may happen if previously optimizing transforms were limited by depth or somehow else. This patch adds a bailout when we may end up with a zero SCEV after extension. Differential Revision: https://reviews.llvm.org/D57565 Reviewed By: samparker llvm-svn: 353136	2019-02-05 04:30:37 +00:00
Teresa Johnson	b0bf530fb5	[SamplePGO] More pipeline changes when flattened profile used in ThinLTO postlink Summary: Follow on to D54819/r351476. We also don't need to perform extra InstCombine pass when we aren't loading the sample profile in the ThinLTO backend because we have a flattened sample profile. Additionally, for consistency and clarity, when we aren't reloading the sample profile, perform ICP in the same location as non-sample PGO backends. To this end I have moved the ICP invocation for non-SamplePGO ThinLTO down into buildModuleSimplificationPipeline (partly addresses the FIXME where we were previously setting this up). Reviewers: wmi Subscribers: mehdi_amini, inglorion, eraman, steven_wu, dexonsmith, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57705 llvm-svn: 353135	2019-02-05 04:09:19 +00:00
Richard Trieu	a9354b2f33	Fix narrowing issue from r353129 llvm-svn: 353134	2019-02-05 02:26:03 +00:00
Heejin Ahn	e37ba2cb96	[WebAssembly] Fix indentation after adding IsCanonical property (NFC) llvm-svn: 353132	2019-02-05 01:59:49 +00:00
Wouter van Oortmerssen	1a91cb0402	[WebAssembly] Make disassembler always emit most canonical name. Summary: There are a few instructions that all map to the same opcode, so when disassembling, we have to pick one. That was just the first one before (the except_ref variant in the case of "call"), now it is the one marked as IsCanonical in tablegen, or failing that, the shortest name (which is typically the "canonical" one). Also introduced a canonical "end" instruction for this purpose. Reviewers: dschuff, tlively Subscribers: sbc100, jgravelle-google, aheejin, llvm-commits, sunfish Tags: #llvm Differential Revision: https://reviews.llvm.org/D57713 llvm-svn: 353131	2019-02-05 01:19:45 +00:00
Wei Mi	4901f371a2	[SamplePGO][NFC] Minor improvement to replace a temporary vector with a brace-enclosed init list. Differential Revision: https://reviews.llvm.org/D57726 llvm-svn: 353129	2019-02-05 00:57:50 +00:00
Matt Arsenault	2bf74ec8c5	GlobalISel: Fix verifier crashing on non-register operands Also correct the wording of error on subregisters. llvm-svn: 353128	2019-02-05 00:53:22 +00:00
Thomas Lively	d99af23765	[WebAssembly] memory.copy Summary: Depends on D57495. Reviewers: aheejin, dschuff Subscribers: sbc100, jgravelle-google, hiraditya, sunfish Differential Revision: https://reviews.llvm.org/D57498 llvm-svn: 353127	2019-02-05 00:49:55 +00:00
Matt Arsenault	7f09fd6b04	GlobalISel: Consolidate load/store legalization The fewerElementsVectors implementation for load/stores handles the scalar reduction case just as well, so drop the redundant code in narrowScalar. This also introduces support for narrowing irregular size breakdowns for scalars. llvm-svn: 353125	2019-02-05 00:26:12 +00:00
Craig Topper	d4e37afe45	[DAGCombiner] Discard pointer info when combining extract_vector_elt of a vector load when the index isn't constant Summary: If the index isn't constant, this transform inserts a multiply and an add on the index to calculating the base pointer for a scalar load. But we still create a memory operand with an offset of 0 and the size of the scalar access. But the access is really to an unknown offset within the original access size. This can cause the machine scheduler to incorrectly calculate dependencies between this load and other accesses. In the case we saw, there was a 32 byte vector store that was split into two 16 byte stores, one with offset 0 and one with offset 16. The size of the memory operand for both was 16. The scheduler correctly detected the alias with the offset 0 store, but not the offset 16 store. This patch discards the pointer info so we don't incorrectly detect aliasing. I wasn't sure if we could keep using the original offset and size without risking some other transform on the load changing the size. I tried to reduce a test case, but there's still a lot of memory operations needed to get the scheduler to do the bad reordering. So it looked pretty fragile to maintain. Reviewers: efriedma Reviewed By: efriedma Subscribers: arphaman, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57616 llvm-svn: 353124	2019-02-05 00:22:23 +00:00
Teresa Johnson	4bdf82ce79	[SamplePGO] Minor efficiency improvement in samplePGO ICP Summary: When attaching prof metadata to promoted direct calls in SamplePGO mode, no need to construct and use a SmallVector to pass a single count to the ArrayRef parameter, we can simply use a brace-enclosed init list. This made a small but consistent improvement for a ThinLTO backend compile I was measuring. Reviewers: wmi Subscribers: mehdi_amini, dexonsmith, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57706 llvm-svn: 353123	2019-02-05 00:18:38 +00:00
Matt Arsenault	81511e5428	GlobalISel: Implement narrowScalar for select Don't handle vector conditions. I think this can be merged in the future with fewerElementsVectorSelect, although this becomes slightly tricky with a vector condition. llvm-svn: 353122	2019-02-05 00:13:44 +00:00
Matt Arsenault	24f14993e8	GlobalISel: Combine g_extract with g_merge_values Try to use the underlying source registers. This enables legalization in more cases where some irregular operations are widened and others narrowed. This seems to make the test_combines_2 AArch64 test worse, since the MERGE_VALUES has multiple uses. Since this should be required for legalization, a hasOneUse check is probably inappropriate (or maybe should only be used if the merge is legal?). llvm-svn: 353121	2019-02-04 23:41:59 +00:00
Julian Lettner	98b9f5b4b3	[Sanitizers] UBSan unreachable incompatible with Kernel ASan Summary: This is a follow up for https://reviews.llvm.org/D57278. The previous revision should have also included Kernel ASan. rdar://problem/40723397 Subscribers: cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D57711 llvm-svn: 353120	2019-02-04 23:37:50 +00:00
Sam Clegg	ae28be3a8a	[llvm-readobj] Fix readobj test expectation broken in rL353109. NFC. llvm-svn: 353119	2019-02-04 23:36:38 +00:00
Evandro Menezes	98f356cd74	Revert "[PATCH] [TargetLibraryInfo] Update run time support for Windows" This reverts accidental commit `ff5527718d`. llvm-svn: 353118	2019-02-04 23:34:50 +00:00
Evandro Menezes	d016763774	[ADT] Refactor the Windows query functions (NFC) Increase reuse in the query functions for Windows. llvm-svn: 353117	2019-02-04 23:34:38 +00:00
Evandro Menezes	ff5527718d	[PATCH] [TargetLibraryInfo] Update run time support for Windows It seems that the run time for Windows has changed and supports more math functions than before. Since LLVM requires at least VS2015, I assume that this is the run time that would be redistributed with programs built with Clang. Thus, I based this update on the header file `math.h` that accompanies it. This patch addresses the PR40541. Unfortunately, I have no access to a Windows development environment to validate it. llvm-svn: 353114	2019-02-04 23:29:41 +00:00
Matt Arsenault	1f795e2c2a	GlobalISel: Enforce operand types for constants A number of of tests were using imm operands, not cimm. Since CSE relies on the exact ConstantInt* pointer used, and implicit conversions are generally evil, also enforce the bitsize of the types. llvm-svn: 353113	2019-02-04 23:29:31 +00:00
Matt Arsenault	f2a26339e2	GlobalISel: Verify g_select Factor the common vector element consistency check many instructions need out, although this makes the error messages worse. llvm-svn: 353112	2019-02-04 23:29:16 +00:00
Matt Arsenault	46f9c6cf0b	MachineVerifier: Move verification of G_* instructions to function llvm-svn: 353111	2019-02-04 23:29:11 +00:00
Sam Clegg	313f9f54f5	[WebAssembly] MC: Mark more function aliases as functions Aliases of functions are now marked as function symbols even if they are bitcast to some other other non-function type. This is important for WebAssembly where object and function symbols can't alias each other. Fixes PR38866 Differential Revision: https://reviews.llvm.org/D57538 llvm-svn: 353109	2019-02-04 23:07:34 +00:00
Matt Arsenault	8a59b1919c	MIR: Validate LLT types when parsing llvm-svn: 353107	2019-02-04 22:59:56 +00:00
Sanjay Patel	738e17b5df	[CGP] fix bogus test names/comments; NFC Inverted operand 0 and operand 1. llvm-svn: 353106	2019-02-04 22:37:05 +00:00
Sam Clegg	3fd2462d03	[llvm-readobj] Report more WebAssembly symbol info Differential Revision: https://reviews.llvm.org/D57695 llvm-svn: 353104	2019-02-04 22:27:46 +00:00
Sanjay Patel	1f9e23e3cc	[CGP] add tests for usubo; NFC llvm-svn: 353103	2019-02-04 22:27:08 +00:00
Matt Arsenault	3d6a49b0b9	GlobalISel: Fix not calling observer when legalizing bitcount ops This was hiding bugs from never legalizing the source type. llvm-svn: 353102	2019-02-04 22:26:33 +00:00
Matt Arsenault	cba0c6d0c9	AMDGPU: Don't rematerialize mov with implicit operands This was pulling the mov used for register indexing on gfx9 out of the loop. llvm-svn: 353101	2019-02-04 22:26:21 +00:00
Julian Lettner	29ac3a5b82	[SanitizerCoverage] Clang crashes if user declares `__sancov_lowest_stack` variable Summary: If the user declares or defines `__sancov_lowest_stack` with an unexpected type, then `getOrInsertGlobal` inserts a bitcast and the following cast fails: ``` Constant *SanCovLowestStackConstant = M.getOrInsertGlobal(SanCovLowestStackName, IntptrTy); SanCovLowestStack = cast<GlobalVariable>(SanCovLowestStackConstant); ``` This variable is a SanitizerCoverage implementation detail and the user should generally never have a need to access it, so we emit an error now. rdar://problem/44143130 Reviewers: morehouse Differential Revision: https://reviews.llvm.org/D57633 llvm-svn: 353100	2019-02-04 22:06:30 +00:00
David Major	1137fce9e9	gn build: Windows: use a more standard format for PDB filenames The current build was producing names like llvm-undname.exe.pdb, which looks unusual to me at least. This switches them to the more common llvm-undname.pdb style. Differential Revision: https://reviews.llvm.org/D57613 llvm-svn: 353099	2019-02-04 21:27:38 +00:00
David Major	d1934853a8	gn build: Revert r353094 (bad merge) llvm-svn: 353098	2019-02-04 21:25:13 +00:00
Nicolai Haehnle	a69146e67e	[InstCombine] Cleanup the TFE/LWE check in AMDGPU SimplifyDemanded Summary: The fix added in r352904 is not quite correct, or rather misleading: 1. When the texfailctrl (TFC) argument was non-constant, the fix assumed non-TFE/LWE, which is incorrect. 2. Regardless, this code path cannot even be hit for correct TFE/LWE-enabled calls, because those return a struct. Added a test case for those for completeness. Change-Id: I92d314dbc67a2670f6d7adaab765ef45f56a49cf Reviewers: hliao, dstuttard, arsenm Subscribers: kzhuravl, jvesely, wdng, yaxunl, tpr, t-tye, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57681 llvm-svn: 353097	2019-02-04 21:24:19 +00:00
Craig Topper	4ca0b850e0	[X86] Add test case for report_fatal_error added in r352699. r352699 replaced an llvm_unreachable with a report_fatal_error. This patch adds a test case for it. llvm-svn: 353096	2019-02-04 21:24:15 +00:00
Craig Topper	c45e39b35f	[CodeGen][ARC][SystemZ][WebAssembly] Use MachineInstr::isInlineAsm in more places instead of just comparing opcode. NFCI I'm looking at adding a second INLINEASM opcode for better modeling asm-goto as a terminator. Using the existing predicate will reduce teh number of places that will need to use the new opcode. llvm-svn: 353095	2019-02-04 21:24:13 +00:00
David Major	1469ff417b	gn build: Windows: use a more standard format for PDB filenames The current build was producing names like llvm-undname.exe.pdb, which looks unusual to me at least. This switches them to the more common llvm-undname.pdb style. Differential Revision: https://reviews.llvm.org/D57613 llvm-svn: 353094	2019-02-04 21:20:25 +00:00
David Major	3c659cb267	gn build: Windows: write PDBs when is_debug Without /DEBUG, the /Zi doesn't on its own create PDB files. And since ninja runs multiple compilations in parallel, we need /FS to prevent contention on PDBs. Differential Revision: https://reviews.llvm.org/D57612 llvm-svn: 353093	2019-02-04 21:13:43 +00:00
Aditya Nandakumar	9b6b9a5791	[Tablegen][DAG]: Fix build breakage when LLVM_ENABLE_DAGISEL_COV=1 LLVM_ENABLE_DAGISEL_COV can be used to instrument DAGISel tablegen selection code to show which patterns along with Complex patterns were used when selecting instructions. Unfortunately this is turned off by default and was broken but never tested. This required a simple fix (missing new line) to get it to build again. llvm-svn: 353091	2019-02-04 21:06:24 +00:00
Philip Pfaffe	0ee6a933ce	[NewPM][MSan] Add Options Handling Summary: This patch enables passing options to msan via the passes pipeline, e.e., -passes=msan<recover;kernel;track-origins=4>. Reviewers: chandlerc, fedor.sergeev, leonardchan Subscribers: hiraditya, bollu, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D57640 llvm-svn: 353090	2019-02-04 21:02:49 +00:00
Wolfgang Pieb	90d856cd5f	[DEBUGINFO] Reposting r352642: Handle restore instructions in LiveDebugValues The LiveDebugValues pass recognizes spills but not restores, which can cause large gaps in location information for some variables, depending on control flow. This patch make LiveDebugValues recognize restores and generate appropriate DBG_VALUE instructions. This patch was posted previously with r352642 and reverted in r352666 due to buildbot errors. A missing return statement was the cause for the failures. Reviewers: aprantl, NicolaPrica Differential Revision: https://reviews.llvm.org/D57271 llvm-svn: 353089	2019-02-04 20:42:45 +00:00
Scott Linder	d19d197221	[AMDGPU] Support emitting GOT relocations for function calls Differential Revision: https://reviews.llvm.org/D57416 llvm-svn: 353083	2019-02-04 20:00:07 +00:00
Michael Kruse	70560a0a2c	[WarnMissedTransforms] Do not warn about already vectorized loops. LoopVectorize adds llvm.loop.isvectorized, but leaves llvm.loop.vectorize.enable. Do not consider such a loop for user-forced vectorization since vectorization already happened -- by prioritizing llvm.loop.isvectorized except for TM_SuppressedByUser. Fixes http://llvm.org/PR40546 Differential Revision: https://reviews.llvm.org/D57542 llvm-svn: 353082	2019-02-04 19:55:59 +00:00
Matt Arsenault	22309c8701	GlobalISel: Fix CheckMachineFunction passing if ReadCheckFile files This could be tested, but the FileCheck library spams the error message to the console. llvm-svn: 353081	2019-02-04 19:53:22 +00:00
Matt Arsenault	f3a46d0ae9	GlobalISel: Allow constructing SrcOp/DstOp from MachineOperand llvm-svn: 353080	2019-02-04 19:53:19 +00:00
Matt Arsenault	d7fa13c121	GlobalISel: Fix parameter name in documentation llvm-svn: 353078	2019-02-04 19:16:58 +00:00
Matt Arsenault	8121ec26c0	GlobalISel: Fix CSE handling of buildConstant This fixes two problems with CSE done in buildConstant. First, this would hit an assert when used with a vector result type. Solve this by allowing CSE on the vector elements, but not on the result vector for now. Second, this was also performing the CSE based on the input ConstantInt pointer. The underlying buildConstant could potentially convert the constant depending on the result type, giving in a different ConstantInt*. Stop allowing the APInt and ConstantInt forms from automatically casting to the result type to avoid any similar problems in the future. llvm-svn: 353077	2019-02-04 19:15:50 +00:00
Heejin Ahn	18c56a0762	[WebAssembly] clang-tidy (NFC) Summary: This patch fixes clang-tidy warnings on wasm-only files. The list of checks used is: `-,clang-diagnostic-,llvm-,misc-,-misc-unused-parameters,readability-identifier-naming,modernize-` (LLVM's default .clang-tidy list is the same except it does not have `modernize-`. But I've seen in multiple CLs in LLVM the modernize style was recommended and code was fixed based on the style, so I added it as well.) The common fixes are: - Variable names start with an uppercase letter - Function names start with a lowercase letter - Use `auto` when you use casts so the type is evident - Use inline initialization for class member variables - Use `= default` for empty constructors / destructors - Use `using` in place of `typedef` Reviewers: sbc100, tlively, aardappel Subscribers: dschuff, sunfish, jgravelle-google, yurydelendik, kripken, MatzeB, mgorny, rupprecht, llvm-commits Differential Revision: https://reviews.llvm.org/D57500 llvm-svn: 353075	2019-02-04 19:13:39 +00:00
Jordan Rupprecht	2e862c7555	[llvm-objcopy][NFC] simplify an error return llvm-svn: 353074	2019-02-04 19:09:20 +00:00
Roman Lebedev	b7ecc9b624	[X86] X86DAGToDAGISel::matchBitExtract(): prepare 'control' in 32 bits Summary: Noticed while looking at D56052. ``` // The 'control' of BEXTR has the pattern of: // [15...8 bit][ 7...0 bit] location // [ bit count][ shift] name // I.e. 0b000000011'00000001 means (x >> 0b1) & 0b11 ``` I.e. we do not care about any of the bits aside from the low 16 bits. So there is no point in doing the `slh`,`or` in 64 bits, let's just do everything in 32 bits, and anyext if needed. We could do that in 16 even, but we intentionally don't zext to i16 (longer encoding IIRC), so i'm guessing the same applies here. Reviewers: craig.topper, andreadb, RKSimon Reviewed By: craig.topper Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D56715 llvm-svn: 353073	2019-02-04 19:04:26 +00:00

1 2 3 4 5 ...

174949 Commits