llvm-project

Commit Graph

Author	SHA1	Message	Date
Stanislav Mekhanoshin	d4b500cb08	[AMDGPU] Track occupancy in MFI Keep track of achieved occupancy in SIMachineFunctionInfo. At the moment we have a lot of duplicated or even missed code to query and maintain occupancy info. Record it in the MFI and query in a single call. Interfaces: - getOccupancy() - returns current recorded achieved occupancy. - getMinAllowedOccupancy() - returns lesser of the achieved occupancy and the lowest occupancy we are ready to tolerate. For example if a kernel is memory bound we are ready to tolerate 4 waves. - limitOccupancy() - record occupancy level if we have to lower it. - increaseOccupancy() - record occupancy if scheduler managed to increase the occupancy. MFI takes care of integrating different checks affecting occupancy, including LDS use and waves-per-eu attribute. Note that scheduler starts with not yet known register pressure, so has to record either limit or increase in occupancy after it is done. Later passes can just query a resulting value. New interface is used in the active scheduler and NFC wrt its work. Changes are also made to experimental schedulers to use it and record an occupancy after they are done. Before the change waves-per-eu was ignored by experimental schedulers and tolerance window for memory bound kernels was not used. Differential Revision: https://reviews.llvm.org/D47509 llvm-svn: 333629	2018-05-31 05:36:04 +00:00
Dean Michael Berris	d1fe506694	[XRay] Fixup: Remove unnecessary type alias Follow-up to D45758. llvm-svn: 333628	2018-05-31 05:25:47 +00:00
Dean Michael Berris	16c865b071	[XRay] Fixup: Explicitly call std::make_tuple(...) Follow-up to D45758. llvm-svn: 333627	2018-05-31 05:02:11 +00:00
Craig Topper	a6dd2faaea	[X86] Make 512-bit unmasked load/store builtins more like their 128/256-bit equivalents. Previously we were just passing -1 mask to the masked builtin. This changes it to the more generic way that the 128/256 bit use. llvm-svn: 333626	2018-05-31 05:02:08 +00:00
Dean Michael Berris	ca856e07de	[XRay] Fixup: Address some warnings breaking build Follow-up to D45758. llvm-svn: 333625	2018-05-31 04:55:11 +00:00
Dean Michael Berris	1eb8c206cd	[XRay][profiler] Part 3: Profile Collector Service Summary: This is part of the larger XRay Profiling Mode effort. This patch implements a centralised collector for `FunctionCallTrie` instances, associated per thread. It maintains a global set of trie instances which can be retrieved through the XRay API for processing in-memory buffers (when registered). Future changes will include the wiring to implement the actual profiling mode implementation. This central service provides the following functionality: * Posting a `FunctionCallTrie` associated with a thread, to the central list of tries. * Serializing all the posted `FunctionCallTrie` instances into in-memory buffers. * Resetting the global state of the serialized buffers and tries. Depends on D45757. Reviewers: echristo, pelikan, kpw Reviewed By: kpw Subscribers: llvm-commits, mgorny Differential Revision: https://reviews.llvm.org/D45758 llvm-svn: 333624	2018-05-31 04:33:52 +00:00
Jan Vesely	f5016b79a6	AMDGPU/R600: Make sure functions are cacheline aligned v2: use "ensureAlignment" make functions cache line aligned Fixes GPU hangs since r333219: "AMDGPU: Split R600 AsmPrinter code into its own class" Differential Revision: https://reviews.llvm.org/D47516 llvm-svn: 333622	2018-05-31 04:08:08 +00:00
Tobias Grosser	ce27773a8e	Update isl to isl-0.19-173-g77fe2538 Besides other changes, this update introduces functions to translate a maps and sets into lists of their elements. These lists are useful as we can define iterators for lists, which allow us to replace many uses of foreach. llvm-svn: 333621	2018-05-31 03:59:05 +00:00
Joel E. Denny	fc01dd281d	[lit] Terminate ": RUN at line N" with ";" not "&&" This fixes projects/compiler-rt/test/fuzzer/sigusr.test, which was broken by r333614. The trouble was that "&&" changes the command for which "$!" gives the pid. llvm-svn: 333620	2018-05-31 03:40:37 +00:00
Roman Tereshin	5952576de5	[GlobalISel][Legalizer] LegalizerInfo verifier: Making LegalizerInfo::verify(...) errors fatal Reviewers: aemerson, qcolombet Reviewed By: qcolombet Differential Revision: https://reviews.llvm.org/D46339 llvm-svn: 333619	2018-05-31 01:56:07 +00:00
Roman Tereshin	5a65eb75c7	[GlobalISel][AArch64] LegalizerInfo verifier: Fixing bugs exposed by LegalizerInfo::verify(...) Reviewers: aemerson, qcolombet Reviewed By: qcolombet Differential Revision: https://reviews.llvm.org/D46339 llvm-svn: 333618	2018-05-31 01:56:05 +00:00
Tim Shen	f811de484c	[X86] Fix wrong intrinsic semantic. llvm-svn: 333617	2018-05-31 01:51:07 +00:00
Kostya Serebryany	980e45fe55	[libFuzzer] add collect_data_flow.py that allows to run the data-flow tracer several times on subsets of inputs bytes, to overcome DFSan out-of-label failures llvm-svn: 333616	2018-05-31 01:27:07 +00:00
Craig Topper	cbf3929bc9	[X86] Fix some places where macro arguments to intrinsics weren't cast to _m512(i\|d)/_m256(i\|d/_m128(i\|d) first. The majority of the cases were correct. This fixes the few that weren't. I also removed some superfluous parentheses in non-macros that confused by attempts at grepping for missing casts. llvm-svn: 333615	2018-05-31 01:24:40 +00:00
Joel E. Denny	31b373963f	[lit] Report line number for failed RUN command (Relands r333584, reverted in 333592.) When debugging test failures with -vv (or -v in the case of the internal shell), this makes it easier to locate the RUN line that failed. For example, clang's test/Driver/linux-ld.c has 892 total RUN lines, and clang's test/Driver/arm-cortex-cpus.c has 424 RUN lines after concatenation for line continuations. When reading the generated shell script, this also makes it easier to locate the RUN line that produced each command. To support reporting RUN line numbers in the case of the internal shell, this patch extends the internal shell to support the null command, ":", except pipelines are not supported. To support reporting RUN line numbers in the case of windows cmd.exe as the external shell, this patch extends -vv to set "echo on" instead of "echo off" in bat files. (Support for windows cmd.exe as a lit external shell will likely be dropped later, but I found out too late.) Reviewed By: delcypher, asmith, stella.stamenova, jmorse, lebedev.ri, rnk Differential Revision: https://reviews.llvm.org/D44598 llvm-svn: 333614	2018-05-31 00:55:32 +00:00
Craig Topper	c633867944	[X86] Remove __extension__ from macro intrinsics when its not needed. I think this is a holdover from when we used to declare variables inside the macros. And then its been copy and pasted forward for years every time a new macro intrinsic gets added. Interestingly this caused some tests for IRGen to be slightly more optimized. We now return a zeroinitializer directly instead of going through a store+load. It also removed a bogus error message on another test. llvm-svn: 333613	2018-05-31 00:51:20 +00:00
George Karpenkov	7744c7f137	[analyzer] Trust _Nonnull annotations, and trust analyzer knowledge about receiver nullability Previously, the checker was using the nullability of the expression, which is nonnull IFF both receiver and method are annotated as _Nonnull. However, the receiver could be known to the analyzer to be nonnull without being explicitly marked as _Nonnull. rdar://40635584 Differential Revision: https://reviews.llvm.org/D47510 llvm-svn: 333612	2018-05-31 00:28:13 +00:00
Sanjay Patel	e5bc441791	[InstCombine] don't change the size of a select if it would mismatch its condition operands' sizes Don't always: cast (select (cmp x, y), z, C) --> select (cmp x, y), (cast z), C' This is something that came up as far back as D26556, and I lost track of it. I suspect that this transform is part of the underlying problem that is inspiring some of the recent proposals that seek to match larger patterns that include a cast op. Even if that's not true, this transform causes problems for codegen (particularly with vector types). A transform to actively match the size of cmp and select operand sizes should follow. This patch just removes the harmful canonicalization in the other direction. Differential Revision: https://reviews.llvm.org/D47163 llvm-svn: 333611	2018-05-31 00:16:58 +00:00
Sanjay Patel	ceb595b04e	[InstCombine] don't negate constant expression with fsub (PR37605) X + (-C) would be transformed back into X - C, so infinite loop: https://bugs.llvm.org/show_bug.cgi?id=37605 llvm-svn: 333610	2018-05-30 23:55:12 +00:00
Vedant Kumar	61763b65af	[Coverage] Discard the last uncompleted deferred region in a decl Discard the last uncompleted deferred region in a decl, if one exists. This prevents lines at the end of a function containing only whitespace or closing braces from being marked as uncovered, if they follow a region terminator (return/break/etc). The previous behavior was to heuristically complete deferred regions at the end of a decl. In practice this ended up being too brittle for too little gain. Users would complain that there was no way to reach full code coverage because whitespace at the end of a function would be marked uncovered. rdar://40238228 Differential Revision: https://reviews.llvm.org/D46918 llvm-svn: 333609	2018-05-30 23:35:44 +00:00
Vedant Kumar	e3c1fb8b12	[llvm-cov] Use the new PrintHTMLEscaped utility This removes some duplicate logic to escape characters in HTML output. llvm-svn: 333608	2018-05-30 23:35:14 +00:00
Rui Ueyama	1c6961d3ba	Add "(default)" to default options This improves the help message shown for `ld.lld --help`. Differential Revision: https://reviews.llvm.org/D47562 llvm-svn: 333607	2018-05-30 23:32:41 +00:00
Richard Smith	d9f2e0783a	[www] Update C++ status to cover P0620. While here, mark three-way comparison as in progress and bump "Clang 6" items from yellow to green. llvm-svn: 333606	2018-05-30 23:30:36 +00:00
Tom Stellard	c7624317d7	AMDGPU: Split AMDGPUTTI into GCNTTI and R600TTI Reviewers: arsenm, nhaehnle Reviewed By: arsenm Subscribers: kzhuravl, wdng, yaxunl, dstuttard, tpr, llvm-commits, t-tye Differential Revision: https://reviews.llvm.org/D47359 llvm-svn: 333605	2018-05-30 22:55:35 +00:00
Vlad Tsyrklevich	178fdb1a3b	[LowerTypeTests] Discard extern_weak linkage for definitions Summary: Fix PR37625. It's possible for an extern_weak declaration to be emitted to the merged module when a definition exists in the ThinLTO portion of the build; discard the linkage on the declaration in that case. (otherwise we copy the linkage to the alias to the jumptable and fail) Reviewers: pcc Reviewed By: pcc Subscribers: mehdi_amini, llvm-commits, kcc Differential Revision: https://reviews.llvm.org/D47494 llvm-svn: 333604	2018-05-30 22:39:52 +00:00
Craig Topper	73d1d403e2	[X86] Use C style comments in intrinsic headers for overall consistency. Most of the origial comments used C style /* */ comments, but some C++ // comments had snuck in over time. Still need to convert all the doxygen comments. Which is much harder to do. llvm-svn: 333603	2018-05-30 22:33:21 +00:00
Peter Collingbourne	ac94ca54c5	IRGen: Rename bitsets -> type metadata. NFC. "Type metadata" is the term that we've been using for the CFI-related information on vtables for a while now. llvm-svn: 333602	2018-05-30 22:29:08 +00:00
George Burgess IV	485762ccba	[NewGVN] Fix set comparison; reflow comment Looks like we intended to compare this->Members with Other->Members here, but ended up comparing this->Members with this->Members. Oops. :) Since CongruenceClass::Members is a SmallPtrSet anyway, we can probably skip building std::sets if we're willing to write a bit more code. This appears to be no functional change (for sufficiently lax values of "no"): this equality check was only being called inside of an assert. So, worst case, we'll catch more bugs in the form of assertion failures. Thanks to d0k for noting this! llvm-svn: 333601	2018-05-30 22:24:08 +00:00
Peter Collingbourne	e2a20b1b29	AST: Remove an unused ctor. NFC. llvm-svn: 333600	2018-05-30 22:14:17 +00:00
Richard Smith	e4899c1648	PR37631: verify that a member deduction guide has the same access as its template. llvm-svn: 333599	2018-05-30 22:13:43 +00:00
Peter Collingbourne	e863297775	AST: Remove an unused function. NFC. llvm-svn: 333598	2018-05-30 22:10:07 +00:00
Roman Tereshin	8f1753e994	[GlobalISel][AArch64] LegalizerInfo verifier: Adding LegalizerInfo::verify(...) call w/o fixing bugs This is to make it clear what kind of bugs the LegalizerInfo::verifier is able to catch and test its output Reviewers: aemerson, qcolombet Reviewed By: aemerson Differential Revision: https://reviews.llvm.org/D46338 llvm-svn: 333597	2018-05-30 22:10:04 +00:00
Rui Ueyama	eea690dae5	Simplify `ld.lld --help` message. Previously, we printed out two lines of help messages for `--foo bar` and `--foo=bar` like this: --soname=<value> Set DT_SONAME --soname <value> Set DT_SONAME --sort-section=<value> Specifies sections sorting rule when linkerscript is used --sort-section <value> Specifies sections sorting rule when linkerscript is used This change eliminates duplicate lines that doesn't contain `=` for such options like this. --soname=<value> Set DT_SONAME --sort-section=<value> Specifies sections sorting rule when linkerscript is used Differential Revision: https://reviews.llvm.org/D47558 llvm-svn: 333596	2018-05-30 21:25:53 +00:00
Reid Kleckner	b54ac414d1	[asan] Remove unneeded VirtualQuery from exception handler We don't use the result of the query, and all tests pass if I remove it. During startup, ASan spends a fair amount of time in this handler, and the query is much more expensive than the call to commit the memory. llvm-svn: 333595	2018-05-30 21:21:18 +00:00
Eric Christopher	5b91350b4a	Add fopen to the list of builtins that we check and whitelist. llvm-svn: 333594	2018-05-30 21:11:45 +00:00
Craig Topper	63ec0ea7bc	[X86] Add __extension__ to a bunch of places in our intrinsic headers that fail if you run it through -pedantic -ansi. All of these are lines that create a 'compound literal' to concatenate elements together. llvm-svn: 333593	2018-05-30 21:08:27 +00:00
Joel E. Denny	71792c741e	Revert r333584: [lit] Report line number for failed RUN command It breaks test-suite. llvm-svn: 333592	2018-05-30 21:07:27 +00:00
Florian Hahn	75e87c3f2a	[TableGen] Avoid leaking TreePatternNodes by using shared_ptr. By using std::shared_ptr for TreePatternNode, we can avoid leaking them. Reviewers: craig.topper, dsanders, stoklund, tstellar, zturner Reviewed By: dsanders Differential Revision: https://reviews.llvm.org/D47463 llvm-svn: 333591	2018-05-30 21:00:18 +00:00
Jonas Devlieghere	50603518a0	[ADT] Add unit test for PrintHTMLEscaped Add unit tests for PrintHTMLEscaped which was added in r333565. llvm-svn: 333590	2018-05-30 20:47:18 +00:00
Richard Smith	2600c63d96	PR34520: after instantiating a non-templated member deduction guide, don't forget to push it into the class scope. llvm-svn: 333589	2018-05-30 20:24:10 +00:00
Daniel Neilson	936d50aeea	[IRBuilder] Add APIs for creating calls to atomic memmove and memset intrinsics. (NFC) Summary: Creating the IRBuilder methods: CreateElementUnorderedAtomicMemSet CreateElementUnorderedAtomicMemMove These mirror the methods that create calls to the regular (non-atomic) memmove and memset intrinsics. llvm-svn: 333588	2018-05-30 20:02:56 +00:00
Richard Smith	5105573041	As discussed with SG10, bump version of __cpp_deduction_guides macro to indicate support for P0620R0. llvm-svn: 333587	2018-05-30 19:54:52 +00:00
Simon Pilgrim	159bd7444e	Fix Wdocumentation warning. NFCI. llvm-svn: 333586	2018-05-30 19:50:26 +00:00
Vedant Kumar	f3b6d2930d	[lldb-test] ir-memory-map: Avoid accessing a bad iterator Do not access Probe.start() when Probe is at the end of the interval map. llvm-svn: 333585	2018-05-30 19:46:47 +00:00
Joel E. Denny	b6423479a1	[lit] Report line number for failed RUN command (Relands r330755 (reverted in r330848) with fix for PR37239.) When debugging test failures with -vv (or -v in the case of the internal shell), this makes it easier to locate the RUN line that failed. For example, clang's test/Driver/linux-ld.c has 892 total RUN lines, and clang's test/Driver/arm-cortex-cpus.c has 424 RUN lines after concatenation for line continuations. When reading the generated shell script, this also makes it easier to locate the RUN line that produced each command. To support reporting RUN line numbers in the case of the internal shell, this patch extends the internal shell to support the null command, ":", except pipelines are not supported. To support reporting RUN line numbers in the case of windows cmd.exe as the external shell, this patch extends -vv to set "echo on" instead of "echo off" in bat files. (Support for windows cmd.exe as a lit external shell will likely be dropped later, but I found out too late.) Reviewed By: delcypher, asmith, stella.stamenova, jmorse, lebedev.ri, rnk Differential Revision: https://reviews.llvm.org/D44598 llvm-svn: 333584	2018-05-30 19:42:27 +00:00
Vedant Kumar	c1cd826248	[lldb-test] Add a testing harness for the JIT's IRMemoryMap This teaches lldb-test how to launch a process, set up an IRMemoryMap, and issue memory allocations in the target process through the map. This makes it possible to test IRMemoryMap in a targeted way. This has uncovered two bugs so far. The first bug is that Malloc performs an adjustment on the pointer returned from AllocateMemory (for alignment purposes) which ultimately allows overlapping memory regions to be created. The second bug is that after most of the address space on the host side is exhausted, Malloc may return the same address multiple times. These bugs (and hopefully more!) can be uncovered and tested for with targeted lldb-test commands. At an even higher level, the motivation for addressing these bugs is that they can lead to strange user-visible failures (e.g, variables assume the wrong value during expression evaluation, or the debugger crashes). See my third comment on this swift-lldb PR for an example: https://github.com/apple/swift-lldb/pull/652 I hope lldb-test is the right place to add this testing harness. Setting up a gtest-style unit test proved too cumbersome (you need to recreate or mock way too much debugger state), as did writing end-to-end tests (it's hard to write a test that actually hits a buggy path). With lldb-test, it's easy to read/generate the test input and parse the test output. I'll attach a simple "fuzz" tester which generates failing test cases to the Phab review. Here's an example: ``` Command: malloc(size=1024, alignment=32) Malloc: address = 0xca000 Command: malloc(size=64, alignment=16) Malloc: address = 0xca400 Command: malloc(size=1024, alignment=16) Malloc: address = 0xca440 Command: malloc(size=16, alignment=8) Malloc: address = 0xca840 Command: malloc(size=2048, alignment=16) Malloc: address = 0xcb000 Command: malloc(size=64, alignment=32) Malloc: address = 0xca860 Command: malloc(size=1024, alignment=16) Malloc: address = 0xca890 Malloc error: overlapping allocation detected, previous allocation at [0xca860, 0xca8a0) ``` {F6288839} Differential Revision: https://reviews.llvm.org/D47508 llvm-svn: 333583	2018-05-30 19:39:10 +00:00
Benjamin Kramer	c8bd5449e0	[CalledValuePropagation] Just use a sorted vector instead of a set. The set properties are never used, so a vector is enough. No functionality change intended. While there add some std::moves to SparseSolver. llvm-svn: 333582	2018-05-30 19:31:11 +00:00
Peter Collingbourne	1651ac13be	llvm-objcopy: Set sh_link to 0 on unrecognized symtab-linked sections. Per discussion on the generic-abi mailing list: https://groups.google.com/forum/#!topic/generic-abi/MPr8TVtnVn4 An object file manipulation tool must either write out a symbol table with the same number of entries as the original symbol table and in the same order, or if this is impossible, refuse to operate on the object file if it has unrecognized sections that are linked to the symtab section. However, existing tools (namely GNU strip, GNU objcopy and ld.{bfd,gold,lld} -r) do not comply with this at present: they change symbol table indexes and set sh_link to 0 on the unrecognized symtab-linked sections. We intend to use the latter as a (temporary) signal that a tool has operated on a proposed new symtab-linked section and invalidated the symbol table indexes. However, llvm-objcopy currently keeps sh_link pointing to the new symtab section. This patch changes llvm-objcopy to set sh_link to 0 to match the behaviour of the other tools. Differential Revision: https://reviews.llvm.org/D47404 llvm-svn: 333581	2018-05-30 19:30:39 +00:00
Simon Pilgrim	5e9f459c62	[X86][SSE] Pulled out splat detection helper from LowerScalarVariableShift (NFCI) Created the IsSplatValue helper from the splat detection code in LowerScalarVariableShift as a first NFC step towards improving support for splat rotations, which is an extension of PR37426. llvm-svn: 333580	2018-05-30 19:16:59 +00:00
Galina Kistanova	df917811ca	Reverted r333424 as it broke multiple build bots and left unfixed for a long time llvm-svn: 333578	2018-05-30 18:51:08 +00:00

... 3 4 5 6 7 ...

290978 Commits All Branches Search

290978 Commits

All Branches