llvm-project

Commit Graph

Author	SHA1	Message	Date
Frederik Gossen	c2c65585c5	[MLIR] Fix `isValidIndex` Differential Revision: https://reviews.llvm.org/D100635	2021-04-16 14:58:54 +02:00
Hansang Bae	9b98497b44	[OpenMP] Add omp_target_is_accessible() to header files -- Added omp_target_is_accessible to the header files -- Added missing const qualifier to device memory routines Differential Revision: https://reviews.llvm.org/D100420	2021-04-16 07:54:15 -05:00
Sanjay Patel	bb907b26e2	[ValueTracking] don't recursively compute known bits using multiple llvm.assumes This is an alternative to D99759 to avoid the compile-time explosion seen in: https://llvm.org/PR49785 Another potential solution would make the exclusion logic stronger to avoid blowing up, but note that we reduced the complexity of the exclusion mechanism in D16204 because it was too costly. So I'm questioning the need for recursion/exclusion entirely - what is the optimization value vs. cost of recursively computing known bits based on assumptions? This was built into the implementation from the start with `60db058`, and we have kept adding code/cost to deal with that capability. By clearing the query's AssumptionCache inside computeKnownBitsFromAssume(), this patch retains all existing assume functionality except refining known bits based on even more assumptions. We have 1 regression test that shows a difference in optimization power. Differential Revision: https://reviews.llvm.org/D100573	2021-04-16 08:43:35 -04:00
Roman Lebedev	b06c55a698	[X86][CostModel] Fix cost model for non-power-of-two vector load/stores Sometimes LV has to produce really wide vectors, and sometimes they end up being not powers of two. As it can be seen from the diff, the cost computation is currently completely non-sensical in those cases. Instead of just scalarizing everything, split/factorize the wide vector into a number of subvectors, each one having a power-of-two elements, recurse to get the cost of op on this subvector. Also, check how we'd legalize this subvector, and if the legalized type is scalar, also account for the scalarization cost. Note that for sub-vector loads, we might be able to do better, when the vectors are properly aligned. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D100099	2021-04-16 15:30:57 +03:00
Abhina Sreeskantharajan	3be2ba0ba3	[SystemZ][z/OS][Windows] Add new functions that set Text/Binary mode for Stdin and Stdout based on OpenFlags On Windows, we want to open a file in Binary mode if OF_CRLF bit is not set. On z/OS, we want to open a file in Binary mode if the OF_Text bit is not set. This patch creates two new functions called ChangeStdinMode and ChangeStdoutMode which will take OpenFlags as an arg to determine which mode to set stdin and stdout to. This will enable patches like https://reviews.llvm.org/D100056 to not affect Windows when setting the OF_Text flag for raw_fd_streams. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D100130	2021-04-16 08:09:19 -04:00
Nigel Perks	23f8993f32	Restore lit feature object-emission. Omit DebugInfo/Generic on XCore. D73568 removed the lit feature object-emission, because it was introduced for a target which did not support the integrated assembler, and that target no longer required the feature. XCore still does not support the integrated assembler, so a build with XCore as the default target fails tests requiring object-emission. This issue was not publicly visible because there was not a buildbot for XCore as the default target. We fixed the failures downstream. We now have builder clang-xcore-ubuntu-20-x64 on the staging buildmaster, which shows the failures. We would like to make upstream build green. Omit DebugInfo/Generic on XCore to avoid annotating 70 separate files. Differential Revision: https://reviews.llvm.org/D98508	2021-04-16 13:02:14 +01:00
Frederik Gossen	3a5a610e27	[MLIR][Shape] Expose `getShapeVec` and add support for extent tensors Differential Revision: https://reviews.llvm.org/D100636	2021-04-16 13:59:20 +02:00
Nico Weber	1ede08a290	[llvm-objcopy] clang-format a line	2021-04-16 07:24:43 -04:00
Florian Hahn	31b5c2b1d2	[SimplifyCFG] Regenerate CHECK lines and add test for PR49982.	2021-04-16 12:05:54 +01:00
Caroline Concatto	394eb91854	[NFC][AArch64][SVE] Move select-sve.ll tests to sve-select.ll This patch merges the two select tests: select-sve.ll and sve-select.ll into sve-select.ll as they are both testing SELECT instruction	2021-04-16 11:59:53 +01:00
David Green	00a6045473	[ARM] Combine sub 0, csinc X, Y, CC -> csinv -X, Y, CC Combine sub 0, csinc X, Y, CC to csinv -X, Y, CC providing that the negation of X is cheap, currently just handling constants. This comes up during the splat of an i1 to a predicate, where we now generate csetm, as opposed to cset; rsb. Differential Revision: https://reviews.llvm.org/D99940	2021-04-16 11:52:31 +01:00
Fraser Cormack	ec0f7c6923	[RISCV] Rerun stack test through update_llc_test_checks.py Adjusts formatting of comments only. Just to reduce diffs in future patches.	2021-04-16 11:08:58 +01:00
Simon Pilgrim	2a1a2f5733	[CostModel][X86] Add fully aligned load/store tests As noted on D100099, if these illegal vector types are suitably aligned they should be much cheaper to load (but probably not store).	2021-04-16 10:35:40 +01:00
Pushpinder Singh	efc013ec4d	Revert "[AMDGPU][OpenMP] Add amdgpu-arch tool to list AMD GPUs installed" This reverts commit `7029cffc4e`.	2021-04-16 09:16:58 +00:00
LemonBoy	7c6f177477	[lld] Fix test crashing when AVR target is missing Fixes buildbot error.	2021-04-16 11:12:29 +02:00
Simon Moll	fda078bffb	[docs] Add vector predication call Add the syncup call to the table Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D100474	2021-04-16 10:49:34 +02:00
Nicolas Vasilache	b5f3a128bf	[mlir][Python][Linalg] Add support for captures in body builder. When Linalg named ops support was added, captures were omitted from the body builder. This revision adds support for captures which allows us to write FillOp in a more idiomatic fashion using the _linalg_ops_ext mixin support. This raises an issue in the generation of `_linalg_ops_gen.py` where ``` @property def result(self): return self.operation.results[0] if len(self.operation.results) > 1 else None ```. The condition should be `== 1`. This will be fixed in a separate commit. Differential Revision: https://reviews.llvm.org/D100363	2021-04-16 08:47:26 +00:00
LemonBoy	7a781fb692	[LLD][ELF][AVR] Propagate ELF flags to the linked image The `e_flags` for a ELF file targeting the AVR ISA contains two fields at the time of writing: - A 7-bit integer field specifying the ISA revision being targeted - A 1-bit flag specifying whether the object files being linked are suited for applying the relaxations at link time The linked ELF file is blessed with the arch revision shared among all the files. The behaviour in case of mismatch is purposefully different than the one implemented in libbfd: LLD will raise a fatal error while libbfd silently picks a default value of `avr2`. The relaxation-ready flag is handled as done by libbfd, in order for it to appear in the linked object every source object must be tagged with it. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D99754	2021-04-16 10:40:18 +02:00
Max Sagebaum	fd4e08aa8f	[clang-format] Inconsistent behavior regarding line break before access modifier Fixes https://llvm.org/PR41870. Checks for newlines in option Style.EmptyLineBeforeAccessModifier are now based on the formatted new lines and not on the new lines in the file. Reviewed By: HazardyKnusperkeks, curdeius Differential Revision: https://reviews.llvm.org/D99503	2021-04-16 10:39:13 +02:00
Nicolas Vasilache	8cf650c554	[mlir][linalg] Add support for WAW fusion on tensors. Differential Revision: https://reviews.llvm.org/D100603	2021-04-16 08:22:09 +00:00
Guillaume Chatelet	907b52d1a7	[libc] Fix typo	2021-04-16 08:09:28 +00:00
Guillaume Chatelet	f6b6568536	[libc] Add slice/take/drop methods to ArrayRef Add various methods from llvm::ArrayRef. Refactor implementation to remove code duplication. Differential Revision: https://reviews.llvm.org/D100569	2021-04-16 07:54:48 +00:00
Nick Desaulniers	bb7016f8f5	[Aarch64] handle "o" inline asm memory constraints This Linux kernel is making use of this inline asm constraint which is causing an ICE. PR49956 Link: https://github.com/ClangBuiltLinux/linux/issues/1348 Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D100412	2021-04-15 23:36:21 -07:00
Petr Hosek	9ac988f6a8	[libcxx] Make the GDB pretty printer test less strict This is a workaround for PR48937. GDB can sometimes print additional warnings which currently fails the test. Use re.search instead of re.match to ignore this additional output. Differential Revision: https://reviews.llvm.org/D99532	2021-04-15 23:33:22 -07:00
patacca	4170d6cdd5	[Polly][Ast] Partial refactoring of IslAst and IslAstInfo to use isl++. NFC. Polly use algorithms from the Integer Set Library (isl), which is a library written in C and which is incompatible with the rest of the LLVM as it is written in C++. Changes made: - Refactoring the following methods of class `IslAst` - `getAst()` `getRunCondition()` `buildRunCondition()` - Removed the destructor in favor of the default one - Change the type of the attribute `IslAst.RunCondition` to `isl::ast_expr` - Change the type of the attribute `IslAst.Root` to `isl::ast_node` - Change the order of attributes in class `IslAst` to reflect the data dependencies so that the destructor won't complain - Refactoring the following methods of class `IslAstInfo` - `getAst()` `getRunCondition()` Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D100265	2021-04-16 00:40:26 -05:00
Pushpinder Singh	7029cffc4e	[AMDGPU][OpenMP] Add amdgpu-arch tool to list AMD GPUs installed This patch adds new clang tool named amdgpu-arch which uses HSA to detect installed AMDGPU and report back latter's march. This tool is built only if system has HSA installed. The value printed by amdgpu-arch is used to fill -march when latter is not explicitly provided in -Xopenmp-target. Reviewed By: JonChesterfield, gregrodgers Differential Revision: https://reviews.llvm.org/D99949	2021-04-16 05:26:20 +00:00
Jim Lin	2893570e86	[RISCV] Don't emit save-restore call if function is a interrupt handler It has to save all caller-saved registers before a call in the handler. So don't emit a call that save/restore registers. Reviewed By: simoncook, luismarques, asb Differential Revision: https://reviews.llvm.org/D100532	2021-04-16 12:54:47 +08:00
Ahmed Taei	0e2f9b61fd	Fix tile-and-pad when padding doesn't span all dimension Without this tile-and-pad will never terminate if pad-fails. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D97720	2021-04-15 20:17:40 -07:00
Jason Molenda	9d4415d01d	Don't refer to allocation map entry after deallocating it debugserver's MachTask::DeallocateMemory when removing an allocate entry from our map (in resposne to an '_m' packet), copy the size from the entry before removing it from the map and then using the iterator to fix an ASAN error on the bots when running TestGdbRemoteMemoryAllocation.py rdar://76595998	2021-04-15 20:16:38 -07:00
Christopher Di Bella	0148b65372	[libcxx] adds `cpp17-.*iterator` concepts for iterator_traits The `iterator_traits` patch became too large for a concise review, so the "bloat" —as it were— was moved into this patch. Also tests most C++[98,17] iterator types to confirm backwards compatibility is successful (regex iterators are intentionally not present, but directory iterators are due to a peculiar error encountered while patching `iterator_traits`). Depends on D99461. Differential Revision: https://reviews.llvm.org/D99854	2021-04-16 03:14:42 +00:00
hsmahesha	099dcb68a6	[AMDGPU] Refactor ds_read/ds_write related select code for better readability. Part of the code related to ds_read/ds_write ISel is refactored, and the corresponding comment is re-written for better readability, which would help while implementing any future ds_read/ds_write ISel related modifications. Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D100300	2021-04-16 08:24:00 +05:30
Mircea Trofin	0d06b14f59	[MLGO] Fix use of AM.invalidate post D100519 The ML inline advisors more aggressively invalidate certain analyses after each call site inlining, to more accurately capture the problem state.	2021-04-15 18:45:39 -07:00
Marcythm	f8cf3b9931	[LICM][NFC] Fix typo fixed some typos which may lead to misunderstandings in LICM.cpp Reviewed By: nikic, asbirlea Differential Revision: https://reviews.llvm.org/D100470	2021-04-16 09:42:00 +08:00
Juneyoung Lee	085423282d	[LangRef] formatting	2021-04-16 10:41:30 +09:00
Fangrui Song	acf7e55783	[Polly] Fix PM invalidate usage after D100519	2021-04-15 18:41:20 -07:00
LLVM GN Syncbot	68744bb479	[gn build] Port `3bc88eb392`	2021-04-16 01:16:51 +00:00
Jez Ng	4938b090cf	[lld-macho] Don't use arrays as template parameters MSVC from VSCode 2017 appears unhappy with it (causes an internal compiler error.) This also means that we need to avoid doing `sizeof(stubCode)` as `sizeof(int[N])` on function array parameters decays into `sizeof(int *)`. Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D100605	2021-04-15 21:16:34 -04:00
Jez Ng	1acda12d00	[lld-macho] Make load relaxation work for arm64_32 arm64_32 uses 32-bit GOT loads, so we should accept those instructions in `ARM64Common::relaxGotLoad()` too. Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D100229	2021-04-15 21:16:34 -04:00
Jez Ng	1460942c15	[lld-macho] Add 32-bit compact unwind support This could probably have been part of D99633, but I split it up to make things a bit more reviewable. I also fixed some bugs in the implementation that were masked through integer underflows when operating in 64-bit mode. Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D99823	2021-04-15 21:16:33 -04:00
Jez Ng	3bc88eb392	[lld-macho] Add support for arm64_32 From what I can tell, it's pretty similar to arm64. The two main differences are: 1. No 64-bit relocations 2. Stub code writes to 32-bit registers instead of 64-bit Plus of course the various on-disk structures like `segment_command` are using the 32-bit instead of the 64-bit variants. Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D99822	2021-04-15 21:16:33 -04:00
Jez Ng	db7a413e51	[lld-macho] Re-root absolute input file paths if -syslibroot is specified Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D100147	2021-04-15 21:16:33 -04:00
Jez Ng	eb5b7d4497	[lld-macho] LTO: Unset VisibleToRegularObj where possible This allows LLVM's LTO to internalize symbols that are not referenced directly by regular objects. Naturally, this means we need to track which symbols are referenced by regular objects. The approach taken here is similar to LLD-COFF's: like the COFF port, we extend `SymbolTable::insert()` to set the isVisibleToRegularObj bit. (LLD-ELF relies on the Symbol constructor and `Symbol::mergeProperties()`, but the Mach-O port does not have a `mergeProperties()` equivalent.) From what I can tell, ld64 (which uses libLTO) doesn't do this optimization at all. I'm not even sure libLTO provides a way to do this. Not having ld64's behavior as a reference implementation is unfortunate; instead, I am relying on LLD-ELF/COFF's behavior as references while erring on the conservative side. In particular, LLD-MachO will only do this optimization for executables right now. We also don't attempt it when `-flat_namespace` is used -- otherwise we'd need scan the symbol table to find matches for every un-namespaced symbol reference, which is expensive. internalize.ll is based off the LLD-ELF tests `internalize-basic.ll` and `internalize-undef.ll`. Looks like @davide added some of LLD-ELF's internalize tests, so adding him as a reviewer... Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D99105	2021-04-15 21:16:33 -04:00
Richard Smith	f7c9de0de5	Add triple to fix test failure. This test uses `__regcall`, support for which is target-specific.	2021-04-15 18:08:35 -07:00
Juneyoung Lee	25e96dffac	[LangRef] fix unexepcted unindent errror	2021-04-16 09:58:55 +09:00
Juneyoung Lee	1bcadb0984	[LangRef] clarify the semantics of nocapture This patch clarifies the semantics of nocapture attribute. A 'Pointer Capture' subsection is added to describe the semantics of pointer capture first. For the nocapture example with two same pointer arguments, it is consistent with the semantics that Alive2 used to run lit tests. Reviewed By: nlopes Differential Revision: https://reviews.llvm.org/D97924	2021-04-16 09:48:42 +09:00
George Balatsouras	98b114d480	[dfsan] Remove hard-coded constant in release_shadow_space.c Reviewed By: stephan.yichao.zhao Differential Revision: https://reviews.llvm.org/D100608	2021-04-15 17:24:35 -07:00
Caroline Tice	042668d092	Revert "[LLDB] Use path relative to binary for finding .dwo files." This reverts commit `b241f3cb29`. Test case is breaking windows builder.	2021-04-15 17:17:44 -07:00
Joshua Haberman	8344675908	Implemented [[clang::musttail]] attribute for guaranteed tail calls. This is a Clang-only change and depends on the existing "musttail" support already implemented in LLVM. The [[clang::musttail]] attribute goes on a return statement, not a function definition. There are several constraints that the user must follow when using [[clang::musttail]], and these constraints are verified by Sema. Tail calls are supported on regular function calls, calls through a function pointer, member function calls, and even pointer to member. Future work would be to throw a warning if a users tries to pass a pointer or reference to a local variable through a musttail call. Reviewed By: rsmith Differential Revision: https://reviews.llvm.org/D99517	2021-04-15 17:12:21 -07:00
Christopher Di Bella	f280505aa0	[libcxx] adds `std::indirectly_readable_traits` to <iterator> Implements parts of: * P0896R4 The One Ranges Proposal * LWG3446 `indirectly_readable_traits` ambiguity for types with both `value_type` and `element_type` Depends on D99141. Differential Revision: https://reviews.llvm.org/D99461	2021-04-15 23:59:02 +00:00
Arthur Eubanks	9c776c2fa2	[NFC][NewPM] Remove some AnalysisManager invalidate methods These were misleading, they're more of a "clear" than an "invalidate". We shouldn't be individually clearing analysis results. Either we clear all analyses when some IR becomes invalid, or we properly go through invalidation. There was only one use of this, which can be simulated with AM.invalidate(F, PA). Reviewed By: mtrofin Differential Revision: https://reviews.llvm.org/D100519	2021-04-15 16:51:26 -07:00

1 2 3 4 5 ...

385730 Commits All Branches Search

385730 Commits

All Branches