llvm-project

Commit Graph

Author	SHA1	Message	Date
Alexandre Ganea	199c397482	Revert "[clang-scan-deps] Add support for clang-cl" This reverts commit `bb26fa8c28`.	2021-04-19 17:45:18 -04:00
Sanjay Patel	152efbc19a	[PhaseOrdering] add test to show unintended code sinking; NFC See D87479 for discussion.	2021-04-19 17:30:23 -04:00
Ricky Taylor	2221185776	[M68k] Implement Disassembler This is an implementation of a disassembler for M68k. Differential Revision: https://reviews.llvm.org/D98540	2021-04-19 22:24:12 +01:00
Ricky Taylor	6de262827c	[M68k] Change printing of absolute memory references This also includes PC-relative addresses since they are still referenced as absolute addresses in assembly and converted to relative addresses by the assembler. This changes, for example: - `bra #-2` -> `bra $100` - `jsr #16` -> `jsr $10` Differential Revision: https://reviews.llvm.org/D100697	2021-04-19 22:24:12 +01:00
Alexey Bataev	8030481065	Revert "[SLP]Add detection of shuffled/perfect matching of tree entries." This reverts commit `d6fde91379` to fix compiler crashes.	2021-04-19 14:10:04 -07:00
Zequan Wu	e28435caf6	[ThinLTO] Copy UnnamedAddr when spliting module. The unnamedaddr property of a function is lost when using `-fwhole-program-vtables` and thinlto which causes size increase under linker's safe icf mode. The size increase of chrome on Linux when switching from all icf to safe icf drops from 5 MB to 3 MB after this change, and from 6 MB to 4 MB on Windows. There is a repro: ``` # a.h struct A { virtual int f(); virtual int g(); }; # a.cpp #include "a.h" int A::f() { return 10; } int A::g() { return 10; } # main.cpp #include "a.h" int g(A* a) { return a->f(); } int main(int argv, char** args) { A a; return g(&a); } $ clang++ -O2 -ffunction-sections -flto=thin -fwhole-program-vtables -fsplit-lto-unit -c main.cpp -o main.o && clang++ -Wl,--icf=safe -fuse-ld=lld -flto=thin main.o -o a.out && llvm-readobj -t a.out \| grep -A 1 -e _ZN1A1fEv -e _ZN1A1gEv Name: _ZN1A1fEv (480) Value: 0x201830 -- Name: _ZN1A1gEv (490) Value: 0x201840 ``` Differential Revision: https://reviews.llvm.org/D100498	2021-04-19 14:04:58 -07:00
Emily Shi	cc2b62a06e	[compiler-rt] assert max virtual address is <= mmap range size If these sizes do not match, asan will not work as expected. If possible, assert at compile time that the vm size is less than or equal to mmap range. If a compile time assert is not possible, check at run time (for iOS) rdar://76477969 Reviewed By: delcypher, yln Differential Revision: https://reviews.llvm.org/D100239	2021-04-19 14:01:07 -07:00
Alexey Bataev	d6fde91379	[SLP]Add detection of shuffled/perfect matching of tree entries. SLP supports perfect diamond matching for the vectorized tree entries but do not support it for gathered entries and does not support non-perfect (shuffled) matching with 1 or 2 tree entries. Patch adds support for this matching to improve cost of the vectorized tree. Differential Revision: https://reviews.llvm.org/D100495	2021-04-19 13:29:30 -07:00
David Penry	ca8eef7e3d	[CodeGen] Use ProcResGroup information in SchedBoundary When the ProcResGroup has BufferSize=0, 1. if there is a subunit in the list of write resources for the scheduling class, do not attempt to schedule the ProcResGroup. 2. if there is not a subunit in the list of write resources for the scheduling class, choose a subunit to use instead of the ProcResGroup. 3. having both the ProcResGroup and any of its subunits in the resources implied by a InstRW is not supported. Used to model parallel uses from a pool of resources. Differential Revision: https://reviews.llvm.org/D98976	2021-04-19 21:27:45 +01:00
David Penry	78a871abf7	[ARM] Use ProcResGroup in Cortex-M7 scheduling model Used to model structural hazards on FP issue, where some instructions take up 2 issue slots and others one as well as similar structural hazards on load issue, where some instructions take up two load lanes and others one. Differential Revision: https://reviews.llvm.org/D98977	2021-04-19 21:23:05 +01:00
Philip Reames	3c54762226	[funcattrs] Consistently check call site attributes This is mostly stylistic cleanup after D100226, but not entirely. When skimming the code, I found one case where we weren't accounting for attributes on the callsite at all. I'm also suspicious we had some latent bugs related to operand bundles (which are supposed to be able to override attributes on declarations), but I don't have concrete test cases for those, just suspicions. Aside: The only case left in the file which directly checks attributes on the declaration is the norecurse logic. I left that because I didn't understand it; it looks obviously wrong, so I suspect I'm misinterpreting the intended semantics of the attribute. Differential Revision: https://reviews.llvm.org/D100689	2021-04-19 13:20:50 -07:00
Stephen Kelly	782c3e23ba	[AST] Fix comparison to of SourceRanges in container Differential Revision: https://reviews.llvm.org/D100723	2021-04-19 21:19:21 +01:00
Philip Reames	01801d5274	[rs4gc] Fix a latent bug around attribute stripping for intrinsics This change fixes a latent bug which was exposed by a change currently in review (https://reviews.llvm.org/D99802#2685032). The story on this is a bit involved. Without this change, what ended up happening with the pending review was that we'd strip attributes off intrinsics, and then selectiondag would fail to lower the intrinsic. Why? Because the lowering of the intrinsic relies on the presence of the readonly attribute. We don't have a matcher to select the case where there's a glue node needed. Now, on the surface, this still seems like a codegen bug. However, here it gets fun. I was unable to reproduce this with a standalone test at all, and was pretty much struck until skatkov provided the critical detail. This reproduces only when RS4GC and codegen are run in the same process and context. Why? Because it turns out we can't roundtrip the stripped attribute through serialized IR! We'll happily print out the missing attribute, but when we parse it back, the auto-upgrade logic has a side effect of blindly overwriting attributes on intrinsics with those specified in Intrinsics.td. This makes it impossible to exercise SelectionDAG from a standalone test case. At this point, I decided to treat this an RS4GC bug as a) we don't need to strip in this case, and b) I could write a test which shows the correct behavior to ensure this doesn't break again in the future. As an aside, I'd originally set out to handle libfuncs too - since in theory they might have the same issues - but backed away quickly when I realized how the semantics of builtin, nobuiltin, and no-builtin-x all interacted. I'm utterly convinced that no part of the optimizer handles that correctly, and decided not to open that can of worms here.	2021-04-19 13:14:07 -07:00
Nikita Popov	9423f78240	[InstCombine] Fold multiuse shr eq zero The single-use case is handled implicity by converting the icmp into a mask check first. When comparing with zero in particular, we don't need the one-use restriction, as we only produce a single icmp. https://alive2.llvm.org/ce/z/MSixcm https://alive2.llvm.org/ce/z/GwpG0M	2021-04-19 22:13:11 +02:00
Nikita Popov	3d385cc90e	[InstCombine] Add tests for multiuse shr eq zero (NFC) The exact case is folded, the inexact one is not.	2021-04-19 22:13:11 +02:00
Stephen Kelly	abacaef181	[AST] Update introspection API to use const-ref for copyable types Differential Revision: https://reviews.llvm.org/D100720	2021-04-19 21:07:47 +01:00
Martin Storsjö	f9ddb81d79	[libcxx] [test] Ifdef out tests that rely on perms::none on directories for triggering errors On Windows, one can't use perms::none on a directory to trigger failures to read the directory entries. These remaining tests can't use GetWindowsInaccessibleDir() sensibly, e.g. for tests that rely on toggling accessibility back and forth during the test, or where the semantics of the dir provided by GetWindowsInaccessibleDir() doesn't allow for running the ifdeffed tests meaningfully. Differential Revision: https://reviews.llvm.org/D97538	2021-04-19 23:03:12 +03:00
Thomas Lively	e657c84fa1	[WebAssembly] Use v128.const instead of splats for constants We previously used splats instead of v128.const to materialize vector constants because V8 did not support v128.const. Now that V8 supports v128.const, we can use v128.const instead. Although this increases code size, it should also increase performance (or at least require fewer engine-side optimizations), so it is an appropriate change to make. Differential Revision: https://reviews.llvm.org/D100716	2021-04-19 12:43:59 -07:00
Martin Storsjö	6c5b0d6bea	[libcxx] Base MSVC autolinking on _LIBCPP_DISABLE_VISIBILITY_ANNOTATIONS Previously the decision of which library to try to autolink was based on _DLL, however the _DLL define (which is set by the compiler) is tied to whether using a dynamically linked CRT or not, and the choice of dynamic or static CRT is entirely orthogonal to whether libc++ is linked dynamically or statically. If _LIBCPP_DISABLE_VISIBILITY_ANNOTATIONS isn't defined, then all declarations are decorated with dllimport, and there's no doubt that the DLL version of the library is what must be linked. _LIBCPP_DISABLE_VISIBILITY_ANNOTATIONS is defined if building with LIBCXX_ENABLE_SHARED disabled, and thus the static library is what should be linked. If defining _LIBCPP_DISABLE_VISIBILITY_ANNOTATIONS manually but wanting to link against the DLL version of the library, that's not a canonical configuration, and then it's probably reasonable to manually define _LIBCPP_NO_AUTO_LINK too, and manually link against the desired library. This fixes, among other issues, running tests for the library if built with LIBCXX_ENABLE_STATIC disabled. Differential Revision: https://reviews.llvm.org/D100539	2021-04-19 22:42:33 +03:00
Nicolas Vasilache	1dc533cea4	[mlir][python] ExecutionEngine can dump to object file Differential Revision: https://reviews.llvm.org/D100786	2021-04-19 19:33:27 +00:00
Jonas Devlieghere	cc68799056	[lldb] Stop unsetting LLDB_DEBUGSERVER_PATH from TestLaunchProcessPosixSpawn We no longer need this after Pavel's change to automatically find debug servers to test. (`3ca7b2d`)	2021-04-19 12:28:22 -07:00
Jinsong Ji	d88d8c5b86	[PowerPC] Disable relative lookup table converter pass for AIX XCOFF hasn't implemented lowerRelativeReference. So we need to disable new pass introduced by https://reviews.llvm.org/D94355 for AIX for now. Reviewed By: gulfem Differential Revision: https://reviews.llvm.org/D100584	2021-04-19 19:28:11 +00:00
Jonas Devlieghere	a7712091ea	[lldb] Update breakpoint_function_callback.test for different error message Adjust for the Lua error message printed by Lua 5.4.3.	2021-04-19 12:23:23 -07:00
Jonas Devlieghere	f7414759d7	[lldb] Print the fixed address if symbolication fails in DumpDataExtractor When formatting memory with as eFormatAddressIn and symbolication fails, fix the code address and print the symbol it points to, if any.	2021-04-19 12:23:23 -07:00
Emily Shi	94ba3b6e3b	[compiler-rt][asan] use full vm range on apple silicon macs We previously shrunk the mmap range size on ios, but those settings got inherited by apple silicon macs. Don't shrink the vm range on apple silicon Mac since we have access to the full range. Also don't shrink vm range for iOS simulators because they have the same range as the host OS, not the simulated OS. rdar://75302812 Reviewed By: delcypher, kubamracek, yln Differential Revision: https://reviews.llvm.org/D100234	2021-04-19 12:12:26 -07:00
madhur13490	6a4d9cb7e0	[AMDGPU] Remove error check for indirect calls and add missing queue-ptr This patch removes -fixed-abi check for indirect calls and also adds queue-ptr which is required for indirect calls to work. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D100633	2021-04-20 00:35:17 +05:30
Pavel Iliin	2ec16103c6	[AArch64] Peephole rule to remove redundant cmp after cset. Comparisons to zero or one after cset instructions can be safely removed in examples like: cset w9, eq cset w9, eq cmp w9, #1 ---> <removed> b.ne .L1 b.ne .L1 cset w9, eq cset w9, eq cmp w9, #0 ---> <removed> b.ne .L1 b.eq .L1 Peephole optimization to detect suitable cases and get rid of that comparisons added. Differential Revision: https://reviews.llvm.org/D98564	2021-04-19 19:58:38 +01:00
Yaxun (Sam) Liu	d8805574c1	[CUDA][HIP] Allow non-ODR use of host var in device Reviewed by: Artem Belevich, Richard Smith Differential Revision: https://reviews.llvm.org/D98193	2021-04-19 14:45:24 -04:00
peter klausler	71d868cf90	[flang] Define missing & needed IEEE_ARITHMETIC symbols Define IEEE_IS_NAN, IEEE_IS_FINITE, & IEEE_REM. Differential Revision: https://reviews.llvm.org/D100599	2021-04-19 11:44:43 -07:00
LLVM GN Syncbot	03b98114ce	[gn build] Port `e0adf7e06a`	2021-04-19 18:35:15 +00:00
Nikita Popov	d440f9a326	[LICM] Make capture check more precise During store promotion, we check whether the pointer was captured to exclude potential reads from other threads. However, we're only interested in captures before or inside the loop. Check this using PointerMayBeCapturedBefore against the loop header. Differential Revision: https://reviews.llvm.org/D100706	2021-04-19 20:34:23 +02:00
zoecarver	e0adf7e06a	[libc++][NFC] Move incrementable_traits and indirectly_readable_traits into separate headers. Differential Revision: https://reviews.llvm.org/D100682	2021-04-19 14:31:30 -04:00
Craig Topper	87afefcd22	[RISCV] Fix mistake in comment. NFC	2021-04-19 11:15:32 -07:00
Philip Reames	89a93889da	Update a test for auto-update format change	2021-04-19 11:14:52 -07:00
Craig Topper	7ed01a420a	[RISCV] Pad v4i1/v2i1/v1i1 stores with 0s to make a full byte. As noted in the FIXME there's a sort of agreement that the any extra bits stored will be 0. The generated code is pretty terrible. I was really hoping we could use a tail undisturbed trick, but tail undisturbed no longer applies to masked destinations in the current draft spec. Fingers crossed that it isn't common to do this. I doubt IR from clang or the vectorizer would ever create this kind of store. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D100618	2021-04-19 11:05:18 -07:00
Arthur Eubanks	5561b48b70	[test] Make global in split-gep-and-gvn.ll not constant An upcoming change will cause loads from a constant zeroinitializer global to be constant folded, breaking this test.	2021-04-19 11:03:19 -07:00
Fangrui Song	03769d9308	[lld] Delete unused includes. NFC	2021-04-19 10:56:49 -07:00
Jessica Paquette	65f257a215	[AArch64][GlobalISel] Implement custom legalization for s32 and s64 G_CTPOP This is a partial port of AArch64TargetLowering::LowerCTPOP. This custom lowering tries to uses NEON instructions to give a more efficient CTPOP lowering when possible. In the non-NEON/noimplicitfloat case, this should use the generic lowering (see: https://godbolt.org/z/GcaPvWe4x). I think that's worth implementing after implementing the widening code for s16/s8 though. Differential Revision: https://reviews.llvm.org/D100399	2021-04-19 10:56:02 -07:00
Nick Desaulniers	c440b97d89	[TargetLowering] move "o" and "X" constraint handling to base class These constraints are machine agnostic; there's no reason to handle these per-arch. If arches don't support these constraints, then they will fail elsewhere during instruction selection. We don't need virtual calls to look these up; TargetLowering::getInlineAsmMemConstraint should only be overridden by architectures with additional unique memory constraints. Reviewed By: echristo, MaskRay Differential Revision: https://reviews.llvm.org/D100416	2021-04-19 10:53:31 -07:00
Jessica Paquette	91bbb914e0	[AArch64][GlobalISel] Regbankselect + select @llvm.aarch64.neon.uaddlv It turns out we actually import a bunch of selection code for intrinsics. The imported code checks that the register banks on the G_INTRINSIC instruction are correct. If so, it goes ahead and selects it. This adds code to AArch64RegisterBankInfo to allow us to correctly determine register banks on intrinsics which have known register bank constraints. For now, this only handles @llvm.aarch64.neon.uaddlv. This is necessary for porting AArch64TargetLowering::LowerCTPOP. Also add a utility for getting the intrinsic ID from a G_INTRINSIC instruction. This seems a little nicer than having to know about how intrinsic instructions are structured. Differential Revision: https://reviews.llvm.org/D100398	2021-04-19 10:47:49 -07:00
Jonas Devlieghere	2cbd3b04fe	[lldb] Support "absolute memory address" images in crashlog.py The binary image list contains the following entry when a frame is not found in any know binary image: { "size" : 0, "source" : "A", "base" : 0, "uuid" : "00000000-0000-0000-0000-000000000000" } Note that this object is missing the name and path keys. This patch makes the JSON parser resilient against their absence.	2021-04-19 10:27:11 -07:00
Sanjay Patel	9d43f6d7ce	[LowerConstantIntrinsics] avoid crashing on alloca with unexpected operand type The test here is reduced from the fuzzer-generated crasher in: https://llvm.org/PR50023 https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=33395 I don't know if this is the best or complete solution, but the zext of the i42 type appears to match the behavior if we run a weird type example like this through the IR optimizer with -O1. Differential Revision: https://reviews.llvm.org/D100766	2021-04-19 13:06:29 -04:00
Nico Weber	0871ce3547	fix comment typo to cycle bots	2021-04-19 12:59:20 -04:00
Wael Yehia	369c0e0f48	[AIX] Diagnose thinLTO usage in clang on AIX. Reviewed By: Xiangling Liao Differential Revision: https://reviews.llvm.org/D100350	2021-04-19 16:39:48 +00:00
Jan Svoboda	6a72ed239c	[clang] NFC: Fix range-based for loop warnings related to decl lookup	2021-04-19 18:31:31 +02:00
Roman Lebedev	2aff4f7f57	[polly] Fix check-polly after SCEVExpander PtrToInt fixes	2021-04-19 19:10:55 +03:00
Roman Lebedev	d746fefb6f	[SCEVExpander] ReuseOrCreateCast(): use IRBuilder to actually create the cast In particular, this allows to create constant expressions instead of IR Instruction's if the argumen is a constant.	2021-04-19 18:38:39 +03:00
Roman Lebedev	ecc9d7e913	[SCEVExpander] Expand explicit PtrToInt casts just like we would implicit ones I.e., use GetOptimalInsertionPointForCastOf() helper to get the insertion point, and try to reuse casts first.	2021-04-19 18:38:39 +03:00
Roman Lebedev	442c408e0e	[SCEVExpander] GetOptimalInsertionPointForCastOf(): gracefully handle Constant's I guess this case hasn't come up thus far, and i'm not sure if it can really happen for the existing usages, thus no test in this commit. But, the following commit adds test coverage, there we'd expirience a crash without this fix.	2021-04-19 18:38:39 +03:00
Roman Lebedev	b8a3705896	[NFCI][SCEVExpander] Extract GetOptimalInsertionPointForCastOf() helper	2021-04-19 18:38:38 +03:00

1 2 3 4 5 ...

385910 Commits All Branches Search

385910 Commits

All Branches