llvm-project

Commit Graph

Author	SHA1	Message	Date
David Green	7d9af03ff7	[Scheduling][ARM] Consistently enable PostRA Machine scheduling In the ARM backend, for historical reasons we have only some targets using Machine Scheduling. The rest use the old list scheduler as they are using itinaries and the list scheduler seems to produce better code (and not crash running out of register on v6m codes). So whether to use the MIScheduler or not is checked at runtime from the subtarget features. This is fine, except for post-ra scheduling. Whether to use the old post-ra list scheduler or the post-ra machine schedule is decided as the pass manager is set up, in arms case from a newly constructed subtarget. Under some situations, like LTO, this won't include the correct cpu so can pick the wrong option. This can have a surprising effect on performance. To fix that, this patch overrides targetSchedulesPostRAScheduling and addPreSched2 in the ARM backend, adding _both_ post-ra schedulers and picking at runtime which to execute. To pick between the two I've had to add a enablePostRAMachineScheduler() method that normally returns enableMachineScheduler() && enablePostRAScheduler(), which can be overridden to enable just one of PostRAMachineScheduler vs PostRAScheduler. Thanks to David Penry for the identifying this problem. Differential Revision: https://reviews.llvm.org/D69775	2019-11-05 10:44:55 +00:00
Pavel Labath	f71e35dc1f	lldb/breakpad: add suppport for the "x86_64h" architecture	2019-11-05 11:41:20 +01:00
serge-sans-paille	9357b5d084	Revert and patch "[Python] Remove readline module" Fix https://bugs.llvm.org/show_bug.cgi?id=43830 while avoiding polluting the global Python namespace. This both reverts r357277 to rebundle a version of Python's readline module based on libedit. However, this patch also provides two improvements over the previous implementation: 1. use PyMem_RawMalloc instead of PyMem_Malloc, as expected by PyOS_Readline (prevents to segfault upon exit of interactive session) 2. patch the readline module upon embedded interpreter loading, instead of patching it globally, which should prevent any side effect on other modules/packages 3. only activate the patched module if libedit is actually linked in lldb Differential Revision: https://reviews.llvm.org/D69793	2019-11-05 11:39:19 +01:00
Sven van Haastregt	0e56b0f94b	[OpenCL] Group builtin functions by prototype The TableGen-generated file containing the function definitions can be reorganized to save some memory in the Clang binary. Functions having the same prototype(s) will point to a shared list of prototype(s). Patch by Pierre Gondois and Sven van Haastregt. Differential Revision: https://reviews.llvm.org/D63557	2019-11-05 10:26:47 +00:00
Sven van Haastregt	9a8d477a0e	[OpenCL] Add builtin function attribute handling Add handling for the "pure", "const" and "convergent" function attributes for OpenCL builtin functions. Patch by Pierre Gondois and Sven van Haastregt. Differential Revision: https://reviews.llvm.org/D64319	2019-11-05 10:26:47 +00:00
Pavel Labath	4ecff91ed1	lldb/minidump: Add support for the alternate ARM64 constant	2019-11-05 11:26:06 +01:00
Pavel Labath	28cf9698ab	MemoryRegion: Print "don't know" permission values as such Summary: The permissions in a memory region have ternary states (yes, no, don't know), but the memory region command only prints in binary, treating "don't know" as "yes", which is particularly confusing as for instance the unwinder will treat an unknown value as "no". This patch makes is so that we distinguish all three states when printing the values, using "?" to indicate the lack of information. It is implemented via a special argument to the format provider for the OptionalBool enumeration. Reviewers: clayborg, jingham Subscribers: lldb-commits Differential Revision: https://reviews.llvm.org/D69106	2019-11-05 11:17:27 +01:00
Roman Lebedev	12c4a71ca9	[LoopUnroll] peel-loop-conditions.ll: add some 'is even/odd' peeling tests	2019-11-05 13:02:57 +03:00
Roman Lebedev	ccf1a5f4bb	[InstCombine] dropRedundantMaskingOfLeftShiftInput(): truncation (PR42563) Summary: That fold keeps growing and growing :( I think this may be one of the last pieces for it. Since D67677/D67725, the fold knowns the general form of the pattern - where some masking is needed: https://rise4fun.com/Alive/F5R https://rise4fun.com/Alive/gslRa But there is one more huge piece missing - if you are extracting some bits, it is not impossible that the origin is wider than the extraction, i.e. there may be a truncation. And we don't deal with that yet. But we can, and the generalization remains fully identical: https://rise4fun.com/Alive/Uar https://rise4fun.com/Alive/5SW After a preparatory cleanup i think the diff looks rather clean. One missing piece is that in some patterns (especially pat. b), `-1` only needs to be `-1` in final type, but that is for later.. https://bugs.llvm.org/show_bug.cgi?id=42563 Reviewers: spatel, nikic Reviewed By: spatel Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69125	2019-11-05 12:41:26 +03:00
Luís Marques	0d47c7aba3	[RISCV] Add InstrInfo areMemAccessesTriviallyDisjoint hook Summary: Introduces the `InstrInfo::areMemAccessesTriviallyDisjoint` hook. The test could check for instruction reorderings, but to avoid being brittle it just checks instruction dependencies. Reviewers: asb, lenary Reviewed By: lenary Tags: #llvm Differential Revision: https://reviews.llvm.org/D67046	2019-11-05 09:39:06 +00:00
Pavel Labath	b4c5b8f3f5	DWARFDebugLoclists: Make it possible to read relocated addresses Summary: Handling relocations was not needed when the loclists section was a DWO-only thing. But since DWARF5, it is possible to use it in regular objects too, and the standard permits embedding addresses into the section directly. These addresses need to be relocated in unlinked files. Reviewers: JDevlieghere, dblaikie, probinson Subscribers: aprantl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68271	2019-11-05 10:21:39 +01:00
Simon Atanasyan	0d14656b9d	[mips] Set __OCTEON__ macros	2019-11-05 12:10:58 +03:00
Simon Atanasyan	e578d0fd29	[mips] Fix `__mips_isa_rev` macros value for Octeon CPU	2019-11-05 12:10:58 +03:00
Sjoerd Meijer	92164cf25d	Recommit "[HardwareLoops] Optimisation remarks" With a few things fixed: - initialisaiton of the optimisation remark pass (this was causing the buildbot failures on PPC), - a test case. Differential Revision: https://reviews.llvm.org/D69660	2019-11-05 09:06:22 +00:00
David Green	edfb8eea57	[AArch64] Update test checks on merge-store-dependency.ll. NFC	2019-11-05 09:01:47 +00:00
Raphael Isemann	db5074dc10	[lldb][NFC] Give some parameters in CommandInterpreter more descriptive names	2019-11-05 09:21:10 +01:00
aqjune	92ef101da9	[IR] Remove switch's default block that causes clang 8 raise error	2019-11-05 16:31:51 +09:00
Craig Topper	103968d147	[X86] Lower the cost of avx512 horizontal bool and/or reductions to 2*log2(bitwidth)+1 for legal types. This better represents the kshift+binop we'd get for each stage before the final extract. Its likely we'll do even better by doing a kmov and a cmp with a GPR, but this is a good start. The default handling was costing a worst case single source permute shuffle of the vector before the binop. This worst case assumes the shuffle might have to be emulated with extracts and inserts. But since we know we're doing a reduction we can assume we'll get kshift lowering. There's still some room for improvement here, but this is much better than it was.	2019-11-04 22:58:04 -08:00
aqjune	58acbce3de	[IR] Add Freeze instruction Summary: - Define Instruction::Freeze, let it be UnaryOperator - Add support for freeze to LLLexer/LLParser/BitcodeReader/BitcodeWriter The format is `%x = freeze <ty> %v` - Add support for freeze instruction to llvm-c interface. - Add m_Freeze in PatternMatch. - Erase freeze when lowering IR to SelDag. Reviewers: deadalnix, hfinkel, efriedma, lebedev.ri, nlopes, jdoerfert, regehr, filcab, delcypher, whitequark Reviewed By: lebedev.ri, jdoerfert Subscribers: jfb, kristof.beyls, hiraditya, lebedev.ri, steven_wu, dexonsmith, xbolva00, delcypher, spatel, regehr, trentxintong, vsk, filcab, nlopes, mehdi_amini, deadalnix, llvm-commits Differential Revision: https://reviews.llvm.org/D29011	2019-11-05 15:54:56 +09:00
Yonghong Song	9f34447f3f	[BPF] fix a use after free bug Commit `fff2721286` ("[BPF] Fix CO-RE bugs with bitfields") fixed CO-RE handling bitfield issues. But the implementation introduced a use after free bug. The "Base" of the intrinsic might be freed so later on accessing the Type of "Base" might access the freed memory. The failed test case, CodeGen/BPF/CORE/offset-reloc-middle-chain.ll is exactly used to test such a case. Similarly to previous attempt to remember Metadata etc, remember "Base" pointee Alignment in advance to avoid such use after free bug.	2019-11-04 22:20:23 -08:00
Craig Topper	f65493a83e	[X86] Teach X86MCInstLower to swap operands of commutable instructions to enable 2-byte VEX encoding. Summary: The 2 source operands commutable instructions are encoded in the VEX.VVVV field and the r/m field of the MODRM byte plus the VEX.B field. The VEX.B field is missing from the 2-byte VEX encoding. If the VEX.VVVV source is 0-7 and the other register is 8-15 we can swap them to avoid needing the VEX.B field. This works as long as the VEX.W, VEX.mmmmm, and VEX.X fields are also not needed. Fixes PR36706. Reviewers: RKSimon, spatel Reviewed By: RKSimon Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68550	2019-11-04 22:07:46 -08:00
Devin Coughlin	abc04ff401	[analyzer] Require darwin for scan-build tests Let's at least get some coverage from these tests. We can generalize to other platforms later.	2019-11-04 21:17:55 -08:00
Devin Coughlin	48223d92a9	[analyzer] Fixup scan-build tests for non-Darwin platforms. This is a fix to `0aba69eb1a` to address failing bots.	2019-11-04 21:12:11 -08:00
aqjune	31be9f3f7d	Fix clone_constant_impl to correctly deal with null pointers Summary: This patch resolves llvm-c-test's following error ``` LLVM ERROR: LLVMGetValueKind returned incorrect type ``` which arises when the input bitcode contains a null pointer. Reviewers: jdoerfert, CodaFi, deadalnix Reviewed By: jdoerfert Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68928	2019-11-05 13:53:52 +09:00
Devin Coughlin	0aba69eb1a	[analyzer] Add test directory for scan-build. The static analyzer's scan-build script is critical infrastructure but is not well tested. To start to address this, add a new test directory under tests/Analysis for scan-build lit tests and seed it with several tests. The goal is that future scan-build changes will be accompanied by corresponding tests. Differential Revision: https://reviews.llvm.org/D69781	2019-11-04 20:26:35 -08:00
Yaxun (Sam) Liu	4264e7bbfd	[CUDA][HIP] Disable emitting llvm.linker.options in device compilation The linker options (e.g. pragma detect_mismatch) are intended for host compilation only, therefore disable it for device compilation. Differential Revision: https://reviews.llvm.org/D57829	2019-11-04 23:21:39 -05:00
Yonghong Song	fff2721286	[BPF] Fix CO-RE bugs with bitfields bitfield handling is not robust with current implementation. I have seen two issues as described below. Issue 1: struct s { long long f1; char f2; char b1:1; } *p; The current approach will generate an access bit size 56 (from b1 to the end of structure) which will be rejected as it is not power of 2. Issue 2: struct s { char f1; char b1:3; char b2:5; char b3:6: char b4:2; char f2; }; The LLVM will group 4 bitfields together with 2 bytes. But loading 2 bytes is not correct as it violates alignment requirement. Note that sometimes, LLVM breaks a large bitfield groups into multiple groups, but not in this case. To resolve the above two issues, this patch takes a different approach. The alignment for the structure is used to construct the offset of the bitfield access. The bitfield incurred memory access is an aligned memory access with alignment/size equal to the alignment of the structure. This also simplified the code. This may not be the optimal memory access in terms of memory access width. But this should be okay since extracting the bitfield value will have the same amount of work regardless of what kind of memory access width. Differential Revision: https://reviews.llvm.org/D69837	2019-11-04 20:08:05 -08:00
Jorg Brown	586952f4ce	Optimize std::midpoint for integers Same idea as the current algorithm, that is, add (half of the difference between a and b) to a. But we use a different technique for computing the difference: we compute b - a into a pair of integers that are named "sign_bit" and "diff". We have to use a pair because subtracting two 32-bit integers produces a 33-bit result. Computing half of that is a simple matter of shifting diff right by 1, and adding sign_bit shifted left by 31. llvm knows how to do that with one instruction: shld. The only tricky part is that if the difference is odd and negative, then shifting it by one isn't the same as dividing it by two - shifting a negative one produces a negative one, for example. So there's one more adjustment: if the sign bit and the low bit of diff are one, we add one. For a demonstration of the codegen difference, see https://godbolt.org/z/7ar3K9 , which also has a built-in test. Differential Revision: https://reviews.llvm.org/D69459	2019-11-04 19:00:23 -08:00
Vedant Kumar	610f80f7ba	[cmake] Add an option to skip stripping before install The swift build system has support for cross-compiling, installing, and generating symbols for lldb. As the swift symbol-generation step occurs after installation, we need to disable stripping during the install.	2019-11-04 17:38:13 -08:00
Saleem Abdulrasool	6db7a5cd7c	build: explicitly set the linker language for unwind The unwinder should not depend on libc++. In fact, we do not end up with a link against libc++ as we do not have a dependency on libc++ at runtime. This ensures that we link with `clang` rather than `clang++`.	2019-11-04 16:55:31 -08:00
Vedant Kumar	a5c8ec4baa	[CGDebugInfo] Emit subprograms for decls when AT_tail_call is understood Currently, clang emits subprograms for declared functions when the target debugger or DWARF standard is known to support entry values (DW_OP_entry_value & the GNU equivalent). Treat DW_AT_tail_call the same way to allow debuggers to follow cross-TU tail calls. Pre-patch debug session with a cross-TU tail call: ``` * frame #0: 0x0000000100000fa4 main`target at b.c:4:3 [opt] frame #1: 0x0000000100000f99 main`main at a.c:8:10 [opt] ``` Post-patch (note that the tail-calling frame, "helper", is visible): ``` * frame #0: 0x0000000100000fa4 main`target at b.c:4:3 [opt] frame #1: 0x0000000100000f80 main`helper [opt] [artificial] frame #2: 0x0000000100000f99 main`main at a.c:8:10 [opt] ``` rdar://46577651 Differential Revision: https://reviews.llvm.org/D69743	2019-11-04 15:14:24 -08:00
Ron Lieberman	dc34b1c94d	Test commit: adds a . to comment. NFC	2019-11-04 16:51:03 -06:00
Evandro Menezes	4cbe10efc2	[AArch64] Update for Exynos Fix the costs of integer division.	2019-11-04 16:21:28 -06:00
Sam Clegg	1cce82eae8	Add more binutils tools to LLVM_INSTALL_TOOLCHAIN_ONLY target Also add the aliases for these tools so that LLVM_INSTALL_BINUTILS_SYMLINKS and LLVM_INSTALL_TOOLCHAIN_ONLY can work together. Differential Revision: https://reviews.llvm.org/D69635	2019-11-04 14:11:23 -08:00
Mark de Wever	403739b2fd	[AST][NFC] Fixes a comment typo Also a test for commit access.	2019-11-04 22:32:56 +01:00
Stanislav Mekhanoshin	1bfcc60828	[AMDGPU] Added assert in SIFoldOperands before ptr use. NFC.	2019-11-04 13:31:21 -08:00
Alexey Bataev	3eecd601ed	[OPENMP][DOCS]Update list of implemented features, NFC.	2019-11-04 16:29:26 -05:00
James Y Knight	d11a9018b7	Add release notes for commit `ccc4d83cda`. (Which was "[ObjC] Diagnose implicit type coercion from ObjC 'Class' to object pointer types.")	2019-11-04 16:26:53 -05:00
Alexey Bataev	8bbf2e3716	[OPENMP50]Support for imperfectly nested loops. Added support for imperfectly nested loops introduced in OpenMP 5.0.	2019-11-04 16:09:25 -05:00
Lawrence D'Anna	adbf64ccc9	[LLDB][Python] remove ArgInfo::count Summary: This patch updates the last user of ArgInfo::count and deletes it. I also delete `GetNumInitArguments()` and `GetInitArgInfo()`. Classess are callables and `GetArgInfo()` should work on them. On python 3 it already works, of course. `inspect` is good. On python 2 we have to add yet another special case. But hey if python 2 wasn't crufty we wouln't need python 3. I also delete `is_bound_method` becuase it is unused. This path is tested in `TestStepScripted.py` Reviewers: labath, mgorny, JDevlieghere Reviewed By: labath, JDevlieghere Subscribers: lldb-commits Tags: #lldb Differential Revision: https://reviews.llvm.org/D69742	2019-11-04 12:48:49 -08:00
Stanislav Mekhanoshin	4312c4afd4	[AMDGPU] deduplicate tablegen predicates We are duplicating predicates if several parts of the combined predicate list contain the same condition. Added code to deduplicate the list. We have AssemblerPredicates and AssemblerPredicate in the PredicateControl, but we never use AssemblerPredicates with an actual list, so this one is dropped. This addresses the first part of the llvm bug 43886: https://bugs.llvm.org/show_bug.cgi?id=43886 Differential Revision: https://reviews.llvm.org/D69815	2019-11-04 12:19:17 -08:00
Erik Pilkington	af11f417fc	[demangle] NFC: get rid of NodeOrString This class was a bit overengineered, and was triggering some PVS warnings. Instead, put strings into a NameType and let clients unconditionally treat it as a Node.	2019-11-04 12:17:12 -08:00
Alexandre Ganea	efad56b2be	Remove unused variables, as suggested by @mcgov. Fixes warning: unused variable 'XXX' [-Wunused-const-variable]	2019-11-04 14:55:51 -05:00
Alexandre Ganea	9cc3ebf8b7	Fix warning: format specifies type 'unsigned long' but the argument has type 'unsigned long long' [-Wformat]	2019-11-04 14:42:07 -05:00
Duncan P. N. Exon Smith	8112a423a8	clang/Modules: Bring back optimization lost in `31e14f41a2` `31e14f41a2` accidentally dropped caching of failed module loads. This brings it back by making ModuleMap::getCachedModuleLoad return an Optional.	2019-11-04 11:40:03 -08:00
Craig Topper	b2b6a54f84	[X86] Add support for -mvzeroupper and -mno-vzeroupper to match gcc -mvzeroupper will force the vzeroupper insertion pass to run on CPUs that normally wouldn't. -mno-vzeroupper disables it on CPUs where it normally runs. To support this with the default feature handling in clang, we need a vzeroupper feature flag in X86.td. Since this flag has the opposite polarity of the fast-partial-ymm-or-zmm-write we used to use to disable the pass, we now need to add this new flag to every CPU except KNL/KNM and BTVER2 to keep identical behavior. Remove -fast-partial-ymm-or-zmm-write which is no longer used. Differential Revision: https://reviews.llvm.org/D69786	2019-11-04 11:03:54 -08:00
Philip Reames	6ff439b57f	[SimplifyCFG] Use a (trivially) dominanting widenable branch to remove later slow path blocks This transformation is a variation on the GuardWidening transformation we have checked in as it's own pass. Instead of focusing on merge (i.e. hoisting and simplifying) two widenable branches, this transform makes the observation that simply removing a second slowpath block (by reusing an existing one) is often a very useful canonicalization. This may lead to later merging, or may not. This is a useful generalization when the intermediate block has loads whose dereferenceability is hard to establish. As noted in the patch, this can be generalized further, and will be. Differential Revision: https://reviews.llvm.org/D69689	2019-11-04 11:03:28 -08:00
Sanjay Patel	113181e9bd	[DAGCombine][MSP430] use shift amount threshold in DAGCombine (2/2) Continuation of: D69116 Contributes to a fix for PR43559: https://bugs.llvm.org/show_bug.cgi?id=43559 See also D69099 and D69116 Use the TLI hook in DAGCombine.cpp to guard against creating shift nodes that are not optimal for a target. Patch by: @joanlluch (Joan LLuch) Differential Revision: https://reviews.llvm.org/D69120	2019-11-04 13:41:41 -05:00
Michał Górny	6eca4f4691	[lldb] [Process/NetBSD] Add register info for missing register sets Add info for all register sets supported in NetBSD, particularly for all registers 'expected' by LLDB. This is necessary in order to fix python_api/lldbutil/iter/TestRegistersIterator.py test that currently fails due to missing names of register sets (None). This copies fpreg descriptions from Linux, and combines Linux' AVX and MPX registers into a single XState group, to fit NetBSD register group design. Technically, we do not support MPX registers at the moment but gdb-remote insists on passing their errors anyway, and if we do not include it in any group, they end up in a separate anonymous group that breaks the test. While at it, swap the enums for XState and DBRegs to match register set ordering. This also adds a few consts to the lldb-x86-register-enums.h to provide more consistency between user registers and debug registers. Differential Revision: https://reviews.llvm.org/D69667	2019-11-04 19:36:58 +01:00
Julian Lettner	bd14bb42f0	[lit] Move measurement of testing time out of Run.execute	2019-11-04 10:16:24 -08:00

1 2 3 4 5 ...

331101 Commits All Branches Search

331101 Commits

All Branches