llvm-project

Commit Graph

Author	SHA1	Message	Date
Jameson Nash	59a6ab28c4	[GVN] small improvements to comments	2020-11-03 13:21:48 -05:00
Jonas Devlieghere	16dd69347d	[crashlog] Modularize parser Instead of parsing the crashlog in one big loop, use methods that correspond to the different parsing modes. Differential revision: https://reviews.llvm.org/D90665	2020-11-03 10:21:21 -08:00
Simon Pilgrim	6dabc38cce	Cleanup namespace comment to fix clang-tidy warning. NFCI.	2020-11-03 18:13:21 +00:00
Simon Pilgrim	e9b88c754a	[DAG] computeKnownBits - Move ISD::SRA handling into KnownBits::ashr As discussed on D90527, we should be trying to move shift handling functionality into KnownBits to avoid code duplication in SelectionDAG/GlobalISel/ValueTracking.	2020-11-03 18:09:33 +00:00
Craig Topper	00eff96e1d	[RISCV] Add missing patterns for rotr with immediate for Zbb/Zbp extensions. DAGCombine doesn't canonicalize rotl/rotr with immediate so we need patterns for both. Remove the custom matcher for rotl to RORI and just use a SDNodeXForm to convert the immediate instead. Doing this gives priority to the rev32/rev16 versions of grevi over rori since an explicit immediate is more precise than any immediate. I also added rotr patterns for rev32/rev16. And removed the (or (shl), (shr)) patterns that should be combined to rotl by DAG combine. There is at least one other grev pattern that probably needs a another rotr pattern, but we need more test coverage first. Differential Revision: https://reviews.llvm.org/D90575	2020-11-03 10:04:52 -08:00
Simon Pilgrim	cb798f040a	[DAG] computeKnownBits - Move (most) ISD::SRL handling into KnownBits::lshr As discussed on D90527, we should be be trying to move shift handling functionality into KnownBits to avoid code duplication in SelectionDAG/GlobalISel/ValueTracking. The refactor to use the KnownBits fixed/min/max constant helpers allows us to hit a couple of cases that we were missing before. We still need the getValidMinimumShiftAmountConstant case as KnownBits doesn't handle per-element vector cases.	2020-11-03 17:30:36 +00:00
Simon Pilgrim	c06c02bd1f	[AMDGPU] Regenerate load i16 tests to use update_llc_test_checks.py script. NFCI. Necessary for upcoming KnownBits::lshr support.	2020-11-03 17:30:36 +00:00
Louis Dionne	d9a4f936d0	[libc++] Move <memory> helpers outside of std::allocator_traits They don't really belong as members of allocator_traits.	2020-11-03 12:27:26 -05:00
Jonas Devlieghere	4b84682044	[crashlog] Move crash log parsing into its own class Move crash log parsing out of the CrashLog class and into its own class and add more tests. Differential revision: https://reviews.llvm.org/D90664	2020-11-03 09:04:35 -08:00
Nico Weber	af9bf14e6b	Make test/tools/llvm-dlltool/tool-name.test pass, and make it run The test hasn't run since it was added in D71302.	2020-11-03 11:59:15 -05:00
Tony	45bcbe46d7	[NFC][AMDGPU] Minor editorial improvements to AMDGPUUsage.rst Differential Revision: https://reviews.llvm.org/D90661	2020-11-03 16:56:01 +00:00
etiotto	e1af54296c	[compiler-rt][profile][AIX]: Enable compiler-rt profile build on AIX This patch adds support for building the compiler-rt profile library on AIX. Reviewed by: phosek Differential Revision: https://reviews.llvm.org/D90619	2020-11-03 11:46:21 -05:00
Esme-Yi	5053eab890	Revert "[PowerPC] Extend folding RLWINM + RLWINM to post-RA." This reverts commit `119ab2181e`.	2020-11-03 16:34:02 +00:00
Tim Renouf	89d41f3a2b	[AMDGPU] Add gfx1033 target Differential Revision: https://reviews.llvm.org/D90447 Change-Id: If2650fc7f31bbdd49c76e74a9ca8e3734d769761	2020-11-03 16:27:48 +00:00
Tim Renouf	ee3e642627	[AMDGPU] Add gfx90c target This differentiates the Ryzen 4000/4300/4500/4700 series APUs that were previously included in gfx909. Differential Revision: https://reviews.llvm.org/D90419 Change-Id: Ia901a7157eb2f73ccd9f25dbacec38427312377d	2020-11-03 16:27:43 +00:00
Michał Górny	fbc0d41bb0	[lldb] [Process/FreeBSDRemote] Fix "Fix attaching via lldb-server" One of the changes seems to have been lost in rebase. Reapply.	2020-11-03 17:16:57 +01:00
Valentin Clement	6c337945c8	[openmp][openacc][NFC] Simplify access and validation of DirectiveBase information This patch adds some helper in the DirectiveLanguage wrapper to initialize it from the RecordKeeper and validate the records. This simplify arguments in lots of function since only the DirectiveLanguge is passed. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D90358	2020-11-03 11:13:06 -05:00
Sanjay Patel	3c050a597c	[CostModel] fix cost calc bug for sadd/ssub with overflow As noted in D90554, there's an opcode typo in using an easily misused cost model API: getCmpSelInstrCost(). Beyond that, the assumed sequence of ops is questionable, but that would be another patch. My guess is that the x86 test diffs show that we are probably wrong both before and after this change, so there will be no practical difference. As an example, I tried this test which shows a cost of '7' either way: define <4 x i32> @sadd(<4 x i32> %va, <4 x i32> %vb) { %V4I32 = call {<4 x i32>, <4 x i1>} @llvm.sadd.with.overflow.v4i32(<4 x i32> %va, <4 x i32> %vb) %ov = extractvalue {<4 x i32>, <4 x i1>} %V4I32, 1 %r = extractvalue {<4 x i32>, <4 x i1>} %V4I32, 0 %z = select <4 x i1> %ov, <4 x i32> <i32 42, i32 42, i32 42, i32 42>, <4 x i32> %r ret <4 x i32> %z } $ llc -o - sadd.ll -mattr=avx vpaddd %xmm1, %xmm0, %xmm2 vpcmpgtd %xmm2, %xmm0, %xmm0 vpxor %xmm0, %xmm1, %xmm0 vblendvps %xmm0, LCPI0_0(%rip), %xmm2, %xmm0a Differential Revision: https://reviews.llvm.org/D90681	2020-11-03 11:03:47 -05:00
Hans Wennborg	1d3cd7172b	Fix GCC error: specialization of 'template<class LeafTy> struct llvm::LinearPolyBaseTypeTraits' in different namespace	2020-11-03 16:55:32 +01:00
Joachim Protze	eaed9e6b56	[OpenMP][Tools] clang-format Archer (NFC)	2020-11-03 16:32:02 +01:00
Joe Ellis	cf637a69e7	[SVE][InstCombine] Improve specificity of InstCombine TypeSize test The test was using -O2, where -instcombine will suffice. Reviewed By: sdesmalen Differential Revision: https://reviews.llvm.org/D90684	2020-11-03 15:24:44 +00:00
Jay Foad	040c50278c	[AMDGPU] Fix ds_read2/write2 with unaligned offsets These instructions use a scaled offset. We were wrongly selecting them even when the required offset was not a multiple of the scale factor. Differential Revision: https://reviews.llvm.org/D90607	2020-11-03 15:16:10 +00:00
Martin Storsjö	1127ef789c	[libcxx] Error out if __libcpp_mbsrtowcs_l fails in __time_get_storage If __libcpp_mbsrtowcs_l outputs zero wchar_t's for week days or month names (due to errors in the locale function setup), these are matched all the time in __time_get_storage::__analyze, ending up in an infinite loop, allocating more memory until killed. Differential Revision: https://reviews.llvm.org/D69553	2020-11-03 17:15:05 +02:00
Martin Storsjö	8a73aa8c4c	[libcxx] [libcxxabi] Set flags for visibility when statically linking libcxxabi into libcxx for windows Previously, these had to be set manually when building each of the projects standalone, in order to get proper symbol visibility when combining the two libraries. Differential Revision: https://reviews.llvm.org/D90021	2020-11-03 17:13:48 +02:00
Pavel Labath	d2700b7873	[lldb/Utility] Add unit tests for RegisterValue::GetScalarValue Buggy cases are commented out. Also sneak in a modernization of a RegisterValue constructor.	2020-11-03 16:12:32 +01:00
Mircea Trofin	34b0a99cce	[Docs][FileCheck] Small fix.	2020-11-03 07:08:51 -08:00
Jameson Nash	a0ad066ce4	make the AsmPrinterHandler array public This lets external consumers customize the output, similar to how AssemblyAnnotationWriter lets the caller define callbacks when printing IR. The array of handlers already existed, this just cleans up the code so that it can be exposed publically. Replaces https://reviews.llvm.org/D74158 Differential Revision: https://reviews.llvm.org/D89613	2020-11-03 10:02:09 -05:00
Nathan James	97e8da45f9	[ADT] Add SmallVector::pop_back_n Adds a method called pop_back_n to SmallVector. This is more readable and less error prone than the alternatives of using ```lang=c++ Vector.resize(Vector.size() - N); Vector.erase(Vector.end() - N, Vector.end()); for (unsigned I = 0;I<N;++I) Vector.pop_back(); ``` Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D90576	2020-11-03 14:57:10 +00:00
Anton Afanasyev	e8d67ef2dc	[SLP][X86][Test] Extend test coverage for PR47629 Add two cases for `<i32 x 8>`. Precommit for PR47629 and D90445. NFC	2020-11-03 17:51:24 +03:00
Jay Foad	6e008cb554	[AMDGPU] Precommit globalisel tests for ds_read2_b64 with large offset	2020-11-03 14:38:56 +00:00
Nathan James	b091af790f	[ASTMatchers] Made isExpandedFromMacro Polymorphic Made the isExpandedFromMacro matcher work on Stmt's, TypeLocs and Decls in line with the other macro expansion matchers. Also tweaked it to take a `std::string` instead of a `StringRef`. This prevents potential use-after-free bugs if the matcher is created with a string thats destroyed before the matcher finishes matching. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D90303	2020-11-03 14:36:51 +00:00
Simon Pilgrim	cab21d4fa8	[DAG] computeKnownBits - Move (most) ISD::SHL handling into KnownBits::shl As discussed on D90527, we should be be trying to move shift handling functionality into KnownBits to avoid code duplication in SelectionDAG/GlobalISel/ValueTracking. The refactor to use the KnownBits fixed/min/max constant helpers allows us to hit a couple of cases that we were missing before. We still need the getValidMinimumShiftAmountConstant case as KnownBits doesn't handle per-element vector cases.	2020-11-03 14:22:28 +00:00
LLVM GN Syncbot	a5bbefe303	[gn build] Port `1667d23e58`	2020-11-03 13:58:51 +00:00
Nico Weber	47c95f1710	[gn build] (manually) port `1af3cb5424`	2020-11-03 08:58:23 -05:00
Jay Foad	32897c05ab	[AMDGPU] Specify a triple to avoid codegen changes depending on host OS	2020-11-03 13:33:44 +00:00
Lei Zhang	d5bf727bcd	[mlir][spirv] Support for a few more decorations in (de)serialization Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D90655	2020-11-03 08:11:19 -05:00
Sanjay Patel	9af561ec99	[x86] update cost table comments for maxnum; NFC Follow-up suggested in D90613.	2020-11-03 08:09:59 -05:00
Yaxun (Sam) Liu	abd8cd9199	[CUDA][HIP] Fix linkage for -fgpu-rdc Currently for explicit template function instantiation in CUDA/HIP device compilation clang emits instantiated kernel with external linkage and instantiated device function with internal linkage. This is fine for -fno-gpu-rdc since there is only one TU. However this causes duplicate symbols for kernels for -fgpu-rdc if the same instantiation happen in multiple TU. Or missing symbols if a device function calls an explicitly instantiated template function in a different TU. To make explicit template function instantiation work for -fgpu-rdc we need to follow the C++ linkage paradigm, i.e. use weak_odr linkage. Differential Revision: https://reviews.llvm.org/D90311	2020-11-03 08:07:19 -05:00
Roman Lebedev	c009d11bda	[InstCombine] Perform C-(X+C2) --> (C-C2)-X transform before using Negator In particular, it makes it fire for C=0, because negator doesn't want to perform that fold since in general it's not beneficial.	2020-11-03 16:06:52 +03:00
Roman Lebedev	e465f9c303	[InstCombine] Negator: - (C - %x) --> %x - C (PR47997) This relaxes one-use restriction on that `sub` fold, since apparently the addition of Negator broke preexisting `C-(C2-X) --> X+(C-C2)` (with C=0) fold.	2020-11-03 16:06:51 +03:00
Roman Lebedev	f8cf6d027b	[NFC][InstCombine] Negator: add test coverage for `(?? - (%y + C))` pattern (PR47997)	2020-11-03 16:06:51 +03:00
Roman Lebedev	67be050acc	[NFC][InstCombine] Negator: add test coverage for `(?? - (C - %y))` pattern (PR47997)	2020-11-03 16:06:51 +03:00
Roman Lebedev	482d65331b	[NFC][InstCombine] Add test coverage for PR47997	2020-11-03 16:06:50 +03:00
Florian Hahn	d68bed0fa9	[SCCP] Handle bitcast of vector constants. Vectors where all elements have the same known constant range are treated as a single constant range in the lattice. When bitcasting such vectors, there is a mis-match between the width of the lattice value (single constant range) and the original operands (vector). Go to overdefined in that case. Fixes PR47991.	2020-11-03 12:58:39 +00:00
David Green	bd32386410	[ARM] Remove unused variable. NFC	2020-11-03 12:58:10 +00:00
Joachim Protze	71041a8b6b	[OpenMP][libomptarget][Tests] fix failing test D88149 updated `omp_get_initial_device` behavior to conform with OpenMP 5.1. omp_get_initial_device() == omp_get_num_devices()	2020-11-03 13:15:33 +01:00
Joachim Protze	b0eb19bf8a	[OpenMP][OMPT][NFC] Fix flaky test As reported by @ronlieb, the test shows intermittent fails. The test failed, if the dependent task was already finished, when the depending task was to be created. We have other tests to check for the dependences pair.	2020-11-03 13:15:32 +01:00
Joachim Protze	e99207feb4	[OpenMP][Tool] Handle detached tasks in Archer Since detached tasks are supported by clang and the OpenMP runtime, Archer must expect to receive the corresponding callbacks. This patch adds support to interpret the synchronization semantics of omp_fulfill_event and cleans up the handling of task switches.	2020-11-03 13:15:32 +01:00
Hans Wennborg	cbf25fbed5	Revert "[CodeGen] [WinException] Only produce handler data at the end of the function if needed" This caused an explosion in ICF times during linking on Windows when libfuzzer instrumentation is enabled. For a small binary we see ICF time go from ~0 to ~10 s. For a large binary it goes from ~1 s to forevert (I gave up after 30 minutes). See comment on the code review. > If we are going to write handler data (that is written as variable > length data following after the unwind info in .xdata), we need to > emit the handler data immediately, but for cases where no such > info is going to be written, skip emitting it right away. (Unwind > info for all remaining functions that hasn't gotten it emitted > directly is emitted at the end.) > > This does slightly change the ordering of sections (triggering a > bunch of updates to DebugInfo/COFF tests), but the change should be > benign. > > This also matches GCC's assembly output, which doesn't output > .seh_handlerdata unless it actually is needed. > > For ARM64, the unwind info can be packed into the runtime function > entry itself (leaving no data in the .xdata section at all), but > that can only be done if there's no follow-on data in the .xdata > section. If emission of the unwind info is triggered via > EmitWinEHHandlerData (or the .seh_handlerdata directive), which > implicitly switches to the .xdata section, there's a chance of the > caller wanting to pass further data there, so the packed format > can't be used in that case. > > Differential Revision: https://reviews.llvm.org/D87448 This reverts commit `36c64af9d7`.	2020-11-03 13:12:10 +01:00
Stefan Gränitz	b397795f1a	[JITLink][ELF] Implement R_X86_64_PLT32 relocations Basic implementation for call and jmp branches with 32 bit offset. Branches to local targets produce Branch32 edges that are resolved like a regular PCRel32 relocations. Branches to external (undefined) targets produce Branch32ToStub edges and go through a PLT entry by default. If the target happens to get resolved within the 32 bit range from the callsite, the edge is relaxed during post-allocation optimization. There is a test for each of these cases. Reviewed By: lhames Differential Revision: https://reviews.llvm.org/D90331	2020-11-03 12:05:54 +00:00

1 2 3 4 5 ...

371003 Commits All Branches Search

371003 Commits

All Branches