llvm-project

Commit Graph

Author	SHA1	Message	Date
Shraiysh Vaishay	35fc666877	[OpenMP][IRBuilder] Add support for taskgroup This patch adds support for generating taskgroup construct. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D128203	2022-07-19 10:49:34 +05:30
Jacques Pienaar	c8598fa22f	[mlir] Add refineReturnTypes to InferTypeOpInterface refineReturnType method shares the same parameters as inferReturnTypes but gets passed in the return types of the op if known that can be used during refinement passes or for more op specific error reporting. Currently the error reporting on failure is generic and doesn't allow for specializing the returned result based on failure, with this change what would previously have been a separate trait with specialized verification can just be handled as part of inferrence rather than duplicated. refineReturnTypes behaves like inferReturnTypes if no result types are fed in, while the current verification is recast as the default implementation for refineReturnTypes with it calling inferReturnTypes (and so the default type verification now goes through refine and allows for more op specific inference mismatch errors). Differential Revision: https://reviews.llvm.org/D129955	2022-07-18 22:18:52 -07:00
Carlos Alberto Enciso	83e922562f	Update the Windows packaging script. As discussed on: https://discourse.llvm.org/t/build-llvm-release-bat-script-options/63146/6 - Refactor the build/test steps into functions. - Exit the script if the build directory already exists. Reviewed By: hans Differential Revision: https://reviews.llvm.org/D129559	2022-07-19 05:55:14 +01:00
Nathan James	6357f1c1aa	[clang-tidy] Remove unnecessary code from ReadabilityModuleTest D56303 added testing code that was then made redundant by the changes in D125026. However this code wasn't completely removed in the latter patch. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D130026	2022-07-19 05:21:19 +01:00
Konstantin Varlamov	8ed702b83f	[libc++][ranges] Implement `ranges::{,stable_}partition`. Differential Revision: https://reviews.llvm.org/D129624	2022-07-18 21:06:17 -07:00
Lang Hames	67220c2ad7	[ORC] Fix serialization / deserialization of default-constructed ArrayRef<char>. Avoids a zero-length memcpy from a null src, which caused errors on some of the sanitizer bots. Also uses null when deserializing an empty ArrayRef (rather than pointing to a zero length range in the middle of the input buffer).	2022-07-18 20:39:01 -07:00
jacquesguan	58dfaaaace	[DAGCombiner] Teach scalarizeBinOpOfSplats handle scalable splat. This revision supports to scalarize a binary operation of two scalable splat vectors. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D122791	2022-07-19 11:20:51 +08:00
jacquesguan	3fcaea176c	[RISCV][test] Precommit test for D122791. Differential Revision: https://reviews.llvm.org/D123362	2022-07-19 10:56:02 +08:00
Kazushi (Jam) Marukawa	469044cfd3	[VE] Support load/store/spill of vector mask registers Support load/store/spill of vector mask registers and add regression tests. Reviewed By: efocht Differential Revision: https://reviews.llvm.org/D129415	2022-07-19 10:29:21 +09:00
zhongyunde	bddf20735e	[AArch64][NFC] Set true for default of subfeature is more readable Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D129960	2022-07-19 09:00:00 +08:00
Jim Ingham	83fab8cee9	Revert "Make hit point counts reliable for architectures that stop before evaluation." This reverts commit `5778ada8e5`. The watchpoint tests all stall on aarch64-ubuntu bots. Reverting till I can get my hands on an system to test this out.	2022-07-18 17:38:43 -07:00
Jim Ingham	4f5707e743	Revert "This is a followup to https://reviews.llvm.org/D129814 " This reverts commit `555ae5b8f5`. Apparently, there's something different about how Linux ARM handles watchpoints, as all the watchpoint tests seem to stall on the Ubuntu aarch64 bots. Reverting till I can get my hands on a linux system and see what is wrong.	2022-07-18 17:37:13 -07:00
ksyx	3198364e6e	[RISCV][Clang] Add support for Zmmul extension This patch implements recently ratified extension Zmmul, a subextension of M (Integer Multiplication and Division) consisting only multiplication part of it. Differential Revision: https://reviews.llvm.org/D103313 Reviewed By: craig.topper, jrtc27, asb	2022-07-18 20:26:08 -04:00
Argyrios Kyrtzidis	d1b58cada6	[unittests/Tooling/DependencyScannerTest] Add a target triple for `ScanDepsWithFS` test This should fix the `clang-ppc64-aix` builder.	2022-07-18 16:55:07 -07:00
Rahman Lavaee	ed93d157de	[llvm-objdump] Support --symbolize-operands when there is a single SHT_LLVM_BB_ADDR_MAP section for all text sections When linking, using `-Wl,-z,keep-text-section-prefix` results in multiple text sections while all `SHT_LLVM_BB_ADDR_MAP` sections are linked into a single one. In such case, we should not read the corresponding section for each text section, and instead read all `SHT_LLVM_BB_ADDR_MAP` sections before disassembly. Reviewed By: jhenderson, MaskRay Differential Revision: https://reviews.llvm.org/D129924	2022-07-18 16:51:22 -07:00
Jim Ingham	555ae5b8f5	This is a followup to https://reviews.llvm.org/D129814 That was causing hit counts to be double-counted on x86_64 Linux. It looks like StopInfoWatchpoint::ShouldStopSynchronous gets called twice for a give stop on Linux (not on Darwin). I had taken out the "have I been called already" check when I reworked this part of the code because it didn't seem necessary. Putting that back in because it looks like it is on some systems.	2022-07-18 16:24:31 -07:00
Ellis Hoag	3580daacf3	[InstrProf] Allow CSIRPGO function entry coverage The flag `-fcs-profile-generate` for enabling CSIRPGO moves the pass `pgo-instrumentation` after inlining. Function entry coverage works fine with this change, so remove the assert. I had originally left this assert in because I had not tested this at the time. Reviewed By: davidxl, MaskRay Differential Revision: https://reviews.llvm.org/D129407	2022-07-18 15:10:11 -07:00
Jim Ingham	e83d47f6b7	When the module path for `command script import` is invalid, echo the path. We were just emitting "invalid module" w/o saying which module. That's not particularly helpful. Differential Revision: https://reviews.llvm.org/D129338	2022-07-18 14:49:07 -07:00
Jim Ingham	5778ada8e5	Make hit point counts reliable for architectures that stop before evaluation. Since we want to present the "new & old" values for watchpoint hits, on architectures, including the ARM family, that stop before the triggering instruction is run, we need to single step over the instruction before stopping for realz. This was incorrectly done directly in the StopInfoWatchpoint::ShouldStop. That causes problems if more than one thread stops "for a reason" at the same time as the watchpoint, since the other actions didn't expect the process to make progress in this part of the execution control machinery. The correct way to do this is to schedule the step over using ThreadPlans, and then to restore the stop info after that plan stops, so that the rest of the stop info actions can happen when all the other threads have handled their immediate actions as well. Differential Revision: https://reviews.llvm.org/D129814	2022-07-18 14:36:32 -07:00
Matt Arsenault	8d0383eb69	CodeGen: Remove AliasAnalysis from regalloc This was stored in LiveIntervals, but not actually used for anything related to LiveIntervals. It was only used in one check for if a load instruction is rematerializable. I also don't think this was entirely correct, since it was implicitly assuming constant loads are also dereferenceable. Remove this and rely only on the invariant+dereferenceable flags in the memory operand. Set the flag based on the AA query upfront. This should have the same net benefit, but has the possible disadvantage of making this AA query nonlazy. Preserve the behavior of assuming pointsToConstantMemory implying dereferenceable for now, but maybe this should be changed.	2022-07-18 17:23:41 -04:00
Michael Jones	bf7f01d857	[libc] fix strtofloatingpoint on rare edge case Currently, there are two string parsers that can be used in a call to strtofloatingpoint. There is the main parser used by Clinger's fast path and Eisel-Lemire, and the backup parser used by Simple Decimal Conversion. There was a bug in the backup parser where if the number had more than 800 digits (the size of the SDC buffer) before the decimal point, it would just ignore the digits after the 800th and not count them into the exponent. This patch fixes that issue and adds regression tests. Reviewed By: lntue Differential Revision: https://reviews.llvm.org/D130032	2022-07-18 14:23:33 -07:00
zr33	1a1324a303	[BOLT][DWARF] Fix incorrect DW_AT_type offset for unittest Some unit tests has incorrect DW_AT_type offset since they are manual crafted, fix them to the correct offset. Reviewed By: Amir, ayermolo Differential Revision: https://reviews.llvm.org/D129828	2022-07-18 14:20:22 -07:00
zr33	66a41e0807	[BOLT][DWARF] Add Unit test for DW_AT_high_pc [DW_FORM_addr] Reviewed By: ayermolo Differential Revision: https://reviews.llvm.org/D127613	2022-07-18 14:03:53 -07:00
Sam McCall	fa0c7639e9	[pseudo] Add guards for module contextual keywords	2022-07-18 22:38:41 +02:00
Martin Storsjö	315072b450	[clang-tidy] Reduce the dependencies for the "make-confusable-table" tool When cross compiling llvm, a separate recursive native cmake build is generated, for building the tools that generate code (unless they're provided externally by the caller). This reduces the number of build steps for that native build from 1000+ steps to 162. This matches how the clang-pseudo-gen tool is set up in clang-tools-extra/pseudo/gen/CMakeLists.txt. Differential Revision: https://reviews.llvm.org/D129797	2022-07-18 22:50:29 +03:00
Björn Schäpers	d2eda49202	[clang-format] Mark constexpr lambdas as lambda Otherwise the brace was detected as a function brace, not wrong per se, but when directly calling the lambda the calling parens were put on the next line. Differential Revision: https://reviews.llvm.org/D129946	2022-07-18 21:42:34 +02:00
Björn Schäpers	3c18a8b3a3	[clang-format] Indent TT_CtorInitializerColon after requires clauses Fixes https://github.com/llvm/llvm-project/issues/56215 Differential Revision: https://reviews.llvm.org/D129942	2022-07-18 21:41:09 +02:00
Björn Schäpers	2b04c41b28	[clang-format] Fix misannotation of colon in presence of requires clause For clauses without parentheses it was annotated as TT_InheritanceColon. Relates to https://github.com/llvm/llvm-project/issues/56215 Differential Revision: https://reviews.llvm.org/D129940	2022-07-18 21:41:09 +02:00
Stanislav Mekhanoshin	523a99c0eb	[AMDGPU] Support for gfx940 fp8 smfmac Differential Revision: https://reviews.llvm.org/D129908	2022-07-18 12:12:41 -07:00
Stanislav Mekhanoshin	2695f0a688	[AMDGPU] Support for gfx940 fp8 mfma Differential Revision: https://reviews.llvm.org/D129906	2022-07-18 11:49:56 -07:00
Stanislav Mekhanoshin	9fa5a6b7e8	[AMDGPU] Support for gfx940 fp8 conversions Differential Revision: https://reviews.llvm.org/D129902	2022-07-18 11:48:43 -07:00
Florian Hahn	30e53b8c03	[LV] Sink module variable and use State to set it in widenCall. (NFC) Limits the lifetime of the variable and makes it independent of CallInst.	2022-07-18 19:41:48 +01:00
Jay Foad	dbed4326dd	[LiveIntervals] Find better anchoring end points when repairing ranges r175673 changed repairIntervalsInRange to find anchoring end points for ranges automatically, but the calculation of Begin included the first instruction found that already had an index. This patch changes it to exclude that instruction: 1. For symmetry, so that the half open range [Begin,End) only includes instructions that do not already have indexes. 2. As a possible performance improvement, since repairOldRegInRange will scan fewer instructions. 3. Because repairOldRegInRange hits assertion failures in some cases when it sees a def that already has a live interval. (3) fixes about ten tests in the CodeGen lit test suite when -early-live-intervals is forced on. Differential Revision: https://reviews.llvm.org/D110182	2022-07-18 19:34:43 +01:00
Mubariz Afzal	c444f03787	Reland "[SystemZ][z/OS] Fix f32 variadic argument assertion" This patch relands the f32 vararg assertion on z/OS fix that was reverted previously due to the testcase failing on non-z/OS platforms. It is now passing. The tablegen lines that specify the XPLINK64 calling convention for promoting an f32 vararg to an f64 are effectively overwritten by the following tablegen line which bitcast an f64 vararg to an i64 (so that it can be used in the GPRs). Thus it becomes a bitcast from f32 to i64. We don't handle bitcasts for f32s and so this causes an assertion to be thrown. We fix this by simplifying the tablegen lines to explicity show this behaviour, and allow the f32 in the bitcast case by first promoting it to an f64.	2022-07-18 14:25:17 -04:00
Mehdi Amini	d04c2b2fd9	Revert "[MLIR] Generic 'malloc', 'aligned_alloc' and 'free' functions" This reverts commit `3e21fb616d`. A lot of integration tests are failing on the bot.	2022-07-18 18:07:36 +00:00
Craig Topper	0b02752899	[RISCV] Optimize (seteq (i64 (and X, 0xffffffff)), C1) (and X, 0xffffffff) requires 2 shifts in the base ISA. Since we know the result is being used by a compare, we can use a sext_inreg instead of an AND if we also modify C1 to have 33 sign bits instead of 32 leading zeros. This can also improve the generated code for materializing C1. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D129980	2022-07-18 10:54:45 -07:00
Craig Topper	464b3a9d8a	[RISCV] Pre-commit tests for D129980. NFC Differential Revision: https://reviews.llvm.org/D129981	2022-07-18 10:54:45 -07:00
LLVM GN Syncbot	42fdb487d7	[gn build] Port `e24b390dbc`	2022-07-18 17:45:05 +00:00
LLVM GN Syncbot	81a059a8d4	[gn build] Port `0f9d9edd24`	2022-07-18 17:45:04 +00:00
Arnold Schwaighofer	28ebd13d63	[coro async] Fix code to run coro.async.end cleanup like the legacy pass did The code executed for the Switch ABI does not change. rdar://97074714 Differential Revision: https://reviews.llvm.org/D129865	2022-07-18 10:41:29 -07:00
Matt Arsenault	e24b390dbc	llvm-reduce: Add reduction for instruction defs Try to insert an implicit_def to replace the instruction's value, replacing the original instruction's def with a dead register. If all defs are delete the instruction entirely. This is pretty similar to the instruction reduction, but leaves the new defs in the same place as the original instruction. This could possibly replace it. I'm not sure if we should directly delete the instructions here, or leave dead ones behind. This could also further work to replace physical register defs.	2022-07-18 13:41:08 -04:00
Matt Arsenault	0f9d9edd24	llvm-reduce: Add reduction for custom register masks I have a register allocator failure that only reproduces with IPRA enabled, and requires the specific regmask if I want to only run the one relevant pass. The printed custom regmask is enormous and I would like to reduce it. This reduces each individual bit in the mask, but it would probably be better to start at register units and clear all aliasing fields at a time. This would require stricter verification that all aliasing bits are set in regmasks (although I would prefer to switch regmasks to use register units in the first place).	2022-07-18 13:41:08 -04:00
Alex Bradbury	86c4242976	[docs] Remove unmaintained target feature matrix Back in 2017, a table was added to the codegen documentation listing which features various backends support. It received a few updates since then, but not since the end of 2019. Having such a table is a nice idea, but it hasn't been kept up to date, it isn't easy to ensure that it is up to date, and the table probably isn't very discoverable for most users who would be interested in this information anyway (it would be better suited to some kind of "what can LLVM do for me?" page). For all of the above reasons, I believe it makes sense to remove it. Differential Revision: https://reviews.llvm.org/D129996	2022-07-18 18:38:23 +01:00
Daniel Bertalan	1fb9466c6a	[lld-macho] Devirtualize TargetInfo::getRelocAttrs This method is called on each relocation when parsing input files, so the overhead of using virtual functions ends up being quite large. We now have a single non-virtual method, which reads from the appropriate array of relocation attributes set in the TargetInfo constructor. This change results in a modest 2.3% reduction in link time for chromium_framework measured on an x86-64 VPS, and 0.7% on an arm64 Mac. N Min Max Median Avg Stddev x 10 11.869417 12.032609 11.935041 11.938268 0.045802324 + 10 11.581526 11.785265 11.649885 11.659507 0.054634834 Difference at 95.0% confidence -0.278761 +/- 0.0473673 -2.33502% +/- 0.396768% (Student's t, pooled s = 0.0504124) Differential Revision: https://reviews.llvm.org/D130000	2022-07-18 19:32:58 +02:00
Arjun P	477c2c6f4a	[MLIR][Presburger] fix warning under g++ (NFC)	2022-07-18 18:02:20 +01:00
Igor Kudrin	32eed8828e	Reapply "[NVPTX] Use the mask() operator to initialize packed structs with pointers" The original patch revealed an issue of reading incorrect values on BE hosts. That is now changed to use `endian::read32le()` and `endian::read64le()`. Original commit message: The current implementation assumes that all pointers used in the initialization of an aggregate are aligned according to the pointer size of the target; that might not be so if the object is packed. In that case, an array of .u8 should be used and pointers should be decorated with the mask() operator. The operator was introduced in PTX ISA 7.1, so an error is issued if the case is detected for an earlier version. Differential Revision: https://reviews.llvm.org/D127504	2022-07-18 20:56:26 +04:00
Craig Topper	7c0b9b379b	[RISCV] Add isel patterns for ineg+setge/le/uge/ule. setge/le/uge/ule selected by themselves require an xori with 1. If we're negating the setcc, we can fold the xori with the neg to create an addi with -1. This works because xori X, 1 is equivalent to 1 - X if X is either 0 or 1. So we're doing -(1 - X) which is X-1 or X+-1. This improves the code for selecting between 0 and -1 based on a condition for some conditions. Reviewed By: asb Differential Revision: https://reviews.llvm.org/D129957	2022-07-18 09:55:01 -07:00
Nicolai Hähnle	1586075a00	Rerun ./utils/update_cc_test.py on a bunch of tests Due to update script changes; this reduces the size of a later "real" diff.	2022-07-18 18:48:34 +02:00
Joseph Huber	47b0aa5e4b	[LinkerWrapper] Rework passing args to the LLVM backend	2022-07-18 12:44:15 -04:00
Argyrios Kyrtzidis	fbbabd4ca0	[Tooling/DependencyScanning] Enable passing a `vfs::FileSystem` object to `DependencyScanningTool` Also include a unit test to validate that the `vfs::FileSystem` object is properly used. Differential Revision: https://reviews.llvm.org/D129912	2022-07-18 09:37:17 -07:00

... 5 6 7 8 9 ...

430548 Commits All Branches Search

430548 Commits

All Branches