llvm-project

Commit Graph

Author	SHA1	Message	Date
Kadir Cetinkaya	fbf1745722	[clangd] Escape error message in AddUsing Fixes https://github.com/clangd/clangd/issues/900	2021-10-28 14:52:12 +02:00
Nico Weber	5d64bf00ac	[gn build] (manually) port `d736002e90`	2021-10-28 08:48:54 -04:00
Alexey Bataev	07ef9f513f	[SLP]Improve/fix reordering of the gathered graph nodes. Gathered loads/extractelements/extractvalue instructions should be checked if they can represent a vector reordering node too and their order should ve taken into account for better graph reordering analysis/ Also, if the gather node has reused scalars, they must be reordered instead of the scalars themselves. Differential Revision: https://reviews.llvm.org/D112454	2021-10-28 05:45:09 -07:00
Hans Wennborg	4d2765e994	Re-instate -Wweak-template-vtables as a no-op flag Follow-up to `8c13680524` to allow a less abrupt migration for users. Differential revision: https://reviews.llvm.org/D112704	2021-10-28 14:40:59 +02:00
Uday Bondhugula	57b9b29649	[MLIR][LLVM] Add llvm.mlir.global_ctors/dtors and translation support Add llvm.mlir.global_ctors and global_dtors ops and their translation support to LLVM global_ctors/global_dtors global variables. Differential Revision: https://reviews.llvm.org/D112524	2021-10-28 18:09:34 +05:30
Pavel Labath	349295fcf3	[lldb/test] Allow indentation in inline tests This makes it possible to use for loops (and other language constructs) in inline tests. Differential Revision: https://reviews.llvm.org/D112706	2021-10-28 14:39:02 +02:00
Sanjay Patel	e8535fa784	[InstCombine] allow Negator to fold multi-use select with constant arms The motivating test is reduced from: https://llvm.org/PR52261 Note that the more general problem of folding any binop into a multi-use select of constants is still there. We need to ease the restriction in InstCombinerImpl::FoldOpIntoSelect() to catch those. But these examples never reach that code because Negator exclusively handles negation patterns within visitSub(). Differential Revision: https://reviews.llvm.org/D112657	2021-10-28 08:35:58 -04:00
Peter Waller	98f08752f7	[InstCombine][ConstantFolding] Make ConstantFoldLoadThroughBitcast TypeSize-aware The newly added test previously caused the compiler to fail an assertion. It looks like a strightforward TypeSize upgrade. Reviewed By: paulwalker-arm Differential Revision: https://reviews.llvm.org/D112142	2021-10-28 12:15:15 +00:00
David Green	0a2708d2ae	[InstSimplify] Add tests for the range of a half float. NFC	2021-10-28 12:58:13 +01:00
Konstantin Schwarz	c09f1fc74c	[GlobalISel][Tablegen] Fix SameOperandMatcher's isIdentical check During rule optimization, identical SameOperandMatchers are hoisted into a common group, however previously only one operand index was considered. Commutable patterns can introduce SameOperandMatcher checks where the second index is commuted, resulting in a different check that cannot be hoisted. Reviewed By: qcolombet Differential Revision: https://reviews.llvm.org/D111506	2021-10-28 13:37:12 +02:00
Jon Chesterfield	4d50803ce4	[libomptarget] Build DeviceRTL for amdgpu Passes same tests as the current deviceRTL. Includes cmake change from D111987. CI is showing a different set of pass/fails to local, committing this without the tests enabled by default while debugging that difference. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D112227	2021-10-28 12:34:01 +01:00
Dmitry Vyukov	d736002e90	tsan: move memory access functions to a separate file tsan_rtl.cpp is huge and does lots of things. Move everything related to memory access and tracing to a separate tsan_rtl_access.cpp file. No functional changes, only code movement. Reviewed By: vitalybuka, melver Differential Revision: https://reviews.llvm.org/D112625	2021-10-28 13:31:10 +02:00
Abinav Puthan Purayil	2da6ef3664	[AMDGPU] Add 24-bit mulhi intrinsics in INTRINSIC_WO_CHAIN combine. mul24 intrinsic's operands are simplified by AMDGPUTargetLowering::performIntrinsicWOChainCombine(). This change adds the mul24hi intrinsics in the combine since its operands can be simplified like that of the mul24 intrinsics. Differential Revision: https://reviews.llvm.org/D112702	2021-10-28 16:57:48 +05:30
Abinav Puthan Purayil	9f8e779b42	[AMDGPU] Fix rhs of the tests in amdgpu-codegenprepare-mul24.ll. Differential Revision: https://reviews.llvm.org/D112685	2021-10-28 16:57:48 +05:30
Guillaume Chatelet	00c943a548	[libc] automemcpy	2021-10-28 11:10:15 +00:00
Emil Kieri	848cca6c5b	[flang] Checks for pointers to intrinsic functions Check that when a procedure pointer is initialised or assigned with an intrinsic function, or when its interface is being defined by one, that intrinsic function is unrestricted specific (listed in Table 16.2 of F'2018). Mark intrinsics LGE, LGT, LLE, and LLT as restricted specific. Getting their classifications right helps in designing the tests. Differential Revision: https://reviews.llvm.org/D112381	2021-10-28 12:30:29 +02:00
Kirill Bobyrev	f9201c70ad	[clangd] NFC: Use more idiomatic way of checking for definition	2021-10-28 12:25:12 +02:00
Kirill Bobyrev	56a8aee100	[clangd] NFC: Match function signature in the header and source file	2021-10-28 12:13:18 +02:00
OCHyams	b07d59c495	[dexter] XFAIL feature_test source-root-dir.cpp Test is failing for unknown reasons and needs investigating.	2021-10-28 11:13:01 +01:00
Jay Foad	c6b4fb87c0	[AMDGPU] Add gfx10 uaddsat test coverage. NFC.	2021-10-28 10:24:12 +01:00
Max Kazantsev	8daf76935d	[Test] Regenerate some of llc test checks using auto updater	2021-10-28 16:18:30 +07:00
Balazs Benics	49285f43e5	[analyzer] sprintf is a taint propagator not a source Due to a typo, `sprintf()` was recognized as a taint source instead of a taint propagator. It was because an empty taint source list - which is the first parameter of the `TaintPropagationRule` - encoded the unconditional taint sources. This typo effectively turned the `sprintf()` into an unconditional taint source. This patch fixes that typo and demonstrated the correct behavior with tests. Reviewed By: martong Differential Revision: https://reviews.llvm.org/D112558	2021-10-28 11:03:02 +02:00
Shraiysh Vaishay	30bd11fab4	[MLIR][OpenMP] Fixed the missing inclusive clause in omp.wsloop and fix order clause This patch adds the inclusive clause (which was missed in previous reorganization - https://reviews.llvm.org/D110903) in omp.wsloop operation. Added a test for validating it. Also fixes the order clause, which was not accepting any values. It now accepts "concurrent" as a value, as specified in the standard. Reviewed By: kiranchandramohan, peixin, clementval Differential Revision: https://reviews.llvm.org/D112198	2021-10-28 14:18:05 +05:30
Sebastian Neubauer	fd1cfc9094	[AMDGPU][GlobalISel] Fix waterfall loops - Move the `s_and exec` to its correct position before the content of the waterfall loop - Use the SI_WATERFALL pseudo instruction, like for sdag, to benefit from optimizations - Add support for indirect function calls To support indirect calls, add a G_SI_CALL instruction without register class restrictions and insert a waterfall loop when applying register banks. Differential Revision: https://reviews.llvm.org/D109052	2021-10-28 10:30:55 +02:00
Neubauer, Sebastian	50d8d963e3	[GlobalISel] Simplify RegBankSelect Save the instruction list of a block before selecting banks. This allows to cope with moved instructions, even if they are reordered or splitted into multiple basic blocks. Differential Revision: https://reviews.llvm.org/D111223	2021-10-28 10:30:55 +02:00
Pavel Labath	5f4980f004	[lldb] Remove ConstString from Process, ScriptInterpreter and StructuredData plugin names	2021-10-28 10:15:03 +02:00
Max Kazantsev	21adcdb712	[Test] Regenerate checks using auto-update script	2021-10-28 15:13:43 +07:00
Caroline Concatto	2186b011e9	[Driver][AArch64]Add driver support for neoverse-512tvb target The support for neoverse-512tvb mirrors the same option available in GCC[1]. There is no functional effect for this option yet. This patch ensures the driver accepts "-mcpu=neoverse-512tvb", and enough plumbing is in place to allow the new option to be used in the future. [1]https://gcc.gnu.org/onlinedocs/gcc/AArch64-Options.html Differential Revision: https://reviews.llvm.org/D112406	2021-10-28 09:08:40 +01:00
Michał Górny	073c5d0e47	[lldb] [Host/Socket] Make DecodeHostAndPort() return a dedicated struct Differential Revision: https://reviews.llvm.org/D112629	2021-10-28 09:57:50 +02:00
Diana Picus	824bf90819	[flang] runtime: Read environment variables directly Add support for reading environment variables directly, via std::getenv. This needs to allocate a C-style string to pass into std::getenv. If the memory allocation for that fails, we terminate. This also changes the interface for EnvVariableLength to receive the source file and line so we can crash gracefully. Note that we are now completely ignoring the envp pointer passed into ProgramStart, since that could go stale if the environment is modified during execution. Differential Revision: https://reviews.llvm.org/D111785	2021-10-28 07:49:30 +00:00
Martin Storsjö	177176f75c	[Support] [Windows] Manually clean up temp files if not setting delete disposition Since D81803 / `79657e2339`, temp files created on network shares don't set "Disposition.DeleteFile = true". This flag normally takes care of removing the temp file both if the process exits abnormally (either crashing or killed externally), and when the file is closed cleanly. For network shares, we voluntarily choose to not set the flag, and if the operation to inspect the file handle (as a prerequisite to setting the flag since `79657e2339`) fails we also error out. In both of these cases, we can at least make sure to remove the temp files when they are closed cleanly. Adjust the semantics of "OF_Delete" to not set the delete disposition, but only set the access mode for allowing deletion. Move the call to setDeleteDisposition into TempFile::create, where we can check if it failed, and if it did, set a flag noting that the file should be removed manually at the end. This does leak files on crash, but at least doesn't leak files in regular successful runs. (Technically, the alternative codepath could use the RemoveFileOnSignal function, but that might complicate the TempFile implementation further.) This fixes https://github.com/mstorsjo/llvm-mingw/issues/233 and https://bugs.llvm.org/show_bug.cgi?id=52080. Differential Revision: https://reviews.llvm.org/D111875	2021-10-28 10:33:37 +03:00
Martin Storsjö	897c86dec5	[clang] [MinGW] Rename the 'Arch' member to 'SubdirName'. NFC. This string isn't a plain architecture name, but contains the whole subdir name used for the sysroot, which often is equal to the target triple. Differential Revision: https://reviews.llvm.org/D112387	2021-10-28 10:26:54 +03:00
YunQiang Su	284c2ebc5e	[clang][MIPS] Fix search path for Debian multilib O32 In the situation of multilib, the gcc objects are in a /32 directory. On Debian, the libraries is under /libo32 to avoid confliction. This patch enables clang find gcc in /32, and C lib in /libo32. Differential Revision: https://reviews.llvm.org/D112158	2021-10-28 10:23:06 +03:00
Sam McCall	73453e7ade	[clangd] Avoid expensive checks of buffer names in IncludeCleaner This changes the handling of special buffers (<command-line> etc) that SourceManager treats as files but FileManager does not. We now include them in findReferencedFiles() and drop them as part of translateToHeaderIDs(). This pairs more naturally with the data representations we're using, and so avoids a bunch of converting between representations for filtering. Differential Revision: https://reviews.llvm.org/D112652	2021-10-28 08:00:57 +02:00
Hongtao Yu	259e4c5658	[CSSPGO] Trim cold base profiles for the CS preinliner. Adding support to the CS preinliner to trim cold base profiles. This makes trimming consistent with the inline decision made by the preinliner. Also disable the existing profile merger when preinliner is on unless explicitly specified. Reviewed By: wenlei, wlei Differential Revision: https://reviews.llvm.org/D112489	2021-10-27 22:50:27 -07:00
Hsiangkai Wang	7051f73d69	[RISCV] Sync Zvlsseg register order as the same as vector registers. Sync the order of Zvlsseg registers with vector registers to avoid unnecessary register copies between vector instructions and zvlsseg instructions. Differential Revision: https://reviews.llvm.org/D110250	2021-10-28 13:34:53 +08:00
Greg Clayton	1300556479	Add unix signal hit counts to the target statistics. Android and other platforms make wide use of signals when running applications and this can slow down debug sessions. Tracking this statistic can help us to determine why a debug session is slow. The new data appears inside each target object and reports the signal hit counts: "signals": [ { "SIGSTOP": 1 }, { "SIGUSR1": 1 } ], Differential Revision: https://reviews.llvm.org/D112683	2021-10-27 22:31:14 -07:00
thomasraoux	eacd6e1ebe	[mlir][GPUtoNVVM] Relax restriction on wmma op lowering Allow lowering of wmma ops with 64bits indexes. Change the default version of the test to use default layout. Differential Revision: https://reviews.llvm.org/D112479	2021-10-27 21:31:55 -07:00
Kazu Hirata	cee3419d65	[AMDGPU] Remove unused declaration findNumUsedRegistersSI (NFC)	2021-10-27 21:24:02 -07:00
Max Kazantsev	4024ca8922	[Test] Add test showing missing simplifycfg opportunity for Phi with undef inputs	2021-10-28 11:23:07 +07:00
Phoebe Wang	2bc28c6f82	[X86] Add a dependency breaking xor before any gathers with an undef passthru value. In the instruction encoding, the passthru register is always tied to the destination register. The CPU scheduler has to wait for the last writer of this register to finish executing before the gather can start. This is true even if the initial mask is all ones so that the passthru will never be used. By explicitly zeroing the register we can break the false dependency. The zero idiom is executed completing by the register renamer and so is immedately considered ready. Authored by Craig. Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D112505	2021-10-28 11:44:52 +08:00
Hsiangkai Wang	0a9b82960c	[RISCV] Use vmv.v.[v\|i] if we know COPY is under the same vl and vtype. If we know the source operand of COPY is defined by a vector instruction with tail agnostic and the same LMUL and there is no vsetvli between COPY and the define instruction to change the vl and vtype, we could use vmv.v.v or vmv.v.i to copy vector registers to get better performance than the whole vector register move instructions. If the source of COPY is from vmv.v.i, we could use vmv.v.i for the COPY. This patch only considers all these instructions within one basic block. Case 1: ``` bb.0: ... VSETVLI # The first VSETVLI before COPY and VOP. ... # Use this VSETVLI to check LMUL and tail agnostic. ... vy = VOP va, vb # Define vy. ... # There is no vsetvli between VOP and COPY. vx = COPY vy ``` Case 2: ``` bb.0: ... VSETVLI # The first VSETVLI before VOP. ... # Use this VSETVLI to check LMUL and tail agnostic. ... vy = VOP va, vb # Define vy. ... # There is no vsetvli to change vl between VOP and COPY. ... VSETVLI # The first VSETVLI before COPY. ... # This VSETVLI does not change vl and vtype. ... vx = COPY vy ``` Co-Authored-by: Zakk Chen <zakk.chen@sifive.com> Co-Authored-by: Kito Cheng <kito.cheng@sifive.com> Differential Revision: https://reviews.llvm.org/D103510	2021-10-28 11:39:04 +08:00
Michael Benfield	15e3d39110	[clang] Fortify warning for scanf calls with field width too big. Differential Revision: https://reviews.llvm.org/D111833	2021-10-28 02:52:03 +00:00
Abinav Puthan Purayil	fa592180b3	[AMDGPU] Add more llc tests for 48-bit mul generation. Differential Revision: https://reviews.llvm.org/D112554	2021-10-28 08:10:04 +05:30
Max Kazantsev	513914e1f3	[SCEV] Invalidate user SCEVs along with operand SCEVs to avoid cache corruption Following discussion in D110390, it seems that we are suffering from unability to traverse users of a SCEV being invalidated. The result of that is that ScalarEvolution's inner caches may store obsolete data about SCEVs even if their operands are forgotten. It creates problems when we try to verify the contents of those caches. It's also a frequent situation when messing with cache causes very sneaky and hard-to-analyze bugs related to corruption of memory when dealing with cached data. They are lurking there because ScalarEvolution's veirfication is not powerful enough and misses many problematic cases. I plan to make SCEV's verification much stricter in follow-ups, and this requires dangling-pointers-free caches. This patch makes sure that, whenever we forget cached information for a SCEV, we also forget it for all SCEVs that (transitively) use it. This may have negative compile time impact. It's a sacrifice we are more than willing to make to enforce correctness. We can also save some time by reworking invokers of forgetMemoizedResults (maybe we can forget multiple SCEVs with single query). Differential Revision: https://reviews.llvm.org/D111533 Reviewed By: reames	2021-10-28 09:39:24 +07:00
Craig Topper	1387483e72	[RISCV] Replace most uses of RISCVSubtarget::hasStdExtV. NFCI Add new hasVInstructions() which is currently equivalent. Replace vector uses of hasStdExtZfh/F/D with new vector specific versions. The vector spec no longer requires that the vectors implement the same types as scalar. It only requires that the scalar type is the maximum size the vectors can support. This is currently implemented using the scalar rule we were using before. Add new hasVInstructionsI64() begin using to qualify code that requires i64 vector elements. This is all NFC for now, but we can start using this to better implement D112408 which introduces the Zve extensions. Reviewed By: frasercrmck, eopXD Differential Revision: https://reviews.llvm.org/D112496	2021-10-27 19:33:48 -07:00
Florian Mayer	dd943ebc6d	[hwasan] print exact mismatch offset for short granules. Reviewed By: eugenis Differential Revision: https://reviews.llvm.org/D104463	2021-10-28 03:31:11 +01:00
Kai Luo	6ea2431d3f	[clang][compiler-rt][atomics] Add `__c11_atomic_fetch_nand` builtin and support `__atomic_fetch_nand` libcall Add `__c11_atomic_fetch_nand` builtin to language extensions and support `__atomic_fetch_nand` libcall in compiler-rt. Reviewed By: theraven Differential Revision: https://reviews.llvm.org/D112400	2021-10-28 02:18:43 +00:00
Johannes Doerfert	6cf6fa6ef1	[OpenMP] Declare variants for templates need to match # template args A declare variant template is only compatible with a base when the number of template arguments is equal, otherwise our instantiations will produce nonsensical results. Exposes as part of D109344. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D109770	2021-10-27 21:04:32 -05:00
Johannes Doerfert	acf3093117	[Attributor][FIX] Do not ignore memory writes in AAMemoryBehavior Even if we look for `nocapture` we need to bail on escaping pointers. The crucial thing is that we might not look at a big enough scope when we derive the memory behavior. Thus, it might be `nocapture` in a larger context while it is "captured" in a smaller context.	2021-10-27 21:04:32 -05:00

1 2 3 4 5 ...

403089 Commits All Branches Search

403089 Commits

All Branches