llvm-project

Commit Graph

Author	SHA1	Message	Date
Guillaume Chatelet	87065c0d24	[libc] add benchmarks for memcmp and bzero Differential Revision: https://reviews.llvm.org/D104511	2021-06-23 14:19:40 +00:00
Jinsong Ji	c125af82a5	[DAGCombine] Check reassoc flags in aggressive fsub fusion The is from discussion in https://reviews.llvm.org/D104247#inline-993387 The contract and reassoc flags shouldn't imply each other . All the aggressive fsub fusion reassociate operations, we should guard them with reassoc flag check. Reviewed By: mcberg2017 Differential Revision: https://reviews.llvm.org/D104723	2021-06-23 13:59:40 +00:00
Joel E. Denny	9fa5e3280d	[OpenMP] Fix delete map type in ref count debug messages For example, without this patch: ``` $ cat test.c int main() { int x; #pragma omp target enter data map(alloc: x) #pragma omp target enter data map(alloc: x) #pragma omp target enter data map(alloc: x) #pragma omp target exit data map(delete: x) ; return 0; } $ clang -fopenmp -fopenmp-targets=nvptx64-nvidia-cuda test.c $ LIBOMPTARGET_DEBUG=1 ./a.out \|& grep 'Creating\\|Mapping exists\\|last' Libomptarget --> Creating new map entry with HstPtrBegin=0x00007ffddf1eaea8, TgtPtrBegin=0x00000000013bb040, Size=4, RefCount=1, Name=unknown Libomptarget --> Mapping exists with HstPtrBegin=0x00007ffddf1eaea8, TgtPtrBegin=0x00000000013bb040, Size=4, RefCount=2 (incremented), Name=unknown Libomptarget --> Mapping exists with HstPtrBegin=0x00007ffddf1eaea8, TgtPtrBegin=0x00000000013bb040, Size=4, RefCount=3 (incremented), Name=unknown Libomptarget --> Mapping exists with HstPtrBegin=0x00007ffddf1eaea8, TgtPtrBegin=0x00000000013bb040, Size=4, RefCount=2 (decremented) Libomptarget --> There are 4 bytes allocated at target address 0x00000000013bb040 - is not last ``` `RefCount` is reported as decremented to 2, but it ought to be reset because of the `delete` map type, and `is not last` is incorrect. This patch migrates the reset of reference counts from `DeviceTy::deallocTgtPtr` to `DeviceTy::getTgtPtrBegin`, which then correctly reports the reset. Based on the `IsLast` result from `DeviceTy::getTgtPtrBegin`, `targetDataEnd` then correctly reports `is last` for any deletion. `DeviceTy::deallocTgtPtr` is responsible only for the final reference count decrement and mapping removal. An obscure side effect of this patch is that a `delete` map type when the reference count is infinite yields `DelEntry=IsLast=false` in `targetDataEnd` and so no longer results in a `DeviceTy::deallocTgtPtr` call. Without this patch, that call is a no-op anyway besides some unnecessary locking and mapping table lookups. Reviewed By: grokos Differential Revision: https://reviews.llvm.org/D104560	2021-06-23 09:57:19 -04:00
Joel E. Denny	48421ac441	[OpenMP] Improve ref count debug messages For example, without this patch: ``` $ cat test.c int main() { int x; #pragma omp target enter data map(alloc: x) #pragma omp target exit data map(release: x) ; return 0; } $ clang -fopenmp -fopenmp-targets=nvptx64-nvidia-cuda test.c $ LIBOMPTARGET_DEBUG=1 ./a.out \|& grep 'Creating\\|Mapping exists' Libomptarget --> Creating new map entry with HstPtrBegin=0x00007ffcace8e448, TgtPtrBegin=0x00007f12ef600000, Size=4, Name=unknown Libomptarget --> Mapping exists with HstPtrBegin=0x00007ffcace8e448, TgtPtrBegin=0x00007f12ef600000, Size=4, updated RefCount=1 ``` There are two problems in this example: * `RefCount` is not reported when a mapping is created, but it might be 1 or infinite. In this case, because it's created by `omp target enter data`, it's 1. Seeing that would make later `RefCount` messages easier to understand. * `RefCount` is still 1 at the `omp target exit data`, but it's reported as `updated`. The reason it's still 1 is that, upon deletions, the reference count is generally not updated in `DeviceTy::getTgtPtrBegin`, where the report is produced. Instead, it's zeroed later in `DeviceTy::deallocTgtPtr`, where it's actually removed from the mapping table. This patch makes the following changes: * Report the reference count when creating a mapping. * Where an existing mapping is reported, always report a reference count action: * `update suppressed` when `UpdateRefCount=false` * `incremented` * `decremented` * `deferred final decrement`, which replaces the misleading `updated` in the above example * Add comments to `DeviceTy::getTgtPtrBegin` to explain why it does not zero the reference count. (Please advise if these comments miss the point.) * For unified shared memory, don't report confusing messages like `RefCount=` or `RefCount= updated` given that reference counts are irrelevant in this case. Instead, just report `for unified shared memory`. * Use `INFO` not `DP` consistently for `Mapping exists` messages. * Fix device table dumps to print `INF` instead of `-1` for an infinite reference count. Reviewed By: jhuber6, grokos Differential Revision: https://reviews.llvm.org/D104559	2021-06-23 09:57:19 -04:00
Louis Dionne	0c0628c92c	[libc++] Remove ad-hoc modules tests that are now unnecessary Since we now have modules-enabled CI, it is now redundant to have ad-hoc tests that check arbitrary things about our modules support. Instead, the whole test suite should pass with modules enabled, period. This patch also removes the module cache path workaround: one would expect that modules work properly without that workaround. If that isn't the case and we do run into flaky test failures, we can re-enable the workaround temporarily (but that would be very vexing and we should fix Clang ASAP if that's the case). Differential Revision: https://reviews.llvm.org/D104746	2021-06-23 09:42:56 -04:00
Roman Lebedev	707224ea16	[NFC] Update arm_function_name.ll after `4de0c40031`	2021-06-23 16:41:43 +03:00
serge-sans-paille	a0d05ed848	Handle interactions between reserved identifier and user-defined suffixes According to https://eel.is/c++draft/over.literal > double operator""_Bq(long double); // OK: does not use the reserved identifier _Bq ([lex.name]) > double operator"" _Bq(long double); // ill-formed, no diagnostic required: uses the reserved identifier _Bq ([lex.name]) Obey that rule by keeping track of the operator literal name status wrt. leading whitespace. Fix: https://bugs.llvm.org/show_bug.cgi?id=50644 Differential Revision: https://reviews.llvm.org/D104299	2021-06-23 15:38:42 +02:00
Jay Foad	a16cb95a3a	[AMDGPU] Remove unused multiclass MUBUF_Real_gfx10_with_name	2021-06-23 14:37:28 +01:00
Roman Lebedev	eb7ce97870	[NFC][ARM] Fix update_llc_test_checks for thumbv7-apple-darwin, autogenerate thumb2-ifcvt1.ll	2021-06-23 16:31:19 +03:00
Roman Lebedev	b77972ac4f	[NFC][AArch64] Autogenerate a few more tests	2021-06-23 16:31:19 +03:00
Roman Lebedev	3c94869632	[NFC][ARM] Fix update_llc_test_checks for aarch64-apple-ios/thumbv7s-apple-darwin, autogenerate a few tests	2021-06-23 16:31:19 +03:00
Roman Lebedev	15be15073e	[NFC][ARM] Fix update_llc_test_checks for thumbv7-apple-ios, autogenerate switch-minsize.ll	2021-06-23 16:31:19 +03:00
Roman Lebedev	4de0c40031	[NFC][ARM] Fix update_llc_test_checks for armv7-apple-ios, autogenerate ifcvt5.ll/ifcvt6.ll	2021-06-23 16:31:19 +03:00
Nikita Popov	8c01deb8e6	[ARMParallelDSP] Remove unnecessary wrapper function (NFC) AreSequentialAccesses() forwards directly to isConsecutiveAccess() and has an unnecessary template parameter to boot.	2021-06-23 15:27:54 +02:00
David Spickett	fe63db25bc	[lldb] Remove asserts in CommandReturnObject SetError and AppendError I added asserts to these in https://reviews.llvm.org/D104525. They are available (directly or otherwise) via the API so we should not assert. Restore the previous behaviour. If the message is empty, we return early before printing anything. For SetError don't assert that the error is a failure. The remaining assert is in AppendRawError which is not part of the API. Reviewed By: teemperor Differential Revision: https://reviews.llvm.org/D104778	2021-06-23 13:11:14 +00:00
Raphael Isemann	8a5165b3b9	[lldb][NFC] Remove some redundant semicolons on HostInfoMacOSX	2021-06-23 15:06:12 +02:00
Rosie Sumpter	12cb8ca668	[AArch64] Add CodeGen tests for vector reduction intrinsics. NFC Tests are added for vector reduce OR, AND and XOR. Differential Revision: https://reviews.llvm.org/D104771	2021-06-23 13:46:16 +01:00
owenca	ca7f471585	[clang-format] Fix a bug that indents else-comment-if incorrectly PR50809 Differential Revision: https://reviews.llvm.org/D104774	2021-06-23 04:57:45 -07:00
Zarko Todorovski	76c931ae42	[AIX][PowerPC] Remove error when specifying mabi=vec-default on AIX The default Altivec ABI was implemented but the clang error for specifying its use still remains. Users could get around this but not specifying the type of Altivec ABI but we need to remove the error. Reviewed By: jsji Differential Revision: https://reviews.llvm.org/D102094	2021-06-23 07:40:38 -04:00
Roman Lebedev	ff4b1d379f	[NFCI-ish][SimplifyCFGPass] Rework and generalize `ret` block tail-merging This changes the approach taken to tail-merge the blocks to always create a new block instead of trying to reuse some block, and generalizes it to support dealing not with just the `ret` in the future. This effectively lifts the CallBr restriction, although this isn't really intentional. That is the only non-NFC change here, i'm not sure if it's reasonable/feasible to temporarily retain it. Other restrictions of the transform remain. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D104598	2021-06-23 14:33:18 +03:00
Hans Wennborg	24037c37b6	Add support for #pragma system_header with -fms-extensions Clang already supports the pragma prefixed by "GCC" or "clang". MSVC has more recently added support for the pragma, but without any prefix; see https://devblogs.microsoft.com/cppblog/broken-warnings-theory/#external-headers Differential revision: https://reviews.llvm.org/D104770	2021-06-23 13:26:03 +02:00
Juneyoung Lee	5af8bacc94	[InstSimplify] Add more poison folding optimizations This adds more poison folding optimizations to InstSimplify. Since all binary operators propagate poison, these are fine. Also, the precondition of `select cond, undef, x` -> `x` is relaxed to allow the case when `x` is undef. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D104661	2021-06-23 20:25:24 +09:00
David Spickett	1b1c8e4a98	[lldb] Remove CommandReturnObject's SetError(StringRef) Replacing existing uses with AppendError. SetError is also part of the SBI API. This remains but instead of calling the underlying SetError it will call AppendError. Reviewed By: teemperor Differential Revision: https://reviews.llvm.org/D104768	2021-06-23 11:25:10 +00:00
Joe Ellis	3c4dbf6ea9	[Verifier] Fail on overrunning and invalid indices for {insert,extract} vector intrinsics With regards to overrunning, the langref (llvm/docs/LangRef.rst) specifies: (llvm.experimental.vector.insert) Elements ``idx`` through (``idx`` + num_elements(``subvec``) - 1) must be valid ``vec`` indices. If this condition cannot be determined statically but is false at runtime, then the result vector is undefined. (llvm.experimental.vector.extract) Elements ``idx`` through (``idx`` + num_elements(result_type) - 1) must be valid vector indices. If this condition cannot be determined statically but is false at runtime, then the result vector is undefined. For the non-mixed cases (e.g. inserting/extracting a scalable into/from another scalable, or inserting/extracting a fixed into/from another fixed), it is possible to statically check whether or not the above conditions are met. This was previously missing from the verifier, and if the conditions were found to be false, the result of the insertion/extraction would be replaced with an undef. With regards to invalid indices, the langref (llvm/docs/LangRef.rst) specifies: (llvm.experimental.vector.insert) ``idx`` represents the starting element number at which ``subvec`` will be inserted. ``idx`` must be a constant multiple of ``subvec``'s known minimum vector length. (llvm.experimental.vector.extract) The ``idx`` specifies the starting element number within ``vec`` from which a subvector is extracted. ``idx`` must be a constant multiple of the known-minimum vector length of the result type. Similarly, these conditions were not previously enforced in the verifier. In some circumstances, invalid indices were permitted silently, and in other circumstances, an undef was spawned where a verifier error would have been preferred. This commit adds verifier checks to enforce the constraints above. Differential Revision: https://reviews.llvm.org/D104468	2021-06-23 10:33:22 +00:00
Nikita Popov	cfb1cb4491	[TTI] Make assertion compatible with opaque pointers Dropping the TODO here because it applies to all uses of this method.	2021-06-23 12:21:54 +02:00
Nikita Popov	3ee6f1a4fa	[LLParser] Remove special handling for call address space Spin-off from D104740: I don't think this special handling is needed anymore. Calls in textual IR are annotated with addrspace(N) (which defaults to the program address space from data layout) and specifies the expected pointer address space of the callee. There is no need to special-case the program address space on top of that, as it already is the default expected address space, and we shouldn't allow use of the program address space if the call was explicitly annotated with some other address space. The IsCall parameter is retained because it will be used again soon. Differential Revision: https://reviews.llvm.org/D104752	2021-06-23 12:07:44 +02:00
Nicolas Vasilache	f0d43a29e3	[mlir][LLVMIR] Fold ExtractValueOp coming from InsertValueOp Differential Revision: https://reviews.llvm.org/D104769	2021-06-23 10:04:24 +00:00
Jay Foad	dfb8c08739	[AMDGPU] Stop using LegacyLegalizerInfo. NFCI. Differential Revision: https://reviews.llvm.org/D103684	2021-06-23 10:50:32 +01:00
Jay Foad	157473a58f	[IR] Simplify createReplacementInstr NFCI, although the test change shows that ConstantExpr::getAsInstruction is better than the old implementation of createReplacementInstr because it propagates things like the sdiv "exact" flag. Differential Revision: https://reviews.llvm.org/D104124	2021-06-23 10:47:43 +01:00
Tobias Gysi	f1844f15c1	[mlir][linalg] Change the FillOp library call signature. Adapt the FillOp library call signature to the updated operand order introduced in https://reviews.llvm.org/D10412. The patch reverts the special treatment of FillOp in LinalgToStandard. Differential Revision: https://reviews.llvm.org/D104360	2021-06-23 09:37:14 +00:00
Florian Hahn	aa58fdb396	[llvm] Update tests that got missed in `adee485adf`.	2021-06-23 10:29:58 +01:00
Florian Hahn	adee485adf	[SCEV] Support signed predicates in applyLoopGuards. This adds handling for signed predicates, similar to how unsigned predicates are already handled. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D104732	2021-06-23 10:21:05 +01:00
Florian Hahn	5ab96fa16b	[SCEV] Add tests with single-cond range check generated by InstComb.	2021-06-23 10:16:57 +01:00
Jay Foad	c65f3f562b	[AMDGPU] Simplify collectReachableCallees. NFCI. Don't use SCC iterators when we're only interested in reachability. Use df_begin/df_end inline to find reachable nodes. Differential Revision: https://reviews.llvm.org/D104704	2021-06-23 09:11:29 +01:00
Tobias Gysi	7cef24ee83	[mlir][linalg] Adapt the FillOp builder signature. Change the build operand order from output, value to value, output. The patch makes the argument order consistent with the pretty printed order updated by https://reviews.llvm.org/D104356. Differential Revision: https://reviews.llvm.org/D104359	2021-06-23 08:06:43 +00:00
Stanislav Mekhanoshin	2b43209ee3	[AMDGPU] Propagate LDS align into to instructions Differential Revision: https://reviews.llvm.org/D104316	2021-06-23 00:57:16 -07:00
Martin Storsjö	f1a18fb699	[LLD] [MinGW] Silence the printouts in one test. NFC. This particular linker invocation is only run to check that we accept options, but we don't inspect the generated command line. As all other commands in the file have their output piped to FileCheck, the lit test doesn't print any other output; therefore silence this one for consistency as well.	2021-06-23 10:44:01 +03:00
Fangrui Song	011b502ce8	[llvm-objcopy][MachO] Fix namespace style issues	2021-06-23 00:31:52 -07:00
Martin Storsjö	fdf54f5c50	[LLD] [MinGW] Print the lld-link command to stderr This is consistent with how clang prints its internal commands with -### and -v. When linking with -verbose, we get log messages from the actual linking written to stderr. By printing the command to the same stream, we make sure they appear in a sensible chronological order. Differential Revision: https://reviews.llvm.org/D104527	2021-06-23 10:21:42 +03:00
Tobias Gysi	a21a6f51bc	[mlir][linalg] Change the pretty printed FillOp operand order. The patch changes the pretty printed FillOp operand order from output, value to value, output. The change is a follow up to https://reviews.llvm.org/D104121 that passes the fill value using a scalar input instead of the former capture semantics. Differential Revision: https://reviews.llvm.org/D104356	2021-06-23 07:03:00 +00:00
Vinayaka Bandishti	a873b6d466	[MLIR] Generalize detecting mods during slice computing During slice computation of affine loop fusion, detect one id as the mod of another id w.r.t a constant in a more generic way. Restrictions on co-efficients of the ids is removed. Also, information from the previously calculated ids is used for simplification of affine expressions, e.g., If `id1` = `id2`, `id_n - divisor * id_q - id_r + id1 - id2 = 0`, is simplified to: `id_n - divisor * id_q - id_r = 0`. If `c` is a non-zero integer, `cid_n - cdivisor * id_q - cid_r = 0`, is simplified to: `id_n - divisor id_q - id_r = 0`. Reviewed By: bondhugula, ayzhuang Differential Revision: https://reviews.llvm.org/D104614	2021-06-23 12:29:34 +05:30
Vinayaka Bandishti	0e55112242	[NFC][PDL] Fix documentation typo, redundant test Correct a documentation typo, and delete a duplicate test in `pdl-to-pdl-interp-rewriter.mlir`. Reviewed By: pr4tgpt, bondhugula, rriddle Differential Revision: https://reviews.llvm.org/D104688	2021-06-23 12:27:12 +05:30
Martin Storsjö	1cb7849a55	Revert "[AArch64LoadStoreOptimizer] Recommit: Generate more STPs by renaming registers earlier" This reverts commit `ea011ec5ed`. This still causes some miscompiles, I'll follow up in the phabricator review with a sample of that issue (which is part of the sample of the previous issue).	2021-06-23 09:54:16 +03:00
Igor Kudrin	36111f28ed	[TableGen] Fix printing second PC-relative operand If an instruction has several operands and a PC-relative one is not the first of them, the generator may produce the code that does not pass the 'Address' parameter to the printout method. For example, for an Arm instruction 'LE LR, $imm', it reuses the same code as for other instructions where the second operand is not PC-relative: void ARMInstPrinter::printInstruction(...) { ... case 11: // BF16VDOTI_VDOTD, BF16VDOTI_VDOTQ, BF16VDOTS_VDOTD, ... printOperand(MI, 1, STI, O); O << ", "; printOperand(MI, 2, STI, O); break; ... The patch fixes that by considering 'PCRel' when comparing 'AsmWriterOperand' values. Differential Revision: https://reviews.llvm.org/D104698	2021-06-23 13:27:37 +07:00
Min-Yih Hsu	dfafd56daa	[M68k] Fix incorrect #include-ed file in M68kSubtarget In https://reviews.llvm.org/rG2193347e72fa , a cpp file is accidentally included instead of its header file counterpart. This patch fixes this error.	2021-06-22 23:02:21 -07:00
Jim Lin	0365af1a87	[M68k] Add testcases for shift and rotate instructions Add codegen testcases for lsl, lsr, asr, rol and ror instructions. Reviewed By: myhsu Differential Revision: https://reviews.llvm.org/D104685	2021-06-23 13:26:58 +08:00
Jim Lin	5cb5225cf5	[M68k] Refactor codegen patterns for logic operations and add tests for it Refactor pat for and, or and xor operation and add missing tests for it Reviewed By: myhsu Differential Revision: https://reviews.llvm.org/D104626	2021-06-23 13:25:24 +08:00
Max Kazantsev	842b4c83cb	[LoopDeletion] Exploit undef Phi inputs when symbolically executing 1st iteration Follow-up on Roman's idea expressed in D103959. - If a Phi has undefined inputs from live blocks: - and no other inputs, assume it is undef itself; - and exactly one non-undef input, we can assume that all undefs are equal to this input. Differential Revision: https://reviews.llvm.org/D104618 Reviewed By: lebedev.ri, nikic	2021-06-23 11:53:48 +07:00
Zequan Wu	f681fd927e	Revert "[CodeGen] Don't create fake FunctionDecls when generating block/byref" That commit causes crash with error "!dbg attachment points at wrong subprogram for function" on iOS platforms. This reverts commit `f4c06bcb67`.	2021-06-22 21:48:00 -07:00
Max Kazantsev	976926e8ee	[Test] Clear out br i1 undef from tests to avoid UB We don't want to test possible unexpected impact of such branches. Replacing them with regular conditions. Idea by Nikita Popov.	2021-06-23 11:33:57 +07:00

1 2 3 4 5 ...

391839 Commits All Branches Search

391839 Commits

All Branches