llvm-project

Commit Graph

Author	SHA1	Message	Date
Sanjay Patel	75e8eb2b10	[InstCombine] update code/test comments; NFC Follow-up for post-commit suggestion on: `28afaed691` The comments were partly copied from the original code, but not updated to match the new code.	2021-09-11 10:53:53 -04:00
Nikita Popov	f5806830e0	[ARM] Support neon.vld auto-upgrade with opaque pointers This code manually constructs the intrinsic name, so we need to use p0 instead of p0i8 in opaque pointer mode.	2021-09-11 16:34:32 +02:00
Kazu Hirata	e030d31fda	[GlobalOpt] Use make_early_inc_range (NFC)	2021-09-11 07:23:22 -07:00
Sanjay Patel	28afaed691	[InstCombine] fold sub of min/max intrinsics with invertible ops This is a translation of the existing code to handle the intrinsics and another step towards D98152. https://alive2.llvm.org/ce/z/jA7eBC This pattern is already handled by underlying folds if there are less uses, so the minimal tests in this case have extra uses. The larger cmyk tests show the motivation - when combined with other folds, we invert a larger sequence and eliminate 'not' ops.	2021-09-11 09:18:46 -04:00
Muhammad Omair Javaid	c55e021de4	[LLDB] Skip TestDyldLaunchLinux.py on Arm/Linux TestDyldLaunchLinux.py has been recently added and is failing on LLDB Arm/Linux buildbot. I am marking it skip till I come back and look at it in more detail.	2021-09-11 16:18:22 +05:00
guopeilin	749ddd25e9	[BitcodeReader] Delay select until all constants resolved Like the shuffle, we should treat the select delayed so that all constants can be resolved. Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D109053	2021-09-11 18:51:35 +08:00
Simon Pilgrim	df975e4590	[X86][SLM] Fix PSAD/MPSAD uops, latency and throughput Noticed while trying to improve generic reduction costs via the D103695 helper script. Confirmed with Intel AoM / Agner / InstLatX64.	2021-09-11 11:44:09 +01:00
Simon Pilgrim	484944ac3b	[X86][SLM] Fix HADD/HSUB uops, latency and throughput Noticed while trying to improve generic reduction costs via the D103695 helper script. Confirmed with Intel AoM / Agner / InstLatX64.	2021-09-11 11:44:09 +01:00
Simon Pilgrim	51d04e2268	[X86][SLM] Swap LoadLat and LoadUOps in the SLMWriteResPair<> helper. NFC. We set the LoadUOps argument a lot more frequently that LoadLat, by swapping them we can simplify a number of declarations.	2021-09-11 11:44:09 +01:00
Wang, Pengfei	2aaa6466fe	[X86] Support *_set1_pch(Float16 _Complex h) Reviewed By: LuoYuanke Differential Revision: https://reviews.llvm.org/D109487	2021-09-11 17:47:31 +08:00
LLVM GN Syncbot	6aacc69338	[gn build] Port `2269a941a4`	2021-09-11 09:02:52 +00:00
Lang Hames	2269a941a4	Revert `5629afea91` and `bb27e45643` while I look into bot failures. This reverts commit `5629afea91` ("[ORC] Add missing include."), and `bb27e45643` ("[ORC] Add SimpleRemoteEPC: ExecutorProcessControl over SPS + abstract transport."). The SimpleRemoteEPC patch currently assumes availability of threads, and needs to be rewritten with LLVM_ENABLE_THREADS guards.	2021-09-11 19:02:11 +10:00
LLVM GN Syncbot	26f1cab44a	[gn build] Port `bb27e45643`	2021-09-11 08:51:51 +00:00
Lang Hames	5629afea91	[ORC] Add missing include.	2021-09-11 18:51:16 +10:00
Lang Hames	bb27e45643	[ORC] Add SimpleRemoteEPC: ExecutorProcessControl over SPS + abstract transport. SimpleRemoteEPC is an ExecutorProcessControl implementation (with corresponding new server class) that uses ORC SimplePackedSerialization (SPS) to serialize and deserialize EPC-messages to/from byte-buffers. The byte-buffers are sent and received via a new SimpleRemoteEPCTransport interface that can be implemented to run SimpleRemoteEPC over whatever underlying transport system (IPC, RPC, network sockets, etc.) best suits your use case. The SimpleRemoteEPCServer class provides executor-side support. It uses a customizable SimpleRemoteEPCServer::Dispatcher object to dispatch wrapper function calls to prevent the RPC thread from being blocked (a problem in some earlier remote-JIT server implementations). Almost all functionality (beyond the bare basics needed to bootstrap) is implemented as wrapper functions to keep the implementation simple and uniform. Compared to previous remote JIT utilities (OrcRemoteTarget, OrcRPCExecutorProcessControl), more consideration has been given to disconnection and error handling behavior: Graceful disconnection is now always initiated by the ORC side of the connection, and failure at either end (or in the transport) will result in Errors being delivered to both ends to enable controlled tear-down of the JIT and Executor (in the Executor's case this means "as controlled as the JIT'd code allows"). The introduction of SimpleRemoteEPC will allow us to remove other remote-JIT support from ORC (including the legacy OrcRemoteTarget code used by lli, and the OrcRPCExecutorProcessControl and OrcRPCEPCServer classes), and then remove ORC RPC itself. The llvm-jitlink and llvm-jitlink-executor tools have been updated to use SimpleRemoteEPC over file descriptors. Future commits will move lli and other tools and example code to this system, and remove ORC RPC.	2021-09-11 18:16:38 +10:00
Matt Beardsley	c6bf8b8592	Fix python 2-vs-3 issues in add_new_check.py and rename_check.py As of this commit: https://github.com/llvm/llvm-project/commit/307b1fdd If either of those scripts are invoked with python 2, neither works due to: "TypeError: write() argument 1 must be unicode, not str" And if rename_check.py is invoked with python 3: "ValueError: binary mode doesn't take an encoding argument" (referring to `with io.open(filename, 'wb', encoding='utf8') as f:`), and Another issue in rename_check.py in python 2: "TypeError: list object is not an iterator" (referring to `next(filter( ... os.listdir(old_module_path)))`) (so, rename_check doesn't work with either 2 or 3, and add_new_check doesn't work with 2, but does work with 3) I ran these steps to test both python versions: (manually - appears to be the "status quo" for these files) python3 clang-tools-extra/clang-tidy/add_new_check.py readability ggggg python3 clang-tools-extra/clang-tidy/rename_check.py readability-ggggg readability-hhhhh git checkout HEAD -- clang-tools-extra/clang-tidy/readability/CMakeLists.txt clang-tools-extra/clang-tidy/readability/ReadabilityTidyModule.cpp clang-tools-extra/docs/ReleaseNotes.rst clang-tools-extra/docs/clang-tidy/checks/list.rst rm -f clang-tools-extra/clang-tidy/readability/GggggCheck.cpp clang-tools-extra/clang-tidy/readability/GggggCheck.h clang-tools-extra/docs/clang-tidy/checks/readability-ggggg.rst clang-tools-extra/test/clang-tidy/checkers/readability-ggggg.cpp clang-tools-extra/clang-tidy/readability/HhhhhCheck.cpp clang-tools-extra/clang-tidy/readability/HhhhhCheck.h clang-tools-extra/docs/clang-tidy/checks/readability-hhhhh.rst python2 clang-tools-extra/clang-tidy/add_new_check.py readability ggggg python2 clang-tools-extra/clang-tidy/rename_check.py readability-ggggg readability-hhhhh git checkout HEAD -- clang-tools-extra/clang-tidy/readability/CMakeLists.txt clang-tools-extra/clang-tidy/readability/ReadabilityTidyModule.cpp clang-tools-extra/docs/ReleaseNotes.rst clang-tools-extra/docs/clang-tidy/checks/list.rst rm -f clang-tools-extra/clang-tidy/readability/GggggCheck.cpp clang-tools-extra/clang-tidy/readability/GggggCheck.h clang-tools-extra/docs/clang-tidy/checks/readability-ggggg.rst clang-tools-extra/test/clang-tidy/checkers/readability-ggggg.cpp clang-tools-extra/clang-tidy/readability/HhhhhCheck.cpp clang-tools-extra/clang-tidy/readability/HhhhhCheck.h clang-tools-extra/docs/clang-tidy/checks/readability-hhhhh.rst Reviewed By: kbobyrev Differential Revision: https://reviews.llvm.org/D109127	2021-09-11 09:52:50 +02:00
Jessica Paquette	4e408aae2c	[AArch64][GlobalISel] Select full-fp16 s16 G_FCONSTANT as a constant pool load When we have full-fp16 support, we should (manually select) s16 G_FCONSTANT to a constant pool load. Add support for that to `emitLoadFromConstantPool` + the existing constant selection code. Also tidy up the constant selection code a little. There were some out-of-date comments + some dead code. Differential Revision: https://reviews.llvm.org/D108957	2021-09-10 19:36:34 -07:00
Keith Smiley	e972e49b11	[llvm-cov] Add error for invalid -path-equivalence format Differential Revision: https://reviews.llvm.org/D109042	2021-09-10 18:34:37 -07:00
Lang Hames	6c56b13331	[JITLink] Working memory shouldn't be subject to alignment constraints. Refactors copyBlockContentToWorkingMemory to use offsets rather than direct pointers to working memory. This simplifies the problem of maintaining alignments between blocks in working memory, without requiring the working memory itself to be aligned.	2021-09-11 11:26:38 +10:00
Lang Hames	3828ab086a	[ORC] Fix missing newline in debugging output.	2021-09-11 11:24:01 +10:00
Lang Hames	a0a91ed3dd	[ORC] Merge LLVMSPSSerializers.h into SimplePackedSerialization.h. Since the ORC runtime and LLVM are no longer sharing SPS code (the ORC runtime has its own copy) there is no reason to keep these separate.	2021-09-11 11:24:00 +10:00
Lang Hames	22641f5853	[ORC] Use EPC for EPCGeneric MemoryAccess / JITLinkMemoryManager construction. This allows these classes to be created during EPC construction, before an ExecutionSession is available.	2021-09-11 11:24:00 +10:00
Usman Nadeem	ab111e982f	Revert "Revert "[AArch64][SVE][InstCombine] Canonicalize aarch64_sve_dup_x intrinsic to IR splat operation"" This reverts commit `eee7d225de`. Effectively relanding `98c37247d8` after fixing the failing tests. Change-Id: I5d7461aeb820a2d5f1895457d824a8de4d316ee5	2021-09-10 18:11:24 -07:00
Eric Christopher	2d26a72f82	nullptr initialize variables, spotted on msan bots.	2021-09-10 18:10:53 -07:00
Keith Smiley	f7062f6dfb	[docs] Improve description of LLVM_BUILD_TESTS This makes it clear that this only has an effect if you use the all build target. Differential Revision: https://reviews.llvm.org/D109113	2021-09-10 18:01:38 -07:00
Jason Molenda	89ed21a8f8	Recognize namespaced all_image_infos symbol name from dyld In macOS 12, the symbol name for the dyld_all_image_infos struct in dyld has a namespace qualifier. Search for it without qualification, then with qualification when doing a by-name search. (lldb will only search for it by name when loading a user process Mach-O corefile) rdar://76270013	2021-09-10 16:56:48 -07:00
owenca	3205dd3d59	[clang-format] Restrict the special handling for K&R C to C/C++ Commits `58494c856a`, `f6bc614546`, and `0fc27ef196` added special handlings for K&R C function definitions and caused some JavaScript/TypeScript regressions which were addressed in D107267, D108538, and D108620. This patch would have prevented these known regressions and will fix any unknown ones. Differential Revision: https://reviews.llvm.org/D109582	2021-09-10 15:51:35 -07:00
Joseph Huber	29b44ca896	[OpenMP] Add flag for setting debug in the offloading device This patch introduces the flags `-fopenmp-target-debug` and `-fopenmp-target-debug=` to set the value of a global in the device. This will be used to enable or disable debugging features statically in the device runtime library. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D109544	2021-09-10 18:19:19 -04:00
Joseph Huber	7eb899cbcd	[OpenMP] Add more verbose remarks for runtime folding We peform runtime folding, but do not currently emit remarks when it is performed. This is because it comes from the runtime library and is beyond the users control. However, people may still wish to view this and similar information easily, so we can enable this behaviour using a special flag to enable verbose remarks. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D109627	2021-09-10 17:36:06 -04:00
Alex Langford	8dae35527f	[lldb] Remove unused typedefs from lldb-forward.h	2021-09-10 14:16:47 -07:00
Ye Luo	2187cbf56f	[OpenMP][libomptarget] Add __tgt_target_return_t enum for __tgt_target_XXX return int The defintion of OFFLOAD_SUCCESS and OFFLOAD_FAIL used in plugin APIs and libomptarget public APIs are not consistent. Create __tgt_target_return_t for libomptarget public APIs. Differential Revision: https://reviews.llvm.org/D109304	2021-09-10 16:11:08 -05:00
Johannes Doerfert	99ea8ac9f1	Reapply "[OpenMP] Group side-effects to improve guarding efficiency" This reapplies `ca134c3963`, effectively reverting commit `d2f206e0af`. Minor test changes to make the test pass.	2021-09-10 15:22:57 -05:00
Johannes Doerfert	c09fbbdcfb	Reapply "[GlobalOpt][FIX] Do not embed initializers into AS!=0 globals"" This reapplies commit `7dbba3376f`, or, put differently, this reverts commit `d9a8d20827`. The test now requires the amdgpu and nvptx backend explicitly as it won't work without properly.	2021-09-10 15:22:56 -05:00
Rob Suderman	b0532286fe	[mlir][tosa] Add shape inference for tosa.while Tosa.while shape inference requires repeatedly running shape inference across the body of the loop until the types become static as we do not know the number of iterations required by the loop body. Once the least specific arguments are known they are propagated to both regions. To determine the final end type, the least restrictive types are determined from all yields. Differential Revision: https://reviews.llvm.org/D108801	2021-09-10 13:11:53 -07:00
Mark Schimmel	7c82db3634	[ARC] Improve code generated for i32 ADDC/ADDE and SUBC/SUBE This change improves the code generated for long long addition and subtraction Differential Revision: https://reviews.llvm.org/D109615	2021-09-10 13:04:08 -07:00
Usman Nadeem	eee7d225de	Revert "[AArch64][SVE][InstCombine] Canonicalize aarch64_sve_dup_x intrinsic to IR splat operation" This reverts commit `98c37247d8`.	2021-09-10 13:01:48 -07:00
Usman Nadeem	98c37247d8	[AArch64][SVE][InstCombine] Canonicalize aarch64_sve_dup_x intrinsic to IR splat operation Differential Revision: https://reviews.llvm.org/D109118 Change-Id: I47adc1984a54bea02bf5a0a767b765afe7e16aa3	2021-09-10 12:52:14 -07:00
Jan Svoboda	285ec53730	[clang][deps] Move tests to the Clang subdirectory	2021-09-10 21:36:49 +02:00
Sanjay Patel	188375f478	[InstCombine] add tests for sub of min/max intrinsics; NFC	2021-09-10 14:53:05 -04:00
Joseph Huber	9e2fc0ba37	[OpenMP] Check OpenMP assumptions on call-sites as well This patch adds functionality to check assumption attributes on call sites as well. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D109376	2021-09-10 14:52:47 -04:00
Joseph Huber	f28e710db7	[OpenMP] Make CUDA math library functions SPMD amenable This patch adds the SPMD amenable assumption to the CUDA math library defintions in Clang. Previously these functions would block SPMD execution on the device because they're intrinsic calls into the library and can't be calculated. These functions don't have side-effects so they are safe to execute in SPMD mode. Depends on D105937 Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D108958	2021-09-10 14:52:45 -04:00
Siva Chandra Reddy	0da5ac1a75	[libc] Add extension functions fedisableexcept, feenableexcept and fegetexcept. Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D109613	2021-09-10 18:44:53 +00:00
Florian Mayer	09391e7e50	[hwasan] Do not instrument accesses to uninteresting allocas. This leads to a statistically significant improvement when using -hwasan-instrument-stack=0: https://bit.ly/3AZUIKI. When enabling stack instrumentation, the data appears gets better but not statistically significantly so. This is consistent with the very moderate improvements I have seen for stack safety otherwise, so I expect it to improve when the underlying issue of that is resolved. Reviewed By: eugenis Differential Revision: https://reviews.llvm.org/D108457	2021-09-10 19:28:28 +01:00
David Carlier	8fdd821aef	[Sanitizers] intercept netent, protoent and mincore on FreeBSD. netent on Linux in addition as well. Reviewd By: vitalybuka Differential Revision: https://reviews.llvm.org/D109287	2021-09-10 19:24:51 +01:00
Florian Mayer	57335b6e2e	[stack-safety] Allow to determine safe accesses. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D109503	2021-09-10 19:23:54 +01:00
Nico Weber	23f256f2b1	[clang] Fix typo in test from `a723310b4` We want the driver-level flag here, else the test passes for the wrong reasons. See comments on https://reviews.llvm.org/D99901.	2021-09-10 14:16:45 -04:00
Kazu Hirata	c9fca53af1	[CodeGen, Target] Use pred_empty and succ_empty (NFC)	2021-09-10 11:11:31 -07:00
Rumeet Dhindsa	03df971012	[lldb] Add support for debugging via the dynamic linker. This patch adds support for shared library load when the executable is called through ld.so. Differential Revision:https://reviews.llvm.org/D108061	2021-09-10 10:59:31 -07:00
Roman Lebedev	f3c2094d8c	[clang] `aligned_alloc` allocation function specifies alignment in first arg, manifest that knowledge Mainly, if a constant value was passed as an alignment, then we correctly annotate the alignment of the returned value of @aligned_alloc. And if it wasn't constant, then we also don't loose that, but emit an assumption.	2021-09-10 20:49:29 +03:00
Roman Lebedev	85ba583eba	[NFCI][clang] Move allocation alignment manifestation for malloc-like into Sema from Codegen ... so that it happens right next to `AddKnownFunctionAttributesForReplaceableGlobalAllocationFunction()`, which is good for consistency.	2021-09-10 20:49:28 +03:00

1 2 3 4 5 ...

398764 Commits All Branches Search

398764 Commits

All Branches