llvm-project

Commit Graph

Author	SHA1	Message	Date
Sanjay Patel	5d7d689edf	[InstCombine] fix propagation of FMF through select-of-fnegs The existing code was unquestionably wrong - it looked at one fneg and ignored the other 2 instructions. It was also untested, so it didn't make the list of bugs flagged by Alive2. This is an unusual propagation, but Alive2 agress that we can intersect the fnegs and union that with the select, then apply the results to both new instructions: https://alive2.llvm.org/ce/z/SF8_dt	2021-08-31 09:52:17 -04:00
LLVM GN Syncbot	22efb9d364	[gn build] Port `3285c7a436`	2021-08-31 13:47:38 +00:00
Xiang Xiao	3285c7a436	[libcxx] Remove the locale fallback for NuttX Since these functions can handled by NuttX's libc now Reviewed By: #libc, ldionne Differential Revision: https://reviews.llvm.org/D108895	2021-08-31 09:46:55 -04:00
Anton Afanasyev	aaae726afb	[SLPVectorizer][Test] Add test for extractelements with (non)const indices (NFC) Add test for an issue discussed here: https://reviews.llvm.org/D108703#2974289	2021-08-31 16:14:26 +03:00
Sanjay Patel	027de5c7d4	[InstCombine] add tests for FMF propagation for select-of-fneg; NFC	2021-08-31 09:12:03 -04:00
Sanjay Patel	d59ae12d58	[InstCombine] fix typo; NFC	2021-08-31 09:02:14 -04:00
Anton Afanasyev	077d4cb3ab	Revert "[SLP]No need to schedule/check parent for extract{element/value} instruction." Revert since introduced issure reported here: https://lists.llvm.org/pipermail/llvm-dev/2021-August/152411.html Discussed starting from here: https://reviews.llvm.org/D108703#2974289 This reverts commit `a36bc873a2`.	2021-08-31 15:29:06 +03:00
Michał Górny	8307869a22	[lldb] [gdb-remote client] Remove breakpoints in forked processes Remove software breakpoints from forked processes in order to restore the original program code before detaching it. Differential Revision: https://reviews.llvm.org/D100263	2021-08-31 13:41:35 +02:00
Andy Yankovsky	1f986f6057	Revert "[lldb] Add minidump save-core functionality to ELF object files" This reverts commit `aafa05e03d`. Broke builder on aarch64 -- https://lab.llvm.org/buildbot/#/builders/96/builds/10926	2021-08-31 13:36:53 +02:00
Simon Pilgrim	7ec7272b80	[MCA][X86] Add basic coverage for icelake arch Copy the skylake-avx512 tests for icelake-server coverage. Add icelake/rocketlake/tigerlake test coverage to the relevent generic tests as well.	2021-08-31 12:20:09 +01:00
Andrej Korman	aafa05e03d	[lldb] Add minidump save-core functionality to ELF object files This change adds save-core functionality into the ObjectFileELF that enables saving minidump of a stopped process. This change is mainly targeting Linux running on x86_64 machines. Minidump should contain basic information needed to examine state of threads, local variables and stack traces. Full support for other platforms is not so far implemented. API tests are using LLDB's MinidumpParser. Reviewed By: clayborg Differential Revision: https://reviews.llvm.org/D108233	2021-08-31 13:04:38 +02:00
Simon Pilgrim	9e2d14c285	[X86] Copy X86SchedSkylakeServer.td to X86SchedIceLake.td Icelake, Rocketlake and Tigerlake targets currently use the SkylakeServer scheduler model, despite being a later microarchitecture, leading to both reported bugs (PR48110) and discrepancies when comparing llvm-mca reports to other profiling tools (OSACA, uops, uica, etc.). And tbh I'm getting sick of llvm-mca getting blamed for what are backend scheduler model issues :-( This patch doesn't attempt to fix any of these discrepancies - there should be no changes in codegen - its a setup patch that copies the skx model, renames all the resources, adds the additional ports (but doesn't reference them yet) and updates the llvm-exegesis pfm counter mappings (based off https://sourceforge.net/p/perfmon2/libpfm4/ci/master/tree/lib/events/intel_icl_events.h). This should make it trivial for anyone with hardware access to use llvm-exegesis reports to iteratively improve the model (my attempts to get hold of a cheap tiger lake box haven't been fruitful yet....). I will copy the SkylakeServer llvm-mca resource tests as follow up commits - the diff should entirely be the resource renames. Differential Revision: https://reviews.llvm.org/D108914	2021-08-31 11:57:20 +01:00
Douglas Yung	76a1a41530	Fix test by adding REQUIRES: x86-registered-target to skip test in configurations that do not include x86.	2021-08-31 03:29:17 -07:00
Tres Popp	44485fcd97	[mlir] Prevent assertion failure in DropUnitDims Don't assert fail on strided memrefs when dropping unit dims. Instead just leave them unchanged. Differential Revision: https://reviews.llvm.org/D108205	2021-08-31 12:15:13 +02:00
marina kolpakova a.k.a. geexie	0080d2aa55	[mlir][gpu] folds memref.dim of gpu.alloc implements canonicalization which folds memref.dim(gpu.alloc(%size), %idx) -> %size Differential Revision: https://reviews.llvm.org/D108892	2021-08-31 12:33:10 +03:00
Justas Janickas	f9bc1b3bee	[OpenCL] Defines helper function for kernel language compatible OpenCL version This change defines a helper function getOpenCLCompatibleVersion() inside LangOptions class. The function contains mapping between C++ for OpenCL versions and their corresponding compatible OpenCL versions. This mapping function should be updated each time a new C++ for OpenCL language version is introduced. The helper function is expected to simplify conditions on OpenCL C and C++ for OpenCL versions inside compiler code. Code refactoring performed. Differential Revision: https://reviews.llvm.org/D108693	2021-08-31 10:08:38 +01:00
Jason Molenda	c1184ca6eb	Use dSYM's file addr for Sections when it doesn't match binary When adding a dSYM to a Module and it has different file addresses from the already-present ObjectFile binary, change the Sections to use the dSYM's file addresses so the symbol table and DWARF are properly contained in the Sections. Previously this was only done for IsInMemory ObjectFiles, but it's more common than that. Differential Revision: https://reviews.llvm.org/D108889 rdar://81504400	2021-08-31 01:35:07 -07:00
Shivam Gupta	0d02aa6e43	[NFC] Correct typo in CodeGenMapTable.cpp, patch by Jordi CodeGenMapTable.cpp refers to TableGen as TabelGen in the comments. This appears to be a typo. This patch fixes the typo. Differential Revision: https://reviews.llvm.org/D76343	2021-08-31 13:04:33 +05:30
Simon Wallis	f417b660ee	[Arm] Add assert in T2 Imm7s code emitter Add assert to provoke failure in object file output, not just in disassembly output. Reviewed By: yroux Differential Revision: https://reviews.llvm.org/D107259	2021-08-31 08:16:48 +01:00
Shivam Gupta	e01ac501af	Fix typo in two files in Clang, patch by FusionBolt Reviewed By: xgupta Differential Revision: https://reviews.llvm.org/D98254	2021-08-31 12:33:37 +05:30
Shivam Gupta	4a6d8a11f8	[clang] Fix Typo in AST Matcher Reference In [[ https://clang.llvm.org/docs/LibASTMatchersReference.html \| AST Matcher Reference]], the example of matcher `hasDeclContext` contained a typo. `cxxRcordDecl` was changed to `cxxRecordDecl`. Differential Revision: https://reviews.llvm.org/D102836	2021-08-31 12:21:47 +05:30
Kai Luo	a594362436	[AIX] Rename shared_libraries_to_archive -> objects_to_archive. NFC.	2021-08-31 06:47:06 +00:00
Doug Beck	ed6cff667e	Fix typo s/beloinging/belonging Differential Revision: https://reviews.llvm.org/D107099	2021-08-31 12:01:50 +05:30
Alexander Pivovarov	eb946cc5b6	Fix typo in comments Reviewed By: MaskRay, jsji Differential Revision: https://reviews.llvm.org/D108857	2021-08-31 11:55:40 +05:30
Shivam Gupta	654e8d6c31	[LLDB][Docs] Move best-practices.txt contain to resources/test.rst This file contain some old reference to files those are now either renamed or replaced. Also this .txt file didn't generate to html during the sphnix documentation build so I send its contents to resources/test.rst file. Signed-off-by: Shivam Gupta <shivam98.tkg@gmail.com> Reviewed By: teemperor, mgorny, JDevlieghere Differential Revision: https://reviews.llvm.org/D108812	2021-08-31 11:45:24 +05:30
Shivam Gupta	3af9847a95	[LLDB][Docs] Convert some .txt files to .rst Upadate some .txt files to .rst for consistency as most of the documentation is written in reStructuredText format. Signed-off-by: Shivam Gupta <shivam98.tkg@gmail.com> Differential Revision: https://reviews.llvm.org/D108807	2021-08-31 11:45:24 +05:30
Shivam Gupta	8254f4afcb	[Docs][Phabricator] Mention how to create a draft revision https://llvm.org/docs/Phabricator.html have two links to Arcnist guide but none of them mention how to create a draft revision. It would create some less noise if developers create draft revisoin in this(--draft) way instead of [WIP] tag way. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D108970	2021-08-31 11:43:04 +05:30
Shivam Gupta	387a8dea72	[Docs] Remove subversion reference from MyFirstTypoFix.rst	2021-08-31 11:33:30 +05:30
David Blaikie	4f3a92ca0a	DebugInfo: Refactor/deduplicate various template argument list emission Streamline template arguments across types, variables, and functions - for convenient reuse in experiments related to template argument list reconstitution (not including template argument lists in the "name" of those entities, and leaving it to debug info consumers to rebuild the full template name from the semantic descriptions of the argument lists) But the change seems like a good refactoring/cleanup anyway. I'd certainly be open to suggestions about how this might be more streamlined - like is there no generic way to query template argument lists across the 3 kinds of entities, rather than needing special case code?	2021-08-30 22:39:46 -07:00
Stella Laurenzo	f05ff4f757	[mlir][python] Apply py::module_local() to all classes. * This allows multiple MLIR-API embedding downstreams to co-exist in the same process. * I believe this is the last thing needed to enable isolated embedding. Differential Revision: https://reviews.llvm.org/D108605	2021-08-30 22:18:43 -07:00
Heejin Ahn	3419e85b15	[WebAssembly] Free setjmpTable before exiting calls in EmSjLj This is an improvement over D107852. We don't need to enumerate specific function names; we can just check for `noreturn` attribute. This also requires us to make sure `__resumeExeption` and `emscripten_longjmp` have `noreturn` attribute too; one of them is a JS function and the other calls a JS function so Clang does not have a way to deduce they don't return. This is effectively NFC, because I'm not sure if there is an additional case this case covers; if we add a custom function call that has `noreturn` attribute, it will be processed within the SjLj handling and turned into `__invoke` call. So this really applies to some special functions like `emscripten_longjmp`. Reviewed By: dschuff Differential Revision: https://reviews.llvm.org/D108955	2021-08-30 21:46:25 -07:00
Heejin Ahn	b8fc71b7ae	[WebAssembly] Share rethrowing BBs in LowerEmscriptenEHSjLj There are three kinds of "rethrowing" BBs in this pass: 1. In Emscripten SjLj, after a possibly longjmping function call, we check if the thrown longjmp corresponds to one of setjmps within the current function. If not, we rethrow the longjmp by calling `emscripten_longjmp`. 2. In Emscripten EH, after a possibly throwing function call, we check if the thrown exception corresponds to the current `catch` clauses. If not, we rethrow the exception by calling `__resumeException`. 3. When both Emscripten EH and SjLj are used, when we check for an exception after a possibly throwing function call, it is possible that we get not an exception but a longjmp. In this case, we shouldn't swallow it; we should rethrow the longjmp by calling `emscripten_longjmp`. 4. When both Emscripten EH and SjLj are used, when we check for a longjmp after a possibly longjmping function call, it is possible that we get not a longjmp but an exception. In this case, we shouldn't swallot it; we should rethrow the exception by calling `__resumeException`. Case 1 is in Emscripten SjLj, 2 is in Emscripten EH, and 3 and 4 are relevant when both Emscripten EH and SjLj are used. 3 and 4 were first implemented in D106525. We create BBs for 1, 3, and 4 in this pass. We create those BBs for every throwing/longjmping function call, along with other BBs that contain condition checks. What this CL does is to create a single BB within a function for each of 1, 3, and 4 cases. These BBs are exiting BBs in the function and thus don't have successors, so easy to be shared between calls. The names of BBs created are: Case 1: `call.em.longjmp` Case 3: `rethrow.exn` Case 4: `rethrow.longjmp` For the case 2 we don't currently create BBs; we only replace the existing `resume` instruction with `call @__resumeException`. And Clang already creates only a single `resume` BB per function and reuses it, so we don't need to optimize this case. Not sure what are good benchmarks for EH/SjLj, but this decreases the size of the object file for `grfmt_jpeg.bc` (presumably from opencv) we got from one of our users by 8.9%. Even after running `wasm-opt -O4` on them, there is still 4.8% improvement. Reviewed By: dschuff Differential Revision: https://reviews.llvm.org/D108945	2021-08-30 21:44:34 -07:00
Hongtao Yu	b9db70369b	[CSSPGO] Split context string to deduplicate function name used in the context. Currently context strings contain a lot of duplicated function names and that significantly increase the profile size. This change split the context into a series of {name, offset, discriminator} tuples so function names used in the context can be replaced by the index into the name table and that significantly reduce the size consumed by context. A follow-up improvement made in the compiler and profiling tools is to avoid reconstructing full context strings which is time- and memory- consuming. Instead a context vector of `StringRef` is adopted to represent the full context in all scenarios. As a result, the previous prevalent profile map which was implemented as a `StringRef` is now engineered as an unordered map keyed by `SampleContext`. `SampleContext` is reshaped to using an `ArrayRef` to represent a full context for CS profile. For non-CS profile, it falls back to use `StringRef` to represent a contextless function name. Both the `ArrayRef` and `StringRef` objects are underpinned by real array and string objects that are stored in producer buffers. For compiler, they are maintained by the sample reader. For llvm-profgen, they are maintained in `ProfiledBinary` and `ProfileGenerator`. Full context strings can be generated only in those cases of debugging and printing. When it comes to profile format, nothing has changed to the text format, though internally CS context is implemented as a vector. Extbinary format is only changed for CS profile, with an additional `SecCSNameTable` section which stores all full contexts logically in the form of `vector<int>`, which each element as an offset points to `SecNameTable`. All occurrences of contexts elsewhere are redirected to using the offset of `SecCSNameTable`. Testing This is no-diff change in terms of code quality and profile content (for text profile). For our internal large service (aka ads), the profile generation is cut to half, with a 20x smaller string-based extbinary format generated. The compile time of ads is dropped by 25%. Differential Revision: https://reviews.llvm.org/D107299	2021-08-30 20:09:29 -07:00
MaheshRavishankar	2dfb66833f	Fix unused variable in release build. Differential Revision: https://reviews.llvm.org/D108963	2021-08-30 19:34:52 -07:00
Xu Mingjie	f10d003b0c	[tsan] Add environment variable TSAN_SYMBOLIZER_PATH as we do in other sanitizers ASan, LSan, MSan and UBSan all allow to use environment variable `*SAN_SYMBOLIZER_PATH` to pass the symbolizer path, this patch add `TSAN_SYMBOLIZER_PATH` to TSan. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D108911	2021-08-31 10:18:52 +08:00
Nico Weber	86c8f395ae	[lld/mac] Leave more room for thunks in thunk placement code Fixes PR51578 in practice. Currently there's only enough room for a single thunk, which for real-life code isn't enough. The error case only happens when there are many branch statements very close to each other (0 or 1 instructions apart), with the function at the finalization barrier small. There's a FIXME on what to do if we hit this case, but that suggestion sounds complicated to me (see end of PR51578 comment 5 for why). Instead, just leave more room for thunks. Chromium's unit_tests links fine with room for 3 thunks. Leave room for 100, which should fix this for most cases in practice. There's little cost for leaving lots of room: This slop value only determines when we finalize sections, and we insert thunks for forward jumps into unfinalized sections. So leaving room means we'll need a few more thunks, but the thunk jump range is 128 MiB while a single thunk is just 12 bytes. For Chromium's unit_tests: With a slop of 3: thunk calls = 355418, thunks = 10903 With a slop of 100: thunk calls = 355426, thunks = 10904 Chances are 100 is enough for all use cases we'll hit in practice, but even bumping it to 1000 would probably be fine. Differential Revision: https://reviews.llvm.org/D108930	2021-08-30 22:09:05 -04:00
Volodymyr Sapsai	93764ff6e2	[modules] Fix miscompilation when using two RecordDecl definitions with the same name. When deserializing a RecordDecl we don't enforce that redeclaration chain contains only a single definition. So if the canonical decl is not a definition itself, `RecordType::getDecl` can return different objects before and after an include. It means we can build CGRecordLayout for one RecordDecl with its set of FieldDecl but try to use it with FieldDecl belonging to a different RecordDecl. With assertions enabled it results in > Assertion failed: (FieldInfo.count(FD) && "Invalid field for record!"), > function getLLVMFieldNo, file llvm-project/clang/lib/CodeGen/CGRecordLayout.h, line 199. and with assertions disabled a bunch of fields are treated as their memory is located at offset 0. Fix by keeping the first encountered RecordDecl definition and marking the subsequent ones as non-definitions. Also need to merge FieldDecl properly, so that `getPrimaryMergedDecl` works correctly and during name lookup we don't treat fields from same-name RecordDecl as ambiguous. rdar://80184238 Differential Revision: https://reviews.llvm.org/D106994	2021-08-30 17:51:38 -07:00
Keith Smiley	b5da3120b8	[llvm-cov][NFC] Add test for coverage-prefix-map remappings This test covers acts as a regression test for these fixes: `c75a0a1e9d` `dd388ba3e0` Differential Revision: https://reviews.llvm.org/D108805	2021-08-30 17:19:57 -07:00
MaheshRavishankar	ba72cfe734	[mlir] Add an interface to allow operations to specify how they can be tiled. An interface to allow for tiling of operations is introduced. The tiling of the linalg.pad_tensor operation is modified to use this interface. Differential Revision: https://reviews.llvm.org/D108611	2021-08-30 16:31:18 -07:00
peter klausler	3fefebabe5	[flang] Fold EOSHIFT Implement constant folding for the transformational intrinsic function EOSHIFT. Differential Revision: https://reviews.llvm.org/D108941	2021-08-30 16:27:35 -07:00
Chris Lattner	faf1c22408	[Builder] Eliminate the StringRef/StringAttr forms of getSymbolRefAttr. The StringAttr version doesn't need a context, so we can just use the existing `SymbolRefAttr::get` form. The StringRef version isn't preferred so we want to encourage people to use StringAttr. There is an additional form of getSymbolRefAttr that takes a (SymbolTrait implementing) operation. This should also be moved, but I'll do that as a separate patch. Differential Revision: https://reviews.llvm.org/D108922	2021-08-30 16:05:36 -07:00
Michael Jones	7f2ce19d1c	[libc][nfc][obvious] fix typos in FPUtil Fix minor typos in FPUtil comments. Reviewed By: michaelrj Differential Revision: https://reviews.llvm.org/D108952	2021-08-30 22:39:02 +00:00
Keno Fischer	ea8539111d	[COFF] Force Symbols containing '.' to be quoted In D87099, the mangler learned to quote export directives that contain special characters. Only alhpanumerical characters as well as '_', '$', '.' and '@' were exmpt from this quoting. However, at least binutils considers an unquoted '.' to be syntax and object files containing such symbols will cause errors during linking. Fix that by removing '.' from the list of allowed exemptions. Differential Revision: https://reviews.llvm.org/D100359	2021-08-30 17:26:57 -04:00
Artem Belevich	30dfd3449e	[MemCpyOpt] Allow specifying --enable-memcpyopt-without-libcalls more than once so we can override it via clang's CLI if necessary.	2021-08-30 13:55:55 -07:00
Siva Chandra Reddy	7a2a765745	[libc] Add mtx_destroy which does nothing. There is not cleanup to be done for the mutex type so mtx_destroy does nothing.	2021-08-30 20:43:46 +00:00
Andrew Litteken	c58d4c4bd3	[IROutliner] Changing outliner to prioritize reductions on assembly rather than IR instruction Currently, the IROutliner uses a simple metric to outline the largest amount of IR possible to outline first if it fits the cost model. This is model loses out on smaller blocks of code that have higher reductions in cost that are contained within larger blocks of IR. This reverses the order, where we calculate all of the costs first, and then reorder and extract items based on the calculated results. Reviewers: paquette Differential Revision: https://reviews.llvm.org/D106440	2021-08-30 13:43:08 -07:00
Nikita Popov	c1b7540645	[TTI] Sink IVDescriptors.h include (NFC) Forward declare RecurrenceDescriptor and include IVDescritor.h only in implementation code that actually needs it.	2021-08-30 22:41:58 +02:00
natashaknk	203d38b234	[mlir][tosa] Small refactor to the functionality of Conv2D and Fully_connected to add the bias at the end of the convolution Made to adjust for a modification to the tiling algorithm Reviewed By: rsuderman Differential Revision: https://reviews.llvm.org/D108746	2021-08-30 13:18:43 -07:00
Craig Topper	201f6446da	[LegalizeTypes][X86] Improve ExpandIntRes_FP_TO_SINT/ExpandIntRes_FP_TO_UINT when input is SoftPromoteHalf. Instead of splitting off the fp16 to float conversion and generating a libcall, we should split the operation into fp16 to float and float to integer operations. This will allow the float to integer conversion to go through any custom handling the target has. If the target doesn't have custom handling then we should come back to ExpandIntRes_FP_TO_SINT/ ExpandIntRes_FP_TO_UINT automatically to create the libcall. This avoids generating libcalls on 32-bit X86. These library functions may not exist in 32-bit libgcc. At least for LLVM, we never generate them when hardware floating point instructions are available. Differential Revision: https://reviews.llvm.org/D108933	2021-08-30 13:12:59 -07:00
Bjorn Pettersson	789f01283d	[SelectionDAG] Fix miscompile bugs related to smul.fix.sat with scale zero When expanding a SMULFIXSAT ISD node (usually originating from a smul.fix.sat intrinsic) we've applied some optimizations for the special case when the scale is zero. The idea has been that it would be cheaper to use an SMULO instruction (if legal) to perform the multiplication and at the same time detect any overflow. And in case of overflow we could use some SELECT:s to replace the result with the saturated min/max value. The only tricky part is to know if we overflowed on the min or max value, i.e. if the product is positive or negative. Unfortunately the implementation has been incorrect as it has looked at the product returned by the SMULO to determine the sign of the product. In case of overflow that product is truncated and won't give us the correct sign bit. This patch is adding an extra XOR of the multiplication operands, which is used to determine the sign of the non truncated product. This patch fixes PR51677. Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D108938	2021-08-30 22:08:26 +02:00

1 2 3 4 5 ...

397796 Commits All Branches Search

397796 Commits

All Branches