llvm-project

Commit Graph

Author	SHA1	Message	Date
Simon Pilgrim	46aa3c6c33	[DAG] visitVECTOR_SHUFFLE - MergeInnerShuffle - improve shuffle(shuffle(x,y),shuffle(x,y)) merging MergeInnerShuffle currently attempts to merge shuffle(shuffle(x,y),z) patterns into a single shuffle, using 1 or 2 of the x,y,z ops. However if we already match 2 ops we might be able to handle the third op if its also a shuffle that references one of the previous ops, allowing us to handle some cases like: shuffle(shuffle(x,y),shuffle(x,y)) shuffle(shuffle(shuffle(x,z),y),z) shuffle(shuffle(x,shuffle(x,y)),z) etc. This isn't an exhaustive match and is dependent on the order the candidate ops are encountered - if one of the matched ops was a shuffle that was peek-able we don't go back and try to split that, I haven't found much need for that amount of analysis yet. This is a preliminary patch that will allow us to later improve x86 HADD/HSUB matching - but needs to be reviewed separately as its in generic code and affects existing Thumb2 tests. Differential Revision: https://reviews.llvm.org/D94671	2021-01-15 15:08:31 +00:00
Jamie Schmeiser	17d0fb7f57	Set option default for enabling memory ssa for new pass manager loop sink pass to true. Summary: Set the default for the option enabling memory ssa use in the loop sink pass to true for the new pass manager. Author: Jamie Schmeiser <schmeise@ca.ibm.com> Reviewed By: asbirlea (Alina Sbirlea) Differential Revision: https://reviews.llvm.org/D92486	2021-01-15 09:56:44 -05:00
Simon Pilgrim	5183a13d37	[X86] Add umin knownbits/demandedbits ult test for D94532	2021-01-15 14:42:55 +00:00
Jan Svoboda	791634b999	[clang][cli] Parse & generate options necessary for LangOptions defaults manually It turns out we need to handle `LangOptions` separately from the rest of the options. `LangOptions` used to be conditionally parsed only when `!(DashX.getFormat() == InputKind::Precompiled \|\| DashX.getLanguage() == Language::LLVM_IR)` and we need to restore this order (for more info, see D94682). D94682 moves the parsing of marshalled `LangOpts` from `parseSimpleArgs` back to `ParseLangArgs`. We need to parse marshalled `LangOpts` after `ParseLangArgs` calls `setLangDefaults`. This will enable future patches, where values of some `LangOpts` depend on the defaults. However, two language options (`-finclude-default-header` and `-fdeclare-opencl-builtins`) need to be parsed before `ParseLangArgs` calls `setLangDefaults`, because they are necessary for setting up OpenCL defaults correctly. This patch implements this by removing their marshalling info and manually parsing (and generating) them exactly where necessary. Reviewed By: Bigcheese Differential Revision: https://reviews.llvm.org/D94678	2021-01-15 15:38:43 +01:00
Anastasia Stulova	d1862a1631	[OpenCL][Docs] Fixed malformed table in OpenCLSupport Tags: #clang	2021-01-15 14:27:26 +00:00
Stephan Herhut	061d152085	[SVE] Fix unused variable. Introduced by [SVE] Restrict the usage of REINTERPRET_CAST. Differential Revision: https://reviews.llvm.org/D94773	2021-01-15 15:10:33 +01:00
Lei Zhang	0acc260b57	[mlir][linalg] Support generating builders for named op attributes This commit adds support to generate an additional builder for each named op that has attributes. This gives better experience when creating the named ops. Along the way adds support for i64. Reviewed By: hanchung Differential Revision: https://reviews.llvm.org/D94733	2021-01-15 09:00:30 -05:00
Sam Tebbs	5e4480b6c0	[ARM] Don't run the block placement pass at O0 The block placement pass shouldn't run unless optimisations are enabled. Differential Revision: https://reviews.llvm.org/D94691	2021-01-15 13:59:29 +00:00
Andy Wingo	e9f1ed2306	[WebAssembly] MC layer writes table symbols to object files Now that the linker handles table symbols, we can allow the frontend to produce them. Depends on D91870. Differential Revision: https://reviews.llvm.org/D92215	2021-01-15 14:55:55 +01:00
Simon Pilgrim	1dfd5c9ad8	[X86][AVX] combineHorizOpWithShuffle - support target shuffles in HOP(SHUFFLE(X,Y),SHUFFLE(X,Y)) -> SHUFFLE(HOP(X,Y)) Be more aggressive on (AVX2+) folds of lane shuffles of 256-bit horizontal ops by working on target/faux shuffles as well.	2021-01-15 13:55:30 +00:00
Raphael Isemann	4017c6fe7f	[lldb][docs] Translate ASCII art to restructured text formatting This translates most of the old ASCII art in our documentation to the equivalent in restructured text (which the new version of the LLDB docs is using).	2021-01-15 14:43:27 +01:00
Adam Czachorowski	c77c3d1d18	[clangd] Set correct CWD when using compile_flags.txt This fixes a bug where clangd would attempt to set CWD to the compile_flags.txt file itself. Differential Revision: https://reviews.llvm.org/D94699	2021-01-15 14:26:24 +01:00
Adam Czachorowski	aeaeb9e6bd	[clangd] Make ExpandAutoType not available on template params. We cannot expand auto when used inside a template param (C++17 feature), so do not offer it there. Differential Revision: https://reviews.llvm.org/D94719	2021-01-15 14:19:05 +01:00
Raphael Isemann	9d2053f61a	Revert "[lldb][docs] Use sphinx instead of epydoc to generate LLDB's Python reference" This reverts commit `bab121a1b6`. It seems the docs buildbot doesn't have the required Python headers.	2021-01-15 14:07:45 +01:00
Stefan Gränitz	6edc3fe598	[Orc] Fix OrcV2Examples after D94690 Differential Revision: https://reviews.llvm.org/D94690	2021-01-15 13:45:46 +01:00
Raphael Isemann	bab121a1b6	[lldb][docs] Use sphinx instead of epydoc to generate LLDB's Python reference Currently LLDB uses epydoc to generate the Python API reference for the website. epydoc however is unmaintained since more than a decade and no longer works with Python 3. Also whatever setup we had once for generating the documentation on the website server no longer seems to work, so the current website documentation has been stale since more than a year. This patch replaces epydoc with sphinx and its automodapi plugin that can generate Python API references. LLVM already uses sphinx for the rest of the documentation, so this way we are more consistent with the rest of LLVM. The only new dependency is the automodapi plugin for sphinx. This patch effectively does the following things: * Remove the epydoc code. * Make a new dummy Python API page in our website that just calls the Sphinx command for generated the API documentation. * Add a mock _lldb module that is only used when generating the Python API. This way we don't have to build all of LLDB to generate the API reference. Some notes: * The long list of skips is necessary due to boilerplate functions that SWIG is generating. Sadly automodapi is not really scriptable from what I can see, so we have to blacklist this stuff manually. * The .gitignore change because automodapi wants a subfolder of our documentation directory to place generated documentation files there. The path is also what is used on the website, so we can't really workaround this (without copying the whole `docs` dir somewhere else when we build). * We have to use environment variables to pass our build path to our sphinx configuration. Sphinx doesn't support passing variables onto that script. Reviewed By: JDevlieghere Differential Revision: https://reviews.llvm.org/D94489	2021-01-15 13:26:42 +01:00
Hsiangkai Wang	619eb14775	[NFC][RISCV] Remove useless code in RISCVRegisterInfo.td. Differential Revision: https://reviews.llvm.org/D94750	2021-01-15 20:08:51 +08:00
Stefan Gränitz	cf905274c6	[Orc] Allow LLJITBuilder's CreateObjectLinkingLayer to return errors It can be useful for an ObjectLinkingLayerCreator to allow callee errors to get propagated to the builder. Specifically, this is the case when the ObjectLayer uses the EHFrameRegistrationPlugin, because it requires a TPCEHFrameRegistrar and instantiation for it may fail (e.g. if the required registration symbols are missing in the target process). Reviewed By: lhames Differential Revision: https://reviews.llvm.org/D94690	2021-01-15 12:53:41 +01:00
Stefan Gränitz	a5eb9df1e3	[Orc][NFC] Turn LLJIT member ObjTransformLayer into unique_ptr All other layers in LLJIT are stored as unique_ptr's already. At this point, it is not strictly necessary for ObjTransformLayer, but it makes a follow-up change more straightforward. Reviewed By: lhames Differential Revision: https://reviews.llvm.org/D94689	2021-01-15 12:53:24 +01:00
Paul Walker	2b8db40c92	[SVE] Restrict the usage of REINTERPRET_CAST. In order to limit the number of combinations of REINTERPRET_CAST, whilst at the same time prevent overlap with BITCAST, this patch establishes the following rules: 1. The operand and result element types must be the same. 2. The operand and/or result type must be an unpacked type. Differential Revision: https://reviews.llvm.org/D94593	2021-01-15 11:32:13 +00:00
Sam Elliott	141e45b99c	[RISCV] Optimize Branch Comparisons I noticed in D94450 that there were quite a few places where we generate the sequence: ``` xN <- comparison ... xN <- xor xN, 1 bnez xN, symbol ``` Given we know the XOR will be used by BRCOND, which only looks at the lowest bit, I think we can remove the XOR and just invert the branch condition in these cases? The case mostly seems to come up in floating point tests, where there is often more logic to combine the results of multiple SETCCs, rather than a single (BRCOND (SETCC ...) ...). Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D94535	2021-01-15 11:28:19 +00:00
Muhammad Omair Javaid	b9993fcbf5	DynamicRegisterInfo calculate offsets in separate function This patch pull offset calculation logic out of DynamicRegisterInfo::Finalize into a separate function. We are going to call this function whenever we update SVE register sizes. Reviewed By: labath Differential Revision: https://reviews.llvm.org/D94008	2021-01-15 16:21:18 +05:00
Muhammad Omair Javaid	4fd77668b2	[LLDB] Add per-thread register infos shared pointer in gdb-remote In gdb-remote process we have register infos defind as a refernce object of GDBRemoteDynamicRegisterInfo class. In past register infos have remained constant througout the life time of a process. This has changed after AArch64 SVE support where register infos will have per-thread configuration. SVE registers will have per-thread size and can be updated while running. This patch aims to build up for that support by changing GDBRemoteDynamicRegisterInfo reference to a shared pointer deinfed per-thread. Reviewed By: labath Differential Revision: https://reviews.llvm.org/D82857	2021-01-15 16:11:17 +05:00
Ilya Golovenko	9cc221b99b	[clangd] exclude symbols from document outline which do not originate from the main file Differential Revision: https://reviews.llvm.org/D94753	2021-01-15 13:23:12 +03:00
Georgii Rymar	45ef053bd7	[llvm-readobj][test] - Remove excessive YAML fields from tests. This removes excessive YAML keys from `SHT_GNU_verdef` sections. Those keys are set by default. Differential revision: https://reviews.llvm.org/D94660	2021-01-15 12:46:39 +03:00
Georgii Rymar	d9afe8588e	[yaml2obj/obj2yaml] - Refine handling of SHT_GNU_verdef sections. This patch: 1) Makes `Version`, `Flags`, `VersionNdx` and `Hash` fields to be `Optional<>`. 2) Disallows dumping version definitions that have `vd_version != 1`. `vd_version` identifies the version of the structure itself. (https://refspecs.linuxfoundation.org/LSB_5.0.0/LSB-Core-generic/LSB-Core-generic/symversion.html, https://docs.oracle.com/cd/E19683-01/816-7777/chapter6-80869/index.html) 3) Stops dumping default values for `Version`, `Flags`, `VersionNdx` and `Hash` fields. 4) Refines testing. Differential revision: https://reviews.llvm.org/D94659	2021-01-15 12:40:42 +03:00
Oliver Stannard	3676ef1053	[ARM][GISel] Treat calls as variadic even if only fixed arguments provided For the ARM hard-float calling convention, calls to variadic functions need to be treated diffrently, even if only the fixed arguments are provided. This fixes GCC-C-execute-pr68390 in the test-suite, which is failing on the ARM GlobaISel bot.	2021-01-15 09:37:01 +00:00
Georgii Rymar	021ea78a97	[llvm-nm] - Simplify the code in dumpSymbolNamesFromObject. NFC. It is possible to simplify the logic that extracts symbol names. D94667 made the `NMSymbol::Name` to be `std::string`, what allowed this simplification. Differential revision: https://reviews.llvm.org/D94669	2021-01-15 12:29:49 +03:00
Guillaume Chatelet	a10300a2b2	[libc] Allow customization of memcpy via flags. - Adds LLVM_LIBC_IS_DEFINED macro to libc/src/__support/common.h - Adds a few knobs to memcpy to help with experimentations: - LLVM_LIBC_MEMCPY_X86_USE_ONLY_REPMOVSB replaces the implementation with a single call to rep;movsb - LLVM_LIBC_MEMCPY_X86_USE_REPMOVSB_FROM_SIZE customizes where the usage of rep;movsb Differential Revision: https://reviews.llvm.org/D94692	2021-01-15 09:26:45 +00:00
Georgii Rymar	bfb8f45ef3	[llvm-nm] - Move MachO specific logic out from the dumpSymbolNamesFromObject(). NFC. `dumpSymbolNamesFromObject` is the method that dumps symbol names. It has 563 lines, mostly because of huge piece of MachO specific code. In this patch I move it to separate helper method. The new size of `dumpSymbolNamesFromObject` is 93 lines. With it it becomes much easier to maintain it. I had to change the type of 2 name fields to `std::string`, because MachO logic uses temporarily buffer strings (e.g `ExportsNameBuffer`, `BindsNameBuffer` etc): ``` std::string ExportsNameBuffer; raw_string_ostream EOS(ExportsNameBuffer); ``` these buffers were moved to `dumpSymbolsFromDLInfoMachO` by this patch and invalidated after return. Technically, before this patch we had a situation when local pointers (symbol names) were assigned to members of global static `SymbolList`, what is dirty by itself. Differential revision: https://reviews.llvm.org/D94667	2021-01-15 12:18:37 +03:00
Alok Kumar Sharma	104a9f99cc	[Debuginfo][DW_OP_implicit_pointer] (1/7) Support for DW_OP_LLVM_implicit_pointer New dwarf operator DW_OP_LLVM_implicit_pointer is introduced (present only in LLVM IR) This operator is required as it is different than DWARF operator DW_OP_implicit_pointer in representation and specification (number and types of operands) and later can not be used as multiple level. Reviewed By: aprantl Differential Revision: https://reviews.llvm.org/D84113	2021-01-15 14:45:04 +05:30
Igor Kudrin	7803636057	[libcxx testing] Fix UB in tests for std::lock_guard If mutex::try_lock() is called in a thread that already owns the mutex, the behavior is undefined. The patch fixes the issue by creating another thread, where the call is allowed. Differential Revision: https://reviews.llvm.org/D94656	2021-01-15 16:11:45 +07:00
Amara Emerson	89e84dec18	[AArch64][GlobalISel] Fix fallbacks introduced for G_SITOFP in `8f283cafdd` If we have an integer->fp convert that has differing sizes, e.g. s32 to s64, then don't try to convert it to AArch64::G_SITOF since it won't select.	2021-01-15 01:10:49 -08:00
Georgii Rymar	1185d3f43d	[llvm-readobj] - Fix the compilation with GCC < 7.0. This addressed post commit comments for D93900. GCC had an issue and requires placing a specialization of `printUnwindInfo` to a namespace to compile: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=56480	2021-01-15 11:58:04 +03:00
Qiu Chaofan	168be42083	[Clang] Mutate long-double math builtins into f128 under IEEE-quad Under -mabi=ieeelongdouble on PowerPC, IEEE-quad floating point semantic is used for long double. This patch mutates call to related builtins into f128 version on PowerPC. And in theory, this should be applied to other targets when their backend supports IEEE 128-bit style libcalls. GCC already has these mutations except nansl, which is not available on PowerPC along with other variants (nans, nansf). Reviewed By: RKSimon, nemanjai Differential Revision: https://reviews.llvm.org/D92080	2021-01-15 16:56:20 +08:00
Nikita Popov	33be50daa9	Revert "Reapply "ADT: Fix reference invalidation in SmallVector::push_back and single-element insert"" This reverts commit `260a856c2a`. This reverts commit `3043e5a5c3`. This reverts commit `49142991a6`. This change had a larger than anticipated compile-time impact, possibly because the small value optimization is not working as intended. See D93779.	2021-01-15 09:28:42 +01:00
Andy Wingo	38dfce706f	[WebAssembly] Add support for table linking to wasm-ld This patch adds support to wasm-ld for linking multiple table references together, in a manner similar to wasm globals. The indirect function table is synthesized as needed. To manage the transitional period in which the compiler doesn't yet produce TABLE_NUMBER relocations and doesn't residualize table symbols, the linker will detect object files which have table imports or definitions, but no table symbols. In that case it will synthesize symbols for the defined and imported tables. As a change, relocatable objects are now written with table symbols, which can cause symbol renumbering in some of the tests. If no object file requires an indirect function table, none will be written to the file. Note that for legacy ObjFile inputs, this test is conservative: as we don't have relocs for each use of the indirecy function table, we just assume that any incoming indirect function table should be propagated to the output. Differential Revision: https://reviews.llvm.org/D91870	2021-01-15 09:21:52 +01:00
KAWASHIMA Takahiro	b54337070b	[AArch64] Add Fujitsu A64FX scheduling model Basic support of A64FX was added in D75594 but its scheduling model was missing. This commit adds the scheduling model. Also, this commit amends/adds some subtarget parameters of A64FX. The A64FX Microarchitecture Manual, which is source information of this commit, is on GitHub. https://github.com/fujitsu/A64FX/ Differential Revision: https://reviews.llvm.org/D93791	2021-01-15 17:14:04 +09:00
Jan Svoboda	b6575bfd0e	[clang][cli] Specify KeyPath prefixes via TableGen classes It turns out we need to handle `LangOptions` separately from the rest of the options. `LangOptions` used to be conditionally parsed only when `!(DashX.getFormat() == InputKind::Precompiled \|\| DashX.getLanguage() == Language::LLVM_IR)` and we need to restore this order (for more info, see D94682). We could do this similarly to how `DiagnosticOptions` are handled: via a counterpart to the `IsDiag` mix-in (e.g. `IsLang`). These mix-ins would prefix the option key path with the appropriate `CompilerInvocation::XxxOpts` member. However, this solution would be problematic, as we'd now have two kinds of options (`Lang` and `Diag`) with seemingly incomplete key paths in the same file. To understand what `CompilerInvocation` member an option affects, one would need to read the whole option definition and notice the `IsDiag` or `IsLang` class. Instead, this patch introduces more robust way to handle different kinds of options separately: via the `KeyPathAndMacroPrefix` class. We have one specialization of that class per `CompilerInvocation` member (e.g. `LangOpts`, `DiagnosticOpts`, etc.). Now, instead of specifying a key path with `"LangOpts->UndefPrefixes"`, we use `LangOpts<"UndefPrefixes">`. This keeps the readability intact (you don't have to look for the `IsLang` mix-in, the key path is complete on its own) and allows us to specify a custom macro prefix within `LangOpts`. Reviewed By: Bigcheese Differential Revision: https://reviews.llvm.org/D94676	2021-01-15 08:42:59 +01:00
Jan Svoboda	1a49944b59	[clang][cli] NFC: Decrease the scope of ParseCodeGenArgs parameters Instead of passing the whole `TargetOptions` and `FrontendOptions` to `ParseCodeGenArgs` give it only the necessary members. This makes tracking the dependencies between various parsers and option groups easier. Reviewed By: Bigcheese Differential Revision: https://reviews.llvm.org/D94675	2021-01-15 08:42:30 +01:00
Jan Svoboda	c495dfe026	[clang][cli] NFC: Decrease the scope of ParseLangArgs parameters Instead of passing the whole `TargetOptions` and `PreprocessorOptions` to `ParseLangArgs` give it only the necessary members. This makes tracking the dependencies between various parsers and option groups easier. Reviewed By: Bigcheese Differential Revision: https://reviews.llvm.org/D94674	2021-01-15 08:41:50 +01:00
Aart Bik	5508516b06	[mlir][sparse] retry sparse-only for cyclic iteration graphs This is a very minor improvement during iteration graph construction. If the first attempt considering the dimension order of all tensors fails, a second attempt is made using the constraints of sparse tensors only. Dense tensors prefer dimension order (locality) but provide random access if needed, enabling the compilation of more sparse kernels. Reviewed By: penpornk Differential Revision: https://reviews.llvm.org/D94709	2021-01-14 22:39:29 -08:00
Yashaswini	39665d9aab	Add Semantic check for Flang OpenMP 4.5 - 2.7.1 Do Loop restrictions on single directive and firstprivate clause. Semantic checks added to check the worksharing 'single' region closely nested inside a worksharing 'do' region. And also to check whether the 'do' iteration variable is a variable in 'Firstprivate' clause. Files: check-directive-structure.h check-omp-structure.h check-omp-structure.cpp Testcases: omp-do01-positivecase.f90 omp-do01.f90 omp-do05-positivecase.f90 omp-do05.f90 Reviewed by: Kiran Chandramohan @kiranchandramohan , Valentin Clement @clementval Differential Revision: https://reviews.llvm.org/D93205	2021-01-15 11:16:25 +05:30
Kazu Hirata	7dc3575ef2	[llvm] Remove redundant return and continue statements (NFC) Identified with readability-redundant-control-flow.	2021-01-14 20:30:34 -08:00
Kazu Hirata	2efcbe24a7	[llvm] Use llvm::drop_begin (NFC)	2021-01-14 20:30:33 -08:00
Kazu Hirata	9bcc0d1040	[CodeGen, Transforms] Use llvm::sort (NFC)	2021-01-14 20:30:31 -08:00
Cheng Wang	2423ec5837	[libc] Add memmove implementation. Use `memcpy` rather than copying bytes one by one, for there might be large size structs to move. Reviewed By: gchatelet, sivachandra Differential Revision: https://reviews.llvm.org/D93195	2021-01-15 12:08:25 +08:00
Amara Emerson	8f283cafdd	[AArch64][GlobalISel] Add selection support for fpr bank source variants of G_SITOFP and G_UITOFP. In order to import patterns for these, we need to define new ops that can map to the AArch64ISD::[SU]ITOF nodes. We then transform fpr->fpr variants of the generic opcodes to these custom opcodes in preisel-lowering. We have to do it here and not the PostLegalizer combiner because this has to run after regbankselect. Differential Revision: https://reviews.llvm.org/D94702	2021-01-14 19:31:19 -08:00
Yitzhak Mandelbaum	1fabe6e519	[libTooling] Change `addInclude` to use expansion locs. This patch changes the default range used to anchor the include insertion to use an expansion loc. This ensures that the location is valid, when the user relies on the default range. Driveby: extend a FIXME for a problem that was emphasized by this change; fix some spellings. Differential Revision: https://reviews.llvm.org/D93703	2021-01-15 03:08:56 +00:00
Jon Chesterfield	214387c2c6	[libomptarget][nvptx] Reduce calls to cuda header [libomptarget][nvptx] Reduce calls to cuda header Remove use of clock_t in favour of a builtin. Drop a preprocessor branch. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D94731	2021-01-15 02:16:33 +00:00

1 2 3 4 5 ...

377131 Commits All Branches Search

377131 Commits

All Branches