llvm-project

Commit Graph

Author	SHA1	Message	Date
wlei	c2e08aba1a	[llvm-profgen] Compute and show profile density AutoFDO performance is sensitive to profile density, i.e., the amount of samples in the profile relative to the program size, because profiles with insufficient samples could be inaccurate due to statistical noise and thus hurt AutoFDO performance. A previous investigation showed that AutoFDO performed better on MySQL with increased amount of samples. Therefore, we implement a profile-density computation feature to give hints about profile density to users and the compiler. We define the density of a profile Prof as follows: - For each function A in the profile, density(A) = total_samples(A) / sizeof(A). - density(Prof) = min(density(A)) for all functions A that are warm (defined below). A function is considered warm if its total-samples is within top N percent of the profile. For implementation, we reuse the `ProfileSummaryBuilder::getHotCountThreshold(..)` as threshold which can be set by percent(`--profile-summary-cutoff-hot`) or by value(`--profile-summary-hot-count`). We also introduce `--hot-function-density-threshold` to set hot function density threshold and will give suggestion if profile density is below it which implies we should increase samples. This also applies for CS profile with all profiles merged into base. Reviewed By: hoy, wenlei Differential Revision: https://reviews.llvm.org/D113781	2021-11-29 23:54:31 -08:00
Zarko Todorovski	e714394ab8	[LLVM][llvm-cov] Inclusive language: rename option -name-whitelist to -name-allowlist Renamed the option for llvm-cov and changed variable names to use more inclusive terms. Also changed the binary for the test. Reviewed By: alanphipps Differential Revision: https://reviews.llvm.org/D112816	2021-11-26 11:08:01 -05:00
Florian Hahn	fb46e64a01	Revert "[ThreadPool] Do not return shared futures." This reverts commit `a5fff58781`. The offending commit broke building with LLVM_ENABLE_THREADS=OFF.	2021-11-24 19:01:47 +00:00
Paul Robinson	f3bfe1b418	Have yaml2obj describe all options in --help Differential Revision: https://reviews.llvm.org/D114538	2021-11-24 07:44:52 -08:00
Djordje Todorovic	e3d8ebe158	[llvm-dwarfdump][Statistics] Handle LTO cases with cross CU referencing With link-time optimizations enabled, resulting DWARF mayend up containing cross CU references (through the DW_AT_abstract_origin attribute). Consider the following example: // sum.c __attribute__((always_inline)) int sum(int a, int b) { return a + b; } // main.c extern int sum(int, int); int main() { int a = 5, b = 10, c = sum(a, b); return 0; } Compiled as follows: $ clang -g -flto -fuse-ld=lld main.c sum.c -o main Results in the following DWARF: -- sum.c CU: abstract instance tree ... 0x000000b0: DW_TAG_subprogram DW_AT_name ("sum") DW_AT_decl_file ("sum.c") DW_AT_decl_line (1) DW_AT_prototyped (true) DW_AT_type (0x000000d3 "int") DW_AT_external (true) DW_AT_inline (DW_INL_inlined) 0x000000bc: DW_TAG_formal_parameter DW_AT_name ("a") DW_AT_decl_file ("sum.c") DW_AT_decl_line (1) DW_AT_type (0x000000d3 "int") 0x000000c7: DW_TAG_formal_parameter DW_AT_name ("b") DW_AT_decl_file ("sum.c") DW_AT_decl_line (1) DW_AT_type (0x000000d3 "int") ... -- main.c CU: concrete inlined instance tree ... 0x0000006d: DW_TAG_inlined_subroutine DW_AT_abstract_origin (0x00000000000000b0 "sum") DW_AT_low_pc (0x00000000002016ef) DW_AT_high_pc (0x00000000002016f1) DW_AT_call_file ("main.c") DW_AT_call_line (5) DW_AT_call_column (0x19) 0x00000081: DW_TAG_formal_parameter DW_AT_location (DW_OP_reg0 RAX) DW_AT_abstract_origin (0x00000000000000bc "a") 0x00000088: DW_TAG_formal_parameter DW_AT_location (DW_OP_reg2 RCX) DW_AT_abstract_origin (0x00000000000000c7 "b") ... Note that each entry within the concrete inlined instance tree in the main.c CU has a DW_AT_abstract_origin attribute which refers to a corresponding entry within the abstract instance tree in the sum.c CU. llvm-dwarfdump --statistics did not properly report DW_TAG_formal_parameters/DW_TAG_variables from concrete inlined instance trees which had 0% location coverage and which referred to a different CU, mainly because information about abstract instance trees and their parameters/variables was stored locally - just for the currently processed CU, rather than globally - for all CUs. In particular, if the concrete inlined instance tree from the example above was to look like this (i.e. parameter b has 0% location coverage, hence why it's missing): 0x0000006d: DW_TAG_inlined_subroutine DW_AT_abstract_origin (0x00000000000000b0 "sum") DW_AT_low_pc (0x00000000002016ef) DW_AT_high_pc (0x00000000002016f1) DW_AT_call_file ("main.c") DW_AT_call_line (5) DW_AT_call_column (0x19) 0x00000081: DW_TAG_formal_parameter DW_AT_location (DW_OP_reg0 RAX) DW_AT_abstract_origin (0x00000000000000bc "a") llvm-dwarfdump --statistics would have not reported b as such. Patch by Dimitrije Milosevic. Differential revision: https://reviews.llvm.org/D113465	2021-11-24 13:50:47 +01:00
Florian Hahn	8ef460fc51	[llvm-reduce] Add parallel chunk processing. This patch adds parallel processing of chunks. When reducing very large inputs, e.g. functions with 500k basic blocks, processing chunks in parallel can significantly speed up the reduction. To allow modifying clones of the original module in parallel, each clone needs their own LLVMContext object. To achieve this, each job parses the input module with their own LLVMContext. In case a job successfully reduced the input, it serializes the result module as bitcode into a result array. To ensure parallel reduction produces the same results as serial reduction, only the first successfully reduced result is used, and results of other successful jobs are dropped. Processing resumes after the chunk that was successfully reduced. The number of threads to use can be configured using the -j option. It defaults to 1, which means serial processing. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D113857	2021-11-24 09:23:52 +00:00
Bill Wendling	2975f37d8d	[llvm-diff] Implement diff of PHI nodes Implement diff of PHI nodes Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D114211	2021-11-22 13:23:10 -08:00
Nico Weber	1718fe4643	[llvm-objcopy] Fix some comment typos	2021-11-17 13:43:30 -05:00
Keith Smiley	68311f21eb	[llvm-objcopy][MachO] Add llvm-strip support for newer load commands Previously llvm-strip would fail because of unknown commands. Fixes https://bugs.llvm.org/show_bug.cgi?id=50044 Differential Revision: https://reviews.llvm.org/D113734	2021-11-17 10:36:35 -08:00
Keith Smiley	693b02023e	[llvm-objdump/mac] Add support for new load commands Differential Revision: https://reviews.llvm.org/D113733	2021-11-17 09:53:25 -08:00
Leonard Chan	25bcd94234	[llvm-objcopy] Add --update-section This is another attempt at D59351 which attempted to add --update-section, but with some heuristics for adjusting segment/section offsets/sizes in the event the data copied into the section is larger than the original size of the section. We are opting to not support this case. GNU's objcopy was able to do this because the linker and objcopy are tightly coupled enough that segment reformatting was simpler. This is not the case with llvm-objcopy and lld where they like to be separated. This will attempt to copy data into the section without changing any other properties of the parent segment (if the section is part of one). Differential Revision: https://reviews.llvm.org/D112116	2021-11-16 14:10:40 -08:00
Duncan P. N. Exon Smith	fd6018072a	DebugInfo: Make DWARFExpression::iterator a const iterator `3d1d8c767b` made DWARFExpression::iterator's Operation member `mutable`. After a few prep commits, the iterator can instead be made a `const` iterator since no caller can change the Operation. Differential Revision: https://reviews.llvm.org/D113958	2021-11-16 10:25:10 -08:00
Florian Hahn	be56ece918	[llvm-reduce] Move code to check chunk to function, to enable reuse (NFC). This patch moves the logic to clone and check a new chunk into a new function, to allow re-use in a follow-up patch that implements parallel reductions. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D113856	2021-11-16 15:39:13 +00:00
Florian Hahn	97b9b6f565	[llvm-reduce] Add new BitWriter dependency after `28d95a2610`.	2021-11-16 12:48:21 +00:00
Florian Hahn	28d95a2610	[llvm-reduce] Allow writing temporary files as bitcode. Textual LLVM IR files are much bigger and take longer to write to disk. To avoid the extra cost incurred by serializing to text, this patch adds an option to save temporary files as bitcode instead. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D113858	2021-11-16 12:39:42 +00:00
Wenlei He	f7976edc1e	[llvm-profgen] Add switch to allow use of first loadable segment for calculating offset Adding `-use-loadable-segment-as-base` to allow use of first loadable segment for calculating offset. By default first executable segment is used for calculating offset. The switch helps compatibility with unsymbolized profile generated from older tools. Differential Revision: https://reviews.llvm.org/D113727	2021-11-15 19:00:27 -08:00
Arthur Eubanks	0b5051cede	[llvm-reduce] Don't reuse SmallVector across calls to getAllMetadata() The SmallVector is not cleared in calls to getAllMetadata(). Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D113808	2021-11-15 14:53:48 -08:00
Steven Wan	351870720f	[AIX][llvm-go] AIX linker does not recognize `-rpath` The existing logic adds `-rpath` to CGO_LDFLAGS, which is not a valid linker option on AIX. This patch substitutes it with `-blibpath` on AIX. Reviewed By: daltenty Differential Revision: https://reviews.llvm.org/D113704	2021-11-15 13:13:08 -05:00
Lang Hames	55751f5f63	[llvm-jitlink] Add an explicit -debugger-support option. Commit `69be352a19` restricted the MachO debugger support testcase to run on Darwin only, but we still need to disable debugger support by default for other noexec tests. This patch introduces a -debugger-support option to llvm-jitlink that is on-by-default when executing code, and off-by-default for noexec tests. This should prevent regression tests from trying (and failing) to set up MachO debugging support when running on non-Darwin platforms. to explicitly enable/disable support.	2021-11-14 15:46:00 -08:00
Lang Hames	69be352a19	Reapply "[ORC] Initial MachO debugging support (via GDB JIT debug.." with fixes. This reapplies `e1933a0488` (which was reverted in `f55ba3525e` due to bot failures, e.g. https://lab.llvm.org/buildbot/#/builders/117/builds/2768). The bot failures were due to a missing symbol error: We use the input object's mangling to decide how to mangle the debug-info registration function name. This caused lookup of the registration function to fail when the input object mangling didn't match the host mangling. Disbaling the test on non-Darwin platforms is the easiest short-term solution. I have filed https://llvm.org/PR52503 with a proposed longer term solution.	2021-11-14 14:44:07 -08:00
Florian Hahn	4081df43b6	[llvm-reduce] Remove unnecessary loop. After `cd8aa234fd`, there's no need to collect a vector of basic blocks to keep first. Remove the first loop.	2021-11-14 21:03:21 +00:00
Lang Hames	f55ba3525e	Revert "[ORC] Initial MachO debugging support (via GDB JIT debug..." This reverts commit `e1933a0488` until I can look into bot failures.	2021-11-14 00:14:39 -08:00
Lang Hames	e1933a0488	[ORC] Initial MachO debugging support (via GDB JIT debug registration interface) This commit adds a new plugin, GDBJITDebugInfoRegistrationPlugin, that checks for objects containing debug info and registers any debug info found via the GDB JIT registration API. To enable this registration without redundantly representing non-debug sections this plugin synthesizes a new embedded object within a section of the LinkGraph. An allocation action is used to make the registration call. Currently MachO only. ELF users can still use the DebugObjectManagerPlugin. The two are likely to be merged in the near future.	2021-11-13 13:21:01 -08:00
Duncan P. N. Exon Smith	75c86c9935	Support: Make VarStreamArrayIterator iterate over const values VarStreamArrayIterator returns a reference to a just-computed internal value. Change it to iterate over `const ValueType` to avoid allowing clients to mutate the internal state, and to drop the non-`const`-qualified operator(). The removed operator() was from `175d70ee5c` to get iterator_facade_base::operator->() working, and this fixes the root cause instead: setting `T` to `const ValueType` causes iterator_facade_base to infer `PointerT` as `const ValueType*`. Ironically, this is the last blocker for removing the const-incorrect overload of `iterator_facade_base::operator->()`, whose presence triggered adding the workaround in the first place :). Differential Revision: https://reviews.llvm.org/D113797	2021-11-12 20:37:36 -08:00
Keith Smiley	47bb456b2f	[llvm-obcopy][MachO] Add error for MH_PRELOAD Previously this would crash. Fixes https://bugs.llvm.org/show_bug.cgi?id=51877 Differential Revision: https://reviews.llvm.org/D113819	2021-11-12 19:18:34 -08:00
wlei	aab1810006	[llvm-profgen] Fix bug of setting function entry Previously we set `isFuncEntry` flag to true when the funcName from DWARF is equal to the name in symbol table and we use this flag to ignore reporting callsite sample that's from an intra func branch. However, in HHVM, it appears that the symbol table name is inconsistent with the dwarf info func name, it's likely due to `OptimizeGlobalAliases`. This change is a workaround in llvm-profgen side to mark the only one range as the function entry and add warnings for the remaining inconsistence. This also fixed a missing `getCanonicalFnName` for symbol name which caused the mismatching as well. Reviewed By: hoy, wenlei Differential Revision: https://reviews.llvm.org/D113492	2021-11-12 12:18:43 -08:00
Lang Hames	3fb641618f	[ORC-RT][llvm-jitlink] Fix a buggy check in ORC-RT MachO TLV deregistration. The check was failing because it was matching against the end of the range, not the start. This bug wasn't causing the ORC-RT MachO TLV regression test to fail because we were only logging deallocation errors (including TLV deregistration errors) and not actually returning a failure code. This commit updates llvm-jitlink to report the errors properly.	2021-11-12 10:36:17 -08:00
Tomasz Miąsko	c3e07df607	[llvm-nm] Demangle Rust symbols Add support for demangling Rust v0 symbols to llvm-nm by reusing nonMicrosoftDemangle which supports both Itanium and Rust mangling. Reviewed By: dblaikie, jhenderson Differential Revision: https://reviews.llvm.org/D111937	2021-11-12 12:46:59 +01:00
Arthur Eubanks	87687b4ff7	[llvm-reduce] Fix build after D113537 Forgot to amend D113537 with these changes before committing.	2021-11-11 18:53:34 -08:00
Arthur Eubanks	6f288bd772	[llvm-reduce] Count chunks by running a preliminary reduction Having a separate counting method runs the risk of a mismatch between the actual reduction method and the counting method. Instead, create an Oracle that always returns true for shouldKeep(), run the reduction, and count how many times shouldKeep() was called. The module should not be modified if shouldKeep() always returns true. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D113537	2021-11-11 18:46:09 -08:00
Arthur Eubanks	be0b47d530	[llvm-reduce] Skip replacing metadata and callee operands Metadata operands tend to require special conditions, especially on dbg intrinsics. We also don't have a zero value for metadata. Replacing callee operands is a little weird, since calling undef/null doesn't make sense. It also causes tons of invalid reductions when reducing calls to intrinsics since only arguments to intrinsics can be of the metadata type. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D113532	2021-11-11 18:42:16 -08:00
Michael Kruse	c15f930e96	[llvm-reduce] Introduce operands-skip pass. Add a new "operands-skip" pass whose goal is to remove instructions in the middle of dependency chains. For instance: ``` %baseptr = alloca i32 %arrayidx = getelementptr i32, i32* %baseptr, i32 %idxprom store i32 42, i32* %arrayidx ``` might be reducible to ``` %baseptr = alloca i32 %arrayidx = getelementptr ... ; now dead, together with the computation of %idxprom store i32 42, i32* %baseptr ``` Other passes would either replace `%baseptr` with undef (operands, instructions) or move it to become a function argument (operands-to-args), both of which might fail the interestingness check. In principle the implementation allows operand replacement with any value or instruction in the function that passes the filter constraints (same type, dominance, "more reduced"), but is limited in this patch to values that are directly or indirectly used to compute the current operand value, motivated by the example above. Additionally, function arguments are added to the candidate set which helps reducing the number of relevant arguments mitigating a concern of too many arguments mentioned in https://reviews.llvm.org/D110274#3025013. Possible future extensions: * Instead of requiring the same type, bitcast/trunc/zext could be automatically inserted for some more flexibility. * If undef is added to the candidate set, "operands-skip"is able to produce any reduction that "operands" can do. Additional candidates might be zero and one, where the "reductive power" classification can prefer one over the other. If undefined behaviour should not be introduced, undef can be removed from the candidate set. Recommit after resolving conflict with D112651 and reusing shouldReduceOperand from D113532. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D111818	2021-11-11 20:16:34 -06:00
Michael Kruse	ed7b37155b	Revert "[llvm-reduce] Introduce operands-skip pass." This reverts commit `fa4210a9a0`. It causes compile failures, presumably because conflicting with another patch landed after I checked locally.	2021-11-11 19:25:39 -06:00
Michael Kruse	fa4210a9a0	[llvm-reduce] Introduce operands-skip pass. Add a new "operands-skip" pass whose goal is to remove instructions in the middle of dependency chains. For instance: ``` %baseptr = alloca i32 %arrayidx = getelementptr i32, i32* %baseptr, i32 %idxprom store i32 42, i32* %arrayidx ``` might be reducible to ``` %baseptr = alloca i32 %arrayidx = getelementptr ... ; now dead, together with the computation of %idxprom store i32 42, i32* %baseptr ``` Other passes would either replace `%baseptr` with undef (operands, instructions) or move it to become a function argument (operands-to-args), both of which might fail the interestingness check. In principle the implementation allows operand replacement with any value or instruction in the function that passes the filter constraints (same type, dominance, "more reduced"), but is limited in this patch to values that are directly or indirectly used to compute the current operand value, motivated by the example above. Additionally, function arguments are added to the candidate set which helps reducing the number of relevant arguments mitigating a concern of too many arguments mentioned in https://reviews.llvm.org/D110274#3025013. Possible future extensions: * Instead of requiring the same type, bitcast/trunc/zext could be automatically inserted for some more flexibility. * If undef is added to the candidate set, "operands-skip"is able to produce any reduction that "operands" can do. Additional candidates might be zero and one, where the "reductive power" classification can prefer one over the other. If undefined behaviour should not be introduced, undef can be removed from the candidate set. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D111818	2021-11-11 18:54:01 -06:00
Florian Hahn	cd8aa234fd	[llvm-reduce] Use DenseSet instead of std::set (NFC). When reducing functions with very large basic blocks (~ almost 1 million BBs), the majority of time is spent maintaining the order in the std::set for the basic blocks to keep. In those cases, DenseSet<> is much more efficient. Use it instead.	2021-11-10 13:56:22 +00:00
Martin Storsjö	91350eb151	[llvm-objdump] Remove a trailing semicolon, fixing GCC warnings. NFC.	2021-11-10 09:39:47 +02:00
Arthur Eubanks	b394ba5d7f	[llvm-reduce] Print extra newline when encountering unknown pass	2021-11-09 15:20:16 -08:00
Luís Ferreira	9af467ed8b	[Tools] Add a fuzzing tool to help fuzzing D demangler This patch adds a fuzzing helper tool for D demangler by feeding the demangler API with pseudo-random null terminated strings with the help of libfuzzer heuristics. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D111432	2021-11-09 12:45:25 -08:00
Dwight Guth	16c3db8def	[llvm-reduce] Fix invalid reduction in basic-blocks delta pass Previously, if the basic-blocks delta pass tried to remove a basic block that was the last basic block in a function that did not have external or weak linkage, the resulting IR would become invalid. Since removing the last basic block in a function is effectively identical to removing the function body itself, we check explicitly for this case and if we detect it, we run the same logic as in ReduceFunctionBodies.cpp Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D113486	2021-11-09 10:43:38 -08:00
Dwight Guth	fbfd327fdf	[llvm-reduce] Add flag to start at finer granularity Sometimes if llvm-reduce is interrupted in the middle of a delta pass on a large file, it can take quite some time for the tool to start actually doing new work if it is restarted again on the partially-reduced file. A lot of time ends up being spent testing large chunks when these large chunks are very unlikely to actually pass the interestingness test. In cases like this, the tool will complete faster if the starting granularity is reduced to a finer amount. Thus, we introduce a command line flag that automatically divides the chunks into smaller subsets a fixed, user-specified number of times prior to beginning the core loop. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D112651	2021-11-09 10:14:08 -08:00
Fangrui Song	5f1e509579	[llvm-objdump] -p: Dump PE header for PE/COFF For a trivial DLL built with `clang --target=x86_64-windows -O2 -c a.c; lld-link -subsystem:console -dll a.o -out:a.dll`, `objdump -p` vs `llvm-objdump -p`: ``` -a.dll: file format pei-x86-64 - +a.dll: file format coff-x86-64 Characteristics 0x2022 executable large address aware @@ -57,4 +56,4 @@ Entry d 0000000000000000 00000000 Delay Import Directory Entry e 0000000000000000 00000000 CLR Runtime Header Entry f 0000000000000000 00000000 Reserved - +Export Table: ``` For a Linux image (`vmlinuz-5.10.76-gentoo-r1`) built with `CONFIG_EFI_STUB=y` ``` -vmlinuz-5.10.76-gentoo-r1: file format pei-x86-64 - -Characteristics 0x20e +vmlinuz-5.10.76-gentoo-r1: file format coff-x86-64 +Characteristics 0x206 executable line numbers stripped - symbols stripped debugging information removed Time/Date Wed Dec 31 16:00:00 1969 @@ -55,10 +53,4 @@ Entry d 0000000000000000 00000000 Delay Import Directory Entry e 0000000000000000 00000000 CLR Runtime Header Entry f 0000000000000000 00000000 Reserved - - -PE File Base Relocations (interpreted .reloc section contents) - -Virtual Address: 000037ca Chunk size 10 (0xa) Number of fixups 1 - reloc 0 offset 0 [37ca] ABSOLUTE - +Export Table: ``` `symbols stripped` looks like a GNU objdump problem. Reviewed By: jhenderson, alexander-shaposhnikov Differential Revision: https://reviews.llvm.org/D113356	2021-11-09 10:08:41 -08:00
Paul Robinson	38be8f4057	Add llvm-tli-checker A new tool that compares TargetLibraryInfo's opinion of the availability of library function calls against the functions actually exported by a specified set of libraries. Can be helpful in verifying the correctness of TLI for a given target, and avoid mishaps such as had to be addressed in D107509 and `94b4598d`. The tool currently supports ELF object files only, although it's unlikely to be hard to add support for other formats. Re-commits `62dd488` with changes to use pre-generated objects, as not all bots have ld.lld available. Differential Revision: https://reviews.llvm.org/D111358	2021-11-08 16:29:28 -08:00
Paul Robinson	1297c21406	Revert "Add llvm-tli-checker" Not all bots have ld.lld available. This reverts commit `62dd488164`.	2021-11-08 15:48:29 -08:00
Jessica Clarke	a9a510f217	[bugpoint] Fix repeated off-by-one error in debug output This resulted in the final argument being dropped from the output, which can be rather important.	2021-11-08 23:44:45 +00:00
Paul Robinson	62dd488164	Add llvm-tli-checker A new tool that compares TargetLibraryInfo's opinion of the availability of library function calls against the functions actually exported by a specified set of libraries. Can be helpful in verifying the correctness of TLI for a given target, and avoid mishaps such as had to be addressed in D107509 and `94b4598d`. The tool currently supports ELF object files only, although it's unlikely to be hard to add support for other formats. Differential Revision: https://reviews.llvm.org/D111358	2021-11-08 14:59:13 -08:00
Adrian Prantl	fae440974a	Attempt to work around type checking error on older compilers	2021-11-08 12:42:33 -08:00
Adrian Prantl	8bd8dd16e2	Extend obj2yaml to optionally preserve raw __LINKEDIT/__DATA segments. I am planning to upstream MachOObjectFile code to support Darwin chained fixups. In order to test the new parser features we need a way to produce correct (and incorrect) chained fixups. Right now the only tool that can produce them is the Darwin linker. To avoid having to check in binary files, this patch allows obj2yaml to print a hexdump of the raw LINKEDIT and DATA segment, which both allows to bootstrap the parser and enables us to easily create malformed inputs to test error handling in the parser. This patch adds two new options to obj2yaml: -raw-data-segment -raw-linkedit-segment Differential Revision: https://reviews.llvm.org/D113234	2021-11-08 11:30:12 -08:00
Roger Kim	1658980a1c	[NFC][llvm-libtool-darwin] Clean up names Removing unclear abbreviations. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D113215	2021-11-08 10:33:59 -08:00
Roger Kim	c51f947a13	[NFC][llvm-libtool-darwin] Remove unnecessary conditionals around errors The existing code has unnecessary logic to indirectly pass errors through function calls. This diff gets rid of the fluff. Test Plan: Existing unit tests Reviewed By: jhenderson, drodriguez, alexander-shaposhnikov Differential Revision: https://reviews.llvm.org/D113301	2021-11-08 10:33:52 -08:00
Zarko Todorovski	c4396b77ae	[LLVM][llvm-cfi] Inclusive language: replace uses of blacklist with ignorelist Replace the description and file names for this argument. As far as I understand this is a positional argument and I don't believe this changes breaks any existing interfaces. Reviewed By: hctim, MaskRay Differential Revision: https://reviews.llvm.org/D113316	2021-11-08 10:05:52 -05:00
Esme-Yi	9b6f264d2b	[XCOFF][llvm-readobj] improve the relocation output. Summary: 1. implemented the unexpanded relocations output. 2. modified the expanded output format to align. Reviewed By: shchenz, jhenderson Differential Revision: https://reviews.llvm.org/D111700	2021-11-08 03:15:52 +00:00
Fangrui Song	859a6d973f	[llvm-objdump] Remove untested diagnostic "missing data dir for TLS table"	2021-11-06 11:18:29 -07:00
Kazu Hirata	87e53a0ad8	[llvm] Use make_early_inc_range (NFC)	2021-11-05 19:39:07 -07:00
wlei	5bf191a381	[llvm-profgen] Fix index out of bounds error while using ip.advance Previously we assume there're some non-executing sections at the bottom of the text section so that we won't hit the array's bound. But on BOLTed binary, it turned out .bolt section is at the bottom of text section which can be profiled, then it crash llvm-profgen. This change try to fix it. Reviewed By: hoy, wenlei Differential Revision: https://reviews.llvm.org/D113238	2021-11-05 18:38:40 -07:00
Fangrui Song	26a8ceba3e	[llvm-readobj] Display DT_RELRSZ/DT_RELRENT as " (bytes)" to match RELSZ/RELENT. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D113206	2021-11-05 10:02:49 -07:00
Roman Lebedev	7a98761d74	[NFC] Move CombinationGenerator from Exegesis to ADT Reviewed By: courbet Differential Revision: https://reviews.llvm.org/D113213	2021-11-05 16:53:46 +03:00
Quinn Pham	c71fbdd87b	[NFC] Inclusive language: Remove instances of master in URLs [NFC] This patch fixes URLs containing "master". Old URLs were either broken or redirecting to the new URL. Reviewed By: #libc, ldionne, mehdi_amini Differential Revision: https://reviews.llvm.org/D113186	2021-11-05 08:48:41 -05:00
Arthur Eubanks	13317286f8	[NewPM] Use the default AA pipeline by default We almost always want to use the default AA pipeline. It's very easy for users of PassBuilder to forget to customize the AAManager to use the default AA pipeline (for example, the NewPM C API forgets to do this). If somebody wants a custom AA pipeline, similar to what is being done now with the default AA pipeline registration, they can FAM.registerPass([&] { return std::move(MyAA); }); before calling PB.registerFunctionAnalyses(FAM); For example, LTOBackend.cpp and NewPMDriver.cpp do this. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D113210	2021-11-04 15:10:34 -07:00
Ben Langmuir	a2639dcbe6	[ORC] Add a utility for adding missing "self" relocations to a Symbol If a tool wants to introduce new indirections via stubs at link-time in ORC, it can cause fidelity issues around the address of the function if some references to the function do not have relocations. This is known to happen inside the body of the function itself on x86_64 for example, where a PC-relative address is formed, but without a relocation. ``` _foo: leaq -7(%rip), %rax ## form pointer to '_foo' without relocation _bar: leaq (%rip), %rax ## uses X86_64_RELOC_SIGNED to '_foo' ``` The consequence of introducing a stub for such a function at link time is that if it forms a pointer to itself without relocation, it will not have the same value as a pointer from outside the function. If the function pointer is used as a key, this can cause problems. This utility provides best-effort support for adding such missing relocations using MCDisassembler and MCInstrAnalysis to identify the problematic instructions. Currently it is only implemented for x86_64. Note: the related issue with call/jump instructions is not handled here, only forming function pointers. rdar://83514317 Differential revision: https://reviews.llvm.org/D113038	2021-11-04 15:01:05 -07:00
Noah Shutty	d788c44f5c	[Support] Improve Caching conformance with Support library behavior This diff makes several amendments to the local file caching mechanism which was migrated from ThinLTO to Support in rGe678c51177102845c93529d457b020f969125373 in response to follow-up discussion on that commit. Patch By: noajshu Differential Revision: https://reviews.llvm.org/D113080	2021-11-04 13:00:44 -07:00
Rahman Lavaee	f533ec37eb	Make the BBAddrMap struct binary-format-agnostic. The only binary-format-related field in the BBAddrMap structure is the function address (`Addr`), which will use uint64_t in 64B format and uint32_t in 32B format. This patch changes it to use uint64_t in both formats. This allows non-templated use of the struct, at the expense of a marginal additional size overhead for the 32-bit format. The size of the BB address map section does not change. Differential Revision: https://reviews.llvm.org/D112679	2021-11-04 10:27:24 -07:00
gbreynoo	ced9287c2d	[llvm-objdump] Fix the Assertion failure when providing invalid --debug-vars or --dwarf values As seen in https://bugs.llvm.org/show_bug.cgi?id=52213 llvm-objdump asserts if either the --debug-vars or the --dwarf options are provided with invalid values. As suggested, this fix adds use of a default value to these options and errors when given bad input. Differential Revision: https://reviews.llvm.org/D112183	2021-11-04 11:01:32 +00:00
Jakub Kuderski	3348b841d3	Make enum iteration with seq safe by default By default `llvm::seq` would happily iterate over enums, which may be unsafe if the enum values are not continuous. This patch disable enum iteration with `llvm::seq` and `llvm::seq_inclusive` and adds two new functions: `enum_seq` and `enum_seq_inclusive`. To make sure enum iteration is safe, we require users to declare their enum types as iterable by specializing `enum_iteration_traits<SomeEnum>`. Because it's not always possible to add these traits next to enum definition (e.g., for enums defined in external libraries), we provide an escape hatch to allow iteration on per-callsite basis by passing `force_iteration_on_noniterable_enum`. The main benefit of this approach is that these global declarations via traits can appear just next to enum definitions, making easy to spot when enums are miss-labeled, e.g., after introducing new enum values, whereas `force_iteration_on_noniterable_enum` should stand out and be easy to grep for. This emerged from a discussion with gchatelet@ about reusing llvm's `Sequence.h` in lieu of https://github.com/GPUOpen-Drivers/llpc/blob/dev/lgc/interface/lgc/EnumIterator.h. Reviewed By: dblaikie, gchatelet, aaron.ballman Differential Revision: https://reviews.llvm.org/D107378	2021-11-03 20:52:21 -04:00
Kirill Stoimenov	a55c4ec1ce	[ASan] Process functions in Asan module pass This came up as recommendation while reviewing D112098. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D112732	2021-11-03 20:27:53 +00:00
Vitaly Buka	3131714f8d	[NFC][asan] Use AddressSanitizerOptions in ModuleAddressSanitizerPass Reviewed By: kstoimenov Differential Revision: https://reviews.llvm.org/D113072	2021-11-03 11:32:14 -07:00
Kirill Stoimenov	b3145323b5	Revert "[ASan] Process functions in Asan module pass" This reverts commit `76ea87b94e`. Reviewed By: kstoimenov Differential Revision: https://reviews.llvm.org/D113129	2021-11-03 18:01:01 +00:00
Kirill Stoimenov	76ea87b94e	[ASan] Process functions in Asan module pass This came up as recommendation while reviewing D112098. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D112732	2021-11-03 17:51:01 +00:00
wlei	dc9f037955	[llvm-profgen] Refactor the code of getHashCode Refactor to generate hash code lazily. Tested on clang self build, no observable generating time regression. Reviewed By: hoy, wenlei Differential Revision: https://reviews.llvm.org/D113059	2021-11-02 19:56:20 -07:00
wlei	138202a8c3	[llvm-profgen] Warn on invalid range and show warning summary Two things in this diff: 1) Warn on the invalid range, currently three types of checking, see the detailed message in the code. 2) In some situation, llvm-profgen gives lots of warnings on the truncated stacks which is noisy. This change provides a switch to `--show-detailed-warning` to skip the warnings. Alternatively, we use a summary for those warning and show the percentage of cases with those issues. Example of warning summary. ``` warning: 0.05%(1120/2428958) cases with issue: Profile context truncated due to missing probe for call instruction. warning: 0.00%(2/178637) cases with issue: Range does not belong to any functions, likely from external function. ``` Reviewed By: hoy Differential Revision: https://reviews.llvm.org/D111902	2021-11-02 19:55:55 -07:00
Med Ismail Bennani	797b50d4be	Revert "Use `GNUInstallDirs` to support custom installation dirs. -- LLVM" This reverts commit `6fd2db04d0` since it broke GreenDragon LLDB-Incremental bot: https://green.lab.llvm.org/green/job/lldb-cmake/37560/console Signed-off-by: Med Ismail Bennani <medismail.bennani@gmail.com>	2021-11-02 19:11:44 +01:00
Arthur Eubanks	e2024d72fa	Revert "[NFC] Remove LinkAll*.h" This reverts commit `fe364e5dc7`. Causes breakages, e.g. https://lab.llvm.org/buildbot/#/builders/188/builds/5266	2021-11-02 09:08:09 -07:00
Arthur Eubanks	f54a8759f0	[llvm-reduce] Reduce more GlobalValue properties Reviewed By: hans Differential Revision: https://reviews.llvm.org/D112885	2021-11-02 08:47:41 -07:00
Arthur Eubanks	80ba72b07b	[llvm-reduce] Reduce some GlobalObject properties Specifically, the section and the alignment. Reviewed By: hans Differential Revision: https://reviews.llvm.org/D112884	2021-11-02 08:47:32 -07:00
Arthur Eubanks	fe364e5dc7	[NFC] Remove LinkAll*.h These were added to prevent functions from being removed by WPO. But that doesn't make sense, correct WPO will not remove functions we actually use. I noticed these because compiling cc1_main.cpp was pulling in random LLVM pass headers. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D112971	2021-11-02 08:43:17 -07:00
John Ericson	6fd2db04d0	Use `GNUInstallDirs` to support custom installation dirs. -- LLVM This is a new draft of D28234. I previously did the unorthodox thing of pushing to it when I wasn't the original author, but since this version - Uses `GNUInstallDirs`, rather than mimics it, as the original author was hesitant to do but others requested. - Is much broader, effecting many more projects than LLVM itself. I figured it was time to make a new revision. I am using this patch (and many back-ports) as the basis of https://github.com/NixOS/nixpkgs/pull/111487 for my distro (NixOS). It looked like people were generally on board in D28234, but I make note of this here in case extra motivation is useful. --- As pointed out in the original issue, a central tension is that LLVM already has some partial support for these sorts of things. For example `LLVM_LIBDIR_SUFFIX`, or `COMPILER_RT_INSTALL_PATH`. Because it's not quite clear yet what to do about those, we are holding off on changing libdirs and `compiler-rt`. for this initial PR. --- On the advice of @lebedev.ri, I am splitting this up a bit per subproject, starting with LLVM. To allow it to be more easily reviewed. This and the subsequent patch must be landed together, as this will not build alone. But the rest can be landed on their own. Reviewed By: compnerd Differential Revision: https://reviews.llvm.org/D100810	2021-11-02 10:23:30 -04:00
Frederic Cambus	650311737e	[llvm-readobj] Add support for reading OpenBSD ELF core notes. Notes generated in OpenBSD core files provide additional information about the kernel state and CPU registers. These notes are described in core.5, which can be viewed here: https://man.openbsd.org/core.5 Differential Revision: https://reviews.llvm.org/D111966	2021-11-02 10:18:54 +01:00
Markus Lavin	fd41738e2c	Recommit "[llvm-reduce] Add MIR support" (Second try. Need to link against CodeGen and MC libs.) The llvm-reduce tool has been extended to operate on MIR (import, clone and export). Current limitation is that only a single machine function is supported. A single reducer pass that operates on machine instructions (while on SSA-form) has been added. Additional MIR specific reducer passes can be added later as needed. Differential Revision: https://reviews.llvm.org/D110527	2021-11-02 10:16:42 +01:00
Markus Lavin	aee7f3384b	Revert "[llvm-reduce] Add MIR support" This reverts commit `bc2773cb1b`. Broke the clang-ppc64le-linux-multistage build. Reverting while I investigate.	2021-11-02 09:41:02 +01:00
Markus Lavin	bc2773cb1b	[llvm-reduce] Add MIR support The llvm-reduce tool has been extended to operate on MIR (import, clone and export). Current limitation is that only a single machine function is supported. A single reducer pass that operates on machine instructions (while on SSA-form) has been added. Additional MIR specific reducer passes can be added later as needed. Differential Revision: https://reviews.llvm.org/D110527	2021-11-02 09:14:56 +01:00
wlei	3f3103c6a9	[llvm-profgen] Fill zero count for all function ranges Allow filling zero count for all the function ranges even there is no samples hitting that function. Add a switch for this. Reviewed By: hoy, wenlei Differential Revision: https://reviews.llvm.org/D112858	2021-11-01 09:57:05 -07:00
Yi Kong	c060457ec6	Revert "[opt-viewer] Use safe yaml load_all" This reverts commit `1123e03a9d`. Broken on the AIX platform.	2021-11-01 17:18:49 +08:00
Kazu Hirata	4db2e4cebe	Use {DenseSet,SetVector,SmallPtrSet}::contains (NFC)	2021-10-30 19:00:19 -07:00
Duncan P. N. Exon Smith	9902362701	Support: Use sys::path::is_style_{posix,windows}() in a few places Use the new sys::path::is_style_posix() and is_style_windows() in a few places that need to detect the system's native path style. In llvm/lib/Support/Path.cpp, this patch removes most uses of the private `real_style()`, where is_style_posix() and is_style_windows() are just a little tidier. Elsewhere, this removes `_WIN32` macro checks. Added a FIXME to a FileManagerTest that seemed fishy, but maintained the existing behaviour. Differential Revision: https://reviews.llvm.org/D112289	2021-10-29 12:09:41 -07:00
wlei	f5537643b8	[llvm-profgen] Update total samples by accumulating all its body samples Like probe-based profile, the total samples is the sum of all its body samples. This patch fix it by a post-processing update for the line-number based profile. Tested it on our internal services, results showed no performance change. Reviewed By: hoy, wenlei Differential Revision: https://reviews.llvm.org/D112672	2021-10-29 10:36:57 -07:00
Kazu Hirata	3b285ff517	[llvm-profgen] Fix a set-but-unused warning This patch fixes: llvm/tools/llvm-profgen/ProfiledBinary.cpp:357:12: error: variable 'EndOffset' set but not used [-Werror,-Wunused-but-set-variable] The last use of the variable was removed on Oct 26 in commit `40ca411251`.	2021-10-29 10:19:44 -07:00
Dwight Guth	2f16173627	[llvm-reduce] optimize extractFromModule functions The extractBasicBlocksFromModule, extractInstrFromModule, and other similar functions previously performed very poorly when the number of such elements in the program to reduce was very high. Previously, we were creating the set which caches elements to keep by looping through all elements in the module and adding them to the set. However, since std::set is an ordered set, this introduces a massive amount of rebalancing if the order of elements in the program and the order of their pointers in memory are not the same. The solution is straightforward: first put all the elements to be kept in a vector, then use the constructor for std::set which takes a pair of iterators over a collection. This constructor is optimized to avoid doing unnecessary work when initializing large sets. Also in this change, we pass BBsToKeep set to functions replaceBranchTerminator and removeUninterestingBBsFromSwitch as a const reference rather than passing it by value. This ought to prevent the need to copy the collection each time these functions are called, which is expensive if the collection is large. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D112757	2021-10-29 10:06:26 -07:00
wlei	2f8196db92	[llvm-profgen] Fix bug of populating profile symbol list Previous implementation of populating profile symbol list is wrong, it only included the profiled symbols. Actually it should use all symbols, here this switches to use the symbols from debug info. Also turned the flag off by default. Reviewed By: wenlei, hoy Differential Revision: https://reviews.llvm.org/D111824	2021-10-29 09:59:12 -07:00
wlei	40ca411251	[llvm-profgen] Switch to DWARF-based symbol and ranges It happened a bug that some callsite name in the profile is not a real function, it turned out that there're some non-function symbol from the ELF text section, e.g. the global accessible branch label and also recalled that we can have one function being split into multiple ranges. We shouldn't count samples for those are not the entry of the real function. So this change tried to fix this issue by switching to use the name or ranges from DWARF-based debug info, the range of which assure it's the real function start. For the split functions, we assume that the real entry function's DWARF name should always match the symbol table name. The switching is also consistent with the body samples' symbol which is from DWARF. Reviewed By: hoy, wenlei Differential Revision: https://reviews.llvm.org/D112282	2021-10-29 09:59:12 -07:00
Arthur Eubanks	177a703710	[llvm-reduce] Actually skip invalid candidates in operands-to-args This was checked while counting but not actually when doing the reduction, resulting in crashes. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D112766	2021-10-29 09:14:18 -07:00
Daniel Rodríguez Troitiño	8fbe1e7602	[llvm-objcopy] Fix misaligned access to load command data. It seems that llvm-objcopy stores data temporarily misaligned with the requirements of the underlaying struct from libBinaryFormat, and UBSan generates a runtime error. Instead of trying to reinterpret the memory as the struct itself, simply access the `char *` pointer that we are interested in, and that do not have alignment restrictions. This problem was pointed out in a comment of D111164. Differential Revision: https://reviews.llvm.org/D112744	2021-10-28 22:14:39 -07:00
Hongtao Yu	259e4c5658	[CSSPGO] Trim cold base profiles for the CS preinliner. Adding support to the CS preinliner to trim cold base profiles. This makes trimming consistent with the inline decision made by the preinliner. Also disable the existing profile merger when preinliner is on unless explicitly specified. Reviewed By: wenlei, wlei Differential Revision: https://reviews.llvm.org/D112489	2021-10-27 22:50:27 -07:00
Nuri Amari	a299b24712	Regenerate LC_CODE_SIGNATURE during llvm-objcopy operations Context: This is a second attempt at introducing signature regeneration to llvm-objcopy. In this diff: https://reviews.llvm.org/D109840, a script was introduced to test the validity of a code signature. In this diff: https://reviews.llvm.org/D109803 (now reverted), an effort was made to extract the signature generation behavior out of LLD into a common location for use in llvm-objcopy. In this diff: https://reviews.llvm.org/D109972 it was decided that there was no appropriate common location and that a small amount of duplication to bring signature generation to llvm-objcopy would be better. This diff introduces this duplication. Summary Prior to this change, if a LC_CODE_SIGNATURE load command was included in the binary passed to llvm-objcopy, the command and associated section were simply copied and included verbatim in the new binary. If rest of the binary was modified at all, this results in an invalid Mach-O file. This change regenerates the signature rather than copying it. The code_signature_lc.test test was modified to include the yaml representation of a small signed MachO executable in order to effectively test the signature generation. Reviewed By: alexander-shaposhnikov, #lld-macho Differential Revision: https://reviews.llvm.org/D111164	2021-10-26 14:51:13 -07:00
zhijian	158083f0de	[AIX][XCOFF] parsing xcoff object file auxiliary header Summary: The patch supports parsing the xcoff object file auxiliary header with llvm-readobj with option "auxiliary-headers" the format of auxiliary header as https://www.ibm.com/support/knowledgecenter/en/ssw_aix_72/filesreference/XCOFF.html#XCOFF__fyovh386shar Reviewers: James Henderson, Jason Liu, Hubert Tong, Esme yi, Sean Fertile. Differential Revision: https://reviews.llvm.org/D82549	2021-10-26 10:40:25 -04:00
wlei	a5f411b7f8	[llvm-profgen] Allow unsymbolized profile as perf input This change allows the unsymbolized profile as input. The unsymbolized profile is created by `llvm-profgen` with `--skip-symbolization` and it's after the sample aggregation but before symbolization , so it has much small file size. It can be used for sample merging and trimming, also is useful for debugging or adding test cases. A switch `--unsymbolized-profile=file-patch` is added for this. Format of unsymbolized profile: ``` [context stack1] # If it's a CS profile number of entries in RangeCounter from_1-to_1:count_1 from_2-to_2:count_2 ...... from_n-to_n:count_n number of entries in BranchCounter src_1->dst_1:count_1 src_2->dst_2:count_2 ...... src_n->dst_n:count_n [context stack2] ...... ``` Reviewed By: hoy, wenlei Differential Revision: https://reviews.llvm.org/D111750	2021-10-25 23:58:08 -07:00
Kazu Hirata	d8e4170b0a	Ensure newlines at the end of files (NFC)	2021-10-23 08:45:29 -07:00
Kazu Hirata	4e3eebc6bd	[tools, utils] Use StringRef::contains (NFC)	2021-10-22 17:22:13 -07:00
Florian Hahn	d465315679	[LLVM-C]Add LLVMAddMetadataToInst, deprecated LLVMSetInstDebugLocation. IRBuilder has been updated to support preserving metdata in a more general manner. This patch adds `LLVMAddMetadataToInst` and deprecates `LLVMSetInstDebugLocation` in favor of the more general function. Reviewed By: aprantl Differential Revision: https://reviews.llvm.org/D93454	2021-10-22 11:21:28 +01:00
Yi Kong	1123e03a9d	[opt-viewer] Use safe yaml load_all Differential Revision: https://reviews.llvm.org/D112075	2021-10-21 14:00:03 +08:00
Wenlei He	e8c245dcd3	[llvm-profgen] Skip duplication factor outside of body sample computation We incorrectly use duplication factor for total samples even though we already accumulate samples instead of taking MAX. It causes profile to have bloated total samples for functions with loop unrolled or vectorized. The change fix the issue for total sample, head sample and call target samples. Differential Revision: https://reviews.llvm.org/D112042	2021-10-19 23:10:45 -07:00
Arthur Eubanks	9660563950	[llvm-reduce] Add reduction passes to reduce operands to undef/1/0 Having non-undef constants in a final llvm-reduce output is nicer than having undefs. This splits the existing reduce-operands pass into three, one which does the same as the current pass of reducing to undef, and two more to reduce to the constant 1 and the constant 0. Do not reduce to undef if the operand is a ConstantData, and do not reduce 0s to 1s. Reducing GEP operands very frequently causes invalid IR (since types may not match up if we index differently into a struct), so don't touch GEPs. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D111765	2021-10-19 15:25:21 -07:00
Lasse Folger	134e1817f6	[lldb] change name demangling to be consistent between windows and linx When printing names in lldb on windows these names contain the full type information while on linux only the name is contained. This change introduces a flag in the Microsoft demangler to control if the type information should be included. With the flag enabled demangled name contains only the qualified name, e.g: without flag -> with flag int (array2d)[10] -> array2d int (abc::array2d)[10] -> abc::array2d const int *x -> x For globals there is a second inconsistency which is not yet addressed by this change. On linux globals (in global namespace) are prefixed with :: while on windows they are not. Reviewed By: teemperor, rnk Differential Revision: https://reviews.llvm.org/D111715	2021-10-19 12:04:37 +02:00
Qiaojin.Bao	cf65271e46	[llvm-shlib] Fix windows build failed while llvm non-standalone building. While build llvm-project as a sub-project on windows, met a build error: libllvm-c.exports /llvm/bin\llvm-nm.exe: error: ...builds/rel64ninja/./lib/LLVMDemangle.lib: no such file or directory The libllvm-c.exports, libllvm-c.args, and lib/*.lib should under LLVM_BINARY_DIR, using CMAKE_BINARY_DIR will cause 'no such file' error while llvm-project built as a sub-project.	2021-10-19 09:10:11 +01:00
Fangrui Song	8189c4eee7	[tools] Delete redundant 'static' from namespace scope 'static const'. NFC	2021-10-18 22:38:42 -07:00
Fangrui Song	b68bf98c0a	[llvm-readobj] Delete redundant 'static' from namespace scope 'static const'. NFC By default, such a non-template variable of non-volatile const-qualified type having namespace-scope has internal linkage ([basic.link]), so no need for `static`.	2021-10-18 22:21:54 -07:00
Noah Shutty	e678c51177	[Support][ThinLTO] Move ThinLTO caching to LLVM Support library We would like to move ThinLTO’s battle-tested file caching mechanism to the LLVM Support library so that we can use it elsewhere in LLVM. Patch By: noajshu Differential Revision: https://reviews.llvm.org/D111371	2021-10-18 18:57:25 -07:00
Arthur Eubanks	15fefcb9eb	[opt] Directly translate -O# to -passes='default<O#>' Right now when we see -O# we add the corresponding 'default<O#>' into the list of passes to run when translating legacy -pass-name. This has the side effect of not using the default AA pipeline. Instead, treat -O# as -passes='default<O#>', but don't allow any other -passes or -pass-name. I think we can keep `opt -O#` as shorthand for `opt -passes='default<O#>` but disallow anything more than just -O#. Tests need to be updated to not use `opt -O# -pass-name`. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D112036	2021-10-18 16:48:10 -07:00
Petr Hosek	8e46e34d24	Revert "[Support][ThinLTO] Move ThinLTO caching to LLVM Support library" This reverts commit `92b8cc52bb` since it broke the gold plugin.	2021-10-18 12:24:05 -07:00
Noah Shutty	92b8cc52bb	[Support][ThinLTO] Move ThinLTO caching to LLVM Support library We would like to move ThinLTO’s battle-tested file caching mechanism to the LLVM Support library so that we can use it elsewhere in LLVM. Patch By: noajshu Differential Revision: https://reviews.llvm.org/D111371	2021-10-18 12:08:49 -07:00
Tomasz Miąsko	a3813438ae	[llvm-cxxfilt] Use nonMicrosoftDemangle for demangling NFC Reviewed By: dblaikie, jhenderson Part of https://reviews.llvm.org/D110664	2021-10-16 13:32:17 +02:00
Sam Clegg	659a08399a	[WebAssembly] Add import info to `dylink` section of shared libraries See https://github.com/WebAssembly/tool-conventions/pull/175 Differential Revision: https://reviews.llvm.org/D111345	2021-10-15 11:49:16 -07:00
gbreynoo	a64e6ecfe1	[llvm-readelf] Make -W an alias of --wide Currently -W and --wide are treated as two options as they are only included for gnu readelf compatibility and ignored. This change makes -W an alias of --wide to be consistent with other option aliases. Differential Revision: https://reviews.llvm.org/D111731	2021-10-15 16:27:53 +01:00
djtodoro	c450e47a8c	[llvm-dwarfdump] Fix unsigned overflow when calculating stats This fixes https://bugs.llvm.org/show_bug.cgi?id=51652. The idea is to bump all the stat fields to 64-bit wide unsigned integers. I've confirmed this resolves the use case for chromium. Differential Revision: https://reviews.llvm.org/D109217	2021-10-15 12:15:58 +02:00
Shao-Ce SUN	7c704c0f53	[NFC] fix a typo	2021-10-15 14:51:49 +08:00
Daniel Sanders	0a869ef3a8	[llvm-mca][timeline] Indicate output was stopped due to cycle limit. It can be a bit confusing to stop with no explanation so we should indicate when further output was prevented by the cycle limit. Differential Revision: https://reviews.llvm.org/D111753	2021-10-14 11:10:09 -07:00
Wenlei He	a316343e19	[llvm-profgen] Allow generating AutoFDO profile from CSSPGO binary Add `-use-dwarf-correlation` switch to allow llvm-profgen to generate AutoFDO profile for binaries built with CSSPGO (pseudo-probe). Differential Revision: https://reviews.llvm.org/D111776	2021-10-14 09:11:56 -07:00
wlei	30ca33eab0	[llvm-profgen] Ignore the whole trace with the leading external branch The first LBR entry can be an external branch, we should ignore the whole trace. ``` 7f7448e889e4 0x7f7448e889e4/0x7f7448e88826/P/-/-/1 0x7f7448e8899f/0x7f7448e889d8/P/-/-/4 ... ``` Reviewed By: wenlei, hoy Differential Revision: https://reviews.llvm.org/D111749	2021-10-13 16:52:29 -07:00
wlei	ab5d65e685	[llvm-profgen] Ignore stack samples before aggregation With `ignore-stack-samples`, We can ignore the call stack before the samples aggregation which could reduce some redundant computations. Reviewed By: hoy, wenlei Differential Revision: https://reviews.llvm.org/D111577	2021-10-13 16:52:29 -07:00
Lang Hames	4fcc0ac15e	[ORC] Use a Setup object for SimpleRemoteEPC construction. SimpleRemoteEPC notionally allowed subclasses to override the createMemoryManager and createMemoryAccess methods to use custom objects, but could not actually be subclassed in practice (The construction process in SimpleRemoteEPC::Create could not be re-used). Instead of subclassing, this commit adds a SimpleRemoteEPC::Setup class that can be used by clients to set up the memory manager and memory access members. A default-constructed Setup object results in no change from previous behavior (EPCGeneric* memory manager and memory access objects used by default).	2021-10-13 16:47:00 -07:00
Lang Hames	92bec0e970	[llvm-jitlink] Don't use thread pool task dispatch when LLVM_ENABLE_THREADS=Off This should fix compile errors in llvm-jitlink.cpp in LLVM_ENABLE_THREADS=Off builds due to `f341161689`.	2021-10-13 10:19:55 -07:00
Michael Kruse	dd71b65ca8	[llvm-reduce] Introduce operands-to-args pass. Instead of setting operands to undef as the "operands" pass does, convert the operands to a function argument. This avoids having to introduce undef values into the IR which have some unpredictability during optimizations. For instance, define void @func() { entry: %val = add i32 32, 21 store i32 %val, i32* null ret void } is reduced to define void @func(i32 %val) { entry: %val1 = add i32 32, 21 store i32 %val, i32* null ret void } (note that the instruction %val is renamed to %val1 when printing the IR to avoid ambiguity; ideally %val1 would be removed by dce or the instruction reduction pass) Any call to @func is replaced with a call to the function with the new signature and filled with undef. This is not ideal for IPA passes, but those out-of-scope for now. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D111503	2021-10-13 09:54:03 -05:00
Lang Hames	962a2479b5	Re-apply `e50aea58d5`, "Major JITLinkMemoryManager refactor". with fixes. Adds explicit narrowing casts to JITLinkMemoryManager.cpp. Honors -slab-address option in llvm-jitlink.cpp, which was accidentally dropped in the refactor. This effectively reverts commit `6641d29b70`.	2021-10-11 21:39:00 -07:00
Lang Hames	b7c1ccd422	[llvm-jitlink] Fix a broken warning. This warning should only be issued if -slab-page-size has not been used.	2021-10-11 20:54:12 -07:00
Lang Hames	6641d29b70	Revert "[JITLink][ORC] Major JITLinkMemoryManager refactor." This reverts commit `e50aea58d5` while I investigate bot failures.	2021-10-11 19:23:41 -07:00
Lang Hames	e50aea58d5	[JITLink][ORC] Major JITLinkMemoryManager refactor. This commit substantially refactors the JITLinkMemoryManager API to: (1) add asynchronous versions of key operations, (2) give memory manager implementations full control over link graph address layout, (3) enable more efficient tracking of allocated memory, and (4) support "allocation actions" and finalize-lifetime memory. Together these changes provide a more usable API, and enable more powerful and efficient memory manager implementations. To support these changes the JITLinkMemoryManager::Allocation inner class has been split into two new classes: InFlightAllocation, and FinalizedAllocation. The allocate method returns an InFlightAllocation that tracks memory (both working and executor memory) prior to finalization. The finalize method returns a FinalizedAllocation object, and the InFlightAllocation is discarded. Breaking Allocation into InFlightAllocation and FinalizedAllocation allows InFlightAllocation subclassses to be written more naturally, and FinalizedAlloc to be implemented and used efficiently (see (3) below). In addition to the memory manager changes this commit also introduces a new MemProt type to represent memory protections (MemProt replaces use of sys::Memory::ProtectionFlags in JITLink), and a new MemDeallocPolicy type that can be used to indicate when a section should be deallocated (see (4) below). Plugin/pass writers who were using sys::Memory::ProtectionFlags will have to switch to MemProt -- this should be straightworward. Clients with out-of-tree memory managers will need to update their implementations. Clients using in-tree memory managers should mostly be able to ignore it. Major features: (1) More asynchrony: The allocate and deallocate methods are now asynchronous by default, with synchronous convenience wrappers supplied. The asynchronous versions allow clients (including JITLink) to request and deallocate memory without blocking. (2) Improved control over graph address layout: Instead of a SegmentRequestMap, JITLinkMemoryManager::allocate now takes a reference to the LinkGraph to be allocated. The memory manager is responsible for calculating the memory requirements for the graph, and laying out the graph (setting working and executor memory addresses) within the allocated memory. This gives memory managers full control over JIT'd memory layout. For clients that don't need or want this degree of control the new "BasicLayout" utility can be used to get a segment-based view of the graph, similar to the one provided by SegmentRequestMap. Once segment addresses are assigned the BasicLayout::apply method can be used to automatically lay out the graph. (3) Efficient tracking of allocated memory. The FinalizedAlloc type is a wrapper for an ExecutorAddr and requires only 64-bits to store in the controller. The meaning of the address held by the FinalizedAlloc is left up to the memory manager implementation, but the FinalizedAlloc type enforces a requirement that deallocate be called on any non-default values prior to destruction. The deallocate method takes a vector<FinalizedAlloc>, allowing for bulk deallocation of many allocations in a single call. Memory manager implementations will typically store the address of some allocation metadata in the executor in the FinalizedAlloc, as holding this metadata in the executor is often cheaper and may allow for clean deallocation even in failure cases where the connection with the controller is lost. (4) Support for "allocation actions" and finalize-lifetime memory. Allocation actions are pairs (finalize_act, deallocate_act) of JITTargetAddress triples (fn, arg_buffer_addr, arg_buffer_size), that can be attached to a finalize request. At finalization time, after memory protections have been applied, each of the "finalize_act" elements will be called in order (skipping any elements whose fn value is zero) as ((char()(const char , size_t))fn)((const char )arg_buffer_addr, (size_t)arg_buffer_size); At deallocation time the deallocate elements will be run in reverse order (again skipping any elements where fn is zero). The returned char * should be null to indicate success, or a non-null heap-allocated string error message to indicate failure. These actions allow finalization and deallocation to be extended to include operations like registering and deregistering eh-frames, TLS sections, initializer and deinitializers, and language metadata sections. Previously these operations required separate callWrapper invocations. Compared to callWrapper invocations, actions require no extra IPC/RPC, reducing costs and eliminating a potential source of errors. Finalize lifetime memory can be used to support finalize actions: Sections with finalize lifetime should be destroyed by memory managers immediately after finalization actions have been run. Finalize memory can be used to support finalize actions (e.g. with extra-metadata, or synthesized finalize actions) without incurring permanent memory overhead.	2021-10-11 19:12:42 -07:00
Arthur Eubanks	337cf0a5ab	[llc] Support -time-trace in llc Mostly copied from opt.cpp. Reviewed By: hans Differential Revision: https://reviews.llvm.org/D111466	2021-10-11 10:16:46 -07:00
Esme-Yi	a00ff71668	[XCOFF] Improve error message context. Summary: This patch improves the error message context of the XCOFF interfaces by providing more details. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D110320	2021-10-11 02:52:20 +00:00
Lang Hames	f341161689	[ORC] Add TaskDispatch API and thread it through ExecutorProcessControl. ExecutorProcessControl objects will now have a TaskDispatcher member which should be used to dispatch work (in particular, handling incoming packets in the implementation of remote EPC implementations like SimpleRemoteEPC). The GenericNamedTask template can be used to wrap function objects that are callable as 'void()' (along with an optional name to describe the task). The makeGenericNamedTask functions can be used to create GenericNamedTask instances without having to name the function object type. In a future patch ExecutionSession will be updated to use the ExecutorProcessControl's dispatcher, instead of its DispatchTaskFunction.	2021-10-10 18:39:55 -07:00
Arthur Eubanks	77bc3ba365	[NFC][llvm-reduce] Cleanup types Use Module& wherever possible. Since every reduction immediately turns Chunks into an Oracle, directly pass Oracle instead. Reviewed By: hans Differential Revision: https://reviews.llvm.org/D111122	2021-10-10 18:07:28 -07:00
Wenlei He	9978e0e475	[llvm-profdata] Allow overlap/similarity comparison to use custom hot threshold cutoff Allow overlap/similarity comparison to use custom hot threshold cutoff, instead of using hard coded 990000 as hot cutoff. Differential Revision: https://reviews.llvm.org/D111385	2021-10-10 13:30:18 -07:00
Wenlei He	da4e5fc861	[llvm-profgen] Deduplicate PID when processing perf input When parsing mmap to retrieve PID, deduplicate them before passing PID list to perf script. Perf script would error out when there's duplicated PID in the input, however raw perf data may main duplicated PID for large binary where more than one mmap is needed to load executable segment. Differential Revision: https://reviews.llvm.org/D111384	2021-10-10 13:30:17 -07:00
william woodruff	e7fc254875	[BitcodeAnalyzer] allow a motivated user to dump BLOCKINFO This adds the `--dump-blockinfo` flag to `llvm-bcanalyzer`, allowing a sufficiently motivated user to dump (parts of) the `BLOCKINFO_BLOCK` block. The default behavior is unchanged, and `--dump-blockinfo` only takes effect in the same context as other flags that control dump behavior (i.e., requires that `--dump` is also passed). Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D107536	2021-10-10 10:15:14 +05:30
Dávid Bolvanský	3649fb14d1	Fixed some errors detected by PVS Studio	2021-10-09 17:20:04 +02:00
John Ericson	59ae182bc2	Remove unnecessary StringRef convesion in llvm-config We have a string litteral (via CPP) used to construct `StringRef`, which is used to construct a `SmallString`. Just construct the latter directly. Differential Revision: https://reviews.llvm.org/D111322	2021-10-08 21:16:32 -04:00
Reid Kleckner	b3a6d096d7	Fix shlib builds for all lib/Target/*/TargetInfo libs They all must depend on MC now that the target registry is in MC. Also fix llvm-cxxdump	2021-10-08 15:21:13 -07:00
Reid Kleckner	89b57061f7	Move TargetRegistry.(h\|cpp) from Support to MC This moves the registry higher in the LLVM library dependency stack. Every client of the target registry needs to link against MC anyway to actually use the target, so we might as well move this out of Support. This allows us to ensure that Support doesn't have includes from MC/*. Differential Revision: https://reviews.llvm.org/D111454	2021-10-08 14:51:48 -07:00
Lang Hames	8fe3d9df0e	Revert "[ORC] Move SimpleRemoteEPCServer::Dispatcher into OrcShared." This reverts commit `dfd74db981`. SimpleRemoteEPC should share dispatch with the ExecutionSession, rather than having two different dispatch systems on the controller side. SimpleRemoteEPCServer::Dispatch doesn't need to be shared.	2021-10-08 13:43:42 -07:00
Nikita Popov	cfb53d8e6d	[NFC] Make some includes explicit Avoid relying on a number of indirect includes that currently happen through the Hashing.h header in DenseMapInfo.h.	2021-10-08 20:34:48 +02:00
Lang Hames	dfd74db981	[ORC] Move SimpleRemoteEPCServer::Dispatcher into OrcShared. Renames SimpleRemoteEPCServer::Dispatcher to SimpleRemoteEPCDispatcher and moves it into OrcShared. SimpleRemoteEPCServer::ThreadDispatcher is similarly moved and renamed to DynamicThreadPoolSimpleRemoteEPCDispatcher. This will allow these classes to be reused by SimpleRemoteEPC on the controller side of the connection.	2021-10-08 11:29:57 -07:00
Qiongsi Wu	856a07e47a	[NFC] Including <string> in llvm-cxxdump/Error.cpp A [[ https://reviews.llvm.org/rGf6fa95b77f33c3690e4201e505cb8dce1433abd9 \| recent commit ]] removed `<string>` from `ErrorHandling.h`. The removal caused `<string>` to be no longer included for `llvm/tools/llvm-cxxdump/Error.cpp` which uses the string type. This patch adds `<string>` to `llvm/tools/llvm-cxxdump/Error.cpp`. Reviewed By: jsji Differential Revision: https://reviews.llvm.org/D111354	2021-10-07 18:11:56 -04:00
wlei	b1a45c62f0	[llvm-profgen] Ignore branch count against outline function For some transformations like hot-cold split or coro split, it can outline its part of function ranges. Since sample loader is the early stage of backend and no split happens at that time, compiler can't recognize those function, so in llvm-profgen we should attribute the sample to the original function. This is already done for the body range samples since we use the symbols from dwarf which is created before the split. But for branch samples, the call from master function to its outlined function is actually not a call to the original function, we shouldn't add head/callsie samples for it. So instead of dwarf symbol, we use the symbols from symbol table and ignore those functions with special suffixes(like `.cold` ,`.resume`) for accumulating the callsite/head samples. Reviewed By: hoy, wenlei Differential Revision: https://reviews.llvm.org/D110864	2021-10-07 14:03:34 -07:00
gbreynoo	14d76a376a	[llvm-readelf][docs] Add missing options and details to the help output and the command guide This change is to keep the help text and command guide of llvm-readelf in tandem. - In the help text mention that --section-data, --section-relocations, --section-symbols and --stack-sizes have no effect on GNU style output; give the accepted values for --elf-output-style and update the description of --gnu-hash-table to use the command guide description. - In the command guide add the missing options -a, --dependant-libraries,--no-demangle, --wide and -W. Also update the description of --symbols so it matches the help text. Differential Revision: https://reviews.llvm.org/D111240	2021-10-07 17:11:02 +01:00
gbreynoo	3a5aa57c9b	[llvm-objdump][docs] Add details to the help output and command guide This change is to add some missing details, clarifies some options and brings the help text and command guide of objdump closer together. - Added to the help that --all-headers also outputs symbols and relocations to match the command guide. - Added to the help that --debug-vars accepts an optional ascii/unicode format to match the command guide. - Changed the help descriptions for --disassemble, --disassemble-all, --dwarf=<value>, --fault-map-section, --line-numbers, --no-leading-addr and --source descriptions to match the command guide. - Added to the help that --start-address and --stop-address also effect relocation entries and the symbol table output to match the command guide. - Added a note to the command guide that --unwind-info and -u are not available for the elf format. Differential Revision: https://reviews.llvm.org/D110633	2021-10-07 16:30:12 +01:00
gbreynoo	9072183cb6	[llvm-objdump] Fix --prefix and --prefix-strip In the command guide --prefix and --prefix-strip is used in the form --prefix=<prefix> however currently it is used in the form --prefix <prefix>. This change fixes these options to match the command guide. Differential Revision: https://reviews.llvm.org/D110551	2021-10-07 15:53:45 +01:00
Itay Bookstein	40ec1c0f16	[IR][NFC] Rename getBaseObject to getAliaseeObject To better reflect the meaning of the now-disambiguated {GlobalValue, GlobalAlias}::getBaseObject after breaking off GlobalIFunc::getResolverFunction (D109792), the function is renamed to getAliaseeObject.	2021-10-06 19:33:10 -07:00
wlei	16516f8925	[llvm-profgen] Support symbol list for accurate profile Differential Revision: https://reviews.llvm.org/D110859	2021-10-06 11:41:39 -07:00
Simon Pilgrim	21661607ca	[llvm] Replace report_fatal_error(std::string) uses with report_fatal_error(Twine) As described on D111049, we're trying to remove the <string> dependency from error handling and replace uses of report_fatal_error(const std::string&) with the Twine() variant which can be forward declared.	2021-10-06 12:04:30 +01:00
Heejin Ahn	3ec1760d91	[WebAssembly] Remove WasmTagType This removes `WasmTagType`. `WasmTagType` contained an attribute and a signature index: ``` struct WasmTagType { uint8_t Attribute; uint32_t SigIndex; }; ``` Currently the attribute field is not used and reserved for future use, and always 0. And that this class contains `SigIndex` as its property is a little weird in the place, because the tag type's signature index is not an inherent property of a tag but rather a reference to another section that changes after linking. This makes tag handling in the linker also weird that tag-related methods are taking both `WasmTagType` and `WasmSignature` even though `WasmTagType` contains a signature index. This is because the signature index changes in linking so it doesn't have any info at this point. This instead moves `SigIndex` to `struct WasmTag` itself, as we did for `struct WasmFunction` in D111104. In this CL, in lib/MC and lib/Object, this now treats tag types in the same way as function types. Also in YAML, this removes `struct Tag`, because now it only contains the tag index. Also tags set `SigIndex` in `WasmImport` union, as functions do. I think this makes things simpler and makes tag handling more in line with function handling. These two shares similar properties in that both of them have signatures, but they are kind of nominal so having the same signature doesn't mean they are the same element. Also a drive-by fix: the reserved 'attirubute' part's encoding changed from uleb32 to uint8 a while ago. This was fixed in lib/MC and lib/Object but not in YAML. This doesn't change object files because the field's value is always 0 and its encoding is the same for the both encoding. This is effectively NFC; I didn't mark it as such just because it changed YAML test results. Reviewed By: sbc100, tlively Differential Revision: https://reviews.llvm.org/D111086	2021-10-05 17:11:22 -07:00
Simon Pilgrim	2e5daac217	[llvm] Update report_fatal_error calls from raw_string_ostream to use Twine(OS.str()) As described on D111049, we're trying to remove the <string> dependency from error handling and replace uses of report_fatal_error(const std::string&) with the Twine() variant which can be forward declared. We can use the raw_string_ostream::str() method to perform the implicit flush() and return a reference to the std::string container that we can then wrap inside Twine().	2021-10-05 18:42:12 +01:00
Simon Pilgrim	e463b69736	[Support] Change fatal_error_handler_t to take a const char* instead of std::string https://commondatastorage.googleapis.com/chromium-browser-clang/llvm-include-analysis.html Excessive use of the <string> header has a massive impact on compile time; its most commonly included via the ErrorHandling.h header, which has to be included in many key headers, impacting many source files that have no need for std::string. As an initial step toward removing the <string> include from ErrorHandling.h, this patch proposes to update the fatal_error_handler_t handler to just take a raw const char* instead. The next step will be to remove the report_fatal_error std::string variant, which will involve a lot of cleanup and better use of Twine/StringRef. Differential Revision: https://reviews.llvm.org/D111049	2021-10-05 10:55:40 +01:00
wlei	31a5cb3292	[llvm-profgen] Filter out invalid debug line Differential Revision: https://reviews.llvm.org/D110081	2021-10-04 19:09:06 -07:00
wlei	46cf7d75d9	[llvm-profgen] Add duplication factor for line-number based profile This change adds duplication factor multiplier while accumulating body samples for line-number based profile. The body sample count will be `duplication-factor * count`. Base discriminator and duplication factor is decoded from the raw discriminator, this requires some refactor works. Differential Revision: https://reviews.llvm.org/D109934	2021-10-04 19:08:55 -07:00
wlei	fb29d812e4	[CSSPGO] Rename the field of SampleContextFrame Differential Revision: https://reviews.llvm.org/D110980	2021-10-04 19:06:59 -07:00
Sam Clegg	c0039de295	[Object][WebAssemlby] Report function types (signatures). NFC This simplifies the code in a number of ways and avoids having to track functions and their types separately. Differential Revision: https://reviews.llvm.org/D111104	2021-10-04 17:33:56 -07:00
David Spickett	8692d07e58	[llvm-objdump] Fix common symbol output on 32 bit platforms Since https://reviews.llvm.org/D109452 symbol-table.test has been failing on our Arm32 bots. https://lab.llvm.org/buildbot/#/builders/171/builds/4201 This is because in that change an implicit widening cast of the alignment from 32 bit to 64 bit was removed and the format string expects a 64 bit number.	2021-10-04 14:24:03 +00:00
Lang Hames	d9152a8571	[llvm-jitlink] Sink getPageSize call in Session::Create. The page size for the host process is only needed in the in-process use case.	2021-10-02 11:28:14 -07:00
Tomasz Miąsko	f33274c7bf	[llvm-cxxfilt] Replace isalnum with isAlnum from StringExtras D104366 introduced a new llvm-cxxfilt test with non-ASCII characters, which caused a failure on llvm-clang-x86_64-expensive-checks-win builder, with a stack trace suggesting issue in a call to isalnum. The argument to isalnum should be either EOF or a value that is representable in the type unsigned char. The llvm-cxxfilt does not perform a cast from char to unsigned char before the call, so the value might be out of valid range. Replace the call to isalnum with isAlnum from StringExtras, which takes a char as the argument. This also makes the check independent of the current locale. Differential Revision: https://reviews.llvm.org/D110986	2021-10-02 08:54:04 +02:00
Lang Hames	33dd98e9e4	[ORC] Remove ORC RPC. With the removal of OrcRPCExecutorProcessControl and OrcRPCTPCServer in `6aeed7b19c` the ORC RPC library no longer has any in-tree users. Clients needing serialization for ORC should move to Simple Packed Serialization (usually by adopting SimpleRemoteEPC for remote JITing).	2021-10-01 11:17:33 -07:00
Arthur Eubanks	a7b4ce9cfd	[NFC][AttributeList] Replace index_begin/end with an iterator We expose the fact that we rely on unsigned wrapping to iterate through all indexes. This can be confusing. Rather, keeping it as an implementation detail through an iterator is less confusing and is less code. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D110885	2021-10-01 10:17:41 -07:00
zhijian	5b44c716ee	[AIX]implement the --syms and using "symbol index and qualname" for --sym --symbol--description for llvm-objdump for xcoff Summary: for xcoff : implement the getSymbolFlag and getSymbolType() for option --syms. llvm-objdump --sym , if the symbol is label, print the containing section for the symbol too. when using llvm-objdump --sym --symbol--description, print the symbol index and qualname for symbol. for example: --symbol-description 00000000000000c0 l .text (csect: (idx: 2) .foov[PR]) (idx: 3) .foov and without --symbol-description 00000000000000c0 l .text (csect: .foov) .foov Reviewers: James Henderson,Esme Yi Differential Revision: https://reviews.llvm.org/D109452	2021-10-01 12:37:51 -04:00
Lang Hames	d908118b8a	[llvm-jitlink] Fix a FIXME. ORC errors preserve the SymbolStringPool since `6fe2e9a9cc`, so we can stop bailing out early.	2021-10-01 08:49:51 -07:00
Marcelo Juchem	dfb213c2df	Fix ambiguous overload build failure LLVM (llvmorg-14-init) under Debian sid using latest gcc (Debian 10.3.0-9) 10.3.0 fails due to ambiguous overload on operators == and !=: /root/src/llvm/src/llvm/tools/obj2yaml/elf2yaml.cpp:212:22: error: ambiguous overload for 'operator!=' (operand types are 'llvm::ELFYAML::ELF_SHF' and 'int') /root/src/llvm/src/llvm/tools/obj2yaml/elf2yaml.cpp:204:32: error: ambiguous overload for 'operator!=' (operand types are 'const llvm::yaml::Hex64' and 'int') /root/src/llvm/src/llvm/lib/CodeGen/LiveDebugValues/VarLocBasedImpl.cpp:629:35: error: ambiguous overload for 'operator==' (operand types are 'const uint64_t' {aka 'const long unsigned int'} and 'llvm::Register') Reviewed by: StephenTozer, jmorse, Higuoxing Differential Revision: https://reviews.llvm.org/D109534	2021-10-01 14:19:57 +01:00
Florian Hahn	57fbb9ed0e	[llvm-reduce] Skip updating calls where OldF isn't the called fn. When replacing function calls, skip call instructions where the old function is not the called function, but e.g. the old function is passed as an argument. This fixes a crash due to trying to construct invalid IR for the test case. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D109759	2021-10-01 10:52:48 +01:00
Wenlei He	47d66355ef	[llvm-profgen] Fix alignment in preferred based calculation We used the segment alignment in elf header to assume the loader alignment. However this is incorrect because loader alignment is always the same as page size. If segment needs to be aligned at load time, linker will set aligned address as virtual address in elf header. Differential Revision: https://reviews.llvm.org/D110795	2021-09-29 23:01:10 -07:00
Wenlei He	1f0bc617bd	[llvm-porfgen] Allow perf data as input This change enables llvm-profgen to take raw perf data as alternative input format. Sometimes we need to retrieve evenets for processes with matching binary. Using perf data as input allows us to retrieve process Ids from mmap events for matching binary, then filter by process id during perf script generation. Differential Revision: https://reviews.llvm.org/D110793	2021-09-29 22:57:35 -07:00
Wenlei He	941191aae4	[llvm-profgen] Refactor and better diagnostics This change contains diagnostics improvments, refactoring and preparation for consuming perf data directly. Diagnostics: - We now have more detailed diagnostics when no mmap is found. - We also print warning for abnormal transition to external code. Refactoring: - Simplify input perf trace processing to only allow a single input file. This is because 1) using multiple input perf trace (perf script) is error prone because we may miss key mmap events. 2) the functionality is not really being used anyways. - Make more functions private for Readers, move non-trivial definitions out of header. Cleanup some inconsistency. - Prepare for consuming perf data as input directly. Differential Revision: https://reviews.llvm.org/D110729	2021-09-29 22:55:50 -07:00
Fangrui Song	8971b99c83	[llvm-objdump/llvm-readobj/obj2yaml/yaml2obj] Support STO_RISCV_VARIANT_CC and DT_RISCV_VARIANT_CC STO_RISCV_VARIANT_CC marks that a symbol uses a non-standard calling convention or the vector calling convention. See https://github.com/riscv/riscv-elf-psabi-doc/pull/190 Differential Revision: https://reviews.llvm.org/D107949	2021-09-29 16:56:52 -07:00
Wael Yehia	8b8da01d88	Revert "[LTO][Legacy] Add -debug-pass-manager option to enable pass run/skip trace." This reverts commit `a60405cf03`.	2021-09-29 19:43:35 +00:00
Michael Kruse	d9562a8e45	[llvm-reduce] Reduce metadata references. The ReduceMetadata pass before this patch removed metadata on a per-MDNode (or NamedMDNode) basis. Either all references to an MDNode are kept, or all of them are removed. However, MDNodes are uniqued, meaning that references to MDNodes with the same data become references to the same MDNodes. As a consequence, e.g. tbaa references to the same type will all have the same MDNode reference and hence make it impossible to reduce only keeping metadata on those memory access for which they are interesting. Moreover, MDNodes can also be referenced by some intrinsics or other MDNodes. These references were not considered for removal leading to the possibility that MDNodes are not actually removed even if selected to be removed by the oracle. This patch changes ReduceMetadata to reduces based on removable metadata references instead. MDNodes without references implicitly dropped anyway. References by intrinsic calls should be removed by ReduceOperands or ReduceInstructions. References in other MDNodes cannot be removed as it would violate the immutability of MDNodes. Additionally, ReduceMetadata pass before this patch used `setMetadata(I, NULL)` to remove references, where `I` is the index in the array returned by `getAllMetadata`. However, `setMetadata` expects a MDKind (such as `MD_tbaa`) as first argument. `getAllMetadata` does not return those in consecutive order (otherwise it would not need to be a `std::pair` with `first` representing the MDKind). Reviewed By: aeubanks, swamulism Differential Revision: https://reviews.llvm.org/D110534	2021-09-29 11:25:35 -05:00
Wael Yehia	a60405cf03	[LTO][Legacy] Add -debug-pass-manager option to enable pass run/skip trace. Reviewed by: steven_wu, fhahn, tejohnson Differential Revision: https://reviews.llvm.org/D110075	2021-09-29 12:17:53 +00:00
Igor Kudrin	7b424b9333	[llvm-objcopy] Rename relocation sections together with their targets. As for now, llvm-objcopy renames only sections that are specified explicitly in --rename-section, while GNU objcopy keeps names of relocation sections in sync with their targets. For example: > readelf -S test.o ... [ 1] .foo PROGBITS [ 2] .rela.foo RELA > objcopy --rename-section .foo=.bar test.o gnu.o > readelf -S gnu.o ... [ 1] .bar PROGBITS [ 2] .rela.bar RELA > llvm-objcopy --rename-section .foo=.bar test.o llvm.o > readelf -S llvm.o ... [ 1] .bar PROGBITS [ 2] .rela.foo RELA This patch makes llvm-objcopy to match the behavior of GNU objcopy better. Differential Revision: https://reviews.llvm.org/D110352	2021-09-29 16:36:37 +07:00
wlei	a03cf331e1	[llvm-profgen] Strip context to support non-CS profile generation for hybrid sample Differential Revision: https://reviews.llvm.org/D109769	2021-09-28 12:20:23 -07:00
Lang Hames	ab5e6e7434	[llvm-jitlink] Add a -slab-page-size option to override process page size. The slab allocator is frequently used in -noexec tests where we want a consistent memory layout. In this context we also want to set the effective page size, rather than using the page size of the host process, since not all systems use the same page size. The -slab-page-size option allows us to set the page size for such tests. The -slab-page-size option will also be honored in exec mode when using the slab allocator, but will trigger an error if the requested size is not a multiple of the actual process page size. This option was motivated by test failures on a ppc64 bot that was returning zero from sys::Process::getPageSize(), so it also contains a check for errors and zero results from that function if the -slab-page-size option is absent. Existing slab allocator tests will be updated to use this option in a follow-up commit so that we can point the failing bot at this commit and observe errors associated with sys::Process::getPageSize().	2021-09-28 10:43:46 -07:00
Fangrui Song	74a47e54be	[llvm-objdump] Fix -R display and support ET_EXEC * Add a newline before `DYNAMIC RELOCATION RECORDS` (see D101796) * Add the missing `OFFSET TYPE VALUE` line * Align columns Note: llvm-readobj/ELFDumper.cpp `loadDynamicTable` has sophisticated PT_DYNAMIC code which is unavailable in llvm-objdump. Reviewed By: jhenderson, Higuoxing Differential Revision: https://reviews.llvm.org/D110595	2021-09-28 09:58:27 -07:00
wlei	ce40843a3f	[llvm-profgen][CSSPGO] On-demand function size computation for preinliner Similar to https://reviews.llvm.org/D110465, we can compute function size on-demand for the functions that's hit by samples. Here we leverage the raw range samples' address to compute a set of sample hit function. Then `BinarySizeContextTracker` just works on those function range for the size. Reviewed By: hoy Differential Revision: https://reviews.llvm.org/D110466	2021-09-28 09:09:38 -07:00
wlei	091c16f76b	[llvm-profgen] On-demand symbolization Previously we do symbolization for all the functions and actually we only need the symbols that's hit by the samples. This can significantly speed up the time for large size binary. Optimization for per-inliner will come along with next patch. Reviewed By: hoy, wenlei Differential Revision: https://reviews.llvm.org/D110465	2021-09-28 09:09:25 -07:00
Lang Hames	61e25d2550	clang-format	2021-09-27 18:02:06 -07:00
Lang Hames	22f8276fe4	[llvm-jitlink] Add more information about allocation failures. Slab allocator failures will now report requested size and remaining capacity.	2021-09-27 18:02:06 -07:00
Lang Hames	21a06254a3	[ORC] Switch from JITTargetAddress to ExecutorAddr for EPC-call APIs. Part of the ongoing move to ExecutorAddr.	2021-09-27 16:53:09 -07:00
Jozef Lawrynowicz	6cfb4d46ba	[llvm-readobj] Support dumping of MSP430 ELF attributes The MSP430 ABI supports build attributes for specifying the ISA, code model, data model and enum size in ELF object files. Differential Revision: https://reviews.llvm.org/D107969	2021-09-28 00:56:11 +03:00
gbreynoo	05b1c7aebf	[llvm-dwarfdump][docs] Add missing options to the help output and the command guide This change is to add some missing details to the help text and command guide: - Added a note to the command guide that --debug-macro also dumps .debug_macinfo. - Added a note to the command guide that --debug-frame and --eh_frame are aliases, and in cases where both sections are present one command outputs both. - Changed the wording in the help output for --ignore-case and --regex to closer match the command guide.	2021-09-27 14:28:31 +01:00
Lang Hames	a12c0d5ea6	[ORC] Export process symbols in lli-child-target. We want this behavior for future testing infrastructure anyway, and it may help with the failure in https://lab.llvm.org/buildbot/#/builders/98/builds/6401: /b/fuchsia-x86_64-linux/llvm.obj/tools/clang/stage2-bins/bin/lli: warning: remote mcjit does not support lazy compilation Finalization error: could not register eh-frame: __register_frame function not found /b/fuchsia-x86_64-linux/llvm.obj/tools/clang/stage2-bins/bin/lli: disconnecting	2021-09-26 11:22:49 -07:00
Lang Hames	6498b0e991	Reintroduce "[ORC] Introduce EPCGenericRTDyldMemoryManager." This reintroduces "[ORC] Introduce EPCGenericRTDyldMemoryManager." (`bef55a2b47`) and "[lli] Add ChildTarget dependence on OrcTargetProcess library." (`7a219d801b`) which were reverted in `99951a5684` due to bot failures. The root cause of the bot failures should be fixed by "[ORC] Fix uninitialized variable." (`0371049277`) and "[ORC] Wait for handleDisconnect to complete in SimpleRemoteEPC::disconnect." (`320832cc9b`).	2021-09-27 03:24:33 +10:00
Lang Hames	175c1a39e8	[ORC][llvm-jitlink] Add debugging output to SimpleRemoteEPC (and Server). Also adds an optional 'debug' argument to the llvm-jitlink-executor tool to enable debug-logging.	2021-09-26 10:00:29 -07:00
Lang Hames	99951a5684	Revert "[ORC] Introduce EPCGenericRTDyldMemoryManager." This reverts commit `bef55a2b47` while I investigate failures on some bots. Also reverts "[lli] Add ChildTarget dependence on OrcTargetProcess library." (`7a219d801b`) which was a fallow-up to `bef55a2b47`.	2021-09-25 11:19:14 -07:00
Lang Hames	7a219d801b	[lli] Add ChildTarget dependence on OrcTargetProcess library. ChildTarget depends on OrcTargetProcess after `bef55a2b47`.	2021-09-25 10:51:29 -07:00
Lang Hames	bef55a2b47	[ORC] Introduce EPCGenericRTDyldMemoryManager. EPCGenericRTDyldMemoryMnaager is an EPC-based implementation of the RuntimeDyld::MemoryManager interface. It enables remote-JITing via EPC (backed by a SimpleExecutorMemoryManager instance on the executor side) for RuntimeDyld clients. The lli and lli-child-target tools are updated to use SimpleRemoteEPC and SimpleRemoteEPCServer (rather than OrcRemoteTargetClient/Server), and EPCGenericRTDyldMemoryManager for MCJIT tests. By enabling remote-JITing for MCJIT and RuntimeDyld-based ORC clients, EPCGenericRTDyldMemoryManager allows us to deprecate older remote-JITing support, including OrcTargetClient/Server, OrcRPCExecutorProcessControl, and the Orc RPC system itself. These will be removed in future patches.	2021-09-25 10:42:10 -07:00
modimo	ce6ed64a69	[llvm-profdata] Extend support of --topn to sample profiles Reviewed By: wenlei Differential Revision: https://reviews.llvm.org/D110449	2021-09-24 16:42:46 -07:00
wlei	1422fa5fab	[llvm-profgen] Unify output format of different unsymbolized profiles Differential Revision: https://reviews.llvm.org/D110080	2021-09-24 14:18:00 -07:00
wlei	28277e9b48	[AutoFDO][llvm-profgen] Report zero count for unexecuted part of function code In order to be consistent with compiler that interprets zero count as unexecuted(cold), this change reports zero-value count for unexecuted part of function code. For the implementation, it leverages the range counter, initializes all the executed function range with the zero-value. After all ranges are merged and converted into disjoint ranges, the remaining zero count will indicates the unexecuted(cold) part of the function. This change also extends the current `findDisjointRanges` method which now can support adding zero-value range. Reviewed By: hoy, wenlei Differential Revision: https://reviews.llvm.org/D109713	2021-09-24 14:15:05 -07:00
wlei	d5f2013004	[AutoFDO][llvm-profgen] Profile generation for LBR(non-CS) sample This patch introduces non-CS AutoFDO profile generation into LLVM. The profile is supposed to be well consumed by compiler using `-fprofile-sample-use=[profile]`. After range and branch counters are extracted from the LBR sample, here we go through each addresses for symbolization, create FunctionSamples and populate its sub fields like TotalSamples, BodySamples and HeadSamples etc. For inlined code, as we need to map back to original code, so we always add body samples to the leaf frame's function sample. Reviewed By: wenlei, hoy Differential Revision: https://reviews.llvm.org/D109551	2021-09-24 13:55:34 -07:00
wlei	a7cdcf25c1	[llvm-profgen] Ignore invalid perf line in LBR record Similar to https://reviews.llvm.org/D109637, there is a whole invalid line of message in perfscript. ``` warning: Invalid address in LBR record at line 14118674: Processed 14138923 events and lost 1 chunks! warning: Invalid address in LBR record at line 14118676: Check IO/CPU overload! ``` This only happened for LBR only perfscript, hybridperfscript have a check of " 0x" to make sure it's the LBR perf line. Reviewed By: hoy, wenlei Differential Revision: https://reviews.llvm.org/D110424	2021-09-24 13:44:57 -07:00
Teresa Johnson	b5bfbb4da2	Fix bot failure by adding needed dependence Fix bot failure from `96cb97c453`, e.g.: https://lab.llvm.org/buildbot/#/builders/61/builds/15203 llvm-lto now needs to link in IPO.	2021-09-24 12:43:10 -07:00
Teresa Johnson	96cb97c453	[ThinLTO] Update combined index for SamplePGO indirect calls to locals In ThinLTO for locals we normally compute the GUID from the name after prepending the source path to get a unique global id. SamplePGO indirect call profiles contain the target GUID without this uniquification, however (unless compiling with -funique-internal-linkage-names). In order to correctly handle the call edges added to the combined index for these indirect calls, during importing and bitcode writing we consult a map of original to full GUID to identify the actual callee. However, for a large application this was consuming a lot of compile time as we need to do this repeatedly (especially during importing where we may traverse call edges multiple times). To fix this implement a suggestion in one of the FIXME comments, and actually modify the call edges during a single traversal after the index is built to perform the fixups once. I combined this fixup with the dead code analysis performed on the index in order to avoid adding an additional walk of the index. The dead code analysis is the first analysis performed on the index. This reduced the time required for a large thin link with SamplePGO by about 20%. No new test added, but I confirmed that there are existing tests that will fail when no fixup is performed. Differential Revision: https://reviews.llvm.org/D110374	2021-09-24 12:29:49 -07:00
Igor Kudrin	6dda6c49ce	[llvm-objcopy][NFC] Add a helper method RelocationSectionBase::getNamePrefix() Refactor handleArgs() to use that method. Differential Revision: https://reviews.llvm.org/D110350	2021-09-24 22:02:36 +07:00
gbreynoo	3bad9616aa	[llvm-objcopy][docs] Add missing options to the help output and the command guide This change is to keep the help text and command guide of objcopy in tandem. - In the help output the options --rename-section and --set-section-flags were missing the flag exclude, which is found in the command guide. - In the command guide the alias -G for --keep-global-symbol was missing, which is found in the help output. Differential Revision: https://reviews.llvm.org/D110340	2021-09-24 09:44:46 +01:00
Simon Pilgrim	5f2c53bdf4	Pass some DataLayout arguments by const-ref Avoid unnecessary copies, reported by MSVC static analyzer.	2021-09-23 15:50:31 +01:00
wlei	1ed69bb86e	[llvm-profgen] Fix a dangling vector reference in CS line number based generator It seems we missed one spot to persist `SampleContextFrameVector` into the global table (CSProfileGenerator::populateFunctionBoundarySamples:340) which causes a crash. This change tried to fix it in a centralized way i. e. where we generate the `FunctionSamples`. Reviewed By: hoy, wenlei Differential Revision: https://reviews.llvm.org/D110275	2021-09-22 18:33:28 -07:00
wlei	686cc00067	[llvm-profgen] Fix an out-of-range error during unwinding It happened that the LBR entry target can be the first address of text section which causes an out-of-range crash. So here add a boundary check. Reviewed By: hoy, wenlei Differential Revision: https://reviews.llvm.org/D110271	2021-09-22 18:33:27 -07:00
wlei	c2be2d3284	[llvm-profgen] Fix a bug of assertion The assertion should work on the entire context. Reviewed By: hoy, wenlei Differential Revision: https://reviews.llvm.org/D110268	2021-09-22 18:33:27 -07:00
Wenlei He	81c249784f	[llvm-profgen] Use hot threshold for context merging and trimming Without preinliner, we need to tune down the cold count cutoff to merge/trim more context to limit profile size for large components. However it doesn't make sense for cold threshold to be higher than hot threshold, so we now change to use hot threshold as merging/trimming cut off instead. Differential Revision: https://reviews.llvm.org/D110212	2021-09-22 15:01:51 -07:00
Hongtao Yu	734f4d832c	[llvm-profgen] An option to dump disasm of specified symbols For large app, dumping disasm of the whole program can be slow and result in gianant output. Adding a switch to dump specific symbols only. Reviewed By: wlei Differential Revision: https://reviews.llvm.org/D110079	2021-09-22 10:32:59 -07:00
Craig Topper	d85e347a28	[RISCV] Add a pass to recognize VLS strided loads/store from gather/scatter. For strided accesses the loop vectorizer seems to prefer creating a vector induction variable with a start value of the form <i32 0, i32 1, i32 2, ...>. This value will be incremented each loop iteration by a splat constant equal to the length of the vector. Within the loop, arithmetic using splat values will be done on this vector induction variable to produce indices for a vector GEP. This pass attempts to dig through the arithmetic back to the phi to create a new scalar induction variable and a stride. We push all of the arithmetic out of the loop by folding it into the start, step, and stride values. Then we create a scalar GEP to use as the base pointer for a strided load or store using the computed stride. Loop strength reduce will run after this pass and can do some cleanups to the scalar GEP and induction variable. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D107790	2021-09-20 09:39:44 -07:00
Samuel	f18c0739b3	[llvm-reduce] Add reduce operands pass Add reduction to set operands to default values Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D108903	2021-09-17 12:32:15 -07:00
Lang Hames	78b083dbb7	[ORC] Add finalization & deallocation actions, SimpleExecutorMemoryManager class Finalization and deallocation actions are a key part of the upcoming JITLinkMemoryManager redesign: They generalize the existing finalization and deallocate concepts (basically "copy-and-mprotect", and "munmap") to include support for arbitrary registration and deregistration of parts of JIT linked code. This allows us to register and deregister eh-frames, TLV sections, language metadata, etc. using regular memory management calls with no additional IPC/RPC overhead, which should both improve JIT performance and simplify interactions between ORC and the ORC runtime. The SimpleExecutorMemoryManager class provides executor-side support for memory management operations, including finalization and deallocation actions. This support is being added in advance of the rest of the memory manager redesign as it will simplify the introduction of an EPC based RuntimeDyld::MemoryManager (since eh-frame registration/deregistration will be expressible as actions). The new RuntimeDyld::MemoryManager will in turn allow us to remove older remote allocators that are blocking the rest of the memory manager changes.	2021-09-17 09:55:45 +10:00
Nico Weber	646299d183	[Support] Convert BinaryStream class zoo to 64-bit offsets Most PDB fields on disk are 32-bit but describe the file in terms of MSF blocks, which are 4 kiB by default. So PDB files can be a bit larger than 4 GiB, and much larger if you create them with a block size > 4 kiB. This is a first (necessary, but by far not not sufficient) step towards supporting such PDB files. Now we don't truncate in-memory file offsets (which are in terms of bytes, not in terms of blocks). No effective behavior change. lld-link will still error out if it were to produce PDBs > 4 GiB. Differential Revision: https://reviews.llvm.org/D109923	2021-09-16 19:14:52 -04:00
Wenlei He	446e21623c	[llvm-profgen] Use context-sensitive byte size cost for preinliner decisions by default Turn on `use-context-cost-for-preinliner` to use context-sensitive byte size cost for preinliner decisions by default. This is a more accurate proxy of inline cost than profile size. We tested on our large workload that it delivers measureable CPU improvement. Differential Revision: https://reviews.llvm.org/D109893	2021-09-16 10:36:12 -07:00
Alok Kumar Sharma	a5b72abc9e	[DebugInfo] Enhance DIImportedEntity to accept children entities New field `elements` is added to '!DIImportedEntity', representing list of aliased entities. This is needed to dump optimized debugging information where all names in a module are imported, but a few names are imported with overriding aliases. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D109343	2021-09-16 10:41:55 +05:30
Esme-Yi	945df8bc4c	[obj2yaml][XCOFF] Dump sections Summary: This patch implements parsing sections for obj2yaml on AIX. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D98003	2021-09-15 05:16:33 +00:00
Hongtao Yu	0057c7185d	[CSSPGO][llvm-profgen] Truncate stack samples with invalid return address. Invalid frame addresses exist in call stack samples due to bad unwinding. This could happen to frame-pointer-based unwinding and the callee functions that do not have the frame pointer chain set up. It isn't common when the program is built with the frame pointer omission disabled, but can still happen with third-party static libs built with frame pointer omitted. Reviewed By: wenlei Differential Revision: https://reviews.llvm.org/D109638	2021-09-14 21:56:22 -07:00
Hongtao Yu	8cbbd7e0b2	[llvm-profgen] Ignore broken LBR samples Perf script can sometimes give disordered LBR samples like below. ``` b022500 32de0044 3386e1d1 7f118e05720c 7f118df2d81f 0x2a0b9622/0x2a0b9610/P/-/-/1 0x2a0b79ff/0x2a0b9618/P/-/-/2 0x2a0b7a4a/0x2a0b79e8/P/-/-/1 0x2a0b7a33/0x2a0b7a46/P/-/-/1 0x2a0b7a42/0x2a0b7a23/P/-/-/1 0x2a0b7a21/0x2a0b7a37/P/-/-/2 0x2a0b79e6/0x2a0b7a07/P/-/-/1 0x2a0b79d4/0x2a0b79dc/P/-/-/2 0x2a0b7a03/0x2a0b79aa/P/-/-/1 0x2a0b79a8/0x2a0b7a00/P/-/-/234 0x2a0b9613/0x2a0b7930/P/-/-/1 0x2a0b9622/0x2a0b9610/P/-/-/1 0x2a0b79ff/0x2a0b9618/P/-/-/2 0x2a0b7a4a/0x2aWarning: Processed 10263226 events and lost 1 chunks! ``` Note that the last LBR record `0x2a0b7a4a/0x2aWarning:` . Currently llvm-profgen does not detect that and as a result an uninitialized branch target value will be used. The uninitialized value can cause creepy instruction ranges created which which in turn will result in a completely wrong profile. An example is like ``` .... @ _ZN5folly13loadUnalignedIsEET_PKv]:18446744073709551615:18446744073709551615 1: 18446744073709551615 !CFGChecksum: 4294967295 !Attributes: 0 ``` Reviewed By: wenlei, wlei Differential Revision: https://reviews.llvm.org/D109637	2021-09-14 12:11:17 -07:00
Sam Clegg	ef8c9135ef	[WebAssembly] Allow import and export of TLS symbols between DSOs We previously had a limitation that TLS variables could not be exported (and therefore could also not be imported). This change removed that limitation. Differential Revision: https://reviews.llvm.org/D108877	2021-09-14 06:47:37 -07:00
Martin Storsjö	63784b9a75	[llvm-readobj] [COFF] Resolve relocations pointing at section symbols for arm64 too This syncs parts from the x86 implementation to the ARMWinEH implementation. Currently, neither of the compilers targeting COFF/arm64 (MSVC, LLVM) produce such relocations, but LLVM might after a later patch. Differential Revision: https://reviews.llvm.org/D109650	2021-09-14 11:04:46 +03:00
Martin Storsjö	197084fcee	[llvm-readobj] [COFF] Try to resolve symbols in unwind info on x86 This is the same as we do on arm64 already for the MSVC style label symbols, but also handle the way GCC produces it - with all relocations pointing at the .text section symbol, with various offsets. Differential Revision: https://reviews.llvm.org/D109649	2021-09-14 11:04:46 +03:00
Arthur Eubanks	096d9814aa	[opt] Remove some legacy PM flags Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D109664	2021-09-13 15:50:03 -07:00
Sam Clegg	b78c85a44a	[WebAssembly] Convert to new "dylink.0" section format This format is based on sub-sections (like the "linking" and "name" sections) and is therefore easier to extend going forward. spec change: https://github.com/WebAssembly/tool-conventions/pull/170 binaryen change: https://github.com/WebAssembly/binaryen/pull/4141 wabt change: https://github.com/WebAssembly/wabt/pull/1707 emscripten change: https://github.com/emscripten-core/emscripten/pull/15019 Differential Revision: https://reviews.llvm.org/D109595	2021-09-12 05:30:38 -07:00
Lang Hames	bb72f07380	Re-apply `bb27e45643` and `5629afea91` with fixes. This reapplies `bb27e45643` (SimpleRemoteEPC support) and `2269a941a4` (#include <mutex> fix) with further fixes to support building with LLVM_ENABLE_THREADS=Off.	2021-09-12 14:23:22 +10:00
Martin Storsjö	314b5a0efd	[llvm-shlib] Fix the i686 MSVC triple check for listing symbols to export in LLVM-C.dll https://reviews.llvm.org/D47381 / `eb46c95c3e` changed the triples set up by GetHostTriple.cmake for i686 MSVC from i686-pc-win32 to i686-pc-windows-msvc without changing the corresponding condition in llvm-shlib. Since then, the 32 bit x86 build of LLVM-C.dll has contained no exported symbols at all. Differential Revision: https://reviews.llvm.org/D109493	2021-09-11 19:50:03 +03:00
Lang Hames	2269a941a4	Revert `5629afea91` and `bb27e45643` while I look into bot failures. This reverts commit `5629afea91` ("[ORC] Add missing include."), and `bb27e45643` ("[ORC] Add SimpleRemoteEPC: ExecutorProcessControl over SPS + abstract transport."). The SimpleRemoteEPC patch currently assumes availability of threads, and needs to be rewritten with LLVM_ENABLE_THREADS guards.	2021-09-11 19:02:11 +10:00
Lang Hames	bb27e45643	[ORC] Add SimpleRemoteEPC: ExecutorProcessControl over SPS + abstract transport. SimpleRemoteEPC is an ExecutorProcessControl implementation (with corresponding new server class) that uses ORC SimplePackedSerialization (SPS) to serialize and deserialize EPC-messages to/from byte-buffers. The byte-buffers are sent and received via a new SimpleRemoteEPCTransport interface that can be implemented to run SimpleRemoteEPC over whatever underlying transport system (IPC, RPC, network sockets, etc.) best suits your use case. The SimpleRemoteEPCServer class provides executor-side support. It uses a customizable SimpleRemoteEPCServer::Dispatcher object to dispatch wrapper function calls to prevent the RPC thread from being blocked (a problem in some earlier remote-JIT server implementations). Almost all functionality (beyond the bare basics needed to bootstrap) is implemented as wrapper functions to keep the implementation simple and uniform. Compared to previous remote JIT utilities (OrcRemoteTarget, OrcRPCExecutorProcessControl), more consideration has been given to disconnection and error handling behavior: Graceful disconnection is now always initiated by the ORC side of the connection, and failure at either end (or in the transport) will result in Errors being delivered to both ends to enable controlled tear-down of the JIT and Executor (in the Executor's case this means "as controlled as the JIT'd code allows"). The introduction of SimpleRemoteEPC will allow us to remove other remote-JIT support from ORC (including the legacy OrcRemoteTarget code used by lli, and the OrcRPCExecutorProcessControl and OrcRPCEPCServer classes), and then remove ORC RPC itself. The llvm-jitlink and llvm-jitlink-executor tools have been updated to use SimpleRemoteEPC over file descriptors. Future commits will move lli and other tools and example code to this system, and remove ORC RPC.	2021-09-11 18:16:38 +10:00
Keith Smiley	e972e49b11	[llvm-cov] Add error for invalid -path-equivalence format Differential Revision: https://reviews.llvm.org/D109042	2021-09-10 18:34:37 -07:00
Alfonso Sánchez-Beato	b25ab4f313	[llvm-objcopy][COFF] Fix test for debug dir presence If the number of directories was 6 (equal to the DEBUG_DIRECTORY index), patchDebugDirectory() was run even though the debug directory is actually the 7th entry. Use <= in the comparison to fix that. This fixes https://llvm.org/PR51243 Differential Revision: https://reviews.llvm.org/D106940 Reviewed by: jhenderson	2021-09-10 09:57:18 +01:00
Chris Lattner	735f46715d	[APInt] Normalize naming on keep constructors / predicate methods. This renames the primary methods for creating a zero value to `getZero` instead of `getNullValue` and renames predicates like `isAllOnesValue` to simply `isAllOnes`. This achieves two things: 1) This starts standardizing predicates across the LLVM codebase, following (in this case) ConstantInt. The word "Value" doesn't convey anything of merit, and is missing in some of the other things. 2) Calling an integer "null" doesn't make any sense. The original sin here is mine and I've regretted it for years. This moves us to calling it "zero" instead, which is correct! APInt is widely used and I don't think anyone is keen to take massive source breakage on anything so core, at least not all in one go. As such, this doesn't actually delete any entrypoints, it "soft deprecates" them with a comment. Included in this patch are changes to a bunch of the codebase, but there are more. We should normalize SelectionDAG and other APIs as well, which would make the API change more mechanical. Differential Revision: https://reviews.llvm.org/D109483	2021-09-09 09:50:24 -07:00
Alfonso Sánchez-Beato	b33fd31772	[yaml2obj][COFF] Allow variable number of directories Allow variable number of directories, as allowed by the specification. NumberOfRvaAndSize will default to 16 if not specified, as in the past. Reviewed by: jhenderson Differential Revision: https://reviews.llvm.org/D108825	2021-09-09 11:16:56 +01:00
Alexey Lapshin	50467c0852	[llvm-objcopy][NFC] Refactor CopyConfig structure - categorize options. This patch continues refactoring done by D99055. It puts format specific options into the correponding CopyConfig structures. Differential Revision: https://reviews.llvm.org/D102277	2021-09-08 19:16:38 +03:00
Nikita Popov	f5832eaaad	[UseListOrder] Fix use list order for function operands Functions can have a personality function, as well as prefix and prologue data as additional operands. Unused operands are assigned a dummy value of i1* null. This patch addresses multiple issues in use-list order preservation for these: * Fix verify-uselistorder to also enumerate the dummy values. This means that now use-list order values of these values are shuffled even if there is no other mention of i1* null in the module. This results in failures of Assembler/call-arg-is-callee.ll, Assembler/opaque-ptr.ll and Bitcode/use-list-order2.ll. * The use-list order prediction in ValueEnumerator does not take into account the fact that a global may use a value more than once and leaves uses in the same global effectively unordered. We should be comparing the operand number here, as we do for the more general case. * While we enumerate all operands of a function together (which seems sensible to me), the bitcode reader would first resolve prefix data for all function, then prologue data for all functions, then personality functions for all functions. Change this to resolve all operands for a given function together instead. Differential Revision: https://reviews.llvm.org/D109282	2021-09-07 20:59:12 +02:00
Maksim Panchenko	6300e4ac58	[llvm-objdump] Fix 'llvm-objdump -dr' for executables with relocations Print relocations interleaved with disassembled instructions for executables with relocatable sections, e.g. those built with "-Wl,-q". Differential Revision: https://reviews.llvm.org/D109016	2021-09-07 11:24:24 -07:00
Roman Lebedev	e030f808ec	[Exegesis] Native clusterization: sub-partition by sched class id Currently native clusterization simply groups all benchmarks by the opcode of key instruction, but that is suboptimal in certain cases, e.g. where we can already tell that the particular instructions already resolve into different sched classes.	2021-09-07 17:54:37 +03:00
Peter Smith	5e71839f77	[MC] Add MCSubtargetInfo to MCAlignFragment In preparation for passing the MCSubtargetInfo (STI) through to writeNops so that it can use the STI in operation at the time, we need to record the STI in operation when a MCAlignFragment may write nops as padding. The STI is currently unused, a further patch will pass it through to writeNops. There are many places that can create an MCAlignFragment, in most cases we can find out the STI in operation at the time. In a few places this isn't possible as we are in initialisation or finalisation, or are emitting constant pools. When possible I've tried to find the most appropriate existing fragment to obtain the STI from, when none is available use the per module STI. For constant pools we don't actually need to use EmitCodeAlign as the constant pools are data anyway so falling through into it via an executable NOP is no better than falling through into data padding. This is a prerequisite for D45962 which uses the STI to emit the appropriate NOP for the STI. Which can differ per fragment. Note that involves an interface change to InitSections. It is now called initSections and requires a SubtargetInfo as a parameter. Differential Revision: https://reviews.llvm.org/D45961	2021-09-07 15:46:19 +01:00
Roman Lebedev	03512ae9bf	[exegesis][X86] ParallelSnippetGenerator: don't accidentally create serialized instructions In the case of no tied variables, we pick random defs, and then random uses that don't alias with defs we just picked. Sounds good, except that an X86 instruction may have implicit reg uses, e.g. for `MULX` it's `EDX`/`RDX`: `Intel SDM, 4-162 Vol. 2B MULX — Unsigned Multiply Without Affecting Flags` > Performs an unsigned multiplication of the implicit source operand (EDX/RDX) and the specified source operand > (the third operand) and stores the low half of the result in the second destination (second operand), the high half > of the result in the first destination operand (first operand), without reading or writing the arithmetic flags. And indeed, every once in a while `llvm-exegesis` happened to pick EDX as a def while measuring throughput, and producing garbage output: ``` $ ./bin/llvm-exegesis -num-repetitions=1000000 -mode=inverse_throughput -repetition-mode=min --loop-body-size=4096 -dump-object-to-disk=false -opcode-name=MULX32rr --max-configs-per-opcode=65536 --- mode: inverse_throughput key: instructions: - 'MULX32rr EDX R11D R12D' config: '' register_initial_values: - 'R12D=0x0' - 'EDX=0x0' cpu_name: znver3 llvm_triple: x86_64-unknown-linux-gnu num_repetitions: 1000000 measurements: - { key: inverse_throughput, value: 4.00014, per_snippet_value: 4.00014 } error: '' info: instruction has no tied variables picking Uses different from defs assembled_snippet: 415441BC00000000BA00000000C4C223F6D4C4C223F6D4C4C223F6D4C4C223F6D4415CC3415441BC00000000BA0000000049B80200000000000000C4C223F6D4C4C223F6D44983C0FF75F0415CC3 ... ``` ``` $ ./bin/llvm-exegesis -num-repetitions=1000000 -mode=inverse_throughput -repetition-mode=min --loop-body-size=4096 -dump-object-to-disk=false -opcode-name=MULX32rr --max-configs-per-opcode=65536 --- mode: inverse_throughput key: instructions: - 'MULX32rr R13D EDX ECX' config: '' register_initial_values: - 'ECX=0x0' - 'EDX=0x0' cpu_name: znver3 llvm_triple: x86_64-unknown-linux-gnu num_repetitions: 1000000 measurements: - { key: inverse_throughput, value: 3.00013, per_snippet_value: 3.00013 } error: '' info: instruction has no tied variables picking Uses different from defs assembled_snippet: 4155B900000000BA00000000C4626BF6E9C4626BF6E9C4626BF6E9C4626BF6E9415DC34155B900000000BA0000000049B80200000000000000C4626BF6E9C4626BF6E94983C0FF75F0415DC3 ... ``` Oops! Not only does that not look fun, i did hit that pitfail during AMD Zen 3 enablement. While i have since then addressed this in rGd4d459e7475b4bb0d15280f12ed669342fa5edcd, i suspect there may be other buggy results lying around, so we should at least stop producing them. Reviewed By: courbet Differential Revision: https://reviews.llvm.org/D109275	2021-09-07 12:39:23 +03:00
Jinsong Ji	878c2a42ec	[RuntimeDyld] Guard UsedTLSStorage to x86 ELF only UsedTLSStorage is only used in allocateTLSSection, guarded in x87 ELF only. So clang will emit error with -Werror on. .../llvm/tools/llvm-rtdyld/llvm-rtdyld.cpp:288:12: error: private field 'UsedTLSStorage' is not used [-Werror,-Wunused-private-field] unsigned UsedTLSStorage = 0; ^	2021-09-07 01:20:38 +00:00
Moritz Sichert	a0a5964499	[RuntimeDyld] Implemented relocation of TLS symbols in ELF Differential Revision: https://reviews.llvm.org/D105466	2021-09-06 10:27:43 +02:00
Nikita Popov	ab79ffdb74	[verify-uselistorder] Support -force-opaque-pointers By creating LLVMContext after parsing parameters.	2021-09-04 22:41:31 +02:00
Wenlei He	a5d3cac033	[llvm-profgen] Turn off cold context trimming by default We merge cold context by default to save profile size. However trimming cold context after merging doesn't save size much, so default to off to reflect how it's commonly used. Differential Revision: https://reviews.llvm.org/D109166	2021-09-02 12:29:06 -07:00
Wenlei He	6eca242e09	[llvm-profgen] Deduplicate and improve warning for truncated context This change improves the warning for truncated context by: 1) deduplicate them as one call without probe can appear in many different context leading to duplicated warnings , 2) rephrase the message to make it easier to understand. The term "untracked frame" can be confusing. Differential Revision: https://reviews.llvm.org/D109115	2021-09-02 09:15:38 -07:00
Kazu Hirata	e1bb54b593	[clangd, llvm] Remove redundant calls to c_str() (NFC) Identified with readability-redundant-string-cstr.	2021-09-02 09:07:13 -07:00
Markus Lavin	304f2bd21d	[NPM] Added opt option -print-pipeline-passes. Added opt option -print-pipeline-passes to print a -passes compatible string describing the built pass pipeline. As an example: $ opt -enable-new-pm=1 -adce -licm -simplifycfg -o /dev/null /dev/null -print-pipeline-passes verify,function(adce),function(loop-mssa(licm)),function(simplifycfg<bonus-inst-threshold=1;no-forward-switch-cond;no-switch-to-lookup;keep-loops;no-hoist-common-insts;no-sink-common-insts>),verify,BitcodeWriterPass At the moment this is best-effort only and there are some known limitations: - Not all passes accepting parameters will print their parameters (currently only implemented for simplifycfg). - Some ClassName to pass-name mappings are not unique. - Some ClassName to pass-name mappings are missing (e.g. BitcodeWriterPass). Differential Revision: https://reviews.llvm.org/D108298	2021-09-02 08:23:33 +02:00
Markus Lavin	645af79e8e	Revert "[NPM] Added opt option -print-pipeline-passes." This reverts commit `c71869ed4c`.	2021-09-02 08:22:17 +02:00
Markus Lavin	c71869ed4c	[NPM] Added opt option -print-pipeline-passes. Added opt option -print-pipeline-passes to print a -passes compatible string describing the built pass pipeline. As an example: $ opt -enable-new-pm=1 -adce -licm -simplifycfg -o /dev/null /dev/null -print-pipeline-passes verify,function(adce),function(loop-mssa(licm)),function(simplifycfg<bonus-inst-threshold=1;no-forward-switch-cond;no-switch-to-lookup;keep-loops;no-hoist-common-insts;no-sink-common-insts>),verify,BitcodeWriterPass At the moment this is best-effort only and there are some known limitations: - Not all passes accepting parameters will print their parameters (currently only implemented for simplifycfg). - Some ClassName to pass-name mappings are not unique. - Some ClassName to pass-name mappings are missing (e.g. BitcodeWriterPass).	2021-09-02 08:16:51 +02:00
Wenlei He	f10004e7dd	[CSSPGO] Add stats for pre-inliner Add some stats to help tuning pre-inliner. Differential Revision: https://reviews.llvm.org/D109098	2021-09-01 20:03:50 -07:00
Wenlei He	4ef88031f5	[llvm-profdata] Fix assertion from invalid iterator Differential Revision: https://reviews.llvm.org/D109096	2021-09-01 14:42:00 -07:00
Hongtao Yu	7ca8030030	[CSSPGO] Enable loading MD5 CS profile. Adding the compiler support of MD5 CS profile based on pervious context split work D107299. A MD5 CS profile is about 40% smaller than the string-based extbinary profile. As a result, the compilation is 15% faster. There are a few conversion from real names to md5 names that have been made on the sample loader and context tracker side to get it work. Reviewed By: wenlei, wmi Differential Revision: https://reviews.llvm.org/D108342	2021-09-01 09:19:47 -07:00
Vy Nguyen	3afa2151f8	[llvm-ar][nfc] Reword help message to be less ambiguous on what p and t do. The current help msg isn't super clear on whether t prints the content of the files or just the list of files. (I'd certainly thought it'd print the list of files, and accidentally had a bunch of "gargabe" printed to my terminal). Similarly, t sounded like it'd do what p actually did. Differential Revision: https://reviews.llvm.org/D109018	2021-08-31 17:48:04 -04:00
wlei	964053d56f	[llvm-profgen] Support LBR only perf script This change aims at supporting LBR only sample perf script which is used for regular(Non-CS) profile generation. A LBR perf script includes a batch of LBR sample which starts with a frame pointer and a group of 32 LBR entries is followed. The FROM/TO LBR pair and the range between two consecutive entries (the former entry's TO and the latter entry's FROM) will be used to infer function profile info. An example of LBR perf script(created by `perf script -F ip,brstack -i perf.data`) ``` 40062f 0x40062f/0x4005b0/P/-/-/9 0x400645/0x4005ff/P/-/-/1 0x400637/0x400645/P/-/-/1 ... 4005d7 0x4005d7/0x4005e5/P/-/-/8 0x40062f/0x4005b0/P/-/-/6 0x400645/0x4005ff/P/-/-/1 ... ... ``` For implementation: - Extended a new child class `LBRPerfReader` for the sample parsing, reused all the functionalities in `extractLBRStack` except for an extension to parsing leading instruction pointer. - `HybridSample` is reused(just leave the call stack empty) and the parsed samples is still aggregated in `AggregatedSamples`. After that, range samples, branch sample, address samples are computed and recorded. - Reused `ContextSampleCounterMap` to store the raw profile, since it's no need to aggregation by context, here it just registered one sample counter with a fake context key. - Unified to use `show-raw-profile` instead of `show-unwinder-output` to dump the intermediate raw profile, see the comments of the format of the raw profile. For CS profile, it remains to output the unwinder output. Profile generation part will come soon. Differential Revision: https://reviews.llvm.org/D108153	2021-08-31 13:28:17 -07:00
Hongtao Yu	b9db70369b	[CSSPGO] Split context string to deduplicate function name used in the context. Currently context strings contain a lot of duplicated function names and that significantly increase the profile size. This change split the context into a series of {name, offset, discriminator} tuples so function names used in the context can be replaced by the index into the name table and that significantly reduce the size consumed by context. A follow-up improvement made in the compiler and profiling tools is to avoid reconstructing full context strings which is time- and memory- consuming. Instead a context vector of `StringRef` is adopted to represent the full context in all scenarios. As a result, the previous prevalent profile map which was implemented as a `StringRef` is now engineered as an unordered map keyed by `SampleContext`. `SampleContext` is reshaped to using an `ArrayRef` to represent a full context for CS profile. For non-CS profile, it falls back to use `StringRef` to represent a contextless function name. Both the `ArrayRef` and `StringRef` objects are underpinned by real array and string objects that are stored in producer buffers. For compiler, they are maintained by the sample reader. For llvm-profgen, they are maintained in `ProfiledBinary` and `ProfileGenerator`. Full context strings can be generated only in those cases of debugging and printing. When it comes to profile format, nothing has changed to the text format, though internally CS context is implemented as a vector. Extbinary format is only changed for CS profile, with an additional `SecCSNameTable` section which stores all full contexts logically in the form of `vector<int>`, which each element as an offset points to `SecNameTable`. All occurrences of contexts elsewhere are redirected to using the offset of `SecCSNameTable`. Testing This is no-diff change in terms of code quality and profile content (for text profile). For our internal large service (aka ads), the profile generation is cut to half, with a 20x smaller string-based extbinary format generated. The compile time of ads is dropped by 25%. Differential Revision: https://reviews.llvm.org/D107299	2021-08-30 20:09:29 -07:00
Nikita Popov	ae5e5f2011	[llc] Initialize context for parsing options This will allow using -force-opaque-pointers in codegen tests.	2021-08-28 22:37:26 +02:00
Haowei Wu	31e61c58b0	[ifs] Add option to hide undefined symbols This change add an option to llvm-ifs to hide undefined symbols from its output. Differential Revision: https://reviews.llvm.org/D108428	2021-08-27 11:15:56 -07:00
Andrea Di Biagio	0dc5dc6531	[MCA][NFC] Removed unused method, and fixed a coverity issue. The coverity issue was reported agaist class MCAOperand due to the lack of proper initialization for field Index. No functional change intended.	2021-08-27 12:49:49 +01:00
Lang Hames	b749ef9e22	[ORC][ORC-RT] Reapply "Introduce ELF/*nix Platform and runtime..." with fixes. This reapplies `e256445bff`, which was reverted in `45ac5f5441` due to bot errors (e.g. https://lab.llvm.org/buildbot/#/builders/112/builds/8599). The issue that caused the bot failure was fixed in `2e6a4fce35`.	2021-08-27 14:41:58 +10:00
Esme-Yi	b21ed75e10	[llvm-readobj][XCOFF] Add support for `--needed-libs` option. Summary: This patch is trying to add support for llvm-readobj --needed-libs option under XCOFF. For XCOFF, the needed libraries can be found from the Import File ID Name Table of the Loader Section. Currently, I am using binary inputs in the test since yaml2obj does not yet support for writing the Loader Section and the import file table. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D106643	2021-08-26 07:17:06 +00:00
Wenlei He	a45d72e024	[CSSPGO] Add switch for sample loader to honor global pre-inliner decision from llvm-profgen The change adds a switch to allow sample loader to use global pre-inliner's decision instead. The pre-inliner in llvm-profgen makes inline decision globally based on whole program profile and function byte size as cost proxy. Since pre-inliner also adjusts/merges context profile based on its inline decision, honoring its inline decision in sample loader would lead to better post-inline profile quality especially for thinlto where cross module profile merging isn't possible without pre-inliner. Minor fix in profile reader is also included. When pre-inliner is use, we now also turn off the default merging and trimming logic unless it's explicitly asked. Differential Revision: https://reviews.llvm.org/D108677	2021-08-25 17:20:15 -07:00

... 3 4 5 6 7 ...

13448 Commits