llvm-project

Commit Graph

Author	SHA1	Message	Date
Keith Smiley	e972e49b11	[llvm-cov] Add error for invalid -path-equivalence format Differential Revision: https://reviews.llvm.org/D109042	2021-09-10 18:34:37 -07:00
Alfonso Sánchez-Beato	b25ab4f313	[llvm-objcopy][COFF] Fix test for debug dir presence If the number of directories was 6 (equal to the DEBUG_DIRECTORY index), patchDebugDirectory() was run even though the debug directory is actually the 7th entry. Use <= in the comparison to fix that. This fixes https://llvm.org/PR51243 Differential Revision: https://reviews.llvm.org/D106940 Reviewed by: jhenderson	2021-09-10 09:57:18 +01:00
Chris Lattner	735f46715d	[APInt] Normalize naming on keep constructors / predicate methods. This renames the primary methods for creating a zero value to `getZero` instead of `getNullValue` and renames predicates like `isAllOnesValue` to simply `isAllOnes`. This achieves two things: 1) This starts standardizing predicates across the LLVM codebase, following (in this case) ConstantInt. The word "Value" doesn't convey anything of merit, and is missing in some of the other things. 2) Calling an integer "null" doesn't make any sense. The original sin here is mine and I've regretted it for years. This moves us to calling it "zero" instead, which is correct! APInt is widely used and I don't think anyone is keen to take massive source breakage on anything so core, at least not all in one go. As such, this doesn't actually delete any entrypoints, it "soft deprecates" them with a comment. Included in this patch are changes to a bunch of the codebase, but there are more. We should normalize SelectionDAG and other APIs as well, which would make the API change more mechanical. Differential Revision: https://reviews.llvm.org/D109483	2021-09-09 09:50:24 -07:00
Alfonso Sánchez-Beato	b33fd31772	[yaml2obj][COFF] Allow variable number of directories Allow variable number of directories, as allowed by the specification. NumberOfRvaAndSize will default to 16 if not specified, as in the past. Reviewed by: jhenderson Differential Revision: https://reviews.llvm.org/D108825	2021-09-09 11:16:56 +01:00
Alexey Lapshin	50467c0852	[llvm-objcopy][NFC] Refactor CopyConfig structure - categorize options. This patch continues refactoring done by D99055. It puts format specific options into the correponding CopyConfig structures. Differential Revision: https://reviews.llvm.org/D102277	2021-09-08 19:16:38 +03:00
Nikita Popov	f5832eaaad	[UseListOrder] Fix use list order for function operands Functions can have a personality function, as well as prefix and prologue data as additional operands. Unused operands are assigned a dummy value of i1* null. This patch addresses multiple issues in use-list order preservation for these: * Fix verify-uselistorder to also enumerate the dummy values. This means that now use-list order values of these values are shuffled even if there is no other mention of i1* null in the module. This results in failures of Assembler/call-arg-is-callee.ll, Assembler/opaque-ptr.ll and Bitcode/use-list-order2.ll. * The use-list order prediction in ValueEnumerator does not take into account the fact that a global may use a value more than once and leaves uses in the same global effectively unordered. We should be comparing the operand number here, as we do for the more general case. * While we enumerate all operands of a function together (which seems sensible to me), the bitcode reader would first resolve prefix data for all function, then prologue data for all functions, then personality functions for all functions. Change this to resolve all operands for a given function together instead. Differential Revision: https://reviews.llvm.org/D109282	2021-09-07 20:59:12 +02:00
Maksim Panchenko	6300e4ac58	[llvm-objdump] Fix 'llvm-objdump -dr' for executables with relocations Print relocations interleaved with disassembled instructions for executables with relocatable sections, e.g. those built with "-Wl,-q". Differential Revision: https://reviews.llvm.org/D109016	2021-09-07 11:24:24 -07:00
Roman Lebedev	e030f808ec	[Exegesis] Native clusterization: sub-partition by sched class id Currently native clusterization simply groups all benchmarks by the opcode of key instruction, but that is suboptimal in certain cases, e.g. where we can already tell that the particular instructions already resolve into different sched classes.	2021-09-07 17:54:37 +03:00
Peter Smith	5e71839f77	[MC] Add MCSubtargetInfo to MCAlignFragment In preparation for passing the MCSubtargetInfo (STI) through to writeNops so that it can use the STI in operation at the time, we need to record the STI in operation when a MCAlignFragment may write nops as padding. The STI is currently unused, a further patch will pass it through to writeNops. There are many places that can create an MCAlignFragment, in most cases we can find out the STI in operation at the time. In a few places this isn't possible as we are in initialisation or finalisation, or are emitting constant pools. When possible I've tried to find the most appropriate existing fragment to obtain the STI from, when none is available use the per module STI. For constant pools we don't actually need to use EmitCodeAlign as the constant pools are data anyway so falling through into it via an executable NOP is no better than falling through into data padding. This is a prerequisite for D45962 which uses the STI to emit the appropriate NOP for the STI. Which can differ per fragment. Note that involves an interface change to InitSections. It is now called initSections and requires a SubtargetInfo as a parameter. Differential Revision: https://reviews.llvm.org/D45961	2021-09-07 15:46:19 +01:00
Roman Lebedev	03512ae9bf	[exegesis][X86] ParallelSnippetGenerator: don't accidentally create serialized instructions In the case of no tied variables, we pick random defs, and then random uses that don't alias with defs we just picked. Sounds good, except that an X86 instruction may have implicit reg uses, e.g. for `MULX` it's `EDX`/`RDX`: `Intel SDM, 4-162 Vol. 2B MULX — Unsigned Multiply Without Affecting Flags` > Performs an unsigned multiplication of the implicit source operand (EDX/RDX) and the specified source operand > (the third operand) and stores the low half of the result in the second destination (second operand), the high half > of the result in the first destination operand (first operand), without reading or writing the arithmetic flags. And indeed, every once in a while `llvm-exegesis` happened to pick EDX as a def while measuring throughput, and producing garbage output: ``` $ ./bin/llvm-exegesis -num-repetitions=1000000 -mode=inverse_throughput -repetition-mode=min --loop-body-size=4096 -dump-object-to-disk=false -opcode-name=MULX32rr --max-configs-per-opcode=65536 --- mode: inverse_throughput key: instructions: - 'MULX32rr EDX R11D R12D' config: '' register_initial_values: - 'R12D=0x0' - 'EDX=0x0' cpu_name: znver3 llvm_triple: x86_64-unknown-linux-gnu num_repetitions: 1000000 measurements: - { key: inverse_throughput, value: 4.00014, per_snippet_value: 4.00014 } error: '' info: instruction has no tied variables picking Uses different from defs assembled_snippet: 415441BC00000000BA00000000C4C223F6D4C4C223F6D4C4C223F6D4C4C223F6D4415CC3415441BC00000000BA0000000049B80200000000000000C4C223F6D4C4C223F6D44983C0FF75F0415CC3 ... ``` ``` $ ./bin/llvm-exegesis -num-repetitions=1000000 -mode=inverse_throughput -repetition-mode=min --loop-body-size=4096 -dump-object-to-disk=false -opcode-name=MULX32rr --max-configs-per-opcode=65536 --- mode: inverse_throughput key: instructions: - 'MULX32rr R13D EDX ECX' config: '' register_initial_values: - 'ECX=0x0' - 'EDX=0x0' cpu_name: znver3 llvm_triple: x86_64-unknown-linux-gnu num_repetitions: 1000000 measurements: - { key: inverse_throughput, value: 3.00013, per_snippet_value: 3.00013 } error: '' info: instruction has no tied variables picking Uses different from defs assembled_snippet: 4155B900000000BA00000000C4626BF6E9C4626BF6E9C4626BF6E9C4626BF6E9415DC34155B900000000BA0000000049B80200000000000000C4626BF6E9C4626BF6E94983C0FF75F0415DC3 ... ``` Oops! Not only does that not look fun, i did hit that pitfail during AMD Zen 3 enablement. While i have since then addressed this in rGd4d459e7475b4bb0d15280f12ed669342fa5edcd, i suspect there may be other buggy results lying around, so we should at least stop producing them. Reviewed By: courbet Differential Revision: https://reviews.llvm.org/D109275	2021-09-07 12:39:23 +03:00
Jinsong Ji	878c2a42ec	[RuntimeDyld] Guard UsedTLSStorage to x86 ELF only UsedTLSStorage is only used in allocateTLSSection, guarded in x87 ELF only. So clang will emit error with -Werror on. .../llvm/tools/llvm-rtdyld/llvm-rtdyld.cpp:288:12: error: private field 'UsedTLSStorage' is not used [-Werror,-Wunused-private-field] unsigned UsedTLSStorage = 0; ^	2021-09-07 01:20:38 +00:00
Moritz Sichert	a0a5964499	[RuntimeDyld] Implemented relocation of TLS symbols in ELF Differential Revision: https://reviews.llvm.org/D105466	2021-09-06 10:27:43 +02:00
Nikita Popov	ab79ffdb74	[verify-uselistorder] Support -force-opaque-pointers By creating LLVMContext after parsing parameters.	2021-09-04 22:41:31 +02:00
Wenlei He	a5d3cac033	[llvm-profgen] Turn off cold context trimming by default We merge cold context by default to save profile size. However trimming cold context after merging doesn't save size much, so default to off to reflect how it's commonly used. Differential Revision: https://reviews.llvm.org/D109166	2021-09-02 12:29:06 -07:00
Wenlei He	6eca242e09	[llvm-profgen] Deduplicate and improve warning for truncated context This change improves the warning for truncated context by: 1) deduplicate them as one call without probe can appear in many different context leading to duplicated warnings , 2) rephrase the message to make it easier to understand. The term "untracked frame" can be confusing. Differential Revision: https://reviews.llvm.org/D109115	2021-09-02 09:15:38 -07:00
Kazu Hirata	e1bb54b593	[clangd, llvm] Remove redundant calls to c_str() (NFC) Identified with readability-redundant-string-cstr.	2021-09-02 09:07:13 -07:00
Markus Lavin	304f2bd21d	[NPM] Added opt option -print-pipeline-passes. Added opt option -print-pipeline-passes to print a -passes compatible string describing the built pass pipeline. As an example: $ opt -enable-new-pm=1 -adce -licm -simplifycfg -o /dev/null /dev/null -print-pipeline-passes verify,function(adce),function(loop-mssa(licm)),function(simplifycfg<bonus-inst-threshold=1;no-forward-switch-cond;no-switch-to-lookup;keep-loops;no-hoist-common-insts;no-sink-common-insts>),verify,BitcodeWriterPass At the moment this is best-effort only and there are some known limitations: - Not all passes accepting parameters will print their parameters (currently only implemented for simplifycfg). - Some ClassName to pass-name mappings are not unique. - Some ClassName to pass-name mappings are missing (e.g. BitcodeWriterPass). Differential Revision: https://reviews.llvm.org/D108298	2021-09-02 08:23:33 +02:00
Markus Lavin	645af79e8e	Revert "[NPM] Added opt option -print-pipeline-passes." This reverts commit `c71869ed4c`.	2021-09-02 08:22:17 +02:00
Markus Lavin	c71869ed4c	[NPM] Added opt option -print-pipeline-passes. Added opt option -print-pipeline-passes to print a -passes compatible string describing the built pass pipeline. As an example: $ opt -enable-new-pm=1 -adce -licm -simplifycfg -o /dev/null /dev/null -print-pipeline-passes verify,function(adce),function(loop-mssa(licm)),function(simplifycfg<bonus-inst-threshold=1;no-forward-switch-cond;no-switch-to-lookup;keep-loops;no-hoist-common-insts;no-sink-common-insts>),verify,BitcodeWriterPass At the moment this is best-effort only and there are some known limitations: - Not all passes accepting parameters will print their parameters (currently only implemented for simplifycfg). - Some ClassName to pass-name mappings are not unique. - Some ClassName to pass-name mappings are missing (e.g. BitcodeWriterPass).	2021-09-02 08:16:51 +02:00
Wenlei He	f10004e7dd	[CSSPGO] Add stats for pre-inliner Add some stats to help tuning pre-inliner. Differential Revision: https://reviews.llvm.org/D109098	2021-09-01 20:03:50 -07:00
Wenlei He	4ef88031f5	[llvm-profdata] Fix assertion from invalid iterator Differential Revision: https://reviews.llvm.org/D109096	2021-09-01 14:42:00 -07:00
Hongtao Yu	7ca8030030	[CSSPGO] Enable loading MD5 CS profile. Adding the compiler support of MD5 CS profile based on pervious context split work D107299. A MD5 CS profile is about 40% smaller than the string-based extbinary profile. As a result, the compilation is 15% faster. There are a few conversion from real names to md5 names that have been made on the sample loader and context tracker side to get it work. Reviewed By: wenlei, wmi Differential Revision: https://reviews.llvm.org/D108342	2021-09-01 09:19:47 -07:00
Vy Nguyen	3afa2151f8	[llvm-ar][nfc] Reword help message to be less ambiguous on what p and t do. The current help msg isn't super clear on whether t prints the content of the files or just the list of files. (I'd certainly thought it'd print the list of files, and accidentally had a bunch of "gargabe" printed to my terminal). Similarly, t sounded like it'd do what p actually did. Differential Revision: https://reviews.llvm.org/D109018	2021-08-31 17:48:04 -04:00
wlei	964053d56f	[llvm-profgen] Support LBR only perf script This change aims at supporting LBR only sample perf script which is used for regular(Non-CS) profile generation. A LBR perf script includes a batch of LBR sample which starts with a frame pointer and a group of 32 LBR entries is followed. The FROM/TO LBR pair and the range between two consecutive entries (the former entry's TO and the latter entry's FROM) will be used to infer function profile info. An example of LBR perf script(created by `perf script -F ip,brstack -i perf.data`) ``` 40062f 0x40062f/0x4005b0/P/-/-/9 0x400645/0x4005ff/P/-/-/1 0x400637/0x400645/P/-/-/1 ... 4005d7 0x4005d7/0x4005e5/P/-/-/8 0x40062f/0x4005b0/P/-/-/6 0x400645/0x4005ff/P/-/-/1 ... ... ``` For implementation: - Extended a new child class `LBRPerfReader` for the sample parsing, reused all the functionalities in `extractLBRStack` except for an extension to parsing leading instruction pointer. - `HybridSample` is reused(just leave the call stack empty) and the parsed samples is still aggregated in `AggregatedSamples`. After that, range samples, branch sample, address samples are computed and recorded. - Reused `ContextSampleCounterMap` to store the raw profile, since it's no need to aggregation by context, here it just registered one sample counter with a fake context key. - Unified to use `show-raw-profile` instead of `show-unwinder-output` to dump the intermediate raw profile, see the comments of the format of the raw profile. For CS profile, it remains to output the unwinder output. Profile generation part will come soon. Differential Revision: https://reviews.llvm.org/D108153	2021-08-31 13:28:17 -07:00
Hongtao Yu	b9db70369b	[CSSPGO] Split context string to deduplicate function name used in the context. Currently context strings contain a lot of duplicated function names and that significantly increase the profile size. This change split the context into a series of {name, offset, discriminator} tuples so function names used in the context can be replaced by the index into the name table and that significantly reduce the size consumed by context. A follow-up improvement made in the compiler and profiling tools is to avoid reconstructing full context strings which is time- and memory- consuming. Instead a context vector of `StringRef` is adopted to represent the full context in all scenarios. As a result, the previous prevalent profile map which was implemented as a `StringRef` is now engineered as an unordered map keyed by `SampleContext`. `SampleContext` is reshaped to using an `ArrayRef` to represent a full context for CS profile. For non-CS profile, it falls back to use `StringRef` to represent a contextless function name. Both the `ArrayRef` and `StringRef` objects are underpinned by real array and string objects that are stored in producer buffers. For compiler, they are maintained by the sample reader. For llvm-profgen, they are maintained in `ProfiledBinary` and `ProfileGenerator`. Full context strings can be generated only in those cases of debugging and printing. When it comes to profile format, nothing has changed to the text format, though internally CS context is implemented as a vector. Extbinary format is only changed for CS profile, with an additional `SecCSNameTable` section which stores all full contexts logically in the form of `vector<int>`, which each element as an offset points to `SecNameTable`. All occurrences of contexts elsewhere are redirected to using the offset of `SecCSNameTable`. Testing This is no-diff change in terms of code quality and profile content (for text profile). For our internal large service (aka ads), the profile generation is cut to half, with a 20x smaller string-based extbinary format generated. The compile time of ads is dropped by 25%. Differential Revision: https://reviews.llvm.org/D107299	2021-08-30 20:09:29 -07:00
Nikita Popov	ae5e5f2011	[llc] Initialize context for parsing options This will allow using -force-opaque-pointers in codegen tests.	2021-08-28 22:37:26 +02:00
Haowei Wu	31e61c58b0	[ifs] Add option to hide undefined symbols This change add an option to llvm-ifs to hide undefined symbols from its output. Differential Revision: https://reviews.llvm.org/D108428	2021-08-27 11:15:56 -07:00
Andrea Di Biagio	0dc5dc6531	[MCA][NFC] Removed unused method, and fixed a coverity issue. The coverity issue was reported agaist class MCAOperand due to the lack of proper initialization for field Index. No functional change intended.	2021-08-27 12:49:49 +01:00
Lang Hames	b749ef9e22	[ORC][ORC-RT] Reapply "Introduce ELF/*nix Platform and runtime..." with fixes. This reapplies `e256445bff`, which was reverted in `45ac5f5441` due to bot errors (e.g. https://lab.llvm.org/buildbot/#/builders/112/builds/8599). The issue that caused the bot failure was fixed in `2e6a4fce35`.	2021-08-27 14:41:58 +10:00
Esme-Yi	b21ed75e10	[llvm-readobj][XCOFF] Add support for `--needed-libs` option. Summary: This patch is trying to add support for llvm-readobj --needed-libs option under XCOFF. For XCOFF, the needed libraries can be found from the Import File ID Name Table of the Loader Section. Currently, I am using binary inputs in the test since yaml2obj does not yet support for writing the Loader Section and the import file table. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D106643	2021-08-26 07:17:06 +00:00
Wenlei He	a45d72e024	[CSSPGO] Add switch for sample loader to honor global pre-inliner decision from llvm-profgen The change adds a switch to allow sample loader to use global pre-inliner's decision instead. The pre-inliner in llvm-profgen makes inline decision globally based on whole program profile and function byte size as cost proxy. Since pre-inliner also adjusts/merges context profile based on its inline decision, honoring its inline decision in sample loader would lead to better post-inline profile quality especially for thinlto where cross module profile merging isn't possible without pre-inliner. Minor fix in profile reader is also included. When pre-inliner is use, we now also turn off the default merging and trimming logic unless it's explicitly asked. Differential Revision: https://reviews.llvm.org/D108677	2021-08-25 17:20:15 -07:00
Alfonso Sánchez-Beato	cdd407286a	[llvm-objcopy] [COFF] Consider section flags when adding section The --set-section-flags option was being ignored when adding a new section. Take it into account if present. Fixes https://llvm.org/PR51244 Reviewed By: jhenderson, MaskRay Differential Revision: https://reviews.llvm.org/D106942	2021-08-25 23:11:41 +03:00
Patrick Holland	fe01014faa	[MCA] Moved View.h and View.cpp from /tools/llvm-mca/ to /lib/MCA/. Moved View.h and View.cpp from /tools/llvm-mca/Views/ to /lib/MCA/ and /include/llvm/MCA/. This is so that targets can define their own Views within the /lib/Target/ directory (so that the View can use backend functionality). To enable these Views within mca, targets will need to add them to the vector of Views returned by their target's CustomBehaviour::getViews() methods. Differential Revision: https://reviews.llvm.org/D108520	2021-08-25 12:12:47 -07:00
Wenlei He	a6f15e9a49	[CSSPGO] Use probe inline tree to track zero size fully optimized context for pre-inliner This is a follow up diff for BinarySizeContextTracker to track zero size for fully optimized inlinee. When an inlinee is fully optimized away, we won't be able to get its size through symbolizing instructions, hence we will treat the corresponding context size as unknown. However by traversing the inlined probe forest, we know what're original inlinees regardless of optimization. If a context show up in inlined probes, but not during symbolization, we know that it's fully optimized away hence its size is zero instead of unknown. It should provide more accurate size cost estimation for pre-inliner to make better inline decisions in llvm-profgen. Differential Revision: https://reviews.llvm.org/D108350	2021-08-25 09:01:11 -07:00
Nico Weber	67ffce68bc	Make WindowsManifestMerger::merge() take a MemoryBufferRef No behavior change.	2021-08-24 16:39:20 -04:00
Fangrui Song	9b96b0865d	llvm-xray {convert,extract}: Add --demangle No demangling may be a better default in the future. Add `--demangle` for migration convenience. Reviewed By: Enna1 Differential Revision: https://reviews.llvm.org/D108100	2021-08-24 13:35:19 -07:00
Kirill Stoimenov	b97ca3aca1	Revert "[asan] Implemented intrinsic for the custom calling convention similar used by HWASan for X86." This reverts commit `9588b685c6`. Breaks a bunch of builds. Reviewed By: GMNGeoffrey Differential Revision: https://reviews.llvm.org/D108658	2021-08-24 13:21:20 -07:00
Kirill Stoimenov	9588b685c6	[asan] Implemented intrinsic for the custom calling convention similar used by HWASan for X86. The implementation uses the int_asan_check_memaccess intrinsic to instrument the code. The intrinsic is replaced by a call to a function which performs the access check. The generated function names encode the input register name as a number using Reg - X86::NoRegister formula. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D107850	2021-08-24 19:34:34 +00:00
Arthur Eubanks	2d8a2a91b1	[llvm-reduce] Check if module data strings are empty before attempting to reduce	2021-08-24 10:23:00 -07:00
Arthur Eubanks	d2e103644b	[llvm-reduce] Remove various module data This removes the data layout, target triple, source filename, and module identifier when possible. Reviewed By: swamulism Differential Revision: https://reviews.llvm.org/D108568	2021-08-24 09:45:31 -07:00
Christian Fetzer	9116211d18	[Coverage][llvm-cov] Correctly export branch coverage in LCOV format Commit `9f2967bcfe` introduced support for branch coverage including export to the LCOV format. This commit corrects the LCOV field name for branches from BFH to BRH. The mistake seems to have slipped in as typo because the correct field name BRH is used in the comment section at the beginning of the file. Differential Revision: https://reviews.llvm.org/D108358	2021-08-20 13:44:25 -05:00
Bjorn Pettersson	d52f506192	[NewPM] Use parameterized syntax for a couple of more passes A couple of passes that are parameterized in new-PM used different pass names (in cmd line interface) while using the same pass class name. This patch updates the PassRegistry to model pass parameters more properly using PASS_WITH_PARAMS. Reason for the change is to ensure that we have a 1-1 mapping between class name and pass name (when disregarding the params). With a 1-1 mapping it is more obvious which pass name to use in options such as -debug-only, -print-after etc. The opt -passes syntax is changed for the following passes: early-cse-memssa => early-cse<memssa> post-inline-ee-instrument => ee-instrument<post-inline> loop-extract-single => loop-extract<single> lower-matrix-intrinsics-minimal => lower-matrix-intrinsics<minimal> This patch is not updating pass names in docs/Passes.rst. Not quite sure what the status is for that document (e.g. when it comes to listing pass paramters). It is only loop-extract-single that is mentioned in Passes.rst today, out of the passes mentioned above. Differential Revision: https://reviews.llvm.org/D108362	2021-08-20 14:59:21 +02:00
Maryam Benimmar	2cdfd0b259	[AIX][XCOFF] 64-bit relocation reading support Support XCOFFDumper relocation reading support This patch is part of D103696 partition Reviewed By: daltenty, Helflym Differential Revision: https://reviews.llvm.org/D104646	2021-08-19 21:56:57 -04:00
Bjorn Pettersson	36d5138619	[NewPM] Make some sanitizer passes parameterized in the PassRegistry Refactored implementation of AddressSanitizerPass and HWAddressSanitizerPass to use pass options similar to passes like MemorySanitizerPass. This makes sure that there is a single mapping from class name to pass name (needed by D108298), and options like -debug-only and -print-after makes a bit more sense when (despite that it is the unparameterized pass name that should be used in those options). A result of the above is that some pass names are removed in favor of the parameterized versions: - "khwasan" is now "hwasan<kernel;recover>" - "kasan" is now "asan<kernel>" - "kmsan" is now "msan<kernel>" Differential Revision: https://reviews.llvm.org/D105007	2021-08-19 12:43:37 +02:00
Andrzej Warzynski	dcc6b7b1d5	[OptTable] Refine how `printHelp` treats empty help texts Currently, `printHelp` behaves differently for options that: * do not define `HelpText` (such options _are not printed_), and * define its `HelpText` as `HelpText<"">` (such options _are printed_). In practice, both approaches lead to no help text and `printHelp` should treat them consistently. This patch addresses that by making `printHelpt` check the length of the help text to be printed. All affected tests have been updated accordingly. The option definitions for llvm-cvtres have been updated with a short description or "Not implemented" for options that are ignored by the tool. Differential Revision: https://reviews.llvm.org/D107557	2021-08-19 09:30:15 +00:00
Wenlei He	eca03d2768	[CSSPGO] Track and use context-sensitive post-optimization function size to drive global pre-inliner in llvm-profgen This change enables llvm-profgen to use accurate context-sensitive post-optimization function byte size as a cost proxy to drive global preinline decisions. To do this, BinarySizeContextTracker is introduced to track function byte size under different inline context during disassembling. In preinliner, we can not query context byte size under switch `context-cost-for-preinliner`. The tracker uses a reverse trie to keep size of functions under different context (callee as parent, caller as child), and it can give best/longest possible matching context size for given input context. The new size cost is off by default. There're a few TODOs that needs to addressed: 1) avoid dangling string from `Offset2LocStackMap`, which will be addressed in split context work; 2) using inlinee's entry probe to make sure we have correct zero size for inlinee that's completely optimized away after inlining. Some tuning is also needed. Differential Revision: https://reviews.llvm.org/D108180	2021-08-18 22:50:57 -07:00
Rong Xu	5fdaaf7fd8	[SampleFDO] Flow Sensitive Sample FDO (FSAFDO) profile loader This patch implements Flow Sensitive Sample FDO (FSAFDO) profile loader. We have two profile loaders for FS profile, one before RegAlloc and one before BlockPlacement. To enable it, when -fprofile-sample-use=<profile> is specified, add "-enable-fs-discriminator=true \ -disable-ra-fsprofile-loader=false \ -disable-layout-fsprofile-loader=false" to turn on the FS profile loaders. Differential Revision: https://reviews.llvm.org/D107878	2021-08-18 18:37:35 -07:00
Maryam Benimmar	7151a8aada	[PowerPC][AIX] llvm-readobj: Convert some errors to warnings. Report warnings rather than errors, so that llvm-readobj doesn't bail out on malformed inputs. Differential Revision: https://reviews.llvm.org/D106783	2021-08-18 11:04:08 -04:00
Lang Hames	45ac5f5441	Revert "[ORC-RT][ORC] Introduce ELF/*nix Platform and runtime support." This reverts commit `e256445bff`. This commit broke some of the bots (see e.g. https://lab.llvm.org/buildbot/#/builders/112/builds/8599). Reverting while I investigate.	2021-08-18 20:42:23 +10:00
Lang Hames	e256445bff	[ORC-RT][ORC] Introduce ELF/*nix Platform and runtime support. This change adds support to ORCv2 and the Orc runtime library for static initializers, C++ static destructors, and exception handler registration for ELF-based platforms, at present Linux and FreeBSD on x86_64. It is based on the MachO platform and runtime support introduced in `bb5f97e3ad`. Patch by Peter Housel. Thanks very much Peter! Reviewed By: lhames Differential Revision: https://reviews.llvm.org/D108081	2021-08-18 15:00:22 +10:00
Xu Mingjie	168ee72718	[NFC][llvm-xray] add a llvm-xray convert option `no-demangle` When option `--symbolize` is true, llvm-xray convert will demangle function name on default. This patch adds a llvm-xray convert option `no-demangle` to determine whether to demangle function name when symbolizing function ids from the input log. Reviewed By: MaskRay, smeenai Differential Revision: https://reviews.llvm.org/D108019	2021-08-18 12:22:04 +08:00
Arthur Eubanks	de0ae9e89e	[NFC] Cleanup more AttributeList::addAttribute()	2021-08-17 21:05:41 -07:00
wlei	9af46710fe	[llvm-profgen] Move profiled binary loading out of PerfReader Change to use unique pointer of profiled binary to unblock asan. At same time, I realized we can decouple to move the profiled binary loading out of PerfReader, so I made some other related refactors. Reviewed By: hoy Differential Revision: https://reviews.llvm.org/D108254	2021-08-17 17:28:01 -07:00
Jozef Lawrynowicz	108ba4f4a4	[llvm-readobj] Refactor ELFDumper::printAttributes() The current implementation of printAttributes makes it fiddly to extend attribute support for new targets. By refactoring the code so all target specific variables are initialized in a switch/case statement, it becomes simpler to extend attribute support for new targets. Reviewed By: jhenderson, MaskRay Differential Revision: https://reviews.llvm.org/D107968	2021-08-17 13:28:31 -07:00
wlei	f812c19253	[llvm-profgen] Clean up code dealing with multiple binaries As we decided to support only one binary each time, this patch cleans up the related code dealing with multiple binaries. We can use `llvm-profdata` to merge profile from multiple binaries. Reviewed By: hoy, wenlei Differential Revision: https://reviews.llvm.org/D108002	2021-08-17 12:16:07 -07:00
Fangrui Song	c56b4cfd4b	[llvm-objdump] -T: print symbol versions Similar to D94907 (llvm-nm -D). The output will match GNU objdump 2.37. Older versions don't use ` (version)` for undefined symbols. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D108097	2021-08-17 09:10:50 -07:00
Fangrui Song	78cb1adc5c	[Object] Move llvm-nm's symbol version utility to ELFObjectFile::readDynsymVersions The utility can be reused by llvm-objdump -T. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D108096	2021-08-17 09:06:39 -07:00
Fangrui Song	54e76cb17a	[split-file] Default to --no-leading-lines It turns out that the --leading-lines may be a bad default. [[#@LINE+-num]] is rarely used.	2021-08-16 19:23:11 -07:00
Arthur Eubanks	10bbf41737	Add missed rename of getFnAttributes() -> getFnAttrs()	2021-08-13 11:29:20 -07:00
Arthur Eubanks	80ea2bb574	[NFC] Rename AttributeList::getParam/Ret/FnAttributes() -> get*Attributes() This is more consistent with similar methods.	2021-08-13 11:16:52 -07:00
Vyacheslav Zakharin	15497e62f6	[openmp][ELF] Recognize LLVM OpenMP offload specific notes The new ELF notes are added in clang-offload-wrapper, and llvm-readobj has to visualize them properly. Differential Revision: https://reviews.llvm.org/D99552	2021-08-12 13:47:48 -07:00
Bill Wendling	b61359f92b	[llvm-diff] correct variable typo	2021-08-12 11:29:48 -07:00
Bill Wendling	39809eb1b3	[llvm-diff] add "reset" method to DiffConsumer A DiffConsumer object may be reused, but we'd like to reset it before the next use. No functionality change intended. Differential Revision: https://reviews.llvm.org/D107985	2021-08-12 11:25:47 -07:00
Hongtao Yu	ccb5b9bbfb	[CSSPGO] Allow the use of debug-info-for-profiling and pseudo-probe-for-profiling together Previoulsy debug-info-for-profiling and pseudo-probe-for-profiling are mutual exclusive because they compete the dwarf discrimnator for callsites on the IR. This changes allows to use the two switches together. The side effect is that callsite discriminators will be taken by pseudo probe, while discriminators for other instructions are still available for AutoFDO use. This is less than ideal, however, it still allows us a chance to smoothly transition from AutoFDO to CSSPGO, by collecting both profiles from a CSSPGO binary. Reviewed By: wenlei, wmi Differential Revision: https://reviews.llvm.org/D107876	2021-08-12 08:52:49 -07:00
Igor Kudrin	68616584c3	[llvm-objcopy][ELF] Avoid reordering section headers As for now, llvm-objcopy sorts section headers according to the offsets of the sections in the input file. That can corrupt section references in the dynamic symbol table because it is a loadable section and as such is not updated by the tool. Even though the section references are not required for loading the binary correctly, they are still handy for a user who analyzes the file. While the patch removes global reordering of section headers, it layouts the sections in the same way as before, i.e. according to their original offsets. All that helps the output file to resemble the input better. Note that the patch removes sorting SHT_GROUP sections to the start of the list, which was introduced in D62620 in order to ensure that they come before the group members, along with the corresponding test. The original issue was caused by the sorting of section headers, so dropping the sorting also resolves the issue. Differential Revision: https://reviews.llvm.org/D107653	2021-08-12 17:12:09 +07:00
wlei	856a6a5041	[CSSPGO][llvm-profgen] Trim and merge context beforehand to reduce memory usage Currently we use a centralized string map(StringMap<FunctionSamples> ProfileMap) to store the profile while populating the sample, which might cause the memory usage bottleneck. I saw in an extreme case, there are thousands of samples whose context stack depth is >= 100. The memory consumption can be greater than 100GB. As here the context is used for inlining, we can assume we won't have so many of inlinees keeping inlined at the same root function, so this change tried to cap the context stack and merge the samples for peak memory reduction and this is done after recursion compression. The default value is -1 meaning no depth limit, in the future we can tune to a smaller one. Reviewed By: hoy, wenlei Differential Revision: https://reviews.llvm.org/D107800	2021-08-11 16:02:35 -07:00
Christopher Di Bella	c874dd5362	[llvm][clang][NFC] updates inline licence info Some files still contained the old University of Illinois Open Source Licence header. This patch replaces that with the Apache 2 with LLVM Exception licence. Differential Revision: https://reviews.llvm.org/D107528	2021-08-11 02:48:53 +00:00
Igor Kudrin	2bb4ebb19e	[llvm-objcopy][ELF][NFC] Remove unneeded methods of Object The patch removes mutable accessor methods for sections and segments. As for now, const variants of them are not used because all callers have mutable access to an instance of Object. On the other hand, they do not actually modify the sets, so it looks better to keep only const ones. Differential Revision: https://reviews.llvm.org/D107652	2021-08-09 15:44:03 +07:00
Esme-Yi	aefdce8b39	fix the error caused by D107333: llvm/tools/llvm-readobj/XCOFFDumper.cpp:464:5: error: call to member function 'printNumber' is ambiguous W.printNumber("Length", StrTabSize);	2021-08-09 08:36:39 +00:00
Esme-Yi	f49c3a6882	[llvm-readobj][XCOFF] Print the length of the string table. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D107333	2021-08-09 06:47:15 +00:00
Andrea Di Biagio	45685a1fc4	[MCA] Simplify the rounding logic used in TimelineView::printWaitTimeEntry. This is related to PR51392. Before this patch, the timeline view was rounding doubles to the first decimal, using a logic similar to this: ``` double AverageTime = (double)Input / CumulativeExecutions; double Result = floor((AverageTime * 10) + 0.5) / 10 ``` Here, Input and CumulativeExecutions are both unsigned integers. The last operation is what effectively performs the rounding of AverageTime. PR51392 has been raised because - under specific -m32 configurations of GCC - one of the timeline tests reports slighlty different values (due to a different rounding choice). This patch tries to minimise the propagation of floating-point error by hoisting the multiply by 10, so that it is performed on the unsigned. ``` double AverageTime = (double)(Input * 10) / CumulativeExecutions; floor(AverageTime + 0.5) / 10 ``` So we are trading a floating point multiply for a integer multiply (which can be expanded using a simple MUL or using an `ADD + LEA` sequence). This decrease in floating point operations executed should also help with decreasing the error in the computation.. Strictly speaking, that computation will always be potentially subject to error (depending on what values are passed in input). However, this patch should improve the situation and make bug like PR51392 less frequent.	2021-08-07 11:59:41 +01:00
Pirama Arumuga Nainar	16ebb7ab5c	[llvm-objcopy] [COFF] Do not patch debug entries if PointerToRawData is zero Fix an edge case missed by https://reviews.llvm.org/D78921. For e.g., the Repro debug entry (generated with the /Brepro linker flag) does not have a debug-directory payload. Do not attempt to patch Debug entries without a payload. Differential Revision: https://reviews.llvm.org/D107324	2021-08-06 09:23:25 -07:00
Esme-Yi	2919ac8971	[llvm-readobj][XCOFF] Warn about invalid offset Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D107398	2021-08-06 08:54:02 +00:00
wlei	a8a38ef3d9	[llvm-profgen] Fix bug of loop scope mismatch One performance issue happened in profile generation and it turned out the line 525 loop is the bottleneck. Moving the code outside of loop scope can fix this issue. The run time is improved from 30+mins to ~30s. Reviewed By: hoy, wenlei Differential Revision: https://reviews.llvm.org/D107529	2021-08-05 16:52:57 -07:00
Bill Wendling	4d293f215d	[llvm-diff] Create libLLVMDiff library Some tools may want to use the LLVM "diff" code. Move the code into a library for easy use. No functionality change intende. Differential Revision: https://reviews.llvm.org/D107392	2021-08-05 12:05:50 -07:00
Matt Morehouse	ec5137029b	Revert "[llvm-diff] Create libLLVMDiff library" This reverts commit `9854f2f30f` since it broke all the builds.	2021-08-05 11:10:58 -07:00
Bill Wendling	fec8f1a008	Remove unintended commit.	2021-08-05 10:51:37 -07:00
Bill Wendling	9854f2f30f	[llvm-diff] Create libLLVMDiff library Some tools may want to use the LLVM "diff" code. Move the code into a library for easy use. No functionality change intende. Differential Revision: https://reviews.llvm.org/D107392	2021-08-05 10:36:01 -07:00
Lang Hames	13c8ec44e6	[llvm-jitlink] Don't try to guess the ORC runtime path. ORC-runtime regression tests will now explicitly specify the runtime path.	2021-08-05 18:46:34 +10:00
Martin Storsjö	46020f6f0c	[llvm-rc] Allow specifying language with a leading 0x prefix This option is always interpreted strictly as a hexadecimal string, even if it has no prefix that indicates the number format, hence the existing call to StringRef::getAsInteger(16, ...). StringRef::getAsInteger(0, ...) consumes a leading "0x" prefix is present, but when the radix is specified, the radix shouldn't be included. Both MS rc.exe and GNU windres accept the language with that prefix. Also allow specifying the codepage to llvm-windres with a different radix, as GNU windres allows that (but MS rc.exe doesn't). This fixes https://llvm.org/PR51295. Differential Revision: https://reviews.llvm.org/D107263	2021-08-05 10:19:55 +03:00
Igor Kudrin	2c14798ead	[ARM][llvm-objdump] Annotate PC-relative memory operands of VLDR instructions This extends D105979 and adds support for VLDR instructions. Differential Revision: https://reviews.llvm.org/D105980	2021-08-05 14:11:11 +07:00
jamesluox	ee7d20e846	[CSSPGO] Migrate and refactor the decoder of Pseudo Probe Migrate pseudo probe decoding logic in llvm-profgen to MC, so other LLVM-base program could reuse existing codes. Redesign object layout of encoded and decoded pseudo probes. Reviewed By: hoy Differential Revision: https://reviews.llvm.org/D106861	2021-08-04 09:21:34 -07:00
Esme-Yi	737e27f623	[llvm-readobj][XCOFF] dump the string table only if the size is bigger than 4.	2021-08-04 06:28:26 +00:00
Vitaly Buka	3df1e7e6f0	[llvm-readobj][XCOFF] Warn about invalid offset Followup for D105522 Differential Revision: https://reviews.llvm.org/D107398	2021-08-03 20:11:26 -07:00
wlei	f1affe8dc8	[llvm-profgen][CSSPGO] Support count based aggregated type of hybrid perf script This change tried to integrate a new count based aggregated type of perf script. The only difference of the format is that an aggregated count is added at the head of the original sample which means the same samples are repeated to the given count times. This is used to reduce the perf script size. e.g. ``` 2 4005dc 400634 400684 7f68c5788793 0x4005c8/0x4005dc/P/-/-/0 .... ``` Implemented by a dedicated PerfReader `AggregatedHybridPerfReader`. Differential Revision: https://reviews.llvm.org/D107192	2021-08-03 17:56:35 -07:00
Vitaly Buka	033ca45d4f	[llvm-readobj] Fix UB in pointer arithmetics after D105522	2021-08-03 10:38:29 -07:00
wlei	fe3ba90830	[llvm-profgen] Support perf script without parsing MMap events This change supports to run without parsing MMap binary loading events instead it always assumes binary is loaded at the preferred address. This is used when we have assured no binary load address changes or we have pre-processed the addresses resolution. Warn if there's interior mmap event but without leading mmap events. Reviewed By: hoy Differential Revision: https://reviews.llvm.org/D107097	2021-08-03 10:01:07 -07:00
Simon Pilgrim	43ff058e78	[llvm-objcopy] IHexELFBuilder::addDataSections - fix evaluation ordering static analyzer warning As detailed on https://pvs-studio.com/en/blog/posts/cpp/0771/ and raised on D62583, the SecNo++ increment is not guaranteed to occur before the second use of SecNo in the same addSection() call. This patch pulls out the increment (just for clarity) and replaces the second use of SecNo with a constant zero value (we're using stable_sort so the value isn't critical). Differential Revision: https://reviews.llvm.org/D107273	2021-08-03 12:16:59 +01:00
Esme-Yi	69396896fb	[llvm-readobj][XCOFF] Fix the error dumping for the first item of StringTable. Summary: For the string table in XCOFF, the first 4 bytes contains the length of the string table, so we should print the string entries from fifth bytes. This patch also adds tests for llvm-readobj dumping the string table. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D105522	2021-08-03 09:08:58 +00:00
wlei	6da9241aab	[llvm-profgen] Refactor PerfReader to allow different types of perf scripts In order to support different types of perf scripts, this change tried to refactor `PerfReader` by adding the base class `PerfReaderBase` and current HybridPerfReader is derived from it for CS profile generation. Common functions like, passMM2PEvents, extract_lbrs, extract_callstack, etc. can be reused. Next step is to add LBR only reader(for non-CS profile) and aggregated perf scripts reader(do a pre-aggregation of scripts). Reviewed By: hoy, wenlei Differential Revision: https://reviews.llvm.org/D107014	2021-08-02 17:18:47 -07:00
Alexander Yermolovich	5a865b0b1e	[DWARF] Don't process .debug_info relocations for DWO Context When we build with split dwarf in single mode the .o files that contain both "normal" debug sections and dwo sections, along with relocaiton sections for "normal" debug sections. When we create DWARF context in DWARFObjInMemory we process relocations and store them in the map for .debug_info, etc section. For DWO Context we also do it for non dwo dwarf sections. Which I believe is not necessary. This leads to a lot of memory being wasted. We observed 70GB extra memory being used. I went with context sensitive approach, flag is passed in. I am not sure if it's always safe not to process relocations for regular debug sections if Obj contains .dwo sections. If it is alternatvie might be just to scan, in constructor, sections and if there are .dwo sections not to process regular debug ones. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D106624	2021-08-02 10:41:47 -07:00
Tarindu Jayatilaka	7a797b2902	Take OptimizationLevel class out of Pass Builder Pulled out the OptimizationLevel class from PassBuilder in order to be able to access it from within the PassManager and avoid include conflicts. Reviewed By: mtrofin Differential Revision: https://reviews.llvm.org/D107025	2021-07-29 21:57:23 -07:00
Daniel Rodríguez Troitiño	d6704e5ed9	[llvm-objcopy][MachO] Ignore all LC_SUB_* commands. The LC_SUB_FRAMEWORK, LC_SUB_UMBRELLA, LC_SUB_CLIENT, and LC_SUB_LIBRARY are used to indicate related libraries, binaries or framework names. Their only payload is the string with the name of the object. Adding those commands to the list of ignored/skipped load commands will avoid an error that stop the process of copying/stripping and will copy their contents verbatim. Additionally, in order to have a test for this case, `yaml2obj` now allows those four commands to contain a `Content`. Differential Revision: https://reviews.llvm.org/D106412	2021-07-28 17:35:26 -07:00
Patrick Holland	dbed061bf1	[MCA] Moving the target specific CustomBehaviour impl. from /tools/llvm-mca/ to /lib/Target/. Differential Revision: https://reviews.llvm.org/D106775	2021-07-28 11:23:18 -07:00
Fangrui Song	6da3d8b19c	[llvm] Replace LLVM_ATTRIBUTE_NORETURN with C++11 [[noreturn]] [[noreturn]] can be used since Oct 2016 when the minimum compiler requirement was bumped to GCC 4.8/MSVC 2015. Note: the definition of LLVM_ATTRIBUTE_NORETURN is kept for now.	2021-07-28 09:31:14 -07:00
Wael Yehia	9559bd1990	[LTO][Legacy] Add new API to check presence of ctor/dtor functions. On AIX, the linker needs to check whether a given lto_module_t contains any constructor/destructor functions, in order to implement the behavior of the -bcdtors:all flag. See https://www.ibm.com/docs/en/aix/7.2?topic=l-ld-command for the flag's documentation. In llvm IR, constructor (destructor) functions are added to a special global array @llvm.global_ctors (@llvm.global_dtors). However, because these two symbols are artificial, they are not visited during the symbol traversal (using the lto_module_get_[num_symbols\|symbol_name\|symbol_attribute] API). This patch adds a new function to the libLTO interface that checks the presence of one or both of these two symbols. Reviewed By: steven_wu Differential Revision: https://reviews.llvm.org/D106887	2021-07-28 12:41:56 +00:00
Lang Hames	2487db1f28	[ORC] Require ExecutorProcessControl when constructing an ExecutionSession. Wrapper function call and dispatch handler helpers are moved to ExecutionSession, and existing EPC-based tools are re-written to take an ExecutionSession argument instead. Requiring an ExecutorProcessControl instance simplifies existing EPC based utilities (which only need to take an ES now), and should encourage more utilities to use the EPC interface. It also simplifies process termination, since the session can automatically call ExecutorProcessControl::disconnect (previously this had to be done manually, and carefully ordered with the rest of JIT tear-down to work correctly).	2021-07-27 16:53:49 +10:00
Tom Stellard	dbefcde6da	Merge all the llvm-exegesis unit tests into a single binary These tests access private symbols in the backends, so they cannot link against libLLVM.so and must be statically linked. Linking these tests can be slow and with debug builds the resulting binaries use a lot of disk space. By merging them into a single test binary means we now only need to statically link 1 test instead of 6, which helps reduce the build times and saves disk space. Reviewed By: courbet Differential Revision: https://reviews.llvm.org/D106464	2021-07-26 22:07:07 -07:00
Lang Hames	25986a21ef	[llvm-jitlink] Don't hardcode LLVM version number into the runtime path. This should unbreak builders that were failing due to different patch numbers.	2021-07-27 13:04:50 +10:00
Fangrui Song	c5d8bd5a35	[llvm-objcopy] Fix section group flag read/write when operating on a cross-endian object file	2021-07-26 15:09:15 -07:00
Fangrui Song	792c206e2b	[llvm-objcopy] Drop GRP_COMDAT if the group signature is localized See [GRP_COMDAT group with STB_LOCAL signature](https://groups.google.com/g/generic-abi/c/2X6mR-s2zoc) objcopy PR: https://sourceware.org/bugzilla/show_bug.cgi?id=27931 GRP_COMDAT deduplication is purely based on the signature symbol name in ld.lld/GNU ld/gold. The local/global status is not part of the equation. If the signature symbol is localized by --localize-hidden or --keep-global-symbol, the intention is likely to make the group fully localized. Drop GRP_COMDAT to suppress deduplication. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D106782	2021-07-26 09:05:18 -07:00
Fangrui Song	c0da287c30	[yaml2obj][MachO] Rename PayloadString to Content The new name is conciser and matches yaml2obj ELF & DWARF. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D106759	2021-07-26 09:04:51 -07:00
gbreynoo	87ed73fe6e	[llvm-readobj] Display multiple function names for stack size entries The current implementation of displaying .stack_size information presumes that each entry represents a single function but this is not always the case. For example with the use of ICF multiple functions can be represented with the same code, meaning that the address found in a .stack_size entry corresponds to multiple function symbols. This change allows multiple function names to be displayed when appropriate. Differential Revision: https://reviews.llvm.org/D105884	2021-07-26 14:49:53 +01:00
Martin Storsjö	0a1683f8cc	[llvm-rc] Allow dashes as part of resource name strings This matches what MS rc.exe allows in practice. I'm not aware of any legal syntax case that are broken by allowing dashes as part of what the tokenizer considers an Identifier - but I'm not very well versed in the RC syntax either, can @amccarth think of any case that would be broken by this? This fixes downstream bug https://github.com/msys2/MINGW-packages/issues/9180. Additionally, rc.exe allows such resource name strings to be surrounded by quotes, ending up with e.g. Resource name (string): "QUOTEDNAME" (i.e., the quotes end up as part of the string), which llvm-rc doesn't support yet either. (I'm not aware of such cases in the wild though, but resource string names with dashes do exist.) This also allows including files with unquoted paths, with filenames containing dashes (which fixes https://github.com/msys2/MINGW-packages/issues/9130, which has been worked around differently so far). Differential Revision: https://reviews.llvm.org/D106598	2021-07-23 23:05:20 +03:00
Fangrui Song	31677c6481	[llvm-symbolizer] Remove one-dash long options Most modern tools only accept two-dash long options. Remove one-dash long options which are not recognized by GNU style `getopt_long`. This ensures long options cannot collide with grouped short options. Note: llvm-symbolizer has `-demangle={true,false}` for pprof compatibility (for a while). They are kept. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D106377	2021-07-23 08:35:45 -07:00
Gulfem Savrun Yeniceri	e50a38840d	[profile] Add binary id into profiles This patch adds binary id into profiles to easily associate binaries with the corresponding profiles. There is an RFC that discusses the motivation, design and implementation in more detail: https://lists.llvm.org/pipermail/llvm-dev/2021-June/151154.html Differential Revision: https://reviews.llvm.org/D102039	2021-07-23 00:19:12 +00:00
Alexander Yermolovich	f8c6515554	[DWP] Refactoring llvm-dwp in to a library part 2 This is follow up to https://reviews.llvm.org/D106198 where llvm-dwp was refactored in to multiple files. In this patch moving them in to lib/include directories. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D106493	2021-07-22 14:23:29 -07:00
Timm Bäder	924d62ca4a	[llvm][tools] Hide remaining unrelated llvm- tool options Differential Revision: https://reviews.llvm.org/D106430	2021-07-22 09:47:55 +02:00
Hsiangkai Wang	0fe7531b75	[llvm-mc-assemble-fuzzer] Initialize MCTargetOptions. When run the command in the llvm-mc-assemble-fuzzer document, ``` llvm-mc-fuzzer --triple=aarch64-linux-gnu --fuzzer-args -max_len=4 ``` it triggers the following assertion: ``` llvm-mc-assemble-fuzzer: llvm-project/llvm/lib/MC/MCTargetOptionsCommandFlags.cpp:38: bool llvm::mc::getRelaxAll(): Assertion `RelaxAllView && "RegisterMCTargetOptionsFlags not created."' failed. ``` It is caused by no global RegisterMCTargetOptionsFlags object to initialize the MC target options. Differential Revision: https://reviews.llvm.org/D106417	2021-07-22 14:36:37 +08:00
Bill Wendling	635288d215	[llvm-diff] Check for recursive initialiers We need to check for recursive initializers in the "ConstantStruct" case. Differential Revision: https://reviews.llvm.org/D105616	2021-07-21 14:21:21 -07:00
Gulfem Savrun Yeniceri	fd895bc81b	Revert "[profile] Add binary id into profiles" Revert "[profile] Change linkage type of a compiler-rt func" This reverts commits `f984ac2715` and `467c719124` because it broke some builds.	2021-07-21 19:15:18 +00:00
Gulfem Savrun Yeniceri	f984ac2715	[profile] Add binary id into profiles This patch adds binary id into profiles to easily associate binaries with the corresponding profiles. There is an RFC that discusses the motivation, design and implementation in more detail: https://lists.llvm.org/pipermail/llvm-dev/2021-June/151154.html Differential Revision: https://reviews.llvm.org/D102039	2021-07-21 17:55:43 +00:00
Eric Astor	5fba605896	[ms] [llvm-ml] Support built-in text macros Add support for all built-in text macros supported by ML64: @Date, @Time, @FileName, @FileCur, and @CurSeg. Reviewed By: thakis Differential Revision: https://reviews.llvm.org/D104965	2021-07-21 11:44:09 -04:00
Guillaume Chatelet	d6da02d952	[llvm] Add enum iteration to Sequence This patch allows iterating typed enum via the ADT/Sequence utility. It also changes the original design to better separate concerns: - `StrongInt` only deals with safe `intmax_t` operations, - `SafeIntIterator` presents the iterator and reverse iterator interface but only deals with safe `StrongInt` internally. - `iota_range` only deals with `SafeIntIterator` internally. This design ensures that operations are always valid. In particular, "Out of bounds" assertions fire when: - the `value_type` is not representable as an `intmax_t` - iterator operations make internal computation underflow/overflow - the internal representation cannot be converted back to `value_type` Differential Revision: https://reviews.llvm.org/D106279	2021-07-21 12:48:53 +00:00
Timm Bäder	d16f154240	[llvm][tools] Hide more unrelated LLVM tool options Differential Revision: https://reviews.llvm.org/D106366	2021-07-21 09:14:04 +02:00
Alexander Yermolovich	51db236324	[DWP] Fix for Refactoring llvm-dwp in to a library Fix build for https://reviews.llvm.org/D106198 when -DBUILD_SHARED_LIBS=ON. Test Plan: Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D106414	2021-07-20 18:17:24 -07:00
Alexander Yermolovich	403e67d34d	[DWP] Refactoring llvm-dwp in to a library. This is a step1, mechanical refactor, of moving the bulk of llvm-dwp functionality in to a library. This should allow other tools, like BOLT, to re-use some of the llvm-dwp functionality. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D106198	2021-07-20 17:19:26 -07:00
Fangrui Song	db5e078690	[LTO] Add SelectionKind to IRSymtab and use it in ld.lld/LLVMgold In PGO, a C++ external linkage function `foo` has a private counter `__profc_foo` and a private `__profd_foo` in a `comdat nodeduplicate`. A `__attribute__((weak))` function `foo` has a weak hidden counter `__profc_foo` and a private `__profd_foo` in a `comdat nodeduplicate`. In `ld.lld a.o b.o`, say a.o defines an external linkage `foo` and b.o defines a weak `foo`. Currently we treat `comdat nodeduplicate` as `comdat any`, ld.lld will incorrectly consider `b.o:__profc_foo` non-prevailing. In the worst case when `b.o:__profd_foo` is retained and `b.o:__profc_foo` isn't, there will be dangling reference causing an `undefined hidden symbol` error. Add SelectionKind to `Comdat` in IRSymtab and let linkers ignore nodeduplicate comdat. Differential Revision: https://reviews.llvm.org/D106228	2021-07-20 13:22:00 -07:00
Timm Bäder	669275f8a0	[llvm][tools] Hide more unrelated tool options Differential Revision: https://reviews.llvm.org/D106271	2021-07-20 13:27:33 +02:00
Arthur Eubanks	5de114b650	[NewPM][opt] Add -debug-pass-manager=quiet to not print analysis info Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D106307	2021-07-19 15:08:26 -07:00
Haowei Wu	a2cc67eaf0	[ifs] Fix linking errors on some llvm builders This change fixes linking errors on some llvm builders.	2021-07-19 12:01:34 -07:00
Haowei Wu	6103fdfab4	[ifs][elfabi] Merge llvm-ifs/elfabi tools This change merges llvm-elfabi and llvm-ifs tools. Differential Revision: https://reviews.llvm.org/D100139	2021-07-19 11:23:19 -07:00
Haowei Wu	61fa9afe4c	[ifs] Prepare llvm-ifs for elfabi/ifs merging. This diff changes llvm-ifs to use unified IFS file format and perform other renaming changes in preparation for the merging between elfabi/ifs. Differential Revision: https://reviews.llvm.org/D99810	2021-07-19 11:23:00 -07:00
Haowei Wu	8b4acb067f	[elfabi] Prepare elfabi/ifs merging. This change implements unified text stub format and command line interface proposed in the elfabi/ifs merge plan. Differential Revision: https://reviews.llvm.org/D99399	2021-07-19 11:22:43 -07:00
Lang Hames	249510a004	[ORC] Add missing std::move. This should fix the build failure at https://lab.llvm.org/buildbot/#/builders/58/builds/11428.	2021-07-19 20:37:53 +10:00
Lang Hames	bb5f97e3ad	[ORC][ORC-RT] Introduce ORC-runtime based MachO-Platform. Adds support for MachO static initializers/deinitializers and eh-frame registration via the ORC runtime. This commit introduces cooperative support code into the ORC runtime and ORC LLVM libraries (especially the MachOPlatform class) to support macho runtime features for JIT'd code. This commit introduces support for static initializers, static destructors (via cxa_atexit interposition), and eh-frame registration. Near-future commits will add support for MachO native thread-local variables, and language runtime registration (e.g. for Objective-C and Swift). The llvm-jitlink tool is updated to use the ORC runtime where available, and regression tests for the new MachOPlatform support are added to compiler-rt. Notable changes on the ORC runtime side: 1. The new macho_platform.h / macho_platform.cpp files contain the bulk of the runtime-side support. This includes eh-frame registration; jit versions of dlopen, dlsym, and dlclose; a cxa_atexit interpose to record static destructors, and an '__orc_rt_macho_run_program' function that defines running a JIT'd MachO program in terms of the jit- dlopen/dlsym/dlclose functions. 2. Replaces JITTargetAddress (and casting operations) with ExecutorAddress (copied from LLVM) to improve type-safety of address management. 3. Adds serialization support for ExecutorAddress and unordered_map types to the runtime-side Simple Packed Serialization code. 4. Adds orc-runtime regression tests to ensure that static initializers and cxa-atexit interposes work as expected. Notable changes on the LLVM side: 1. The MachOPlatform class is updated to: 1.1. Load the ORC runtime into the ExecutionSession. 1.2. Set up standard aliases for macho-specific runtime functions. E.g. ___cxa_atexit -> ___orc_rt_macho_cxa_atexit. 1.3. Install the MachOPlatformPlugin to scrape LinkGraphs for information needed to support MachO features (e.g. eh-frames, mod-inits), and communicate this information to the runtime. 1.4. Provide entry-points that the runtime can call to request initializers, perform symbol lookup, and request deinitialiers (the latter is implemented as an empty placeholder as macho object deinits are rarely used). 1.5. Create a MachO header object for each JITDylib (defining the __mh_header and __dso_handle symbols). 2. The llvm-jitlink tool (and llvm-jitlink-executor) are updated to use the runtime when available. 3. A `lookupInitSymbolsAsync` method is added to the Platform base class. This can be used to issue an async lookup for initializer symbols. The existing `lookupInitSymbols` method is retained (the GenericIRPlatform code is still using it), but is deprecated and will be removed soon. 4. JIT-dispatch support code is added to ExecutorProcessControl. The JIT-dispatch system allows handlers in the JIT process to be associated with 'tag' symbols in the executor, and allows the executor to make remote procedure calls back to the JIT process (via __orc_rt_jit_dispatch) using those tags. The primary use case is ORC runtime code that needs to call bakc to handlers in orc::Platform subclasses. E.g. __orc_rt_macho_jit_dlopen calling back to MachOPlatform::rt_getInitializers using __orc_rt_macho_get_initializers_tag. (The system is generic however, and could be used by non-runtime code). The new ExecutorProcessControl::JITDispatchInfo struct provides the address (in the executor) of the jit-dispatch function and a jit-dispatch context object, and implementations of the dispatch function are added to SelfExecutorProcessControl and OrcRPCExecutorProcessControl. 5. OrcRPCTPCServer is updated to support JIT-dispatch calls over ORC-RPC. 6. Serialization support for StringMap is added to the LLVM-side Simple Packed Serialization code. 7. A JITLink::allocateBuffer operation is introduced to allocate writable memory attached to the graph. This is used by the MachO header synthesis code, and will be generically useful for other clients who want to create new graph content from scratch.	2021-07-19 19:50:16 +10:00
Andy Wingo	db69ea40a9	[llvm-objdump][WebAssembly] Fix llvm-objdump on files without symbols If a file has no symbols, perhaps because it is a linked executable, synthesize some symbols by walking the code section. Otherwise the disassembler will try to treat the whole code section as a function, which won't parse. Fixes https://bugs.llvm.org/show_bug.cgi?id=50957. Differential Revision: https://reviews.llvm.org/D105539	2021-07-19 08:59:26 +02:00
Lang Hames	89aa11ed28	[ORC] Remove LLVM-side MachO Platform runtime support. Support for this functionality is moving to the ORC runtime.	2021-07-17 14:25:31 +10:00
Fangrui Song	3f9004c19c	[llvm-readelf/llvm-readobj] Remove one-dash long options llvm-readelf is a user-facing tool which emulates GNU readelf. Remove one-dash long options which are not recognized by GNU style `getopt_long`. This ensures long options cannot collide with grouped short options. Note: the documentation (D63719)/help messages have recommended the double-dash forms since LLVM 9.0.0. llvm-readobj is intended as an internal tool which has some flexibility. llvm-readelf/llvm-readobj use the same option parsing code and llvm-readobj's one-dash long options aren't used after test migration. Differential Revision: https://reviews.llvm.org/D106037	2021-07-16 12:03:08 -07:00
Timm Bäder	69a5684531	[llvm][tools] Hide unrelated llvm-cfi-verify options Differential Revision: https://reviews.llvm.org/D106055	2021-07-16 10:43:52 +02:00
Mehdi Amini	76374573ce	Use ManagedStatic and lazy initialization of cl::opt in libSupport to make it free of global initializer We can build it with -Werror=global-constructors now. This helps in situation where libSupport is embedded as a shared library, potential with dlopen/dlclose scenario, and when command-line parsing or other facilities may not be involved. Avoiding the implicit construction of these cl::opt can avoid double-registration issues and other kind of behavior. Reviewed By: lattner, jpienaar Differential Revision: https://reviews.llvm.org/D105959	2021-07-16 07:38:16 +00:00
Mehdi Amini	8d051d8546	Revert "Use ManagedStatic and lazy initialization of cl::opt in libSupport to make it free of global initializer" This reverts commit `af9321739b`. Still some specific config broken in some way that requires more investigation.	2021-07-16 07:35:13 +00:00
Timm Bäder	3d3dc9523f	Revert "[llvm][tools] Hide unrelated llvm-cfi-verify options" This reverts commit `7c63726072`.	2021-07-16 09:30:57 +02:00
Timm Bäder	7c63726072	[llvm][tools] Hide unrelated llvm-cfi-verify options Differential Revision: https://reviews.llvm.org/D106055	2021-07-16 09:25:54 +02:00
Marcos Horro	77f2f0f9b7	[llvm-mca][JSON] Store extra information about driver flags used for the simulation Added information stored in PipelineOptions and the MCSubtargetInfo. Bug: https://bugs.llvm.org/show_bug.cgi?id=51041 Reviewed By: andreadb Differential Revision: https://reviews.llvm.org/D106077	2021-07-16 09:18:40 +02:00
Mehdi Amini	af9321739b	Use ManagedStatic and lazy initialization of cl::opt in libSupport to make it free of global initializer We can build it with -Werror=global-constructors now. This helps in situation where libSupport is embedded as a shared library, potential with dlopen/dlclose scenario, and when command-line parsing or other facilities may not be involved. Avoiding the implicit construction of these cl::opt can avoid double-registration issues and other kind of behavior. Reviewed By: lattner, jpienaar Differential Revision: https://reviews.llvm.org/D105959	2021-07-16 06:54:26 +00:00
Mehdi Amini	16b5e9d6a2	Revert "Use ManagedStatic and lazy initialization of cl::opt in libSupport to make it free of global initializer" This reverts commit `42f588f39c`. Broke some buildbots	2021-07-16 03:46:53 +00:00
Mehdi Amini	42f588f39c	Use ManagedStatic and lazy initialization of cl::opt in libSupport to make it free of global initializer We can build it with -Werror=global-constructors now. This helps in situation where libSupport is embedded as a shared library, potential with dlopen/dlclose scenario, and when command-line parsing or other facilities may not be involved. Avoiding the implicit construction of these cl::opt can avoid double-registration issues and other kind of behavior. Reviewed By: lattner, jpienaar Differential Revision: https://reviews.llvm.org/D105959	2021-07-16 03:33:20 +00:00
Vy Nguyen	a35480f859	[llvm-exegesis] Fix missing-headers build errors. Details: Switch all #includes to use <> because that is consistent with what happens in the cmake checks. Otherwise, we could be in the situation where cmake checks see that headers exist at <perfmon/...> but in llvm-exegesis code, we use "perfmon/...", which may not exist. Related PR/revisions: D84076, PR51017+D105615 Differential Revision: https://reviews.llvm.org/D105861	2021-07-15 13:20:25 -04:00
Fangrui Song	96e9bc4244	[llvm-nm] Remove one-dash long options except -arch The documentation and help messages have recommended the double-dash forms for quite a while. Remove one-dash long options which are not recognized by GNU style `getopt_long`. `-arch` is kept as it is in the manpage of classic nm https://keith.github.io/xcode-man-pages/nm.1.html Note: the dyldinfo related options don't have a test. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D105948	2021-07-15 09:50:37 -07:00
Timm Bäder	d9cdcfb069	[llvm][tools] Hide unrelated llvm-bcanalyzer options They otherwise show up when we link against the dynamic libLLVM.so. Differential Revision: https://reviews.llvm.org/D105893	2021-07-15 10:43:15 +02:00
Chuanqi Xu	8a1727ba51	[Coroutines] Run coroutine passes by default This patch make coroutine passes run by default in LLVM pipeline. Now the clang and opt could handle IR inputs containing coroutine intrinsics without special options. It should be fine. On the one hand, the coroutine passes seems to be stable since there are already many projects using coroutine feature. On the other hand, the coroutine passes should do nothing for IR who doesn't contain coroutine intrinsic. Test Plan: check-llvm Reviewed by: lxfind, aeubanks Differential Revision: https://reviews.llvm.org/D105877	2021-07-15 14:33:40 +08:00
Derek Schuff	7cb25f5387	[llvm-strip][WebAssembly] Support strip flags Summary: Add support for the basic section stripping (and keeping) flags for wasm: strip with no flags, --strip-all, --strip-debug, --only-section, --keep-section, and --only-keep-debug. Factor section removal into a function and use a predicate chain like the ELF implementation. Reviewers: jhenderson, sbc100 Differential Revision: https://reviews.llvm.org/D73820	2021-07-14 14:17:02 -07:00
Hongtao Yu	6b04ecaab3	[CSSPGO][llvm-profgen] Fix a missing initalization Fixing a missing initalization that accidentaly caused by https://reviews.llvm.org/D103178 .	2021-07-13 19:49:55 -07:00
Hongtao Yu	597e9c61ce	Revert "[CSSPGO][llvm-profgen] Fix a missing initalization" This reverts commit `fef5f4456a`.	2021-07-13 19:48:58 -07:00
Hongtao Yu	fef5f4456a	[CSSPGO][llvm-profgen] Fix a missing initalization Fixing a missing initalization that accidentaly caused by https://reviews.llvm.org/D103178 .	2021-07-13 19:46:18 -07:00
Hongtao Yu	cda2394d97	[NFC][CSSPGO] Rename the name of an enum value.	2021-07-13 18:30:16 -07:00
Hongtao Yu	0712038458	[CSSPGO][llvm-profgen] Allow multiple executable load segments. The linker or post-link optimizer can create an ELF image with multiple executable segments each of which will be loaded separately at run time. This breaks the assumption of llvm-profgen that currently only supports one base load address. What it ends up with is that the subsequent mmap events will be treated as an overwrite of the first mmap event which will in turn screw up address mapping. While it is non-trivial to support multiple separate load addresses and given that on x64 those segments will always be loaded at consecutive addresses (though via separate mmap sys calls), I'm adding an error checking logic to bail out if that's violated and keep using a single load address which is the address of the first executable segment. Also changing the disassembly output from printing section offset to printing the virtual address instead, which matches the behavior of objdump. Differential Revision: https://reviews.llvm.org/D103178	2021-07-13 18:22:24 -07:00
Alexander Yermolovich	24129fbc9a	[LLD] Adding support for RELA for CG Profile. This is a follow up to https://reviews.llvm.org/D104080, and `ca3bdb57fa (diff-e64a48fabe31db213a631fdc5f2acb51bdddf3f16a8fb2928784f4c579229585)`. The implementation of call graph profile was changed from a black box section to relocation approach. This was done to be compatible with post processing tools like strip/objcopy, and llvm equivalent. When they are invoked on object file before the final linking step with this new approach the symbol indices correctness is preserved. The GNU binutils tools change the REL section to RELA section, unlike llvm tools. For example when strip -S is run on the ELF object files, as an intermediate step before linking. To preserve compatibility this patch extends implementation in LLD and ELFDumper to support both REL and RELA sections for call graph profile. Reviewed By: MaskRay, jhenderson Differential Revision: https://reviews.llvm.org/D105217	2021-07-13 13:56:30 -07:00
Marcos Horro	14f77576c9	[llvm-mca] [NFC] Formatting code Applied clang-format to all files. Discarded BottleneckAnalysis.h 80-column width violation since it contains an example of report. Caught some typos and minor style details. Reviewed By: andreadb Differential Revision: https://reviews.llvm.org/D105900	2021-07-13 19:13:59 +02:00
Guillaume Chatelet	2c47b8847e	Revert "[llvm] Add enum iteration to Sequence" This reverts commit `a006af5d6e`.	2021-07-13 16:44:42 +00:00
Guillaume Chatelet	a006af5d6e	[llvm] Add enum iteration to Sequence This patch allows iterating typed enum via the ADT/Sequence utility. Differential Revision: https://reviews.llvm.org/D103900	2021-07-13 16:22:19 +00:00
Fangrui Song	fada2782cf	[llc] Default MCUseDwarfDirectory to true For Clang, `MCUseDwarfDirectory` is true by default for the majority cases (-fintegrated-as or -gdwarf-5; most targets use -fintegrated-as by default). Defaulting MCUseDwarfDirectory to true can reduce the differences between clang and llc. Reviewed By: #debug-info, dblaikie Differential Revision: https://reviews.llvm.org/D105856	2021-07-12 17:44:02 -07:00
Fangrui Song	46580d43fc	[llvm-readobj] Switch command line parsing from llvm::cl to OptTable Users should generally observe no difference as long as they don't use unintended option forms. Behavior changes: * `-t=d` is removed. Use `-t d` instead. * `--demangle=false` and `--demangle=0` cannot be used. Omit the option or use `--no-demangle`. Other flag-style options don't have `--no-` forms. * `--help-list` is removed. This is a `cl::` specific option. * llvm-readobj now supports grouped short options as well. * `--color` is removed. This is generally not useful (only apply to errors/warnings) but was inherited from Support. Some adjustment to the canonical forms (usually from GNU readelf; currently llvm-readobj has too many redundant aliases): * --dyn-syms is canonical. --dyn-symbols is a hidden alias * --file-header is canonical. --file-headers is a hidden alias * --histogram is canonical. --elf-hash-histogram is a hidden alias * --relocs is canonical. --relocations is a hidden alias * --section-groups is canonical. --elf-section-groups is a hidden alias OptTable avoids global option collision if we decide to support multiplexing for binary utilities. * Most one-dash long options are still supported. `-dt, -sd, -st, -sr` are dropped due to their conflict with grouped short options. * `--section-mapping=false` (D57365) is strange but is kept for now. * Many `cl::opt` variables were unnecessarily external. I added `static` whenever appropriate. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D105532	2021-07-12 10:14:42 -07:00
Fangrui Song	0144e625b9	[llvm-objcopy] Improve performance of long pattern lists Some users use a long list of fixed patterns (PR50404) and O(\|patterns\|*\|symbols\|) can be too slow. Such usage typically does not use --regex or --wildcard. We can use a DenseSet<CachedHashStringRef> to optimize name lookups. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D105218	2021-07-12 09:03:34 -07:00
Clement Courbet	04f8ffd983	[llvm-exegesis] Fix compilation with old libpfm versions. Do not try include `perfmon/perf_event.h` when we are not sure that it exists. Fixes PR51017. Differential Revision: https://reviews.llvm.org/D105615	2021-07-12 07:48:29 +02:00
Andrea Di Biagio	4fe0fcd1c0	[llvm-mca][JSON] Teach the PipelinePrinter how to deal with anonymous code regions (PR51008) This patch addresses the last remaining problems reported in PR51008. Previous fixes for PR51008 worked under the wrong assumption that code regions are always named (except maybe for the default region, which was automatically named "main"). In reality, it is quite common for users to declare multiple anonymous regions. So we cannot really use the region name as the key string of a JSON object. In practice, code region names are completely optional. Using "main" for the default region was also problematic because there can be another region with that same name. This patch fixes these issues by introducing a json::array of regions. Each region has a "Name" field, which would default to the empty string for anonymous regions. Added a few more tests to verify that the JSON file format is still valid, and that multiple anonymous regions all appear in the final output.	2021-07-10 13:57:52 +01:00
Andrea Di Biagio	d919bca875	[llvm-mca][JSON] Further refactoring of the JSON printing logic. This patch renames object "Resources" to "TargetInfo". Moved the getJSONTargetInfo method from class InstructionView to the PipelinePrinter. Removed uses of std::stringstream. Removed unused method View::printViewJSON().	2021-07-10 12:38:19 +01:00
Andrea Di Biagio	10cb036223	[llvm-mca] Refactor the logic that prints JSON files. Moved most of the printing logic into the PipelinePrinter. This patch also fixes the JSON output when flag -instruction-tables is specified.	2021-07-09 22:56:39 +01:00
Fangrui Song	a2860b8b04	[llvm-mca] Fix -Wunused-private-field after D105618	2021-07-09 10:30:16 -07:00
Fangrui Song	47db32e542	[llvm-size] Switch command line parsing from llvm::cl to OptTable Part of https://lists.llvm.org/pipermail/llvm-dev/2021-July/151622.html "Binary utilities: switch command line parsing from llvm::cl to OptTable" * `--totals=false` and `--totals=0` cannot be used. Omit the option. * `--help-list` is removed. This is a `cl::` specific option. OptTable avoids global option collision if we decide to support multiplexing for binary utilities. Note: because the tool is simple, and its long options are uncommon, I just drop the one-dash forms except `-arch <value>` (Darwin style). Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D105598	2021-07-09 10:26:53 -07:00
Fangrui Song	48de8bb0d3	[llvm-cxxfilt] Switch command line parsing from llvm::cl to OptTable Similar to D104889. The tool is very simple and its long options are uncommon, so just drop the one-dash form in this patch. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D105605	2021-07-09 10:10:45 -07:00
Marcos Horro	b11d31eb73	[llvm-mca] Fix JSON format for multiple regions Instead of printing each region individually when using JSON format, this patch creates a JSON object which is updated with the values of each region, printing them at the end. New test is added for JSON output with multiple regions. Bug: https://bugs.llvm.org/show_bug.cgi?id=51008 Reviewed By: andreadb Differential Revision: https://reviews.llvm.org/D105618	2021-07-09 18:04:16 +02:00
Martin Storsjö	beb0e7e338	[llvm-rc] Make commas in user data structs optional This matches what rc.exe tolerates in this type. This fixes cases like this: 1 24 BEGIN "<?xml version=""1.0""?>\n" "<assembly>\n" "</assembly>\n" END Differential Revision: https://reviews.llvm.org/D105621	2021-07-09 14:31:12 +03:00
Bjorn Pettersson	472462c472	[NewPM] Consistently use 'simplifycfg' rather than 'simplify-cfg' There was an alias between 'simplifycfg' and 'simplify-cfg' in the PassRegistry. That was the original reason for this patch, which effectively removes the alias. This patch also replaces all occurrances of 'simplify-cfg' by 'simplifycfg'. Reason for choosing that form for the name is that it matches the DEBUG_TYPE for the pass, and the legacy PM name and also how it is spelled out in other passes such as 'loop-simplifycfg', and in other options such as 'simplifycfg-merge-cond-stores'. I for some reason the name should be changed to 'simplify-cfg' in the future, then I think such a renaming should be more widely done and not only impacting the PassRegistry. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D105627	2021-07-09 09:47:03 +02:00
David Blaikie	1def2579e1	PR51018: Remove explicit conversions from SmallString to StringRef to future-proof against C++23 C++23 will make these conversions ambiguous - so fix them to make the codebase forward-compatible with C++23 (& a follow-up change I've made will make this ambiguous/invalid even in <C++23 so we don't regress this & it generally improves the code anyway)	2021-07-08 13:37:57 -07:00
Patrick Holland	d38b9f1f31	Revert "[MCA] [AMDGPU] Adding an implementation to AMDGPUCustomBehaviour for handling s_waitcnt instructions." Build failures when building with shared libraries. Reverting until I can fix. Differential Revision: https://reviews.llvm.org/D104730	2021-07-07 20:48:42 -07:00
Patrick Holland	af3baf1761	[MCA] [AMDGPU] Adding an implementation to AMDGPUCustomBehaviour for handling s_waitcnt instructions. This commit also makes some slight changes to the scheduling model for AMDGPU to set the RetireOOO flag for all scheduling classes. This flag is only used by llvm-mca and allows instructions to retire out of order. See the differential link below for a deeper explanation of everything. Differential Revision: https://reviews.llvm.org/D104730	2021-07-07 14:17:54 -07:00
Fangrui Song	cae3b831f4	[llvm-nm] Switch command line parsing from llvm::cl to OptTable Part of https://lists.llvm.org/pipermail/llvm-dev/2021-July/151622.html "Binary utilities: switch command line parsing from llvm::cl to OptTable" Users should generally observe no difference as long as they only use intended option forms. Behavior changes: * `-t=d` is removed. Use `-t d` instead. * `--demangle=0` cannot be used. Omit the option or use `--no-demangle` instead. * `--help-list` is removed. This is a `cl::` specific option. Note: * `-t` diagnostic gets improved. * This patch avoids cl::opt collision if we decide to support multiplexing for binary utilities * One-dash long options are still supported. * The `-s` collision (`-s segment section` for Mach-O) is unfortunate. `-s` means `--print-armap` in GNU nm. * This patch removes the last `cl::multi_val` use case from the `llvm/lib/Support/CommandLine.cpp` library `-M` (`--print-armap`), `-U` (`--defined-only`), and `-W` (`--no-weak`) are now deprecated. They could conflict with future GNU nm options. (--print-armap has an existing alias -s, so GNU will unlikely add a new one. --no-weak (not in GNU nm) is rarely used anyway.) `--just-symbol-name` is now deprecated in favor of `--format=just-symbols` and `-j`. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D105330	2021-07-07 13:34:33 -07:00
Martin Storsjö	9bcce1c9aa	[CodeView] Add missing cases for new enum values This fixes warnings while building llvm-pdbutil after `d20b013b49`.	2021-07-07 22:15:33 +03:00
Fangrui Song	98f078324f	[llvm-strings] Switch command line parsing from llvm::cl to OptTable Some behavior changes: * `-t=d` is removed. Use `-t d` instead. * one-dash long options like `-all` are supported. Use `--all` instead. * `--all=0` or `--all=false` cannot be used. (Note: `--all` is silently ignored anyway) * `--help-list` is removed. This is a `cl::` specific option. Nobody is likely leveraging any of the above. Advantages: * `-t` diagnostic gets improved. * in the absence of `HideUnrelatedOptions`, `--help` will not list unrelated options if linking against libLLVM-13git.so or linker GC is not used. * Decrease the probability of cl::opt collision if we do decide to support multiplexing Note: because the tool is so simple, used more for forensics instead of a building tool, and its long options are unlikely used in one-dash form, I just drop the one-dash form in this patch. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D104889	2021-07-05 10:46:17 -07:00
Esme-Yi	0dad3f6ee2	[llvm-readobj][XCOFF] Add support for printing the String Table. Summary: The patch adds the StringTable dumping to llvm-readobj. Currently only XCOFF is supported. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D104613	2021-07-05 04:16:58 +00:00
Nico Weber	7a2c1acd5f	[llvm-dwarfdump] Add comment saying where DumpDebugFrame comes from	2021-07-02 09:56:21 -04:00
Fangrui Song	5efffac71a	[llvm-symbolizer] Move setGroupedShortOptions and don't ignore case setGroupedShortOptions in the ctor seems more popular.	2021-07-01 19:43:49 -07:00
Daniel Rodríguez Troitiño	b77533fb70	[llvm-strip] Support grouped options in llvm-strip GNU and Apple `strip` implementations seems to support grouped options. Enable the support for grouped options introduced in https://reviews.llvm.org/D83639 for `llvm-strip` invocations. Includes test that checks that both the grouped and non grouped invocations produces the same result. Reviewed By: alexander-shaposhnikov, MaskRay Differential Revision: https://reviews.llvm.org/D105249	2021-07-01 13:36:45 -07:00
Marcos Horro	aa13e4fe7e	[llvm-mca] Fix JSON output (PR50922) Based on the discussion in PR50922, minor changes have been done to properly output a valid JSON. Removed "not implemented" keys. Differential Revision: https://reviews.llvm.org/D105064	2021-07-01 12:53:20 +01:00
Lang Hames	662c55442f	[ORC] Rename TargetProcessControl to ExecutorProcessControl. NFC. This is a first step towards consistently using the term 'executor' for the process that executes JIT'd code. I've opted for 'executor' as the preferred term over 'target' as target is already heavily overloaded ("the target machine for the executor" is much clearer than "the target machine for the target").	2021-07-01 13:31:12 +10:00
Jez Ng	6875165123	[llvm-objdump/mac] Print symbols at the same address deterministically By using stable_sort. Added a test case which previously failed when expensive checks were enabled. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D105240	2021-06-30 22:38:07 -04:00
Martin Storsjö	bf6770f9bd	[CMake] Don't use -Bsymbolic-functions for MinGW targets This is an ELF specific option which isn't supported for Windows/MinGW targets, even if the MinGW linker otherwise uses an ld.bfd like linker interface. Differential Revision: https://reviews.llvm.org/D105148	2021-06-30 22:54:26 +03:00
Fangrui Song	814dffa4b7	[llvm-objcopy][MachO] Support LC_LINKER_OPTIMIZATION_HINT load command The load command is currently specific to arm64 and holds information for instruction rewriting, e.g. converting a GOT load to an ADR to compute a local address. (On ELF the information is usually conveyed by relocations, e.g. R_X86_64_REX_GOTPCRELX, R_PPC64_TOC16_HA) Reviewed By: alexander-shaposhnikov Differential Revision: https://reviews.llvm.org/D104968	2021-06-29 18:47:55 -07:00
Alexander Shaposhnikov	6d72845a85	[llvm-objcopy][MachO] Code cleanup 1. Remove unnecessary templates. 2. Fix potentially unaligned reads inside constructSection. Test plan: make check-all Differential revision: https://reviews.llvm.org/D105089	2021-06-29 14:08:23 -07:00
Fangrui Song	d4dcb55c70	[llvm-readobj] Make -s and -t match llvm-readelf llvm-readobj is an internal testing tool for binary formats. Its output and command line options do not need to be stable. It isn't supposed to be part of a build process. llvm-readelf was created as a user-facing utility and its interface intends to be compatible with GNU readelf (unless there are good reasons not to). The two tools have mostly compatible options. -s and -t are noticeable exceptions due to history. I think the cost of keeping the inconsistency overweighs the little history-compatible benefit and hinders transition from cl::opt to OptTable, so let's change it. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D105055	2021-06-29 11:56:26 -07:00
Fangrui Song	69937a8080	[llvm-objcopy][MachO] Support ARM64_RELOC_ADDEND An ARM64_RELOC_ADDEND relocation reuses the symbol field for the addend value. We should pass through such relocations. Reviewed By: alexander-shaposhnikov Differential Revision: https://reviews.llvm.org/D104967	2021-06-29 11:23:30 -07:00
Alexander Shaposhnikov	6229369e50	Revert "[llvm-objcopy][MachO] Minor code cleanup" This reverts commit `c94cf97b53` since it appears to have broken linaro-clang-armv7-quick build bot and needs further investigation.	2021-06-29 01:18:48 -07:00
Alexander Shaposhnikov	c94cf97b53	[llvm-objcopy][MachO] Minor code cleanup Remove unnecessary template in MachOReader.cpp. NFC.	2021-06-28 22:51:02 -07:00
Igor Kudrin	d25e572421	[llvm-objdump] Print memory operand addresses as regular comments The patch reuses the common code to print memory operand addresses as instruction comments. This helps to align the comments and enables using target-specific comment markers when `evaluateMemoryOperandAddress()` is implemented for them. Differential Revision: https://reviews.llvm.org/D104861	2021-06-28 14:25:22 +07:00
Igor Kudrin	e7fffa6f03	[llvm-objdump] Prefix memory operand addresses with '0x' This helps to avoid ambiguity when the address contains only digits 0..9. Differential Revision: https://reviews.llvm.org/D104909	2021-06-28 14:25:21 +07:00
Igor Kudrin	c2e6bcb494	[llvm-objdump] Prevent variable locations to overlap short comments For now, the source variable locations are printed at about the same space as the comments for disassembled code, which can make some ranges for variables disappear if a line contains comments, for example: ┠─ bar = W1 0: add x0, x2, #2, lsl #12 // =8192┃ 4: add z31.d, z31.d, #65280 // =0xff00 8: nop ┻ The patch shifts the report a bit to allow printing comments up to approximately 16 characters without interferences. Differential Revision: https://reviews.llvm.org/D104700	2021-06-28 14:25:21 +07:00
Igor Kudrin	abe0fa4352	[llvm-objdump] Print comments for the disassembled code LLVM disassembler can generate comments for disassembled instructions. The patch enables printing these comments for 'llvm-objdump -d'. Differential Revision: https://reviews.llvm.org/D104699	2021-06-28 14:25:20 +07:00
Martin Storsjö	bdb03557c0	[llvm-rc] Don't rewrite the arch in the default triple unless necessary When the default target arch isn't one that is supported as a windows target, we want to set a suitable architecture (so that Clang tests that run plain 'llvm-rc' succeed checks for e.g. "#ifdef _WIN32" even for llvm builds that default to e.g. ppc64). But if the default target architecture is usable, don't rewrite it. (Rewriting it, by e.g. "T.setArch(T.getArch())", normalizes the spelling of the architecture, e.g. changing i686 to i386. Such a change can make clang unable to find the right sysroot.) This can't, unfortunately, practically be tested very well because it is entirely dependent on the default triple of the llvm build. Differential Revision: https://reviews.llvm.org/D104589	2021-06-25 22:59:09 +03:00
Fangrui Song	ca3bdb57fa	[MC][ELF] Change SHT_LLVM_CALL_GRAPH_PROFILE relocations from SHT_RELA to SHT_REL ... even on targets preferring RELA. The section is only consumed by ld.lld which can handle REL. Follow-up to D104080 as I explained in the review. There are two advantages: * The D104080 code only handles RELA, so arm/i386/mips32 etc may warn for -fprofile-use=/-fprofile-sample-use= usage. * Decrease object file size for RELA targets While here, change the relocation to relocate weights, instead of 0,1,2,3,.. I failed to catch the issue during review.	2021-06-24 21:35:48 -07:00
Fangrui Song	f1e2d5851b	[OptTable] Rename PrintHelp to printHelp To be consistent with other member functions and match the coding standard.	2021-06-24 14:47:03 -07:00
Martin Storsjö	42f74e8249	[llvm] Rename StringRef _lower() method calls to _insensitive() This is a mechanical change. This actually also renames the similarly named methods in the SmallString class, however these methods don't seem to be used outside of the llvm subproject, so this doesn't break building of the rest of the monorepo.	2021-06-25 00:22:01 +03:00
Arthur Eubanks	4c8174f54b	[OpaquePtr] Introduce option to force all pointers to be opaque pointers We don't want to start updating tests to use opaque pointers until we're close to the opaque pointer transition. However, before the transition we want to run tests as if pointers are opaque pointers to see if there are any crashes. At some point when we have a flag to only create opaque pointers in the bitcode and textual IR readers, and when we have fixed all places that try to read a pointee type, this flag will be useless. However, until then, this can help us find issues more easily. Since the cl::opt is read into LLVMContext, we need to make sure LLVMContext is created after cl::ParseCommandLineOptions(). Previously ValueEnumerator would visit the value types of global values via the pointer type, but with opaque pointers we have to manually visit the value type. Reviewed By: nikic, dexonsmith Differential Revision: https://reviews.llvm.org/D103503	2021-06-24 13:32:31 -07:00
Aakanksha Patil	3453f3dd46	[AMDGPU] Add gfx1035 target Differential Revision: https://reviews.llvm.org/D104804	2021-06-24 14:32:41 -04:00
Alexander Yermolovich	a224c5199b	[LLD][LLVM] CG Graph profile using relocations Currently when .llvm.call-graph-profile is created by llvm it explicitly encodes the symbol indices. This section is basically a black box for post processing tools. For example, if we run strip -s on the object files the symbol table changes, but indices in that section do not. In non-visible behavior indices point to wrong symbols. The visible behavior indices point outside of Symbol table: "invalid symbol index". This patch changes the format by using R_*_NONE relocations to indicate the from/to symbols. The Frequency (Weight) will still be in the .llvm.call-graph-profile, but symbol information will be in relocation section. In LLD information from both sections is used to reconstruct call graph profile. Relocations themselves will never be applied. With this approach post processing tools that handle relocations correctly work for this section also. Tools can add/remove symbols and as long as they handle relocation sections with this approach information stays correct. Doing a quick experiment with clang-13. The size went up from 107KB to 322KB, aggregate of all the input sections. Size of clang-13 binary is ~118MB. For users of -fprofile-use/-fprofile-sample-use the size of object files will go up slightly, it will not impact final binary size. Reviewed By: jhenderson, MaskRay Differential Revision: https://reviews.llvm.org/D104080	2021-06-24 09:09:33 -07:00
Jay Foad	beebe5a056	[MCA] Allow unlimited cycles in the timeline view Change --max-timeline-cycles=0 to mean no limit on the number of cycles. Use this in AMDGPU tests to show all instructions in the timeline view instead of having it arbitrarily truncated. Differential Revision: https://reviews.llvm.org/D104846	2021-06-24 12:54:57 +01:00
Bill Wendling	826947080b	[llvm-diff] Explicitly check ConstantStructs for differences A ConstantStruct is renamed when the LLVM context sees a new one. This makes global variable initializers appear different when they aren't. Instead, check the ConstantStruct for equivalence. Differential Revision: https://reviews.llvm.org/D104734	2021-06-23 16:26:34 -07:00
Patrick Holland	70040de32d	[MCA][TimelineView] Fixed a bug that was causing instructions outside of the timeline-max-cycles to still be printed. Differential Revision: https://reviews.llvm.org/D104815	2021-06-23 15:05:49 -07:00
Andrew Litteken	9e73f7c8d2	[IRSim] Adding basic implementation of llvm-sim. This is a similarity visualization tool that accepts a Module and passes it to the IRSimilarityIdentifier. The resulting SimilarityGroups are output in a JSON file. Tests are found in test/tools/llvm-sim and check for the file not found, a bad module, and that the JSON is created correctly. Reviewers: paquette, jroelofs, MaskRay Recommit of: `15645d044b` to fix linking errors and GN build system. Differential Revision: https://reviews.llvm.org/D86974	2021-06-23 14:38:58 -05:00
Cyndy Ishida	fb9f9497dc	[llvm-tapi-diff] Wrap empty string around StringLiteral NFC This prevents invalid implicit conversation which caused buildbot failure.	2021-06-23 11:41:03 -07:00
Cyndy Ishida	56709b8695	[TextAPI] add symbol name prefixes to central location, NFC These prefixes are used for printing the symbols coming from tbd files and they were redundant across locations	2021-06-23 11:21:00 -07:00
Adrian Prantl	7b81fdf984	clang-format llvm-dwarfdump.cpp	2021-06-23 10:44:13 -07:00
Adrian Prantl	072f5180f2	Improve error handling in llvm-dwarfdump. Without this patch we're only showing a generic error message derived from the error code to the end user. rdar://79378794 Differential Revision: https://reviews.llvm.org/D104483	2021-06-23 10:44:13 -07:00
Fangrui Song	011b502ce8	[llvm-objcopy][MachO] Fix namespace style issues	2021-06-23 00:31:52 -07:00
Hongtao Yu	5c8659801a	[CSSPGO][llvm-profgen] Handle return to external transition. In a callback case, a return from internal code, say A, to external runtime can happen. The external runtime can then call back to another internal routine, say B. Making an artificial branch that looks like a return from A to B can confuse the unwinder to treat the instruction before B as the call instruction. Reviewed By: wenlei, wmi Differential Revision: https://reviews.llvm.org/D104546	2021-06-22 16:24:59 -07:00
Bill Wendling	46db43240f	[llvm-diff] Explicitly check ConstantArrays Global initializers may be ConstantArrays. They need to be checked explicitly, because different-yet-still-equivalent type names may be used for each, and/or a GEP instruction may appear in one.	2021-06-22 12:23:38 -07:00
Bill Wendling	ab6002871d	[llvm-diff] Add support for diffing the callbr instruction The only wrinkle is that we can't process the "blockaddress" arguments of the callbr until the blocks have been equated. So we force them to be "unified" before checking. This was left out when the callbr instruction was added. Differential Revision: https://reviews.llvm.org/D104606	2021-06-22 12:23:37 -07:00
Patrick Holland	d03736455c	[MCA] [In-order pipeline] Fix for 0 latency instruction causing assertion to fail. 0 latency instructions now get processed and retired properly within the in-order pipeline. Had to fix a bug within TimelineView.cpp as well that would show up when a 0 latency instruction was the first instruction in the source. Differential Revision: https://reviews.llvm.org/D104675	2021-06-22 10:18:39 -07:00
Fangrui Song	3accff2553	[llvm-objcopy] Fix some namespace style issues https://llvm.org/docs/CodingStandards.html#use-namespace-qualifiers-to-implement-previously-declared-functions Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D104693	2021-06-22 09:19:48 -07:00
Bill Wendling	dd1b121c99	[llvm-diff] Constify APIs so that there aren't conflicts Some APIs work with const variables while others don't. This can cause conflicts when calling one from the other. This is NFC. Differential Revision: https://reviews.llvm.org/D104719	2021-06-22 09:17:04 -07:00
Martin Storsjö	703b0ed8e2	[ADT] Add StringRef consume_front_lower and consume_back_lower These serve as a convenient combination of consume_front/back and startswith_lower/endswith_lower, consistent with other existing case insensitive methods named <operation>_lower. Differential Revision: https://reviews.llvm.org/D104218	2021-06-22 12:38:08 +03:00
Fangrui Song	3f873e9b51	[llvm-objcopy] Internalize some symbols	2021-06-21 23:49:25 -07:00
Fangrui Song	f14e6e4451	[llvm-objcopy] Delete empty namespace. NFC	2021-06-21 23:44:07 -07:00
Rong Xu	8c68eb8306	[SampleFDO] Make FSDiscriminator flag part of function parameters Add a parameter of IsFSDiscriminator to function getBaseDiscriminatorFromDiscriminator(). This function currently checks the internal flag of --enable-fs-discriminator. This is not good because we might change the default value of the internal flag. Note that we have a default parameter. This is just because create_afdo_tool has a call-site to it. I will remove the default parameter in a later patch. Differential Revision: https://reviews.llvm.org/D104584	2021-06-21 14:37:45 -07:00
Langston Barrett	a240358833	[llvm-reduce] Don't delete arguments of intrinsics The argument reduction pass shouldn't remove arguments of intrinsics, because the resulting module is ill-formed, and so inherently uninteresting. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D103129	2021-06-21 12:43:58 -07:00
Fangrui Song	ea23c38d06	[llvm-profdata] Allow omission of -o for --text output This makes it more convenient to get a text format profile. Add an error for printing non-text format output to a terminal for instrumentation profile. (It cannot be portably tested. For sample profile, raw_fd_ostream is hidden deeply so it's inconvenient to add a diagnostic.) Reviewed By: davidxl Differential Revision: https://reviews.llvm.org/D104600	2021-06-21 12:01:57 -07:00
Fangrui Song	8ea2a58a2e	[llvm-profdata] Make diagnostics consistent with the (no capitalization, no period) style The format is currently inconsistent. Use the https://llvm.org/docs/CodingStandards.html#error-and-warning-messages style. And add `error:` or `warning:` to CHECK lines wherever appropriate.	2021-06-19 14:54:25 -07:00
Fangrui Song	0f558db742	[llvm-profdata] Delete unneeded empty output filename check	2021-06-19 12:20:45 -07:00
Fangrui Song	59d90fe817	Simplify some typedef struct	2021-06-19 11:36:44 -07:00
Hongtao Yu	bd52495518	[CSSPGO] Undoing the concept of dangling pseudo probe As a follow-up to https://reviews.llvm.org/D104129, I'm cleaning up the danling probe related code in both the compiler and llvm-profgen. I'm seeing a 5% size win for the pseudo_probe section for SPEC2017 and 10% for Ciner. Certain benchmark such as 602.gcc has a 20% size win. No obvious difference seen on build time for SPEC2017 and Cinder. Reviewed By: wenlei Differential Revision: https://reviews.llvm.org/D104477	2021-06-18 15:14:11 -07:00
Hongtao Yu	fb19aa0c74	[CSSPGO][llvm-profgen] Fix an issue in findDisjointRanges We were using 0 as an indicator of invalid offset when computing disjoint ranges. In reality, 0 can be an valid code offset which stands for the first function in .text section. I'm using UINT64_MAX as an invalid code offset instead. Reviewed By: wenlei Differential Revision: https://reviews.llvm.org/D104497	2021-06-18 14:38:48 -07:00
Hongtao Yu	8c2c97287e	[CSSPGO][llvm-profgen] Ignore LBR records after interrupt transition If we have seen an inwards transition from external code to internal code, but not a following outwards transition, the inwards transition is likely due to interrupt which is usually unpaired. Ignore current and subsequent entries since they are likely from an unrelated pre-interrupt context. LBR records from different interrupt context are unrelated and they should not be mixed together. Currenlty the OS does this for task-scheduling interrupt but not for all interrupts. Reviewed By: wenlei, wlei Differential Revision: https://reviews.llvm.org/D104276	2021-06-18 12:13:53 -07:00
Hongtao Yu	c60f1d5d98	[CSSPGO] Fix an invalid hash table reference issue in the CS preinliner. We were using a `StringMap` object to store all profiles to be emitted. The object is basically an unordered hash table, therefore updating it in the process of trasvering it may cause issue since the underlying bucket array could change. I'm also moving the `csspgo-preinliner` switch around so that no context tri will be constructed (by the constructor of `CSPreInliner`) when the switch is off. Reviewed By: wenlei Differential Revision: https://reviews.llvm.org/D104267	2021-06-18 11:54:23 -07:00
Heejin Ahn	1d891d44f3	[WebAssembly] Rename event to tag We recently decided to change 'event' to 'tag', and 'event section' to 'tag section', out of the rationale that the section contains a generalized tag that references a type, which may be used for something other than exceptions, and the name 'event' can be confusing in the web context. See - https://github.com/WebAssembly/exception-handling/issues/159#issuecomment-857910130 - https://github.com/WebAssembly/exception-handling/pull/161 Reviewed By: tlively Differential Revision: https://reviews.llvm.org/D104423	2021-06-17 20:34:19 -07:00
Min-Yih Hsu	c29555342c	[MCA] Anchoring the vtable of CustomBehaviour Put the dtor of mca::CustomBehaviour into the cpp file to avoid undefined vtable when linking libLLVMMCACustomBehaviourAMDGPU as shared library. Differential Revision: https://reviews.llvm.org/D104401	2021-06-16 12:43:58 -07:00
Fangrui Song	d619cf5ac5	[llvm-objcopy][MachO] Copy LC_LINKER_OPTIMIZATION_HINT This fixes `error: unsupported load command (cmd=0x2e)`	2021-06-16 12:09:50 -07:00
Hongtao Yu	cef9b96b01	[CSSPGO] Report zero-count probe in profile instead of dangling probes. Previously dangling samples were represented by INT64_MAX in sample profile while probes never executed were not reported. This was based on an observation that dangling probes were only at a smaller portion than zero-count probes. However, with compiler optimizations, dangling probes end up becoming at large portion of all probes in general and reporting them does not make sense from profile size point of view. This change flips sample reporting by reporting zero-count probes instead. This enabled dangling probe to be represented by none (missing entry in profile). This has a couple benefits: 1. Reducing sample profile size in optimize mode, even when the number of non-executed probes outperform the number of dangling probes, since INT64_MAX takes more space over 0 to encode. 2. Binary size savings. No need to encode dangling probe anymore, since missing probes are treated as dangling in the profile reader. 3. Reducing compiler work to track dangling probes. However, for probes that are real dead and removed, we still need the compiler to identify them so that they can be reported as zero-count, instead of mistreated as dangling probes. 4. Improving counts quality by respecting the counts already collected on the non-dangling copy of a probe. A probe, when duplicated, gets two copies at runtime. If one of them is dangling while the other is not, merging the two probes at profile generation time will cause the real samples collected on the non-dangling one to be discarded. Not reporting the dangling counterpart will keep the real samples. 5. Better readability. 6. Be consistent with non-CS dwarf line number based profile. Zero counts are trusted by the compiler counts inferencer while missing counts will be inferred by the compiler. Note that the current patch does include any work for #3. There will be follow-up changes. For #1, I've seen for a large Facebook service, the text profile is reduced by 7%. For extbinary profile, the size of LBRProfileSection is reduced by 35%. For #4, I have seen general counts quality for SPEC2017 is improved by 10%. Reviewed By: wenlei, wlei, wmi Differential Revision: https://reviews.llvm.org/D104129	2021-06-16 11:45:29 -07:00
Fangrui Song	1de18ad8d7	[llvm-objcopy] Make ihex writer similar to binary writer There is no need to differentiate whether `UseSegments` is true or false. Unifying the cases makes the behavior closer to BinaryWriter. This improves compatibility with objcopy because SHF_ALLOC sections not in a PT_LOAD will not be skipped. Such cases are usually erroneous input, though. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D104186	2021-06-16 10:08:20 -07:00
Patrick Holland	ef16c8eaa5	Reapply "[MCA] Adding the CustomBehaviour class to llvm-mca". The original change was pushed in main as commit `f7a23ecece`. It was then reverted by commit `a04f01bab2` because it caused linker failures on buildbots that don't build the AMDGPU target. -- Some instructions are not defined well enough within the target’s scheduling model for llvm-mca to be able to properly simulate its behaviour. The ideal solution to this situation is to modify the scheduling model, but that’s not always a viable strategy. Maybe other parts of the backend depend on that instruction being modelled the way that it is. Or maybe the instruction is quite complex and it’s difficult to fully capture its behaviour with tablegen. The CustomBehaviour class (which I will refer to as CB frequently) is designed to provide intuitive scaffolding for developers to implement the correct modelling for these instructions. More details are available in the original commit log message (`f7a23ecece`). Differential Revision: https://reviews.llvm.org/D104149	2021-06-16 16:54:48 +01:00
James Henderson	b9ce8ea454	[obj2yaml] Address D104035 review comments Accidentally missed from commit `5c1639fe06`. Differential Revision: https://reviews.llvm.org/D104035	2021-06-16 15:01:54 +01:00
James Henderson	5c1639fe06	[yaml2obj][obj2yaml] Support custom ELF section header string table name This patch adds support for a new field in the FileHeader, which states the name to use for the section header string table. This also allows combining the string table with another string table in the object, e.g. the symbol name string table. The field is optional. By default, .shstrtab will continue to be used. This partially fixes https://bugs.llvm.org/show_bug.cgi?id=50506. Reviewed by: Higuoxing Differential Revision: https://reviews.llvm.org/D104035	2021-06-16 10:02:23 +01:00
Andrea Di Biagio	a04f01bab2	Revert "[MCA] Adding the CustomBehaviour class to llvm-mca" This reverts commit `f7a23ecece`. It appears to breaks buildbots that don't build the AMDGPU backend.	2021-06-15 21:41:36 +01:00
Patrick Holland	f7a23ecece	[MCA] Adding the CustomBehaviour class to llvm-mca Some instructions are not defined well enough within the target’s scheduling model for llvm-mca to be able to properly simulate its behaviour. The ideal solution to this situation is to modify the scheduling model, but that’s not always a viable strategy. Maybe other parts of the backend depend on that instruction being modelled the way that it is. Or maybe the instruction is quite complex and it’s difficult to fully capture its behaviour with tablegen. The CustomBehaviour class (which I will refer to as CB frequently) is designed to provide intuitive scaffolding for developers to implement the correct modelling for these instructions. Implementation details: llvm-mca does its best to extract relevant register, resource, and memory information from every MCInst when lowering them to an mca::Instruction. It then uses this information to detect dependencies and simulate stalls within the pipeline. For some instructions, the information that gets captured within the mca::Instruction is not enough for mca to simulate them properly. In these cases, there are two main possibilities: 1. The instruction has a dependency that isn’t detected by mca. 2. mca is incorrectly enforcing a dependency that shouldn’t exist. For the rest of this discussion, I will be focusing on (1), but I have put some thought into (2) and I may revisit it in the future. So we have an instruction that has dependencies that aren’t picked up by mca. The basic idea for both pipelines in mca is that when an instruction wants to be dispatched, we first check for register hazards and then we check for resource hazards. This is where CB is injected. If no register or resource hazards have been detected, we make a call to CustomBehaviour::checkCustomHazard() to give the target specific CB the chance to detect and enforce any custom dependencies. The return value for checkCustomHazaard() is an unsigned int representing the (minimum) number of cycles that the instruction needs to stall for. It’s fine to underestimate this value because when StallCycles gets down to 0, we’ll end up checking for all the hazards again before the instruction is actually dispatched. However, it’s important not to overestimate the value and the more accurate your estimate is, the more efficient mca’s execution can be. In general, for checkCustomHazard() to be able to detect these custom dependencies, it needs information about the current instruction and also all of the instructions that are still executing within the pipeline. The mca pipeline uses mca::Instruction rather than MCInst and the current information encoded within each mca::Instruction isn’t sufficient for my use cases. I had to add a few extra attributes to the mca::Instruction class and have them get set by the MCInst during instruction building. For example, the current mca::Instruction doesn’t know its opcode, and it also doesn’t know anything about its immediate operands (both of which I had to add to the class). With information about the current instruction, a list of all currently executing instructions, and some target specific objects (MCSubtargetInfo and MCInstrInfo which the base CB class has references to), developers should be able to detect and enforce most custom dependencies within checkCustomHazard. If you need more information than is present in the mca::Instruction, feel free to add attributes to that class and have them set during the lowering sequence from MCInst. Fortunately, in the in-order pipeline, it’s very convenient for us to pass these arguments to checkCustomHazard. The hazard checking is taken care of within InOrderIssueStage::canExecute(). This function takes a const InstRef as a parameter (representing the instruction that currently wants to be dispatched) and the InOrderIssueStage class maintains a SmallVector<InstRef, 4> which holds all of the currently executing instructions. For the out-of-order pipeline, it’s a bit trickier to get the list of executing instructions and this is why I have held off on implementing it myself. This is the main topic I will bring up when I eventually make a post to discuss and ask for feedback. CB is a base class where targets implement their own derived classes. If a target specific CB does not exist (or we pass in the -disable-cb flag), the base class is used. This base class trivially returns 0 from its checkCustomHazard() implementation (meaning that the current instruction needs to stall for 0 cycles aka no hazard is detected). For this reason, targets or users who choose not to use CB shouldn’t see any negative impacts to accuracy or performance (in comparison to pre-patch llvm-mca). Differential Revision: https://reviews.llvm.org/D104149	2021-06-15 21:30:48 +01:00
Simon Pilgrim	941188e965	[llvm-exegesis] Fix X86LbrCounter destructor to correctly unmap memory and not double-close fd (PR50620) As was reported on PR50620, the X86LbrCounter destructor was double-closing the filedescriptor and not unmapping the buffer. Differential Revision: https://reviews.llvm.org/D104201	2021-06-15 14:24:35 +01:00
wlei	863184dd69	[CSSPGO] Aggregation by the last K context frames for cold profiles This change provides the option to merge and aggregate cold context by the last k frames instead of context-less name. By default K = 1 means the context-less one. This is for better perf tuning. The more selective merging and trimming will rely on llvm-profgen's preinliner. Reviewed By: wenlei, hoy Differential Revision: https://reviews.llvm.org/D104131	2021-06-14 10:33:43 -07:00
David Blaikie	02c718301b	llvm-objcopy: fix section size truncation/extension when dumping sections Since this only comes up with inputs containing sections at least 4GB large (I guess I could use a bzero section or something, so the input file doesn't have to be 4GB, but even then the output file would have to be 4GB, right?) I've skipped testing this. If there's a nice way to test this without needing 4GB inputs or output files. The subtlety here is demonstrated by this code: struct t { operator uint64_t(); }; static_assert(std::is_same_v<int, decltype(std::declval<bool>() ? 0 : std::declval<t>())>); static_assert(std::is_same_v<uint64_t, decltype(std::declval<bool>() ? 0 : std::declval<uint64_t>())>); Because of this difference, the original source code was getting an int type (truncating the actual size) and then extending it again, resulting in bogus values (I haven't thought through this hard enough to explain why the resulting value was 0xffff... - sign extension, possible UB, but in any case it's the wrong answer - in this particular case I was looking at that resulted in a size so large that we couldn't open a file large enough to write to and ended up with a rather vague: error: 'file_name.o': Invalid argument	2021-06-12 19:00:10 -07:00
Ian McIntyre	5899278758	[llvm-objcopy] Exclude empty sections in IHexWriter output IHexWriter was evaluating a section's physical address when deciding if that section should be written to an output. This approach does not account for a zero-sized section that has the same physical address as a sized section. The behavior varies from GNU objcopy, and may result in a HEX file that does not include all program sections. The IHexWriter now excludes zero-sized sections when deciding what should be written to the output. This affects the contents of the writer's `Sections` collection; we will not try to insert multiple sections that could have the same physical address. The behavior seems consistent with GNU objcopy, which always excludes empty sections, no matter the address. The new test case evaluates the IHexWriter behavior when provided a variety of empty sections that overlap or append a filled section. See the input file's comments for more information. Given that test input, and the change to the IHexWriter, GNU objcopy and llvm-objcopy produce the same output. Reviewed By: jhenderson, MaskRay, evgeny777 Differential Revision: https://reviews.llvm.org/D101332	2021-06-12 12:23:07 -07:00
Alexander Shaposhnikov	0276cc742b	[llvm-objcopy][MachO] Do not strip symbols with the flag REFERENCED_DYNAMICALLY set Do not strip symbols having the flag REFERENCED_DYNAMICALLY set. Test plan: make check-all Differential revision: https://reviews.llvm.org/D104092	2021-06-11 16:34:59 -07:00
Andrew Litteken	8bc0eb4011	Revert "[IRSim] Adding basic implementation of llvm-sim." This reverts commit `f47d00c54b`.	2021-06-11 15:44:19 -05:00
Andrew Litteken	f47d00c54b	[IRSim] Adding basic implementation of llvm-sim. This is a similarity visualization tool that accepts a Module and passes it to the IRSimilarityIdentifier. The resulting SimilarityGroups are output in a JSON file. Tests are found in test/tools/llvm-sim and check for the file not found, a bad module, and that the JSON is created correctly. Reviewers: paquette, jroelofs, MaskRay Recommit of: `15645d044b` to fix linking errors. Differential Revision: https://reviews.llvm.org/D86974	2021-06-11 14:56:41 -05:00
Simon Pilgrim	61cdaf66fe	[ADT] Remove APInt/APSInt toString() std::string variants <string> is currently the highest impact header in a clang+llvm build: https://commondatastorage.googleapis.com/chromium-browser-clang/llvm-include-analysis.html One of the most common places this is being included is the APInt.h header, which needs it for an old toString() implementation that returns std::string - an inefficient method compared to the SmallString versions that it actually wraps. This patch replaces these APInt/APSInt methods with a pair of llvm::toString() helpers inside StringExtras.h, adjusts users accordingly and removes the <string> from APInt.h - I was hoping that more of these users could be converted to use the SmallString methods, but it appears that most end up creating a std::string anyhow. I avoided trying to use the raw_ostream << operators as well as I didn't want to lose having the integer radix explicit in the code. Differential Revision: https://reviews.llvm.org/D103888	2021-06-11 13:19:15 +01:00
Simon Pilgrim	646e970d44	[llvm-stress] Fix dead code preventing us generating per-element vector selects This has been reported several times by the PVS Studio team as well as coming up in some static analysis. getRandom() % 1 always returns 0 so we never actually test this codepath, (git blame suggests this has always been like this) - given that we have plenty of other "getRandom() & 1" the typo is pretty obvious, and matches the intention in the comment above - with this change we generate a nice mixture of scalar/vector condition selects of vectors. I don't know llvm-stress that well - but I don't think we guarantee that the same seed value will always generate the same IR for later versions of the program - just that the same binary would. Differential Revision: https://reviews.llvm.org/D104022	2021-06-11 10:56:19 +01:00
Simon Pilgrim	d789ed11ea	Fix implicit dependency on <string> header. NFCI.	2021-06-11 10:24:14 +01:00
David Tenty	75d4f55d15	[AIX] Build libLTO as MODULE rather than SHARED On CMake versions greater that >= 3.16 on AIX, shared libraries are created as archives (which is the normal form for the platform). However plugins libraries which are passed directly to a executable, like libLTO to the linker, are usual build as plain `.so`, so this patch restores this behaviour for libLTO on AIX (and adjust the name if need be to account for the fact that llvm_add_library likes to force an empty name prefix on modules), so we end up with the expected libLTO.so Reviewed By: w2yehia Differential Revision: https://reviews.llvm.org/D103824	2021-06-10 12:08:59 -04:00
Sam Powell	5b5ab80e31	Reland "[llvm] llvm-tapi-diff" This is relanding commit `d1d36f7ad2` . This patch additionally addresses failures found in buildbots due to unstable build ordering & post review comments. This patch introduces a new tool, llvm-tapi-diff, that compares and returns the diff of two TBD files. Reviewed By: ributzka, JDevlieghere Differential Revision: https://reviews.llvm.org/D101835	2021-06-09 21:17:34 -07:00
Eric Astor	4b5317e937	[ms] [llvm-ml] Add support for INCLUDE environment variable Also adds support for the ML.exe command-line flag /X, which ignores the INCLUDE environment variable. This relands commit `c43f413b01` using lit's cross-platform `env` support. Differential Revision: https://reviews.llvm.org/D103989	2021-06-09 17:54:40 -04:00
Cyndy Ishida	e7b755ecb1	Revert "Reland "[llvm] llvm-tapi-diff"" This reverts commit `20126c9fd4`. The sorting fixes failed to have stable output on different platforms.	2021-06-09 13:48:09 -07:00
Cyndy Ishida	1899cb7d0e	Revert "[llvm-tapi-diff] Apply stable sorting to output" This reverts commit `90a26a41e9`. This failed to fix ubuntu failures.	2021-06-09 13:48:09 -07:00
Sam Powell	90a26a41e9	[llvm-tapi-diff] Apply stable sorting to output * For the output, the attributes within the target slice should be grouped by the input order, then sorted by value ordering. This is to fix current ubuntu buildbot inconsistences.	2021-06-09 13:09:47 -07:00
Eric Astor	68d0db0b6d	Revert "[ms] [llvm-ml] Add support for INCLUDE environment variable" This reverts commit `c43f413b01` due to Windows environment build breaks	2021-06-09 15:49:51 -04:00
Eric Astor	c43f413b01	[ms] [llvm-ml] Add support for INCLUDE environment variable Also adds support for the ML.exe command-line flag /X, which ignores the INCLUDE environment variable.	2021-06-09 15:25:26 -04:00
Sam Powell	20126c9fd4	Reland "[llvm] llvm-tapi-diff" This is relanding commit `d1d36f7ad2` . This patch additionally addresses failures found in buildbots & post review comments. This patch introduces a new tool, llvm-tapi-diff, that compares and returns the diff of two TBD files. Reviewed By: ributzka, JDevlieghere Differential Revision: https://reviews.llvm.org/D101835	2021-06-09 10:35:41 -07:00
Florian Hahn	e978f6bc97	[LTO] Support new PM in ThinLTOCodeGenerator. This patch adds initial support for using the new pass manager when doing ThinLTO via libLTO. Reviewed By: steven_wu Differential Revision: https://reviews.llvm.org/D102627	2021-06-09 10:05:14 +01:00
Brendon Cahoon	294efbbd3e	Reland "[AMDGPU] Add gfx1013 target" This reverts commit `211e584fa2`. Fixed a use-after-free error that caused the sanitizers to fail.	2021-06-08 21:15:35 -04:00
Brendon Cahoon	211e584fa2	Revert "[AMDGPU] Add gfx1013 target" This reverts commit `ea10a86984`. A sanitizer buildbot reports an error.	2021-06-08 16:29:41 -04:00
Brendon Cahoon	ea10a86984	[AMDGPU] Add gfx1013 target Differential Revision: https://reviews.llvm.org/D103663	2021-06-08 12:49:49 -04:00
David Blaikie	c5d56fec50	NFC: .clang-tidy: Inherit configs from parents to improve maintainability In the interests of disabling misc-no-recursion across LLVM (this seems like a stylistic choice that is not consistent with LLVM's style/development approach) this NFC preliminary change adjusts all the .clang-tidy files to inherit from their parents as much as possible. This change specifically preserves all the quirks of the current configs in order to make it easier to review as NFC. I validatad the change is NFC as follows: for X in `cat ../files.txt`; do mkdir -p ../tmp/$(dirname $X) touch $(dirname $X)/blaikie.cpp clang-tidy -dump-config $(dirname $X)/blaikie.cpp > ../tmp/$(dirname $X)/after rm $(dirname $X)/blaikie.cpp done (similarly for the "before" state, without this patch applied) for X in `cat ../files.txt`; do echo $X diff \ ../tmp/$(dirname $X)/before \ <(cat ../tmp/$(dirname $X)/after \ \| sed -e "s/,readability-identifier-naming$.$,-readability-identifier-naming/\1/" \ \| sed -e "s/,-llvm-include-order$.$,llvm-include-order/\1/" \ \| sed -e "s/,-misc-no-recursion$.$,misc-no-recursion/\1/" \ \| sed -e "s/,-clang-diagnostic-\$.$,clang-diagnostic-\/\1/") done (using sed to strip some add/remove pairs to reduce the diff and make it easier to read) The resulting report is: .clang-tidy clang/.clang-tidy 2c2 < Checks: 'clang-diagnostic-,clang-analyzer-,-,clang-diagnostic-,llvm-,misc-,-misc-unused-parameters,-misc-non-private-member-variables-in-classes,-readability-identifier-naming,-misc-no-recursion' --- > Checks: 'clang-diagnostic-,clang-analyzer-,-,clang-diagnostic-,llvm-,misc-,-misc-unused-parameters,-misc-non-private-member-variables-in-classes,-misc-no-recursion' compiler-rt/.clang-tidy 2c2 < Checks: 'clang-diagnostic-,clang-analyzer-,-,clang-diagnostic-,llvm-,-llvm-header-guard,misc-,-misc-unused-parameters,-misc-non-private-member-variables-in-classes' --- > Checks: 'clang-diagnostic-,clang-analyzer-,-,clang-diagnostic-,llvm-,misc-,-misc-unused-parameters,-misc-non-private-member-variables-in-classes,-llvm-header-guard' flang/.clang-tidy 2c2 < Checks: 'clang-diagnostic-,clang-analyzer-,-,llvm-,-llvm-include-order,misc-,-misc-no-recursion,-misc-unused-parameters,-misc-non-private-member-variables-in-classes' --- > Checks: 'clang-diagnostic-,clang-analyzer-,-,llvm-,misc-,-misc-unused-parameters,-misc-non-private-member-variables-in-classes,-llvm-include-order,-misc-no-recursion' flang/include/flang/Lower/.clang-tidy flang/include/flang/Optimizer/.clang-tidy flang/lib/Lower/.clang-tidy flang/lib/Optimizer/.clang-tidy lld/.clang-tidy lldb/.clang-tidy llvm/tools/split-file/.clang-tidy mlir/.clang-tidy The `clang/.clang-tidy` change is a no-op, disabling an option that was never enabled. The compiler-rt and flang changes are no-op reorderings of the same flags. (side note, the .clang-tidy file in parallel-libs is broken and crashes clang-tidy because it uses "lowerCase" as the style instead of "lower_case" - so I'll deal with that separately) Differential Revision: https://reviews.llvm.org/D103842	2021-06-08 08:25:59 -07:00
jasonliu	8e84311a84	[XCOFF][AIX] Enable tooling support for 64 bit symbol table parsing Add in the ability of parsing symbol table for 64 bit object. Reviewed By: jhenderson, DiggerLin Differential Revision: https://reviews.llvm.org/D85774	2021-06-07 17:24:13 +00:00
Simon Pilgrim	551a697c5c	xray-color-helper.cpp - add missing implicit cmath header dependency. NFCI. Noticed while investigating if we can remove an unnecessary MathExtras.h include from SmallVector.h (necessary for gcc builds but not MSVC)	2021-06-05 21:33:24 +01:00
Simon Pilgrim	6ff62d7e17	xray-color-helper.h - sort includes. NFCI.	2021-06-05 21:33:23 +01:00
Rong Xu	8d581857d7	[SampleFDO] New hierarchical discriminator for FS SampleFDO (llvm-profdata part) This patch was split from https://reviews.llvm.org/D102246 [SampleFDO] New hierarchical discriminator for Flow Sensitive SampleFDO This is for llvm-profdata part of change. It sets the bit masks for the profile reader in llvm-profdata. Also add an internal option "-fs-discriminator-pass" for show and merge command to process the profile offline. This patch also moved setDiscriminatorMaskedBitFrom() to SampleProfileReader::create() to simplify the interface. Differential Revision: https://reviews.llvm.org/D103550	2021-06-04 11:22:06 -07:00
Cyndy Ishida	5337c7550d	Revert "[llvm] llvm-tapi-diff" This reverts commit `d1d36f7ad2`. Reverting this patch to investigate linux bot failures + fix with author offline	2021-06-03 21:10:51 -07:00
Wenlei He	aaa826fac1	[CSSPGO][llvm-profgen] Make extended binary the default output format Make extended binary the default output format for CSSPGO. This avoids having to pass flag every time when generating profile. It also matches llvm-profdata where binary profile is the default (should we switch to extbinary as default for llvm-profdata?). We plan to compress name table for context profile, which depends on the built-in compression of extbinary. Differential Revision: https://reviews.llvm.org/D103650	2021-06-03 17:58:16 -07:00
Sam Powell	d1d36f7ad2	[llvm] llvm-tapi-diff This patch introduces a new tool, llvm-tapi-diff, that compares and returns the diff of two TBD files. Reviewed By: ributzka, JDevlieghere Differential Revision: https://reviews.llvm.org/D101835	2021-06-03 11:38:00 -07:00
Nikita Popov	983565a6fe	[ADT] Move DenseMapInfo for ArrayRef/StringRef into respective headers (NFC) This is a followup to D103422. The DenseMapInfo implementations for ArrayRef and StringRef are moved into the ArrayRef.h and StringRef.h headers, which means that these two headers no longer need to be included by DenseMapInfo.h. This required adding a few additional includes, as many files were relying on various things pulled in by ArrayRef.h. Differential Revision: https://reviews.llvm.org/D103491	2021-06-03 18:34:36 +02:00
Kim-Anh Tran	de51c48ed3	[llvm-dwp] Add support for rnglists and loclists This patch updates llvm-dwp to include rnglists and loclists when parsing debug sections. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D101894	2021-06-02 12:31:35 -07:00
Kim-Anh Tran	316da543af	[llvm-dwp] Add support for DWARFv5 type units ... This patch adds support for DWARFv5 type units: parsing from the .debug_info section, and writing index to the type unit index. Previously, the type units were part of the .debug_types section which is no longer used in DWARFv5. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D101818	2021-06-02 12:24:08 -07:00
Kim-Anh Tran	6e2d3049d2	[llvm-dwp] Adding support for v5 index writing This patch adds general support for DWARFv5 index writing. In particular, this means only allowing inputs with one version, either DWARFv5 or DWARFv4. This patch adds the .debug_macro section as an example, but the DWARFv5 type support and loc and rangelists are still missing (and upcoming). Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D102315	2021-06-02 12:21:31 -07:00
Kim-Anh Tran	595b1683b7	[llvm-dwp] Skip type unit debug info sections This patch makes llvm-dwp skip debug info sections that may not be encoding a compile unit. In DWARF5, debug info sections are also used for type units. As in preparation to support type units, make llvm-dwp aware of other uses of debug info sections but skip them for now. The patch first records all .debug_info sections, then goes through them one by one and records the cu debug info section for writing the index unit, and copies that section to the final dwp output info section. If it's not a compile unit, skip. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D102312	2021-06-02 11:48:10 -07:00
Rahman Lavaee	616ac1b961	[llvm-readobj] Print function names with `--bb-addr-map`. This patch uses the `getSymbolIndexForFunctionAddress` helper function to print function names for BB address map entries. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D102900	2021-06-01 18:40:42 -07:00
gbreynoo	e60f147324	[llvm-dwarfdump][test] Add missing dedicated tests for some options This change adds tests specifically for --parent-recurse-depth, --quiet and -o. The test for -o found a typo in an error message which is also fixed in this change. Differential Revision: https://reviews.llvm.org/D103250	2021-06-01 14:57:00 +01:00
Andrea Di Biagio	9853d0db1e	[MCA][NFCI] Minor changes to InstrBuilder and Instruction. This is based on the assumption that most simulated instructions don't define more than one or two registers. This is true for example on x86, where most instruction definitions don't declare more than one register write. The default code region size has been increased from 8 to 16. This is based on the assumption that, for small microbenchmarks, the typical code snippet size is often less than 16 instructions. mca::Instruction now uses bitfields to pack flags. No functional change intended.	2021-05-31 17:05:13 +01:00
Alexey Lapshin	83cc4478a0	[llvm-objcopy][NFC] Refactor CopyConfig structure - remove lazy options processing. During reviewing D102277 it was decided to remove lazy options processing from llvm-objcopy CopyConfig structure. This patch transforms processing of ELF lazy options into the in-place processing. Differential Revision: https://reviews.llvm.org/D103260	2021-05-31 14:40:27 +03:00
Andrea Di Biagio	50770d8de5	[MCA] Refactor the InOrderIssueStage stage. NFCI Moved the logic that checks for RAW hazards from the InOrderIssueStage to the RegisterFile. Changed how the InOrderIssueStage keeps track of backend stalls. Stall events are now generated from method notifyStallEvent(). No functional change intended.	2021-05-27 22:28:04 +01:00
Simon Giesecke	5f2d4b23b4	Add --quiet option to llvm-gsymutil to suppress output of warnings. Differential Revision: https://reviews.llvm.org/D102829	2021-05-27 12:36:34 +00:00
Esme-Yi	d82f2a123f	[llvm-objdump] Print the DEBUG type under `--section-headers`. Summary: Under the option --section-headers, we can only print the section types of TEXT, DATA, and BSS for now. This patch adds the DEBUG type. Reviewed By: jhenderson, Higuoxing Differential Revision: https://reviews.llvm.org/D102603	2021-05-27 04:53:14 +00:00
Rahman Lavaee	6505c63040	[llvm-readobj] Optimize printing stack sizes to linear time. Currently, each function name lookup is a linear iteration over all symbols defined in the object file which makes the total running time quadratic. This patch optimizes the function name lookup by populating an address to index map upon the first function name lookup which is used to lookup each function name in O(1). impact: For the clang binary built with `-fstack-size-section`, this improves the running time of `llvm-readobj --stack-size` from 7 minutes to 0.25 seconds. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D103072	2021-05-26 13:14:33 -07:00
Fangrui Song	73a1179535	[llvm-mc] Add -M to replace -riscv-no-aliases and -riscv-arch-reg-names In objdump, many targets support `-M no-aliases`. Instead of having a `-*-no-aliases` for each target when LLVM adds the support, it makes more sense to introduce objdump style `-M`. -riscv-arch-reg-names is removed. -riscv-no-aliases has too many uses and thus is retained for now. Reviewed By: luismarques Differential Revision: https://reviews.llvm.org/D103004	2021-05-26 10:43:32 -07:00
Esme-Yi	bf809cd165	[NFC][object] Change the input parameter of the method isDebugSection. Summary: This is a NFC patch to change the input parameter of the method SectionRef::isDebugSection(), by replacing the StringRef SectionName with DataRefImpl Sec. This allows us to determine if a section is debug type in more ways than just by section name. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D102601	2021-05-26 08:47:53 +00:00
Wenlei He	fa14fd30ce	[CSSPGO][llvm-profgen] Change default cold threshold for context merging llvm-profgen uses profile summary based cold threshold to merge and trim cold context profile. This is to strike a good balance between profile size and performance. We've been using 99.9% as the cutoff to save profile size without affecting performance. This change switch to use 99.9% instead of 99.9999% as default cold threshold cutoff for llvm-profgen. Redundant switch csprof-cold-thres is also removed and tests cleaned up. Differential Revision: https://reviews.llvm.org/D103071	2021-05-25 10:41:10 -07:00
Langston Barrett	472c009139	[llvm-reduce] Exit when input module is malformed The parseInputFile function returns an empty unique_ptr to signal an error, like when the input file doesn't exist, or is malformed. In this case, the tool should exit immediately rather than segfault by dereferencing the unique_ptr later. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D102891	2021-05-25 10:01:12 -07:00
Roman Lebedev	78eaff2ef8	[llvm-exegesis] Loop unrolling for loop snippet repetitor mode I really needed this, like, factually, yesterday, when verifying dependency breaking idioms for AMD Zen 3 scheduler model. Consider the following example: ``` $ ./bin/llvm-exegesis --mode=inverse_throughput --snippets-file=/tmp/snippet.s --num-repetitions=1000000 --repetition-mode=duplicate Check generated assembly with: /usr/bin/objdump -d /tmp/snippet-4a7e50.o --- mode: inverse_throughput key: instructions: - 'VPXORYrr YMM0 YMM0 YMM0' config: '' register_initial_values: [] cpu_name: znver3 llvm_triple: x86_64-unknown-linux-gnu num_repetitions: 1000000 measurements: - { key: inverse_throughput, value: 0.31025, per_snippet_value: 0.31025 } error: '' info: '' assembled_snippet: C5FDEFC0C5FDEFC0C5FDEFC0C5FDEFC0C5FDEFC0C5FDEFC0C5FDEFC0C5FDEFC0C5FDEFC0C5FDEFC0C5FDEFC0C5FDEFC0C5FDEFC0C5FDEFC0C5FDEFC0C5FDEFC0C3 ... ``` What does it tell us? So wait, it can only execute ~3 x86 AVX YMM PXOR zero-idioms per cycle? That doesn't seem right. That's even less than there are pipes supporting this type of op. Now, second example: ``` $ ./bin/llvm-exegesis --mode=inverse_throughput --snippets-file=/tmp/snippet.s --num-repetitions=1000000 --repetition-mode=loop Check generated assembly with: /usr/bin/objdump -d /tmp/snippet-2418b5.o --- mode: inverse_throughput key: instructions: - 'VPXORYrr YMM0 YMM0 YMM0' config: '' register_initial_values: [] cpu_name: znver3 llvm_triple: x86_64-unknown-linux-gnu num_repetitions: 1000000 measurements: - { key: inverse_throughput, value: 1.00011, per_snippet_value: 1.00011 } error: '' info: '' assembled_snippet: 49B80800000000000000C5FDEFC0C5FDEFC04983C0FF75F2C3 ... ``` Now that's just worse. Due to the looping, the throughput completely plummeted, and now we can only do a single instruction/cycle!? That's not great. And final example: ``` $ ./bin/llvm-exegesis --mode=inverse_throughput --snippets-file=/tmp/snippet.s --num-repetitions=1000000 --repetition-mode=loop --loop-body-size=1000 Check generated assembly with: /usr/bin/objdump -d /tmp/snippet-c402e2.o --- mode: inverse_throughput key: instructions: - 'VPXORYrr YMM0 YMM0 YMM0' config: '' register_initial_values: [] cpu_name: znver3 llvm_triple: x86_64-unknown-linux-gnu num_repetitions: 1000000 measurements: - { key: inverse_throughput, value: 0.167087, per_snippet_value: 0.167087 } error: '' info: '' assembled_snippet: 49B80800000000000000C5FDEFC0C5FDEFC04983C0FF75F2C3 ... ``` So if we merge the previous two approaches, do duplicate this single-instruction snippet 1000x (loop-body-size/instruction count in snippet), and run a loop with 1000 iterations over that duplicated/unrolled snippet, the measured throughput goes through the roof, up to 5.9 instructions/cycle, which finally tells us that this idiom is zero-cycle! Reviewed By: courbet Differential Revision: https://reviews.llvm.org/D102522	2021-05-25 12:08:27 +03:00
Jonas Devlieghere	1ec03f3de5	[dsymutil] Emit an error when the Mach-O exceeds the 4GB limit. The Mach-O object file format is limited to 4GB because its used of 32-bit offsets in the header. It is possible for dsymutil to (silently) emit an invalid binary. Instead of having consumers deal with this, emit an error instead.	2021-05-24 16:29:06 -07:00
Jonas Devlieghere	7bf7b80b19	[dsymutil] Use EXIT_SUCCESS and EXIT_FAILURE (NFC)	2021-05-24 16:29:05 -07:00
Jonas Devlieghere	aab488ac2a	[dsymutil] Compute the output location once per input file (NFC) Compute the location of the output file just once outside the loop over the different architectures.	2021-05-24 16:29:05 -07:00
Hongtao Yu	00bfde723b	[NFC][CSSPGO]llvm-profge] Fix Build warning dueo to an attrbute usage.	2021-05-24 12:59:02 -07:00
Hongtao Yu	3b51b51877	[CSSPGO][llvm-profgen] Report samples for untrackable frames. Fixing an issue where samples collected for an untrackable frame is not reported. An untrackable frame refers to a frame whose caller is untrackable due to missing debug info or pseudo probe. Though the frame is connected to its parent frame through the frame pointer chain at runtime, the compiler cannot build the connection without debug info or pseudo probe. In such case we just need to report the untrackable frame as the base frame and all of its child frames. With more samples reported I'm seeing this improves the performance of an internal benchmark by 2.5%. Reviewed By: wenlei, wlei Differential Revision: https://reviews.llvm.org/D102961	2021-05-24 12:39:12 -07:00
Philipp Krones	c2f819af73	[MC] Refactor MCObjectFileInfo initialization and allow targets to create MCObjectFileInfo This makes it possible for targets to define their own MCObjectFileInfo. This MCObjectFileInfo is then used to determine things like section alignment. This is a follow up to D101462 and prepares for the RISCV backend defining the text section alignment depending on the enabled extensions. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D101921	2021-05-23 14:15:23 -07:00
Sergey Dmitriev	1fb5278882	[llvm-strip] Add support for '--' for delimiting options from input files This will allow to use llvm-strip with file names that begin with dashes. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D102825	2021-05-20 03:33:51 -07:00
Alexey Lapshin	081c62501e	[llvm-objcopy] Refactor CopyConfig structure. This patch prepares llvm-objcopy to move its implementation into a separate library. To make it possible it is necessary to minimize internal dependencies. Differential Revision: https://reviews.llvm.org/D99055	2021-05-20 13:14:51 +03:00
Simon Giesecke	0ddc75fd08	Add option to llvm-gsymutil to read addresses from stdin. Differential Revision: https://reviews.llvm.org/D102224	2021-05-20 06:10:35 +00:00
Patrick Holland	e5d59db469	[MCA] llvm-mca MCTargetStreamer segfault fix In order to create the code regions for llvm-mca to analyze, llvm-mca creates an AsmCodeRegionGenerator and calls AsmCodeRegionGenerator::parseCodeRegions(). Within this function, both an MCAsmParser and MCTargetAsmParser are created so that MCAsmParser::Run() can be used to create the code regions for us. These parser classes were created for llvm-mc so they are designed to emit code with an MCStreamer and MCTargetStreamer that are expected to be setup and passed into the MCAsmParser constructor. Because llvm-mca doesn’t want to emit any code, an MCStreamerWrapper class gets created instead and passed into the MCAsmParser constructor. This wrapper inherits from MCStreamer and overrides many of the emit methods to just do nothing. The exception is the emitInstruction() method which calls Regions.addInstruction(Inst). This works well and allows llvm-mca to utilize llvm-mc’s MCAsmParser to build our code regions, however there are a few directives which rely on the MCTargetStreamer. llvm-mc assumes that the MCStreamer that gets passed into the MCAsmParser’s constructor has a valid pointer to an MCTargetStreamer. Because llvm-mca doesn’t setup an MCTargetStreamer, when the parser encounters one of those directives, a segfault will occur. In x86, each one of these 7 directives will cause this segfault if they exist in the input assembly to llvm-mca: .cv_fpo_proc .cv_fpo_setframe .cv_fpo_pushreg .cv_fpo_stackalloc .cv_fpo_stackalign .cv_fpo_endprologue .cv_fpo_endproc I haven’t looked at other targets, but I wouldn’t be surprised if some of the other ones also have certain directives which could result in this same segfault. My proposed solution is to simply initialize an MCTargetStreamer after we initialize the MCStreamerWrapper. The MCTargetStreamer requires an ostream object, but we don’t actually want any of these directives to be emitted anywhere, so I use an ostream created with the nulls() function. Since this needs to happen after the MCStreamerWrapper has been initialized, it needs to happen within the AsmCodeRegionGenerator::parseCodeRegions() function. The MCTargetStreamer also needs an MCInstPrinter which is easiest to initialize within the main() function of llvm-mca. So this MCInstPrinter gets constructed within main() then passed into the parseCodeRegions() function as a parameter. (If you feel like it would be appropriate and possible to create the MCInstPrinter within the parseCodeRegions() function, then feel free to modify my solution. That would stop us from having to pass it into the function and would limit its scope / lifetime.) My solution stops the segfault from happening and still passes all of the current (expected) llvm-mca tests. I also added a new test for x86 that checks for this segfault on an input that includes one of the .cv_fpo directives (this test fails without my solution, but passes with it). As far as I can tell, all of the functions that I modified are only called from within llvm-mca so there shouldn’t be any worries about breaking other tools. Differential Revision: https://reviews.llvm.org/D102709	2021-05-19 18:36:10 +01:00
Mariusz Ceier	9383e9c1e6	Fix lld macho standalone build by including llvm/Config/llvm-config.h instead of llvm/Config/config.h lld/MachO/Driver.cpp and lld/MachO/SyntheticSections.cpp include llvm/Config/config.h which doesn't exist when building standalone lld. This patch replaces llvm/Config/config.h include with llvm/Config/llvm-config.h just like it is in lld/ELF/Driver.cpp and HAVE_LIBXAR with LLVM_HAVE_LIXAR and moves LLVM_HAVE_LIBXAR from config.h to llvm-config.h Also it adds LLVM_HAVE_LIBXAR to LLVMConfig.cmake and links liblldMachO2.so with XAR_LIB if LLVM_HAVE_LIBXAR is set. Differential Revision: https://reviews.llvm.org/D102084	2021-05-19 11:15:07 -04:00
Sergey Dmitriev	f24f140290	[llvm-objcopy] Add support for '--' for delimiting options from input/output files This will allow to use llvm-objcopy with file names that begin with dashes. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D102665	2021-05-19 01:56:46 -07:00
Arthur Eubanks	0c509dbc7e	[NewPM] Add options to PrintPassInstrumentation To bring D99599's implementation in line with the existing PrintPassInstrumentation, and to fix a FIXME, add more customizability to PrintPassInstrumentation. Introduce three new options. The first takes over the existing "-debug-pass-manager-verbose" cl::opt. The second and third option are specific to -fdebug-pass-structure. They allow indentation, and also don't print analysis queries. To avoid more golden file tests than necessary, prune down the -fdebug-pass-structure tests. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D102196	2021-05-18 20:59:35 -07:00
Lang Hames	49cdd62db5	[llvm-jitlink] Link libnetwork on Haiku in llvm-jitlink The system's network API is in libnetwork.so, so we explicitly need to link to them on Haiku. This patch is similar to https://reviews.llvm.org/D97633. Patch by Niels Reedijk. Thanks Niels! Reviewed By: lhames Differential Revision: https://reviews.llvm.org/D98405	2021-05-14 20:49:03 -07:00
Fangrui Song	4f05f4c8e6	[CMake][ELF] Link libLLVM.so and libclang-cpp.so with -Bsymbolic-functions llvm-dev message: https://lists.llvm.org/pipermail/llvm-dev/2021-May/150465.html In an ELF shared object, a default visibility defined symbol is preemptible by default. This creates some missed optimization opportunities. -Bsymbolic-functions is more aggressive than our current -fvisibility-inlines-hidden (present since 2012) as it applies to all function definitions. It can * avoid PLT for cross-TU function calls && reduce dynamic symbol lookup * reduce dynamic symbol lookup for taking function addresses and optimize out GOT/TOC on x86-64/ppc64 In a -DLLVM_TARGETS_TO_BUILD=X86 build, the number of JUMP_SLOT decreases from 12716 to 1628, and the number of GLOB_DAT decreases from 1918 to 1313 The built clang with `-DLLVM_LINK_LLVM_DYLIB=on -DCLANG_LINK_CLANG_DYLIB=on` is significantly faster. See the Linux kernel build result https://bugs.archlinux.org/task/70697 Note: the performance of -fno-semantic-interposition -Bsymbolic-functions libLLVM.so and libclang-cpp.so is close to a PIE binary linking against `libLLVM.a` and `libclang.a`. When the host compiler is Clang, -Bsymbolic-functions is the major contributor. On x86-64 (with GOTPCRELX) and ppc64 ELFv2, the GOT/TOC relocations can be optimized. Some implication: Interposing a subset of functions is no longer supported. (This is fragile on ELF and unsupported on Mach-O at all. For Mach-O we don't use `ld -interpose` or `-flat_namespace`) Compiling a program which takes the address of any LLVM function with `{gcc,clang} -fno-pic` and expects the address to equal to the address taken from libLLVM.so or libclang-cpp.so is unsupported. I am fairly confident that llvm-project shouldn't have different behaviors depending on such pointer equality (as we've been using -fvisibility-inlines-hidden which applies to inline functions for a long time), but if we accidentally do, users should be aware that they should not make assumption on pointer equality in `-fno-pic` mode. See more on https://maskray.me/blog/2021-05-09-fno-semantic-interposition Reviewed By: phosek Differential Revision: https://reviews.llvm.org/D102090	2021-05-13 13:44:57 -07:00
Martin Storsjö	b42fb6811e	[llvm-nm] Support the -V option, print that the tool is compatible with GNU nm This unlocks some codepaths in libtool. Differential Revision: https://reviews.llvm.org/D102321	2021-05-13 22:36:25 +03:00
Aakanksha Patil	464e4dc50f	[AMDGPU] Add gfx1034 target Differential Revision: https://reviews.llvm.org/D102306	2021-05-13 14:25:18 -04:00

... 4 5 6 7 8 ...

13279 Commits