llvm-project

Commit Graph

Author	SHA1	Message	Date
Kirill Bobyrev	eefd620a25	[llvm] NFC: Cleanup llvm-yaml-numeric-parser-fuzzer * Use static variables instead of non-trivially destructible global ones. * Remove unused header. Differential Revision: https://reviews.llvm.org/D91600	2021-02-15 14:52:53 +01:00
Florian Hahn	c70737ba1d	Recommit "[LTO] Use lto::backend for code generation." This version of the patch includes a fix for the cfi failures. (undoes the revert commit `7db390cc77`) It also undoes reverts of follow-up patches that also needed reverting originally: * [LTO] Add option enable NewPM with LTOCodeGenerator. (undoes revert commit `0a17664b47`) * [LTOCodeGenerator] Use lto::Config for options (NFC)." (undoes revert commit `b0a8e41cff`)	2021-02-15 10:05:42 +00:00
Kazu Hirata	910e2d1e57	[llvm] Use llvm::is_contained (NFC)	2021-02-14 08:36:20 -08:00
Jian Cai	c2a84771bb	[llvm-objcopy] preserve file ownership when overwritten by root As of binutils 2.36, GNU strip calls chown(2) for "sudo strip foo" and "sudo strip foo -o foo", but no "sudo strip foo -o bar" or "sudo strip foo -o ./foo". In other words, while "sudo strip foo -o bar" creates a new file bar with root access, "sudo strip foo" will keep the owner and group of foo unchanged. Currently llvm-objcopy and llvm-strip behave differently, always changing the owner and gropu to root. The discrepancy prevents Chrome OS from migrating to llvm-objcopy and llvm-strip as they change file ownership and cause intended users/groups to lose access when invoked by sudo with the following sequence (recommended in man page of GNU strip). 1.<Link the executable as normal.> 1.<Copy "foo" to "foo.full"> 1.<Run "strip --strip-debug foo"> 1.<Run "objcopy --add-gnu-debuglink=foo.full foo"> This patch makes llvm-objcopy and llvm-strip follow GNU's behavior. Link: crbug.com/1108880	2021-02-12 18:01:43 -08:00
wlei	afd8bd601e	[CSSPGO][llvm-profgen] Filter out the instructions without location info for symbolizer It appears some instructions doesn't have the debug location info and the symbolizer will return an empty call stack for them which will cause some crash later in profile unwinding. Actually we do not record the sample info for them, so this change just filter out those instruction. As those instruction would appears at the begin and end of the instruction list, without them we need to add the boundary check for IP `advance` and `backward`. Also for pseudo probe based profile, we actually don't need the symbolized location info, so here just change to use an empty stack for it. This could save half of the binary loading time. Differential Revision: https://reviews.llvm.org/D96434	2021-02-12 16:47:49 -08:00
James Y Knight	8bd8534aa3	LLVM-C: Allow LLVM{Get/Set}Alignment on an atomicrmw/cmpxchg instruction. (Now that these can have alignment specified.)	2021-02-12 18:31:18 -05:00
wlei	426e326a19	[CSSPGO][llvm-profgen] Renovate perfscript check and command line input validation This include some changes related with PerfReader's the input check and command line change: 1) It appears there might be thousands of leading MMAP-Event line in the perfscript for large workload. For this case, the 4k threshold is not eligible to determine it's a hybrid sample. This change renovated the `isHybridPerfScript` by going through the script without threshold limitation checking whether there is a non-empty call stack immediately followed by a LBR sample. It will stop once it find a valid one. 2) Added several input validations for the command line switches in PerfReader. 3) Changed the command line `show-disassembly` to `show-disassembly-only`, it will print to stdout and exit early which leave an empty output profile. Reviewed By: hoy, wenlei Differential Revision: https://reviews.llvm.org/D96387	2021-02-12 15:18:50 -08:00
Lukas Sommer	6577cef9b0	[CodeGen] New pass: Replace vector intrinsics with call to vector library This patch adds a pass to replace calls to vector intrinsics (i.e., LLVM intrinsics operating on vector operands) with calls to a vector library. Currently, calls to LLVM intrinsics are only replaced with calls to vector libraries when scalar calls to intrinsics are vectorized by the Loop- or SLP-Vectorizer. With this pass, it is now possible to replace calls to LLVM intrinsics already operating on vector operands, e.g., if such code was generated by MLIR. For the replacement, information from the TargetLibraryInfo, e.g., as specified via -vector-library is used. This is a re-try of the original commit `2303e93e66` that was reverted due to pass manager problems. Other minor changes have also been made. Differential Revision: https://reviews.llvm.org/D95373	2021-02-12 12:53:27 -05:00
Hongtao Yu	0b1914e83a	[ThinLTO][gold] Fix filenaming scheme for tasks. The gold LTO plugin uses a set of hooks to implements emit-llvm and capture intermediate file generated during LTO. The hooks are called by each lto backend thread with a taskID as argument to differentiate between threads and tasks. Currently, all threads are overwriting the same file which results into only the intermediate output of the last backend thread to be preserved. This diff encodes the taskID into the filename. Reviewed By: tejohnson, wenlei Differential Revision: https://reviews.llvm.org/D96173	2021-02-12 09:40:08 -08:00
Abhina Sreeskantharajan	fdb640ea30	Mark output as text if it is really text This is a continuation of https://reviews.llvm.org/D67696. The following places need to set the OF_Text flag correctly. Reviewed By: JDevlieghere Differential Revision: https://reviews.llvm.org/D96363	2021-02-12 07:14:21 -05:00
Maxim Kuvyrkov	06f53f2f09	Fix exegesis build on aarch64-windows-msvc host Include x86 intrinsics only when compiling for x86_64 or i386. _MSC_VER no longer implies x86. Reviewed By: gchatelet Differential Revision: https://reviews.llvm.org/D96498	2021-02-12 09:50:22 +00:00
wlei	c3aeabaea1	[CSSPGO][llvm-profgen] Add brackets for context id to support extended binary format To align with https://reviews.llvm.org/D95547, we need to add brackets for context id before initializing the `SampleContext`. Also added test cases for extended binary format from llvm-profgen side. Differential Revision: https://reviews.llvm.org/D95929	2021-02-12 01:14:53 -08:00
Arthur Eubanks	cee9869c4e	[opt] Add helpful alternatives for -analyze under new PM Reviewed By: reames Differential Revision: https://reviews.llvm.org/D96449	2021-02-10 14:09:17 -08:00
Jameson Nash	a7db680183	Renovate CMake files in the `llvm-exegesis` tool. This attempts to move all tools over to using `add_llvm_library` for better consistency. After doing this, I noticed it ended up as nearly a reimplementation of https://reviews.llvm.org/rL342148, which later got reverted in r342336 (`b09a8c9bd9`). With ccache and ninja on a large core machine (40), I haven't run into build errors, so I'm hopeful it's better now, though it doesn't seem to be any different / new. Reviewed By: stephenneuendorffer Differential Revision: https://reviews.llvm.org/D90970	2021-02-10 14:22:55 -05:00
Arthur Eubanks	5d960cba34	[opt][NewPM] Add a --print-passes flag to print all available passes It seems nicer to list passes given a flag rather than displaying all passes in opt --help. This is awkwardly structured because a PassBuilder is required, but reusing the PassBuilder in runPassPipeline() doesn't work because we read the input IR before getting to runPassPipeline(). So printing the list of passes needs to happen before reading the input IR. If we remove the legacy PM code in main() and move everything from NewPMDriver.cpp into opt.cpp, we can create the PassBuilder before reading IR and check if we should print the list of passes and exit. But until then this hack seems fine. Compared to the legacy PM, the new PM passes are lacking descriptions. We'll need to figure out a way to add descriptions if we think this is important. Also, this only works for passes specified in PassRegistry.def. If we want to print other custom registered passes, we'll need a different mechanism. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D96101	2021-02-10 11:22:12 -08:00
Arthur Eubanks	c2c977ce50	Specify that some flags are legacy PM-specific Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D96100	2021-02-10 10:53:04 -08:00
Fangrui Song	4f30a3d3d2	[llvm-cfi-verify] Set UseSymbolTable to false parseSectionContents expects to skip regions not described by DWARF. With my pending DebugInfo/Symbolize change, the filename can be recovered and there will be more IndirectInstructions entries.	2021-02-10 09:44:13 -08:00
Todd Lipcon	747c450e6f	Fix JSON formatting when converting to trace event format Reviewed By: dberris Differential Revision: https://reviews.llvm.org/D96384	2021-02-10 13:00:28 +11:00
Alex Richardson	7dc3136033	[llvm-readobj] Add support for decoding FreeBSD ELF notes The current support only printed coredump notes, but most binaries also contain notes. This change adds names for four FreeBSD-specific notes and pretty-prints three of them: NT_FREEBSD_ABI_TAG: This note holds a 32-bit (decimal) integer containing the value of the __FreeBSD_version macro, which is defined in crt1.o and will hold a value such as 1300076 for a binary build on a FreeBSD 13 system. NT_FREEBSD_ARCH_TAG: A string containing the value of the build-time MACHINE_ARCH NT_FREEBSD_FEATURE_CTL: A 32-bit flag that indicates to the kernel that the binary wants certain bevahiour. Examples include setting NT_FREEBSD_FCTL_ASLR_DISABLE which tells the kernel to disable ASLR. After this change llvm-readobj also no longer decodes coredump-only FreeBSD notes in non-coredump files. I've also converted the note-freebsd.s test to use yaml2obj instead of llvm-mc. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D74393	2021-02-09 16:59:22 +00:00
Alex Richardson	135df21248	[llvm-readelf] Print raw ELF note contents if we can't parse it Currently, if the note name is known, but the value isn't we don't print the contents. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D74367	2021-02-09 16:59:22 +00:00
Alex Richardson	d613d8eb0e	[yaml2obj] Handle NT_* string values in for ELF note types This is required for D74393. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D95953	2021-02-09 16:59:22 +00:00
Alex Richardson	f4670fbfff	[llvm-readobj] Print empty line between note sections in GNU mode This matches GNU binutils. Reviewed By: rupprecht, jhenderson, MaskRay Differential Revision: https://reviews.llvm.org/D96010	2021-02-09 16:59:21 +00:00
Jameson Nash	10c1d290d9	Revert "Renovate CMake files in the `llvm-exegesis` tool." This reverts commit `549a1e2e59`. I see some buildbot failures, so reverting while I look into them.	2021-02-08 19:12:08 -05:00
Jameson Nash	16e7973c5d	Renovate CMake file for the `llvm-cfi-verify` tool Hopefully this is the non-problematic part from https://reviews.llvm.org/rL342148, which later got reverted in r342336 (`b09a8c9bd9`) due to problems with the llvm-exegesis part of the change. That part would also still be desirable, but currently appears not to be possible (https://reviews.llvm.org/D81922). I think this should replace https://reviews.llvm.org/D44650, per Keno's comment there. Reviewed By: hctim Differential Revision: https://reviews.llvm.org/D90969	2021-02-08 18:20:38 -05:00
Jameson Nash	549a1e2e59	Renovate CMake files in the `llvm-exegesis` tool. This attempts to move all tools over to using `add_llvm_library` for better consistency. After doing this, I noticed it ended up as nearly a reimplementation of https://reviews.llvm.org/rL342148, which later got reverted in r342336 (`b09a8c9bd9`). With ccache and ninja on a large core machine (40), I haven't run into build errors, so I'm hopeful it's better now, though it doesn't seem to be any different / new. Reviewed By: stephenneuendorffer Differential Revision: https://reviews.llvm.org/D90970	2021-02-08 18:06:07 -05:00
Sanjay Patel	c981f6f8e1	Revert "[Codegen][ReplaceWithVecLib] add pass to replace vector intrinsics with calls to vector library" This reverts commit `2303e93e66`. Investigating bot failures.	2021-02-05 15:10:11 -05:00
Lukas Sommer	2303e93e66	[Codegen][ReplaceWithVecLib] add pass to replace vector intrinsics with calls to vector library This patch adds a pass to replace calls to vector intrinsics (i.e., LLVM intrinsics operating on vector operands) with calls to a vector library. Currently, calls to LLVM intrinsics are only replaced with calls to vector libraries when scalar calls to intrinsics are vectorized by the Loop- or SLP-Vectorizer. With this pass, it is now possible to replace calls to LLVM intrinsics already operating on vector operands, e.g., if such code was generated by MLIR. For the replacement, information from the TargetLibraryInfo, e.g., as specified via -vector-library is used. Differential Revision: https://reviews.llvm.org/D95373	2021-02-05 14:25:19 -05:00
James Henderson	b0f4ffbfaa	[llvm-objdump] Fix missing first line of license in header file	2021-02-05 08:45:50 +00:00
Dan Gohman	698c6b0a09	[WebAssembly] Support single-floating-point immediate value As mentioned in TODO comment, casting double to float causes NaNs to change bits. To avoid the change, this patch adds support for single-floating-point immediate value on MachineCode. Patch by Yuta Saito. Differential Revision: https://reviews.llvm.org/D77384	2021-02-04 18:05:06 -08:00
wlei	dd9e219014	[CSSPGO][llvm-profgen] Fix bug with parsing hybrid sample trace line when we skip the call stack starting with an external address, we should also skip the bottom LBR entry, otherwise it will cause a truncated context issue. Reviewed By: hoy, wenlei Differential Revision: https://reviews.llvm.org/D95480	2021-02-04 16:15:05 -08:00
wlei	e10b73f646	[CSSPGO][llvm-profgen] Merge and trim profile for cold context to reduce profile size This change allows merging and trimming cold context profile in llvm-profgen to solve profile size bloat problem. Currently when the profile's total sample is below threshold(supported by a switch), it will be considered cold and merged into a base context-less profile, which will at least keep the profile quality as good as the baseline(non-cs). For example, two input profiles: [main @ foo @ bar]:60 [main @ bar]:50 Under threshold = 100, the two profiles will be merge into one with the base context, get result: [bar]:110 Added two switches: `--csprof-cold-thres=<value>`: Specified the total samples threshold for a context profile to be considered cold, with 100 being the default. Any cold context profiles will be merged into context-less base profile by default. `--csprof-keep-cold`: Force profile generation to keep cold context profiles instead of dropping them. By default, any cold context will not be written to output profile. Results: Though not yet evaluating it with the latest CSSPGO, our internal branch shows neutral on performance but significantly reduce the profile size. Detailed evaluation on llvm-profgen with CSSPGO will come later. Differential Revision: https://reviews.llvm.org/D94111	2021-02-04 11:05:03 -08:00
Peng Guo	91e7a17133	[NFC][llvm-mca] Fix compiler warning Fix clang compiler warning from `-Wrange-loop-analysis`. Reviewed By: andreadb Differential Revision: https://reviews.llvm.org/D95997	2021-02-04 09:44:36 -08:00
Fangrui Song	eecbb1c776	[llvm-objdump] --source: drop the warning when there is no debug info Warnings have been added for three cases (PR41905): (1) missing debug info, (2) the source file cannot be found, (3) the debug info points at a line beyond the end of the file. (1) is probably less useful. This was brought up once on http://lists.llvm.org/pipermail/llvm-dev/2020-April/141264.html and two internal users mentioned it to me that it was annoying. (I personally find the warning confusing, too.) Users specify --source to get additional information if sources happen to be available. If sources are not available, it should be obvious as the output will have no interleaved source lines. The warning can be especially annoying when using llvm-objdump -S on a bunch of files. This patch drops the warning when there is no debug info. (If LLVMSymbolizer::symbolizeCode returns an `Error`, there will still be an error. There is currently no test for an `Error` return value. The only code path is probably a broken symbol table, but we probably already emit a warning in that case) `source-interleave-prefix.test` has an inappropriate "malformed" test - the test simply has no .debug_* because new llc does not produce debug info when the filename is empty (invalid). I have tried tampering the header of .debug_info/.debug_line but llvm-symbolizer does not warn. This patch does not intend to add the missing test coverage. Differential Revision: https://reviews.llvm.org/D88715	2021-02-04 09:07:44 -08:00
wlei	3869309a0c	[CSSPGO][llvm-profgen] Aggregate samples on call frame trie to speed up profile generation For CS profile generation, the process of call stack unwinding is time-consuming since for each LBR entry we need linear time to generate the context( hash, compression, string concatenation). This change speeds up this by grouping all the call frame within one LBR sample into a trie and aggregating the result(sample counter) on it, deferring the context compression and string generation to the end of unwinding. Specifically, it uses `StackLeaf` as the top frame on the stack and manipulates(pop or push a trie node) it dynamically during virtual unwinding so that the raw sample can just be recoded on the leaf node, the path(root to leaf) will represent its calling context. In the end, it traverses the trie and generates the context on the fly. Results: Our internal branch shows about 5X speed-up on some large workloads in SPEC06 benchmark. Differential Revision: https://reviews.llvm.org/D94110	2021-02-04 08:43:21 -08:00
wlei	ac14bb14e7	[CSSPGO][llvm-profgen] Compress recursive cycles in calling context This change compresses the context string by removing cycles due to recursive function for CS profile generation. Removing recursion cycles is a way to normalize the calling context which will be better for the sample aggregation and also make the context promoting deterministic. Specifically for implementation, we recognize adjacent repeated frames as cycles and deduplicated them through multiple round of iteration. For example: Considering a input context string stack: [“a”, “a”, “b”, “c”, “a”, “b”, “c”, “b”, “c”, “d”] For first iteration,, it removed all adjacent repeated frames of size 1: [“a”, “b”, “c”, “a”, “b”, “c”, “b”, “c”, “d”] For second iteration, it removed all adjacent repeated frames of size 2: [“a”, “b”, “c”, “a”, “b”, “c”, “d”] So in the end, we get compressed output: [“a”, “b”, “c”, “d”] Compression will be called in two place: one for sample's context key right after unwinding, one is for the eventual context string id in the ProfileGenerator. Added a switch `compress-recursion` to control the size of duplicated frames, default -1 means no size limit. Added unit tests and regression test for this. Differential Revision: https://reviews.llvm.org/D93556	2021-02-03 22:16:07 -08:00
wlei	6bccdcdb35	Revert "[CSSPGO][llvm-profgen] Compress recursive cycles in calling context" This reverts commit `0609f257dc`.	2021-02-03 22:16:05 -08:00
wlei	08e8bb60cf	Revert "[CSSPGO][llvm-profgen] Aggregate samples on call frame trie to speed up profile generation" This reverts commit `1714ad2336`.	2021-02-03 22:16:05 -08:00
wlei	1714ad2336	[CSSPGO][llvm-profgen] Aggregate samples on call frame trie to speed up profile generation For CS profile generation, the process of call stack unwinding is time-consuming since for each LBR entry we need linear time to generate the context( hash, compression, string concatenation). This change speeds up this by grouping all the call frame within one LBR sample into a trie and aggregating the result(sample counter) on it, deferring the context compression and string generation to the end of unwinding. Specifically, it uses `StackLeaf` as the top frame on the stack and manipulates(pop or push a trie node) it dynamically during virtual unwinding so that the raw sample can just be recoded on the leaf node, the path(root to leaf) will represent its calling context. In the end, it traverses the trie and generates the context on the fly. Results: Our internal branch shows about 5X speed-up on some large workloads in SPEC06 benchmark. Differential Revision: https://reviews.llvm.org/D94110	2021-02-03 18:50:14 -08:00
wlei	0609f257dc	[CSSPGO][llvm-profgen] Compress recursive cycles in calling context This change compresses the context string by removing cycles due to recursive function for CS profile generation. Removing recursion cycles is a way to normalize the calling context which will be better for the sample aggregation and also make the context promoting deterministic. Specifically for implementation, we recognize adjacent repeated frames as cycles and deduplicated them through multiple round of iteration. For example: Considering a input context string stack: [“a”, “a”, “b”, “c”, “a”, “b”, “c”, “b”, “c”, “d”] For first iteration,, it removed all adjacent repeated frames of size 1: [“a”, “b”, “c”, “a”, “b”, “c”, “b”, “c”, “d”] For second iteration, it removed all adjacent repeated frames of size 2: [“a”, “b”, “c”, “a”, “b”, “c”, “d”] So in the end, we get compressed output: [“a”, “b”, “c”, “d”] Compression will be called in two place: one for sample's context key right after unwinding, one is for the eventual context string id in the ProfileGenerator. Added a switch `compress-recursion` to control the size of duplicated frames, default -1 means no size limit. Added unit tests and regression test for this. Differential Revision: https://reviews.llvm.org/D93556	2021-02-03 18:50:14 -08:00
wlei	c82b24f475	[CSSPGO][llvm-profgen] Pseudo probe based CS profile generation This change implements profile generation infra for pseudo probe in llvm-profgen. During virtual unwinding, the raw profile is extracted into range counter and branch counter and aggregated to sample counter map indexed by the call stack context. This change introduces the last step and produces the eventual profile. Specifically, the body of function sample is recorded by going through each probe among the range and callsite target sample is recorded by extracting the callsite probe from branch's source. Please refer https://groups.google.com/g/llvm-dev/c/1p1rdYbL93s and https://reviews.llvm.org/D89707 for more context about CSSPGO and llvm-profgen. Implementation - Extended `PseudoProbeProfileGenerator` for pseudo probe based profile generation. - `populateBodySamplesWithProbes` reading range counter is responsible for recording function body samples and inferring caller's body samples. - `populateBoundarySamplesWithProbes` reading branch counter is responsible for recording call site target samples. - Each sample is recorded with its calling context(named `ContextId`). Remind that the probe based context key doesn't include the leaf frame probe info, so the `ContextId` string is created from two part: one from the probe stack strings' concatenation and other one from the leaf frame probe. - Added regression test Test Plan: ninja & ninja check-llvm Differential Revision: https://reviews.llvm.org/D92998	2021-02-03 16:21:53 -08:00
Florian Hahn	7db390cc77	Revert "[LTO] Use lto::backend for code generation." This reverts commit `6a59f05606`, because it is causing failures on green dragon.	2021-02-03 22:49:30 +00:00
Florian Hahn	0a17664b47	Revert "[LTO] Add option enable NewPM with LTOCodeGenerator." This reverts commit `7a6a2cc81a` because it is causing failures on green dragon.	2021-02-03 22:49:20 +00:00
Fangrui Song	1560a00032	[yaml2obj/obj2yaml/llvm-readobj] Support SHF_GNU_RETAIN In binutils, the flag is defined for ELFOSABI_GNU and ELFOSABI_FREEBSD. It can be used to mark a section as a GC root. In practice, the flag has generic semantics and can be applied to many EI_OSABI values, so we consider it generic. Differential Revision: https://reviews.llvm.org/D95728	2021-02-02 09:19:53 -08:00
Rahman Lavaee	f1ff6d210a	[obj2yaml, yaml2obj] Use Hex64 for BBAddressMap fields. This patch let the yaml encoding use Hex64 values for NumBlocks, BB AddressOffset, BB Size, and BB Metadata. Additionally, it changes the decoded values in elf2yaml to uint64_t to match DataExtractor::getULEB128 return type. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D95767	2021-02-01 15:37:30 -08:00
Patrick Oppenlander	93345e825a	[llvm-objcopy] -O binary: consider SHT_NOBITS sections to be empty This is consistent with BFD objcopy. Previously llvm objcopy would allocate space for SHT_NOBITS sections often resulting in enormous binary files. New test case (binary-paddr.test %t6). Reviewed By: jhenderson, MaskRay Differential Revision: https://reviews.llvm.org/D95569	2021-02-01 15:01:25 -08:00
Kazu Hirata	3d1200b9f6	[llvm] Drop unnecessary const from return types (NFC) Identified with const-return-type.	2021-01-31 10:23:43 -08:00
Alexey Lapshin	fb244ffb9f	[dsymutil][DWARFLinker][NFC] make AddressManager not depending on the order of checks for relocations. Current dsymutil implementation of hasLiveMemoryLocation()/hasLiveAddressRange() and applyValidRelocs() assume that calls should be done in certain order (from first Dies to last). Multi-thread implementation might call these methods in other order(it might process compilation units in order other than they are physically located), so we remove restriction that searching for relocations should be done in ascending order. This change does not introduce noticable performance degradation. The testing results for clang binary: golden-dsymutil/dsymutil 23787992 clang MD5: 5efa8fd9355ebf81b65f24db5375caa2 elapsed time=91sec build-Release/bin/dsymutil 23855616 clang MD5: 5efa8fd9355ebf81b65f24db5375caa2 elapsed time=91sec Differential Revision: https://reviews.llvm.org/D93106	2021-01-31 16:34:10 +03:00
Georgii Rymar	d221406875	[llvm-symbolizer] - Fix the crash in GNU output style with --no-inlines and missing input file. Fixes https://bugs.llvm.org/show_bug.cgi?id=48882. If the input file does not exist (or has a reading error), the following code will crash if there are two or more input addresses. ``` auto ResOrErr = Symbolizer.symbolizeInlinedCode( ModuleName, {Offset, object::SectionedAddress::UndefSection}); Printer << (error(ResOrErr) ? DILineInfo() : ResOrErr.get().getFrame(0)); ``` For the first address, `symbolizeInlinedCode` returns an error. For the second address, `symbolizeInlinedCode` returns an empty result (not an error) and `.getFrame(0)` will crash. Differential revision: https://reviews.llvm.org/D95609	2021-01-30 18:36:38 +03:00
Florian Hahn	7a6a2cc81a	[LTO] Add option enable NewPM with LTOCodeGenerator. This patch adds an option to enable the new pass manager in LTOCodeGenerator. It also updates a few tests with legacy PM specific tests, which started failing after `6a59f05606` when LLVM_ENABLE_NEW_PASS_MANAGER=true.	2021-01-30 11:54:20 +00:00
Florian Hahn	6a59f05606	[LTO] Use lto::backend for code generation. This patch updates LTOCodeGenerator to use the utilities provided by LTOBackend to run middle-end optimizations and backend code generation. This is a first step towards unifying the code used by libLTO's C API and the newer, C++ interface (see PR41541). The immediate motivation is to allow using the new pass manager when doing LTO using libLTO's C API, which is used on Darwin, among others. With the changes, there are no codegen/stats differences when building MultiSource/SPEC2000/SPEC2006 on Darwin X86 with LTO, compared to without the patch. Reviewed By: steven_wu Differential Revision: https://reviews.llvm.org/D94487	2021-01-30 10:09:55 +00:00
Kazu Hirata	1a2d67fa23	[llvm] Use llvm::lower_bound and llvm::upper_bound (NFC)	2021-01-29 23:23:36 -08:00
Kazu Hirata	7728cc003a	[llvm] Use append_range (NFC)	2021-01-29 23:23:34 -08:00
Greg McGary	61a5502a93	[llvm-objdump-macho] print per-second-level-page encodings for option --unwind-info Compact unwind entries have 8 bits for the encoding-table offset: * offsets 0..126 reference the global commmon-encodings table, while * offsets 127..255 reference a per-second-level-page table. This diff teaches `llvm-objdump` to print this per-page encodings table. Differential Revision: https://reviews.llvm.org/D93265	2021-01-29 21:59:07 -07:00
Florian Hahn	f3a710cade	[LTO] Update splitCodeGen to take a reference to the module. (NFC) splitCodeGen does not need to take ownership of the module, as it currently clones the original module for each split operation. There is an ~4 year old fixme to change that, but until this is addressed, the function can just take a reference to the module. This makes the transition of LTOCodeGenerator to use LTOBackend a bit easier, because under some circumstances, LTOCodeGenerator needs to write the original module back after codegen. Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D95222	2021-01-29 11:53:11 +00:00
Rafik Zurob	cba2552bfe	[llvm-jitlink] Replace use of deprecated gethostbyname by getaddrinfo. This patch replaces use of deprecated gethostbyname by getaddrinfo. Author: Rafik Zurob Reviewed By: lhames Differential Revision: https://reviews.llvm.org/D95477	2021-01-29 03:11:16 -06:00
Georgii Rymar	a5154ab9b0	[llvm-readobj/elf] - Report "bitcode files are not supported" warning for bitcode files. Fixes https://bugs.llvm.org/show_bug.cgi?id=43543 Currently we report "The file was not recognized as a valid object file" for BC files. Also, we terminate dumping. Instead we could report a better warning and try to continue dumping other files. This is what this patch implements. Differential revision: https://reviews.llvm.org/D95605	2021-01-29 12:04:41 +03:00
Yang Fan	d6d0c09e84	[NFC][llvm-nm] Fix unused variable warning	2021-01-29 11:42:23 +08:00
Fangrui Song	b3af96d07b	[llvm-nm] Display defined weak STT_GNU_IFUNC symbols as 'i' This patch makes the behavior match GNU nm. Note: undefined STT_GNU_IFUNC symbols use 'U'. Differential Revision: https://reviews.llvm.org/D95461	2021-01-28 09:46:05 -08:00
Hongtao Yu	7e99bddfea	[CSSPGO] Support of CS profiles in extended binary format. This change brings up support of context-sensitive profiles in the format of extended binary. Existing sample profile reader/writer/merger code is being tweaked to reflect the fact of bracketed input contexts, like (`[...]`). The paired brackets are also needed in extbinary profiles because we don't yet have an otherwise good way to tell calling contexts apart from regular function names since the context delimiter `@` can somehow serve as a part of the C++ mangled names. Reviewed By: wmi, wenlei Differential Revision: https://reviews.llvm.org/D95547	2021-01-27 21:29:46 -08:00
Teresa Johnson	1487747e99	[LTO] Prevent devirtualization for symbols dynamically exported Identify dynamically exported symbols (--export-dynamic[-symbol=], --dynamic-list=, or definitions needed to preempt shared objects) and prevent their LTO visibility from being upgraded. This helps avoid use of whole program devirtualization when there may be overrides in dynamic libraries. Differential Revision: https://reviews.llvm.org/D91583	2021-01-27 15:54:13 -08:00
Craig Topper	0b50fa9945	[FaultsMaps][llvm-objdump] Move FaultMapParser to Object/. Remove CodeGen dependency from llvm-objdump FaultsMapParser lived in CodeGen and was forcing llvm-objdump to link CodeGen and everything CodeGen depends on. This was previously attempted in r240364 to fix a link failure. The CodeGen dependency was independently added to fix the same link failure, and that ended up being kept. Removing the dependency seems like the correct layering for llvm-objdump. Reviewed By: MaskRay, jhenderson Differential Revision: https://reviews.llvm.org/D95414	2021-01-27 10:39:59 -08:00
Kazu Hirata	48bdd676a1	[llvm-objdump] Use append_range (NFC)	2021-01-26 20:00:19 -08:00
Fangrui Song	4d28f0a6a4	[llc] Add reportError helper and canonicalize error messages	2021-01-26 15:33:37 -08:00
Fangrui Song	34b60d8a56	Add -fbinutils-version= to gate ELF features on the specified binutils version There are two use cases. Assembler We have accrued some code gated on MCAsmInfo::useIntegratedAssembler(). Some features are supported by latest GNU as, but we have to use MCAsmInfo::useIntegratedAs() because the newer versions have not been widely adopted (e.g. SHF_LINK_ORDER 'o' and 'unique' linkage in 2.35, --compress-debug-sections= in 2.26). Linker We want to use features supported only by LLD or very new GNU ld, or don't want to work around older GNU ld. We currently can't represent that "we don't care about old GNU ld". You can find such workarounds in a few other places, e.g. Mips/MipsAsmprinter.cpp PowerPC/PPCTOCRegDeps.cpp X86/X86MCInstrLower.cpp AArch64 TLS workaround for R_AARCH64_TLSLD_MOVW_DTPREL_* (PR ld/18276), R_AARCH64_TLSLE_LDST8_TPREL_LO12 (https://bugs.llvm.org/show_bug.cgi?id=36727 https://sourceware.org/bugzilla/show_bug.cgi?id=22969) Mixed SHF_LINK_ORDER and non-SHF_LINK_ORDER components (supported by LLD in D84001; GNU ld feature request https://sourceware.org/bugzilla/show_bug.cgi?id=16833 may take a while before available). This feature allows to garbage collect some unused sections (e.g. fragmented .gcc_except_table). This patch adds `-fbinutils-version=` to clang and `-binutils-version` to llc. It changes one codegen place in SHF_MERGE to demonstrate its usage. `-fbinutils-version=2.35` means the produced object file does not care about GNU ld<2.35 compatibility. When `-fno-integrated-as` is specified, the produced assembly can be consumed by GNU as>=2.35, but older versions may not work. `-fbinutils-version=none` means that we can use all ELF features, regardless of GNU as/ld support. Both clang and llc need `parseBinutilsVersion`. Such command line parsing is usually implemented in `llvm/lib/CodeGen/CommandFlags.cpp` (LLVMCodeGen), however, ClangCodeGen does not depend on LLVMCodeGen. So I add `parseBinutilsVersion` to `llvm/lib/Target/TargetMachine.cpp` (LLVMTarget). Differential Revision: https://reviews.llvm.org/D85474	2021-01-26 12:28:23 -08:00
Martin Storsjö	510b3d4b3e	[llvm-nm] Silence a gcc warning about a stray semicolon. NFC.	2021-01-26 12:29:14 +02:00
Georgii Rymar	db92d47cf7	[llvm-nm][ELF] - Use @@ prefix when printing default versions. llvm-readelf prints default versions with `@@` prefix. This patch does the same for llvm-nm. Differential revision: https://reviews.llvm.org/D94912	2021-01-26 12:16:38 +03:00
Georgii Rymar	e98d5c3192	[libObject,llvm-readelf/obj] - Don't use @@ when printing versions of undefined symbols. A default version (@@) is only available for defined symbols. Currently we use "@@" for undefined symbols too. This patch fixes the issue and improves our test case. Differential revision: https://reviews.llvm.org/D95219	2021-01-26 12:05:59 +03:00
Philip Pfaffe	da489946a9	[llvm-dwp] Automatically set the target triple The llvm-dwp tool hard-codes the target triple to x86. Instead, deduce the target triple from the object files being read. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D93749	2021-01-25 11:58:54 +01:00
Georgii Rymar	9c89dcf807	[yaml2obj, obj2yaml] - Implement section header table as a special Chunk. This was discussed in D93678 thread. Currently we have one special chunk - Fill. This patch re implements the "SectionHeaderTable" key to become a special chunk too. With that we are able to place the section header table at any location, just like we place sections. Differential revision: https://reviews.llvm.org/D95140	2021-01-25 13:08:08 +03:00
Florian Hahn	f959d8195d	[LTO] Move DisableVerify setting to LTOCodeGenerator class (NFC). To simplify the transition to using LTOBackend, move DisableVerify to the LTOCodeGenerator class, like most/all other options. Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D95223	2021-01-24 14:14:40 +00:00
Arthur Eubanks	c37dd3b6d5	[NewPM][opt] Make -enable-new-pm default to LLVM_ENABLE_NEW_PASS_MANAGER This is controlled by the ENABLE_EXPERIMENTAL_NEW_PASS_MANAGER CMake flag. https://lists.llvm.org/pipermail/llvm-dev/2021-January/147993.html Reviewed By: ychen, asbirlea Differential Revision: https://reviews.llvm.org/D95254	2021-01-23 12:36:09 -08:00
Florian Hahn	166d40f2ed	[FuzzMutate] Add mutator to modify instruction flags. This patch adds a new InstModificationIRStrategy to mutate flags/options for instructions. For example, it may add or remove nuw/nsw flags from add, mul, sub, shl instructions or change the predicate for icmp instructions. Subtle changes such as those mentioned above should lead to a more interesting range of inputs. The presence or absence of overflow flags can expose subtle bugs, for example. Reviewed By: bogner Differential Revision: https://reviews.llvm.org/D94905	2021-01-23 19:05:20 +00:00
Florian Hahn	08dbcc14e2	[LTO] Store target attributes as vector of strings (NFC). The target features are obtained as a list of features/attributes. Instead of storing them in a single string, store the vector. This matches lto::Config's behavior and simplifies the transition to lto::backend(). Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D95224	2021-01-23 12:11:58 +00:00
Florian Hahn	2a8cbdd830	[LTO] Add support for existing Config::Freestanding option. lto::Config has a field to control whether the build is "freestanding" (no builtins) or not, but it is not hooked up to the code actually running the passes. This patch adds support for the flag to both the code that runs optimization with the new and old pass managers, by explicitly adding a TargetLibraryInfo instance. If Freestanding is true, all library functions are disabled. Reviewed By: steven_wu Differential Revision: https://reviews.llvm.org/D94630	2021-01-22 13:45:39 +00:00
Arthur Eubanks	6699029b67	[NewPM][opt] Run the "default" AA pipeline by default We tend to assume that the AA pipeline is by default the default AA pipeline and it's confusing when it's empty instead. PR48779 Initially reverted due to BasicAA running analyses in an unspecified order (multiple function calls as parameters), fixed by fetching analyses before the call to construct BasicAA. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D95117	2021-01-21 21:08:54 -08:00
Arthur Eubanks	ba9b4ea4ee	Revert "[NewPM][opt] Run the "default" AA pipeline by default" This reverts commit `be611431cd`. Other/new-pm-lto-defaults.ll failing	2021-01-21 20:16:34 -08:00
Kazu Hirata	cfa241680f	[llvm] Don't include StringSwitch.h where unnecessary (NFC)	2021-01-21 19:59:48 -08:00
Arthur Eubanks	be611431cd	[NewPM][opt] Run the "default" AA pipeline by default We tend to assume that the AA pipeline is by default the default AA pipeline and it's confusing when it's empty instead. PR48779 Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D95117	2021-01-21 19:46:38 -08:00
Wolfgang Pieb	c6e8f81410	[llvm-mca] Addressing build failures due to missing override specifiers	2021-01-21 17:32:18 -08:00
Wolfgang Pieb	04af1ca2e9	[llvm-mca] Forgot a couple of override specifiers. Differential Revision: https://reviews.llvm.org/D86644	2021-01-21 15:44:14 -08:00
Wolfgang Pieb	d38be2ba0e	[llvm-mca] Initial implementation of serialization using JSON. The views implemented at this time are Summary, Timeline, ResourcePressure and InstructionInfo. Use --json on the command line to obtain JSON output.	2021-01-21 15:15:54 -08:00
Georgii Rymar	dd5c982804	[llvm-nm][ELF] - Make -D display symbol versions. This fixes https://bugs.llvm.org/show_bug.cgi?id=48670. Since binutils 2.35, nm -D displays symbol versions by default. This patch teaches llvm-nm to do the same. Differential revision: https://reviews.llvm.org/D94907	2021-01-21 11:23:45 +03:00
Georgii Rymar	51f4958057	[yaml2obj/obj2yaml] - Improve dumping/creating of ELF versioning sections. This makes the following improvements. For `SHT_GNU_versym`: * yaml2obj: set `sh_link` to index of `.dynsym` section automatically. For `SHT_GNU_verdef`: * yaml2obj: set `sh_link` to index of `.dynstr` section automatically. * yaml2obj: set `sh_info` field automatically. * obj2yaml: don't dump the `Info` field when its value matches the number of version definitions. For `SHT_GNU_verneed`: * yaml2obj: set `sh_link` to index of `.dynstr` section automatically. * yaml2obj: set `sh_info` field automatically. * obj2yaml: don't dump the `Info` field when its value matches the number of version dependencies. Also, simplifies few test cases. Differential revision: https://reviews.llvm.org/D94956	2021-01-21 10:36:48 +03:00
Jonas Devlieghere	f354b87df2	[dsymutil] Compare object modification times using second precision The modification time in the debug map is expressed using second precision, while the modification time returned by the filesystem could be more precise. Avoid spurious warnings about timestamp mismatches by truncating the modification time reported by the system to seconds.	2021-01-20 18:45:30 -08:00
Kazu Hirata	978c754076	[llvm] Use llvm::any_of (NFC)	2021-01-19 20:19:16 -08:00
wlei	daeea961a6	[llvm-profgen][NFC] Fix the incorrect computation of callsite sample count Differential Revision: https://reviews.llvm.org/D95009	2021-01-19 17:50:48 -08:00
Sergey Dmitriev	233106269d	[llvm-link] Improve link time for bitcode archives [NFC] Linking large bitcode archives currently takes a lot of time with llvm-link, this patch adds couple improvements which reduce link time for archives - Use one Linker instance for archive instead of recreating it for each member - Lazy load archive members Reviewed By: tra, jdoerfert Differential Revision: https://reviews.llvm.org/D94643	2021-01-19 16:41:28 -08:00
Arthur Eubanks	cabe1b1124	[polly][NewPM][test] Fix polly tests under -enable-new-pm In preparation for turning on opt's -enable-new-pm by default, this pins uses of passes via the legacy "opt -passname" with pass names beginning with "polly-" and "polyhedral-info" to the legacy PM. Many of these tests use -analyze, which isn't supported in the new PM. (This doesn't affect uses of "opt -passes=passname"). rL240766 accidentally removed `-polly-prepare` in phi_not_grouped_at_top.ll, and it also doesn't use the output of -analyze. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D94266	2021-01-19 12:38:58 -08:00
Kazu Hirata	23b0ab2acb	[llvm] Use the default value of drop_begin (NFC)	2021-01-18 10:16:36 -08:00
Georgii Rymar	b9ce772b8f	[Object, llvm-readelf] - Move the API for retrieving symbol versions to ELF.h `ELFDumper.cpp` implements the functionality that allows to get symbol versions. It is used for dumping versioned symbols. This helps to implement https://bugs.llvm.org/show_bug.cgi?id=48670 ("make llvm-nm -D print version names"): we can move out and reuse the code from `ELFDumper.cpp`. This is what this patch do: it moves the related functionality to `ELFFile<ELFT>`. Differential revision: https://reviews.llvm.org/D94771	2021-01-18 12:50:29 +03:00
Kazu Hirata	352fcfc697	[llvm] Use llvm::sort (NFC)	2021-01-17 10:39:45 -08:00
Kazu Hirata	2082b10d10	[llvm] Use *::empty (NFC)	2021-01-16 09:40:55 -08:00
Florian Hahn	bca16e2fbb	[LTO] Remove options to disable inlining, vectorization & GVNLoadPRE. This patch removes some ancient options as a clean-up before moving code-gen to use LTOBackend in D94487. I think it would preferable to remove those ancient options, because 1. There are no corresponding options in LTOBackend based tools, 2. There are no unit tests for them, 3. They are not passed through by Clang, 4. At least for GNVLoadPRE, users could just use GVN's `enable-load-pre`. Alternatively we could add support for those options to lto::Config & co, but I think it would be better to remove them, unless they are actually used in practice. Reviewed By: steven_wu, tejohnson Differential Revision: https://reviews.llvm.org/D94783	2021-01-16 16:29:15 +00:00
Georgii Rymar	d9afe8588e	[yaml2obj/obj2yaml] - Refine handling of SHT_GNU_verdef sections. This patch: 1) Makes `Version`, `Flags`, `VersionNdx` and `Hash` fields to be `Optional<>`. 2) Disallows dumping version definitions that have `vd_version != 1`. `vd_version` identifies the version of the structure itself. (https://refspecs.linuxfoundation.org/LSB_5.0.0/LSB-Core-generic/LSB-Core-generic/symversion.html, https://docs.oracle.com/cd/E19683-01/816-7777/chapter6-80869/index.html) 3) Stops dumping default values for `Version`, `Flags`, `VersionNdx` and `Hash` fields. 4) Refines testing. Differential revision: https://reviews.llvm.org/D94659	2021-01-15 12:40:42 +03:00
Georgii Rymar	021ea78a97	[llvm-nm] - Simplify the code in dumpSymbolNamesFromObject. NFC. It is possible to simplify the logic that extracts symbol names. D94667 made the `NMSymbol::Name` to be `std::string`, what allowed this simplification. Differential revision: https://reviews.llvm.org/D94669	2021-01-15 12:29:49 +03:00
Georgii Rymar	bfb8f45ef3	[llvm-nm] - Move MachO specific logic out from the dumpSymbolNamesFromObject(). NFC. `dumpSymbolNamesFromObject` is the method that dumps symbol names. It has 563 lines, mostly because of huge piece of MachO specific code. In this patch I move it to separate helper method. The new size of `dumpSymbolNamesFromObject` is 93 lines. With it it becomes much easier to maintain it. I had to change the type of 2 name fields to `std::string`, because MachO logic uses temporarily buffer strings (e.g `ExportsNameBuffer`, `BindsNameBuffer` etc): ``` std::string ExportsNameBuffer; raw_string_ostream EOS(ExportsNameBuffer); ``` these buffers were moved to `dumpSymbolsFromDLInfoMachO` by this patch and invalidated after return. Technically, before this patch we had a situation when local pointers (symbol names) were assigned to members of global static `SymbolList`, what is dirty by itself. Differential revision: https://reviews.llvm.org/D94667	2021-01-15 12:18:37 +03:00
Georgii Rymar	1185d3f43d	[llvm-readobj] - Fix the compilation with GCC < 7.0. This addressed post commit comments for D93900. GCC had an issue and requires placing a specialization of `printUnwindInfo` to a namespace to compile: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=56480	2021-01-15 11:58:04 +03:00
Kazu Hirata	7dc3575ef2	[llvm] Remove redundant return and continue statements (NFC) Identified with readability-redundant-control-flow.	2021-01-14 20:30:34 -08:00
Kazu Hirata	2efcbe24a7	[llvm] Use llvm::drop_begin (NFC)	2021-01-14 20:30:33 -08:00
Andy Wingo	53e3b81faa	[lld][WebAssembly] Add support for handling table symbols This commit adds table symbol support in a partial way, while still including some special cases for the __indirect_function_table symbol. No change in tests. Differential Revision: https://reviews.llvm.org/D94075	2021-01-14 11:13:13 +01:00
Kazu Hirata	4c1617dac8	[llvm] Use std::any_of (NFC)	2021-01-13 19:14:44 -08:00
wlei	35debdfcac	[NFC] Fix build break by a initializer list converting error	2021-01-13 14:28:02 -08:00
wlei	33a8466531	[NFC] fix missing SectionName declaration	2021-01-13 11:30:09 -08:00
wlei	c681400b25	[CSSPGO][llvm-profgen] Virtual unwinding with pseudo probe This change extends virtual unwinder to support pseudo probe in llvm-profgen. Please refer https://groups.google.com/g/llvm-dev/c/1p1rdYbL93s and https://reviews.llvm.org/D89707 for more context about CSSPGO and llvm-profgen. Implementation - Added `ProbeBasedCtxKey` derived from `ContextKey` for sample counter aggregation. As we need string splitting to infer the profile for callee function, string based context introduces more string handling overhead, here we just use probe pointer based context. - For linear unwinding, as inline context is encoded in each pseudo probe, we don't need to go through each instruction to extract range sharing same inliner. So just record the range for the context. - For probe based context, we should ignore the top frame probe since it will be extracted from the address range. we defer the extraction in `ProfileGeneration`. - Added `PseudoProbeProfileGenerator` for pseudo probe based profile generation. - Some helper function to get pseduo probe info(call probe, inline context) from profiled binary. - Added regression test for unwinder's output The pseudo probe based profile generation will be in the upcoming patch. Test Plan: ninja & ninja check-llvm Differential Revision: https://reviews.llvm.org/D92896	2021-01-13 11:02:58 -08:00
wlei	414930b91b	[CSSPGO][llvm-profgen] Refactor to unify hashable interface for trace sample and context-sensitive counter As we plan to support both CSSPGO and AutoFDO for llvm-profgen, we will have different kinds of perf sample and different kinds of sample counter(cs/non-cs, with/without pseudo probe) which both need to do aggregation in hash map. This change implements the hashable interface(`Hashable`) and the unified base class for them to have better extensibility and reusability. Currently perf trace sample and sample counter with context implemented this `Hashable` and the class hierarchy is like: ``` \| Hashable \| PerfSample \| HybridSample \| LBRSample \| ContextKey \| StringBasedCtxKey \| ProbeBasedCtxKey \| CallsiteBasedCtxKey \| ... ``` - Class specifying `Hashable` should implement `getHashCode` and `isEqual`. Here we make `getHashCode` a non-virtual function to avoid vtable overhead, so derived class should calculate and assign the base class's HashCode manually. This also provides the flexibility for calculating the hash code incrementally(like rolling hash) during frame stack unwinding - `isEqual` is a virtual function, which will have perf overhead. In the future, if we redesign a better hash function, then we can just skip this or switch to non-virtual function. - Added `PerfSample` and `ContextKey` as base class for perf sample and counter context key, leveraging llvm-style RTTI for this. - Added `StringBasedCtxKey` class extending `ContextKey` to use string as context id. - Refactor `AggregationCounter` to take all kinds of `PerfSample` as key - Refactor `ContextSampleCounter` to take all kinds of `ContextKey` as key - Other refactoring work: - Create a wrapper class `SampleCounter` to wrap `RangeCounter` and `BranchCounter` - Hoist `ContextId` and `FunctionProfile` out of `populateFunctionBodySamples` and `populateFunctionBoundarySamples` to reuse them in ProfileGenerator Differential Revision: https://reviews.llvm.org/D92584	2021-01-13 11:02:57 -08:00
wlei	b3154d11bc	[CSSPGO][llvm-profgen] Pseudo probe decoding and disassembling This change implements pseudo probe decoding and disassembling for llvm-profgen/CSSPGO. Please see https://groups.google.com/g/llvm-dev/c/1p1rdYbL93s and https://reviews.llvm.org/D89707 for more context about CSSPGO and llvm-profgen. ELF section format Please see the encoding patch(https://reviews.llvm.org/D91878) for more details of the format, just copy the example here: Two section(`.pseudo_probe_desc` and `.pseudoprobe` ) is emitted in ELF to support pseudo probe. The format of `.pseudo_probe_desc` section looks like: ``` .section .pseudo_probe_desc,"",@progbits .quad 6309742469962978389 // Func GUID .quad 4294967295 // Func Hash .byte 9 // Length of func name .ascii "_Z5funcAi" // Func name .quad 7102633082150537521 .quad 138828622701 .byte 12 .ascii "_Z8funcLeafi" .quad 446061515086924981 .quad 4294967295 .byte 9 .ascii "_Z5funcBi" .quad -2016976694713209516 .quad 72617220756 .byte 7 .ascii "_Z3fibi" ``` For each `.pseudoprobe` section, the encoded binary data consists of a single function record corresponding to an outlined function (i.e, a function with a code entry in the `.text` section). A function record has the following format : ``` FUNCTION BODY (one for each outlined function present in the text section) GUID (uint64) GUID of the function NPROBES (ULEB128) Number of probes originating from this function. NUM_INLINED_FUNCTIONS (ULEB128) Number of callees inlined into this function, aka number of first-level inlinees PROBE RECORDS A list of NPROBES entries. Each entry contains: INDEX (ULEB128) TYPE (uint4) 0 - block probe, 1 - indirect call, 2 - direct call ATTRIBUTE (uint3) reserved ADDRESS_TYPE (uint1) 0 - code address, 1 - address delta CODE_ADDRESS (uint64 or ULEB128) code address or address delta, depending on ADDRESS_TYPE INLINED FUNCTION RECORDS A list of NUM_INLINED_FUNCTIONS entries describing each of the inlined callees. Each record contains: INLINE SITE GUID of the inlinee (uint64) ID of the callsite probe (ULEB128) FUNCTION BODY A FUNCTION BODY entry describing the inlined function. ``` Disassembling A switch `--show-pseudo-probe` is added to use along with `--show-disassembly` to print disassembly code with pseudo probe directives. For example: ``` 00000000002011a0 <foo2>: 2011a0: 50 push rax 2011a1: 85 ff test edi,edi [Probe]: FUNC: foo2 Index: 1 Type: Block 2011a3: 74 02 je 2011a7 <foo2+0x7> [Probe]: FUNC: foo2 Index: 3 Type: Block [Probe]: FUNC: foo2 Index: 4 Type: Block [Probe]: FUNC: foo Index: 1 Type: Block Inlined: @ foo2:6 2011a5: 58 pop rax 2011a6: c3 ret [Probe]: FUNC: foo2 Index: 2 Type: Block 2011a7: bf 01 00 00 00 mov edi,0x1 [Probe]: FUNC: foo2 Index: 5 Type: IndirectCall 2011ac: ff d6 call rsi [Probe]: FUNC: foo2 Index: 4 Type: Block 2011ae: 58 pop rax 2011af: c3 ret ``` Implementation - `PseudoProbeDecoder` is added in ProfiledBinary as an infra for the decoding. It decoded the two section and generate two map: `GUIDProbeFunctionMap` stores all the `PseudoProbeFunction` which is the abstraction of a general function. `AddressProbesMap` stores all the pseudo probe info indexed by its address. - All the inline info is encoded into binary as a trie(`PseudoProbeInlineTree`) and will be constructed from the decoding. Each pseudo probe can get its inline context(`getInlineContext`) by traversing its inline tree node backwards. Test Plan: ninja & ninja check-llvm Differential Revision: https://reviews.llvm.org/D92334	2021-01-13 11:02:57 -08:00
Jonas Devlieghere	48d2068fb7	[dsymutil] Warn on timestmap mismatch between object file and debug map This re-lands `e5553b9a6a` with two small fixes to the tests: - Don't touch the source directory in debug-map-parsing.test but instead copy everything over in a temporary directory in timestamp-mismatch.test. - Don't redirect stderr to stdout to avoid the output getting intertwined in extern-alias.test.	2021-01-13 09:15:30 -08:00
David Zarzycki	c6e341c899	Revert "[dsymutil] Warn on timestmap mismatch between object file and debug map" This reverts commit `e5553b9a6a`. Tests are not allowed to modify the source. Please figure out a way to use %t rather than dynamically modifying the inputs.	2021-01-13 07:23:34 -05:00
Georgii Rymar	6d3098e7ff	[obj2yaml,yaml2obj] - Refine how we set/dump the sh_entsize field. This reuses the code from yaml2obj (moves it to ELFYAML.h). With it we can set the `sh_entsize` in a single place in `obj2yaml`. Note that it also fixes a bug of `yaml2obj`: we do not set the `sh_entsize` field for the `SHT_ARM_EXIDX` section properly. Differential revision: https://reviews.llvm.org/D93858	2021-01-13 11:52:40 +03:00
Georgii Rymar	141906fa14	[llvm-readelf/obj] - Add support of multiple SHT_SYMTAB_SHNDX sections. Currently we don't support multiple SHT_SYMTAB_SHNDX sections and the DT_SYMTAB_SHNDX tag currently. This patch implements it and fixes the https://bugs.llvm.org/show_bug.cgi?id=43991. I had to introduce the `struct DataRegion` to ELF.h, it is used to represent a region that might have no known size. It is needed, because we don't know the size of the extended section indices table when it is located via DT_SYMTAB_SHNDX. In this case we still want to validate that we don't read past the end of the file. Differential revision: https://reviews.llvm.org/D92923	2021-01-13 11:36:43 +03:00
Jonas Devlieghere	f1d5cbbdee	[dsymutil] Add preliminary support for DWARF 5. Currently dsymutil will silently fail when processing binaries with Dwarf 5 debug info. This patch adds rudimentary support for Dwarf 5 in dsymutil. - Recognize relocations in the debug_addr section. - Recognize (a subset of) Dwarf 5 form values. - Emits valid Dwarf 5 compile unit header chains. To simplify things (and avoid having to emit indexed sections) I decided to emit the relocated addresses directly in the debug info section. - DW_FORM_strx gets relocated and rewritten to DW_FORM_strp - DW_FORM_addrx gets relocated and rewritten to DW_FORM_addr Obviously there's a lot of work left, but this should be a step in the right direction. rdar://62345491 Differential revision: https://reviews.llvm.org/D94323	2021-01-12 21:55:41 -08:00
Kazu Hirata	12fc9ca3a4	[llvm] Remove redundant string initialization (NFC) Identified with readability-redundant-string-init.	2021-01-12 21:43:46 -08:00
Jonas Devlieghere	8a47d875b0	[dsymutil] Copy eh_frame content into the dSYM companion file. Copy over the __eh_frame from the binary into the dSYM. This helps kernel developers that are working with only dSYMs (i.e. no binaries) when debugging a core file. This only kicks in when the __eh_frame exists in the linked binary. Most of the time ld64 will remove the section in favor of compact unwind info. When it is emitted, it's generally small enough and should not bloat the dSYM. rdar://69774935 Differential revision: https://reviews.llvm.org/D94460	2021-01-12 19:50:34 -08:00
Jonas Devlieghere	e5553b9a6a	[dsymutil] Warn on timestmap mismatch between object file and debug map Add a warning when the timestmap doesn't match between the object file and the debug map entry. We were already emitting such warnings for archive members and swift interface files. This patch also unifies the warning across all three. rdar://65614640 Differential revision: https://reviews.llvm.org/D94536	2021-01-12 18:58:10 -08:00
Georgii Rymar	c15a57cc1a	[obj2yaml] - Don't crash when an object has an empty symbol table. Currently we crash when we have an object with SHT_SYMTAB/SHT_DYNSYM sections of size 0. With this patch instead of the crash we start to dump them properly. Differential revision: https://reviews.llvm.org/D93697	2021-01-12 14:08:59 +03:00
Georgii Rymar	60df7c08b1	[obj2yaml,yaml2obj] - Fix issues with creating/dumping group sections. We have the following issues related to group sections: 1) yaml2obj is unable to set the custom `sh_entsize` value, because the `EntSize` key is currently ignored. 2) obj2yaml is unable to dump the group section which `sh_entsize != 4`. 3) obj2yaml always dumps the "EntSize" for group sections, though usually we are trying to omit dumping default values when dumping keys. I.e. we should not print the "EntSize" key when `sh_entsize` == 4. This patch fixes (1),(3) and adds the test case to document the behavior of (2). Differential revision: https://reviews.llvm.org/D93854	2021-01-12 14:07:42 +03:00
Georgii Rymar	891b4873c1	[llvm-readobj] - One more attempt to fix BB. Add `this->` for `W`, which is the member of `ObjDumper` An example of error: readobj/ELFDumper.cpp:738:13: error: use of undeclared identifier 'W' assert(&W.getOStream() == &llvm::fouts());	2021-01-12 13:17:59 +03:00
Georgii Rymar	cc91efdabe	[llvm-readobj] - An attempt to fix BB. This adds the `template` keyword for 'getAsArrayRef' calls. An example of error: /b/1/openmp-gcc-x86_64-linux-debian/llvm.src/llvm/tools/llvm-readobj/ELFDumper.cpp:4491:50: error: use 'template' keyword to treat 'getAsArrayRef' as a dependent template name for (const Elf_Rel &Rel : this->DynRelRegion.getAsArrayRef<Elf_Rel>())	2021-01-12 13:09:49 +03:00
Georgii Rymar	1e11402aa8	[llvm-readobj] - Add 'override' to fix build bots. This should fix bots after landing D93900. An example of error is: /home/worker/2.0.1/lldb-x86_64-debian/llvm-project/llvm/tools/llvm-readobj/ELFDumper.cpp:883:8: warning: 'printSectionMapping' overrides a member function but is not marked 'override' [-Winconsistent-missing-override] void printSectionMapping() {}	2021-01-12 13:01:15 +03:00
Georgii Rymar	9ec72cfc61	[llvm-readef/obj] - Change the design structure of ELF dumper. NFCI. This is a refactoring for design of stuff in `ELFDumper.cpp`. The current design of ELF dumper is far from ideal. Currently most overridden functions (inherited from `ObjDumper`) in `ELFDumper` just forward to the functions of `ELFDumperStyle` (which can be either `GNUStyle` or `LLVMStyle`). A concrete implementation may be in any of `ELFDumper`/`DumperStyle`/`GNUStyle`/`LLVMStyle`. This patch reorganizes the classes by introducing `GNUStyleELFDumper`/`LLVMStyleELFDumper` which inherit from `ELFDumper`. The implementations are moved: `DumperStyle` -> `ELFDumper` `GNUStyle` -> `GNUStyleELFDumper` `LLVMStyle` -> `LLVMStyleELFDumper` With that we can avoid having a lot of redirection calls and helper methods. The number of code lines changes from 7142 to 6922 (reduced by ~3%) and the code overall looks cleaner. Differential revision: https://reviews.llvm.org/D93900	2021-01-12 12:36:17 +03:00
Kazu Hirata	e5b4dbab04	[llvm] Simplify string comparisons (NFC) Identified with readability-string-compare.	2021-01-11 18:48:09 -08:00
Kazu Hirata	8590a3e3ad	[llvm] Use *Set::contains (NFC)	2021-01-11 18:48:07 -08:00
Kazu Hirata	89e8eb946d	[llvm] Use llvm::find_if (NFC)	2021-01-11 18:48:06 -08:00
Abhina Sreeskantharajan	8ad998a611	[tools] Mark output of tools as text if it is really text This is a continuation of https://reviews.llvm.org/D67696. The following tools also need to set the OF_Text flag correctly. - llvm-profdata - llvm-link Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D94313	2021-01-11 15:14:03 -05:00
Georgii Rymar	a6db7cf1ce	[llvm-readelf/obj] - Index phdrs and relocations from 0 when reporting warnings. As was mentioned in comments here: https://reviews.llvm.org/D92636#inline-864967 we are not consistent and sometimes index things from 0, but sometimes from 1 in warnings. This patch fixes 2 places: messages reported for program headers and messages reported for relocations. Differential revision: https://reviews.llvm.org/D93805	2021-01-11 15:13:54 +03:00
Georgii Rymar	c74751d4b5	[obj2yaml] - Fix the crash in getUniquedSectionName(). `getUniquedSectionName(const Elf_Shdr *Sec)` assumes that `Sec` is not `nullptr`. I've found one place in `getUniquedSymbolName` where it is not true (because of that we crash when trying to dump unnamed null section symbols). Patch fixes the crash and changes the signature of the `getUniquedSectionName` section to accept a reference. Differential revision: https://reviews.llvm.org/D93754	2021-01-11 15:04:00 +03:00
Kazu Hirata	6a6e382161	[llvm] Drop unnecessary make_range (NFC)	2021-01-09 09:25:00 -08:00
Martin Storsjö	7a91dad9e5	[llvm-readobj] [ARMWinEH] Clearly print an invalid case of packed unwind info as such As the actual windows unwinder doesn't support this case, don't pretend that it is supported when dumping the generated unwind info either, even if it would be possible to interpret it as something sensible. This should reduce the risk of us emitting such a case in code (although it's unlikely as long as the unwind info is generated through the SEH opcodes, as the opcodes can't describe this case). Differential Revision: https://reviews.llvm.org/D91529	2021-01-08 10:04:44 +02:00
Arthur Eubanks	9ccf13c36d	[NewPM][NVPTX] Port NVPTX opt passes There are only two used in the IR optimization pipeline. Port these and add them to the default pipeline. Similar to https://reviews.llvm.org/D93863. I added -mtriple to some tests since under the new PM, the passes are only available when the TargetMachine is specified. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D93930	2021-01-07 15:12:35 -08:00
Alexandre Ganea	ce7f30b2a8	[llvm-pdbutil] Don't crash when printing unknown CodeView type records Differential Revision: https://reviews.llvm.org/D93720	2021-01-07 15:44:55 -05:00
Roman Lebedev	8dee0b4bd6	[llvm-reduce] ReduceGlobalVarInitializers delta pass: fix handling of globals w/ comdat/non-external linkage Much like with ReduceFunctionBodies delta pass, we need to remove comdat and set linkage to external, else verifier will complain, and our deltas are invalid.	2021-01-07 18:05:03 +03:00
Simon Pilgrim	a9a8caf2ce	[llvm-objdump] Pass Twine by const reference instead of by value. NFCI.	2021-01-07 12:53:29 +00:00
Kazu Hirata	cd088ba7e6	[llvm] Use llvm::lower_bound and llvm::upper_bound (NFC)	2021-01-05 21:15:59 -08:00
Kazu Hirata	441650d589	[tools] Use llvm::append_range (NFC)	2021-01-05 21:15:56 -08:00
Christudasan Devadasan	d68458bd56	[GlobalISel] Base implementation for sret demotion. If the return values can't be lowered to registers SelectionDAG performs the sret demotion. This patch contains the basic implementation for the same in the GlobalISel pipeline. Furthermore, targets should bring relevant changes during lowerFormalArguments, lowerReturn and lowerCall to make use of this feature. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D92953	2021-01-06 10:30:50 +05:30
Sergey Dmitriev	761aca1e2e	[llvm-link] fix linker behavior when linking archives with --only-needed option This patch fixes linker behavior when archive is linked with other inputs as a library (i.e. when --only-needed option is specified). In this case library is expected to be normally linked first into a separate module and only after that linker should import required symbols from the linked library module. Reviewed By: tra Differential Revision: https://reviews.llvm.org/D92535	2021-01-05 10:02:51 -08:00
Alan Phipps	9f2967bcfe	[Coverage] Add support for Branch Coverage in LLVM Source-Based Code Coverage This is an enhancement to LLVM Source-Based Code Coverage in clang to track how many times individual branch-generating conditions are taken (evaluate to TRUE) and not taken (evaluate to FALSE). Individual conditions may comprise larger boolean expressions using boolean logical operators. This functionality is very similar to what is supported by GCOV except that it is very closely anchored to the ASTs. Differential Revision: https://reviews.llvm.org/D84467	2021-01-05 09:51:51 -06:00
Arthur Eubanks	4e838ba9ea	[NewPM][AMDGPU] Port amdgpu-always-inline And add to AMDGPU opt pipeline. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D94025	2021-01-04 12:27:01 -08:00
Arthur Eubanks	fd323a897c	[NewPM][AMDGPU] Port amdgpu-printf-runtime-binding And add to AMDGPU opt pipeline. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D94026	2021-01-04 12:25:50 -08:00
Arthur Eubanks	e1833e7493	[NewPM][AMDGPU] Port amdgpu-unify-metadata And add to AMDGPU opt pipeline. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D94023	2021-01-04 11:57:46 -08:00
Arthur Eubanks	a5f863e076	[NewPM][AMDGPU] Port amdgpu-propagate-attributes-early/late And add to AMDGPU opt pipeline. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D94022	2021-01-04 11:53:37 -08:00
Kazu Hirata	eb198f4c3c	[llvm] Use llvm::any_of (NFC)	2021-01-04 11:42:47 -08:00
Roman Lebedev	5799fc79c3	[llvm-reduce] Refactor global variable delta pass The limitation of the current pass that it skips initializer-less GV's seems arbitrary, in all the reduced cases i (personally) looked at, the globals weren't needed, yet they were kept. So let's do two things: 1. allow reducing initializer-less globals 2. before reducing globals, reduce their initializers, much like we do function bodies	2021-01-03 01:45:47 +03:00
Roman Lebedev	19ab1817b6	[llvm-reduce] Fix removal of unused llvm intrinsics declarations `ee6e25e439` changed the delta pass to skip intrinsics, which means we may end up being left with declarations of intrinsics, that aren't otherwise referenced in the module. This is obviously unwanted, do drop them.	2021-01-03 01:45:47 +03:00
Hongtao Yu	01f0d162d6	Moving UniqueInternalLinkageNamesPass to the start of IR pipelines. `UniqueInternalLinkageNamesPass` is useful to CSSPGO, especially when pseudo probe is used. It solves naming conflict for static functions which otherwise will share a merged profile and likely have a profile quality issue with mismatched CFG checksums. Since the pseudo probe instrumentation happens very early in the pipeline, I'm moving `UniqueInternalLinkageNamesPass` right before it. This is being done only to the new pass manager. Reviewed By: dblaikie, aeubanks Differential Revision: https://reviews.llvm.org/D93656	2021-01-02 14:26:21 -08:00
Kazu Hirata	171c5fd43e	[llvm] Use llvm::erase_value and llvm::erase_if (NFC)	2021-01-02 09:24:15 -08:00
Roman Lebedev	e6b1a27fb9	[NFC][CodeGen] Split DwarfEHPrepare pass into an actual transform and an legacy-PM wrapper This is consistent with the layout of other passes, and simplifies further refinements regarding DomTree handling. This is indended to be a NFC commit.	2021-01-02 01:01:19 +03:00
Kazu Hirata	9a90c4ea8a	[llvm] Use isa instead of dyn_cast (NFC)	2021-01-01 12:44:56 -08:00
Kazu Hirata	bea8d021a3	[llvm] Use *Map::lookup (NFC)	2021-01-01 12:44:54 -08:00
Kazu Hirata	f904b46b1a	[llvm-objcopy] Use llvm::erase_if (NFC)	2020-12-31 09:39:09 -08:00
Bogdan Graur	2016f2c8a7	Fixes warning 'enumeration value not handled in switch'. This was introduced in commit: `981a0bd858`. Differential Revision: https://reviews.llvm.org/D93944	2020-12-30 06:56:29 -08:00
Haowei Wu	a1d0589266	[llvm-elfabi] Add flag to preserve timestamp when output is the same This change adds '--write-if-changed' flag to llvm-elfabi tool. When enabled, llvm-elfabi will not overwrite the existing file if the content of the file will not be changed, which preserves the timestamp. Differential Revision: https://reviews.llvm.org/D92902	2020-12-29 20:27:06 -08:00
Lang Hames	5efc71e119	[ORC] Move Orc RPC code into Shared, rename some RPC types. Moves all headers from Orc/RPC to Orc/Shared, and from the llvm::orc::rpc namespace into llvm::orc::shared. Also renames RPCTypeName to SerializationTypeName and Function to RPCFunction. In addition to being a more reasonable home for this code, this will make it easier for the upcoming Orc runtime to re-use the Serialization system for creating and parsing wrapper-function binary blobs.	2020-12-30 12:48:20 +11:00
Haowei Wu	d034a94e7b	Revert "[llvm-elfabi] Add flag to preserve timestamp when output is the same" This reverts commit `fddb417449`. which causes test failures on Mac builders.	2020-12-29 17:26:22 -08:00
Haowei Wu	fddb417449	[llvm-elfabi] Add flag to preserve timestamp when output is the same This change adds '--write-if-changed' flag to llvm-elfabi tool. When enabled, llvm-elfabi will not overwrite the existing file if the content of the file will not be changed, which preserves the timestamp. Differential Revision: https://reviews.llvm.org/D92902	2020-12-29 14:43:47 -08:00
Arthur Eubanks	7ecbe0c7a0	[NewPM][AMDGPU] Port amdgpu-lower-kernel-attributes And add it to the AMDGPU opt pipeline. This is a function pass instead of a module pass (like the legacy pass) because it's getting added to a CGSCCPassManager, and you can't put a module pass in a CGSCCPassManager. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D93885	2020-12-29 10:26:06 -08:00
Arthur Eubanks	c2ef06d3dd	[NewPM] Port infer-address-spaces And add it to the AMDGPU opt pipeline. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D93880	2020-12-28 19:58:12 -08:00
Arthur Eubanks	0e9abcfc19	[AMDGPU][NewPM] Port amdgpu-promote-alloca(-to-vector) And add to AMDGPU opt pipeline. Don't pin an opt run to the legacy PM when -enable-new-pm=1 if these passes (or passes introduced in https://reviews.llvm.org/D93863) are in the list of passes. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D93875	2020-12-28 17:52:31 -08:00
Kazu Hirata	079923309c	[llvm-cov] Use is_contained (NFC)	2020-12-27 09:57:25 -08:00
Kazu Hirata	b676f2fee1	[llvm-cov, llvm-symbolizer] Use llvm::erase_if (NFC)	2020-12-26 12:06:27 -08:00
Kazu Hirata	9c9bca45f0	[llvm-pdbutil] Use llvm::is_contained (NFC)	2020-12-26 12:06:24 -08:00
Kazu Hirata	e334c52add	[llvm-objcopy] Use llvm::erase_if (NFC)	2020-12-25 10:13:18 -08:00
Kazu Hirata	ea39991251	[llvm-nm, llvm-objdump] Use llvm::is_contained (NFC)	2020-12-25 09:22:37 -08:00
Georgii Rymar	893c84d71c	[obj2yaml] - Dump the content of a broken hash table properly. This is similar to D93760. When something is wrong with the hash table header we dump its context as a raw data. Currently we have the calculation overflow issue and it is possible to bypass the validation we have (and crash). The patch fixes it. Differential revision: https://reviews.llvm.org/D93799	2020-12-25 11:51:28 +03:00
Georgii Rymar	177779e8dd	[llvm-readelf/obj] - Improve the warning reported when unable to read the stack size. It was discussed in D92545 that we might want to improve messages reported when something is wrong with the stack size section. This patch does it. Differential revision: https://reviews.llvm.org/D93802	2020-12-25 11:40:35 +03:00
Georgii Rymar	438bc157a4	[libObject] - Add more ELF types to LLVM_ELF_IMPORT_TYPES_ELFT define (ELFTypes.h). This allows to get rid of lots for typedefs/usings from many places. Differential revision: https://reviews.llvm.org/D93801	2020-12-25 11:39:05 +03:00
Kazu Hirata	d6ff5cf995	[Target] Use llvm::any_of (NFC)	2020-12-24 19:43:26 -08:00
Georgii Rymar	b8cb1802a8	[obj2yaml] - Dump the content of a broken GNU hash table properly. When something is wrong with the GNU hash table header we dump its context as a raw data. Currently we have the calculation overflow issue and it is possible to bypass the validation we have (and crash). The patch fixes it. Differential revision: https://reviews.llvm.org/D93760	2020-12-24 11:16:31 +03:00
Georgii Rymar	bdef1f87ab	[llvm-readobj] - Dump the ELF file type better. Currently llvm-readelf might print "OS Specific/Processor Specific/<unknown>" hint when dumping the ELF file type. The patch teaches llvm-readobj to do the same. This fixes https://bugs.llvm.org/show_bug.cgi?id=40868 I am removing `Object/elf-unknown-type.test` test because it is not in the right place, it is outdated and very limited. The `readobj/ELF/file-types.test` checks the functionality much better. Differential revision: https://reviews.llvm.org/D93689	2020-12-23 11:13:19 +03:00
Arthur O'Dwyer	22cf54a7fb	Replace `T(x)` with `reinterpret_cast<T>(x)` everywhere it means reinterpret_cast. NFC. Differential Revision: https://reviews.llvm.org/D76572	2020-12-22 19:54:29 -05:00
Tom Stellard	4ad0cfd4de	llvm-profgen: Parse command line arguments after initializing targets I am experimenting with turning backends into loadable modules and in that scenario, target specific command line arguments won't be available until after the targets are initialized. Also, most other tools initialize targets before parsing arguments. Reviewed By: wlei Differential Revision: https://reviews.llvm.org/D93348	2020-12-21 15:13:10 -08:00
Georgii Rymar	8590b5ccd5	[libObject, llvm-readobj] - Reimplement `ELFFile<ELFT>::getEntry`. Currently, `ELFFile<ELFT>::getEntry` does not check an index of an entry. Because of that the code might read past the end of the symbol table silently. I've added a test to `llvm-readobj\ELF\relocations.test` to demonstrate the possible issue. Also, I've added a unit test for this method. After this change, `getEntry` stops reporting the section index and reuses the `getSectionContentsAsArray` method, which already has all the validation needed. Our related warnings now provide more and better context sometimes. Differential revision: https://reviews.llvm.org/D93209	2020-12-18 16:52:27 +03:00
Adhemerval Zanella	e04dc5f557	[llvm-readobj/elf] - AArch64: Handle AARCH64_VARIANT_PCS for GNUStyle It mimics the GNU readelf where it prints a [VARIANT_PCS] for symbols with st_other with STO_AARCH64_VARIANT_PCS. Reviewed By: grimar, MaskRay Differential Revision: https://reviews.llvm.org/D93044	2020-12-17 11:09:53 -03:00
dfukalov	9ed8e0caab	[NFC] Reduce include files dependency and AA header cleanup (part 2). Continuing work started in https://reviews.llvm.org/D92489: Removed a bunch of includes from "AliasAnalysis.h" and "LoopPassManager.h". Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D92852	2020-12-17 14:04:48 +03:00
Barry Revzin	92310454bf	Make LLVM build in C++20 mode Part of the <=> changes in C++20 make certain patterns of writing equality operators ambiguous with themselves (sorry!). This patch goes through and adjusts all the comparison operators such that they should work in both C++17 and C++20 modes. It also makes two other small C++20-specific changes (adding a constructor to a type that cases to be an aggregate, and adding casts from u8 literals which no longer have type const char*). There were four categories of errors that this review fixes. Here are canonical examples of them, ordered from most to least common: // 1) Missing const namespace missing_const { struct A { #ifndef FIXED bool operator==(A const&); #else bool operator==(A const&) const; #endif }; bool a = A{} == A{}; // error } // 2) Type mismatch on CRTP namespace crtp_mismatch { template <typename Derived> struct Base { #ifndef FIXED bool operator==(Derived const&) const; #else // in one case changed to taking Base const& friend bool operator==(Derived const&, Derived const&); #endif }; struct D : Base<D> { }; bool b = D{} == D{}; // error } // 3) iterator/const_iterator with only mixed comparison namespace iter_const_iter { template <bool Const> struct iterator { using const_iterator = iterator<true>; iterator(); template <bool B, std::enable_if_t<(Const && !B), int> = 0> iterator(iterator<B> const&); #ifndef FIXED bool operator==(const_iterator const&) const; #else friend bool operator==(iterator const&, iterator const&); #endif }; bool c = iterator<false>{} == iterator<false>{} // error \|\| iterator<false>{} == iterator<true>{} \|\| iterator<true>{} == iterator<false>{} \|\| iterator<true>{} == iterator<true>{}; } // 4) Same-type comparison but only have mixed-type operator namespace ambiguous_choice { enum Color { Red }; struct C { C(); C(Color); operator Color() const; bool operator==(Color) const; friend bool operator==(C, C); }; bool c = C{} == C{}; // error bool d = C{} == Red; } Differential revision: https://reviews.llvm.org/D78938	2020-12-17 10:44:10 +00:00
Fangrui Song	c70f36865e	Use basic_string::find(char) instead of basic_string::find(const char *s, size_type pos=0) Many (StringRef) cannot be detected by clang-tidy performance-faster-string-find.	2020-12-16 23:28:32 -08:00
Hongtao Yu	ac068e014b	[CSSPGO] Consume pseudo-probe-based AutoFDO profile This change enables pseudo-probe-based sample counts to be consumed by the sample profile loader under the regular `-fprofile-sample-use` switch with minimal adjustments to the existing sample file formats. After the counts are imported, a probe helper, aka, a `PseudoProbeManager` object, is automatically launched to verify the CFG checksum of every function in the current compilation against the corresponding checksum from the profile. Mismatched checksums will cause a function profile to be slipped. A `SampleProfileProber` pass is scheduled before any of the `SampleProfileLoader` instances so that the CFG checksums as well as probe mappings are available during the profile loading time. The `PseudoProbeManager` object is set up right after the profile reading is done. In the future a CFG-based fuzzy matching could be done in `PseudoProbeManager`. Samples will be applied only to pseudo probe instructions as well as probed callsites once the checksum verification goes through. Those instructions are processed in the same way that regular instructions would be processed in the line-number-based scenario. In other words, a function is processed in a regular way as if it was reduced to just containing pseudo probes (block probes and callsites). Adjustment to profile format A CFG checksum field is being added to the existing AutoFDO profile formats. So far only the text format and the extended binary format are supported. For the text format, a new line like ``` !CFGChecksum: 12345 ``` is added to the end of the body sample lines. For the extended binary profile format, we introduce a metadata section to store the checksum map from function names to their CFG checksums. Differential Revision: https://reviews.llvm.org/D92347	2020-12-16 15:57:18 -08:00
Georgii Rymar	8c2cf89834	[yaml2obj/obj2yaml] - Make Value/Size fields of Symbol optional. When a field is optional we can use the `=<none>` syntax in macros. This patch makes `Value`/`Size` fields of `Symbol` optional and adds test cases for them. Differential revision: https://reviews.llvm.org/D93010	2020-12-16 13:49:57 +03:00
Georgii Rymar	407d420029	[lib/Object] - Make ELFObjectFile::getSymbol() return Expected<>. This was requested in comments for D93209: https://reviews.llvm.org/D93209#inline-871192 D93209 fixes an issue with `ELFFile<ELFT>::getEntry`, after what `getSymbol` starts calling `report_fatal_error` for previously missed invalid cases. This patch makes it return `Expected<>` and updates callers. For few of them I had to add new `report_fatal_error` calls. But I see no way to avoid it currently. The change would affects too many places, e.g: `getSymbolBinding` and other methods are used from `ELFSymbolRef` which is used in too many places across LLVM. Differential revision: https://reviews.llvm.org/D93297	2020-12-16 13:14:23 +03:00
Georgii Rymar	78aea98308	[llvm-readelf/obj] - Handle out-of-order PT_LOADs better. This is https://bugs.llvm.org/show_bug.cgi?id=45698. Specification says that "Loadable segment entries in the program header table appear in ascending order, sorted on the p_vaddr member." Our `toMappedAddr()` relies on this condition. This patch adds a warning when the sorting order of loadable segments is wrong. In this case we force segments sorting and that allows `toMappedAddr()` to work as expected. Differential revision: https://reviews.llvm.org/D92641	2020-12-16 12:59:32 +03:00
Amy Huang	aa7ae25613	[llvm-symbolizer] Add missing include for config.h The cmake variable LLVM_ENABLE_DIA_SDK was being used here but was undefined because config.h wasn't included. Differential Revision: https://reviews.llvm.org/D93309	2020-12-15 09:20:31 -08:00
Georgii Rymar	83aea14ed6	[llvm-readelf] - Don't print OS/Processor specific prefix for known ELF file types. This is a change suggested in post commit comments for D93096 (https://reviews.llvm.org/D93096#2451796). Imagine we want to add a custom OS specific ELF file type. For that we can update the `ElfObjectFileType` array: ``` static const EnumEntry<unsigned> ElfObjectFileType[] = { ... {"Core", "CORE (Core file)", ELF::ET_CORE}, {"MyType", "MyType (my description)", 0xfe01}, }; ``` The current code then might print: ``` OS Specific: (MyType (my description)) ``` Though instead we probably would like to see a nicer output, e.g: ``` Type: MyType (my description) ``` To achieve that we can reorder the code slightly. It is impossible to add a test I think, because we have no custom values in the `ElfObjectFileType` array in LLVM. Differential revision: https://reviews.llvm.org/D93217	2020-12-15 10:56:25 +03:00
David Spickett	aabaca3363	[llvm-objdump] Use "--" for long options in --help text Single dash for these options is not recognised. Changes found by running this on the --help output and the user guide: grep -e ' -[a-zA-Z]\{2,\}' The user guide was updated in https://reviews.llvm.org/D92305 so no change there. Reviewed By: jhenderson, MaskRay Differential Revision: https://reviews.llvm.org/D92310	2020-12-14 13:11:29 +00:00
Georgii Rymar	98a4289810	[llvm-readobj] - For SHT_REL relocations, don't display an addend. This is https://bugs.llvm.org/show_bug.cgi?id=44257. In LLVM style we always print `0` as addend when dumping SHT_REL relocations. It is confusing, this patch stops printing it as the first comment on the bug page suggests. Differential revision: https://reviews.llvm.org/D93033	2020-12-14 12:03:00 +03:00
Georgii Rymar	4e2e785ddd	[llvm-readelf] - Improve ELF type field dumping. This is related to https://bugs.llvm.org/show_bug.cgi?id=40868. Currently we don't print `OS Specific`/``Processor Specific`/`<unknown>` prefixes when dumping the ELF file type. This is not consistent with GNU readelf. The patch fixes it. Also, this patch removes the `types.test`, because we already have `file-types.test`, which tests more cases and this patch revealed that we have such a duplicate. Differential revision: https://reviews.llvm.org/D93096	2020-12-14 11:24:08 +03:00
Arthur Eubanks	655011c713	[opt][NPM] Pin -lower-amx-type to legacy PM This is part of the codegen pipeline.	2020-12-13 19:16:20 -08:00
Lang Hames	04795ab836	Re-apply `8904ee8ac7` with missing header included this time.	2020-12-14 13:39:33 +11:00
Nico Weber	5b112bcc0d	Revert "[JITLink] Add JITLinkDylib type, thread through JITLinkMemoryManager APIs." This reverts commit `8904ee8ac7`. Didn't `git add` llvm/ExecutionEngine/JITLink/JITLinkDylib.h and hence doesn't build anywhere.	2020-12-13 21:30:38 -05:00
Lang Hames	8904ee8ac7	[JITLink] Add JITLinkDylib type, thread through JITLinkMemoryManager APIs. JITLinkDylib represents a target dylib for a JITLink link. By representing this explicitly we can: - Enable JITLinkMemoryManagers to manage allocations on a per-dylib basis (e.g by maintaining a seperate allocation pool for each JITLinkDylib). - Enable new features and diagnostics that require information about the target dylib (not implemented in this patch).	2020-12-14 12:29:16 +11:00
Martin Storsjö	879c15e890	[llvm-rc] Handle driveless absolute windows paths when loading external files When llvm-rc loads an external file, it looks for it relative to a number of include directories and the current working directory. If the path is considered absolute, llvm-rc tries to open the filename as such, and doesn't try to open it relative to other paths. On Windows, a path name like "\dir\file" isn't considered absolute as it lacks the drive name, but by appending it on top of the search dirs, it's not found. LLVM's sys::path::append just appends such a path (same with a properly absolute posix path) after the paths it's supposed to be relative to. This fix doesn't handle the case if the resource script and the external file are on a different drive than the current working directory; to fix that, we'd have to make LLVM's sys::path::append handle appending fully absolute and partially absolute paths (ones lacking a drive prefix but containing a root directory), or switch to C++17's std::filesystem. Differential Revision: https://reviews.llvm.org/D92558	2020-12-10 14:11:06 +02:00
Alexey Lapshin	693da9df74	[dsymutil][DWARFLinker][NFC] Make interface of AddressMap more general. Current interface of AddressMap assumes that relocations exist. That is correct for not-linked object file but is not correct for linked executable. This patch changes interface in such way that AddressMap could be used not only with not-linked object files: hasValidRelocationAt() replaced with: hasLiveMemoryLocation() hasLiveAddressRange() Differential Revision: https://reviews.llvm.org/D87723	2020-12-10 14:57:08 +03:00
Sergey Dmitriev	025d4faadb	[llvm-link][NFC] Minor cleanup llvm::Linker::linkModules() is a static member, so there is no need to pass reference to llvm::Linker instance to loadArFile() function. Reviewed By: tra Differential Revision: https://reviews.llvm.org/D92918	2020-12-09 23:16:13 -08:00
Fangrui Song	7adcacda06	Rename -plugin-opt=no-new-pass-manager to -plugin-opt=legacy-pass-manager	2020-12-09 16:43:30 -08:00
Fangrui Song	68ff3b3376	[LLD][gold] Add -plugin-opt=no-new-pass-manager -DENABLE_EXPERIMENTAL_NEW_PASS_MANAGER=on configured LLD and LLVMgold.so will use the new pass manager by default. Add an option to use the legacy pass manager. This will also be used by the Clang driver when -fno-new-pass-manager (D92915) / -fno-experimental-new-pass-manager is set. Reviewed By: aeubanks, tejohnson Differential Revision: https://reviews.llvm.org/D92916	2020-12-09 13:31:03 -08:00
Sam Clegg	9a72d3e3e4	[WebAssembly] Add support for named data sections in wasm binaries Followup to https://reviews.llvm.org/D91769 which added support for names globals. Differential Revision: https://reviews.llvm.org/D92909	2020-12-09 12:57:07 -08:00
Arthur Eubanks	664b187160	Reland Pin -loop-reduce to legacy PM This was accidentally reverted by a later change. LSR currently only runs in the codegen pass manager. There are a couple issues with LSR and the NPM. 1) Lots of tests assume that LCSSA isn't run before LSR. This breaks a bunch of tests' expected output. This is fixable with some time put in. 2) LSR doesn't preserve LCSSA. See llvm/test/Analysis/MemorySSA/update-remove-deadblocks.ll. LSR's use of SCEVExpander is the only use of SCEVExpander where the PreserveLCSSA option is off. Turning it on causes some code sinking out of loops to fail due to SCEVExpander's inability to handle the newly created trivial PHI nodes in the broken critical edge (I was looking at llvm/test/Transforms/LoopStrengthReduce/X86/2011-11-29-postincphi.ll). I also tried simply just calling formLCSSA() at the end of LSR, but the extra PHI nodes cause regressions in codegen tests. We'll delay figuring these issues out until later. This causes the number of check-llvm failures with -enable-new-pm true by default to go from 60 to 29. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D92796	2020-12-09 09:57:57 -08:00
Georgii Rymar	bdfafc4613	[llvm-readelf/obj] - Improve diagnostics when printing NT_FILE notes. This changes the `printNotesHelper` to report warnings on its side when there are errors when dumping notes. With that we can provide more content when reporting warnings about broken notes. Differential revision: https://reviews.llvm.org/D92636	2020-12-09 12:31:46 +03:00
Georgii Rymar	abae3c1196	[obj2yaml] - Support dumping objects that have multiple SHT_SYMTAB_SHNDX sections. It is allowed to have multiple `SHT_SYMTAB_SHNDX` sections, though we currently don't implement it. The current implementation assumes that there is a maximum of one SHT_SYMTAB_SHNDX section and that it is always linked with .symtab section. This patch drops this limitations. Differential revision: https://reviews.llvm.org/D92644	2020-12-09 12:14:58 +03:00
Arthur Eubanks	f0e89e69d6	[gold][NPM] Use NPM with ENABLE_EXPERIMENTAL_NEW_PASS_MANAGER Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D92869	2020-12-08 15:13:34 -08:00
Anna Thomas	29356e3279	[ScalarizeMaskedMemIntrin] Add new PM support This patch adds new PM support for the pass and the pass can be now used during middle-end transforms. The old pass is remamed to ScalarizeMaskedMemIntrinLegacyPass. Reviewed-By: skatkov, aeubanks Differential Revision: https://reviews.llvm.org/D92743	2020-12-08 17:15:22 -05:00

... 2 3 4 5 6 ...

12656 Commits