llvm-project

Commit Graph

Author	SHA1	Message	Date
Simon Pilgrim	1c1335a10d	[X86][BMI1] Fix BLSI/BLSMSK/BLSR BMI1 scheduling on btver2 These have the same behaviour as tzcnt on btver2 - confirmed with AMD 16h SOG, Agner and instlatx64. llvm-svn: 342235	2018-09-14 13:31:14 +00:00
Richard Smith	3164fcfd27	Add flag to llvm-profdata to allow symbols in profile data to be remapped, and add a tool to generate symbol remapping files. Summary: The new tool llvm-cxxmap builds a symbol mapping table from a file containing a description of partial equivalences to apply to mangled names and files containing old and new symbol tables. Reviewers: davidxl Subscribers: mgorny, llvm-commits Differential Revision: https://reviews.llvm.org/D51470 llvm-svn: 342168	2018-09-13 20:22:02 +00:00
Vedant Kumar	2963c49087	[llvm-cov] Delete custom JSON serialization code (NFC) Teach llvm-cov to use the new llvm JSON library, and remove some redundant/brittle JSON serialization tests. llvm-svn: 342088	2018-09-12 21:59:38 +00:00
Julie Hockett	468722ee9f	[objcopy] make objcopy follow program header standards Submitted on behalf of Armando Montanez (amontanez@google.com). Objects with unused program headers copied by objcopy would always have nonzero values for program header offset and program header entry size. While technically valid, this atypical behavior triggers warnings in some tools. This change sets the two fields to zero when the program header is unused, better fitting the general expectations for unused program header data. Section headers behaved somewhat similarly (though only with the entry size), and are fixed in this revision as well. Differential Revision: https://reviews.llvm.org/D51961 llvm-svn: 342065	2018-09-12 17:56:31 +00:00
Nico Weber	f48e961d23	Make malformed-machos.test pass on my Mac. For some reason, llvm-objdump defaults to -arch=i386 on this system while the test checks x86_64 output. Explicitly pass -arch=x86_64. llvm-svn: 341944	2018-09-11 14:10:33 +00:00
Dean Michael Berris	d2c50408d4	[XRay] Add TSC to NewCPUId Records Summary: This more correctly reflects the data written by the FDR mode runtime. This is a continuation of the work in D50441. Reviewers: mboerger, eizan Subscribers: hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D51911 llvm-svn: 341905	2018-09-11 06:36:51 +00:00
Dean Michael Berris	dd01efc56d	[XRay] Add the `llvm-xray fdr-dump` implementation Summary: In this change, we implement a `BlockPrinter` which orders records in a Block that's been indexed by the `BlockIndexer`. This is used in the `llvm-xray fdr-dump` tool which ties together the various types and utilities we've been working on, to allow for inspection of XRay FDR mode traces both with and without verification. This change is the final step of the refactoring of D50441. Reviewers: mboerger, eizan Subscribers: mgorny, hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D51846 llvm-svn: 341887	2018-09-11 00:22:53 +00:00
David Carlier	0efae196dd	[XRay] Fix buildbot failure llvm-svn: 341774	2018-09-10 05:29:49 +00:00
David Carlier	07cc5a8df9	[Xray] tooling allow MachO format support Getting writable xray __DATA sections from MachO as well. Reviewers: dberris Reviewed By: dberris Differential Revision: https://reviews.llvm.org/D51758 llvm-svn: 341772	2018-09-10 05:00:43 +00:00
Fangrui Song	91c95a35c1	[llvm-dwp] Clean up tests X86/*.test llvm-svn: 341688	2018-09-07 18:29:20 +00:00
Puyan Lotfi	99124cc082	[llvm-objcopy] Dwarf .debug section compression support (zlib, zlib-gnu). Third Attempt: - Alignment issues resolved. - zlib::isAvailable() detected. - ArrayRef misuse fixed. Usage: llvm-objcopy --compress-debug-sections=zlib foo.o llvm-objcopy --compress-debug-sections=zlib-gnu foo.o In both cases the debug section contents is compressed with zlib. In the GNU style case the header is the "ZLIB" magic string followed by the uint64 big- endian decompressed size. In the non-GNU mode the header is the Elf(32\|64)_Chdr. Decompression support is coming soon. Differential Revision: https://reviews.llvm.org/D49678 llvm-svn: 341635	2018-09-07 08:10:22 +00:00
Jordan Rupprecht	470f745275	[llvm-strip] -p test fix for windows buildbots Windows ls prints dates as "1997-05-05" instead of "May 05 1997", so only check for a leading space. llvm-svn: 341614	2018-09-07 00:28:54 +00:00
Puyan Lotfi	5be060e341	Revert: [llvm-objcopy] Dwarf .debug section compression (Second Attempt). Various bots still fail for unknown reason. llvm-svn: 341613	2018-09-07 00:28:25 +00:00
Puyan Lotfi	f0954dd275	[llvm-objcopy] Dwarf .debug section compression support (zlib, zlib-gnu). Second Attempt. Alignment issues resolved. zlib::isAvailable() detected. Usage: llvm-objcopy --compress-debug-sections=zlib foo.o llvm-objcopy --compress-debug-sections=zlib-gnu foo.o In both cases the debug section contents is compressed with zlib. In the GNU style case the header is the "ZLIB" magic string followed by the uint64 big- endian decompressed size. In the non-GNU mode the header is the Elf(32\|64)_Chdr. Decompression support is coming soon. Differential Revision: https://reviews.llvm.org/D49678 llvm-svn: 341607	2018-09-06 23:59:50 +00:00
Jordan Rupprecht	29f1ce7dcc	[llvm-strip] Fix -p test to check for explicit spaces around dates, to avoid when the filename happens to contain 1995/1997. llvm-svn: 341595	2018-09-06 22:34:48 +00:00
Fangrui Song	a373582169	Reland rL341509: "[llvm-dwp] Use buffer_stream if output file is not seekable (e.g. "-")" It caused ambiguity between llvm:🆑:Optional and llvm::Optional, which has been fixed by dropping `using namespace cl;` in favor of explicit cl:: qualified names. llvm-svn: 341586	2018-09-06 20:26:54 +00:00
Martin Storsjo	1e8edd13ee	[llvm-ar] Support * as comment char in MRI scripts MRI scripts have two comment chars, * and ;, but only the latter was supported before. Also allow leading spaces before comment chars (and before any command string), and allow comments after a command. Differential Revision: https://reviews.llvm.org/D51338 llvm-svn: 341571	2018-09-06 18:10:45 +00:00
Max Kazantsev	eb410f79b3	Revert rL341509 to fix massive failures on buildbots llvm-svn: 341515	2018-09-06 04:40:49 +00:00
Fangrui Song	26f23f8c25	[llvm-dwp] Fix `UN:` lines (supposed to be `RUN:`) in X86/simple.test and adjust check lines for TYPES: Reviewers: dblaikie, aprantl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D51704 llvm-svn: 341510	2018-09-06 00:46:30 +00:00
Fangrui Song	57575e11d1	[llvm-dwp] Use buffer_stream if output file is not seekable (e.g. "-") Reviewers: dblaikie, pcc Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D51707 llvm-svn: 341509	2018-09-06 00:06:25 +00:00
Jordan Rupprecht	591d889006	[llvm-strip] Support stripping multiple input files Summary: Allow strip to be called on multiple input files, which is interpreted as stripping N files in place. Using multiple input files is incompatible with -o. To allow this, create a `DriverConfig` struct which just wraps a list of `CopyConfigs`. objcopy will only ever have a single `CopyConfig`, but strip will have N (where N >= 1) CopyConfigs. Reviewers: alexshap, jakehehrlich Reviewed By: alexshap, jakehehrlich Subscribers: MaskRay, jakehehrlich, llvm-commits Differential Revision: https://reviews.llvm.org/D51660 llvm-svn: 341464	2018-09-05 13:10:03 +00:00
Jordan Rupprecht	ec277a8278	[llvm-strip] Allow copying relocation sections without symbol tables. Summary: Fixes the error "Link field value 0 in section .rela.plt is invalid" when copying/stripping certain binaries. Minimal repro: ``` $ cat /tmp/a.c int main() { return 0; } $ clang -static /tmp/a.c -o /tmp/a $ llvm-strip /tmp/a -o /tmp/b llvm-strip: error: Link field value 0 in section .rela.plt is invalid. ``` Reviewers: jakehehrlich, alexshap Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D51493 llvm-svn: 341419	2018-09-04 22:28:49 +00:00
Francis Visoiu Mistrih	2d3f01c5dc	[MachO] Fix inconsistency between error messages when validating LC_DYSYMTAB llvm-svn: 341379	2018-09-04 16:31:53 +00:00
Francis Visoiu Mistrih	7690af4da9	[MachO] Fix LC_DYSYMTAB validation for external symbols We were validating the same index (ilocalsym) twice, while iextdefsym was never validated. llvm-svn: 341378	2018-09-04 16:31:48 +00:00
Jonas Devlieghere	881452384a	[dwarfdump] Improve -diff option by hiding more data. The -diff option makes it easy to diff dwarf by hiding addresses and offsets. However not all of them were hidden, which should be fixed by this patch. Differential revision: https://reviews.llvm.org/D51593 llvm-svn: 341377	2018-09-04 16:21:37 +00:00
Chandler Carruth	163222f569	Revert r341342: Dwarf .debug section compression support (zlib, zlib-gnu). Also reverts follow-up commits r341343 and r341344. The primary commit continues to break some build bots even after the fixes in r341343 for UBSan issues: http://lab.llvm.org:8011/builders/clang-cmake-aarch64-full/builds/5823 It is also failing for me locally (linux, x86-64). llvm-svn: 341360	2018-09-04 11:55:57 +00:00
Puyan Lotfi	5a40cd5b50	[llvm-objcopy] Dwarf .debug section compression support (zlib, zlib-gnu). Usage: llvm-objcopy --compress-debug-sections=zlib foo.o llvm-objcopy --compress-debug-sections=zlib-gnu foo.o In both cases the debug section contents is compressed with zlib. In the GNU style case the header is the "ZLIB" magic string followed by the uint64 big- endian decompressed size. In the non-GNU mode the header is the Elf(32\|64)_Chdr. Decompression support is coming soon. Differential Revision: https://reviews.llvm.org/D49678 llvm-svn: 341342	2018-09-03 22:25:56 +00:00
Andrea Di Biagio	fb3d9e1449	[X86] Remove wrong ReadAdvance from multiclass sse_fp_unop_s. A ReadAdvance was incorrectly added to the SchedReadWrite list associated with the following SSE instructions: sqrtss sqrtsd rsqrtss rcpss As a consequence, a wrong operand latency was computed for the register operand used as the base address of the folded load operand. This patch removes the wrong ReadAdvance, and updates the llvm-mca test cases. There is still a problem with correctly modeling partial register writes on XMM registers This other problem is currently tracked here: https://bugs.llvm.org/show_bug.cgi?id=38813 Differential Revision: https://reviews.llvm.org/D51542 llvm-svn: 341326	2018-09-03 16:47:34 +00:00
Jonas Devlieghere	6e5c7e6037	[DebugInfo] Have the verifier accept missing linkage names. According to the standard, for the .debug_names (the "dwarf accelerator tables"): > If a subprogram or inlined subroutine is included, and has a > DW_AT_linkage_name attribute, there will be an additional index entry > for the linkage name. For Swift we generate DW_structure_types with a linkage name and the verifier was incorrectly rejecting this. This patch fixes that by only considering the linkage name in those particular cases. The test is the "reduced" debug info of the failing swift test on swift.org. Differential revision: https://reviews.llvm.org/D51420 llvm-svn: 341311	2018-09-03 12:12:17 +00:00
Alexandre Ganea	6a7efef4af	[DebugInfo] Common behavior for error types Following D50807, and heading towards D50664, this intermediary change does the following: 1. Upgrade all custom Error types in llvm/trunk/lib/DebugInfo/ to use the new StringError behavior (D50807). 2. Implement std::is_error_code_enum and make_error_code() for DebugInfo error enumerations. 3. Rename GenericError -> PDBError (the file will be renamed in a subsequent commit) 4. Update custom error messages to follow the same formatting: (\w\s*)+\. 5. Keep generic "file not found" (ENOENT) errors as they are in PDB code. Previously, there used to be a custom enumeration for that purpose. 6. Remove a few extraneous LF in log() implementations. Printing LF is a responsability at a higher level, not at the error level. Differential Revision: https://reviews.llvm.org/D51499 llvm-svn: 341228	2018-08-31 17:41:58 +00:00
Andrea Di Biagio	a59ec4efa0	[X86][BtVer2] Remove wrong ReadAdvance from AVX vbroadcast(ss\|sd\|f128) instructions. The presence of a ReadAdvance for input operand #0 is problematic because it changes the input latency of the register used as the base address for the folded load. A broadcast cannot start executing if the load address hasn't been computed yet. In the llvm-mca example, the VBROADCASTSS is dependent on the address generated by the LEAQ. That means, it cannot start until LEAQ reaches the write-back stage. If we apply ReadAdvance, then we wrongly assume that the load can start 3 cycles in advance. Differential Revision: https://reviews.llvm.org/D51534 llvm-svn: 341222	2018-08-31 16:05:48 +00:00
Andrea Di Biagio	69da3f3df6	[X86] Add llvm-mca tests that show how operand latency is wrongly computed for SSE sqrtss/sd and rcpss. According to the timeline view, sqrtss/sd/rcpss start executing before the load address for the memory operand is available. This problem is caused by the presence of a ReadAfterLd (a ReadAdvance). Those unary operations should not specify a ReadAdvance at all. llvm-svn: 341213	2018-08-31 14:12:13 +00:00
Francis Visoiu Mistrih	8e864be70a	[llvm-objdump] Keep the memory buffer from the dSYM alive when using -g -dsym When using -g and -dsym, llvm-objdump opens the dsym file and keeps the MachOObjectFile alive, while the memory buffer that the MachOObjectFile was based on gets destroyed. Differential Revision: https://reviews.llvm.org/D51365 llvm-svn: 341209	2018-08-31 13:10:54 +00:00
Andrea Di Biagio	0e21ca1278	[X86][BtVer2] Add an llvm-mca test that shows how the read latency of AVX broadcastss on ymm registers is incorrectly set. llvm-svn: 341197	2018-08-31 10:39:33 +00:00
Andrea Di Biagio	b998eae2f2	[X86][BtVer2] Fix WriteFShuffle256 schedule write info. This patch fixes the number of micro opcodes, and processor resource cycles for the following AVX instructions: vinsertf128rr/rm vperm2f128rr/rm vbroadcastf128 Tests have been regenerated using the usual scripts in the llvm/utils directory. Differential Revision: https://reviews.llvm.org/D51492 llvm-svn: 341185	2018-08-31 08:30:47 +00:00
Matt Arsenault	0da6350dc8	AMDGPU: Remove remnants of old address space mapping llvm-svn: 341165	2018-08-31 05:49:54 +00:00
Adrian Prantl	bdffea12d0	dsymutil: Avoid pruning non-type forward declarations inside DW_TAG_module forward declarations. Especially with template instantiations, there are legitimate reasons why for declarations might be emitted into a DW_TAG_module skeleton / forward-declaration sub-tree, that are not forward declarations in the sense of that there is a more complete definition over in a .pcm file. The example in the testcase is a constant DW_TAG_member of a DW_TAG_class template instatiation. rdar://problem/43623196 llvm-svn: 341123	2018-08-30 21:21:16 +00:00
Andrea Di Biagio	8b647dcf4b	[llvm-mca] Report the number of dispatched micro opcodes in the DispatchStatistics view. This patch introduces the following changes to the DispatchStatistics view: * DispatchStatistics now reports the number of dispatched opcodes instead of the number of dispatched instructions. * The "Dynamic Dispatch Stall Cycles" table now also reports the percentage of stall cycles against the total simulated cycles. This change allows users to easily compare dispatch group sizes with the processor DispatchWidth. Before this change, it was difficult to correlate the two numbers, since DispatchStatistics view reported numbers of instructions (instead of opcodes). DispatchWidth defines the maximum size of a dispatch group in terms of number of micro opcodes. The other change introduced by this patch is related to how DispatchStage generates "instruction dispatch" events. In particular: * There can be multiple dispatch events associated with a same instruction * Each dispatch event now encapsulates the number of dispatched micro opcodes. The number of micro opcodes declared by an instruction may exceed the processor DispatchWidth. Therefore, we cannot assume that instructions are always fully dispatched in a single cycle. DispatchStage knows already how to handle instructions declaring a number of opcodes bigger that DispatchWidth. However, DispatchStage always emitted a single instruction dispatch event (during the first simulated dispatch cycle) for instructions dispatched. With this patch, DispatchStage now correctly notifies multiple dispatch events for instructions that cannot be dispatched in a single cycle. A few views had to be modified. Views can no longer assume that there can only be one dispatch event per instruction. Tests (and docs) have been updated. Differential Revision: https://reviews.llvm.org/D51430 llvm-svn: 341055	2018-08-30 10:50:20 +00:00
Andrew V. Tischenko	62f7a3207b	[X86] Improved sched model for X86 CMPXCHG* instructions. Differential Revision: https://reviews.llvm.org/D50070 llvm-svn: 341024	2018-08-30 06:26:00 +00:00
Jordan Rupprecht	7481540fd9	[llvm-strip] Fix -p\|--preserve-dates to not truncate output when used in-place. The restoreDateOnFile() method used to preserve dates uses sys::fs::openFileForWrite(). That method defaults to opening files with CD_CreateAlways, which truncates the output file if it exists. Use CD_OpenExisting instead to open it and not truncate it, which also has the side benefit of erroring if the file does not exist (it should always exist, because we just wrote it out). Also, fix the test case to make sure the output is a valid output file, and not empty. The extra test assertions are enough to catch this regression. llvm-svn: 340996	2018-08-29 23:21:56 +00:00
Andrea Di Biagio	a2eee47450	[llvm-mca] Add fields "Total uOps" and "uOps Per Cycle" to the report generated by the SummaryView. This patch adds two new fields to the perf report generated by the SummaryView. Fields are now logically organized into two small groups; only the second group contains throughput indicators. Example: ``` Iterations: 100 Instructions: 300 Total Cycles: 414 Total uOps: 700 Dispatch Width: 4 uOps Per Cycle: 1.69 IPC: 0.72 Block RThroughput: 4.0 ``` This patch also updates the docs for llvm-mca. Due to the nature of this change, several tests in the tools/llvm-mca directory were affected, and had to be updated using script `update_mca_test_checks.py`. llvm-svn: 340946	2018-08-29 17:56:39 +00:00
Andrea Di Biagio	5221e17fd6	[llvm-mca] Don't disable the SummaryView if flag `-all-stats` is false. llvm-svn: 340945	2018-08-29 17:40:04 +00:00
Andrea Di Biagio	d17d371c40	[llvm-mca][TimelineView] Force the same number of executions for every entry in the 'wait-times' table. This patch also uses colors to highlight problematic wait-time entries. A problematic entry is an entry with an high wait time that tends to match (or exceed) the size of the scheduler's buffer. Color RED is used if an instruction had to wait an average number of cycles which is bigger than (or equal to) the size of the underlying scheduler's buffer. Color YELLOW is used if the time (in cycles) spend waiting for the operands or pipeline resources is bigger than half the size of the underlying scheduler's buffer. Color MAGENTA is used if an instruction does not consume buffer resources according to the scheduling model. llvm-svn: 340825	2018-08-28 14:27:01 +00:00
Kit Barton	7c80f98b69	[PPC] Remove Darwin support from POWER backend. This patch issues an error message if Darwin ABI is attempted with the PPC backend. It also cleans up existing test cases, either converting the test to use an alternative triple or removing the test if the coverage is no longer needed. Updated Tests ------------- The majority of test cases were updated to use a different triple that does not include the Darwin ABI. Many tests were also updated to use FileCheck, in place of grep. Deleted Tests ------------- llvm/test/tools/dsymutil/PowerPC/sibling.test was originally added to test specific functionality of dsymutil using an object file created with an old version of llvm-gcc for a Powerbook G4. After a discussion with @JDevlieghere he suggested removing the test. llvm/test/CodeGen/PowerPC/combine_loads_from_build_pair.ll was converted from a PPC test to a SystemZ test, as the behavior is also reproducible there. All other tests that were deleted were specific to the darwin/ppc ABI and no longer necessary. Phabricator Review: https://reviews.llvm.org/D50988 llvm-svn: 340795	2018-08-28 01:18:29 +00:00
Andrea Di Biagio	b89b96c1b2	[llvm-mca] Improved report generated by the SchedulerStatistics view. Before this patch, the SchedulerStatistics only printed the maximum number of buffer entries consumed in each scheduler's queue at a given point of the simulation. This patch restructures the reported table, and adds an extra field named "Average number of used buffer entries" to it. This patch also uses different colors to help identifying bottlenecks caused by high scheduler's buffer pressure. llvm-svn: 340746	2018-08-27 14:52:52 +00:00
Nico Weber	e75fd1b184	fix comment typo llvm-svn: 340744	2018-08-27 14:25:22 +00:00
Joel Galenson	6cc0e63e2f	[cfi-verify] Support cross-DSO When used in cross-DSO mode, CFI will generate calls to special functions rather than trap instructions. For example, instead of generating if (!InlinedFastCheck(f)) abort(); call f CFI generates if (!InlinedFastCheck(f)) __cfi_slowpath(CallSiteTypeId, f); call f This patch teaches cfi-verify to recognize calls to __cfi_slowpath and abort and treat them as trap functions. In addition to normal symbols, we also parse the dynamic relocations to handle cross-DSO calls in libraries. We also extend cfi-verify to recognize other patterns that occur using cross-DSO. For example, some indirect calls are not guarded by a branch to a trap but instead follow a call to __cfi_slowpath. For example: if (!InlinedFastCheck(f)) call f else { __cfi_slowpath(CallSiteTypeId, f); call f } In this case, the second call to f is not marked as protected by the current code. We thus recognize if indirect calls directly follow a call to a function that will trap on CFI violations and treat them as protected. We also ignore indirect calls in the PLT, since on AArch64 each entry contains an indirect call that should not be protected by CFI, and these are labeled incorrectly when debug information is not present. Differential Revision: https://reviews.llvm.org/D49383 llvm-svn: 340612	2018-08-24 15:21:58 +00:00
Joel Galenson	134cf47dcb	[llvm-objdump] Label calls to the PLT. Differential Revision: https://reviews.llvm.org/D50204 llvm-svn: 340611	2018-08-24 15:21:57 +00:00
Richard Smith	c6ba9ca169	Make llvm-profdata show -text work as advertised in the documentation. Per LLVM's CommandGuide, llvm-profdata show -text is supposed to produce textual output that can be passed as input to further llvm-profdata invocations. This previously didn't work for two reasons: 1) -text was not sufficient to enable the machine-readable text format output; instead, -text was effectively ignored if -counts was not also specified. (With this patch, -counts is instead ignored if -text is specified, because the machine-readable text format always includes counts.) 2) When the input data was an IR-level profile, the :ir marker was missing from the output, resulting in a text format output that would not be usable as profiling data due to function hash mismatches. Differential Revision: https://reviews.llvm.org/D51188 llvm-svn: 340592	2018-08-24 01:34:45 +00:00
Fangrui Song	9ba5740ba5	[gold] -thinlto-object-suffix-replace: don't append new suffix if path does not end with old suffix Summary: This is to be consistent with lld behavior since rLLD340364. Reviewers: tejohnson Reviewed By: tejohnson Subscribers: steven_wu, eraman, mehdi_amini, inglorion, dexonsmith, llvm-commits Differential Revision: https://reviews.llvm.org/D51060 llvm-svn: 340380	2018-08-22 02:11:36 +00:00
Zachary Turner	030ad37ef4	[llvm-objdump] Add ability to demangle COFF symbols. llvm-svn: 340221	2018-08-20 22:18:21 +00:00
Jordan Rupprecht	be8ebccaed	[llvm-objcopy] Implement -G/--keep-global-symbol(s). Summary: Port GNU Objcopy -G/--keep-global-symbol(s). This is slightly different than the already-implemented --globalize-symbol, which marks a symbol as global when copying. When --keep-global-symbol (alias -G) is used, only those symbols marked will stay global, and all other globals are demoted to local. (Also note that it doesn't promote a symbol to global). Additionally, there is a pluralized version of the flag --keep-global-symbols, which effectively applies --keep-global-symbol for every non-comment in a file. Reviewers: jakehehrlich, jhenderson, alexshap Reviewed By: jhenderson Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D50589 llvm-svn: 340105	2018-08-17 22:34:48 +00:00
Jordan Rupprecht	bb179a197c	Fix windows buildbots by removing : from filenames llvm-svn: 340071	2018-08-17 19:18:20 +00:00
Jordan Rupprecht	cf67633e66	[llvm-objcopy] Add support for -I binary -B <arch>. Summary: The -I (--input-target) and -B (--binary-architecture) flags exist but are currently silently ignored. This adds support for -I binary for architectures i386, x86-64 (and alias i386:x86-64), arm, aarch64, sparc, and ppc (powerpc:common64). This is largely based on D41687. This is done by implementing an additional subclass of Reader, BinaryReader, which works by interpreting the input file as contents for .data field, sets up a synthetic header, and adds additional sections/symbols (e.g. _binary__tmp_data_txt_start). Reviewers: jakehehrlich, alexshap, jhenderson, javed.absar Reviewed By: jhenderson Subscribers: jyknight, nemanjai, kbarton, fedor.sergeev, jrtc27, kristof.beyls, paulsemel, llvm-commits Differential Revision: https://reviews.llvm.org/D50343 llvm-svn: 340070	2018-08-17 18:51:11 +00:00
Peter Collingbourne	3da2ffb826	Add missing test file from r339799. llvm-svn: 339927	2018-08-16 19:29:01 +00:00
Jordan Rupprecht	d1767dc56f	[llvm-strip] Add support for -p/--preserve-dates Summary: [llvm-strip] Preserve access/modification timestamps when -p is used. Reviewers: jakehehrlich, jhenderson, alexshap Reviewed By: jhenderson Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D50744 llvm-svn: 339921	2018-08-16 18:29:40 +00:00
George Rimar	d2f90ea337	[yaml2obj] - Allow to use numeric sh_link (Link) value for sections. That change allows using numeric values for Link field. It is consistent with the code for another fields in this method. llvm-svn: 339873	2018-08-16 12:44:17 +00:00
George Rimar	17257bb0b5	[yaml2elf] - Use check-next in test. Its a follow up for rL339870. llvm-svn: 339872	2018-08-16 12:40:27 +00:00
George Rimar	7f2df7df45	[yaml2elf] - Simplify code, add a test. NFC. This simplifies the code allowing to set the sh_info for relocations sections. And adds a missing test. llvm-svn: 339870	2018-08-16 12:23:22 +00:00
Peter Collingbourne	62e4fc48a5	llvm-readobj: Fix addend in relocations for android packed format If a relocation group doesn't have the RELOCATION_GROUP_HAS_ADDEND_FLAG set, then this implies the group's addend equals zero. In this case android packed format won't encode an explicit addend delta, instead we need to set Addend, the "previous addend" variable, to zero by ourself. Patch by Yi-Yo Chiang! Differential Revision: https://reviews.llvm.org/D50601 llvm-svn: 339799	2018-08-15 17:58:22 +00:00
George Rimar	942e8ed19d	[yaml2obj] - Teach yaml2obj to produce SHT_GROUP section with a custom Info field. This allows to set custom Info field value for SHT_GROUP sections. It is useful to allow this because we would be able to replace at least one binary object committed in LLD and replace it with the yaml2obj based test. Differential revision: https://reviews.llvm.org/D50776 llvm-svn: 339772	2018-08-15 13:55:22 +00:00
Andrea Di Biagio	a03f2a77f8	[llvm-mca] Fix PR38575: Avoid an invalid implicit truncation of a processor resource mask (an uint64_t value) to unsigned. This patch fixes a regression introduced at revision 338702. A processor resource mask was incorrectly implicitly truncated to an unsigned quantity. Later on, the truncated mask was used to initialize an element of a vector of processor resource descriptors. On targets with more than 32 processor resources, some elements of the vector are left uninitialized. As a consequence, this bug might have eventually caused a crash due to null dereference in the Scheduler. This patch fixes PR38575, and adds a test for it. llvm-svn: 339768	2018-08-15 12:53:38 +00:00
George Rimar	5290af8ad9	[yaml2obj] - Teach tool to produce SHT_GROUP section with a custom type. Currently, it is possible to use yaml2obj for producing SHT_GROUP sections of type GRP_COMDAT. For LLD test case I need to produce an object with a broken (different from GRP_COMDAT) type. The patch teaches tool to do such things. Differential revision: https://reviews.llvm.org/D50761 llvm-svn: 339764	2018-08-15 11:43:00 +00:00
Tom Stellard	69bf876b49	[gold] Fix Tests cases on i686 Reviewers: tejohnson Reviewed By: tejohnson Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D50583 llvm-svn: 339492	2018-08-11 01:08:34 +00:00
Jordan Rupprecht	88ed5e59bd	[llvm-objcopy] NFC: Add some color to error() llvm-svn: 339404	2018-08-09 22:52:03 +00:00
Paul Semel	7a3dc2c184	[llvm-objcopy] Add --prefix-symbols option Differential Revision: https://reviews.llvm.org/D50381 llvm-svn: 339362	2018-08-09 17:49:04 +00:00
Paul Semel	a42dec7a1b	[llvm-objcopy] Add --dump-section Differential Revision: https://reviews.llvm.org/D49979 llvm-svn: 339358	2018-08-09 17:05:21 +00:00
Andrew V. Tischenko	1fe3375620	[X86] MCA tests for XCHG, XADD and CMPXCHG* instructions Differential Revision: https://reviews.llvm.org/D49912 llvm-svn: 339145	2018-08-07 14:36:43 +00:00
George Rimar	65a6828b17	[yaml2obj] - Add a support for changing EntSize. I was trying to add a test case for LLD and found that it is impossible to set sh_entsize via yaml. The patch implements the missing part. Differential revision: https://reviews.llvm.org/D50235 llvm-svn: 339113	2018-08-07 08:11:38 +00:00
Stella Stamenova	cc2404c01d	[lit, python] Always add quotes around the python path in lit Summary: The issue with the python path is that the path to python on Windows can contain spaces. To make the tests always work, the path to python needs to be surrounded by quotes. This change updates several configuration files which specify the path to python as a substitution and also remove quotes from existing tests. Reviewers: asmith, zturner, alexshap, jakehehrlich Reviewed By: zturner, alexshap, jakehehrlich Subscribers: mehdi_amini, nemanjai, eraman, kbarton, jakehehrlich, steven_wu, dexonsmith, stella.stamenova, delcypher, llvm-commits Differential Revision: https://reviews.llvm.org/D50206 llvm-svn: 339073	2018-08-06 22:37:44 +00:00
Alexandre Ganea	741cc3531a	[llvm-pdbutil] Support PDBs without a DBI stream Differential Revision: https://reviews.llvm.org/D50258 llvm-svn: 339045	2018-08-06 19:35:00 +00:00
David Bolvansky	b7fcd10700	[NFC] Fixed inliner tests - 2 llvm-svn: 338973	2018-08-05 16:53:36 +00:00
David Bolvansky	2f1f3b10ad	[NFC] Fixed inliner tests llvm-svn: 338972	2018-08-05 16:30:46 +00:00
David Bolvansky	c0aa4b75a4	Enrich inline messages Summary: This patch improves Inliner to provide causes/reasons for negative inline decisions. 1. It adds one new message field to InlineCost to report causes for Always and Never instances. All Never and Always instantiations must provide a simple message. 2. Several functions that used to return the inlining results as boolean are changed to return InlineResult which carries the cause for negative decision. 3. Changed remark priniting and debug output messages to provide the additional messages and related inline cost. 4. Adjusted tests for changed printing. Patch by: yrouban (Yevgeny Rouban) Reviewers: craig.topper, sammccall, sgraenitz, NutshellySima, shchenz, chandlerc, apilipenko, javed.absar, tejohnson, dblaikie, sanjoy, eraman, xbolva00 Reviewed By: tejohnson, xbolva00 Subscribers: xbolva00, llvm-commits, arsenm, mehdi_amini, eraman, haicheng, steven_wu, dexonsmith Differential Revision: https://reviews.llvm.org/D49412 llvm-svn: 338969	2018-08-05 14:53:08 +00:00
Dave Lee	3fb120f12e	objdump: Better handling of Mach-O universal binaries Summary: With Mach-O, there is a flag requirement discrepancy between working with universal binaries and thin binaries. Many flags that don't require the `-macho` flag (for example `-private-headers` and `-disassemble`) fail to work on universal binaries unless `-macho` is given. When this happens, the error message is unhelpful, stating: The file was not recognized as a valid object file. Which can lead to confusion. This change allows generic flags to be used on universal binaries with and without the `-macho` flag. This means flags that can be used for thin files can be used consistently with fat files too. To do this, the universal binary support within `ParseInputMachO()` is extracted into a new function. This new function is called directly from `DumpInput()` when the input binary is universal. Additionally the `-arch` flag validation in `ParseInputMachO()` was extracted to be reused. Reviewers: compnerd Reviewed By: compnerd Subscribers: keith, llvm-commits Differential Revision: https://reviews.llvm.org/D48702 llvm-svn: 338792	2018-08-03 00:06:38 +00:00
Ben Dunbobbin	d498dcdbbf	[llvm-ar] Fix help text test. NFC. Missed from @338703 llvm-svn: 338709	2018-08-02 12:27:01 +00:00
Simon Pilgrim	b911d6721d	[llvm-mca][x86] Add CMPXCHG instruction resource tests I've put CMPXCHG8B/CMPXCHG16B in the same file, even though technically they are under separate CPUID bits all targets seem to support both (or neither). llvm-svn: 338595	2018-08-01 17:25:11 +00:00
Simon Pilgrim	5c4fb14e07	[llvm-mca][x86] Add PREFETCHW instruction resource tests These aren't just available via 3DNow! so test for them separately as well. llvm-svn: 338584	2018-08-01 16:34:39 +00:00
Simon Pilgrim	dcfa732b2f	[llvm-mca][x86] Add PCLMUL instruction resource tests Renamed the btver2 file that already contained them - the other targets were only testing the AVX versions llvm-svn: 338583	2018-08-01 16:25:50 +00:00
Jordan Rupprecht	d67c1e129b	[llvm-objcopy] Add support for --rename-section flags from gnu objcopy Summary: Add support for --rename-section flags from gnu objcopy. Not all flags appear to have an effect for ELF objects, but allowing them would allow easier drop-in replacement. Other unrecognized flags are rejected. This was only tested by comparing flags printed by "readelf -e <.o>" against the output of gnu vs llvm objcopy, it hasn't been tested to be valid beyond that. Reviewers: jakehehrlich, alexshap Subscribers: llvm-commits, paulsemel, alexshap Differential Revision: https://reviews.llvm.org/D49870 llvm-svn: 338582	2018-08-01 16:23:22 +00:00
Andrea Di Biagio	7f3bf5c1f9	[llvm-mca] Correctly update the rank in `Scheduler::select()`. Found by inspection. llvm-svn: 338579	2018-08-01 16:06:33 +00:00
Simon Pilgrim	34ac6533f4	[llvm-mca][x86] Add SET/TEST instruction resource tests llvm-svn: 338576	2018-08-01 15:29:47 +00:00
Simon Pilgrim	e364e57ac9	[llvm-mca][x86] Add LEA instruction resource tests We already added these to btver2, now add them to other targets, even though none of their models treat them specially (yet). llvm-svn: 338565	2018-08-01 14:25:33 +00:00
Simon Pilgrim	6754913e95	[llvm-mca][x86] Add more x86-64 system instruction resource tests CPUID, IN/OUT, INS/OUTS, INT, PAUSE, SCAS, UD2, XLAT llvm-svn: 338563	2018-08-01 14:18:09 +00:00
Simon Pilgrim	5f41ab79c0	[llvm-mca][x86] Add CLFLUSHOPT instruction resource tests llvm-svn: 338550	2018-08-01 13:34:17 +00:00
Simon Pilgrim	bd014f4d91	[llvm-mca][x86] Add CMPS/LODS/MOVS/STOS string instruction resource tests llvm-svn: 338532	2018-08-01 13:14:45 +00:00
Simon Pilgrim	18d025a732	[llvm-mca][x86] Add STC + STD instruction resource tests llvm-svn: 338514	2018-08-01 11:00:11 +00:00
David Bolvansky	fbbb83c782	Revert "Enrich inline messages", tests fail llvm-svn: 338496	2018-08-01 08:02:40 +00:00
David Bolvansky	7f36cd9d96	Enrich inline messages Summary: This patch improves Inliner to provide causes/reasons for negative inline decisions. 1. It adds one new message field to InlineCost to report causes for Always and Never instances. All Never and Always instantiations must provide a simple message. 2. Several functions that used to return the inlining results as boolean are changed to return InlineResult which carries the cause for negative decision. 3. Changed remark priniting and debug output messages to provide the additional messages and related inline cost. 4. Adjusted tests for changed printing. Patch by: yrouban (Yevgeny Rouban) Reviewers: craig.topper, sammccall, sgraenitz, NutshellySima, shchenz, chandlerc, apilipenko, javed.absar, tejohnson, dblaikie, sanjoy, eraman, xbolva00 Reviewed By: tejohnson, xbolva00 Subscribers: xbolva00, llvm-commits, arsenm, mehdi_amini, eraman, haicheng, steven_wu, dexonsmith Differential Revision: https://reviews.llvm.org/D49412 llvm-svn: 338494	2018-08-01 07:37:16 +00:00
Victor Leschuk	58d3399d8a	[DWARF] Support for .debug_addr (consumer) This patch implements basic support for parsing and dumping DWARFv5 .debug_addr section. llvm-svn: 338447	2018-07-31 22:19:19 +00:00
Fangrui Song	87b4b8f7b4	[llvm-objcopy] Make --strip-debug strip .gdb_index Summary: See binutils-gdb/bfd/elf.c, GNU objcopy also strips .stab* (STABS) .line* (DWARF 1) .gnu.linkonce.wi.* (linkonce section for .debug_info) but I'm not sure we need to be compatible with it. Reviewers: dblaikie, alexshap, jakehehrlich, jhenderson Reviewed By: alexshap, jakehehrlich Subscribers: aprantl, JDevlieghere, jakehehrlich, llvm-commits Differential Revision: https://reviews.llvm.org/D50100 llvm-svn: 338443	2018-07-31 21:26:35 +00:00
Simon Pilgrim	1f4b9cb6fe	[llvm-mca][x86] Add 32-bit instruction resource tests These aren't exhaustive, but cover some instructions that are only available in 32-bit mode (where would we be without good BCD math performance?). llvm-svn: 338404	2018-07-31 17:33:08 +00:00
David Bolvansky	ab79414f7b	Revert Enrich inline messages llvm-svn: 338389	2018-07-31 14:47:22 +00:00
David Bolvansky	b562dbabda	Enrich inline messages Summary: This patch improves Inliner to provide causes/reasons for negative inline decisions. 1. It adds one new message field to InlineCost to report causes for Always and Never instances. All Never and Always instantiations must provide a simple message. 2. Several functions that used to return the inlining results as boolean are changed to return InlineResult which carries the cause for negative decision. 3. Changed remark priniting and debug output messages to provide the additional messages and related inline cost. 4. Adjusted tests for changed printing. Patch by: yrouban (Yevgeny Rouban) Reviewers: craig.topper, sammccall, sgraenitz, NutshellySima, shchenz, chandlerc, apilipenko, javed.absar, tejohnson, dblaikie, sanjoy, eraman, xbolva00 Reviewed By: tejohnson, xbolva00 Subscribers: xbolva00, llvm-commits, arsenm, mehdi_amini, eraman, haicheng, steven_wu, dexonsmith Differential Revision: https://reviews.llvm.org/D49412 llvm-svn: 338387	2018-07-31 14:25:24 +00:00
Andrea Di Biagio	a1852b6194	[llvm-mca][BtVer2] Teach how to identify dependency-breaking idioms. This patch teaches llvm-mca how to identify dependency breaking instructions on btver2. An example of dependency breaking instructions is the zero-idiom XOR (example: `XOR %eax, %eax`), which always generates zero regardless of the actual value of the input register operands. Dependency breaking instructions don't have to wait on their input register operands before executing. This is because the computation is not dependent on the inputs. Not all dependency breaking idioms are also zero-latency instructions. For example, `CMPEQ %xmm1, %xmm1` is independent on the value of XMM1, and it generates a vector of all-ones. That instruction is not eliminated at register renaming stage, and its opcode is issued to a pipeline for execution. So, the latency is not zero. This patch adds a new method named isDependencyBreaking() to the MCInstrAnalysis interface. That method takes as input an instruction (i.e. MCInst) and a MCSubtargetInfo. The default implementation of isDependencyBreaking() conservatively returns false for all instructions. Targets may override the default behavior for specific CPUs, and return a value which better matches the subtarget behavior. In future, we should teach to Tablegen how to automatically generate the body of isDependencyBreaking from scheduling predicate definitions. This would allow us to expose the knowledge about dependency breaking instructions to the machine schedulers (and, potentially, other codegen passes). Differential Revision: https://reviews.llvm.org/D49310 llvm-svn: 338372	2018-07-31 13:21:43 +00:00
Jonas Devlieghere	ae1727e3dd	[dsymutil] Simplify temporary file handling. Dsymutil's update functionality was broken on Windows because we tried to rename a file while we're holding open handles to that file. TempFile provides a solution for this through its keep(Twine) method. This patch changes dsymutil to make use of that functionality. Differential revision: https://reviews.llvm.org/D49860 llvm-svn: 338216	2018-07-29 14:56:15 +00:00
Stephen Hines	e6e75bf84c	Handle the lack of a symbol table correctly. Summary: These two cases will trigger a dereference on a nullptr, since the SymbolTable can be nonexistent for a given library, in addition to just being empty. Reviewers: alexshap Reviewed By: alexshap Subscribers: meikeb, kongyi, chh, jakehehrlich, llvm-commits, pirama Differential Revision: https://reviews.llvm.org/D49534 llvm-svn: 338062	2018-07-26 20:05:31 +00:00
Jonas Devlieghere	640e790af2	[test] Disable dsymutil update test on windows Apparently, the issue with dsymutil update functionality on Windows was that Windows doesn't like dsymutil renaming files that have open handles to them. This disables the new accelerator test and updates the comment in the other two test. We should be able to enable the tests again once we updated the implementation to use TempFile::keep() to keep the temporary files in MachOUtils. A big thank you to Jeremy Morse from Sony for figuring this out and bringing it to my attention. llvm-svn: 338030	2018-07-26 14:16:19 +00:00
Jonas Devlieghere	f290256dfb	[test] Do dsymutil update in place Update the dSYM bundle in place when swapping out the accelerator tables. This should unbreak the windows bot that have been failing with an access denied. llvm-svn: 338014	2018-07-26 09:23:10 +00:00
Jonas Devlieghere	743d351120	[dsymutil] Add support for generating DWARF5 accelerator tables. This patch add support for emitting DWARF5 accelerator tables (.debug_names) from dsymutil. Just as with the Apple style accelerator tables, it's possible to update existing dSYMs. This patch includes a test that show how you can convert back and forth between the two types. If no kind of table is specified, dsymutil will default to generating Apple-style accelerator tables whenever it finds those in its input. The same is true when there are no accelerator tables at all. Finally, in the remaining case, where there's at least one DWARF v5 table and no Apple ones, the output will contains a DWARF accelerator tables (.debug_names). Differential revision: https://reviews.llvm.org/D49137 llvm-svn: 337980	2018-07-25 23:01:38 +00:00
Paul Semel	0913dcd747	[llvm-objdump] Add dynamic section printing to private-headers option Differential Revision: https://reviews.llvm.org/D49016 llvm-svn: 337902	2018-07-25 11:09:20 +00:00
Paul Semel	5ce8f1598c	[llvm-readobj] Generic hex-dump option Helpers are available to make this option file format independant. This patch adds the feature for Wasm file format. It doesn't change the behavior of the other file format handling. Differential Revision: https://reviews.llvm.org/D49545 llvm-svn: 337896	2018-07-25 10:04:37 +00:00
Wolfgang Pieb	439801ba1d	[DWARF v5] Refactor range lists dumping by using a more generic way of handling tables of lists. The intent is to use it for location list tables as well. Change is almost NFC with the exception of the spelling of some strings used during dumping (all lowercase now). Reviewer: JDevlieghere Differential Revision: https://reviews.llvm.org/D49500 llvm-svn: 337763	2018-07-23 22:37:17 +00:00
Paul Semel	1dbbfba888	[yaml2obj] Add default sh_entsize for dynamic sections Dynamic section holds a table, so the sh_entsize might be set. As the dynamic section entry size never changes, we can default it to the size of a dynamic entry. Differential Revision: https://reviews.llvm.org/D49619 llvm-svn: 337725	2018-07-23 18:49:04 +00:00
Roman Lebedev	52b85377eb	[NFC][MCA] ZnVer1: Update RegisterFile to identify false dependencies on partially written registers. Summary: Pretty mechanical follow-up for D49196. As microarchitecture.pdf notes, "20 AMD Ryzen pipeline", "20.8 Register renaming and out-of-order schedulers": The integer register file has 168 physical registers of 64 bits each. The floating point register file has 160 registers of 128 bits each. "20.14 Partial register access": The processor always keeps the different parts of an integer register together. ... An instruction that writes to part of a register will therefore have a false dependence on any previous write to the same register or any part of it. Reviewers: andreadb, courbet, RKSimon, craig.topper, GGanesh Reviewed By: GGanesh Subscribers: gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D49393 llvm-svn: 337676	2018-07-23 10:10:13 +00:00
Roman Lebedev	d57bd45acc	[NFC][MCA] ZnVer1: add partial-reg-update tests Reviewers: andreadb, courbet, RKSimon, craig.topper, GGanesh Reviewed By: GGanesh Subscribers: gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D49392 llvm-svn: 337675	2018-07-23 10:10:04 +00:00
Martin Storsjo	a6ffc9c8df	[COFF] Adjust how we flag weak externals This fixes PR36096. Originally based on a patch by Martell Malone. Differential Revision: https://reviews.llvm.org/D44357 llvm-svn: 337613	2018-07-20 20:48:29 +00:00
Jordan Rupprecht	db2036e1f5	[llvm-objcopy] Add basic support for --rename-section Summary: Add basic support for --rename-section=old=new to llvm-objcopy. A full replacement for GNU objcopy requires also modifying flags (i.e. --rename-section=old=new,flag1,flag2); I'd like to keep that in a separate change to keep this simple. Reviewers: jakehehrlich, alexshap Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D49576 llvm-svn: 337604	2018-07-20 19:54:24 +00:00
Simon Pilgrim	5e729dcc03	[llvm-mca][x86] Add movsx/movzx instructions to general x86_64 resource tests llvm-svn: 337586	2018-07-20 17:43:42 +00:00
Stella Stamenova	ca0547c83c	[llvm-objcopy, tests] Fix several llvm-objcopy tests Summary: In Python 3, sys.stdout.write expects a string rather than bytes. In order to be able to write the bytes to stdout, we need to use the buffer directly instead. This change is borrowing the implementation for writing to stdout that cat.py uses. Note that we cannot use cat.py directly because the file we are trying to open is a gzip file. Reviewers: asmith, bkramer, alexshap, jakehehrlich Reviewed By: alexshap, jakehehrlich Subscribers: jakehehrlich, llvm-commits Differential Revision: https://reviews.llvm.org/D49515 llvm-svn: 337567	2018-07-20 16:19:36 +00:00
Andrea Di Biagio	b6022aa8d9	[X86][BtVer2] correctly model the latency/throughput of LEA instructions. This patch fixes the latency/throughput of LEA instructions in the BtVer2 scheduling model. On Jaguar, A 3-operands LEA has a latency of 2cy, and a reciprocal throughput of 1. That is because it uses one cycle of SAGU followed by 1cy of ALU1. An LEA with a "Scale" operand is also slow, and it has the same latency profile as the 3-operands LEA. An LEA16r has a latency of 3cy, and a throughput of 0.5 (i.e. RThrouhgput of 2.0). This patch adds a new TIIPredicate named IsThreeOperandsLEAFn to X86Schedule.td. The tablegen backend (for instruction-info) expands that definition into this (file X86GenInstrInfo.inc): ``` static bool isThreeOperandsLEA(const MachineInstr &MI) { return ( ( MI.getOpcode() == X86::LEA32r \|\| MI.getOpcode() == X86::LEA64r \|\| MI.getOpcode() == X86::LEA64_32r \|\| MI.getOpcode() == X86::LEA16r ) && MI.getOperand(1).isReg() && MI.getOperand(1).getReg() != 0 && MI.getOperand(3).isReg() && MI.getOperand(3).getReg() != 0 && ( ( MI.getOperand(4).isImm() && MI.getOperand(4).getImm() != 0 ) \|\| (MI.getOperand(4).isGlobal()) ) ); } ``` A similar method is generated in the X86_MC namespace, and included into X86MCTargetDesc.cpp (the declaration lives in X86MCTargetDesc.h). Back to the BtVer2 scheduling model: A new scheduling predicate named JSlowLEAPredicate now checks if either the instruction is a three-operands LEA, or it is an LEA with a Scale value different than 1. A variant scheduling class uses that new predicate to correctly select the appropriate latency profile. Differential Revision: https://reviews.llvm.org/D49436 llvm-svn: 337469	2018-07-19 16:42:15 +00:00
George Rimar	a2b553b4c9	[llvm-readobj] - Do not report invalid amount of sections. When output style is GNU and amount of sections is >= SHN_LORESERVE, llvm-readobj reports zero number of sections instead of actual value. The patch fixes that. Differential revision: https://reviews.llvm.org/D49544 llvm-svn: 337462	2018-07-19 14:52:57 +00:00
Paul Semel	6e13790801	[llvm-readobj] Generic -string-dump option Differential Revision: https://reviews.llvm.org/D49470 llvm-svn: 337408	2018-07-18 18:00:41 +00:00
Paul Semel	007dedbf77	[llvm-objdump] Add -demangle (-C) option Differential Revision: https://reviews.llvm.org/D49043 llvm-svn: 337401	2018-07-18 16:39:21 +00:00
Benjamin Kramer	99ea38f42d	[llvm-objcopy] %python wants to be in quotes, because it might contain spaces llvm-svn: 337399	2018-07-18 16:17:53 +00:00
George Rimar	e35e6448f9	[llvm-objdump] - Stop reporting bogus section IDs. Imagine we have a file with few sections, and one of them is .foo with index N != 0. Problem is that when llvm-objdump is given a -section=.foo parameter it lists .foo as a section at index 0. That makes impossible to write test cases which needs to find the index of the particular section, while ignoring dumping of others. The patch fixes that. Differential revision: https://reviews.llvm.org/D49372 llvm-svn: 337361	2018-07-18 08:34:35 +00:00
George Rimar	6fdac3b23a	[llvm-readobj] - Teach tool to dump objects with >= SHN_LORESERVE of sections. http://www.sco.com/developers/gabi/2003-12-17/ch4.eheader.html says that e_shnum and/or e_shstrndx may have special values if "the number of sections is greater than or equal to SHN_LORESERVE" or "the section name string table section index is greater than or equal to SHN_LORESERVE (0xff00)" Previously llvm-readobj was unable to dump such files, patch changes that. I had to add a precompiled test case because it does not seem possible to prepare a test using yaml2obj or llvm-mc (not clear how to make .shstrtab to have index >= SHN_LORESERVE). Differential revision: https://reviews.llvm.org/D49369 llvm-svn: 337360	2018-07-18 08:19:58 +00:00
Simon Pilgrim	03164dfa5e	[llvm-mca][x86] Add extend, carry-flag and CMP instructions to general x86_64 resource tests llvm-svn: 337306	2018-07-17 17:47:35 +00:00
Simon Pilgrim	92da01fed9	[llvm-mca][x86] Add MOVBE resource tests to all supporting targets SNB doesn't support MOVBE but the numbers in Generic (which use the SNB model) look sane. llvm-svn: 337305	2018-07-17 17:41:45 +00:00
Simon Pilgrim	94049e8b15	[llvm-mca][x86] Add BSWAP resource tests llvm-svn: 337302	2018-07-17 17:10:47 +00:00
Simon Pilgrim	99a4f3195b	[llvm-mca][x86] Add displacement-only and additional scale=1 LEA tests llvm-svn: 337298	2018-07-17 16:17:33 +00:00
Simon Pilgrim	17d89ca70e	[llvm-mca][x86] Add LEA resource tests (PR32326) Add llvm-mca tests demonstrating how LEA instructions are currently modelled. Once this is working on btver2 I'll copy the test file to the other target directories. llvm-svn: 337297	2018-07-17 16:13:29 +00:00
Benjamin Kramer	e6dac13bab	[llvm-objcopy] Run not with any python, but the python configured in lit. llvm-svn: 337262	2018-07-17 10:30:56 +00:00
Jake Ehrlich	c7f8ac7896	[llvm-objcopy] Add support for large indexes This patch is an update of an older patch that never landed (see here: https://reviews.llvm.org/D42516) Recently various users have run into this issue and it just 100% has to be solved at this point. The main difference in this patch is that I use gunzip instead of unzip which should hopefully allow tests to pass. Please review this as if it is a new patch however. I found some issues along the way and made some minor modifications. The binary used in this patch for testing (a zip file to make it small) can be found here: https://drive.google.com/file/d/1UjsnTO9edLttZibbr-2T1bJl92KEQFAO/view?usp=sharing Differential Revision: https://reviews.llvm.org/D49206 llvm-svn: 337204	2018-07-16 19:48:52 +00:00
Joel Galenson	4099b249fb	[cfi-verify] Abort on unsupported targets As suggested in the review for r337007, this makes cfi-verify abort on unsupported targets instead of producing incorrect results. It also updates the design document to reflect this. Differential Revision: https://reviews.llvm.org/D49304 llvm-svn: 337181	2018-07-16 15:26:44 +00:00
Andrea Di Biagio	f84b0a6914	[llvm-mca] Regenerate X86 specific tests. NFC Not all tests were correctly updated by the update script after r336797. llvm-svn: 337124	2018-07-15 11:43:11 +00:00
Andrea Di Biagio	ff630c2cdc	[llvm-mca][BtVer2] teach how to identify false dependencies on partially written registers. The goal of this patch is to improve the throughput analysis in llvm-mca for the case where instructions perform partial register writes. On x86, partial register writes are quite difficult to model, mainly because different processors tend to implement different register merging schemes in hardware. When the code contains partial register writes, the IPC (instructions per cycles) estimated by llvm-mca tends to diverge quite significantly from the observed IPC (using perf). Modern AMD processors (at least, from Bulldozer onwards) don't rename partial registers. Quoting Agner Fog's microarchitecture.pdf: " The processor always keeps the different parts of an integer register together. For example, AL and AH are not treated as independent by the out-of-order execution mechanism. An instruction that writes to part of a register will therefore have a false dependence on any previous write to the same register or any part of it." This patch is a first important step towards improving the analysis of partial register updates. It changes the semantic of RegisterFile descriptors in tablegen, and teaches llvm-mca how to identify false dependences in the presence of partial register writes (for more details: see the new code comments in include/Target/TargetSchedule.h - class RegisterFile). This patch doesn't address the case where a write to a part of a register is followed by a read from the whole register. On Intel chips, high8 registers (AH/BH/CH/DH)) can be stored in separate physical registers. However, a later (dirty) read of the full register (example: AX/EAX) triggers a merge uOp, which adds extra latency (and potentially affects the pipe usage). This is a very interesting article on the subject with a very informative answer from Peter Cordes: https://stackoverflow.com/questions/45660139/how-exactly-do-partial-registers-on-haswell-skylake-perform-writing-al-seems-to In future, the definition of RegisterFile can be extended with extra information that may be used to identify delays caused by merge opcodes triggered by a dirty read of a partial write. Differential Revision: https://reviews.llvm.org/D49196 llvm-svn: 337123	2018-07-15 11:01:38 +00:00
Nico Weber	337e241d58	Attempt to get test/tools/llvm-lib/help.test passing on sanitizer-x86_64-linux-fast The bot has a /b directory, so /? matches against that and gets expanded to it. (Thanks to Hans's r187366, which solved the same problem for clang-cl a while ago and which saved me much head scratching.) llvm-svn: 337092	2018-07-14 11:33:33 +00:00
Nico Weber	17172c6b80	Give llvm-lib rudimentary help output. https://reviews.llvm.org/D49318 llvm-svn: 337084	2018-07-14 02:29:44 +00:00
Jonas Devlieghere	327e7a1608	[dwarfdump] Add pretty printer for accelerator table based on Atom. For instance, When dumping .apple_types, the second atom represents the DW_TAG. In addition to printing the raw value, we now also pretty print the value if the ATOM tells us how. llvm-svn: 337026	2018-07-13 17:21:51 +00:00
Andrea Di Biagio	e86e6efea1	[llvm-mca][BtVer2] Add tests for dependency breaking instructions. llvm-svn: 337024	2018-07-13 16:46:51 +00:00
Joel Galenson	667eac80da	[cfi-verify] Only run AArch64 tests when it is a supported target This stops the tests I added in r337007 from running when AArch64 is not a supported target. llvm-svn: 337012	2018-07-13 16:09:19 +00:00
Joel Galenson	06e7e5798f	[cfi-verify] Support AArch64. This patch adds support for AArch64 to cfi-verify. This required three changes to cfi-verify. First, it generalizes checking if an instruction is a trap by adding a new isTrap flag to TableGen (and defining it for x86 and AArch64). Second, the code that ensures that the operand register is not clobbered between the CFI check and the indirect call needs to allow a single dereference (in x86 this happens as part of the jump instruction). Third, we needed to ensure that return instructions are not counted as indirect branches. Technically, returns are indirect branches and can be covered by CFI, but LLVM's forward-edge CFI does not protect them, and x86 does not consider them, so we keep that behavior. In addition, we had to improve AArch64's code to evaluate the branch target of a MCInst to handle calls where the destination is not the first operand (which it often is not). Differential Revision: https://reviews.llvm.org/D48836 llvm-svn: 337007	2018-07-13 15:19:33 +00:00
Dean Michael Berris	10141261e1	[XRay][compiler-rt] Add PID field to llvm-xray tool and add PID metadata record entry in FDR mode Summary: llvm-xray changes: - account-mode - process-id {...} shows after thread-id - convert-mode - process {...} shows after thread - parses FDR and basic mode pid entries - Checks version number for FDR log parsing. Basic logging changes: - Update header version from 2 -> 3 FDR logging changes: - Update header version from 2 -> 3 - in writeBufferPreamble, there is an additional PID Metadata record (after thread id record and tsc record) Test cases changes: - fdr-mode.cc, fdr-single-thread.cc, fdr-thread-order.cc modified to catch process id output in the log. Reviewers: dberris Reviewed By: dberris Subscribers: hiraditya, llvm-commits, #sanitizers Differential Revision: https://reviews.llvm.org/D49153 llvm-svn: 336974	2018-07-13 05:38:22 +00:00
Bill Wendling	7bd9e94e38	[gold-plugin] Disable section ordering for relocatable links Not all programs want section ordering when compiled with LTO. In particular, the Linux kernel is very sensitive when it comes to linking, and doesn't boot when each function is placed in its own sections. Reviewed By: pcc Differential Revision: https://reviews.llvm.org/D48756 llvm-svn: 336943	2018-07-12 20:35:58 +00:00
Stephen Hines	e8c3c5fe5d	Add --strip-all option back to llvm-strip. Summary: This option appears to have been dropped as part of the refactoring in r331663. Unfortunately, if we want to use llvm-strip as a drop-in replacement for strip, this option should still be available. Reviewers: alexshap Reviewed By: alexshap Subscribers: meikeb, kongyi, chh, jakehehrlich, llvm-commits, pirama Differential Revision: https://reviews.llvm.org/D49226 llvm-svn: 336921	2018-07-12 17:42:17 +00:00
Paul Semel	0f9ca2d960	Revert "[llvm-objdump] Add -demangle (-C) option" This reverts commit 3a44ccd156e0edd2e89226f8ed63928e227900bb. This reverts commit d5cfc836bb5552e20507d3612d13ff66ff9e36a0. llvm-svn: 336829	2018-07-11 18:09:52 +00:00
Paul Semel	569200aab8	Fix llvm-objdump demangle test (added triple option) llvm-svn: 336821	2018-07-11 16:31:33 +00:00
Andrea Di Biagio	483db141e3	[X86] Fix MayLoad/HasSideEffect flag for (V)MOVLPSrm instructions. Before revision 336728, the "mayLoad" flag for instruction (V)MOVLPSrm was inferred directly from the "default" pattern associated with the instruction definition. r336728 removed special node X86Movlps, and all the patterns associated to it. Now instruction (V)MOVLPSrm doesn't have a pattern associated to it, and the 'mayLoad/hasSideEffects' flags are left unset. When the instruction info is emitted by tablegen, method CodeGenDAGPatterns::InferInstructionFlags() sees that (V)MOVLPSrm doesn't have a pattern, and flags are undefined. So, it conservatively sets the "hasSideEffects" flag for it. As a consequence, we were losing the 'mayLoad' flag, and we were gaining a 'hasSideEffect' flag in its place. This patch fixes the issue (originally reported by Michael Holmen). The mca tests show the differences in the instruction info flags. Instructions that were affected by this problem were: MOVLPSrm/VMOVLPSrm/VMOVLPSZ128rm. Differential Revision: https://reviews.llvm.org/D49182 llvm-svn: 336818	2018-07-11 15:27:50 +00:00
Paul Semel	bcf55ab95a	[llvm-objdump] Add -demangle (-C) option Differential Revision: https://reviews.llvm.org/D49043 llvm-svn: 336816	2018-07-11 15:25:39 +00:00
Andrea Di Biagio	d2e2c053cf	[llvm-mca] Use a different character to flag instructions with side-effects in the Instruction Info View. NFC This makes easier to identify changes in the instruction info flags. It also helps spotting potential regressions similar to the one recently introduced at r336728. Using the same character to mark MayLoad/MayStore/HasSideEffects is problematic for llvm-lit. When pattern matching substrings, llvm-lit consumes tabs and spaces. A change in position of the flag marker may not trigger a test failure. This patch only changes the character used for flag `hasSideEffects`. The reason why I didn't touch other flags is because I want to avoid spamming the mailing because of the massive diff due to the numerous tests affected by this change. In future, each instruction flag should be associated with a different character in the Instruction Info View. llvm-svn: 336797	2018-07-11 12:44:44 +00:00
Paul Semel	b98f504850	[llvm-readobj] Add -hex-dump (-x) option Differential Revision: https://reviews.llvm.org/D48281 llvm-svn: 336782	2018-07-11 10:00:29 +00:00
Andrea Di Biagio	2b3a4f9c9b	[llvm-mca] Add tests for partial register writes. llvm-mca doesn't know that on modern AMD processors, portions of a general purpose register are not treated independently. So, a partial register write has a false dependency on the super-register. The issue with partial register writes will be addressed by a follow-up patch. llvm-svn: 336778	2018-07-11 09:50:00 +00:00
Jonas Devlieghere	82dee6aca8	[dsymutil] Add support for outputting assembly When implementing the DWARF accelerator tables in dsymutil I ran into an assertion in the assembler. Debugging these kind of issues is a lot easier when looking at the assembly instead of debugging the assembler itself. Since it's only a matter of creating an AsmStreamer instead of a MCObjectStreamer it made sense to turn this into a (hidden) dsymutil feature. Differential revision: https://reviews.llvm.org/D49079 llvm-svn: 336561	2018-07-09 16:58:48 +00:00
Andrea Di Biagio	8834779644	[llvm-mca] report an error if the assembly sequence contains an unsupported instruction. This is a short-term fix for PR38093. For now, we llvm::report_fatal_error if the instruction builder finds an unsupported instruction in the instruction stream. We need to revisit this fix once we start addressing PR38101. Essentially, we need a better framework for error handling. llvm-svn: 336543	2018-07-09 12:30:55 +00:00
Roman Lebedev	0e58dee284	[MCA][X86][NFC] Add BSF/BSR resource tests Reviewers: RKSimon, andreadb, courbet Reviewed By: RKSimon Subscribers: gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D48997 llvm-svn: 336510	2018-07-08 09:50:14 +00:00
Alexander Shaposhnikov	42b5ef0269	[llvm-objcopy] Add support for static libraries This diff adds support for handling static libraries to llvm-objcopy and llvm-strip. Test plan: make check-all Differential revision: https://reviews.llvm.org/D48413 llvm-svn: 336455	2018-07-06 17:51:03 +00:00
Andrea Di Biagio	61c52af9d9	[llvm-mca] improve the instruction issue logic implemented by the Scheduler. This patch modifies the Scheduler heuristic used to select the next instruction to issue to the pipelines. The motivating example is test X86/BtVer2/add-sequence.s, for which llvm-mca wrongly reported an estimated IPC of 1.50. According to perf, the actual IPC for that test should have been ~2.00. It turns out that an IPC of 2.00 for test add-sequence.s cannot possibly be predicted by a Scheduler that only prioritizes instructions based on their "age". A similar issue also affected test X86/BtVer2/dependent-pmuld-paddd.s, for which llvm-mca wrongly estimated an IPC of 0.84 instead of an IPC of 1.00. Instructions in the ReadyQueue are now ranked based on two factors: - The "age" of an instruction. - The number of unique users of writes associated with an instruction. The new logic still prioritizes older instructions over younger instructions to minimize the pressure on the reorder buffer. However, the number of users of an instruction now also affects the overall rank. This potentially increases the ability of the Scheduler to extract instruction level parallelism. This patch fixes the problem with the wrong IPC reported for test add-sequence.s and test dependent-pmuld-paddd.s. llvm-svn: 336420	2018-07-06 08:08:30 +00:00
Dave Lee	390abe4a75	Reapply: "objdump: Support newer ObjC image info flags" Summary: Add support for two additional ObjC image info flags: `IS_SIMULATED` and `HAS_CATEGORY_CLASS_PROPERTIES`. `IS_SIMULATED` indicates a Mach-O binary built for iOS simulator. `HAS_CATEGORY_CLASS_PROPERTIES` indicates a Mach-O binary built by a compiler that supports class properties in categories. Reviewers: enderby, compnerd Reviewed By: compnerd Subscribers: keith, llvm-commits Differential Revision: https://reviews.llvm.org/D48568 llvm-svn: 336411	2018-07-06 05:11:35 +00:00
Dave Lee	e6de96410b	Revert "objdump: Support newer ObjC image info flags" This reverts commit 8c4cc472e7a67bd3b2b20cc4cf32d31af29bc7e9. llvm-svn: 336402	2018-07-06 00:13:21 +00:00
Dave Lee	9e412ec8f2	objdump: Support newer ObjC image info flags Summary: Add support for two additional ObjC image info flags: `IS_SIMULATED` and `HAS_CATEGORY_CLASS_PROPERTIES`. `IS_SIMULATED` indicates a Mach-O binary built for iOS simulator. `HAS_CATEGORY_CLASS_PROPERTIES` indicates a Mach-O binary built by a compiler that supports class properties in categories. Reviewers: enderby, compnerd Reviewed By: compnerd Subscribers: keith, llvm-commits Differential Revision: https://reviews.llvm.org/D48568 llvm-svn: 336399	2018-07-05 23:32:15 +00:00
Paul Semel	91c9d4251c	[llvm-objdump] Removed archive-headers-disas test This test is failing because of the disas part. For the moment, I will juste remove it. I will add it again tomorrow with a proper fix. llvm-svn: 336370	2018-07-05 16:49:46 +00:00
Paul Semel	63e4008718	[llvm-objcopy] Fix timezone dependant tests llvm-svn: 336363	2018-07-05 15:24:11 +00:00
Paul Semel	0dc92f6a74	[llvm-objdump] Add --archive-headers (-a) option llvm-svn: 336357	2018-07-05 14:43:29 +00:00
Roman Lebedev	0dd27042c6	[X86][BtVer2][MCA][NFC] Add CMPEQ dependency-breaking one-idioms tests Summary: As per `Agner's Microarchitecture doc (21.8 AMD Bobcat and Jaguar pipeline - Dependency-breaking instructions)`, these, like zero-idioms, are dependency-breaking, although they produce ones and still consume resources. FIXME: as discussed in D48877, llvm-mca handling is broken for these. Reviewers: andreadb Reviewed By: andreadb Subscribers: gbedwell, RKSimon, llvm-commits Differential Revision: https://reviews.llvm.org/D48876 llvm-svn: 336292	2018-07-04 17:32:44 +00:00
Paul Semel	d2af4d6f1b	[llvm-objdump] Add --file-headers (-f) option llvm-svn: 336284	2018-07-04 15:25:03 +00:00
Gabor Buella	da4a966e1c	NFC - Various typo fixes in tests llvm-svn: 336268	2018-07-04 13:28:39 +00:00
Teresa Johnson	50615c72b4	Remove absolute path in test My test change in r336148 accidentally included an absolute path, clean that up to fix bot failures. llvm-svn: 336151	2018-07-02 23:02:07 +00:00
Teresa Johnson	8fc766681d	[ThinLTO] Fix printing of module paths for distributed backend indexes Summary: In the individual index files emitted for distributed ThinLTO backends, the module path ids are not contiguous. Assign slots to module paths in order to handle this better and also to get contiguous numbering in the summary assembly. Reviewers: davidxl, dexonsmith Subscribers: mehdi_amini, inglorion, eraman, llvm-commits, steven_wu Differential Revision: https://reviews.llvm.org/D48698 llvm-svn: 336148	2018-07-02 22:09:23 +00:00
Fangrui Song	f50ad6c311	Replace unused output filenames with /dev/null in tests Similar to rLLD336129 llvm-svn: 336131	2018-07-02 18:16:44 +00:00
Dave Lee	d4f77a523b	nm: Add -no-weak flag for hiding weak symbols Summary: This adds a new -no-weak flag to nm to hide weak symbols in its output. This also adds a -W alias for this which is analogous to -U. Patch by Keith Smiley Reviewers: kastiglione, enderby, compnerd Reviewed By: kastiglione Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D48751 llvm-svn: 336126	2018-07-02 17:24:37 +00:00
Paul Semel	8dabda70af	Revert "[llvm-readobj] Fix printing format" There is a problem with the formatting on windows build. I need to investigate on this. llvm-svn: 336061	2018-07-01 11:54:09 +00:00
Paul Semel	49997adc88	[llvm-readobj] Fix printing format We were printing every character, even those that weren't printable. It doesn't really make sense for this option. The string content was sticked to its address, added two spaces in between. Differential Revision: https://reviews.llvm.org/D48271 llvm-svn: 336058	2018-07-01 09:51:59 +00:00
Jonas Devlieghere	a0857eaefe	[dsymutil] Make the CachedBinaryHolder the default Replaces all uses of the old binary holder with its cached variant. Differential revision: https://reviews.llvm.org/D48770 llvm-svn: 335991	2018-06-29 16:51:52 +00:00
Sterling Augustine	0cf1f15e83	Require x86 for this test. llvm-svn: 335939	2018-06-28 23:22:14 +00:00
Jake Ehrlich	0f440d832f	[llvm-readobj] Add experimental support for SHT_RELR sections This change adds experimental support for SHT_RELR sections, proposed here: https://groups.google.com/forum/#!topic/generic-abi/bX460iggiKg Definitions for the new ELF section type and dynamic array tags, as well as the encoding used in the new section are all under discussion and are subject to change. Use with caution! Author: rahulchaudhry Differential Revision: https://reviews.llvm.org/D47919 llvm-svn: 335922	2018-06-28 21:07:34 +00:00
Sterling Augustine	052ce120d5	Some targets don't have lld built, so just use a binary copy of the input file. llvm-svn: 335908	2018-06-28 19:47:23 +00:00
Sterling Augustine	bc78b62169	Handle absolute symbols as branch targets in disassembly. https://reviews.llvm.org/D48554 llvm-svn: 335903	2018-06-28 18:57:13 +00:00
Simon Pilgrim	83125594ed	[llvm-mca][x86] Add FMA4 resource tests We should be ensuring we have (near) complete test coverage of instructions, at least for the generic model. llvm-svn: 335870	2018-06-28 16:24:13 +00:00
Simon Pilgrim	12f9503d40	[llvm-mca][x86] Add 3dnow! resource tests We should be ensuring we have (near) complete test coverage of instructions, at least for the generic model. llvm-svn: 335869	2018-06-28 16:21:22 +00:00
Fangrui Song	ee15d3dcdb	Move `REQUIRES:` line to the top llvm-svn: 335635	2018-06-26 17:44:23 +00:00
Tim Northover	f2f9f2f505	ARM: add binary file git swallowed. Should fix bots. llvm-svn: 335596	2018-06-26 12:28:47 +00:00
Tim Northover	bf54858115	ARM: diagnose unpredictable IT instructions IT instructions are allowed to have the 'AL' predicate, but it must never result in an 'NV' predicated instruction. Essentially this means that all branches must be 't' rather than 'e' if the predicate is 'AL'. This patch adds a diagnostic for this during assembly (error because parsing hits an assertion if allowed to continue) and an annotation during disassembly. llvm-svn: 335593	2018-06-26 11:38:41 +00:00
Vedant Kumar	b725c69f12	[SelectionDAG] Remove debug locations from ConstantSD(FP)Nodes This removes debug locations from ConstantSDNode and ConstantSDFPNode. When this kind of node is materialized we no longer create a line table entry which jumps back to the constant's first point of use. This makes single-stepping behavior smoother, and it matches the model used by IR, where Constants have no locations. See this thread for more context: http://lists.llvm.org/pipermail/llvm-dev/2018-June/124164.html I'd like to handle constant BuildVectorSDNodes and to try to eliminate passing SDLocs to SelectionDAG::getConstant*() in follow-up commits. Differential Revision: https://reviews.llvm.org/D48468 llvm-svn: 335497	2018-06-25 17:06:18 +00:00
Jonas Devlieghere	fb54074112	[llvm-mt] Use WithColor for printing errors. Use the WithColor helper from support to print errors. llvm-svn: 335416	2018-06-23 16:49:07 +00:00
Eugene Leviant	da873b5e2e	[LIT] Enable testing of LLVM gold plugin on Mac OS X Differential revision: https://reviews.llvm.org/D48350 llvm-svn: 335136	2018-06-20 15:32:47 +00:00
Andrea Di Biagio	2145b13fc9	[llvm-mca][X86] Teach how to identify register writes that implicitly clear the upper portion of a super-register. This patch teaches llvm-mca how to identify register writes that implicitly zero the upper portion of a super-register. On X86-64, a general purpose register is implemented in hardware as a 64-bit register. Quoting the Intel 64 Software Developer's Manual: "an update to the lower 32 bits of a 64 bit integer register is architecturally defined to zero extend the upper 32 bits". Also, a write to an XMM register performed by an AVX instruction implicitly zeroes the upper 128 bits of the aliasing YMM register. This patch adds a new method named clearsSuperRegisters to the MCInstrAnalysis interface to help identify instructions that implicitly clear the upper portion of a super-register. The rest of the patch teaches llvm-mca how to use that new method to obtain the information, and update the register dependencies accordingly. I compared the kernels from tests clear-super-register-1.s and clear-super-register-2.s against the output from perf on btver2. Previously there was a large discrepancy between the estimated IPC and the measured IPC. Now the differences are mostly in the noise. Differential Revision: https://reviews.llvm.org/D48225 llvm-svn: 335113	2018-06-20 10:08:11 +00:00
Roman Lebedev	d23b6831de	[X86][Znver1] Specify Register Files, RCU; FP scheduler capacity. Summary: First off: i do not have any access to that processor, so this is purely theoretical, no benchmarks. I have been looking into bdver2 scheduling profile, and while cross-referencing the existing btver2, znver1 profiles, and the reference docs (`Software Optimization Guide for AMD Family {15,16,17}h Processors`), i have noticed that only btver2 scheduling profile specifies these. Also, there is no mca test coverage. Reviewers: RKSimon, craig.topper, courbet, GGanesh, andreadb Reviewed By: GGanesh Subscribers: gbedwell, vprasad, ddibyend, shivaram, Ashutosh, javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D47676 llvm-svn: 335099	2018-06-20 07:01:14 +00:00
Clement Courbet	e0aa30008f	[X86] Fix r335097 Missed `Generic` test in llvm-mca. llvm-svn: 335098	2018-06-20 06:44:13 +00:00
Clement Courbet	7b9913fb9f	[X86] Add sched class WriteLAHFSAHF and fix values. Summary: I ran llvm-exegesis on SKX, SKL, BDW, HSW, SNB. Atom is from Agner and SLM is a guess. I've left AMD processors alone. Reviewers: RKSimon, craig.topper Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D48079 llvm-svn: 335097	2018-06-20 06:13:39 +00:00
Roman Lebedev	ae0527aac9	[MCA][NFC] Add generic XOP resource tests Summary: Based on * [[ https://support.amd.com/TechDocs/43479.pdf \| AMD64 Architecture Programmer’s Manual Volume 6: 128-Bit and 256-Bit XOP and FMA4 Instructions ]], * [[ https://support.amd.com/TechDocs/24594.pdf \| AMD64 Architecture Programmer’s Manual Volume 3: General-Purpose and System Instructions]], * https://en.wikipedia.org/wiki/XOP_instruction_set Appears to be only supported in AMD's 15h generation, so only in bdver[1-4], for which currently llvm has no scheduling profiles. Reviewers: RKSimon, craig.topper, andreadb, spatel Reviewed By: RKSimon Subscribers: gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D48264 llvm-svn: 335034	2018-06-19 09:21:27 +00:00
Roman Lebedev	0d12c1685b	[MCA][NFC] Add generic TBM resource tests Summary: Based on https://support.amd.com/TechDocs/24594.pdf, https://en.wikipedia.org/wiki/Bit_Manipulation_Instruction_Sets#TBM_(Trailing_Bit_Manipulation) Appears to be only supported in AMD's 15h generation, so only in bdver[1-4], for which currently llvm has no scheduling profiles. Reviewers: RKSimon, craig.topper, simark, andreadb Reviewed By: RKSimon Subscribers: gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D48252 llvm-svn: 335033	2018-06-19 09:21:22 +00:00
Andrea Di Biagio	a88281d8ae	[llvm-mca] Use an ordered map to collect hardware statistics. NFC. Histogram entries are now ordered by key. This should improves their readability when statistics are printed. llvm-svn: 334961	2018-06-18 17:04:56 +00:00
Andrea Di Biagio	487da729a2	[llvm-mca] Add tests for XOP and AVX512 instructions that implicitly clear the upper portion of a super-register. When the destination register of a XOP instruction is an XMM register, bits [255:128] of the corresponding YMM register are cleared. When the destination register of a EVEX encoded instruction is an XMM/YMM register, the upper bits of the corresponding ZMM are cleared. On processors that feature AVX512, a write to an XMM registers always clears the upper portion of the corresponding ZMM register if the instruction is VEX or EVEX encoded. These new tests show some interesting cases which aren't correctly analyzed by llvm-mca. The lack of knowledge related to the implicit update on the super-registers is addressed by D48225. llvm-svn: 334945	2018-06-18 14:00:30 +00:00
Clement Courbet	0d9da88d18	[X86] Fix NOOP sched overrides on BDW/HSW/SKL. Summary: Noop certainly does not use resources. Reviewers: RKSimon, craig.topper, andreadb Subscribers: gbedwell, llvm-commits, gchatelet Differential Revision: https://reviews.llvm.org/D48028 llvm-svn: 334927	2018-06-18 06:48:22 +00:00
Simon Pilgrim	e930f569f7	[llvm-mca][X86] Add some avx512f/avx512vl resource test placeholders There are a lot of instructions to add under these ISAs (and the other AVX512 variants) but this should demonstrate how to test for the EVEX instructions with different maskings llvm-svn: 334907	2018-06-17 16:25:48 +00:00
Simon Pilgrim	f5ecd8d50d	[llvm-mca][x86] Add Generic cpu resource tests Added a Generic x86 cpu set of resource tests to allow us to check all ISAs. We currently use SandyBridge as our generic CPU model, but it's better if we actually duplicate these tests for if/when we change the model, it also means we don't end up polluting the SandyBridge folder with tests for ISAs it doesn't support. llvm-svn: 334853	2018-06-15 18:35:25 +00:00
Roman Lebedev	9ddf128f79	[MCA] Add -summary-view option Summary: While that is indeed a quite interesting summary stat, there are cases where it does not really add anything other than consuming extra lines. Declutters the output of D48190. Reviewers: RKSimon, andreadb, courbet, craig.topper Reviewed By: andreadb Subscribers: javed.absar, gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D48209 llvm-svn: 334833	2018-06-15 14:01:43 +00:00
Roman Lebedev	7c423001e4	[MCA][x86][NFC] Add tests for -register-file-stats, -scheduler-stats Summary: There does not seem to be any other tests for this. Split off from D47676. Reviewers: RKSimon, craig.topper, courbet, andreadb Reviewed By: andreadb Subscribers: javed.absar, gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D48190 llvm-svn: 334832	2018-06-15 14:01:35 +00:00
Andrea Di Biagio	4cafb297d5	[llvm-mca] Add tests for instructions that implicitly clear the upper portion of a super-register. On x86-64, a write to register EAX implicitly clears the upper half or RAX. 128-bit AVX instructions clear the upper 128-bit of the YMM register that aliases the XMM definition register. llvm-mca doesn't know about register writes that implicitly clear the upper portion of an aliasing super-register. This issue will be fixed in a future patch. llvm-svn: 334742	2018-06-14 17:48:42 +00:00
Andrea Di Biagio	4729d1ff27	[llvm-mca] Add another test for partial register stalls. This test checks that a physical register is correctly allocated for the partial write to register BX. The ADD instruction has to wait for the write to RBX (and BX) before being executed. llvm-svn: 334730	2018-06-14 15:54:34 +00:00
Andrea Di Biagio	0ffb2271a1	[llvm-mca] Fixed a bug in the logic that checks if a memory operation is ready to execute. Fixes PR37790. In some (very rare) cases, the LSUnit (Load/Store unit) was wrongly marking a load (or store) as "ready to execute" effectively bypassing older memory barrier instructions. To reproduce this bug, the memory barrier must be the first instruction in the input assembly sequence, and it doesn't have to perform any register writes. llvm-svn: 334633	2018-06-13 18:30:14 +00:00
Pavel Labath	4adc88ed25	[DWARF/AccelTable] Remove getDIESectionOffset for DWARF v5 entries Summary: This method was not correct for entries in DWO files as it assumed it could just add up the CU and DIE offsets to get the absolute DIE offset. This is not correct for the DWO files, as here the CU offset will reference the skeleton unit, whereas the DIE offset will be the offset in the full unit in the DWO file. Unfortunately, this means that we are not able to determine the absolute DIE offset using the information in the .debug_names section alone, which means we have to offload some of this work to the users of this class. To demonstrate how this can be done, I've added/fixed the ability to lookup entries using accelerator tables in DWO files in llvm-dwarfdump. To make this happen, I've needed to make two extra changes in other classes: - made the DWARFContext method to lookup a CU based on the section offset public. I've needed this functionality to lookup a CU, and this seems like a useful thing in general. - made DWARFUnit::getDWOId call extractDIEsIfNeeded. Before this, the DWOId was filled in only if the root DIE happened to be parsed before we called the accessor. Since the lazy parsing is supposed to happen under the hood, calling extractDIEsIfNeeded seems appropriate. Reviewers: JDevlieghere, aprantl, dblaikie Subscribers: mgrang, llvm-commits Differential Revision: https://reviews.llvm.org/D48009 llvm-svn: 334578	2018-06-13 08:14:27 +00:00
Clement Courbet	7db69cc08a	[X86] Fix skylake server scheduling info. Summary: This fixes most of the scheduling info for SKX vector operations. I had to split a lot of the YMM/ZMM classes into separate classes for YMM and ZMM. The before/after llvm-exegesis analysis are in the phabricator diff. Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D47721 llvm-svn: 334407	2018-06-11 14:37:53 +00:00
Simon Pilgrim	89deac6694	[X86][BtVer2] Add support for all SUB/XOR 32/64 scalar instructions that should match the dependency-breaking 'zero-idiom' As detailed on Agner's Microarchitecture doc (21.8 AMD Bobcat and Jaguar pipeline - Dependency-breaking instructions), these instructions are dependency breaking and fast-path zero the destination register (and appropriate EFLAGS bits). llvm-svn: 334303	2018-06-08 17:00:45 +00:00
Simon Pilgrim	efb4806bb9	[X86][BtVer2] Remove SBB tests that were accidentally added in rL334296 These aren't true zero-idiom instructions (just dependency breaking). llvm-svn: 334297	2018-06-08 15:43:00 +00:00
Simon Pilgrim	53766a986d	[X86][BtVer2] Add tests for scalar SUB/XOR instructions that should match the dependency-breaking 'zero-idiom' As detailed on Agner's Microarchitecture doc (21.8 AMD Bobcat and Jaguar pipeline - Dependency-breaking instructions). llvm-svn: 334296	2018-06-08 15:28:43 +00:00
Simon Pilgrim	aafcf9e4a1	[X86][BtVer2] Limit zero idiom tests to a single iteration. Reduces output size and we're only wanting to check that the instructions are fast-path'd (just Dispatch+Retire) anyhow llvm-svn: 334292	2018-06-08 15:01:40 +00:00
Paul Semel	e57bc78324	[llvm-strip] Expose --strip-unneeded option Differential Revision: https://reviews.llvm.org/D47818 llvm-svn: 334182	2018-06-07 10:05:25 +00:00
Peter Collingbourne	cf017ada68	llvm-readobj: fix printing number of relocations in Android packed format. With '-elf-output-style=GNU -relocations', a header containing the number of entries is printed before all the relocation entries in the section. For Android packed format, we need to perform the unpacking first before we can get the actual number of relocations in the section. Patch by Rahul Chaudhry! Differential Revision: https://reviews.llvm.org/D47800 llvm-svn: 334147	2018-06-07 00:02:07 +00:00

... 2 3 4 5 6 ...

2432 Commits