llvm-project

Commit Graph

Author	SHA1	Message	Date
Clement Courbet	7b475f3b41	[llvm-exegesis] Also check latency mode in local lit. Summary: This should avoid failing on old CPUs that do not have a cycle counter. Subscribers: tschuett, llvm-commits Differential Revision: https://reviews.llvm.org/D55416 llvm-svn: 348740	2018-12-10 07:29:47 +00:00
Martin Storsjo	e3093808fb	[COFF] Map truncated .eh_frame section name PE/COFF sections can have section names truncated to 8 chars, in order to have the name available at runtime. (The string table, where long untruncated names are stored, isn't loaded at runtime.) This allows various llvm tools to dump the .eh_frame section from such executables. Patch by Peiyuan Song! Differential Revision: https://reviews.llvm.org/D55407 llvm-svn: 348708	2018-12-08 18:15:41 +00:00
Simon Pilgrim	99c139f4dc	[llvm-mca][x86] Add RDSEED instruction resource tests for GLM llvm-svn: 348624	2018-12-07 18:37:40 +00:00
Simon Pilgrim	c703ce35b8	[llvm-mca][x86] Add missing AES instruction resource tests Add missing non-VEX instructions llvm-svn: 348623	2018-12-07 18:35:54 +00:00
Simon Pilgrim	c4e2776f3b	[llvm-mca][x86] Add RDRAND/RDSEED instruction resource tests llvm-svn: 348622	2018-12-07 18:29:47 +00:00
Hans Wennborg	c56cc3a889	Fix test/tools/llvm-mca/AArch64/Exynos/direct-branch.s on Mac It was failing as below. Adding a triple seems to help. -- : 'RUN: at line 2'; /work/llvm.combined/build.release/bin/llvm-mca -march=aarch64 -mcpu=exynos-m1 -resource-pressure=false < /work/llvm.combined/llvm/test/tools/llvm-mca/AArch64/Exynos/direct-branch.s \| /work/llvm.combined/build.release/bin/FileCheck /work/llvm.combined/llvm/test/tools/llvm-mca/AArch64/Exynos/direct-branch.s -check-prefixes=ALL,M1 : 'RUN: at line 3'; /work/llvm.combined/build.release/bin/llvm-mca -march=aarch64 -mcpu=exynos-m3 -resource-pressure=false < /work/llvm.combined/llvm/test/tools/llvm-mca/AArch64/Exynos/direct-branch.s \| /work/llvm.combined/build.release/bin/FileCheck /work/llvm.combined/llvm/test/tools/llvm-mca/AArch64/Exynos/direct-branch.s -check-prefixes=ALL,M3 -- Exit Code: 1 Command Output (stderr): -- /work/llvm.combined/llvm/test/tools/llvm-mca/AArch64/Exynos/direct-branch.s:36:12: error: M1-NEXT: expected string not found in input ^ <stdin>:21:2: note: scanning from here 1 0 0.25 b Ltmp0 ^ -- llvm-svn: 348577	2018-12-07 09:58:33 +00:00
Evandro Menezes	51df880e70	[llvm-mca] Improve test (NFC) Add more instructions to the test for Cortex. llvm-svn: 348565	2018-12-07 03:23:36 +00:00
Evandro Menezes	83beb91450	[llvm-mca] Improve test (NFC) Add a label to make explicit that the branch is short for Exynos. llvm-svn: 348564	2018-12-07 03:23:14 +00:00
George Rimar	f8825577be	[llvm-dwarfdump] - Simplify the test case. The test was fully rewritten for simplification. New test code was suggested by David Blaikie. Differential revision: https://reviews.llvm.org/D55261 llvm-svn: 348464	2018-12-06 08:42:57 +00:00
Jake Ehrlich	85985ed363	[llvm-objcopy] Change --only-keep to --only-section I just hard core goofed when I wrote this and created a different name for no good reason. I'm failry aware of most "fresh" users of llvm-objcopy (that is, users which are not using it as a drop in replacement for GNU objcopy) and can say that only "-j" is being used by such people so this patch should strictly increase compatibility and not remove it. Differential Revision: https://reviews.llvm.org/D52180 llvm-svn: 348446	2018-12-06 02:03:53 +00:00
Evandro Menezes	5d42bc7ce8	[llvm-mca] Simplify test (NFC) llvm-svn: 348395	2018-12-05 18:34:51 +00:00
Evandro Menezes	86953e4350	[llvm-mca] Sort test run lines (NFC) llvm-svn: 348393	2018-12-05 18:30:06 +00:00
Martin Storsjo	d0afe724d1	[llvm-rc] Support not expressions. Patch by Jacek Caban! Differential Revision: https://reviews.llvm.org/D55242 llvm-svn: 348363	2018-12-05 13:22:56 +00:00
Michal Gorny	c6cdc16ad6	[test] Split strip-preserve-time.test, and skip atime test on NetBSD Split timestamp preservation tests into atime and mtime test, and skip the former on NetBSD. When the filesystem is mounted noatime, NetBSD not only inhibits implicit atime updates but also prevents setting atime via utime(), causing the test to fail. Differential Revision: https://reviews.llvm.org/D55271 llvm-svn: 348354	2018-12-05 11:15:46 +00:00
George Rimar	7e981f330b	[llvm-dwarfdump] - Dump the older versions of .eh_frame/.debug_frame correctly. The issue is the following. DWARF 2 used version 1 for .debug_frame. (Appendix G, p. 416 http://dwarfstd.org/doc/DWARF5.pdf) lib/MC now always sets version 1 for .eh_frame (and sets 1-4 versions for .debug_frame correctly): https://github.com/llvm-mirror/llvm/blob/master/lib/MC/MCDwarf.cpp#L1530 https://github.com/llvm-mirror/llvm/blob/master/lib/MC/MCDwarf.cpp#L1562 https://github.com/llvm-mirror/llvm/blob/master/lib/MC/MCDwarf.cpp#L1602 In version 1, return_address_register was defined as ubyte, while other versions switched to uleb128. (p 62, http://www.dwarfstd.org/doc/dwarf-2.0.0.pdf) Patch teaches llvm-dwarfdump about this difference. Differential revision: https://reviews.llvm.org/D54860 llvm-svn: 348242	2018-12-04 10:01:39 +00:00
Jake Ehrlich	8ad7779071	[llvm-objcopy] Add --build-id-link-dir flag This flag does not exist in GNU objcopy but has a major use case. Debugging tools support the .build-id directory structure to find debug binaries. There is no easy way to build this structure up however. One way to do it is by using llvm-readelf and some crazy shell magic. This implements the feature directly. It is most often the case that you'll want to strip a file and send the original to the .build-id directory but if you just want to send a file to the .build-id directory you can copy to /dev/null instead. Differential Revision: https://reviews.llvm.org/D54384 llvm-svn: 348174	2018-12-03 19:49:23 +00:00
George Rimar	6d85c58328	[llvm-dwarfdump] - Stop printing the bogus empty section name on invalid dwarf. When there is no .debug_addr section for some reason, llvm-dwarfdump would print the bogus empty section name when dumping ranges in .debug_info: DW_AT_ranges [DW_FORM_rnglistx] (indexed (0x0) rangelist = 0x00000004 [0x0000000000000000, 0x0000000000000001) "" [0x0000000000000000, 0x0000000000000002) "") That happens because of the code which uses 0 (zero) as a section index as a default value. The code should use -1ULL instead because technically 0 is a valid zero section index in ELF and -1ULL is a special constant used that means "no section available". This is mostly a fix for the overall correctness/safety of the code, but a test case is provided too. Differential revision: https://reviews.llvm.org/D55113 llvm-svn: 348115	2018-12-03 10:33:40 +00:00
Jonas Devlieghere	d1c9751657	[dsymutil] Gather global and local symbol addresses in the main executable. Usually local symbols will have their address described in the debug map. Global symbols have to have their address looked up in the symbol table of the main executable. By playing with 'ld -r' and export lists, you can get a symbol described as global by the debug map while actually being a local symbol as far as the link in concerned. By gathering the address of local symbols, we fix this issue. Also, we prefer a global symbol in case of a name collision to preserve the previous behavior. Note that using the 'ld -r' tricks, people can actually cause symbol names collisions that dsymutil has no way to figure out. This fixes the simple case where there is only one symbol of a given name. rdar://problem/32826621 Differential revision: https://reviews.llvm.org/D54922 llvm-svn: 348021	2018-11-30 18:56:10 +00:00
Alexander Shaposhnikov	6e4dc6f23f	[llvm-objcopy] Move elf-specific tests into subfolder In this diff the elf-specific tests are moved into the subfolder llvm-objcopy/ELF (the change was discussed in the comments on https://reviews.llvm.org/D54674). A separate code reivew wasn't sent for this change since Phabricator is failing to create such a large diff. Test plan: make check-all make check-llvm-tools make check-llvm-tools-llvm-objcopy llvm-svn: 347958	2018-11-30 05:43:39 +00:00
Martin Storsjo	c1410635bf	[obj2yaml] [COFF] Write RVA instead of VA for sections, fix roundtripping executables yaml2obj writes the yaml value as is to the output file. Differential Revision: https://reviews.llvm.org/D54965 llvm-svn: 347916	2018-11-29 20:53:57 +00:00
Martin Storsjo	a876b5c0f5	[llvm-rc] Support EXSTYLE statement. Patch by Jacek Caban! Differential Revision: https://reviews.llvm.org/D55020 llvm-svn: 347858	2018-11-29 12:17:39 +00:00
Andrea Di Biagio	373a4ccf6c	[llvm-mca][MC] Add the ability to declare which processor resources model load/store queues (PR36666). This patch adds the ability to specify via tablegen which processor resources are load/store queue resources. A new tablegen class named MemoryQueue can be optionally used to mark resources that model load/store queues. Information about the load/store queue is collected at 'CodeGenSchedule' stage, and analyzed by the 'SubtargetEmitter' to initialize two new fields in struct MCExtraProcessorInfo named `LoadQueueID` and `StoreQueueID`. Those two fields are identifiers for buffered resources used to describe the load queue and the store queue. Field `BufferSize` is interpreted as the number of entries in the queue, while the number of units is a throughput indicator (i.e. number of available pickers for loads/stores). At construction time, LSUnit in llvm-mca checks for the presence of extra processor information (i.e. MCExtraProcessorInfo) in the scheduling model. If that information is available, and fields LoadQueueID and StoreQueueID are set to a value different than zero (i.e. the invalid processor resource index), then LSUnit initializes its LoadQueue/StoreQueue based on the BufferSize value declared by the two processor resources. With this patch, we more accurately track dynamic dispatch stalls caused by the lack of LS tokens (i.e. load/store queue full). This is also shown by the differences in two BdVer2 tests. Stalls that were previously classified as generic SCHEDULER FULL stalls, are not correctly classified either as "load queue full" or "store queue full". About the differences in the -scheduler-stats view: those differences are expected, because entries in the load/store queue are not released at instruction issue stage. Instead, those are released at instruction executed stage. This is the main reason why for the modified tests, the load/store queues gets full before PdEx is full. Differential Revision: https://reviews.llvm.org/D54957 llvm-svn: 347857	2018-11-29 12:15:56 +00:00
Martin Storsjo	e9af715820	[llvm-objcopy] Hook up the -V alias to --version, output "GNU strip" This allows libtool to detect the presence of llvm-strip and use it with the options --strip-debug and --strip-unneeded. Also hook up the -V alias for objcopy. Differential Revision: https://reviews.llvm.org/D54936 llvm-svn: 347731	2018-11-28 06:51:50 +00:00
Martin Storsjo	5db3d33938	[yaml2obj] Treat COFF/ARM64 as a 64 bit architecture Differential Revision: https://reviews.llvm.org/D54935 llvm-svn: 347703	2018-11-27 20:47:38 +00:00
Andrea Di Biagio	7a7588990b	[llvm-mca] pass -dispatch-stats flag to a couple of tests. NFC This change is in preparation for a patch that fixes PR36666. llvm-mca currently doesn't know if a buffered processor resource describes a load or store queue. So, any dynamic dispatch stall caused by the lack of load/store queue entries is normally reported as a generic SCHEDULER stall. See for example the -dispatch-stats output from the two tests modified by this patch. In future, processor models will be able to tag processor resources that are used to describe load/store queues. That information would then be used by llvm-mca to correctly classify dynamic dispatch stalls caused by the lack of tokens in the LS. llvm-svn: 347662	2018-11-27 15:56:00 +00:00
Vitaly Buka	dfe8474e48	[cfi] Help sanstats to find binary if they are not at the original location Summary: By default sanstats search binaries at the same location where they were when stats was collected. Sometime you can not print report immediately or you need to move post-processing to another workstation. To support this use-case when original binary is missing sanstats will fall-back to directory with sanstats file. Reviewers: pcc Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D53857 llvm-svn: 347601	2018-11-26 21:48:55 +00:00
Vitaly Buka	db87ced890	[cfi] Make sanstats print address of the check Summary: Help with off-line symbolization or other type debugging. Reviewers: pcc Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D53606 llvm-svn: 347600	2018-11-26 21:48:45 +00:00
Evandro Menezes	56368c6fa5	[AArch64] Refactor the scheduling predicates (2/3) (NFC) Refactor the scheduling predicates based on `MCInstPredicate`. In this case, `AArch64InstrInfo::hasShiftedReg()`. Differential revision: https://reviews.llvm.org/D54820 llvm-svn: 347598	2018-11-26 21:47:41 +00:00
Evandro Menezes	b02ac8bd21	[AArch64] Refactor the scheduling predicates (1/3) (NFC) Refactor the scheduling predicates based on `MCInstPredicate`. In this case, `AArch64InstrInfo::isScaledAddr()` Differential revision: https://reviews.llvm.org/D54777 llvm-svn: 347597	2018-11-26 21:47:28 +00:00
Andrea Di Biagio	36296c0484	[llvm-mca] Add support for instructions with a variadic number of operands. By default, llvm-mca conservatively assumes that a register operand from the variadic sequence is both a register read and a register write. That is because MCInstrDesc doesn't describe extra variadic operands; we don't have enough dataflow information to tell which register operands from the variadic sequence is a definition, and which is a use instead. However, if a variadic instruction is flagged 'mayStore' (but not 'mayLoad'), and it has no 'unmodeledSideEffects', then llvm-mca (very) optimistically assumes that any register operand in the variadic sequence is a register read only. Conversely, if a variadic instruction is marked as 'mayLoad' (but not 'mayStore'), and it has no 'unmodeledSideEffects', then llvm-mca optimistically assumes that any extra register operand is a register definition only. These assumptions work quite well for variadic load/store multiple instructions defined by the ARM backend. llvm-svn: 347522	2018-11-25 12:46:24 +00:00
Evandro Menezes	079bf4b7b4	[TableGen] Emit more variant transitions `llvm-mca` relies on the predicates to be based on `MCSchedPredicate` in order to resolve the scheduling for variant instructions. Otherwise, it aborts the building of the instruction model early. However, the scheduling model emitter in `TableGen` gives up too soon, unless all processors use only such predicates. In order to allow more processors to be used with `llvm-mca`, this patch emits scheduling transitions if any processor uses these predicates. The transition emitted for the processors using legacy predicates is the one specified with `NoSchedPred`, which is based on `MCSchedPredicate`. Preferably, `llvm-mca` should instead assume a reasonable default when a variant transition is not based on `MCSchedPredicate` for a given processor. This issue should be revisited in the future. Differential revision: https://reviews.llvm.org/D54648 llvm-svn: 347504	2018-11-23 21:17:33 +00:00
Andrea Di Biagio	7e32cc8353	[llvm-mca] Refactor some of the logic in InstrBuilder, and add a verifyOperands method. With this change, InstrBuilder emits an error if the MCInst sequence contains an instruction with a variadic opcode, and a non-zero number of variadic operands. Currently we don't know how to correctly analyze variadic opcodes. The problem with variadic operands is that there is no information for them in the opcode descriptor (i.e. MCInstrDesc). That means, we don't know which variadic operands are defs, and which are uses. In future, we could try to conservatively assume that any extra register operands is both a register use and a register definition. This patch fixes a subtle bug in the evaluation of read/write operands for ARM VLD1 with implicit index update. Added test vld1-index-update.s llvm-svn: 347503	2018-11-23 20:26:57 +00:00
Luke Cheeseman	6db3a6a4a7	Revert r347490 as it breaks address sanitizer builds llvm-svn: 347499	2018-11-23 17:13:06 +00:00
Andrea Di Biagio	07a8255a78	[llvm-mca][View] Improved Retire Control Unit Statistics. RetireControlUnitStatistics now reports extra information about the ROB and the avg/maximum number of entries consumed over the entire simulation. Example: Retire Control Unit - number of cycles where we saw N instructions retired: [# retired], [# cycles] 0, 109 (17.9%) 1, 102 (16.7%) 2, 399 (65.4%) Total ROB Entries: 64 Max Used ROB Entries: 35 ( 54.7% ) Average Used ROB Entries per cy: 32 ( 50.0% ) Documentation in llvm/docs/CommandGuide/llvmn-mca.rst has been updated to reflect this change. llvm-svn: 347493	2018-11-23 12:12:57 +00:00
Luke Cheeseman	d6dbd64104	Revert r343341 - Cannot reproduce the build failure locally and the build logs have been deleted. llvm-svn: 347490	2018-11-23 11:01:47 +00:00
Andrea Di Biagio	1cb8a3c690	[llvm-mca] Fix an invalid memory read introduced by r346487. This patch fixes an invalid memory read introduced by r346487. Before this patch, partial register write had to query the latency of the dependent full register write by calling a method on the full write descriptor. However, if the full write is from an already retired instruction, chances are that the EntryStage already reclaimed its memory. In some parial register write tests, valgrind was reporting an invalid memory read. This change fixes the invalid memory access problem. Writes are now responsible for tracking dependent partial register writes, and notify them in the event of instruction issued. That means, partial register writes no longer need to query their associated full write to check when they are ready to execute. Added test X86/BtVer2/partial-reg-update-7.s llvm-svn: 347459	2018-11-22 12:48:57 +00:00
Evandro Menezes	d0792170a3	[llvm-mca] Add test case (NFC) Add test case that will serve as the base for D54820. llvm-svn: 347440	2018-11-22 00:38:36 +00:00
Evandro Menezes	b9f9042648	[llvm-mca] Add test case (NFC) Fix previous commit r347434. llvm-svn: 347437	2018-11-21 23:36:40 +00:00
Evandro Menezes	34b32a3019	[llvm-mca] Add test case (NFC) Add test case that will serve as the base for D54777. llvm-svn: 347434	2018-11-21 22:57:46 +00:00
Xing GUO	2e3364f9c2	[commit-test] Add blank line for test/tools/llvm-objdump/symbol-table-elf.test Summary: Test commit Reviewers: Higuoxing Reviewed By: Higuoxing Subscribers: llvm-commits, Higuoxing Differential Revision: https://reviews.llvm.org/D54562 llvm-svn: 346924	2018-11-15 02:36:20 +00:00
Adrian Prantl	a77eae96bf	Make dsymutil more robust when parsing load commands. rdar://problem/45883463 llvm-svn: 346815	2018-11-13 23:31:25 +00:00
Jordan Rupprecht	c5bae7834e	[llvm-objcopy] Rename --keep to --keep-section. Summary: llvm-objcopy/strip support `--keep` (for sections) and `--keep-symbols` (for symbols). For consistency and clarity, rename `--keep` to `--keep-section`. In fact, for GNU compatability, -K is --keep-symbol, so it's weird that the alias `-K` is not the same as the short-ish `--keep`. Reviewers: jakehehrlich, jhenderson, alexshap, MaskRay, espindola Reviewed By: jakehehrlich, MaskRay Subscribers: emaste, arichardson, llvm-commits Differential Revision: https://reviews.llvm.org/D54477 llvm-svn: 346782	2018-11-13 19:32:27 +00:00
Xing GUO	65051c83b9	[commit test] Add blank line to test/tools/llvm-objdump/full-contents.test llvm-svn: 346729	2018-11-13 02:14:38 +00:00
Jake Ehrlich	ee7ec6ce1f	[libObject] Fix getDesc for Elf_Note_Impl This change fixes a bug in Elf_Note_Impl in which Elf_Word was used where uint8_t should have been used. llvm-svn: 346724	2018-11-13 01:10:35 +00:00
Jordan Rupprecht	dbf552c40f	[llvm-readelf] Make llvm-readelf more compatible with GNU readelf. Summary: This change adds a bunch of options that GNU readelf supports. There is one breaking change when invoked as `llvm-readobj`, and three breaking changes when invoked as `llvm-readelf`: - Add --all (implies --file-header, --program-headers, etc.) - [Breaking] -a is --all instead of --arm-attributes - Add --file-header as an alias for --file-headers - Replace --sections with --sections-headers, keeping --sections as an alias for it - Add --relocs as an alias for --relocations - Add --dynamic as an alias for --dynamic-table - Add --segments as an alias for --program-headers - Add --section-groups as an alias for --elf-section-groups - Add --dyn-syms as an alias for --dyn-symbols - Add --syms as an alias for --symbols - Add --histogram as an alias for --elf-hash-histogram - [Breaking] When invoked as `llvm-readelf`, -s is --symbols instead of --sections - [Breaking] When invoked as `llvm-readelf`, -t is no longer an alias for --symbols Reviewers: MaskRay, phosek, mcgrathr, jhenderson Reviewed By: MaskRay, jhenderson Subscribers: sbc100, aheejin, edd, jhenderson, silvas, echristo, compnerd, kristina, javed.absar, kristof.beyls, llvm-commits, Bigcheese Differential Revision: https://reviews.llvm.org/D54124 llvm-svn: 346685	2018-11-12 18:02:38 +00:00
Andrea Di Biagio	dda9032314	[llvm-mca] Correctly update the resource strategy for processor resources with multiple units. When looking at the tests committed by Roman at r346587, I noticed that numbers reported by the resource pressure for PdAGU01 were wrong. In particular, according to the aut-generated CHECK lines in tests memcpy-like-test.s and store-throughput.s, resource pressure for PdAGU01 was not uniformly distributed among the two AGEN pipes. It turns out that the reason why pressure was not correctly distributed, was because the "resource selection strategy" object associated with PdAGU01 was not correctly updated on the event of AGEN pipe used. As a result, llvm-mca was not simulating a round-robin pipeline allocation for PdAGU01. Instead, PdAGU1 was always prioritized over PdAGU0. This patch fixes the issue; now processor resource strategy objects for resources declaring multiple units, are correctly notified in the event of "resource used". llvm-svn: 346650	2018-11-12 13:09:39 +00:00
Fangrui Song	5014540a63	[llvm-objdump] add more constraints for tests Patch by Higuoxing (Xing) Reviewers: jhenderson Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D54299 llvm-svn: 346636	2018-11-12 08:10:14 +00:00
Jonas Devlieghere	ba17b96bed	[llvm-nm] Use WithColor for error reporting Use helpers from Support/WithError.h to print errors. llvm-svn: 346624	2018-11-11 22:12:21 +00:00
Kristina Brooks	0674f9d739	[llvm-objdump] Add symbol 'O' for object data Improve compatibility with GNU objdump by showing `O` next to global symbol names, instead of a blank space. Patch by Higuoxing (Xing). Reviewers: MaskRay Differential Revision: https://reviews.llvm.org/D54380 llvm-svn: 346610	2018-11-11 17:47:13 +00:00
Jonas Devlieghere	ed75efa5cd	[llvm-cxxdump] Use error reporting helpers from support This patch makes llvm-cxxdump use the error reporting helpers from Support/WithColor.h llvm-svn: 346602	2018-11-11 01:24:02 +00:00
Roman Lebedev	b428b8b214	[X86][BdVer2] Fix loads/stores throughput for Piledriver (PR39465) There are two AGU units, and per 1cy, there can be either two loads, or a load and a store; but not two stores, or two loads and a store. Additionally, loads shouldn't affect the store scheduler and vice versa. (but should affect the PdEX scheduler.) Required rL346545. Fixes https://bugs.llvm.org/show_bug.cgi?id=39465 llvm-svn: 346587	2018-11-10 14:31:43 +00:00
Roman Lebedev	e105b655a2	[NFC][MCA][BdVer2] Add bdver2 runline into register-file-statistics.s test Missed this one by accident when adding the initial version in rL345463 / rL345462 llvm-svn: 346585	2018-11-10 10:56:58 +00:00
Greg Clayton	44487b655d	Add total function byte size and inline function byte size to "llvm-dwarfdump --statistics" Differential Revision: https://reviews.llvm.org/D54217 llvm-svn: 346531	2018-11-09 18:10:02 +00:00
Jordan Rupprecht	dcf1f8e716	[llvm-strings] Fix whitespaces to match strings output. Summary: The current implementation prepends a space on every line, making it difficult to compare against GNU strings. The space appears to have come from handling --radix in rL292707. The space is for making sure there's a space between the radix and the value; however the space is still emitted even when there is no radix. This change fixes that so the space is only emitted when there is a radix. Reviewers: jhenderson Reviewed By: jhenderson Subscribers: llvm-commits, compnerd Differential Revision: https://reviews.llvm.org/D54238 llvm-svn: 346529	2018-11-09 18:03:21 +00:00
Max Moroz	b2091c930b	[llvm-cov] Add lcov tracefile export format. Summary: lcov tracefiles are used by various coverage reporting tools and build systems (e.g., Bazel). It is a simple text-based format to parse and more convenient to use than the JSON export format, which needs additional processing to map regions/segments back to line numbers. It's a little unfortunate that "text" format is now overloaded to refer specifically to JSON for export, but I wanted to avoid making any breaking changes to the UI of the llvm-cov tool at this time. Patch by Tony Allevato (@allevato). Reviewers: Dor1s, vsk Reviewed By: Dor1s, vsk Subscribers: mgorny, llvm-commits Differential Revision: https://reviews.llvm.org/D54266 llvm-svn: 346506	2018-11-09 16:10:44 +00:00
Clement Courbet	e6b727e552	[X86] Fix VZEROUPPER scheduling info on SNB,HSW,BDW,SXL,SKX. Summary: Starting from SNB, VZEROUPPER is handled by the renamer and uses no proc resources. After HSW, it also has zero latency. This fixes PR35606. To reproduce: Uops: llvm-exegesis -mode=uops -opcode-name=VZEROUPPER Latency: echo -e '#LLVM-EXEGESIS-DEFREG XMM0 1\n#LLVM-EXEGESIS-DEFREG XMM1 1\nvzeroupper' \| /tmp/llvm-exegesis -mode=latency -snippets-file=- echo -e '#LLVM-EXEGESIS-DEFREG XMM0 1\n#LLVM-EXEGESIS-DEFREG XMM1 1\nvzeroupper\naddps %xmm0, %xmm1' \| /tmp/llvm-exegesis -mode=latency -snippets-file=- Reviewers: RKSimon, craig.topper, andreadb Subscribers: gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D54107 llvm-svn: 346482	2018-11-09 09:49:06 +00:00
Petr Hosek	e2f6896eef	[llvm-rc] Support joined or separate spelling for /fo flag CMake invokes rc using the joined spelling which appears to be supported by Microsoft's rc implementation, so we should support it as well. Differential Revision: https://reviews.llvm.org/D54191 llvm-svn: 346470	2018-11-09 03:16:53 +00:00
Petr Hosek	1f597e6e6b	[llvm-rc] Support absolute filenames in manifests CMake generate manifests that contain absolute filenames and these currently result in assertion error. This change ensures that we handle these correctly. Differential Revision: https://reviews.llvm.org/D54194 llvm-svn: 346450	2018-11-08 23:45:00 +00:00
Roman Lebedev	3817292069	[NFC][BdVer2] Load and store throughput tests: also check sched stats (PR39465) As noted by Andrea Di Biagio in https://bugs.llvm.org/show_bug.cgi?id=39465 both the loads and stores occupy both the store and load queues. This is clearly wrong. llvm-svn: 346425	2018-11-08 18:15:58 +00:00
Adrian Prantl	778fba3188	[dsymutil] Copy the LC_BUILD_VERSION load command into the companion binary. LC_BUILD_VERSION contains platform information that is useful for LLDB to match up dSYM bundles with binaries. This patch copies the load command over into the dSYM. rdar://problem/44145175 rdar://problem/45883463 Differential Revision: https://reviews.llvm.org/D54233 llvm-svn: 346412	2018-11-08 16:54:59 +00:00
Roman Lebedev	2ad16b9371	[NFC][BdVer2] Tests for load and store throughput (PR39465) During review it was noted that while it appears that the Piledriver can do two [consecutive] loads per cycle, it can only do one store per cycle. It was suggested that the sched model incorrectly models that, but it was opted to fix this afterwards. These tests show that the two consecutive loads are modelled correctly, and one consecutive stores is not modelled incorrectly. Unless i'm missing the point. https://bugs.llvm.org/show_bug.cgi?id=39465 llvm-svn: 346404	2018-11-08 14:48:56 +00:00
Jordan Rupprecht	4f36c7ad90	[llvm-readobj] Implement LLVM style printer for --notes Summary: Port the GNU style printNotes method to the LLVMStyle subclass. This is basically just a heavy refactor so that the note parsing/formatting logic from the GNUStyle::printNotes can be shared with LLVMStyle::printNotes. Reviewers: MaskRay Reviewed By: MaskRay Subscribers: dschuff, fedor.sergeev, llvm-commits Differential Revision: https://reviews.llvm.org/D54220 llvm-svn: 346371	2018-11-07 23:53:50 +00:00
Adrian Prantl	85e71733ed	Fix spelling error llvm-svn: 346359	2018-11-07 21:34:33 +00:00
Clement Courbet	c544838f87	[llvm-exegesis] Correclty handle all X86 memory encoding formats. Summary: Add unit tests to check the support for each supported format to avoid regressions such as the one in PR36906. Reviewers: gchatelet Subscribers: tschuett, lebedev.ri, llvm-commits Differential Revision: https://reviews.llvm.org/D54144 llvm-svn: 346330	2018-11-07 16:14:55 +00:00
Heejin Ahn	756b50ea85	[WebAssembly] Update test cases after FixFunctionBitcasts Summary: This updates generated binaries and corresponding test cases up to date after applying FixFunctionBitcasts pass. Reviewers: sbc100 Subscribers: dschuff, jgravelle-google, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D54070 llvm-svn: 346286	2018-11-07 01:58:50 +00:00
Dean Michael Berris	25f8d204b8	[XRay] Update XRayRecord to support Custom/Typed Events Summary: This change cuts across LLVM and compiler-rt to add support for rendering custom events in the XRayRecord type, to allow for including user-provided annotations in the output YAML (as raw bytes). This work enables us to add custom event and typed event records into the `llvm::xray::Trace` type for user-provided events. This can then be programmatically handled through the C++ API and can be included in some of the tooling as well. For now we support printing the raw data we encounter in the custom events in the converted output. Future work will allow us to start interpreting these custom and typed events through a yet-to-be-defined API for extending the trace analysis library. Reviewers: mboerger Subscribers: hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D54139 llvm-svn: 346214	2018-11-06 08:51:37 +00:00
Wolfgang Pieb	5253cccbd5	[DWARF v5] Verifier: Add checks for DW_FORM_strx* forms. Adding functionality to the DWARF verifier for DWARF v5 strx* forms which index into the string offsets table. Differential Revision: https://reviews.llvm.org/D54049 llvm-svn: 346061	2018-11-03 00:27:35 +00:00
Eli Friedman	d2941b43f4	[AArch64] [Windows] Misc fixes for llvm-readobj -unwind. Use getImageBase() helper to compute the image base. Fix various offsets/addresses/masks so they're actually correct. This allows decoding unwind info from DLLs, and unwind info from object files containing multiple functions. Differential Revision: https://reviews.llvm.org/D54015 llvm-svn: 346036	2018-11-02 19:59:08 +00:00
Leonard Mosescu	4bdbea3ce2	Fix a few small issues in llvm-pdbutil Running "llvm-pdbutil dump -all" on linux (using the native PDB reader), over a few PDBs pulled from the Microsoft public symbol store uncovered a few small issues: - stripped PDBs might not have the strings stream (/names) - stripped PDBs might not have the "module info" stream Differential Revision: https://reviews.llvm.org/D54006 llvm-svn: 346010	2018-11-02 18:00:37 +00:00
Dean Michael Berris	8a3ef6f3c3	[XRay] Fix tests with updated fdr-dump Follow-up to D54022. llvm-svn: 345955	2018-11-02 08:35:46 +00:00
Andrea Di Biagio	fe3bc1b9bf	[llvm-mca] Add extra counters for move elimination in view RegisterFileStatistics. This patch teaches view RegisterFileStatistics how to report events for optimizable register moves. For each processor register file, view RegisterFileStatistics reports the following extra information: - Number of optimizable register moves - Number of register moves eliminated - Number of zero moves (i.e. register moves that propagate a zero) - Max Number of moves eliminated per cycle. Differential Revision: https://reviews.llvm.org/D53976 llvm-svn: 345865	2018-11-01 18:04:39 +00:00
Jordan Rupprecht	30d1b19b0b	[llvm-strip] Support --keep and --strip-all-gnu from llvm-objcopy Summary: Add --keep and --strip-all-gnu from llvm-objcopy into llvm-strip. Reviewers: jakehehrlich, jhenderson, alexshap Reviewed By: jhenderson, alexshap Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D53954 llvm-svn: 345861	2018-11-01 17:48:46 +00:00
Jordan Rupprecht	fc780bbb16	[llvm-objcopy] Support --{enable,disable}-deterministic-archives Summary: ar and objcopy/strip all support configuring whether archives are written deterministically (timestamps/UIDs/GIDs/etc zero'd). This has been ported to llvm-ar (the U/D modifiers) but not yet to llvm-objcopy/strip. Reviewers: jakehehrlich, jhenderson, alexshap Reviewed By: jhenderson Subscribers: ruiu, mgrang, llvm-commits Differential Revision: https://reviews.llvm.org/D53913 llvm-svn: 345859	2018-11-01 17:36:37 +00:00
Jordan Rupprecht	b47475c058	[llvm-objcopy] Don't apply --localize flags to common symbols Summary: --localize-symbol and --localize-hidden will currently localize common symbols. GNU objcopy will not localize these symbols even when explicitly requested, which seems reasonable; common symbols should always be global so they can be merged during linking. See PR39461 Reviewers: jakehehrlich, jhenderson, alexshap, MaskRay, espindola Reviewed By: jakehehrlich, jhenderson, alexshap, MaskRay Subscribers: emaste, arichardson, alexshap, MaskRay, llvm-commits Differential Revision: https://reviews.llvm.org/D53782 llvm-svn: 345856	2018-11-01 17:26:36 +00:00
Kristina Brooks	31579e9d19	[llvm-objdump] Add --reloc alias for -r (PR39407) This addresses PR39407 (https://bugs.llvm.org/show_bug.cgi?id=39407) improving compatibility with GNU binutils counterparts. Reviewed By: kristina Patch by Higuoxing (Xing). Differential Revision: https://reviews.llvm.org/D53804 llvm-svn: 345703	2018-10-31 09:34:08 +00:00
Kristina Brooks	3baa5f79d8	[llvm-objdump] support '--syms' as an alias of -t This adds support for '--syms' as an alias of '-t' for llvm-objdump, fixing PR39406 (https://bugs.llvm.org/show_bug.cgi?id=39406). Patch by Higuoxing (Xing). Differential Revision: https://reviews.llvm.org/D53803 llvm-svn: 345697	2018-10-31 05:45:01 +00:00
Jordan Rupprecht	634820d414	[llvm-objcopy] Fix --keep-global-symbol/--globalize-symbol for undefined symbols. Summary: --keep-global-symbol and --globalize-symbol don't make sense for undefined symbols, so it should be ignored for those symbols. This matches GNU objcopy behavior. Reviewers: jhenderson, alexshap, jakehehrlich, espindola Reviewed By: jhenderson, jakehehrlich Subscribers: emaste, arichardson, llvm-commits Differential Revision: https://reviews.llvm.org/D53733 llvm-svn: 345614	2018-10-30 16:23:38 +00:00
James Henderson	c1608c96f6	[llvm-objdump] Don't crash when using `-a` on non-archives This fixes PR39402. The crash was caused when dereferencing nullptr in DumpObject and printArchiveChild. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D53690 Patch by Xing GUO llvm-svn: 345503	2018-10-29 14:17:08 +00:00
James Henderson	b55b6587a5	[llvm-objdump] Add '--full-contents' as alias for '-s' This fixes PR39404. Reviewed By: jhenderson Patch by Xing Guo Differential Revision: https://reviews.llvm.org/D53576 llvm-svn: 345495	2018-10-29 10:05:39 +00:00
Saleem Abdulrasool	ec77a6517f	Revert "Revert "DebugInfo: reduce DIE range verification on object files"" This reverts commit 836c763dadbd9478fa35b1a291a38bf17aa206ba. Default initialize the values that MSAN caught. llvm-svn: 345482	2018-10-28 22:30:48 +00:00
Roman Lebedev	a5baf86744	AMD BdVer2 (Piledriver) Initial Scheduler model Summary: # Overview This is somewhat partial. * Latencies are good {F7371125} * All of these remaining inconsistencies //appear// to be noise/noisy/flaky. * NumMicroOps are somewhat good {F7371158} * Most of the remaining inconsistencies are from `Ld` / `Ld_ReadAfterLd` classes * Actual unit occupation (pipes, `ResourceCycles`) are undiscovered lands, i did not really look there. They are basically verbatum copy from `btver2` * Many `InstRW`. And there are still inconsistencies left... To be noted: I think this is the first new schedule profile produced with the new next-gen tools like llvm-exegesis! # Benchmark I realize that isn't what was suggested, but i'll start with some "internal" public real-world benchmark i understand - [[ https://github.com/darktable-org/rawspeed \| RawSpeed raw image decoding library ]]. Diff (the exact clang from trunk without/with this patch): ``` Comparing /home/lebedevri/rawspeed/build-old/src/utilities/rsbench/rsbench to /home/lebedevri/rawspeed/build-new/src/utilities/rsbench/rsbench Benchmark Time CPU Time Old Time New CPU Old CPU New ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Canon/EOS 5D Mark II/09.canon.sraw1.cr2/threads:8/real_time_pvalue 0.0000 0.0000 U Test, Repetitions: 25 vs 25 Canon/EOS 5D Mark II/09.canon.sraw1.cr2/threads:8/real_time_mean -0.0607 -0.0604 234 219 233 219 Canon/EOS 5D Mark II/09.canon.sraw1.cr2/threads:8/real_time_median -0.0630 -0.0626 233 219 233 219 Canon/EOS 5D Mark II/09.canon.sraw1.cr2/threads:8/real_time_stddev +0.2581 +0.2587 1 2 1 2 Canon/EOS 5D Mark II/10.canon.sraw2.cr2/threads:8/real_time_pvalue 0.0000 0.0000 U Test, Repetitions: 25 vs 25 Canon/EOS 5D Mark II/10.canon.sraw2.cr2/threads:8/real_time_mean -0.0770 -0.0767 144 133 144 133 Canon/EOS 5D Mark II/10.canon.sraw2.cr2/threads:8/real_time_median -0.0767 -0.0763 144 133 144 133 Canon/EOS 5D Mark II/10.canon.sraw2.cr2/threads:8/real_time_stddev -0.4170 -0.4156 1 0 1 0 Canon/EOS 5DS/2K4A9927.CR2/threads:8/real_time_pvalue 0.0000 0.0000 U Test, Repetitions: 25 vs 25 Canon/EOS 5DS/2K4A9927.CR2/threads:8/real_time_mean -0.0271 -0.0270 463 450 463 450 Canon/EOS 5DS/2K4A9927.CR2/threads:8/real_time_median -0.0093 -0.0093 453 449 453 449 Canon/EOS 5DS/2K4A9927.CR2/threads:8/real_time_stddev -0.7280 -0.7280 13 4 13 4 Canon/EOS 5DS/2K4A9928.CR2/threads:8/real_time_pvalue 0.0004 0.0004 U Test, Repetitions: 25 vs 25 Canon/EOS 5DS/2K4A9928.CR2/threads:8/real_time_mean -0.0065 -0.0065 569 565 569 565 Canon/EOS 5DS/2K4A9928.CR2/threads:8/real_time_median -0.0077 -0.0077 569 564 569 564 Canon/EOS 5DS/2K4A9928.CR2/threads:8/real_time_stddev +1.0077 +1.0068 2 5 2 5 Canon/EOS 5DS/2K4A9929.CR2/threads:8/real_time_pvalue 0.0220 0.0199 U Test, Repetitions: 25 vs 25 Canon/EOS 5DS/2K4A9929.CR2/threads:8/real_time_mean +0.0006 +0.0007 312 312 312 312 Canon/EOS 5DS/2K4A9929.CR2/threads:8/real_time_median +0.0031 +0.0032 311 312 311 312 Canon/EOS 5DS/2K4A9929.CR2/threads:8/real_time_stddev -0.7069 -0.7072 4 1 4 1 Canon/EOS 10D/CRW_7673.CRW/threads:8/real_time_pvalue 0.0004 0.0004 U Test, Repetitions: 25 vs 25 Canon/EOS 10D/CRW_7673.CRW/threads:8/real_time_mean -0.0015 -0.0015 141 141 141 141 Canon/EOS 10D/CRW_7673.CRW/threads:8/real_time_median -0.0010 -0.0011 141 141 141 141 Canon/EOS 10D/CRW_7673.CRW/threads:8/real_time_stddev -0.1486 -0.1456 0 0 0 0 Canon/EOS 40D/_MG_0154.CR2/threads:8/real_time_pvalue 0.6139 0.8766 U Test, Repetitions: 25 vs 25 Canon/EOS 40D/_MG_0154.CR2/threads:8/real_time_mean -0.0008 -0.0005 60 60 60 60 Canon/EOS 40D/_MG_0154.CR2/threads:8/real_time_median -0.0006 -0.0002 60 60 60 60 Canon/EOS 40D/_MG_0154.CR2/threads:8/real_time_stddev -0.1467 -0.1390 0 0 0 0 Canon/EOS 77D/IMG_4049.CR2/threads:8/real_time_pvalue 0.0137 0.0137 U Test, Repetitions: 25 vs 25 Canon/EOS 77D/IMG_4049.CR2/threads:8/real_time_mean +0.0002 +0.0002 275 275 275 275 Canon/EOS 77D/IMG_4049.CR2/threads:8/real_time_median -0.0015 -0.0014 275 275 275 275 Canon/EOS 77D/IMG_4049.CR2/threads:8/real_time_stddev +3.3687 +3.3587 0 2 0 2 Canon/PowerShot G1/crw_1693.crw/threads:8/real_time_pvalue 0.4041 0.3933 U Test, Repetitions: 25 vs 25 Canon/PowerShot G1/crw_1693.crw/threads:8/real_time_mean +0.0004 +0.0004 67 67 67 67 Canon/PowerShot G1/crw_1693.crw/threads:8/real_time_median -0.0000 -0.0000 67 67 67 67 Canon/PowerShot G1/crw_1693.crw/threads:8/real_time_stddev +0.1947 +0.1995 0 0 0 0 Fujifilm/GFX 50S/20170525_0037TEST.RAF/threads:8/real_time_pvalue 0.0074 0.0001 U Test, Repetitions: 25 vs 25 Fujifilm/GFX 50S/20170525_0037TEST.RAF/threads:8/real_time_mean -0.0092 +0.0074 547 542 25 25 Fujifilm/GFX 50S/20170525_0037TEST.RAF/threads:8/real_time_median -0.0054 +0.0115 544 541 25 25 Fujifilm/GFX 50S/20170525_0037TEST.RAF/threads:8/real_time_stddev -0.4086 -0.3486 8 5 0 0 Fujifilm/X-Pro2/_DSF3051.RAF/threads:8/real_time_pvalue 0.3320 0.0000 U Test, Repetitions: 25 vs 25 Fujifilm/X-Pro2/_DSF3051.RAF/threads:8/real_time_mean +0.0015 +0.0204 218 218 12 12 Fujifilm/X-Pro2/_DSF3051.RAF/threads:8/real_time_median +0.0001 +0.0203 218 218 12 12 Fujifilm/X-Pro2/_DSF3051.RAF/threads:8/real_time_stddev +0.2259 +0.2023 1 1 0 0 GoPro/HERO6 Black/GOPR9172.GPR/threads:8/real_time_pvalue 0.0000 0.0001 U Test, Repetitions: 25 vs 25 GoPro/HERO6 Black/GOPR9172.GPR/threads:8/real_time_mean -0.0209 -0.0179 96 94 90 88 GoPro/HERO6 Black/GOPR9172.GPR/threads:8/real_time_median -0.0182 -0.0155 95 93 90 88 GoPro/HERO6 Black/GOPR9172.GPR/threads:8/real_time_stddev -0.6164 -0.2703 2 1 2 1 Kodak/DCS Pro 14nx/D7465857.DCR/threads:8/real_time_pvalue 0.0000 0.0000 U Test, Repetitions: 25 vs 25 Kodak/DCS Pro 14nx/D7465857.DCR/threads:8/real_time_mean -0.0098 -0.0098 176 175 176 175 Kodak/DCS Pro 14nx/D7465857.DCR/threads:8/real_time_median -0.0126 -0.0126 176 174 176 174 Kodak/DCS Pro 14nx/D7465857.DCR/threads:8/real_time_stddev +6.9789 +6.9157 0 2 0 2 Nikon/D850/Nikon-D850-14bit-lossless-compressed.NEF/threads:8/real_time_pvalue 0.0000 0.0000 U Test, Repetitions: 25 vs 25 Nikon/D850/Nikon-D850-14bit-lossless-compressed.NEF/threads:8/real_time_mean -0.0237 -0.0238 474 463 474 463 Nikon/D850/Nikon-D850-14bit-lossless-compressed.NEF/threads:8/real_time_median -0.0267 -0.0267 473 461 473 461 Nikon/D850/Nikon-D850-14bit-lossless-compressed.NEF/threads:8/real_time_stddev +0.7179 +0.7178 3 5 3 5 Olympus/E-M1MarkII/Olympus_EM1mk2__HIRES_50MP.ORF/threads:8/real_time_pvalue 0.6837 0.6554 U Test, Repetitions: 25 vs 25 Olympus/E-M1MarkII/Olympus_EM1mk2__HIRES_50MP.ORF/threads:8/real_time_mean -0.0014 -0.0013 1375 1373 1375 1373 Olympus/E-M1MarkII/Olympus_EM1mk2__HIRES_50MP.ORF/threads:8/real_time_median +0.0018 +0.0019 1371 1374 1371 1374 Olympus/E-M1MarkII/Olympus_EM1mk2__HIRES_50MP.ORF/threads:8/real_time_stddev -0.7457 -0.7382 11 3 10 3 Panasonic/DC-G9/P1000476.RW2/threads:8/real_time_pvalue 0.0000 0.0000 U Test, Repetitions: 25 vs 25 Panasonic/DC-G9/P1000476.RW2/threads:8/real_time_mean -0.0080 -0.0289 22 22 10 10 Panasonic/DC-G9/P1000476.RW2/threads:8/real_time_median -0.0070 -0.0287 22 22 10 10 Panasonic/DC-G9/P1000476.RW2/threads:8/real_time_stddev +1.0977 +0.6614 0 0 0 0 Panasonic/DC-GH5/_T012014.RW2/threads:8/real_time_pvalue 0.0000 0.0000 U Test, Repetitions: 25 vs 25 Panasonic/DC-GH5/_T012014.RW2/threads:8/real_time_mean +0.0132 +0.0967 35 36 10 11 Panasonic/DC-GH5/_T012014.RW2/threads:8/real_time_median +0.0132 +0.0956 35 36 10 11 Panasonic/DC-GH5/_T012014.RW2/threads:8/real_time_stddev -0.0407 -0.1695 0 0 0 0 Panasonic/DC-GH5S/P1022085.RW2/threads:8/real_time_pvalue 0.0000 0.0000 U Test, Repetitions: 25 vs 25 Panasonic/DC-GH5S/P1022085.RW2/threads:8/real_time_mean +0.0331 +0.1307 13 13 6 6 Panasonic/DC-GH5S/P1022085.RW2/threads:8/real_time_median +0.0430 +0.1373 12 13 6 6 Panasonic/DC-GH5S/P1022085.RW2/threads:8/real_time_stddev -0.9006 -0.8847 1 0 0 0 Pentax/645Z/IMGP2837.PEF/threads:8/real_time_pvalue 0.0016 0.0010 U Test, Repetitions: 25 vs 25 Pentax/645Z/IMGP2837.PEF/threads:8/real_time_mean -0.0023 -0.0024 395 394 395 394 Pentax/645Z/IMGP2837.PEF/threads:8/real_time_median -0.0029 -0.0030 395 394 395 393 Pentax/645Z/IMGP2837.PEF/threads:8/real_time_stddev -0.0275 -0.0375 1 1 1 1 Phase One/P65/CF027310.IIQ/threads:8/real_time_pvalue 0.0232 0.0000 U Test, Repetitions: 25 vs 25 Phase One/P65/CF027310.IIQ/threads:8/real_time_mean -0.0047 +0.0039 114 113 28 28 Phase One/P65/CF027310.IIQ/threads:8/real_time_median -0.0050 +0.0037 114 113 28 28 Phase One/P65/CF027310.IIQ/threads:8/real_time_stddev -0.0599 -0.2683 1 1 0 0 Samsung/NX1/2016-07-23-142101_sam_9364.srw/threads:8/real_time_pvalue 0.0000 0.0000 U Test, Repetitions: 25 vs 25 Samsung/NX1/2016-07-23-142101_sam_9364.srw/threads:8/real_time_mean +0.0206 +0.0207 405 414 405 414 Samsung/NX1/2016-07-23-142101_sam_9364.srw/threads:8/real_time_median +0.0204 +0.0205 405 414 405 414 Samsung/NX1/2016-07-23-142101_sam_9364.srw/threads:8/real_time_stddev +0.2155 +0.2212 1 1 1 1 Samsung/NX30/2015-03-07-163604_sam_7204.srw/threads:8/real_time_pvalue 0.0000 0.0000 U Test, Repetitions: 25 vs 25 Samsung/NX30/2015-03-07-163604_sam_7204.srw/threads:8/real_time_mean -0.0109 -0.0108 147 145 147 145 Samsung/NX30/2015-03-07-163604_sam_7204.srw/threads:8/real_time_median -0.0104 -0.0103 147 145 147 145 Samsung/NX30/2015-03-07-163604_sam_7204.srw/threads:8/real_time_stddev -0.4919 -0.4800 0 0 0 0 Samsung/NX3000/_3184416.SRW/threads:8/real_time_pvalue 0.0000 0.0000 U Test, Repetitions: 25 vs 25 Samsung/NX3000/_3184416.SRW/threads:8/real_time_mean -0.0149 -0.0147 220 217 220 217 Samsung/NX3000/_3184416.SRW/threads:8/real_time_median -0.0173 -0.0169 221 217 220 217 Samsung/NX3000/_3184416.SRW/threads:8/real_time_stddev +1.0337 +1.0341 1 3 1 3 Sony/DSLR-A350/DSC05472.ARW/threads:8/real_time_pvalue 0.0001 0.0001 U Test, Repetitions: 25 vs 25 Sony/DSLR-A350/DSC05472.ARW/threads:8/real_time_mean -0.0019 -0.0019 194 193 194 193 Sony/DSLR-A350/DSC05472.ARW/threads:8/real_time_median -0.0021 -0.0021 194 193 194 193 Sony/DSLR-A350/DSC05472.ARW/threads:8/real_time_stddev -0.4441 -0.4282 0 0 0 0 Sony/ILCE-7RM2/14-bit-compressed.ARW/threads:8/real_time_pvalue 0.0000 0.4263 U Test, Repetitions: 25 vs 25 Sony/ILCE-7RM2/14-bit-compressed.ARW/threads:8/real_time_mean +0.0258 -0.0006 81 83 19 19 Sony/ILCE-7RM2/14-bit-compressed.ARW/threads:8/real_time_median +0.0235 -0.0011 81 82 19 19 Sony/ILCE-7RM2/14-bit-compressed.ARW/threads:8/real_time_stddev +0.1634 +0.1070 1 1 0 0 ``` {F7443905} If we look at the `_mean`s, the time column, the biggest win is `-7.7%` (`Canon/EOS 5D Mark II/10.canon.sraw2.cr2`), and the biggest loose is `+3.3%` (`Panasonic/DC-GH5S/P1022085.RW2`); Overall: mean `-0.7436%`, median `-0.23%`, `cbrt(sum(time^3))` = `-8.73%` Looks good so far i'd say. llvm-exegesis details: {F7371117} {F7371125} {F7371128} {F7371144} {F7371158} Reviewers: craig.topper, RKSimon, andreadb, courbet, avt77, spatel, GGanesh Reviewed By: andreadb Subscribers: javed.absar, gbedwell, jfb, llvm-commits Differential Revision: https://reviews.llvm.org/D52779 llvm-svn: 345463	2018-10-27 20:46:30 +00:00
Roman Lebedev	a51921877a	[NFC][X86] Baseline tests for AMD BdVer2 (Piledriver) Scheduler model Adding the baseline tests in a preparatory NFC commit, so that the actual commit shows the diff. Yes, i'm aware that a few of these codegen-based sched tests are testing wrong instructions, i will fix that afterwards. For https://reviews.llvm.org/D52779 llvm-svn: 345462	2018-10-27 20:36:11 +00:00
Vlad Tsyrklevich	50d2683a00	Revert "DebugInfo: reduce DIE range verification on object files" This reverts commits r345441 and r345444, they were causing msan buildbot failures. llvm-svn: 345457	2018-10-27 17:39:13 +00:00
Saleem Abdulrasool	c8e781c511	test: add missing -triple Ensure that the test builds for x86_64 as it is an assembly test. This should repair the buildbots. llvm-svn: 345444	2018-10-27 02:27:38 +00:00
Saleem Abdulrasool	b342446fe0	DebugInfo: reduce DIE range verification on object files Relocatable content may have overlapping ranges until the sections are finalized. This reduces the amount of verification that is done on an object file so that invalid errors are not raised. llvm-svn: 345441	2018-10-27 00:49:33 +00:00
Owen Reynolds	c443e7ef55	[llvm-ar] Access ADDLIB in llvm-ar via command line ADDLIB is called to add the contents of an archive to another archive. Previously this was only accessible through the use of an MRI script. With the use of a new "L" modifier, archive files can treated in the manner above when using quick append. llvm-svn: 345383	2018-10-26 13:34:38 +00:00
George Rimar	581fc63dc0	[llvm-dwarfdump] - Fix incorrect parsing of the DW_LLE_startx_length As was already mentioned in comments for D53364, DWARF 5 spec says about DW_LLE_startx_length: "This is a form of bounded location description that has two unsigned ULEB operands. The first value is an address index (into the .debug_addr section) that indicates the beginning of the address range over which the location is valid. The second value is the length of the range. ") Currently, the length is always parsed as U32. Patch change the behavior to parse DW_LLE_startx_length as ULEB128 for DWARF 5 and keeps it as U32 for DWARF4+(pre-DWARF5) for compatibility. Differential revision: https://reviews.llvm.org/D53564 llvm-svn: 345254	2018-10-25 10:56:44 +00:00
Simon Atanasyan	1993254509	[llvm-readobj] Print ELF header flags names in GNU output GNU readelf tool prints hex value of the ELF header flags field and the flags names. This change adds the same functionality to llvm-readobj. Now llvm-readobj can print MIPS and RISCV flags. New GNUStyle::printFlags() method is a copy of ScopedPrinter::printFlags() routine. Probably we can escape code duplication and / or simplify the printFlags() method. But it's a task for separate commit. Differential revision: https://reviews.llvm.org/D52027 llvm-svn: 345238	2018-10-25 05:39:27 +00:00
Paul Robinson	73766ccfda	Make llvm-dwarfdump -name work on type units. Differential Revision: https://reviews.llvm.org/D53672 llvm-svn: 345203	2018-10-24 21:51:55 +00:00
Reid Kleckner	953bdce68d	[MC] Separate masm integer literal lexer support from inline asm Summary: This renames the IsParsingMSInlineAsm member variable of AsmLexer to LexMasmIntegers and moves it up to MCAsmLexer. This is the only behavior controlled by that variable. I added a public setter, so that it can be set from outside or from the llvm-mc command line. We may need to arrange things so that users can get this behavior from clang, but that's future work. I also put additional hex literal lexing functionality under this flag to fix PR32973. It appears that this hex literal parsing wasn't intended to be enabled in non-masm-style blocks. Now, masm integers (0b1101 and 0ABCh) work in __asm blocks from clang, but 0b label references work when using .intel_syntax in standalone .s files. However, 0b label references will not work from __asm blocks in clang. They will work from GCC inline asm blocks, which it sounds like is important for Crypto++ as mentioned in PR36144. Essentially, we only lex masm literals for inline asm blobs that use intel syntax. If the .intel_syntax directive is used inside a gnu-style inline asm statement, masm literals will not be lexed, which is compatible with gas and llvm-mc standalone .s assembly. This fixes PR36144 and PR32973. Reviewers: Gerolf, avt77 Subscribers: eraman, hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D53535 llvm-svn: 345189	2018-10-24 20:23:57 +00:00
Teresa Johnson	c8dba682bb	[hot-cold-split] Name split functions with ".cold" suffix Summary: The current default of appending "_"+entry block label to the new extracted cold function breaks demangling. Change the deliminator from "_" to "." to enable demangling. Because the header block label will be empty for release compile code, use "extracted" after the "." when the label is empty. Additionally, add a mechanism for the client to pass in an alternate suffix applied after the ".", and have the hot cold split pass use "cold."+Count, where the Count is currently 1 but can be used to uniquely number multiple cold functions split out from the same function with D53588. Reviewers: sebpop, hiraditya Subscribers: llvm-commits, erik.pilkington Differential Revision: https://reviews.llvm.org/D53534 llvm-svn: 345178	2018-10-24 18:53:47 +00:00
James Henderson	5b2e968264	Fix llvm-strings crash for negative char values On Windows at least, llvm-strings was crashing if it encountered bytes that mapped to negative chars, as it was passing these into std::isgraph and std::isblank functions, resulting in undefined behaviour. On debug builds using MSVC, these functions verfiy that the value passed in is representable as an unsigned char. Since the char is promoted to an int, a value greater than 127 would turn into a negative integer value, and fail the check. Using the llvm::isPrint function is sufficient to solve the issue. Reviewed by: ruiu, mstorsjo Differential Revision: https://reviews.llvm.org/D53509 llvm-svn: 345137	2018-10-24 13:16:16 +00:00
Andrea Di Biagio	083addf751	[llvm-mca] [llvm-mca] Improved error handling and error reporting from class InstrBuilder. A new class named InstructionError has been added to Support.h in order to improve the error reporting from class InstrBuilder. The llvm-mca driver is responsible for handling InstructionError objects, and printing them out to stderr. The goal of this patch is to remove all the remaining error handling logic from the library code. In particular, this allows us to: - Simplify the logic in InstrBuilder by removing a needless dependency from MCInstrPrinter. - Centralize all the error halding logic in a new function named 'runPipeline' (see llvm-mca.cpp). This is also a first step towards generalizing class InstrBuilder, so that in future, we will be able to reuse its logic to also "lower" MachineInstr to mca::Instruction objects. Differential Revision: https://reviews.llvm.org/D53585 llvm-svn: 345129	2018-10-24 10:56:47 +00:00
Sanjin Sijaric	cd41638292	[ARM64][Windows] Add unwind support to llvm-readobj This patch adds support for dumping the unwind info from ARM64 COFF object files. Differential Revision: https://reviews.llvm.org/D53264 llvm-svn: 345108	2018-10-24 00:03:34 +00:00
Jonas Devlieghere	3ef53e10d3	[dwarfdump] Make incompatibility between -diff and -verbose explicit. Using -diff and -verbose together doesn't work today. We should audit where these two options interact and fix them. In the meantime we error out when the user try to specify both. llvm-svn: 345084	2018-10-23 21:51:44 +00:00
Jordan Rupprecht	aaeaa0a8b3	[llvm-strip] Support -s alias for --strip-all. Make both strip and objcopy case sensitive to support both -s (--strip-all) and -S (--strip-debug). Summary: GNU strip supports both `-s` and `-S` as aliases for `--strip-all` and `--strip-debug`, respectfully. As part of this, it turns out that strip/objcopy were accepting case insensitive command line args. I'm not sure if there was an explicit reason for this. The only others uses of this are llvm-cvtres/llvm-mt/llvm-lib, which are all tools specific for windows support. Forcing case sensitivity allows both aliases to exist, but seems like a good idea anyway. And as a surprise test case adjustment, the llvm-strip unit test was running with `-keep=unavailable_symbol`, despite `keep` not be a valid flag for strip. This is because there is a flag `-K` which, when case insensitivity is permitted, allows it to be interpreted as `-K` = `eep=unavailable_symbol` (e.g. to allow `-Kfoo` == `--keep-symbol=foo`). Reviewers: jakehehrlich, jhenderson, alexshap Reviewed By: jakehehrlich Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D53163 llvm-svn: 345068	2018-10-23 18:46:33 +00:00
David Blaikie	59ac206433	llvm-dwarfdump: Support RLE_addressx and RLE_startx_length in .debug_rnglists llvm-svn: 344835	2018-10-20 06:16:25 +00:00
Petar Jovanovic	8d947bad09	[llvm-objdump] Fix --file-headers (-f) option Changed the format call to match the surrounding code. Previously it was printing an unsigned int while the return type being printed was long unsigned int or wider. This caused problems for big-endian systems which were discovered on mips64. Also, the printed address had less characters than it should because the character count was directly obtained from the number of bytes in the address. The tests were adapted to fit this fix and now use longer addresses. Patch by Milos Stojanovic. Differential Revision: https://reviews.llvm.org/D53403 llvm-svn: 344818	2018-10-19 22:16:49 +00:00
Fedor Sergeev	bd6b2138b9	[NewPM] teach -passes= to emit meaningful error messages All the PassBuilder::parse interfaces now return descriptive StringError instead of a plain bool. It allows to make -passes/aa-pipeline parsing errors context-specific and thus less confusing. TODO: ideally we should also make suggestions for misspelled pass names, but that requires some extensions to PassBuilder. Reviewed By: philip.pfaffe, chandlerc Differential Revision: https://reviews.llvm.org/D53246 llvm-svn: 344685	2018-10-17 10:36:23 +00:00
Simon Pilgrim	7d27cfdcb2	[X86] Fix Skylake ReadAfterLd for PADDrm etc. Missed in rL343868 as due to their custom InstrRW. llvm-svn: 344600	2018-10-16 09:50:16 +00:00
Jordan Rupprecht	6fb774e223	[llvm-objcopy] NFC: update TODO test comment llvm-svn: 344550	2018-10-15 20:15:58 +00:00
Fedor Sergeev	a01be0f217	Revert "[NewPM] teach -passes= to emit meaningful error messages" This reverts r344519 due to failures in pipeline-parsing test. llvm-svn: 344524	2018-10-15 15:36:08 +00:00
Fedor Sergeev	4155a77e98	[NewPM] teach -passes= to emit meaningful error messages Summary: All the PassBuilder::parse interfaces now return descriptive StringError instead of a plain bool. It allows to make -passes/aa-pipeline parsing errors context-specific and thus less confusing. TODO: ideally we should also make suggestions for misspelled pass names, but that requires some extensions to PassBuilder. Reviewed By: philip.pfaffe, chandlerc Differential Revision: https://reviews.llvm.org/D53246 llvm-svn: 344519	2018-10-15 15:00:18 +00:00
Andrea Di Biagio	6eebbe0a97	[tblgen][llvm-mca] Add the ability to describe move elimination candidates via tablegen. This patch adds the ability to identify instructions that are "move elimination candidates". It also allows scheduling models to describe processor register files that allow move elimination. A move elimination candidate is an instruction that can be eliminated at register renaming stage. Each subtarget can specify which instructions are move elimination candidates with the help of tablegen class "IsOptimizableRegisterMove" (see llvm/Target/TargetInstrPredicate.td). For example, on X86, BtVer2 allows both GPR and MMX/SSE moves to be eliminated. The definition of 'IsOptimizableRegisterMove' for BtVer2 looks like this: ``` def : IsOptimizableRegisterMove<[ InstructionEquivalenceClass<[ // GPR variants. MOV32rr, MOV64rr, // MMX variants. MMX_MOVQ64rr, // SSE variants. MOVAPSrr, MOVUPSrr, MOVAPDrr, MOVUPDrr, MOVDQArr, MOVDQUrr, // AVX variants. VMOVAPSrr, VMOVUPSrr, VMOVAPDrr, VMOVUPDrr, VMOVDQArr, VMOVDQUrr ], CheckNot<CheckSameRegOperand<0, 1>> > ]>; ``` Definitions of IsOptimizableRegisterMove from processor models of a same Target are processed by the SubtargetEmitter to auto-generate a target-specific override for each of the following predicate methods: ``` bool TargetSubtargetInfo::isOptimizableRegisterMove(const MachineInstr *MI) const; bool MCInstrAnalysis::isOptimizableRegisterMove(const MCInst &MI, unsigned CPUID) const; ``` By default, those methods return false (i.e. conservatively assume that there are no move elimination candidates). Tablegen class RegisterFile has been extended with the following information: - The set of register classes that allow move elimination. - Maxium number of moves that can be eliminated every cycle. - Whether move elimination is restricted to moves from registers that are known to be zero. This patch is structured in three part: A first part (which is mostly boilerplate) adds the new 'isOptimizableRegisterMove' target hooks, and extends existing register file descriptors in MC by introducing new fields to describe properties related to move elimination. A second part, uses the new tablegen constructs to describe move elimination in the BtVer2 scheduling model. A third part, teaches llm-mca how to query the new 'isOptimizableRegisterMove' hook to mark instructions that are candidates for move elimination. It also teaches class RegisterFile how to describe constraints on move elimination at PRF granularity. llvm-mca tests for btver2 show differences before/after this patch. Differential Revision: https://reviews.llvm.org/D53134 llvm-svn: 344334	2018-10-12 11:23:04 +00:00
Jordan Rupprecht	bb4588e9c1	[llvm-objcopy] Add -F\|--target compatibility Summary: This change adds support for the GNU --target flag, which sets both --input-target and --output-target. GNU objcopy doesn't do any checking for whether both --target and --{input,output}-target are used, and so it allows both, e.g. "--target A --output-target B" is equivalent to "--input-target A --output-target B" since the later command line flag would override earlier ones. This may be error prone, so I chose to implement it as an error if both are used. I'm not sure if anyone is actually using both. Reviewers: jakehehrlich, jhenderson, alexshap Reviewed By: jakehehrlich, alexshap Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D53029 llvm-svn: 344321	2018-10-12 00:36:01 +00:00
Aaron Smith	c5cd5911ec	[llvm-pdbutil] Add missing pdb for test llvm-svn: 344306	2018-10-11 22:25:55 +00:00
Aaron Smith	c66838aee9	[llvm-pdbutil] Pretty print PDBSymbolUsingNamespace symbols Reviewers: rnk, zturner, llvm-commits Differential Revision: https://reviews.llvm.org/D52799 llvm-svn: 344298	2018-10-11 21:37:18 +00:00
Andrea Di Biagio	6a0b319549	[llvm-mca][BtVer2] Add tests for optimizable GPR register moves. NFC llvm-svn: 344253	2018-10-11 14:54:54 +00:00
Martin Storsjo	5239041338	[llvm-nm] Include the text "@FILE" in the output of --help libtool requires this text to be present, in order to conclude that the tool supports response files. Also add an explicit test of using response files with llvm-nm. Differential Revision: https://reviews.llvm.org/D53064 llvm-svn: 344222	2018-10-11 06:53:38 +00:00
Craig Topper	df17244ffa	Change the timestamp of llvmcache-foo file to meet the thinLTO prune policy The case will randomly fail if we test it with command " while llvm-lit test/tools/gold/X86/cache.ll ; do true; done". It is because the llvmcache-foo file is younger than llvmcache-349F039B8EB076D412007D82778442BED3148C4E and llvmcache-A8107945C65C2B2BBEE8E61AA604C311D60D58D6. But due to timestamp precision reason their timestamp is the same. Given the same timestamp, the file prune policy is to remove bigger size file first, so mostly foo file is removed for its bigger size. And the files size is under threshold after deleting foo file. That's what test case expect. However sometimes, the precision is enough to measure that timestamp of llvmcache-349F039B8EB076D412007D82778442BED3148C4E and llvmcache-A8107945C65C2B2BBEE8E61AA604C311D60D58D6 are smaller than foo, so llvmcache-349F039B8EB076D412007D82778442BED3148C4E and llvmcache-A8107945C65C2B2BBEE8E61AA604C311D60D58D6 are deleted first. Since the files size is still above the file size threshold after deleting the 2 files, the foo file is also deleted. And then the test case fails, because it expect only one file should be deleted instead of 3. The fix is to change the timestamp of llvmcache-foo file to meet the thinLTO prune policy. Patch by Luo Yuanke. Differential Revision: https://reviews.llvm.org/D52452 llvm-svn: 344158	2018-10-10 17:37:32 +00:00
Andrea Di Biagio	1b29ec6531	[llvm-mca][BtVer2] Add two more move-elimination tests. NFC These should test all the optimizable moves on Jaguar. A follow-up patch will teach how to recognize these optimizable register moves. llvm-svn: 344144	2018-10-10 14:46:54 +00:00
John Brawn	c616a7236c	[llvm-exegesis] Fix function return generation so it doesn't return register 0 When fillMachineFunction generates a return on targets without a return opcode (such as AArch64) it should pass an empty set of registers as the return registers, not 0 which means register number zero. Differential Revision: https://reviews.llvm.org/D53074 llvm-svn: 344139	2018-10-10 13:03:23 +00:00
Fangrui Song	88478bbc60	[opt] Change the parameter of OptTable::PrintHelp from Name to Usage and don't append "[options] <inputs>" Summary: Before, "[options] <inputs>" is unconditionally appended to the `Name` parameter. It is more flexible to change its semantic to `Usage` and let user customize the usage line. % llvm-objcopy ... USAGE: llvm-objcopy <input> [ <output> ] [options] <inputs> With this patch: % llvm-objcopy ... USAGE: llvm-objcopy input [output] Reviewers: rupprecht, alexshap, jhenderson Reviewed By: rupprecht Subscribers: jakehehrlich, mehdi_amini, steven_wu, dexonsmith, llvm-commits Differential Revision: https://reviews.llvm.org/D51009 llvm-svn: 344097	2018-10-10 00:15:31 +00:00
Jake Ehrlich	5e49846ca6	[llvm-objcopy] Make -S an alias for --strip-all -S should be an alias for --strip-all not --strip-all-gnu llvm-svn: 344080	2018-10-09 21:14:09 +00:00
Adrian Prantl	8aa69e9927	llvm-dwarfdump: Extend --name to also search DW_AT_linkage_name. rdar://problem/45132695 llvm-svn: 344079	2018-10-09 20:51:33 +00:00
Wolfgang Pieb	a9ea9c5034	[DWARF] Make llvm-dwarfdump display the .debug_loc.dwo section. Fixes PR38991. Reviewer: dblaikie Differential Revision: https://reviews.llvm.org/D52444 llvm-svn: 344068	2018-10-09 18:38:55 +00:00
Jordan Rupprecht	34c0e470ae	[llvm-ar] Use POSIX-specified timestamps for 'tv'. Summary: The POSIX spec says: ``` If the −t option is used with the −v option, the standard output format shall be: "%s %u/%u %u %s %d %d:%d %d %s\n", <member mode>, <user ID>, <group ID>, <number of bytes in member>, <abbreviated month>, <day-of-month>, <hour>, <minute>, <year>, <file> where: ... <abbreviated month> Equivalent to the format of the %b conversion specification format in date. <day-of-month> Equivalent to the format of the %e conversion specification format in date. <hour> Equivalent to the format of the %H conversion specification format in date. <minute> Equivalent to the format of the %M conversion specification format in date. <year> Equivalent to the format of the %Y conversion specification format in date. ``` This actually used to be the format printed by llvm-ar. It was apparently accidentally changed (see r207385 followed by comments in r207387). This makes it conform to GNU ar for easier replacement. Reviewers: MaskRay Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D52940 llvm-svn: 343901	2018-10-05 23:25:39 +00:00
Petr Hosek	227f25420a	[llvm-nm] Update all tests to redirect stderr to stdout This addresses the breakage introduced in r343887. llvm-svn: 343896	2018-10-05 22:16:37 +00:00
Petr Hosek	62f6462bf9	[llvm-nm] Write "no symbol" output to stderr This matches the output of binutils' nm and ensures that any scripts or tools that use nm and expect empty output in case there no symbols don't break. Differential Revision: https://reviews.llvm.org/D52943 llvm-svn: 343887	2018-10-05 21:10:03 +00:00
Vedant Kumar	5931b4e5b5	[DebugInfo] Add support for DWARF5 call site-related attributes DWARF v5 introduces DW_AT_call_all_calls, a subprogram attribute which indicates that all calls (both regular and tail) within the subprogram have call site entries. The information within these call site entries can be used by a debugger to populate backtraces with synthetic tail call frames. Tail calling frames go missing in backtraces because the frame of the caller is reused by the callee. Call site entries allow a debugger to reconstruct a sequence of (tail) calls which led from one function to another. This improves backtrace quality. There are limitations: tail recursion isn't handled, variables within synthetic frames may not survive to be inspected, etc. This approach is not novel, see: https://gcc.gnu.org/wiki/summit2010?action=AttachFile&do=get&target=jelinek.pdf This patch adds an IR-level flag (DIFlagAllCallsDescribed) which lowers to DW_AT_call_all_calls. It adds the minimal amount of DWARF generation support needed to emit standards-compliant call site entries. For easier deployment, when the debugger tuning is LLDB, the DWARF requirement is adjusted to v4. Testing: Apart from check-{llvm, clang}, I built a stage2 RelWithDebInfo clang binary. Its dSYM passed verification and grew by 1.4% compared to the baseline. 151,879 call site entries were added. rdar://42001377 Differential Revision: https://reviews.llvm.org/D49887 llvm-svn: 343883	2018-10-05 20:37:17 +00:00
Simon Pilgrim	f09fc3bc12	[X86] Move ReadAfterLd functionality into X86FoldableSchedWrite (PR36957) Currently we hardcode instructions with ReadAfterLd if the register operands don't need to be available until the folded load has completed. This doesn't take into account the different load latencies of different memory operands (PR36957). This patch adds a ReadAfterFold def into X86FoldableSchedWrite to replace ReadAfterLd, allowing us to specify the load latency at a scheduler class level. I've added ReadAfterVec*Ld classes that match the XMM/Scl, XMM and YMM/ZMM WriteVecLoad classes that we currently use, we can tweak these values in future patches once this infrastructure is in place. Differential Revision: https://reviews.llvm.org/D52886 llvm-svn: 343868	2018-10-05 17:57:29 +00:00
Adrian Prantl	7875142b5c	Format the dwarfdump --statistics version as an integer instead of a string. llvm-svn: 343864	2018-10-05 17:41:30 +00:00
Simon Pilgrim	6ad03ad34b	[llvm-mca][x86] Add PR36951 ReadAfterLd test case llvm-svn: 343795	2018-10-04 16:26:56 +00:00
Greg Bedwell	dee7bfdb9f	[utils] Ensure that update_mca_test_checks.py writes prefixes in alphabetical order llvm-svn: 343783	2018-10-04 14:42:19 +00:00
Simon Pilgrim	82a3b1c687	[llvm-mca][x86] Add tests demonstrating ReadAfterLd delay llvm-svn: 343773	2018-10-04 13:05:42 +00:00
Clement Courbet	217ed1ffff	[llvm-exegesis][NFC] Test sched class names only in !NDEBUG mode. Sched classes have no names in NDEBUG. llvm-svn: 343755	2018-10-04 07:07:16 +00:00
Fangrui Song	5fbdce131d	[llvm-exegesis] Unbreak analysis-uops-variant.test introduced in D52825 A `defined(NDEBUG) && !defined(LLVM_ENABLE_DUMP)` build does not call writeEscaped and there will be no `SBWriteZeroLatency` in the output. llvm-svn: 343751	2018-10-04 03:32:47 +00:00
Jordan Rupprecht	53cb573564	[llvm-nm] Print an explicit "no symbols" message when an object file has no symbols Summary: GNU nm (and other nm implementations, such as "go tool nm") prints an explicit "no symbols" message when an object file has no symbols. Currently llvm-nm just doesn't print anything. Adding an explicit "no symbols" message will allow llvm-nm to be used in place of nm: some scripts and build processes use `nm <file> \| grep "no symbols"` as a test to see if a file has no symbols. It will also be more familiar to anyone used to nm. That said, the format implemented here is slightly different, in that it doesn't print the tool name in the message (which IMHO is not useful to include). Demo: ``` $ for nm in nm bin/llvm-nm ; do echo "nm implementation: $nm"; $nm /tmp/foo{1,2}.o; echo; done nm implementation: nm /tmp/foo1.o: nm: /tmp/foo1.o: no symbols /tmp/foo2.o: 0000000000000000 T foo2 nm implementation: bin/llvm-nm /tmp/foo1.o: no symbols /tmp/foo2.o: 0000000000000000 T foo2 ``` Reviewers: MaskRay Reviewed By: MaskRay Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D52810 llvm-svn: 343742	2018-10-03 23:39:49 +00:00
Simon Pilgrim	0b451a2983	[X86][Btver2] Fix MMX PSHUFB schedule Match AMD Fam16h SOG + llvm-exegesis tests llvm-svn: 343701	2018-10-03 18:18:50 +00:00
Andrea Di Biagio	207e0217f9	[llvm-mca] Add support for move elimination in class RegisterFile. This patch teaches class RegisterFile how to analyze register writes from instructions that are move elimination candidates. In particular, it teaches it how to check if a move can be effectively eliminated by the underlying PRF, and (if necessary) how to perform move elimination. The long term goal is to allow processor models to describe instructions that are valid move elimination candidates. The idea is to let register file definitions in tablegen declare if/when moves can be eliminated. This patch is a non functional change. The logic that performs move elimination is currently disabled. A future patch will add support for move elimination in the processor models, and enable this new code path. llvm-svn: 343691	2018-10-03 15:02:44 +00:00
Clement Courbet	d5a39553ff	[llvm-exegesis] Resolve variant classes in analysis. Summary: See PR38884. Reviewers: gchatelet Subscribers: tschuett, RKSimon, llvm-commits Differential Revision: https://reviews.llvm.org/D52825 llvm-svn: 343680	2018-10-03 11:50:25 +00:00
Simon Pilgrim	c68cc4efbe	[X86][Btver2] Most RMW instructions don't require an additional uop Remove uop on WriteRMW and move it into the few instructions that need it. Match AMD Fam16h SOG + llvm-exegesis tests llvm-svn: 343671	2018-10-03 10:28:43 +00:00
Simon Pilgrim	d11015861c	[X86] ALU/ADC RMW instructions should use the WriteRMW sequence class I was expecting this to be a nfc but Silvermont seems to be setup a little differently: // A folded store needs a cycle on MEC_RSV for the store data, but it does not need an extra port cycle to recompute the address. def : WriteRes<WriteRMW, [SLM_MEC_RSV]>; So moving from WriteStore to WriteRMW reduces predicted port pressure, confirmed by @craig.topper that this is correct. Differential Revision: https://reviews.llvm.org/D52740 llvm-svn: 343670	2018-10-03 10:01:13 +00:00
Simon Pilgrim	860cb5c071	[X86][Btver2] Fix BLENDV and AESDEC schedules Match AMD Fam16h SOG + llvm-exegesis tests llvm-svn: 343597	2018-10-02 15:13:18 +00:00
Zachary Turner	a5e3e02602	[PDB] Add support for dumping Typedef records. These work a little differently because they are actually in the globals stream and are treated as symbol records, even though DIA presents them as types. So this also adds the necessary infrastructure to cache records that live somewhere other than the TPI stream as well. llvm-svn: 343507	2018-10-01 17:55:38 +00:00
Simon Pilgrim	e0d2019052	[X86][Btver2] Fix BT(C\|R\|S)mr & BT(C\|R\|S)mi schedule latency + uop counts Match AMD Fam16h SOG + llvm-exegesis tests llvm-svn: 343494	2018-10-01 16:31:30 +00:00
Simon Pilgrim	6ddc4e821c	[X86][Btver2] Fix BTmr schedule uop counts Match AMD Fam16h SOG + llvm-exegesis tests llvm-svn: 343484	2018-10-01 14:42:16 +00:00
Simon Pilgrim	a982236e59	[X86][Btver2] Fix masked load schedule JFPU01 resource usage should match JFPX Match AMD Fam16h SOG + llvm-exegesis tests llvm-svn: 343468	2018-10-01 13:12:05 +00:00
Puyan Lotfi	06e65cae4a	[NFC] Adding "REQUIRES: zlib" to a llvm-objcopy test for bots without zlib. M test/tools/llvm-objcopy/compress-and-decompress-debug-sections-error.test llvm-svn: 343454	2018-10-01 10:50:23 +00:00
Andrea Di Biagio	24ea163007	[X86][BtVer2] Teach how to identify zero-idiom VPERM2F128rr instructions. This patch adds another variant class to identify zero-idiom VPERM2F128rr instructions. On Jaguar, a VPERM wih bit 3 and 7 of the mask set, is a zero-idiom. Differential Revision: https://reviews.llvm.org/D52663 llvm-svn: 343452	2018-10-01 10:35:13 +00:00
Puyan Lotfi	af048648d3	[llvm-objcopy] Adding support for decompressing zlib compressed dwarf sections. Summary: I had added support for compressing dwarf sections in a prior commit, this one adds support for decompressing. Usage is: llvm-objcopy --decompress-debug-sections input.o output.o Reviewers: jakehehrlich, jhenderson, alexshap Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D51841 llvm-svn: 343451	2018-10-01 10:29:41 +00:00
Clement Courbet	a933fb237e	[X86][Sched] Update scheduling information for VZEROALL on HWS, BDW, SKX, SNB. Summary: While looking at PR35606, I found out that the scheduling info is incorrect. One can check that it's really a P5+P6 and not a 2*P56 with: echo -e 'vzeroall\nvandps %xmm1, %xmm2, %xmm3' \| ./bin/llvm-exegesis -mode=uops -snippets-file=- (vandps executes on P5 only) Reviewers: craig.topper, RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D52541 llvm-svn: 343447	2018-10-01 08:37:48 +00:00
Simon Pilgrim	f21083870d	[X86] Fix scheduler class for BTmi instructions This wasn't treated as a folded load instruction llvm-svn: 343424	2018-09-30 20:19:16 +00:00
Simon Pilgrim	b1108399bd	[LLVM-MCA][X86] Add missing VCMPESTR/VCMPESTR tests llvm-svn: 343421	2018-09-30 18:19:00 +00:00
Simon Pilgrim	20623f2343	[LLVM-MCA][X86] Add some AVX512 tests These are going to be necessary to check I don't mess up when I start cleaning up all the remaining vector integer overrides llvm-svn: 343414	2018-09-30 17:01:59 +00:00
Simon Pilgrim	4f5693ac8d	[X86][Btver2] Fix PCmpIStrI/PCmpIStrM schedules Missing JFPU0 pipe and double JFPU1 pipe (to match JVALU1) resources Match AMD Fam16h SOG + llvm-exegesis tests llvm-svn: 343413	2018-09-30 16:38:38 +00:00
Zachary Turner	a1e79e326a	Fix some tests on Windows. I don't actually have a Windows machine at the present moment, so hopefully this fixes it. llvm-svn: 343397	2018-09-30 00:22:21 +00:00
Andrea Di Biagio	6e218d0a57	[llvm-mca] Add a test for zero-idiom VPERM2F128rr. NFC We don't correctly model the latency and resource usage information for zero-idiom VPERM2F128rr on Jaguar. This is demonstrated by the incorrect numbers in the resource pressure view, and the timeline view. A follow up patch will fix this problem. llvm-svn: 343346	2018-09-28 17:47:09 +00:00
Luke Cheeseman	10981cc884	Revert r343317 - asan buildbots are breaking and I need to investigate the issue llvm-svn: 343341	2018-09-28 17:01:50 +00:00
Greg Bedwell	becbbe0383	[utils] Stricter checking from update_mca_test_checks.py If any prefixes have been specified on the RUN lines that do not end up ever actually getting printed, raise an Error. This is either an indication that the run lines just need cleaning up, or that something is more fundamentally wrong with the test. Also raise an Error if there are any blocks which cannot be checked because they are not uniquely covered by a prefix. Fixed up a couple of tests where the extra checking flagged up issues. Differential Revision: https://reviews.llvm.org/D48276 llvm-svn: 343332	2018-09-28 15:39:09 +00:00
Greg Bedwell	2f528f8c1e	[utils] Allow better identification of matching blocks in update_mca_test_checks.py Insert empty blocks to cause the positions of matching blocks to match across lists where possible so that later stages of the algorithm can actually identify them as being identical. Regenerated all tests with this change. Differential Revision: https://reviews.llvm.org/D52560 llvm-svn: 343331	2018-09-28 15:38:56 +00:00
Simon Pilgrim	428c1196d8	[X86][Btver2] PSUBS/PSUBUS instructions are zero-idioms Noticed during llvm-exegesis tests, the PSUBS/PSUBUS instructions have the same zero-idiom behaviour to PSUB llvm-svn: 343321	2018-09-28 14:20:42 +00:00
Simon Pilgrim	3216fd3602	[X86][Btver2] Add zero-idiom tests for PSUBS/PSUBUS instructions Noticed during llvm-exegesis tests, the PSUBS/PSUBUS instructions have the same zero-idiom behaviour to PSUB llvm-svn: 343319	2018-09-28 13:53:11 +00:00
Luke Cheeseman	21f2955bb2	Reapply changes reverted by r343235 - Add fix so that all code paths that create DWARFContext with an ObjectFile initialise the target architecture in the context - Add an assert that the Arch is known in the Dwarf CallFrameString method llvm-svn: 343317	2018-09-28 13:37:27 +00:00
Simon Pilgrim	66da1ed29d	[X86][Btver2] CVTSS2I/CVTSD2I - add missing JFPU0 pipe We issue JFPU1->JSTC then JFPU0->JFPA then -> JALU0 (integer pipe) Match AMD Fam16h SOG + llvm-exegesis tests llvm-svn: 343314	2018-09-28 13:19:22 +00:00
Simon Pilgrim	17e5981ebf	[X86][Btver2] Fix BSF/BSR schedule Double throughput to account for 2 pipes + fix BSF's latency/uop counts Match AMD Fam16h SOG + llvm-exegesis tests llvm-svn: 343311	2018-09-28 10:26:48 +00:00
Simon Pilgrim	280af1c7f0	[X86][BtVer2] Fix PHMINPOS schedule resources typo PHMINPOS can run on either JFPU pipe llvm-svn: 343299	2018-09-28 08:21:39 +00:00
Simon Pilgrim	86c7b07ecd	[X86][Btver2] (V)MPSADBW instructions take 3uops not 1 llvm-svn: 343238	2018-09-27 17:13:57 +00:00
Simon Pilgrim	dd744f158a	[X86][Btver2] BTC/BTR/BTS instructions take 2uops not 1 llvm-svn: 343234	2018-09-27 16:39:52 +00:00
Simon Pilgrim	c2a88ea64e	[X86][Btver2] BLSI/BLSMSK/BLSR instructions take 2uops not 1 (same as TZCNT) llvm-svn: 343227	2018-09-27 14:57:57 +00:00
Clement Courbet	30183093ab	[llvm-exegesis] Fix PR39096. Summary: The key is now the resource name, not the resource id. Reviewers: gchatelet Subscribers: tschuett, RKSimon, llvm-commits Differential Revision: https://reviews.llvm.org/D52607 llvm-svn: 343208	2018-09-27 13:26:37 +00:00
Simon Pilgrim	98f503a326	[X86][Btver2] TZCNT instructions take 2uops not 1 llvm-svn: 343200	2018-09-27 12:28:47 +00:00
Clement Courbet	a5720c4e62	[llvm-exgesis][NFC] Do not pollute buildbots with messages when the exegesis lit tests cannot run. llvm-svn: 343110	2018-09-26 13:58:26 +00:00
Clement Courbet	28d4f85824	[llvm-exegesis] Get rid of debug_string. Summary: THis is a backwards-compatible change (existing files will work as expected). See PR39082. Reviewers: gchatelet Subscribers: tschuett, llvm-commits Differential Revision: https://reviews.llvm.org/D52546 llvm-svn: 343108	2018-09-26 13:35:10 +00:00
Guillaume Chatelet	345fae5d56	[llvm-exegesis] Serializes registers initial values. Summary: Adds the registers initial values to the YAML output of llvm-exegesis. Reviewers: courbet Subscribers: tschuett, llvm-commits Differential Revision: https://reviews.llvm.org/D52460 llvm-svn: 342982	2018-09-25 15:15:54 +00:00
Guillaume Chatelet	6078f82241	[llvm-exegesis] Fix missing document separator in YAML output. Reviewers: courbet Subscribers: tschuett, llvm-commits Differential Revision: https://reviews.llvm.org/D52496 llvm-svn: 342981	2018-09-25 14:48:24 +00:00
Clement Courbet	86baebc5fd	[llvm-exegesis] Add lit tests (v2). Summary: This revisits rL342953 by adding detection of host support. Reviewers: gchatelet, lebedev.ri, alexshap Subscribers: mgorny, tschuett, llvm-commits Differential Revision: https://reviews.llvm.org/D52464 llvm-svn: 342975	2018-09-25 13:59:35 +00:00
Simon Pilgrim	b56be79e0c	Revert rL342916: [X86] Remove shift/rotate by CL memory (RMW) overrides As suggested by Craig Topper - I'm going to look at cleaning up the RMW sequences instead. The uops are slightly different to the register variant, so requires a +1uop tweak llvm-svn: 342969	2018-09-25 13:01:26 +00:00
Clement Courbet	6d92c198ac	Revert rL342953 "[llvm-exegesis] Add lit tests." We also need to make sure that we're on the right subtarget. llvm-svn: 342955	2018-09-25 09:36:44 +00:00
Clement Courbet	7f1322dc4d	[llvm-exegesis] Add lit tests. Summary: Right now we only have unit tests. This will allow testing the whole tool. Even though We can't really check actual values, this will avoid regressions such as PR39055. Reviewers: gchatelet, alexshap Subscribers: mgorny, tschuett, llvm-commits Differential Revision: https://reviews.llvm.org/D52407 llvm-svn: 342953	2018-09-25 09:27:43 +00:00
Simon Pilgrim	0b4ad7596f	[X86] Remove shift/rotate by CL memory (RMW) overrides The uops are slightly different to the register variant, so requires a +1uop tweak llvm-svn: 342916	2018-09-24 20:11:50 +00:00
Simon Pilgrim	00865a48d1	[X86] Split WriteIMul into 8/16/32/64 implementations (PR36931) Split WriteIMul by size and also by IMUL multiply-by-imm and multiply-by-reg cases. This removes all the scheduler overrides for gpr multiplies and stops WriteMULH being ignored for BMI2 MULX instructions. llvm-svn: 342892	2018-09-24 15:21:57 +00:00
Jonas Devlieghere	8a7cfc6c86	[dsymutil] Set LSan blacklist whenever sanitizers are enabled. LSan can be enabled by itself or as part of the address sanitizer. Rather than checking the enabled sanitizers for both, just set the LSan env options whenever a sanitizer is enabled. llvm-svn: 342881	2018-09-24 13:56:36 +00:00
Matt Arsenault	ce5f203415	llvm-diff: Fix crash on anonymous functions Not sure what the correct behavior is for this. Skip them and report how many there were. llvm-svn: 342857	2018-09-24 04:42:13 +00:00
Simon Pilgrim	9202c9fb47	[X86] RORmCL instruction models should match ROLmCL etc. Confirmed with Craig Topper - fix a typo that was missing a Port4 uop for ROR*mCL instructions on some Intel models. Yet another step on the scheduler model cleanup marathon...... llvm-svn: 342846	2018-09-23 19:16:01 +00:00
Simon Pilgrim	19952add7c	[X86] Added missing RCL/RCR schedule overrides to the generic SNB model The SandyBridge model was missing schedule values for the RCL/RCR values - instead using the (incredibly optimistic) WriteShift (now WriteRotate) defaults. I've added overrides with more realistic (slow) values, based on a mixture of Agner/instlatx64 numbers and what later Intel models do as well. This is necessary to allow WriteRotate to be updated to remove other rotate overrides. It'd probably be a good idea to investigate a WriteRotateCarry class at some point but its not high priority given the unusualness of these instructions. llvm-svn: 342842	2018-09-23 17:40:24 +00:00
Jordan Rupprecht	8d60f9b6d2	[llvm-size] Berkeley formatting: use tabs instead of spaces as field delimeters. This matches GNU behavior for size and allows use of cut to parse the output of llvm-size. llvm-svn: 342791	2018-09-21 23:48:12 +00:00
Adrian Prantl	2e102480ac	llvm-dwarfdump --statistics: Unique abstract origins across multiple CUs. Instead of indexing local variables by DIE offset, use the variable name + the path through the lexical block tree. This makes the lookup key consistent across duplicate abstract origins in different CUs. llvm-svn: 342776	2018-09-21 21:59:34 +00:00
Clement Courbet	8171bd8e0f	[X86][Sched] Add zero idiom sched data to the SNB model. Summary: On SNB, renamer-based zeroing does not work for: - 16 and 8-bit GPRs[1]. - MMX [2]. - ANDN variants [3] [1] echo 'sub %ax, %ax' \| /tmp/llvm-exegesis -mode=uops -snippets-file=- [2] echo 'pxor %mm0, %mm0' \| /tmp/llvm-exegesis -mode=uops -snippets-file=- [3] echo 'andnps %xmm0, %xmm0' \| /tmp/llvm-exegesis -mode=uops -snippets-file=- Reviewers: RKSimon, andreadb Subscribers: gbedwell, craig.topper, llvm-commits Differential Revision: https://reviews.llvm.org/D52358 llvm-svn: 342736	2018-09-21 14:07:20 +00:00
Andrea Di Biagio	4cd5cf9fc8	[X86][BtVer2] Fix latency and resource cycles of AVX 256-bit zero-idioms. This patch introduces a SchedWriteVariant to describe zero-idiom VXORP(S\|D)Yrr and VANDNP(S\|D)Yrr. This is a follow-up of r342555. On Jaguar, a VXORPSYrr is 2 macro opcodes. Only one opcode is eliminated at register-renaming stage. The other opcode has to be executed to set the upper half of the destination YMM. Same for VANDNP(S\|D)Yrr. Differential Revision: https://reviews.llvm.org/D52347 llvm-svn: 342728	2018-09-21 12:43:07 +00:00
Jonas Devlieghere	907ed15f99	[dsymutil] Suppress CoreFoundation leaks in tests. This suppresses CoreFoundation originated leaks in the dsymutil tests. I'm not sure if this is a false positive or not, but either way we don't have control over it and shouldn't keep the bot red. http://lab.llvm.org:8080/green/job/clang-stage2-cmake-RgSan/ llvm-svn: 342724	2018-09-21 11:55:17 +00:00
Jonas Devlieghere	b32274242d	[dwarfdump] Verify DW_AT_type is set and points to a compatible DIE. This extends the verifier to catch three new errors: * Missing DW_AT_type attributes for DW_TAG_formal_parameter, DW_TAG_variable and DW_TAG_array_type. * Valid references for DW_AT_type pointing to a non-type tag. Differential revision: https://reviews.llvm.org/D52223 llvm-svn: 342713	2018-09-21 07:50:21 +00:00
Jonas Devlieghere	7ef2c2021e	[dwarfdump] Verify compatibility of attribute TAGs. Verify that DW_AT_specification and DW_AT_abstract_origin reference a DIE with a compatible tag. Differential revision: https://reviews.llvm.org/D38719 llvm-svn: 342712	2018-09-21 07:49:29 +00:00
Jordan Rupprecht	7b1c8168c7	[llvm-objcopy/llvm-strip]: handle --version Summary: Implement --version for objcopy and strip. I think there are LLVM utilities that automatically handle this, but that doesn't seem to work with custom parsing since this binary handles both objcopy and strip, so it uses custom parsing. This fixes PR38298 Reviewers: jhenderson, alexshap, jakehehrlich Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D52328 llvm-svn: 342702	2018-09-21 00:47:31 +00:00
Zachary Turner	4bb55c6a0d	[PDB] Fix failing test. This test was missed on the last run since I only ran a subset of them before commiting. llvm-svn: 342659	2018-09-20 16:12:27 +00:00
Calixte Denizet	0b1fe47e22	[gcov] Fix wrong line hit counts when multiple blocks are on the same line Summary: The goal of this patch is to have the same behaviour than gcc-gcov. Currently the hit counts for a line is the sum of the counts for each block on that line. The idea is to detect the cycles in the graph of blocks in using the algorithm by Hawick & James. The count for a cycle is the min of the counts for each edge in the cycle. Once we've the count for each cycle, we can sum them and add the transition counts of those cycles. Fix both https://bugs.llvm.org/show_bug.cgi?id=38065 and https://bugs.llvm.org/show_bug.cgi?id=38066 Reviewers: marco-c, davidxl Reviewed By: marco-c Subscribers: vsk, lebedev.ri, sylvestre.ledru, dblaikie, llvm-commits Differential Revision: https://reviews.llvm.org/D49659 llvm-svn: 342657	2018-09-20 16:09:30 +00:00
Andrea Di Biagio	0aea310391	[llvm-mca][BtVer2] Modify ANDN tests in zero-idioms-avx-256.s. NFC Two test cases should have tested 256-bit variants of VANDN zero-idioms instead of the 128-bit variants. llvm-svn: 342655	2018-09-20 15:48:23 +00:00
Andrea Di Biagio	8b6c314be1	[TableGen][SubtargetEmitter] Add the ability for processor models to describe dependency breaking instructions. This patch adds the ability for processor models to describe dependency breaking instructions. Different processors may specify a different set of dependency-breaking instructions. That means, we cannot assume that all processors of the same target would use the same rules to classify dependency breaking instructions. The main goal of this patch is to provide the means to describe dependency breaking instructions directly via tablegen, and have the following TargetSubtargetInfo hooks redefined in overrides by tabegen'd XXXGenSubtargetInfo classes (here, XXX is a Target name). ``` virtual bool isZeroIdiom(const MachineInstr MI, APInt &Mask) const { return false; } virtual bool isDependencyBreaking(const MachineInstr MI, APInt &Mask) const { return isZeroIdiom(MI); } ``` An instruction MI is a dependency-breaking instruction if a call to method isDependencyBreaking(MI) on the STI (TargetSubtargetInfo object) evaluates to true. Similarly, an instruction MI is a special case of zero-idiom dependency breaking instruction if a call to STI.isZeroIdiom(MI) returns true. The extra APInt is used for those targets that may want to select which machine operands have their dependency broken (see comments in code). Note that by default, subtargets don't know about the existence of dependency-breaking. In the absence of external information, those method calls would always return false. A new tablegen class named STIPredicate has been added by this patch to let processor models classify instructions that have properties in common. The idea is that, a MCInstrPredicate definition can be used to "generate" an instruction equivalence class, with the idea that instructions of a same class all have a property in common. STIPredicate definitions are essentially a collection of instruction equivalence classes. Also, different processor models can specify a different variant of the same STIPredicate with different rules (i.e. predicates) to classify instructions. Tablegen backends (in this particular case, the SubtargetEmitter) will be able to process STIPredicate definitions, and automatically generate functions in XXXGenSubtargetInfo. This patch introduces two special kind of STIPredicate classes named IsZeroIdiomFunction and IsDepBreakingFunction in tablegen. It also adds a definition for those in the BtVer2 scheduling model only. This patch supersedes the one committed at r338372 (phabricator review: D49310). The main advantages are: - We can describe subtarget predicates via tablegen using STIPredicates. - We can describe zero-idioms / dep-breaking instructions directly via tablegen in the scheduling models. In future, the STIPredicates framework can be used for solving other problems. Examples of future developments are: - Teach how to identify optimizable register-register moves - Teach how to identify slow LEA instructions (each subtarget defining its own concept of "slow" LEA). - Teach how to identify instructions that have undocumented false dependencies on the output registers on some processors only. It is also (in my opinion) an elegant way to expose knowledge to both external tools like llvm-mca, and codegen passes. For example, machine schedulers in LLVM could reuse that information when internally constructing the data dependency graph for a code region. This new design feature is also an "opt-in" feature. Processor models don't have to use the new STIPredicates. It has all been designed to be as unintrusive as possible. Differential Revision: https://reviews.llvm.org/D52174 llvm-svn: 342555	2018-09-19 15:57:45 +00:00
Alexander Shaposhnikov	1de445c71c	[llvm-objcopy] Add missing alias for --strip-all-gnu This diff adds -S as an alias for --strip-all-gnu (for compatibility with binutils' objcopy). Patch by Dmitry Golovin! Test plan: make check-all Differential revision: https://reviews.llvm.org/D52163 llvm-svn: 342364	2018-09-17 09:45:12 +00:00
Simon Pilgrim	1c1335a10d	[X86][BMI1] Fix BLSI/BLSMSK/BLSR BMI1 scheduling on btver2 These have the same behaviour as tzcnt on btver2 - confirmed with AMD 16h SOG, Agner and instlatx64. llvm-svn: 342235	2018-09-14 13:31:14 +00:00
Richard Smith	3164fcfd27	Add flag to llvm-profdata to allow symbols in profile data to be remapped, and add a tool to generate symbol remapping files. Summary: The new tool llvm-cxxmap builds a symbol mapping table from a file containing a description of partial equivalences to apply to mangled names and files containing old and new symbol tables. Reviewers: davidxl Subscribers: mgorny, llvm-commits Differential Revision: https://reviews.llvm.org/D51470 llvm-svn: 342168	2018-09-13 20:22:02 +00:00
Vedant Kumar	2963c49087	[llvm-cov] Delete custom JSON serialization code (NFC) Teach llvm-cov to use the new llvm JSON library, and remove some redundant/brittle JSON serialization tests. llvm-svn: 342088	2018-09-12 21:59:38 +00:00
Julie Hockett	468722ee9f	[objcopy] make objcopy follow program header standards Submitted on behalf of Armando Montanez (amontanez@google.com). Objects with unused program headers copied by objcopy would always have nonzero values for program header offset and program header entry size. While technically valid, this atypical behavior triggers warnings in some tools. This change sets the two fields to zero when the program header is unused, better fitting the general expectations for unused program header data. Section headers behaved somewhat similarly (though only with the entry size), and are fixed in this revision as well. Differential Revision: https://reviews.llvm.org/D51961 llvm-svn: 342065	2018-09-12 17:56:31 +00:00
Nico Weber	f48e961d23	Make malformed-machos.test pass on my Mac. For some reason, llvm-objdump defaults to -arch=i386 on this system while the test checks x86_64 output. Explicitly pass -arch=x86_64. llvm-svn: 341944	2018-09-11 14:10:33 +00:00
Dean Michael Berris	d2c50408d4	[XRay] Add TSC to NewCPUId Records Summary: This more correctly reflects the data written by the FDR mode runtime. This is a continuation of the work in D50441. Reviewers: mboerger, eizan Subscribers: hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D51911 llvm-svn: 341905	2018-09-11 06:36:51 +00:00
Dean Michael Berris	dd01efc56d	[XRay] Add the `llvm-xray fdr-dump` implementation Summary: In this change, we implement a `BlockPrinter` which orders records in a Block that's been indexed by the `BlockIndexer`. This is used in the `llvm-xray fdr-dump` tool which ties together the various types and utilities we've been working on, to allow for inspection of XRay FDR mode traces both with and without verification. This change is the final step of the refactoring of D50441. Reviewers: mboerger, eizan Subscribers: mgorny, hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D51846 llvm-svn: 341887	2018-09-11 00:22:53 +00:00
David Carlier	0efae196dd	[XRay] Fix buildbot failure llvm-svn: 341774	2018-09-10 05:29:49 +00:00
David Carlier	07cc5a8df9	[Xray] tooling allow MachO format support Getting writable xray __DATA sections from MachO as well. Reviewers: dberris Reviewed By: dberris Differential Revision: https://reviews.llvm.org/D51758 llvm-svn: 341772	2018-09-10 05:00:43 +00:00
Fangrui Song	91c95a35c1	[llvm-dwp] Clean up tests X86/*.test llvm-svn: 341688	2018-09-07 18:29:20 +00:00
Puyan Lotfi	99124cc082	[llvm-objcopy] Dwarf .debug section compression support (zlib, zlib-gnu). Third Attempt: - Alignment issues resolved. - zlib::isAvailable() detected. - ArrayRef misuse fixed. Usage: llvm-objcopy --compress-debug-sections=zlib foo.o llvm-objcopy --compress-debug-sections=zlib-gnu foo.o In both cases the debug section contents is compressed with zlib. In the GNU style case the header is the "ZLIB" magic string followed by the uint64 big- endian decompressed size. In the non-GNU mode the header is the Elf(32\|64)_Chdr. Decompression support is coming soon. Differential Revision: https://reviews.llvm.org/D49678 llvm-svn: 341635	2018-09-07 08:10:22 +00:00
Jordan Rupprecht	470f745275	[llvm-strip] -p test fix for windows buildbots Windows ls prints dates as "1997-05-05" instead of "May 05 1997", so only check for a leading space. llvm-svn: 341614	2018-09-07 00:28:54 +00:00
Puyan Lotfi	5be060e341	Revert: [llvm-objcopy] Dwarf .debug section compression (Second Attempt). Various bots still fail for unknown reason. llvm-svn: 341613	2018-09-07 00:28:25 +00:00
Puyan Lotfi	f0954dd275	[llvm-objcopy] Dwarf .debug section compression support (zlib, zlib-gnu). Second Attempt. Alignment issues resolved. zlib::isAvailable() detected. Usage: llvm-objcopy --compress-debug-sections=zlib foo.o llvm-objcopy --compress-debug-sections=zlib-gnu foo.o In both cases the debug section contents is compressed with zlib. In the GNU style case the header is the "ZLIB" magic string followed by the uint64 big- endian decompressed size. In the non-GNU mode the header is the Elf(32\|64)_Chdr. Decompression support is coming soon. Differential Revision: https://reviews.llvm.org/D49678 llvm-svn: 341607	2018-09-06 23:59:50 +00:00
Jordan Rupprecht	29f1ce7dcc	[llvm-strip] Fix -p test to check for explicit spaces around dates, to avoid when the filename happens to contain 1995/1997. llvm-svn: 341595	2018-09-06 22:34:48 +00:00
Fangrui Song	a373582169	Reland rL341509: "[llvm-dwp] Use buffer_stream if output file is not seekable (e.g. "-")" It caused ambiguity between llvm:🆑:Optional and llvm::Optional, which has been fixed by dropping `using namespace cl;` in favor of explicit cl:: qualified names. llvm-svn: 341586	2018-09-06 20:26:54 +00:00
Martin Storsjo	1e8edd13ee	[llvm-ar] Support * as comment char in MRI scripts MRI scripts have two comment chars, * and ;, but only the latter was supported before. Also allow leading spaces before comment chars (and before any command string), and allow comments after a command. Differential Revision: https://reviews.llvm.org/D51338 llvm-svn: 341571	2018-09-06 18:10:45 +00:00
Max Kazantsev	eb410f79b3	Revert rL341509 to fix massive failures on buildbots llvm-svn: 341515	2018-09-06 04:40:49 +00:00
Fangrui Song	26f23f8c25	[llvm-dwp] Fix `UN:` lines (supposed to be `RUN:`) in X86/simple.test and adjust check lines for TYPES: Reviewers: dblaikie, aprantl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D51704 llvm-svn: 341510	2018-09-06 00:46:30 +00:00
Fangrui Song	57575e11d1	[llvm-dwp] Use buffer_stream if output file is not seekable (e.g. "-") Reviewers: dblaikie, pcc Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D51707 llvm-svn: 341509	2018-09-06 00:06:25 +00:00
Jordan Rupprecht	591d889006	[llvm-strip] Support stripping multiple input files Summary: Allow strip to be called on multiple input files, which is interpreted as stripping N files in place. Using multiple input files is incompatible with -o. To allow this, create a `DriverConfig` struct which just wraps a list of `CopyConfigs`. objcopy will only ever have a single `CopyConfig`, but strip will have N (where N >= 1) CopyConfigs. Reviewers: alexshap, jakehehrlich Reviewed By: alexshap, jakehehrlich Subscribers: MaskRay, jakehehrlich, llvm-commits Differential Revision: https://reviews.llvm.org/D51660 llvm-svn: 341464	2018-09-05 13:10:03 +00:00
Jordan Rupprecht	ec277a8278	[llvm-strip] Allow copying relocation sections without symbol tables. Summary: Fixes the error "Link field value 0 in section .rela.plt is invalid" when copying/stripping certain binaries. Minimal repro: ``` $ cat /tmp/a.c int main() { return 0; } $ clang -static /tmp/a.c -o /tmp/a $ llvm-strip /tmp/a -o /tmp/b llvm-strip: error: Link field value 0 in section .rela.plt is invalid. ``` Reviewers: jakehehrlich, alexshap Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D51493 llvm-svn: 341419	2018-09-04 22:28:49 +00:00
Francis Visoiu Mistrih	2d3f01c5dc	[MachO] Fix inconsistency between error messages when validating LC_DYSYMTAB llvm-svn: 341379	2018-09-04 16:31:53 +00:00
Francis Visoiu Mistrih	7690af4da9	[MachO] Fix LC_DYSYMTAB validation for external symbols We were validating the same index (ilocalsym) twice, while iextdefsym was never validated. llvm-svn: 341378	2018-09-04 16:31:48 +00:00
Jonas Devlieghere	881452384a	[dwarfdump] Improve -diff option by hiding more data. The -diff option makes it easy to diff dwarf by hiding addresses and offsets. However not all of them were hidden, which should be fixed by this patch. Differential revision: https://reviews.llvm.org/D51593 llvm-svn: 341377	2018-09-04 16:21:37 +00:00
Chandler Carruth	163222f569	Revert r341342: Dwarf .debug section compression support (zlib, zlib-gnu). Also reverts follow-up commits r341343 and r341344. The primary commit continues to break some build bots even after the fixes in r341343 for UBSan issues: http://lab.llvm.org:8011/builders/clang-cmake-aarch64-full/builds/5823 It is also failing for me locally (linux, x86-64). llvm-svn: 341360	2018-09-04 11:55:57 +00:00
Puyan Lotfi	5a40cd5b50	[llvm-objcopy] Dwarf .debug section compression support (zlib, zlib-gnu). Usage: llvm-objcopy --compress-debug-sections=zlib foo.o llvm-objcopy --compress-debug-sections=zlib-gnu foo.o In both cases the debug section contents is compressed with zlib. In the GNU style case the header is the "ZLIB" magic string followed by the uint64 big- endian decompressed size. In the non-GNU mode the header is the Elf(32\|64)_Chdr. Decompression support is coming soon. Differential Revision: https://reviews.llvm.org/D49678 llvm-svn: 341342	2018-09-03 22:25:56 +00:00
Andrea Di Biagio	fb3d9e1449	[X86] Remove wrong ReadAdvance from multiclass sse_fp_unop_s. A ReadAdvance was incorrectly added to the SchedReadWrite list associated with the following SSE instructions: sqrtss sqrtsd rsqrtss rcpss As a consequence, a wrong operand latency was computed for the register operand used as the base address of the folded load operand. This patch removes the wrong ReadAdvance, and updates the llvm-mca test cases. There is still a problem with correctly modeling partial register writes on XMM registers This other problem is currently tracked here: https://bugs.llvm.org/show_bug.cgi?id=38813 Differential Revision: https://reviews.llvm.org/D51542 llvm-svn: 341326	2018-09-03 16:47:34 +00:00
Jonas Devlieghere	6e5c7e6037	[DebugInfo] Have the verifier accept missing linkage names. According to the standard, for the .debug_names (the "dwarf accelerator tables"): > If a subprogram or inlined subroutine is included, and has a > DW_AT_linkage_name attribute, there will be an additional index entry > for the linkage name. For Swift we generate DW_structure_types with a linkage name and the verifier was incorrectly rejecting this. This patch fixes that by only considering the linkage name in those particular cases. The test is the "reduced" debug info of the failing swift test on swift.org. Differential revision: https://reviews.llvm.org/D51420 llvm-svn: 341311	2018-09-03 12:12:17 +00:00
Alexandre Ganea	6a7efef4af	[DebugInfo] Common behavior for error types Following D50807, and heading towards D50664, this intermediary change does the following: 1. Upgrade all custom Error types in llvm/trunk/lib/DebugInfo/ to use the new StringError behavior (D50807). 2. Implement std::is_error_code_enum and make_error_code() for DebugInfo error enumerations. 3. Rename GenericError -> PDBError (the file will be renamed in a subsequent commit) 4. Update custom error messages to follow the same formatting: (\w\s*)+\. 5. Keep generic "file not found" (ENOENT) errors as they are in PDB code. Previously, there used to be a custom enumeration for that purpose. 6. Remove a few extraneous LF in log() implementations. Printing LF is a responsability at a higher level, not at the error level. Differential Revision: https://reviews.llvm.org/D51499 llvm-svn: 341228	2018-08-31 17:41:58 +00:00
Andrea Di Biagio	a59ec4efa0	[X86][BtVer2] Remove wrong ReadAdvance from AVX vbroadcast(ss\|sd\|f128) instructions. The presence of a ReadAdvance for input operand #0 is problematic because it changes the input latency of the register used as the base address for the folded load. A broadcast cannot start executing if the load address hasn't been computed yet. In the llvm-mca example, the VBROADCASTSS is dependent on the address generated by the LEAQ. That means, it cannot start until LEAQ reaches the write-back stage. If we apply ReadAdvance, then we wrongly assume that the load can start 3 cycles in advance. Differential Revision: https://reviews.llvm.org/D51534 llvm-svn: 341222	2018-08-31 16:05:48 +00:00
Andrea Di Biagio	69da3f3df6	[X86] Add llvm-mca tests that show how operand latency is wrongly computed for SSE sqrtss/sd and rcpss. According to the timeline view, sqrtss/sd/rcpss start executing before the load address for the memory operand is available. This problem is caused by the presence of a ReadAfterLd (a ReadAdvance). Those unary operations should not specify a ReadAdvance at all. llvm-svn: 341213	2018-08-31 14:12:13 +00:00
Francis Visoiu Mistrih	8e864be70a	[llvm-objdump] Keep the memory buffer from the dSYM alive when using -g -dsym When using -g and -dsym, llvm-objdump opens the dsym file and keeps the MachOObjectFile alive, while the memory buffer that the MachOObjectFile was based on gets destroyed. Differential Revision: https://reviews.llvm.org/D51365 llvm-svn: 341209	2018-08-31 13:10:54 +00:00
Andrea Di Biagio	0e21ca1278	[X86][BtVer2] Add an llvm-mca test that shows how the read latency of AVX broadcastss on ymm registers is incorrectly set. llvm-svn: 341197	2018-08-31 10:39:33 +00:00
Andrea Di Biagio	b998eae2f2	[X86][BtVer2] Fix WriteFShuffle256 schedule write info. This patch fixes the number of micro opcodes, and processor resource cycles for the following AVX instructions: vinsertf128rr/rm vperm2f128rr/rm vbroadcastf128 Tests have been regenerated using the usual scripts in the llvm/utils directory. Differential Revision: https://reviews.llvm.org/D51492 llvm-svn: 341185	2018-08-31 08:30:47 +00:00
Matt Arsenault	0da6350dc8	AMDGPU: Remove remnants of old address space mapping llvm-svn: 341165	2018-08-31 05:49:54 +00:00
Adrian Prantl	bdffea12d0	dsymutil: Avoid pruning non-type forward declarations inside DW_TAG_module forward declarations. Especially with template instantiations, there are legitimate reasons why for declarations might be emitted into a DW_TAG_module skeleton / forward-declaration sub-tree, that are not forward declarations in the sense of that there is a more complete definition over in a .pcm file. The example in the testcase is a constant DW_TAG_member of a DW_TAG_class template instatiation. rdar://problem/43623196 llvm-svn: 341123	2018-08-30 21:21:16 +00:00
Andrea Di Biagio	8b647dcf4b	[llvm-mca] Report the number of dispatched micro opcodes in the DispatchStatistics view. This patch introduces the following changes to the DispatchStatistics view: * DispatchStatistics now reports the number of dispatched opcodes instead of the number of dispatched instructions. * The "Dynamic Dispatch Stall Cycles" table now also reports the percentage of stall cycles against the total simulated cycles. This change allows users to easily compare dispatch group sizes with the processor DispatchWidth. Before this change, it was difficult to correlate the two numbers, since DispatchStatistics view reported numbers of instructions (instead of opcodes). DispatchWidth defines the maximum size of a dispatch group in terms of number of micro opcodes. The other change introduced by this patch is related to how DispatchStage generates "instruction dispatch" events. In particular: * There can be multiple dispatch events associated with a same instruction * Each dispatch event now encapsulates the number of dispatched micro opcodes. The number of micro opcodes declared by an instruction may exceed the processor DispatchWidth. Therefore, we cannot assume that instructions are always fully dispatched in a single cycle. DispatchStage knows already how to handle instructions declaring a number of opcodes bigger that DispatchWidth. However, DispatchStage always emitted a single instruction dispatch event (during the first simulated dispatch cycle) for instructions dispatched. With this patch, DispatchStage now correctly notifies multiple dispatch events for instructions that cannot be dispatched in a single cycle. A few views had to be modified. Views can no longer assume that there can only be one dispatch event per instruction. Tests (and docs) have been updated. Differential Revision: https://reviews.llvm.org/D51430 llvm-svn: 341055	2018-08-30 10:50:20 +00:00
Andrew V. Tischenko	62f7a3207b	[X86] Improved sched model for X86 CMPXCHG* instructions. Differential Revision: https://reviews.llvm.org/D50070 llvm-svn: 341024	2018-08-30 06:26:00 +00:00
Jordan Rupprecht	7481540fd9	[llvm-strip] Fix -p\|--preserve-dates to not truncate output when used in-place. The restoreDateOnFile() method used to preserve dates uses sys::fs::openFileForWrite(). That method defaults to opening files with CD_CreateAlways, which truncates the output file if it exists. Use CD_OpenExisting instead to open it and not truncate it, which also has the side benefit of erroring if the file does not exist (it should always exist, because we just wrote it out). Also, fix the test case to make sure the output is a valid output file, and not empty. The extra test assertions are enough to catch this regression. llvm-svn: 340996	2018-08-29 23:21:56 +00:00
Andrea Di Biagio	a2eee47450	[llvm-mca] Add fields "Total uOps" and "uOps Per Cycle" to the report generated by the SummaryView. This patch adds two new fields to the perf report generated by the SummaryView. Fields are now logically organized into two small groups; only the second group contains throughput indicators. Example: ``` Iterations: 100 Instructions: 300 Total Cycles: 414 Total uOps: 700 Dispatch Width: 4 uOps Per Cycle: 1.69 IPC: 0.72 Block RThroughput: 4.0 ``` This patch also updates the docs for llvm-mca. Due to the nature of this change, several tests in the tools/llvm-mca directory were affected, and had to be updated using script `update_mca_test_checks.py`. llvm-svn: 340946	2018-08-29 17:56:39 +00:00
Andrea Di Biagio	5221e17fd6	[llvm-mca] Don't disable the SummaryView if flag `-all-stats` is false. llvm-svn: 340945	2018-08-29 17:40:04 +00:00
Andrea Di Biagio	d17d371c40	[llvm-mca][TimelineView] Force the same number of executions for every entry in the 'wait-times' table. This patch also uses colors to highlight problematic wait-time entries. A problematic entry is an entry with an high wait time that tends to match (or exceed) the size of the scheduler's buffer. Color RED is used if an instruction had to wait an average number of cycles which is bigger than (or equal to) the size of the underlying scheduler's buffer. Color YELLOW is used if the time (in cycles) spend waiting for the operands or pipeline resources is bigger than half the size of the underlying scheduler's buffer. Color MAGENTA is used if an instruction does not consume buffer resources according to the scheduling model. llvm-svn: 340825	2018-08-28 14:27:01 +00:00
Kit Barton	7c80f98b69	[PPC] Remove Darwin support from POWER backend. This patch issues an error message if Darwin ABI is attempted with the PPC backend. It also cleans up existing test cases, either converting the test to use an alternative triple or removing the test if the coverage is no longer needed. Updated Tests ------------- The majority of test cases were updated to use a different triple that does not include the Darwin ABI. Many tests were also updated to use FileCheck, in place of grep. Deleted Tests ------------- llvm/test/tools/dsymutil/PowerPC/sibling.test was originally added to test specific functionality of dsymutil using an object file created with an old version of llvm-gcc for a Powerbook G4. After a discussion with @JDevlieghere he suggested removing the test. llvm/test/CodeGen/PowerPC/combine_loads_from_build_pair.ll was converted from a PPC test to a SystemZ test, as the behavior is also reproducible there. All other tests that were deleted were specific to the darwin/ppc ABI and no longer necessary. Phabricator Review: https://reviews.llvm.org/D50988 llvm-svn: 340795	2018-08-28 01:18:29 +00:00
Andrea Di Biagio	b89b96c1b2	[llvm-mca] Improved report generated by the SchedulerStatistics view. Before this patch, the SchedulerStatistics only printed the maximum number of buffer entries consumed in each scheduler's queue at a given point of the simulation. This patch restructures the reported table, and adds an extra field named "Average number of used buffer entries" to it. This patch also uses different colors to help identifying bottlenecks caused by high scheduler's buffer pressure. llvm-svn: 340746	2018-08-27 14:52:52 +00:00
Nico Weber	e75fd1b184	fix comment typo llvm-svn: 340744	2018-08-27 14:25:22 +00:00
Joel Galenson	6cc0e63e2f	[cfi-verify] Support cross-DSO When used in cross-DSO mode, CFI will generate calls to special functions rather than trap instructions. For example, instead of generating if (!InlinedFastCheck(f)) abort(); call f CFI generates if (!InlinedFastCheck(f)) __cfi_slowpath(CallSiteTypeId, f); call f This patch teaches cfi-verify to recognize calls to __cfi_slowpath and abort and treat them as trap functions. In addition to normal symbols, we also parse the dynamic relocations to handle cross-DSO calls in libraries. We also extend cfi-verify to recognize other patterns that occur using cross-DSO. For example, some indirect calls are not guarded by a branch to a trap but instead follow a call to __cfi_slowpath. For example: if (!InlinedFastCheck(f)) call f else { __cfi_slowpath(CallSiteTypeId, f); call f } In this case, the second call to f is not marked as protected by the current code. We thus recognize if indirect calls directly follow a call to a function that will trap on CFI violations and treat them as protected. We also ignore indirect calls in the PLT, since on AArch64 each entry contains an indirect call that should not be protected by CFI, and these are labeled incorrectly when debug information is not present. Differential Revision: https://reviews.llvm.org/D49383 llvm-svn: 340612	2018-08-24 15:21:58 +00:00
Joel Galenson	134cf47dcb	[llvm-objdump] Label calls to the PLT. Differential Revision: https://reviews.llvm.org/D50204 llvm-svn: 340611	2018-08-24 15:21:57 +00:00
Richard Smith	c6ba9ca169	Make llvm-profdata show -text work as advertised in the documentation. Per LLVM's CommandGuide, llvm-profdata show -text is supposed to produce textual output that can be passed as input to further llvm-profdata invocations. This previously didn't work for two reasons: 1) -text was not sufficient to enable the machine-readable text format output; instead, -text was effectively ignored if -counts was not also specified. (With this patch, -counts is instead ignored if -text is specified, because the machine-readable text format always includes counts.) 2) When the input data was an IR-level profile, the :ir marker was missing from the output, resulting in a text format output that would not be usable as profiling data due to function hash mismatches. Differential Revision: https://reviews.llvm.org/D51188 llvm-svn: 340592	2018-08-24 01:34:45 +00:00
Fangrui Song	9ba5740ba5	[gold] -thinlto-object-suffix-replace: don't append new suffix if path does not end with old suffix Summary: This is to be consistent with lld behavior since rLLD340364. Reviewers: tejohnson Reviewed By: tejohnson Subscribers: steven_wu, eraman, mehdi_amini, inglorion, dexonsmith, llvm-commits Differential Revision: https://reviews.llvm.org/D51060 llvm-svn: 340380	2018-08-22 02:11:36 +00:00
Zachary Turner	030ad37ef4	[llvm-objdump] Add ability to demangle COFF symbols. llvm-svn: 340221	2018-08-20 22:18:21 +00:00
Jordan Rupprecht	be8ebccaed	[llvm-objcopy] Implement -G/--keep-global-symbol(s). Summary: Port GNU Objcopy -G/--keep-global-symbol(s). This is slightly different than the already-implemented --globalize-symbol, which marks a symbol as global when copying. When --keep-global-symbol (alias -G) is used, only those symbols marked will stay global, and all other globals are demoted to local. (Also note that it doesn't promote a symbol to global). Additionally, there is a pluralized version of the flag --keep-global-symbols, which effectively applies --keep-global-symbol for every non-comment in a file. Reviewers: jakehehrlich, jhenderson, alexshap Reviewed By: jhenderson Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D50589 llvm-svn: 340105	2018-08-17 22:34:48 +00:00
Jordan Rupprecht	bb179a197c	Fix windows buildbots by removing : from filenames llvm-svn: 340071	2018-08-17 19:18:20 +00:00
Jordan Rupprecht	cf67633e66	[llvm-objcopy] Add support for -I binary -B <arch>. Summary: The -I (--input-target) and -B (--binary-architecture) flags exist but are currently silently ignored. This adds support for -I binary for architectures i386, x86-64 (and alias i386:x86-64), arm, aarch64, sparc, and ppc (powerpc:common64). This is largely based on D41687. This is done by implementing an additional subclass of Reader, BinaryReader, which works by interpreting the input file as contents for .data field, sets up a synthetic header, and adds additional sections/symbols (e.g. _binary__tmp_data_txt_start). Reviewers: jakehehrlich, alexshap, jhenderson, javed.absar Reviewed By: jhenderson Subscribers: jyknight, nemanjai, kbarton, fedor.sergeev, jrtc27, kristof.beyls, paulsemel, llvm-commits Differential Revision: https://reviews.llvm.org/D50343 llvm-svn: 340070	2018-08-17 18:51:11 +00:00
Peter Collingbourne	3da2ffb826	Add missing test file from r339799. llvm-svn: 339927	2018-08-16 19:29:01 +00:00
Jordan Rupprecht	d1767dc56f	[llvm-strip] Add support for -p/--preserve-dates Summary: [llvm-strip] Preserve access/modification timestamps when -p is used. Reviewers: jakehehrlich, jhenderson, alexshap Reviewed By: jhenderson Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D50744 llvm-svn: 339921	2018-08-16 18:29:40 +00:00
George Rimar	d2f90ea337	[yaml2obj] - Allow to use numeric sh_link (Link) value for sections. That change allows using numeric values for Link field. It is consistent with the code for another fields in this method. llvm-svn: 339873	2018-08-16 12:44:17 +00:00
George Rimar	17257bb0b5	[yaml2elf] - Use check-next in test. Its a follow up for rL339870. llvm-svn: 339872	2018-08-16 12:40:27 +00:00
George Rimar	7f2df7df45	[yaml2elf] - Simplify code, add a test. NFC. This simplifies the code allowing to set the sh_info for relocations sections. And adds a missing test. llvm-svn: 339870	2018-08-16 12:23:22 +00:00
Peter Collingbourne	62e4fc48a5	llvm-readobj: Fix addend in relocations for android packed format If a relocation group doesn't have the RELOCATION_GROUP_HAS_ADDEND_FLAG set, then this implies the group's addend equals zero. In this case android packed format won't encode an explicit addend delta, instead we need to set Addend, the "previous addend" variable, to zero by ourself. Patch by Yi-Yo Chiang! Differential Revision: https://reviews.llvm.org/D50601 llvm-svn: 339799	2018-08-15 17:58:22 +00:00
George Rimar	942e8ed19d	[yaml2obj] - Teach yaml2obj to produce SHT_GROUP section with a custom Info field. This allows to set custom Info field value for SHT_GROUP sections. It is useful to allow this because we would be able to replace at least one binary object committed in LLD and replace it with the yaml2obj based test. Differential revision: https://reviews.llvm.org/D50776 llvm-svn: 339772	2018-08-15 13:55:22 +00:00
Andrea Di Biagio	a03f2a77f8	[llvm-mca] Fix PR38575: Avoid an invalid implicit truncation of a processor resource mask (an uint64_t value) to unsigned. This patch fixes a regression introduced at revision 338702. A processor resource mask was incorrectly implicitly truncated to an unsigned quantity. Later on, the truncated mask was used to initialize an element of a vector of processor resource descriptors. On targets with more than 32 processor resources, some elements of the vector are left uninitialized. As a consequence, this bug might have eventually caused a crash due to null dereference in the Scheduler. This patch fixes PR38575, and adds a test for it. llvm-svn: 339768	2018-08-15 12:53:38 +00:00
George Rimar	5290af8ad9	[yaml2obj] - Teach tool to produce SHT_GROUP section with a custom type. Currently, it is possible to use yaml2obj for producing SHT_GROUP sections of type GRP_COMDAT. For LLD test case I need to produce an object with a broken (different from GRP_COMDAT) type. The patch teaches tool to do such things. Differential revision: https://reviews.llvm.org/D50761 llvm-svn: 339764	2018-08-15 11:43:00 +00:00
Tom Stellard	69bf876b49	[gold] Fix Tests cases on i686 Reviewers: tejohnson Reviewed By: tejohnson Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D50583 llvm-svn: 339492	2018-08-11 01:08:34 +00:00
Jordan Rupprecht	88ed5e59bd	[llvm-objcopy] NFC: Add some color to error() llvm-svn: 339404	2018-08-09 22:52:03 +00:00
Paul Semel	7a3dc2c184	[llvm-objcopy] Add --prefix-symbols option Differential Revision: https://reviews.llvm.org/D50381 llvm-svn: 339362	2018-08-09 17:49:04 +00:00
Paul Semel	a42dec7a1b	[llvm-objcopy] Add --dump-section Differential Revision: https://reviews.llvm.org/D49979 llvm-svn: 339358	2018-08-09 17:05:21 +00:00
Andrew V. Tischenko	1fe3375620	[X86] MCA tests for XCHG, XADD and CMPXCHG* instructions Differential Revision: https://reviews.llvm.org/D49912 llvm-svn: 339145	2018-08-07 14:36:43 +00:00
George Rimar	65a6828b17	[yaml2obj] - Add a support for changing EntSize. I was trying to add a test case for LLD and found that it is impossible to set sh_entsize via yaml. The patch implements the missing part. Differential revision: https://reviews.llvm.org/D50235 llvm-svn: 339113	2018-08-07 08:11:38 +00:00
Stella Stamenova	cc2404c01d	[lit, python] Always add quotes around the python path in lit Summary: The issue with the python path is that the path to python on Windows can contain spaces. To make the tests always work, the path to python needs to be surrounded by quotes. This change updates several configuration files which specify the path to python as a substitution and also remove quotes from existing tests. Reviewers: asmith, zturner, alexshap, jakehehrlich Reviewed By: zturner, alexshap, jakehehrlich Subscribers: mehdi_amini, nemanjai, eraman, kbarton, jakehehrlich, steven_wu, dexonsmith, stella.stamenova, delcypher, llvm-commits Differential Revision: https://reviews.llvm.org/D50206 llvm-svn: 339073	2018-08-06 22:37:44 +00:00
Alexandre Ganea	741cc3531a	[llvm-pdbutil] Support PDBs without a DBI stream Differential Revision: https://reviews.llvm.org/D50258 llvm-svn: 339045	2018-08-06 19:35:00 +00:00
David Bolvansky	b7fcd10700	[NFC] Fixed inliner tests - 2 llvm-svn: 338973	2018-08-05 16:53:36 +00:00
David Bolvansky	2f1f3b10ad	[NFC] Fixed inliner tests llvm-svn: 338972	2018-08-05 16:30:46 +00:00
David Bolvansky	c0aa4b75a4	Enrich inline messages Summary: This patch improves Inliner to provide causes/reasons for negative inline decisions. 1. It adds one new message field to InlineCost to report causes for Always and Never instances. All Never and Always instantiations must provide a simple message. 2. Several functions that used to return the inlining results as boolean are changed to return InlineResult which carries the cause for negative decision. 3. Changed remark priniting and debug output messages to provide the additional messages and related inline cost. 4. Adjusted tests for changed printing. Patch by: yrouban (Yevgeny Rouban) Reviewers: craig.topper, sammccall, sgraenitz, NutshellySima, shchenz, chandlerc, apilipenko, javed.absar, tejohnson, dblaikie, sanjoy, eraman, xbolva00 Reviewed By: tejohnson, xbolva00 Subscribers: xbolva00, llvm-commits, arsenm, mehdi_amini, eraman, haicheng, steven_wu, dexonsmith Differential Revision: https://reviews.llvm.org/D49412 llvm-svn: 338969	2018-08-05 14:53:08 +00:00
Dave Lee	3fb120f12e	objdump: Better handling of Mach-O universal binaries Summary: With Mach-O, there is a flag requirement discrepancy between working with universal binaries and thin binaries. Many flags that don't require the `-macho` flag (for example `-private-headers` and `-disassemble`) fail to work on universal binaries unless `-macho` is given. When this happens, the error message is unhelpful, stating: The file was not recognized as a valid object file. Which can lead to confusion. This change allows generic flags to be used on universal binaries with and without the `-macho` flag. This means flags that can be used for thin files can be used consistently with fat files too. To do this, the universal binary support within `ParseInputMachO()` is extracted into a new function. This new function is called directly from `DumpInput()` when the input binary is universal. Additionally the `-arch` flag validation in `ParseInputMachO()` was extracted to be reused. Reviewers: compnerd Reviewed By: compnerd Subscribers: keith, llvm-commits Differential Revision: https://reviews.llvm.org/D48702 llvm-svn: 338792	2018-08-03 00:06:38 +00:00
Ben Dunbobbin	d498dcdbbf	[llvm-ar] Fix help text test. NFC. Missed from @338703 llvm-svn: 338709	2018-08-02 12:27:01 +00:00
Simon Pilgrim	b911d6721d	[llvm-mca][x86] Add CMPXCHG instruction resource tests I've put CMPXCHG8B/CMPXCHG16B in the same file, even though technically they are under separate CPUID bits all targets seem to support both (or neither). llvm-svn: 338595	2018-08-01 17:25:11 +00:00
Simon Pilgrim	5c4fb14e07	[llvm-mca][x86] Add PREFETCHW instruction resource tests These aren't just available via 3DNow! so test for them separately as well. llvm-svn: 338584	2018-08-01 16:34:39 +00:00
Simon Pilgrim	dcfa732b2f	[llvm-mca][x86] Add PCLMUL instruction resource tests Renamed the btver2 file that already contained them - the other targets were only testing the AVX versions llvm-svn: 338583	2018-08-01 16:25:50 +00:00
Jordan Rupprecht	d67c1e129b	[llvm-objcopy] Add support for --rename-section flags from gnu objcopy Summary: Add support for --rename-section flags from gnu objcopy. Not all flags appear to have an effect for ELF objects, but allowing them would allow easier drop-in replacement. Other unrecognized flags are rejected. This was only tested by comparing flags printed by "readelf -e <.o>" against the output of gnu vs llvm objcopy, it hasn't been tested to be valid beyond that. Reviewers: jakehehrlich, alexshap Subscribers: llvm-commits, paulsemel, alexshap Differential Revision: https://reviews.llvm.org/D49870 llvm-svn: 338582	2018-08-01 16:23:22 +00:00
Andrea Di Biagio	7f3bf5c1f9	[llvm-mca] Correctly update the rank in `Scheduler::select()`. Found by inspection. llvm-svn: 338579	2018-08-01 16:06:33 +00:00
Simon Pilgrim	34ac6533f4	[llvm-mca][x86] Add SET/TEST instruction resource tests llvm-svn: 338576	2018-08-01 15:29:47 +00:00
Simon Pilgrim	e364e57ac9	[llvm-mca][x86] Add LEA instruction resource tests We already added these to btver2, now add them to other targets, even though none of their models treat them specially (yet). llvm-svn: 338565	2018-08-01 14:25:33 +00:00
Simon Pilgrim	6754913e95	[llvm-mca][x86] Add more x86-64 system instruction resource tests CPUID, IN/OUT, INS/OUTS, INT, PAUSE, SCAS, UD2, XLAT llvm-svn: 338563	2018-08-01 14:18:09 +00:00
Simon Pilgrim	5f41ab79c0	[llvm-mca][x86] Add CLFLUSHOPT instruction resource tests llvm-svn: 338550	2018-08-01 13:34:17 +00:00
Simon Pilgrim	bd014f4d91	[llvm-mca][x86] Add CMPS/LODS/MOVS/STOS string instruction resource tests llvm-svn: 338532	2018-08-01 13:14:45 +00:00
Simon Pilgrim	18d025a732	[llvm-mca][x86] Add STC + STD instruction resource tests llvm-svn: 338514	2018-08-01 11:00:11 +00:00
David Bolvansky	fbbb83c782	Revert "Enrich inline messages", tests fail llvm-svn: 338496	2018-08-01 08:02:40 +00:00
David Bolvansky	7f36cd9d96	Enrich inline messages Summary: This patch improves Inliner to provide causes/reasons for negative inline decisions. 1. It adds one new message field to InlineCost to report causes for Always and Never instances. All Never and Always instantiations must provide a simple message. 2. Several functions that used to return the inlining results as boolean are changed to return InlineResult which carries the cause for negative decision. 3. Changed remark priniting and debug output messages to provide the additional messages and related inline cost. 4. Adjusted tests for changed printing. Patch by: yrouban (Yevgeny Rouban) Reviewers: craig.topper, sammccall, sgraenitz, NutshellySima, shchenz, chandlerc, apilipenko, javed.absar, tejohnson, dblaikie, sanjoy, eraman, xbolva00 Reviewed By: tejohnson, xbolva00 Subscribers: xbolva00, llvm-commits, arsenm, mehdi_amini, eraman, haicheng, steven_wu, dexonsmith Differential Revision: https://reviews.llvm.org/D49412 llvm-svn: 338494	2018-08-01 07:37:16 +00:00
Victor Leschuk	58d3399d8a	[DWARF] Support for .debug_addr (consumer) This patch implements basic support for parsing and dumping DWARFv5 .debug_addr section. llvm-svn: 338447	2018-07-31 22:19:19 +00:00
Fangrui Song	87b4b8f7b4	[llvm-objcopy] Make --strip-debug strip .gdb_index Summary: See binutils-gdb/bfd/elf.c, GNU objcopy also strips .stab* (STABS) .line* (DWARF 1) .gnu.linkonce.wi.* (linkonce section for .debug_info) but I'm not sure we need to be compatible with it. Reviewers: dblaikie, alexshap, jakehehrlich, jhenderson Reviewed By: alexshap, jakehehrlich Subscribers: aprantl, JDevlieghere, jakehehrlich, llvm-commits Differential Revision: https://reviews.llvm.org/D50100 llvm-svn: 338443	2018-07-31 21:26:35 +00:00
Simon Pilgrim	1f4b9cb6fe	[llvm-mca][x86] Add 32-bit instruction resource tests These aren't exhaustive, but cover some instructions that are only available in 32-bit mode (where would we be without good BCD math performance?). llvm-svn: 338404	2018-07-31 17:33:08 +00:00
David Bolvansky	ab79414f7b	Revert Enrich inline messages llvm-svn: 338389	2018-07-31 14:47:22 +00:00
David Bolvansky	b562dbabda	Enrich inline messages Summary: This patch improves Inliner to provide causes/reasons for negative inline decisions. 1. It adds one new message field to InlineCost to report causes for Always and Never instances. All Never and Always instantiations must provide a simple message. 2. Several functions that used to return the inlining results as boolean are changed to return InlineResult which carries the cause for negative decision. 3. Changed remark priniting and debug output messages to provide the additional messages and related inline cost. 4. Adjusted tests for changed printing. Patch by: yrouban (Yevgeny Rouban) Reviewers: craig.topper, sammccall, sgraenitz, NutshellySima, shchenz, chandlerc, apilipenko, javed.absar, tejohnson, dblaikie, sanjoy, eraman, xbolva00 Reviewed By: tejohnson, xbolva00 Subscribers: xbolva00, llvm-commits, arsenm, mehdi_amini, eraman, haicheng, steven_wu, dexonsmith Differential Revision: https://reviews.llvm.org/D49412 llvm-svn: 338387	2018-07-31 14:25:24 +00:00
Andrea Di Biagio	a1852b6194	[llvm-mca][BtVer2] Teach how to identify dependency-breaking idioms. This patch teaches llvm-mca how to identify dependency breaking instructions on btver2. An example of dependency breaking instructions is the zero-idiom XOR (example: `XOR %eax, %eax`), which always generates zero regardless of the actual value of the input register operands. Dependency breaking instructions don't have to wait on their input register operands before executing. This is because the computation is not dependent on the inputs. Not all dependency breaking idioms are also zero-latency instructions. For example, `CMPEQ %xmm1, %xmm1` is independent on the value of XMM1, and it generates a vector of all-ones. That instruction is not eliminated at register renaming stage, and its opcode is issued to a pipeline for execution. So, the latency is not zero. This patch adds a new method named isDependencyBreaking() to the MCInstrAnalysis interface. That method takes as input an instruction (i.e. MCInst) and a MCSubtargetInfo. The default implementation of isDependencyBreaking() conservatively returns false for all instructions. Targets may override the default behavior for specific CPUs, and return a value which better matches the subtarget behavior. In future, we should teach to Tablegen how to automatically generate the body of isDependencyBreaking from scheduling predicate definitions. This would allow us to expose the knowledge about dependency breaking instructions to the machine schedulers (and, potentially, other codegen passes). Differential Revision: https://reviews.llvm.org/D49310 llvm-svn: 338372	2018-07-31 13:21:43 +00:00
Jonas Devlieghere	ae1727e3dd	[dsymutil] Simplify temporary file handling. Dsymutil's update functionality was broken on Windows because we tried to rename a file while we're holding open handles to that file. TempFile provides a solution for this through its keep(Twine) method. This patch changes dsymutil to make use of that functionality. Differential revision: https://reviews.llvm.org/D49860 llvm-svn: 338216	2018-07-29 14:56:15 +00:00
Stephen Hines	e6e75bf84c	Handle the lack of a symbol table correctly. Summary: These two cases will trigger a dereference on a nullptr, since the SymbolTable can be nonexistent for a given library, in addition to just being empty. Reviewers: alexshap Reviewed By: alexshap Subscribers: meikeb, kongyi, chh, jakehehrlich, llvm-commits, pirama Differential Revision: https://reviews.llvm.org/D49534 llvm-svn: 338062	2018-07-26 20:05:31 +00:00
Jonas Devlieghere	640e790af2	[test] Disable dsymutil update test on windows Apparently, the issue with dsymutil update functionality on Windows was that Windows doesn't like dsymutil renaming files that have open handles to them. This disables the new accelerator test and updates the comment in the other two test. We should be able to enable the tests again once we updated the implementation to use TempFile::keep() to keep the temporary files in MachOUtils. A big thank you to Jeremy Morse from Sony for figuring this out and bringing it to my attention. llvm-svn: 338030	2018-07-26 14:16:19 +00:00
Jonas Devlieghere	f290256dfb	[test] Do dsymutil update in place Update the dSYM bundle in place when swapping out the accelerator tables. This should unbreak the windows bot that have been failing with an access denied. llvm-svn: 338014	2018-07-26 09:23:10 +00:00
Jonas Devlieghere	743d351120	[dsymutil] Add support for generating DWARF5 accelerator tables. This patch add support for emitting DWARF5 accelerator tables (.debug_names) from dsymutil. Just as with the Apple style accelerator tables, it's possible to update existing dSYMs. This patch includes a test that show how you can convert back and forth between the two types. If no kind of table is specified, dsymutil will default to generating Apple-style accelerator tables whenever it finds those in its input. The same is true when there are no accelerator tables at all. Finally, in the remaining case, where there's at least one DWARF v5 table and no Apple ones, the output will contains a DWARF accelerator tables (.debug_names). Differential revision: https://reviews.llvm.org/D49137 llvm-svn: 337980	2018-07-25 23:01:38 +00:00
Paul Semel	0913dcd747	[llvm-objdump] Add dynamic section printing to private-headers option Differential Revision: https://reviews.llvm.org/D49016 llvm-svn: 337902	2018-07-25 11:09:20 +00:00
Paul Semel	5ce8f1598c	[llvm-readobj] Generic hex-dump option Helpers are available to make this option file format independant. This patch adds the feature for Wasm file format. It doesn't change the behavior of the other file format handling. Differential Revision: https://reviews.llvm.org/D49545 llvm-svn: 337896	2018-07-25 10:04:37 +00:00
Wolfgang Pieb	439801ba1d	[DWARF v5] Refactor range lists dumping by using a more generic way of handling tables of lists. The intent is to use it for location list tables as well. Change is almost NFC with the exception of the spelling of some strings used during dumping (all lowercase now). Reviewer: JDevlieghere Differential Revision: https://reviews.llvm.org/D49500 llvm-svn: 337763	2018-07-23 22:37:17 +00:00
Paul Semel	1dbbfba888	[yaml2obj] Add default sh_entsize for dynamic sections Dynamic section holds a table, so the sh_entsize might be set. As the dynamic section entry size never changes, we can default it to the size of a dynamic entry. Differential Revision: https://reviews.llvm.org/D49619 llvm-svn: 337725	2018-07-23 18:49:04 +00:00
Roman Lebedev	52b85377eb	[NFC][MCA] ZnVer1: Update RegisterFile to identify false dependencies on partially written registers. Summary: Pretty mechanical follow-up for D49196. As microarchitecture.pdf notes, "20 AMD Ryzen pipeline", "20.8 Register renaming and out-of-order schedulers": The integer register file has 168 physical registers of 64 bits each. The floating point register file has 160 registers of 128 bits each. "20.14 Partial register access": The processor always keeps the different parts of an integer register together. ... An instruction that writes to part of a register will therefore have a false dependence on any previous write to the same register or any part of it. Reviewers: andreadb, courbet, RKSimon, craig.topper, GGanesh Reviewed By: GGanesh Subscribers: gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D49393 llvm-svn: 337676	2018-07-23 10:10:13 +00:00
Roman Lebedev	d57bd45acc	[NFC][MCA] ZnVer1: add partial-reg-update tests Reviewers: andreadb, courbet, RKSimon, craig.topper, GGanesh Reviewed By: GGanesh Subscribers: gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D49392 llvm-svn: 337675	2018-07-23 10:10:04 +00:00
Martin Storsjo	a6ffc9c8df	[COFF] Adjust how we flag weak externals This fixes PR36096. Originally based on a patch by Martell Malone. Differential Revision: https://reviews.llvm.org/D44357 llvm-svn: 337613	2018-07-20 20:48:29 +00:00
Jordan Rupprecht	db2036e1f5	[llvm-objcopy] Add basic support for --rename-section Summary: Add basic support for --rename-section=old=new to llvm-objcopy. A full replacement for GNU objcopy requires also modifying flags (i.e. --rename-section=old=new,flag1,flag2); I'd like to keep that in a separate change to keep this simple. Reviewers: jakehehrlich, alexshap Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D49576 llvm-svn: 337604	2018-07-20 19:54:24 +00:00
Simon Pilgrim	5e729dcc03	[llvm-mca][x86] Add movsx/movzx instructions to general x86_64 resource tests llvm-svn: 337586	2018-07-20 17:43:42 +00:00
Stella Stamenova	ca0547c83c	[llvm-objcopy, tests] Fix several llvm-objcopy tests Summary: In Python 3, sys.stdout.write expects a string rather than bytes. In order to be able to write the bytes to stdout, we need to use the buffer directly instead. This change is borrowing the implementation for writing to stdout that cat.py uses. Note that we cannot use cat.py directly because the file we are trying to open is a gzip file. Reviewers: asmith, bkramer, alexshap, jakehehrlich Reviewed By: alexshap, jakehehrlich Subscribers: jakehehrlich, llvm-commits Differential Revision: https://reviews.llvm.org/D49515 llvm-svn: 337567	2018-07-20 16:19:36 +00:00
Andrea Di Biagio	b6022aa8d9	[X86][BtVer2] correctly model the latency/throughput of LEA instructions. This patch fixes the latency/throughput of LEA instructions in the BtVer2 scheduling model. On Jaguar, A 3-operands LEA has a latency of 2cy, and a reciprocal throughput of 1. That is because it uses one cycle of SAGU followed by 1cy of ALU1. An LEA with a "Scale" operand is also slow, and it has the same latency profile as the 3-operands LEA. An LEA16r has a latency of 3cy, and a throughput of 0.5 (i.e. RThrouhgput of 2.0). This patch adds a new TIIPredicate named IsThreeOperandsLEAFn to X86Schedule.td. The tablegen backend (for instruction-info) expands that definition into this (file X86GenInstrInfo.inc): ``` static bool isThreeOperandsLEA(const MachineInstr &MI) { return ( ( MI.getOpcode() == X86::LEA32r \|\| MI.getOpcode() == X86::LEA64r \|\| MI.getOpcode() == X86::LEA64_32r \|\| MI.getOpcode() == X86::LEA16r ) && MI.getOperand(1).isReg() && MI.getOperand(1).getReg() != 0 && MI.getOperand(3).isReg() && MI.getOperand(3).getReg() != 0 && ( ( MI.getOperand(4).isImm() && MI.getOperand(4).getImm() != 0 ) \|\| (MI.getOperand(4).isGlobal()) ) ); } ``` A similar method is generated in the X86_MC namespace, and included into X86MCTargetDesc.cpp (the declaration lives in X86MCTargetDesc.h). Back to the BtVer2 scheduling model: A new scheduling predicate named JSlowLEAPredicate now checks if either the instruction is a three-operands LEA, or it is an LEA with a Scale value different than 1. A variant scheduling class uses that new predicate to correctly select the appropriate latency profile. Differential Revision: https://reviews.llvm.org/D49436 llvm-svn: 337469	2018-07-19 16:42:15 +00:00
George Rimar	a2b553b4c9	[llvm-readobj] - Do not report invalid amount of sections. When output style is GNU and amount of sections is >= SHN_LORESERVE, llvm-readobj reports zero number of sections instead of actual value. The patch fixes that. Differential revision: https://reviews.llvm.org/D49544 llvm-svn: 337462	2018-07-19 14:52:57 +00:00
Paul Semel	6e13790801	[llvm-readobj] Generic -string-dump option Differential Revision: https://reviews.llvm.org/D49470 llvm-svn: 337408	2018-07-18 18:00:41 +00:00
Paul Semel	007dedbf77	[llvm-objdump] Add -demangle (-C) option Differential Revision: https://reviews.llvm.org/D49043 llvm-svn: 337401	2018-07-18 16:39:21 +00:00
Benjamin Kramer	99ea38f42d	[llvm-objcopy] %python wants to be in quotes, because it might contain spaces llvm-svn: 337399	2018-07-18 16:17:53 +00:00
George Rimar	e35e6448f9	[llvm-objdump] - Stop reporting bogus section IDs. Imagine we have a file with few sections, and one of them is .foo with index N != 0. Problem is that when llvm-objdump is given a -section=.foo parameter it lists .foo as a section at index 0. That makes impossible to write test cases which needs to find the index of the particular section, while ignoring dumping of others. The patch fixes that. Differential revision: https://reviews.llvm.org/D49372 llvm-svn: 337361	2018-07-18 08:34:35 +00:00
George Rimar	6fdac3b23a	[llvm-readobj] - Teach tool to dump objects with >= SHN_LORESERVE of sections. http://www.sco.com/developers/gabi/2003-12-17/ch4.eheader.html says that e_shnum and/or e_shstrndx may have special values if "the number of sections is greater than or equal to SHN_LORESERVE" or "the section name string table section index is greater than or equal to SHN_LORESERVE (0xff00)" Previously llvm-readobj was unable to dump such files, patch changes that. I had to add a precompiled test case because it does not seem possible to prepare a test using yaml2obj or llvm-mc (not clear how to make .shstrtab to have index >= SHN_LORESERVE). Differential revision: https://reviews.llvm.org/D49369 llvm-svn: 337360	2018-07-18 08:19:58 +00:00
Simon Pilgrim	03164dfa5e	[llvm-mca][x86] Add extend, carry-flag and CMP instructions to general x86_64 resource tests llvm-svn: 337306	2018-07-17 17:47:35 +00:00
Simon Pilgrim	92da01fed9	[llvm-mca][x86] Add MOVBE resource tests to all supporting targets SNB doesn't support MOVBE but the numbers in Generic (which use the SNB model) look sane. llvm-svn: 337305	2018-07-17 17:41:45 +00:00
Simon Pilgrim	94049e8b15	[llvm-mca][x86] Add BSWAP resource tests llvm-svn: 337302	2018-07-17 17:10:47 +00:00
Simon Pilgrim	99a4f3195b	[llvm-mca][x86] Add displacement-only and additional scale=1 LEA tests llvm-svn: 337298	2018-07-17 16:17:33 +00:00
Simon Pilgrim	17d89ca70e	[llvm-mca][x86] Add LEA resource tests (PR32326) Add llvm-mca tests demonstrating how LEA instructions are currently modelled. Once this is working on btver2 I'll copy the test file to the other target directories. llvm-svn: 337297	2018-07-17 16:13:29 +00:00
Benjamin Kramer	e6dac13bab	[llvm-objcopy] Run not with any python, but the python configured in lit. llvm-svn: 337262	2018-07-17 10:30:56 +00:00
Jake Ehrlich	c7f8ac7896	[llvm-objcopy] Add support for large indexes This patch is an update of an older patch that never landed (see here: https://reviews.llvm.org/D42516) Recently various users have run into this issue and it just 100% has to be solved at this point. The main difference in this patch is that I use gunzip instead of unzip which should hopefully allow tests to pass. Please review this as if it is a new patch however. I found some issues along the way and made some minor modifications. The binary used in this patch for testing (a zip file to make it small) can be found here: https://drive.google.com/file/d/1UjsnTO9edLttZibbr-2T1bJl92KEQFAO/view?usp=sharing Differential Revision: https://reviews.llvm.org/D49206 llvm-svn: 337204	2018-07-16 19:48:52 +00:00
Joel Galenson	4099b249fb	[cfi-verify] Abort on unsupported targets As suggested in the review for r337007, this makes cfi-verify abort on unsupported targets instead of producing incorrect results. It also updates the design document to reflect this. Differential Revision: https://reviews.llvm.org/D49304 llvm-svn: 337181	2018-07-16 15:26:44 +00:00
Andrea Di Biagio	f84b0a6914	[llvm-mca] Regenerate X86 specific tests. NFC Not all tests were correctly updated by the update script after r336797. llvm-svn: 337124	2018-07-15 11:43:11 +00:00
Andrea Di Biagio	ff630c2cdc	[llvm-mca][BtVer2] teach how to identify false dependencies on partially written registers. The goal of this patch is to improve the throughput analysis in llvm-mca for the case where instructions perform partial register writes. On x86, partial register writes are quite difficult to model, mainly because different processors tend to implement different register merging schemes in hardware. When the code contains partial register writes, the IPC (instructions per cycles) estimated by llvm-mca tends to diverge quite significantly from the observed IPC (using perf). Modern AMD processors (at least, from Bulldozer onwards) don't rename partial registers. Quoting Agner Fog's microarchitecture.pdf: " The processor always keeps the different parts of an integer register together. For example, AL and AH are not treated as independent by the out-of-order execution mechanism. An instruction that writes to part of a register will therefore have a false dependence on any previous write to the same register or any part of it." This patch is a first important step towards improving the analysis of partial register updates. It changes the semantic of RegisterFile descriptors in tablegen, and teaches llvm-mca how to identify false dependences in the presence of partial register writes (for more details: see the new code comments in include/Target/TargetSchedule.h - class RegisterFile). This patch doesn't address the case where a write to a part of a register is followed by a read from the whole register. On Intel chips, high8 registers (AH/BH/CH/DH)) can be stored in separate physical registers. However, a later (dirty) read of the full register (example: AX/EAX) triggers a merge uOp, which adds extra latency (and potentially affects the pipe usage). This is a very interesting article on the subject with a very informative answer from Peter Cordes: https://stackoverflow.com/questions/45660139/how-exactly-do-partial-registers-on-haswell-skylake-perform-writing-al-seems-to In future, the definition of RegisterFile can be extended with extra information that may be used to identify delays caused by merge opcodes triggered by a dirty read of a partial write. Differential Revision: https://reviews.llvm.org/D49196 llvm-svn: 337123	2018-07-15 11:01:38 +00:00
Nico Weber	337e241d58	Attempt to get test/tools/llvm-lib/help.test passing on sanitizer-x86_64-linux-fast The bot has a /b directory, so /? matches against that and gets expanded to it. (Thanks to Hans's r187366, which solved the same problem for clang-cl a while ago and which saved me much head scratching.) llvm-svn: 337092	2018-07-14 11:33:33 +00:00
Nico Weber	17172c6b80	Give llvm-lib rudimentary help output. https://reviews.llvm.org/D49318 llvm-svn: 337084	2018-07-14 02:29:44 +00:00
Jonas Devlieghere	327e7a1608	[dwarfdump] Add pretty printer for accelerator table based on Atom. For instance, When dumping .apple_types, the second atom represents the DW_TAG. In addition to printing the raw value, we now also pretty print the value if the ATOM tells us how. llvm-svn: 337026	2018-07-13 17:21:51 +00:00
Andrea Di Biagio	e86e6efea1	[llvm-mca][BtVer2] Add tests for dependency breaking instructions. llvm-svn: 337024	2018-07-13 16:46:51 +00:00
Joel Galenson	667eac80da	[cfi-verify] Only run AArch64 tests when it is a supported target This stops the tests I added in r337007 from running when AArch64 is not a supported target. llvm-svn: 337012	2018-07-13 16:09:19 +00:00
Joel Galenson	06e7e5798f	[cfi-verify] Support AArch64. This patch adds support for AArch64 to cfi-verify. This required three changes to cfi-verify. First, it generalizes checking if an instruction is a trap by adding a new isTrap flag to TableGen (and defining it for x86 and AArch64). Second, the code that ensures that the operand register is not clobbered between the CFI check and the indirect call needs to allow a single dereference (in x86 this happens as part of the jump instruction). Third, we needed to ensure that return instructions are not counted as indirect branches. Technically, returns are indirect branches and can be covered by CFI, but LLVM's forward-edge CFI does not protect them, and x86 does not consider them, so we keep that behavior. In addition, we had to improve AArch64's code to evaluate the branch target of a MCInst to handle calls where the destination is not the first operand (which it often is not). Differential Revision: https://reviews.llvm.org/D48836 llvm-svn: 337007	2018-07-13 15:19:33 +00:00
Dean Michael Berris	10141261e1	[XRay][compiler-rt] Add PID field to llvm-xray tool and add PID metadata record entry in FDR mode Summary: llvm-xray changes: - account-mode - process-id {...} shows after thread-id - convert-mode - process {...} shows after thread - parses FDR and basic mode pid entries - Checks version number for FDR log parsing. Basic logging changes: - Update header version from 2 -> 3 FDR logging changes: - Update header version from 2 -> 3 - in writeBufferPreamble, there is an additional PID Metadata record (after thread id record and tsc record) Test cases changes: - fdr-mode.cc, fdr-single-thread.cc, fdr-thread-order.cc modified to catch process id output in the log. Reviewers: dberris Reviewed By: dberris Subscribers: hiraditya, llvm-commits, #sanitizers Differential Revision: https://reviews.llvm.org/D49153 llvm-svn: 336974	2018-07-13 05:38:22 +00:00
Bill Wendling	7bd9e94e38	[gold-plugin] Disable section ordering for relocatable links Not all programs want section ordering when compiled with LTO. In particular, the Linux kernel is very sensitive when it comes to linking, and doesn't boot when each function is placed in its own sections. Reviewed By: pcc Differential Revision: https://reviews.llvm.org/D48756 llvm-svn: 336943	2018-07-12 20:35:58 +00:00
Stephen Hines	e8c3c5fe5d	Add --strip-all option back to llvm-strip. Summary: This option appears to have been dropped as part of the refactoring in r331663. Unfortunately, if we want to use llvm-strip as a drop-in replacement for strip, this option should still be available. Reviewers: alexshap Reviewed By: alexshap Subscribers: meikeb, kongyi, chh, jakehehrlich, llvm-commits, pirama Differential Revision: https://reviews.llvm.org/D49226 llvm-svn: 336921	2018-07-12 17:42:17 +00:00
Paul Semel	0f9ca2d960	Revert "[llvm-objdump] Add -demangle (-C) option" This reverts commit 3a44ccd156e0edd2e89226f8ed63928e227900bb. This reverts commit d5cfc836bb5552e20507d3612d13ff66ff9e36a0. llvm-svn: 336829	2018-07-11 18:09:52 +00:00
Paul Semel	569200aab8	Fix llvm-objdump demangle test (added triple option) llvm-svn: 336821	2018-07-11 16:31:33 +00:00
Andrea Di Biagio	483db141e3	[X86] Fix MayLoad/HasSideEffect flag for (V)MOVLPSrm instructions. Before revision 336728, the "mayLoad" flag for instruction (V)MOVLPSrm was inferred directly from the "default" pattern associated with the instruction definition. r336728 removed special node X86Movlps, and all the patterns associated to it. Now instruction (V)MOVLPSrm doesn't have a pattern associated to it, and the 'mayLoad/hasSideEffects' flags are left unset. When the instruction info is emitted by tablegen, method CodeGenDAGPatterns::InferInstructionFlags() sees that (V)MOVLPSrm doesn't have a pattern, and flags are undefined. So, it conservatively sets the "hasSideEffects" flag for it. As a consequence, we were losing the 'mayLoad' flag, and we were gaining a 'hasSideEffect' flag in its place. This patch fixes the issue (originally reported by Michael Holmen). The mca tests show the differences in the instruction info flags. Instructions that were affected by this problem were: MOVLPSrm/VMOVLPSrm/VMOVLPSZ128rm. Differential Revision: https://reviews.llvm.org/D49182 llvm-svn: 336818	2018-07-11 15:27:50 +00:00
Paul Semel	bcf55ab95a	[llvm-objdump] Add -demangle (-C) option Differential Revision: https://reviews.llvm.org/D49043 llvm-svn: 336816	2018-07-11 15:25:39 +00:00
Andrea Di Biagio	d2e2c053cf	[llvm-mca] Use a different character to flag instructions with side-effects in the Instruction Info View. NFC This makes easier to identify changes in the instruction info flags. It also helps spotting potential regressions similar to the one recently introduced at r336728. Using the same character to mark MayLoad/MayStore/HasSideEffects is problematic for llvm-lit. When pattern matching substrings, llvm-lit consumes tabs and spaces. A change in position of the flag marker may not trigger a test failure. This patch only changes the character used for flag `hasSideEffects`. The reason why I didn't touch other flags is because I want to avoid spamming the mailing because of the massive diff due to the numerous tests affected by this change. In future, each instruction flag should be associated with a different character in the Instruction Info View. llvm-svn: 336797	2018-07-11 12:44:44 +00:00
Paul Semel	b98f504850	[llvm-readobj] Add -hex-dump (-x) option Differential Revision: https://reviews.llvm.org/D48281 llvm-svn: 336782	2018-07-11 10:00:29 +00:00
Andrea Di Biagio	2b3a4f9c9b	[llvm-mca] Add tests for partial register writes. llvm-mca doesn't know that on modern AMD processors, portions of a general purpose register are not treated independently. So, a partial register write has a false dependency on the super-register. The issue with partial register writes will be addressed by a follow-up patch. llvm-svn: 336778	2018-07-11 09:50:00 +00:00
Jonas Devlieghere	82dee6aca8	[dsymutil] Add support for outputting assembly When implementing the DWARF accelerator tables in dsymutil I ran into an assertion in the assembler. Debugging these kind of issues is a lot easier when looking at the assembly instead of debugging the assembler itself. Since it's only a matter of creating an AsmStreamer instead of a MCObjectStreamer it made sense to turn this into a (hidden) dsymutil feature. Differential revision: https://reviews.llvm.org/D49079 llvm-svn: 336561	2018-07-09 16:58:48 +00:00
Andrea Di Biagio	8834779644	[llvm-mca] report an error if the assembly sequence contains an unsupported instruction. This is a short-term fix for PR38093. For now, we llvm::report_fatal_error if the instruction builder finds an unsupported instruction in the instruction stream. We need to revisit this fix once we start addressing PR38101. Essentially, we need a better framework for error handling. llvm-svn: 336543	2018-07-09 12:30:55 +00:00
Roman Lebedev	0e58dee284	[MCA][X86][NFC] Add BSF/BSR resource tests Reviewers: RKSimon, andreadb, courbet Reviewed By: RKSimon Subscribers: gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D48997 llvm-svn: 336510	2018-07-08 09:50:14 +00:00
Alexander Shaposhnikov	42b5ef0269	[llvm-objcopy] Add support for static libraries This diff adds support for handling static libraries to llvm-objcopy and llvm-strip. Test plan: make check-all Differential revision: https://reviews.llvm.org/D48413 llvm-svn: 336455	2018-07-06 17:51:03 +00:00
Andrea Di Biagio	61c52af9d9	[llvm-mca] improve the instruction issue logic implemented by the Scheduler. This patch modifies the Scheduler heuristic used to select the next instruction to issue to the pipelines. The motivating example is test X86/BtVer2/add-sequence.s, for which llvm-mca wrongly reported an estimated IPC of 1.50. According to perf, the actual IPC for that test should have been ~2.00. It turns out that an IPC of 2.00 for test add-sequence.s cannot possibly be predicted by a Scheduler that only prioritizes instructions based on their "age". A similar issue also affected test X86/BtVer2/dependent-pmuld-paddd.s, for which llvm-mca wrongly estimated an IPC of 0.84 instead of an IPC of 1.00. Instructions in the ReadyQueue are now ranked based on two factors: - The "age" of an instruction. - The number of unique users of writes associated with an instruction. The new logic still prioritizes older instructions over younger instructions to minimize the pressure on the reorder buffer. However, the number of users of an instruction now also affects the overall rank. This potentially increases the ability of the Scheduler to extract instruction level parallelism. This patch fixes the problem with the wrong IPC reported for test add-sequence.s and test dependent-pmuld-paddd.s. llvm-svn: 336420	2018-07-06 08:08:30 +00:00
Dave Lee	390abe4a75	Reapply: "objdump: Support newer ObjC image info flags" Summary: Add support for two additional ObjC image info flags: `IS_SIMULATED` and `HAS_CATEGORY_CLASS_PROPERTIES`. `IS_SIMULATED` indicates a Mach-O binary built for iOS simulator. `HAS_CATEGORY_CLASS_PROPERTIES` indicates a Mach-O binary built by a compiler that supports class properties in categories. Reviewers: enderby, compnerd Reviewed By: compnerd Subscribers: keith, llvm-commits Differential Revision: https://reviews.llvm.org/D48568 llvm-svn: 336411	2018-07-06 05:11:35 +00:00
Dave Lee	e6de96410b	Revert "objdump: Support newer ObjC image info flags" This reverts commit 8c4cc472e7a67bd3b2b20cc4cf32d31af29bc7e9. llvm-svn: 336402	2018-07-06 00:13:21 +00:00
Dave Lee	9e412ec8f2	objdump: Support newer ObjC image info flags Summary: Add support for two additional ObjC image info flags: `IS_SIMULATED` and `HAS_CATEGORY_CLASS_PROPERTIES`. `IS_SIMULATED` indicates a Mach-O binary built for iOS simulator. `HAS_CATEGORY_CLASS_PROPERTIES` indicates a Mach-O binary built by a compiler that supports class properties in categories. Reviewers: enderby, compnerd Reviewed By: compnerd Subscribers: keith, llvm-commits Differential Revision: https://reviews.llvm.org/D48568 llvm-svn: 336399	2018-07-05 23:32:15 +00:00
Paul Semel	91c9d4251c	[llvm-objdump] Removed archive-headers-disas test This test is failing because of the disas part. For the moment, I will juste remove it. I will add it again tomorrow with a proper fix. llvm-svn: 336370	2018-07-05 16:49:46 +00:00
Paul Semel	63e4008718	[llvm-objcopy] Fix timezone dependant tests llvm-svn: 336363	2018-07-05 15:24:11 +00:00
Paul Semel	0dc92f6a74	[llvm-objdump] Add --archive-headers (-a) option llvm-svn: 336357	2018-07-05 14:43:29 +00:00
Roman Lebedev	0dd27042c6	[X86][BtVer2][MCA][NFC] Add CMPEQ dependency-breaking one-idioms tests Summary: As per `Agner's Microarchitecture doc (21.8 AMD Bobcat and Jaguar pipeline - Dependency-breaking instructions)`, these, like zero-idioms, are dependency-breaking, although they produce ones and still consume resources. FIXME: as discussed in D48877, llvm-mca handling is broken for these. Reviewers: andreadb Reviewed By: andreadb Subscribers: gbedwell, RKSimon, llvm-commits Differential Revision: https://reviews.llvm.org/D48876 llvm-svn: 336292	2018-07-04 17:32:44 +00:00
Paul Semel	d2af4d6f1b	[llvm-objdump] Add --file-headers (-f) option llvm-svn: 336284	2018-07-04 15:25:03 +00:00
Gabor Buella	da4a966e1c	NFC - Various typo fixes in tests llvm-svn: 336268	2018-07-04 13:28:39 +00:00
Teresa Johnson	50615c72b4	Remove absolute path in test My test change in r336148 accidentally included an absolute path, clean that up to fix bot failures. llvm-svn: 336151	2018-07-02 23:02:07 +00:00
Teresa Johnson	8fc766681d	[ThinLTO] Fix printing of module paths for distributed backend indexes Summary: In the individual index files emitted for distributed ThinLTO backends, the module path ids are not contiguous. Assign slots to module paths in order to handle this better and also to get contiguous numbering in the summary assembly. Reviewers: davidxl, dexonsmith Subscribers: mehdi_amini, inglorion, eraman, llvm-commits, steven_wu Differential Revision: https://reviews.llvm.org/D48698 llvm-svn: 336148	2018-07-02 22:09:23 +00:00
Fangrui Song	f50ad6c311	Replace unused output filenames with /dev/null in tests Similar to rLLD336129 llvm-svn: 336131	2018-07-02 18:16:44 +00:00
Dave Lee	d4f77a523b	nm: Add -no-weak flag for hiding weak symbols Summary: This adds a new -no-weak flag to nm to hide weak symbols in its output. This also adds a -W alias for this which is analogous to -U. Patch by Keith Smiley Reviewers: kastiglione, enderby, compnerd Reviewed By: kastiglione Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D48751 llvm-svn: 336126	2018-07-02 17:24:37 +00:00
Paul Semel	8dabda70af	Revert "[llvm-readobj] Fix printing format" There is a problem with the formatting on windows build. I need to investigate on this. llvm-svn: 336061	2018-07-01 11:54:09 +00:00
Paul Semel	49997adc88	[llvm-readobj] Fix printing format We were printing every character, even those that weren't printable. It doesn't really make sense for this option. The string content was sticked to its address, added two spaces in between. Differential Revision: https://reviews.llvm.org/D48271 llvm-svn: 336058	2018-07-01 09:51:59 +00:00
Jonas Devlieghere	a0857eaefe	[dsymutil] Make the CachedBinaryHolder the default Replaces all uses of the old binary holder with its cached variant. Differential revision: https://reviews.llvm.org/D48770 llvm-svn: 335991	2018-06-29 16:51:52 +00:00
Sterling Augustine	0cf1f15e83	Require x86 for this test. llvm-svn: 335939	2018-06-28 23:22:14 +00:00
Jake Ehrlich	0f440d832f	[llvm-readobj] Add experimental support for SHT_RELR sections This change adds experimental support for SHT_RELR sections, proposed here: https://groups.google.com/forum/#!topic/generic-abi/bX460iggiKg Definitions for the new ELF section type and dynamic array tags, as well as the encoding used in the new section are all under discussion and are subject to change. Use with caution! Author: rahulchaudhry Differential Revision: https://reviews.llvm.org/D47919 llvm-svn: 335922	2018-06-28 21:07:34 +00:00
Sterling Augustine	052ce120d5	Some targets don't have lld built, so just use a binary copy of the input file. llvm-svn: 335908	2018-06-28 19:47:23 +00:00
Sterling Augustine	bc78b62169	Handle absolute symbols as branch targets in disassembly. https://reviews.llvm.org/D48554 llvm-svn: 335903	2018-06-28 18:57:13 +00:00
Simon Pilgrim	83125594ed	[llvm-mca][x86] Add FMA4 resource tests We should be ensuring we have (near) complete test coverage of instructions, at least for the generic model. llvm-svn: 335870	2018-06-28 16:24:13 +00:00
Simon Pilgrim	12f9503d40	[llvm-mca][x86] Add 3dnow! resource tests We should be ensuring we have (near) complete test coverage of instructions, at least for the generic model. llvm-svn: 335869	2018-06-28 16:21:22 +00:00
Fangrui Song	ee15d3dcdb	Move `REQUIRES:` line to the top llvm-svn: 335635	2018-06-26 17:44:23 +00:00
Tim Northover	f2f9f2f505	ARM: add binary file git swallowed. Should fix bots. llvm-svn: 335596	2018-06-26 12:28:47 +00:00
Tim Northover	bf54858115	ARM: diagnose unpredictable IT instructions IT instructions are allowed to have the 'AL' predicate, but it must never result in an 'NV' predicated instruction. Essentially this means that all branches must be 't' rather than 'e' if the predicate is 'AL'. This patch adds a diagnostic for this during assembly (error because parsing hits an assertion if allowed to continue) and an annotation during disassembly. llvm-svn: 335593	2018-06-26 11:38:41 +00:00
Vedant Kumar	b725c69f12	[SelectionDAG] Remove debug locations from ConstantSD(FP)Nodes This removes debug locations from ConstantSDNode and ConstantSDFPNode. When this kind of node is materialized we no longer create a line table entry which jumps back to the constant's first point of use. This makes single-stepping behavior smoother, and it matches the model used by IR, where Constants have no locations. See this thread for more context: http://lists.llvm.org/pipermail/llvm-dev/2018-June/124164.html I'd like to handle constant BuildVectorSDNodes and to try to eliminate passing SDLocs to SelectionDAG::getConstant*() in follow-up commits. Differential Revision: https://reviews.llvm.org/D48468 llvm-svn: 335497	2018-06-25 17:06:18 +00:00
Jonas Devlieghere	fb54074112	[llvm-mt] Use WithColor for printing errors. Use the WithColor helper from support to print errors. llvm-svn: 335416	2018-06-23 16:49:07 +00:00
Eugene Leviant	da873b5e2e	[LIT] Enable testing of LLVM gold plugin on Mac OS X Differential revision: https://reviews.llvm.org/D48350 llvm-svn: 335136	2018-06-20 15:32:47 +00:00
Andrea Di Biagio	2145b13fc9	[llvm-mca][X86] Teach how to identify register writes that implicitly clear the upper portion of a super-register. This patch teaches llvm-mca how to identify register writes that implicitly zero the upper portion of a super-register. On X86-64, a general purpose register is implemented in hardware as a 64-bit register. Quoting the Intel 64 Software Developer's Manual: "an update to the lower 32 bits of a 64 bit integer register is architecturally defined to zero extend the upper 32 bits". Also, a write to an XMM register performed by an AVX instruction implicitly zeroes the upper 128 bits of the aliasing YMM register. This patch adds a new method named clearsSuperRegisters to the MCInstrAnalysis interface to help identify instructions that implicitly clear the upper portion of a super-register. The rest of the patch teaches llvm-mca how to use that new method to obtain the information, and update the register dependencies accordingly. I compared the kernels from tests clear-super-register-1.s and clear-super-register-2.s against the output from perf on btver2. Previously there was a large discrepancy between the estimated IPC and the measured IPC. Now the differences are mostly in the noise. Differential Revision: https://reviews.llvm.org/D48225 llvm-svn: 335113	2018-06-20 10:08:11 +00:00
Roman Lebedev	d23b6831de	[X86][Znver1] Specify Register Files, RCU; FP scheduler capacity. Summary: First off: i do not have any access to that processor, so this is purely theoretical, no benchmarks. I have been looking into bdver2 scheduling profile, and while cross-referencing the existing btver2, znver1 profiles, and the reference docs (`Software Optimization Guide for AMD Family {15,16,17}h Processors`), i have noticed that only btver2 scheduling profile specifies these. Also, there is no mca test coverage. Reviewers: RKSimon, craig.topper, courbet, GGanesh, andreadb Reviewed By: GGanesh Subscribers: gbedwell, vprasad, ddibyend, shivaram, Ashutosh, javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D47676 llvm-svn: 335099	2018-06-20 07:01:14 +00:00
Clement Courbet	e0aa30008f	[X86] Fix r335097 Missed `Generic` test in llvm-mca. llvm-svn: 335098	2018-06-20 06:44:13 +00:00
Clement Courbet	7b9913fb9f	[X86] Add sched class WriteLAHFSAHF and fix values. Summary: I ran llvm-exegesis on SKX, SKL, BDW, HSW, SNB. Atom is from Agner and SLM is a guess. I've left AMD processors alone. Reviewers: RKSimon, craig.topper Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D48079 llvm-svn: 335097	2018-06-20 06:13:39 +00:00
Roman Lebedev	ae0527aac9	[MCA][NFC] Add generic XOP resource tests Summary: Based on * [[ https://support.amd.com/TechDocs/43479.pdf \| AMD64 Architecture Programmer’s Manual Volume 6: 128-Bit and 256-Bit XOP and FMA4 Instructions ]], * [[ https://support.amd.com/TechDocs/24594.pdf \| AMD64 Architecture Programmer’s Manual Volume 3: General-Purpose and System Instructions]], * https://en.wikipedia.org/wiki/XOP_instruction_set Appears to be only supported in AMD's 15h generation, so only in bdver[1-4], for which currently llvm has no scheduling profiles. Reviewers: RKSimon, craig.topper, andreadb, spatel Reviewed By: RKSimon Subscribers: gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D48264 llvm-svn: 335034	2018-06-19 09:21:27 +00:00
Roman Lebedev	0d12c1685b	[MCA][NFC] Add generic TBM resource tests Summary: Based on https://support.amd.com/TechDocs/24594.pdf, https://en.wikipedia.org/wiki/Bit_Manipulation_Instruction_Sets#TBM_(Trailing_Bit_Manipulation) Appears to be only supported in AMD's 15h generation, so only in bdver[1-4], for which currently llvm has no scheduling profiles. Reviewers: RKSimon, craig.topper, simark, andreadb Reviewed By: RKSimon Subscribers: gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D48252 llvm-svn: 335033	2018-06-19 09:21:22 +00:00
Andrea Di Biagio	a88281d8ae	[llvm-mca] Use an ordered map to collect hardware statistics. NFC. Histogram entries are now ordered by key. This should improves their readability when statistics are printed. llvm-svn: 334961	2018-06-18 17:04:56 +00:00
Andrea Di Biagio	487da729a2	[llvm-mca] Add tests for XOP and AVX512 instructions that implicitly clear the upper portion of a super-register. When the destination register of a XOP instruction is an XMM register, bits [255:128] of the corresponding YMM register are cleared. When the destination register of a EVEX encoded instruction is an XMM/YMM register, the upper bits of the corresponding ZMM are cleared. On processors that feature AVX512, a write to an XMM registers always clears the upper portion of the corresponding ZMM register if the instruction is VEX or EVEX encoded. These new tests show some interesting cases which aren't correctly analyzed by llvm-mca. The lack of knowledge related to the implicit update on the super-registers is addressed by D48225. llvm-svn: 334945	2018-06-18 14:00:30 +00:00
Clement Courbet	0d9da88d18	[X86] Fix NOOP sched overrides on BDW/HSW/SKL. Summary: Noop certainly does not use resources. Reviewers: RKSimon, craig.topper, andreadb Subscribers: gbedwell, llvm-commits, gchatelet Differential Revision: https://reviews.llvm.org/D48028 llvm-svn: 334927	2018-06-18 06:48:22 +00:00
Simon Pilgrim	e930f569f7	[llvm-mca][X86] Add some avx512f/avx512vl resource test placeholders There are a lot of instructions to add under these ISAs (and the other AVX512 variants) but this should demonstrate how to test for the EVEX instructions with different maskings llvm-svn: 334907	2018-06-17 16:25:48 +00:00
Simon Pilgrim	f5ecd8d50d	[llvm-mca][x86] Add Generic cpu resource tests Added a Generic x86 cpu set of resource tests to allow us to check all ISAs. We currently use SandyBridge as our generic CPU model, but it's better if we actually duplicate these tests for if/when we change the model, it also means we don't end up polluting the SandyBridge folder with tests for ISAs it doesn't support. llvm-svn: 334853	2018-06-15 18:35:25 +00:00
Roman Lebedev	9ddf128f79	[MCA] Add -summary-view option Summary: While that is indeed a quite interesting summary stat, there are cases where it does not really add anything other than consuming extra lines. Declutters the output of D48190. Reviewers: RKSimon, andreadb, courbet, craig.topper Reviewed By: andreadb Subscribers: javed.absar, gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D48209 llvm-svn: 334833	2018-06-15 14:01:43 +00:00
Roman Lebedev	7c423001e4	[MCA][x86][NFC] Add tests for -register-file-stats, -scheduler-stats Summary: There does not seem to be any other tests for this. Split off from D47676. Reviewers: RKSimon, craig.topper, courbet, andreadb Reviewed By: andreadb Subscribers: javed.absar, gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D48190 llvm-svn: 334832	2018-06-15 14:01:35 +00:00
Andrea Di Biagio	4cafb297d5	[llvm-mca] Add tests for instructions that implicitly clear the upper portion of a super-register. On x86-64, a write to register EAX implicitly clears the upper half or RAX. 128-bit AVX instructions clear the upper 128-bit of the YMM register that aliases the XMM definition register. llvm-mca doesn't know about register writes that implicitly clear the upper portion of an aliasing super-register. This issue will be fixed in a future patch. llvm-svn: 334742	2018-06-14 17:48:42 +00:00
Andrea Di Biagio	4729d1ff27	[llvm-mca] Add another test for partial register stalls. This test checks that a physical register is correctly allocated for the partial write to register BX. The ADD instruction has to wait for the write to RBX (and BX) before being executed. llvm-svn: 334730	2018-06-14 15:54:34 +00:00
Andrea Di Biagio	0ffb2271a1	[llvm-mca] Fixed a bug in the logic that checks if a memory operation is ready to execute. Fixes PR37790. In some (very rare) cases, the LSUnit (Load/Store unit) was wrongly marking a load (or store) as "ready to execute" effectively bypassing older memory barrier instructions. To reproduce this bug, the memory barrier must be the first instruction in the input assembly sequence, and it doesn't have to perform any register writes. llvm-svn: 334633	2018-06-13 18:30:14 +00:00
Pavel Labath	4adc88ed25	[DWARF/AccelTable] Remove getDIESectionOffset for DWARF v5 entries Summary: This method was not correct for entries in DWO files as it assumed it could just add up the CU and DIE offsets to get the absolute DIE offset. This is not correct for the DWO files, as here the CU offset will reference the skeleton unit, whereas the DIE offset will be the offset in the full unit in the DWO file. Unfortunately, this means that we are not able to determine the absolute DIE offset using the information in the .debug_names section alone, which means we have to offload some of this work to the users of this class. To demonstrate how this can be done, I've added/fixed the ability to lookup entries using accelerator tables in DWO files in llvm-dwarfdump. To make this happen, I've needed to make two extra changes in other classes: - made the DWARFContext method to lookup a CU based on the section offset public. I've needed this functionality to lookup a CU, and this seems like a useful thing in general. - made DWARFUnit::getDWOId call extractDIEsIfNeeded. Before this, the DWOId was filled in only if the root DIE happened to be parsed before we called the accessor. Since the lazy parsing is supposed to happen under the hood, calling extractDIEsIfNeeded seems appropriate. Reviewers: JDevlieghere, aprantl, dblaikie Subscribers: mgrang, llvm-commits Differential Revision: https://reviews.llvm.org/D48009 llvm-svn: 334578	2018-06-13 08:14:27 +00:00
Clement Courbet	7db69cc08a	[X86] Fix skylake server scheduling info. Summary: This fixes most of the scheduling info for SKX vector operations. I had to split a lot of the YMM/ZMM classes into separate classes for YMM and ZMM. The before/after llvm-exegesis analysis are in the phabricator diff. Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D47721 llvm-svn: 334407	2018-06-11 14:37:53 +00:00
Simon Pilgrim	89deac6694	[X86][BtVer2] Add support for all SUB/XOR 32/64 scalar instructions that should match the dependency-breaking 'zero-idiom' As detailed on Agner's Microarchitecture doc (21.8 AMD Bobcat and Jaguar pipeline - Dependency-breaking instructions), these instructions are dependency breaking and fast-path zero the destination register (and appropriate EFLAGS bits). llvm-svn: 334303	2018-06-08 17:00:45 +00:00
Simon Pilgrim	efb4806bb9	[X86][BtVer2] Remove SBB tests that were accidentally added in rL334296 These aren't true zero-idiom instructions (just dependency breaking). llvm-svn: 334297	2018-06-08 15:43:00 +00:00
Simon Pilgrim	53766a986d	[X86][BtVer2] Add tests for scalar SUB/XOR instructions that should match the dependency-breaking 'zero-idiom' As detailed on Agner's Microarchitecture doc (21.8 AMD Bobcat and Jaguar pipeline - Dependency-breaking instructions). llvm-svn: 334296	2018-06-08 15:28:43 +00:00
Simon Pilgrim	aafcf9e4a1	[X86][BtVer2] Limit zero idiom tests to a single iteration. Reduces output size and we're only wanting to check that the instructions are fast-path'd (just Dispatch+Retire) anyhow llvm-svn: 334292	2018-06-08 15:01:40 +00:00
Paul Semel	e57bc78324	[llvm-strip] Expose --strip-unneeded option Differential Revision: https://reviews.llvm.org/D47818 llvm-svn: 334182	2018-06-07 10:05:25 +00:00
Peter Collingbourne	cf017ada68	llvm-readobj: fix printing number of relocations in Android packed format. With '-elf-output-style=GNU -relocations', a header containing the number of entries is printed before all the relocation entries in the section. For Android packed format, we need to perform the unpacking first before we can get the actual number of relocations in the section. Patch by Rahul Chaudhry! Differential Revision: https://reviews.llvm.org/D47800 llvm-svn: 334147	2018-06-07 00:02:07 +00:00
Alexander Shaposhnikov	29407f3abe	[llvm-strip] Expose --discard-all option Expose objcopy's --discard-all option in llvm-strip. Test plan: make check-all Differential revision: https://reviews.llvm.org/D47750 llvm-svn: 334131	2018-06-06 21:23:19 +00:00
Simon Pilgrim	aef5bdbea1	[X86][BtVer2] Add support for all vector instructions that should match the dependency-breaking 'zero-idiom' As detailed on Agner's Microarchitecture doc (21.8 AMD Bobcat and Jaguar pipeline - Dependency-breaking instructions), all these instructions are dependency breaking and zero the destination register. llvm-svn: 334119	2018-06-06 19:06:09 +00:00
Simon Pilgrim	7a48bb6e44	[llvm-mca][x86] Fix all resources-x86_64.s tests to use different registers in reg-reg cases I noticed while working on zero-idiom + dependency-breaking support (PR36671) that most of our binary instruction tests were reusing the same src registers, which would cause the tests to fail once we enable scalar zero-idiom support on btver2. Fixed in all targets to keep them in sync. llvm-svn: 334110	2018-06-06 18:20:25 +00:00
Simon Pilgrim	64541ff297	[X86][BtVer2] Add tests for all vector instructions that should match the dependency-breaking 'zero-idiom' As detailed on Agner's Microarchitecture doc (21.8 AMD Bobcat and Jaguar pipeline - Dependency-breaking instructions), all these instructions are dependency breaking and zero the destination register. TODO: Scalar instructions still need to be tested (need to check EFLAGS handling). llvm-svn: 334104	2018-06-06 16:14:37 +00:00
Sanjay Patel	59313be8d3	[CodeGen] assume max/default throughput for unspecified instructions This is a fix for the problem arising in D47374 (PR37678): https://bugs.llvm.org/show_bug.cgi?id=37678 We may not have throughput info because it's not specified in the model or it's not available with variant scheduling, so assume that those instructions can execute/complete at max-issue-width. Differential Revision: https://reviews.llvm.org/D47723 llvm-svn: 334055	2018-06-05 23:34:45 +00:00
Andrea Di Biagio	757600bccb	[llvm-mca] Correctly update the CyclesLeft of a register read in the presence of partial register updates. This patch fixe the logic in ReadState::cycleEvent(). That method was not correctly updating field `TotalCycles`. Added extra code comments in class ReadState to better describe each field. llvm-svn: 334028	2018-06-05 17:12:02 +00:00
Alexander Shaposhnikov	d7eaf27654	[llvm-strip] Add missing aliases for --strip-debug Add missing aliases for --strip-debug: -g, -S, -d. Test plan: make check-all Differential revision: https://reviews.llvm.org/D47674 llvm-svn: 333940	2018-06-04 18:55:41 +00:00
Andrea Di Biagio	39e5a5695f	[RFC][patch 3/3] Add support for variant scheduling classes in llvm-mca. This patch is the last of a sequence of three patches related to LLVM-dev RFC "MC support for variant scheduling classes". http://lists.llvm.org/pipermail/llvm-dev/2018-May/123181.html This fixes PR36672. The main goal of this patch is to teach llvm-mca how to solve variant scheduling classes. This patch does that, plus it adds new variant scheduling classes to the BtVer2 scheduling model to identify so-called zero-idioms (i.e. so-called dependency breaking instructions that are known to generate zero, and that are optimized out in hardware at register renaming stage). Without the BtVer2 change, this patch would not have had any meaningful tests. This patch is effectively the union of two changes: 1) a change that teaches llvm-mca how to resolve variant scheduling classes. 2) a change to the BtVer2 scheduling model that allows us to special-case packed XOR zero-idioms (this partially fixes PR36671). Differential Revision: https://reviews.llvm.org/D47374 llvm-svn: 333909	2018-06-04 15:43:09 +00:00
Alexander Ivchenko	ab60a2823f	[llvm-readobj] Support GNU_PROPERTY_X86_FEATURE_1_AND notes in .note.gnu.property Resubmit of r333424. This version contains the fix for fails found by buildbots on some targets. This patch allows parsing GNU_PROPERTY_X86_FEATURE_1_AND notes in .note.gnu.property sections. These notes indicate that the object file is built to support Intel CET. patch by mike.dvoretsky Differential Revision: https://reviews.llvm.org/D47473 llvm-svn: 333908	2018-06-04 15:14:18 +00:00
Greg Bedwell	bbe64af0a0	[llvm-mca] Regenerate a test to remove a double newline Command used: py update_mca_test_checks.py ..\test\tools\llvm-mca\\.s ..\test\tools\llvm-mca\\\*.s llvm-svn: 333893	2018-06-04 12:30:03 +00:00
Roman Lebedev	7b53d1454f	[llvm-mca] Make sure not to end the test files with an empty line. Summary: It's super irritating. [properly configured] git client then complains about that double-newline, and you have to use `--force` to ignore the warning, since even if you fix it manually, it will be reintroduced the very next runtime :/ Reviewers: RKSimon, andreadb, courbet, craig.topper, javed.absar, gbedwell Reviewed By: gbedwell Subscribers: javed.absar, tschuett, gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D47697 llvm-svn: 333887	2018-06-04 11:48:46 +00:00
Paul Semel	46201fb7bc	[llvm-objcopy] Fix null symbol handling This fixes the bug where strip-all option was leading to a malformed outputted ELF file. Differential Revision: https://reviews.llvm.org/D47414 llvm-svn: 333772	2018-06-01 16:19:46 +00:00
Alexander Shaposhnikov	ecc84834b7	[llvm-strip] Add -o option to llvm-strip This diff implements the option -o for specifying a file to write the output to. Test plan: make check-all Differential revision: https://reviews.llvm.org/D47505 llvm-svn: 333693	2018-05-31 20:42:13 +00:00
Clement Courbet	2e41c5a79c	[X86] Introduce WriteFLDC for x87 constant loads. Summary: {FLDL2E, FLDL2T, FLDLG2, FLDLN2, FLDPI} were using WriteMicrocoded. - I've measured the values for Broadwell, Haswell, SandyBridge, Skylake. - For ZnVer1 and Atom, values were transferred form InstRWs. - For SLM and BtVer2, I've guessed some values :( Reviewers: RKSimon, craig.topper, andreadb Subscribers: gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D47585 llvm-svn: 333656	2018-05-31 14:22:01 +00:00
Clement Courbet	b78ab5097d	[X86] Extract latency of fldz/fld1 in separate classes. Summary: - I've measured the values for Broadwell, Haswell, SandyBridge, Skylake. - For ZnVer1 and Atom, values were transferred form `InstRW`s. - For SLM and BtVer2, values are from Agner. This is split off from https://reviews.llvm.org/D47377 Reviewers: RKSimon, andreadb Subscribers: gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D47523 llvm-svn: 333642	2018-05-31 11:41:27 +00:00
Pavel Labath	59870af66f	DWARFAcceleratorTable: fix equal_range iterators Summary: Both (Apple and DWARF5) implementations of the iterators had bugs which resulted in crashes if one attempted to iterate through the accelerator tables all the way. For the Apple tables, the issue was that we did not clear the DataOffset field when we reached the end, which made our iterator compare unequal to the "end" iterator. For the Dwarf5 tables, the problem was that we incremented the CurrentIndex pointer and then used the incremented (possibly invalid) pointer to check whether we have reached the end of the index list. The reason these bugs went undetected is because their only user (dwarfdump) only ever searched for the first match. Besides allowing us to test this fix, changing llvm-dwarfdump --find to display all matches seems like a good improvement (it makes the behavior consistent with the --name option), so I change llvm-dwarfdump to do that. The existing tests would be sufficient to test this fix with the new llvm-dwarfdump behavior, but I add a special test that demonstrates that the tool indeed displays multiple results. The find.test test needed to be tweaked a bit as the tool now does not print the ".debug_info contents" header (also consistent with how --name works). Reviewers: JDevlieghere, aprantl, dblaikie Subscribers: mgrang, llvm-commits Differential Revision: https://reviews.llvm.org/D47543 llvm-svn: 333635	2018-05-31 08:47:00 +00:00
Vedant Kumar	e3c1fb8b12	[llvm-cov] Use the new PrintHTMLEscaped utility This removes some duplicate logic to escape characters in HTML output. llvm-svn: 333608	2018-05-30 23:35:14 +00:00
Peter Collingbourne	1651ac13be	llvm-objcopy: Set sh_link to 0 on unrecognized symtab-linked sections. Per discussion on the generic-abi mailing list: https://groups.google.com/forum/#!topic/generic-abi/MPr8TVtnVn4 An object file manipulation tool must either write out a symbol table with the same number of entries as the original symbol table and in the same order, or if this is impossible, refuse to operate on the object file if it has unrecognized sections that are linked to the symtab section. However, existing tools (namely GNU strip, GNU objcopy and ld.{bfd,gold,lld} -r) do not comply with this at present: they change symbol table indexes and set sh_link to 0 on the unrecognized symtab-linked sections. We intend to use the latter as a (temporary) signal that a tool has operated on a proposed new symtab-linked section and invalidated the symbol table indexes. However, llvm-objcopy currently keeps sh_link pointing to the new symtab section. This patch changes llvm-objcopy to set sh_link to 0 to match the behaviour of the other tools. Differential Revision: https://reviews.llvm.org/D47404 llvm-svn: 333581	2018-05-30 19:30:39 +00:00
Galina Kistanova	df917811ca	Reverted r333424 as it broke multiple build bots and left unfixed for a long time llvm-svn: 333578	2018-05-30 18:51:08 +00:00
Jonas Devlieghere	f4ce54a123	[dsymutil] Escape HTML special characters in plist. When printing string in the Plist, we weren't escaping the characters which lead to invalid XML. This patch adds the escape logic to StringExtras. rdar://39785334 llvm-svn: 333565	2018-05-30 17:47:11 +00:00
Alexander Ivchenko	6572425462	[llvm-readobj] Support GNU_PROPERTY_X86_FEATURE_1_AND notes in .note.gnu.property This patch allows parsing GNU_PROPERTY_X86_FEATURE_1_AND notes in .note.gnu.property sections. These notes indicate that the object file is built to support Intel CET. patch by mike.dvoretsky Differential Revision: https://reviews.llvm.org/D47473 llvm-svn: 333424	2018-05-29 14:49:51 +00:00
Clement Courbet	07c9ec6f2e	[X86][Sched] Add InstRW for CLC on Intel after SNB. Summary: After SNB, Intel CPUs can rename CF independently of other EFLAGS, so the renamer can zero it for free. Note that STC still consumes resources. To reproduce: `$ llvm-exegesis -mode=uops -opcode-name=CLC` On SNB: ``` --- key: opcode_name: CLC mode: uops config: '' cpu_name: sandybridge llvm_triple: x86_64-unknown-linux-gnu num_repetitions: 10000 measurements: - { key: '3', value: 0.0014, debug_string: SBPort0 } - { key: '4', value: 0.0013, debug_string: SBPort1 } - { key: '5', value: 0.0003, debug_string: SBPort4 } - { key: '6', value: 0.0029, debug_string: SBPort5 } - { key: '10', value: 0.0003, debug_string: SBPort23 } error: '' info: 'instruction is serial, repeating a random one. Snippet: CLC ' ... ``` On HSW: ``` --- key: opcode_name: CLC mode: uops config: '' cpu_name: haswell llvm_triple: x86_64-unknown-linux-gnu num_repetitions: 10000 measurements: - { key: '3', value: 0.001, debug_string: HWPort0 } - { key: '4', value: 0.0009, debug_string: HWPort1 } - { key: '5', value: 0.0004, debug_string: HWPort2 } - { key: '6', value: 0.0006, debug_string: HWPort3 } - { key: '7', value: 0.0002, debug_string: HWPort4 } - { key: '8', value: 0.0012, debug_string: HWPort5 } - { key: '9', value: 0.0022, debug_string: HWPort6 } - { key: '10', value: 0.0001, debug_string: HWPort7 } error: '' info: 'instruction is serial, repeating a random one. Snippet: CLC ' ... ``` Reviewers: craig.topper, RKSimon Subscribers: gchatelet, llvm-commits Differential Revision: https://reviews.llvm.org/D47362 llvm-svn: 333392	2018-05-29 06:19:39 +00:00
Jonas Devlieghere	cb547cbb5c	[dwarfdump] Make -c and -p work together When requesting to dump both the parent chain and children, we used to print the DIE more than once because we propagated the dump options to the parent without clearing the respective flags. This commit fixes this oversight and adds a test. rdar://39415292 Differential revision: https://reviews.llvm.org/D47263 llvm-svn: 333350	2018-05-26 19:39:56 +00:00
Paul Semel	cf51c80bf1	[llvm-objcopy] Add --keep-file-symbols option This option prevent from removing file symbols while removing symbols. Differential Revision: https://reviews.llvm.org/D46830 llvm-svn: 333339	2018-05-26 08:10:37 +00:00
Simon Pilgrim	0155bf0da9	[X86][SNB] Fix differences between vex/non-vex XMM vector moves (PR37286) As confirmed by llvm-exegesis, there is no scheduler difference between MOVDQA/MOVDQU and VMOVDQA/VMOVDQU xmm reg-reg moves Another chapter in the never ending crusade to remove useless InstRW overrides from the x86 scheduler models...... llvm-svn: 333271	2018-05-25 12:18:11 +00:00
Paul Semel	99dda0bab8	[llvm-objcopy] Add --strip-unneeded option Differential Revision: https://reviews.llvm.org/D46896 llvm-svn: 333267	2018-05-25 11:01:25 +00:00
Greg Bedwell	e790f6fb06	[UpdateTestChecks] Improved update_mca_test_checks block analysis Previously update_mca_test_checks worked entirely at "block" level where a block is some sequence of lines delimited by at least one empty line. This generally worked well, but could sometimes lead to excessive repetition of check lines for various prefixes if some block was almost identical between prefixes, but not quite (for example, due to a different dispatch width in the otherwise identical summary views). This new analyis attempts to split blocks further in the case where the following conditions are met: a) There is some prefix common to every RUN line (typically 'ALL'). b) The first line of the block is common to the output with every prefix. c) The block has the same number of lines for the output with every prefix. Also, regenerated all llvm-mca test files with the following command: update_mca_test_checks.py "../test/tools/llvm-mca//.s" "../test/tools/llvm-mca///*.s" The new analysis showed a "multiple lines not disambiguated by prefixes" warning for test "AArch64/Exynos/scheduler-queue-usage.s" so I've also added some explicit prefixes to each of the RUN lines in that test. Differential Revision: https://reviews.llvm.org/D47321 llvm-svn: 333204	2018-05-24 16:36:44 +00:00
Jonas Devlieghere	27126f5260	[Support] Add color cl category. This commit adds a color category so tools can document this option and enables it for dwarfdump and dsymuttil. rdar://problem/40498996 llvm-svn: 333176	2018-05-24 11:36:57 +00:00
Alexander Shaposhnikov	c7277e6e2b	[llvm-strip] Minor fix of the usage of TableGen This is a small follow-up to the revisions r333117 and r331663. 1. Avoid the name conflicts of the generated variables for prefixes. 2. Apply clang-format -i -style=llvm to llvm-objcopy.cpp once again. 3. Add a test for the flag with double dash. Test plan: make check-all llvm-svn: 333120	2018-05-23 20:39:52 +00:00
Alexander Shaposhnikov	35bee3e06b	[llvm-strip] Expose --keep-symbol option Expose --keep-symbol option in llvm-strip. Test plan: make check-all Differential revision: https://reviews.llvm.org/D47222 llvm-svn: 333117	2018-05-23 19:44:19 +00:00
Andrea Di Biagio	3fc20c9c7f	[llvm-mca] Print the "Block RThroughput" in the SummaryView. This patch implements the "block reciprocal throughput" computation in the SummaryView. The block reciprocal throughput is computed as the MAX of: - NumMicroOps / DispatchWidth - Resource Cycles / #Units (for every resource consumed). The block throughput is bounded from above by the hardware dispatch throughput. That is because the DispatchWidth is an upper bound on how many opcodes can be part of a single dispatch group. The block throughput is also limited by the amount of hardware parallelism. The number of available resource units affects how the resource pressure is distributed, and also how many blocks can be delivered every cycle. llvm-svn: 333095	2018-05-23 15:59:27 +00:00
Alexander Shaposhnikov	6e7814c484	[llvm-objcopy] Fix the behavior of --strip-* and --keep-symbol If one runs llvm-objcopy --strip-all --keep-symbol foo and the symbol table indeed contains the symbol "foo" then it should not be removed. Test plan: make check-all Differential revision: https://reviews.llvm.org/D47052 llvm-svn: 333008	2018-05-22 18:24:07 +00:00
Paul Semel	31a212d694	Revert "[llvm-objcopy] Add --strip-unneeded option" There is a use after free I didn't see. Need to investigate. This reverts commit f7624abeb1f0d012309baf2e78cf2499fbfe5e5f. llvm-svn: 332925	2018-05-22 01:04:36 +00:00
Paul Semel	040df77ed6	[llvm-objcopy] Add --strip-unneeded option This option removes symbols that are not needed by relocations. Differential Revision: https://reviews.llvm.org/D46896 llvm-svn: 332915	2018-05-21 22:50:32 +00:00
Peter Collingbourne	c5a9765cea	LTO: Replace split dwarf implementation that uses objcopy with one that uses direct emission. Part of PR37466. Differential Revision: https://reviews.llvm.org/D47091 llvm-svn: 332884	2018-05-21 20:26:49 +00:00
Jonas Devlieghere	c111382aa8	[DebugInfo] Use absolute addresses in location lists Rather than relying on the user to do the address calculating in DW_AT_location we should just dump the absolute address. rdar://problem/38513870 Differential revision: https://reviews.llvm.org/D47152 llvm-svn: 332873	2018-05-21 19:36:54 +00:00
Andrea Di Biagio	cb1ed400a4	[llvm-mca] Removed an empty line generated by the timeline view. NFC. Also, regenerate all tests. llvm-svn: 332853	2018-05-21 17:11:56 +00:00
Andrea Di Biagio	b5757abefb	[X86][BtVer2] Add a 'J' prefix to the PRF/RCU defs. NFC This is to keep the Jaguar model's naming convention. Processor resources all have a 'J' prefix in the BtVer2 scheduling model. llvm-svn: 332851	2018-05-21 16:30:26 +00:00
Nico Weber	da5513b9c4	win: try to fix dia tests with newer msvc versions llvm-svn: 332827	2018-05-21 02:09:57 +00:00
Simon Pilgrim	1273f4ad93	[X86] Add GPR<->XMM Schedule Tags BtVer2 - fix NumMicroOp and account for the Lat+6cy GPR->XMM and Lat+1cy XMm->GPR delays (see rL332737) The high number of MOVD/MOVQ equivalent instructions meant that there were a number of missed patterns in SNB/Znver1: SNB - add missing GPR<->MMX costs (taken from Agner / Intel AOM) Znver1 - add missing GPR<->XMM MOVQ costs (taken from Agner) llvm-svn: 332745	2018-05-18 17:58:36 +00:00
Simon Pilgrim	007b50fd35	[X86][BtVer2] Improve simulation of (V)PINSR values Include the 6cy delay transferring from the GPR to FPU. llvm-svn: 332737	2018-05-18 17:09:41 +00:00
Simon Pilgrim	3ecb0b80f6	[X86][BtVer2] Partial vector stores (inc MMX) have a 2cy latency llvm-svn: 332722	2018-05-18 14:22:22 +00:00
Simon Pilgrim	c4b8d367a8	[X86][SSE] Ensure vector partial load/stores use the WriteVecLoad/WriteVecStore scheduler classes Retag some instructions that were missed when we split off vector load/store/moves - MOVQ/MOVD etc. Fixes BtVer2/SLM which have different behaviours for GPR stores. llvm-svn: 332718	2018-05-18 14:08:01 +00:00
Simon Pilgrim	d749b321b2	[X86][SSE] Ensure float load/stores use the WriteFLoad/WriteFStore scheduler classes Retag some instructions that were missed when we split off vector load/store/moves - MOVSS/MOVSD/MOVHPD/MOVHPD/MOVLPD/MOVLPS etc. Fixes BtVer2/SLM which have different behaviours for GPR stores. llvm-svn: 332714	2018-05-18 13:13:59 +00:00
Simon Pilgrim	e389ea0e3e	[llvm-mca][X86] Add CMOV test files llvm-svn: 332622	2018-05-17 16:29:12 +00:00
Simon Pilgrim	b5741f5c3d	[X86][BtVer2] ADC/SBB take 2cy on an ALU pipe, not 1cy like ADD/SUB llvm-svn: 332616	2018-05-17 15:43:23 +00:00
Andrea Di Biagio	650b5fc6cb	[llvm-mca] add flag -all-views and flag -all-stats. Flag -all-views enables all the views. Flag -all-stats enables all the views that print hardware statistics. llvm-svn: 332602	2018-05-17 12:27:03 +00:00
Simon Pilgrim	b4fd145fc3	[llvm-mca][X86] Add ADX test files llvm-svn: 332595	2018-05-17 11:32:38 +00:00
Simon Pilgrim	d5d77dcb46	[X86] Fix typo in instregex for CVTSI642SDrr llvm-svn: 332510	2018-05-16 18:31:17 +00:00
Andrea Di Biagio	45ccdd1785	[llvm-mca] Regenerate tests after r332381 and r332361. NFC llvm-svn: 332447	2018-05-16 10:12:06 +00:00
Jake Ehrlich	e40398ad98	[llvm-objcopy] Add --only-keep-debug as a noop This option just keeps being a problem and really needs to be implemented in some fashion. Implementing it properly requires some kind of "replaceSectionReference" method because all the existing links need to be maintained. The desired behavior is just for allocated sections to become NOBITS but actually implementing that is rather tricky due to the current design of llvm-objcopy. However converting allocated sections to NOBITS is just an optimization and not something debuggers need. Debuggers can debug a stripped executable and take an unstripped executable for that stripped executable as input. Additionally allocated sections account for a very small part of debug binaries so this optimization is quite small. I propose that for the time being we implement this as a NOP so that people can use llvm-objcopy where they need to, just in a sub-optimal way. This option has already blocked a lot of people and its currently blocking me. llvm-svn: 332396	2018-05-15 20:53:53 +00:00
Martin Storsjo	e241ce6f65	[llvm-rc] Add support for the optional CLASS statement for dialogs Differential Revision: https://reviews.llvm.org/D46875 llvm-svn: 332386	2018-05-15 19:21:28 +00:00
Simon Pilgrim	be9a206883	[X86] Split WriteCvtF2F into F32->F64 and F64->F32 scheduler classes BtVer2 - Fixes schedules for (V)CVTPS2PD instructions A lot of the Intel models still have too many InstRW overrides for these new classes - this needs cleaning up but I wanted to get the classes in first llvm-svn: 332376	2018-05-15 17:36:49 +00:00
Simon Pilgrim	891ebcdbaa	[X86] Split off F16C WriteCvtPH2PS/WriteCvtPS2PH scheduler classes Btver2 - VCVTPH2PSYrm needs to double pump the AGU Broadwell - missing VCVTPS2PH*mr stores extra latency Allows us to remove the WriteCvtF2FSt conversion store class llvm-svn: 332357	2018-05-15 14:12:32 +00:00
Paul Semel	5d97c823a4	[llvm-objcopy] Add --keep-symbol (-K) option This option permits to explicitly keep the specified symbol so that it doesn't get removed. Differential Revision: https://reviews.llvm.org/D46819 llvm-svn: 332356	2018-05-15 14:09:37 +00:00
Pavel Labath	80827f10a1	Reapply "DWARFVerifier: Check "completeness" of .debug_names section" This is a resubmit of r331868 (D46583), which was reverted due to failures on the PS4 bot. These have been resolved with r332246/D46748. llvm-svn: 332349	2018-05-15 13:24:10 +00:00
Simon Pilgrim	2aa395abcf	[llvm-mca][x86] Add F16C instruction tests llvm-svn: 332347	2018-05-15 12:50:06 +00:00
Martin Storsjo	11adbacac8	[llvm-rc] Add support for parsing memory flags Most of the handling is pretty straightforward; fetch the default memory flags for the specific resource type before parsing the flags and apply them on top of that, except that some flags imply others and some flags clear more than one flag. For icons and cursors, the flags set get passed on to all individual single icon/cursor resources, while only some flags affect the icon/cursor group resource. For stringtables, the behaviour is pretty simple; the first stringtable resource of a bundle sets the flags for the whole bundle. The output of these tests match rc.exe byte for byte. The actual use of these memory flags is deprecated and they have no effect since Win16, but some resource script files may still happen to have them in place. Differential Revision: https://reviews.llvm.org/D46818 llvm-svn: 332329	2018-05-15 06:35:29 +00:00
Martin Storsjo	860e5fcdf4	[llvm-rc] Read the Planes/BitCount fields from BITMAPINFOHEADER for icons Previously these fields were only read from this header for cursors, while Planes was hardcoded to 1 for icons (with a comment that it was unknown why this was needed) and BitCount was left at the value read originally in the RESDIRENTRY. This fixes the single byte that was differing for the icon/cursor test compared to rc.exe. This is based on research/testing by Nico Weber. Differential Revision: https://reviews.llvm.org/D46816 llvm-svn: 332328	2018-05-15 06:35:20 +00:00
Martin Storsjo	5556841cd3	[llvm-rc] Add missing inputs for tag-icon-cursor.test. This adds the missing input files used for this test, except for the separate input files for specific error cases; matching test input files were provided by Nico Weber. The extra copying of files into the %t directory doesn't seem to be necessary since that directory only ever is used for output here, not for inputs. Differential Revision: https://reviews.llvm.org/D46813 llvm-svn: 332297	2018-05-14 21:32:47 +00:00
Simon Pilgrim	5bd5e2fd3e	[llvm-mca][X86] Add missing SSE4A test file llvm-svn: 332270	2018-05-14 18:20:40 +00:00

... 7 8 9 10 11 ...

2871 Commits