llvm-project

Commit Graph

Author	SHA1	Message	Date
Saleem Abdulrasool	c6bf547564	llvm-objdump: add coff import library symbol listing support This adds behaviour similar to binutils' objdump which can show symbols in an import library. Differences from that stem around the fact that we do not create section symbols nor the all import import descriptor symbol reference. However, this does mean that the tool can serve as a possible replacement for the existing tool. llvm-svn: 279088	2016-08-18 16:39:19 +00:00
Sam Kolton	c05d7784a6	[AMDGPU] llvm-objdump: Skip amd_kernel_code_t only at the begining of kernel symbol. Summary: This change fix bug in AMDGPU disassembly. Previously, presence of symbols other than kernel symbols caused objdump to skip begining of those symbols. Reviewers: tstellarAMD, vpykhtin, Bigcheese, ruiu Subscribers: kzhuravl, arsenm Differential Revision: http://reviews.llvm.org/D21966 llvm-svn: 278921	2016-08-17 10:17:57 +00:00
Hemant Kulkarni	8dfc0b5541	llvm-objdump: Implement source[line numbers] interleaving Differential Revsion: https://reviews.llvm.org/D22932 llvm-svn: 278725	2016-08-15 19:49:24 +00:00
David Majnemer	42531260b3	Use the range variant of find/find_if instead of unpacking begin/end If the result of the find is only used to compare against end(), just use is_contained instead. No functionality change is intended. llvm-svn: 278469	2016-08-12 03:55:06 +00:00
Kevin Enderby	f4586039f6	The next step along the way to getting good error messages for bad archives. As mentioned in commit log for r276686 this next step is adding a new method in the ArchiveMemberHeader class to get the full name that does proper error checking, and can be use for error messages. To do this the name of ArchiveMemberHeader::getName() is changed to ArchiveMemberHeader::getRawName() to be consistent with Archive::Child::getRawName(). Then the “new” method is the addition of a new implementation of ArchiveMemberHeader::getName() which gets the full name and provides proper error checking. Which is mostly a rewrite of what was Archive::Child::getName() and cleaning up incorrect uses of llvm_unreachable() in the code which were actually just cases of errors in the input Archives. Then Archive::Child::getName() is changed to return Expected<> and use the new implementation of ArchiveMemberHeader::getName() . Also needed to change Archive::getMemoryBufferRef() with these changes to return Expected<> as well to propagate Errors up. As well as changing Archive::isThinMember() to return Expected<> . llvm-svn: 277177	2016-07-29 17:44:13 +00:00
Alexei Starovoitov	cfb51f54ba	BPF: Use official ELF e_machine value The same value for EM_BPF is being propagated to glibc, elfutils, and binutils. Signed-off-by: Richard Henderson <rth@twiddle.net> Signed-off-by: Alexei Starovoitov <ast@kernel.org> llvm-svn: 275633	2016-07-15 22:27:55 +00:00
Lang Hames	fc209623e9	[Object] Re-apply r275316 now that I have the corresponding LLD patch ready. llvm-svn: 275361	2016-07-14 02:24:01 +00:00
Lang Hames	ae610ab528	[Object] Revert r275316, Archive::child_iterator changes, while I update lld. Should fix the bots broken by r275316. llvm-svn: 275353	2016-07-14 00:37:04 +00:00
Lang Hames	c2773e97d2	[Object] Change Archive::child_iterator for better interop with Error/Expected. See http://reviews.llvm.org/D22079 Changes the Archive::child_begin and Archive::children to require a reference to an Error. If iterator increment fails (because the archive header is damaged) the iterator will be set to 'end()', and the error stored in the given Error&. The Error value should be checked by the user immediately after the loop. E.g.: Error Err; for (auto &C : A->children(Err)) { // Do something with archive child C. } // Check the error immediately after the loop. if (Err) return Err; Failure to check the Error will result in an abort() when the Error goes out of scope (as guaranteed by the Error class). llvm-svn: 275316	2016-07-13 21:13:05 +00:00
Kevin Enderby	42398051d8	Finish cleaning up most of the error handling in libObject’s MachOUniversalBinary and its clients to use the new llvm::Error model for error handling. Changed getAsArchive() from ErrorOr<...> to Expected<...> so now all interfaces there use the new llvm::Error model for return values. In the two places it had if (!Parent) this is actually a program error so changed from returning errorCodeToError(object_error::parse_failed) to calling report_fatal_error() with a message. In getObjectForArch() added error messages to its two llvm::Error return values instead of returning errorCodeToError(object_error::arch_not_found) with no error message. For the llvm-obdump, llvm-nm and llvm-size clients since the only binary files in Mach-O Universal Binaries that are supported are Mach-O files or archives with Mach-O objects, updated their logic to generate an error when a slice contains something like an ELF binary instead of ignoring it. And added a test case for that. The last error stuff to be cleaned up for libObject’s MachOUniversalBinary is the use of errorOrToExpected(Archive::create(ObjBuffer)) which needs Archive::create() to be changed from ErrorOr<...> to Expected<...> first, which I’ll work on next. llvm-svn: 274079	2016-06-28 23:16:13 +00:00
Kevin Enderby	931cb65df2	Thread Expected<...> up from libObject’s getSymbolAddress() for symbols to allow a good error message to be produced. This is nearly the last libObject interface that used ErrorOr and the last one that appears in llvm/include/llvm/Object/MachO.h . For Mach-O objects this is just a clean up because it’s version of getSymbolAddress() can’t return an error. I will leave it to the experts on COFF and ELF to actually add meaning full error messages in their tests if they wish. And also leave it to these experts to change the last two ErrorOr interfaces in llvm/include/llvm/Object/ObjectFile.h for createCOFFObjectFile() and createELFObjectFile() if they wish. Since there are no test cases for COFF and ELF error cases with respect to getSymbolAddress() in the test suite this is no functional change (NFC). llvm-svn: 273701	2016-06-24 18:24:42 +00:00
Daniel Sanders	1d14864bb3	[llvm-objdump] Support detection of feature bits from the object and implement this for Mips. Summary: The Mips implementation only covers the feature bits described by the ELF e_flags so far. Mips stores additional feature bits such as MSA in the .MIPS.abiflags section. Also fixed a small bug this revealed where microMIPS wouldn't add the EF_MIPS_MICROMIPS flag when using -filetype=obj. Reviewers: echristo, rafael Subscribers: rafael, mehdi_amini, dsanders, sdardis, llvm-commits Differential Revision: http://reviews.llvm.org/D21125 llvm-svn: 272880	2016-06-16 09:17:03 +00:00
Richard Smith	2ad6d48b0c	Search for llvm-symbolizer binary in the same directory as argv[0], before looking for it along $PATH. This allows installs of LLVM tools outside of $PATH to find the symbolizer and produce pretty backtraces if they crash. llvm-svn: 272232	2016-06-09 00:53:21 +00:00
Kevin Enderby	9acb109930	Change llvm-objdump, llvm-nm and llvm-size when reporting an object file error when the object is from a slice of a Mach-O Universal Binary use something like "foo.o (for architecture i386)" as part of the error message when expected. Also fixed places in these tools that were ignoring object file errors from MachOUniversalBinary::getAsObjectFile() when the code moved on to see if the slice was an archive. To do this MachOUniversalBinary::getAsObjectFile() and MachOUniversalBinary::getObjectForArch() were changed from returning ErrorOr<...> to Expected<...> then that was threaded up to its users. Converting these interfaces to Expected<> from ErrorOr<> does involve touching a number of places. To contain the changes for now the use of errorToErrorCode() is still used in two places yet to be fully converted. llvm-svn: 271332	2016-05-31 20:35:34 +00:00
Benjamin Kramer	82de7d323d	Apply clang-tidy's misc-move-constructor-init throughout LLVM. No functionality change intended, maybe a tiny performance improvement. llvm-svn: 270997	2016-05-27 14:27:24 +00:00
Kevin Enderby	ac9e15551d	Change llvm-objdump, llvm-nm and llvm-size when reporting an object file error when the object is in an archive to use something like libx.a(foo.o) as part of the error message. Also changed llvm-objdump and llvm-size to be like llvm-nm and ignore non-object files in archives and not produce any error message. To do this Archive::Child::getAsBinary() was changed from ErrorOr<...> to Expected<...> then that was threaded up to its users. Converting this interface to Expected<> from ErrorOr<> does involve touching a number of places. To contain the changes for now the use of errorToErrorCode() is still used in one place yet to be fully converted. Again there some were bugs in the existing code that did not deal with the old ErrorOr<> return values. So now with Expected<> since they must be checked and the error handled, I added a TODO and a comments for those. llvm-svn: 269784	2016-05-17 17:10:12 +00:00
Kevin Enderby	b34e3a1877	Clean up the specific error message for a malformed Mach-O files with bad segment load commands. The existing test case in test/Object/macho-invalid.test for macho-invalid-too-small-segment-load-command has a cmdsize of 55, while being too small also it is not a multiple of 4. So when that check is added this test case will produce a different error. So I constructed a new test case that will trigger the intended error. I also changed the error message to be consistent with the other malformed Mach-O file error messages which prints the load command index. I also removed both object_error::macho_load_segment_too_small and object_error::macho_load_segment_too_many_sections from Object/Error.h as they are not needed and can just use object_error::parse_failed and let the error message string distinguish the specific error. llvm-svn: 268652	2016-05-05 17:43:35 +00:00
Kevin Enderby	7bd8d99497	Thread Expected<...> up from libObject’s getType() for symbols to allow llvm-objdump to produce a good error message. Produce another specific error message for a malformed Mach-O file when a symbol’s section index is more than the number of sections. The existing test case in test/Object/macho-invalid.test for macho-invalid-section-index-getSectionRawName now reports the error with the message indicating that a symbol at a specific index has a bad section index and that bad section index value. Again converting interfaces to Expected<> from ErrorOr<> does involve touching a number of places. Where the existing code reported the error with a string message or an error code it was converted to do the same. Also there some were bugs in the existing code that did not deal with the old ErrorOr<> return values. So now with Expected<> since they must be checked and the error handled, I added a TODO and a comment: "// TODO: Actually report errors helpfully" and a call something like consumeError(NameOrErr.takeError()) so the buggy code will not crash since needed to deal with the Error. llvm-svn: 268298	2016-05-02 20:28:12 +00:00
Matt Arsenault	87d80dbfda	AMDGPU: Fix crash when dumping unknown opcode I'm for some reason having a problem producing a test. It should be the same as test/MC/X86/invalid_opcode.s, but llvm-mc seems to ignore random bytes. llvm-svn: 267225	2016-04-22 21:23:41 +00:00
Kevin Enderby	81e8b7d949	Thread Expected<...> up from libObject’s getName() for symbols to allow llvm-objdump to produce a good error message. Produce another specific error message for a malformed Mach-O file when a symbol’s string index is past the end of the string table. The existing test case in test/Object/macho-invalid.test for macho-invalid-symbol-name-past-eof now reports the error with the message indicating that a symbol at a specific index has a bad sting index and that bad string index value. Again converting interfaces to Expected<> from ErrorOr<> does involve touching a number of places. Where the existing code reported the error with a string message or an error code it was converted to do the same. There is some code for this that could be factored into a routine but I would like to leave that for the code owners post-commit to do as they want for handling an llvm::Error. An example of how this could be done is shown in the diff in lib/ExecutionEngine/RuntimeDyld/RuntimeDyldImpl.h which had a Check() routine already for std::error_code so I added one like it for llvm::Error . Also there some were bugs in the existing code that did not deal with the old ErrorOr<> return values. So now with Expected<> since they must be checked and the error handled, I added a TODO and a comment: “// TODO: Actually report errors helpfully” and a call something like consumeError(NameOrErr.takeError()) so the buggy code will not crash since needed to deal with the Error. Note there fixes needed to lld that goes along with this that I will commit right after this. So expect lld not to built after this commit and before the next one. llvm-svn: 266919	2016-04-20 21:24:34 +00:00
Colin LeMahieu	efe3732883	Revert r265817 lld tests need to be addressed. llvm-svn: 265822	2016-04-08 18:15:37 +00:00
Colin LeMahieu	4a1975ba8e	[llvm-objdump] Printing hex instead of dec by default Differential Revision: http://reviews.llvm.org/D18770 llvm-svn: 265817	2016-04-08 17:55:03 +00:00
Valery Pykhtin	8e79f5be0c	fix r265645: target dependent printf formatting flags. llvm-svn: 265649	2016-04-07 08:38:20 +00:00
Valery Pykhtin	de04805e9f	[AMDGPU] llvm-objdump: Minimal HSA Code Object disassembler support. Reenable reverted r265550 with endianness issue fixed. Variables of endian-aware types such as ulittle32_t should be explicitly casted to their natural equivalent types before passing it as vararg to printf like functions (format in my case). Added lit config file depending on AMDGPU target as the testcase uses assembler. Differential revision: http://reviews.llvm.org/D16998 llvm-svn: 265645	2016-04-07 07:24:01 +00:00
Kevin Enderby	3fcdf6ae2a	Thread Expected<...> up from createMachOObjectFile() to allow llvm-objdump to produce a real error message Produce the first specific error message for a malformed Mach-O file describing the problem instead of the generic message for object_error::parse_failed of "Invalid data was encountered while parsing the file”. Many more good error messages will follow after this first one. This is built on Lang Hames’ great work of adding the ’Error' class for structured error handling and threading Error through MachOObjectFile construction. And making createMachOObjectFile return Expected<...> . So to to get the error to the llvm-obdump tool, I changed the stack of these methods to also return Expected<...> : object::ObjectFile::createObjectFile() object::SymbolicFile::createSymbolicFile() object::createBinary() Then finally in ParseInputMachO() in MachODump.cpp the error can be reported and the specific error message can be printed in llvm-objdump and can be seen in the existing test case for the existing malformed binary but with the updated error message. Converting these interfaces to Expected<> from ErrorOr<> does involve touching a number of places. To contain the changes for now use of errorToErrorCode() and errorOrToExpected() are used where the callers are yet to be converted. Also there some were bugs in the existing code that did not deal with the old ErrorOr<> return values. So now with Expected<> since they must be checked and the error handled, I added a TODO and a comment: “// TODO: Actually report errors helpfully” and a call something like consumeError(ObjOrErr.takeError()) so the buggy code will not crash since needed to deal with the Error. Note there is one fix also needed to lld/COFF/InputFiles.cpp that goes along with this that I will commit right after this. So expect lld not to built after this commit and before the next one. llvm-svn: 265606	2016-04-06 22:14:09 +00:00
Valery Pykhtin	1dcb91b4de	Revert "[AMDGPU] llvm-objdump: Minimal HSA Code Object disassembler support." This reverts commit r265550. There're problems with endianness on dumping instruction bytes. Need to find out how to use support::ulittle32_t type properly. llvm-svn: 265554	2016-04-06 16:30:21 +00:00
Valery Pykhtin	bd90c60afb	[AMDGPU] llvm-objdump: Minimal HSA Code Object disassembler support. Differential revision: http://reviews.llvm.org/D16998 llvm-svn: 265550	2016-04-06 15:55:10 +00:00
Kevin Enderby	5afbc1cda7	Fix a crash in running llvm-objdump -t with an invalid Mach-O file already in the test suite. While this is not really an interesting tool and option to run on a Mach-O file to show the symbol table in a generic libObject format it shouldn’t crash. The reason for the crash was in MachOObjectFile::getSymbolType() when it was calling MachOObjectFile::getSymbolSection() without checking its return value for the error case. What makes this fix require a fair bit of diffs is that the method getSymbolType() is in the class ObjectFile defined without an ErrorOr<> so I needed to add that all the sub classes. And all of the uses needed to be updated and the return value needed to be checked for the error case. The MachOObjectFile version of getSymbolType() “can” get an error in trying to come up with the libObject’s internal SymbolRef::Type when the Mach-O symbol symbol type is an N_SECT type because the code is trying to select from the SymbolRef::ST_Data or SymbolRef::ST_Function values for the SymbolRef::Type. And it needs the Mach-O section to use isData() and isBSS to determine if it will return SymbolRef::ST_Data. One other possible fix I considered is to simply return SymbolRef::ST_Other when MachOObjectFile::getSymbolSection() returned an error. But since in the past when I did such changes that “ate an error in the libObject code” I was asked instead to push the error out of the libObject code I chose not to implement the fix this way. As currently written both the COFF and ELF versions of getSymbolType() can’t get an error. But if isReservedSectionNumber() wanted to check for the two known negative values rather than allowing all negative values or the code wanted to add the same check as in getSymbolAddress() to use getSection() and check for the error then these versions of getSymbolType() could return errors. At the end of the day the error printed now is the generic “Invalid data was encountered while parsing the file” for object_error::parse_failed. In the future when we thread Lang’s new TypedError for recoverable error handling though libObject this will improve. And where the added // Diagnostic(… comment is, it would be changed to produce and error message like “bad section index (42) for symbol at index 8” for this case. llvm-svn: 264187	2016-03-23 20:27:00 +00:00
Rafael Espindola	9219fe79b9	Revert "[llvm-objdump] Printing relocations in executable and shared object files. This partially reverts r215844 by removing test objdump-reloc-shared.test which stated GNU objdump doesn't print relocations, it does." This reverts commit r263971. It produces the wrong results for .rela.dyn. I will add a test. llvm-svn: 263987	2016-03-21 20:59:15 +00:00
Colin LeMahieu	cdaf644c48	[llvm-objdump] Printing relocations in executable and shared object files. This partially reverts r215844 by removing test objdump-reloc-shared.test which stated GNU objdump doesn't print relocations, it does. In executable and shared object ELF files, relocations in the file contain the final virtual address rather than section offset so this is adjusted to display section offset. Differential revision: http://reviews.llvm.org/D15965 llvm-svn: 263971	2016-03-21 19:14:50 +00:00
Colin LeMahieu	307a83d76a	[llvm-objdump] Print <unknown> in place of instruction text if it couldn't be disassembled. llvm-svn: 263793	2016-03-18 16:26:48 +00:00
Simon Atanasyan	34223a7e5d	[llvm-objdump] Add '0x' prefix to a target displacement number to accent its hex format It might be hard to recognize a hexadecimal number without '0x' prefix. Besides that '0x' prefix corresponds to GNU objdump behaviour. Differential Revision: http://reviews.llvm.org/D18207 llvm-svn: 263705	2016-03-17 10:43:44 +00:00
Jacques Pienaar	ea9f25a740	[lanai] Add ELF enum value and relocations. Add ELF enum value and relocations for Lanai backed. General Lanai backend discussion on llvm-dev thread "[RFC] Lanai backend" (http://lists.llvm.org/pipermail/llvm-dev/2016-February/095118.html). Differential Revision: http://reviews.llvm.org/D17008 llvm-svn: 262394	2016-03-01 21:21:42 +00:00
Benjamin Kramer	f57c1977c1	Reflect the MC/MCDisassembler split on the include/ level. No functional change, just moving code around. llvm-svn: 258818	2016-01-26 16:44:37 +00:00
Igor Laevsky	03a670c0ec	Re-submit r256008 "Improve DWARFDebugFrame::parse to also handle __eh_frame." Originally this change was causing failures on windows buildbots. But those problems were fixed in r258806. llvm-svn: 258811	2016-01-26 15:09:42 +00:00
Kevin Enderby	0ae163f9ea	For llvm-objdump, add the option -private-header (without the trailing ’s’) to only print the first private header. Which for Mach-O files only prints the Mach header and not the subsequent load commands. Which is used by scripts to match what the darwin otool(1) with the -h flag does without the -l flag. For non-Mach-O files it has the same functionality as -private-headers (with the trailing ’s’). rdar://24158331 llvm-svn: 257548	2016-01-13 00:25:36 +00:00
Dan Gohman	4635017176	[WebAssembly] Add a EM_WEBASSEMBLY value, and several bits of code that use it. A request has been made to the official registry, but an official value is not yet available. This patch uses a temporary value in order to support development. When an official value is recieved, the value of EM_WEBASSEMBLY will be updated. llvm-svn: 257517	2016-01-12 20:56:01 +00:00
Davide Italiano	ed9d95b290	[llvm-objdump] Mark noreturn function as such. Match attribute in the header to make MSVC happy. llvm-svn: 256560	2015-12-29 13:41:02 +00:00
Davide Italiano	140af648f2	[llvm-objdump] Use stderr and not stdout for fatal errors. llvm-svn: 256423	2015-12-25 18:16:45 +00:00
Davide Italiano	e85abf7269	[llvm-objdump] Move COFF function to where it belongs. Ideally much more stuff should be moved out of llvm-objdump.cpp, but that will happen later. llvm-svn: 256118	2015-12-20 09:54:34 +00:00
Davide Italiano	540e92157c	[llvm-objdump] Fail early if we can't parse the object header. llvm-svn: 256108	2015-12-19 22:09:40 +00:00
Pete Cooper	98052537f0	Revert "Improve DWARFDebugFrame::parse to also handle __eh_frame." This reverts commit r256008. Its breaking multiple buildbots, although works for me locally. llvm-svn: 256013	2015-12-18 19:45:38 +00:00
Pete Cooper	6c97f4c7d7	Improve DWARFDebugFrame::parse to also handle __eh_frame. LLVM MC has single methods which can handle the output of EH frame and DWARF CIE's and FDE's. This code improves DWARFDebugFrame::parse to do the same for parsing. This also allows llvm-objdump to support the --dwarf=frames option which objdump supports. This option dumps the .eh_frame section using the new code in DWARFDebugFrame::parse. http://reviews.llvm.org/D15535 Reviewed by Rafael Espindola. llvm-svn: 256008	2015-12-18 18:51:08 +00:00
Davide Italiano	711e495ecf	[llvm-objdump] Use report_fatal_error() for a more uniform error handling. llvm-svn: 255871	2015-12-17 01:59:50 +00:00
Davide Italiano	b13edeb9b5	[llvm-objdump/MachO] Don't cut'n'paste the same code over and over. Use the appropriate helper instead. llvm-svn: 254990	2015-12-08 02:45:59 +00:00
Davide Italiano	bb599e3a4d	[llvm-objdump] Use report_fatal_error() if we can't find a target. llvm-svn: 254654	2015-12-03 22:13:40 +00:00
David Majnemer	153722d2f1	Fix LLD testsuite fallout from r253429 llvm-svn: 253432	2015-11-18 04:35:32 +00:00
David Majnemer	fbb1c3a70b	[llvm-objdump] Use the COFF export table for additional symbols Most linked executables do not have a symbol table in COFF. However, it is pretty typical to have some export entries. Use those entries to inform the disassembler about potential function definitions and call targets. llvm-svn: 253429	2015-11-18 02:49:19 +00:00
Kevin Enderby	7a96942a6a	Reapply r250906 with many suggested updates from Rafael Espindola. The needed lld matching changes to be submitted immediately next, but this revision will cause lld failures with this alone which is expected. This removes the eating of the error in Archive::Child::getSize() when the characters in the size field in the archive header for the member is not a number. To do this we have all of the needed methods return ErrorOr to push them up until we get out of lib. Then the tools and can handle the error in whatever way is appropriate for that tool. So the solution is to plumb all the ErrorOr stuff through everything that touches archives. This include its iterators as one can create an Archive object but the first or any other Child object may fail to be created due to a bad size field in its header. Thanks to Lang Hames on the changes making child_iterator contain an ErrorOr<Child> instead of a Child and the needed changes to ErrorOr.h to add operator overloading for * and -> . We don’t want to use llvm_unreachable() as it calls abort() and is produces a “crash” and using report_fatal_error() to move the error checking will cause the program to stop, neither of which are really correct in library code. There are still some uses of these that should be cleaned up in this library code for other than the size field. The test cases use archives with text files so one can see the non-digit character, in this case a ‘%’, in the size field. These changes will require corresponding changes to the lld project. That will be committed immediately after this change. But this revision will cause lld failures with this alone which is expected. llvm-svn: 252192	2015-11-05 19:24:56 +00:00
Michael Kuperstein	a3b79dd783	[ELF] elfiamcu triple should imply e_machine == EM_IAMCU Differential Revision: http://reviews.llvm.org/D14109 llvm-svn: 252043	2015-11-04 11:21:50 +00:00
Kevin Enderby	da9dd05011	Backing out commit r250906 as it broke lld. llvm-svn: 250908	2015-10-21 17:13:20 +00:00
Kevin Enderby	e3bf4fd546	This removes the eating of the error in Archive::Child::getSize() when the characters in the size field in the archive header for the member is not a number. To do this we have all of the needed methods return ErrorOr to push them up until we get out of lib. Then the tools and can handle the error in whatever way is appropriate for that tool. So the solution is to plumb all the ErrorOr stuff through everything that touches archives. This include its iterators as one can create an Archive object but the first or any other Child object may fail to be created due to a bad size field in its header. Thanks to Lang Hames on the changes making child_iterator contain an ErrorOr<Child> instead of a Child and the needed changes to ErrorOr.h to add operator overloading for * and -> . We don’t want to use llvm_unreachable() as it calls abort() and is produces a “crash” and using report_fatal_error() to move the error checking will cause the program to stop, neither of which are really correct in library code. There are still some uses of these that should be cleaned up in this library code for other than the size field. Also corrected the code where the size gets us to the “at the end of the archive” which is OK but past the end of the archive will return object_error::parse_failed now. The test cases use archives with text files so one can see the non-digit character, in this case a ‘%’, in the size field. llvm-svn: 250906	2015-10-21 16:59:24 +00:00
Pete Cooper	e11c9de83d	Stop linking all target libraries in llvm-nm and llvm-objdump. llvm-nm only needs the target to parse module level assembly in bitcode. It doesn't need a disassembler or codegen. llvm-objdump needs to be able to disassemble a file, but doesn't need asm parsers or codegen. This reduces the sizes of these tools by a few MB each, depending on how many backends are linked in. llvm-svn: 249632	2015-10-07 22:39:17 +00:00
Davide Italiano	f070688ecf	[PATCH] D13360: [llvm-objdump] Teach -d about AArch64 mapping symbols AArch64 uses $d* and $x* to interleave between text and data. llvm-objdump didn't know about this so it ended up printing garbage. This patch is a first step towards a solution of the problem. Differential Revision: http://reviews.llvm.org/D13360 llvm-svn: 249083	2015-10-01 21:57:09 +00:00
Davide Italiano	c50ae36509	[llvm-objdump] Fix time of check to time of use bug. There's already a test that covers this situation, so we should be fine. llvm-svn: 248976	2015-10-01 01:02:37 +00:00
Benjamin Kramer	ac9257b258	[objdump] Make iterator operator* return a reference. This is closer to the expected behavior of an iterator and avoids awkward warnings from clang's -Wrange-loop-analysis below. llvm-svn: 248497	2015-09-24 14:52:52 +00:00
Daniel Sanders	50f17235dd	Revert r247692: Replace Triple with a new TargetTuple in MCTargetDesc/* and related. NFC. Eric has replied and has demanded the patch be reverted. llvm-svn: 247702	2015-09-15 16:17:27 +00:00
Daniel Sanders	153010c52d	Re-commit r247683: Replace Triple with a new TargetTuple in MCTargetDesc/* and related. NFC. Summary: This is the first patch in the series to migrate Triple's (which are ambiguous) to TargetTuple's (which aren't). For the moment, TargetTuple simply passes all requests to the Triple object it holds. Once it has replaced Triple, it will start to implement the interface in a more suitable way. This change makes some changes to the public C++ API. In particular, InitMCSubtargetInfo(), createMCRelocationInfo(), and createMCSymbolizer() now take TargetTuples instead of Triples. The other public C++ API's have been left as-is for the moment to reduce patch size. This commit also contains a trivial patch to clang to account for the C++ API change. Thanks go to Pavel Labath for fixing LLDB for me. Reviewers: rengolin Subscribers: jyknight, dschuff, arsenm, rampitec, danalbert, srhines, javed.absar, dsanders, echristo, emaste, jholewinski, tberghammer, ted, jfb, llvm-commits, rengolin Differential Revision: http://reviews.llvm.org/D10969 llvm-svn: 247692	2015-09-15 14:08:28 +00:00
Daniel Sanders	c40de48041	Revert r247684 - Replace Triple with a new TargetTuple ... LLDB needs to be updated in the same commit. llvm-svn: 247686	2015-09-15 13:46:21 +00:00
Daniel Sanders	18d4b0dab7	Replace Triple with a new TargetTuple in MCTargetDesc/* and related. NFC. Summary: This is the first patch in the series to migrate Triple's (which are ambiguous) to TargetTuple's (which aren't). For the moment, TargetTuple simply passes all requests to the Triple object it holds. Once it has replaced Triple, it will start to implement the interface in a more suitable way. This change makes some changes to the public C++ API. In particular, InitMCSubtargetInfo(), createMCRelocationInfo(), and createMCSymbolizer() now take TargetTuples instead of Triples. The other public C++ API's have been left as-is for the moment to reduce patch size. This commit also contains a trivial patch to clang to account for the C++ API change. Reviewers: rengolin Subscribers: jyknight, dschuff, arsenm, rampitec, danalbert, srhines, javed.absar, dsanders, echristo, emaste, jholewinski, tberghammer, ted, jfb, llvm-commits, rengolin Differential Revision: http://reviews.llvm.org/D10969 llvm-svn: 247683	2015-09-15 13:17:40 +00:00
Rafael Espindola	a01ff22bb1	Use higher level functions in llvm-objdump. This matches the rest of llvm-objdump better and isolates it from upcoming changes to ELFFile. llvm-svn: 244500	2015-08-10 20:50:40 +00:00
Rafael Espindola	8bab889b0f	Convert getSymbolSection to return an ErrorOr. This function can actually fail since the symbol contains an index to the section and that can be invalid. llvm-svn: 244375	2015-08-07 23:27:14 +00:00
Davide Italiano	7f6c301090	[llvm-objdump] Add missing call to exit(1). Reported by: Rafael Espindola. llvm-svn: 244184	2015-08-06 00:18:52 +00:00
Davide Italiano	ccd53feee2	[llvm-objdump] Call exit(1) on error, i.e. fail early. Previously we kept going on partly corrupted input, which might result in garbage being printed, or even worse, random crashes. Rafael mentioned that this is the GNU behavior as well, but after some discussion we both agreed it's probably better to emit a reasonable error message and exit. As a side-effect of this commit, now we don't rely on global state for error codes anymore. objdump was the last tool in the toolchain which needed to be converted. Hopefully the old behavior won't sneak into the tree again. llvm-svn: 244019	2015-08-05 07:18:31 +00:00
Davide Italiano	95dd3cb091	[llvm-objdump] Range-loopify. NFC intended. llvm-svn: 243905	2015-08-03 21:46:32 +00:00
Colin LeMahieu	da1723f194	[llvm-objdump] Inverting logic to match the word "predicate". Returning true when we want it rather than when we want to discard it. llvm-svn: 243558	2015-07-29 19:21:13 +00:00
Colin LeMahieu	fcc32766bf	[llvm-objdump] Merging MachO DumpSections in to FilterSections. Simplifying some predicate logic. llvm-svn: 243556	2015-07-29 19:08:10 +00:00
Colin LeMahieu	77804bed85	[llvm-objdump] Added -j flag to filter sections that are operated on. llvm-svn: 243526	2015-07-29 15:45:39 +00:00
Colin LeMahieu	f34933e425	[llvm-objdump] Add -D and --disassemble-all flags that attempt disassembly on all sections instead of just text sections. llvm-svn: 243041	2015-07-23 20:58:49 +00:00
Daniel Jasper	0f70ee9017	Add missing 'const'. I don't think this is strictly required, but some compiler configuration is giving me an error and it seems to be recommended anyway. llvm-svn: 241892	2015-07-10 07:09:20 +00:00
David Majnemer	2603a8fa24	[llvm-objdump] Require that jump targets shown in -d are functions Don't let the disassembler pick call <.text> if a function happens to live at the start of the section by only using function symbols. llvm-svn: 241830	2015-07-09 18:11:40 +00:00
Adrian Prantl	437105a4de	llvm-objdump: Replace the -macho -raw option with a generic -raw-clang-ast option that works with all object container formats. Now that clang modules/PCH are object containers this option is useful to to construct pipes like llvm-objdump -raw-clang-ast foo.pcm \| llvm-bcanalyzer - to inspect the AST contents in a PCH container. Will be tested via clang. Belatedly addresses review feedback for r233390. llvm-svn: 241659	2015-07-08 02:04:15 +00:00
David Majnemer	81afca6bf7	[llvm-objdump] Print the call target next to the instruction GNU binutils provides this behavior. objdump -r doesn't really help when you aren't dealing with relocation object files. llvm-svn: 241631	2015-07-07 22:06:59 +00:00
Rafael Espindola	be8b0ea854	Delete UnknownAddress. It is a perfectly valid symbol value. getSymbolValue now returns a value that in convenient for most callers: * 0 for undefined * symbol size for common symbols * offset/address for symbols the rest Code that needs something more specific can check getSymbolFlags. llvm-svn: 241605	2015-07-07 17:12:59 +00:00
Rafael Espindola	704cd841a1	Simplify. NFC. llvm-svn: 241456	2015-07-06 15:47:43 +00:00
Rafael Espindola	ed067c45d4	Return ErrorOr from getSymbolAddress. It can fail trying to get the section on ELF and COFF. This makes sure the error is handled. llvm-svn: 241366	2015-07-03 18:19:00 +00:00
Rafael Espindola	5d0c2ffadf	Return ErrorOr from SymbolRef::getName. This function can really fail since the string table offset can be out of bounds. Using ErrorOr makes sure the error is checked. Hopefully a lot of the boilerplate code in tools/* can go away once we have a diagnostic manager in Object. llvm-svn: 241297	2015-07-02 20:55:21 +00:00
Rafael Espindola	7f162ec2cc	Expose getRel and getRela to reduce code duplication. llvm-svn: 241266	2015-07-02 14:21:38 +00:00
Rafael Espindola	6def304209	Return ErrorOr from getSection. This also improves the logic of what is an error: * getSection(uint_32): only return an error if the index is out of bounds. The index 0 corresponds to a perfectly valid entry. * getSection(Elf_Sym): Returns null for symbols that normally don't have sections and error for out of bound indexes. In many places this just moves the report_fatal_error up the stack, but those can then be fixed in smaller patches. llvm-svn: 241156	2015-07-01 12:56:27 +00:00
Rafael Espindola	41bb43252b	Don't return error_code from a function that doesn't fail. llvm-svn: 241042	2015-06-30 04:08:37 +00:00
Rafael Espindola	0ad71d982c	Move function to the only file that uses it. llvm-svn: 241040	2015-06-30 03:41:26 +00:00
Rafael Espindola	f69ac42ac4	Don't return error_code from a function that doesn't fail. llvm-svn: 241039	2015-06-30 03:33:18 +00:00
Rafael Espindola	96d071cd0c	Don't return error_code from function that never fails. llvm-svn: 241021	2015-06-29 23:29:12 +00:00
Rafael Espindola	44c2871c09	Convert obj->getSymbolName to sym->getName. I doesn't depend on the object anymore. llvm-svn: 240996	2015-06-29 21:24:55 +00:00
Rafael Espindola	6a1bfb2f9b	Factor out the checking of string tables. This moves the error checking for string tables to getStringTable which returns an ErrorOr<StringRef>. This improves error checking, makes it uniform across all string tables and makes it possible to check them once instead of once per name. llvm-svn: 240950	2015-06-29 14:39:25 +00:00
Rafael Espindola	719dc7c436	Remove Elf_Sym_Iter. It was a fairly broken concept for an ELF only class. An ELF file can have two symbol tables, but they have exactly the same format. There is no concept of a dynamic or a static symbol. Storing this on the iterator also makes us do more work per symbol than necessary. To fetch a name we would: * Find if we had a static or a dynamic symbol. * Look at the corresponding symbol table and find the string table section. * Look at the string table section to fetch its contents. * Compute the name as a substring of the string table. All but the last step can be done per symbol table instead of per symbol. This is a step in that direction. llvm-svn: 240939	2015-06-29 12:38:31 +00:00
Rafael Espindola	854038ed1a	Rename getObjectFile to getObject for consistency. llvm-svn: 240785	2015-06-26 14:51:16 +00:00
Rafael Espindola	2fa80cc5fd	Simplify getSymbolType. This is still a really odd function. Most calls are in object format specific contexts and should probably be replaced with a more direct query, but at least now this is not too obnoxious to use. llvm-svn: 240777	2015-06-26 12:18:49 +00:00
Rafael Espindola	dbb6bd3345	Add an ELFSymbolRef type. This allows user code to say Sym.getSize() instead of having to manually fetch the object. llvm-svn: 240708	2015-06-25 22:10:04 +00:00
Rafael Espindola	d7a32ea4b8	Change how symbol sizes are handled in lib/Object. COFF and MachO only define symbol sizes for common symbols. Reflect that in the class hierarchy by having a method for common symbols only in the base and a general one in ELF. This avoids the need of using a magic value for the size, which had a few problems * Most callers didn't check for it. * The ones that did could not tell the magic value from a file actually having that value. llvm-svn: 240529	2015-06-24 10:20:30 +00:00
Sanjoy Das	3f1bc3b2bb	Revert "[FaultMaps] Move FaultMapParser to Object/" This reverts commit r240364 (git c49542e5bb186). The issue r240364 was trying to fix was fixed independently in r240362. llvm-svn: 240448	2015-06-23 20:09:03 +00:00
Rafael Espindola	ae3ac08326	Don't pass a 32 bit value to "%08" PRIx64. Should fix the arm bots. llvm-svn: 240439	2015-06-23 18:34:25 +00:00
Rafael Espindola	5f7ade26d0	objdump: Don't print a (always 0) size for MachO symbols. Only common symbol on MachO and COFF have a size. For COFF we already had a custom format. For MachO, there is no native objdump and we were printing it as ELF. Now we only print the sizes for symbols that actually have them. llvm-svn: 240422	2015-06-23 15:45:38 +00:00
Sanjoy Das	9d95716c15	[FaultMaps] Move FaultMapParser to Object/ Summary: That way llvm-objdump can rely on it without adding an extra dependency on CodeGen. This change duplicates the FaultKind enum and the code that serializes it to a string. I could not figure out a way to get around this without adding a new dependency to Object Reviewers: rafael, ab Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10619 llvm-svn: 240364	2015-06-23 01:05:26 +00:00
Sanjoy Das	6f567a4b79	[FaultMaps] Add a parser for the __llvm__faultmaps section. Summary: The parser is exercised by llvm-objdump using -print-fault-maps. As is probably obvious, the code itself was "heavily inspired" by http://reviews.llvm.org/D10434. Reviewers: reames, atrick, JosephTremoulet Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D10491 llvm-svn: 240304	2015-06-22 18:03:02 +00:00
Rui Ueyama	7d09919534	Remove object_error::success and use std::error_code() instead make_error_code(object_error) is slow because object::object_category() uses a ManagedStatic variable. But the real problem is that the function is called too frequently. This patch uses std::error_code() instead of object_error::success. In most cases, we return "success", so this patch reduces number of function calls to that function. http://reviews.llvm.org/D10333 llvm-svn: 239409	2015-06-09 15:20:42 +00:00
Colin LeMahieu	14ec76eb63	[objdump] Moving PrintImmHex out of MachODump and in to llvm-objdump and setting instprinter appropriately. llvm-svn: 239265	2015-06-07 21:07:17 +00:00
Alexey Samsonov	50d0fbd2b9	llvm-objdump: return non-zero exit code for certain cases of invalid input * If the input file is missing; * If the type of input object file can't be recognized; * If the object file can't be parsed correctly. llvm-svn: 239065	2015-06-04 18:34:11 +00:00
Rafael Espindola	7884c95c7e	Disassemble the start of sections even if there is no symbol there. We already handled a section with no symbols, extend that to also handle a section with symbols that don't include the section start. llvm-svn: 239039	2015-06-04 15:01:05 +00:00
Rafael Espindola	75d5b5495f	Fix the interpretation of a 0 st_name. The ELF spec is very clear: ----------------------------------------------------------------------------- If the value is non-zero, it represents a string table index that gives the symbol name. Otherwise, the symbol table entry has no name. -------------------------------------------------------------------------- In particular, a st_name of 0 most certainly doesn't mean that the symbol has the same name as the section. llvm-svn: 238899	2015-06-03 05:14:22 +00:00
Rafael Espindola	37070a5a3a	Move to llvm-objdump a large amount of code to that is only used there. llvm-svn: 238898	2015-06-03 04:48:06 +00:00
Rafael Espindola	5eb02e45e3	Simplify another function that doesn't fail. llvm-svn: 238703	2015-06-01 00:27:26 +00:00
Rafael Espindola	a4d22472f3	Simplify interface of function that doesn't fail. llvm-svn: 238700	2015-05-31 23:52:50 +00:00
Colin LeMahieu	35436a2634	[Objdump] Removing unused parameter. llvm-svn: 238557	2015-05-29 14:48:25 +00:00
Colin LeMahieu	68d967d92e	[Hexagon] Disassembling, printing, and emitting instructions a whole-bundle at a time which is the semantic unit for Hexagon. Fixing tests to use the new format. Disabling tests in the direct object emission path for a followup patch. llvm-svn: 238556	2015-05-29 14:44:13 +00:00
Aaron Ballman	1196ca2113	Removing a switch statement that only contains a default; NFC. llvm-svn: 238552	2015-05-29 13:00:07 +00:00
Colin LeMahieu	0b5890d411	[llvm] Adding vdtor to fix warning. llvm-svn: 238494	2015-05-28 20:59:08 +00:00
Colin LeMahieu	fb76b007d3	[Objdump] Allow instruction pretty printing to be specialized by the target triple. Differential Revision: http://reviews.llvm.org/D8427 llvm-svn: 238457	2015-05-28 19:07:14 +00:00
Colin LeMahieu	2048ea4056	[llvm] Parameterizing the output stream for dumpbytes and outputting directly to stream. llvm-svn: 238453	2015-05-28 18:39:50 +00:00
Davide Italiano	cd2514dca6	[Object] Teach Object and llvm-objdump about ".hidden" Differential Revision: http://reviews.llvm.org/D9416 Reviewed by: rafael llvm-svn: 236279	2015-04-30 23:08:53 +00:00
Kevin Enderby	0fc1182eed	Add the option -objc-meta-data to llvm-objdump used with -macho to print the Objective-C runtime meta data for Mach-O files. There are three types of Objective-C runtime meta data, Objc2 64-bit, Objc2 32-bit and Objc1 32-bit. This prints the first of these types. The changes to print the others will follow next. llvm-svn: 233840	2015-04-01 20:57:01 +00:00
Eric Christopher	f8019408dc	Replace the MCSubtargetInfo parameter with a Triple when creating an MCInstPrinter. Update all callers and use where we wanted a Triple previously. llvm-svn: 233648	2015-03-31 00:10:04 +00:00
Akira Hatanaka	b46d0234a6	[MCInstPrinter] Enable MCInstPrinter to change its behavior based on the per-function subtarget. Currently, code-gen passes the default or generic subtarget to the constructors of MCInstPrinter subclasses (see LLVMTargetMachine::addPassesToEmitFile), which enables some targets (AArch64, ARM, and X86) to change their instprinter's behavior based on the subtarget feature bits. Since the backend can now use different subtargets for each function, instprinter has to be changed to use the per-function subtarget rather than the default subtarget. This patch takes the first step towards enabling instprinter to change its behavior based on the per-function subtarget. It adds a bit "PassSubtarget" to AsmWriter which tells table-gen to pass a reference to MCSubtargetInfo to the various print methods table-gen auto-generates. I will follow up with changes to instprinters of AArch64, ARM, and X86. llvm-svn: 233411	2015-03-27 20:36:02 +00:00
Colin LeMahieu	fc32b1b874	[Objdump] DumpBytes of uint8_t from ArrayRef<uint8_t> instead of char from StringRef. Removing reinterpret_casts. llvm-svn: 232659	2015-03-18 19:27:31 +00:00
Colin LeMahieu	916c3b423a	[Objdump] Removing size limit on DumpBytes and changing to range based for loop. llvm-svn: 232654	2015-03-18 18:41:23 +00:00
Kevin Enderby	bc847fa4ed	Add the options, -dylibs-used and -dylib-id to llvm-objdump used with -macho to print the Mach-O dynamic shared libraries used by a linked image or the library id of a shared library. llvm-svn: 232406	2015-03-16 20:08:09 +00:00
Kevin Enderby	cd66be5dda	Add the option, -info-plist to llvm-objdump used with -macho to print the Mach-O info plist section as strings. llvm-svn: 231974	2015-03-11 22:06:32 +00:00
Kevin Enderby	f6d258537d	Add the -section option to llvm-objdump used with -macho that takes the argument segname,sectname to specify a Mach-O section to print. The printing is based on the section type or section attributes. The printing of the module initialization and termination section types is printed with this change. Printing of other section types will be added next. llvm-svn: 227649	2015-01-31 00:37:11 +00:00
Kevin Enderby	9a50944ca0	dd the option, -link-opt-hints to llvm-objdump used with -macho to print the Mach-O AArch64 linker optimization hints for ADRP code optimization. llvm-svn: 227246	2015-01-27 21:28:24 +00:00
Colin LeMahieu	bc2f47a76e	[Objdump] Output information about common symbols in a way closer to GNU objdump. llvm-svn: 226932	2015-01-23 20:06:24 +00:00
Kevin Enderby	69fe98da14	Add the option, -data-in-code, to llvm-objdump used with -macho to print the Mach-O data in code table. llvm-svn: 226921	2015-01-23 18:52:17 +00:00
Kevin Enderby	a7bdc7e671	Add the option, -indirect-symbols, used with -macho to print the Mach-O indirect symbol table to llvm-objdump. llvm-svn: 226848	2015-01-22 18:55:27 +00:00
Kevin Enderby	98da6136d0	For llvm-objdump, hook up existing options to work when using -macho (the Mach-O parser). llvm-svn: 226612	2015-01-20 21:47:46 +00:00
Kevin Enderby	13023a1af6	Add the option, -archive-headers, used with -macho to print the Mach-O archive headers to llvm-objdump. llvm-svn: 226228	2015-01-15 23:19:11 +00:00
Kevin Enderby	131d1770f6	Add the option, -universal-headers, used with -macho to print the Mach-O universal headers to llvm-objdump. llvm-svn: 225537	2015-01-09 19:22:37 +00:00
Kevin Enderby	e2297ddd11	Slightly refactor things for llvm-objdump and the -macho option so it can be used with options other than just -disassemble so that universal files can be used with other options combined with -arch options. No functional change to existing options and use. One test case added for the additional functionality with a universal file an a -arch option. llvm-svn: 225383	2015-01-07 21:02:18 +00:00
Rafael Espindola	839353bca0	Remove unused includes and out of date comment. NFC. llvm-svn: 224413	2014-12-17 03:07:20 +00:00
Kevin Enderby	ef3ad2ff32	Re-add support to llvm-objdump for Mach-O universal files and archives with -macho with fixes. Includes the move of tests for llvm-objdump for universal files to an X86 directory. And the fix where it was failing on linux Rafael tracked down with asan. I had both Jim Grosbach and Adam Hemet look over the second fix since I could not set up asan to reproduce with the old version but not with the fix. llvm-svn: 223416	2014-12-04 23:56:27 +00:00
Rafael Espindola	de882cd1c7	This reverts commit r223306 and r223277. The code is using uninitialized memory and failing on linux. llvm-svn: 223315	2014-12-03 23:29:34 +00:00
Kevin Enderby	3f0ffab2b0	Add support to llvm-objdump for Mach-O universal files and archives with -macho. llvm-svn: 223277	2014-12-03 22:29:40 +00:00
Rui Ueyama	98fe58a3a7	Object/COFF: Fix off-by-one error for object having lots of relocations llvm-objdump printed out an error message for this off-by-one error, but because it always exits with 0 whether or not it found an error, the test (llvm-objdump/coff-many-relocs.test) succeeded. I made llvm-objdump exit with EXIT_FAILURE when an error is found. llvm-svn: 222852	2014-11-26 22:17:25 +00:00
David Majnemer	236b0ca790	Object, COFF: Tighten the object file parser We were a little lax in a few areas: - We pretended that import libraries were like any old COFF file, they are not. In fact, they aren't really COFF files at all, we should probably grow some specialized functionality to handle them smarter. - Our symbol iterators were more than happy to attempt to go past the end of the symbol table if you had a symbol with a bad list of auxiliary symbols. llvm-svn: 222124	2014-11-17 11:17:17 +00:00
Aaron Ballman	106fd7bed5	Fixing more -Wcast-qual warnings; NFC. llvm-svn: 221782	2014-11-12 14:01:17 +00:00
Rafael Espindola	7fc5b87480	Pass an ArrayRef to MCDisassembler::getInstruction. With this patch MCDisassembler::getInstruction takes an ArrayRef<uint8_t> instead of a MemoryObject. Even on X86 there is a maximum size an instruction can have. Given that, it seems way simpler and more efficient to just pass an ArrayRef to the disassembler instead of a MemoryObject and have it do a virtual call every time it wants some extra bytes. llvm-svn: 221751	2014-11-12 02:04:27 +00:00
David Majnemer	185b5b1d24	llvm-objdump: Skip empty sections when dumping contents Empty sections are just noise when using objdump. This is similar to what binutils does. llvm-svn: 221680	2014-11-11 09:58:25 +00:00
Rafael Espindola	802912743e	Remove bogus std::error_code returns form SectionRef. There are two methods in SectionRef that can fail: * getName: The index into the string table can be invalid. * getContents: The section might point to invalid contents. Every other method will always succeed and returning and std::error_code just complicates the code. For example, a section can have an invalid alignment, but if we are able to get to the section structure at all and create a SectionRef, we will always be able to read that invalid alignment. llvm-svn: 219314	2014-10-08 15:28:58 +00:00
Kevin Enderby	bf246f5a9d	Flush out enough of llvm-objdump’s SymbolizerSymbolLookUp() for Mach-O files to get the literal string “Hello world” printed as a comment on the instruction that loads the pointer to it. For now this is just for x86_64. So for object files with relocation entries it produces things like: leaq L_.str(%rip), %rax ## literal pool for: "Hello world\n" and similar for fully linked images like executables: leaq 0x4f(%rip), %rax ## literal pool for: "Hello world\n" Also to allow testing against darwin’s otool(1), I hooked up the existing -no-show-raw-insn option to the Mach-O parser code, added the new Mach-O only -full-leading-addr option to match otool(1)'s printing of addresses and also added the new -print-imm-hex option. llvm-svn: 218423	2014-09-24 23:08:22 +00:00
Nick Kledzik	56ebef45ef	[llvm-objdump] for mach-o add -bind, -lazy-bind, and -weak-bind options This finishes the ability of llvm-objdump to print out all information from the LC_DYLD_INFO load command. The -bind option prints out symbolic references that dyld must resolve immediately. The -lazy-bind option prints out symbolc reference that are lazily resolved on first use. The -weak-bind option prints out information about symbols which dyld must try to coalesce across images. llvm-svn: 217853	2014-09-16 01:41:51 +00:00
David Majnemer	4d57159c09	MC: Add support for BigObj Teach WinCOFFObjectWriter how to write -mbig-obj style object files; these object files allow for more sections inside an object file. Our support for BigObj is notably different from binutils and cl: we implicitly upgrade object files to BigObj instead of asking the user to compile the same file again but with another flag. This matches up with how LLVM treats ELF variants. This was tested by forcing LLVM to always emit BigObj files and running the entire test suite. A specific test has also been added. I've lowered the maximum number of sections in a normal COFF file, VS "14" CTP 3 supports no more than 65279 sections. This is important otherwise we might not switch to BigObj quickly enough, leaving us with a COFF file that we couldn't link. yaml2obj support is all that remains to implement. Differential Revision: http://reviews.llvm.org/D5349 llvm-svn: 217812	2014-09-15 19:42:42 +00:00
Nick Kledzik	ac43144e5a	[llvm-objdump] support -rebase option for mach-o to dump rebasing info Similar to my previous -exports-trie option, the -rebase option dumps info from the LC_DYLD_INFO load command. The rebasing info is a list of the the locations that dyld needs to adjust if a mach-o image is not loaded at its preferred address. Since ASLR is now the default, images almost never load at their preferred address, and thus need to be rebased by dyld. llvm-svn: 217709	2014-09-12 21:34:15 +00:00
David Majnemer	44f51e5113	Object: Add support for bigobj This adds support for reading the "bigobj" variant of COFF produced by cl's /bigobj and mingw's -mbig-obj. The most significant difference that bigobj brings is more than 2**16 sections to COFF. bigobj brings a few interesting differences with it: - It doesn't have a Characteristics field in the file header. - It doesn't have a SizeOfOptionalHeader field in the file header (it's only used in executable files). - Auxiliary symbol records have the same width as a symbol table entry. Since symbol table entries are bigger, so are auxiliary symbol records. Write support will come soon. Differential Revision: http://reviews.llvm.org/D5259 llvm-svn: 217496	2014-09-10 12:51:52 +00:00
Sean Silva	888320e9fa	Nuke MCAnalysis. The code is buggy and barely tested. It is also mostly boilerplate. (This includes MCObjectDisassembler, which is the interface to that functionality) Following an IRC discussion with Jim Grosbach, it seems sensible to just nuke the whole lot of functionality, and dig it up from VCS if necessary (I hope not!). All of this stuff appears to have been added in a huge patch dump (look at the timeframe surrounding e.g. r182628) where almost every patch seemed to be untested and not reviewed before being committed. Post-review responses to the patches were never addressed. I don't think any of it would have passed pre-commit review. I doubt anyone is depending on this, since this code appears to be extremely buggy. In limited testing that Michael Spencer and I did, we couldn't find a single real-world object file that wouldn't crash the CFG reconstruction stuff. The symbolizer stuff has O(n^2) behavior and so is not much use to anyone anyway. It seemed simpler to remove them as a whole. Most of this code is boilerplate, which is the only way it was able to scrape by 60% coverage. HEADSUP: Modules folks, some files I nuked were referenced from include/llvm/module.modulemap; I just deleted the references. Hopefully that is the right fix (one was a FIXME though!). llvm-svn: 216983	2014-09-02 22:32:20 +00:00
Nick Kledzik	d04bc35852	Object/llvm-objdump: allow dumping of mach-o exports trie MachOObjectFile in lib/Object currently has no support for parsing the rebase, binding, and export information from the LC_DYLD_INFO load command in final linked mach-o images. This patch adds support for parsing the exports trie data structure. It also adds an option to llvm-objdump to dump that export info. I did the exports parsing first because it is the hardest. The information is encoded in a trie structure, but the standard ObjectFile way to inspect content is through iterators. So I needed to make an iterator that would do a non-recursive walk through the trie and maintain the concatenation of edges needed for the current string prefix. I plan to add similar support in MachOObjectFile and llvm-objdump to parse/display the rebasing and binding info too. llvm-svn: 216808	2014-08-30 00:20:14 +00:00
Rafael Espindola	3fd1e9933f	Modernize raw_fd_ostream's constructor a bit. Take a StringRef instead of a "const char *". Take a "std::error_code &" instead of a "std::string &" for error. A create static method would be even better, but this patch is already a bit too big. llvm-svn: 216393	2014-08-25 18:16:47 +00:00
Kevin Enderby	b76d386d7c	Add the start of the support for llvm-objdump’s -private-headers for Mach-O files. This adds the printing of the mach header. Load command printing will be next. llvm-svn: 216285	2014-08-22 20:35:18 +00:00
Rafael Espindola	48af1c2a1a	Don't own the buffer in object::Binary. Owning the buffer is somewhat inflexible. Some Binaries have sub Binaries (like Archive) and we had to create dummy buffers just to handle that. It is also a bad fit for IRObjectFile where the Module wants to own the buffer too. Keeping this ownership would make supporting IR inside native objects particularly painful. This patch focuses in lib/Object. If something elsewhere used to own an Binary, now it also owns a MemoryBuffer. This patch introduces a few new types. * MemoryBufferRef. This is just a pair of StringRefs for the data and name. This is to MemoryBuffer as StringRef is to std::string. * OwningBinary. A combination of Binary and a MemoryBuffer. This is needed for convenience functions that take a filename and return both the buffer and the Binary using that buffer. The C api now uses OwningBinary to avoid any change in semantics. I will start a new thread to see if we want to change it and how. llvm-svn: 216002	2014-08-19 18:44:46 +00:00
Rafael Espindola	c66d761b97	llvm-objdump: don't print relocations in non-relocatable files. This matches the behavior of GNU objdump. llvm-svn: 215844	2014-08-17 19:09:37 +00:00
Rafael Espindola	e45c740370	Fix an off-by-one bug in the target independent llvm-objdump. It would prevent the display of a single byte instruction before a label. Patch by Steve King! llvm-svn: 215837	2014-08-17 16:31:39 +00:00
Kevin Enderby	c959562092	Add the -mcpu= option to llvm-objdump for use with the disassemblers. Also make the disassembler created with the Mach-O parser (the -m option) pick up the Target specific attributes specified with -mattr option. llvm-svn: 215032	2014-08-06 23:24:41 +00:00
Rafael Espindola	3f6481d0d3	Remove some calls to std::move. Instead of moving out the data in a ErrorOr<std::unique_ptr<Foo>>, get a reference to it. Thanks to David Blaikie for the suggestion. llvm-svn: 214516	2014-08-01 14:31:55 +00:00
Tim Northover	4bd286ab53	llvm-objdump: implement printing for MachO __compact_unwind info. llvm-svn: 214509	2014-08-01 13:07:19 +00:00
Rafael Espindola	3f0549f66b	Move MCObjectSymbolizer.h to MC/MCAnalysis. The cpp file is already in lib/MC/MCAnalysis. llvm-svn: 214424	2014-07-31 19:29:23 +00:00
Rafael Espindola	437b0d5887	Use std::unique_ptr to make the ownership explicit. llvm-svn: 214377	2014-07-31 03:12:45 +00:00
David Majnemer	8f6b04cb57	llvm-objdump: Handle BSS sections larger than the object file The size of the uninitialized sections, like BSS, can exceed the size of the object file. Do not attempt to grab the contents of such sections. llvm-svn: 212953	2014-07-14 16:20:14 +00:00
Rafael Espindola	cbc5ac7a7e	Move CFG building code to a new lib/MC/MCAnalysis library. The new library is 150KB on a Release+Asserts build, so it is quiet a bit of code that regular users of MC don't need to link with now. llvm-svn: 212209	2014-07-02 19:49:34 +00:00
Alp Toker	e69170a110	Revert "Introduce a string_ostream string builder facilty" Temporarily back out commits r211749, r211752 and r211754. llvm-svn: 211814	2014-06-26 22:52:05 +00:00
Alp Toker	614717388c	Introduce a string_ostream string builder facilty string_ostream is a safe and efficient string builder that combines opaque stack storage with a built-in ostream interface. small_string_ostream<bytes> additionally permits an explicit stack storage size other than the default 128 bytes to be provided. Beyond that, storage is transferred to the heap. This convenient class can be used in most places an std::string+raw_string_ostream pair or SmallString<>+raw_svector_ostream pair would previously have been used, in order to guarantee consistent access without byte truncation. The patch also converts much of LLVM to use the new facility. These changes include several probable bug fixes for truncated output, a programming error that's no longer possible with the new interface. llvm-svn: 211749	2014-06-26 00:00:48 +00:00
Rafael Espindola	ae460027a4	Convert the Archive API to use ErrorOr. Now that we have c++11, even things like ErrorOr<std::unique_ptr<...>> are easy to use. No intended functionality change. llvm-svn: 211033	2014-06-16 16:08:36 +00:00
Rafael Espindola	4453e42945	Remove 'using std::error_code' from tools. llvm-svn: 210876	2014-06-13 03:07:50 +00:00
Rafael Espindola	3acea39853	Don't use 'using std::error_code' in include/llvm. This should make sure that most new uses use the std prefix. llvm-svn: 210835	2014-06-12 21:46:39 +00:00
Rafael Espindola	a6e9c3e43a	Remove system_error.h. This is a minimal change to remove the header. I will remove the occurrences of "using std::error_code" in a followup patch. llvm-svn: 210803	2014-06-12 17:38:55 +00:00
Craig Topper	e6cb63e471	[C++] Use 'nullptr'. Tools edition. llvm-svn: 207176	2014-04-25 04:24:47 +00:00
Saleem Abdulrasool	98938f1e0a	objdump: identify WoA WinCOFF/ARM correctly Since LLVM currently only supports WinCOFF, assume that the input is WinCOFF rather than another type of COFF file (ECOFF/XCOFF). If the architecture is detected as thumb (e.g. the file has a IMAGE_FILE_MACHINE_ARMNT magic) then use a triple of thumbv7-windows. This allows for objdump to properly handle WoA object files without having to specify the target triple manually. llvm-svn: 206446	2014-04-17 06:17:23 +00:00
Lang Hames	a1bc0f5662	[MC] Require an MCContext when constructing an MCDisassembler. This patch re-introduces the MCContext member that was removed from MCDisassembler in r206063, and requires that an MCContext be passed in at MCDisassembler construction time. (Previously the MCContext member had been initialized in an ad-hoc fashion after construction). The MCCContext member can be used by MCDisassembler sub-classes to construct constant or target-specific MCExprs. This patch updates disassemblers for in-tree targets, and provides the MCRegisterInfo instance that some disassemblers were using through the MCContext (previously those backends were constructing their own MCRegisterInfo instances). llvm-svn: 206241	2014-04-15 04:40:56 +00:00
Saleem Abdulrasool	13a3f6914b	tools: fix heap-buffer-overrun detected via ASAN Once the auxiliary fields relating to the filename have been inspected, any following auxiliary fields need not be visited as they have been consumed (the following fields comprise the filepath as a single unit). Adjust the test to catch this even if ASAN is not enabled. llvm-svn: 206190	2014-04-14 16:38:25 +00:00
Saleem Abdulrasool	7050eedb61	tools: simplify symbol handling in objdump Rather than switching behaviour on whether a previous symbol has an auxiliary symbol record for the next count of elements, simply iterate over the auxiliary symbols right after processing the current symbol entry. This makes the behaviour much simpler to follow and similar to llvm-readobj and yaml2obj. llvm-svn: 206146	2014-04-14 02:37:28 +00:00
Saleem Abdulrasool	d38c6b1e4b	tools: address possible non-null terminated filenames If a filename is a multiple of 18 characters, there will be no null-terminator. This will result in an invalid access by the constructed StringRef. Add a test case to exercise this and fix that handling. Address this same vulnerability in llvm-readobj as well. llvm-svn: 206145	2014-04-14 02:37:23 +00:00
Saleem Abdulrasool	63a0dd6540	tools: avoid a string duplication The auxiliary file records are contiguous and only contain the filename. Construct a StringRef directly rather than copying to a temporary buffer. Suggested by majnemer on IRC! llvm-svn: 206139	2014-04-13 22:54:11 +00:00
Saleem Abdulrasool	9ede5c7dd0	tools: teach objdump about FILE aux records Add support for file auxiliary symbol entries in COFF symbol tables. A COFF symbol table with a FILE entry is followed by sizeof(__FILE__) / 18 auxiliary symbol records which contain the filename. Read them and form the original filename that the record contains. Then display the name in the output. llvm-svn: 206126	2014-04-13 03:11:08 +00:00
Lang Hames	eb37092342	Update MCSymbolizer and its subclasses' constructors to reflect the fact that they take ownership of the RelocationInfo they're constructed with. llvm-svn: 204891	2014-03-27 02:39:01 +00:00
Greg Fitzgerald	1843227551	llvm-objdump output hex to match binutils' objdump Patch by Ted Woodward llvm-svn: 204409	2014-03-20 22:55:15 +00:00
David Majnemer	ddf28f2b79	Object: Provide a richer means of describing auxiliary symbols The current state of affairs has auxiliary symbols described as a big bag of bytes. This is less than satisfying, it detracts from the YAML file as being human readable. Instead, allow for symbols to optionally contain their auxiliary data. This allows us to have a much higher level way of describing things like weak symbols, function definitions and section definitions. This depends on D3105. Differential Revision: http://llvm-reviews.chandlerc.com/D3092 llvm-svn: 204214	2014-03-19 04:47:47 +00:00
Rui Ueyama	4e39f717ff	Use early returns to reduce nesting. llvm-svn: 204171	2014-03-18 18:58:51 +00:00
Alexey Samsonov	464d2e448b	[C++11] Introduce ObjectFile::symbols() to use range-based loops. Reviewers: rafael Reviewed By: rafael CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D3081 llvm-svn: 204031	2014-03-17 07:28:19 +00:00
Alexey Samsonov	aa4d29571c	[C++11] Introduce SectionRef::relocations() to use range-based loops Reviewers: rafael Reviewed By: rafael CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D3077 llvm-svn: 203927	2014-03-14 14:22:49 +00:00
Alexey Samsonov	48803e5ca9	[C++11] Use ObjectFile::sections() in commandline llvm tools llvm-svn: 203802	2014-03-13 14:37:36 +00:00
Ahmed Charles	df17c83fa8	Change MCDisassembler::setSymbolizer to take unique_ptr by value. This changes the interface to be more explicit that ownership is being transferred. llvm-svn: 203223	2014-03-07 09:38:02 +00:00
Saleem Abdulrasool	35476334e9	Support: split object format out of environment This is a preliminary setup change to support a renaming of Windows target triples. Split the object file format information out of the environment into a separate entity. Unfortunately, file format was previously treated as an environment with an unknown OS. This is most obvious in the ARM subtarget where the handling for macho on an arbitrary platform switches to AAPCS rather than APCS (as per Apple's needs). llvm-svn: 203160	2014-03-06 20:47:11 +00:00
Ahmed Charles	56440fd820	Replace OwningPtr<T> with std::unique_ptr<T>. This compiles with no changes to clang/lld/lldb with MSVC and includes overloads to various functions which are used by those projects and llvm which have OwningPtr's as parameters. This should allow out of tree projects some time to move. There are also no changes to libs/Target, which should help out of tree targets have time to move, if necessary. llvm-svn: 203083	2014-03-06 05:51:42 +00:00
Simon Atanasyan	2b614e1163	llvm-objdump: Do not attempt to disassemble symbols outside of section boundaries. It is possible to create an ELF executable where symbol from say .text section 'points' to the address outside the section boundaries. It does not have a sense to disassemble something outside the section. Without this fix llvm-objdump prints finite or infinite (depends on the executable file architecture) number of 'invalid instruction encoding' warnings. llvm-svn: 202083	2014-02-24 22:12:11 +00:00
Rafael Espindola	90c7f1cc16	Replace the F_Binary flag with a F_Text one. After this I will set the default back to F_None. The advantage is that before this patch forgetting to set F_Binary would corrupt a file on windows. Forgetting to set F_Text produces one that cannot be read in notepad, which is a better failure mode :-) llvm-svn: 202052	2014-02-24 18:20:12 +00:00
Rafael Espindola	7dbcdd08c2	Don't make F_None the default. This will make it easier to switch the default to being binary files. llvm-svn: 202042	2014-02-24 15:07:20 +00:00
Rafael Espindola	b5155a572f	Change the begin and end methods in ObjectFile to match the style guide. llvm-svn: 201108	2014-02-10 20:24:04 +00:00
Rafael Espindola	20122a436c	Simplify getSymbolFlags. None of the object formats require extra parsing to compute these flags, so the method cannot fail. llvm-svn: 200574	2014-01-31 20:57:12 +00:00
Rafael Espindola	5e812afaeb	Simplify the handling of iterators in ObjectFile. None of the object file formats reported error on iterator increment. In retrospect, that is not too surprising: no object format stores symbols or sections in a linked list or other structure that requires chasing pointers. As a consequence, all error checking can be done on begin() and end(). This reduces the text segment of bin/llvm-readobj in my machine from 521233 to 518526 bytes. llvm-svn: 200442	2014-01-30 02:49:50 +00:00
Mark Seaborn	0929d3d855	Fix "llvm-objdump -d -r" to show relocations inline for ELF files This fixes a regression introduced by r182908, which broke llvm-objdump's ability to display relocations inline in a disassembly dump for ELF object files. That change removed a SectionRelocMap from Object/ELF.h, which we recreate in llvm-objdump.cpp. I discovered this regression via an out-of-tree test (test/NaCl/X86/pnacl-hides-sandbox-x86-64.ll) which used llvm-objdump. Note that the "Unknown" string in the test output on i386 isn't quite right, but this appears to be a pre-existing bug. Differential Revision: http://llvm-reviews.chandlerc.com/D2559 llvm-svn: 200090	2014-01-25 17:38:19 +00:00
Mark Seaborn	eb03ac50ed	llvm-objdump: Some style cleanups to follow LLVM coding style Rename "ec" to "EC", and rename some iterators. Then fix whitespace using clang-format-diff. (As requested in http://llvm-reviews.chandlerc.com/D2559) Differential Revision: http://llvm-reviews.chandlerc.com/D2594 llvm-svn: 200053	2014-01-25 00:32:01 +00:00
Rafael Espindola	23a9750c47	Rename these methods to match the style guide. llvm-svn: 199751	2014-01-21 16:09:45 +00:00
Rafael Espindola	63da295045	Return an ErrorOr<Binary *> from createBinary. I did write a version returning ErrorOr<OwningPtr<Binary> >, but it is too cumbersome to use without std::move. I will keep the patch locally and submit when we switch to c++11. llvm-svn: 199326	2014-01-15 19:37:43 +00:00
Rui Ueyama	c2bed42904	Re-submit r191472 with a fix for big endian. llvm-objdump: Dump COFF import table if -private-headers option is given. llvm-svn: 191557	2013-09-27 21:04:00 +00:00
Rui Ueyama	333d28a0bb	Revert "llvm-objdump: Dump COFF import table if -private-headers option is given." This reverts commit r191472 because it's failing on BE machine. llvm-svn: 191480	2013-09-27 01:29:36 +00:00
Rui Ueyama	5b1adbaad9	llvm-objdump: Dump COFF import table if -private-headers option is given. This is a patch to add capability to llvm-objdump to dump COFF Import Table entries, so that we can write tests for LLD checking Import Table contents. llvm-objdump did not print anything but just file name if the format is COFF and -private-headers option is given. This is a patch adds capability for dumping DLL Import Table, which is specific to the COFF format. In this patch I defined a new iterator to iterate over import table entries. Also added a few functions to COFFObjectFile.cpp to access fields of the entry. Differential Revision: http://llvm-reviews.chandlerc.com/D1719 llvm-svn: 191472	2013-09-27 00:07:01 +00:00
Ahmed Bougacha	d56f705d87	Add basic YAML MC CFG testcase. Drive-by llvm-objdump cleanup (don't hardcode ToolName). llvm-svn: 188904	2013-08-21 16:13:25 +00:00
Ahmed Bougacha	1792647942	MC CFG: Add YAML MCModule representation to enable MC CFG testing. Like yaml ObjectFiles, this will be very useful for testing the MC CFG implementation (mostly MCObjectDisassembler), by matching the output with YAML, and for potential users of the MC CFG, by using it as an input. There isn't much to the actual format, it is just a serialization of the MCModule class. Of note: - Basic block references (pred/succ, ..) are represented by the BB's start address. - Just as in the MC CFG, instructions are MCInsts with a size. - Operands have a prefix representing the type (only register and immediate supported here). - Instruction opcodes are represented by their names; enum values aren't stable, enum names mostly are: usually, a change to a name would need lots of changes in the backend anyway. Same with registers. All in all, an example is better than 1000 words, here goes: A simple binary: Disassembly of section __TEXT,__text: _main: 100000f9c: 48 8b 46 08 movq 8(%rsi), %rax 100000fa0: 0f be 00 movsbl (%rax), %eax 100000fa3: 3b 04 25 48 00 00 00 cmpl 72, %eax 100000faa: 0f 8c 07 00 00 00 jl 7 <.Lend> 100000fb0: 2b 04 25 48 00 00 00 subl 72, %eax .Lend: 100000fb7: c3 ret And the (pretty verbose) generated YAML: --- Atoms: - StartAddress: 0x0000000100000F9C Size: 20 Type: Text Content: - Inst: MOV64rm Size: 4 Ops: [ RRAX, RRSI, I1, R, I8, R ] - Inst: MOVSX32rm8 Size: 3 Ops: [ REAX, RRAX, I1, R, I0, R ] - Inst: CMP32rm Size: 7 Ops: [ REAX, R, I1, R, I72, R ] - Inst: JL_4 Size: 6 Ops: [ I7 ] - StartAddress: 0x0000000100000FB0 Size: 7 Type: Text Content: - Inst: SUB32rm Size: 7 Ops: [ REAX, REAX, R, I1, R, I72, R ] - StartAddress: 0x0000000100000FB7 Size: 1 Type: Text Content: - Inst: RET Size: 1 Ops: [ ] Functions: - Name: __text BasicBlocks: - Address: 0x0000000100000F9C Preds: [ ] Succs: [ 0x0000000100000FB7, 0x0000000100000FB0 ] <snip> ... llvm-svn: 188890	2013-08-21 07:29:02 +00:00
Bill Wendling	bc07a8900c	Use pointers to the MCAsmInfo and MCRegInfo. Someone may want to do something crazy, like replace these objects if they change or something. No functionality change intended. llvm-svn: 184175	2013-06-18 07:20:20 +00:00
NAKAMURA Takumi	d5c2e60b19	llvm-objdump.cpp: Appease MSC16 x64. utostr(n++) causes internal compiler error. llvm-svn: 182722	2013-05-27 00:02:48 +00:00
Ahmed Bougacha	aa79068157	MC: Disassembled CFG reconstruction. This patch builds on some existing code to do CFG reconstruction from a disassembled binary: - MCModule represents the binary, and has a list of MCAtoms. - MCAtom represents either disassembled instructions (MCTextAtom), or contiguous data (MCDataAtom), and covers a specific range of addresses. - MCBasicBlock and MCFunction form the reconstructed CFG. An MCBB is backed by an MCTextAtom, and has the usual successors/predecessors. - MCObjectDisassembler creates a module from an ObjectFile using a disassembler. It first builds an atom for each section. It can also construct the CFG, and this splits the text atoms into basic blocks. MCModule and MCAtom were only sketched out; MCFunction and MCBB were implemented under the experimental "-cfg" llvm-objdump -macho option. This cleans them up for further use; llvm-objdump -d -cfg now generates graphviz files for each function found in the binary. In the future, MCObjectDisassembler may be the right place to do "intelligent" disassembly: for example, handling constant islands is just a matter of splitting the atom, using information that may be available in the ObjectFile. Also, better initial atom formation than just using sections is possible using symbols (and things like Mach-O's function_starts load command). This brings two minor regressions in llvm-objdump -macho -cfg: - The printing of a relocation's referenced symbol. - An annotation on loop BBs, i.e., which are their own successor. Relocation printing is replaced by the MCSymbolizer; the basic CFG annotation will be superseded by more related functionality. llvm-svn: 182628	2013-05-24 01:07:04 +00:00
Ahmed Bougacha	ad1084de84	Add MCSymbolizer for symbolic/annotated disassembly. This is a basic first step towards symbolization of disassembled instructions. This used to be done using externally provided (C API) callbacks. This patch introduces: - the MCSymbolizer class, that mimics the same functions that were used in the X86 and ARM disassemblers to symbolize immediate operands and to annotate loads based off PC (for things like c string literals). - the MCExternalSymbolizer class, which implements the old C API. - the MCRelocationInfo class, which provides a way for targets to translate relocations (either object::RelocationRef, or disassembler C API VariantKinds) to MCExprs. - the MCObjectSymbolizer class, which does symbolization using what it finds in an object::ObjectFile. This makes simple symbolization (with no fancy relocation stuff) work for all object formats! - x86-64 Mach-O and ELF MCRelocationInfos. - A basic ARM Mach-O MCRelocationInfo, that provides just enough to support the C API VariantKinds. Most of what works in otool (the only user of the old symbolization API that I know of) for x86-64 symbolic disassembly (-tvV) works, namely: - symbol references: call _foo; jmp 15 <_foo+50> - relocations: call _foo-_bar; call _foo-4 - __cf?string: leaq 193(%rip), %rax ## literal pool for "hello" Stub support is the main missing part (because libObject doesn't know, among other things, about mach-o indirect symbols). As for the MCSymbolizer API, instead of relying on the disassemblers to call the tryAdding* methods, maybe this could be done automagically using InstrInfo? For instance, even though PC-relative LEAs are used to get the address of string literals in a typical Mach-O file, a MOV would be used in an ELF file. And right now, the explicit symbolization only recognizes PC-relative LEAs. InstrInfo should have already have most of what is needed to know what to symbolize, so this can definitely be improved. I'd also like to remove object::RelocationRef::getValueString (it seems only used by relocation printing in objdump), as simply printing the created MCExpr is definitely enough (and cleaner than string concats). llvm-svn: 182625	2013-05-24 00:39:57 +00:00
Ahmed Bougacha	0835ca12ef	llvm-objdump: Initialize MCDisassembler once instead of for each section. llvm-svn: 182054	2013-05-16 21:28:23 +00:00
Rafael Espindola	227144c23c	Remove the MachineMove class. It was just a less powerful and more confusing version of MCCFIInstruction. A side effect is that, since MCCFIInstruction uses dwarf register numbers, calls to getDwarfRegNum are pushed out, which should allow further simplifications. I left the MachineModuleInfo::addFrameMove interface unchanged since this patch was already fairly big. llvm-svn: 181680	2013-05-13 01:16:13 +00:00
Rafael Espindola	1e48387962	Clarify getRelocationAddress x getRelocationOffset a bit. getRelocationAddress is for dynamic libraries and executables, getRelocationOffset for relocatable objects. Mark the getRelocationAddress of COFF and MachO as not implemented yet. Add a test of ELF's. llvm-readobj -r now prints the same values as readelf -r. llvm-svn: 180259	2013-04-25 12:28:45 +00:00
Rafael Espindola	56f976f6bd	At Jim Grosbach's request detemplate Object/MachO.h. We are still able to handle mixed endian objects by swapping one struct at a time. llvm-svn: 179778	2013-04-18 18:08:55 +00:00
Alexey Samsonov	209095cd9f	llvm-objdump: Don't print contents of BSS sections: it makes no sense and crashes llvm-objdump on relocated objects with large bss llvm-svn: 179589	2013-04-16 10:53:11 +00:00
Rafael Espindola	9b709259e1	Finish templating MachObjectFile over endianness. We are now able to handle big endian macho files in llvm-readobject. Thanks to David Fang for providing the object files. llvm-svn: 179440	2013-04-13 01:45:40 +00:00
Rafael Espindola	c2413f59e4	Convert MachOObjectFile to a template. For now it is templated only on being 64 or 32 bits. I will add little/big endian next. llvm-svn: 179097	2013-04-09 14:49:08 +00:00
Rafael Espindola	b0f76a4b75	Don't fetch pointers from a InMemoryStruct. InMemoryStruct is extremely dangerous as it returns data from an internal buffer when the endiannes doesn't match. This should fix the tests on big endian hosts. llvm-svn: 178875	2013-04-05 15:15:22 +00:00
Eric Christopher	2d4b3a6b94	Don't disassemble symbols with an unknown address or size. Patch by Nico Rieck! llvm-svn: 178678	2013-04-03 18:31:23 +00:00
Guy Benyei	83c74e9fad	Add static cast to unsigned char whenever a character classification function is called with a signed char argument, in order to avoid assertions in Windows Debug configuration. llvm-svn: 175006	2013-02-12 21:21:59 +00:00
Michael J. Spencer	d7e7003e8b	[objdump,readobj] Document the purpose and goals of each tool. llvm-svn: 174439	2013-02-05 20:27:22 +00:00
Jakub Staszak	691860c294	Remove unneeded #include. llvm-svn: 173088	2013-01-21 21:02:47 +00:00
Michael J. Spencer	d857c1c9bf	[llvm-objdump] Emit addresses with the correct number of leading 0's. llvm-svn: 172130	2013-01-10 22:40:50 +00:00
Michael J. Spencer	209565db2d	[objdump] Add --private-headers, -p. This currently prints the ELF program headers. llvm-svn: 171649	2013-01-06 03:56:49 +00:00
Rafael Espindola	a9f810b6b5	Add a function to get the segment name of a section. On MachO, sections also have segment names. When a tool looking at a .o file prints a segment name, this is what they mean. In reality, a .o has only one anonymous, segment. This patch adds a MachO only function to fetch that segment name. I named it getSectionFinalSegmentName since the main use for the name seems to be inform the linker with segment this section should go to. The patch also changes MachOObjectFile::getSectionName to return just the section name instead of computing SegmentName,SectionName. The main difference from the previous patch is that it doesn't use InMemoryStruct. It is extremely dangerous: if the endians match it returns a pointer to the file buffer, if not, it returns a pointer to an internal buffer that is overwritten in the next API call. We should change all of this code to use support::detail::packed_endian_specific_integral like ELF, but since these functions only handle strings, they work with big and little endian machines as is. I have tested this by installing ubuntu 12.10 ppc on qemu, that is why it took so long :-) llvm-svn: 170838	2012-12-21 03:47:03 +00:00
Rafael Espindola	0f00de40dd	Revert 170545 while I debug the ppc failures. llvm-svn: 170547	2012-12-19 14:48:05 +00:00
Rafael Espindola	aa7b27801c	Add r170095 back. I cannot reproduce it the failures locally, so I will keep an eye at the ppc bots. This patch does add the change to the "Disassembly of section" message, but that is not what was failing on the bots. Original message: Add a funciton to get the segment name of a section. On MachO, sections also have segment names. When a tool looking at a .o file prints a segment name, this is what they mean. In reality, a .o has only one anonymous, segment. This patch adds a MachO only function to fetch that segment name. I named it getSectionFinalSegmentName since the main use for the name seems to be infor the linker with segment this section should go to. The patch also changes MachOObjectFile::getSectionName to return just the section name instead of computing SegmentName,SectionName. llvm-svn: 170545	2012-12-19 14:15:04 +00:00
Eric Christopher	c859c2912f	Revert "Add a funciton to get the segment name of a section." This reverts commit r170095 since it appears to be breaking the bots. llvm-svn: 170105	2012-12-13 06:36:18 +00:00
Rafael Espindola	bc8016d062	Add a funciton to get the segment name of a section. On MachO, sections also have segment names. When a tool looking at a .o file prints a segment name, this is what they mean. In reality, a .o has only one, anonymous, segment. This patch adds a MachO only function to fetch that segment name. I named it getSectionFinalSegmentName since the main use for the name seems to be informing the linker with segment this section should go to. The patch also changes MachOObjectFile::getSectionName to return just the section name instead of computing SegmentName,SectionName. llvm-svn: 170095	2012-12-13 04:07:18 +00:00
Michael J. Spencer	0c6ec48d0b	Add dump of Win64 EH unwind data. The new command line option -unwind-info dumps the Win64 EH unwind data to the console. This is a nice feature if you need to debug generated EH data (e.g. from LLVM). Includes a test case. Initial patch by João Matos, extensions and rework by Kai Nacke. llvm-svn: 169415	2012-12-05 20:12:35 +00:00
Chandler Carruth	4d88a1c233	Sort the #include lines for tools/... Again, tools are trickier to pick the main module header for than library source files. I've started to follow the pattern of using LLVMContext.h when it is included as a stub for program source files. llvm-svn: 169252	2012-12-04 10:44:52 +00:00
Eli Bendersky	3a6808cc32	Add the -no-show-raw-insn option to llvm-objdump, thus making it a bit more conformant to binutils objdump. llvm-svn: 168393	2012-11-20 22:57:02 +00:00
Jack Carter	551efd7fd9	Some of the instructions in the Mips instruction set are revision delimited. llvm-mc -disassemble access these through the -mattr option. llvm-objdump -disassemble had no such way to set the attribute so some instructions were just not recognized for disassembly. This patch accepts llvm-mc mechanism for specifying the attributes. llvm-svn: 162781	2012-08-28 19:24:49 +00:00
Jim Grosbach	af9aec0cd7	Tidy up a bit. llvm-svn: 161430	2012-08-07 17:53:14 +00:00
Kevin Enderby	fe3d005ca5	Fix it so llvm-objdump -arch does accept x86 and x86-64 as valid arch names. PR12731. Patch by Meador Inge! llvm-svn: 156444	2012-05-08 23:38:45 +00:00
Pete Cooper	28fb4fc91b	PR12729: Change 'llvm-objdump' to display the available targets. Patch by Meador Inge. llvm-svn: 156128	2012-05-03 23:20:10 +00:00
Craig Topper	54bfde79db	Make MCInstrInfo available to the MCInstPrinter. This will be used to remove getInstructionName and the static data it contains since the same tables are already in MCInstrInfo. llvm-svn: 153860	2012-04-02 06:09:36 +00:00
Benjamin Kramer	a5177e63a6	Include cctype for std::isprint. This should unbreak the msvc build. llvm-svn: 153329	2012-03-23 11:49:32 +00:00
Benjamin Kramer	82803112da	Fix uses of the C99 PRI format macros not to conflict with C++11 UDLs. llvm-svn: 152474	2012-03-10 02:04:38 +00:00
Jim Grosbach	fd93a59557	Make MCRegisterInfo available to the the MCInstPrinter. Used to allow context sensitive printing of super-register or sub-register references. llvm-svn: 152043	2012-03-05 19:33:20 +00:00
David Meyer	7e4b976c36	[Object] Add symbol attribute flags: ST_ThreadLocal, ST_Common, and ST_Undefined. Implement these completely for ELF. Rename ST_External to ST_Unknown, and slightly change its semantics. It now only indicates that the symbol's type is unknown, not that the symbol is undefined. (For that, use ST_Undefined). llvm-svn: 151696	2012-02-29 02:11:55 +00:00
David Meyer	1df4b84db4	In the ObjectFile interface, replace isInternal(), isAbsolute(), isGlobal(), and isWeak(), with a bitset of flags. llvm-svn: 151670	2012-02-28 23:47:53 +00:00
Cameron Zwarich	07f0f77629	Fix llvm-objdump disassembly for interesting Mach-O binaries, e.g. any MacOS dylib. This regressed with r145408. I will try to make a test case and add it so that this doesn't happen again. llvm-svn: 149667	2012-02-03 04:13:37 +00:00
David Blaikie	a379b18173	Unweaken vtables as per http://llvm.org/docs/CodingStandards.html#ll_virtual_anch llvm-svn: 146960	2011-12-20 02:50:00 +00:00
Danil Malyshev	cbe72fc959	Fixed ObjectFile functions: - getSymbolOffset() renamed as getSymbolFileOffset() - getSymbolFileOffset(), getSymbolAddress(), getRelocationAddress() returns same result for ELFObjectFile, MachOObjectFile and COFFObjectFile. - added getRelocationOffset() - fixed MachOObjectFile::getSymbolSize() - fixed MachOObjectFile::getSymbolSection() - fixed MachOObjectFile::getSymbolOffset() for symbols without section data. llvm-svn: 145408	2011-11-29 17:40:10 +00:00
Chandler Carruth	37ab257b88	Revert r145180 as it is causing test failures on all the bots. Original commit message: Fixed ObjectFile functions: - getSymbolOffset() renamed as getSymbolFileOffset() - getSymbolFileOffset(), getSymbolAddress(), getRelocationAddress() returns same result for ELFObjectFile, MachOObjectFile and COFFObjectFile. - added getRelocationOffset() - fixed MachOObjectFile::getSymbolSize() - fixed MachOObjectFile::getSymbolSection() - fixed MachOObjectFile::getSymbolOffset() for symbols without section data. llvm-svn: 145182	2011-11-27 10:37:47 +00:00
Danil Malyshev	2631f93f7d	Fixed ObjectFile functions: - getSymbolOffset() renamed as getSymbolFileOffset() - getSymbolFileOffset(), getSymbolAddress(), getRelocationAddress() returns same result for ELFObjectFile, MachOObjectFile and COFFObjectFile. - added getRelocationOffset() - fixed MachOObjectFile::getSymbolSize() - fixed MachOObjectFile::getSymbolSection() - fixed MachOObjectFile::getSymbolOffset() for symbols without section data. llvm-svn: 145180	2011-11-27 10:12:52 +00:00
Michael J. Spencer	53723de5b8	llvm-objdump: Ignore non-objects in archives. llvm-svn: 144755	2011-11-16 01:24:41 +00:00
Stepan Dyatkovskiy	07c0d4091f	uint64 formatted output: replaced %llx with PRIx64 macro. llvm-svn: 143191	2011-10-28 13:07:32 +00:00
Owen Anderson	bf3bc1db22	Revert r143149, stubbing out symbolic disassembly support. The symbolic disassembly support is too MC-engrained to be useful in llvm-objdump. llvm-svn: 143152	2011-10-27 21:55:13 +00:00
Owen Anderson	8f167d4861	Stub out support for symbol disassembly in llvm-objdump. llvm-svn: 143149	2011-10-27 21:46:31 +00:00
Stepan Dyatkovskiy	4b96dc7167	Fixed llvm-objdump uint64_t formatted output. llvm-svn: 143120	2011-10-27 18:40:45 +00:00
Owen Anderson	fa3e5200b8	Add support for the notion of "hidden" relocations. On MachO, these are relocation entries that are used as additional information for other, real relocations, rather than being relocations themselves. I'm not familiar enough with ELF or COFF to know if they should have any relocations marked hidden. llvm-svn: 142961	2011-10-25 20:35:53 +00:00
Owen Anderson	f20e3e5774	Fix off-by-one error when printing relocations inline with disassembly. llvm-svn: 142952	2011-10-25 20:15:39 +00:00
Michael J. Spencer	bfa067862c	llvm-objdump: Add static symbol table dumping. llvm-svn: 142404	2011-10-18 19:32:17 +00:00
Michael J. Spencer	81c80ddb0c	Revert "llvm-objdump: Add static symbol table dumping." This reverts commit 0c30d4e4f5f9110c5a67bd0ca84444dc58697596. llvm-svn: 142320	2011-10-18 00:17:04 +00:00
Michael J. Spencer	6b22ef8af2	llvm-objdump: Add static symbol table dumping. llvm-svn: 142319	2011-10-17 23:55:22 +00:00
Michael J. Spencer	4e25c02487	llvm-objdump: Add -s, which prints the contents of each section. llvm-svn: 142199	2011-10-17 17:13:22 +00:00
Michael J. Spencer	51862b3890	llvm-object: Add inline relocation information to disassembly. llvm-svn: 141897	2011-10-13 22:17:18 +00:00
Michael J. Spencer	8f67d47d0d	llvm-objdump: Fix whitespace. llvm-svn: 141886	2011-10-13 20:37:20 +00:00
Michael J. Spencer	ee84f64f0b	llvm-objdump: Fix dumping of multiple symbols with the same address. This happens in COFF because there is a symbol for the beginning of each section. llvm-svn: 141885	2011-10-13 20:37:08 +00:00
NAKAMURA Takumi	bd926cbdb5	llvm-objdump.cpp: Use PRIx64 as format specifier for int64_t. llvm-svn: 141664	2011-10-11 12:51:50 +00:00

... 3 4 5 6 7 ...

486 Commits