llvm-project/llvm/docs/PDB/CodeViewSymbols.rst

273 lines
7.3 KiB
ReStructuredText

=====================================
CodeView Symbol Records
=====================================
.. contents::
:local:
.. _symbols_intro:
Introduction
============
This document describes the usage and serialization format of the various
CodeView symbol records that LLVM understands. Like
:doc:`CodeView Type Records <CodeViewTypes>`, we describe only the important
types which are generated by modern C++ toolchains.
Record Categories
=================
Symbol records share one major similarity with :doc:`type records <CodeViewTypes>`:
They start with the same :ref:`record prefix <leaf_types>`, which we will not describe
again (refer to the previous link for a description). As a result of this, a sequence
of symbol records can be processed with largely the same code as that which processes
type records. There are several important differences between symbol and type records:
* Symbol records only appear in the :doc:`PublicStream`, :doc:`GlobalStream`, and
:doc:`Module Info Streams <ModiStream>`.
* Type records only appear in the :doc:`TPI & IPI streams <TpiStream>`.
* While types are referenced from other CodeView records via :ref:`type indices <type_indices>`,
symbol records are referenced by the byte offset of the record in the stream that it appears
in.
* Types can reference types (via type indices), and symbols can reference both types (via type
indices) and symbols (via offsets), but types can never reference symbols.
* There is no notion of :ref:`Leaf Records <leaf_types>` and :ref:`Member Records <member_types>`
as there are with types. Every symbol record describes is own length.
* Certain special symbol records begin a "scope". For these records, all following records
up until the next ``S_END`` record are "children" of this symbol record. For example,
given a symbol record which describes a certain function, all local variables of this
function would appear following the function up until the corresponding ``S_END`` record.
Finally, there are three general categories of symbol record, grouped by where they are legal
to appear in a PDB file. Public Symbols (which appear only in the
:doc:`publics stream <PublicStream>`), Global Symbols (which appear only in the
:doc:`globals stream <GlobalStream>`) and module symbols (which appear in the
:doc:`module info stream <ModiStream>`).
.. _public_symbols:
Public Symbols
--------------
Public symbols are the CodeView equivalent of DWARF ``.debug_pubnames``. There
is one public symbol record for every function or variable in the program that
has a mangled name. The :doc:`Publics Stream <PublicStream>`, which contains these
records, additionally contains a hash table that allows one to quickly locate a
record by mangled name.
S_PUB32 (0x110e)
^^^^^^^^^^^^^^^^
There is only type of public symbol, an ``S_PUB32`` which describes a mangled
name, a flag indicating what kind of symbol it is (e.g. function, variable), and
the symbol's address. The :ref:`dbi_section_map_substream` of the
:doc:`DBI Stream <DbiStream>` can be consulted to determine what module this address
corresponds to, and from there that module's :doc:`module debug stream <ModiStream>`
can be consulted to locate full information for the symbol with the given address.
.. _global_symbols:
Global Symbols
--------------
While there is one :ref:`public symbol <public_symbols>` for every symbol in the
program with `external` linkage, there is one global symbol for every symbol in the
program with linkage (including internal linkage). As a result, global symbols do
not describe a mangled name *or* an address, since symbols with internal linkage
need not have any mangling at all, and also may not have an address. Thus, all
global symbols simply refer directly to the full symbol record via a module/offset
combination.
Similarly to :ref:`public symbols <public_symbols>`, all global symbols are contained
in a single :doc:`Globals Stream <GlobalStream>`, which contains a hash table mapping
fully qualified name to the corresponding record in the globals stream (which as
mentioned, then contains information allowing one to locate the full record in the
corresponding module symbol stream).
Note that a consequence and limitation of this design is that program-wide lookup
by anything other than an exact textually matching fully-qualified name of whatever
the compiler decided to emit is impractical. This differs from DWARF, where even
though we don't necessarily have O(1) lookup by basename within a given scope (including
O(1) scope, we at least have O(n) access within a given scope).
.. important::
Program-wide lookup of names by anything other than an exact textually matching fully
qualified name is not possible.
S_GDATA32
^^^^^^^^^^
S_GTHREAD32 (0x1113)
^^^^^^^^^^^^^^^^^^^^
S_PROCREF (0x1125)
^^^^^^^^^^^^^^^^^^
S_LPROCREF (0x1127)
^^^^^^^^^^^^^^^^^^^
S_GMANDATA (0x111d)
^^^^^^^^^^^^^^^^^^^
.. _module_symbols:
Module Symbols
--------------
S_END (0x0006)
^^^^^^^^^^^^^^
S_FRAMEPROC (0x1012)
^^^^^^^^^^^^^^^^^^^^
S_OBJNAME (0x1101)
^^^^^^^^^^^^^^^^^^
S_THUNK32 (0x1102)
^^^^^^^^^^^^^^^^^^
S_BLOCK32 (0x1103)
^^^^^^^^^^^^^^^^^^
S_LABEL32 (0x1105)
^^^^^^^^^^^^^^^^^^
S_REGISTER (0x1106)
^^^^^^^^^^^^^^^^^^^
S_BPREL32 (0x110b)
^^^^^^^^^^^^^^^^^^
S_LPROC32 (0x110f)
^^^^^^^^^^^^^^^^^^
S_GPROC32 (0x1110)
^^^^^^^^^^^^^^^^^^
S_REGREL32 (0x1111)
^^^^^^^^^^^^^^^^^^^
S_COMPILE2 (0x1116)
^^^^^^^^^^^^^^^^^^^
S_UNAMESPACE (0x1124)
^^^^^^^^^^^^^^^^^^^^^
S_TRAMPOLINE (0x112c)
^^^^^^^^^^^^^^^^^^^^^
S_SECTION (0x1136)
^^^^^^^^^^^^^^^^^^
S_COFFGROUP (0x1137)
^^^^^^^^^^^^^^^^^^^^
S_EXPORT (0x1138)
^^^^^^^^^^^^^^^^^
S_CALLSITEINFO (0x1139)
^^^^^^^^^^^^^^^^^^^^^^^
S_FRAMECOOKIE (0x113a)
^^^^^^^^^^^^^^^^^^^^^^
S_COMPILE3 (0x113c)
^^^^^^^^^^^^^^^^^^^
S_ENVBLOCK (0x113d)
^^^^^^^^^^^^^^^^^^^
S_LOCAL (0x113e)
^^^^^^^^^^^^^^^^
S_DEFRANGE (0x113f)
^^^^^^^^^^^^^^^^^^^
S_DEFRANGE_SUBFIELD (0x1140)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^
S_DEFRANGE_REGISTER (0x1141)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^
S_DEFRANGE_FRAMEPOINTER_REL (0x1142)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
S_DEFRANGE_SUBFIELD_REGISTER (0x1143)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
S_DEFRANGE_FRAMEPOINTER_REL_FULL_SCOPE (0x1144)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
S_DEFRANGE_REGISTER_REL (0x1145)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
S_LPROC32_ID (0x1146)
^^^^^^^^^^^^^^^^^^^^^
S_GPROC32_ID (0x1147)
^^^^^^^^^^^^^^^^^^^^^
S_BUILDINFO (0x114c)
^^^^^^^^^^^^^^^^^^^^
S_INLINESITE (0x114d)
^^^^^^^^^^^^^^^^^^^^^
S_INLINESITE_END (0x114e)
^^^^^^^^^^^^^^^^^^^^^^^^^
S_PROC_ID_END (0x114f)
^^^^^^^^^^^^^^^^^^^^^^
S_FILESTATIC (0x1153)
^^^^^^^^^^^^^^^^^^^^^
S_LPROC32_DPC (0x1155)
^^^^^^^^^^^^^^^^^^^^^^
S_LPROC32_DPC_ID (0x1156)
^^^^^^^^^^^^^^^^^^^^^^^^^
S_CALLEES (0x115a)
^^^^^^^^^^^^^^^^^^
S_CALLERS (0x115b)
^^^^^^^^^^^^^^^^^^
S_HEAPALLOCSITE (0x115e)
^^^^^^^^^^^^^^^^^^^^^^^^
S_FASTLINK (0x1167)
^^^^^^^^^^^^^^^^^^^
S_INLINEES (0x1168)
^^^^^^^^^^^^^^^^^^^
.. _module_and_global_symbols:
Symbols which can go in either/both of the module info stream & global stream
-----------------------------------------------------------------------------
S_CONSTANT (0x1107)
^^^^^^^^^^^^^^^^^^^
S_UDT (0x1108)
^^^^^^^^^^^^^^
S_LDATA32 (0x110c)
^^^^^^^^^^^^^^^^^^
S_LTHREAD32 (0x1112)
^^^^^^^^^^^^^^^^^^^^
S_LMANDATA (0x111c)
^^^^^^^^^^^^^^^^^^^
S_MANCONSTANT (0x112d)
^^^^^^^^^^^^^^^^^^^^^^