[ORC] Add generic initializer/deinitializer support.
Initializers and deinitializers are used to implement C++ static constructors
and destructors, runtime registration for some languages (e.g. with the
Objective-C runtime for Objective-C/C++ code) and other tasks that would
typically be performed when a shared-object/dylib is loaded or unloaded by a
statically compiled program.
MCJIT and ORC have historically provided limited support for discovering and
running initializers/deinitializers by scanning the llvm.global_ctors and
llvm.global_dtors variables and recording the functions to be run. This approach
suffers from several drawbacks: (1) It only works for IR inputs, not for object
files (including cached JIT'd objects). (2) It only works for initializers
described by llvm.global_ctors and llvm.global_dtors, however not all
initializers are described in this way (Objective-C, for example, describes
initializers via specially named metadata sections). (3) To make the
initializer/deinitializer functions described by llvm.global_ctors and
llvm.global_dtors searchable they must be promoted to extern linkage, polluting
the JIT symbol table (extra care must be taken to ensure this promotion does
not result in symbol name clashes).
This patch introduces several interdependent changes to ORCv2 to support the
construction of new initialization schemes, and includes an implementation of a
backwards-compatible llvm.global_ctor/llvm.global_dtor scanning scheme, and a
MachO specific scheme that handles Objective-C runtime registration (if the
Objective-C runtime is available) enabling execution of LLVM IR compiled from
Objective-C and Swift.
The major changes included in this patch are:
(1) The MaterializationUnit and MaterializationResponsibility classes are
extended to describe an optional "initializer" symbol for the module (see the
getInitializerSymbol method on each class). The presence or absence of this
symbol indicates whether the module contains any initializers or
deinitializers. The initializer symbol otherwise behaves like any other:
searching for it triggers materialization.
(2) A new Platform interface is introduced in llvm/ExecutionEngine/Orc/Core.h
which provides the following callback interface:
- Error setupJITDylib(JITDylib &JD): Can be used to install standard symbols
in JITDylibs upon creation. E.g. __dso_handle.
- Error notifyAdding(JITDylib &JD, const MaterializationUnit &MU): Generally
used to record initializer symbols.
- Error notifyRemoving(JITDylib &JD, VModuleKey K): Used to notify a platform
that a module is being removed.
Platform implementations can use these callbacks to track outstanding
initializers and implement a platform-specific approach for executing them. For
example, the MachOPlatform installs a plugin in the JIT linker to scan for both
__mod_inits sections (for C++ static constructors) and ObjC metadata sections.
If discovered, these are processed in the usual platform order: Objective-C
registration is carried out first, then static initializers are executed,
ensuring that calls to Objective-C from static initializers will be safe.
This patch updates LLJIT to use the new scheme for initialization. Two
LLJIT::PlatformSupport classes are implemented: A GenericIR platform and a MachO
platform. The GenericIR platform implements a modified version of the previous
llvm.global-ctor scraping scheme to provide support for Windows and
Linux. LLJIT's MachO platform uses the MachOPlatform class to provide MachO
specific initialization as described above.
Reviewers: sgraenitz, dblaikie
Subscribers: mgorny, hiraditya, mgrang, ributzka, llvm-commits
Tags: #llvm
Differential Revision: https://reviews.llvm.org/D74300
2019-12-16 18:50:40 +08:00
|
|
|
//===----------- Mangling.cpp -- Name Mangling Utilities for ORC ----------===//
|
|
|
|
//
|
|
|
|
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
|
|
|
|
// See https://llvm.org/LICENSE.txt for license information.
|
|
|
|
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
|
|
|
|
//
|
|
|
|
//===----------------------------------------------------------------------===//
|
|
|
|
|
|
|
|
#include "llvm/ExecutionEngine/Orc/Mangling.h"
|
|
|
|
#include "llvm/IR/Constants.h"
|
|
|
|
#include "llvm/IR/Mangler.h"
|
|
|
|
#include "llvm/Object/MachO.h"
|
|
|
|
#include "llvm/Object/ObjectFile.h"
|
|
|
|
#include "llvm/Support/Debug.h"
|
|
|
|
|
|
|
|
#define DEBUG_TYPE "orc"
|
|
|
|
|
|
|
|
namespace llvm {
|
|
|
|
namespace orc {
|
|
|
|
|
|
|
|
MangleAndInterner::MangleAndInterner(ExecutionSession &ES, const DataLayout &DL)
|
|
|
|
: ES(ES), DL(DL) {}
|
|
|
|
|
|
|
|
SymbolStringPtr MangleAndInterner::operator()(StringRef Name) {
|
|
|
|
std::string MangledName;
|
|
|
|
{
|
|
|
|
raw_string_ostream MangledNameStream(MangledName);
|
|
|
|
Mangler::getNameWithPrefix(MangledNameStream, Name, DL);
|
|
|
|
}
|
|
|
|
return ES.intern(MangledName);
|
|
|
|
}
|
|
|
|
|
|
|
|
void IRSymbolMapper::add(ExecutionSession &ES, const ManglingOptions &MO,
|
|
|
|
ArrayRef<GlobalValue *> GVs,
|
|
|
|
SymbolFlagsMap &SymbolFlags,
|
|
|
|
SymbolNameToDefinitionMap *SymbolToDefinition) {
|
|
|
|
if (GVs.empty())
|
|
|
|
return;
|
|
|
|
|
|
|
|
MangleAndInterner Mangle(ES, GVs[0]->getParent()->getDataLayout());
|
|
|
|
for (auto *G : GVs) {
|
|
|
|
assert(G && "GVs cannot contain null elements");
|
|
|
|
if (!G->hasName() || G->isDeclaration() || G->hasLocalLinkage() ||
|
|
|
|
G->hasAvailableExternallyLinkage() || G->hasAppendingLinkage())
|
|
|
|
continue;
|
|
|
|
|
|
|
|
if (G->isThreadLocal() && MO.EmulatedTLS) {
|
|
|
|
auto *GV = cast<GlobalVariable>(G);
|
|
|
|
|
|
|
|
auto Flags = JITSymbolFlags::fromGlobalValue(*GV);
|
|
|
|
|
|
|
|
auto EmuTLSV = Mangle(("__emutls_v." + GV->getName()).str());
|
|
|
|
SymbolFlags[EmuTLSV] = Flags;
|
|
|
|
if (SymbolToDefinition)
|
|
|
|
(*SymbolToDefinition)[EmuTLSV] = GV;
|
|
|
|
|
|
|
|
// If this GV has a non-zero initializer we'll need to emit an
|
|
|
|
// __emutls.t symbol too.
|
|
|
|
if (GV->hasInitializer()) {
|
|
|
|
const auto *InitVal = GV->getInitializer();
|
|
|
|
|
|
|
|
// Skip zero-initializers.
|
|
|
|
if (isa<ConstantAggregateZero>(InitVal))
|
|
|
|
continue;
|
|
|
|
const auto *InitIntValue = dyn_cast<ConstantInt>(InitVal);
|
|
|
|
if (InitIntValue && InitIntValue->isZero())
|
|
|
|
continue;
|
|
|
|
|
|
|
|
auto EmuTLST = Mangle(("__emutls_t." + GV->getName()).str());
|
|
|
|
SymbolFlags[EmuTLST] = Flags;
|
|
|
|
if (SymbolToDefinition)
|
|
|
|
(*SymbolToDefinition)[EmuTLST] = GV;
|
|
|
|
}
|
|
|
|
continue;
|
|
|
|
}
|
|
|
|
|
|
|
|
// Otherwise we just need a normal linker mangling.
|
|
|
|
auto MangledName = Mangle(G->getName());
|
|
|
|
SymbolFlags[MangledName] = JITSymbolFlags::fromGlobalValue(*G);
|
|
|
|
if (SymbolToDefinition)
|
|
|
|
(*SymbolToDefinition)[MangledName] = G;
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
|
|
|
Expected<std::pair<SymbolFlagsMap, SymbolStringPtr>>
|
|
|
|
getObjectSymbolInfo(ExecutionSession &ES, MemoryBufferRef ObjBuffer) {
|
|
|
|
auto Obj = object::ObjectFile::createObjectFile(ObjBuffer);
|
|
|
|
|
|
|
|
if (!Obj)
|
|
|
|
return Obj.takeError();
|
|
|
|
|
|
|
|
SymbolFlagsMap SymbolFlags;
|
|
|
|
for (auto &Sym : (*Obj)->symbols()) {
|
|
|
|
// Skip symbols not defined in this object file.
|
|
|
|
if (Sym.getFlags() & object::BasicSymbolRef::SF_Undefined)
|
|
|
|
continue;
|
|
|
|
|
|
|
|
// Skip symbols that are not global.
|
|
|
|
if (!(Sym.getFlags() & object::BasicSymbolRef::SF_Global))
|
|
|
|
continue;
|
|
|
|
|
2020-03-04 08:02:46 +08:00
|
|
|
// Skip symbols that have type SF_File.
|
|
|
|
if (auto SymType = Sym.getType()) {
|
|
|
|
if (*SymType == object::SymbolRef::ST_File)
|
|
|
|
continue;
|
|
|
|
} else
|
|
|
|
return SymType.takeError();
|
|
|
|
|
[ORC] Add generic initializer/deinitializer support.
Initializers and deinitializers are used to implement C++ static constructors
and destructors, runtime registration for some languages (e.g. with the
Objective-C runtime for Objective-C/C++ code) and other tasks that would
typically be performed when a shared-object/dylib is loaded or unloaded by a
statically compiled program.
MCJIT and ORC have historically provided limited support for discovering and
running initializers/deinitializers by scanning the llvm.global_ctors and
llvm.global_dtors variables and recording the functions to be run. This approach
suffers from several drawbacks: (1) It only works for IR inputs, not for object
files (including cached JIT'd objects). (2) It only works for initializers
described by llvm.global_ctors and llvm.global_dtors, however not all
initializers are described in this way (Objective-C, for example, describes
initializers via specially named metadata sections). (3) To make the
initializer/deinitializer functions described by llvm.global_ctors and
llvm.global_dtors searchable they must be promoted to extern linkage, polluting
the JIT symbol table (extra care must be taken to ensure this promotion does
not result in symbol name clashes).
This patch introduces several interdependent changes to ORCv2 to support the
construction of new initialization schemes, and includes an implementation of a
backwards-compatible llvm.global_ctor/llvm.global_dtor scanning scheme, and a
MachO specific scheme that handles Objective-C runtime registration (if the
Objective-C runtime is available) enabling execution of LLVM IR compiled from
Objective-C and Swift.
The major changes included in this patch are:
(1) The MaterializationUnit and MaterializationResponsibility classes are
extended to describe an optional "initializer" symbol for the module (see the
getInitializerSymbol method on each class). The presence or absence of this
symbol indicates whether the module contains any initializers or
deinitializers. The initializer symbol otherwise behaves like any other:
searching for it triggers materialization.
(2) A new Platform interface is introduced in llvm/ExecutionEngine/Orc/Core.h
which provides the following callback interface:
- Error setupJITDylib(JITDylib &JD): Can be used to install standard symbols
in JITDylibs upon creation. E.g. __dso_handle.
- Error notifyAdding(JITDylib &JD, const MaterializationUnit &MU): Generally
used to record initializer symbols.
- Error notifyRemoving(JITDylib &JD, VModuleKey K): Used to notify a platform
that a module is being removed.
Platform implementations can use these callbacks to track outstanding
initializers and implement a platform-specific approach for executing them. For
example, the MachOPlatform installs a plugin in the JIT linker to scan for both
__mod_inits sections (for C++ static constructors) and ObjC metadata sections.
If discovered, these are processed in the usual platform order: Objective-C
registration is carried out first, then static initializers are executed,
ensuring that calls to Objective-C from static initializers will be safe.
This patch updates LLJIT to use the new scheme for initialization. Two
LLJIT::PlatformSupport classes are implemented: A GenericIR platform and a MachO
platform. The GenericIR platform implements a modified version of the previous
llvm.global-ctor scraping scheme to provide support for Windows and
Linux. LLJIT's MachO platform uses the MachOPlatform class to provide MachO
specific initialization as described above.
Reviewers: sgraenitz, dblaikie
Subscribers: mgorny, hiraditya, mgrang, ributzka, llvm-commits
Tags: #llvm
Differential Revision: https://reviews.llvm.org/D74300
2019-12-16 18:50:40 +08:00
|
|
|
auto Name = Sym.getName();
|
|
|
|
if (!Name)
|
|
|
|
return Name.takeError();
|
|
|
|
auto InternedName = ES.intern(*Name);
|
|
|
|
auto SymFlags = JITSymbolFlags::fromObjectSymbol(Sym);
|
|
|
|
if (!SymFlags)
|
|
|
|
return SymFlags.takeError();
|
|
|
|
SymbolFlags[InternedName] = std::move(*SymFlags);
|
|
|
|
}
|
|
|
|
|
|
|
|
SymbolStringPtr InitSymbol;
|
|
|
|
|
|
|
|
if (auto *MachOObj = dyn_cast<object::MachOObjectFile>(Obj->get())) {
|
|
|
|
for (auto &Sec : MachOObj->sections()) {
|
|
|
|
auto SecType = MachOObj->getSectionType(Sec);
|
|
|
|
if ((SecType & MachO::SECTION_TYPE) == MachO::S_MOD_INIT_FUNC_POINTERS) {
|
|
|
|
std::string InitSymString;
|
|
|
|
raw_string_ostream(InitSymString)
|
|
|
|
<< "$." << ObjBuffer.getBufferIdentifier() << ".__inits";
|
|
|
|
InitSymbol = ES.intern(InitSymString);
|
2020-03-04 01:32:49 +08:00
|
|
|
SymbolFlags[InitSymbol] = JITSymbolFlags();
|
[ORC] Add generic initializer/deinitializer support.
Initializers and deinitializers are used to implement C++ static constructors
and destructors, runtime registration for some languages (e.g. with the
Objective-C runtime for Objective-C/C++ code) and other tasks that would
typically be performed when a shared-object/dylib is loaded or unloaded by a
statically compiled program.
MCJIT and ORC have historically provided limited support for discovering and
running initializers/deinitializers by scanning the llvm.global_ctors and
llvm.global_dtors variables and recording the functions to be run. This approach
suffers from several drawbacks: (1) It only works for IR inputs, not for object
files (including cached JIT'd objects). (2) It only works for initializers
described by llvm.global_ctors and llvm.global_dtors, however not all
initializers are described in this way (Objective-C, for example, describes
initializers via specially named metadata sections). (3) To make the
initializer/deinitializer functions described by llvm.global_ctors and
llvm.global_dtors searchable they must be promoted to extern linkage, polluting
the JIT symbol table (extra care must be taken to ensure this promotion does
not result in symbol name clashes).
This patch introduces several interdependent changes to ORCv2 to support the
construction of new initialization schemes, and includes an implementation of a
backwards-compatible llvm.global_ctor/llvm.global_dtor scanning scheme, and a
MachO specific scheme that handles Objective-C runtime registration (if the
Objective-C runtime is available) enabling execution of LLVM IR compiled from
Objective-C and Swift.
The major changes included in this patch are:
(1) The MaterializationUnit and MaterializationResponsibility classes are
extended to describe an optional "initializer" symbol for the module (see the
getInitializerSymbol method on each class). The presence or absence of this
symbol indicates whether the module contains any initializers or
deinitializers. The initializer symbol otherwise behaves like any other:
searching for it triggers materialization.
(2) A new Platform interface is introduced in llvm/ExecutionEngine/Orc/Core.h
which provides the following callback interface:
- Error setupJITDylib(JITDylib &JD): Can be used to install standard symbols
in JITDylibs upon creation. E.g. __dso_handle.
- Error notifyAdding(JITDylib &JD, const MaterializationUnit &MU): Generally
used to record initializer symbols.
- Error notifyRemoving(JITDylib &JD, VModuleKey K): Used to notify a platform
that a module is being removed.
Platform implementations can use these callbacks to track outstanding
initializers and implement a platform-specific approach for executing them. For
example, the MachOPlatform installs a plugin in the JIT linker to scan for both
__mod_inits sections (for C++ static constructors) and ObjC metadata sections.
If discovered, these are processed in the usual platform order: Objective-C
registration is carried out first, then static initializers are executed,
ensuring that calls to Objective-C from static initializers will be safe.
This patch updates LLJIT to use the new scheme for initialization. Two
LLJIT::PlatformSupport classes are implemented: A GenericIR platform and a MachO
platform. The GenericIR platform implements a modified version of the previous
llvm.global-ctor scraping scheme to provide support for Windows and
Linux. LLJIT's MachO platform uses the MachOPlatform class to provide MachO
specific initialization as described above.
Reviewers: sgraenitz, dblaikie
Subscribers: mgorny, hiraditya, mgrang, ributzka, llvm-commits
Tags: #llvm
Differential Revision: https://reviews.llvm.org/D74300
2019-12-16 18:50:40 +08:00
|
|
|
break;
|
|
|
|
}
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
|
|
|
return std::make_pair(std::move(SymbolFlags), std::move(InitSymbol));
|
|
|
|
}
|
|
|
|
|
|
|
|
} // End namespace orc.
|
|
|
|
} // End namespace llvm.
|