tracing: Amend documentation in recordmcount.pl to reflect implementation

The documentation currently says we will use the first function in a section
as a reference. The actual algorithm is: choose the first global function we
meet as a reference. If there is none, choose the first local one.
Change the documentation to be consistent with the code.

Also add several other clarifications.

Signed-off-by: Li Hong <lihong.hi@gmail.com>
LKML-Reference: <20091028050138.GA30758@uhli>
Signed-off-by: Steven Rostedt <rostedt@goodmis.org>
This commit is contained in:
Li Hong 2009-10-28 13:01:38 +08:00 committed by Steven Rostedt
parent 9de09ace8d
commit d49f6aa76d
1 changed files with 49 additions and 35 deletions

View File

@ -6,73 +6,89 @@
# all the offsets to the calls to mcount. # all the offsets to the calls to mcount.
# #
# #
# What we want to end up with is a section in vmlinux called # What we want to end up with this is that each object file will have a
# __mcount_loc that contains a list of pointers to all the # section called __mcount_loc that will hold the list of pointers to mcount
# call sites in the kernel that call mcount. Later on boot up, the kernel # callers. After final linking, the vmlinux will have within .init.data the
# will read this list, save the locations and turn them into nops. # list of all callers to mcount between __start_mcount_loc and __stop_mcount_loc.
# When tracing or profiling is later enabled, these locations will then # Later on boot up, the kernel will read this list, save the locations and turn
# be converted back to pointers to some function. # them into nops. When tracing or profiling is later enabled, these locations
# will then be converted back to pointers to some function.
# #
# This is no easy feat. This script is called just after the original # This is no easy feat. This script is called just after the original
# object is compiled and before it is linked. # object is compiled and before it is linked.
# #
# The references to the call sites are offsets from the section of text # When parse this object file using 'objdump', the references to the call
# that the call site is in. Hence, all functions in a section that # sites are offsets from the section that the call site is in. Hence, all
# has a call site to mcount, will have the offset from the beginning of # functions in a section that has a call site to mcount, will have the
# the section and not the beginning of the function. # offset from the beginning of the section and not the beginning of the
# function.
#
# But where this section will reside finally in vmlinx is undetermined at
# this point. So we can't use this kind of offsets to record the final
# address of this call site.
#
# The trick is to change the call offset referring the start of a section to
# referring a function symbol in this section. During the link step, 'ld' will
# compute the final address according to the information we record.
# #
# The trick is to find a way to record the beginning of the section.
# The way we do this is to look at the first function in the section
# which will also be the location of that section after final link.
# e.g. # e.g.
# #
# .section ".sched.text", "ax" # .section ".sched.text", "ax"
# .globl my_func
# my_func:
# [...] # [...]
# call mcount (offset: 0x5) # func1:
# [...]
# call mcount (offset: 0x10)
# [...] # [...]
# ret # ret
# other_func: # .globl fun2
# func2: (offset: 0x20)
# [...] # [...]
# call mcount (offset: 0x1b) # [...]
# ret
# func3:
# [...]
# call mcount (offset: 0x30)
# [...] # [...]
# #
# Both relocation offsets for the mcounts in the above example will be # Both relocation offsets for the mcounts in the above example will be
# offset from .sched.text. If we make another file called tmp.s with: # offset from .sched.text. If we choose global symbol func2 as a reference and
# make another file called tmp.s with the new offsets:
# #
# .section __mcount_loc # .section __mcount_loc
# .quad my_func + 0x5 # .quad func2 - 0x10
# .quad my_func + 0x1b # .quad func2 + 0x10
# #
# We can then compile this tmp.s into tmp.o, and link it to the original # We can then compile this tmp.s into tmp.o, and link it back to the original
# object. # object.
# #
# But this gets hard if my_func is not globl (a static function). # In our algorithm, we will choose the first global function we meet in this
# In such a case we have: # section as the reference. But this gets hard if there is no global functions
# in this section. In such a case we have to select a local one. E.g. func1:
# #
# .section ".sched.text", "ax" # .section ".sched.text", "ax"
# my_func: # func1:
# [...] # [...]
# call mcount (offset: 0x5) # call mcount (offset: 0x10)
# [...] # [...]
# ret # ret
# other_func: # func2:
# [...] # [...]
# call mcount (offset: 0x1b) # call mcount (offset: 0x20)
# [...] # [...]
# .section "other.section"
# #
# If we make the tmp.s the same as above, when we link together with # If we make the tmp.s the same as above, when we link together with
# the original object, we will end up with two symbols for my_func: # the original object, we will end up with two symbols for func1:
# one local, one global. After final compile, we will end up with # one local, one global. After final compile, we will end up with
# an undefined reference to my_func. # an undefined reference to func1 or a wrong reference to another global
# func1 in other files.
# #
# Since local objects can reference local variables, we need to find # Since local objects can reference local variables, we need to find
# a way to make tmp.o reference the local objects of the original object # a way to make tmp.o reference the local objects of the original object
# file after it is linked together. To do this, we convert the my_func # file after it is linked together. To do this, we convert func1
# into a global symbol before linking tmp.o. Then after we link tmp.o # into a global symbol before linking tmp.o. Then after we link tmp.o
# we will only have a single symbol for my_func that is global. # we will only have a single symbol for func1 that is global.
# We can convert my_func back into a local symbol and we are done. # We can convert func1 back into a local symbol and we are done.
# #
# Here are the steps we take: # Here are the steps we take:
# #
@ -86,10 +102,8 @@
# 6) Link together this new object with the list object. # 6) Link together this new object with the list object.
# 7) Convert the local functions back to local symbols and rename # 7) Convert the local functions back to local symbols and rename
# the result as the original object. # the result as the original object.
# End.
# 8) Link the object with the list object. # 8) Link the object with the list object.
# 9) Move the result back to the original object. # 9) Move the result back to the original object.
# End.
# #
use strict; use strict;