OpenCloudOS-Kernel/Documentation/process/7.AdvancedTopics.rst

.. _development_advancedtopics:

Advanced topics
===============

At this point, hopefully, you have a handle on how the development process
works.  There is still more to learn, however!  This section will cover a
number of topics which can be helpful for developers wanting to become a
regular part of the Linux kernel development process.

Managing patches with git
-------------------------

The use of distributed version control for the kernel began in early 2002,
when Linus first started playing with the proprietary BitKeeper
application.  While BitKeeper was controversial, the approach to software
version management it embodied most certainly was not.  Distributed version
control enabled an immediate acceleration of the kernel development
project.  In current times, there are several free alternatives to
BitKeeper.  For better or for worse, the kernel project has settled on git
as its tool of choice.

Managing patches with git can make life much easier for the developer,
especially as the volume of those patches grows.  Git also has its rough
edges and poses certain hazards; it is a young and powerful tool which is
still being civilized by its developers.  This document will not attempt to
teach the reader how to use git; that would be sufficient material for a
long document in its own right.  Instead, the focus here will be on how git
fits into the kernel development process in particular.  Developers who
wish to come up to speed with git will find more information at:

	https://git-scm.com/

	https://www.kernel.org/pub/software/scm/git/docs/user-manual.html

and on various tutorials found on the web.

The first order of business is to read the above sites and get a solid
understanding of how git works before trying to use it to make patches
available to others.  A git-using developer should be able to obtain a copy
of the mainline repository, explore the revision history, commit changes to
the tree, use branches, etc.  An understanding of git's tools for the
rewriting of history (such as rebase) is also useful.  Git comes with its
own terminology and concepts; a new user of git should know about refs,
remote branches, the index, fast-forward merges, pushes and pulls, detached
heads, etc.  It can all be a little intimidating at the outset, but the
concepts are not that hard to grasp with a bit of study.

Using git to generate patches for submission by email can be a good
exercise while coming up to speed.

When you are ready to start putting up git trees for others to look at, you
will, of course, need a server that can be pulled from.  Setting up such a
server with git-daemon is relatively straightforward if you have a system
which is accessible to the Internet.  Otherwise, free, public hosting sites
(Github, for example) are starting to appear on the net.  Established
developers can get an account on kernel.org, but those are not easy to come
by; see https://kernel.org/faq/ for more information.

The normal git workflow involves the use of a lot of branches.  Each line
of development can be separated into a separate "topic branch" and
maintained independently.  Branches in git are cheap, there is no reason to
not make free use of them.  And, in any case, you should not do your
development in any branch which you intend to ask others to pull from.
Publicly-available branches should be created with care; merge in patches
from development branches when they are in complete form and ready to go -
not before.

Git provides some powerful tools which can allow you to rewrite your
development history.  An inconvenient patch (one which breaks bisection,
say, or which has some other sort of obvious bug) can be fixed in place or
made to disappear from the history entirely.  A patch series can be
rewritten as if it had been written on top of today's mainline, even though
you have been working on it for months.  Changes can be transparently
shifted from one branch to another.  And so on.  Judicious use of git's
ability to revise history can help in the creation of clean patch sets with
fewer problems.

Excessive use of this capability can lead to other problems, though, beyond
a simple obsession for the creation of the perfect project history.
Rewriting history will rewrite the changes contained in that history,
turning a tested (hopefully) kernel tree into an untested one.  But, beyond
that, developers cannot easily collaborate if they do not have a shared
view of the project history; if you rewrite history which other developers
have pulled into their repositories, you will make life much more difficult
for those developers.  So a simple rule of thumb applies here: history
which has been exported to others should generally be seen as immutable
thereafter.

So, once you push a set of changes to your publicly-available server, those
changes should not be rewritten.  Git will attempt to enforce this rule if
you try to push changes which do not result in a fast-forward merge
(i.e. changes which do not share the same history).  It is possible to
override this check, and there may be times when it is necessary to rewrite
an exported tree.  Moving changesets between trees to avoid conflicts in
linux-next is one example.  But such actions should be rare.  This is one
of the reasons why development should be done in private branches (which
can be rewritten if necessary) and only moved into public branches when
it's in a reasonably advanced state.

As the mainline (or other tree upon which a set of changes is based)
advances, it is tempting to merge with that tree to stay on the leading
edge.  For a private branch, rebasing can be an easy way to keep up with
another tree, but rebasing is not an option once a tree is exported to the
world.  Once that happens, a full merge must be done.  Merging occasionally
makes good sense, but overly frequent merges can clutter the history
needlessly.  Suggested technique in this case is to merge infrequently, and
generally only at specific release points (such as a mainline -rc
release).  If you are nervous about specific changes, you can always
perform test merges in a private branch.  The git "rerere" tool can be
useful in such situations; it remembers how merge conflicts were resolved
so that you don't have to do the same work twice.

One of the biggest recurring complaints about tools like git is this: the
mass movement of patches from one repository to another makes it easy to
slip in ill-advised changes which go into the mainline below the review
radar.  Kernel developers tend to get unhappy when they see that kind of
thing happening; putting up a git tree with unreviewed or off-topic patches
can affect your ability to get trees pulled in the future.  Quoting Linus:

::

	You can send me patches, but for me to pull a git patch from you, I
	need to know that you know what you're doing, and I need to be able
	to trust things *without* then having to go and check every
	individual change by hand.

(https://lwn.net/Articles/224135/).

To avoid this kind of situation, ensure that all patches within a given
branch stick closely to the associated topic; a "driver fixes" branch
should not be making changes to the core memory management code.  And, most
importantly, do not use a git tree to bypass the review process.  Post an
occasional summary of the tree to the relevant list, and, when the time is
right, request that the tree be included in linux-next.

If and when others start to send patches for inclusion into your tree,
don't forget to review them.  Also ensure that you maintain the correct
authorship information; the git "am" tool does its best in this regard, but
you may have to add a "From:" line to the patch if it has been relayed to
you via a third party.

When requesting a pull, be sure to give all the relevant information: where
your tree is, what branch to pull, and what changes will result from the
pull.  The git request-pull command can be helpful in this regard; it will
format the request as other developers expect, and will also check to be
sure that you have remembered to push those changes to the public server.


Reviewing patches
-----------------

Some readers will certainly object to putting this section with "advanced
topics" on the grounds that even beginning kernel developers should be
reviewing patches.  It is certainly true that there is no better way to
learn how to program in the kernel environment than by looking at code
posted by others.  In addition, reviewers are forever in short supply; by
looking at code you can make a significant contribution to the process as a
whole.

Reviewing code can be an intimidating prospect, especially for a new kernel
developer who may well feel nervous about questioning code - in public -
which has been posted by those with more experience.  Even code written by
the most experienced developers can be improved, though.  Perhaps the best
piece of advice for reviewers (all reviewers) is this: phrase review
comments as questions rather than criticisms.  Asking "how does the lock
get released in this path?" will always work better than stating "the
locking here is wrong."

Different developers will review code from different points of view.  Some
are mostly concerned with coding style and whether code lines have trailing
white space.  Others will focus primarily on whether the change implemented
by the patch as a whole is a good thing for the kernel or not.  Yet others
will check for problematic locking, excessive stack usage, possible
security issues, duplication of code found elsewhere, adequate
documentation, adverse effects on performance, user-space ABI changes, etc.
All types of review, if they lead to better code going into the kernel, are
welcome and worthwhile.
doc: development-process: convert it to ReST markup This document is on good shape for ReST: all it was needed was to fix the section markups, add a toctree, convert the tables and add a few code/quote blocks. While not strictly required, I opted to use lowercase for the titles, just like the other books that were converted to Sphinx. Signed-off-by: Mauro Carvalho Chehab <mchehab@s-opensource.com> 2016-09-19 19:07:36 +08:00			`.. _development_advancedtopics:`

			`Advanced topics`
			`===============`
Add the development process document This is an extended document intended to help interested developers, their managers, and their employers work with the kernel development process. This work was supported by the Linux Foundation. Signed-off-by: Jonathan Corbet <corbet@lwn.net> 2008-10-01 05:15:56 +08:00
			`At this point, hopefully, you have a handle on how the development process`
			`works. There is still more to learn, however! This section will cover a`
			`number of topics which can be helpful for developers wanting to become a`
			`regular part of the Linux kernel development process.`

doc: development-process: convert it to ReST markup This document is on good shape for ReST: all it was needed was to fix the section markups, add a toctree, convert the tables and add a few code/quote blocks. While not strictly required, I opted to use lowercase for the titles, just like the other books that were converted to Sphinx. Signed-off-by: Mauro Carvalho Chehab <mchehab@s-opensource.com> 2016-09-19 19:07:36 +08:00			`Managing patches with git`
			`-------------------------`
Add the development process document This is an extended document intended to help interested developers, their managers, and their employers work with the kernel development process. This work was supported by the Linux Foundation. Signed-off-by: Jonathan Corbet <corbet@lwn.net> 2008-10-01 05:15:56 +08:00
			`The use of distributed version control for the kernel began in early 2002,`
			`when Linus first started playing with the proprietary BitKeeper`
			`application. While BitKeeper was controversial, the approach to software`
			`version management it embodied most certainly was not. Distributed version`
			`control enabled an immediate acceleration of the kernel development`
			`project. In current times, there are several free alternatives to`
			`BitKeeper. For better or for worse, the kernel project has settled on git`
			`as its tool of choice.`

			`Managing patches with git can make life much easier for the developer,`
			`especially as the volume of those patches grows. Git also has its rough`
			`edges and poses certain hazards; it is a young and powerful tool which is`
			`still being civilized by its developers. This document will not attempt to`
			`teach the reader how to use git; that would be sufficient material for a`
			`long document in its own right. Instead, the focus here will be on how git`
			`fits into the kernel development process in particular. Developers who`
			`wish to come up to speed with git will find more information at:`

Replace HTTP links with HTTPS ones: documentation Rationale: Reduces attack surface on kernel devs opening the links for MITM as HTTPS traffic is much harder to manipulate. Deterministic algorithm: For each file: For each line: If doesn't contain `\bxmlns\b`: For each link, `\bhttp://[^# \t\r\n]*(?:\w\|/)`: If both the HTTP and HTTPS versions return 200 OK and serve the same content: Replace HTTP with HTTPS. Signed-off-by: Alexander A. Klimov <grandmaster@al2klimov.de> Link: https://lore.kernel.org/r/20200526060544.25127-1-grandmaster@al2klimov.de Signed-off-by: Jonathan Corbet <corbet@lwn.net> 2020-05-26 14:05:44 +08:00			`https://git-scm.com/`
Add the development process document This is an extended document intended to help interested developers, their managers, and their employers work with the kernel development process. This work was supported by the Linux Foundation. Signed-off-by: Jonathan Corbet <corbet@lwn.net> 2008-10-01 05:15:56 +08:00
Replace HTTP links with HTTPS ones: documentation Rationale: Reduces attack surface on kernel devs opening the links for MITM as HTTPS traffic is much harder to manipulate. Deterministic algorithm: For each file: For each line: If doesn't contain `\bxmlns\b`: For each link, `\bhttp://[^# \t\r\n]*(?:\w\|/)`: If both the HTTP and HTTPS versions return 200 OK and serve the same content: Replace HTTP with HTTPS. Signed-off-by: Alexander A. Klimov <grandmaster@al2klimov.de> Link: https://lore.kernel.org/r/20200526060544.25127-1-grandmaster@al2klimov.de Signed-off-by: Jonathan Corbet <corbet@lwn.net> 2020-05-26 14:05:44 +08:00			`https://www.kernel.org/pub/software/scm/git/docs/user-manual.html`
Add the development process document This is an extended document intended to help interested developers, their managers, and their employers work with the kernel development process. This work was supported by the Linux Foundation. Signed-off-by: Jonathan Corbet <corbet@lwn.net> 2008-10-01 05:15:56 +08:00
			`and on various tutorials found on the web.`

			`The first order of business is to read the above sites and get a solid`
			`understanding of how git works before trying to use it to make patches`
			`available to others. A git-using developer should be able to obtain a copy`
			`of the mainline repository, explore the revision history, commit changes to`
			`the tree, use branches, etc. An understanding of git's tools for the`
			`rewriting of history (such as rebase) is also useful. Git comes with its`
			`own terminology and concepts; a new user of git should know about refs,`
			`remote branches, the index, fast-forward merges, pushes and pulls, detached`
			`heads, etc. It can all be a little intimidating at the outset, but the`
			`concepts are not that hard to grasp with a bit of study.`

			`Using git to generate patches for submission by email can be a good`
			`exercise while coming up to speed.`

			`When you are ready to start putting up git trees for others to look at, you`
			`will, of course, need a server that can be pulled from. Setting up such a`
			`server with git-daemon is relatively straightforward if you have a system`
			`which is accessible to the Internet. Otherwise, free, public hosting sites`
			`(Github, for example) are starting to appear on the net. Established`
			`developers can get an account on kernel.org, but those are not easy to come`
Replace HTTP links with HTTPS ones: documentation Rationale: Reduces attack surface on kernel devs opening the links for MITM as HTTPS traffic is much harder to manipulate. Deterministic algorithm: For each file: For each line: If doesn't contain `\bxmlns\b`: For each link, `\bhttp://[^# \t\r\n]*(?:\w\|/)`: If both the HTTP and HTTPS versions return 200 OK and serve the same content: Replace HTTP with HTTPS. Signed-off-by: Alexander A. Klimov <grandmaster@al2klimov.de> Link: https://lore.kernel.org/r/20200526060544.25127-1-grandmaster@al2klimov.de Signed-off-by: Jonathan Corbet <corbet@lwn.net> 2020-05-26 14:05:44 +08:00			`by; see https://kernel.org/faq/ for more information.`
Add the development process document This is an extended document intended to help interested developers, their managers, and their employers work with the kernel development process. This work was supported by the Linux Foundation. Signed-off-by: Jonathan Corbet <corbet@lwn.net> 2008-10-01 05:15:56 +08:00
			`The normal git workflow involves the use of a lot of branches. Each line`
			`of development can be separated into a separate "topic branch" and`
			`maintained independently. Branches in git are cheap, there is no reason to`
			`not make free use of them. And, in any case, you should not do your`
			`development in any branch which you intend to ask others to pull from.`
			`Publicly-available branches should be created with care; merge in patches`
			`from development branches when they are in complete form and ready to go -`
			`not before.`

			`Git provides some powerful tools which can allow you to rewrite your`
			`development history. An inconvenient patch (one which breaks bisection,`
			`say, or which has some other sort of obvious bug) can be fixed in place or`
			`made to disappear from the history entirely. A patch series can be`
			`rewritten as if it had been written on top of today's mainline, even though`
			`you have been working on it for months. Changes can be transparently`
			`shifted from one branch to another. And so on. Judicious use of git's`
			`ability to revise history can help in the creation of clean patch sets with`
			`fewer problems.`

			`Excessive use of this capability can lead to other problems, though, beyond`
			`a simple obsession for the creation of the perfect project history.`
			`Rewriting history will rewrite the changes contained in that history,`
			`turning a tested (hopefully) kernel tree into an untested one. But, beyond`
			`that, developers cannot easily collaborate if they do not have a shared`
			`view of the project history; if you rewrite history which other developers`
			`have pulled into their repositories, you will make life much more difficult`
			`for those developers. So a simple rule of thumb applies here: history`
			`which has been exported to others should generally be seen as immutable`
			`thereafter.`

			`So, once you push a set of changes to your publicly-available server, those`
			`changes should not be rewritten. Git will attempt to enforce this rule if`
			`you try to push changes which do not result in a fast-forward merge`
			`(i.e. changes which do not share the same history). It is possible to`
			`override this check, and there may be times when it is necessary to rewrite`
			`an exported tree. Moving changesets between trees to avoid conflicts in`
			`linux-next is one example. But such actions should be rare. This is one`
			`of the reasons why development should be done in private branches (which`
			`can be rewritten if necessary) and only moved into public branches when`
			`it's in a reasonably advanced state.`

			`As the mainline (or other tree upon which a set of changes is based)`
			`advances, it is tempting to merge with that tree to stay on the leading`
			`edge. For a private branch, rebasing can be an easy way to keep up with`
			`another tree, but rebasing is not an option once a tree is exported to the`
			`world. Once that happens, a full merge must be done. Merging occasionally`
			`makes good sense, but overly frequent merges can clutter the history`
			`needlessly. Suggested technique in this case is to merge infrequently, and`
			`generally only at specific release points (such as a mainline -rc`
			`release). If you are nervous about specific changes, you can always`
			`perform test merges in a private branch. The git "rerere" tool can be`
			`useful in such situations; it remembers how merge conflicts were resolved`
			`so that you don't have to do the same work twice.`

			`One of the biggest recurring complaints about tools like git is this: the`
			`mass movement of patches from one repository to another makes it easy to`
			`slip in ill-advised changes which go into the mainline below the review`
			`radar. Kernel developers tend to get unhappy when they see that kind of`
			`thing happening; putting up a git tree with unreviewed or off-topic patches`
			`can affect your ability to get trees pulled in the future. Quoting Linus:`

doc: development-process: convert it to ReST markup This document is on good shape for ReST: all it was needed was to fix the section markups, add a toctree, convert the tables and add a few code/quote blocks. While not strictly required, I opted to use lowercase for the titles, just like the other books that were converted to Sphinx. Signed-off-by: Mauro Carvalho Chehab <mchehab@s-opensource.com> 2016-09-19 19:07:36 +08:00			`::`

Add the development process document This is an extended document intended to help interested developers, their managers, and their employers work with the kernel development process. This work was supported by the Linux Foundation. Signed-off-by: Jonathan Corbet <corbet@lwn.net> 2008-10-01 05:15:56 +08:00			`You can send me patches, but for me to pull a git patch from you, I`
			`need to know that you know what you're doing, and I need to be able`
			`to trust things without then having to go and check every`
			`individual change by hand.`

Replace HTTP links with HTTPS ones: documentation Rationale: Reduces attack surface on kernel devs opening the links for MITM as HTTPS traffic is much harder to manipulate. Deterministic algorithm: For each file: For each line: If doesn't contain `\bxmlns\b`: For each link, `\bhttp://[^# \t\r\n]*(?:\w\|/)`: If both the HTTP and HTTPS versions return 200 OK and serve the same content: Replace HTTP with HTTPS. Signed-off-by: Alexander A. Klimov <grandmaster@al2klimov.de> Link: https://lore.kernel.org/r/20200526060544.25127-1-grandmaster@al2klimov.de Signed-off-by: Jonathan Corbet <corbet@lwn.net> 2020-05-26 14:05:44 +08:00			`(https://lwn.net/Articles/224135/).`
Add the development process document This is an extended document intended to help interested developers, their managers, and their employers work with the kernel development process. This work was supported by the Linux Foundation. Signed-off-by: Jonathan Corbet <corbet@lwn.net> 2008-10-01 05:15:56 +08:00
			`To avoid this kind of situation, ensure that all patches within a given`
			`branch stick closely to the associated topic; a "driver fixes" branch`
			`should not be making changes to the core memory management code. And, most`
			`importantly, do not use a git tree to bypass the review process. Post an`
			`occasional summary of the tree to the relevant list, and, when the time is`
			`right, request that the tree be included in linux-next.`

			`If and when others start to send patches for inclusion into your tree,`
			`don't forget to review them. Also ensure that you maintain the correct`
			`authorship information; the git "am" tool does its best in this regard, but`
			`you may have to add a "From:" line to the patch if it has been relayed to`
			`you via a third party.`

			`When requesting a pull, be sure to give all the relevant information: where`
			`your tree is, what branch to pull, and what changes will result from the`
			`pull. The git request-pull command can be helpful in this regard; it will`
			`format the request as other developers expect, and will also check to be`
docs: update the development process document Here's a set of changes updating Documentation/development-process. I have update kernel releases and relevant statistics, added information for a couple of tools, zapped some trailing white space, and generally tried to make it more closely match the current state of affairs. [Typo fixes from Joe Perches and Nicolas Kaiser incorporated] Signed-off-by: Jonathan Corbet <corbet@lwn.net> Acked-by: Greg KH <greg@kroah.com> Cc: Randy Dunlap <rdunlap@xenotime.net> 2011-03-26 02:17:53 +08:00			`sure that you have remembered to push those changes to the public server.`
Add the development process document This is an extended document intended to help interested developers, their managers, and their employers work with the kernel development process. This work was supported by the Linux Foundation. Signed-off-by: Jonathan Corbet <corbet@lwn.net> 2008-10-01 05:15:56 +08:00

doc: development-process: convert it to ReST markup This document is on good shape for ReST: all it was needed was to fix the section markups, add a toctree, convert the tables and add a few code/quote blocks. While not strictly required, I opted to use lowercase for the titles, just like the other books that were converted to Sphinx. Signed-off-by: Mauro Carvalho Chehab <mchehab@s-opensource.com> 2016-09-19 19:07:36 +08:00			`Reviewing patches`
			`-----------------`
Add the development process document This is an extended document intended to help interested developers, their managers, and their employers work with the kernel development process. This work was supported by the Linux Foundation. Signed-off-by: Jonathan Corbet <corbet@lwn.net> 2008-10-01 05:15:56 +08:00
			`Some readers will certainly object to putting this section with "advanced`
			`topics" on the grounds that even beginning kernel developers should be`
			`reviewing patches. It is certainly true that there is no better way to`
			`learn how to program in the kernel environment than by looking at code`
			`posted by others. In addition, reviewers are forever in short supply; by`
			`looking at code you can make a significant contribution to the process as a`
			`whole.`

			`Reviewing code can be an intimidating prospect, especially for a new kernel`
			`developer who may well feel nervous about questioning code - in public -`
			`which has been posted by those with more experience. Even code written by`
			`the most experienced developers can be improved, though. Perhaps the best`
			`piece of advice for reviewers (all reviewers) is this: phrase review`
			`comments as questions rather than criticisms. Asking "how does the lock`
			`get released in this path?" will always work better than stating "the`
			`locking here is wrong."`

			`Different developers will review code from different points of view. Some`
			`are mostly concerned with coding style and whether code lines have trailing`
			`white space. Others will focus primarily on whether the change implemented`
			`by the patch as a whole is a good thing for the kernel or not. Yet others`
			`will check for problematic locking, excessive stack usage, possible`
			`security issues, duplication of code found elsewhere, adequate`
			`documentation, adverse effects on performance, user-space ABI changes, etc.`
			`All types of review, if they lead to better code going into the kernel, are`
			`welcome and worthwhile.`