1 Bionic comes with a set of 'clean' Linux kernel headers that can safely be
2 included by userland applications and libraries without fear of hideous
3 conflicts. for more information why this is needed, see the "RATIONALE"
4 section at the end of this document.
5
6 these clean headers are automatically generated by several scripts located
7 in the 'bionic/kernel/tools' directory, which process a set of original
8 and unmodified kernel headers in order to get rid of many annoying
9 declarations and constructs that usually result in compilation failure.
10
11 the 'clean headers' only contain type and macro definitions, with the
12 exception of a couple static inline functions used for performance
13 reason (e.g. optimized CPU-specific byte-swapping routines)
14
15 they can be included from C++, or when compiling code in strict ANSI mode.
16 they can be also included before or after any Bionic C library header.
17
18 the generation process works as follows:
19
20 * 'external/kernel-headers/original/'
21 contains a set of kernel headers as normally found in the 'include'
22 directory of a normal Linux kernel source tree. note that this should
23 only contain the files that are really needed by Android (use
24 'find_headers.py' to find these automatically).
25
26 * 'bionic/libc/kernel/common'
27 contains the non-arch-specific clean headers and directories
28 (e.g. linux, asm-generic and mtd)
29
30 * 'bionic/libc/kernel/arch-arm/'
31 contains the ARM-specific directory tree of clean headers.
32
33 * 'bionic/libc/kernel/arch-arm/asm'
34 contains the real ARM-specific headers
35
36 * 'bionic/libc/kernel/arch-x86'
37 'bionic/libc/kernel/arch-x86/asm'
38 similarly contains all headers and symlinks to be used on x86
39
40 * 'bionic/libc/kernel/tools' contains various Python and shell scripts used
41 to manage and re-generate the headers
42
43 the tools you can use are:
44
45 * tools/find_users.py
46 scans a list of source files or directories and prints which ones do
47 include Linux headers.
48
49 * tools/find_headers.py
50 scans a list of source files or directories and recursively finds all
51 the original kernel headers they need.
52
53 * tools/clean_header.py
54 prints the clean version of a given kernel header. with the -u option,
55 this will also update the corresponding clean header file if its
56 content has changed. you can also process more than one file with -u
57
58 * tools/update_all.py
59 automatically update all clean headers from the content of
60 'external/kernel-headers/original'. this is the script you're likely going to
61 run whenever you update the original headers.
62
63
64 HOW TO BUILD BIONIC AND OTHER PROGRAMS WITH THE CLEAN HEADERS:
65 ==============================================================
66
67 add bionic/kernel/common and bionic/kernel/arch-<yourarch> to your C
68 include path. that should be enough. Note that Bionic will not compile properly
69 if you don't.
70
71
72 HOW TO SUPPORT ANOTHER ARCHITECTURE:
73 ====================================
74
75 see the content of tools/defaults.py, you will need to make a few updates
76 here:
77
78 - add a new item to the 'kernel_archs' list of supported architectures
79
80 - add a proper definition for 'kernel_known_<arch>_statics' with
81 relevant definitions.
82
83 - update 'kernel_known_statics' to map "<arch>" to
84 'kernel_known_<arch>_statics'
85
86 then, add the new architecture-specific headers to original/asm-<arch>.
87 (please ensure that these are really needed, e.g. with tools/find_headers.py)
88
89 finally, run tools/update_all.py
90
91
92
93 HOW TO UPDATE THE HEADERS WHEN NEEDED:
94 ======================================
95
96 IMPORTANT IMPORTANT:
97
98 WHEN UPDATING THE HEADERS, ALWAYS CHECK THAT THE NEW CLEAN HEADERS DO
99 NOT BREAK THE KERNEL <-> USER ABI, FOR EXAMPLE BY CHANGING THE SIZE
100 OF A GIVEN TYPE. THIS TASK CANNOT BE EASILY AUTOMATED AT THE MOMENT
101
102 copy any updated kernel header into the corresponding location under
103 'bionic/kernel/original'.
104
105 for any new kernel header you want to add, first run tools/find_headers.py to be
106 sure that it is really needed by the Android sources. then add it to
107 'bionic/kernel/original'
108
109 then, run tools/update_all.py to re-run the auto-cleaning
110
111
112
113 HOW THE CLEANUP PROCESS WORKS:
114 ==============================
115
116 this section describes the action performed by the cleanup program(s) when they
117 process the original kernel headers into clean ones:
118
119 1. Optimize well-known macros (e.g. __KERNEL__, __KERNEL_STRICT_NAMES)
120
121 this pass gets rid of everything that is guarded by a well-known macro
122 definition. this means that a block like
123
124 #ifdef __KERNEL__
125 ....
126 #endif
127
128 will be totally omitted from the output. the optimizer is smart enough to
129 handle all complex C-preprocessor conditional expression appropriately.
130 this means that, for example:
131
132 #if defined(__KERNEL__) || defined(FOO)
133 ...
134 #endif
135
136 will be transformed into:
137
138 #ifdef FOO
139 ...
140 #endif
141
142 see tools/defaults.py for the list of well-known macros used in this pass,
143 in case you need to update it in the future.
144
145 note that this also remove any reference to a kernel-specific configuration
146 macro like CONFIG_FOO from the clean headers.
147
148
149 2. remove variable and function declarations:
150
151 this pass scans non-directive text and only keeps things that look like a
152 typedef/struct/union/enum declaration. this allows to get rid of any variable
153 or function declaration that should only be used within the kernel anyway
154 (and which normally *should* be guarded in a #ifdef __KERNEL__ ... #endif
155 block, if the kernel writers were not so messy)
156
157 there are however a few exceptions: it is seldom useful to keep the definition
158 of some static inline functions performing very simple operations. a good
159 example is the optimized 32-bit byte-swap function found in
160 arch-arm/asm/byteorder.h
161
162 the list of exceptions is in tools/defaults.py in case you need to update it
163 in the future.
164
165 note that we do *not* remove macro definitions, including these macro that
166 perform a call to one of these kernel-header functions, or even define other
167 functions. we consider it safe since userland applications have no business
168 using them anyway.
169
170
171 3. whitespace cleanup:
172
173 the final pass remove any comments and empty lines from the final headers.
174
175
176 4. add a standard disclaimer:
177
178 prepended to each generated header, contains a message like
179 "do not edit directly - file was auto-generated by ...."
180
181
182 RATIONALE:
183 ==========
184
185 OVERVIEW OF THE CURRENT KERNEL HEADER MESS:
186 -------------------------------------------
187
188 The original kernel headers are not easily usable from userland applications.
189 they contain many declarations and construct that will result in a compilation
190 failure or even worse, incorrect behaviour. for example:
191
192 - some headers try to define Posix types (e.g. size_t, ssize_t) that can
193 conflict with the corresponding definitions provided by your C library.
194
195 - some headers use constructs that cannot be compiled in ANSI C mode.
196
197 - some headers use constructs do not compile with C++ at all.
198
199 - some headers contain invalid "legacy" definitions for the benefit of old
200 C libraries (e.g. glibc5) but result in incorrect behaviour if used
201 directly.
202
203 e.g. gid_t being defined in <linux/types.h> as a 16-bit type while the
204 kernel uses 32-bit ids. this results in problems when getgroups() or
205 setgroups() are called, since they operate on gid_t arrays.
206
207 unfortunately, these headers are also the only source of some really extensive
208 constant and type definitions that are required by userland applications.
209 think any library/program that need to access ALSA, or Video4Linux, or
210 anything related to a specific device or Linux-specific system interface
211 (e.g. IOCTLS, etc...)
212
213 As a consequence, every Linux distribution provides a set of patched kernel
214 headers to be used by userland applications (which installs in
215 /usr/include/linux/, /usr/include/asm/, etc...). these are manually maintained
216 by distribution packagers, and generated either manually or with various
217 scripts. these headers are also tailored to GNU LibC and cannot be reused
218 easily by Bionic.
219
220 for a really long period, the kernel authors have stated that they don't want
221 to fix the problem, even when someone proposed a patch to start cleaning the
222 official headers. from their point of view this is purely a library author
223 problem.
224
225 fortunately, enlightnment happened, and the kernel now provides a way to
226 install a set of "user-friendly" headers that are generated from the official
227 ones by stripping the __KERNEL__ protected declarations.
228
229 unfortunately, this is not enough for Bionic because the result still contains
230 a few broken declarations that are difficult to route around. (see below for
231 a little bit of details).
232
233 we plan to be able to support these kernel-generated user-land headers in the
234 future, but the priority on this issue is very low.
235
236
237 WHAT WE DO:
238 -----------
239
240 so we're doomed to repeat the same effort than anyone else. the big difference
241 here is that we want to automate as much as possible the generation of the
242 clean headers to easily support additional architectures in the future,
243 and keep current with upstream changes in the header definitions with the
244 least possible hassle.
245
246 of course, this is only a race to the bottom. the kernel maintainers still
247 feel free to randomly break the structure of their headers (e.g. moving the
248 location of some files) occasionally, so we'll need to keep up with that by
249 updating our build script/original headers as these cases happen.
250
251 what we do is keep a set of "original" kernel headers, and process them
252 automatically to generate a set of "clean" headers that can be used from
253 userland and the C library.
254
255 note that the "original" headers can be tweaked a little to avoid some subtle
256 issues. for example:
257
258 - when the location of various USB-related headers changes in the kernel
259 source tree, we want to keep them at the same location in our generated
260 headers (there is no reason to break the userland API for something
261 like that).
262
263 - sometimes, we prefer to take certain things out of blocks guarded by a
264 #ifdef __KERNEL__ .. #endif. for example, on recent kernels <linux/wireless.h>
265 only includes <linux/if.h> when in kernel mode. we make it available to
266 userland as well since some code out there assumes that this is the case.
267
268 - sometimes, the header is simply incorrect (e.g. it uses a type without
269 including the header that defines it before-hand)
270
271