xref: /netbsd-src/external/gpl3/autoconf/dist/doc/standards.texi (revision d874e91932377fc40d53f102e48fc3ee6f4fe9de)
1\input texinfo @c -*-texinfo-*-
2@c %**start of header
3@setfilename standards.info
4@settitle GNU Coding Standards
5@c This date is automagically updated when you save this file:
6@set lastupdate April 7, 2012
7@c %**end of header
8
9@dircategory GNU organization
10@direntry
11* Standards: (standards).       GNU coding standards.
12@end direntry
13
14@c @setchapternewpage odd
15@setchapternewpage off
16
17@c Put everything in one index (arbitrarily chosen to be the concept index).
18@syncodeindex fn cp
19@syncodeindex ky cp
20@syncodeindex pg cp
21@syncodeindex vr cp
22
23@c This is used by a cross ref in make-stds.texi
24@set CODESTD  1
25
26@copying
27The GNU coding standards, last updated @value{lastupdate}.
28
29Copyright @copyright{} 1992, 1993, 1994, 1995, 1996, 1997, 1998, 1999,
302000, 2001, 2002, 2003, 2004, 2005, 2006, 2007, 2008, 2009, 2010,
312011, 2012 Free Software Foundation, Inc.
32
33Permission is granted to copy, distribute and/or modify this document
34under the terms of the GNU Free Documentation License, Version 1.3 or
35any later version published by the Free Software Foundation; with no
36Invariant Sections, with no Front-Cover Texts, and with no Back-Cover
37Texts.  A copy of the license is included in the section entitled
38``GNU Free Documentation License''.
39@end copying
40
41@titlepage
42@title GNU Coding Standards
43@author Richard Stallman, et al.
44@author last updated @value{lastupdate}
45@page
46@vskip 0pt plus 1filll
47@insertcopying
48@end titlepage
49
50@contents
51
52@ifnottex
53@node Top
54@top GNU Coding Standards
55
56@insertcopying
57@end ifnottex
58
59@menu
60* Preface::                     About the GNU Coding Standards.
61* Legal Issues::                Keeping free software free.
62* Design Advice::               General program design.
63* Program Behavior::            Program behavior for all programs
64* Writing C::                   Making the best use of C.
65* Documentation::               Documenting programs.
66* Managing Releases::           The release process.
67* References::                  Mentioning non-free software or documentation.
68* GNU Free Documentation License::  Copying and sharing this manual.
69* Index::
70
71@end menu
72
73@node Preface
74@chapter About the GNU Coding Standards
75
76The GNU Coding Standards were written by Richard Stallman and other GNU
77Project volunteers.  Their purpose is to make the GNU system clean,
78consistent, and easy to install.  This document can also be read as a
79guide to writing portable, robust and reliable programs.  It focuses on
80programs written in C, but many of the rules and principles are useful
81even if you write in another programming language.  The rules often
82state reasons for writing in a certain way.
83
84@cindex where to obtain @code{standards.texi}
85@cindex downloading this manual
86If you did not obtain this file directly from the GNU project and
87recently, please check for a newer version.  You can get the GNU
88Coding Standards from the GNU web server in many
89different formats, including the Texinfo source, PDF, HTML, DVI, plain
90text, and more, at: @uref{http://www.gnu.org/prep/standards/}.
91
92If you are maintaining an official GNU package, in addition to this
93document, please read and follow the GNU maintainer information
94(@pxref{Top, , Contents, maintain, Information for Maintainers of GNU
95Software}).
96
97@cindex @code{gnustandards-commit@@gnu.org} mailing list
98If you want to receive diffs for every change to these GNU documents,
99join the mailing list @code{gnustandards-commit@@gnu.org}, via the web
100interface at
101@url{http://lists.gnu.org/mailman/listinfo/gnustandards-commit}.
102Archives are also available there.
103
104@cindex @code{bug-standards@@gnu.org} email address
105@cindex Savannah repository for gnustandards
106@cindex gnustandards project repository
107Please send corrections or suggestions for this document to
108@email{bug-standards@@gnu.org}.  If you make a suggestion, please
109include a suggested new wording for it, to help us consider the
110suggestion efficiently.  We prefer a context diff to the Texinfo
111source, but if that's difficult for you, you can make a context diff
112for some other version of this document, or propose it in any way that
113makes it clear.  The source repository for this document can be found
114at @url{http://savannah.gnu.org/projects/gnustandards}.
115
116These standards cover the minimum of what is important when writing a
117GNU package.  Likely, the need for additional standards will come up.
118Sometimes, you might suggest that such standards be added to this
119document.  If you think your standards would be generally useful, please
120do suggest them.
121
122You should also set standards for your package on many questions not
123addressed or not firmly specified here.  The most important point is to
124be self-consistent---try to stick to the conventions you pick, and try
125to document them as much as possible.  That way, your program will be
126more maintainable by others.
127
128The GNU Hello program serves as an example of how to follow the GNU
129coding standards for a trivial program.
130@uref{http://www.gnu.org/software/hello/hello.html}.
131
132This release of the GNU Coding Standards was last updated
133@value{lastupdate}.
134
135
136@node Legal Issues
137@chapter Keeping Free Software Free
138@cindex legal aspects
139
140This chapter discusses how you can make sure that GNU software
141avoids legal difficulties, and other related issues.
142
143@menu
144* Reading Non-Free Code::       Referring to proprietary programs.
145* Contributions::               Accepting contributions.
146* Trademarks::                  How we deal with trademark issues.
147@end menu
148
149@node Reading Non-Free Code
150@section Referring to Proprietary Programs
151@cindex proprietary programs
152@cindex avoiding proprietary code
153
154Don't in any circumstances refer to Unix source code for or during
155your work on GNU!  (Or to any other proprietary programs.)
156
157If you have a vague recollection of the internals of a Unix program,
158this does not absolutely mean you can't write an imitation of it, but
159do try to organize the imitation internally along different lines,
160because this is likely to make the details of the Unix version
161irrelevant and dissimilar to your results.
162
163For example, Unix utilities were generally optimized to minimize
164memory use; if you go for speed instead, your program will be very
165different.  You could keep the entire input file in memory and scan it
166there instead of using stdio.  Use a smarter algorithm discovered more
167recently than the Unix program.  Eliminate use of temporary files.  Do
168it in one pass instead of two (we did this in the assembler).
169
170Or, on the contrary, emphasize simplicity instead of speed.  For some
171applications, the speed of today's computers makes simpler algorithms
172adequate.
173
174Or go for generality.  For example, Unix programs often have static
175tables or fixed-size strings, which make for arbitrary limits; use
176dynamic allocation instead.  Make sure your program handles NULs and
177other funny characters in the input files.  Add a programming language
178for extensibility and write part of the program in that language.
179
180Or turn some parts of the program into independently usable libraries.
181Or use a simple garbage collector instead of tracking precisely when
182to free memory, or use a new GNU facility such as obstacks.
183
184
185@node Contributions
186@section Accepting Contributions
187@cindex legal papers
188@cindex accepting contributions
189
190If the program you are working on is copyrighted by the Free Software
191Foundation, then when someone else sends you a piece of code to add to
192the program, we need legal papers to use it---just as we asked you to
193sign papers initially.  @emph{Each} person who makes a nontrivial
194contribution to a program must sign some sort of legal papers in order
195for us to have clear title to the program; the main author alone is not
196enough.
197
198So, before adding in any contributions from other people, please tell
199us, so we can arrange to get the papers.  Then wait until we tell you
200that we have received the signed papers, before you actually use the
201contribution.
202
203This applies both before you release the program and afterward.  If
204you receive diffs to fix a bug, and they make significant changes, we
205need legal papers for that change.
206
207This also applies to comments and documentation files.  For copyright
208law, comments and code are just text.  Copyright applies to all kinds of
209text, so we need legal papers for all kinds.
210
211We know it is frustrating to ask for legal papers; it's frustrating for
212us as well.  But if you don't wait, you are going out on a limb---for
213example, what if the contributor's employer won't sign a disclaimer?
214You might have to take that code out again!
215
216You don't need papers for changes of a few lines here or there, since
217they are not significant for copyright purposes.  Also, you don't need
218papers if all you get from the suggestion is some ideas, not actual code
219which you use.  For example, if someone sent you one implementation, but
220you write a different implementation of the same idea, you don't need to
221get papers.
222
223The very worst thing is if you forget to tell us about the other
224contributor.  We could be very embarrassed in court some day as a
225result.
226
227We have more detailed advice for maintainers of GNU packages.  If you
228have reached the stage of maintaining a GNU program (whether released
229or not), please take a look: @pxref{Legal Matters,,, maintain,
230Information for GNU Maintainers}.
231
232
233@node Trademarks
234@section Trademarks
235@cindex trademarks
236
237Please do not include any trademark acknowledgements in GNU software
238packages or documentation.
239
240Trademark acknowledgements are the statements that such-and-such is a
241trademark of so-and-so.  The GNU Project has no objection to the basic
242idea of trademarks, but these acknowledgements feel like kowtowing,
243and there is no legal requirement for them, so we don't use them.
244
245What is legally required, as regards other people's trademarks, is to
246avoid using them in ways which a reader might reasonably understand as
247naming or labeling our own programs or activities.  For example, since
248``Objective C'' is (or at least was) a trademark, we made sure to say
249that we provide a ``compiler for the Objective C language'' rather
250than an ``Objective C compiler''.  The latter would have been meant as
251a shorter way of saying the former, but it does not explicitly state
252the relationship, so it could be misinterpreted as using ``Objective
253C'' as a label for the compiler rather than for the language.
254
255Please don't use ``win'' as an abbreviation for Microsoft Windows in
256GNU software or documentation.  In hacker terminology, calling
257something a ``win'' is a form of praise.  If you wish to praise
258Microsoft Windows when speaking on your own, by all means do so, but
259not in GNU software.  Usually we write the name ``Windows'' in full,
260but when brevity is very important (as in file names and sometimes
261symbol names), we abbreviate it to ``w''.  For instance, the files and
262functions in Emacs that deal with Windows start with @samp{w32}.
263
264@node Design Advice
265@chapter General Program Design
266@cindex program design
267
268This chapter discusses some of the issues you should take into
269account when designing your program.
270
271@c                         Standard or ANSI C
272@c
273@c In 1989 the American National Standards Institute (ANSI) standardized
274@c C   as  standard  X3.159-1989.    In  December   of  that   year  the
275@c International Standards Organization ISO  adopted the ANSI C standard
276@c making  minor changes.   In 1990  ANSI then  re-adopted  ISO standard
277@c C. This version of C is known as either ANSI C or Standard C.
278
279@c A major revision of the C Standard appeared in 1999.
280
281@menu
282* Source Language::             Which languages to use.
283* Compatibility::               Compatibility with other implementations.
284* Using Extensions::            Using non-standard features.
285* Standard C::                  Using standard C features.
286* Conditional Compilation::     Compiling code only if a conditional is true.
287@end menu
288
289@node Source Language
290@section Which Languages to Use
291@cindex programming languages
292
293When you want to use a language that gets compiled and runs at high
294speed, the best language to use is C.  Using another language is like
295using a non-standard feature: it will cause trouble for users.  Even if
296GCC supports the other language, users may find it inconvenient to have
297to install the compiler for that other language in order to build your
298program.  For example, if you write your program in C++, people will
299have to install the GNU C++ compiler in order to compile your program.
300
301C has one other advantage over C++ and other compiled languages: more
302people know C, so more people will find it easy to read and modify the
303program if it is written in C.
304
305So in general it is much better to use C, rather than the
306comparable alternatives.
307
308But there are two exceptions to that conclusion:
309
310@itemize @bullet
311@item
312It is no problem to use another language to write a tool specifically
313intended for use with that language.  That is because the only people
314who want to build the tool will be those who have installed the other
315language anyway.
316
317@item
318If an application is of interest only to a narrow part of the community,
319then the question of which language it is written in has less effect on
320other people, so you may as well please yourself.
321@end itemize
322
323Many programs are designed to be extensible: they include an interpreter
324for a language that is higher level than C.  Often much of the program
325is written in that language, too.  The Emacs editor pioneered this
326technique.
327
328@cindex Guile
329@cindex GNOME and Guile
330The standard extensibility interpreter for GNU software is Guile
331(@uref{http://www.gnu.org/@/software/@/guile/}), which implements the
332language Scheme (an especially clean and simple dialect of Lisp).
333Guile also includes bindings for GTK+/GNOME, making it practical to
334write modern GUI functionality within Guile.  We don't reject programs
335written in other ``scripting languages'' such as Perl and Python, but
336using Guile is very important for the overall consistency of the GNU
337system.
338
339
340@node Compatibility
341@section Compatibility with Other Implementations
342@cindex compatibility with C and @sc{posix} standards
343@cindex @sc{posix} compatibility
344
345With occasional exceptions, utility programs and libraries for GNU
346should be upward compatible with those in Berkeley Unix, and upward
347compatible with Standard C if Standard C specifies their
348behavior, and upward compatible with @sc{posix} if @sc{posix} specifies
349their behavior.
350
351When these standards conflict, it is useful to offer compatibility
352modes for each of them.
353
354@cindex options for compatibility
355Standard C and @sc{posix} prohibit many kinds of extensions.  Feel
356free to make the extensions anyway, and include a @samp{--ansi},
357@samp{--posix}, or @samp{--compatible} option to turn them off.
358However, if the extension has a significant chance of breaking any real
359programs or scripts, then it is not really upward compatible.  So you
360should try to redesign its interface to make it upward compatible.
361
362@cindex @code{POSIXLY_CORRECT}, environment variable
363Many GNU programs suppress extensions that conflict with @sc{posix} if the
364environment variable @code{POSIXLY_CORRECT} is defined (even if it is
365defined with a null value).  Please make your program recognize this
366variable if appropriate.
367
368When a feature is used only by users (not by programs or command
369files), and it is done poorly in Unix, feel free to replace it
370completely with something totally different and better.  (For example,
371@code{vi} is replaced with Emacs.)  But it is nice to offer a compatible
372feature as well.  (There is a free @code{vi} clone, so we offer it.)
373
374Additional useful features are welcome regardless of whether
375there is any precedent for them.
376
377@node Using Extensions
378@section Using Non-standard Features
379@cindex non-standard extensions
380
381Many GNU facilities that already exist support a number of convenient
382extensions over the comparable Unix facilities.  Whether to use these
383extensions in implementing your program is a difficult question.
384
385On the one hand, using the extensions can make a cleaner program.
386On the other hand, people will not be able to build the program
387unless the other GNU tools are available.  This might cause the
388program to work on fewer kinds of machines.
389
390With some extensions, it might be easy to provide both alternatives.
391For example, you can define functions with a ``keyword'' @code{INLINE}
392and define that as a macro to expand into either @code{inline} or
393nothing, depending on the compiler.
394
395In general, perhaps it is best not to use the extensions if you can
396straightforwardly do without them, but to use the extensions if they
397are a big improvement.
398
399An exception to this rule are the large, established programs (such as
400Emacs) which run on a great variety of systems.  Using GNU extensions in
401such programs would make many users unhappy, so we don't do that.
402
403Another exception is for programs that are used as part of compilation:
404anything that must be compiled with other compilers in order to
405bootstrap the GNU compilation facilities.  If these require the GNU
406compiler, then no one can compile them without having them installed
407already.  That would be extremely troublesome in certain cases.
408
409@node Standard C
410@section Standard C and Pre-Standard C
411@cindex @sc{ansi} C standard
412
4131989 Standard C is widespread enough now that it is ok to use its
414features in new programs.  There is one exception: do not ever use the
415``trigraph'' feature of Standard C.
416
4171999 Standard C is not widespread yet, so please do not require its
418features in programs.  It is ok to use its features if they are present.
419
420However, it is easy to support pre-standard compilers in most programs,
421so if you know how to do that, feel free.  If a program you are
422maintaining has such support, you should try to keep it working.
423
424@cindex function prototypes
425To support pre-standard C, instead of writing function definitions in
426standard prototype form,
427
428@example
429int
430foo (int x, int y)
431@dots{}
432@end example
433
434@noindent
435write the definition in pre-standard style like this,
436
437@example
438int
439foo (x, y)
440     int x, y;
441@dots{}
442@end example
443
444@noindent
445and use a separate declaration to specify the argument prototype:
446
447@example
448int foo (int, int);
449@end example
450
451You need such a declaration anyway, in a header file, to get the benefit
452of prototypes in all the files where the function is called.  And once
453you have the declaration, you normally lose nothing by writing the
454function definition in the pre-standard style.
455
456This technique does not work for integer types narrower than @code{int}.
457If you think of an argument as being of a type narrower than @code{int},
458declare it as @code{int} instead.
459
460There are a few special cases where this technique is hard to use.  For
461example, if a function argument needs to hold the system type
462@code{dev_t}, you run into trouble, because @code{dev_t} is shorter than
463@code{int} on some machines; but you cannot use @code{int} instead,
464because @code{dev_t} is wider than @code{int} on some machines.  There
465is no type you can safely use on all machines in a non-standard
466definition.  The only way to support non-standard C and pass such an
467argument is to check the width of @code{dev_t} using Autoconf and choose
468the argument type accordingly.  This may not be worth the trouble.
469
470In order to support pre-standard compilers that do not recognize
471prototypes, you may want to use a preprocessor macro like this:
472
473@example
474/* Declare the prototype for a general external function.  */
475#if defined (__STDC__) || defined (WINDOWSNT)
476#define P_(proto) proto
477#else
478#define P_(proto) ()
479#endif
480@end example
481
482@node Conditional Compilation
483@section Conditional Compilation
484
485When supporting configuration options already known when building your
486program we prefer using @code{if (... )} over conditional compilation,
487as in the former case the compiler is able to perform more extensive
488checking of all possible code paths.
489
490For example, please write
491
492@smallexample
493  if (HAS_FOO)
494    ...
495  else
496    ...
497@end smallexample
498
499@noindent
500instead of:
501
502@smallexample
503  #ifdef HAS_FOO
504    ...
505  #else
506    ...
507  #endif
508@end smallexample
509
510A modern compiler such as GCC will generate exactly the same code in
511both cases, and we have been using similar techniques with good success
512in several projects.  Of course, the former method assumes that
513@code{HAS_FOO} is defined as either 0 or 1.
514
515While this is not a silver bullet solving all portability problems,
516and is not always appropriate, following this policy would have saved
517GCC developers many hours, or even days, per year.
518
519In the case of function-like macros like @code{REVERSIBLE_CC_MODE} in
520GCC which cannot be simply used in @code{if (...)} statements, there is
521an easy workaround.  Simply introduce another macro
522@code{HAS_REVERSIBLE_CC_MODE} as in the following example:
523
524@smallexample
525  #ifdef REVERSIBLE_CC_MODE
526  #define HAS_REVERSIBLE_CC_MODE 1
527  #else
528  #define HAS_REVERSIBLE_CC_MODE 0
529  #endif
530@end smallexample
531
532@node Program Behavior
533@chapter Program Behavior for All Programs
534
535This chapter describes conventions for writing robust
536software.  It also describes general standards for error messages, the
537command line interface, and how libraries should behave.
538
539@menu
540* Non-GNU Standards::           We consider standards such as POSIX;
541                                  we don't "obey" them.
542* Semantics::                   Writing robust programs.
543* Libraries::                   Library behavior.
544* Errors::                      Formatting error messages.
545* User Interfaces::             Standards about interfaces generally.
546* Graphical Interfaces::        Standards for graphical interfaces.
547* Command-Line Interfaces::     Standards for command line interfaces.
548* Dynamic Plug-In Interfaces::  Standards for dynamic plug-in interfaces.
549* Option Table::                Table of long options.
550* OID Allocations::             Table of OID slots for GNU.
551* Memory Usage::                When and how to care about memory needs.
552* File Usage::                  Which files to use, and where.
553@end menu
554
555@node Non-GNU Standards
556@section Non-GNU Standards
557
558The GNU Project regards standards published by other organizations as
559suggestions, not orders.  We consider those standards, but we do not
560``obey'' them.  In developing a GNU program, you should implement
561an outside standard's specifications when that makes the GNU system
562better overall in an objective sense.  When it doesn't, you shouldn't.
563
564In most cases, following published standards is convenient for
565users---it means that their programs or scripts will work more
566portably.  For instance, GCC implements nearly all the features of
567Standard C as specified by that standard.  C program developers would
568be unhappy if it did not.  And GNU utilities mostly follow
569specifications of POSIX.2; shell script writers and users would be
570unhappy if our programs were incompatible.
571
572But we do not follow either of these specifications rigidly, and there
573are specific points on which we decided not to follow them, so as to
574make the GNU system better for users.
575
576For instance, Standard C says that nearly all extensions to C are
577prohibited.  How silly!  GCC implements many extensions, some of which
578were later adopted as part of the standard.  If you want these
579constructs to give an error message as ``required'' by the standard,
580you must specify @samp{--pedantic}, which was implemented only so that
581we can say ``GCC is a 100% implementation of the standard'', not
582because there is any reason to actually use it.
583
584POSIX.2 specifies that @samp{df} and @samp{du} must output sizes by
585default in units of 512 bytes.  What users want is units of 1k, so
586that is what we do by default.  If you want the ridiculous behavior
587``required'' by POSIX, you must set the environment variable
588@samp{POSIXLY_CORRECT} (which was originally going to be named
589@samp{POSIX_ME_HARDER}).
590
591GNU utilities also depart from the letter of the POSIX.2 specification
592when they support long-named command-line options, and intermixing
593options with ordinary arguments.  This minor incompatibility with
594POSIX is never a problem in practice, and it is very useful.
595
596In particular, don't reject a new feature, or remove an old one,
597merely because a standard says it is ``forbidden'' or ``deprecated''.
598
599
600@node Semantics
601@section Writing Robust Programs
602
603@cindex arbitrary limits on data
604Avoid arbitrary limits on the length or number of @emph{any} data
605structure, including file names, lines, files, and symbols, by allocating
606all data structures dynamically.  In most Unix utilities, ``long lines
607are silently truncated''.  This is not acceptable in a GNU utility.
608
609@cindex @code{NUL} characters
610@findex libiconv
611Utilities reading files should not drop NUL characters, or any other
612nonprinting characters @emph{including those with codes above 0177}.
613The only sensible exceptions would be utilities specifically intended
614for interface to certain types of terminals or printers that can't
615handle those characters.  Whenever possible, try to make programs work
616properly with sequences of bytes that represent multibyte characters;
617UTF-8 is the most important.
618
619@cindex error messages
620Check every system call for an error return, unless you know you wish
621to ignore errors.  Include the system error text (from @code{perror},
622@code{strerror}, or equivalent) in @emph{every} error message
623resulting from a failing system call, as well as the name of the file
624if any and the name of the utility.  Just ``cannot open foo.c'' or
625``stat failed'' is not sufficient.
626
627@cindex @code{malloc} return value
628@cindex memory allocation failure
629Check every call to @code{malloc} or @code{realloc} to see if it
630returned zero.  Check @code{realloc} even if you are making the block
631smaller; in a system that rounds block sizes to a power of 2,
632@code{realloc} may get a different block if you ask for less space.
633
634In Unix, @code{realloc} can destroy the storage block if it returns
635zero.  GNU @code{realloc} does not have this bug: if it fails, the
636original block is unchanged.  Feel free to assume the bug is fixed.  If
637you wish to run your program on Unix, and wish to avoid lossage in this
638case, you can use the GNU @code{malloc}.
639
640You must expect @code{free} to alter the contents of the block that was
641freed.  Anything you want to fetch from the block, you must fetch before
642calling @code{free}.
643
644If @code{malloc} fails in a noninteractive program, make that a fatal
645error.  In an interactive program (one that reads commands from the
646user), it is better to abort the command and return to the command
647reader loop.  This allows the user to kill other processes to free up
648virtual memory, and then try the command again.
649
650@cindex command-line arguments, decoding
651Use @code{getopt_long} to decode arguments, unless the argument syntax
652makes this unreasonable.
653
654When static storage is to be written in during program execution, use
655explicit C code to initialize it.  Reserve C initialized declarations
656for data that will not be changed.
657@c ADR: why?
658
659Try to avoid low-level interfaces to obscure Unix data structures (such
660as file directories, utmp, or the layout of kernel memory), since these
661are less likely to work compatibly.  If you need to find all the files
662in a directory, use @code{readdir} or some other high-level interface.
663These are supported compatibly by GNU.
664
665@cindex signal handling
666The preferred signal handling facilities are the BSD variant of
667@code{signal}, and the @sc{posix} @code{sigaction} function; the
668alternative USG @code{signal} interface is an inferior design.
669
670Nowadays, using the @sc{posix} signal functions may be the easiest way
671to make a program portable.  If you use @code{signal}, then on GNU/Linux
672systems running GNU libc version 1, you should include
673@file{bsd/signal.h} instead of @file{signal.h}, so as to get BSD
674behavior.  It is up to you whether to support systems where
675@code{signal} has only the USG behavior, or give up on them.
676
677@cindex impossible conditions
678In error checks that detect ``impossible'' conditions, just abort.
679There is usually no point in printing any message.  These checks
680indicate the existence of bugs.  Whoever wants to fix the bugs will have
681to read the source code and run a debugger.  So explain the problem with
682comments in the source.  The relevant data will be in variables, which
683are easy to examine with the debugger, so there is no point moving them
684elsewhere.
685
686Do not use a count of errors as the exit status for a program.
687@emph{That does not work}, because exit status values are limited to 8
688bits (0 through 255).  A single run of the program might have 256
689errors; if you try to return 256 as the exit status, the parent process
690will see 0 as the status, and it will appear that the program succeeded.
691
692@cindex temporary files
693@cindex @code{TMPDIR} environment variable
694If you make temporary files, check the @code{TMPDIR} environment
695variable; if that variable is defined, use the specified directory
696instead of @file{/tmp}.
697
698In addition, be aware that there is a possible security problem when
699creating temporary files in world-writable directories.  In C, you can
700avoid this problem by creating temporary files in this manner:
701
702@example
703fd = open (filename, O_WRONLY | O_CREAT | O_EXCL, 0600);
704@end example
705
706@noindent
707or by using the @code{mkstemps} function from Gnulib
708(@pxref{mkstemps,,, gnulib, Gnulib}).
709
710In bash, use @code{set -C} (long name @code{noclobber}) to avoid this
711problem.  In addition, the @code{mktemp} utility is a more general
712solution for creating temporary files from shell scripts
713(@pxref{mktemp invocation,,, coreutils, GNU Coreutils}).
714
715
716@node Libraries
717@section Library Behavior
718@cindex libraries
719
720Try to make library functions reentrant.  If they need to do dynamic
721storage allocation, at least try to avoid any nonreentrancy aside from
722that of @code{malloc} itself.
723
724Here are certain name conventions for libraries, to avoid name
725conflicts.
726
727Choose a name prefix for the library, more than two characters long.
728All external function and variable names should start with this
729prefix.  In addition, there should only be one of these in any given
730library member.  This usually means putting each one in a separate
731source file.
732
733An exception can be made when two external symbols are always used
734together, so that no reasonable program could use one without the
735other; then they can both go in the same file.
736
737External symbols that are not documented entry points for the user
738should have names beginning with @samp{_}.  The @samp{_} should be
739followed by the chosen name prefix for the library, to prevent
740collisions with other libraries.  These can go in the same files with
741user entry points if you like.
742
743Static functions and variables can be used as you like and need not
744fit any naming convention.
745
746@node Errors
747@section Formatting Error Messages
748@cindex formatting error messages
749@cindex error messages, formatting
750
751Error messages from compilers should look like this:
752
753@example
754@var{sourcefile}:@var{lineno}: @var{message}
755@end example
756
757@noindent
758If you want to mention the column number, use one of these formats:
759
760@example
761@var{sourcefile}:@var{lineno}:@var{column}: @var{message}
762@var{sourcefile}:@var{lineno}.@var{column}: @var{message}
763
764@end example
765
766@noindent
767Line numbers should start from 1 at the beginning of the file, and
768column numbers should start from 1 at the beginning of the line.
769(Both of these conventions are chosen for compatibility.)  Calculate
770column numbers assuming that space and all ASCII printing characters
771have equal width, and assuming tab stops every 8 columns.  For
772non-ASCII characters, Unicode character widths should be used when in
773a UTF-8 locale; GNU libc and GNU gnulib provide suitable
774@code{wcwidth} functions.
775
776The error message can also give both the starting and ending positions
777of the erroneous text.  There are several formats so that you can
778avoid redundant information such as a duplicate line number.
779Here are the possible formats:
780
781@example
782@var{sourcefile}:@var{line1}.@var{column1}-@var{line2}.@var{column2}: @var{message}
783@var{sourcefile}:@var{line1}.@var{column1}-@var{column2}: @var{message}
784@var{sourcefile}:@var{line1}-@var{line2}: @var{message}
785@end example
786
787@noindent
788When an error is spread over several files, you can use this format:
789
790@example
791@var{file1}:@var{line1}.@var{column1}-@var{file2}:@var{line2}.@var{column2}: @var{message}
792@end example
793
794Error messages from other noninteractive programs should look like this:
795
796@example
797@var{program}:@var{sourcefile}:@var{lineno}: @var{message}
798@end example
799
800@noindent
801when there is an appropriate source file, or like this:
802
803@example
804@var{program}: @var{message}
805@end example
806
807@noindent
808when there is no relevant source file.
809
810If you want to mention the column number, use this format:
811
812@example
813@var{program}:@var{sourcefile}:@var{lineno}:@var{column}: @var{message}
814@end example
815
816In an interactive program (one that is reading commands from a
817terminal), it is better not to include the program name in an error
818message.  The place to indicate which program is running is in the
819prompt or with the screen layout.  (When the same program runs with
820input from a source other than a terminal, it is not interactive and
821would do best to print error messages using the noninteractive style.)
822
823The string @var{message} should not begin with a capital letter when
824it follows a program name and/or file name, because that isn't the
825beginning of a sentence.  (The sentence conceptually starts at the
826beginning of the line.)  Also, it should not end with a period.
827
828Error messages from interactive programs, and other messages such as
829usage messages, should start with a capital letter.  But they should not
830end with a period.
831
832@node User Interfaces
833@section Standards for Interfaces Generally
834
835@cindex program name and its behavior
836@cindex behavior, dependent on program's name
837Please don't make the behavior of a utility depend on the name used
838to invoke it.  It is useful sometimes to make a link to a utility
839with a different name, and that should not change what it does.
840
841Instead, use a run time option or a compilation switch or both
842to select among the alternate behaviors.
843
844@cindex output device and program's behavior
845Likewise, please don't make the behavior of the program depend on the
846type of output device it is used with.  Device independence is an
847important principle of the system's design; do not compromise it merely
848to save someone from typing an option now and then.  (Variation in error
849message syntax when using a terminal is ok, because that is a side issue
850that people do not depend on.)
851
852If you think one behavior is most useful when the output is to a
853terminal, and another is most useful when the output is a file or a
854pipe, then it is usually best to make the default behavior the one that
855is useful with output to a terminal, and have an option for the other
856behavior.
857
858Compatibility requires certain programs to depend on the type of output
859device.  It would be disastrous if @code{ls} or @code{sh} did not do so
860in the way all users expect.  In some of these cases, we supplement the
861program with a preferred alternate version that does not depend on the
862output device type.  For example, we provide a @code{dir} program much
863like @code{ls} except that its default output format is always
864multi-column format.
865
866
867@node Graphical Interfaces
868@section Standards for Graphical Interfaces
869@cindex graphical user interface
870@cindex interface styles
871@cindex user interface styles
872
873@cindex GTK+
874When you write a program that provides a graphical user interface,
875please make it work with the X Window System and the GTK+ toolkit
876unless the functionality specifically requires some alternative (for
877example, ``displaying jpeg images while in console mode'').
878
879In addition, please provide a command-line interface to control the
880functionality.  (In many cases, the graphical user interface can be a
881separate program which invokes the command-line program.)  This is
882so that the same jobs can be done from scripts.
883
884@cindex CORBA
885@cindex GNOME
886@cindex D-bus
887@cindex keyboard interface
888@cindex library interface
889Please also consider providing a D-bus interface for use from other
890running programs, such as within GNOME.  (GNOME used to use CORBA
891for this, but that is being phased out.)  In addition, consider
892providing a library interface (for use from C), and perhaps a
893keyboard-driven console interface (for use by users from console
894mode).  Once you are doing the work to provide the functionality and
895the graphical interface, these won't be much extra work.
896
897
898@node Command-Line Interfaces
899@section Standards for Command Line Interfaces
900@cindex command-line interface
901
902@findex getopt
903It is a good idea to follow the @sc{posix} guidelines for the
904command-line options of a program.  The easiest way to do this is to use
905@code{getopt} to parse them.  Note that the GNU version of @code{getopt}
906will normally permit options anywhere among the arguments unless the
907special argument @samp{--} is used.  This is not what @sc{posix}
908specifies; it is a GNU extension.
909
910@cindex long-named options
911Please define long-named options that are equivalent to the
912single-letter Unix-style options.  We hope to make GNU more user
913friendly this way.  This is easy to do with the GNU function
914@code{getopt_long}.
915
916One of the advantages of long-named options is that they can be
917consistent from program to program.  For example, users should be able
918to expect the ``verbose'' option of any GNU program which has one, to be
919spelled precisely @samp{--verbose}.  To achieve this uniformity, look at
920the table of common long-option names when you choose the option names
921for your program (@pxref{Option Table}).
922
923It is usually a good idea for file names given as ordinary arguments to
924be input files only; any output files would be specified using options
925(preferably @samp{-o} or @samp{--output}).  Even if you allow an output
926file name as an ordinary argument for compatibility, try to provide an
927option as another way to specify it.  This will lead to more consistency
928among GNU utilities, and fewer idiosyncrasies for users to remember.
929
930@cindex standard command-line options
931@cindex options, standard command-line
932@cindex CGI programs, standard options for
933@cindex PATH_INFO, specifying standard options as
934All programs should support two standard options: @samp{--version}
935and @samp{--help}.  CGI programs should accept these as command-line
936options, and also if given as the @env{PATH_INFO}; for instance,
937visiting @url{http://example.org/p.cgi/--help} in a browser should
938output the same information as invoking @samp{p.cgi --help} from the
939command line.
940
941@menu
942* --version::       The standard output for --version.
943* --help::          The standard output for --help.
944@end menu
945
946@node --version
947@subsection @option{--version}
948
949@cindex @samp{--version} output
950
951The standard @code{--version} option should direct the program to
952print information about its name, version, origin and legal status,
953all on standard output, and then exit successfully.  Other options and
954arguments should be ignored once this is seen, and the program should
955not perform its normal function.
956
957@cindex canonical name of a program
958@cindex program's canonical name
959The first line is meant to be easy for a program to parse; the version
960number proper starts after the last space.  In addition, it contains
961the canonical name for this program, in this format:
962
963@example
964GNU Emacs 19.30
965@end example
966
967@noindent
968The program's name should be a constant string; @emph{don't} compute it
969from @code{argv[0]}.  The idea is to state the standard or canonical
970name for the program, not its file name.  There are other ways to find
971out the precise file name where a command is found in @code{PATH}.
972
973If the program is a subsidiary part of a larger package, mention the
974package name in parentheses, like this:
975
976@example
977emacsserver (GNU Emacs) 19.30
978@end example
979
980@noindent
981If the package has a version number which is different from this
982program's version number, you can mention the package version number
983just before the close-parenthesis.
984
985If you @emph{need} to mention the version numbers of libraries which
986are distributed separately from the package which contains this program,
987you can do so by printing an additional line of version info for each
988library you want to mention.  Use the same format for these lines as for
989the first line.
990
991Please do not mention all of the libraries that the program uses ``just
992for completeness''---that would produce a lot of unhelpful clutter.
993Please mention library version numbers only if you find in practice that
994they are very important to you in debugging.
995
996The following line, after the version number line or lines, should be a
997copyright notice.  If more than one copyright notice is called for, put
998each on a separate line.
999
1000Next should follow a line stating the license, preferably using one of
1001abbreviations below, and a brief statement that the program is free
1002software, and that users are free to copy and change it.  Also mention
1003that there is no warranty, to the extent permitted by law.  See
1004recommended wording below.
1005
1006It is ok to finish the output with a list of the major authors of the
1007program, as a way of giving credit.
1008
1009Here's an example of output that follows these rules:
1010
1011@smallexample
1012GNU hello 2.3
1013Copyright (C) 2007 Free Software Foundation, Inc.
1014License GPLv3+: GNU GPL version 3 or later <http://gnu.org/licenses/gpl.html>
1015This is free software: you are free to change and redistribute it.
1016There is NO WARRANTY, to the extent permitted by law.
1017@end smallexample
1018
1019You should adapt this to your program, of course, filling in the proper
1020year, copyright holder, name of program, and the references to
1021distribution terms, and changing the rest of the wording as necessary.
1022
1023This copyright notice only needs to mention the most recent year in
1024which changes were made---there's no need to list the years for previous
1025versions' changes.  You don't have to mention the name of the program in
1026these notices, if that is inconvenient, since it appeared in the first
1027line.  (The rules are different for copyright notices in source files;
1028@pxref{Copyright Notices,,,maintain,Information for GNU Maintainers}.)
1029
1030Translations of the above lines must preserve the validity of the
1031copyright notices (@pxref{Internationalization}).  If the translation's
1032character set supports it, the @samp{(C)} should be replaced with the
1033copyright symbol, as follows:
1034
1035@ifinfo
1036(the official copyright symbol, which is the letter C in a circle);
1037@end ifinfo
1038@ifnotinfo
1039@copyright{}
1040@end ifnotinfo
1041
1042Write the word ``Copyright'' exactly like that, in English.  Do not
1043translate it into another language.  International treaties recognize
1044the English word ``Copyright''; translations into other languages do not
1045have legal significance.
1046
1047Finally, here is the table of our suggested license abbreviations.
1048Any abbreviation can be followed by @samp{v@var{version}[+]}, meaning
1049that particular version, or later versions with the @samp{+}, as shown
1050above.
1051
1052In the case of exceptions for extra permissions with the GPL, we use
1053@samp{/} for a separator; the version number can follow the license
1054abbreviation as usual, as in the examples below.
1055
1056@table @asis
1057@item GPL
1058GNU General Public License, @url{http://www.gnu.org/@/licenses/@/gpl.html}.
1059
1060@item LGPL
1061GNU Lesser General Public License, @url{http://www.gnu.org/@/licenses/@/lgpl.html}.
1062
1063@item GPL/Ada
1064GNU GPL with the exception for Ada.
1065
1066@item Apache
1067The Apache Software Foundation license,
1068@url{http://www.apache.org/@/licenses}.
1069
1070@item Artistic
1071The Artistic license used for Perl, @url{http://www.perlfoundation.org/@/legal}.
1072
1073@item Expat
1074The Expat license, @url{http://www.jclark.com/@/xml/@/copying.txt}.
1075
1076@item MPL
1077The Mozilla Public License, @url{http://www.mozilla.org/@/MPL/}.
1078
1079@item OBSD
1080The original (4-clause) BSD license, incompatible with the GNU GPL
1081@url{http://www.xfree86.org/@/3.3.6/@/COPYRIGHT2.html#6}.
1082
1083@item PHP
1084The license used for PHP, @url{http://www.php.net/@/license/}.
1085
1086@item public domain
1087The non-license that is being in the public domain,
1088@url{http://www.gnu.org/@/licenses/@/license-list.html#PublicDomain}.
1089
1090@item Python
1091The license for Python, @url{http://www.python.org/@/2.0.1/@/license.html}.
1092
1093@item RBSD
1094The revised (3-clause) BSD, compatible with the GNU GPL,@*
1095@url{http://www.xfree86.org/@/3.3.6/@/COPYRIGHT2.html#5}.
1096
1097@item X11
1098The simple non-copyleft license used for most versions of the X Window
1099System, @url{http://www.xfree86.org/@/3.3.6/@/COPYRIGHT2.html#3}.
1100
1101@item Zlib
1102The license for Zlib, @url{http://www.gzip.org/@/zlib/@/zlib_license.html}.
1103
1104@end table
1105
1106More information about these licenses and many more are on the GNU
1107licensing web pages,
1108@url{http://www.gnu.org/@/licenses/@/license-list.html}.
1109
1110
1111@node --help
1112@subsection @option{--help}
1113
1114@cindex @samp{--help} output
1115
1116The standard @code{--help} option should output brief documentation
1117for how to invoke the program, on standard output, then exit
1118successfully.  Other options and arguments should be ignored once this
1119is seen, and the program should not perform its normal function.
1120
1121@cindex address for bug reports
1122@cindex bug reports
1123Near the end of the @samp{--help} option's output, please place lines
1124giving the email address for bug reports, the package's home page
1125(normally @indicateurl{http://www.gnu.org/software/@var{pkg}}, and the
1126general page for help using GNU programs.  The format should be like this:
1127
1128@example
1129Report bugs to: @var{mailing-address}
1130@var{pkg} home page: <http://www.gnu.org/software/@var{pkg}/>
1131General help using GNU software: <http://www.gnu.org/gethelp/>
1132@end example
1133
1134It is ok to mention other appropriate mailing lists and web pages.
1135
1136
1137@node Dynamic Plug-In Interfaces
1138@section Standards for Dynamic Plug-in Interfaces
1139@cindex plug-ins
1140@cindex dynamic plug-ins
1141
1142Another aspect of keeping free programs free is encouraging
1143development of free plug-ins, and discouraging development of
1144proprietary plug-ins.  Many GNU programs will not have anything like
1145plug-ins at all, but those that do should follow these
1146practices.
1147
1148First, the general plug-in architecture design should closely tie the
1149plug-in to the original code, such that the plug-in and the base
1150program are parts of one extended program.  For GCC, for example,
1151plug-ins receive and modify GCC's internal data structures, and so
1152clearly form an extended program with the base GCC.
1153
1154@vindex plugin_is_GPL_compatible
1155Second, you should require plug-in developers to affirm that their
1156plug-ins are released under an appropriate license.  This should be
1157enforced with a simple programmatic check.  For GCC, again for
1158example, a plug-in must define the global symbol
1159@code{plugin_is_GPL_compatible}, thus asserting that the plug-in is
1160released under a GPL-compatible license (@pxref{Plugins,, Plugins,
1161gccint, GCC Internals}).
1162
1163By adding this check to your program you are not creating a new legal
1164requirement.  The GPL itself requires plug-ins to be free software,
1165licensed compatibly.  As long as you have followed the first rule above
1166to keep plug-ins closely tied to your original program, the GPL and AGPL
1167already require those plug-ins to be released under a compatible
1168license.  The symbol definition in the plug-in---or whatever equivalent
1169works best in your program---makes it harder for anyone who might
1170distribute proprietary plug-ins to legally defend themselves.  If a case
1171about this got to court, we can point to that symbol as evidence that
1172the plug-in developer understood that the license had this requirement.
1173
1174
1175@node Option Table
1176@section Table of Long Options
1177@cindex long option names
1178@cindex table of long options
1179
1180Here is a table of long options used by GNU programs.  It is surely
1181incomplete, but we aim to list all the options that a new program might
1182want to be compatible with.  If you use names not already in the table,
1183please send @email{bug-standards@@gnu.org} a list of them, with their
1184meanings, so we can update the table.
1185
1186@c Please leave newlines between items in this table; it's much easier
1187@c to update when it isn't completely squashed together and unreadable.
1188@c When there is more than one short option for a long option name, put
1189@c a semicolon between the lists of the programs that use them, not a
1190@c period.   --friedman
1191
1192@table @samp
1193@item after-date
1194@samp{-N} in @code{tar}.
1195
1196@item all
1197@samp{-a} in @code{du}, @code{ls}, @code{nm}, @code{stty}, @code{uname},
1198and @code{unexpand}.
1199
1200@item all-text
1201@samp{-a} in @code{diff}.
1202
1203@item almost-all
1204@samp{-A} in @code{ls}.
1205
1206@item append
1207@samp{-a} in @code{etags}, @code{tee}, @code{time};
1208@samp{-r} in @code{tar}.
1209
1210@item archive
1211@samp{-a} in @code{cp}.
1212
1213@item archive-name
1214@samp{-n} in @code{shar}.
1215
1216@item arglength
1217@samp{-l} in @code{m4}.
1218
1219@item ascii
1220@samp{-a} in @code{diff}.
1221
1222@item assign
1223@samp{-v} in @code{gawk}.
1224
1225@item assume-new
1226@samp{-W} in @code{make}.
1227
1228@item assume-old
1229@samp{-o} in @code{make}.
1230
1231@item auto-check
1232@samp{-a} in @code{recode}.
1233
1234@item auto-pager
1235@samp{-a} in @code{wdiff}.
1236
1237@item auto-reference
1238@samp{-A} in @code{ptx}.
1239
1240@item avoid-wraps
1241@samp{-n} in @code{wdiff}.
1242
1243@item background
1244For server programs, run in the background.
1245
1246@item backward-search
1247@samp{-B} in @code{ctags}.
1248
1249@item basename
1250@samp{-f} in @code{shar}.
1251
1252@item batch
1253Used in GDB.
1254
1255@item baud
1256Used in GDB.
1257
1258@item before
1259@samp{-b} in @code{tac}.
1260
1261@item binary
1262@samp{-b} in @code{cpio} and @code{diff}.
1263
1264@item bits-per-code
1265@samp{-b} in @code{shar}.
1266
1267@item block-size
1268Used in @code{cpio} and @code{tar}.
1269
1270@item blocks
1271@samp{-b} in @code{head} and @code{tail}.
1272
1273@item break-file
1274@samp{-b} in @code{ptx}.
1275
1276@item brief
1277Used in various programs to make output shorter.
1278
1279@item bytes
1280@samp{-c} in @code{head}, @code{split}, and @code{tail}.
1281
1282@item c@t{++}
1283@samp{-C} in @code{etags}.
1284
1285@item catenate
1286@samp{-A} in @code{tar}.
1287
1288@item cd
1289Used in various programs to specify the directory to use.
1290
1291@item changes
1292@samp{-c} in @code{chgrp} and @code{chown}.
1293
1294@item classify
1295@samp{-F} in @code{ls}.
1296
1297@item colons
1298@samp{-c} in @code{recode}.
1299
1300@item command
1301@samp{-c} in @code{su};
1302@samp{-x} in GDB.
1303
1304@item compare
1305@samp{-d} in @code{tar}.
1306
1307@item compat
1308Used in @code{gawk}.
1309
1310@item compress
1311@samp{-Z} in @code{tar} and @code{shar}.
1312
1313@item concatenate
1314@samp{-A} in @code{tar}.
1315
1316@item confirmation
1317@samp{-w} in @code{tar}.
1318
1319@item context
1320Used in @code{diff}.
1321
1322@item copyleft
1323@samp{-W copyleft} in @code{gawk}.
1324
1325@item copyright
1326@samp{-C} in @code{ptx}, @code{recode}, and @code{wdiff};
1327@samp{-W copyright} in @code{gawk}.
1328
1329@item core
1330Used in GDB.
1331
1332@item count
1333@samp{-q} in @code{who}.
1334
1335@item count-links
1336@samp{-l} in @code{du}.
1337
1338@item create
1339Used in @code{tar} and @code{cpio}.
1340
1341@item cut-mark
1342@samp{-c} in @code{shar}.
1343
1344@item cxref
1345@samp{-x} in @code{ctags}.
1346
1347@item date
1348@samp{-d} in @code{touch}.
1349
1350@item debug
1351@samp{-d} in @code{make} and @code{m4};
1352@samp{-t} in Bison.
1353
1354@item define
1355@samp{-D} in @code{m4}.
1356
1357@item defines
1358@samp{-d} in Bison and @code{ctags}.
1359
1360@item delete
1361@samp{-D} in @code{tar}.
1362
1363@item dereference
1364@samp{-L} in @code{chgrp}, @code{chown}, @code{cpio}, @code{du},
1365@code{ls}, and @code{tar}.
1366
1367@item dereference-args
1368@samp{-D} in @code{du}.
1369
1370@item device
1371Specify an I/O device (special file name).
1372
1373@item diacritics
1374@samp{-d} in @code{recode}.
1375
1376@item dictionary-order
1377@samp{-d} in @code{look}.
1378
1379@item diff
1380@samp{-d} in @code{tar}.
1381
1382@item digits
1383@samp{-n} in @code{csplit}.
1384
1385@item directory
1386Specify the directory to use, in various programs.  In @code{ls}, it
1387means to show directories themselves rather than their contents.  In
1388@code{rm} and @code{ln}, it means to not treat links to directories
1389specially.
1390
1391@item discard-all
1392@samp{-x} in @code{strip}.
1393
1394@item discard-locals
1395@samp{-X} in @code{strip}.
1396
1397@item dry-run
1398@samp{-n} in @code{make}.
1399
1400@item ed
1401@samp{-e} in @code{diff}.
1402
1403@item elide-empty-files
1404@samp{-z} in @code{csplit}.
1405
1406@item end-delete
1407@samp{-x} in @code{wdiff}.
1408
1409@item end-insert
1410@samp{-z} in @code{wdiff}.
1411
1412@item entire-new-file
1413@samp{-N} in @code{diff}.
1414
1415@item environment-overrides
1416@samp{-e} in @code{make}.
1417
1418@item eof
1419@samp{-e} in @code{xargs}.
1420
1421@item epoch
1422Used in GDB.
1423
1424@item error-limit
1425Used in @code{makeinfo}.
1426
1427@item error-output
1428@samp{-o} in @code{m4}.
1429
1430@item escape
1431@samp{-b} in @code{ls}.
1432
1433@item exclude-from
1434@samp{-X} in @code{tar}.
1435
1436@item exec
1437Used in GDB.
1438
1439@item exit
1440@samp{-x} in @code{xargs}.
1441
1442@item exit-0
1443@samp{-e} in @code{unshar}.
1444
1445@item expand-tabs
1446@samp{-t} in @code{diff}.
1447
1448@item expression
1449@samp{-e} in @code{sed}.
1450
1451@item extern-only
1452@samp{-g} in @code{nm}.
1453
1454@item extract
1455@samp{-i} in @code{cpio};
1456@samp{-x} in @code{tar}.
1457
1458@item faces
1459@samp{-f} in @code{finger}.
1460
1461@item fast
1462@samp{-f} in @code{su}.
1463
1464@item fatal-warnings
1465@samp{-E} in @code{m4}.
1466
1467@item file
1468@samp{-f} in @code{gawk}, @code{info}, @code{make}, @code{mt},
1469@code{sed}, and @code{tar}.
1470
1471@item field-separator
1472@samp{-F} in @code{gawk}.
1473
1474@item file-prefix
1475@samp{-b} in Bison.
1476
1477@item file-type
1478@samp{-F} in @code{ls}.
1479
1480@item files-from
1481@samp{-T} in @code{tar}.
1482
1483@item fill-column
1484Used in @code{makeinfo}.
1485
1486@item flag-truncation
1487@samp{-F} in @code{ptx}.
1488
1489@item fixed-output-files
1490@samp{-y} in Bison.
1491
1492@item follow
1493@samp{-f} in @code{tail}.
1494
1495@item footnote-style
1496Used in @code{makeinfo}.
1497
1498@item force
1499@samp{-f} in @code{cp}, @code{ln}, @code{mv}, and @code{rm}.
1500
1501@item force-prefix
1502@samp{-F} in @code{shar}.
1503
1504@item foreground
1505For server programs, run in the foreground;
1506in other words, don't do anything special to run the server
1507in the background.
1508
1509@item format
1510Used in @code{ls}, @code{time}, and @code{ptx}.
1511
1512@item freeze-state
1513@samp{-F} in @code{m4}.
1514
1515@item fullname
1516Used in GDB.
1517
1518@item gap-size
1519@samp{-g} in @code{ptx}.
1520
1521@item get
1522@samp{-x} in @code{tar}.
1523
1524@item graphic
1525@samp{-i} in @code{ul}.
1526
1527@item graphics
1528@samp{-g} in @code{recode}.
1529
1530@item group
1531@samp{-g} in @code{install}.
1532
1533@item gzip
1534@samp{-z} in @code{tar} and @code{shar}.
1535
1536@item hashsize
1537@samp{-H} in @code{m4}.
1538
1539@item header
1540@samp{-h} in @code{objdump} and @code{recode}
1541
1542@item heading
1543@samp{-H} in @code{who}.
1544
1545@item help
1546Used to ask for brief usage information.
1547
1548@item here-delimiter
1549@samp{-d} in @code{shar}.
1550
1551@item hide-control-chars
1552@samp{-q} in @code{ls}.
1553
1554@item html
1555In @code{makeinfo}, output HTML.
1556
1557@item idle
1558@samp{-u} in @code{who}.
1559
1560@item ifdef
1561@samp{-D} in @code{diff}.
1562
1563@item ignore
1564@samp{-I} in @code{ls};
1565@samp{-x} in @code{recode}.
1566
1567@item ignore-all-space
1568@samp{-w} in @code{diff}.
1569
1570@item ignore-backups
1571@samp{-B} in @code{ls}.
1572
1573@item ignore-blank-lines
1574@samp{-B} in @code{diff}.
1575
1576@item ignore-case
1577@samp{-f} in @code{look} and @code{ptx};
1578@samp{-i} in @code{diff} and @code{wdiff}.
1579
1580@item ignore-errors
1581@samp{-i} in @code{make}.
1582
1583@item ignore-file
1584@samp{-i} in @code{ptx}.
1585
1586@item ignore-indentation
1587@samp{-I} in @code{etags}.
1588
1589@item ignore-init-file
1590@samp{-f} in Oleo.
1591
1592@item ignore-interrupts
1593@samp{-i} in @code{tee}.
1594
1595@item ignore-matching-lines
1596@samp{-I} in @code{diff}.
1597
1598@item ignore-space-change
1599@samp{-b} in @code{diff}.
1600
1601@item ignore-zeros
1602@samp{-i} in @code{tar}.
1603
1604@item include
1605@samp{-i} in @code{etags};
1606@samp{-I} in @code{m4}.
1607
1608@item include-dir
1609@samp{-I} in @code{make}.
1610
1611@item incremental
1612@samp{-G} in @code{tar}.
1613
1614@item info
1615@samp{-i}, @samp{-l}, and @samp{-m} in Finger.
1616
1617@item init-file
1618In some programs, specify the name of the file to read as the user's
1619init file.
1620
1621@item initial
1622@samp{-i} in @code{expand}.
1623
1624@item initial-tab
1625@samp{-T} in @code{diff}.
1626
1627@item inode
1628@samp{-i} in @code{ls}.
1629
1630@item interactive
1631@samp{-i} in @code{cp}, @code{ln}, @code{mv}, @code{rm};
1632@samp{-e} in @code{m4};
1633@samp{-p} in @code{xargs};
1634@samp{-w} in @code{tar}.
1635
1636@item intermix-type
1637@samp{-p} in @code{shar}.
1638
1639@item iso-8601
1640Used in @code{date}
1641
1642@item jobs
1643@samp{-j} in @code{make}.
1644
1645@item just-print
1646@samp{-n} in @code{make}.
1647
1648@item keep-going
1649@samp{-k} in @code{make}.
1650
1651@item keep-files
1652@samp{-k} in @code{csplit}.
1653
1654@item kilobytes
1655@samp{-k} in @code{du} and @code{ls}.
1656
1657@item language
1658@samp{-l} in @code{etags}.
1659
1660@item less-mode
1661@samp{-l} in @code{wdiff}.
1662
1663@item level-for-gzip
1664@samp{-g} in @code{shar}.
1665
1666@item line-bytes
1667@samp{-C} in @code{split}.
1668
1669@item lines
1670Used in @code{split}, @code{head}, and @code{tail}.
1671
1672@item link
1673@samp{-l} in @code{cpio}.
1674
1675@item lint
1676@itemx lint-old
1677Used in @code{gawk}.
1678
1679@item list
1680@samp{-t} in @code{cpio};
1681@samp{-l} in @code{recode}.
1682
1683@item list
1684@samp{-t} in @code{tar}.
1685
1686@item literal
1687@samp{-N} in @code{ls}.
1688
1689@item load-average
1690@samp{-l} in @code{make}.
1691
1692@item login
1693Used in @code{su}.
1694
1695@item machine
1696Used in @code{uname}.
1697
1698@item macro-name
1699@samp{-M} in @code{ptx}.
1700
1701@item mail
1702@samp{-m} in @code{hello} and @code{uname}.
1703
1704@item make-directories
1705@samp{-d} in @code{cpio}.
1706
1707@item makefile
1708@samp{-f} in @code{make}.
1709
1710@item mapped
1711Used in GDB.
1712
1713@item max-args
1714@samp{-n} in @code{xargs}.
1715
1716@item max-chars
1717@samp{-n} in @code{xargs}.
1718
1719@item max-lines
1720@samp{-l} in @code{xargs}.
1721
1722@item max-load
1723@samp{-l} in @code{make}.
1724
1725@item max-procs
1726@samp{-P} in @code{xargs}.
1727
1728@item mesg
1729@samp{-T} in @code{who}.
1730
1731@item message
1732@samp{-T} in @code{who}.
1733
1734@item minimal
1735@samp{-d} in @code{diff}.
1736
1737@item mixed-uuencode
1738@samp{-M} in @code{shar}.
1739
1740@item mode
1741@samp{-m} in @code{install}, @code{mkdir}, and @code{mkfifo}.
1742
1743@item modification-time
1744@samp{-m} in @code{tar}.
1745
1746@item multi-volume
1747@samp{-M} in @code{tar}.
1748
1749@item name-prefix
1750@samp{-a} in Bison.
1751
1752@item nesting-limit
1753@samp{-L} in @code{m4}.
1754
1755@item net-headers
1756@samp{-a} in @code{shar}.
1757
1758@item new-file
1759@samp{-W} in @code{make}.
1760
1761@item no-builtin-rules
1762@samp{-r} in @code{make}.
1763
1764@item no-character-count
1765@samp{-w} in @code{shar}.
1766
1767@item no-check-existing
1768@samp{-x} in @code{shar}.
1769
1770@item no-common
1771@samp{-3} in @code{wdiff}.
1772
1773@item no-create
1774@samp{-c} in @code{touch}.
1775
1776@item no-defines
1777@samp{-D} in @code{etags}.
1778
1779@item no-deleted
1780@samp{-1} in @code{wdiff}.
1781
1782@item no-dereference
1783@samp{-d} in @code{cp}.
1784
1785@item no-inserted
1786@samp{-2} in @code{wdiff}.
1787
1788@item no-keep-going
1789@samp{-S} in @code{make}.
1790
1791@item no-lines
1792@samp{-l} in Bison.
1793
1794@item no-piping
1795@samp{-P} in @code{shar}.
1796
1797@item no-prof
1798@samp{-e} in @code{gprof}.
1799
1800@item no-regex
1801@samp{-R} in @code{etags}.
1802
1803@item no-sort
1804@samp{-p} in @code{nm}.
1805
1806@item no-splash
1807Don't print a startup splash screen.
1808
1809@item no-split
1810Used in @code{makeinfo}.
1811
1812@item no-static
1813@samp{-a} in @code{gprof}.
1814
1815@item no-time
1816@samp{-E} in @code{gprof}.
1817
1818@item no-timestamp
1819@samp{-m} in @code{shar}.
1820
1821@item no-validate
1822Used in @code{makeinfo}.
1823
1824@item no-wait
1825Used in @code{emacsclient}.
1826
1827@item no-warn
1828Used in various programs to inhibit warnings.
1829
1830@item node
1831@samp{-n} in @code{info}.
1832
1833@item nodename
1834@samp{-n} in @code{uname}.
1835
1836@item nonmatching
1837@samp{-f} in @code{cpio}.
1838
1839@item nstuff
1840@samp{-n} in @code{objdump}.
1841
1842@item null
1843@samp{-0} in @code{xargs}.
1844
1845@item number
1846@samp{-n} in @code{cat}.
1847
1848@item number-nonblank
1849@samp{-b} in @code{cat}.
1850
1851@item numeric-sort
1852@samp{-n} in @code{nm}.
1853
1854@item numeric-uid-gid
1855@samp{-n} in @code{cpio} and @code{ls}.
1856
1857@item nx
1858Used in GDB.
1859
1860@item old-archive
1861@samp{-o} in @code{tar}.
1862
1863@item old-file
1864@samp{-o} in @code{make}.
1865
1866@item one-file-system
1867@samp{-l} in @code{tar}, @code{cp}, and @code{du}.
1868
1869@item only-file
1870@samp{-o} in @code{ptx}.
1871
1872@item only-prof
1873@samp{-f} in @code{gprof}.
1874
1875@item only-time
1876@samp{-F} in @code{gprof}.
1877
1878@item options
1879@samp{-o} in @code{getopt}, @code{fdlist}, @code{fdmount},
1880@code{fdmountd}, and @code{fdumount}.
1881
1882@item output
1883In various programs, specify the output file name.
1884
1885@item output-prefix
1886@samp{-o} in @code{shar}.
1887
1888@item override
1889@samp{-o} in @code{rm}.
1890
1891@item overwrite
1892@samp{-c} in @code{unshar}.
1893
1894@item owner
1895@samp{-o} in @code{install}.
1896
1897@item paginate
1898@samp{-l} in @code{diff}.
1899
1900@item paragraph-indent
1901Used in @code{makeinfo}.
1902
1903@item parents
1904@samp{-p} in @code{mkdir} and @code{rmdir}.
1905
1906@item pass-all
1907@samp{-p} in @code{ul}.
1908
1909@item pass-through
1910@samp{-p} in @code{cpio}.
1911
1912@item port
1913@samp{-P} in @code{finger}.
1914
1915@item portability
1916@samp{-c} in @code{cpio} and @code{tar}.
1917
1918@item posix
1919Used in @code{gawk}.
1920
1921@item prefix-builtins
1922@samp{-P} in @code{m4}.
1923
1924@item prefix
1925@samp{-f} in @code{csplit}.
1926
1927@item preserve
1928Used in @code{tar} and @code{cp}.
1929
1930@item preserve-environment
1931@samp{-p} in @code{su}.
1932
1933@item preserve-modification-time
1934@samp{-m} in @code{cpio}.
1935
1936@item preserve-order
1937@samp{-s} in @code{tar}.
1938
1939@item preserve-permissions
1940@samp{-p} in @code{tar}.
1941
1942@item print
1943@samp{-l} in @code{diff}.
1944
1945@item print-chars
1946@samp{-L} in @code{cmp}.
1947
1948@item print-data-base
1949@samp{-p} in @code{make}.
1950
1951@item print-directory
1952@samp{-w} in @code{make}.
1953
1954@item print-file-name
1955@samp{-o} in @code{nm}.
1956
1957@item print-symdefs
1958@samp{-s} in @code{nm}.
1959
1960@item printer
1961@samp{-p} in @code{wdiff}.
1962
1963@item prompt
1964@samp{-p} in @code{ed}.
1965
1966@item proxy
1967Specify an HTTP proxy.
1968
1969@item query-user
1970@samp{-X} in @code{shar}.
1971
1972@item question
1973@samp{-q} in @code{make}.
1974
1975@item quiet
1976Used in many programs to inhibit the usual output.  Every
1977program accepting @samp{--quiet} should accept @samp{--silent} as a
1978synonym.
1979
1980@item quiet-unshar
1981@samp{-Q} in @code{shar}
1982
1983@item quote-name
1984@samp{-Q} in @code{ls}.
1985
1986@item rcs
1987@samp{-n} in @code{diff}.
1988
1989@item re-interval
1990Used in @code{gawk}.
1991
1992@item read-full-blocks
1993@samp{-B} in @code{tar}.
1994
1995@item readnow
1996Used in GDB.
1997
1998@item recon
1999@samp{-n} in @code{make}.
2000
2001@item record-number
2002@samp{-R} in @code{tar}.
2003
2004@item recursive
2005Used in @code{chgrp}, @code{chown}, @code{cp}, @code{ls}, @code{diff},
2006and @code{rm}.
2007
2008@item reference
2009@samp{-r} in @code{touch}.
2010
2011@item references
2012@samp{-r} in @code{ptx}.
2013
2014@item regex
2015@samp{-r} in @code{tac} and @code{etags}.
2016
2017@item release
2018@samp{-r} in @code{uname}.
2019
2020@item reload-state
2021@samp{-R} in @code{m4}.
2022
2023@item relocation
2024@samp{-r} in @code{objdump}.
2025
2026@item rename
2027@samp{-r} in @code{cpio}.
2028
2029@item replace
2030@samp{-i} in @code{xargs}.
2031
2032@item report-identical-files
2033@samp{-s} in @code{diff}.
2034
2035@item reset-access-time
2036@samp{-a} in @code{cpio}.
2037
2038@item reverse
2039@samp{-r} in @code{ls} and @code{nm}.
2040
2041@item reversed-ed
2042@samp{-f} in @code{diff}.
2043
2044@item right-side-defs
2045@samp{-R} in @code{ptx}.
2046
2047@item same-order
2048@samp{-s} in @code{tar}.
2049
2050@item same-permissions
2051@samp{-p} in @code{tar}.
2052
2053@item save
2054@samp{-g} in @code{stty}.
2055
2056@item se
2057Used in GDB.
2058
2059@item sentence-regexp
2060@samp{-S} in @code{ptx}.
2061
2062@item separate-dirs
2063@samp{-S} in @code{du}.
2064
2065@item separator
2066@samp{-s} in @code{tac}.
2067
2068@item sequence
2069Used by @code{recode} to chose files or pipes for sequencing passes.
2070
2071@item shell
2072@samp{-s} in @code{su}.
2073
2074@item show-all
2075@samp{-A} in @code{cat}.
2076
2077@item show-c-function
2078@samp{-p} in @code{diff}.
2079
2080@item show-ends
2081@samp{-E} in @code{cat}.
2082
2083@item show-function-line
2084@samp{-F} in @code{diff}.
2085
2086@item show-tabs
2087@samp{-T} in @code{cat}.
2088
2089@item silent
2090Used in many programs to inhibit the usual output.
2091Every program accepting
2092@samp{--silent} should accept @samp{--quiet} as a synonym.
2093
2094@item size
2095@samp{-s} in @code{ls}.
2096
2097@item socket
2098Specify a file descriptor for a network server to use for its socket,
2099instead of opening and binding a new socket.  This provides a way to
2100run, in a non-privileged process, a server that normally needs a
2101reserved port number.
2102
2103@item sort
2104Used in @code{ls}.
2105
2106@item source
2107@samp{-W source} in @code{gawk}.
2108
2109@item sparse
2110@samp{-S} in @code{tar}.
2111
2112@item speed-large-files
2113@samp{-H} in @code{diff}.
2114
2115@item split-at
2116@samp{-E} in @code{unshar}.
2117
2118@item split-size-limit
2119@samp{-L} in @code{shar}.
2120
2121@item squeeze-blank
2122@samp{-s} in @code{cat}.
2123
2124@item start-delete
2125@samp{-w} in @code{wdiff}.
2126
2127@item start-insert
2128@samp{-y} in @code{wdiff}.
2129
2130@item starting-file
2131Used in @code{tar} and @code{diff} to specify which file within
2132a directory to start processing with.
2133
2134@item statistics
2135@samp{-s} in @code{wdiff}.
2136
2137@item stdin-file-list
2138@samp{-S} in @code{shar}.
2139
2140@item stop
2141@samp{-S} in @code{make}.
2142
2143@item strict
2144@samp{-s} in @code{recode}.
2145
2146@item strip
2147@samp{-s} in @code{install}.
2148
2149@item strip-all
2150@samp{-s} in @code{strip}.
2151
2152@item strip-debug
2153@samp{-S} in @code{strip}.
2154
2155@item submitter
2156@samp{-s} in @code{shar}.
2157
2158@item suffix
2159@samp{-S} in @code{cp}, @code{ln}, @code{mv}.
2160
2161@item suffix-format
2162@samp{-b} in @code{csplit}.
2163
2164@item sum
2165@samp{-s} in @code{gprof}.
2166
2167@item summarize
2168@samp{-s} in @code{du}.
2169
2170@item symbolic
2171@samp{-s} in @code{ln}.
2172
2173@item symbols
2174Used in GDB and @code{objdump}.
2175
2176@item synclines
2177@samp{-s} in @code{m4}.
2178
2179@item sysname
2180@samp{-s} in @code{uname}.
2181
2182@item tabs
2183@samp{-t} in @code{expand} and @code{unexpand}.
2184
2185@item tabsize
2186@samp{-T} in @code{ls}.
2187
2188@item terminal
2189@samp{-T} in @code{tput} and @code{ul}.
2190@samp{-t} in @code{wdiff}.
2191
2192@item text
2193@samp{-a} in @code{diff}.
2194
2195@item text-files
2196@samp{-T} in @code{shar}.
2197
2198@item time
2199Used in @code{ls} and @code{touch}.
2200
2201@item timeout
2202Specify how long to wait before giving up on some operation.
2203
2204@item to-stdout
2205@samp{-O} in @code{tar}.
2206
2207@item total
2208@samp{-c} in @code{du}.
2209
2210@item touch
2211@samp{-t} in @code{make}, @code{ranlib}, and @code{recode}.
2212
2213@item trace
2214@samp{-t} in @code{m4}.
2215
2216@item traditional
2217@samp{-t} in @code{hello};
2218@samp{-W traditional} in @code{gawk};
2219@samp{-G} in @code{ed}, @code{m4}, and @code{ptx}.
2220
2221@item tty
2222Used in GDB.
2223
2224@item typedefs
2225@samp{-t} in @code{ctags}.
2226
2227@item typedefs-and-c++
2228@samp{-T} in @code{ctags}.
2229
2230@item typeset-mode
2231@samp{-t} in @code{ptx}.
2232
2233@item uncompress
2234@samp{-z} in @code{tar}.
2235
2236@item unconditional
2237@samp{-u} in @code{cpio}.
2238
2239@item undefine
2240@samp{-U} in @code{m4}.
2241
2242@item undefined-only
2243@samp{-u} in @code{nm}.
2244
2245@item update
2246@samp{-u} in @code{cp}, @code{ctags}, @code{mv}, @code{tar}.
2247
2248@item usage
2249Used in @code{gawk}; same as @samp{--help}.
2250
2251@item uuencode
2252@samp{-B} in @code{shar}.
2253
2254@item vanilla-operation
2255@samp{-V} in @code{shar}.
2256
2257@item verbose
2258Print more information about progress.  Many programs support this.
2259
2260@item verify
2261@samp{-W} in @code{tar}.
2262
2263@item version
2264Print the version number.
2265
2266@item version-control
2267@samp{-V} in @code{cp}, @code{ln}, @code{mv}.
2268
2269@item vgrind
2270@samp{-v} in @code{ctags}.
2271
2272@item volume
2273@samp{-V} in @code{tar}.
2274
2275@item what-if
2276@samp{-W} in @code{make}.
2277
2278@item whole-size-limit
2279@samp{-l} in @code{shar}.
2280
2281@item width
2282@samp{-w} in @code{ls} and @code{ptx}.
2283
2284@item word-regexp
2285@samp{-W} in @code{ptx}.
2286
2287@item writable
2288@samp{-T} in @code{who}.
2289
2290@item zeros
2291@samp{-z} in @code{gprof}.
2292@end table
2293
2294@node OID Allocations
2295@section OID Allocations
2296@cindex OID allocations for GNU
2297@cindex SNMP
2298@cindex LDAP
2299@cindex X.509
2300
2301The OID (object identifier) 1.3.6.1.4.1.11591 has been assigned to the
2302GNU Project (thanks to Werner Koch).  These are used for SNMP, LDAP,
2303X.509 certificates, and so on.  The web site
2304@url{http://www.alvestrand.no/objectid} has a (voluntary) listing of
2305many OID assignments.
2306
2307If you need a new slot for your GNU package, write
2308@email{maintainers@@gnu.org}.  Here is a list of arcs currently
2309assigned:
2310
2311@example
2312@include gnu-oids.texi
2313@end example
2314
2315
2316@node Memory Usage
2317@section Memory Usage
2318@cindex memory usage
2319
2320If a program typically uses just a few meg of memory, don't bother making any
2321effort to reduce memory usage.  For example, if it is impractical for
2322other reasons to operate on files more than a few meg long, it is
2323reasonable to read entire input files into memory to operate on them.
2324
2325However, for programs such as @code{cat} or @code{tail}, that can
2326usefully operate on very large files, it is important to avoid using a
2327technique that would artificially limit the size of files it can handle.
2328If a program works by lines and could be applied to arbitrary
2329user-supplied input files, it should keep only a line in memory, because
2330this is not very hard and users will want to be able to operate on input
2331files that are bigger than will fit in memory all at once.
2332
2333If your program creates complicated data structures, just make them in
2334memory and give a fatal error if @code{malloc} returns zero.
2335
2336@pindex valgrind
2337@cindex memory leak
2338Memory analysis tools such as @command{valgrind} can be useful, but
2339don't complicate a program merely to avoid their false alarms.  For
2340example, if memory is used until just before a process exits, don't
2341free it simply to silence such a tool.
2342
2343
2344@node File Usage
2345@section File Usage
2346@cindex file usage
2347
2348Programs should be prepared to operate when @file{/usr} and @file{/etc}
2349are read-only file systems.  Thus, if the program manages log files,
2350lock files, backup files, score files, or any other files which are
2351modified for internal purposes, these files should not be stored in
2352@file{/usr} or @file{/etc}.
2353
2354There are two exceptions.  @file{/etc} is used to store system
2355configuration information; it is reasonable for a program to modify
2356files in @file{/etc} when its job is to update the system configuration.
2357Also, if the user explicitly asks to modify one file in a directory, it
2358is reasonable for the program to store other files in the same
2359directory.
2360
2361@node Writing C
2362@chapter Making The Best Use of C
2363
2364This chapter provides advice on how best to use the C language
2365when writing GNU software.
2366
2367@menu
2368* Formatting::                  Formatting your source code.
2369* Comments::                    Commenting your work.
2370* Syntactic Conventions::       Clean use of C constructs.
2371* Names::                       Naming variables, functions, and files.
2372* System Portability::          Portability among different operating systems.
2373* CPU Portability::             Supporting the range of CPU types.
2374* System Functions::            Portability and ``standard'' library functions.
2375* Internationalization::        Techniques for internationalization.
2376* Character Set::               Use ASCII by default.
2377* Quote Characters::            Use "..." or '...' in the C locale.
2378* Mmap::                        How you can safely use @code{mmap}.
2379@end menu
2380
2381@node Formatting
2382@section Formatting Your Source Code
2383@cindex formatting source code
2384
2385@cindex open brace
2386@cindex braces, in C source
2387@cindex function definitions, formatting
2388It is important to put the open-brace that starts the body of a C
2389function in column one, so that they will start a defun.  Several
2390tools look for open-braces in column one to find the beginnings of C
2391functions.  These tools will not work on code not formatted that way.
2392
2393Avoid putting open-brace, open-parenthesis or open-bracket in column
2394one when they are inside a function, so that they won't start a defun.
2395The open-brace that starts a @code{struct} body can go in column one
2396if you find it useful to treat that definition as a defun.
2397
2398It is also important for function definitions to start the name of the
2399function in column one.  This helps people to search for function
2400definitions, and may also help certain tools recognize them.  Thus,
2401using Standard C syntax, the format is this:
2402
2403@example
2404static char *
2405concat (char *s1, char *s2)
2406@{
2407  @dots{}
2408@}
2409@end example
2410
2411@noindent
2412or, if you want to use traditional C syntax, format the definition like
2413this:
2414
2415@example
2416static char *
2417concat (s1, s2)        /* Name starts in column one here */
2418     char *s1, *s2;
2419@{                     /* Open brace in column one here */
2420  @dots{}
2421@}
2422@end example
2423
2424In Standard C, if the arguments don't fit nicely on one line,
2425split it like this:
2426
2427@example
2428int
2429lots_of_args (int an_integer, long a_long, short a_short,
2430              double a_double, float a_float)
2431@dots{}
2432@end example
2433
2434@cindex @code{struct} types, formatting
2435@cindex @code{enum} types, formatting
2436For @code{struct} and @code{enum} types, likewise put the braces in
2437column one, unless the whole contents fits on one line:
2438
2439@example
2440struct foo
2441@{
2442  int a, b;
2443@}
2444@exdent @r{or}
2445struct foo @{ int a, b; @}
2446@end example
2447
2448The rest of this section gives our recommendations for other aspects of
2449C formatting style, which is also the default style of the @code{indent}
2450program in version 1.2 and newer.  It corresponds to the options
2451
2452@smallexample
2453-nbad -bap -nbc -bbo -bl -bli2 -bls -ncdb -nce -cp1 -cs -di2
2454-ndj -nfc1 -nfca -hnl -i2 -ip5 -lp -pcs -psl -nsc -nsob
2455@end smallexample
2456
2457We don't think of these recommendations as requirements, because it
2458causes no problems for users if two different programs have different
2459formatting styles.
2460
2461But whatever style you use, please use it consistently, since a mixture
2462of styles within one program tends to look ugly.  If you are
2463contributing changes to an existing program, please follow the style of
2464that program.
2465
2466For the body of the function, our recommended style looks like this:
2467
2468@example
2469if (x < foo (y, z))
2470  haha = bar[4] + 5;
2471else
2472  @{
2473    while (z)
2474      @{
2475        haha += foo (z, z);
2476        z--;
2477      @}
2478    return ++x + bar ();
2479  @}
2480@end example
2481
2482@cindex spaces before open-paren
2483We find it easier to read a program when it has spaces before the
2484open-parentheses and after the commas.  Especially after the commas.
2485
2486When you split an expression into multiple lines, split it
2487before an operator, not after one.  Here is the right way:
2488
2489@cindex expressions, splitting
2490@example
2491if (foo_this_is_long && bar > win (x, y, z)
2492    && remaining_condition)
2493@end example
2494
2495Try to avoid having two operators of different precedence at the same
2496level of indentation.  For example, don't write this:
2497
2498@example
2499mode = (inmode[j] == VOIDmode
2500        || GET_MODE_SIZE (outmode[j]) > GET_MODE_SIZE (inmode[j])
2501        ? outmode[j] : inmode[j]);
2502@end example
2503
2504Instead, use extra parentheses so that the indentation shows the nesting:
2505
2506@example
2507mode = ((inmode[j] == VOIDmode
2508         || (GET_MODE_SIZE (outmode[j]) > GET_MODE_SIZE (inmode[j])))
2509        ? outmode[j] : inmode[j]);
2510@end example
2511
2512Insert extra parentheses so that Emacs will indent the code properly.
2513For example, the following indentation looks nice if you do it by hand,
2514
2515@example
2516v = rup->ru_utime.tv_sec*1000 + rup->ru_utime.tv_usec/1000
2517    + rup->ru_stime.tv_sec*1000 + rup->ru_stime.tv_usec/1000;
2518@end example
2519
2520@noindent
2521but Emacs would alter it.  Adding a set of parentheses produces
2522something that looks equally nice, and which Emacs will preserve:
2523
2524@example
2525v = (rup->ru_utime.tv_sec*1000 + rup->ru_utime.tv_usec/1000
2526     + rup->ru_stime.tv_sec*1000 + rup->ru_stime.tv_usec/1000);
2527@end example
2528
2529Format do-while statements like this:
2530
2531@example
2532do
2533  @{
2534    a = foo (a);
2535  @}
2536while (a > 0);
2537@end example
2538
2539@cindex formfeed
2540@cindex control-L
2541Please use formfeed characters (control-L) to divide the program into
2542pages at logical places (but not within a function).  It does not matter
2543just how long the pages are, since they do not have to fit on a printed
2544page.  The formfeeds should appear alone on lines by themselves.
2545
2546@node Comments
2547@section Commenting Your Work
2548@cindex commenting
2549
2550Every program should start with a comment saying briefly what it is for.
2551Example: @samp{fmt - filter for simple filling of text}.  This comment
2552should be at the top of the source file containing the @samp{main}
2553function of the program.
2554
2555Also, please write a brief comment at the start of each source file,
2556with the file name and a line or two about the overall purpose of the
2557file.
2558
2559Please write the comments in a GNU program in English, because English
2560is the one language that nearly all programmers in all countries can
2561read.  If you do not write English well, please write comments in
2562English as well as you can, then ask other people to help rewrite them.
2563If you can't write comments in English, please find someone to work with
2564you and translate your comments into English.
2565
2566Please put a comment on each function saying what the function does,
2567what sorts of arguments it gets, and what the possible values of
2568arguments mean and are used for.  It is not necessary to duplicate in
2569words the meaning of the C argument declarations, if a C type is being
2570used in its customary fashion.  If there is anything nonstandard about
2571its use (such as an argument of type @code{char *} which is really the
2572address of the second character of a string, not the first), or any
2573possible values that would not work the way one would expect (such as,
2574that strings containing newlines are not guaranteed to work), be sure
2575to say so.
2576
2577Also explain the significance of the return value, if there is one.
2578
2579Please put two spaces after the end of a sentence in your comments, so
2580that the Emacs sentence commands will work.  Also, please write
2581complete sentences and capitalize the first word.  If a lower-case
2582identifier comes at the beginning of a sentence, don't capitalize it!
2583Changing the spelling makes it a different identifier.  If you don't
2584like starting a sentence with a lower case letter, write the sentence
2585differently (e.g., ``The identifier lower-case is @dots{}'').
2586
2587The comment on a function is much clearer if you use the argument
2588names to speak about the argument values.  The variable name itself
2589should be lower case, but write it in upper case when you are speaking
2590about the value rather than the variable itself.  Thus, ``the inode
2591number NODE_NUM'' rather than ``an inode''.
2592
2593There is usually no purpose in restating the name of the function in
2594the comment before it, because readers can see that for themselves.
2595There might be an exception when the comment is so long that the function
2596itself would be off the bottom of the screen.
2597
2598There should be a comment on each static variable as well, like this:
2599
2600@example
2601/* Nonzero means truncate lines in the display;
2602   zero means continue them.  */
2603int truncate_lines;
2604@end example
2605
2606@cindex conditionals, comments for
2607@cindex @code{#endif}, commenting
2608Every @samp{#endif} should have a comment, except in the case of short
2609conditionals (just a few lines) that are not nested.  The comment should
2610state the condition of the conditional that is ending, @emph{including
2611its sense}.  @samp{#else} should have a comment describing the condition
2612@emph{and sense} of the code that follows.  For example:
2613
2614@example
2615@group
2616#ifdef foo
2617  @dots{}
2618#else /* not foo */
2619  @dots{}
2620#endif /* not foo */
2621@end group
2622@group
2623#ifdef foo
2624  @dots{}
2625#endif /* foo */
2626@end group
2627@end example
2628
2629@noindent
2630but, by contrast, write the comments this way for a @samp{#ifndef}:
2631
2632@example
2633@group
2634#ifndef foo
2635  @dots{}
2636#else /* foo */
2637  @dots{}
2638#endif /* foo */
2639@end group
2640@group
2641#ifndef foo
2642  @dots{}
2643#endif /* not foo */
2644@end group
2645@end example
2646
2647@node Syntactic Conventions
2648@section Clean Use of C Constructs
2649@cindex syntactic conventions
2650
2651@cindex implicit @code{int}
2652@cindex function argument, declaring
2653Please explicitly declare the types of all objects.  For example, you
2654should explicitly declare all arguments to functions, and you should
2655declare functions to return @code{int} rather than omitting the
2656@code{int}.
2657
2658@cindex compiler warnings
2659@cindex @samp{-Wall} compiler option
2660Some programmers like to use the GCC @samp{-Wall} option, and change the
2661code whenever it issues a warning.  If you want to do this, then do.
2662Other programmers prefer not to use @samp{-Wall}, because it gives
2663warnings for valid and legitimate code which they do not want to change.
2664If you want to do this, then do.  The compiler should be your servant,
2665not your master.
2666
2667@pindex clang
2668@pindex lint
2669Don't make the program ugly just to placate static analysis tools such
2670as @command{lint}, @command{clang}, and GCC with extra warnings
2671options such as @option{-Wconversion} and @option{-Wundef}.  These
2672tools can help find bugs and unclear code, but they can also generate
2673so many false alarms that it hurts readability to silence them with
2674unnecessary casts, wrappers, and other complications.  For example,
2675please don't insert casts to @code{void} or calls to do-nothing
2676functions merely to pacify a lint checker.
2677
2678Declarations of external functions and functions to appear later in the
2679source file should all go in one place near the beginning of the file
2680(somewhere before the first function definition in the file), or else
2681should go in a header file.  Don't put @code{extern} declarations inside
2682functions.
2683
2684@cindex temporary variables
2685It used to be common practice to use the same local variables (with
2686names like @code{tem}) over and over for different values within one
2687function.  Instead of doing this, it is better to declare a separate local
2688variable for each distinct purpose, and give it a name which is
2689meaningful.  This not only makes programs easier to understand, it also
2690facilitates optimization by good compilers.  You can also move the
2691declaration of each local variable into the smallest scope that includes
2692all its uses.  This makes the program even cleaner.
2693
2694Don't use local variables or parameters that shadow global identifiers.
2695GCC's @samp{-Wshadow} option can detect this problem.
2696
2697@cindex multiple variables in a line
2698Don't declare multiple variables in one declaration that spans lines.
2699Start a new declaration on each line, instead.  For example, instead
2700of this:
2701
2702@example
2703@group
2704int    foo,
2705       bar;
2706@end group
2707@end example
2708
2709@noindent
2710write either this:
2711
2712@example
2713int foo, bar;
2714@end example
2715
2716@noindent
2717or this:
2718
2719@example
2720int foo;
2721int bar;
2722@end example
2723
2724@noindent
2725(If they are global variables, each should have a comment preceding it
2726anyway.)
2727
2728When you have an @code{if}-@code{else} statement nested in another
2729@code{if} statement, always put braces around the @code{if}-@code{else}.
2730Thus, never write like this:
2731
2732@example
2733if (foo)
2734  if (bar)
2735    win ();
2736  else
2737    lose ();
2738@end example
2739
2740@noindent
2741always like this:
2742
2743@example
2744if (foo)
2745  @{
2746    if (bar)
2747      win ();
2748    else
2749      lose ();
2750  @}
2751@end example
2752
2753If you have an @code{if} statement nested inside of an @code{else}
2754statement, either write @code{else if} on one line, like this,
2755
2756@example
2757if (foo)
2758  @dots{}
2759else if (bar)
2760  @dots{}
2761@end example
2762
2763@noindent
2764with its @code{then}-part indented like the preceding @code{then}-part,
2765or write the nested @code{if} within braces like this:
2766
2767@example
2768if (foo)
2769  @dots{}
2770else
2771  @{
2772    if (bar)
2773      @dots{}
2774  @}
2775@end example
2776
2777Don't declare both a structure tag and variables or typedefs in the
2778same declaration.  Instead, declare the structure tag separately
2779and then use it to declare the variables or typedefs.
2780
2781Try to avoid assignments inside @code{if}-conditions (assignments
2782inside @code{while}-conditions are ok).  For example, don't write
2783this:
2784
2785@example
2786if ((foo = (char *) malloc (sizeof *foo)) == 0)
2787  fatal ("virtual memory exhausted");
2788@end example
2789
2790@noindent
2791instead, write this:
2792
2793@example
2794foo = (char *) malloc (sizeof *foo);
2795if (foo == 0)
2796  fatal ("virtual memory exhausted");
2797@end example
2798
2799This example uses zero without a cast as a null pointer constant.
2800This is perfectly fine, except that a cast is needed when calling a
2801varargs function or when using @code{sizeof}.
2802
2803@node Names
2804@section Naming Variables, Functions, and Files
2805
2806@cindex names of variables, functions, and files
2807The names of global variables and functions in a program serve as
2808comments of a sort.  So don't choose terse names---instead, look for
2809names that give useful information about the meaning of the variable or
2810function.  In a GNU program, names should be English, like other
2811comments.
2812
2813Local variable names can be shorter, because they are used only within
2814one context, where (presumably) comments explain their purpose.
2815
2816Try to limit your use of abbreviations in symbol names.  It is ok to
2817make a few abbreviations, explain what they mean, and then use them
2818frequently, but don't use lots of obscure abbreviations.
2819
2820Please use underscores to separate words in a name, so that the Emacs
2821word commands can be useful within them.  Stick to lower case; reserve
2822upper case for macros and @code{enum} constants, and for name-prefixes
2823that follow a uniform convention.
2824
2825For example, you should use names like @code{ignore_space_change_flag};
2826don't use names like @code{iCantReadThis}.
2827
2828Variables that indicate whether command-line options have been
2829specified should be named after the meaning of the option, not after
2830the option-letter.  A comment should state both the exact meaning of
2831the option and its letter.  For example,
2832
2833@example
2834@group
2835/* Ignore changes in horizontal whitespace (-b).  */
2836int ignore_space_change_flag;
2837@end group
2838@end example
2839
2840When you want to define names with constant integer values, use
2841@code{enum} rather than @samp{#define}.  GDB knows about enumeration
2842constants.
2843
2844@cindex file-name limitations
2845@pindex doschk
2846You might want to make sure that none of the file names would conflict
2847if the files were loaded onto an MS-DOS file system which shortens the
2848names.  You can use the program @code{doschk} to test for this.
2849
2850Some GNU programs were designed to limit themselves to file names of 14
2851characters or less, to avoid file name conflicts if they are read into
2852older System V systems.  Please preserve this feature in the existing
2853GNU programs that have it, but there is no need to do this in new GNU
2854programs.  @code{doschk} also reports file names longer than 14
2855characters.
2856
2857
2858@node System Portability
2859@section Portability between System Types
2860@cindex portability, between system types
2861
2862In the Unix world, ``portability'' refers to porting to different Unix
2863versions.  For a GNU program, this kind of portability is desirable, but
2864not paramount.
2865
2866The primary purpose of GNU software is to run on top of the GNU kernel,
2867compiled with the GNU C compiler, on various types of @sc{cpu}.  So the
2868kinds of portability that are absolutely necessary are quite limited.
2869But it is important to support Linux-based GNU systems, since they
2870are the form of GNU that is popular.
2871
2872Beyond that, it is good to support the other free operating systems
2873(*BSD), and it is nice to support other Unix-like systems if you want
2874to.  Supporting a variety of Unix-like systems is desirable, although
2875not paramount.  It is usually not too hard, so you may as well do it.
2876But you don't have to consider it an obligation, if it does turn out to
2877be hard.
2878
2879@pindex autoconf
2880The easiest way to achieve portability to most Unix-like systems is to
2881use Autoconf.  It's unlikely that your program needs to know more
2882information about the host platform than Autoconf can provide, simply
2883because most of the programs that need such knowledge have already been
2884written.
2885
2886Avoid using the format of semi-internal data bases (e.g., directories)
2887when there is a higher-level alternative (@code{readdir}).
2888
2889@cindex non-@sc{posix} systems, and portability
2890As for systems that are not like Unix, such as MSDOS, Windows, VMS, MVS,
2891and older Macintosh systems, supporting them is often a lot of work.
2892When that is the case, it is better to spend your time adding features
2893that will be useful on GNU and GNU/Linux, rather than on supporting
2894other incompatible systems.
2895
2896If you do support Windows, please do not abbreviate it as ``win''.  In
2897hacker terminology, calling something a ``win'' is a form of praise.
2898You're free to praise Microsoft Windows on your own if you want, but
2899please don't do this in GNU packages.  Instead of abbreviating
2900``Windows'' to ``win'', you can write it in full or abbreviate it to
2901``woe'' or ``w''.  In GNU Emacs, for instance, we use @samp{w32} in
2902file names of Windows-specific files, but the macro for Windows
2903conditionals is called @code{WINDOWSNT}.
2904
2905It is a good idea to define the ``feature test macro''
2906@code{_GNU_SOURCE} when compiling your C files.  When you compile on GNU
2907or GNU/Linux, this will enable the declarations of GNU library extension
2908functions, and that will usually give you a compiler error message if
2909you define the same function names in some other way in your program.
2910(You don't have to actually @emph{use} these functions, if you prefer
2911to make the program more portable to other systems.)
2912
2913But whether or not you use these GNU extensions, you should avoid
2914using their names for any other meanings.  Doing so would make it hard
2915to move your code into other GNU programs.
2916
2917@node CPU Portability
2918@section Portability between @sc{cpu}s
2919
2920@cindex data types, and portability
2921@cindex portability, and data types
2922Even GNU systems will differ because of differences among @sc{cpu}
2923types---for example, difference in byte ordering and alignment
2924requirements.  It is absolutely essential to handle these differences.
2925However, don't make any effort to cater to the possibility that an
2926@code{int} will be less than 32 bits.  We don't support 16-bit machines
2927in GNU.
2928
2929Similarly, don't make any effort to cater to the possibility that
2930@code{long} will be smaller than predefined types like @code{size_t}.
2931For example, the following code is ok:
2932
2933@example
2934printf ("size = %lu\n", (unsigned long) sizeof array);
2935printf ("diff = %ld\n", (long) (pointer2 - pointer1));
2936@end example
2937
29381989 Standard C requires this to work, and we know of only one
2939counterexample: 64-bit programs on Microsoft Windows.  We will leave
2940it to those who want to port GNU programs to that environment to
2941figure out how to do it.
2942
2943Predefined file-size types like @code{off_t} are an exception: they are
2944longer than @code{long} on many platforms, so code like the above won't
2945work with them.  One way to print an @code{off_t} value portably is to
2946print its digits yourself, one by one.
2947
2948Don't assume that the address of an @code{int} object is also the
2949address of its least-significant byte.  This is false on big-endian
2950machines.  Thus, don't make the following mistake:
2951
2952@example
2953int c;
2954@dots{}
2955while ((c = getchar ()) != EOF)
2956  write (file_descriptor, &c, 1);
2957@end example
2958
2959@noindent Instead, use @code{unsigned char} as follows.  (The @code{unsigned}
2960is for portability to unusual systems where @code{char} is signed and
2961where there is integer overflow checking.)
2962
2963@example
2964int c;
2965while ((c = getchar ()) != EOF)
2966  @{
2967    unsigned char u = c;
2968    write (file_descriptor, &u, 1);
2969  @}
2970@end example
2971
2972@cindex casting pointers to integers
2973Avoid casting pointers to integers if you can.  Such casts greatly
2974reduce portability, and in most programs they are easy to avoid.  In the
2975cases where casting pointers to integers is essential---such as, a Lisp
2976interpreter which stores type information as well as an address in one
2977word---you'll have to make explicit provisions to handle different word
2978sizes.  You will also need to make provision for systems in which the
2979normal range of addresses you can get from @code{malloc} starts far away
2980from zero.
2981
2982
2983@node System Functions
2984@section Calling System Functions
2985
2986@cindex C library functions, and portability
2987@cindex POSIX functions, and portability
2988@cindex library functions, and portability
2989@cindex portability, and library functions
2990
2991Historically, C implementations differed substantially, and many
2992systems lacked a full implementation of ANSI/ISO C89.  Nowadays,
2993however, very few systems lack a C89 compiler and GNU C supports
2994almost all of C99.  Similarly, most systems implement POSIX.1-1993
2995libraries and tools, and many have POSIX.1-2001.
2996
2997Hence, there is little reason to support old C or non-POSIX systems,
2998and you may want to take advantage of C99 and POSIX-1.2001 to write
2999clearer, more portable, or faster code.  You should use standard
3000interfaces where possible; but if GNU extensions make your program
3001more maintainable, powerful, or otherwise better, don't hesitate to
3002use them.  In any case, don't make your own declaration of system
3003functions; that's a recipe for conflict.
3004
3005Despite the standards, nearly every library function has some sort of
3006portability issue on some system or another.  Here are some examples:
3007
3008@table @code
3009@item open
3010Names with trailing @code{/}'s are mishandled on many platforms.
3011
3012@item printf
3013@code{long double} may be unimplemented; floating values Infinity and
3014NaN are often mishandled; output for large precisions may be
3015incorrect.
3016
3017@item readlink
3018May return @code{int} instead of @code{ssize_t}.
3019
3020@item scanf
3021On Windows, @code{errno} is not set on failure.
3022@end table
3023
3024@cindex Gnulib
3025@uref{http://www.gnu.org/software/gnulib/, Gnulib} is a big help in
3026this regard.  Gnulib provides implementations of standard interfaces
3027on many of the systems that lack them, including portable
3028implementations of enhanced GNU interfaces, thereby making their use
3029portable, and of POSIX-1.2008 interfaces, some of which are missing
3030even on up-to-date GNU systems.
3031
3032@findex xmalloc, in Gnulib
3033@findex error messages, in Gnulib
3034@findex data structures, in Gnulib
3035Gnulib also provides many useful non-standard interfaces; for example,
3036C implementations of standard data structures (hash tables, binary
3037trees), error-checking type-safe wrappers for memory allocation
3038functions (@code{xmalloc}, @code{xrealloc}), and output of error
3039messages.
3040
3041Gnulib integrates with GNU Autoconf and Automake to remove much of the
3042burden of writing portable code from the programmer: Gnulib makes your
3043configure script automatically determine what features are missing and
3044use the Gnulib code to supply the missing pieces.
3045
3046The Gnulib and Autoconf manuals have extensive sections on
3047portability: @ref{Top,, Introduction, gnulib, Gnulib} and
3048@pxref{Portable C and C++,,, autoconf, Autoconf}.  Please consult them
3049for many more details.
3050
3051
3052@node Internationalization
3053@section Internationalization
3054@cindex internationalization
3055
3056@pindex gettext
3057GNU has a library called GNU gettext that makes it easy to translate the
3058messages in a program into various languages.  You should use this
3059library in every program.  Use English for the messages as they appear
3060in the program, and let gettext provide the way to translate them into
3061other languages.
3062
3063Using GNU gettext involves putting a call to the @code{gettext} macro
3064around each string that might need translation---like this:
3065
3066@example
3067printf (gettext ("Processing file '%s'..."), file);
3068@end example
3069
3070@noindent
3071This permits GNU gettext to replace the string @code{"Processing file
3072'%s'..."} with a translated version.
3073
3074Once a program uses gettext, please make a point of writing calls to
3075@code{gettext} when you add new strings that call for translation.
3076
3077Using GNU gettext in a package involves specifying a @dfn{text domain
3078name} for the package.  The text domain name is used to separate the
3079translations for this package from the translations for other packages.
3080Normally, the text domain name should be the same as the name of the
3081package---for example, @samp{coreutils} for the GNU core utilities.
3082
3083@cindex message text, and internationalization
3084To enable gettext to work well, avoid writing code that makes
3085assumptions about the structure of words or sentences.  When you want
3086the precise text of a sentence to vary depending on the data, use two or
3087more alternative string constants each containing a complete sentences,
3088rather than inserting conditionalized words or phrases into a single
3089sentence framework.
3090
3091Here is an example of what not to do:
3092
3093@smallexample
3094printf ("%s is full", capacity > 5000000 ? "disk" : "floppy disk");
3095@end smallexample
3096
3097If you apply gettext to all strings, like this,
3098
3099@smallexample
3100printf (gettext ("%s is full"),
3101        capacity > 5000000 ? gettext ("disk") : gettext ("floppy disk"));
3102@end smallexample
3103
3104@noindent
3105the translator will hardly know that "disk" and "floppy disk" are meant to
3106be substituted in the other string.  Worse, in some languages (like French)
3107the construction will not work: the translation of the word "full" depends
3108on the gender of the first part of the sentence; it happens to be not the
3109same for "disk" as for "floppy disk".
3110
3111Complete sentences can be translated without problems:
3112
3113@example
3114printf (capacity > 5000000 ? gettext ("disk is full")
3115        : gettext ("floppy disk is full"));
3116@end example
3117
3118A similar problem appears at the level of sentence structure with this
3119code:
3120
3121@example
3122printf ("#  Implicit rule search has%s been done.\n",
3123        f->tried_implicit ? "" : " not");
3124@end example
3125
3126@noindent
3127Adding @code{gettext} calls to this code cannot give correct results for
3128all languages, because negation in some languages requires adding words
3129at more than one place in the sentence.  By contrast, adding
3130@code{gettext} calls does the job straightforwardly if the code starts
3131out like this:
3132
3133@example
3134printf (f->tried_implicit
3135        ? "#  Implicit rule search has been done.\n",
3136        : "#  Implicit rule search has not been done.\n");
3137@end example
3138
3139Another example is this one:
3140
3141@example
3142printf ("%d file%s processed", nfiles,
3143        nfiles != 1 ? "s" : "");
3144@end example
3145
3146@noindent
3147The problem with this example is that it assumes that plurals are made
3148by adding `s'.  If you apply gettext to the format string, like this,
3149
3150@example
3151printf (gettext ("%d file%s processed"), nfiles,
3152        nfiles != 1 ? "s" : "");
3153@end example
3154
3155@noindent
3156the message can use different words, but it will still be forced to use
3157`s' for the plural.  Here is a better way, with gettext being applied to
3158the two strings independently:
3159
3160@example
3161printf ((nfiles != 1 ? gettext ("%d files processed")
3162         : gettext ("%d file processed")),
3163        nfiles);
3164@end example
3165
3166@noindent
3167But this still doesn't work for languages like Polish, which has three
3168plural forms: one for nfiles == 1, one for nfiles == 2, 3, 4, 22, 23, 24, ...
3169and one for the rest.  The GNU @code{ngettext} function solves this problem:
3170
3171@example
3172printf (ngettext ("%d files processed", "%d file processed", nfiles),
3173        nfiles);
3174@end example
3175
3176
3177@node Character Set
3178@section Character Set
3179@cindex character set
3180@cindex encodings
3181@cindex ASCII characters
3182@cindex non-ASCII characters
3183
3184Sticking to the ASCII character set (plain text, 7-bit characters) is
3185preferred in GNU source code comments, text documents, and other
3186contexts, unless there is good reason to do something else because of
3187the application domain.  For example, if source code deals with the
3188French Revolutionary calendar, it is OK if its literal strings contain
3189accented characters in month names like ``Flor@'eal''.  Also, it is OK
3190(but not required) to use non-ASCII characters to represent proper
3191names of contributors in change logs (@pxref{Change Logs}).
3192
3193If you need to use non-ASCII characters, you should normally stick
3194with one encoding, certainly within a single file.  UTF-8 is likely to
3195be the best choice.
3196
3197
3198@node Quote Characters
3199@section Quote Characters
3200@cindex quote characters
3201@cindex locale-specific quote characters
3202@cindex left quote
3203@cindex right quote
3204@cindex opening quote
3205@cindex single quote
3206@cindex double quote
3207@cindex grave accent
3208@set txicodequoteundirected
3209@set txicodequotebacktick
3210
3211In the C locale, the output of GNU programs should stick to plain
3212ASCII for quotation characters in messages to users: preferably 0x22
3213(@samp{"}) or 0x27 (@samp{'}) for both opening and closing quotes.
3214Although GNU programs traditionally used 0x60 (@samp{`}) for opening
3215and 0x27 (@samp{'}) for closing quotes, nowadays quotes @samp{`like
3216this'} are typically rendered asymmetrically, so quoting @samp{"like
3217this"} or @samp{'like this'} typically looks better.
3218
3219It is ok, but not required, for GNU programs to generate
3220locale-specific quotes in non-C locales.  For example:
3221
3222@example
3223printf (gettext ("Processing file '%s'..."), file);
3224@end example
3225
3226@noindent
3227Here, a French translation might cause @code{gettext} to return the
3228string @code{"Traitement de fichier
3229@guilsinglleft{}@tie{}%s@tie{}@guilsinglright{}..."}, yielding quotes
3230more appropriate for a French locale.
3231
3232Sometimes a program may need to use opening and closing quotes
3233directly.  By convention, @code{gettext} translates the string
3234@samp{"`"} to the opening quote and the string @samp{"'"} to the
3235closing quote, and a program can use these translations.  Generally,
3236though, it is better to translate quote characters in the context of
3237longer strings.
3238
3239If the output of your program is ever likely to be parsed by another
3240program, it is good to provide an option that makes this parsing
3241reliable.  For example, you could escape special characters using
3242conventions from the C language or the Bourne shell.  See for example
3243the option @option{--quoting-style} of GNU @code{ls}.
3244
3245@clear txicodequoteundirected
3246@clear txicodequotebacktick
3247
3248
3249@node Mmap
3250@section Mmap
3251@findex mmap
3252
3253Don't assume that @code{mmap} either works on all files or fails
3254for all files.  It may work on some files and fail on others.
3255
3256The proper way to use @code{mmap} is to try it on the specific file for
3257which you want to use it---and if @code{mmap} doesn't work, fall back on
3258doing the job in another way using @code{read} and @code{write}.
3259
3260The reason this precaution is needed is that the GNU kernel (the HURD)
3261provides a user-extensible file system, in which there can be many
3262different kinds of ``ordinary files''.  Many of them support
3263@code{mmap}, but some do not.  It is important to make programs handle
3264all these kinds of files.
3265
3266
3267@node Documentation
3268@chapter Documenting Programs
3269@cindex documentation
3270
3271A GNU program should ideally come with full free documentation, adequate
3272for both reference and tutorial purposes.  If the package can be
3273programmed or extended, the documentation should cover programming or
3274extending it, as well as just using it.
3275
3276@menu
3277* GNU Manuals::                 Writing proper manuals.
3278* Doc Strings and Manuals::     Compiling doc strings doesn't make a manual.
3279* Manual Structure Details::    Specific structure conventions.
3280* License for Manuals::         Writing the distribution terms for a manual.
3281* Manual Credits::              Giving credit to documentation contributors.
3282* Printed Manuals::             Mentioning the printed manual.
3283* NEWS File::                   NEWS files supplement manuals.
3284* Change Logs::                 Recording changes.
3285* Man Pages::                   Man pages are secondary.
3286* Reading other Manuals::       How far you can go in learning
3287                                from other manuals.
3288@end menu
3289
3290@node GNU Manuals
3291@section GNU Manuals
3292
3293The preferred document format for the GNU system is the Texinfo
3294formatting language.  Every GNU package should (ideally) have
3295documentation in Texinfo both for reference and for learners.  Texinfo
3296makes it possible to produce a good quality formatted book, using
3297@TeX{}, and to generate an Info file.  It is also possible to generate
3298HTML output from Texinfo source.  See the Texinfo manual, either the
3299hardcopy, or the on-line version available through @code{info} or the
3300Emacs Info subsystem (@kbd{C-h i}).
3301
3302Nowadays some other formats such as Docbook and Sgmltexi can be
3303converted automatically into Texinfo.  It is ok to produce the Texinfo
3304documentation by conversion this way, as long as it gives good results.
3305
3306Make sure your manual is clear to a reader who knows nothing about the
3307topic and reads it straight through.  This means covering basic topics
3308at the beginning, and advanced topics only later.  This also means
3309defining every specialized term when it is first used.
3310
3311Programmers tend to carry over the structure of the program as the
3312structure for its documentation.  But this structure is not
3313necessarily good for explaining how to use the program; it may be
3314irrelevant and confusing for a user.
3315
3316Instead, the right way to structure documentation is according to the
3317concepts and questions that a user will have in mind when reading it.
3318This principle applies at every level, from the lowest (ordering
3319sentences in a paragraph) to the highest (ordering of chapter topics
3320within the manual).  Sometimes this structure of ideas matches the
3321structure of the implementation of the software being documented---but
3322often they are different.  An important part of learning to write good
3323documentation is to learn to notice when you have unthinkingly
3324structured the documentation like the implementation, stop yourself,
3325and look for better alternatives.
3326
3327For example, each program in the GNU system probably ought to be
3328documented in one manual; but this does not mean each program should
3329have its own manual.  That would be following the structure of the
3330implementation, rather than the structure that helps the user
3331understand.
3332
3333Instead, each manual should cover a coherent @emph{topic}.  For example,
3334instead of a manual for @code{diff} and a manual for @code{diff3}, we
3335have one manual for ``comparison of files'' which covers both of those
3336programs, as well as @code{cmp}.  By documenting these programs
3337together, we can make the whole subject clearer.
3338
3339The manual which discusses a program should certainly document all of
3340the program's command-line options and all of its commands.  It should
3341give examples of their use.  But don't organize the manual as a list
3342of features.  Instead, organize it logically, by subtopics.  Address
3343the questions that a user will ask when thinking about the job that
3344the program does.  Don't just tell the reader what each feature can
3345do---say what jobs it is good for, and show how to use it for those
3346jobs.  Explain what is recommended usage, and what kinds of usage
3347users should avoid.
3348
3349In general, a GNU manual should serve both as tutorial and reference.
3350It should be set up for convenient access to each topic through Info,
3351and for reading straight through (appendixes aside).  A GNU manual
3352should give a good introduction to a beginner reading through from the
3353start, and should also provide all the details that hackers want.
3354The Bison manual is a good example of this---please take a look at it
3355to see what we mean.
3356
3357That is not as hard as it first sounds.  Arrange each chapter as a
3358logical breakdown of its topic, but order the sections, and write their
3359text, so that reading the chapter straight through makes sense.  Do
3360likewise when structuring the book into chapters, and when structuring a
3361section into paragraphs.  The watchword is, @emph{at each point, address
3362the most fundamental and important issue raised by the preceding text.}
3363
3364If necessary, add extra chapters at the beginning of the manual which
3365are purely tutorial and cover the basics of the subject.  These provide
3366the framework for a beginner to understand the rest of the manual.  The
3367Bison manual provides a good example of how to do this.
3368
3369To serve as a reference, a manual should have an Index that list all the
3370functions, variables, options, and important concepts that are part of
3371the program.  One combined Index should do for a short manual, but
3372sometimes for a complex package it is better to use multiple indices.
3373The Texinfo manual includes advice on preparing good index entries, see
3374@ref{Index Entries, , Making Index Entries, texinfo, GNU Texinfo}, and
3375see @ref{Indexing Commands, , Defining the Entries of an
3376Index, texinfo, GNU Texinfo}.
3377
3378Don't use Unix man pages as a model for how to write GNU documentation;
3379most of them are terse, badly structured, and give inadequate
3380explanation of the underlying concepts.  (There are, of course, some
3381exceptions.)  Also, Unix man pages use a particular format which is
3382different from what we use in GNU manuals.
3383
3384Please include an email address in the manual for where to report
3385bugs @emph{in the text of the manual}.
3386
3387Please do not use the term ``pathname'' that is used in Unix
3388documentation; use ``file name'' (two words) instead.  We use the term
3389``path'' only for search paths, which are lists of directory names.
3390
3391Please do not use the term ``illegal'' to refer to erroneous input to
3392a computer program.  Please use ``invalid'' for this, and reserve the
3393term ``illegal'' for activities prohibited by law.
3394
3395Please do not write @samp{()} after a function name just to indicate
3396it is a function.  @code{foo ()} is not a function, it is a function
3397call with no arguments.
3398
3399@node Doc Strings and Manuals
3400@section Doc Strings and Manuals
3401
3402Some programming systems, such as Emacs, provide a documentation string
3403for each function, command or variable.  You may be tempted to write a
3404reference manual by compiling the documentation strings and writing a
3405little additional text to go around them---but you must not do it.  That
3406approach is a fundamental mistake.  The text of well-written
3407documentation strings will be entirely wrong for a manual.
3408
3409A documentation string needs to stand alone---when it appears on the
3410screen, there will be no other text to introduce or explain it.
3411Meanwhile, it can be rather informal in style.
3412
3413The text describing a function or variable in a manual must not stand
3414alone; it appears in the context of a section or subsection.  Other text
3415at the beginning of the section should explain some of the concepts, and
3416should often make some general points that apply to several functions or
3417variables.  The previous descriptions of functions and variables in the
3418section will also have given information about the topic.  A description
3419written to stand alone would repeat some of that information; this
3420redundancy looks bad.  Meanwhile, the informality that is acceptable in
3421a documentation string is totally unacceptable in a manual.
3422
3423The only good way to use documentation strings in writing a good manual
3424is to use them as a source of information for writing good text.
3425
3426@node Manual Structure Details
3427@section Manual Structure Details
3428@cindex manual structure
3429
3430The title page of the manual should state the version of the programs or
3431packages documented in the manual.  The Top node of the manual should
3432also contain this information.  If the manual is changing more
3433frequently than or independent of the program, also state a version
3434number for the manual in both of these places.
3435
3436Each program documented in the manual should have a node named
3437@samp{@var{program} Invocation} or @samp{Invoking @var{program}}.  This
3438node (together with its subnodes, if any) should describe the program's
3439command line arguments and how to run it (the sort of information people
3440would look for in a man page).  Start with an @samp{@@example}
3441containing a template for all the options and arguments that the program
3442uses.
3443
3444Alternatively, put a menu item in some menu whose item name fits one of
3445the above patterns.  This identifies the node which that item points to
3446as the node for this purpose, regardless of the node's actual name.
3447
3448The @samp{--usage} feature of the Info reader looks for such a node
3449or menu item in order to find the relevant text, so it is essential
3450for every Texinfo file to have one.
3451
3452If one manual describes several programs, it should have such a node for
3453each program described in the manual.
3454
3455@node License for Manuals
3456@section License for Manuals
3457@cindex license for manuals
3458
3459Please use the GNU Free Documentation License for all GNU manuals that
3460are more than a few pages long.  Likewise for a collection of short
3461documents---you only need one copy of the GNU FDL for the whole
3462collection.  For a single short document, you can use a very permissive
3463non-copyleft license, to avoid taking up space with a long license.
3464
3465See @uref{http://www.gnu.org/copyleft/fdl-howto.html} for more explanation
3466of how to employ the GFDL.
3467
3468Note that it is not obligatory to include a copy of the GNU GPL or GNU
3469LGPL in a manual whose license is neither the GPL nor the LGPL.  It can
3470be a good idea to include the program's license in a large manual; in a
3471short manual, whose size would be increased considerably by including
3472the program's license, it is probably better not to include it.
3473
3474@node Manual Credits
3475@section Manual Credits
3476@cindex credits for manuals
3477
3478Please credit the principal human writers of the manual as the authors,
3479on the title page of the manual.  If a company sponsored the work, thank
3480the company in a suitable place in the manual, but do not cite the
3481company as an author.
3482
3483@node Printed Manuals
3484@section Printed Manuals
3485
3486The FSF publishes some GNU manuals in printed form.  To encourage sales
3487of these manuals, the on-line versions of the manual should mention at
3488the very start that the printed manual is available and should point at
3489information for getting it---for instance, with a link to the page
3490@url{http://www.gnu.org/order/order.html}.  This should not be included
3491in the printed manual, though, because there it is redundant.
3492
3493It is also useful to explain in the on-line forms of the manual how the
3494user can print out the manual from the sources.
3495
3496@node NEWS File
3497@section The NEWS File
3498@cindex @file{NEWS} file
3499
3500In addition to its manual, the package should have a file named
3501@file{NEWS} which contains a list of user-visible changes worth
3502mentioning.  In each new release, add items to the front of the file and
3503identify the version they pertain to.  Don't discard old items; leave
3504them in the file after the newer items.  This way, a user upgrading from
3505any previous version can see what is new.
3506
3507If the @file{NEWS} file gets very long, move some of the older items
3508into a file named @file{ONEWS} and put a note at the end referring the
3509user to that file.
3510
3511@node Change Logs
3512@section Change Logs
3513@cindex change logs
3514
3515Keep a change log to describe all the changes made to program source
3516files.  The purpose of this is so that people investigating bugs in the
3517future will know about the changes that might have introduced the bug.
3518Often a new bug can be found by looking at what was recently changed.
3519More importantly, change logs can help you eliminate conceptual
3520inconsistencies between different parts of a program, by giving you a
3521history of how the conflicting concepts arose and who they came from.
3522
3523@menu
3524* Change Log Concepts::
3525* Style of Change Logs::
3526* Simple Changes::
3527* Conditional Changes::
3528* Indicating the Part Changed::
3529@end menu
3530
3531@node Change Log Concepts
3532@subsection Change Log Concepts
3533
3534You can think of the change log as a conceptual ``undo list'' which
3535explains how earlier versions were different from the current version.
3536People can see the current version; they don't need the change log
3537to tell them what is in it.  What they want from a change log is a
3538clear explanation of how the earlier version differed.
3539
3540The change log file is normally called @file{ChangeLog} and covers an
3541entire directory.  Each directory can have its own change log, or a
3542directory can use the change log of its parent directory---it's up to
3543you.
3544
3545Another alternative is to record change log information with a version
3546control system such as RCS or CVS.  This can be converted automatically
3547to a @file{ChangeLog} file using @code{rcs2log}; in Emacs, the command
3548@kbd{C-x v a} (@code{vc-update-change-log}) does the job.
3549
3550There's no need to describe the full purpose of the changes or how
3551they work together.  However, sometimes it is useful to write one line
3552to describe the overall purpose of a change or a batch of changes.  If
3553you think that a change calls for explanation, you're probably right.
3554Please do explain it---but please put the full explanation in comments
3555in the code, where people will see it whenever they see the code.  For
3556example, ``New function'' is enough for the change log when you add a
3557function, because there should be a comment before the function
3558definition to explain what it does.
3559
3560In the past, we recommended not mentioning changes in non-software
3561files (manuals, help files, etc.) in change logs.  However, we've been
3562advised that it is a good idea to include them, for the sake of
3563copyright records.
3564
3565The easiest way to add an entry to @file{ChangeLog} is with the Emacs
3566command @kbd{M-x add-change-log-entry}.  An entry should have an
3567asterisk, the name of the changed file, and then in parentheses the name
3568of the changed functions, variables or whatever, followed by a colon.
3569Then describe the changes you made to that function or variable.
3570
3571@node Style of Change Logs
3572@subsection Style of Change Logs
3573@cindex change logs, style
3574
3575Here are some simple examples of change log entries, starting with the
3576header line that says who made the change and when it was installed,
3577followed by descriptions of specific changes.  (These examples are
3578drawn from Emacs and GCC.)
3579
3580@example
35811998-08-17  Richard Stallman  <rms@@gnu.org>
3582
3583* register.el (insert-register): Return nil.
3584(jump-to-register): Likewise.
3585
3586* sort.el (sort-subr): Return nil.
3587
3588* tex-mode.el (tex-bibtex-file, tex-file, tex-region):
3589Restart the tex shell if process is gone or stopped.
3590(tex-shell-running): New function.
3591
3592* expr.c (store_one_arg): Round size up for move_block_to_reg.
3593(expand_call): Round up when emitting USE insns.
3594* stmt.c (assign_parms): Round size up for move_block_from_reg.
3595@end example
3596
3597It's important to name the changed function or variable in full.  Don't
3598abbreviate function or variable names, and don't combine them.
3599Subsequent maintainers will often search for a function name to find all
3600the change log entries that pertain to it; if you abbreviate the name,
3601they won't find it when they search.
3602
3603For example, some people are tempted to abbreviate groups of function
3604names by writing @samp{* register.el (@{insert,jump-to@}-register)};
3605this is not a good idea, since searching for @code{jump-to-register} or
3606@code{insert-register} would not find that entry.
3607
3608Separate unrelated change log entries with blank lines.  When two
3609entries represent parts of the same change, so that they work together,
3610then don't put blank lines between them.  Then you can omit the file
3611name and the asterisk when successive entries are in the same file.
3612
3613Break long lists of function names by closing continued lines with
3614@samp{)}, rather than @samp{,}, and opening the continuation with
3615@samp{(} as in this example:
3616
3617@example
3618* keyboard.c (menu_bar_items, tool_bar_items)
3619(Fexecute_extended_command): Deal with 'keymap' property.
3620@end example
3621
3622When you install someone else's changes, put the contributor's name in
3623the change log entry rather than in the text of the entry.  In other
3624words, write this:
3625
3626@example
36272002-07-14  John Doe  <jdoe@@gnu.org>
3628
3629        * sewing.c: Make it sew.
3630@end example
3631
3632@noindent
3633rather than this:
3634
3635@example
36362002-07-14  Usual Maintainer  <usual@@gnu.org>
3637
3638        * sewing.c: Make it sew.  Patch by jdoe@@gnu.org.
3639@end example
3640
3641As for the date, that should be the date you applied the change.
3642
3643@node Simple Changes
3644@subsection Simple Changes
3645
3646Certain simple kinds of changes don't need much detail in the change
3647log.
3648
3649When you change the calling sequence of a function in a simple fashion,
3650and you change all the callers of the function to use the new calling
3651sequence, there is no need to make individual entries for all the
3652callers that you changed.  Just write in the entry for the function
3653being called, ``All callers changed''---like this:
3654
3655@example
3656* keyboard.c (Fcommand_execute): New arg SPECIAL.
3657All callers changed.
3658@end example
3659
3660When you change just comments or doc strings, it is enough to write an
3661entry for the file, without mentioning the functions.  Just ``Doc
3662fixes'' is enough for the change log.
3663
3664There's no technical need to make change log entries for documentation
3665files.  This is because documentation is not susceptible to bugs that
3666are hard to fix.  Documentation does not consist of parts that must
3667interact in a precisely engineered fashion.  To correct an error, you
3668need not know the history of the erroneous passage; it is enough to
3669compare what the documentation says with the way the program actually
3670works.
3671
3672However, you should keep change logs for documentation files when the
3673project gets copyright assignments from its contributors, so as to
3674make the records of authorship more accurate.
3675
3676@node Conditional Changes
3677@subsection Conditional Changes
3678@cindex conditional changes, and change logs
3679@cindex change logs, conditional changes
3680
3681Source files can often contain code that is conditional to build-time
3682or static conditions.  For example, C programs can contain
3683compile-time @code{#if} conditionals; programs implemented in
3684interpreted languages can contain module imports of function
3685definitions that are only performed for certain versions of the
3686interpreter; and Automake @file{Makefile.am} files can contain
3687variable definitions or target declarations that are only to be
3688considered if a configure-time Automake conditional is true.
3689
3690Many changes are conditional as well: sometimes you add a new variable,
3691or function, or even a new program or library, which is entirely
3692dependent on a build-time condition.  It is useful to indicate
3693in the change log the conditions for which a change applies.
3694
3695Our convention for indicating conditional changes is to use
3696@emph{square brackets around the name of the condition}.
3697
3698Conditional changes can happen in numerous scenarios and with many
3699variations, so here are some examples to help clarify.  This first
3700example describes changes in C, Perl, and Python files which are
3701conditional but do not have an associated function or entity name:
3702
3703@example
3704* xterm.c [SOLARIS2]: Include <string.h>.
3705* FilePath.pm [$^O eq 'VMS']: Import the VMS::Feature module.
3706* framework.py [sys.version_info < (2, 6)]: Make "with" statement
3707  available by importing it from __future__,
3708  to support also python 2.5.
3709@end example
3710
3711Our other examples will for simplicity be limited to C, as the minor
3712changes necessary to adapt them to other languages should be
3713self-evident.
3714
3715Next, here is an entry describing a new definition which is entirely
3716conditional: the C macro @code{FRAME_WINDOW_P} is defined (and used)
3717only when the macro @code{HAVE_X_WINDOWS} is defined:
3718
3719@example
3720* frame.h [HAVE_X_WINDOWS] (FRAME_WINDOW_P): Macro defined.
3721@end example
3722
3723Next, an entry for a change within the function @code{init_display},
3724whose definition as a whole is unconditional, but the changes
3725themselves are contained in a @samp{#ifdef HAVE_LIBNCURSES}
3726conditional:
3727
3728@example
3729* dispnew.c (init_display) [HAVE_LIBNCURSES]: If X, call tgetent.
3730@end example
3731
3732Finally, here is an entry for a change that takes effect only when
3733a certain macro is @emph{not} defined:
3734
3735@example
3736(gethostname) [!HAVE_SOCKETS]: Replace with winsock version.
3737@end example
3738
3739@node Indicating the Part Changed
3740@subsection Indicating the Part Changed
3741
3742Indicate the part of a function which changed by using angle brackets
3743enclosing an indication of what the changed part does.  Here is an entry
3744for a change in the part of the function @code{sh-while-getopts} that
3745deals with @code{sh} commands:
3746
3747@example
3748* progmodes/sh-script.el (sh-while-getopts) <sh>: Handle case that
3749user-specified option string is empty.
3750@end example
3751
3752
3753@node Man Pages
3754@section Man Pages
3755@cindex man pages
3756
3757In the GNU project, man pages are secondary.  It is not necessary or
3758expected for every GNU program to have a man page, but some of them do.
3759It's your choice whether to include a man page in your program.
3760
3761When you make this decision, consider that supporting a man page
3762requires continual effort each time the program is changed.  The time
3763you spend on the man page is time taken away from more useful work.
3764
3765For a simple program which changes little, updating the man page may be
3766a small job.  Then there is little reason not to include a man page, if
3767you have one.
3768
3769For a large program that changes a great deal, updating a man page may
3770be a substantial burden.  If a user offers to donate a man page, you may
3771find this gift costly to accept.  It may be better to refuse the man
3772page unless the same person agrees to take full responsibility for
3773maintaining it---so that you can wash your hands of it entirely.  If
3774this volunteer later ceases to do the job, then don't feel obliged to
3775pick it up yourself; it may be better to withdraw the man page from the
3776distribution until someone else agrees to update it.
3777
3778When a program changes only a little, you may feel that the
3779discrepancies are small enough that the man page remains useful without
3780updating.  If so, put a prominent note near the beginning of the man
3781page explaining that you don't maintain it and that the Texinfo manual
3782is more authoritative.  The note should say how to access the Texinfo
3783documentation.
3784
3785Be sure that man pages include a copyright statement and free license.
3786The simple all-permissive license is appropriate for simple man pages
3787(@pxref{License Notices for Other Files,,,maintain,Information for GNU
3788Maintainers}).
3789
3790For long man pages, with enough explanation and documentation that
3791they can be considered true manuals, use the GFDL (@pxref{License for
3792Manuals}).
3793
3794Finally, the GNU help2man program
3795(@uref{http://www.gnu.org/software/help2man/}) is one way to automate
3796generation of a man page, in this case from @option{--help} output.
3797This is sufficient in many cases.
3798
3799@node Reading other Manuals
3800@section Reading other Manuals
3801
3802There may be non-free books or documentation files that describe the
3803program you are documenting.
3804
3805It is ok to use these documents for reference, just as the author of a
3806new algebra textbook can read other books on algebra.  A large portion
3807of any non-fiction book consists of facts, in this case facts about how
3808a certain program works, and these facts are necessarily the same for
3809everyone who writes about the subject.  But be careful not to copy your
3810outline structure, wording, tables or examples from preexisting non-free
3811documentation.  Copying from free documentation may be ok; please check
3812with the FSF about the individual case.
3813
3814@node Managing Releases
3815@chapter The Release Process
3816@cindex releasing
3817
3818Making a release is more than just bundling up your source files in a
3819tar file and putting it up for FTP.  You should set up your software so
3820that it can be configured to run on a variety of systems.  Your Makefile
3821should conform to the GNU standards described below, and your directory
3822layout should also conform to the standards discussed below.  Doing so
3823makes it easy to include your package into the larger framework of
3824all GNU software.
3825
3826@menu
3827* Configuration::               How configuration of GNU packages should work.
3828* Makefile Conventions::        Makefile conventions.
3829* Releases::                    Making releases
3830@end menu
3831
3832@node Configuration
3833@section How Configuration Should Work
3834@cindex program configuration
3835
3836@pindex configure
3837Each GNU distribution should come with a shell script named
3838@code{configure}.  This script is given arguments which describe the
3839kind of machine and system you want to compile the program for.
3840The @code{configure} script must record the configuration options so
3841that they affect compilation.
3842
3843The description here is the specification of the interface for the
3844@code{configure} script in GNU packages.  Many packages implement it
3845using GNU Autoconf (@pxref{Top,, Introduction, autoconf, Autoconf})
3846and/or GNU Automake (@pxref{Top,, Introduction, automake, Automake}),
3847but you do not have to use these tools.  You can implement it any way
3848you like; for instance, by making @code{configure} be a wrapper around
3849a completely different configuration system.
3850
3851Another way for the @code{configure} script to operate is to make a
3852link from a standard name such as @file{config.h} to the proper
3853configuration file for the chosen system.  If you use this technique,
3854the distribution should @emph{not} contain a file named
3855@file{config.h}.  This is so that people won't be able to build the
3856program without configuring it first.
3857
3858Another thing that @code{configure} can do is to edit the Makefile.  If
3859you do this, the distribution should @emph{not} contain a file named
3860@file{Makefile}.  Instead, it should include a file @file{Makefile.in} which
3861contains the input used for editing.  Once again, this is so that people
3862won't be able to build the program without configuring it first.
3863
3864If @code{configure} does write the @file{Makefile}, then @file{Makefile}
3865should have a target named @file{Makefile} which causes @code{configure}
3866to be rerun, setting up the same configuration that was set up last
3867time.  The files that @code{configure} reads should be listed as
3868dependencies of @file{Makefile}.
3869
3870All the files which are output from the @code{configure} script should
3871have comments at the beginning explaining that they were generated
3872automatically using @code{configure}.  This is so that users won't think
3873of trying to edit them by hand.
3874
3875The @code{configure} script should write a file named @file{config.status}
3876which describes which configuration options were specified when the
3877program was last configured.  This file should be a shell script which,
3878if run, will recreate the same configuration.
3879
3880The @code{configure} script should accept an option of the form
3881@samp{--srcdir=@var{dirname}} to specify the directory where sources are found
3882(if it is not the current directory).  This makes it possible to build
3883the program in a separate directory, so that the actual source directory
3884is not modified.
3885
3886If the user does not specify @samp{--srcdir}, then @code{configure} should
3887check both @file{.} and @file{..} to see if it can find the sources.  If
3888it finds the sources in one of these places, it should use them from
3889there.  Otherwise, it should report that it cannot find the sources, and
3890should exit with nonzero status.
3891
3892Usually the easy way to support @samp{--srcdir} is by editing a
3893definition of @code{VPATH} into the Makefile.  Some rules may need to
3894refer explicitly to the specified source directory.  To make this
3895possible, @code{configure} can add to the Makefile a variable named
3896@code{srcdir} whose value is precisely the specified directory.
3897
3898In addition, the @samp{configure} script should take options
3899corresponding to most of the standard directory variables
3900(@pxref{Directory Variables}).  Here is the list:
3901
3902@example
3903--prefix --exec-prefix --bindir --sbindir --libexecdir --sysconfdir
3904--sharedstatedir --localstatedir --libdir --includedir --oldincludedir
3905--datarootdir --datadir --infodir --localedir --mandir --docdir
3906--htmldir --dvidir --pdfdir --psdir
3907@end example
3908
3909The @code{configure} script should also take an argument which specifies the
3910type of system to build the program for.  This argument should look like
3911this:
3912
3913@example
3914@var{cpu}-@var{company}-@var{system}
3915@end example
3916
3917For example, an Athlon-based GNU/Linux system might be
3918@samp{i686-pc-linux-gnu}.
3919
3920The @code{configure} script needs to be able to decode all plausible
3921alternatives for how to describe a machine.  Thus,
3922@samp{athlon-pc-gnu/linux} would be a valid alias.  There is a shell
3923script called
3924@uref{http://git.savannah.gnu.org/@/gitweb/@/?p=config.git;a=blob_plain;f=config.sub;hb=HEAD,
3925@file{config.sub}} that you can use as a subroutine to validate system
3926types and canonicalize aliases.
3927
3928The @code{configure} script should also take the option
3929@option{--build=@var{buildtype}}, which should be equivalent to a
3930plain @var{buildtype} argument.  For example, @samp{configure
3931--build=i686-pc-linux-gnu} is equivalent to @samp{configure
3932i686-pc-linux-gnu}.  When the build type is not specified by an option
3933or argument, the @code{configure} script should normally guess it using
3934the shell script
3935@uref{http://git.savannah.gnu.org/@/gitweb/@/?p=config.git;a=blob_plain;f=config.guess;hb=HEAD,
3936@file{config.guess}}.
3937
3938@cindex optional features, configure-time
3939Other options are permitted to specify in more detail the software
3940or hardware present on the machine, to include or exclude optional parts
3941of the package, or to adjust the name of some tools or arguments to them:
3942
3943@table @samp
3944@item --enable-@var{feature}@r{[}=@var{parameter}@r{]}
3945Configure the package to build and install an optional user-level
3946facility called @var{feature}.  This allows users to choose which
3947optional features to include.  Giving an optional @var{parameter} of
3948@samp{no} should omit @var{feature}, if it is built by default.
3949
3950No @samp{--enable} option should @strong{ever} cause one feature to
3951replace another.  No @samp{--enable} option should ever substitute one
3952useful behavior for another useful behavior.  The only proper use for
3953@samp{--enable} is for questions of whether to build part of the program
3954or exclude it.
3955
3956@item --with-@var{package}
3957@c @r{[}=@var{parameter}@r{]}
3958The package @var{package} will be installed, so configure this package
3959to work with @var{package}.
3960
3961@c  Giving an optional @var{parameter} of
3962@c @samp{no} should omit @var{package}, if it is used by default.
3963
3964Possible values of @var{package} include
3965@samp{gnu-as} (or @samp{gas}), @samp{gnu-ld}, @samp{gnu-libc},
3966@samp{gdb},
3967@samp{x},
3968and
3969@samp{x-toolkit}.
3970
3971Do not use a @samp{--with} option to specify the file name to use to
3972find certain files.  That is outside the scope of what @samp{--with}
3973options are for.
3974
3975@item @var{variable}=@var{value}
3976Set the value of the variable @var{variable} to @var{value}.  This is
3977used to override the default values of commands or arguments in the
3978build process.  For example, the user could issue @samp{configure
3979CFLAGS=-g CXXFLAGS=-g} to build with debugging information and without
3980the default optimization.
3981
3982Specifying variables as arguments to @code{configure}, like this:
3983@example
3984./configure CC=gcc
3985@end example
3986is preferable to setting them in environment variables:
3987@example
3988CC=gcc ./configure
3989@end example
3990as it helps to recreate the same configuration later with
3991@file{config.status}.  However, both methods should be supported.
3992@end table
3993
3994All @code{configure} scripts should accept all of the ``detail''
3995options and the variable settings, whether or not they make any
3996difference to the particular package at hand.  In particular, they
3997should accept any option that starts with @samp{--with-} or
3998@samp{--enable-}.  This is so users will be able to configure an
3999entire GNU source tree at once with a single set of options.
4000
4001You will note that the categories @samp{--with-} and @samp{--enable-}
4002are narrow: they @strong{do not} provide a place for any sort of option
4003you might think of.  That is deliberate.  We want to limit the possible
4004configuration options in GNU software.  We do not want GNU programs to
4005have idiosyncratic configuration options.
4006
4007Packages that perform part of the compilation process may support
4008cross-compilation.  In such a case, the host and target machines for the
4009program may be different.
4010
4011The @code{configure} script should normally treat the specified type of
4012system as both the host and the target, thus producing a program which
4013works for the same type of machine that it runs on.
4014
4015To compile a program to run on a host type that differs from the build
4016type, use the configure option @option{--host=@var{hosttype}}, where
4017@var{hosttype} uses the same syntax as @var{buildtype}.  The host type
4018normally defaults to the build type.
4019
4020To configure a cross-compiler, cross-assembler, or what have you, you
4021should specify a target different from the host, using the configure
4022option @samp{--target=@var{targettype}}.  The syntax for
4023@var{targettype} is the same as for the host type.  So the command would
4024look like this:
4025
4026@example
4027./configure --host=@var{hosttype} --target=@var{targettype}
4028@end example
4029
4030The target type normally defaults to the host type.
4031Programs for which cross-operation is not meaningful need not accept the
4032@samp{--target} option, because configuring an entire operating system for
4033cross-operation is not a meaningful operation.
4034
4035Some programs have ways of configuring themselves automatically.  If
4036your program is set up to do this, your @code{configure} script can simply
4037ignore most of its arguments.
4038
4039@comment The makefile standards are in a separate file that is also
4040@comment included by make.texinfo.  Done by roland@gnu.ai.mit.edu on 1/6/93.
4041@comment For this document, turn chapters into sections, etc.
4042@lowersections
4043@include make-stds.texi
4044@raisesections
4045
4046@node Releases
4047@section Making Releases
4048@cindex packaging
4049
4050You should identify each release with a pair of version numbers, a
4051major version and a minor.  We have no objection to using more than
4052two numbers, but it is very unlikely that you really need them.
4053
4054Package the distribution of @code{Foo version 69.96} up in a gzipped tar
4055file with the name @file{foo-69.96.tar.gz}.  It should unpack into a
4056subdirectory named @file{foo-69.96}.
4057
4058Building and installing the program should never modify any of the files
4059contained in the distribution.  This means that all the files that form
4060part of the program in any way must be classified into @dfn{source
4061files} and @dfn{non-source files}.  Source files are written by humans
4062and never changed automatically; non-source files are produced from
4063source files by programs under the control of the Makefile.
4064
4065@cindex @file{README} file
4066The distribution should contain a file named @file{README} which gives
4067the name of the package, and a general description of what it does.  It
4068is also good to explain the purpose of each of the first-level
4069subdirectories in the package, if there are any.  The @file{README} file
4070should either state the version number of the package, or refer to where
4071in the package it can be found.
4072
4073The @file{README} file should refer to the file @file{INSTALL}, which
4074should contain an explanation of the installation procedure.
4075
4076The @file{README} file should also refer to the file which contains the
4077copying conditions.  The GNU GPL, if used, should be in a file called
4078@file{COPYING}.  If the GNU LGPL is used, it should be in a file called
4079@file{COPYING.LESSER}.
4080
4081Naturally, all the source files must be in the distribution.  It is
4082okay to include non-source files in the distribution along with the
4083source files they are generated from, provided they are up-to-date
4084with the source they are made from, and machine-independent, so that
4085normal building of the distribution will never modify them.  We
4086commonly include non-source files produced by Autoconf, Automake,
4087Bison, @code{lex}, @TeX{}, and @code{makeinfo}; this helps avoid
4088unnecessary dependencies between our distributions, so that users can
4089install whichever packages they want to install.
4090
4091Non-source files that might actually be modified by building and
4092installing the program should @strong{never} be included in the
4093distribution.  So if you do distribute non-source files, always make
4094sure they are up to date when you make a new distribution.
4095
4096Make sure that all the files in the distribution are world-readable, and
4097that directories are world-readable and world-searchable (octal mode 755).
4098We used to recommend that all directories in the distribution also be
4099world-writable (octal mode 777), because ancient versions of @code{tar}
4100would otherwise not cope when extracting the archive as an unprivileged
4101user.  That can easily lead to security issues when creating the archive,
4102however, so now we recommend against that.
4103
4104Don't include any symbolic links in the distribution itself.  If the tar
4105file contains symbolic links, then people cannot even unpack it on
4106systems that don't support symbolic links.  Also, don't use multiple
4107names for one file in different directories, because certain file
4108systems cannot handle this and that prevents unpacking the
4109distribution.
4110
4111Try to make sure that all the file names will be unique on MS-DOS.  A
4112name on MS-DOS consists of up to 8 characters, optionally followed by a
4113period and up to three characters.  MS-DOS will truncate extra
4114characters both before and after the period.  Thus,
4115@file{foobarhacker.c} and @file{foobarhacker.o} are not ambiguous; they
4116are truncated to @file{foobarha.c} and @file{foobarha.o}, which are
4117distinct.
4118
4119@cindex @file{texinfo.tex}, in a distribution
4120Include in your distribution a copy of the @file{texinfo.tex} you used
4121to test print any @file{*.texinfo} or @file{*.texi} files.
4122
4123Likewise, if your program uses small GNU software packages like regex,
4124getopt, obstack, or termcap, include them in the distribution file.
4125Leaving them out would make the distribution file a little smaller at
4126the expense of possible inconvenience to a user who doesn't know what
4127other files to get.
4128
4129@node References
4130@chapter References to Non-Free Software and Documentation
4131@cindex references to non-free material
4132
4133A GNU program should not recommend, promote, or grant legitimacy to
4134the use of any non-free program.  Proprietary software is a social and
4135ethical problem, and our aim is to put an end to that problem.  We
4136can't stop some people from writing proprietary programs, or stop
4137other people from using them, but we can and should refuse to
4138advertise them to new potential customers, or to give the public the
4139idea that their existence is ethical.
4140
4141The GNU definition of free software is found on the GNU web site at
4142@url{http://www.gnu.org/@/philosophy/@/free-sw.html}, and the definition
4143of free documentation is found at
4144@url{http://www.gnu.org/@/philosophy/@/free-doc.html}.  The terms ``free''
4145and ``non-free'', used in this document, refer to those definitions.
4146
4147A list of important licenses and whether they qualify as free is in
4148@url{http://www.gnu.org/@/licenses/@/license-list.html}.  If it is not
4149clear whether a license qualifies as free, please ask the GNU Project
4150by writing to @email{licensing@@gnu.org}.  We will answer, and if the
4151license is an important one, we will add it to the list.
4152
4153When a non-free program or system is well known, you can mention it in
4154passing---that is harmless, since users who might want to use it
4155probably already know about it.  For instance, it is fine to explain
4156how to build your package on top of some widely used non-free
4157operating system, or how to use it together with some widely used
4158non-free program.
4159
4160However, you should give only the necessary information to help those
4161who already use the non-free program to use your program with
4162it---don't give, or refer to, any further information about the
4163proprietary program, and don't imply that the proprietary program
4164enhances your program, or that its existence is in any way a good
4165thing.  The goal should be that people already using the proprietary
4166program will get the advice they need about how to use your free
4167program with it, while people who don't already use the proprietary
4168program will not see anything likely to lead them to take an interest
4169in it.
4170
4171If a non-free program or system is obscure in your program's domain,
4172your program should not mention or support it at all, since doing so
4173would tend to popularize the non-free program more than it popularizes
4174your program.  (You cannot hope to find many additional users for your
4175program among the users of Foobar, if the existence of Foobar is not
4176generally known among people who might want to use your program.)
4177
4178Sometimes a program is free software in itself but depends on a
4179non-free platform in order to run.  For instance, many Java programs
4180depend on some non-free Java libraries.  To recommend or promote such
4181a program is to promote the other programs it needs.  This is why we
4182are careful about listing Java programs in the Free Software
4183Directory: we don't want to promote the non-free Java libraries.
4184
4185We hope this particular problem with Java will be gone by and by, as
4186we replace the remaining non-free standard Java libraries with free
4187software, but the general principle will remain the same: don't
4188recommend, promote or legitimize programs that depend on non-free
4189software to run.
4190
4191Some free programs strongly encourage the use of non-free software.  A
4192typical example is @command{mplayer}.  It is free software in itself,
4193and the free code can handle some kinds of files.  However,
4194@command{mplayer} recommends use of non-free codecs for other kinds of
4195files, and users that install @command{mplayer} are very likely to
4196install those codecs along with it.  To recommend @command{mplayer}
4197is, in effect, to promote use of the non-free codecs.
4198
4199Thus, you should not recommend programs that strongly encourage the
4200use of non-free software.  This is why we do not list
4201@command{mplayer} in the Free Software Directory.
4202
4203A GNU package should not refer the user to any non-free documentation
4204for free software.  Free documentation that can be included in free
4205operating systems is essential for completing the GNU system, or any
4206free operating system, so encouraging it is a priority; to recommend
4207use of documentation that we are not allowed to include undermines the
4208impetus for the community to produce documentation that we can
4209include.  So GNU packages should never recommend non-free
4210documentation.
4211
4212By contrast, it is ok to refer to journal articles and textbooks in
4213the comments of a program for explanation of how it functions, even
4214though they are non-free.  This is because we don't include such
4215things in the GNU system even if they are free---they are outside the
4216scope of what a software distribution needs to include.
4217
4218Referring to a web site that describes or recommends a non-free
4219program is promoting that program, so please do not make links to (or
4220mention by name) web sites that contain such material.  This policy is
4221relevant particularly for the web pages for a GNU package.
4222
4223Following links from nearly any web site can lead eventually to
4224non-free software; this is inherent in the nature of the web.  So it
4225makes no sense to criticize a site for having such links.  As long as
4226the site does not itself recommend a non-free program, there is no
4227need to consider the question of the sites that it links to for other
4228reasons.
4229
4230Thus, for example, you should not refer to AT&T's web site if that
4231recommends AT&T's non-free software packages; you should not refer to
4232a site that links to AT&T's site presenting it as a place to get some
4233non-free program, because that link recommends and legitimizes the
4234non-free program.  However, that a site contains a link to AT&T's web
4235site for some other purpose (such as long-distance telephone service)
4236is not an objection against it.
4237
4238@node GNU Free Documentation License
4239@appendix GNU Free Documentation License
4240
4241@cindex FDL, GNU Free Documentation License
4242@include fdl.texi
4243
4244@node Index
4245@unnumbered Index
4246@printindex cp
4247
4248@bye
4249
4250Local variables:
4251eval: (add-hook 'write-file-hooks 'time-stamp)
4252time-stamp-start: "@set lastupdate "
4253time-stamp-end: "$"
4254time-stamp-format: "%:b %:d, %:y"
4255compile-command: "cd work.s && make"
4256End:
4257