xref: /csrg-svn/usr.bin/spell/README (revision 62250)
1*62250Sbostic#	@(#)README	8.1 (Berkeley) 06/06/93
230612Smckusick
343667StrentAll files and subdirectories of /usr/share/dict are recommended for
430658Smckusickrdisting except web2 and web2a (because of their size), and some of
530658Smckusickthe files  hlist*  depending on needs of your machine (details below).
630658SmckusickDescriptions of most of these files are given under FILES below.
730613Smckusick
830613SmckusickThe new subdirectory "special" contains lists of words in specialized
930658Smckusickfields, which may be hashed in with the regular lists on machines having
1030658Smckusickmany users working in these fields.  As of this writing, there are two
1130658Smckusicksuch specialized wordlists.
1230613Smckusick
1330613SmckusickIt is advised that system managers also create a directory
1430658Smckusick/usr/local/dict. This can be used to maintain files of particular
1530658Smckusickinterest to users of each machine (e.g., surnames of members of the
1630613Smckusickdepartment on a departmental machine).
1730613Smckusick
1830658SmckusickThe hashed wordlists hlista and hlistb in this distribution include
1930658Smckusickthe words in the file special/4bsd, comprising current 4bsd
2030658Smckusickcommands, system calls, etc. (from "abs" to "zcat").  Machines
2130658Smckusickwhose primary users are programmers should take these files by
2230658Smckusickrdist.  For machines with other user populations, a file "hlist" is
2330658Smckusickprovided which only contains the contents of /usr/dict/words.  Managers
2430658Smckusickof such machines should rdist this file, and use "spellin" to produce
2530658Smckusickfiles hlist{a,b} which contain the words from
2643667Strent/usr/share/dict/{american,british} respectively, plus any other files
2730658Smckusickappropriate to the needs of the majority of their users.  (Some basic
2830658Smckusickunix commands and terms that general users are likely to encounter,
2943667Strente.g. troff, emacs, tty, have been included in /usr/share/dict/words.
3030658SmckusickMore may be added as suggestions are received.)  Here, for instance is a
3130658Smckusickscript that might be used to create the hashlists on a particular
3243667Strentmachine, so as to include the words in /usr/share/dict/special/math, as well
3330658Smckusickas two local lists which we will assume are called
3430658Smckusick/usr/local/dict/surnames and /usr/local/dict/acronyms.
3530613Smckusick
3630613Smckusick	#
3743667Strent	cd /usr/share/dict
3830658Smckusick	cat american special/math /usr/local/dict/{surnames,acronyms} | \
3930658Smckusick		spellin hlist > hlista
4030658Smckusick	cat british  special/math /usr/local/dict/{surnames,acronyms} | \
4130658Smckusick		spellin hlist > hlistb
4230613Smckusick
4330658Smckusick     Hashlists can also be created from scratch using
4430658Smckusick/usr/src/usr.bin/spell/Makefile.  This is now written so that if "make"
4530658Smckusickis run with no options it will produce the hashed files as presently
4630658Smckusickdistributed, but so that the extra wordlists used can be controlled with
4730658Smckusickvariables LOCAL and SPECIAL.  For instance, the results given by the
4830658Smckusickabove script can be obtained by doing:
4930613Smckusick
5030658Smckusick	cd /usr/src/usr.bin/spell
5130658Smckusick	make LOCAL='/usr/local/dict/surnames /usr/local/dict/acronyms' \
5230658Smckusick		SPECIAL=special.math
5330658Smckusick	make install
5430613Smckusick
5530658Smckusick     Returning to the subject of the wordlists in /usr/dict, these are,
5630658Smckusickin general, ordered as in  sort -df.  This makes no difference for
5730658Smckusickspell's hashing process, but makes a difference for other commands,
5830658Smckusicksuch as "look", that perform binary searches on the unhashed lists.
5930658Smckusick
6030658SmckusickComplaints, and any additional suggestions for words or wordlists,
6130658Smckusickshould be sent to me.  I cannot fix bugs involving the code of "spell",
6230658Smckusickbut I am maintaining a list of these bugs, and of other ideas for
6330658Smckusickimprovement.
6430658Smckusick		George Bergman, gbergman@cartan.Berkeley.Edu
6530658Smckusick		18 March, 1987
6630658Smckusick
6730613Smckusick--------------------------------------------------------------------
6843667StrentFILES and subdirectories of /usr/share/dict:
6930658Smckusick
7030658Smckusick    words    -- common words, and important technical terms from all
7130613Smckusick	fields, that are spelled the same in British and American usage.
7230658Smckusick    american -- spellings preferred in American but not British usage.
7330658Smckusick    british  -- spellings preferred in British but not American usage.
7430658Smckusick    stop     -- forms that would otherwise be derivable by "spell" from
7530613Smckusick	words in one of the above files, but should not be accepted.
7630658Smckusick    hlist    -- hashed list, formed from the file "words" only.
7730658Smckusick    hlista   -- hashed list, formed from files {words,american,special/4bsd}.
7830658Smckusick    hlistb   -- hashed list, formed from files {words,british,special/4bsd}.
7930658Smckusick    hstop    -- hashed list, formed from file "stop".
8030658Smckusick    README   -- this file
8130658Smckusick    papers/  -- an (out-of-date specialized) bibliographical database,
8230658Smckusick	used as the default by the program "refer".
8330658Smckusick    special/ -- directory of less common terms from specialized fields.
8430658Smckusick	It presently contains:
8530658Smckusick
8630658Smckusick	special/4bsd -- commands and system calls (from filenames in
8743667Strent	    /usr/share/man/man[1238n]), and builtin csh commands (named in
8843667Strent	    /usr/share/man/man1/csh.1) of the current version of 4bsd Unix.
8930613Smckusick	    (Supersedes old "/usr/src/usr.bin/spell/local".)
9043667Strent	special/math -- some mathematical terms not in /usr/share/dict/words.
91