4.  Printing a Publication List

      There are two commands for printing a publication list, depending on whether you want to print one person's list, or the list of many people. To print a list for one person, use the pub.indiv command:

pub.indiv M Lesk
This runs off the list for M. Lesk and puts it in file ``output''. Note that no `.' is given after the initial. In case of ambiguity two initials can be used. Similarly, to get the list for group of people, say
pub.org xxx
which prints all the publications of the members of organization xxx, taking the names for the list in the file /usr/dict/papers/centlist/xxx. This command should normally be run in the background; it takes perhaps 15 minutes. Two options are available with these commands:
pub.indiv -p M Lesk
prints only the papers, leaving out unpublished notes, patents, etc. Also
pub.indiv -t M Lesk | gcat
prints a typeset copy, instead of a computer printer copy. In this case it has been directed to an alternate typesetter with the `gcat' command. These options may be used together, and may be used with the pub.org command as well. For example, to print only the papers for all of organization zzz and typeset them, you could type
pub.center -t -p zzz | gcat &
These publication lists are printed double column with a citation style taken from a set of publication list macros; the macros, of course, can be changed easily to adjust the format of the lists.

Some Applications of Inverted Indexes on the UNIX System

M. E. Lesk


I. Some Applications of Inverted Indexes - Overview

      This memorandum describes a set of programs which make inverted indexes to UNIX* text files, and their application to retrieving and formatting citations for documents prepared using troff.

      These indexing and searching programs make keyword indexes to volumes of material too large for linear searching. Searches for combinations of single words can be performed quickly. The programs are divided into two phases. The first makes an index from the original data; the second searches the index and retrieves items. Both of these phases are further divided into two parts to separate the data-dependent and algorithm dependent code.

      The major current application of these programs is the troff preprocessor refer. A list of 4300 references is maintained on line, containing primarily papers written and cited by local authors. Whenever one of these references is required in a paper, a few words from the title or author list will retrieve it, and the user need not bother to re-enter the exact citation. Alternatively, authors can use their own lists of papers.

      This memorandum is of interest to those who are interested in facilities for searching large but relatively unchanging text files on the UNIX system, and those who are interested in handling bibliographic citations with UNIX troff.

II. Updating Publication Lists

      This section is a brief note describing the auxiliary programs for managing the updating processing. It is written to aid clerical users in maintaining lists of references. Primarily, the programs described permit a large amount of individual control over the content of publication lists while retaining the usefulness of the files to other users.

III. Manual Pages

      This section contains the pages from the UNIX programmer's manual for the lookall, pubindex, and refer commands. It is useful for reference.

* UNIX is a Trademark of Bell Laboratories.