.TH PREP 1 
.SH NAME
prep \- prepare text for statistical processing
.SH SYNOPSIS
.B prep
[
.B \-diop
]
file ...
.SH DESCRIPTION
.I Prep
reads each
.I file
in sequence
and writes it on the standard output,
one `word' to a line.
A word is a string of alphabetic characters
and imbedded apostrophes,
delimited by space or punctuation.
Hyphented words are broken apart;
hyphens at the end of lines are removed and the
hyphenated parts are joined.
Strings of digits are discarded.
.PP
The following option letters may appear in any order:
.TP
.B \-\^d
Print the word number (in the input
stream) with each word.
.TP
.B \-\^i
Take the next
.I file
as an `ignore' file.
These words will not appear in the output.
(They will be counted, for purposes of the
.B \-d
count.)
.TP
.B \-\^o
Take the next 
.I file
as an `only' file.
Only these words will appear in the output.
(All other words will also be counted for
the 
.B \-d
count.)
.TP
.B \-\^p
Include punctuation marks (single nonalphanumeric characters)
as separate output lines.
The punctuation marks are not counted for the
.B \-d
count.
.PP
Ignore and only files contain words, one per line.
.SH SEE ALSO
deroff(1)