.TH PREP 1 .SH NAME prep \- prepare text for statistical processing .SH SYNOPSIS .B prep [ .B \-diop ] file ... .SH DESCRIPTION .I Prep reads each .I file in sequence and writes it on the standard output, one `word' to a line. A word is a string of alphabetic characters and imbedded apostrophes, delimited by space or punctuation. Hyphented words are broken apart; hyphens at the end of lines are removed and the hyphenated parts are joined. Strings of digits are discarded. .PP The following option letters may appear in any order: .TP .B \-\^d Print the word number (in the input stream) with each word. .TP .B \-\^i Take the next .I file as an `ignore' file. These words will not appear in the output. (They will be counted, for purposes of the .B \-d count.) .TP .B \-\^o Take the next .I file as an `only' file. Only these words will appear in the output. (All other words will also be counted for the .B \-d count.) .TP .B \-\^p Include punctuation marks (single nonalphanumeric characters) as separate output lines. The punctuation marks are not counted for the .B \-d count. .PP Ignore and only files contain words, one per line. .SH SEE ALSO deroff(1)