[Date Prev][Date Next] [Thread Prev][Thread Next] [Date Index] [Thread Index]

word list for document index



I wish to add an index to documents I am writing.  I am using XEmacs
and LaTeX.

It appears that makeindex is obsolescent and has been superseded by
xindy, so I plan to use xindy.

>From an article titled "MakeIndex: An Index Processor for LaTeX",
written by Leslie Lamport (17 February 1987), it appears that the
first step is to create an alphabetical list of every word in the
document, with duplicates removed.  Lamport recommends use of the
commands:

    delatex myfile.tex | sort -uf > foo

It would appear that delatex purges from the file all the LaTeX
commands.

However:

(1) It appears that delatex now is obsolete.  I have searched with
Google, but I have not yet discovered how to obtain or create an
executable version of delatex.

(2) I do not know if delatex takes account of "include" files in a
LaTeX document (that is, files specified by the \include command).
However, it would be no great problem to run delatex on each of the
include files, and then concatenate the resulting word lists.

Accordingly, I am trying to start out with the .dvi file of the
document.  Now the problem is that, in the .dvi file, many words are
hyphenated at the end of the line, and there is a newline character
(and perhaps also a carriage return?) at the end of each line.

Using sed and tr, I have attempted to remove combination of [hyphen
and newline character], but I have not been successful.  Two sed
script which I have tried are:  

   sed -e 's/" \n"//g' 

   sed -e 's/( \n)//g' 

Is there a better way to generate a list of words in a LaTeX document?

RLH




Reply to: