[OS X TeX] Binary for detex (convert latex into plain text, see also l2a)

Robert Spence spence at saar.de
Thu Aug 30 20:45:07 EDT 2007


On 31 Aug 2007, at 05:13 , Markus S wrote:

> Or is there a better way to get plain text out of latex? Exporting  
> text from the pdf file using Acrobat works, but all non-ASCII  
> characters (incl. the pervasive ligatures) and the hyphenation make  
> a big mess of the text.

Have you tried adding
\usepackage{cmap}
to the preamble of your LaTeX file?
 From memory it has to go first, i.e. immediately after you specify  
your documentclass.

On my system, doing
texdoc cmap
in Terminal.app tells me:
The cmap package is intended to make the PDF files generated by  
pdflatex both searchable and copyable in acrobat reader and other  
compliant PDF viewers. Just include the \usepackage{cmap} at the  
beginning of your preamble, and it will pre-load the needed CMap  
translations for the fonts used in the document. It works for Type 1  
fonts, and also for Type 3 fonts with recent versions of pdftex. The  
main limitation currently is the inability to work with virtual  
fonts, because of limitations of pdftex, but may be resolved in a  
future version of pdftex. The author is Vladimir Volovich. See also  
pdftex.

HTH a bit

-- Robert Spence
Applied Linguistics
Saarland University
Germany

------------------------- Helpful Info -------------------------
Mac-TeX Website: http://www.esm.psu.edu/mac-tex/
TeX FAQ: http://www.tex.ac.uk/faq
List Archive: http://tug.org/pipermail/macostex-archives/
List Reminders & Etiquette: http://www.esm.psu.edu/mac-tex/list/





More information about the MacOSX-TeX mailing list