[OS X TeX] converting ligatures into text
Lawrence Paulson
lp15 at cam.ac.uk
Fri Apr 22 09:37:09 EDT 2005
I have to extract text from a large number of PDF documents produced
using TeX. Because (I presume) of TeX's non-standard font encodings,
cut and paste often goes wrong. In particular, ligatures get garbled: I
get di±cult instead of difficult.
Does anybody know of a program (or of a definitive set of replacements
that could be given to Perl) for cleaning up such text?
Larry Paulson
--------------------- Info ---------------------
Mac-TeX Website: http://www.esm.psu.edu/mac-tex/
& FAQ: http://latex.yauh.de/faq/
TeX FAQ: http://www.tex.ac.uk/faq
List Post: <mailto:MacOSX-TeX at email.esm.psu.edu>
More information about the MacOSX-TeX
mailing list