| > I see flash parshing is included with the latest version,
> any chance of seeing the pdf link extraction capability
> soon, I have yet to use the program since I just
downloaded
> it but from experience with some offline browsers, I
> haven't yet seen one with pdf parsing capability which
will
> be super useful particularly in academic setting.
Including an external parser is now quite easy to do ; the
problem is the PDF format, a terrible format (compression,
multiple versions..) with a lots of things inside, which
require a huge external library (such as XPDF - but not
freely available in library version)
Remember that PDF is more like a postscript format, each
letters can be placed geometrically (and reforming words is
just a nightmare)..
| |