Re: Links in pdf files.. - HTTrack Website Copier Forum

Subject: Re: Links in pdf files..

Author: Xavier Roche

Date: 10/13/2002 09:25

> I see flash parshing is included with the latest version, 
> any chance of seeing the pdf link extraction capability 
> soon, I have yet to use the program since I just 
downloaded 
> it but from experience with some offline browsers, I 
> haven't yet seen one with pdf parsing capability which 
will 
> be super useful particularly in academic setting.

Including an external parser is now quite easy to do ; the 
problem is the PDF format, a terrible format (compression, 
multiple versions..) with a lots of things inside, which 
require a huge external library (such as XPDF - but not 
freely available in library version)
Remember that PDF is more like a postscript format, each 
letters can be placed geometrically (and reforming words is 
just a nightmare)..

Create subthread

All articles

Subject	Author	Date
Links in pdf files..		11/22/2001 00:51
Re: Links in pdf files..		11/22/2001 09:34
Re: Links in pdf files..		10/13/2002 09:05
Re: Links in pdf files..		10/13/2002 09:25
Re: Links in pdf files..		11/24/2005 02:26
Re: Links in pdf files..		09/14/2007 10:48
Re: Links in pdf files..		02/26/2008 12:56