HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: Links in pdf files..
Author: Xavier Roche
Date: 10/13/2002 09:25
 
> I see flash parshing is included with the latest version, 
> any chance of seeing the pdf link extraction capability 
> soon, I have yet to use the program since I just 
downloaded 
> it but from experience with some offline browsers, I 
> haven't yet seen one with pdf parsing capability which 
will 
> be super useful particularly in academic setting.

Including an external parser is now quite easy to do ; the 
problem is the PDF format, a terrible format (compression, 
multiple versions..) with a lots of things inside, which 
require a huge external library (such as XPDF - but not 
freely available in library version)
Remember that PDF is more like a postscript format, each 
letters can be placed geometrically (and reforming words is 
just a nightmare).. 
 
Reply Create subthread


All articles

Subject Author Date
Links in pdf files..

11/22/2001 00:51
Re: Links in pdf files..

11/22/2001 09:34
Re: Links in pdf files..

10/13/2002 09:05
Re: Links in pdf files..

10/13/2002 09:25
Re: Links in pdf files..

11/24/2005 02:26
Re: Links in pdf files..

09/14/2007 10:48
Re: Links in pdf files..

02/26/2008 12:56




a

Created with FORUM 2.0.11