HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: is there a way to just get all pdf file links?
Author: Xavier Roche
Date: 08/31/2013 14:10
 
> I dont need to download the actual file. I just want
> to get all links to the pdf files, for example into
> an excel file. Is this doable with Httrack?
Err, not directly - you may, however, set the logs in "verbose" mode and
grepping something like "External: " from hts-log.txt

grep "External: " hts-log.txt | cut -f3 | cut -f2 -d' ' | grep -E "\.pdf$"

(on Windows, if you do not have any grepping facility installed, you may
consider installing cygwin)
 
Reply Create subthread


All articles

Subject Author Date
is there a way to just get all pdf file links?

08/30/2013 05:23
Re: is there a way to just get all pdf file links?

08/31/2013 14:10




d

Created with FORUM 2.0.11