HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: Quickly extract only valid urls from hts-log.txt ?
Author: Xavier Roche
Date: 02/10/2002 17:40
 
> 'Extract URLs from the logfile.' 
> That's exactly what I don't know how do quickly. I 
> know how to do it one url after the other (copy and 
> paste in a new text file). But as there are more 
than 
> 500 '.rm' files (each one bigger than 10 Mo), it 
will 
> take me a lot of time. ;-( 

Ah.. on Un*x, one line similar to:
cat hts-log.txt | grep "File too big" | grep -
E "http:\/\/.*\.rm" | sed -e 's/.* \(http:\/
\/.*\.rm\) .*/\1/g'

But I don't know of any similar tools on Windows - 
except if you install standard gnu utils on DOS

 
Reply Create subthread


All articles

Subject Author Date
Quickly extract only valid urls from hts-log.txt ?

02/09/2002 16:16
Re: Quickly extract only valid urls from hts-log.txt ?

02/09/2002 16:45
Re: Quickly extract only valid urls from hts-log.txt ?

02/09/2002 17:12
Re: Quickly extract only valid urls from hts-log.txt ?

02/10/2002 17:40
Re: Quickly extract only valid urls from hts-log.txt ?

02/10/2002 22:33
Re: Quickly extract only valid urls from hts-log.txt ?

02/11/2002 21:06
Re: Quickly extract only valid urls from hts-log.txt ?

02/12/2002 05:20
Re: Quickly extract only valid urls from hts-log.txt ?

03/02/2011 22:19




6

Created with FORUM 2.0.11