HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: html file
Author: William Roeder
Date: 12/25/2008 15:20
 
> I'm using linux command line httrack. I would like
> to download main html page of ther url and images. I
> used the following command. But it is downloading
> html files which is in the "href" tag.
That is what httrack does - mirror a site.

> httrack
> <http://commons.wikimedia.org/wiki/Category:HTML> -O
> /home/test/output/crawler -q -Q -N
> 20081224084727/%n.%t -o0 -X0 -T30 -R1 -b0 -I0 %I0
> -%F "" -F "Mozilla/5.0 Firefox/3.0.3" -r -*.htm* -*
> +*.jpg +*.jpeg +*.css +*.js +*.gif +*.bmp +*.tif*
> +*.png +*.swf -*.exe -*.pdf -*.doc -*.zip

<http://httrack.com/html/fcguide.html>
-g  just get files (saved in the current directory) (--get-files)
or
-rN set the mirror depth to N (* r9999) (--depth[=N])

 
Reply Create subthread


All articles

Subject Author Date
html file 12/24/2008 22:29
Re: html file 12/25/2008 15:20




9

Created with FORUM 2.0.11