HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: Scan Rules Question: How to get ONLY HTML file
Author: William Roeder
Date: 11/02/2008 02:55
 
> I've tried checking and unchecking the option to get
> non-HTML files, and set the limit to non-HTML files
> to 0. I've even tried setting the scan rules to
> "-*.[file extension]*"
> 
> But, still the program downloads .zip, .rar, and
> .mov, and other large files that I do not want. How
> do I fix this?
checking non-html allows additional files such as off-site images.  So that
you don't want checked.

[file extension] is not valid (see <http://httrack.com/html/fcguide.html>)

Since html files come in many flavors (.htm, .asp, .cgi etc)
you need to filter on the mime type.  Per
<http://httrack.kauler.com/help/Filters#Filters7> you need
-mime:*/* +mime:text/html
 
Reply Create subthread


All articles

Subject Author Date
Scan Rules Question: How to get ONLY HTML files

11/01/2008 22:33
Re: Scan Rules Question: How to get ONLY HTML file

11/02/2008 02:55
Re: Scan Rules Question: How to get ONLY HTML file

11/03/2008 14:40




b

Created with FORUM 2.0.11