HTTrack Website Copier
Free software offline browser - FORUM
Subject: problem with mime-type exclusions
Author: Jacob Langseth
Date: 01/16/2009 17:31
 
Hello,

I am experiencing some unexpected behavior with the mime-type exclusion rules
in HTTrack.  I would like to mirror a web page, but exclude all images via
mime type.

If I execute the following, however:

  $ httrack -%H <http://www.cs.utah.edu/~gk/atwork/> -mime:image/*

HTTrack will still download all of the images on the site.  (The only
difference between this command and one that does not use the -mime parameter
appears to be that with -mime, the files are all named .delayed rather than
.gif/.jpg.)

If I instead exclude images by file extension, rather than mime-type, then I
get the behavior that I would expect:

  $ httrack -%H <http://www.cs.utah.edu/~gk/atwork/> -*.gif -*.jpg

(This command downloads the page, but does not download the images.)

Is there a way to use mime-type exclusions to avoid downloading unwanted
mime-types?  I would rather not rely on the file-extension exclusion method,
as file extensions can be absent or unreliable.

I am using HTTrack version 3.43-2+libhtsjava.so.2 under linux
(debian-unstable).

Thanks for your help,
 Jacob
 
Reply


All articles

Subject Author Date
problem with mime-type exclusions

01/16/2009 17:31
Re: problem with mime-type exclusions

01/16/2009 18:53
Re: problem with mime-type exclusions

01/16/2009 19:28




6

Created with FORUM 2.0.11