HTTrack Website Copier
Free software offline browser - FORUM
Subject: 3.44 Filters and filesize bug?
Author: Daniel Konopka
Date: 11/19/2011 19:08
 
Hello,

I can't for the life of me make httrack filters to work properly.
I want to exclude all *.zip files bigger than 1200kb from a site.

I use this command (watch out for the -O path):

httrack.exe <http://www.phpcaptcha.org/download/> "-*"
"+*.phpcaptcha.org/*.zip"
"+*.phpcaptcha.org/download*/*" "-*.zip" "+*.zip*[<1200]" -O
"V:\htt_temp\filesizetest"
-A90000 -c1 -%c1 -H1 -P 127.0.0.1:8128 -R3 -T300
-F "Mozilla/5.0 (X11; U; FreeBSD; i386; en-US; rv:1.7) Gecko" -u1 -B -%D0
-E600 -K0 -%N0 -p3 -r20 -s0 -U -X0 -b1 -N "%h%p/%n.%t%k" -%q0 -f -z -Z -%v1

I have tried these filter configurations:

>>> "-*.zip*[>1200]"

Does not download *.zip at all and debug log shows:

19:01:00	Debug: 	upper link authorized: www.phpcaptcha.org/latest.zip
19:01:00	Debug: 	(wizard) explicit forbidden (-*.zip*[>1200]) link: link
www.phpcaptcha.org/latest.zip at www.phpcaptcha.org/download/
19:01:00	Debug: 	(wizard) cancelled foreign domain link: link
www.phpcaptcha.org/latest.zip at www.phpcaptcha.org/download/
19:01:00	Debug: 	result for wizard link test: 1

>>> "-*.zip" "+*.zip*[<1200]"

Does not download *.zip at all and debug log shows:

18:55:32	Debug: 	upper link authorized: www.phpcaptcha.org/latest.zip
18:55:32	Debug: 	(wizard) explicit forbidden (-*.zip) link: link
www.phpcaptcha.org/latest.zip at www.phpcaptcha.org/download/?replytocom=509
18:55:32	Debug: 	(wizard) cancelled foreign domain link: link
www.phpcaptcha.org/latest.zip at www.phpcaptcha.org/download/?replytocom=509
18:55:32	Debug: 	result for wizard link test: 1
18:55:32	Debug: 	External: www.phpcaptcha.org/latest.zip

>>> "-*.zip" "+*.zip*[< 1200]"
Creates some bogus folder named "1200]" and still downloads the files meant
to be excluded.

I don't need to tell you it drives me mad as hell, since I've wasted >3 hours
already on testing the filters and still nothing... and on top of that the
httrack's
source code is so badly commented (French only), and variable names are often
crazy
and meaningless... making it a PAIN for someone who would like to fix it.

Could someone test it and let me know if it can be reproduced?
 
Reply


All articles

Subject Author Date
3.44 Filters and filesize bug?

11/19/2011 19:08
Re: 3.44 Filters and filesize bug?

11/19/2011 19:59
Re: 3.44 Filters and filesize bug?

11/20/2011 01:17




5

Created with FORUM 2.0.11