HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: defending a site, bandwidth leeching
Author: D_A
Date: 01/18/2005 17:55
 
You can use javascript to block httrack and any other webbot
but you won't be indexed.
You can use server side techniques but .htaccess will not be
helpfull in your case.

The most effective is usually to block any user (IP or
session) requesting too many pages or sending too many
requests instead of trying to fool a robot.
You can find useful information here :
<http://www.httrack.com/html/abuse.html#WEBMASTERS>
<http://www.garykeith.com/browsers/bad-bots.asp>
webmasterworld has many threads about blocking spiders and
spider traps (using perl, php, asp...)
you can look for web spider traps and and find solutions to
bandwidth usage abuse.
and in French
<http://www.1001bd.com/stop_aspirateurs/>
<http://www.toulouse-renaissance.net/c_outils/c_interdire_aspirateurs.htm>
 
Reply Create subthread


All articles

Subject Author Date
defending a site, bandwidth leeching 01/17/2005 10:01
Re: defending a site, bandwidth leeching 01/17/2005 19:10
Re: defending a site, bandwidth leeching 01/17/2005 21:22
Re: defending a site, bandwidth leeching 01/18/2005 08:14
Re: defending a site, bandwidth leeching 01/18/2005 17:55
I wanty to share my ideas... 01/19/2005 16:18
Re: defending a site, bandwidth leeching 01/22/2005 09:52




e

Created with FORUM 2.0.11