HTTrack Website Copier
Free software offline browser - FORUM
Subject: webhttrack - ext deep 1 doesn't work.
Author: dr_Fell
Date: 07/30/2006 22:07
 
This time I tried to mirror gallery of websites from cssbeauty.com. Setted up
quite standard config - since I wanted only part od websites (not articles)
gallery, spider started from
<http://www.cssbeauty.com/archives/category/business/>. Optnions were: can go
down, can go outside domain (whole web - I wanted it to can get pages from
gallery), external deep 1 (only 1st page of every external website) . 
Filters: -www.cssbeauty.com/* -cssbeauty.com/* (I don't want it to retrieve
whole cssbeauty, which is quite big site)
+www.cssbeauty.com/archives/category/* (I want it can go to every subcategory,
not only business. links look like  
<http://www.cssbeauty.com/archives/category/CATEGORY/).+cssbeauty.com/archives/category/>*
(the same reason) +*.png +*.gif +*.jpg +*.css +*.js -ad.doubleclick.net/*
(standard).

Then... why is it downloading whole w3.org at the moment I am writing ?
www.w3.org dir has 6 mb at the moment and is still growing. 

 
Reply


All articles

Subject Author Date
webhttrack - ext deep 1 doesn't work.

07/30/2006 22:07
Re: webhttrack - ext deep 1 doesn't work.

07/31/2006 02:29
Re: webhttrack - ext deep 1 doesn't work.

07/31/2006 13:38
Re: webhttrack - ext deep 1 doesn't work.

07/31/2006 13:54
Re: webhttrack - ext deep 1 doesn't work.

08/01/2006 00:33




d

Created with FORUM 2.0.11