HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: Files aren't downloaded
Author: William Roeder
Date: 03/13/2010 23:58
 
> I've uploaded screenshots of my settings here
> (settings not shown are unchanged from default):
> this is my overall list of websites to download in
> this project:
> <http://www.serebii.net/>
> <http://www.serebii.net/pokedex/>
> <http://www.serebii.net/pokedex-rs/>
> <http://www.serebii.net/pokedex-dp/>
> <http://www.serebii.net/attackdex/>
> <http://www.serebii.net/attackdex-dp/>
> <http://www.serebii.net/abilitydex/>
> <http://www.serebii.net/pokearth/>
> <http://www.serebii.net/games/type.shtml>
> <http://www.serebii.net/pokedex-rs/bug.shtml>

Initial post misdirected me by saying serebii.net

Don't post setting like that. 1) defaults can be changed from the factory
settings. 2) all the scan rules can not be seen but all the -serebii.net...
filters do nothing since that is  not where you are mirroring. 3) you didn't
answer my question about the near option (get non-html) Instead just past the
second line from the log file. That contains Everything.

Now I see that you are limiting mirror depth to two and allowing travel
mode=up and down.
Since you listed the top level (http://www.serebii.net/) then the up and down
does nothing.

The depth to two is different. Other than the top level
you'll get the page listed (Eg. <http://www.serebii.net/pokearth/>) That's level
one, and all images for THAT page (level two) and all other links (level two)
but those pages will NOT get images since those would be level three.
From the top level, you get all those images( Level 2), all other pages (not
mentioned in your mirror) without images.

Any image/css/js etc not next to one of the above pages will not be gotten
since you didn't check get-nonhtml.
Also since you are limiting the mirror, you should check no-external pages so
you know where the mirror ends.

FYI, with non-html I got: 23108 links scanned, 23113 files written (276214686
bytes overall) I didn't see any incorrect pages.
 
Reply Create subthread


All articles

Subject Author Date
Files aren't downloaded

03/12/2010 21:40
Re: Files aren't downloaded

03/12/2010 22:38
Re: Files aren't downloaded

03/12/2010 23:08
Re: Files aren't downloaded

03/13/2010 16:10
Re: Files aren't downloaded

03/13/2010 16:42
Re: Files aren't downloaded

03/13/2010 23:58
Re: Files aren't downloaded

03/14/2010 02:54




7

Created with FORUM 2.0.11