| > I've uploaded screenshots of my settings here
> (settings not shown are unchanged from default):
> this is my overall list of websites to download in
> this project:
> <http://www.serebii.net/>
> <http://www.serebii.net/pokedex/>
> <http://www.serebii.net/pokedex-rs/>
> <http://www.serebii.net/pokedex-dp/>
> <http://www.serebii.net/attackdex/>
> <http://www.serebii.net/attackdex-dp/>
> <http://www.serebii.net/abilitydex/>
> <http://www.serebii.net/pokearth/>
> <http://www.serebii.net/games/type.shtml>
> <http://www.serebii.net/pokedex-rs/bug.shtml>
Initial post misdirected me by saying serebii.net
Don't post setting like that. 1) defaults can be changed from the factory
settings. 2) all the scan rules can not be seen but all the -serebii.net...
filters do nothing since that is not where you are mirroring. 3) you didn't
answer my question about the near option (get non-html) Instead just past the
second line from the log file. That contains Everything.
Now I see that you are limiting mirror depth to two and allowing travel
mode=up and down.
Since you listed the top level (http://www.serebii.net/) then the up and down
does nothing.
The depth to two is different. Other than the top level
you'll get the page listed (Eg. <http://www.serebii.net/pokearth/>) That's level
one, and all images for THAT page (level two) and all other links (level two)
but those pages will NOT get images since those would be level three.
From the top level, you get all those images( Level 2), all other pages (not
mentioned in your mirror) without images.
Any image/css/js etc not next to one of the above pages will not be gotten
since you didn't check get-nonhtml.
Also since you are limiting the mirror, you should check no-external pages so
you know where the mirror ends.
FYI, with non-html I got: 23108 links scanned, 23113 files written (276214686
bytes overall) I didn't see any incorrect pages.
| |