| I'm trying to spider some web sites to catalog information. I'm trying to
build URI lists using the data in HTTRACK's hts-cache/new.txt file. I am
using this HTTRACK command:
httrack --ext-depth=3 --depth=3 --can-go-up-and-down --timeout=10 -e
--user-agent "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1)"
www.somewebsite.com
The HTTRACK command runs fine, but the data in the hts-cache/new.txt file
doesn't reports any of the external links on the last page spidered.
Does anyone know why the hts-cache/new.txt file doesn't catch the external
links on the last spidered page? | |