| Thanks, I follow your suggestion.
I did another scan of a different site, which went fine.
One difference between the two scans was that for the problem one, I had
instructed the crawl to ignore robots.txt because myfriend.com had blocked
certain content from robots (not wanting it on google).
I am curious why the filter code you suggest might be necessary to the proper
operation of httrack. | |