Subject: Getting a 404 from httrack, but not in browser |
Author: Barry Parr |
Date: 05/07/2010 01:29 |
| I'm trying to archive a Wordpress blog.
Although all the pages work correctly in the browser, httrack is getting 404's
for all individual entry pages, as well as archives before January 2008.
I checked and the site doesn't appear to have a robots.txt file at the root
level. Any other thoughts on what might cause this to happen? I'm using the
command line version under Linux:
httrack <http://www.halfmoonbaymemories.com/> -O "/home/barry/websites/hmb"
"+*.halfmoonbaymemories.com/*" -v | |
|
|
|
|