HTTrack Website Copier
Free software offline browser - FORUM
Subject: Getting a 404 from httrack, but not in browser
Author: Barry Parr
Date: 05/07/2010 01:29
 
I'm trying to archive a Wordpress blog.

Although all the pages work correctly in the browser, httrack is getting 404's
for all individual entry pages, as well as archives before January 2008.

I checked and the site doesn't appear to have a robots.txt file at the root
level. Any other thoughts on what might cause this to happen?  I'm using the
command line version under Linux:

httrack <http://www.halfmoonbaymemories.com/> -O "/home/barry/websites/hmb"
"+*.halfmoonbaymemories.com/*" -v 
 
Reply


All articles

Subject Author Date
Getting a 404 from httrack, but not in browser

05/07/2010 01:29
Re: Getting a 404 from httrack, but not in browser

05/07/2010 03:07
Re: Getting a 404 from httrack, but not in browser

05/08/2010 18:20
Re: Getting a 404 from httrack, but not in browser

05/09/2010 02:17
Re: Getting a 404 from httrack, but not in browser

05/09/2010 16:30




8

Created with FORUM 2.0.11