HTTrack Website Copier
Free software offline browser - FORUM
Subject: Site using .htaccess 404 redirect to php script
Author: Gary
Date: 04/30/2007 18:30
 
Hi,
I was wondering if anyone has tried saving a site using a custom error page
directive in the .htaccess file.

It's done to process urls such as cisca.org/convention, where "convention" is
actually just a tag that gets looked up and served from a db. 

Looks like winhttrack is hitting the 404 and deciding (logically) there's
nothing there. Is there a way around this?
It saved the top-level links just fine, even though they use the same 404
scheme. For some reason I can't get the inner links to be followed. I tried
adding them as primary targets and updating, to no avail. I have no limits set
on depth, BTW. Any hints would be greatly appreciated.

Thanks,
Gary

Log from run: 

HTTrack3.41-2+htsswf+htsjava launched on Mon, 30 Apr 2007 11:14:05 at
<http://cisca.org/convention> <http://cisca.org/publications>
<http://cisca.org/sponsorship> +*.css +*.js -ad.doubleclick.net/*
-mime:application/foobar +*.gif +*.jpg +*.png +*.tif +*.bmp +*.zip +*.tar
+*.tgz +*.gz +*.rar +*.z +*.exe +*.pdf
(winhttrack -qi%e0C2%Ps0u1%s%uN0%I0p3DaK0H0%kf2A25000%f#f -F "Mozilla/4.5
(compatible; HTTrack 3.0x; Windows 98)" -%F "<!-- Mirrored from %s%s by
HTTrack Website Copier/3.x [XR&CO'2007], %s -->" -%l "en, en, *"
<http://cisca.org/convention> <http://cisca.org/publications>
<http://cisca.org/sponsorship> -O1 "X:\Downloaded Web Sites\CISCA" +*.css +*.js
-ad.doubleclick.net/* -mime:application/foobar +*.gif +*.jpg +*.png +*.tif
+*.bmp +*.zip +*.tar +*.tgz +*.gz +*.rar +*.z +*.exe +*.pdf )
Information, Warnings and Errors reported for this mirror:
note: the hts-log.txt file, and hts-cache folder, may contain sensitive
information,
 such as username/password authentication for websites mirrored in this
project
 do not share these files/folders if you want these information to remain
private
11:14:09 Error:  "Not Found" (404) at link cisca.org/convention (from
primary/primary)
11:14:09 Error:  "Not Found" (404) at link cisca.org/publications (from
primary/primary)
11:14:09 Error:  "Not Found" (404) at link cisca.org/sponsorship (from
primary/primary)
11:14:09 Info:  No data seems to have been transfered during this session! :
restoring previous one!
 
Reply


All articles

Subject Author Date
Site using .htaccess 404 redirect to php script

04/30/2007 18:30
Re: Site using .htaccess 404 redirect to php script

04/30/2007 18:44




c

Created with FORUM 2.0.11