HTTrack Website Copier
Free software offline browser - FORUM
Subject: how to process numbered directories
Author: Kevin
Date: 09/19/2011 19:47
 
I'm trying to mirror a site that has numbered directories off the root and has
pdf and zip files under each numbered directory.  Those are the only two
filestypes I want.

If I simply process the root site, I get "forbidden" even after I log onto the
main site (see below for more details).

The sites structure involves the following (examples used i.e. not the real
URLs, but close):

You log on at <https://sitea.customerhub.net>

The pdfs are from <https://s3.amazonaws.com/sitearesources/63/wcsneray.pdf>

The zip files are from <http://sitearesources.s3.amazonaws.com/63/wcsneray.zip>

Other content will vary by number and filenames after the numbered directory
e.g. <https://s3.amazonaws.com/sitearesources/110/mycontentb.pdf> and
<http://sitearesources.s3.amazonaws.com/110/mycontentb.zip>.

How can I have HTTrack walk the numbered folders from 1 on up and download the
content?
Thanks.

Kevin

 
Reply


All articles

Subject Author Date
how to process numbered directories

09/19/2011 19:47
Re: how to process numbered directories

09/19/2011 20:33




b

Created with FORUM 2.0.11