| Ok, so I did a bit of searching and found this:
<http://forum.httrack.com/readmsg/10940/index.html?pid=10931&days=10000&js=1&lang=en>
A lot of it is Greek to me, but this seems to be the part I'm interested in:
"
> 2. How to adjust HTTrack so the saved static pages are
> named similar to the original ASP page URL. For example,
> one ASP page has the domain address with the following
> added to the URL "/asp-rel/article.asp?id=realty-times-
> 0958.shtml". I would like to know how to configure
HTTrack
> to save the static page with the filename of "realty-
times-
> 0958.shtml" and not add the random numbers at the end or
> use the html file extension.
"
See 'Set Options' / 'Build Structure' / 'User-Defined
Structure'
with something like:
%r/%h%p/%n%[id:-:::].%t
But what does all that mean? Is there something I'm missing in the help file
that explains that code?
Since it seems HTTrack might actually be able to do what I need, I'll be more
specific.
Say I have a list of URLs like this:
www.example.com/example/abcdefg
www.example.com/example/hijklmn
www.example.com/example/opqrstu
The pages are PHP I think. I know how to get all my URLs into HTTrack. How
do I:
1) get HTTrack to put each page, and all its embedded images, right into one
folder (no site structure, should look just like I saved the complete page
with Firefox or IE)
2) get HTTrack to name the HTML files thusly:
abcdefg.HTML
hijklmn.HTML
opqrstuv.HTML
and the folders (or the images, if HTTrack will only dump them into the same
folder) with the images the same way?
| |