| Can I do this?
I need to collect certain archived images. The images are
output from the archive using a query string, in a URL
similar to this:
<http://mysite.com/ImageArchive.nsf?OpenDocument&Subject=category&Image=item1>
This produces a certain image. I know the format of the
image in each instance (mostly .GIF). I have generated a
List file of the links I need to capture (%L), and I can
instruct HTTrack to image these by telling it not to get
HTML files and assume that other files are of a specified
type, but I can't persuade it to name the files sensibly.
The only success I get is when I allow HTTrack to name the
outputs with 'M5' hashing codes, whereupon I get a
directory full of files named "image5f6e.gif," etc.
There are too many items to go through by hand renaming,
when I already have the names listed in my list file.
How do I persuade HTTrack to name the link it grabs in a
manner that is at least similar to the link at which it was
pointed?
Something like this would be the best output of all:
ImageArchive.nsf~OpenDocument&Subject=category&Image=item1.g
if
but I could live with capturing just the last name/value
pair:
Image=item1.gif
*Suggested Feature*
Can we have HTTrack automatically HTTP encode the URL?
This would then require far fewer occaisions where 'M5'
hashing is the only sensible course of action. In the
above example, the link would be saved as something like
this:
mysite.com/ImageArchive.nsf%63OpenDocument%38Subject%
61category%38Image%61item1.gif
with hashing codes being used only where duplictes occur
(or where the option is specified).
What do you think?
Thanks for listening, and for any help anyone can provide...
Hopefully,
Graham J. Ellis | |