HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: It is great but should be greater
Author: Xavier Roche
Date: 03/19/2002 07:18
 
> I cannot use httrack for a purpose that would be the
> most useful, e.g. to return documents from a
> documentary database on the web.
> In the database there are generated index files
> containing links (to documents) like this:
> <http://www.somewebsite.com/docs/pdfgate.cgi>
>id=2&filename=somefilename.pdf

Use the 'Force old HTTP/1.0 requests (no 1.1)' in 'Set 
options'/'Spider', you may get the correct file types 
and ensure that you do not use MIME types (such as 
cgi -> text/html).

Besides, is you want to get some useful information in 
the query string, such as the 'filename' parameter, go 
to 'Set options'/'Build' and select 'User-defined 
structure'. Then, select 'Options' in this subtab, and 
type in as filesystem mask:
%h%p/%n%[filename].%t

This will name all documents using the embedde 
parameter, such as
<http://www.somewebsite.com/docs/pdfgate.cgi?id=2&filename=somefilename.pdf>
->
C:\My Web 
SItes\foobar\www.somewebsite.com\docs\pdfgatesomefilena
me.pdf.pdf

Yes, the .pdf.pdf is ugly, and you could use:
%h%p/%n%[filename]

But in this case some html files will be badly named.

(you can not select specific pasm for specific files 
yet, this may be added in the future)
 
Reply Create subthread


All articles

Subject Author Date
It is great but should be greater

03/19/2002 07:01
Re: It is great but should be greater

03/19/2002 07:18
Re: It is great but should be greater

03/19/2002 14:29
Re: It is great but should be greater

03/20/2002 21:38




4

Created with FORUM 2.0.11