HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: Writing pages with script and parameters proposal
Author: Xavier Roche
Date: 08/20/2002 19:41
 
> The HTTrack could mirror the pages but it will mess 
> everything actually because the actual stuff is in 
> parameter to servlet.
> So here is my proposal: Could it be possible to add some 
> processing when writing files down to HDD that would 
allow 
> some changes of originating URL? I mean from this:

Argh, this would require some dirty code (and would require 
to hack the htsname.c routines)

<http://delivery.netacad.net/servlet/DeliveryServlet/SessionI>
D=1029853312210791,Engine=theme,Theme=cnamstheme,Client=ccna
,CompleteCourse=knet-CRS47051/knet-

Ah.. there is NO query string separator, which SHOULD be 
included, such as:
<http://delivery.netacad.net/servlet/DeliveryServlet/?SessionID>=..

Unfortunately, with such name, the engine will always 
attempt to use "stange" names

> What I suggest is only (in most simplistic case) to allow 
> some (regular maybe) expression to be replaced in source 
> URL. In my case it would mean replacing

Humm.. the regex routines are rather basic ; but I don't 
want (yet) to include more complex ones ("true" regexp, 
like pcre) because many users have yet problems using the 
current "basic" ones

You can also use an uder-defined structure, and use the 
short names ; such as:
%h%p/%sn%q.%t

 
Reply Create subthread


All articles

Subject Author Date
Writing pages with script and parameters proposal

08/20/2002 16:32
Re: Writing pages with script and parameters proposal

08/20/2002 19:41
Re: Writing pages with script and parameters proposal

08/22/2002 15:01
Re: Writing pages with script and parameters proposal

08/22/2002 22:22
Re: Writing pages with script and parameters proposal

08/22/2002 22:25




a

Created with FORUM 2.0.11