| Hi!
First of all HTTrack is GREAT!!! Thanx to the authors (esp.
Xavier)...
Doing mirrors of some pages I encountered this special
type. The URL looks like this:
<http://delivery.netacad.net/servlet/DeliveryServlet/SessionI>
D=1029853312210791,Engine=theme,Theme=cnamstheme,Client=ccna
,CompleteCourse=knet-CRS47051/knet-
CRS47051/knet/CRS47051/courseframeset.html
I manage to have the offline version which stores this
exact file as: knet-CRS47051/knet-
CRS47051/knet/CRS47051/courseframeset.html
The links inside are of relative form, so they look like
this: <td colspan="3"><img height="110" width="750"
border="0"
src="../../images/ccna/common/intro/top.gif"></td>
The HTTrack could mirror the pages but it will mess
everything actually because the actual stuff is in
parameter to servlet.
So here is my proposal: Could it be possible to add some
processing when writing files down to HDD that would allow
some changes of originating URL? I mean from this:
<http://delivery.netacad.net/servlet/DeliveryServlet/SessionI>
D=1029853312210791,Engine=theme,Theme=cnamstheme,Client=ccna
,CompleteCourse=knet-CRS47051/knet-
CRS47051/knet/CRS47051/courseframeset.html
create something like this:
<http://delivery.netacad.net/knet-CRS47051/knet>-
CRS47051/knet/CRS47051/courseframeset.html
just for the purpose of writing down. The files would be
stored OK, no hassling with parameters, no cryptic
filenames.
What I suggest is only (in most simplistic case) to allow
some (regular maybe) expression to be replaced in source
URL. In my case it would mean replacing
servlet/DeliveryServlet/SessionID=1029853312210791,Engine=th
eme,Theme=cnamstheme,Client=ccna,CompleteCourse=
with nothing. It would also allow some more tweaking I
believe.
Thanx a lot.
Rex
| |