HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: way to grab what's after a ? in the file name
Author: Haudy Kazemi
Date: 12/29/2002 08:06
 
> > the boxscores are organized with an id like:
> > <http://sports.espn.go.com/nfl/boxscore?gameId=221208010>
> > and when saving it stops before the ? in the file name.
> > So i am wondering if there is a way to somehow grab the 
> > info after the ? because it contains pertinant 
> information 
> > such as the date the game was played and the home 
team's 
> > ID number.
> 
> Options / Build / Local structure: options
> %h%p/%n%[gameId].%t
> 
> (in the upcoming release, you'll also be able to do more 
> advanced things with the %[] rule thanks to media-access 
> support)
> 
> 
The relevant portion from Fred Cohen's guide:
---------------------------------------------
This new option is important: you can include query-string 
content when forming the destination filename!

Example: you are mirroring a huge website, with many pages 
named as:
www.foo.com/catalog.php3?page=engineering
www.foo.com/catalog.php3?page=biology
www.foo.com/catalog.php3?page=computing
..


 Then you can use the -N option:

httrack www.foo.com -N "%h%p/%n%[page].%t"


 If found, the "page" parameter will be included after
the filename, and the URLs above will be saved as:

/home/mywebsites/foo/www.foo.com/catalogengineering.php3
/home/mywebsites/foo/www.foo.com/catalogbiology.php3
/home/mywebsites/foo/www.foo.com/catalogcomputing.php3
...
 
Reply Create subthread


All articles

Subject Author Date
way to grab what's after a ? in the file name

12/14/2002 20:07
Re: way to grab what's after a ? in the file name

12/14/2002 20:24
Re: way to grab what's after a ? in the file name

12/29/2002 08:02
Re: way to grab what's after a ? in the file name

12/29/2002 08:06




4

Created with FORUM 2.0.11