HTTrack Website Copier
Free software offline browser - FORUM
Subject: Remove sid parameter from any link on phpbb forum
Author: PeterPan
Date: 07/16/2018 22:50
 
Hello,

thanks for httrack!

I would like to use it to make a copy of my phpbb3 board.

How can I setup httrack to ignore the sid query string parameter in any link
that it scans/finds?
So for example httrack scans my board and finds links like...

www.mysupercoolforum.com/forum/bla.php&sid=23423423
www.mysupercoolforum.com/forum/bla.php&sid=999393

...then I want httrack to ignore "sid=23423423" and "sid=999393" thus treating
the two links as the same link.

I have the feeling the sid causes a lot of trouble and duplication and even
too many links (especially when for some reason a new session id is generated
which makes all links appear to be a new link for httrack). It also makes it
impossible to continue a mirroring session the next day since httrack will
probably get a new session id from my board thus making all links appear as
new links again and so on. I hope you get my point.

So please is there a way to tell httrack to ignore the "sid=somenumber" part
in any scanned link?
Help is really appreciated!
 
Reply


All articles

Subject Author Date
Remove sid parameter from any link on phpbb forum

07/16/2018 22:50




4

Created with FORUM 2.0.11