HTTrack Website Copier
Free software offline browser - FORUM
Subject: Curious about Javascript filtering problem
Author: Steve_W
Date: 12/26/2005 18:10
 
Hi! Just curious about HTTRACK's filtering abilities Re:Java

I've tried downloading part of a site with the following filters

-*
+www.pinkmonkey.com/studyguides/subjects/geometry/*
+*.png +*.gif +*.jpg +*.css -*.js -ad.doubleclick.net/*
-www.qksrv.net*
-www.lduhtrp.net*
-media.fastclick.net*
-as.casalemedia.com*
-pinkmonkey.com/phpadsnew*
-*google*
-pagead2.googlesyndication.com*

but still end up with the same Java script on every page that tries to link to
the net and run a popup window. By removing the following script in notepad I
found it was to blame.

<script language="Javascript"><!--
var d=new Date();var r=(d.getTime()%8673806982)+Math.random();var
u=escape(window.location.href);
var host=' language="Javascript" src=http://as.casalemedia.com/s?s=';
document.write('<scr'+'ipt'+host+'53806&u='+u+'&f=1&id='+r+'"></scr'+'ipt>');
//--></script>

I also noticed other scripts on the page seem to be modified so that

src=http://as.casalemedia.com/s?s='; 
would be changed to something like
src="../../../../../external.html?link=http://as.casalemedia.com/s?s=';

Since I can turn off Java Scripting in Firefox anyway its no big deal, I'm
just curious as to what's happening here. Also, as a suggestion for future
versions of HTTRACK, some sort of tag filter would be great so you could just
remove anything between say <script language="Javascript"> and </script>
as a page is downloaded/parsed. Other than that from the one day I've been
using it so far it seems pretty good.

Thanks
Steve
 
Reply


All articles

Subject Author Date
Curious about Javascript filtering problem

12/26/2005 18:10




8

Created with FORUM 2.0.11