| Hi! Just curious about HTTRACK's filtering abilities Re:Java
I've tried downloading part of a site with the following filters
-*
+www.pinkmonkey.com/studyguides/subjects/geometry/*
+*.png +*.gif +*.jpg +*.css -*.js -ad.doubleclick.net/*
-www.qksrv.net*
-www.lduhtrp.net*
-media.fastclick.net*
-as.casalemedia.com*
-pinkmonkey.com/phpadsnew*
-*google*
-pagead2.googlesyndication.com*
but still end up with the same Java script on every page that tries to link to
the net and run a popup window. By removing the following script in notepad I
found it was to blame.
<script language="Javascript"><!--
var d=new Date();var r=(d.getTime()%8673806982)+Math.random();var
u=escape(window.location.href);
var host=' language="Javascript" src=http://as.casalemedia.com/s?s=';
document.write('<scr'+'ipt'+host+'53806&u='+u+'&f=1&id='+r+'"></scr'+'ipt>');
//--></script>
I also noticed other scripts on the page seem to be modified so that
src=http://as.casalemedia.com/s?s=';
would be changed to something like
src="../../../../../external.html?link=http://as.casalemedia.com/s?s=';
Since I can turn off Java Scripting in Firefox anyway its no big deal, I'm
just curious as to what's happening here. Also, as a suggestion for future
versions of HTTRACK, some sort of tag filter would be great so you could just
remove anything between say <script language="Javascript"> and </script>
as a page is downloaded/parsed. Other than that from the one day I've been
using it so far it seems pretty good.
Thanks
Steve | |