| I regularly backup this site: <https://www.filmweb.pl/user/impactor/blog>
This is the command that used to work:
httrack <http://www.filmweb.pl/user/impactor/blog> -O
/mnt/raid1/backup/filmweb/spis-tresci_`date +%Y-%m-%d` -r4 -m3000000 -c8 -T30
+*http://www.filmweb.pl/user/impactor/blog/556944-Czy+powiniene%C5%9B+wierzy%C4%87+w+boga+Sprawd%C5%BAmy%21#komentarze*
+*http://www.filmweb.pl/user/impactor/blog/*
+*http://www.filmweb.pl/user/impactor/blog/556944-Czy+powiniene%C5%9B+wierzy%C4%87+w+boga+Sprawd%C5%BAmy%21#komentarze*
+*http://www.filmweb.pl/forum/inne* +*http://www.filmweb.pl/film/*discussion*
-*http://www.filmweb.pl/user/* +*.png +*.gif +*.jpg +*.css +*.js
-ad.doubleclick.net/*
However, since about a year, I noticed httrack stopped pulling all the links
from linked topics and they are displayed as external.
I suspect it might have something to do with the full-screen (java script?) ad
displayed when first visiting the site.
Is there a way to fix it? | |