HTTrack Website Copier
Free software offline browser - FORUM
Subject: httrack stopped pulling website reliably
Author: Konrad
Date: 08/07/2019 23:53
 
I regularly backup this site: <https://www.filmweb.pl/user/impactor/blog>

This is the command that used to work:

httrack <http://www.filmweb.pl/user/impactor/blog> -O
/mnt/raid1/backup/filmweb/spis-tresci_`date +%Y-%m-%d` -r4 -m3000000 -c8 -T30
+*http://www.filmweb.pl/user/impactor/blog/556944-Czy+powiniene%C5%9B+wierzy%C4%87+w+boga+Sprawd%C5%BAmy%21#komentarze*
+*http://www.filmweb.pl/user/impactor/blog/*
+*http://www.filmweb.pl/user/impactor/blog/556944-Czy+powiniene%C5%9B+wierzy%C4%87+w+boga+Sprawd%C5%BAmy%21#komentarze*
+*http://www.filmweb.pl/forum/inne* +*http://www.filmweb.pl/film/*discussion*
-*http://www.filmweb.pl/user/* +*.png +*.gif +*.jpg +*.css +*.js
-ad.doubleclick.net/*


However, since about a year, I noticed httrack stopped pulling all the links
from linked topics and they are displayed as external.

I suspect it might have something to do with the full-screen (java script?) ad
displayed when first visiting the site.

Is there a way to fix it?
 
Reply


All articles

Subject Author Date
httrack stopped pulling website reliably

08/07/2019 23:53
Re: httrack stopped pulling website reliably

01/11/2020 01:33




4

Created with FORUM 2.0.11