HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: Downloading links only from a certain subdomain
Author: Nijaz
Date: 09/08/2020 15:13
 
To remove ads you can add these line in scan rules, after all other lines, so
they take higher priority:
-[name].adnxs.com
-[name].amazon-adsystem.com
-[name].google-analytics.com
-[name].uservoice.com

That site is more complicated because it should work with default rules, but
it can't because it uses url which do not get captured by httrack because they
are not with code href, but with code data-url.

One extreme way would be to first download what you can already, then use
notepad++ to search and replace in files then search for "data-url" and
replace via href or something, but that would probably break the website.
Also some mathjax js files could not download for me which breaks website.

My advice is to give up that one and find some book or similar one easier to
download. Also I can't test everything because of lack of time and internet
speed.

You can find some math books on b-ok.cc
 
Reply Create subthread


All articles

Subject Author Date
Downloading links only from a certain subdomain

08/30/2020 18:52
Re: Downloading links only from a certain subdomain

09/01/2020 17:15
Re: Downloading links only from a certain subdomain

09/02/2020 06:26
Re: Downloading links only from a certain subdomain

09/02/2020 21:17
Re: Downloading links only from a certain subdomain

09/03/2020 07:56
Re: Downloading links only from a certain subdomain

09/05/2020 14:35
Re: Downloading links only from a certain subdomain

09/05/2020 14:41
Re: Downloading links only from a certain subdomain

09/08/2020 09:58
Re: Downloading links only from a certain subdomain

09/08/2020 15:13
Re: Downloading links only from a certain subdomain

09/11/2020 12:15




5

Created with FORUM 2.0.11