Re: why does "maximum external depth" not work?

Subject: Re: why does "maximum external depth" not work?

Author: Jeremy

Date: 09/02/2019 12:14

PS 
To expand on the last couple of paragraphs, I should add that I know I could
exclude all HTML and then whitelist particular sites (like my target site) but
(a) this won't work with Wiki*edia.org (unless there is a scan filter for MIME
type, which I can't find documented) and (b) I should actually be able to grab
the HTML from external sites when directly linked without fearing that I will
go on crawling the entire site from there

Create subthread

All articles

Subject	Author	Date
why does "maximum external depth" not work?		09/02/2019 12:09
Re: why does "maximum external depth" not work?		09/02/2019 12:14