HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: Trying to grab download pdfs
Author: William Roeder
Date: 07/29/2011 16:54
 
> the wildcards in your working scan rules. Where do I
> fall down?> -*
> +edition.tefl.net/category/talking-point/   (1)
> +edition.tefl.net/category/talking-point/*/ (2)
> +edition.tefl.net/wp-content/uploads/*/*/*  (3)
> +*.pdf                                      (4)

(1) only enables the starting url which is always done. Useless filter.
(2) The sub pages are: TP Worksheet: Up in Arms? -
<http://edition.tefl.net/talking-point/arms/> 
No /category/ so the filter does nothing. No sub-pages, no pdfs.
(3) The uploads are like: Talking Point: Up in Arms? -
<http://edition.tefl.net/wp-content/uploads/2010/10/TP_Up-in-Arms.pdf>
+uploads/*/*/* allows anything 3 (or more) levels down from uploads. Would
work except for (2)
(4) allows just pdfs from anywhere. Would work except for (2)
 
Reply Create subthread


All articles

Subject Author Date
Trying to grab download pdfs

07/27/2011 13:48
Re: Trying to grab download pdfs

07/27/2011 13:50
Re: Trying to grab download pdfs

07/27/2011 16:38
Re: Trying to grab download pdfs

07/28/2011 05:46
Re: Trying to grab download pdfs

07/28/2011 11:08
Re: Trying to grab download pdfs

07/28/2011 17:29
Re: Trying to grab download pdfs

07/28/2011 17:29
Re: Trying to grab download pdfs

07/29/2011 09:21
Re: Trying to grab download pdfs

07/29/2011 16:54




3

Created with FORUM 2.0.11