HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: download only pdf
Author: Tiago Paolini
Date: 12/23/2013 17:09
 
You should also add a filter to crawl the domain where the PDFs are, then (at
the page you input the sites) chose the mirroring mode as "Get separated
files" intead of "Download website(s)". It will be something like:

-* +philmat.oxfordjournals.org/* +*.full.pdf

This way HTTrack will crawl the HTML pages, but only save the PDF files it
finds. The MIME type filters only work for pages that where already scheduled
for download.
 
Reply Create subthread


All articles

Subject Author Date
download only pdf

12/23/2013 12:09
Re: download only pdf

12/23/2013 17:09
Re: download only pdf

12/24/2013 09:30
Re: download only pdf

12/25/2013 06:03




0

Created with FORUM 2.0.11