Re: download only pdf - HTTrack Website Copier Forum

Subject: Re: download only pdf

Author: Tiago Paolini

Date: 12/23/2013 17:09

You should also add a filter to crawl the domain where the PDFs are, then (at
the page you input the sites) chose the mirroring mode as "Get separated
files" intead of "Download website(s)". It will be something like:

-* +philmat.oxfordjournals.org/* +*.full.pdf

This way HTTrack will crawl the HTML pages, but only save the PDF files it
finds. The MIME type filters only work for pages that where already scheduled
for download.

Create subthread

All articles

Subject	Author	Date
download only pdf		12/23/2013 12:09
Re: download only pdf		12/23/2013 17:09
Re: download only pdf		12/24/2013 09:30
Re: download only pdf		12/25/2013 06:03