HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: Embedded pdf
Author: WHRoeder
Date: 05/06/2013 14:48
 
1) Always post the ACTUAL command line used (or log file line two) so we know
what the site is, what ALL your settings are, etc.
2) Always post the URLs you're not getting and from what URL it is
referenced.
3) Always post anything USEFUL from the log file.
4) If you want everything use the near flag (get non-html files related) not
filters.
5) I always run with A) No External Pages so I know where the mirror ends.
With B) browser ID=msie 6 pulldown as some sites don't like a HTT one. With C)
Attempt to detect all links (for JS/CSS.) With D) Timeout=60, retry=9 to avoid
temporary network interruptions from deleting files.

> Hello, I've been trying to capture an embedded pdf.

There are NO PDF URLs on the page to capture! Only a SWF.

HTT will capture the SWF just fine. But when you try to use it, the SWF will
try to read its data from the server. In the mirror, the server is your PC -
the file doesn't exist and it fails. (Same problem with video sites.)

Only if the actual PDF URL was passed to the SWF and you used extended
parsing, would the PDF been downloaded, parameter modified, and it would work.
 
Reply Create subthread


All articles

Subject Author Date
Embedded pdf

05/06/2013 06:30
Re: Embedded pdf

05/06/2013 14:48




1

Created with FORUM 2.0.11