HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: PDF retrieval from plone website does not work
Author: Hamish Currie
Date: 11/29/2008 04:57
 
Hi,

I had a similar problem mirroring a website running plone. I was getting
errors such as this in the log:

10:59:47	Error: 	Unable to save file
/home/hamish/glazesorg/glazes.org/images/book.jpg : Is a directory

I discovered that I could use wget to fix the problem:

wget -m <http://glazes.org/images/book.jpg>

I automated it by processing the log file, creating a file containing all of
the wget commands and then executing the file. 

#Create a file called fixme.txt whith lines like this:
glazes.org/images/book.jpg

> cat hts-log.txt | grep "Is a directory" | cut -d\  -f6 | cut -d/  -f8- >
fixme.txt

#Create a file called cods.txt with lines like:
wget -m <http://glazes.org/images/book.jpg>

>for f in `cat fixme.txt`; do echo "wget -m <http://$f> >> cmds.txt ; done

#Execute the commands
sh < cmds.txt 


I hope that's useful to someone.

Hamish
 
Reply Create subthread


All articles

Subject Author Date
PDF retrieval from plone website does not work

11/04/2007 13:47
Re: PDF retrieval from plone website does not work

11/05/2007 08:04
Re: PDF retrieval from plone website does not work

11/05/2007 22:45
Re: PDF retrieval from plone website does not work

11/07/2007 11:43
Re: PDF retrieval from plone website does not work

01/25/2008 12:20
Re: PDF retrieval from plone website does not work

11/29/2008 04:57




7

Created with FORUM 2.0.11