HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: Used filters but files in subfolder don't download
Author: Grant
Date: 01/06/2013 08:10
 
Looking at the problem... you cannot download the pngs directly. Httrack has to
download htmls first, and parse them to find links that point to the png
files. So both html and png files need to be downloaded, and nothing else.

Looking below, I've specified the filters and the urls-to-download in separate
files, which the httrack command reads. A lot of the httrack options I've used
are generic options that I use most times anyways..  

I've limited the download to depth of 3 (-r3), so httrack downloads the
/Gita/*verse* htmls too. And each html page seems to link to a /Gita/*.png
file. On testing, html and pngs are downloading okay. If you increase the
depth to 4, then you may discover more png files, but take a much longer time.  


Hope it works for you.
----------------------------------

# Create a file "filters-list" 
cat >"filters-list" <<EOF
-*
+*bhagavad-gita.org/Gita/*.html
+*bhagavad-gita.org/Gita/*verse*.png
EOF

# Create a file "urls-list" 
cat >urls-list <<EOF
<http://www.bhagavad-gita.org/Gita/chapter-01.html>
<http://www.bhagavad-gita.org/Gita/chapter-02.html>
<http://www.bhagavad-gita.org/Gita/chapter-03.html>
<http://www.bhagavad-gita.org/Gita/chapter-04.html>
<http://www.bhagavad-gita.org/Gita/chapter-05.html>
<http://www.bhagavad-gita.org/Gita/chapter-06.html>
<http://www.bhagavad-gita.org/Gita/chapter-07.html>
<http://www.bhagavad-gita.org/Gita/chapter-08.html>
<http://www.bhagavad-gita.org/Gita/chapter-09.html>
<http://www.bhagavad-gita.org/Gita/chapter-10.html>
<http://www.bhagavad-gita.org/Gita/chapter-11.html>
<http://www.bhagavad-gita.org/Gita/chapter-12.html>
<http://www.bhagavad-gita.org/Gita/chapter-13.html>
<http://www.bhagavad-gita.org/Gita/chapter-14.html>
<http://www.bhagavad-gita.org/Gita/chapter-15.html>
<http://www.bhagavad-gita.org/Gita/chapter-16.html>
<http://www.bhagavad-gita.org/Gita/chapter-17.html>
<http://www.bhagavad-gita.org/Gita/chapter-18.html>
<http://www.bhagavad-gita.org/Gita/chapter-19.html>
EOF

httrack -w -r3 -o0 -s2 -%v -C -I -z -Z -%k   \
-%B -H1 -B -%P "-#f" -u2 -%u -%l "en, en, *"   \
 -T20 -R2 -%S "filters-list" -O ./. -%L urls-list
 
Reply Create subthread


All articles

Subject Author Date
Used filters but files in subfolder don't download

01/03/2013 06:44
Re: Used filters but files in subfolder don't download

01/03/2013 15:06
Re: Used filters but files in subfolder don't download

01/03/2013 15:43
Re: Used filters but files in subfolder don't download

01/06/2013 00:33
Re: Used filters but files in subfolder don't download

01/06/2013 08:10
Re: Used filters but files in subfolder don't download

01/06/2013 17:26




a

Created with FORUM 2.0.11