| I used the following statement to crawl the webpage. I need only html and
images used in the initial page. But the below statement crawl, all the
links used in the web page. Please advice me on this
"httrack <http://en.wikipedia.org/wiki/Ringtone> -O
'/home/test/httrack-3.43.1/data/websec-1.9.0/1/8/hts-cache' -q -Q -N
20081217100027/%n.%t -o0 -X0 -T30 -R1 -I0 -%F "" -F "Mozilla/5.0
Firefox/3.0.3" -%h/* -* +*.jpg +*.jpeg +*.css +*.js +*.gif +*.bmp +*.tif*
+*.png +*.swf -*.exe -*.pdf -*.doc -*.zip" | |