| First, I would like to compliment the author of httrack for
his work. It's a great program, one to be proud of.
I use WinHttrack on Windows with few problems. However, I
used Fink to install httrack on my OS X Powerbook. With
default settings, it's able to mirror most sites. However,
on some I can't get it working. Howstuffworks.com would be
a good example. From what I've read on the forum, this site
is problematic due to heavy javascript, and possibly spider
filtering with PHP. But I was able to get it working on
WinHttrack by changing "follow robots.txt" to never and
setting the browser ID to "Mozilla/5.0 (Windows; U; Windows
NT 5.0; en-US; rv:1.1) Gecko/20020826".
I've tried this same method with command line flags on my
Mac, obviously because there's no GUI. But I can't get it
working. Perhaps I'm missing some options that are
different in Windows compared to this version? Or maybe I'm
mis-typing something. Here's the exact command I execute:
httrack <http://howstuffworks.com> -O /tmp/howstuffworks
-s0 -F "Mozilla/5.0 (Windows; U; Windows NT 5.0; en-US;
rv:1.1) Gecko/20020826"
Also, can httrack save a config file on OS X so I don't have
to do this over and over again? If it exists, I haven't
been able to find it yet.
Thanks. | |