| Suppose I have a certain pages, e.g. www.abc.def/ghi/a.html, how can I download
all the pages under www.abc.def./ghi (i.e. www.abc.def/ghi/a.html,
www.abc.def/ghi/b, ...)?
Right now, I'm using the simple httrack <http://www.abc.def/ghi/> -O
<output-folder> command, but it fails with some sites.
E.g., when I try it with <http://www.getsidekick.com/blog/> , it doesn't
download anything. However, if I'm using <http://www.getsidekick.com/blog> (note
the missing slash), it downloads not only the content under
<http://www.getsidekick.com/blog/>, but even content inside
<http://www.getsidekick.com/> .
So, there is a standard way to accomplish this goal?Thanks in advace. | |