| > Hi, I would like to crawl Yahoo portal, so I use
> command:
>
> httrack <http://www.yahoo.com> -O
> "/home/user/HTTRACK/yahoo" "*yahoo.com/*" -s0 -r10
> 16:15:54 Error: "Unable to get server's
> address: No such file or directory" (-5) after 2
> I think this is because redirections.. What should I
> do to crawl _only_ Yahoo web page ? (I shouldn't use
> filter: "*yahoo*" because yahoo word can be in get
You didn't use a filter. You used TWO urls (www.yahoo.com and *yahoo.com/*)
The later is not a valid url.
Try "+*.yahoo.com/*"
| |