Re: problems with yahoo - HTTrack Website Copier Forum

Subject: Re: problems with yahoo

Author: William Roeder

Date: 12/05/2011 17:22

> Hi, I would like to crawl Yahoo portal, so I use
> command:
> 
> httrack <http://www.yahoo.com> -O
> "/home/user/HTTRACK/yahoo" "*yahoo.com/*" -s0 -r10

> 16:15:54    Error:     "Unable to get server's
> address: No such file or directory" (-5) after 2

> I think this is because redirections.. What should I
> do to crawl _only_ Yahoo web page ? (I shouldn't use
> filter: "*yahoo*" because yahoo word can be in get
You didn't use a filter. You used TWO urls (www.yahoo.com and *yahoo.com/*)
The later is not a valid url.
Try "+*.yahoo.com/*"

Create subthread

All articles

Subject	Author	Date
problems with yahoo		12/04/2011 23:31
Re: problems with yahoo		12/05/2011 17:22
Re: problems with yahoo		12/09/2011 21:20