HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: problems with yahoo
Author: William Roeder
Date: 12/05/2011 17:22
 
> Hi, I would like to crawl Yahoo portal, so I use
> command:
> 
> httrack <http://www.yahoo.com> -O
> "/home/user/HTTRACK/yahoo" "*yahoo.com/*" -s0 -r10

> 16:15:54    Error:     "Unable to get server's
> address: No such file or directory" (-5) after 2

> I think this is because redirections.. What should I
> do to crawl _only_ Yahoo web page ? (I shouldn't use
> filter: "*yahoo*" because yahoo word can be in get
You didn't use a filter. You used TWO urls (www.yahoo.com and *yahoo.com/*)
The later is not a valid url.
Try "+*.yahoo.com/*"
 
Reply Create subthread


All articles

Subject Author Date
problems with yahoo

12/04/2011 23:31
Re: problems with yahoo

12/05/2011 17:22
Re: problems with yahoo

12/09/2011 21:20




d

Created with FORUM 2.0.11