| > It did a good job. The only thing is I wish it
> didn't send the "non-english" pages. That's a waste
> of bandwidth and pages. I only wanted english!
> Perhaps, if there was a way in the future for it to
> detect languages in pages, it could forfit
> downloading them, or somehow otherwise.
Every site is different, you have to filter what you don't want.
All nonEngligh links look like:
<http://wiki.archlinux.org/index.php/> Beginners%27_Guide_(Espa%C3%B1ol)
Add filter -*_(* | |