| I'm trying to copy the current issue of USNews online but
I am having a bit of trouble. This is the link to the
current issue:
<http://www.usnews.com/usnews/issue/040216/home.htm>
First, when it retrieves home.htm, it makes it into a
folder for some reason, and leaves a home.htm.txt file
with HTM in it.
Second, it gets too many pages - I think it's copying all
these other unnecessary links. It is going 3 levels deep
(current page > news item > [if news item is multiple htm
pages long]) and I think it is going outside the issue. Is
there a way to keep it confined? Any tips?
Thank you. | |