| I'm a linguistics grad student, and I'm finding the
HTTrack very helpful for gathering text samples.
However, I find myself also getting lots of pages I
don't need (a problem since I have limited space to
work with).
What would be really helpful would be to have an
option to exclude files of certain types, but still
crawl through them to get to links beyond. In other
words, crawl through the site as HTTrack does now, but
only copy certain files to disk.
Thanks,
Jay | |