| Hi all,
I need to scrape come content, and put the files in a
defined file structure and I'm lost!
This is what I need:
#1. I want to store the site in "/home/kam/content/" and I
use the "-O" option to do that.
#2. I am only copying 1 level files (only the initial file
and what the links point to, not what the links within the
links point to)
#3. I want all the HTML files to be on the "root" level
(/home/kam/content)
#4. I want all image files in the "images" dir
(/home/kam/content/images)
#5. I want all other files (non images, non html) in
the "documents" dir (/home/kam/content/documents)
Any thoughts?!
Thanks!
~Kam (^8* | |