| > Can HTTrack be used to report stats like how many,
> images, embedded be files, html pages there are on a
> site without downloading it?
No. (No tool could do that without downloading, really...)
> As well is it possible to pull the content of site
> by adding a start and end tag of html to pull the
> content so it can be outputted in xml. Something
> like this:
>
> <ThisContent>
> <title>Title Here</title>
> <URL>http://www.url.com</URL>
> <body>content is here</body>
> <content_type>Content</content_type>
> <meta_keywords>keywords here</meta_keywords>
> <meta_description>description
> here</meta_description>
> </ThisContent>
Nope. You'd have to do crazy processing after a mirror is completed.
| |