| Hello!
I was using your really nice HTTrack tool on a website, but I stumbled upon
some problems here:
<http://dt.dgidc.min-edu.pt/>
For example, it doesn't recognize the Javascript-generated links like:
<http://dt.dgidc.min-edu.pt/index.php?id=n1>
Anyway, like this one, there are some other websites where their <body>
contents are entirely Javascript-generated...
It would be really nice to have an option to, instead of saving the original
page's HTML code (which contains only the script), save the *dynamically
generated DOM hierarchy*'s HTML code!
Something similar to doing this:
write( "<html>" , document.getElementsByTagName("html")[0].innerHTML ,
"</html>" );
I think it would be really useful for example when the page uses Ajax requests
to get the webpage's content from a (not-offline-accessible) server-side
database: the target content that we are interested on would be saved and not
the offline-useless script to get it.
What do you think?Thank you very much, and keep up the great work! | |