HTTrack Website Copier
Free software offline browser - FORUM
Subject: Option to save plain HTML in script-generated page
Author: XICO2KX
Date: 11/06/2010 23:32
 
Hello!
I was using your really nice HTTrack tool on a website, but I stumbled upon
some problems here:
<http://dt.dgidc.min-edu.pt/>
For example, it doesn't recognize the Javascript-generated links like:
<http://dt.dgidc.min-edu.pt/index.php?id=n1>

Anyway, like this one, there are some other websites where their <body>
contents are entirely Javascript-generated...
It would be really nice to have an option to, instead of saving the original
page's HTML code (which contains only the script), save the *dynamically
generated DOM hierarchy*'s HTML code!

Something similar to doing this:
write( "<html>" , document.getElementsByTagName("html")[0].innerHTML ,
"</html>" );

I think it would be really useful for example when the page uses Ajax requests
to get the webpage's content from a (not-offline-accessible) server-side
database: the target content that we are interested on would be saved and not
the offline-useless script to get it.

What do you think?Thank you very much, and keep up the great work!
 
Reply


All articles

Subject Author Date
Option to save plain HTML in script-generated page

11/06/2010 23:32




d

Created with FORUM 2.0.11