| Hi! I've read that xml files aren't parsed by HTTrack, because HTTrack doesn't
have access to DTD.
However, maybe it would be possible to parse certain XML documents with
declared known DTDs (especially XHTML DTDs).
For example, the document I'm trying to get with HTTrack has
xhtml-math11-f.dtd:
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.1 plus MathML 2.0//EN"
"../dtd/xhtml-math11-f.dtd">
The DTD file path is relative, but it is published on the web server.
The document is registered in hts-cache/new.txt with text/xml content type.
The problem is that the images from <img> tags in the document aren't
downloaded at all:
<img alt="alt text" src="somefile.PNG" />
| |