|
> Well, I think - er - but the file should have been
> parsed. But If I remember, the "alternate"
file was
> not really helpful
The links in the alternate files should be converted,
so the links points to the local file. When you mirror
this html file and parse it why isn't it "converted".
--- snip ---
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2 Final//EN">
<html>
<!-- Mirrored from www.vfxhq.com by HTTrack Website
Copier/3.x [XR,YP'2001] -->
<head>
<title>Menu for /maps/id4.map</title>
</head><body>
<h1>Menu for /maps/id4.map</h1>
<hr>
<pre> <a
href="http://vfxhq.com/1996/id4-e.html">/1996/id4-e.html</a></pre>
<pre> <a
href="http://vfxhq.com/1996/id4-c.html">/1996/id4-c.html</a></pre>
<pre> <a
href="http://vfxhq.com/1996/id4-d.html">/1996/id4-d.html</a></pre>
<pre> <a
href="http://vfxhq.com/1996/id4-a.html">/1996/id4-a.html</a></pre>
<pre> <a
href="http://vfxhq.com/1996/id4-b.html">/1996/id4-b.html</a></pre>
<pre>(Default) <a
href="http://vfxhq.com/1996/id4.html">/1996/id4.html</a></pre>
</body>
<!-- Mirrored from www.vfxhq.com by HTTrack Website
Copier/3.x [XR,YP'2001] -->
</html>
--- snip -----
This is what httrack saves on the disk. I think it
would be helpful it this links where pointing to the
local files (the above links exist and where mirrored).
Maybe you have a "clever" solution for this. :) Somehow
the spider ignores the above file.
read you,
MrDom
| |