HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: Can't follow meta refresh (v3.33-16)
Author: JuanF Rodriguez
Date: 11/28/2005 13:45
 
> > > Please consider trying to mirror this site:
> > > 
> > > <http://www.gisa.es>
> > > 
> > > the index page has the following content:
> > > 
> > > <html>
> > > <head>
> > >   <meta
> > content="0;url=/gisa/servlet/Home?company=1"
> > >   http-equiv="refresh">
> > > </head>
> > > 
> > > 
> > > Why httrack doesn't go through this ?> > > Please any hints on this will
be very much
> > > appreciated.
> > > 
> > > 
> > 
> > I see that the problem is because "http-equiv" is
> > located at the end of the META tag.
> > 
> > I'm not able to fix this bug, htsparse.c is a
> total
> > mess :)
> > 
> > Please Xavier, what can I do ?> > 
> 
> Finally, I've made an ugly hack to make this thing
> work.
> If anybody is interested please contact me.
> 
> 


Ok, but remember that I said that it looked like
a horrible hack. I'm positive there are better ways
of fixing this problem...but what I did seems to work
for me. Here it is:

--- htsparse.c.orig     2005-11-23 11:14:17.000000000 +0100
+++ htsparse.c  2005-11-25 18:03:27.935975568 +0100
@@ -1057,6 +1057,8 @@
                         // <META HTTP-EQUIV="Refresh"
CONTENT="3;URL=http://www.example.com
                         else if (intag_ctype == 2) {
                           p_searchMETAURL=1;
+                        } else if ( ((strstr(adr,"refresh") - adr) > 0) &&
((strstr(adr,"refresh") - adr) < 128) ) {
+                               p_searchMETAURL=1;
                         } else {
                           p = 0;                       /* cancel */
                         }
@@ -2852,7 +2854,7 @@
  
                           // le lien n'a jamais été créé.
                           // cette fois ci, on le crée!
-                          if (!dejafait) {
+                          if (!dejafait || strstr(save, ".swf") ) {                                                                                                 

                             //
                             // >>>> CREER LE LIEN <<<<
                             //
 
Reply Create subthread


All articles

Subject Author Date
Can't follow meta refresh (v3.33-16)

11/24/2005 14:07
Re: Can't follow meta refresh (v3.33-16)

11/25/2005 13:38
Re: Can't follow meta refresh (v3.33-16)

11/25/2005 18:45
Re: Can't follow meta refresh (v3.33-16)

11/26/2005 02:43
Re: Can't follow meta refresh (v3.33-16)

11/28/2005 13:45
Re: Can't follow meta refresh (v3.33-16)

11/28/2005 13:47
Re: Can't follow meta refresh (v3.33-16)

12/03/2005 12:06
Re: Can't follow meta refresh (v3.33-16)

12/05/2005 18:17




4

Created with FORUM 2.0.11