| > > > > Please consider trying to mirror this site:
> > > >
> > > > <http://www.gisa.es>
> > > >
> > > > the index page has the following content:
> > > >
> > > > <html>
> > > > <head>
> > > > <meta
> > > content="0;url=/gisa/servlet/Home?company=1"
> > > > http-equiv="refresh">
> > > > </head>
> > > >
> > > >
> > > > Why httrack doesn't go through this ?> > > > Please any hints on this
will be very much
> > > > appreciated.
> > > >
> > > >
> > >
> > > I see that the problem is because "http-equiv"
> is
> > > located at the end of the META tag.
> > >
> > > I'm not able to fix this bug, htsparse.c is a
> > total
> > > mess :)
> > >
> > > Please Xavier, what can I do ?> > >
> >
> > Finally, I've made an ugly hack to make this
> thing
> > work.
> > If anybody is interested please contact me.
> >
> >
>
>
> Ok, but remember that I said that it looked like
> a horrible hack. I'm positive there are better ways
> of fixing this problem...but what I did seems to
> work
> for me. Here it is:
>
> --- htsparse.c.orig 2005-11-23
> 11:14:17.000000000 +0100
> +++ htsparse.c 2005-11-25 18:03:27.935975568 +0100
> @@ -1057,6 +1057,8 @@
> // <META
> HTTP-EQUIV="Refresh"
> CONTENT="3;URL=http://www.example.com
> else if (intag_ctype == 2)
> {
> p_searchMETAURL=1;
> + } else if (
> ((strstr(adr,"refresh") - adr) > 0) &&
> ((strstr(adr,"refresh") - adr) < 128) ) {
> + p_searchMETAURL=1;
> } else {
> p = 0;
> /* cancel */
> }
> @@ -2852,7 +2854,7 @@
>
> // le lien n'a jamais été
> créé.
> // cette fois ci, on le
> crée!
> - if (!dejafait) {
> + if (!dejafait ||
> strstr(save, ".swf") ) {
>
>
> //
> // >>>> CREER LE LIEN
> <<<<
> //
>
Sorry , the change on the line "dejafait" is not necessary
for making "meta refresh" work, I posted too quickly. | |