HTTrack Website Copier
Free software offline browser - FORUM
Subject: Try this!
Author: Eckart
Date: 10/08/2004 03:04
 
Hi Fabio,

it really helps that you mention the site you want to mirror:
www.fotolog.net/drackon

It's definitely _not_ HTTrack's fault here!

You have to dig a little deeper into things since this site
is rather complex to mirror -- much more complex than most
other sites.

Take a look at the URL of the sunset photo on the main page:
<http://sp1.fotologs.net/?u=drackon&i=2004/10/07/1097184325.jpg&c=f>

It's not only not in the /drackon directory but also on a
different server AND is not recognizable as a JPG image to
HTTrack!

Your "All photos" URL is:
<http://www.fotolog.net/all_photos.html?user=drackon>

Good news is that "drackon" seems to be in all relevant URLs.

So you should try something like: 

-----------------------------------
URL: <http://www.fotolog.net/drackon>

Scan Rules:
-*
+*drackon*
-----------------------------------

It's not that hard, is it?! ;)

BUT due to the bad design of image URLs, e.g.
<http://sp1.fotologs.net/?u=drackon&i=2004/10/07/1097184325.jpg&c=f>,


HTTrack doesn't recognize the images as such :( and saves
them as HTML files. To be able to view them after download,
you may use the Windows Explorer's (NOT Internet Explorer's)
search function and search for ALL FILES in this HTTrack
project that have the string "JFIF" in them. Then use a tool
of your choice (e.g. command prompt or Total Commander) to
rename the resulting files to .jpg.

You could set the option "Build"/"local structure type" to
"HTML in web/html, images/other in web/images" in order to
avoid having all their different servers (see below) listed.
Unfortunately (due to what I explained above), images will
be among the HTML files in the html folder. :(

Otherwise, due to the sites structure, you will have many of
fotolog's servers in your project as directory folders:

ff.fotolog.net
my.fotolog.net
sp0.fotologs.net
sp1.fotologs.net
sp2.fotologs.net
sp3.fotologs.net
sp4.fotologs.net
sp6.fotologs.net
sp7.fotologs.net
sp8.fotologs.net
sp9.fotologs.net
spa.fotologs.net
spb.fotologs.net
spc.fotologs.net
spe.fotologs.net
www.fotolog.net

HTH! -- Sorry, but _that_ site is not the easiest to start
with... :/

Greetings,

Eckart
 
Reply Create subthread


All articles

Subject Author Date
I must be really stupid...

10/04/2004 04:30
Re: I must be really stupid...

10/04/2004 18:59
Re: I must be really stupid...

10/05/2004 19:11
Re: I must be really stupid...

10/06/2004 02:41
Re: I must be really stupid...

10/07/2004 06:00
Try this!

10/08/2004 03:04
Re: Try this!

10/08/2004 20:50




4

Created with FORUM 2.0.11