HTTrack Website Copier
Free software offline browser - FORUM
Subject: A site difficult to grab
Author: Barbara
Date: 01/07/2005 12:29
 
Hi!

I need to mirror a lot of web page whose URL is something like:

<http://goofy/mov/Mickey> Mouse/2/001/MTAwNTg0OjU6MTI

to get the *.mpg files their links point to:

<http://goofy/galleries/Mickey> Mouse/001/afile.mpg
<http://goofy/galleries/Mickey> Mouse/001/anotherone.mpg
...

Running Windows 98 SE, I tried in vain all the followings:

HTTrack.exe %1 -O %2 %e5 -r5
HTTrack.exe %1 -O %2 %e5 -r5 "+*.png +*.gif +*.jpg +*.m* +*.avi"
HTTrack.exe %1 -O %2 %e5 -r5 "+*.png +*.gif +*.jpg +*.m*
+*.avi" -e -b
HTTrack.exe %1 -O %2 %e5 -r5 "+*.png +*.gif +*.jpg +*.m*
+*.avi" -e -b --robots=0
HTTrack.exe %1 -O %2 %e5 -r5 "+*.png +*.gif +*.jpg +*.m*
+*.avi" -e -b --robots=0  --mirror
HTTrack.exe %1 -O %2 %e10 -r10 "+*.png +*.gif +*.jpg +*.m*
+*.avi" -e -b --robots=0  --mirror --mirrorlinks -n

The main page was always dowloaded but no *.mpg was received.

Please, can you provide some hints to get the task rightly
performed ?
Best regards

P. S.

I was successful in grabbing *.mpg file from a single web
page using WinHttrack and in hts-log.txt I could read:

HTTrack3.32-2+swf launched on Fri, 07 Jan 2005 11:41:52 at
<http://goofy/mov/Mickey> Mouse/2/001/MTAwNTg0OjU6MTI +*.css
+*.js -ad.doubleclick.net/* +*.gif +*.jpg +*.png +*.tif
+*.bmp +*.zip +*.tar +*.tgz +*.gz +*.rar +*.z +*.exe +*.mov
+*.mpg +*.mpeg +*.avi +*.asf +*.mp3 +*.mp2 +*.rm +*.wav
+*.vob +*.qt +*.vid +*.ac3 +*.wma +*.wmv
(winhttrack -qwr3%e3C2%Ps2u1%H%sN0%I0p3BeH0%kf2A25000%f#f -F
"Mozilla/4.5 (compatible; HTTrack 3.0x; Windows 98)" -%F
"<!-- Mirrored from %s%s by HTTrack Website Copier/3.x
[XR&CO'2004], %s -->" -%l "it, en, *"
<http://goofy/mov/Mickey> Mouse/2/001/MTAwNTg0OjU6MTI -O
d:\z3\z,d:\z3\z +*.css +*.js -ad.doubleclick.net/* +*.gif
+*.jpg +*.png +*.tif +*.bmp +*.zip +*.tar +*.tgz +*.gz
+*.rar +*.z +*.exe +*.mov +*.mpg +*.mpeg +*.avi +*.asf
+*.mp3 +*.mp2 +*.rm +*.wav +*.vob +*.qt +*.vid +*.ac3 +*.wma
+*.wmv -%A php3,php,php2,asp,jsp,pl,cfm,nsf=text/html )

Information, Warnings and Errors reported for this mirror:
note: the hts-log.txt file, and hts-cache folder, may
contain sensitive information,
such as username/password authentication for websites
mirrored in this project
do not share these files/folders if you want these
information to remain private

11:42:45	Warning: 	File has moved from goofy/robots.txt to
<http://www.Mickey> Mouse/tr/index.php
11:43:03	Warning: 	File has moved from ww2.Mickey
Mouse/track/MTAwNTg0OjU6MTI/ to <http://www.Mickey>
Mouse/tr/index.php/?nats=MTAwNTg0OjU6MTI
11:47:46	Warning: 	File has moved from www.Mickey
Mouse/robots.txt to <http://www.Mickey> Mouse/tr/tour1.php

HTTrack Website Copier/3.32-2 mirror complete in 5 minutes
54 seconds : 24 links scanned, 20 files written (6709794
bytes overall) [6712223 bytes received at 18961 bytes/sec],
7716 bytes transfered using HTTP compression in 3 files,
ratio 30%, 1.9 requests per connection
(No errors, 3 warnings, 0 messages)
 
Reply


All articles

Subject Author Date
A site difficult to grab

01/07/2005 12:29




a

Created with FORUM 2.0.11