HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: Downloading only main domain web site
Author: JAA149
Date: 11/11/2009 13:11
 
Dear William,

Ok maybe I am not able to make u understand and not helped by the fact that I
am new to all this. I try

1 - When I try to download this site
    a - <http://xhtml.com>

2 - I get the following codes at the error log

HTTrack3.43-7+htsswf+htsjava launched on Wed, 11 Nov 2009 15:40:49 at
<http://xhtml.com/> +*.png +*.gif +*.jpg +*.css +*.js -ad.doubleclick.net/*
-mime:application/foobar
(winhttrack -qwC2%Ps2u1%s%uN0%I0p3DaK0c8H0%kf2%c10%f#f -F "Mozilla/4.5
(compatible; HTTrack 3.0x; Windows 98)" -%F "<!-- Mirrored from %s%s by
HTTrack Website Copier/3.x [XR&CO'2008], %s -->" -%l "en, en, *"
<http://xhtml.com/> -O1 "C:\Documents and Settings\Administrator\My
Documents\Downloads\My Web Sites\XHTML" +*.png +*.gif +*.jpg +*.css +*.js
-ad.doubleclick.net/* -mime:application/foobar )
Information, Warnings and Errors reported for this mirror:
note: the hts-log.txt file, and hts-cache folder, may contain sensitive
information,
 such as username/password authentication for websites mirrored in this
project
 do not share these files/folders if you want these information to remain
private
15:40:49 Warning:  * security warning: maximum number of simultaneous
connections limited to 4 to avoid server overload
15:40:49 Warning:  * security warning: maximum number of connections per
second limited to 5.000000 to avoid server overload
15:40:51 Warning:  File has moved from xhtml.com/ to /en/xhtml/reference
15:41:22 Warning:  File has moved from xhtml.com/en/ to /en/xhtml/reference
15:49:26 Error:  "Not Found" (404) at link xhtml.com/images/slashdot.gif (from
xhtml.com/screen.css?27)
15:49:26 Error:  "Not Found" (404) at link xhtml.com/images/required.gif (from
xhtml.com/screen.css?27)
HTTrack Website Copier/3.43-7 mirror complete in 19 minutes 22 seconds : 436
links scanned, 431 files written (4841574 bytes overall) [4950930 bytes
received at 4260 bytes/sec]
(2 errors, 4 warnings, 0 messages)

3 - I get the whole site perfectly with the following directory structure.

<http://i38.tinypic.com/30at89w.jpg>

4 - Now when I try to download
    a - <http://www.htmldog.com/>

5 - I get the following at the error log

HTTrack3.43-7+htsswf+htsjava launched on Wed, 11 Nov 2009 16:20:25 at
<http://www.htmldog.com/> +*.png +*.gif +*.jpg +*.css +*.js
-ad.doubleclick.net/* -mime:application/foobar
(winhttrack -qwC2%Ps2u1%s%uN0%I0p3DaK0c8H0%kf2%f#f -F "Mozilla/4.5
(compatible; HTTrack 3.0x; Windows 98)" -%F "<!-- Mirrored from %s%s by
HTTrack Website Copier/3.x [XR&CO'2008], %s -->" -%l "en, en, *"
<http://www.htmldog.com/> -O1 "C:\Documents and Settings\Administrator\My
Documents\Downloads\My Web Sites\HTMLDOG" +*.png +*.gif +*.jpg +*.css +*.js
-ad.doubleclick.net/* -mime:application/foobar )
Information, Warnings and Errors reported for this mirror:
note: the hts-log.txt file, and hts-cache folder, may contain sensitive
information,
 such as username/password authentication for websites mirrored in this
project
 do not share these files/folders if you want these information to remain
private
16:20:25 Warning:  * security warning: maximum number of simultaneous
connections limited to 4 to avoid server overload
16:38:19 Warning:  File not parsed, looks like binary:
www.csszengarden.com/robots.txt
16:45:30 Error:  "Not Found" (404) at link
www.htmldog.com/articles/suckerfish/dropdowns/example/) (from
www.htmldog.com/ptg/archives/000051.php)
16:45:30 Error:  "Not Found" (404) at link
www.htmldog.com/ptg/archives/www.egeekcentral.com (from
www.htmldog.com/ptg/archives/000051.php)
16:45:30 Error:  "Not Found" (404) at link www.csszengarden.com/sample.css
(from www.csszengarden.com/zengarden-sample.html)
16:45:49 Error:  "Not Found" (404) at link www.csszengarden.com/blossoms.jpg
(from www.csszengarden.com/zengarden-sample.css)
16:45:49 Error:  "Not Found" (404) at link www.csszengarden.com/zen-bg.jpg
(from www.csszengarden.com/zengarden-sample.css)
16:45:49 Error:  "Not Found" (404) at link www.csszengarden.com/h1.gif (from
www.csszengarden.com/zengarden-sample.css)
16:45:49 Error:  "Not Found" (404) at link www.csszengarden.com/h2.gif (from
www.csszengarden.com/zengarden-sample.css)
16:45:49 Error:  "Not Found" (404) at link www.csszengarden.com/paper-bg.jpg
(from www.csszengarden.com/zengarden-sample.css)
16:45:49 Error:  "Not Found" (404) at link www.csszengarden.com/h3.gif (from
www.csszengarden.com/zengarden-sample.css)
16:45:49 Error:  "Not Found" (404) at link www.csszengarden.com/h4.gif (from
www.csszengarden.com/zengarden-sample.css)
16:45:49 Error:  "Not Found" (404) at link www.csszengarden.com/h5.gif (from
www.csszengarden.com/zengarden-sample.css)
16:45:49 Error:  "Not Found" (404) at link www.csszengarden.com/h6.gif (from
www.csszengarden.com/zengarden-sample.css)
16:45:49 Error:  "Not Found" (404) at link www.csszengarden.com/cr1.gif (from
www.csszengarden.com/zengarden-sample.css)
16:45:49 Error:  "Not Found" (404) at link www.csszengarden.com/cr2.gif (from
www.csszengarden.com/zengarden-sample.css)
16:46:18 Error:  "Not Found" (404) at link
www.31tigersqn.be/nieuw/images/arrow.gif (from
www.htmldog.com/ptg/archives/000050.php)
16:46:18 Error:  "Not Found" (404) at link www.31tigersqn.be/nieuw/default.css
(from www.htmldog.com/ptg/archives/000050.php)
16:46:18 Error:  "Not Found" (404) at link www.ahrens.cx/styles.css (from
www.htmldog.com/ptg/archives/000050.php)
16:46:18 Error:  "Not Found" (404) at link
www.ideabankmarketing.com/org/styles/org_styles2.css (from
www.htmldog.com/ptg/archives/000050.php)
16:46:18 Error:  "Object Not Found" (404) at link
www.cbonline.org.au/ejournal/sos.css (from
www.htmldog.com/ptg/archives/000050.php)
16:46:18 Error:  "Not Found" (404) at link
www.eternalsphere.net/images/badmenuss.jpg (from
www.htmldog.com/ptg/archives/000050.php)
16:46:18 Error:  "Not Found" (404) at link
www.eternalsphere.net/images/ffmenuss.jpg (from
www.htmldog.com/ptg/archives/000050.php)
16:46:18 Error:  "Not Found" (404) at link
www.djaztek.com/Roehampton/styles/reviewstyle.css (from
www.htmldog.com/ptg/archives/000050.php)
16:46:18 Error:  "Not Found" (404) at link
img365.imageshack.us/img365/5071/inff0kj.jpg (from
www.htmldog.com/ptg/archives/000050.php)
16:46:18 Error:  "Not Found" (404) at link gracebiblesouderton.org/style.css
(from www.htmldog.com/ptg/archives/000050.php)
16:46:18 Error:  "Not Found" (404) at link
www.htmldog.com/www.design.toddhiestand.com (from
www.htmldog.com/ptg/archives/000050.php)
16:49:20 Error:  "Not Found" (404) at link marcustucker.com/temp/p800cap1.jpg
(from www.htmldog.com/ptg/archives/000055.php)
16:49:20 Error:  "Not Found" (404) at link marcustucker.com/temp/p800cap2.jpg
(from www.htmldog.com/ptg/archives/000055.php)
16:49:20 Error:  "Not Found" (404) at link
www.adampage.net/htmldog/htmldog_test_mpx220_sp2003_ie_01.jpg (from
www.htmldog.com/ptg/archives/000055.php)
16:49:20 Error:  "Not Found" (404) at link
www.adampage.net/htmldog/htmldog_test_mpx220_sp2003_ie_02.jpg (from
www.htmldog.com/ptg/archives/000055.php)
16:49:20 Error:  "Not Found" (404) at link
www.adampage.net/htmldog/htmldog_test_mpx220_sp2003_opera_706_01.jpg (from
www.htmldog.com/ptg/archives/000055.php)
16:49:20 Error:  "Not Found" (404) at link
www.adampage.net/htmldog/htmldog_test_mpx220_sp2003_opera_706_02.jpg (from
www.htmldog.com/ptg/archives/000055.php)
16:49:20 Error:  "Not Found" (404) at link
www.adampage.net/htmldog/htmldog_test_mpx220_sp2003_opera_706_03.jpg (from
www.htmldog.com/ptg/archives/000055.php)
16:53:53 Error:  "Not Found" (404) at link www.baekdal.com/x/email.gif (from
www.htmldog.com/ptg/archives/000073.php)
16:54:51 Error:  "Unable to get server's address: Unknown error" (-5) after 2
retries at link www.barkhamcreative.com/trouble/styles/style.css (from
www.htmldog.com/ptg/archives/000050.php)
16:55:13 Error:  "Connect Error" (-4) after 2 retries at link
jdesigns.homeftp.org:8080/rutgersalumni/html/includes/main.css (from
www.htmldog.com/ptg/archives/000050.php)
HTTrack Website Copier/3.43-7 mirror complete in 35 minutes 26 seconds : 912
links scanned, 802 files written (8355840 bytes overall) [8737687 bytes
received at 4109 bytes/sec], 165094 bytes transfered using HTTP compression in
19 files, ratio 34%, 1.9 requests per connection
(35 errors, 2 warnings, 0 messages)

6 - I get the following

<http://i34.tinypic.com/29omxw8.jpg>

You can see that when i download 
   a - <http://xhtml.com>
   i get the the 3 folders only as
   1 - hts-cache
   2 - www.w3c.org
   3 - www.htmldog.com

But when i download
   a - <http://www.htmldog.com/>
   i get 25 folders

It has downloaded from other sites such as www.csszengarden.com

Is this normal. I only want www.htmldog.com

Thank you again for taking the time out to help and for your continous
paitenance.

regards

JJ


 
Reply Create subthread


All articles

Subject Author Date
Downloading only main domain web site

11/08/2009 11:51
Re: Downloading only main domain web site

11/08/2009 16:20
Re: Downloading only main domain web site

11/08/2009 17:30
Re: Downloading only main domain web site

11/08/2009 17:30
Re: Downloading only main domain web site

11/09/2009 14:29
Re: Downloading only main domain web site

11/09/2009 17:06
Re: Downloading only main domain web site

11/09/2009 20:15
Re: Downloading only main domain web site

11/10/2009 08:32
Re: Downloading only main domain web site

11/10/2009 15:10
Re: Downloading only main domain web site

11/11/2009 13:11
Re: Downloading only main domain web site

11/11/2009 14:59
Re: Downloading only main domain web site

11/11/2009 15:22
Default filters (aka Scan Rules) & external sites-

11/19/2009 06:31
Re: Default filters (aka Scan Rules) & external sites-

11/19/2009 16:39
Re: Default filters (aka Scan Rules) & external sites-

11/20/2009 14:08
Re: Default filters (aka Scan Rules) & external sites-

02/15/2011 21:04
Re: Default filters (aka Scan Rules) & external sites-

03/04/2012 19:47
Re: Downloading only main domain web site

02/20/2018 19:54
Re: Default filters (aka Scan Rules) & external sites-

04/23/2020 10:48
Re: Default filters (aka Scan Rules) & external sites-

03/10/2022 06:30




a

Created with FORUM 2.0.11