HTTrack Website Copier
Free software offline browser - FORUM
Subject: Difficulty with edu indexes
Author: RP
Date: 02/18/2022 19:27
 
Is anyone able to mirror these indexes because no matter how I configure the
settings to limit bandwith and throughput I can't seem to download the pages
on these indexes just one page deep. All of which I can access freely in my
browser. In fact, I have a browser extension that gathers all the links and
manually downloads them. But why can't httrack mimic batch "save as" for links
on a page? I ignore robots.txt and limit the rate to no avail. I'm attempting
html only. Hoping someone could duplicate and povide insight.

=================================================

<https://cs.stanford.edu/people/nick/py/>
<https://web.stanford.edu/class/me200c/tutorial_77/>
<https://web.stanford.edu/class/me200c/tutorial_77/index.html>

HTTrack3.48-21+htsswf+htsjava launched on Fri, 18 Feb 2022 13:16:43 at
<https://cs.stanford.edu/people/nick/py/> +*.png +*.gif +*.jpg +*.jpeg +*.css
+*.js -ad.doubleclick.net/* -mime:application/foobar
(winhttrack -qir1C2%Pxs0u1%s%uN0L2%I0p1DaK0H0%kf2A25000%f#f -F "Mozilla/4.78
[en] (Windows NT 5.0; U)" -%F  -%l "en, *"
<https://cs.stanford.edu/people/nick/py/> -O1
"C:\Users\XLK12\Desktop\temp2\https___cs.stanford.edu_people_nick_py_python-interpreter.html"
+*.png +*.gif +*.jpg +*.jpeg +*.css +*.js -ad.doubleclick.net/*
-mime:application/foobar )
Information, Warnings and Errors reported for this mirror:
note: the hts-log.txt file, and hts-cache folder, may contain sensitive
information,
 such as username/password authentication for websites mirrored in this
project
 do not share these files/folders if you want these information to remain
private
HTTrack Website Copier/3.48-21 mirror complete in 1 seconds : 1 links scanned,
1 files written (944 bytes overall), 1 files updated [942 bytes received at
942 bytes/sec], 944 bytes transferred using HTTP compression in 1 files, ratio
59%
(No errors, 0 warnings, 0 messages)

HTTrack3.48-21+htsswf+htsjava launched on Fri, 18 Feb 2022 13:20:59 at
<https://web.stanford.edu/class/me200c/tutorial_77/> +*.png +*.gif +*.jpg
+*.jpeg +*.css +*.js -ad.doubleclick.net/* -mime:application/foobar
(winhttrack -qir1C2%Pxs0u1%s%uN0L2%I0p1DaK0H0%kf2A5000%f#f -F "Mozilla/4.78
[en] (Windows NT 5.0; U)" -%F  -%l "en, *"
<https://web.stanford.edu/class/me200c/tutorial_77/> -O1
"C:\Users\XLK12\Desktop\temp2\https___web.stanford.edu_class_me200c_tutorial_77_"
+*.png +*.gif +*.jpg +*.jpeg +*.css +*.js -ad.doubleclick.net/*
-mime:application/foobar )
Information, Warnings and Errors reported for this mirror:
note: the hts-log.txt file, and hts-cache folder, may contain sensitive
information,
 such as username/password authentication for websites mirrored in this
project
 do not share these files/folders if you want these information to remain
private
HTTrack Website Copier/3.48-21 mirror complete in 1 seconds : 1 links scanned,
1 files written (2093 bytes overall), no files updated [235 bytes received at
235 bytes/sec]
(No errors, 0 warnings, 0 messages)

+index.html
HTTrack3.48-21+htsswf+htsjava launched on Fri, 18 Feb 2022 13:22:34 at
<https://web.stanford.edu/class/me200c/tutorial_77/index.html> +*.png +*.gif
+*.jpg +*.jpeg +*.css +*.js -ad.doubleclick.net/* -mime:application/foobar
(winhttrack -qir1C2%Pxs0u1%s%uN0L2%I0p1DaK0H0%kf2A5000%f#f -F "Mozilla/4.78
[en] (Windows NT 5.0; U)" -%F  -%l "en, *"
<https://web.stanford.edu/class/me200c/tutorial_77/index.html> -O1
"C:\Users\XLK12\Desktop\temp2\https___web.stanford.edu_class_me200c_tutorial_77_"
+*.png +*.gif +*.jpg +*.jpeg +*.css +*.js -ad.doubleclick.net/*
-mime:application/foobar )
Information, Warnings and Errors reported for this mirror:
note: the hts-log.txt file, and hts-cache folder, may contain sensitive
information,
 such as username/password authentication for websites mirrored in this
project
 do not share these files/folders if you want these information to remain
private
HTTrack Website Copier/3.48-21 mirror complete in 1 seconds : 1 links scanned,
1 files written (2093 bytes overall), 1 files updated [2328 bytes received at
2328 bytes/sec]
(No errors, 0 warnings, 0 messages)


 
Reply


All articles

Subject Author Date
Difficulty with edu indexes

02/18/2022 19:27
Re: Difficulty with edu indexes

02/19/2022 01:36
Re: Difficulty with edu indexes

03/01/2022 14:27




9

Created with FORUM 2.0.11