| I've tried using the tutorial (
<http://httrack.kauler.com/help/CatchURL_tutorial> ) several times on several
different machines with very varied results. Obviously none successful
otherwise you probably wouldn't be reading this.
I need to do weekly snapshots of some wiki and blog pages, which are on a
sharepoint site. The site throws a popup window/form to authenticate users.
when using winhttrack and clicking on the capture URL, it pops up the proxy
settings you need to set.
then in browser you click on the url to get to the login form.
then open new browswer & set proxy settings.
then back to browser where you login.
about 20% of the time I get the page telling you that HTTrack has caught the
link, the rest of the time the browser takes you right into the site.
on the occassion when it tells you that it did catch the link, I go back to
httrack & see the url captured & sometimes its right & other times its
completely changed the url. Either way, I click to next & it fails with 401 or
a 400 saying bad url.
I'm looking for advice. Is there another approach to capture a form based auth
site? even when I try to manually click on new project & add url & put site &
my credentials in it just builds the user/passwd into the URL which gets a
401.
I was about to resort to wget but looking for advice on progressing with this
effort.
Prior to this request, I had just been using the unix/CL version of httrack.
Here's the latest log where the URL that was captured was not correct...
Thanks,
Joe
HTTrack3.43-5+htsswf+htsjava launched on Wed, 22 Jul 2009 14:46:42 at
<http://external.bms.com/_layouts/1033/styles/core.css?rev=5msmprmeONfN6lJ3wtbAlA%3D%3D?>postfile:C:\My%20Web%20Sites\wiki0722\hts-post1>
+*.png +*.gif +*.jpg +*.css +*.js -ad.doubleclick.net/*
-mime:application/foobar
(winhttrack -qwC2%Ps2u1%s%uN0%I0p3DaK0H0%kf2A25000%f#f -F "Mozilla/4.5
(compatible; HTTrack 3.0x; Windows 98)" -%F "<!-- Mirrored from %s%s by
HTTrack Website Copier/3.x [XR&CO'2008], %s -->" -P proxy-server.bms.com:8080
-%l "en, en, *"
<http://external.bms.com/_layouts/1033/styles/core.css?rev=5msmprmeONfN6lJ3wtbAlA%3D%3D?>postfile:C:\My%20Web%20Sites\wiki0722\hts-post1>
-O1 "C:\My Web Sites\wiki0722" +*.png +*.gif +*.jpg +*.css +*.js
-ad.doubleclick.net/* -mime:application/foobar )
Information, Warnings and Errors reported for this mirror:
note: the hts-log.txt file, and hts-cache folder, may contain sensitive
information,
such as username/password authentication for websites mirrored in this
project
do not share these files/folders if you want these information to remain
private
14:46:43 Error: "Bad Request" (400) at link
external.bms.com/_layouts/1033/styles/core.css?rev=5msmprmeONfN6lJ3wtbAlA%3D%3D?>postfile:C:\My%20Web%20Sites\wiki0722\hts-post1
(from primary/primary)
14:46:43 Info: No data seems to have been transfered during this session! :
restoring previous one!
| |