|
How to load cookies.txt prior to spider? When I run the following command
cookies.txt get's overwritten by htt:
httrack <http://www.google.com/> -O "/tmp/www.google.com"
Background info:
I generated a cookies.txt file using firefox add-in
<https://addons.mozilla.org/en-US/firefox/addon/export-cookies/>
Steps to reproduce:
1) clear all cookies from Firefox
2) navigate to www.google.com
3) select Tools->Export Cookies and save c:\winhttrack\cookies.txt
4) from command prompt open c:\winhttrack
5) type this in to the command prompt:
D:\WinHTTrack>httrack <http://www.google.com/> -O "/tmp/www.google.com"
Expected: www.google.com is opened using the existing cookies in cookies.txt
Actual: cookies.txt is overwritten and new cookies appear
Note: i generalized this example for www.google.com. I'm actually trying to
process another site, and within the cookies.txt is login cookie for auth.
Because the auth is overwritten the spider fails on auth protected urls.
Thanks | |