HTTrack Website Copier
Free software offline browser - FORUM
Subject: Howto download files that server return bythe link
Author: MIchel
Date: 11/02/2008 15:25
 
suppose, I like to download all zipped subtitles from a site.

If there's captcha protection, I dont think web crawlers can do that (maybe in
the future they would be able to).
But even if there's no captcha protection, there's one stupid thing that
protects the crawler from ripping.
To give an example, let's look at the site tvsubtitles.net.
When we get a page with a link "download", that link points to something like
"sitename/download-123.html". 
But, when the server gets this query, it returns raw data (the zip file)
instead of returning a html document.
Browsers can easily come up with the situation, ie they will show a message
box where to save the file.

But crawlers cant do that. How would I download such files?
 
Reply


All articles

Subject Author Date
Howto download files that server return bythe link

11/02/2008 15:25
Re: Howto download files that server return bythe link

11/02/2008 18:07




0

Created with FORUM 2.0.11