HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: Archiving Websites - Questions
Author: Bo Hovgaard Thomasen
Date: 11/29/2007 09:48
 
> 1) Can WinHTTrack 3.42 be used to archive websites
> that change daily? For example, let's say I want to
> create a news archive from a news site that updates
> daily. At the moment I do this by saving every
> single article manually in Internet Explorer using
> the "Page" --> "Save As" feature, but this is
> extremely time consuming. I surmise that WinHTTrack
> can do this with one click (or even automatically),
> but what if the website changes daily and the old
> articles disappear from the website. When WinHTTrack
> updates my offline download of the site, what
> happens to the old articles - do they also
> disappear? 

You can use HTTrack to make a new archiving of the desired web pages every
day. Just name the archiving using the current date (and perhaps even time):
e.g. 071129_httrack_com_0915 for an archiving of httrack.com today at 9.15. 

This can be automated via scripts (e.g. using MS-DOS batch scripts or
Microsoft PowerShell).   

> 2) Can archived files be browsed in Internet
> Explorer or do they have to be browsed by some
> specialized browser? I tried using Webaroo some
> months ago and found that the downloaded websites
> can only be browsed in Webaroo. 

Yes, the archived files are saved in so that they are navigatable offline
using your standard web browser.    
 
> 3) Can the archived data be moved... say to an
> archive harddrive? As I already mentioned, I tried
> using Webaroo some months ago, but it was not
> possible to move the files. 

Yes, the archived files are by default interpreted so that the hyperlinks
refer to the local (archived) copy of the web pages. In other words,  you can
copy them to any drive, and the browsing experience will be the same. The
archives are portable. 

> 4) Is it possible to use WinHTTrack to download data
> from RSS feeds? RSS feeds send me the headlines with
> links to articles, so would WinHTTrack be able to
> use these links to download the articles themselves?
If you have the links, yes. You can tell HTTrack to download the web pages
they refer to. Use filters to limit the archiving to just the desired pages. 

> 5) Is it possible to have WinHTTrack download data
> from some sites in terms of .mht files and others in
> terms of .html and .htm files? 

I do think that .mht is a possibility if you use the -%M command line
parameter. 


- Bo
 
Reply Create subthread


All articles

Subject Author Date
Archiving Websites - Questions

11/25/2007 23:57
Re: Archiving Websites - Questions

11/29/2007 01:18
Re: Archiving Websites - Questions

11/29/2007 09:48
Re: Archiving Websites - Questions

11/29/2007 15:19
Re: Archiving Websites - Questions

11/29/2007 23:12
Re: Archiving Websites - Questions

11/30/2007 20:28




1

Created with FORUM 2.0.11