HTTrack Website Copier
Free software offline browser - FORUM
Subject: Filters and External Files
Author: Dan
Date: 01/16/2012 21:35
 
I have a website with the following structure:  

twiki/bin/view/SIG/index.html->Documentation.html->XXX.html->YYYYYPreviousVersions.html
twiki/pub/SIG/XXX/*.pdf
twiki/pub/SIG/YYYYYPreviousVersions/*.pdf


All of the html files are in the “twiki/bin/view/SIG” directory.  The links
indicated above indicate the site hierarchy where no directory structure
exists.  There are many other links in the various pages which aren’t to be
copied.  All of the data files (.pdf) are organized in the “twiki/pub/SIG”
sub-directories.  I only want to copy links and data within the
“twiki/bin/view/SIG” directory, “twiki/pub/SIG” directory, and
“twiki/pub/SIG” sub-directories.  Additionally, I want to limit the mirror
to portions of the site, specifically, SIG->Documentation,
SIG->Documentation->XXX and SIG->Documentation->XXX->YYYYYPreviousVersions.

I’m starting with the Documentation page and trying to use filters to select
one XXX page and linked YYYYYPreviousVersion pages. The problem I’m having is
the “twiki/pub/SIG/YYYYYPreviousVersions/*.pdf” files are being interpreted
as external files and are not copied.  

Here’s the filter:  
-*
+*.png +*.gif +*.jpg +*.css +*.js -ad.doubleclick.net/*
-mime:application/foobar
+*/twiki/bin/view/SIG/Documentation*
+*/twiki/bin/view/SIG/XXX*[]
+*/twiki/bin/view/SIG/XXX/*
+*/twiki/bin/view/SIG/*[0-9]PreviousVersions*
+*.pdf
 
Reply


All articles

Subject Author Date
Filters and External Files

01/16/2012 21:35
Re: Filters and External Files

01/16/2012 23:45
Re: Filters and External Files

01/20/2012 23:25
Re: Filters and External Files

01/21/2012 16:43




7

Created with FORUM 2.0.11