HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: Filter of query string
Author: Ryuu
Date: 06/11/2013 20:13
 
5)
- Is 'No External pages' will exclude other server in same domain for us?If
URL is <http://pokemon.wikia.com/wiki/Pok%C3%A9mon_Wiki>
is it will exclude <http://community.wikia.com/>* or not

And sorry for makes you misunderstanding
1-3)
- It's the real case. I just add filter to makes it smaller for you and me too
e.g. not include image's server. I also test with this so it's not interfere
with real result. It just cut non-related factor that I well known. I (and I
think other too) dont want to load image because it's not relate.
- The page is really exist. And I only use those filter - to test that what's
*[file] really match, so I dont use +*.js or -*.js or anything more
I just mirror that URL with those FILTER rule and you can receive the same
result.
In <http://pokemon.wikia.com/wiki/Pok%C3%A9mon_Wiki> has
<http://pokemon.wikia.com/wiki/Pok%C3%A9mon_Wiki?action=edit> which I think it
should not match a filter at first but it seems it's. which you help me with
*[file]*[]

** %C3%A9 is é but HTT do correct job with %C3%A9 because link in source is
%C3%A9

4) If you want everything use the near flag (get non-html files related) not
filters.
- I always use that but I require filter because I dont want to mirror all
website. I want to trim it as small as required. To produce smallest server
load and also used space.
Although it's no-limit website or HTT already limit load but I think we still
need to filter it out properly, not just download them all.

**Also it's not 'Get'em all' or 'Reject all' job. In both case about 5) my job
still need the filter
- If it's only exact domain. I need to include
<http://images.wikia.com/>* too if I want image (and
<http://images*[0-9].wikia.nocookie.net> which is in another domain).
While it should exclude <http://community.wikia.com/>* and some of it which is
unwanted images. So only 'get non-html files related' doesn't enough for this
type of website :P
 
Reply Create subthread


All articles

Subject Author Date
Filter of query string

06/10/2013 21:32
Re: Filter of query string

06/10/2013 22:25
Re: Filter of query string

06/11/2013 19:36
Re: Filter of query string

06/11/2013 20:13




9

Created with FORUM 2.0.11