HTTrack Website Copier
Free software offline browser - FORUM
Subject: indext.txt: ignored
Author: steffen
Date: 07/18/2005 19:32
 
Hi!

I am very happy to have this searchable index made by httrack. Unfortunately
not all words get indexed. Some words apear only as follows:
-----------
home
        ignored (53)
-----------
This is a very good idea for common stopwords like 'and' etc. But, as httrack
always indexes all words on every html-document, words contained by the menu,
header or footer are getting ignored quickly.
Is there an option to change this behaviour?It would be great, if at least
httrack would list the first 1000 documents, where the word is found and would
ignore only the rest and not all documents.

Any help is very welcome,
Steffen
 
Reply


All articles

Subject Author Date
indext.txt: ignored

07/18/2005 19:32




9

Created with FORUM 2.0.11