| I am looking for someone to develop an HTTrack application.
Here are the objectives:
* load or read 10,000 - 50,000 web site urls (this can be done 100 or a 1k at
a time; whatever is reasonable)
* search 2-3 levels deep in each web site for key vendor names (up to 500),
such as Cisco, Juniper, Symantec, etc., usually found under "Partners" or
"Solutions" pages.
* Extract those key names, only once, even though they may appear many times.
* Place those key vendor names into a spreadsheet column (or text file) under
the url title.
* basically, the objective is to build a list of vendor names for each URL
...please let me know if this can be done with HTTrack and whether you are
interested.
| |