HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: source code of httrack
Author: Yushatak
Date: 08/29/2011 16:20
 
Yeah that's really easy - download the html file from the URL that gets input
to your program, then look through the HTML file for anchor tags and split the
text based on the <a> tags, then split those based on the </a> tags, and
enumerate through the results or just list/save them. I was building a crawler
once and that's as far as I got.
 
Reply Create subthread


All articles

Subject Author Date
source code of httrack

09/17/2007 18:33
Re: source code of httrack

09/17/2007 19:08
Re: source code of httrack

04/09/2008 10:37
Re: source code of httrack

04/09/2008 11:23
Re: source code of httrack

09/01/2010 00:03
Re: source code of httrack

08/29/2011 16:20
Re: source code of httrack

01/15/2012 15:57
Re: source code of httrack

05/15/2013 17:07
Re: source code of httrack

11/07/2020 22:48




1

Created with FORUM 2.0.11