HTTrack Website Copier
Free software offline browser - FORUM
Subject: HTTrack a website analyzing on the fly
Author: debuti
Date: 02/23/2015 09:54
 
I need to know a way to run httrack so it do not store anything downloaded but
run a script on the retrieved htm. For example, a case of use, it would be
useful to collect tracking codes of google analytics that exists in a page
outputting them in csv format.

What I've tried so far is

httrack --skeleton <http://www.example.com> \
-V "if grep -iEo \"UA-[0-9]+-[0-9]+\" \"\$0\">/dev/null; then UA=\`grep -iEo
\"UA-[0-9]+-[0-9]+\" \"\$0\"\` echo \"\$UA \$0\"; fi" \
-*.gif \
-*.jpg \
-*.png \
-*.jpeg \
-*.tif \
-*.tiff
But that code saves files..
 
Reply


All articles

Subject Author Date
HTTrack a website analyzing on the fly

02/23/2015 09:54




d

Created with FORUM 2.0.11