| I need to know a way to run httrack so it do not store anything downloaded but
run a script on the retrieved htm. For example, a case of use, it would be
useful to collect tracking codes of google analytics that exists in a page
outputting them in csv format.
What I've tried so far is
httrack --skeleton <http://www.example.com> \
-V "if grep -iEo \"UA-[0-9]+-[0-9]+\" \"\$0\">/dev/null; then UA=\`grep -iEo
\"UA-[0-9]+-[0-9]+\" \"\$0\"\` echo \"\$UA \$0\"; fi" \
-*.gif \
-*.jpg \
-*.png \
-*.jpeg \
-*.tif \
-*.tiff
But that code saves files.. | |