HTTrack Website Copier
Free software offline browser - FORUM
Subject: Re: DL set # of pages? and edit pages?
Author: Iain Elder
Date: 02/11/2014 01:52
 
The site is organized by days and months. I'm guessing you just want the
months.

Every month page on the site has a URL like this:

<http://alohaki.jugem.jp/?month=YYYYMM>

Where YYYY is the year and MM is the month. The current month is 201402.

I'm guessing that you mean the current month is page 1, the previous month is
page 2, and so on.

So if you want pages 1 thru 12, you want these months:

01 -> 201402
02 -> 201401
03 -> 201312
04 -> 201311
05 -> 201310
06 -> 201309
07 -> 201308
08 -> 201307
09 -> 201306
10 -> 201305
11 -> 201304
12 -> 201303

Use the filter syntax to download just these HTML pages and JPG images they
contain. <http://www.httrack.com/html/filters.html>

Something like this should work:

httrack <http://alohaki.jugem.jp/> -* +*month=20140[1-2] +*month=20130[3-9]
-*month=20131[0-2] +*.jpg

If you want just pages 1 thru 5 and and 8 thru 10, use the conversion table
above to write filters for just those pages.

You can view them one after the other by opening the offline copy of the site
in your browser.

The pictures contain no metadata (e.g. "surfing" in the title or the EXIF
tags) so there's no easy automatic way to keep just the surfing pictures.

You can manually browse the offline copies and delete the ones you don't want.
 
Reply Create subthread


All articles

Subject Author Date
DL set # of pages? and edit pages?

02/03/2014 14:09
Re: DL set # of pages? and edit pages?

02/11/2014 01:52




c

Created with FORUM 2.0.11