| Hello,
I am trying to download all the pdf transit schedules from this website:
<http://www.riderta.com/>
They used to just be listed on one page and I would use down them all, but now
they have a new site :(
I understand that I must let it spider the site: -*
+*.pdf +*.html
The problem is that each route has a separate page with a link to the pdf, but
no file extension, hence +*.html is not picking anything up.
example:
<http://www.riderta.com/routes/14>
All the PDFs are in one directory, though there is not a page where they are
all listed.
<http://www.riderta.com/sites/default/files/schedule-pdfs/>
Any help is appreciated. Thanks!
| |