HTTrack Website Copier
Free software offline browser - FORUM
Subject: httrack for parsing & mining a phpBB-forum
Author: metabo
Date: 08/04/2006 19:28
 
hi there 

hello all httrackusers 

i look for a good tool for parsing & mining a phpBB-forum.


If your application is supposed to work over a long period of time, or
gather data from a variety of forum sites with different themes, then you will
need to modify it.

first of - i have to explain something; I have to grab some data out of a
phpBB in order to do some field reseach. I need the data out of a forum that
is runned by a user community. I need the data to analyze the discussions.

to give an example - let us take this forum here. How can i grab all the data
out of this forum - and get it local and then after wards put it in a local
database - of a phpBB-forum - is this possible"?!"? to give an example - let
us take this forum here - am i able to grabb and harvest data out of this
forum here. How can i do that.  

What i have in mind - Nothing harmeful - nothing bad - nothing serious and
dangerous. But the issue is. i have to get the data - so what?
I need to to take out forum messages and other data (foum topics, users) into
database. Purpose: create forum copy for text analysis. Does anyone have
approximate solution?
It is needed to get data through HTTP for further analysis - in need to get
the data through HTTP and put it into CSV - in order to get a dump that can
fill a local database of a phpBB-board. 

to give an example see here .-... 

<http://www.nukeforums.com/forums/viewforum.php?f=3&sid=3b8a53170356aad30f52d3ba1f449ece>


I need the data in a allmost full and complete formate. So i need all the data
like

username .-
forum
thread
topic
text of the posting and so on and so on.

how to do that?
i need some kind of a grabbing tool - can i do it with that kind of tool. How
do i sove the storing-issue into the local mysql-database.

Well you see that is a tricky work - and i am pretty sure taht i am getting
help here. So for any and all help i am very very thankful

many many thanks in advance


regards 
metabo 
 
Reply


All articles

Subject Author Date
httrack for parsing & mining a phpBB-forum

08/04/2006 19:28
Re: httrack for parsing & mining a phpBB-forum

08/16/2006 04:02
Re: httrack for parsing & mining a phpBB-forum

10/14/2008 15:40




8

Created with FORUM 2.0.11