Mirule front page

Sisältö

TODO: "Spider" for automatic retrieval of protocols

Mikael LönnrothMikael LönnrothOver 2 years ago
Main features:
- Take starting URL and drill down through Groups, Meetings to retrieve information
- Match groups with Mirule Organizations
- Match meetings with Mirule meetings
- Match people with Mirule people

DONE:
- Store web pages for later mining/reruns with better algorithms

I just ran the spider on the Sibbo web site for 2009 and downloaded all protocols + attachments (2.5 GB). Yesterday's 0.5GB was just for the council data, 2.5GB includes all data.
Theme:
Views: 622Show/hide versionsShow selected diff

Muutokset versiosta

versioon

Comment
Minor
4/3/2010 4:36:06 PM
4/3/2010 4:36:06 PM
4/2/2010 8:04:08 PM
4/2/2010 8:04:08 PM
4/2/2010 12:39:20 PM
4/2/2010 12:39:20 PM
Add Comment
Anonymous
OR
Subject (optional):

Content: