Mirule-etusivu

Sisältö

TODO: "Spider" for automatic retrieval of protocols

Mikael LönnrothMikael LönnrothYli 2 vuotta sitten
Main features:
- Take starting URL and drill down through Groups, Meetings to retrieve information
- Match groups with Mirule Organizations
- Match meetings with Mirule meetings
- Match people with Mirule people

DONE:
- Store web pages for later mining/reruns with better algorithms

I just ran the spider on the Sibbo web site for 2009 and downloaded all protocols + attachments (2.5 GB). Yesterday's 0.5GB was just for the council data, 2.5GB includes all data.
Teema:
Näytetty: 621Show/hide versionsShow selected diff

Muutokset versiosta

versioon

Comment
Minor
3.4.2010 16:36:06
3.4.2010 16:36:06
2.4.2010 20:04:08
2.4.2010 20:04:08
2.4.2010 12:39:20
2.4.2010 12:39:20
Lisää kommentti
Anonyymi
TAI
Otsikko (tai jätä tyhjäksi):

Teksti: