Archive for September 15th, 2009

  • Sep
    15
    2009

    Getting tables out of PDFs in Italy

    The Italian Parliament annoys me tremendously. Not for substantial reasons (though it might also annoy me for that reason), but for technical reasons.
    They have some nicely formatted XML files for the resoconti (minutes) of each parliamentary sitting.
    But their voting information is stuck in crappy PDFs.
    Grrr.
    So, I have to

    download all the PDF files using a horrible [...]

 
Powered by Wordpress and MySQL. Theme by openark.org