Is it possible to mash up data from Direct.gov?

January 26, 2008 by Tim
Filed under: Transfered 

There is useful data in local direct gov – but can we get at it with a web service to create mash-ups for public benefit?

<warning – slightly geeky post coming up>

Last year we tried to raise awareness of the Youth Opportunity Fund and Youth Capital Fund with the Actions Speak Louder campaign. The campaign, targetted at young people – involved a national awards ceremony and publicity campaign – but the goal was to help young people find their local Youth Opportunity Fund grant making panel.

YOF/YCF serach on MySpace from Direct Gov?

The only place we could find a directory of Local Youth Opportunity fund websites was through Local.Direct.gov.uk, and it seems like you can only search on the Local Direct Gov orange website. However, we wanted to be able to pull a search of local Youth Opportunity Funds into a widget on the Actions Speak Louder mySpace website instead of pointing people off to Direct Gov.

Direct Gov Youth Funds

As far as I knew at the time, that couldn't be done. Local Direct Gov doesn't appear to provide an XML feed, or web service API. However, here at the BarCampUKGovWeb Paul Clarke who works with Direct Gov has suggested it might just be possible. He notes:

Hantsweb gave an interesting presentation at the last Directgov Open Day about how one local authority has used the Local Directgov functionality to enhance the way it routes interested citizen to relevant local services in its area (and close surroundings outside the county).

So I'm hopefuly. But with a bit more searching I'm little further forward.

So, I still have three questions:

1) Does anyone know if its possible to query the Local Direct Gov data as a webservice? Or do I have to always direct users off to the dreaded orange pages?

2) If it is – does anyone know how?

3) If it isn't – what would it take to make it possible?


Update: follow up post here

Comments

31 Responses to “Is it possible to mash up data from Direct.gov?”

  1. Anonymous on January 31st, 2008 3:25 pm

    I’m not clear on what you’re actually trying to do…

  2. Tim on January 31st, 2008 6:59 pm

    For example:

    I’d like to have access to the Local Direct Gov data in a way that made it easy for me to create a Widget that would let a visitor to the MySpace page we used in the campaign noted above, to enter their town or postcode, and to either get taken direct to their local Youth Opportunity Fund web pages, or to have the details of their local Youth Opportunity Fund displayed back to them in the widget.

  3. Tim on May 15th, 2008 9:22 pm

    Update: This post from Simon Berry’s secondment to DCLG might show one way for my particular YOF/YCF suggestion to move forward:

    http://www.web24gov.org.uk/node/11

    A government data ‘widget factory’…

  4. sunilm on August 9th, 2008 7:10 pm

    i read the post thrice without reading the comments and was not able to get to the head or tail of it… but why would you need a widget to get govt data…

  5. Diseño web on August 9th, 2008 7:46 pm

    Thanks for this work!!! Its great.

    Very thanks.

  6. Daniel K on August 9th, 2008 9:40 pm

    Some nice questions. I would love to answer even one of those but i cant, hehe. But nevertheless, thx for the info

  7. Barbarism on August 10th, 2008 3:47 pm

    This may sound silly, but did you actually make an attempt to contact them over the matter?

  8. John London on August 10th, 2008 7:18 pm

    Hi there,
    Like some of the other comments here. I think your a bit unclear about what you wnated. But, I am pretty sure I get it.
    There are many ways to accomplish what you seek. Perhaps one of the easiest ways maybe to get a current version of FileMaker and use it to extract the data you wish from the target website or any website. Using Filemaker, you can teach it to red and extract what you need.
    However, please remember that while governemt websites are generally open for content extraction. They do often contain data that is not FREE to extract without PERMISSION.
    Naturally, data from any non-government website, should be assumed not for reprint without permission. Cool.
    Now frankly, for the project you seem to be dealing with, I would think that any request for re-print rights will be granted, but always offer a backlink and credit. Also, before investing in Filemaker, you may find that upon request, most source will be happy to make it easy for you to use RSS or some means that they provide to do what you wish.
    The moral of the story, asking the website’s webmaster is always a great place to start. 
    Thanks and Best wishes with your project
    John

  9. Warhammer Online on August 10th, 2008 9:38 pm

    I have a feeling this is not possible but good luck :)

  10. Beatrice on August 10th, 2008 11:37 pm

    I read John London’s comment and I think he’s right about contacting the webmaster, put i think there is a way to parse what you need.

  11. Hunter on August 11th, 2008 12:25 am

    Hey there,

    It’s an interesting possibility–although scraping XML is much easier than trying to scrape something else. It’ll be interesting to see if you can figure it out…:P

  12. Tim on August 11th, 2008 9:12 am

    Hey all

    Thanks for the many comments. There is a follow up post from around the time of this one here: http://www.timdavies.org.uk/2008/01/29/getting-data-out-directgov

    (Not quite sure why this particular post is gaining so much interest now – what brought you all here?)

    For the bigger picture on government data – take a look at http://www.showusabetterway.com

    There are two issues here:

    1) The technical issue – how to get the data out. Parsing and getting all the data into a separate database is less than idea – as it relies on regular parsing to keep the data up-to-date, and is not a stable long-term solution.

    2) The issue of principle – Government data should be collected for public benefit – and so should be provided in free and open ways that maximise public benefit from its collection. At present, Local Direct Gov does not make that easy.

  13. Hunter on August 11th, 2008 1:58 pm

    “(Not quite sure why this particular post is gaining so much interest now – what brought you all here?)”

    You’re such a nice guy!

    And your site has good backlinks for commentators. Who wouldn’t love that :P

  14. Dawn on August 11th, 2008 5:55 pm

    I came acrossed your blog while searching for an article I had read about an international gesture to thank soldiers for their services. So many people are a little too shy to walk up to and thank a soldier. So a gesture was created to say thank you without actually having to walk up and initiate a conversation. Anyhow, I was searching for actions speak louder than words and followed a few links and here I am. :)

    This was interesting reading. What would one do when they found their local grant making panel ?

  15. Snakewood on August 11th, 2008 9:24 pm

    The issue of principle might be the biggest issue.
    Well done! Good article.

  16. Yohan on August 11th, 2008 9:27 pm

    On the lines of what John London suggested above… I think you may try using a web data extractor application such as webscraper plus which basically takes data from web and puts it into a spreadsheet or database. You may then use that data to build the widget.

    As long as you have the government permission to use the data off a .gov site… I think it could be done.

  17. Self Defense Rob on August 11th, 2008 11:09 pm

    I am pretty sure anything on a US .gov site is free distribution, but only state websites need permission. Has to do with the funding sources for compiling the original information.

  18. Canadian hosting on August 12th, 2008 9:07 am

    As per the whole copyright comments, the following link states “may be reproduced provided it is reproduced accurately and not used in a misleading context”, except material that is “identified as being the copyright of a third party”.

    http://www.direct.gov.uk/en/SiteInformation/DG_020460

    Providing links directing users to the orange pages at the same time can never hurt :)

  19. Fern on August 14th, 2008 3:42 pm

    Many states will allow you to use their copyrighted data if you give them credit.

  20. Delicious on August 14th, 2008 5:12 pm

    You should try and contact the webmaster … he’ll help you for sure if you give them credit.

  21. seoweb2 on August 15th, 2008 3:43 am

    The moral of the story, asking the website’s webmaster is always a great place to start.
    Thanks and Best wishes with your project

  22. powerpoint on January 28th, 2009 11:28 am

    well tim, i think its fully depend on the type of the data, whether its allowed to be publish or its a private .gov data

  23. Andrea on February 4th, 2009 3:29 pm

    Hi, Tim
    I am a researcher of Institute of Education University in London and we are doing some project about young bloggers.
    Would you be please so kind and write me:

    what audience would you like to address by your blog : young people, your friends, collegues…
    Thank you very much for your cooperation
    have a nice day

    Andrea Miklovicova
    andreafmaatgmaildotcom

  24. Orlando on February 13th, 2009 5:24 pm

    Has anyone had any luck making heads ort ails of this?

  25. Tim Davies on February 14th, 2009 11:48 am

    For developments with getting data from Direct Gov check out the new Direct Gov Innovation site: http://innovate.direct.gov.uk/

  26. terjemahan on August 25th, 2009 4:43 pm

    wow, thanks for sharing..

  27. John Darlington on September 7th, 2009 10:58 am

    Hi,

    I thought you might be interested in OpenPSI, a collaboration between the University of Southampton and the UK government, lead by the National Archive, to trial a new form of community provisioned information service.

    we are trying to spark interaction between government information providers, academic researchers and information intermediaries, specifically to bridge the gap between those researchers who may not have all
    the technical skills or data knowledge to answer important research questions.

    We are exposing UK government data in the Semantic Web standards, RDF. We have SPARQL end point so data mashers can issue via query requests. This is a form of open API that allows mashups to be created quite quickly.

    There is a current request for information dealing with all forms of data about youth activities which we may try and satisfy.

  28. Tim on September 10th, 2009 5:12 pm

    Hey John

    Thanks for pointing out OpenPSI. Looks very interesting in deed – I shall investigate more soon.

    Re: the request around Youth Activities – where can I find more on that? Is that activities as in ‘Positive Activities’ – i.e. the sort of things the http://www.plings.net project is looking at?

  29. John Darlington on September 11th, 2009 10:42 am

    Tim,

    Yes it is about the sort of things the http://www.plings.net project is listing but from a different direction. It is not about the immediate, what can a person do today but about understanding the long term effect that access has. I think what the social science community is trying to understand is impact. They suffer from small survey issue. If we can use technology to build a systematic picture of opportunities for youth and maintain it over time then we can understand how it’s provision helps social need.

    So a good research question might be, in a recession if we cut spending on services what is the social impact. We could use the UK government admin data as a backbone for understanding this.

  30. John Darlington on September 11th, 2009 10:43 am

    This is just one of the types of question we could investigate with a linked data web of UK government admin data as a backbone.

    Many other questions could be investigated.

  31. Tim on September 11th, 2009 6:24 pm

    Hey John

    I’d be really interested to talk more about anything you are doing on youth-activity related questions. It would be interesting to see what datasets might help here – as, so far as I’m aware, Plings and Parent Know How are the only government (owner or sponsored) data sources with any reasonable coverage of data on activities – and then the coverage is still problematic…

Feel free to leave a comment...
and oh, if you want a pic to show with your comment, go get a gravatar!