To make life easier this site has been walrss'd - click here to visit http://porkandpaws.walrss.com

qrcode -containing url of this post

Just finishing up at my first hackday aka mashed08 I am proud to say the hack was actually completely and ready to be presented. Everyone who knows me knows this is just an achievement in it’s self.

Tim Nash and I following his idea developed an accessible and indexable media service.

Following a talk by BBC R & D we were enlighted about embedded ID3 tags within audio and video files and how the BBC are starting to use version two of these tags (id3v2) to put information about the programs directly within the file . These tags hold meta data and can hold rich information about the file and are directly extractable.  However there are few tools for doing this and the example we were shown used Flash™ and had we felt a number of limitations.

  • You had to read the whole file before you got the data
  • The current method of extraction did not expose this data in a way that search engines could find it (Tim is a SEO specialist)
  • The data in all the example we were given was not presented in a format that was portable
  • The data was not accessible.

So in a mear 24hrs we looked to address these issues, hacking away through the night at Alexandra Palace. We put together a service that extracted the data before all the file was loaded and exposed that data as html on the page and as a ATOM RSS feed  and then cached it so people could easily get data out of files. OK it is still a bit of a protoype and the amount of files around with enhanced data like the BBC provided  (Chis Moyles podcasts) I give you We Do ID3 So zip over and have a look below I have embedded our slides (unfortunatly no embedded data in them though and not very accessible unless you use  a hack done this year by Christian Heilmann (EasyslideShare) - I have a hosted version of accessible slides from mashed here ) and we even made a small mention in the Guardian News blogs

The crew at the BBC were great and I had a brilliant weekend with some very cool and clever things being created.  I am definately going to go back next year. Well done Matthew Cashmore for organising it.

Just look for tagged info (mashed08) on flickr,mashed08.backnetwork.com and twitter to hear how good it was. Oh and yes I got a free bean bag, that was interesting getting back on our mammoth 5hr journey for a location that is only 2ish hours normally, gotta love Sunday trains.

Posted Sunday, June 22nd, 2008 at 3:52 pm
Filed Under Category: Technical
You can leave a response, or trackback from your own site.

4

Responses to “Accessible media mashed08”

Phil Wilson

I don’t really understand whether you’re just pulling the normal description fields etc. or using the chapter data?

In any case, mind bunging me the code? Cheers.

shaun

Phil

We were extracting information embedded within the media (the sample from the bbc -Chris Moyles Podcast ) had keywords , chapter information and images. Unfortunately we did not get around to the image data extraction. This information is stored in what is called theID3 tag or ID3v2 tag in enhaced media. I will get the code together and drop you a link in the meantime it is worth looking at the BBC sample as well on the backnetwork site. This differs from ours as you had to load the whole file before the information was available and the interface was in flash.

Its all in the mp3s • Tim Nash UK SEO Blog

[…] it worked not the hack my hat tip to Shaun who coded amazingly well considering at one point we were talking about the intricacies of binary […]

Phil Wilson

Thanks Shaun. To clarify, I was at Mashed too and had a quick go at something like this, but focussed around extracting the chapter information but found practically no libraries which actually supported it. Presumably you wrote one yourselves?

Leave a Reply