Auto-transcribing podcasts?

I think it would be really cool if someone could come up with a way to automatically derive written transcriptions from podcasts. This would work similar to speech recognition programs but it needs to be speaker independent to be effective, kind of like closed-captioning used by TV stations. The purpose for producing written transcripts would be two-fold- make podcasts available to those in the community who are deaf and unable to access the information shared in the many podcasts now available and to make the shows searchable once the transcripts are published.

I would love to hear from those in a position who have thoughts about possible solutions that could be used from existing technology. I am willing to work with anyone using any of the podcasts I produce as a test case, with the hope of eventually using this on a wide range of podcasts. Please contact me if you have any ideas along these lines, or post a comment here. Let’s see what we can come up with as a community effort.


Comments have been disabled for this post