Speech to Text

From BootstrapWiki
Revision as of 12:50, 28 February 2006 by Bmassey (talk | contribs)
Jump to: navigation, search

February 2006

In answer to my question:

Can anyone recommend a speech to text solution? We're looking for an accuracy that is equivalent to taking notes, that is, not perfect.

I received the following advice:

I use the voice recognition built into Windows XP with very good results. Spend the money to get a good USB DSP headset. I bought the Planatronics xx-500 from Amazon for about $60 (it is about $130 retail) and have had very good accuracy. Plan to spend about 30 minutes training it for your voice. Integration into MS word is seamless.

--Joshua


There is a company that is call idictate.com -- they will transcribe for 1cent per word.


Here's an open source speech recognition package: http://cmusphinx.sourceforge.net/sphinx4/

And, since it's open source, you could easily put together something to automatically transcribe your podcasts, if you wanted to do that (and if you know Java).

There are other versions available at http://cmusphinx.sourceforge.net/html/cmusphinx.php.


Two of the best programs for a single user including multiple voices recording on the same PC:

1. Dragon Naturally Speaking

2. IBM Via Voice

They both take a bit of time to learn. Not only do you need to learn the program but it needs to learn to recognize your voice. Each time you correct a mis-interpreted word the system learns for the future.

They can learn to recognize multiple voices as well. They can translate real time or from a digital recording.

These do require a learning curve so if you are not using it a lot of times you may want to hire a dictation service.


I don't know what your budget is - but the best product on the market today is the IBM Via Voice which was acquired by Dragon Systems Google Dragon Naturalluy Speaking. they have an entire array of products ranging from consumer to very elaborate industry specific fro medical, legal and much more and ranging in price from the 100 or so - to the several thousands - depending on industry, licensing, etc...