What Voice Recognition Technology Could Mean for Apple — and All of Us

21 Comments

Credit: Flickr / Lazurite

Voice recognition has long been billed as a kind of holy grail of mobile computing, but the reality is that the technology has been awkward, inaccurate and often unusable, resulting in misdialed phone calls and incomprehensible messages. So it’s no surprise it’s failed to garner much usage in mobile.

Apple (s aapl) may be positioned to change all that, though, with the iOS 5 platform it will outline at next week’s WWDC in San Francisco. The company has reportedly been in discussions to license Nuance’s effective voice technology – dubbed Dragon – and may integrate it with the new version of iOS 5. Apple could make the technology available to developers as a built-in API in iOS 5, handing app creators a valuable new tool. Such a move would not only give voice recognition a much-needed push into the mobile mainstream, it would give Apple the chance to once again transform the way we interact with our phones. Here’s why:

1) Voice recognition technology is finally ready for prime time. Dragon powers Nuance’s FlexT9 (s nuan) for Android (s goog), a dictation app that sells for a mere $5 and enjoys a four-star user rating after more than 1,100 reviews. And there is no shortage of compelling use cases, from accessing a navigation app while driving (when your hands should be on the wheel) to dictating lengthy messages rather than typing on a miniature keyboard.

2) Apple knows how to educate the consumer. Voice recognition has come a long way, but using it still isn’t always intuitive. Google’s technology, for example, requires users to say the words “period” or “comma” if they want to add punctuation to their messages. But Apple’s marketing genius lies in showing consumers how to use technology: The first iPhone commercials were essentially tutorials in how to surf the Web, access email and find nearby businesses on the handset. A similar campaign could illustrate how to do all those things and more by talking, not typing.

3) Apple is a master of the user interface. The touchscreen was nothing new when the iPhone came to market; Apple’s true innovation was in simplifying the technology with an interface that made it easy for users to navigate their phones. The company could do the same with voice by integrating Dragon closely with iOS, making it easy to send messages or navigate the Safari browser by speaking. And the legions of iOS developers will surely find innovative new ways to leverage voice in everything from messaging to gaming to social networking.

For more thoughts on how Apple could leverage voice recognition technology to change the way we use our phones, see my latest Weekly Update at GigaOM Pro (subscription required).

Image courtesy Flickr user Lazurite.

21 Comments

Detlev Artelt

Even knowing that Android has voice recognition a while, the push from Apple will move any kind of voice apps into the heads of the mass market.

BUT if they do it right, they will integrate it INTO the device and not only in a cloud service like the Dragon App and like Android did it. Have a look f.e. at the Jibbigo App, which offers speech to speech translation ON the different devices. This works even if you don’t have a network or 3G connection.

The question is not Android or iOS, it’s much more the HOW.

For all who like to read more, have a look into my voice compass books at http://www.voice-compass.com/english.

Greets
Detlev

FarmerBob

As far back as 1995 (OS9) Mac’s have had Speech Recognition and Action and it worked very well. Although, IBM had better and more human like voices at the time. I don’t use it much these days. But Text-to-Speech I use all the time to help proofread or read to me so I can multitask. And partnering with Nuance was a good move. We have their software on several of our cellphones and for the most part it works well. It saves from having to completely set up phones for people that can’t grasp contact navigation. And their speech-to-text in reciting the number to dial works well also. DragonSoft and a couple other earlier mergers as it was, I have installed for many clients that needed Medical and Legal versions that saved them time and me grief.

Abhiroop Basu

My biggest problem (and the reason I don’t use it) is because I don’t want to talk to my phone in a robotic monotone so that it navigates to a specific website or so that it enters text. Until I can say “which bus will take me to X road” and it automatically opens up a bus schedule or directions on Google Maps, I won’t be using voice search. Right now it just takes too many steps (for example saying “period”) to get something done and it is far easier for me just to use my fingers.

Abhiroop Basu

My biggest problem (and the reason I don’t use it) is because I don’t want to talk to my phone in a robotic monotone so that it navigates to a specific website or so that it enters text. Until I can say “which bus will take me to X road” and it automatically opens up a bus schedule or directions on Google Maps, I won’t be using voice search. Right now it just takes too many steps (for example saying “period”) to get something done and it is far easier for me just to use my fingers.

mike

So this article is irritating, android uses voice search often, in all fields of all apps.

weird that this says something i have never used called dragon costs 4 bucks? Press and hold the search key

EntrepreNerd

@Raymond Padilla, who asked, “Do you have any stats that back up your assertion that very few people know about and use Android’s voice recognition features?”

No, he does not. The reason he does hot is because there is no such statistic. The fact of the matter is that as of Aug 12, 2010 25%, or 1 in ever 4 US searches are being done using Google Voice Commands. That is HUGE, MASSIVE, and MAGICAL number of users using the service.

Mat

If Apple do this right then they will make it happen, just as they did with the iPhone and with the iPad, these technologies were both around before in multiple different incarnations, but Apple manages to make it hugely popular, ultimately I feel partly because they focus on the user experience which others seem to miss and partly because people just seem to want to buy from them.

The problem with voice recognition is that it has never worked that well. Once the technology reaches the tipping point then it will become useful. At the moment I use a blackberry only because of the keyboard, if a device comes along which lets me type at a similar speed and have a touch screen then I am buying it.

Dominick

I agree with what everyone is saying. I’ve been a Droid (original, still the best!) owner since November 2009 and have used Android’s built-in Voice Search functionality since it was added (built-in) to Froyo 2.1. It’s nothing short of fantastic. what makes the service even better is that it’s constantly improving “in the cloud” because Google takes the voice input string and translates it OTA.

I agree with Roger, Apple’s simply playing catch-up. Seriously, this site needs to get off of Steve Job’s crotch.

Colin Gibbs

Thanks to all for the comments; y’all are right to note that Android’s voice recognition is solid. (I discuss that in the more expansive piece at GigaOM Pro.) But very few people know about it and use it because it isn’t marketed. EntrepreNerd is right to point out that even Android’s voice recognition technology has a learning curve. Apple (and a lot of iOS developers) could give the entire voice recognition market a big boost by educating consumers what it is and how to use it.

Ben

My gf bought an android phone bc of the marketing she saw. Specifically, I think it was Tmobile. Perhaps you mean wasnt marketed in that the huge arm of verizon or apple or the ilk didnt market it. In that regard, you’re right. As soon as apple even announces voice, there will be some sort of ‘droid does’ commercial that will help the overall public awareness of voice recg.

Raymond Padilla

Do you have any stats that back up your assertion that very few people know about and use Android’s voice recognition features?

Yacko

Do any of these mobile voice recognition solutions work offline? Aren’t they all front ends to cloud cpu?

Vivek Iyer

If this is going to be true then i would prepare my self for

“The Magical,Revolutionary Voice recognition is here on iOS.Ur Voice Would be never like before??!!!!!
and @android(Pokes Fun @)

Their Voice recognition is like giving a glass of ice water to somebody in hell!

Tim

iPhone voice recognition is pretty pathetic compared to Android’s. Another reason why iPhone is eating Android’s dust.

EntrepreNerd

I agree with Ralph on this one. I use Google Voice Actions on my Android device about 50 times a day, for everything from writing email and text messages to searching driving routes, making calls and starting music, and it rarely fails me.

That being said, I think it is partly due to the great voice technology Google has and partly due to me learning how to speak clearly when using the service. And when I say “speak clearly” I do not mean sound robotic. I speak normally, with one difference, I speak to it as though it is one of my sons (ages 1 and 4). That is to say I pronunciate.

On that note, and slightly unrelated, as a parent you know you have done a great job when your children can speak clearly enough to use Voice Action on your Android device.

– Proud Father

Ralph Haygood

I find this post curious. For over a year, I’ve been using a Nexus One, on which any text field can be filled in by voice (if there’s a network connection, which presumably Apple too will require). I find it works quite well. And no, I don’t mind saying “comma” or “period” on the rare occasions I need to.

It’s amusing that this page is currently showing me four ads for the “PayPal Developer Challenge for Android.”

Roger

So this article is saying that Apple will find a way to eliminate the need to say punctuation marks and that will make voice control magical now? I have been using the original Droid and the voice control for a long time. I can tell you voice recognition is already ready for prime time. Apple will merely be catching up and not evolving anything.

Patrick Rafter

Apple’s used voice recognition technology for a long time. Beyond what they’re doing with Dragon, Apple has used Interactive Voice Response (IVR) technology to provide automated voice response for customer service. To their credit, Apple was one of the first companies to deploy IVR platform from Interactions, Inc. (www.interactions.net) whose “HumanTouch” technology combines the best of IVR automation (cost savings) with simultaneous connections with live human agents (when necessary). Having endured countless IVR nightmares, I found the Interactions/Apple “interactions” much more like a conversation with a human being.

Comments are closed.