<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>GigaOM &#187; speech technology</title>
	<atom:link href="http://gigaom.com/tag/speech-technology/feed/" rel="self" type="application/rss+xml" />
	<link>http://gigaom.com</link>
	<description></description>
	<lastBuildDate>Mon, 20 May 2013 19:43:32 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='gigaom.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://0.gravatar.com/blavatar/0db8f6557d022075dbbf010c54d46d93?s=96&#038;d=http%3A%2F%2Fs2.wp.com%2Fi%2Fbuttonw-com.png</url>
		<title>GigaOM &#187; speech technology</title>
		<link>http://gigaom.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://gigaom.com/osd.xml" title="GigaOM" />
	<atom:link rel='hub' href='http://gigaom.com/?pushpress=hub'/>
		<item>
		<title>Text-to-speech startup iSpeech launches publisher platform; Evernote, Pearson first clients</title>
		<link>http://gigaom.com/2013/02/27/text-to-speech-startup-ispeech-launches-publisher-platform-evernote-pearson-first-clients/</link>
		<comments>http://gigaom.com/2013/02/27/text-to-speech-startup-ispeech-launches-publisher-platform-evernote-pearson-first-clients/#comments</comments>
		<pubDate>Wed, 27 Feb 2013 14:00:25 +0000</pubDate>
		<dc:creator>Laura Hazard Owen</dc:creator>
				<category><![CDATA[Evernote]]></category>
		<category><![CDATA[iSpeech]]></category>
		<category><![CDATA[pearson]]></category>
		<category><![CDATA[speech technology]]></category>
		<category><![CDATA[text-to-speech technology]]></category>
		<category><![CDATA[Yaron Oren]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=614130</guid>
		<description><![CDATA[Text-to-speech startup iSpeech is rolling out tools for publishers that let them convert books and articles to audio. The first two clients are Pearson and Evernote.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=614130&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>iSpeech, a Newark, N.J.-based startup that specializes in lifelike text-to-speech apps and previously <a href="http://gigaom.com/2012/07/19/ispeech-develops-a-siri-for-our-living-rooms/">rolled out voice technology for the connected home</a>, is launching a <a href="http://www.ispeech.org/publishers">platform for publishers</a>, the company plans to announce Tuesday. The tools are designed to help publishers quickly and inexpensively convert books and articles into audio. iSpeech&#8217;s first two publishing clients are Evernote and Pearson.</p>
<p>iSpeech gives publishers three options for creating content. They can convert PDFs to audio files; they can add a widget to a website that essentially adds a &#8220;play&#8221; button to an article; or they can use more sophisticated developer tools built on iSpeech&#8217;s API and add them directly to their web pages. Pearson is using the PDF option for its textbooks. Evernote is using the developer tools to integrate speech technology into its web reading platform <a href="http://evernote.com/clearly/">Evernote Clearly</a>. &#8220;The natural evolution of this is to potentially bring this functionality into all of Evernote&#8217;s products,&#8221; iSpeech COO Yaron Oren told me. &#8220;One of the things we hear directly from Evernote customers is they want to be able to listen to their Evernote notes in the car, so it would be great to have this kind of of functionality.&#8221;</p>
<p>The publishing platform&#8217;s business model is basically pay-per-use, Oren said, and the cost usually ends up totaling &#8220;less than a tenth of of the cost of professional narration.&#8221; For websites, iSpeech charges by the word, which varies depending on volume but ranges from $0.01 to a fraction of a cent per word. For books, the company charges by the page; there are volume discounts, but Oren said that in general, the maximum cost to convert a 250-page book to audio with iSpeech would cost around $1,000. &#8220;We&#8217;ve heard from publishers that a book with voice talent tends to cost in the order of $15,000 per book,&#8221; he said.</p>
<p>Amazon ran into legal trouble when, in 2009, it automatically added text-to-speech technology to ebooks. The company insisted that text-to-speech features don&#8217;t violate copyright, <a href="http://www.businesswire.com/news/home/20090227005816/en/Statement-Amazon.com-Kindle-2%E2%80%99s-Experimental-Text-to-Speech-Feature">but said at the time</a>, &#8220;We strongly believe many rights-holders will be more comfortable with the text-to-speech feature if they are in the driver’s seat,&#8221; and decided to let rights-holders &#8220;decide on a title by title basis whether they want text-to-speech enabled or disabled for any particular title.&#8221; Oren says iSpeech will avoid those issues by leaving the decision to publishers &#8212; though it seems as if Evernote Clearly could potentially run into trouble, since it doesn&#8217;t hold the copyright to the articles that users save to its platform. (Evernote says it&#8217;s &#8220;comfortable&#8221; with the feature and is only running it on the article pages, not on articles saved into Evernote.)</p>
<p>For now, iSpeech&#8217;s publisher tools are primarily going to be of interest to nonfiction publishers &#8212; not publishers of, say, novels. &#8220;It&#8217;s a viable alternative to nonfiction, textbooks, or more straightforward news content,&#8221; Oren said. &#8220;For fiction, or other content where there&#8217;s more emotion and differences in reading style, this is not an alternative [yet].&#8221; But, he said, &#8220;this is about making more content available [as audio]. Professional voice talent is very expensive, and as a result, most books never get made into an audio format. Now there&#8217;s an option.&#8221;</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=614130&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=350956"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=350956" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=614130+text-to-speech-startup-ispeech-launches-publisher-platform-evernote-pearson-first-clients&utm_content=laurahowen38">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2009/08/evolution-of-the-e-book-market/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=614130+text-to-speech-startup-ispeech-launches-publisher-platform-evernote-pearson-first-clients&utm_content=laurahowen38">Evolution of the E-book Market</a></li><li><a href="http://pro.gigaom.com/2012/07/the-wearable-computing-market-a-global-analysis/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=614130+text-to-speech-startup-ispeech-launches-publisher-platform-evernote-pearson-first-clients&utm_content=laurahowen38">Analyzing the wearable computing market</a></li><li><a href="http://pro.gigaom.com/2011/09/what-amazons-new-kindle-line-means-for-apple-netflix-and-online-media/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=614130+text-to-speech-startup-ispeech-launches-publisher-platform-evernote-pearson-first-clients&utm_content=laurahowen38">What Amazon&#8217;s new Kindle line means for Apple, Netflix and online media</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/02/27/text-to-speech-startup-ispeech-launches-publisher-platform-evernote-pearson-first-clients/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/02/ispeech-for-publishers_210x140-gigaom.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/02/ispeech-for-publishers_210x140-gigaom.jpg?w=150" medium="image">
			<media:title type="html">iSpeech for Publishers</media:title>
		</media:content>

		<media:content url="http://2.gravatar.com/avatar/83965de6c2033ee5ab075123394cec0a?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">laurahowen38</media:title>
		</media:content>
	</item>
		<item>
		<title>Speech recognition anyone? AT&amp;T opens up Watson API</title>
		<link>http://gigaom.com/2012/07/09/speech-recognition-anyone-att-opens-up-watson-api/</link>
		<comments>http://gigaom.com/2012/07/09/speech-recognition-anyone-att-opens-up-watson-api/#comments</comments>
		<pubDate>Mon, 09 Jul 2012 22:50:48 +0000</pubDate>
		<dc:creator>Kevin Fitchard</dc:creator>
				<category><![CDATA[Glenn Lurie]]></category>
		<category><![CDATA[John Donovan]]></category>
		<category><![CDATA[language libraries]]></category>
		<category><![CDATA[natural language understanding]]></category>
		<category><![CDATA[speech technology]]></category>
		<category><![CDATA[voice dictation]]></category>
		<category><![CDATA[voice recognition]]></category>
		<category><![CDATA[voice search]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=540836</guid>
		<description><![CDATA[Look out Nuance. there’s a new speech recognition player in town, AT&#038;T. Ma Bell has taken the locks off of its Watson speech application programming interfaces, allowing any developer to use them to add voice commands and natural language understanding to their apps.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=540836&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://gigaom.com/mobile/what-voice-recognition-technology-could-mean-for-apple-and-all-of-us/voice-recognition/" rel="attachment wp-att-353308"><img  title="voice recognition" src="http://gigaom2.files.wordpress.com/2011/06/voice-recognition-e1306930304143.jpg?w=300&#038;h=199" alt="" width="300" height="199" class="alignleft size-medium wp-image-353308" /></a>Look out Nuance Communications, there’s a new speech recognition player in town. Nuance can’t simply buy up this potential competitor as it <a href="http://gigaom.com/2011/12/20/nuance-buys-vlingo-builds-a-voice-technology-giant/">has in the past</a> since this company happens to be AT&amp;T. Ma Bell has taken the locks off of its Watson speech application programming interfaces (APIs), allowing any developer to access them.</p>
<p>AT&amp;T <a href="http://gigaom.com/2012/04/19/at-att-labs-universal-translators-and-wearable-keys/">promised in April to open Watson up</a> to general development, but on Monday AT&amp;T SVP for technology and network operations John Donovan <a href="http://www.attinnovationspace.com/innovation/story/a7782925">made the launch official in a blog post</a>, revealing seven different “contexts,” or language libraries, that app builders can access to speechify their products.</p>
<p>Like Nuance, AT&amp;T has developed specific taxonomies for specific use cases, allowing its servers to anticipate the words and phrasing consumers are likely to speak in a particular context. Those tailored lexicons allow Watson (not to be confused with IBM’s <a href="http://gigaom.com/2011/11/16/misconceptions-in-ai-or-why-watson-cant-talk-to-siri/">artificial intelligence project Watson</a>) to better capture meaning as well as intent from any given phrase. For instance, the question “where is Trenton?” would be interpreted completely differently if spoken to a Q&amp;A app versus a mapping app.</p>
<p>The contexts are Web Search, Business Search, Voicemail-to-Text transcription, SMS speech transcription, Question and Answer, TV and a category called Generic used for general dictation purposes. The TV context is of particular note because it ties into AT&amp;T’s U-Verse program guide, recognizing queries about particular actors, movies and TV shows.</p>
<p>In general, though, it appears AT&amp;T is only licensing out fairly basic speech recognition and natural language understanding to general developers – features that could be used to build voice search into an app, but nothing that could be used to develop an expansive virtual assistant like Siri.</p>
<p>Watson, however, is likely much more powerful than these contexts let on. Vlingo licenses Watson’s core speech technology for its language servers (though that relationship will likely end after the Nuance acquisition closes), and Research in Motion&#8217;s QNX is working with Watson’s API to develop a conceptual connected car assistant. AT&amp;T also has big plans to leverage Watson’s capabilities for its own products. In a recent interview, AT&amp;T emerging devices president Glenn Lurie hinted that <a href="http://gigaom.com/cleantech/interview-atts-lurie-on-building-the-ios-of-the-connected-home/">Watson could soon make it into its Digital Life connected home technology</a>, becoming the voice-controlled virtual butler we see in science fiction.</p>
<p>Watson does have one big advantage to the average dev: it’s cheap – at least until the end of the year. For the remainder of 2012, AT&amp;T will give free and unlimited access to the Watson APIs to any developer paying AT&amp;T’s $99 annual registration dues. In 2013 <a href="http://developer.att.com/developer/forward.jsp?passedItemId=10100457">a point-based fee structure will kick in</a>. Registered developers will get 5000 points a month and any speech transaction of a minute or less will cost one point. Developers that exceed that amount will be charged $20 for each 2000-point increment, which works out to about 1 cent per transaction.</p>
<p><em>Image courtesy Flickr user <a href="http://www.flickr.com/photos/lazurite/4302673769/">Lazurite</a>.</em></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=540836&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=78635"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=78635" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=540836+speech-recognition-anyone-att-opens-up-watson-api&utm_content=kfitchard">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2011/09/the-future-of-mobile-a-segment-analysis-by-gigaom-pro/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=540836+speech-recognition-anyone-att-opens-up-watson-api&utm_content=kfitchard">The future of mobile: a segment analysis by GigaOM Pro</a></li><li><a href="http://pro.gigaom.com/2012/07/the-wearable-computing-market-a-global-analysis/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=540836+speech-recognition-anyone-att-opens-up-watson-api&utm_content=kfitchard">Analyzing the wearable computing market</a></li><li><a href="http://pro.gigaom.com/2012/07/mobile-second-quarter-2012-analysis-and-outlook/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=540836+speech-recognition-anyone-att-opens-up-watson-api&utm_content=kfitchard">Takeaways from mobile&#8217;s second quarter</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2012/07/09/speech-recognition-anyone-att-opens-up-watson-api/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2011/06/voice-recognition-e1306930304143.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2011/06/voice-recognition-e1306930304143.jpg?w=150" medium="image">
			<media:title type="html">voice recognition</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/0544c4b228f8fa80e31bb952501cd7a4?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">kfitchard</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2011/06/voice-recognition-e1306930304143.jpg?w=300" medium="image">
			<media:title type="html">voice recognition</media:title>
		</media:content>
	</item>
		<item>
		<title>Swype’s new “living keyboard” doesn’t just predict: It learns</title>
		<link>http://gigaom.com/2012/06/20/nuance-swype-living-keyboard-predicts-learns/</link>
		<comments>http://gigaom.com/2012/06/20/nuance-swype-living-keyboard-predicts-learns/#comments</comments>
		<pubDate>Wed, 20 Jun 2012 12:00:05 +0000</pubDate>
		<dc:creator>Kevin Fitchard</dc:creator>
				<category><![CDATA[contextual understanding]]></category>
		<category><![CDATA[language context]]></category>
		<category><![CDATA[multi-modal]]></category>
		<category><![CDATA[natural language]]></category>
		<category><![CDATA[predictive text]]></category>
		<category><![CDATA[semantic search]]></category>
		<category><![CDATA[semantics]]></category>
		<category><![CDATA[speech recognition]]></category>
		<category><![CDATA[speech technology]]></category>
		<category><![CDATA[voice recognition]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=534315</guid>
		<description><![CDATA[Swype just got a whole lot smarter. Nuance is updating Swype with the same sophisticated contextual-anguage technologies it uses in its speech-recognition products. The result is what Nuance is calling a “living keyboard” -- one that can learn both its user’s vocabulary and his habits.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=534315&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://gigaom.com/mobile/nuance-swype-living-keyboard-predicts-learns/screen-shot-2012-06-20-at-12-24-57-am/" rel="attachment wp-att-534318"><img  title="New Swype screenshot feature" src="http://gigaom2.files.wordpress.com/2012/06/screen-shot-2012-06-20-at-12-24-57-am-e1340170400939.png?w=300&#038;h=200" alt="" width="300" height="200" class="alignleft size-medium wp-image-534318" /></a><strong>Updated.</strong> Swype, the text-input interface now common on Android phones, just got a whole lot smarter. <a href="http://gigaom.com/2011/10/07/why-nuance-a-speech-company-bought-swype/">New owner Nuance Communications</a> is updating Swype with the same sophisticated contextual-language technologies it uses in its speech-recognition products. The result is what Nuance is calling a “living keyboard” &#8212; one that can learn both its user’s vocabulary and his habits.</p>
<p>For starters, Nuance is fully implementing into Swype the XT9 predictive text technologies <a href="http://paidcontent.org/tech/419-time-warner-aol-sells-mobile-software-firm-tegic-to-nuance-for-265-mill/">originally developed by Tegic</a>. Nuance has built on Tegic’s algorithms, though, expanding its prognostication abilities beyond just words to the formation of sentences and phrases.</p>
<p>For instance, the words “pit,” “pot” and “put” follow the same trace path on the Swype keyboard, forcing you to select from the right word from a menu. But by analyzing the words that come before or after the entry, Swype now can place the right word by context. It knows to use the word “pit” after avocado and “put” before the phrase “that away.”</p>
<p>The platform’s intelligence goes beyond just common language structure. Nuance says Swype will adapt and customize itself to its user. It learns to predict phrases and word groupings frequently typed by users. For instance, if I enter the words “Grandpa Fitchard” enough times into text messages, Swype will begin automatically filling in the last name every time I type “Grandpa.”</p>
<p><iframe src="http://www.youtube.com/embed/4H-Ci8jEYXc" frameborder="0" width="560" height="315"></iframe></p>
<p>The new version will also learn from all of your missives, not just the ones composed through Swype. If you give it permission, the software will cull through your emails, SMS messages, Twitter feed and Facebook posts, searching for new words to add to your phone’s personal dictionary.</p>
<p>Nuance has already integrated its <a href="http://gigaom.com/mobile/dragon-dictate-hits-the-iphone-for-free/">Dragon Dictation speech technology</a> &#8211; and in some cases <a href="http://gigaom.com/mobile/swype-hits-android-4-0-with-dragon-go-integration/">Dragon Go semantic search</a> &#8212; into Swype, but with this latest release of its software it’s fully bridging the gap between spoken and tactile input methods. Dragon’s speech capabilities use the phone’s data connection to connect to Nuance’s language servers, while text recognition remains local to the device. But Nuance is now linking the two, allowing you to generate a new vocabulary of spoken words synced to your personal dictionary.</p>
<p><a href="http://gigaom.com/mobile/nuance-swype-living-keyboard-predicts-learns/screen-shot-2012-06-20-at-12-24-20-am/" rel="attachment wp-att-534319"><img  title="New Swype screenshot " src="http://gigaom2.files.wordpress.com/2012/06/screen-shot-2012-06-20-at-12-24-20-am.png?w=708" alt=""   class="alignright size-full wp-image-534319" /></a>For instance, if you spoke an uncommon last name, say “Fitchard,” into the phone, Nuance’s servers simply wouldn’t recognize the word. The Swype interface would then prompt you to type the letters of my last name into the keyboard. It then forevermore associates that spoken word with the written text, creating a unique lexicon of recognizable spoken words for each user.</p>
<p>In addition, Swype now has support for 55 languages &#8212; both written and spoken &#8212; and offers a third input method (fourth if you count simple key tapping): writing out words by finger, which it then turns into text.</p>
<p>Nuance is selling the new Swype release to Android handset and tablet makers as well as making the technology available to other OSes via a software developer’s kit. If you don’t want to wait for an update from your manufacturer, you can try <a href="http://beta.swype.com/">downloading a beta version for Android from the Nuance site</a>.</p>
<p>A warning though: The beta installer doesn’t recognize all phones, and it won’t load on devices in which a manufacturer has preinstalled Swype in the factory. That involves a lot of Samsung and HTC phones. As with many things Android-related these days, you may be at the whim of your carrier or device maker when it comes to getting the latest version of Swype’s software.</p>
<p><strong>Update: </strong>Nuance has told me that the company updated its device policies Wednesday when it released the beta to the general public this morning, making it now possible to download the software on devices that already have Swype factory pre-installed. I tried to download the software on my HTC MyTouch again this afternoon, and, true to Nuance&#8217;s word, it worked.</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=534315&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=405639"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=405639" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=mobile&utm_medium=editorial&utm_campaign=auto3&utm_term=534315+nuance-swype-living-keyboard-predicts-learns&utm_content=kfitchard">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/07/the-wearable-computing-market-a-global-analysis/?utm_source=mobile&utm_medium=editorial&utm_campaign=auto3&utm_term=534315+nuance-swype-living-keyboard-predicts-learns&utm_content=kfitchard">Analyzing the wearable computing market</a></li><li><a href="http://pro.gigaom.com/2012/05/the-living-room-reinvented-trends-technologies-and-companies-to-watch/?utm_source=mobile&utm_medium=editorial&utm_campaign=auto3&utm_term=534315+nuance-swype-living-keyboard-predicts-learns&utm_content=kfitchard">Who and what to watch in the new era of the living room</a></li><li><a href="http://pro.gigaom.com/2011/09/the-future-of-mobile-a-segment-analysis-by-gigaom-pro/?utm_source=mobile&utm_medium=editorial&utm_campaign=auto3&utm_term=534315+nuance-swype-living-keyboard-predicts-learns&utm_content=kfitchard">The future of mobile: a segment analysis by GigaOM Pro</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2012/06/20/nuance-swype-living-keyboard-predicts-learns/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/06/screen-shot-2012-06-20-at-12-24-57-am-e1340170400939.png?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/06/screen-shot-2012-06-20-at-12-24-57-am-e1340170400939.png?w=150" medium="image">
			<media:title type="html">New Swype screenshot feature</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/0544c4b228f8fa80e31bb952501cd7a4?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">kfitchard</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/06/screen-shot-2012-06-20-at-12-24-57-am-e1340170400939.png?w=300" medium="image">
			<media:title type="html">New Swype screenshot feature</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/06/screen-shot-2012-06-20-at-12-24-20-am.png" medium="image">
			<media:title type="html">New Swype screenshot </media:title>
		</media:content>
	</item>
		<item>
		<title>At AT&amp;T Labs, universal translators and wearable keys</title>
		<link>http://gigaom.com/2012/04/19/at-att-labs-universal-translators-and-wearable-keys/</link>
		<comments>http://gigaom.com/2012/04/19/at-att-labs-universal-translators-and-wearable-keys/#comments</comments>
		<pubDate>Thu, 19 Apr 2012 16:14:14 +0000</pubDate>
		<dc:creator>Ryan Kim</dc:creator>
				<category><![CDATA[AT&T]]></category>
		<category><![CDATA[AT&T Labs]]></category>
		<category><![CDATA[speech technology]]></category>
		<category><![CDATA[voice recognition]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=512630</guid>
		<description><![CDATA[Perhaps not surprising for a telephone operator, AT&#038;T has some cool stuff in the works around speech. The company showed off some of its newest stuff out its AT&#038;T Labs Thursday and some of the biggest news was around the work being done in voice-related technologies.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=512630&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://gigaom2.files.wordpress.com/2012/04/img_2552.jpg"><img  title="IMG_2552" src="http://gigaom2.files.wordpress.com/2012/04/img_2552-e1334849597547.jpg?w=300&#038;h=200" alt="" width="300" height="200" class="alignleft size-medium wp-image-512649" /></a>AT&amp;T may spend a lot of time talking about mobile data but it still  has some cool stuff in the works around voice and speech. The company showed off some of its newest stuff at an AT&amp;T Labs event in New York Thursday, and some of the most interesting work is being done on voice-related technologies.</p>
<p>AT&amp;T announced that in June it would be <a href="http://www.attinnovationspace.com/innovation/story/a7782318">opening up its Watson speech recognition APIs,</a> which will enable developers to build their own speech-enabled apps and services. AT&amp;T has licensed that technology in the past to companies such as Vlingo but this represents an opportunity for all kinds of developers to speechify their apps. It also brings a challenge to Nuance, which has <a href="http://gigaom.com/2011/01/19/nuance-releases-mobile-sdk-to-speechify-apps/">been lining up developers</a>.</p>
<p>The APIs will focus initially on web search, local business search, Q&amp;A, voice mail to text, SMS, U-Verse&#8217;s electronic programming guide, and a dictation API for THE general use of speech recognition. Other APIs for gaming and social networking will be available in the future.</p>
<p>At the event, AT&amp;T demonstrated how its translation technology allows for simultaneous translated conversations between two people speaking Spanish and English on different devices. Users can speak into a VoIP app using one language and the AT&amp;T technology is able to simultaneously transcribe and then translate the content into another language. AT&amp;T already has a Translator mobile app that works with six languages but that requires both users to speak into one device. This holds the promise of people carrying on translated conversations anywhere in the world, similar to how TDD works for the speech or hearing impaired. The translation API is not yet available but will be released at a later date for developers.</p>
<p><img  title="IMG_2554" src="http://gigaom2.files.wordpress.com/2012/04/img_2554.jpg?w=300&#038;h=225" alt="" width="300" height="225" class="alignright size-medium wp-image-512653" /></p>
<p>One of the other cool demonstrations at the AT&amp;T Labs event involved bio-acoustic data transfer. The technology uses sensors in a phone to recognize and transmit signals through a body using bone conduction. When touching a door handle, a user could put their finger on a transducer on their phone and push out a digital key that gets transmitted through their body and is received by a sensor in the door, which unlocks when it recognizes the key. The door would only unlock when it receives the unique signal that&#8217;s created by that person&#8217;s skeletal structure transmitting the original key. That could be an interesting alternative to other access methods that rely on biometrics or NFC.</p>
<p>But another intriguing scenario is being able to transfer data back and forth between two users with a handshake. If both people have their hand on their phone&#8217;s sensor and an app running, they can transfer a small amount of data back and forth through their touch. The two devices would recognize each other when the two people come in contact and would negotiate the transfer. Only a small amount of data &#8212; enough for a business card or small image &#8212; could be transferred in the time it takes to shake a hand. But it could an alternative to a Bump or NFC data transfer. Some companies are working on transferring data through galvanic skin response but AT&amp;T said that requires daily calibration and can be more inconsistent.</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=512630&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=165061"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=165061" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=512630+at-att-labs-universal-translators-and-wearable-keys&utm_content=oryankim">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/04/supporting-startup-growth-with-the-new-recruiting-ecosystem/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=512630+at-att-labs-universal-translators-and-wearable-keys&utm_content=oryankim">Startup growth and the new recruiting ecosystem</a></li><li><a href="http://pro.gigaom.com/2012/03/a-near-term-outlook-for-big-data/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=512630+at-att-labs-universal-translators-and-wearable-keys&utm_content=oryankim">A near-term outlook for big data</a></li><li><a href="http://pro.gigaom.com/2012/02/ces-2012-a-recap-and-analysis/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=512630+at-att-labs-universal-translators-and-wearable-keys&utm_content=oryankim">CES 2012: a recap and analysis</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2012/04/19/at-att-labs-universal-translators-and-wearable-keys/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/04/img_2552-e1334849597547.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/04/img_2552-e1334849597547.jpg?w=150" medium="image">
			<media:title type="html">IMG_2552</media:title>
		</media:content>

		<media:content url="http://2.gravatar.com/avatar/81c4fca1b2d82a7fb9c8657de52386d1?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">oryankim</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/04/img_2552-e1334849597547.jpg?w=300" medium="image">
			<media:title type="html">IMG_2552</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/04/img_2554.jpg?w=300" medium="image">
			<media:title type="html">IMG_2554</media:title>
		</media:content>
	</item>
		<item>
		<title>Microsoft banks on speech to help WP7 catch up</title>
		<link>http://gigaom.com/2011/08/10/microsoft-banks-on-speech-to-help-wp7-catch-up/</link>
		<comments>http://gigaom.com/2011/08/10/microsoft-banks-on-speech-to-help-wp7-catch-up/#comments</comments>
		<pubDate>Wed, 10 Aug 2011 22:00:22 +0000</pubDate>
		<dc:creator>Ryan Kim</dc:creator>
				<category><![CDATA[iPhone]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[speech recognition]]></category>
		<category><![CDATA[speech technology]]></category>
		<category><![CDATA[Windows Phone 7]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=391035</guid>
		<description><![CDATA[Microsoft is looking at speech technology in the Mango update of WP7 to help the platform get back into the smartphone game. But with Apple also positioned to announce Nuance technology in iOS 5, it shows that speech is becoming table stakes in the smartphone wars. <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=391035&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://gigaom2.files.wordpress.com/2011/08/img_1168.jpg"><img  title="IMG_1168" src="http://gigaom2.files.wordpress.com/2011/08/img_1168-e1313007226279.jpg?w=300&#038;h=200" alt="" width="300" height="200" class="alignleft size-medium wp-image-391138" /></a>Microsoft&#8217;s Windows Phone 7 platform, if you haven&#8217;t noticed, hasn&#8217;t really been a player in the smartphone races yet, registering an<a href="http://wmpoweruser.com/canalys-only-1-5-million-windows-phones-shipped-in-q2-2011/"> estimated 1.5 million in unit shipped</a> in the second quarter. But Microsoft&#8217;s leaders have high hopes for the <a href="http://gigaom.com/2011/05/24/microsoft-works-to-catch-up-with-mango-windows-phone-7-update/">upcoming Mango software update</a> and in particular, are looking to the improved speech technology in the update to help WP7 claw back into the game.</p>
<p>I sat down with Ilya Bukshteyn, senior director of speech marketing, sales and solutions for Microsoft and he said that while Apple helped define the modern user interface of smartphones with the iPhone, expectations are shifting to an interface that moves beyond touch to deeply integrated speech. And that, he said, favors Microsoft, which had integrated speech into WP7 at its launch and is upping the ante with Mango improvements including voice-to-text messaging, which allows people to hear text messages read aloud to them and respond back with a dictated text message, all without pressing a button. Microsoft will also enable map navigations, application launches or turning on the speakerphone with voice commands and is adding support for more languages.</p>
<p>&#8220;If you look at the success of the iPhone, it was about the usability it brought to the screen and touch experience and that resonated,&#8221; said Bukshteyn. &#8220;We&#8217;re now hitting another inflection point where speech is going to matter just as much. Moving forward, having some sort of speech natural user interface does become something users expect. And how well you do it will determine what sells and what doesn&#8217;t.&#8221;</p>
<p>Now Apple appears poised to announce new<a href="http://gigaom.com/apple/text-to-speech-could-be-apples-next-mobile-trump-card/"> speech technology integration in iOS 5.</a> <a href="http://9to5mac.com/2011/08/06/ios-5s-nuance-powered-speech-to-text-feature-revealed-screenshot/">9to5Mac</a> reported recently that iOS 5 contains a text-to-speech system powered by Nuance, which will allow users to switch from keyboard to speech input in any text field. That, along with other learnings Apple picked up from its purchase of personal assistant app Siri, could help Apple catch up quickly on the speech front compared to Windows Phone 7 and Android, which also has some impressive speech capabilities embedded.</p>
<p>But Bukshteyn said it&#8217;s not just about building speech into iOS, it&#8217;s about the intelligence behind it. He said Microsoft&#8217;s cloud-based approach to speech is able to learn from all billions of utterances Microsoft gathers from Xbox Kinect, its Bing Mobile apps, Windows Phone 7 and its other productivity software, to constantly improve the performance of the overall speech product, so the system as a whole is continually learning. The volume and diversity of utterances helps train Tellme and creates a product that learns over time. He said Apple will also be behind in perfecting and implementing speech, something that takes years to tune.</p>
<p>&#8220;Apple doesn&#8217;t have the IP, the patents; they can gain that over time but they&#8217;re starting at a different point,&#8221; Bukshteyn said. &#8220;Speech is something we&#8217;ve been investing in for 10 years and the stuff we&#8217;re seeing now was planted a decade ago.&#8221;</p>
<p><a href="http://gigaom2.files.wordpress.com/2011/08/wp7integratedmessaging_web.jpg"><img  title="wp7integratedMessaging_web" src="http://gigaom2.files.wordpress.com/2011/08/wp7integratedmessaging_web.jpg?w=180&#038;h=300" alt="" width="180" height="300" class="alignright size-medium wp-image-391142" /></a>Bukshteyn is quick to point out that smartphone success involves a lot of factors, not just speech. But he believes that as smartphones start to move down into the masses, the need for elegant interfaces with speech will only grow. He said designs that are intuitive and not intimidating will be what wins over new smartphone users.</p>
<p>We still have to see what Apple does with its Nuance integration, which it has to yet to announce. And as my colleague Darrell wrote, it could be<a href="http://gigaom.com/apple/text-to-speech-could-be-apples-next-mobile-trump-card/"> a moment for Apple to leap frog its competitors</a>. We know that Google is only going to continue to push speech into its Android platform.</p>
<p>But Microsoft&#8217;s heavy emphasis on speech shows that the technology will be, if not a decider in the smartphone race, a key component that all modern smartphone platforms will need to compete. As Vlad Sejnoha, CTO of Nuance told me yesterday, <a href="http://gigaom.com/2011/08/09/nuance-cto-speech-tech-will-be-mobile-tech/">speech technology is really becoming mobile technology</a> and will grow more powerful as it gets further embedded into smartphones.</p>
<p>It&#8217;s not surprising that Microsoft would tout its speech technology. It&#8217;s one area in which it&#8217;s trying to leverage its assets to help WP7 catch up. I&#8217;m skeptical that speech will be a major catalyst for WP7 all by itself but it is a differentiator for now and another way to show how polished the platform is becoming.</p>
<p>And I do think that speech will become more of a popular tool over time. Currently, Google and Microsoft say about 25 percent of their mobile searches are conducted by voice. Bukshteyn said it&#8217;s actually higher than that for Microsoft when you look at just Windows Phone 7 devices. With the way people are learning to talk to their devices, including powerful examples like Xbox Kinect, it&#8217;s seems like a good bet for Microsoft to push speech. But if all the platforms start to emphasize the technology, as we&#8217;re seeing more and more, WP7 may enjoy a temporary boost but it will need more than that to really close the gap on iOS and Android.</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=391035&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=411204"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=411204" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=391035+microsoft-banks-on-speech-to-help-wp7-catch-up&utm_content=oryankim">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2011/08/what-the-google-motorola-deal-means-for-android-microsoft-and-the-mobile-industry/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=391035+microsoft-banks-on-speech-to-help-wp7-catch-up&utm_content=oryankim">What the Google-Motorola deal means for Android, Microsoft and the mobile industry</a></li><li><a href="http://pro.gigaom.com/2013/01/the-2013-task-management-tools-market/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=391035+microsoft-banks-on-speech-to-help-wp7-catch-up&utm_content=oryankim">The 2013 task management tools market</a></li><li><a href="http://pro.gigaom.com/2012/10/the-state-of-cross-platform-measurement-across-tv-online-and-social/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=391035+microsoft-banks-on-speech-to-help-wp7-catch-up&utm_content=oryankim">The state of cross-platform media measurement</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2011/08/10/microsoft-banks-on-speech-to-help-wp7-catch-up/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2011/08/img_1168-e1313007226279.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2011/08/img_1168-e1313007226279.jpg?w=150" medium="image">
			<media:title type="html">IMG_1168</media:title>
		</media:content>

		<media:content url="http://2.gravatar.com/avatar/81c4fca1b2d82a7fb9c8657de52386d1?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">oryankim</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2011/08/img_1168-e1313007226279.jpg?w=300" medium="image">
			<media:title type="html">IMG_1168</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2011/08/wp7integratedmessaging_web.jpg?w=180" medium="image">
			<media:title type="html">wp7integratedMessaging_web</media:title>
		</media:content>
	</item>
		<item>
		<title>Nuance CTO: Speech tech will be mobile tech</title>
		<link>http://gigaom.com/2011/08/09/nuance-cto-speech-tech-will-be-mobile-tech/</link>
		<comments>http://gigaom.com/2011/08/09/nuance-cto-speech-tech-will-be-mobile-tech/#comments</comments>
		<pubDate>Wed, 10 Aug 2011 00:00:07 +0000</pubDate>
		<dc:creator>Ryan Kim</dc:creator>
				<category><![CDATA[natural language processing]]></category>
		<category><![CDATA[Nuance]]></category>
		<category><![CDATA[smartphones]]></category>
		<category><![CDATA[Speech]]></category>
		<category><![CDATA[speech recognition]]></category>
		<category><![CDATA[speech technology]]></category>
		<category><![CDATA[voice recognition]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=390163</guid>
		<description><![CDATA[Speech technology is poised to be a game-changer for smartphones, especially as they get embedded into operating systems and hardware. Nuance CTO Vlad Sejnoha said speech is transforming from an alternative to text input into a powerful tool that can connect users more quickly to information.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=390163&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://gigaom2.files.wordpress.com/2011/08/img_1165.jpg"><img  title="IMG_1165" src="http://gigaom2.files.wordpress.com/2011/08/img_1165.jpg?w=300&#038;h=225" alt="" width="300" height="225" class="alignleft size-medium wp-image-390214" /></a>We&#8217;re still waiting to see the public unveiling of <a href="http://gigaom.com/apple/text-to-speech-could-be-apples-next-mobile-trump-card/">built-in speech technology in iOS 5</a>, something that has been uncovered but has yet to be acknowledged by Apple. Nuance Chief Technology Officer Vlad Sejnoha, whose company is apparently providing the speech technology in iOS 5, isn&#8217;t sharing any information about the implementation.</p>
<p>But Sejnoha sat down with me today at the SpeechTek conference in New York City and painted a broader picture of how speech technology is poised to be a game-changer for smartphones, especially as they get embedded deeper into the operating systems and hardware. He said speech is transforming from an alternative to text input into a much more powerful tool that can understand user intent and connect them more quickly to information, using natural language processing, semantic analysis and cloud computing. In essence, speech is becoming the smart short cut for mobile.</p>
<p>&#8220;Speech can be built to complement and enhance other interfaces helping you find existing applications or information you&#8217;re familiar with but it&#8217;s hard to launch or find. Speech can get you there. It’s really powerful direct access; we’re just entering an amazing era of speech,&#8221; said Sejnoha.</p>
<p>We&#8217;re seeing some of that with Nuance&#8217;s <a href="http://gigaom.com/apple/dragon-go-app-gets-smart-with-voice-search/">latest Dragon Go app</a>, which answers voice activated searches by pulling up a host of websites and apps that complete a user’s query. Instead of search result links, Go allows people to complete actions faster. Vlingo and <a href="http://gigaom.com/2010/02/04/siri-make-artificial-intelligence-your-slave/">Siri, the Nuance-powered app</a> bought by Apple, are also pursuing this <a href="http://gigaom.com/2011/02/11/the-next-evolution-in-mobile-search-will-be-built-on-voice/">concept of a smart voice-activated assistant</a>.</p>
<p>But the real power will be in allowing users to make queries of unstructured data and get back answers immediately. He said <a href="http://www-03.ibm.com/press/us/en/pressrelease/33726.wss">Nuance&#8217;s new partnership with IBM&#8217;s Watson program</a>, which uses IBM’s deep question answering, natural language processing, and machine learning capabilities, will be useful for mobile users, helping them seek answers that are currently hard to come by.</p>
<p><a href="http://gigaom2.files.wordpress.com/2011/08/dragongo-e1310663017264.png"><img  title="dragongo-e1310663017264" src="http://gigaom2.files.wordpress.com/2011/08/dragongo-e1310663017264.png?w=708" alt=""   class="alignright size-full wp-image-390228" /></a>For instance, a user could ask what friends thought of a particular movie or restaurant, and Nuance and IBM could digest their social media information and come up with an answer. Or a vacationer driving around in a new city could ask what time local restaurants or businesses close. While this will type of technology will be useful in enterprise settings, Sejnoha said it will be very valuable to mobile users, who are more intent-driven and want a focused distillation of answers.</p>
<p>He said it makes more sense to move speech technology deeper into smartphones, integrating it from the ground up, which will open even more opportunities for speech to aid in the user experience. Sejnoha said handset manufacturers are very interested in integrating speech as a differentiator, to make their hardware stand out. Ultimately, he believes all smartphones will include some form of this deep speech integration working in concert with the visual interface. We&#8217;re already seeing that with <a href="http://www.theinquirer.net/inquirer/news/2099488/microsoft-windows-phone-mango-voice">Microsoft&#8217;s </a> <a href="http://www.theinquirer.net/inquirer/news/2099488/microsoft-windows-phone-mango-voice">use of speech in Windows Phone 7</a> and <a href="http://googlemobile.blogspot.com/2010/12/voice-search-gets-personal.html">Google&#8217;s </a> <a href="http://googlemobile.blogspot.com/2010/12/voice-search-gets-personal.html">use of voice in Android.</a></p>
<p>&#8220;It&#8217;s a new kind of control, like a natural language overlay over the visual framework,&#8221; Sejnoha said. &#8220;We will have this ability to jump and short-cut through the mini desktop to grab and control things and the best way to do that is getting that capacity up front and having the visual stuff play with that.&#8221;</p>
<p>The combination of speech and mobile isn&#8217;t just going to change the user experience. It&#8217;s already transforming Nuance, which is increasingly finding most of its work deals with mobile.</p>
<p>&#8220;We used to have a mobile unit, but that&#8217;s losing meaning now because all of our business is turning mobile,&#8221; Sejnoha.</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=390163&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=784664"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=784664" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=390163+nuance-cto-speech-tech-will-be-mobile-tech&utm_content=oryankim">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2011/10/siri-say-hello-to-the-coming-invisible-interface/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=390163+nuance-cto-speech-tech-will-be-mobile-tech&utm_content=oryankim">Siri: Say hello to the coming &#8220;invisible interface&#8221;</a></li><li><a href="http://pro.gigaom.com/2013/01/ces-2013-flash-analysis-disruptions-and-disappointments-from-consumer-techs-biggest-show/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=390163+nuance-cto-speech-tech-will-be-mobile-tech&utm_content=oryankim">GigaOM Research highs and lows from CES 2013</a></li><li><a href="http://pro.gigaom.com/2013/01/the-2013-task-management-tools-market/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=390163+nuance-cto-speech-tech-will-be-mobile-tech&utm_content=oryankim">The 2013 task management tools market</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2011/08/09/nuance-cto-speech-tech-will-be-mobile-tech/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2011/08/img_1165.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2011/08/img_1165.jpg?w=150" medium="image">
			<media:title type="html">IMG_1165</media:title>
		</media:content>

		<media:content url="http://2.gravatar.com/avatar/81c4fca1b2d82a7fb9c8657de52386d1?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">oryankim</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2011/08/img_1165.jpg?w=300" medium="image">
			<media:title type="html">IMG_1165</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2011/08/dragongo-e1310663017264.png" medium="image">
			<media:title type="html">dragongo-e1310663017264</media:title>
		</media:content>
	</item>
	</channel>
</rss>
