<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>GigaOM &#187; voice search</title>
	<atom:link href="http://gigaom.com/tag/voice-search/feed/" rel="self" type="application/rss+xml" />
	<link>http://gigaom.com</link>
	<description></description>
	<lastBuildDate>Sat, 25 May 2013 02:30:40 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='gigaom.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://0.gravatar.com/blavatar/0db8f6557d022075dbbf010c54d46d93?s=96&#038;d=http%3A%2F%2Fs2.wp.com%2Fi%2Fbuttonw-com.png</url>
		<title>GigaOM &#187; voice search</title>
		<link>http://gigaom.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://gigaom.com/osd.xml" title="GigaOM" />
	<atom:link rel='hub' href='http://gigaom.com/?pushpress=hub'/>
		<item>
		<title>Google explains how more data means better speech recognition</title>
		<link>http://gigaom.com/2012/10/31/google-explains-how-more-data-means-better-speech-recognition/</link>
		<comments>http://gigaom.com/2012/10/31/google-explains-how-more-data-means-better-speech-recognition/#comments</comments>
		<pubDate>Wed, 31 Oct 2012 21:03:37 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[big data]]></category>
		<category><![CDATA[data science]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[natural language processing]]></category>
		<category><![CDATA[research]]></category>
		<category><![CDATA[speech recognition]]></category>
		<category><![CDATA[voice search]]></category>
		<category><![CDATA[YouTube]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=579350</guid>
		<description><![CDATA[A new research paper from Google highlights the importance of big data in creating consumer-friendly services such as voice search on smartphones. More data helps train smarter models, which can then better predict what someone say next -- letting you keep your eyes on the road.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=579350&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>A <a href="http://research.google.com/pubs/pub40491.html">new research paper out of Google</a> describes in some detail the data science behind the the company&#8217;s speech recognition applications, such as voice search and adding captions or tags to YouTube videos. And although the math might be beyond most people&#8217;s grasp, the concepts are not. The paper underscores why everyone is so excited about the prospect of &#8220;big data&#8221; and also how important it is to choose the right data set for the right job.</p>
<p>Google has always been a fan of the idea that more data is better, as exemplified by Research Director Peter Norvig&#8217;s stance that, generally speaking, more data trumps better algorithms (see, e.g., his 2009 paper titled &#8220;<a href="http://googleresearch.blogspot.com/2009/03/unreasonable-effectiveness-of-data.html">The Unreasonable Effectiveness of Data</a>&#8220;). Although <a href="http://technocalifornia.blogspot.com/2012/07/more-data-or-better-models.html">some hair-splitting does occur</a> about the relative value (or lack thereof) of algorithms in Norvig&#8217;s assessment, it&#8217;s pretty much an accepted truth at this point and drives much of the discussion around big data. The more data your models have from which to learn, the more accurate they become &#8212; <a href="http://gigaom.com/data/5-ideas-to-help-everyone-make-the-most-of-big-data/">even if they weren&#8217;t cutting-edge stuff to begin with</a>.</p>
<p>No surprise, then, it turns out that more data is also better for training speech-recognition systems. The researchers found that data sets and larger language models (here&#8217;s <a href="http://en.wikipedia.org/wiki/Language_model#N-gram_models">a Wikipedia explanation of the n-gram type</a> involved in Google&#8217;s research) result in fewer errors predicting the next word based on the words that precede it. Discussing the research <a href="http://googleresearch.blogspot.com/2012/10/large-scale-language-modeling-in.html">in a blog post on Wednesday</a>, Google research scientist Ciprian Chelba gives the example that a good model will attribute a higher probability to &#8220;pizza&#8221; as the next word than to &#8220;granola&#8221; if the previous two words were &#8220;New York.&#8221; When it comes to voice search, his team found that &#8220;increasing the model size by two orders of magnitude reduces the [word error rate] by 10% relative.&#8221;</p>
<p>The real key, however &#8212; as any data scientist will tell you &#8212; is knowing what type of data is best to train your models, whatever they are. For the voice search tests, the Google researchers used 230 billion words that came from &#8220;a random sample of anonymized queries from google.com that did not trigger spelling correction.&#8221; However, because people speak and write prose differently than they type searches, the YouTube models were fed data from transcriptions of news broadcasts and large web crawls.</p>
<p>&#8220;As far as language modeling is concerned, the variety of topics and speaking styles makes a language model built from a web crawl a very attractive choice,&#8221; they write.</p>
<p>This research isn&#8217;t necessarily groundbreaking, but helps drive home the reasons that topics such as big data and data science get so much attention these days. As consumers demand ever smarter applications and more frictionless user experiences, every last piece of data and every decision about how to analyze it matters.</p>
<p><em>Feature image courtesy of <a href="http://www.shutterstock.com/gallery-709726p1.html">Shutterstock user watcharakun</a>.</em></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=579350&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=638405"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=638405" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=579350+google-explains-how-more-data-means-better-speech-recognition&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/report/sector-roadmap-social-customer-service-in-2013/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=579350+google-explains-how-more-data-means-better-speech-recognition&utm_content=dharrisstructure">Sector RoadMap: Social customer service in 2013</a></li><li><a href="http://pro.gigaom.com/2012/03/a-near-term-outlook-for-big-data/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=579350+google-explains-how-more-data-means-better-speech-recognition&utm_content=dharrisstructure">A near-term outlook for big data</a></li><li><a href="http://pro.gigaom.com/2011/11/connected-world-the-consumer-technology-revolution/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=579350+google-explains-how-more-data-means-better-speech-recognition&utm_content=dharrisstructure">Connected world: the consumer technology revolution</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2012/10/31/google-explains-how-more-data-means-better-speech-recognition/feed/</wfw:commentRss>
		<slash:comments>10</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/10/shutterstock_110381039.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/10/shutterstock_110381039.jpg?w=150" medium="image">
			<media:title type="html">Smart tablet</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>
	</item>
		<item>
		<title>T-Mobile’s Genius voice-command button just got a lot smarter</title>
		<link>http://gigaom.com/2012/08/09/t-mobiles-genius-voice-command-button-just-got-a-lot-smarter/</link>
		<comments>http://gigaom.com/2012/08/09/t-mobiles-genius-voice-command-button-just-got-a-lot-smarter/#comments</comments>
		<pubDate>Thu, 09 Aug 2012 22:57:24 +0000</pubDate>
		<dc:creator>Kevin Fitchard</dc:creator>
				<category><![CDATA[natural language understanding]]></category>
		<category><![CDATA[semantic search]]></category>
		<category><![CDATA[speech recognition]]></category>
		<category><![CDATA[voice command]]></category>
		<category><![CDATA[voice search]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=551613</guid>
		<description><![CDATA[T-Mobile is giving its rather pathetic MyTouch voice-command feature a much-needed overhaul. It's incorporating the same semantic-search technology Nuance uses in Dragon Go into Genius, allowing the voice assistant to search over 200 different content providers and understand intent rather than just words.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=551613&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Anyone who has ever <a href="http://gigaom.com/2010/06/03/mytouch-3g-slide-proves-software-can-differentiate-android-hardware/">owned a T-Mobile MyTouch</a> is familiar with the little button on the lower right-hand corner labeled with stylized “G.” It’s T-Mobile’s Genius button, which, once pushed, allows you to issue basic voice commands from calling or texting a contact to searching the web or Google Maps. If you’ve used it before then you know: Siri it’s not.</p>
<p>The service’s vocabulary and contextual understanding is pretty limited. For instance, if you ask Genius to “find a restaurant” it will pull up the nearest eatery on Google Maps. But if you ask it to find “nearby restaurants” it searches Maps for a joint named “Nearby.” If you’re like me, you’ve probably fiddled with the button a few times and never touched it again, despite its relative convenience on the phone’s faceplate.</p>
<p>But T-Mobile has given Genius a much-needed overhaul, at least on the latest versions of the <a href="http://gigaom.com/mobile/making-t-mos-mytouch-is-just-step-1-of-huaweis-master-plan/">MyTouch manufactured by Huawei</a>. Nuance Communications, which powers the voice-recognition features on T-Mobile devices, is upgrading Genius’ capabilities and the features of the Genius service, providing a deeper level of natural-language understanding and integrating the service with a much broader array of content sources beyond Maps and Google Search.</p>
<p>For instance, if I were to ask the new Genius for nearby restaurants, it would not only understand my intent but also do its searching on Yelp, pulling up dining options nearby and displaying their rankings and reviews. If I were to change that command to “make a reservation at nearby restaurants,” it would bring me to OpenTable’s website and display eateries in the vicinity that accept online bookings.</p>
<p><a href="http://gigaom.com/apple/dragon-go-app-gets-smart-with-voice-search/dragongo/" rel="attachment wp-att-376328"><img title="dragongo" src="http://gigaom2.files.wordpress.com/2011/07/dragongo-e1310663017264.png?w=300&#038;h=218" alt="" width="300" height="218" class="alignleft size-medium wp-image-376328"></a>If this sounds familiar then you’ve probably used <a href="http://gigaom.com/mobile/a-siri-for-other-phones-nuance-brings-dragon-go-to-android/">Nuance’s consumer semantic-search app, Dragon Go</a>, available for Android and iOS smartphones. In fact, if you look at the long list of 200 content partners the Genius can access, they’re the same used by Dragon Go. Nuance wouldn’t acknowledge specifically that T-Mobile is white-labeling the semantic-search app, but it’s pretty obvious that’s exactly what it’s doing — and it’s by no means a bad thing. I’m a big fan of Nuance’s intuitive little search app, and being able to access it in fewer steps is a bonus.</p>
<p>The official line, though, is that T-Mobile has basically upgraded its relationship with Nuance to a kind of platinum status. The old Genius tapped into Nuance’s basic speech-recognition APIs, but it had none of the rules-based language-parsing abilities of Nuance’s more-sophisticated offerings. By adding greater contextual understanding and a host of content providers, T-Mobile may be able to turn a pretty lame voice-command feature into something quite useful. So far, though, it’s only available on the MyTouch and MyTouch Q.</p>
<p>Both T-Mobile and Nuance will be represented at <a href="http://event.gigaom.com/mobilize/?utm_source=mobile&amp;utm_medium=editorial&amp;utm_campaign=intext&amp;utm_term=551613+t-mobiles-genius-voice-command-button-just-got-a-lot-smarter&amp;utm_content=kfitchard">GigaOM’s Mobilize conference</a> next month. Nuance CTO Vlad Sejnoha will be speaking on a panel about the future smartphone interface on Sept. 20, while T-Mobile SVP of marketing Brad Duea will discuss the evolution of voice services on Sept. 21.</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=551613&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=43871"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=43871" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=mobile&utm_medium=editorial&utm_campaign=auto3&utm_term=551613+t-mobiles-genius-voice-command-button-just-got-a-lot-smarter&utm_content=kfitchard">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/02/ces-2012-a-recap-and-analysis/?utm_source=mobile&utm_medium=editorial&utm_campaign=auto3&utm_term=551613+t-mobiles-genius-voice-command-button-just-got-a-lot-smarter&utm_content=kfitchard">CES 2012: a recap and analysis</a></li><li><a href="http://pro.gigaom.com/2011/12/carrier-iq-and-the-continued-erosion-of-operator-trust/?utm_source=mobile&utm_medium=editorial&utm_campaign=auto3&utm_term=551613+t-mobiles-genius-voice-command-button-just-got-a-lot-smarter&utm_content=kfitchard">Carrier IQ and the continued erosion of operator trust</a></li><li><a href="http://pro.gigaom.com/2011/09/the-future-of-mobile-a-segment-analysis-by-gigaom-pro/?utm_source=mobile&utm_medium=editorial&utm_campaign=auto3&utm_term=551613+t-mobiles-genius-voice-command-button-just-got-a-lot-smarter&utm_content=kfitchard">The future of mobile: a segment analysis by GigaOM Pro</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2012/08/09/t-mobiles-genius-voice-command-button-just-got-a-lot-smarter/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/03/t-mobile-mytouch-huawei.jpeg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/03/t-mobile-mytouch-huawei.jpeg?w=150" medium="image">
			<media:title type="html">T-Mobile-myTouch-Huawei</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/0544c4b228f8fa80e31bb952501cd7a4?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">kfitchard</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2011/07/dragongo-e1310663017264.png?w=300" medium="image">
			<media:title type="html">dragongo</media:title>
		</media:content>
	</item>
		<item>
		<title>Speech recognition anyone? AT&amp;T opens up Watson API</title>
		<link>http://gigaom.com/2012/07/09/speech-recognition-anyone-att-opens-up-watson-api/</link>
		<comments>http://gigaom.com/2012/07/09/speech-recognition-anyone-att-opens-up-watson-api/#comments</comments>
		<pubDate>Mon, 09 Jul 2012 22:50:48 +0000</pubDate>
		<dc:creator>Kevin Fitchard</dc:creator>
				<category><![CDATA[Glenn Lurie]]></category>
		<category><![CDATA[John Donovan]]></category>
		<category><![CDATA[language libraries]]></category>
		<category><![CDATA[natural language understanding]]></category>
		<category><![CDATA[speech technology]]></category>
		<category><![CDATA[voice dictation]]></category>
		<category><![CDATA[voice recognition]]></category>
		<category><![CDATA[voice search]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=540836</guid>
		<description><![CDATA[Look out Nuance. there’s a new speech recognition player in town, AT&#038;T. Ma Bell has taken the locks off of its Watson speech application programming interfaces, allowing any developer to use them to add voice commands and natural language understanding to their apps.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=540836&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://gigaom.com/mobile/what-voice-recognition-technology-could-mean-for-apple-and-all-of-us/voice-recognition/" rel="attachment wp-att-353308"><img  title="voice recognition" src="http://gigaom2.files.wordpress.com/2011/06/voice-recognition-e1306930304143.jpg?w=300&#038;h=199" alt="" width="300" height="199" class="alignleft size-medium wp-image-353308" /></a>Look out Nuance Communications, there’s a new speech recognition player in town. Nuance can’t simply buy up this potential competitor as it <a href="http://gigaom.com/2011/12/20/nuance-buys-vlingo-builds-a-voice-technology-giant/">has in the past</a> since this company happens to be AT&amp;T. Ma Bell has taken the locks off of its Watson speech application programming interfaces (APIs), allowing any developer to access them.</p>
<p>AT&amp;T <a href="http://gigaom.com/2012/04/19/at-att-labs-universal-translators-and-wearable-keys/">promised in April to open Watson up</a> to general development, but on Monday AT&amp;T SVP for technology and network operations John Donovan <a href="http://www.attinnovationspace.com/innovation/story/a7782925">made the launch official in a blog post</a>, revealing seven different “contexts,” or language libraries, that app builders can access to speechify their products.</p>
<p>Like Nuance, AT&amp;T has developed specific taxonomies for specific use cases, allowing its servers to anticipate the words and phrasing consumers are likely to speak in a particular context. Those tailored lexicons allow Watson (not to be confused with IBM’s <a href="http://gigaom.com/2011/11/16/misconceptions-in-ai-or-why-watson-cant-talk-to-siri/">artificial intelligence project Watson</a>) to better capture meaning as well as intent from any given phrase. For instance, the question “where is Trenton?” would be interpreted completely differently if spoken to a Q&amp;A app versus a mapping app.</p>
<p>The contexts are Web Search, Business Search, Voicemail-to-Text transcription, SMS speech transcription, Question and Answer, TV and a category called Generic used for general dictation purposes. The TV context is of particular note because it ties into AT&amp;T’s U-Verse program guide, recognizing queries about particular actors, movies and TV shows.</p>
<p>In general, though, it appears AT&amp;T is only licensing out fairly basic speech recognition and natural language understanding to general developers – features that could be used to build voice search into an app, but nothing that could be used to develop an expansive virtual assistant like Siri.</p>
<p>Watson, however, is likely much more powerful than these contexts let on. Vlingo licenses Watson’s core speech technology for its language servers (though that relationship will likely end after the Nuance acquisition closes), and Research in Motion&#8217;s QNX is working with Watson’s API to develop a conceptual connected car assistant. AT&amp;T also has big plans to leverage Watson’s capabilities for its own products. In a recent interview, AT&amp;T emerging devices president Glenn Lurie hinted that <a href="http://gigaom.com/cleantech/interview-atts-lurie-on-building-the-ios-of-the-connected-home/">Watson could soon make it into its Digital Life connected home technology</a>, becoming the voice-controlled virtual butler we see in science fiction.</p>
<p>Watson does have one big advantage to the average dev: it’s cheap – at least until the end of the year. For the remainder of 2012, AT&amp;T will give free and unlimited access to the Watson APIs to any developer paying AT&amp;T’s $99 annual registration dues. In 2013 <a href="http://developer.att.com/developer/forward.jsp?passedItemId=10100457">a point-based fee structure will kick in</a>. Registered developers will get 5000 points a month and any speech transaction of a minute or less will cost one point. Developers that exceed that amount will be charged $20 for each 2000-point increment, which works out to about 1 cent per transaction.</p>
<p><em>Image courtesy Flickr user <a href="http://www.flickr.com/photos/lazurite/4302673769/">Lazurite</a>.</em></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=540836&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=577255"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=577255" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=540836+speech-recognition-anyone-att-opens-up-watson-api&utm_content=kfitchard">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2011/09/the-future-of-mobile-a-segment-analysis-by-gigaom-pro/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=540836+speech-recognition-anyone-att-opens-up-watson-api&utm_content=kfitchard">The future of mobile: a segment analysis by GigaOM Pro</a></li><li><a href="http://pro.gigaom.com/report/mobile-first-quarter-2013-analysis-and-outlook/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=540836+speech-recognition-anyone-att-opens-up-watson-api&utm_content=kfitchard">Mobile first-quarter 2013: analysis and outlook</a></li><li><a href="http://pro.gigaom.com/2012/07/the-wearable-computing-market-a-global-analysis/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=540836+speech-recognition-anyone-att-opens-up-watson-api&utm_content=kfitchard">Analyzing the wearable computing market</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2012/07/09/speech-recognition-anyone-att-opens-up-watson-api/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2011/06/voice-recognition-e1306930304143.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2011/06/voice-recognition-e1306930304143.jpg?w=150" medium="image">
			<media:title type="html">voice recognition</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/0544c4b228f8fa80e31bb952501cd7a4?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">kfitchard</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2011/06/voice-recognition-e1306930304143.jpg?w=300" medium="image">
			<media:title type="html">voice recognition</media:title>
		</media:content>
	</item>
		<item>
		<title>Dolphin Browser adds Sonar: Useful voice search</title>
		<link>http://gigaom.com/2012/02/29/dolphin-browser-adds-sonar-useful-voice-search/</link>
		<comments>http://gigaom.com/2012/02/29/dolphin-browser-adds-sonar-useful-voice-search/#comments</comments>
		<pubDate>Wed, 29 Feb 2012 17:00:55 +0000</pubDate>
		<dc:creator>Kevin C. Tofel</dc:creator>
				<category><![CDATA[@CNN]]></category>
		<category><![CDATA[Android]]></category>
		<category><![CDATA[browsers]]></category>
		<category><![CDATA[Dolphin]]></category>
		<category><![CDATA[Dolphin Browser]]></category>
		<category><![CDATA[Mobile Apps]]></category>
		<category><![CDATA[siri]]></category>
		<category><![CDATA[smartphones]]></category>
		<category><![CDATA[tablets]]></category>
		<category><![CDATA[voice search]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=491645</guid>
		<description><![CDATA[Dolphin Sonar is now part of the popular Dolphin Browser for Android, making it easy to search the web, share links or open specific web pages by speaking. I tested Sonar and see much to like now, and in the future, thanks to web services.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=491645&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://gigaom2.files.wordpress.com/2012/02/dolphin-sonar-featured.jpg"><img  title="dolphin-sonar-featured" src="http://gigaom2.files.wordpress.com/2012/02/dolphin-sonar-featured.jpg?w=210&#038;h=140" alt="" width="210" height="140" class="size-thumbnail wp-image-491694 alignleft" /></a>Voice search on mobiles is becoming all the rage, and the folks behind the popular Dolphin Browser are adding more voice functionality in their latest version. Dolphin Sonar is part of t<a href="https://market.android.com/details?id=mobi.mgeek.TunnyBrowser&amp;feature=search_result#?t=W251bGwsMSwyLDEsIm1vYmkubWdlZWsuVHVubnlCcm93c2VyIl0.">he 7.4 release of Dolphin Browser for Android</a>, making it easy to search the web, share links or open specific web apps and pages by speaking.</p>
<p>Here&#8217;s a short promo video on the new Sonar feature that&#8217;s clearly taking a fun shot at Apple&#8217;s Siri functionality, which launched with the iPhone 4S last October.</p>
<span class='embed-youtube' style='text-align:center; display: block;'><iframe class='youtube-player' type='text/html' width='604' height='370' src='http://www.youtube.com/embed/JD3EN48gwhY?version=3&#038;rel=1&#038;fs=1&#038;showsearch=0&#038;showinfo=1&#038;iv_load_policy=1&#038;wmode=transparent' frameborder='0'></iframe></span>
<p>As a regular user of the Dolphin Browser &#8212; <a href="http://gigaom.com/mobile/8-reasons-android-owners-should-try-dolphin-browser/">here are eight reasons why</a> &#8212; I like the concept of integrated voice actions to use the web. Dolphin has long supported gestures to help navigate the Internet, and the new Sonar feature extends the usefulness of the browser. I&#8217;ve been using Sonar for the past day and generally like where the Dolphin team is headed.</p>
<p><a href="http://gigaom2.files.wordpress.com/2012/02/dolphin-sonar.jpeg"><img  title="dolphin-sonar" src="http://gigaom2.files.wordpress.com/2012/02/dolphin-sonar.jpeg?w=168&#038;h=300" alt="" width="168" height="300" class="alignright size-medium wp-image-491672" /></a>You can tap your keyboard microphone button to use Sonar, but I&#8217;m a fan of the keyboard-less option: simply shake your handset, and Dolphin Sonar is ready to listen. Obviously, you have to have the browser open when shaking the phone; this functionality isn&#8217;t integrated into the operating system.</p>
<p>Most of the time, Sonar simply searches Google for what you need. However, if you want to open a specific website, you can do so through speech. And that&#8217;s handy for certain web apps. For example, using Sonar, I could say, &#8220;Open weather dot com,&#8221; and the mobile site opens immediately, complete with my local forecast. Note that I had previously used the website, so it knew my location. If not, the browser wouldn&#8217;t know where I was. Saying &#8220;eBay iPad 2&#8243; took me right to the eBay mobile site-results for iPad 2.</p>
<p>I wasn&#8217;t able to test the link sharing in Facebook feature as it&#8217;s not quite live yet. The concept is great, and I hope the Dolphin folks can extend that functionality to other web services such as Twitter and Google+. All in all, I like what I see in Dolphin Sonar. Some may find it limited, but it&#8217;s the type of service that can scale to allow for more functionality in the future as our phones become intelligent assistants.</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=491645&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=745765"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=745765" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=mobile&utm_medium=editorial&utm_campaign=auto3&utm_term=491645+dolphin-browser-adds-sonar-useful-voice-search&utm_content=kevintofel">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/07/mobile-second-quarter-2012-analysis-and-outlook/?utm_source=mobile&utm_medium=editorial&utm_campaign=auto3&utm_term=491645+dolphin-browser-adds-sonar-useful-voice-search&utm_content=kevintofel">Takeaways from mobile&#8217;s second quarter</a></li><li><a href="http://pro.gigaom.com/2012/04/mobile-q1-the-fight-for-spectrum-goes-to-washington-the-tablet-wars-continue/?utm_source=mobile&utm_medium=editorial&utm_campaign=auto3&utm_term=491645+dolphin-browser-adds-sonar-useful-voice-search&utm_content=kevintofel">A look back at mobile in Q1</a></li><li><a href="http://pro.gigaom.com/2012/07/the-wearable-computing-market-a-global-analysis/?utm_source=mobile&utm_medium=editorial&utm_campaign=auto3&utm_term=491645+dolphin-browser-adds-sonar-useful-voice-search&utm_content=kevintofel">Analyzing the wearable computing market</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2012/02/29/dolphin-browser-adds-sonar-useful-voice-search/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/02/dolphin-sonar-featured.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/02/dolphin-sonar-featured.jpg?w=150" medium="image">
			<media:title type="html">dolphin-sonar-featured</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/6cbb45abac59965c2626e40155358d1b?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">Kevin C. Tofel</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/02/dolphin-sonar-featured.jpg?w=210" medium="image">
			<media:title type="html">dolphin-sonar-featured</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/02/dolphin-sonar.jpeg?w=168" medium="image">
			<media:title type="html">dolphin-sonar</media:title>
		</media:content>
	</item>
		<item>
		<title>The promise of a truly voice activated world: How real is it</title>
		<link>http://gigaom.com/2011/08/03/the-promise-of-a-truly-voice-activated-world-how-real-is-it/</link>
		<comments>http://gigaom.com/2011/08/03/the-promise-of-a-truly-voice-activated-world-how-real-is-it/#comments</comments>
		<pubDate>Wed, 03 Aug 2011 12:00:46 +0000</pubDate>
		<dc:creator>Ryan Kim</dc:creator>
				<category><![CDATA[@CNN]]></category>
		<category><![CDATA[Mobile]]></category>
		<category><![CDATA[Mobile Apps]]></category>
		<category><![CDATA[search engines]]></category>
		<category><![CDATA[Sensory]]></category>
		<category><![CDATA[speech recognition]]></category>
		<category><![CDATA[voice activation]]></category>
		<category><![CDATA[voice search]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=387104</guid>
		<description><![CDATA[Sensory's TrulyHandsfree Voice Control 2.0 software recognizes and responds to dozens of keywords and phrases and lets people perform actions without the need for a button press. It enables a much more natural interface with a device that can be listening for an array of triggers.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=387104&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://gigaom2.files.wordpress.com/2011/08/sensory.jpg"><img  title="sensory" src="http://gigaom2.files.wordpress.com/2011/08/sensory-e1312321712426.jpg?w=300&#038;h=198" alt="" width="300" height="198" class="alignleft size-medium wp-image-387202" /></a>While the world is slowly being controlled by <a href="http://gigaom.com/2011/01/12/google-translate-is-where-speech-meets-the-cloud/">voice activation and speech recognition</a>, it usually requires a physical trigger or button press, which seems to undercut the power of voice control. But that distinction should be fading away as we get more sophisticated technology that allows devices to recognize more voice commands without a physical prompt.</p>
<p>That&#8217;s the promise behind the<a href="http://www.sensoryinc.com/company/pr11_01.html"> release of Sensory&#8217;s TrulyHandsfree Voice Control 2.0 software Tuesday.</a> The latest speech detection technology builds upon the 1.0 version, which allowed users to trigger a device with a small set of words without a prompt. Now version 2.0 recognizes and responds to many more keywords and longer phrases, dozens of them, that can be analyzed in the course of a regular noisy conversation. The latest update increases the accuracy significantly by 60 percent and picks out words better in loud environments. What it enables is a much more natural interface with a device that can be listening for a wide array of triggers.</p>
<p>TrulyHandsfree is already at work in car Bluetooth kits from BlueAnt and Kensington and phones like the Samsung Galaxy S II. And it&#8217;s finding its way into sophisticated toys like Mattel&#8217;s Fijit Friends, which can respond to words spoken to it.</p>
<h2><strong>A voice-activated world</strong></h2>
<p><a href="http://gigaom2.files.wordpress.com/2011/08/screen-shot-2011-08-02-at-3-55-49-pm.png"><img  title="Screen shot 2011-08-02 at 3.55.49 PM" src="http://gigaom2.files.wordpress.com/2011/08/screen-shot-2011-08-02-at-3-55-49-pm.png?w=300&#038;h=179" alt="" width="300" height="179" class="alignright size-medium wp-image-387272" /></a>But that&#8217;s just the start. Sensory&#8217;s CEO Todd Mozer said he&#8217;s been contacted by four TV companies in the last month who are interested in using TrulyHandsfree as a front end for voice controls. He said there could be any number of devices from coffee machines and stereos to home automation systems and automobiles that can utilize the technology. It&#8217;s already starting to happen slowly, but now with more sophisticated and accurate software, we should see this integrated into a lot more products.</p>
<blockquote><p>&#8220;Almost any new product can we use this Truly Handsfree approach, it’s so accurate,&#8221; Mozer said.</p></blockquote>
<h2><strong>Removing buttons</strong></h2>
<p>Mozer said Truly Handsfree is transforming Sensory&#8217;s speech technology business. Since launching a two years ago, the product quickly got integrated into 1/3 of all Sensory&#8217;s voice-enabled products. This year, 2/3 of all products using Sensory&#8217;s technology are utilizing Truly Handsfree.</p>
<p>When combined with smarter speech recognition technology used by <a href="http://gigaom.com/apple/dragon-go-app-gets-smart-with-voice-search/">companies like Nuance</a> , Vlingo, Google and Microsoft, it could really help bring speech technology into the mainstream. We&#8217;re seeing some of that promise now <a href="http://gigaom.com/2010/12/06/microsoft-claims-its-place-in-a-voice-enabled-world/">with devices like the Xbox Kinect</a>, which can respond to some voice commands. But with a front end that can respond to a wide array of triggers and work in a lot of different environments, we could see users start to feel comfortable with speech as an input.</p>
<h2><strong>Mainstream catalyst</strong></h2>
<p>That&#8217;s one of things that still holds me up at times. I like Android Voice Actions, but even just the act of pressing the microphone icon to start an action is an extra step that sometimes deters me. Google recently said 25 percent of searches on Android devices are conducted by voice. Imagine how much that would go up if people could just talk to their phones without waking it up. A solution has to be accurate though, not firing on the wrong words. Mozer said Truly Handsfree 2.0 registers a 10 percent false reject rate and has a false acceptance rate of a word every three to five hours, and it can register words from 20 feet away. It&#8217;s not perfect, but it is getting very good.</p>
<p><a href="http://gigaom2.files.wordpress.com/2011/08/halimages.jpeg"><img  title="HALimages" src="http://gigaom2.files.wordpress.com/2011/08/halimages.jpeg?w=708" alt=""   class="alignleft size-full wp-image-387269" /></a>I think speech technology has a ways to go to the point where we&#8217;re talking to computers like HAL or the system on the Star Trek Enterprise. But it&#8217;s coming together. We still need computers to figure out the harder task of understanding our speech and our intent but that&#8217;s coming around as we&#8217;re seeing with <a href="http://gigaom.com/cloud/what-watson-taught-us-humans-are-very-smart/">projects like IBM&#8217;s Watson</a> . We increasingly have so much data stored in the cloud and on our devices that it makes sense to also tap tools like voice to get at that. Achieving that future involves a lot of progress including the work being done by Sensory. I, for one, can&#8217;t wait to do less button pressing and more voice activating.</p>
<p><span class='embed-youtube' style='text-align:center; display: block;'><iframe class='youtube-player' type='text/html' width='604' height='370' src='http://www.youtube.com/embed/1peRYXmgiS8?version=3&#038;rel=1&#038;fs=1&#038;showsearch=0&#038;showinfo=1&#038;iv_load_policy=1&#038;wmode=transparent' frameborder='0'></iframe></span></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=387104&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=553519"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=553519" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=387104+the-promise-of-a-truly-voice-activated-world-how-real-is-it&utm_content=oryankim">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2011/01/bluetooth-to-feel-blue-as-personal-area-network-battles-loom/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=387104+the-promise-of-a-truly-voice-activated-world-how-real-is-it&utm_content=oryankim">Bluetooth to Feel Blue as Personal Area Network Battles Loom</a></li><li><a href="http://pro.gigaom.com/2012/04/google-doesnt-like-walled-gardens-except-its-own/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=387104+the-promise-of-a-truly-voice-activated-world-how-real-is-it&utm_content=oryankim">Google doesn&#8217;t like walled gardens &#8212; except its own</a></li><li><a href="http://pro.gigaom.com/2012/02/netflix-may-suffer-from-limited-mobility/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=387104+the-promise-of-a-truly-voice-activated-world-how-real-is-it&utm_content=oryankim">Netflix may suffer from limited mobility</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2011/08/03/the-promise-of-a-truly-voice-activated-world-how-real-is-it/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2011/08/sensory-e1312321712426.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2011/08/sensory-e1312321712426.jpg?w=150" medium="image">
			<media:title type="html">sensory</media:title>
		</media:content>

		<media:content url="http://2.gravatar.com/avatar/81c4fca1b2d82a7fb9c8657de52386d1?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">oryankim</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2011/08/sensory-e1312321712426.jpg?w=300" medium="image">
			<media:title type="html">sensory</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2011/08/screen-shot-2011-08-02-at-3-55-49-pm.png?w=300" medium="image">
			<media:title type="html">Screen shot 2011-08-02 at 3.55.49 PM</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2011/08/halimages.jpeg" medium="image">
			<media:title type="html">HALimages</media:title>
		</media:content>
	</item>
	</channel>
</rss>
