<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>GigaOM &#187; speech recognition</title>
	<atom:link href="http://gigaom.com/tag/speech-recognition/feed/" rel="self" type="application/rss+xml" />
	<link>http://gigaom.com</link>
	<description></description>
	<lastBuildDate>Wed, 19 Jun 2013 20:25:50 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='gigaom.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://0.gravatar.com/blavatar/0db8f6557d022075dbbf010c54d46d93?s=96&#038;d=http%3A%2F%2Fs2.wp.com%2Fi%2Fbuttonw-com.png</url>
		<title>GigaOM &#187; speech recognition</title>
		<link>http://gigaom.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://gigaom.com/osd.xml" title="GigaOM" />
	<atom:link rel='hub' href='http://gigaom.com/?pushpress=hub'/>
		<item>
		<title>Say those magic words: Xbox One, Google Glass and the rise of the always-on microphone</title>
		<link>http://gigaom.com/2013/05/21/say-those-magic-words-xbox-one-google-glass-and-the-rise-of-the-always-on-microphone/</link>
		<comments>http://gigaom.com/2013/05/21/say-those-magic-words-xbox-one-google-glass-and-the-rise-of-the-always-on-microphone/#comments</comments>
		<pubDate>Tue, 21 May 2013 22:07:48 +0000</pubDate>
		<dc:creator>Janko Roettgers</dc:creator>
				<category><![CDATA[Google Glass]]></category>
		<category><![CDATA[speech recognition]]></category>
		<category><![CDATA[Xbox One]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=647953</guid>
		<description><![CDATA[Microsoft's new Xbox One is listening to every word uttered in your living room. But it's only interested in very few of them .<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=647953&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Microsoft’s new Xbox One has many new features, but one in particular raised some eyebrows Tuesday: The new game console will always be on, and users will be able to launch games, live TV or even a Skype call with simple voice commands, and without ever picking up a controller or remote control. Does that mean, <a href="http://www.theverge.com/2013/5/21/4352596/the-xbox-one-is-always-listening">as the Verge mused</a>, that Microsoft will always be listening to each and every word spoken in your living room?</p>
<p>The answer is yes, no, and better get used to it. Microsoft hasn’t actually said how many aspects of the Xbox One are going to work, but the demo it gave at its campus in Redmond, Wash. Tuesday contained some solid hints on the particulars of its voice control. To wake up the device and launch live TV, play a game or do anything at all with it, users will first have to say “Xbox on.”</p>
<p>That’s what people who work on speech recognition call “hot words” &#8211; easily recognizable phrases that can be detected by a system without too much effort. Once a user says that magic word or phrase, the actual speech recognition kicks into high gear.</p>
<p>That means that the Xbox One continuously listens for someone to say “Xbox on,” and that everything else that’s spoken is automatically disregarded. Listening for these hot words is done locally and doesn’t require much in terms of system resources. For example, there&#8217;s no need to record anything, since all that matters are the hot words. But once those words are uttered, the Xbox One is going to use advanced speech recognition to figure out what users are actually talking about.</p>
<p>Again, Microsoft hasn&#8217;t said exactly how this is going to work, but a spokesperson told me that some of the personalization offered by the device is &#8220;one of the benefits of Xbox One being connected to and powered by the cloud.&#8221; I&#8217;d expect that the same is true for speech recognition, much in the same way that Google uploads everything you say to its servers when you use voice search on your Android phone.</p>
<p><a href="http://gigaom.com/2013/05/21/xbox-one-announcement/xbox-one-feature-art/" rel="attachment wp-att-647818"><img src="http://gigaom2.files.wordpress.com/2013/05/xbox-one-feature-art.jpg?w=708" alt="xbox one feature art"    class="aligncenter size-full wp-image-647818" /></a></p>
<p>The use of hot wording to wake up technology from a state of low-level listening to launch active speech recognition isn’t new. It’s also at work in Google Glass, where users get the device’s attention by saying “okay glass.” Google Now simply uses “Google” as a hot word to launch voice input. And the Xbox 360 starts to accept voice commands once users yell “Xbox” at the device’s Kinect sensor.</p>
<p>The difference between how the Xbox 360 and the Xbox One approach voice recognition isn’t so much about technology, even though Xbox users probably hope that the new iteration <a href="http://www.geek.com/games/the-technology-behind-the-kinects-voice-recognition-is-ingenious-but-doesnt-work-very-well-1275099/">is going to work better</a>. What makes people feel uncomfortable is that the Xbox One, and with it its microphone, are meant to be always on.</p>
<p>However, the always-on microphone of the Xbox One is just a sign of things to come. Voice input is going to become a key component of a growing number of internet-connected devices and appliances in your home, car and office, and many of them will use hot words to switch from low-level listening to active speech recognition.</p>
<p>In fact, you are likely looking at one of those devices right now: Laptops, tablets and mobile phones all contain microphones, and they’re all waiting to become hot words-aware any day now. <a href="http://www.youtube.com/watch?v=W7NdrPju9YE">Google just demonstrated how it is going to add hot wording</a> to search on the desktop at last week’s Google I/O conference, allowing users to start a voice search query by simply saying “okay Google” without touching a single button.</p>
<p>Of course, all of this doesn’t mean that there are no privacy issues around hot wording and always-on microphones. Companies should make it clear how exactly they’re using the technology as it is becoming more widely distributed, and there should always be a way to opt out and rely on alternative input methods. It may also be a good idea to indicate to users  when exactly a device is reverting back from active speech recognition to a state of passive listening. But I’d expect that most consumers quickly get used to the constantly running mic, always listening for those magic words.</p>
<p><em>Image <a href="http://creativecommons.org/licenses/by/2.0/">courtesy of</a> Flickr user <a href="http://www.flickr.com/photos/visual_dichotomy/3623619145/">visual.dichotomy.</a></em></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=647953&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=771673"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=771673" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=647953+say-those-magic-words-xbox-one-google-glass-and-the-rise-of-the-always-on-microphone&utm_content=jroettgers">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/05/the-living-room-reinvented-trends-technologies-and-companies-to-watch/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=647953+say-those-magic-words-xbox-one-google-glass-and-the-rise-of-the-always-on-microphone&utm_content=jroettgers">Who and what to watch in the new era of the living room</a></li><li><a href="http://pro.gigaom.com/2012/01/connected-consumer-q4-sopa-and-the-future-of-digital-content/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=647953+say-those-magic-words-xbox-one-google-glass-and-the-rise-of-the-always-on-microphone&utm_content=jroettgers">Q4 Wrap-up: SOPA and the future of digital content</a></li><li><a href="http://pro.gigaom.com/2011/12/connected-consumer-2012-a-year-of-consolidation-and-integration/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=647953+say-those-magic-words-xbox-one-google-glass-and-the-rise-of-the-always-on-microphone&utm_content=jroettgers">Connected Consumer 2012: A year of consolidation and integration</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/05/21/say-those-magic-words-xbox-one-google-glass-and-the-rise-of-the-always-on-microphone/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/05/3623619145_2daf5d53c4_o.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/05/3623619145_2daf5d53c4_o.jpg?w=150" medium="image">
			<media:title type="html">microphone</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/08bc62ecf138202f06b74dfa01376e74?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">jroettgers</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/05/xbox-one-feature-art.jpg" medium="image">
			<media:title type="html">xbox one feature art</media:title>
		</media:content>
	</item>
		<item>
		<title>Why stop at talking phones? Nuance intros voice ads</title>
		<link>http://gigaom.com/2013/04/01/why-stop-at-talking-phones-nuance-intros-voice-ads/</link>
		<comments>http://gigaom.com/2013/04/01/why-stop-at-talking-phones-nuance-intros-voice-ads/#comments</comments>
		<pubDate>Mon, 01 Apr 2013 12:00:26 +0000</pubDate>
		<dc:creator>Kevin Fitchard</dc:creator>
				<category><![CDATA[advertising]]></category>
		<category><![CDATA[Digital Assistant]]></category>
		<category><![CDATA[Mike McSherry]]></category>
		<category><![CDATA[speech recognition]]></category>
		<category><![CDATA[voice-interface]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=625825</guid>
		<description><![CDATA[Nuance powers speech interfaces in phones, TVs, cars and websites. Now its creating advertising that will both listen to your questions and respond to them.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=625825&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Personal assistants like Siri already hear and respond in spoken language. Nuance Communications, the <a href="http://gigaom.com/2011/12/20/nuance-buys-vlingo-builds-a-voice-technology-giant/">company that powers the speech understanding technologies</a> behind Siri and other digital assistants think the ads displayed in our phones should listen to our voices as well.</p>
<p><a href="http://gigaom.com/?attachment_id=625827" rel="attachment wp-att-625827"><img  alt="Nuance Voice Ads screen shot" src="http://gigaom2.files.wordpress.com/2013/03/mlszi8fmlfbiwglzpopy1v5z_bl7s3nmvvvmhvbjxs4.png?w=168&#038;h=300" width="168" height="300" class="alignleft size-medium wp-image-625827" /></a>Nuance on Monday announced it’s expanding its speech interface expertise into <a href="http://gigaom.com/2013/01/07/nuance-to-create-a-universal-voice-assistant-bridging-phones-tvs-and-cars/">yet another industry</a>, advertising, launching a new service for app developers called Voice Ads. Yes, we’re talking about talking ads here – well, at least listening ads. Whether the ad responds via text, video or spoken word is entirely up to the ad creator. The key is that advertisers now have a new way of engaging with their audience rather than just mere splashy images, said Mike McSherry, VP of Advertising and former CEO of Swype, the predicative keyboard company <a href="http://gigaom.com/2011/10/07/why-nuance-a-speech-company-bought-swype/">Nuance acquired in 2011</a>.</p>
<p>Say Jeff Daniels is the spokesman for your product. Instead of just flashing up an image of Daniels holding your widget in the confined space of an app banner ad, your ad could let you virtually ask Daniels questions about it and your company. Answers are pre-recorded, of course, but as we’ve witnessed with the depth and variety of conversational answers Apple has given Siri (try asking Siri if she believes in God), a committed ad agency could have some fun with the medium.</p>
<p>McSherry said that mobile app developers can embed Voice Ad technology into their apps through an SDK, which will allow ad networks like JumpTap, Millennial and Ad Marvel to serve up the new formats in those apps, either as banner or interstitial ads. Nuance is currently working with three agency partners to develop the new ads: Digitas, OMD and Leo Burnett.</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=625825&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=269466"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=269466" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=mobile&utm_medium=editorial&utm_campaign=auto3&utm_term=625825+why-stop-at-talking-phones-nuance-intros-voice-ads&utm_content=kfitchard">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/report/survey-how-apps-can-solve-photo-management/?utm_source=mobile&utm_medium=editorial&utm_campaign=auto3&utm_term=625825+why-stop-at-talking-phones-nuance-intros-voice-ads&utm_content=kfitchard">Survey: How apps can solve photo management</a></li><li><a href="http://pro.gigaom.com/report/sector-roadmap-social-customer-service-in-2013/?utm_source=mobile&utm_medium=editorial&utm_campaign=auto3&utm_term=625825+why-stop-at-talking-phones-nuance-intros-voice-ads&utm_content=kfitchard">Sector RoadMap: Social customer service in 2013</a></li><li><a href="http://pro.gigaom.com/report/frenemy-mine-the-pros-and-cons-of-social-partnerships-for-online-media-companies/?utm_source=mobile&utm_medium=editorial&utm_campaign=auto3&utm_term=625825+why-stop-at-talking-phones-nuance-intros-voice-ads&utm_content=kfitchard">Frenemy mine: The pros and cons of social partnerships for online media companies</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/04/01/why-stop-at-talking-phones-nuance-intros-voice-ads/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/01/shutterstock_8805310-e1357582414559.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/01/shutterstock_8805310-e1357582414559.jpg?w=150" medium="image">
			<media:title type="html">Speech recognition, voice analyzer</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/0544c4b228f8fa80e31bb952501cd7a4?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">kfitchard</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/03/mlszi8fmlfbiwglzpopy1v5z_bl7s3nmvvvmhvbjxs4.png?w=168" medium="image">
			<media:title type="html">Nuance Voice Ads screen shot</media:title>
		</media:content>
	</item>
		<item>
		<title>Joyride asks: Why invest in a connected car when a smartphone will do?</title>
		<link>http://gigaom.com/2013/03/25/joyride-asks-why-invest-in-a-connected-car-when-a-smartphone-will-do/</link>
		<comments>http://gigaom.com/2013/03/25/joyride-asks-why-invest-in-a-connected-car-when-a-smartphone-will-do/#comments</comments>
		<pubDate>Mon, 25 Mar 2013 15:54:58 +0000</pubDate>
		<dc:creator>Kevin Fitchard</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[connected car]]></category>
		<category><![CDATA[hands-free]]></category>
		<category><![CDATA[James Zhang]]></category>
		<category><![CDATA[Jeff Chen]]></category>
		<category><![CDATA[Seth Goldstein]]></category>
		<category><![CDATA[speech recognition]]></category>
		<category><![CDATA[voice-interface]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=623857</guid>
		<description><![CDATA[A new startup backed Turntable's Seth Goldstein plans to make connected software without the car. Joyride is building a gaming and entertainment suite for the smartphone designed to be entirely hands-free.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=623857&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Automakers are coaxing entertainment apps out of the smartphone and into the dashboard, <a href="http://gigaom.com/2013/02/06/the-connected-car-of-the-future-infographic/">turning the connected car into the next big mobile services platform</a>. But a new startup in San Francisco called <a href="http://www.getjoyride.com/">Joyride</a> is wondering why you would even bother with dashboard software development if the tools for making a good in-car app are already in the smartphone.</p>
<p>Joyride is creating a voice user interface for the smartphone designed to function much like the voice command-and-control systems in your car. Founders Jeff Chen and James Zhang created Skyvi, <a href="https://play.google.com/store/apps/details?id=com.bluetornadosf.smartypants&amp;hl=en">a voice-assistant app for Android</a> that received 5 million downloads. With the help of $1 million in seed funding, they’re now building similar voice-interaction technology into Joyride with the aim of creating a game, entertainment and education platform that is fully hands-free. Though you could use its app anywhere, Joyride CEO Chen said, it’s most useful within the restrictive confines of the car.</p>
<p><a href="http://gigaom.com/2013/03/25/joyride-asks-why-invest-in-a-connected-car-when-a-smartphone-will-do/screenshot_joyride/" rel="attachment wp-att-623859"><img  alt="Joyride screenshot" src="http://gigaom2.files.wordpress.com/2013/03/screenshot_joyride.png?w=153&#038;h=300" width="153" height="300" class="alignleft size-medium wp-image-623859" /></a>Joyride plans to launch an Android app in the next few months &#8212; right now it’s in a private beta &#8212; and its first service will be a trivia game Chen described as “a hands-free version of Words With Friends.” But Joyride plans to layer more games and services on top of the app, and will eventually invite outside developers to embed their own applications into the Joyride app framework.</p>
<p>“Think of Joyride as a portal,” Chen said. “It’s an enabling technology intended for people to build things on top of.” Joyride would be the overarching brand and its voice technology would not only power the apps themselves, but allow you to navigate between them. The first Joyride apps will be games, but Chen hopes to layer on any manner of service that lends itself to voice interaction, from audiobooks and music streaming to highly interactive language-learning apps.</p>
<p>Joyride makes use of the car’s stereo system through an auxiliary jack or a Bluetooth connection, though the app doesn’t actually tap into any <a href="http://gigaom.com/2013/01/12/at-ces-the-connected-car-became-truly-connected/">connected car software or OS</a> &#8212; it’s just using the speakers. From there, all interactions are done via voice &#8212; it makes use of Google’s speech API &#8212; requiring  no actual physical input, Chen said. The approach also has the advantage of making the app car agnostic. If you can connect your iPod to the car, you can connect Joyride.</p>
<p>Joyride’s $1 million seed round comes Cowboy Ventures and Freestyle Capital as well as from angel investor and <a href="http://gigaom.com/2011/09/13/turntable-fm-funding-iphone-app/">Turnable.fm</a> co-founder Seth Goldstein, who is also Joyride’s executive chairman.</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=623857&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=775369"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=775369" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=mobile&utm_medium=editorial&utm_campaign=auto3&utm_term=623857+joyride-asks-why-invest-in-a-connected-car-when-a-smartphone-will-do&utm_content=kfitchard">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2011/12/carrier-iq-and-the-continued-erosion-of-operator-trust/?utm_source=mobile&utm_medium=editorial&utm_campaign=auto3&utm_term=623857+joyride-asks-why-invest-in-a-connected-car-when-a-smartphone-will-do&utm_content=kfitchard">Carrier IQ and the continued erosion of operator trust</a></li><li><a href="http://pro.gigaom.com/2011/09/the-future-of-mobile-a-segment-analysis-by-gigaom-pro/?utm_source=mobile&utm_medium=editorial&utm_campaign=auto3&utm_term=623857+joyride-asks-why-invest-in-a-connected-car-when-a-smartphone-will-do&utm_content=kfitchard">The future of mobile: a segment analysis by GigaOM Pro</a></li><li><a href="http://pro.gigaom.com/2011/08/what-the-google-motorola-deal-means-for-android-microsoft-and-the-mobile-industry/?utm_source=mobile&utm_medium=editorial&utm_campaign=auto3&utm_term=623857+joyride-asks-why-invest-in-a-connected-car-when-a-smartphone-will-do&utm_content=kfitchard">What the Google-Motorola deal means for Android, Microsoft and the mobile industry</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/03/25/joyride-asks-why-invest-in-a-connected-car-when-a-smartphone-will-do/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/03/founders-e1364225225404.png?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/03/founders-e1364225225404.png?w=150" medium="image">
			<media:title type="html">Joyride founders Jeff Chen and James Zhang</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/0544c4b228f8fa80e31bb952501cd7a4?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">kfitchard</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/03/screenshot_joyride.png?w=153" medium="image">
			<media:title type="html">Joyride screenshot</media:title>
		</media:content>
	</item>
		<item>
		<title>Real &#8220;touch&#8221; screens and tasteful computers: IBM predicts the future</title>
		<link>http://gigaom.com/2012/12/16/ibms-new-5-in-5-techs-are-about-bringing-computers-to-our-senses/</link>
		<comments>http://gigaom.com/2012/12/16/ibms-new-5-in-5-techs-are-about-bringing-computers-to-our-senses/#comments</comments>
		<pubDate>Mon, 17 Dec 2012 05:01:11 +0000</pubDate>
		<dc:creator>Barb Darrow</dc:creator>
				<category><![CDATA[Bernie Myerson]]></category>
		<category><![CDATA[General Electric]]></category>
		<category><![CDATA[IBM]]></category>
		<category><![CDATA[Pavegen]]></category>
		<category><![CDATA[speech recognition]]></category>
		<category><![CDATA[Watson]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=594862</guid>
		<description><![CDATA[IBM's latest "5 in 5" list picks technologies the IT giant feels will hit the mainstream in the next 5 years. Its 2012 choices all center on richer data input and output that give computers the ability to taste, smell, touch, and feel.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=594862&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>What if a computer could let us &#8220;feel&#8221; the texture of a fabric before we buy clothes online? Or gives us a whiff &#8212; or even a taste &#8212; of a meal we&#8217;re thinking of preparing? That&#8217;s pretty game-changing stuff. And, it&#8217;s also within the realm of possibility in the next 5 years, according to IBM&#8217;s list of technologies it thinks are on the cusp of adoption.</p>
<p>Every year IBM polls its R&amp;D braintrust about what technologies that may have &#8220;been at the hairy edge before but are now closer to the scalp,&#8221; IBM fellow and VP of innovation Bernie Meyerson told me recently. This year those &#8220;closer-to-the-scalp&#8221; technologies converge around computers&#8217; growing ability to handle richer, more diverse data and churn out more valuable output &#8212; such as the feel of cloth, the smell or taste of food. The general premise is that these sensory and cognitive technologies will convert computers from glorified calculators into true thinking machines.</p>
<p>So, without further ado, here&#8217;s IBM&#8217;s sixth annual <a href="http://www.ibm.com/smarterplanet/us/en/ibm_predictions_for_future/ideas/index.html">5 in 5</a> technology picks.</p>
<h2>1: Computers with a sense of touch</h2>
<p><a href="http://gigaom.com/2012/12/16/ibms-new-5-in-5-techs-are-about-bringing-computers-to-our-senses/shutterstock_112467080/" rel="attachment wp-att-594880"><img  alt="Velvet texture" src="http://gigaom2.files.wordpress.com/2012/12/shutterstock_112467080.jpg?w=300&#038;h=200" width="300" height="200" class="alignright size-medium wp-image-594880" /></a>Even people who love shopping online say that it&#8217;s hard to get a good read on the finished product from a digital image alone. Most of us want to <em>feel</em> the fabric before we buy a big-ticket item. So what if you could sample that cashmere coat from your cell phone before adding it to your shopping cart? Texture data fed into a machine&#8217;s piezoelectric drivers can re-create vibrations and temperature on a touch screen can simulate that feel, Meyerson said. &#8220;Imagine you have very fine pixels and that each can be heated and vibrated to mimic the sensation of the cloth,&#8221; he said.</p>
<p>Some of this capability is available now in rudimentary form in computer games where the controller shakes to indicate an on-screen car collision.</p>
<h2>2: Seeing the forest, not just the trees</h2>
<p>If you have to rasterize an image in order to analyze it, any sort of correlation will take a long time. If the computer can instead really see and understand that image for what it represents &#8212; say, a child, as opposed to a bunch of pixels &#8212; it can accelerate the whole process of analysis. That in turn will make the parsing of things like medical images and traffic video much faster. The difference here is between the computer viewing an image and understanding that image without having to break it down into myriad components. That&#8217;s the way humans deal with the world. Computers could monitor scanned images of a person over time to watch for and detect changes that indicate a health condition before it gets too serious for example.</p>
<p><a href="http://gigaom.com/data/how-trifacta-wants-to-teach-humans-and-data-to-work-together/shutterstock_110597087/" rel="attachment wp-att-569798"><img  alt="Humans and data as one" src="http://gigaom2.files.wordpress.com/2012/10/shutterstock_110597087.jpg?w=300&#038;h=225" width="300" height="225" class="alignleft size-medium wp-image-569798" /></a></p>
<h2>3: Hearing the whole story</h2>
<p>Just as computers need to see images as whole entities, IBM thinks they also need to hear total sounds &#8212; ambient noise, words, music, a lot of inputs to get the full story. &#8220;It&#8217;s not necessarily just hearing words, hearing is also background noise &#8230; if a cell phone caller is in a car with an engine running at 2,000 rpm, you might even be able to tell if the driver is stuck in traffic or moving smoothly,&#8221; Meyerson said.</p>
<p>By embedding sensors in flood prone areas, this technology could warn users based on what it&#8217;s learned from past sounds, as to whether a mud slide is likely. Computers could also likewise learn based on past experience when a baby&#8217;s cry is due to a wet diaper, teething, or something more serious.</p>
<h2>4: Digitized taste buds</h2>
<p>IBM&#8217;s brainiacs think that machines will increasingly be able to taste things &#8212; like chocolate or eggplant &#8211; and figure out why people do or don&#8217;t like that taste. As Kevin Fitchard, GigaOM&#8217;s resident foodie, recently reported, <a href="http://gigaom.com/cleantech/hacking-meat-can-technology-make-us-eat-fewer-animals/">some of this is happening now</a>.  For example, researcher and app developer <a href="https://www.foodpairing.com/">Foodpairing</a></p>
<blockquote><p>&#8220;has broken down flavor to its molecular components and has compiled databases that can match the flavor of those ingredients against other completely different ingredients. By compiling “foodpairing trees” its technology can identify vegetable or seafood ingredients that reinforce the flavor of different meats, or in some cases, can act as a substitute for a meat entirely.&#8221;</p></blockquote>
<p>This understanding of the chemical elements of food could help people get healthier by subbing in something that tastes like milk chocolate but is better for them.</p>
<h2>5: A nose that knows</h2>
<p><a href="http://gigaom.com/2012/12/16/ibms-new-5-in-5-techs-are-about-bringing-computers-to-our-senses/11660993_707cac1059_z/" rel="attachment wp-att-594884"><img  alt="Dog's nose" src="http://gigaom2.files.wordpress.com/2012/12/11660993_707cac1059_z.jpg?w=300&#038;h=225" width="300" height="225" class="alignright size-medium wp-image-594884" /></a>Breath analysis can do more than <a href="http://www.pcmag.com/article2/0,2817,2413113,00.asp">keep drunk drivers off the road</a>. What if your smartphone could tell from your breath that you&#8217;re about to get a cold? It&#8217;s conceivable that your doctor would be able to diagnose you remotely based on that information and prescribe treatment. This technology could also sniff out minuscule amounts of environmental toxins before they hit critical mass, which could have broad public health ramifications.</p>
<p>And then there&#8217;s just the quality of life aspect. &#8220;You can paint chemical sensors on a surface and when they detect a pattern, they give off a smell &#8212; you could make a rich paint with all sorts of sensors that mimic things that you like,&#8221; Meyerson said.</p>
<h2>So, how&#8217;s IBM doing as a sooth sayer?</h2>
<p>Since I&#8217;m still waiting for the jet packs we were promised decades ago, I&#8217;m skeptical about technology predictions, but IBM&#8217;s list provides a good starting point to track tech progress and priorities. It&#8217;s also fun to grade its prognostication skills.</p>
<p>Looking at <a href="http://gigaom.com/2011/12/19/take-a-look-at-ibms-5-innovations-for-next-five-years/">last year&#8217;s 5 in 5</a> predictions, it&#8217;s fair to say there are hits and misses. For example, last year it said junk mail will get so targeted it will actually cease to be junk at all. If that&#8217;s happening, I&#8217;m not seeing it.</p>
<p>Another 2011 prediction was we&#8217;d get much better at capturing and using wasted kinetic energy  &#8211; from people walking, riding bikes,from  running water etc. There is early traction there. Los Angeles is testing advanced flywheel technology as a way to <a href="http://www.sustainablebusiness.com/index.cfm/go/news.display/id/24305">reap wasted energy from braking trains</a> and re-apply it when trains accelerate.  And <a href="http://www.pavegen.com/">Pavegen</a> is building sidewalk tiles designed to <a href="http://www.cnn.com/2011/10/13/tech/innovation/pavegen-kinetic-pavements/index.html">capture energy from walking pedestrians</a>.</p>
<p><a href="http://gigaom.com/cloud/better-medicine-brought-to-you-by-big-data/watsonpower7/" rel="attachment wp-att-542635"><img  alt="WatsonPower7" src="http://gigaom2.files.wordpress.com/2012/07/watsonpower7.jpeg?w=300&#038;h=199" width="300" height="199" class="alignleft size-medium wp-image-542635" /></a>As for mind-reading headsets that measure our brain activity and recognize our facial expressions: Um, no, don&#8217;t think so. But to be fair, IBM has 4 more years to make good.</p>
<p>Taking the longer view, looking at IBM&#8217;s inaugural list in 2006, it does better. It was on the money with its call that people would be able to access healthcare remotely. There are lots of tele-radiology options and doctors can even perform surgery remotely. IBM also predicted real-time speech translation now exemplified by products like <a href="http://www.youtube.com/watch?v=YjqwRPUMhBU">Samsung&#8217;s Galaxy speech translation</a>. Meyerson admits to some less successful calls &#8212; especially one about hydrogen-powered vehicles &#8212; but he&#8217;s pretty happy overall with IBM&#8217;s effort.</p>
<p>For more check out the video below:<br />
<span class='embed-youtube' style='text-align:center; display: block;'><iframe class='youtube-player' type='text/html' width='604' height='370' src='http://www.youtube.com/embed/wXkfrBJqVcQ?version=3&#038;rel=0&#038;fs=1&#038;showsearch=0&#038;showinfo=1&#038;iv_load_policy=1&#038;wmode=transparent' frameborder='0'></iframe></span></p>
<p><em>Fabric feature photo courtesy of Shutterstock user <a href="http://www.shutterstock.com/gallery-960088p1.html">sbko</a>; </em><em><a title="Attribution License" href="http://creativecommons.org/licenses/by/2.0/">Nose photograph courtesy of </a> Flickr user <a href="http://www.flickr.com/photos/springfieldhomer/">Slideshow Bruce</a></em></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=594862&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=51434"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=51434" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=594862+ibms-new-5-in-5-techs-are-about-bringing-computers-to-our-senses&utm_content=gigabarb">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2011/04/smart-grid-apps-six-trends-that-will-shape-grid-evolution/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=594862+ibms-new-5-in-5-techs-are-about-bringing-computers-to-our-senses&utm_content=gigabarb">Smart Grid Apps: Six Trends That Will Shape Grid Evolution</a></li><li><a href="http://pro.gigaom.com/2010/12/9-companies-that-pushed-the-infrastructure-discussion-in-2010/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=594862+ibms-new-5-in-5-techs-are-about-bringing-computers-to-our-senses&utm_content=gigabarb">9 Companies that Pushed the Infrastructure Discussion in 2010</a></li><li><a href="http://pro.gigaom.com/report/building-energy-management-systems-overview-and-forecast/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=594862+ibms-new-5-in-5-techs-are-about-bringing-computers-to-our-senses&utm_content=gigabarb">Building energy management systems: overview and forecast</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2012/12/16/ibms-new-5-in-5-techs-are-about-bringing-computers-to-our-senses/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/12/img_0090.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/12/img_0090.jpg?w=150" medium="image">
			<media:title type="html">plants</media:title>
		</media:content>

		<media:content url="http://1.gravatar.com/avatar/4af03439988d64f816da72496325cb73?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">gigabarb</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/12/shutterstock_112467080.jpg?w=300" medium="image">
			<media:title type="html">Velvet texture</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/10/shutterstock_110597087.jpg?w=300" medium="image">
			<media:title type="html">Humans and data as one</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/12/11660993_707cac1059_z.jpg?w=300" medium="image">
			<media:title type="html">Dog&#039;s nose</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/07/watsonpower7.jpeg?w=300" medium="image">
			<media:title type="html">WatsonPower7</media:title>
		</media:content>
	</item>
		<item>
		<title>Want to speak Chinese without learning it? Here&#8217;s how</title>
		<link>http://gigaom.com/2012/11/09/want-to-speak-chinese-without-learning-it-heres-how/</link>
		<comments>http://gigaom.com/2012/11/09/want-to-speak-chinese-without-learning-it-heres-how/#comments</comments>
		<pubDate>Fri, 09 Nov 2012 14:25:29 +0000</pubDate>
		<dc:creator>Barb Darrow</dc:creator>
				<category><![CDATA[Deep Neural Networks]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[speech recognition]]></category>
		<category><![CDATA[University of Toronto]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=582704</guid>
		<description><![CDATA[New "deep neural networks" technology out of Microsoft and the University of Toronto claims to not only translate your English into Mandarin but to speak it in your voice. Microsoft head researcher Richard Rashid demonstrated the technology in China recently.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=582704&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Talk about your time savers. New research from Microsoft and the University of Toronto may make it possible for non-Chinese speakers to &#8220;speak&#8221; the language in their own voices without having to learn the language. Given the trade relationships between the US and China, this could be a really big deal if it works as advertised.</p>
<p>While great strides have been made in speech recognition over the past decades, the current systems still carry word error rates of 20 percent to 25 percent when handling &#8220;arbitrary speech,&#8221; Microsoft&#8217;s Richard Rashid wrote in a <a href="http://blogs.technet.com/b/next/archive/2012/11/08/microsoft-research-shows-a-promising-new-breakthrough-in-speech-translation-technology.aspx#.UJ0H4OOe-aE">blog post. </a> (Do you hear that Siri?)</p>
<p>But now, new technology called <a href="http://research.microsoft.com/apps/pubs/?id=144412">Deep Neural Networks,</a> which mimics the way the human brain operates, enables much more discriminating speech recognition, according to Rashid, Microsoft&#8217;s chief research officer.</p>
<p>Rashid, who demonstrated the technology at a Microsoft conference in Tianjin, China in late October said the process takes text from the subject&#8217;s speech, runs it through a translator, first finding the Chinese (I&#8217;m assuming Mandarin) equivalents for his words, then rearranges the words in a way that is appropriate to the new language.</p>
<p>In addition, a text-to-speech system uses samples from a native Chinese speaker and from the English speaker&#8217;s own voice from pre-recorded English data to model the sound of the speaker&#8217;s voice. Watching a video of Rashid&#8217;s demo, it&#8217;s clear that the technology impressed the audience of Tianjin students. The system is not perfect, Rashid said, but it does cut word error rate by 30 percent.</p>
<p>Microsoft, by the way, last week inked a deal with 21Vianet, a Shanghai-based ISP, <a href="http://gigaom.com/cloud/microsoft-takes-azure-to-china/">to bring both Windows Azure public cloud services and Office 365 to China</a>.</p>
<p>More here about the translation technology from <a href="http://thenextweb.com/microsoft/2012/11/08/microsoft-demos-amazing-english-to-mandarin-translation-allowing-for-real-time-audible-translations">TheNextWeb</a> and <a href="http://7bd45512.zxxo.net/url/http://feeds.slashgear.com/~r/slashgear/~3/MBNTKyPcEnQ/">SlashGear</a>. And now, check out the video.</p>
<p><iframe src="http://www.youtube.com/embed/Nu-nlQqFCKg?rel=0" height="315" width="420"></iframe></p>
<p><em> </em></p>
<div></div>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=582704&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=312023"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=312023" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=582704+want-to-speak-chinese-without-learning-it-heres-how&utm_content=gigabarb">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/report/sector-roadmap-social-customer-service-in-2013/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=582704+want-to-speak-chinese-without-learning-it-heres-how&utm_content=gigabarb">Sector RoadMap: Social customer service in 2013</a></li><li><a href="http://pro.gigaom.com/report/connected-consumer-first-quarter-2013-analysis-and-outlook/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=582704+want-to-speak-chinese-without-learning-it-heres-how&utm_content=gigabarb">Connected consumer first-quarter 2013: Analysis and outlook</a></li><li><a href="http://pro.gigaom.com/report/social-first-quarter-2013-analysis-and-outlook/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=582704+want-to-speak-chinese-without-learning-it-heres-how&utm_content=gigabarb">Social first-quarter 2013: analysis and outlook</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2012/11/09/want-to-speak-chinese-without-learning-it-heres-how/feed/</wfw:commentRss>
		<slash:comments>10</slash:comments>
	
		<media:thumbnail url="http://earth2tech.files.wordpress.com/2009/11/obama-hu-beijing.jpg?w=150" />
		<media:content url="http://earth2tech.files.wordpress.com/2009/11/obama-hu-beijing.jpg?w=150" medium="image">
			<media:title type="html">Obama-Hu-Beijing</media:title>
		</media:content>

		<media:content url="http://1.gravatar.com/avatar/4af03439988d64f816da72496325cb73?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">gigabarb</media:title>
		</media:content>
	</item>
		<item>
		<title>Google explains how more data means better speech recognition</title>
		<link>http://gigaom.com/2012/10/31/google-explains-how-more-data-means-better-speech-recognition/</link>
		<comments>http://gigaom.com/2012/10/31/google-explains-how-more-data-means-better-speech-recognition/#comments</comments>
		<pubDate>Wed, 31 Oct 2012 21:03:37 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[big data]]></category>
		<category><![CDATA[data science]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[natural language processing]]></category>
		<category><![CDATA[research]]></category>
		<category><![CDATA[speech recognition]]></category>
		<category><![CDATA[voice search]]></category>
		<category><![CDATA[YouTube]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=579350</guid>
		<description><![CDATA[A new research paper from Google highlights the importance of big data in creating consumer-friendly services such as voice search on smartphones. More data helps train smarter models, which can then better predict what someone say next -- letting you keep your eyes on the road.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=579350&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>A <a href="http://research.google.com/pubs/pub40491.html">new research paper out of Google</a> describes in some detail the data science behind the the company&#8217;s speech recognition applications, such as voice search and adding captions or tags to YouTube videos. And although the math might be beyond most people&#8217;s grasp, the concepts are not. The paper underscores why everyone is so excited about the prospect of &#8220;big data&#8221; and also how important it is to choose the right data set for the right job.</p>
<p>Google has always been a fan of the idea that more data is better, as exemplified by Research Director Peter Norvig&#8217;s stance that, generally speaking, more data trumps better algorithms (see, e.g., his 2009 paper titled &#8220;<a href="http://googleresearch.blogspot.com/2009/03/unreasonable-effectiveness-of-data.html">The Unreasonable Effectiveness of Data</a>&#8220;). Although <a href="http://technocalifornia.blogspot.com/2012/07/more-data-or-better-models.html">some hair-splitting does occur</a> about the relative value (or lack thereof) of algorithms in Norvig&#8217;s assessment, it&#8217;s pretty much an accepted truth at this point and drives much of the discussion around big data. The more data your models have from which to learn, the more accurate they become &#8212; <a href="http://gigaom.com/data/5-ideas-to-help-everyone-make-the-most-of-big-data/">even if they weren&#8217;t cutting-edge stuff to begin with</a>.</p>
<p>No surprise, then, it turns out that more data is also better for training speech-recognition systems. The researchers found that data sets and larger language models (here&#8217;s <a href="http://en.wikipedia.org/wiki/Language_model#N-gram_models">a Wikipedia explanation of the n-gram type</a> involved in Google&#8217;s research) result in fewer errors predicting the next word based on the words that precede it. Discussing the research <a href="http://googleresearch.blogspot.com/2012/10/large-scale-language-modeling-in.html">in a blog post on Wednesday</a>, Google research scientist Ciprian Chelba gives the example that a good model will attribute a higher probability to &#8220;pizza&#8221; as the next word than to &#8220;granola&#8221; if the previous two words were &#8220;New York.&#8221; When it comes to voice search, his team found that &#8220;increasing the model size by two orders of magnitude reduces the [word error rate] by 10% relative.&#8221;</p>
<p>The real key, however &#8212; as any data scientist will tell you &#8212; is knowing what type of data is best to train your models, whatever they are. For the voice search tests, the Google researchers used 230 billion words that came from &#8220;a random sample of anonymized queries from google.com that did not trigger spelling correction.&#8221; However, because people speak and write prose differently than they type searches, the YouTube models were fed data from transcriptions of news broadcasts and large web crawls.</p>
<p>&#8220;As far as language modeling is concerned, the variety of topics and speaking styles makes a language model built from a web crawl a very attractive choice,&#8221; they write.</p>
<p>This research isn&#8217;t necessarily groundbreaking, but helps drive home the reasons that topics such as big data and data science get so much attention these days. As consumers demand ever smarter applications and more frictionless user experiences, every last piece of data and every decision about how to analyze it matters.</p>
<p><em>Feature image courtesy of <a href="http://www.shutterstock.com/gallery-709726p1.html">Shutterstock user watcharakun</a>.</em></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=579350&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=989218"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=989218" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=579350+google-explains-how-more-data-means-better-speech-recognition&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/report/sector-roadmap-social-customer-service-in-2013/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=579350+google-explains-how-more-data-means-better-speech-recognition&utm_content=dharrisstructure">Sector RoadMap: Social customer service in 2013</a></li><li><a href="http://pro.gigaom.com/2012/03/a-near-term-outlook-for-big-data/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=579350+google-explains-how-more-data-means-better-speech-recognition&utm_content=dharrisstructure">A near-term outlook for big data</a></li><li><a href="http://pro.gigaom.com/2011/11/connected-world-the-consumer-technology-revolution/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=579350+google-explains-how-more-data-means-better-speech-recognition&utm_content=dharrisstructure">Connected world: the consumer technology revolution</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2012/10/31/google-explains-how-more-data-means-better-speech-recognition/feed/</wfw:commentRss>
		<slash:comments>10</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/10/shutterstock_110381039.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/10/shutterstock_110381039.jpg?w=150" medium="image">
			<media:title type="html">Smart tablet</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>
	</item>
		<item>
		<title>Lexee gives devs a new Siri-like tool for apps</title>
		<link>http://gigaom.com/2012/08/13/lexee-gives-devs-a-new-siri-like-tool-for-apps/</link>
		<comments>http://gigaom.com/2012/08/13/lexee-gives-devs-a-new-siri-like-tool-for-apps/#comments</comments>
		<pubDate>Mon, 13 Aug 2012 13:01:51 +0000</pubDate>
		<dc:creator>Ryan Kim</dc:creator>
				<category><![CDATA[angel]]></category>
		<category><![CDATA[Lexee]]></category>
		<category><![CDATA[Nuance]]></category>
		<category><![CDATA[personal assistant]]></category>
		<category><![CDATA[siri]]></category>
		<category><![CDATA[speech recognition]]></category>
		<category><![CDATA[voice-technology]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=552236</guid>
		<description><![CDATA[Developers are getting another Siri-like tool to embed speech-enabled virtual assistants in their apps. Lexee, a new product from interactive voice response provider Angel, offers an SDK for iOS and Android users. Lexee has a point and click system for building conversation flows and user analytics.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=552236&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://gigaom2.files.wordpress.com/2012/08/lexee.jpg"><img  title="lexee" src="http://gigaom2.files.wordpress.com/2012/08/lexee.jpg?w=153&#038;h=300" alt="" width="153" height="300" class="alignleft size-medium wp-image-552243" /></a>Mobile developers and businesses are increasingly getting the tools to embed Siri-like personal assistants into their mobile apps. Following the launch <a href="http://gigaom.com/mobile/nuance-takes-siri-down-to-the-app-level-with-new-voice-assistant/">earlier this month of Nuance&#8217;s Nina,</a> Angel, an interactive voice response provider is releasing its own mobile SDK for iOS and Android called Lexee, which will enable developers to voice-enable their apps.</p>
<p>Lexee actually builds off Nuance&#8217;s technology for online speech recognition and speech to text, so it&#8217;s similar in actual performance. The system is designed to help users accomplish tasks and interact using conversational speech. But Angel believes Lexee can stand out because of the ease with which developers can create their conversation flows and the analytical data that they can get back from Lexee.</p>
<p>Lexee uses a simple point-and-click system called SiteBuilder, a toolkit that Angel first developed for IVR and call center applications. It allows non-developers to easily apply voice to any customer interaction. Lexee will also rely on Angel&#8217;s analytics, so developers will get back information on how consumers are interacting with Lexee and where any problems might arise in conversation flows.</p>
<p>Angel President Dave Rennyson said neither Nina nor AT&amp;T&#8217;s Watson, which <a href="http://gigaom.com/2012/04/19/at-att-labs-universal-translators-and-wearable-keys/">also has its own SDK</a>, provides that mix of ease of use and reporting. He said Lexee was used to voice-enable a Salesforce.com application in just four days. Angel, a division of MicroStrategy, currently has about 1,000 enterprise customers using its existing customer experience management tools including Best Buy, Pfizer, Barnes &amp; Noble and Facebook. He said those companies could be interested in re-using some of their work and applying Lexee to their mobile applications.</p>
<p>Lexee is just another sign of how our future will be <a href="http://gigaom.com/2012/02/08/the-coming-age-of-the-smart-personal-assistant/">filled with voice-enabled personal assistants. </a>Siri has opened a lot of people&#8217;s eyes as to how speech can be used to get at complex tasks and can be combined with other tools to make us a lot more efficient.</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=552236&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=648762"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=648762" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=mobile&utm_medium=editorial&utm_campaign=auto3&utm_term=552236+lexee-gives-devs-a-new-siri-like-tool-for-apps&utm_content=oryankim">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2011/10/siri-say-hello-to-the-coming-invisible-interface/?utm_source=mobile&utm_medium=editorial&utm_campaign=auto3&utm_term=552236+lexee-gives-devs-a-new-siri-like-tool-for-apps&utm_content=oryankim">Siri: Say hello to the coming &#8220;invisible interface&#8221;</a></li><li><a href="http://pro.gigaom.com/2012/01/newnet-q4-platform-mania-and-social-commerce-shakeout/?utm_source=mobile&utm_medium=editorial&utm_campaign=auto3&utm_term=552236+lexee-gives-devs-a-new-siri-like-tool-for-apps&utm_content=oryankim">NewNet Q4: Platform mania and social commerce shakeout</a></li><li><a href="http://pro.gigaom.com/2012/01/newnet-q4-platform-mania-and-social-commerce-shakeout/?utm_source=mobile&utm_medium=editorial&utm_campaign=auto3&utm_term=552236+lexee-gives-devs-a-new-siri-like-tool-for-apps&utm_content=oryankim">NewNet Q4: Platform mania and social commerce shakeout</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2012/08/13/lexee-gives-devs-a-new-siri-like-tool-for-apps/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/08/lexee2-e1344862208437.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/08/lexee2-e1344862208437.jpg?w=150" medium="image">
			<media:title type="html">lexee2</media:title>
		</media:content>

		<media:content url="http://2.gravatar.com/avatar/81c4fca1b2d82a7fb9c8657de52386d1?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">oryankim</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/08/lexee.jpg?w=153" medium="image">
			<media:title type="html">lexee</media:title>
		</media:content>
	</item>
		<item>
		<title>T-Mobile’s Genius voice-command button just got a lot smarter</title>
		<link>http://gigaom.com/2012/08/09/t-mobiles-genius-voice-command-button-just-got-a-lot-smarter/</link>
		<comments>http://gigaom.com/2012/08/09/t-mobiles-genius-voice-command-button-just-got-a-lot-smarter/#comments</comments>
		<pubDate>Thu, 09 Aug 2012 22:57:24 +0000</pubDate>
		<dc:creator>Kevin Fitchard</dc:creator>
				<category><![CDATA[natural language understanding]]></category>
		<category><![CDATA[semantic search]]></category>
		<category><![CDATA[speech recognition]]></category>
		<category><![CDATA[voice command]]></category>
		<category><![CDATA[voice search]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=551613</guid>
		<description><![CDATA[T-Mobile is giving its rather pathetic MyTouch voice-command feature a much-needed overhaul. It's incorporating the same semantic-search technology Nuance uses in Dragon Go into Genius, allowing the voice assistant to search over 200 different content providers and understand intent rather than just words.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=551613&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Anyone who has ever <a href="http://gigaom.com/2010/06/03/mytouch-3g-slide-proves-software-can-differentiate-android-hardware/">owned a T-Mobile MyTouch</a> is familiar with the little button on the lower right-hand corner labeled with stylized “G.” It’s T-Mobile’s Genius button, which, once pushed, allows you to issue basic voice commands from calling or texting a contact to searching the web or Google Maps. If you’ve used it before then you know: Siri it’s not.</p>
<p>The service’s vocabulary and contextual understanding is pretty limited. For instance, if you ask Genius to “find a restaurant” it will pull up the nearest eatery on Google Maps. But if you ask it to find “nearby restaurants” it searches Maps for a joint named “Nearby.” If you’re like me, you’ve probably fiddled with the button a few times and never touched it again, despite its relative convenience on the phone’s faceplate.</p>
<p>But T-Mobile has given Genius a much-needed overhaul, at least on the latest versions of the <a href="http://gigaom.com/mobile/making-t-mos-mytouch-is-just-step-1-of-huaweis-master-plan/">MyTouch manufactured by Huawei</a>. Nuance Communications, which powers the voice-recognition features on T-Mobile devices, is upgrading Genius’ capabilities and the features of the Genius service, providing a deeper level of natural-language understanding and integrating the service with a much broader array of content sources beyond Maps and Google Search.</p>
<p>For instance, if I were to ask the new Genius for nearby restaurants, it would not only understand my intent but also do its searching on Yelp, pulling up dining options nearby and displaying their rankings and reviews. If I were to change that command to “make a reservation at nearby restaurants,” it would bring me to OpenTable’s website and display eateries in the vicinity that accept online bookings.</p>
<p><a href="http://gigaom.com/apple/dragon-go-app-gets-smart-with-voice-search/dragongo/" rel="attachment wp-att-376328"><img title="dragongo" src="http://gigaom2.files.wordpress.com/2011/07/dragongo-e1310663017264.png?w=300&#038;h=218" alt="" width="300" height="218" class="alignleft size-medium wp-image-376328"></a>If this sounds familiar then you’ve probably used <a href="http://gigaom.com/mobile/a-siri-for-other-phones-nuance-brings-dragon-go-to-android/">Nuance’s consumer semantic-search app, Dragon Go</a>, available for Android and iOS smartphones. In fact, if you look at the long list of 200 content partners the Genius can access, they’re the same used by Dragon Go. Nuance wouldn’t acknowledge specifically that T-Mobile is white-labeling the semantic-search app, but it’s pretty obvious that’s exactly what it’s doing — and it’s by no means a bad thing. I’m a big fan of Nuance’s intuitive little search app, and being able to access it in fewer steps is a bonus.</p>
<p>The official line, though, is that T-Mobile has basically upgraded its relationship with Nuance to a kind of platinum status. The old Genius tapped into Nuance’s basic speech-recognition APIs, but it had none of the rules-based language-parsing abilities of Nuance’s more-sophisticated offerings. By adding greater contextual understanding and a host of content providers, T-Mobile may be able to turn a pretty lame voice-command feature into something quite useful. So far, though, it’s only available on the MyTouch and MyTouch Q.</p>
<p>Both T-Mobile and Nuance will be represented at <a href="http://event.gigaom.com/mobilize/?utm_source=mobile&amp;utm_medium=editorial&amp;utm_campaign=intext&amp;utm_term=551613+t-mobiles-genius-voice-command-button-just-got-a-lot-smarter&amp;utm_content=kfitchard">GigaOM’s Mobilize conference</a> next month. Nuance CTO Vlad Sejnoha will be speaking on a panel about the future smartphone interface on Sept. 20, while T-Mobile SVP of marketing Brad Duea will discuss the evolution of voice services on Sept. 21.</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=551613&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=528264"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=528264" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=mobile&utm_medium=editorial&utm_campaign=auto3&utm_term=551613+t-mobiles-genius-voice-command-button-just-got-a-lot-smarter&utm_content=kfitchard">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/02/ces-2012-a-recap-and-analysis/?utm_source=mobile&utm_medium=editorial&utm_campaign=auto3&utm_term=551613+t-mobiles-genius-voice-command-button-just-got-a-lot-smarter&utm_content=kfitchard">CES 2012: a recap and analysis</a></li><li><a href="http://pro.gigaom.com/2011/12/carrier-iq-and-the-continued-erosion-of-operator-trust/?utm_source=mobile&utm_medium=editorial&utm_campaign=auto3&utm_term=551613+t-mobiles-genius-voice-command-button-just-got-a-lot-smarter&utm_content=kfitchard">Carrier IQ and the continued erosion of operator trust</a></li><li><a href="http://pro.gigaom.com/2011/09/the-future-of-mobile-a-segment-analysis-by-gigaom-pro/?utm_source=mobile&utm_medium=editorial&utm_campaign=auto3&utm_term=551613+t-mobiles-genius-voice-command-button-just-got-a-lot-smarter&utm_content=kfitchard">The future of mobile: a segment analysis by GigaOM Pro</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2012/08/09/t-mobiles-genius-voice-command-button-just-got-a-lot-smarter/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/03/t-mobile-mytouch-huawei.jpeg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/03/t-mobile-mytouch-huawei.jpeg?w=150" medium="image">
			<media:title type="html">T-Mobile-myTouch-Huawei</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/0544c4b228f8fa80e31bb952501cd7a4?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">kfitchard</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2011/07/dragongo-e1310663017264.png?w=300" medium="image">
			<media:title type="html">dragongo</media:title>
		</media:content>
	</item>
		<item>
		<title>For Google, keeping search relevant means baking big data into everything</title>
		<link>http://gigaom.com/2012/08/08/for-google-keeping-search-relevant-means-baking-big-data-into-everything/</link>
		<comments>http://gigaom.com/2012/08/08/for-google-keeping-search-relevant-means-baking-big-data-into-everything/#comments</comments>
		<pubDate>Wed, 08 Aug 2012 22:05:32 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[big data]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[graph database]]></category>
		<category><![CDATA[Knowledge Graph]]></category>
		<category><![CDATA[machine-learning]]></category>
		<category><![CDATA[Pregel]]></category>
		<category><![CDATA[search engines]]></category>
		<category><![CDATA[semantic search]]></category>
		<category><![CDATA[speech recognition]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=551119</guid>
		<description><![CDATA[Google has opened its Knowledge Graph to the English-speaking world and has made intelligent voice search possible on mobile phones. Underneath it all, of course, are ever more-complex methods of analyzing data to make search smarter and easier than it has any business being.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=551119&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>It&#8217;s a fashionable practice in the Valley to write off Google&#8217;s search business, but the company is putting its big data chops to the test to prove doubters wrong. In a Wednesday morning blog post, Google SVP of Search Amit Singhal <a href="http://googleblog.blogspot.com/2012/08/building-search-engine-of-future-one.html">announced that Google&#8217;s Knowledge Graph is now live</a> across every English-speaking country in the world, and that voice search on mobile phones has been improved to understand user intent. Useful, yes, but the real story is the technology that makes these features work.</p>
<p>For Google, it&#8217;s all about collecting and analyzing billions of data points to learn what each one really means. With Knowledge Graph, for example, Google uses a &#8220;database of more than 500 million real-world people, places and things with 3.5 billion attributes and connections among them.&#8221; It&#8217;s those connections that are the key, as they&#8217;re what make the system smart enough to know what you&#8217;re looking for that wouldn&#8217;t naturally show up in a standard keyword search.</p>
<p><a href="http://gigaom2.files.wordpress.com/2012/08/paris.jpg"><img  title="paris" src="http://gigaom2.files.wordpress.com/2012/08/paris.jpg?w=708" alt=""   class="aligncenter size-full wp-image-551163" /></a></p>
<p>Although Google hasn&#8217;t come out and said so, I&#8217;d imagine the Knowledge Graph utilizes <a href="http://googleresearch.blogspot.com/2009/06/large-scale-graph-computing-at-google.html">Google&#8217;s Pregel graph processing engine</a>. Graph processing and databases are catching on in social networks and other large-scale environments because they organize pieces of data by how they&#8217;re connected to one another. Those connections are called edges, and they&#8217;d keep Knowledge Graph results both informative and focused because the system knows how closely they&#8217;re related in any given circumstance.</p>
<p>This example of a personalized interest graph from Gravity Labs illustrates how one might visualize a graph, in this case <a href="http://gigaom.com/cloud/the-personalized-web-is-just-an-interest-graph-away/">the connections between a reader&#8217;s perceived interests</a>:</p>
<p><a href="http://gigaom2.files.wordpress.com/2012/08/canvas-copy.jpeg"><img  title="canvas-copy" src="http://gigaom2.files.wordpress.com/2012/08/canvas-copy.jpeg?w=708" alt=""   class="aligncenter size-full wp-image-551162" /></a></p>
<p>Of course, Google has another tool at its disposal, which is the collective wisdom it&#8217;s able to glean from billions of searches every day. So, as Singhal wrote when <a href="http://googleblog.blogspot.com/2012/05/introducing-knowledge-graph-things-not.html">first explaining Knowledge Graph in May</a>, &#8220;[W]e can now sometimes help answer your next question before you’ve asked it, because the facts we show are informed by what other people have searched for. For example, the information we show for Tom Cruise answers 37 percent of next queries that people ask about him.&#8221;</p>
<p>Google&#8217;s other big announcement today is improved voice search on mobile phones, both Android and iOS. Here&#8217;s how Singhal describes the new capability:</p>
<blockquote><p>You just need to tap the microphone icon and ask your question, the same way you’d ask a friend. For example, ask “What movies are playing this weekend?” and you’ll see your words streamed back to you quickly as you speak. Then Google will show you a list of the latest movies in theaters near you, with schedules and even trailers. &#8230; When Google can supply a direct answer to your question, you’ll get a spoken response too.</p></blockquote>
<p>On Monday, a Google Research blog post noted how the company&#8217;s work on neural networks &#8212; which it <a href="http://gigaom.com/2012/06/25/how-google-is-teaching-computers-to-see/">famously used to train a system capable of detecting cats and human faces</a> in video streams &#8212; is <a href="http://googleresearch.blogspot.com/2012/08/speech-recognition-and-deep-learning.html">being used to power speech recognition</a> in the Jelly Bean release of Android. Seventeen-year-old Brittany Wenger recently won the Google Science Fair by building an application atop Google App Engine that <a href="http://googleappengine.blogspot.com/2012/08/neural-network-for-breast-cancer-data.html">uses a neural network to help detect breast cancer</a>.</p>
<p>As one might imagine, however, the big challenge for Google, <a href="http://searchengineland.com/bing-britannica-partnership-123930">Microsoft</a> , <a href="http://gigaom.com/2011/11/16/misconceptions-in-ai-or-why-watson-cant-talk-to-siri/">Apple</a> and everyone else trying to provide intelligent but intuitive user experiences is figuring out how to shape high computer science into easily digestible formats on ever-smaller devices. Search would certainly be a more effective tool if everyone could write complex queries directly against a company&#8217;s database, but the trick is making products good enough that we don&#8217;t have to. It&#8217;s boiling years of machine learning, natural-language processing and neural network research into &#8220;you ask a question and your phone spits back the right answer.&#8221;</p>
<p><em>Feature image courtesy of <a href="http://www.shutterstock.com/gallery-65904p1.html">Shutterstock user Sebastian Kaulitzki</a>.</em></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=551119&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=981509"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=981509" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=551119+for-google-keeping-search-relevant-means-baking-big-data-into-everything&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/report/how-to-use-big-data-to-make-better-business-decisions/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=551119+for-google-keeping-search-relevant-means-baking-big-data-into-everything&utm_content=dharrisstructure">How to use big data to make better business decisions</a></li><li><a href="http://pro.gigaom.com/2012/07/cloud-and-data-second-quarter-2012-analysis-and-outlook-2/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=551119+for-google-keeping-search-relevant-means-baking-big-data-into-everything&utm_content=dharrisstructure">Takeaways from the second quarter in cloud and data</a></li><li><a href="http://pro.gigaom.com/2012/05/the-importance-of-putting-the-u-and-i-in-visualization/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=551119+for-google-keeping-search-relevant-means-baking-big-data-into-everything&utm_content=dharrisstructure">The importance of putting the U and I in visualization</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2012/08/08/for-google-keeping-search-relevant-means-baking-big-data-into-everything/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/08/ai.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/08/ai.jpg?w=150" medium="image">
			<media:title type="html">AI</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/08/paris.jpg" medium="image">
			<media:title type="html">paris</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/08/canvas-copy.jpeg" medium="image">
			<media:title type="html">canvas-copy</media:title>
		</media:content>
	</item>
		<item>
		<title>Siri&#8217;s speech-to-text makes Apple target of yet another patent suit</title>
		<link>http://gigaom.com/2012/07/30/siris-speech-to-text-makes-apple-target-of-yet-another-patent-suit/</link>
		<comments>http://gigaom.com/2012/07/30/siris-speech-to-text-makes-apple-target-of-yet-another-patent-suit/#comments</comments>
		<pubDate>Mon, 30 Jul 2012 14:42:03 +0000</pubDate>
		<dc:creator>Erica Ogg</dc:creator>
				<category><![CDATA[Apple]]></category>
		<category><![CDATA[lawsuit]]></category>
		<category><![CDATA[siri]]></category>
		<category><![CDATA[speech recognition]]></category>
		<category><![CDATA[voice command]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=547959</guid>
		<description><![CDATA[Apple's Siri voice commands for iPhone have made it the target of a patent lawsuit from a university in Taiwan. The school's lawyers say they are also looking into whether to target similar technology from Microsoft and Google next.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=547959&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Apple is the target of yet another patent suit, this time over Siri&#8217;s voice-to-text capability. On Friday, Taiwan&#8217;s National Cheng Kung University sued Apple in the U.S. District Court for the Eastern District of Texas, <a href="http://www.reuters.com/article/2012/07/30/us-apple-patent-taiwan-idUSBRE86T0A920120730">according to Reuters</a>. The school says it owns two U.S. patents for the technology that allows Siri to translate voice commands into texts and notes.</p>
<p>The patents owned by National Cheng Kung University were issued in 2007 and 2010. Siri began life as a U.S. government-funded artificial intelligence project at <a href="http://en.wikipedia.org/wiki/SRI_International">SRI International</a>, but was spun out as its own company, Siri, in 2007 and commercialized. Apple bought Siri in 2010.</p>
<p>The calculation of the damages the university is seeking &#8220;would be based on Apple&#8217;s U.S. sales of devices that use Siri,&#8221; according to its lawyer. That basically means sales of all iPhone 4S and new iPads with iOS 6 software, which is coming this fall. So the school is looking for a very large chunk of change as compensation.</p>
<p>Apple is involved in dozens of lawsuits regarding its mobile technology, including the very high-profile Samsung trial that <a href="http://gigaom.com/2012/07/29/judge-in-apple-samsung-case-says-patent-drawings-can-speak-for-themselves/">kicks off Monday in California</a>. Is the overall climate surrounding handset makers and the mobile industry encouraging more of these kind of lawsuits from international companies? Or, as my colleague <a href="http://gigaom.com/apple/apple-plays-with-fire-in-chinese-trademark-stick-up/">Jeff Roberts has argued</a>, has Apple&#8217;s decision to settle with the near-bankrupt Proview over the iPad trademark in China last month for $60 million emboldened more potential plaintiffs?</p>
<p>Apple is the only target of the school&#8217;s suit for now, but Siri competitors are in the crosshairs as well: the school&#8217;s lawyer told Reuters it is also looking at both Microsoft and Google&#8217;s speech-to-text products for possible violations of the same two patents.</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=547959&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=847952"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=847952" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=apple&utm_medium=editorial&utm_campaign=auto3&utm_term=547959+siris-speech-to-text-makes-apple-target-of-yet-another-patent-suit&utm_content=ericaogg">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/05/the-living-room-reinvented-trends-technologies-and-companies-to-watch/?utm_source=apple&utm_medium=editorial&utm_campaign=auto3&utm_term=547959+siris-speech-to-text-makes-apple-target-of-yet-another-patent-suit&utm_content=ericaogg">Who and what to watch in the new era of the living room</a></li><li><a href="http://pro.gigaom.com/2012/04/connected-consumer-q1-controversy-courtrooms-and-the-cloud/?utm_source=apple&utm_medium=editorial&utm_campaign=auto3&utm_term=547959+siris-speech-to-text-makes-apple-target-of-yet-another-patent-suit&utm_content=ericaogg">Controversy, courtrooms and the cloud in Q1</a></li><li><a href="http://pro.gigaom.com/2012/02/ces-2012-a-recap-and-analysis/?utm_source=apple&utm_medium=editorial&utm_campaign=auto3&utm_term=547959+siris-speech-to-text-makes-apple-target-of-yet-another-patent-suit&utm_content=ericaogg">CES 2012: a recap and analysis</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2012/07/30/siris-speech-to-text-makes-apple-target-of-yet-another-patent-suit/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2011/10/siri-use-1.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2011/10/siri-use-1.jpg?w=150" medium="image">
			<media:title type="html">siri-use (1)</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/f8c30e1552769600b61214d57219220b?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">ericaogg</media:title>
		</media:content>
	</item>
	</channel>
</rss>
