<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>GigaOM &#187; Hilary Mason</title>
	<atom:link href="http://gigaom.com/tag/hilary-mason/feed/" rel="self" type="application/rss+xml" />
	<link>http://gigaom.com</link>
	<description></description>
	<lastBuildDate>Thu, 20 Jun 2013 09:56:05 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='gigaom.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://0.gravatar.com/blavatar/0db8f6557d022075dbbf010c54d46d93?s=96&#038;d=http%3A%2F%2Fs2.wp.com%2Fi%2Fbuttonw-com.png</url>
		<title>GigaOM &#187; Hilary Mason</title>
		<link>http://gigaom.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://gigaom.com/osd.xml" title="GigaOM" />
	<atom:link rel='hub' href='http://gigaom.com/?pushpress=hub'/>
		<item>
		<title>Why 3 celebrity data scientists are willing to work for free &#8212; for you</title>
		<link>http://gigaom.com/2013/05/08/why-3-celebrity-data-scientists-are-willing-to-work-for-free-for-you/</link>
		<comments>http://gigaom.com/2013/05/08/why-3-celebrity-data-scientists-are-willing-to-work-for-free-for-you/#comments</comments>
		<pubDate>Wed, 08 May 2013 16:58:30 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[big data]]></category>
		<category><![CDATA[data science]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Hilary Mason]]></category>
		<category><![CDATA[Mortar Data]]></category>
		<category><![CDATA[recommendation engines]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=643353</guid>
		<description><![CDATA[Hadoop startup Mortar Data is offering to build recommendation systems for 10 companies, with help from Hilary Mason, Drew Conway and Max Shron. It's part of a bigger plan to democratize the science behind online recommendations.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=643353&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Hadoop-in-the-cloud startup Mortar Data is on a mission to bring recommendation engines to the masses, and it has recruited three well-known data scientists to aid its cause. On Wednesday, the company will start accepting applications <a href="http://mortardata.com/">on its website</a> from companies that would like to have Mortar Data &#8212; as well as Bit.ly&#8217;s <a href="http://www.hilarymason.com/">Hilary Mason</a>, IA Ventures Scientist-in-Residence <a href="http://drewconway.com/">Drew Conway</a> and freelancer (and former OKCupid data scientist) <a href="http://shron.net/about">Max Shron</a> &#8212; build a custom recommendation system for them.</p>
<p>The way it works, said Mortar Co-founder and CEO K Young, is that his company will choose eight companies (in addition to the two it has been working with already) to implement custom systems based on their specific needs and businesses. Mason, Conway and Shron will split their time among the 10 total companies, but will be much more than advisers &#8212; they&#8217;ll actually dig into the data and work hands-on to ensure the right techniques and algorithms are applied in the right places.</p>
<p>The applicant companies will keep any custom code, but the ultimate goal from Mortar&#8217;s perspective is to learn some best practices and create reusable building blocks that will let anyone create recommendation engines without pre-existing data science knowledge. Recommendation engines <a href="http://gigaom.com/2013/01/29/you-might-also-like-to-know-how-online-recommendations-work/">are commonplace on large web sites</a> (Netflix, Spotify, iTunes, Google, Amazon, <a href="http://gigaom.com/2013/03/03/how-and-why-linkedin-is-becoming-an-engineering-powerhouse/">LinkedIn</a>, Eventbrite and the list goes on) but smaller companies can sometimes struggle to do them, or to do them well. Young hopes Mortar can establish an open source reference architecture of sorts that makes it easy to implement everything from building data pipelines to the actual algorithms that power recommendations.</p>
<p>&#8220;They&#8217;re really common and they&#8217;re really useful, but they&#8217;re really hard,&#8221; he said. &#8220;That&#8217;s why [a reference implementation] hasn&#8217;t been done before.&#8221;</p>
<div id="attachment_643436" class="wp-caption aligncenter" style="width: 718px"><a href="http://gigaom2.files.wordpress.com/2013/05/gernres-support-1.jpg"><img  alt="They can get pretty complex, as evidence by this Netflix example." src="http://gigaom2.files.wordpress.com/2013/05/gernres-support-1.jpg?w=708&#038;h=358" width="708" height="358" class="size-large wp-image-643436" /></a><p class="wp-caption-text">They can get pretty complex, as evidence by this Netflix example.</p></div>
<p>Presently, Young explained, anyone wanting to build a recommendation system probably knows some of the algorithms to begin with and then gets to work researching how to implement them with specific processing frameworks (e.g., MapReduce) and on their specific data. Alternatively, they might have to hire a consultant that helps them build the recommendation engine. Either way, he noted, they&#8217;re probably not open sourcing it at the end because it&#8217;s presumed too valuable a competitive edge.</p>
<p>Mortar Data&#8217;s recommendation framework will be based on Pig, Python and Java, <a href="http://gigaom.com/2012/11/28/mortar-data-wants-to-become-a-hadoop-developers-best-friend/">just like the company&#8217;s flagship platform</a> for creating Hadoop jobs. Those languages will make the implementation more accessible and customizable by more people, Young said.</p>
<p>Really, he added, any web site or service that has multiple customers and deals with multiple entities &#8212; be they restaurants, songs, dating profiles, artisan necklaces, what have you &#8212; should have some sort of recommendation engine to help provide a more-intelligent customer experience. &#8220;It should become so ubiquitous that any service you go to knows enough about you to put forward the things you actually want to see,&#8221; Young said.</p>
<p>There is, however, one catch to Mortar&#8217;s plans as they stand: Because the service is hosted on Amazon Web Services, anyone interested in having Mason, Conway, Shron and Mortar work on their systems must have their data in AWS or be able to move it there. The initial reference implementation will likely be AWS-centric, too, but Young hopes contributors will use it and share methods for running it atop other platforms.</p>
<p><em>Feature image of Hilary Mason at Structure: Data 2011 courtesy of Pinar Ozger (www.pinarozger.com).</em></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=643353&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=57757"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=57757" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=643353+why-3-celebrity-data-scientists-are-willing-to-work-for-free-for-you&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/05/08/why-3-celebrity-data-scientists-are-willing-to-work-for-free-for-you/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/05/hilarymason.jpeg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/05/hilarymason.jpeg?w=150" medium="image">
			<media:title type="html">hilarymason</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/05/gernres-support-1.jpg?w=708" medium="image">
			<media:title type="html">They can get pretty complex, as evidence by this Netflix example.</media:title>
		</media:content>
	</item>
		<item>
		<title>Bit.ly says CEO Peter Stern is stepping down</title>
		<link>http://gigaom.com/2013/03/11/bit-ly-says-ceo-peter-stern-is-stepping-down-2/</link>
		<comments>http://gigaom.com/2013/03/11/bit-ly-says-ceo-peter-stern-is-stepping-down-2/#comments</comments>
		<pubDate>Mon, 11 Mar 2013 19:50:15 +0000</pubDate>
		<dc:creator>Eliza Kern</dc:creator>
				<category><![CDATA[bit.ly]]></category>
		<category><![CDATA[Hilary Mason]]></category>
		<category><![CDATA[Peter Stern]]></category>
		<category><![CDATA[Structure Data 2013]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=619328</guid>
		<description><![CDATA[Bit.ly, the company that works to help customers understand who is clicking on their links and measure news in real time, announced Monday that the company's CEO will be stepping down.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=619328&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="https://bitly.com/" target="_blank">Bit.ly</a>, the data company best known for <a href="http://gigaom.com/2012/11/15/datasift-teams-with-bitly-to-show-what-social-content-really-gets-clicks/" target="_blank">helping its customers understand who is clicking on their links</a>, announced Monday that <a href="http://www.linkedin.com/in/iampeter" target="_blank">CEO Peter Stern</a> will be stepping down. The company did not give much explanation for Stern’s departure in a short blog post Monday:</p>
<blockquote id="quote-bitly-is-announcing-"><p>“Bitly is announcing today that Peter Stern has resigned to pursue other interests. “Peter has been a key leader and contributor to the Company,” said Bitly Board member Sam Mandel. “In particular he has been instrumental in transforming Bitly into a successful business while growing its unparalleled data set. We are very happy that he will remain a shareholder and supporter.””</p></blockquote>
<p>Stern was previously the founder and CEO of Zenbe, a mobile and web-based collaboration <a href="http://allthingsd.com/20101116/one-more-new-york-acqhire-for-facebook-zenbi/" target="_blank">service that was acquired for talent by Facebook in 2010</a>.</p>
<p><a href="http://gigaom.com/2013/01/08/bit-ly-wants-to-help-developers-play-with-data-under-new-real-time-search-api/" target="_blank">Bit.ly released a new API in January</a> to allow companies to track the flow of news and information on Twitter and across the web, and has been focused on providing more real-time analysis. We’ll be talking about some of the challenges associated with real-time data at <a href="http://event.gigaom.com/structuredata/?utm_source=data&amp;utm_medium=editorial&amp;utm_campaign=intext&amp;utm_term=619328+bit-ly-says-ceo-peter-stern-is-stepping-down-2&amp;utm_content=elizakern" target="_blank">our Structure Data conference next week in New York City</a>.</p>
<p>“Bit.ly is in a great position to see this kind of social data. We see a slice of what people are sharing and reading across the social web. So this is a way to start to put that power in the hands of people who can build interesting things with it,”  <a href="http://www.linkedin.com/in/hilarymason" target="_blank">Hilary Mason, Bit.ly’s chief scientist</a>, said in an interview in January.</p>
<p>Bit.ly is a company that grew out of <a href="http://gigaom.com/2012/07/12/digg-this-former-social-sharing-superstar-sold-for-500k/" target="_blank">Betaworks</a>, and it <a href="http://blog.bitly.com/post/39056042835/2012-in-the-bitlyverse" target="_blank">shortened 8.4 billion links in 2012</a>. <a href="http://gigaom.com/2012/07/10/social-discovery-service-bitly-raises-15m-round-led-by-khosla-ventures/" target="_blank">Bit.ly raised $15 million led by Khosla Ventures this summer</a> and has rasied more than $28 million in total.</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=619328&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=441693"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=441693" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=619328+bit-ly-says-ceo-peter-stern-is-stepping-down-2&utm_content=elizakern">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/report/frenemy-mine-the-pros-and-cons-of-social-partnerships-for-online-media-companies/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=619328+bit-ly-says-ceo-peter-stern-is-stepping-down-2&utm_content=elizakern">Frenemy mine: The pros and cons of social partnerships for online media companies</a></li><li><a href="http://pro.gigaom.com/2012/10/the-state-of-cross-platform-measurement-across-tv-online-and-social/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=619328+bit-ly-says-ceo-peter-stern-is-stepping-down-2&utm_content=elizakern">The state of cross-platform media measurement</a></li><li><a href="http://pro.gigaom.com/2012/07/social-tv-apps-understanding-consumer-behavior-and-the-evolving-ecosystem/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=619328+bit-ly-says-ceo-peter-stern-is-stepping-down-2&utm_content=elizakern">Social-TV apps and consumer behavior</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/03/11/bit-ly-says-ceo-peter-stern-is-stepping-down-2/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/03/1c3c6ae.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/03/1c3c6ae.jpg?w=150" medium="image">
			<media:title type="html">Peter Stern, Bit.ly CEO (former)</media:title>
		</media:content>

		<media:content url="http://2.gravatar.com/avatar/bd7905cba2440e49d86bd328573730f7?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">elizakern</media:title>
		</media:content>
	</item>
		<item>
		<title>Bit.ly wants to help developers play with data under new real-time search API</title>
		<link>http://gigaom.com/2013/01/08/bit-ly-wants-to-help-developers-play-with-data-under-new-real-time-search-api/</link>
		<comments>http://gigaom.com/2013/01/08/bit-ly-wants-to-help-developers-play-with-data-under-new-real-time-search-api/#comments</comments>
		<pubDate>Tue, 08 Jan 2013 19:00:10 +0000</pubDate>
		<dc:creator>Eliza Kern</dc:creator>
				<category><![CDATA[Hilary Mason]]></category>
		<category><![CDATA[realtime search]]></category>
		<category><![CDATA[social web]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=599992</guid>
		<description><![CDATA[On the heels of the release of its realtime search product for consumers this summer, Bit.ly is releasing an API Tuesday that will make the capabilities available to developers interested in incorporating Bit.ly's realtime data into their apps and products.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=599992&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Want to check out what foodies are reading in Brooklyn today? Or what football fans are reading in Tuscaloosa this morning? Bit.ly wants to help you track the movement of news and information in real time, and with the release of an API Tuesday, the company wants to help developers move this into their apps as well.</p>
<p><a href="http://gigaom.com/2013/01/08/bit-ly-wants-to-help-developers-play-with-data-under-new-real-time-search-api/giant_squid_story/" rel="attachment wp-att-600118"><img  alt="Giant Squid story Bit.ly realtime data analytics" src="http://gigaom2.files.wordpress.com/2013/01/giant_squid_story.png?w=284&#038;h=300" width="284" height="300" class="alignleft size-medium wp-image-600118" /></a>&#8220;The idea is that people will start to build real-time search into their applications,&#8221; said <a href="http://www.linkedin.com/in/hilarymason" target="_blank">Hilary Mason, Bit.ly&#8217;s chief scientist</a>, in an interview explaining the release. The company announced <a href="http://blog.bitly.com/post/40026085295/announcing-the-bitly-social-data-apis" target="_blank">full details of the API in a blog post on its website Tuesday</a>.</p>
<p>The company <a href="http://blog.bitly.com/post/28157667321/from-bitly-labs-say-hello-to-realtime" target="_blank">launched its real-time product to users this summer</a>. It is available to users with Bit.ly accounts at <a href="http://rt.ly/" target="_blank">rt.ly</a>, where they can fill out various information fields to see which news stories are getting the most hits across the globe, or see which topics are popular. For instance, this is where you could search for food articles in Brooklyn, New York, United States, and see that <a href="http://blogs.villagevoice.com/forkintheroad/2013/01/simit_--_a_turk.php" target="_blank">something called a &#8220;Turkish bagel&#8221;</a> is riling people up.</p>
<p>Now, with the release of this API, developers will be able to integrate the real-time search features into their apps, showing what information sent through Bit.ly is being read, where it&#8217;s being read, and how often. For instance, a publisher could show which of its articles are the most popular in a particular geographic area, or companies could track where around the globe consumers are reading about its product. Bit.ly&#8217;s real-time product also allows for the tracking of particular topics or phrases that are getting a good deal of attention via social media.</p>
<p>Of course, all of the tracked trends on what&#8217;s being read right now are necessarily stories tweeted from Bit.ly links, but that&#8217;s no insignificant measure. The company, which came <a href="http://gigaom.com/2012/07/12/digg-this-former-social-sharing-superstar-sold-for-500k/" target="_blank">from Betaworks</a>, <a href="http://blog.bitly.com/post/39056042835/2012-in-the-bitlyverse" target="_blank">shortened 8.4 billion links in 2012</a>. <a href="http://gigaom.com/2012/07/10/social-discovery-service-bitly-raises-15m-round-led-by-khosla-ventures/" target="_blank">The company raised $15 million led by Khosla Ventures this summer</a>, bringing its total funding to more than $28 million.</p>
<p>&#8220;Bit.ly is in a great position to see this kind of social data. We see a slice of what people are sharing and reading across the social web. So this is a way to start to put that power in the hands of people who can build interesting things with it,&#8221; she said.</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=599992&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=686530"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=686530" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=599992+bit-ly-wants-to-help-developers-play-with-data-under-new-real-time-search-api&utm_content=elizakern">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/report/sector-roadmap-content-personalization-in-2013/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=599992+bit-ly-wants-to-help-developers-play-with-data-under-new-real-time-search-api&utm_content=elizakern">Sector RoadMap: Content personalization in 2013</a></li><li><a href="http://pro.gigaom.com/report/frenemy-mine-the-pros-and-cons-of-social-partnerships-for-online-media-companies/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=599992+bit-ly-wants-to-help-developers-play-with-data-under-new-real-time-search-api&utm_content=elizakern">Frenemy mine: The pros and cons of social partnerships for online media companies</a></li><li><a href="http://pro.gigaom.com/2012/10/the-state-of-cross-platform-measurement-across-tv-online-and-social/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=599992+bit-ly-wants-to-help-developers-play-with-data-under-new-real-time-search-api&utm_content=elizakern">The state of cross-platform media measurement</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/01/08/bit-ly-wants-to-help-developers-play-with-data-under-new-real-time-search-api/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/01/screen-shot-2013-01-08-at-9-06-19-am.png?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/01/screen-shot-2013-01-08-at-9-06-19-am.png?w=150" medium="image">
			<media:title type="html">Bit.ly pufferfish logo</media:title>
		</media:content>

		<media:content url="http://2.gravatar.com/avatar/bd7905cba2440e49d86bd328573730f7?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">elizakern</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/01/giant_squid_story.png?w=284" medium="image">
			<media:title type="html">Giant Squid story Bit.ly realtime data analytics</media:title>
		</media:content>
	</item>
		<item>
		<title>Publishers intimidated by big data should think about &#8220;little data&#8221; first</title>
		<link>http://paidcontent.org/2012/05/22/book-publishers-big-data/</link>
		<comments>http://paidcontent.org/2012/05/22/book-publishers-big-data/#comments</comments>
		<pubDate>Tue, 22 May 2012 17:24:11 +0000</pubDate>
		<dc:creator>Laura Hazard Owen</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[bitly]]></category>
		<category><![CDATA[Dominique Raccah]]></category>
		<category><![CDATA[e-books]]></category>
		<category><![CDATA[Google Ads]]></category>
		<category><![CDATA[Hilary Mason]]></category>
		<category><![CDATA[Joe Wikert]]></category>
		<category><![CDATA[sourcebooks]]></category>
		<category><![CDATA[Tools of Change]]></category>

		<guid isPermaLink="false">http://paidcontent.org/?p=209533</guid>
		<description><![CDATA[Big data is an intimidating concept for publishers who don't collect much info about their customers already. A first step is to think about "data-driven decision making," and little data, says bitly's Hilary Mason.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=524437&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://gigaompaidcontent.files.wordpress.com/2012/04/tower-of-books_mlibrary1-e1335379517413.jpg"><img  title="tower of books_MLibrary" src="http://gigaompaidcontent.files.wordpress.com/2012/04/tower-of-books_mlibrary1-e1335379517413.jpg?w=300&#038;h=199" alt="" width="300" height="199" class="alignleft size-medium wp-image-204878" /></a>Big data is an intimidating concept for publishers who don&#8217;t collect much info about their customers already. A first step is to think about &#8220;data-driven decision making,&#8221; said <a href="http://bitly.com/">bitly</a> chief scientist Hilary Mason at a <a href="http://oreilly.com/toc-roundtable.html?imm_mid=084ff4&amp;cmp=em-toc-tc12-5-22executive-rountable-reminder">Tools of Change roundtable</a> this morning.</p>
<p>In the next couple of months, URL shortening service bitly is <a href="http://gigaom.com/cloud/big-apple-the-new-big-data-central/">launching</a> a real-time search engine that would let publishers see how their content &#8212; including books on Amazon &#8212; is being shared. And TOC&#8217;s Joe Wikert thinks publishers should track data like how far readers get in an e-book before they stop reading. &#8220;Amazon is probably gathering this data already,&#8221; he said. &#8220;At some point publishers will probably be able to buy access to that information,&#8221; but it&#8217;s even better if they can collect it themselves.</p>
<p>Audience member Dominique Raccah, publisher of the innovative Sourcebooks &#8212; whose e-book revenues <a href="http://paidcontent.org/2012/01/19/419-one-publishers-e-book-revenues-up-795-in-2011/">were up</a> 795 percent in 2011 &#8212; said publishers should use data to answer questions like how many of their books fail and how much it costs them to publish a book.</p>
<p>Sourcebooks is collecting data about their books and about how readers use their website. The biggest success they&#8217;ve had is using data to position books, Raccah said &#8212; &#8220;where we think the book is about one thing, and the consumer thinks the book is about something else.&#8221; Publishers can poll readers to find &#8220;what&#8217;s exciting somebody, what&#8217;s really turning them on &#8212; what it is that moves the needle for them.&#8221;</p>
<p>Bitly&#8217;s Mason agreed that data doesn&#8217;t have to be big or &#8220;require a huge amount of infrastructure&#8221; to be useful. And publishers can collect data through resources that weren&#8217;t explicitly designed for that purpose. For example, Mason said, if a publisher can&#8217;t decide what a book&#8217;s title should be, they should buy Google Ads for two different titles and see which title gets more clicks. The product you&#8217;re advertising, she said, &#8220;doesn&#8217;t actually have to exist.&#8221;</p>
<p><em>Correction: An earlier version of this story incorrectly stated Hilary Mason&#8217;s last name. Her last name is Mason, not Rosen. I apologize for the error.</em></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=524437&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=470900"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=470900" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=media&utm_medium=editorial&utm_campaign=auto3&utm_term=524437+book-publishers-big-data&utm_content=laurahowen38">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/report/how-big-data-analytics-drives-competitive-advantage/?utm_source=media&utm_medium=editorial&utm_campaign=auto3&utm_term=524437+book-publishers-big-data&utm_content=laurahowen38">How big data analytics drives competitive advantage</a></li><li><a href="http://pro.gigaom.com/report/the-new-economics-of-enterprise-data-warehousing/?utm_source=media&utm_medium=editorial&utm_campaign=auto3&utm_term=524437+book-publishers-big-data&utm_content=laurahowen38">How data warehousing is now a cost-effective solution for businesses</a></li><li><a href="http://pro.gigaom.com/report/how-to-manage-big-data-without-breaking-the-bank/?utm_source=media&utm_medium=editorial&utm_campaign=auto3&utm_term=524437+book-publishers-big-data&utm_content=laurahowen38">How to manage big data without breaking the bank</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://paidcontent.org/2012/05/22/book-publishers-big-data/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:thumbnail url="http://gigaompaidcontent.files.wordpress.com/2012/04/tower-of-books_mlibrary1.jpg?w=150" />
		<media:content url="http://gigaompaidcontent.files.wordpress.com/2012/04/tower-of-books_mlibrary1.jpg?w=150" medium="image">
			<media:title type="html">tower of books_MLibrary</media:title>
		</media:content>

		<media:content url="http://2.gravatar.com/avatar/83965de6c2033ee5ab075123394cec0a?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">laurahowen38</media:title>
		</media:content>

		<media:content url="http://gigaompaidcontent.files.wordpress.com/2012/04/tower-of-books_mlibrary1-e1335379517413.jpg?w=300" medium="image">
			<media:title type="html">tower of books_MLibrary</media:title>
		</media:content>
	</item>
		<item>
		<title>Big Apple &#8211; the new Big Data Central?</title>
		<link>http://gigaom.com/2012/05/07/big-apple-the-new-big-data-central/</link>
		<comments>http://gigaom.com/2012/05/07/big-apple-the-new-big-data-central/#comments</comments>
		<pubDate>Mon, 07 May 2012 21:30:31 +0000</pubDate>
		<dc:creator>Om Malik</dc:creator>
				<category><![CDATA[bit.ly]]></category>
		<category><![CDATA[Dennis Crowley]]></category>
		<category><![CDATA[ebay]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Hilary Mason]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[Yahoo]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=518592</guid>
		<description><![CDATA[When it comes to NYC's tech scene, digital media and commerce companies get all the attention, but city's big opportunity could very well be as the hub for the "data centric" economy. From startups to web giants, many are making a beeline for the Big Apple.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=518592&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://gigaom.com/cloud/big-apple-the-new-big-data-central/hilarymason/" rel="attachment wp-att-518595"><img title="hilarymason" src="http://gigaom2.files.wordpress.com/2012/05/hilarymason.jpeg?w=300&#038;h=200" alt="" width="300" height="200" class="alignright size-medium wp-image-518595"></a>When it comes to New York’s booming tech startup scene, digital media and new commerce companies get all the attention, but in reality, the Big Apple’s big opportunity might well be as the hub for “data-centric” businesses. In taking advantage of this opportunity, the city wouldn’t be going too far from its traditional strengths.</p>
<p>I was reminded of New York’s role in the emerging big data ecosystem over the weekend, when I was catching up with my weekly reading.</p>
<ul><li>eBay, which had acquired Hunch in November 2011, is now building out its operations in New York with a new 35,000 square-foot office, <a href="http://gigaom.com/2012/05/03/ebay-puts-down-roots-in-nyc-with-data-focused-tech-center/">Ryan Kim reported last week</a>. ”When fully built out, a majority of the workers will be developers, data scientists and statisticians,” he added.</li>
<li>Microsoft has <a href="http://allthingsd.com/20120502/microsoft-hires-14-yahoo-researchers-to-kickstart-new-nyc-research-lab/">hired a bunch of former Yahoo</a> researchers in New York.</li>
</ul><p>Steve Lohr of <a href="http://bits.blogs.nytimes.com/2012/05/03/microsoft-taps-yahoo-scientists-for-new-york-research-lab/"><em>The New York Times</em> wrote</a> about Microsoft:</p>
<blockquote><p>The group’s research focuses in large part on the application of advanced computing tools to the social sciences. It is a fast-growing field fueled by the vast new data sources of the Web, social-network communications and from sensor-equipped devices like smartphones. The potential is enormous, as Google and Facebook prove. But Microsoft has trailed so far.</p></blockquote>
<p>Beyond these two recent announcements, there are several other startups that are experimenting with data. Take URL shortener, <a href="http://bit.ly">Bit.ly</a>, for example. The service, which is based in Manhattan, is creating new news reader experiences based on the data it collects and from the social context that data carries. <a href="http://www.businessweek.com/articles/2012-04-26/hilary-mason-from-tiny-links-big-insights">Businessweek writes</a>:</p>
<blockquote><p>This year, Bitly is introducing a suite of data products for professionals developed in part by Mason and her team of six scientists and engineers. One, dubbed Bitly Realtime, tracks terms that receive sudden bursts of attention. Another is a reputation-monitoring system. The goal of the products is “to give people a Spidey sense about what’s going on on the Internet that’s relevant to them,” says Mason.</p></blockquote>
<p>Hilary Mason, who is the chief scientist at Bit.ly told me that in “New York we are more interested in telling stories” from of data as opposed to how “big” the data is, or what database technology you happen to use, Mason told us. “This is how businesses, marketers, and social scientists need to think about data to make rational decisions.”</p>
<p>New York has a long history of learning from data, thanks to the quantitive revolution that swept Wall Street. Financial services, was the first real big data vertical, and quants were Wall Street’s data scientists. The markets for tradable instruments, high-volume and high-velocity data streams all came from Wall Street. (Actually, that was the rationale behind why we host our <a href="http://event.gigaom.com/structuredata/?utm_source=cloud&amp;utm_medium=editorial&amp;utm_campaign=intext&amp;utm_term=518592+big-apple-the-new-big-data-central&amp;utm_content=om">Structure: Data conference</a> in New York.)</p>
<p><a href="http://gigaom.com/cloud/big-apple-the-new-big-data-central/explore-search-1/" rel="attachment wp-att-518743"><img title="Explore-search-1" src="http://gigaom2.files.wordpress.com/2012/05/explore-search-1.jpg?w=300&#038;h=220" alt="" width="300" height="220" class="alignleft size-medium wp-image-518743"></a>Mason believes that New York can leverage big data to its advantage. From art to fashion to media, New York has enough creative talent to be able to ask the right questions from the data. A good example is the Explore feature on <a href="http://foursquare.com">Foursquare</a>, which co-founder Dennis Crowley calls the “<a href="http://gigaom.com/2012/01/12/foursquare-still-mobile-first-but-with-more-love-for-the-web/">big data driven recommendation engine for the real world</a>.” (Here is a presentation of technology <a href="http://engineering.foursquare.com/2011/08/03/foursquares-data-and-the-explore-recommendation-engine/">behind Explore that is pretty cool</a>.)</p>
<p>Similar startups such as <a href="http://knod.es/">Knod.es</a> are helping make sense of social data as well. For instance, the startup sucks in your social data across platforms and let you ask questions like “Who do I know who likes cheeseburgers in Paris?”</p>
<p>“There are a bunch of interesting things happening in NYC, not all of which are startups,” Mason says, singling out Wes McKinney’s <a href="http://pandas.pydata.org/">the Pandas Project</a>, which is a time-series analysis system drawn from his experience in finance. Mason calls it “awesome.”</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=518592&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=200617"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=200617" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=518592+big-apple-the-new-big-data-central&utm_content=om">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/report/how-the-mega-data-center-is-changing-the-hardware-and-data-center-markets/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=518592+big-apple-the-new-big-data-central&utm_content=om">How the mega data center is changing the hardware and data center markets</a></li><li><a href="http://pro.gigaom.com/2012/10/the-state-of-cross-platform-measurement-across-tv-online-and-social/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=518592+big-apple-the-new-big-data-central&utm_content=om">The state of cross-platform media measurement</a></li><li><a href="http://pro.gigaom.com/2012/10/social-third-quarter-2012-analysis-and-outlook/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=518592+big-apple-the-new-big-data-central&utm_content=om">Social third-quarter 2012: analysis and outlook</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2012/05/07/big-apple-the-new-big-data-central/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/05/hilarymason.jpeg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/05/hilarymason.jpeg?w=150" medium="image">
			<media:title type="html">hilarymason</media:title>
		</media:content>

		<media:content url="http://2.gravatar.com/avatar/89c6ff98059617751fcf312690965fa0?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">om</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/05/hilarymason.jpeg?w=300" medium="image">
			<media:title type="html">hilarymason</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/05/explore-search-1.jpg?w=300" medium="image">
			<media:title type="html">Explore-search-1</media:title>
		</media:content>
	</item>
	</channel>
</rss>
