<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>GigaOM &#187; big data</title>
	<atom:link href="http://gigaom.com/tag/big-data/feed/" rel="self" type="application/rss+xml" />
	<link>http://gigaom.com</link>
	<description></description>
	<lastBuildDate>Fri, 24 May 2013 23:25:10 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='gigaom.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://0.gravatar.com/blavatar/0db8f6557d022075dbbf010c54d46d93?s=96&#038;d=http%3A%2F%2Fs2.wp.com%2Fi%2Fbuttonw-com.png</url>
		<title>GigaOM &#187; big data</title>
		<link>http://gigaom.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://gigaom.com/osd.xml" title="GigaOM" />
	<atom:link rel='hub' href='http://gigaom.com/?pushpress=hub'/>
		<item>
		<title>Steering clear of the iceberg: three ways we can fix the data-credibilty crisis in science</title>
		<link>http://gigaom.com/2013/05/24/steering-clear-of-the-iceberg-three-ways-we-can-fix-the-data-credibilty-crisis-in-science/</link>
		<comments>http://gigaom.com/2013/05/24/steering-clear-of-the-iceberg-three-ways-we-can-fix-the-data-credibilty-crisis-in-science/#comments</comments>
		<pubDate>Fri, 24 May 2013 19:41:49 +0000</pubDate>
		<dc:creator>Amanda Alvarez</dc:creator>
				<category><![CDATA[big data]]></category>
		<category><![CDATA[data management]]></category>
		<category><![CDATA[data visualization]]></category>
		<category><![CDATA[Mendeley]]></category>
		<category><![CDATA[Public Library of Science]]></category>
		<category><![CDATA[reproducibility]]></category>
		<category><![CDATA[research]]></category>
		<category><![CDATA[research data alliance]]></category>
		<category><![CDATA[Science]]></category>
		<category><![CDATA[Trifacta]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=649037</guid>
		<description><![CDATA[Science has a data problem, There's been a rash of experiments that no one can reproduce and studies that have to be retracted, But there are some nascent efforts to address this credibility crisis by changing the way the data is handled. <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=649037&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://gigaom.com/2013/05/23/dodgy-data-the-iceberg-to-sciences-titanic/">As I detailed yesterday</a>, science has a data-credibility problem. There&#8217;s been a rash of experiments that no one can reproduce and studies that have to be retracted, all of which threatens to undermine the health and integrity of a fundamental driver of medical and economic progress. For the sake of the researchers, their funders and the public, we need to boost the power of the science community to self-correct and confirm its results.</p>
<p>In the eight years since John Ioannidis dropped the bomb that <a href="http://www.plosmedicine.org/article/info:doi/10.1371/journal.pmed.0020124">“most published research findings are false,”</a> pockets of activist scientists from both academia and industry have been forming to address this problem, and it seems this year that some of those efforts are finally bearing fruit.</p>
<h3 id="the-research-auditors"><em><b>The research auditors</b></em></h3>
<p>One interesting development is that a <a href="http://www.sciencemag.org/content/340/6134/787.full">group of scientists</a> is threatening to topple the <a href="http://en.wikipedia.org/wiki/Impact_factor">impact factor</a>, which ranks studies based on the journals in which they appear. This filter for quality research is based on journal prestige, but some scientists and startups are beginning to use <a href="http://altmetrics.org/manifesto/">alternative metrics</a> in an effort to refocus on the science itself (rather than the publishing journal).</p>
<p>Taking a cue from the internet, they are citing the number of clicks, downloads, and page views that the research gets as better measures of “impact.” One group leading that charge is the <a href="http://scienceexchange.com/reproducibility">Reproducibility Initiative</a>, an alliance that includes an open-access journal (the Public Library of Science’s PLOS ONE) and three startups (data repository Figshare, experiment marketplace Science Exchange, and reference manager Mendeley). The Initiative isn’t trying to solve fraud, says Mendeley’s head of academic outreach William Gunn. Rather, it wants to address the rest of the dodgy data iceberg: the selective reporting of data, the vague methods for performing experiments, and the culture that contributes to so many scientific studies being irreproducible.</p>
<p><img  alt="Stamp of Approval" src="http://gigaompaidcontent.files.wordpress.com/2012/02/stamp-approval-o.jpg?w=300&#038;h=248" width="300" height="248" class="size-medium wp-image-610830 alignleft" />The Initiative will leverage Science Exchange’s network of outside labs and contract research organizations to do what its name says: try to reproduce published scientific studies. They have 50 studies lined up for their first batch. The authors of these studies have opted in for the additional scrutiny, so there is a good chance much of their research will turn out to be solid.</p>
<p>Whatever the outcome, though, the Initiative wants to use this first test batch to show the scientific community and funders that this kind of exercise is value-adding despite the costs, which are estimated to be $20,000 per study (<a href="http://www.nature.com/nature/journal/v483/n7391/full/483531a.html">about 10% of the original research price tag</a>, depending on the study).</p>
<p>Gunn likens the process to a tax audit: not all studies can or should be tested for reproducibility, but the likely offenders may be among those that have high &#8220;impact factors,&#8221; much like high-income earners with many deductions warrant suspicion.</p>
<p>A stumbling block may be the researchers themselves, who like many successful people have egos to protect; no one wants to be branded “irreproducible.” The Initiative stresses that the replication effort is about setting a standard for what counts as a good method, and finding predictors of research quality that supersede journal, institution or individual.</p>
<h3 id="the-plumbers-and-librarians-of"><em><b>The plumbers and librarians of big data </b></em></h3>
<p>While the Reproducibility Initiative is trying to accelerate science’s natural self-correction process, another nascent group is working on improving the plumbing that serves data. The <a href="http://rd-alliance.org/">Research Data Alliance</a> (RDA), which is partially funded by the National Science Foundation, is barely a few months old, but it is already uniting global researchers who are passionate about improving infrastructure for data-driven innovation. <a href="http://www.businessweek.com/stories/2004-05-11/the-superwoman-of-supercomputing">“The superwoman of supercomputing”</a> Francine Berman, a professor at Rensselaer Polytechnic Institute, heads up the U.S. division of RDA.</p>
<p>The RDA is structured like the World Wide Web Consortium, with working groups that produce code, policies for data interoperability, and data infrastructure solutions. As of yet there is no working group for data integrity, but it is within RDA’s scope, says Berman. While the effort is still in its infancy, the broad goals would be to come up with a way to make sure that the data contained in a study is more accessible to more people, and also that it doesn&#8217;t simply disappear at a certain point because of, say, storage issues.  She says with data it&#8217;s like we&#8217;re back in the  Industrial Revolution, when we had to create a new social contract to guide how we do research and commerce.</p>
<h3 id="the-men-who-stare-at-data"><em>The men who stare at data</em></h3>
<p><img  alt="visualization-examples" src="http://gigaom2.files.wordpress.com/2013/04/visualization-examples.png?w=383&#038;h=236" width="383" height="236" class="alignright  wp-image-641109" />You can build places for data to live and spot-check it once it’s published, but there are also things researchers can do earlier, while they’re &#8220;interrogating&#8221; the data. After all, says Berman, you’re careful around strangers in real life, so why jump into bed with your data before you’re familiar with it?</p>
<p>Visualization is one of the most effective ways of inspecting the quality of your data, and getting different views of its potential. Automated processing is fast, but it can also produce spurious results if you don’t sanity-check your data first with visual and statistical techniques.</p>
<p>Stanford University computer scientist Jeff Heer, who also co-founded the <a href="http://gigaom.com/2012/10/04/how-trifacta-wants-to-teach-humans-and-data-to-work-together/">data munging startup Trifacta</a>, says visualization can help spot errors or extreme values. It can also test the user’s domain expertise (do you know what you’re doing and can you tell what a complete or faulty data set looks like?) and prior hypotheses about the data. “Skilled people are at the heart of the process of making sense of data,” says Heer. Someone with domain expertise who brings their memories and skills to the data can spot new insights, and in this way combat the determinism of blindly collected and reported data sets. Context, in the form of metadata, is rich and omni-present, Heer argues, as long as we’ve collected the right data the right way. Context can aid in interpretation and combat the determinism of blindly collected and reported data sets.</p>
<p>The three-pronged approach &#8212; better auditing, preservation and visualization &#8212; will help steer science away from the iceberg of unreliable data.</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=649037&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=639409"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=639409" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=649037+steering-clear-of-the-iceberg-three-ways-we-can-fix-the-data-credibilty-crisis-in-science&utm_content=neuroamanda">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/report/how-big-data-analytics-drives-competitive-advantage/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=649037+steering-clear-of-the-iceberg-three-ways-we-can-fix-the-data-credibilty-crisis-in-science&utm_content=neuroamanda">How big data analytics drives competitive advantage</a></li><li><a href="http://pro.gigaom.com/report/the-new-economics-of-enterprise-data-warehousing/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=649037+steering-clear-of-the-iceberg-three-ways-we-can-fix-the-data-credibilty-crisis-in-science&utm_content=neuroamanda">How data warehousing is now a cost-effective solution for businesses</a></li><li><a href="http://pro.gigaom.com/report/cloud-and-data-first-quarter-2013-analysis-and-outlook/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=649037+steering-clear-of-the-iceberg-three-ways-we-can-fix-the-data-credibilty-crisis-in-science&utm_content=neuroamanda">Cloud and data first-quarter 2013: analysis and outlook</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/05/24/steering-clear-of-the-iceberg-three-ways-we-can-fix-the-data-credibilty-crisis-in-science/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="http://gigaompaidcontent.files.wordpress.com/2013/04/shutterstock_131163173.jpg?w=150" />
		<media:content url="http://gigaompaidcontent.files.wordpress.com/2013/04/shutterstock_131163173.jpg?w=150" medium="image">
			<media:title type="html">icebergahead</media:title>
		</media:content>

		<media:content url="http://2.gravatar.com/avatar/e37323b74d1f383817d82c9f906b7bcf?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">neuroamanda</media:title>
		</media:content>

		<media:content url="http://gigaompaidcontent.files.wordpress.com/2012/02/stamp-approval-o.jpg?w=300" medium="image">
			<media:title type="html">Stamp of Approval</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/04/visualization-examples.png?w=708" medium="image">
			<media:title type="html">visualization-examples</media:title>
		</media:content>
	</item>
		<item>
		<title>BYOD is for amateurs. Try bring-your-own-laboratory</title>
		<link>http://gigaom.com/2013/05/23/byod-is-for-amateurs-try-bring-your-own-laboratory/</link>
		<comments>http://gigaom.com/2013/05/23/byod-is-for-amateurs-try-bring-your-own-laboratory/#comments</comments>
		<pubDate>Thu, 23 May 2013 22:36:58 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[big data]]></category>
		<category><![CDATA[geodata]]></category>
		<category><![CDATA[internet of things]]></category>
		<category><![CDATA[iPhone]]></category>
		<category><![CDATA[location data]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[sensors]]></category>
		<category><![CDATA[smartphones]]></category>
		<category><![CDATA[Structure]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=648925</guid>
		<description><![CDATA[University of Illinois researchers have created an app and a sensor-filled cradle that turn an iPhone into a mobile spectrophotometer. The combination of that mobile lab data and metadata such as location might prove very valuable.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=648925&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Smartphones never cease to amaze me. I’m still impressed by how productive I’m able to be on my Android device no matter where I am (often to the chagrin of my wife), and I’m still surprised every time I see someone pull out a Square when it comes time to pay (like happened last night at Fat Choy in Las Vegas, a way-off-strip place you should totally check out if you’re in town). But neither of those situations really compare with busting out a phone in order to detect the levels of toxins in the air.</p>
<p>Yet that’s exactly <a href="http://news.illinois.edu/news/13/0523iphone_biosensor_BrianCunningham.html">what a group of researchers at the University of Illinois have created</a> — a cradle that wraps around an iPhone and turns it into a biosensor that can detect, according to a university press release, “toxins, proteins, bacteria, viruses and other molecules.” Inside that cradle are about $200 worth of mirrors, lenses and a photonic crystal that the researchers claim can identify these substances as accurately as a $50,000 spectrophotometer in the lab.</p>
<p>The cradle is essentially there for support, though, while the phone’s camera and processor do the real work. With everything firmly aligned in front of the camera, a scientist would simply snaps a photo and the CPU processes the result. What it’s processing is the difference in wavelength that the photonic crystal, primed to react to a specific molecule, reflects. The team demonstrates the device and app in the video embedded below.</p>
<span class="embed-youtube" style="text-align:center; display: block;"><iframe class="youtube-player" type="text/html" width="604" height="370" src="http://www.youtube.com/embed/Kh7MUjIYuyw?version=3&amp;rel=1&amp;fs=1&amp;showsearch=0&amp;showinfo=1&amp;iv_load_policy=1&amp;wmode=transparent" frameborder="0"></iframe></span>
<p>And if you’re into this type of mobile data collection, another group of University of Illinois researchers actually <a href="http://gigaom.com/2013/04/23/mobosens-a-square-like-tool-for-eco-warriors-lets-you-crowdsource-water-pollutants/">created a smartphone-powered water-pollution device called MoboSens</a></p>
<p>Like all things mobile or sensor, though — from <a href="http://gigaom.com/2012/02/03/skin-scan-wants-to-fight-cancer-using-iphones-and-big-data/">SkinScan</a> (now <a href="https://skinvision.com">SkinVision</a>) to health care apps like <a href="http://ginger.io/the-science/">Ginger.io</a> — the biggest value might come from data that has nothing to do with what the app is primarily measuring. Rather, when data about a certain condition, air quality or what have you is tagged with time and geodata, for example, it becomes the basis for mapping how situations are spreading or where there might be safe haven.</p>
<p>Imagine a team of scientists with iPhones dispersed throughout a city after a disaster, painting a real-time picture of what areas are most affected by a particular toxin (<a href="http://www.laboratoryequipment.com/news/2013/05/crowd-sourcing-helps-monitor-japans-radiation">or maybe radiation</a>). Taking a longer term approach, researchers could track how situations are evolving over time. Throw in even more data that smartphones are capable of detecting — temperature, ambient noise, vibration, etc. — and we might unlock entirely new ways to think about how diseases spread through the air or what conditions tend to favor the spread of foodborne bacteria.</p>
<p>In some ways, though, this is more than another cool thing you can do with a smartphone. It’s the furtherance of something we’ll discussing in depth at our <a href="http://event.gigaom.com/structure?utm_source=tech&amp;utm_medium=editorial&amp;utm_campaign=intext&amp;utm_term=648925+byod-is-for-amateurs-try-bring-your-own-laboratory&amp;utm_content=dharrisstructure">Structure conference</a> next month, which is how we rethink IT when computation and data are no longer bound within a single server or even the corporate network somewhere. The biological data this app will collect isn’t much use locked inside the phone; it needs a way to reliably and securely connect with other datasets and other services, likely distributed across the country or even the world. That’s where the real opportunity lies.</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=648925&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=386523"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=386523" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=648925+byod-is-for-amateurs-try-bring-your-own-laboratory&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2011/11/connected-world-the-consumer-technology-revolution/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=648925+byod-is-for-amateurs-try-bring-your-own-laboratory&utm_content=dharrisstructure">Connected world: the consumer technology revolution</a></li><li><a href="http://pro.gigaom.com/2012/01/12-tech-leaders-resolutions-for-2012/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=648925+byod-is-for-amateurs-try-bring-your-own-laboratory&utm_content=dharrisstructure">12 tech leaders’ resolutions for 2012</a></li><li><a href="http://pro.gigaom.com/2011/11/dissecting-the-data-5-issues-for-our-digital-future/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=648925+byod-is-for-amateurs-try-bring-your-own-laboratory&utm_content=dharrisstructure">Dissecting the data: 5 issues for our digital future</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/05/23/byod-is-for-amateurs-try-bring-your-own-laboratory/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/05/smartphonebiosensor-x-e1369346453624.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/05/smartphonebiosensor-x-e1369346453624.jpg?w=150" medium="image">
			<media:title type="html">SmartphoneBiosensor-x</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>
	</item>
		<item>
		<title>WibiData gets $15M to help it become the Hadoop application company</title>
		<link>http://gigaom.com/2013/05/23/wibidata-gets-15m-to-help-it-become-the-hadoop-application-company/</link>
		<comments>http://gigaom.com/2013/05/23/wibidata-gets-15m-to-help-it-become-the-hadoop-application-company/#comments</comments>
		<pubDate>Thu, 23 May 2013 11:31:17 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[big data]]></category>
		<category><![CDATA[Cloudera]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Hbase]]></category>
		<category><![CDATA[machine-learning]]></category>
		<category><![CDATA[open source]]></category>
		<category><![CDATA[OPower]]></category>
		<category><![CDATA[predictive analytics]]></category>
		<category><![CDATA[WibiData]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=648663</guid>
		<description><![CDATA[Startup WibiData has raised another $15 million and wants to turn the lessons it has learned in the field into generic software that can let anyone build predictive applications on Hadoop.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=648663&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.wibidata.com/">WibiData</a> &#8212; the big data startup from Cloudera Co-founder Christophe Bisciglia and Aaron Kimball &#8212; doesn&#8217;t have <em>overly</em> big plans. It only wants to become one of the first, if not the first, company selling off-the-shelf software that lets other companies build valuable, customer-facing applications on Hadoop. On Thursday, WibiData announced $15 million in Series B funding from Canaan Partners, as well as existing investors NEA and Google Chairman Eric Schmidt, to help make the goal a reality. </p>
<p>Kidding aside, that&#8217;s actually quite an ambitious goal in a Hadoop market that&#8217;s big and growing, but that&#8217;s exemplified by expensive consulting arrangements and purpose-built applications. Even more so for companies that want to do something other than transforming unstructured data into structured data (often called ETL) or run back-office analytics jobs. In fact, WibiData has spent the last 18 months doing just this type of deal, and Bisciglia says every single customer has already engaged with one of the big three Hadoop vendors (Cloudera, Hortonworks and MapR). </p>
<p>Home energy-management startup <a href="http://gigaom.com/2012/11/19/opower-the-big-data-energy-player-to-beat/">Opower</a> is a good example of this process. It&#8217;s actually one of Cloudera&#8217;s banner customers, but &#8220;when they wanted to take [their software-as-a-service tool] beyond batch analysis and ETL workloads,&#8221; Bisciglia said, Opower came to WibiData. So whereas the Opower service was originally focused on nightly data analysis comparing users&#8217; energy usage against that of other users, it&#8217;s now working on dynamic recommendations for users and letting them engage with the application in new ways.</p>
<div id="attachment_648685" class="wp-caption alignright" style="width: 310px"><a href="http://gigaom2.files.wordpress.com/2013/05/wibi-kiji.jpg"><img  alt="The WibiData architecture" src="http://gigaom2.files.wordpress.com/2013/05/wibi-kiji.jpg?w=300&#038;h=224" width="300" height="224" class="size-medium wp-image-648685" /></a><p class="wp-caption-text">The WibiData architecture</p></div>
<p>During these engagements, WibiData <a href="http://gigaom.com/2012/03/22/wibidata-structure-data-2012/">has been building up its core technology</a> for connecting those brawny back-office Hadoop environments to predictive customer-facing applications &#8211; a collection of HBase, data-formatting tools and machine learning algorithms that the company <a href="http://gigaom.com/2012/11/14/wibidata-open-sources-kiji-to-make-hbase-more-useful/">has been slowly open-sourcing under the Kiji banner</a>. It has also been learning the similarities among the applications it&#8217;s building for customers in the same field, figuring out what&#8217;s repeatable. What does any given company in the retail space, for example, need to get started on <a href="http://gigaom.com/2013/05/08/why-3-celebrity-data-scientists-are-willing-to-work-for-free-for-you/">its own recommendation engine</a>? </p>
<p>And now, Bisciglia says, WibiData is going to double down on building application software based on what it has learned. The first two industries it targets will likely be financial services and retail, two areas where the company has seen a lot of traction. He envisions the finished product including some pre-defined schema for formatting data and some pre-built predictive models, both broadly applicable across that industry rather than specific to a single user. </p>
<p>There will also be different interfaces that allow different types of users (e.g., data scientists, systems engineers and business users) to interact with the data in the ways they need to. </p>
<p>Time will tell if WibiData can actually accomplish its goal of turning Hadoop into a collection of somewhat specialized software packages, but someone has to. Even industry heavyweights like Cloudera see the need, but their hands are full just getting Hadoop integrated into existing environments and getting those early uses up and running. As Cloudera CEO Mike Olson <a href="http://gigaom.com/2012/03/21/cloudera-structure-data-2012/">said at Structure: Data in 2012</a> to anyone ambitious enough to tackle the Hadoop-application gap, &#8220;Call me, I’ll connect you with funding. The money is out there.&#8221; </p>
<p>If you want to hear more about the need for Hadoop applications, check out this panel from Structure: Data 2013, where I speak with WibiData&#8217;s Omer Trajman, Continuuity&#8217;s Jonathan Gray and Pivotal&#8217;s Muddu Sudhakar. <span class='embed-youtube' style='text-align:center; display: block;'><iframe class='youtube-player' type='text/html' width='604' height='370' src='http://www.youtube.com/embed/z7BhGEQX9BQ?version=3&#038;rel=1&#038;fs=1&#038;showsearch=0&#038;showinfo=1&#038;iv_load_policy=1&#038;wmode=transparent' frameborder='0'></iframe></span></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=648663&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=801262"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=801262" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=648663+wibidata-gets-15m-to-help-it-become-the-hadoop-application-company&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/03/a-near-term-outlook-for-big-data/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=648663+wibidata-gets-15m-to-help-it-become-the-hadoop-application-company&utm_content=dharrisstructure">A near-term outlook for big data</a></li><li><a href="http://pro.gigaom.com/2012/07/cloud-and-data-second-quarter-2012-analysis-and-outlook-2/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=648663+wibidata-gets-15m-to-help-it-become-the-hadoop-application-company&utm_content=dharrisstructure">Takeaways from the second quarter in cloud and data</a></li><li><a href="http://pro.gigaom.com/2011/12/why-the-big-data-startup-boom-will-likely-be-short-lived/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=648663+wibidata-gets-15m-to-help-it-become-the-hadoop-application-company&utm_content=dharrisstructure">Why the big data startup boom will likely be short-lived</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/05/23/wibidata-gets-15m-to-help-it-become-the-hadoop-application-company/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/05/wibi-founders.png?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/05/wibi-founders.png?w=150" medium="image">
			<media:title type="html">wibi founders</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/05/wibi-kiji.jpg?w=300" medium="image">
			<media:title type="html">The WibiData architecture</media:title>
		</media:content>
	</item>
		<item>
		<title>Concurrent is building a Hadoop assembly line in open source</title>
		<link>http://gigaom.com/2013/05/22/concurrent-is-building-a-hadoop-assembly-line-in-open-source/</link>
		<comments>http://gigaom.com/2013/05/22/concurrent-is-building-a-hadoop-assembly-line-in-open-source/#comments</comments>
		<pubDate>Wed, 22 May 2013 19:21:16 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[big data]]></category>
		<category><![CDATA[Cascading]]></category>
		<category><![CDATA[Concurrent]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Lingual]]></category>
		<category><![CDATA[machine-learning]]></category>
		<category><![CDATA[open source]]></category>
		<category><![CDATA[Pattern]]></category>
		<category><![CDATA[predictive analytics]]></category>
		<category><![CDATA[SQL]]></category>
		<category><![CDATA[statistical analysis]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=648186</guid>
		<description><![CDATA[Cascading creator Concurrent has developed a new open source tool called Pattern for running machine learning models on Hadoop clusters. When combined with its SQL tool called Lingual, users can move data from one stage to another easily.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=648186&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>If you know Java, R or SAS, doing machine learning on Hadoop data just got a lot easier. <a href="http://www.concurrentinc.com/">Concurrent</a> <em>(</em><em>see disclosure)</em>, the company behind the popular <a href="http://www.cascading.org/">Cascading</a> framework for writing big data jobs, has developed a new open source tool called <a href="http://www.cascading.org/pattern/">Pattern</a> that lets users export their models from statistical analysis applications and run THEM? at scale on Hadoop data with little to no code change.</p>
<p>The reason for creating Pattern is pretty simple, according to Concurrent founder and CTO Chris Wensel: &#8220;Hadoop is never used alone.&#8221; It&#8217;s always part of a data environment that also includes databases, visualization tools, analytics software and/or statistical analysis tools that arguably do the really valuable work. Hadoop&#8217;s real value is an integration platform that can feed data into these other systems and, ideally, put their outputs to work across much larger datasets.</p>
<p>Developers <em>can</em> use the Pattern Java API to create machine learning jobs, but they can also simply export a Predictive Model Markup Language (PMML) file from software like R, SAS and MicroStrategy that Pattern will read and run them as a Cascading workflow. Models are useless unless you can run them in production, Wensel said, and Pattern lets them run across more data, stored in Hadoop, than you can use to build them with those other tools.</p>
<p>However, Wensel noted, &#8220;The real takeaway isn&#8217;t Pattern itself.&#8221;</p>
<p>From his perspective, the real story is Pattern plus Cascading plus <a href="http://www.cascading.org/lingual/">Lingual</a>, the open source SQL-to-Hadoop tool that Concurrent recently developed and released. Lingual is the tie that binds everything together, creating a sort of assembly line for data as it works its way from generation to delivering some value. For example, someone might create a Cascading job that adds structure to incoming data, and then pull some of the data into R using Lingual. Once a model is created in R and exported to the Hadoop cluster using Pattern, Lingual can feed the MapReduce output file back to R so a data scientist can test the model&#8217;s accuracy.</p>
<p><a href="http://gigaom2.files.wordpress.com/2013/05/arch-diagram.png"><img  alt="arch-diagram" src="http://gigaom2.files.wordpress.com/2013/05/arch-diagram.png?w=708"   class="aligncenter size-full wp-image-648347" /></a></p>
<p>And actually, Wensel said, Lingual could have a positive effect on companies&#8217; bottom lines. Airbnb recently replaced a departed engineer with Lingual for monthly migrations of data from Hadoop and into SQL environments. Climate Corporation, <a href="http://gigaom.com/2012/05/02/how-climate-corp-is-pitting-big-data-against-mother-nature/">a massive Hadoop and Cascading user</a>, could use Lingual to let its crop-and-weather insurance customers access their data from the company&#8217;s Hadoop store.</p>
<p>Lingual and Pattern should help Concurrent finally make some money, too. Both of them, as well as the Cascading framework that underpins them, will always be open source, Wensel said, but it plans to create &#8220;a suite of products that will make your life much better if &#8230; you standardize on Cascading.&#8221;</p>
<p>For example, the company has the ability to monitor jobs at the application level rather than the cluster level, meaning it can tell you the details of that job that&#8217;s locking up all the resources and whether you really want to kill it (it might be an important report for the CFO &#8230;). &#8220;We can do some really interesting things,&#8221; Wensel said.</p>
<p><em><strong>Disclosure</strong>: Concurrent is backed by True Ventures, a venture capital firm that is an investor in the parent company of this blog, Giga Omni Media. Om Malik, the founder of Giga Omni Media, is also a venture partner at True.</em></p>
<p><em>This post was updated at 2:48pm PT to correct Chris Wensel&#8217;s title. He is CTO.</em></p>
<p><em>Feature image courtesy of <a href="http://www.shutterstock.com/gallery-908242p1.html">Shutterstock user PENGYOU91</a>.</em></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=648186&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=537772"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=537772" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=648186+concurrent-is-building-a-hadoop-assembly-line-in-open-source&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/report/sql-on-hadoop-roadmap-2013/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=648186+concurrent-is-building-a-hadoop-assembly-line-in-open-source&utm_content=dharrisstructure">Sector RoadMap: SQL-on-Hadoop platforms in 2013</a></li><li><a href="http://pro.gigaom.com/report/cloud-and-data-first-quarter-2013-analysis-and-outlook/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=648186+concurrent-is-building-a-hadoop-assembly-line-in-open-source&utm_content=dharrisstructure">Cloud and data first-quarter 2013: analysis and outlook</a></li><li><a href="http://pro.gigaom.com/report/how-to-use-big-data-to-make-better-business-decisions/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=648186+concurrent-is-building-a-hadoop-assembly-line-in-open-source&utm_content=dharrisstructure">How to use big data to make better business decisions</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/05/22/concurrent-is-building-a-hadoop-assembly-line-in-open-source/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/05/shutterstock_98915513.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/05/shutterstock_98915513.jpg?w=150" medium="image">
			<media:title type="html">assembly line</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/05/arch-diagram.png" medium="image">
			<media:title type="html">arch-diagram</media:title>
		</media:content>
	</item>
		<item>
		<title>New algorithm maps cancer cells like nodes on a social network</title>
		<link>http://gigaom.com/2013/05/20/new-algorithm-maps-cancer-cells-like-nodes-on-a-social-network/</link>
		<comments>http://gigaom.com/2013/05/20/new-algorithm-maps-cancer-cells-like-nodes-on-a-social-network/#comments</comments>
		<pubDate>Mon, 20 May 2013 20:58:53 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[algorithms]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[cancer research]]></category>
		<category><![CDATA[graph analysis]]></category>
		<category><![CDATA[health care]]></category>
		<category><![CDATA[medical research]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=647256</guid>
		<description><![CDATA[A group of researchers from Columbia and Stanford have created a method for turning complex cellular datasets into visualizations that map the similarities between tens of thousands of cells within a tissue sample.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=647256&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Often times, the best way to to get a sense of your data is to look at it. A bunch of of numbers or words might not mean anything sitting within a table, but they start to make a lot more sense when they’re turned into a chart. In fields like mass cytometry, though, where doctors might want to analyze dozens of biological markers for each of tends of thousands of cells in a tissue sample, creating an easy-to-understand chart is easier said than done.</p>
<p>That’s why a group of researchers from Columbia University and Stanford University developed an algorithm that can do just that, turning those cells into something that resembles your social graph. This lets researchers see how the various cells are related to each other so they know , for example, where to focus cancer treatment and what to track as that treatment progresses.</p>
<p>The idea of representing large or complex data as a graph is nothing new, but it has taken on more prominence thanks to the rise of social media and those ubiquitous social graphs that map out who’s connected to whom. As we highlighted recently, however, <a href="http://gigaom.com/2013/05/14/were-witnessing-the-rise-of-the-graph-in-big-data/">graph analysis is becoming more popular</a> outside the realm of social networks, and is being applied to problems that are more complex than just figuring out simple relationships within a network. In cases such as medical research, especially, graphs can provide a very effective way of seeing how potentially hundreds of thousands of data points spanning perhaps hundreds of variables are similar to each other.</p>
<p>That’s exactly what the team at Columbia and Stanford has done with a new algorithm that they’ve demonstrated within the realm of mass cytometry. According to <a href="http://newsroom.cumc.columbia.edu/2013/05/20/computational-tool-translates-complex-data-into-simplified-2-dimensional-images/">a press release announcing the research</a> (which is <a href="http://www.nature.com/nbt/journal/vaop/ncurrent/full/nbt.2594.html">available via paid download</a> at Nature Biotechnology):</p>
<blockquote id="quote-the-method-called-vi"><p>“The method, called viSNE (visual interactive Stochastic Neighbor Embedding), is based on a sophisticated algorithm that translates high-dimensional data (e.g., a dataset that includes many different simultaneous measurements from single cells) into visual representations similar to two-dimensional ‘scatter plots’ ….</p>
<p>“The viSNE software can analyze measurements of dozens of molecular markers. In the two-dimensional maps that result, the distance between points represents the degree of similarity between single cells. The maps can reveal clearly defined groups of cells with distinct behaviors (e.g., drug resistance) even if they are only a tiny fraction of the total population. This should enable the design of ways to physically isolate and study these cell subpopulations in the laboratory.”</p></blockquote>
<p>I assume they say <em>similar</em> to scatter plots because the algorithm is analyzing data across more than two dimensions, although the resulting chart is essentially the same (i.e., data points with similar characteristics will form clusters).</p>
<div id="attachment_647346" class="wp-caption aligncenter" style="width: 718px"><a href="http://gigaom2.files.wordpress.com/2013/05/screen-shot-2013-05-20-at-9-42-09-am.png"><img alt="The results of viSNE, showing cell densities in diagnosis and relapse samples." src="http://gigaom2.files.wordpress.com/2013/05/screen-shot-2013-05-20-at-9-42-09-am.png?w=708&#038;h=403" width="708" height="403" class="size-large wp-image-647346"></a><p class="wp-caption-text">The results of viSNE, showing cell densities in diagnosis and relapse samples.</p></div>
<p>Whether or not they’re technically similar, this research <a href="http://gigaom.com/2013/01/16/has-ayasdi-turned-machine-learning-into-a-magic-bullet/">seems similar to what Ayasdi is doing</a> with its new data-analysis software based on a technique called topological data analysis. In both cases, though, the algorithms aren’t necessarily concerned with how data points interact with one another (like in network graphs), but rather what similar characteristics the points share. Ayasdi’s software has been used in cancer research, too, including on datasets spanning hundreds of patients and tens of thousands of variables.</p>
<p>In theory — although not likely in practice considering the complexity of the datasets medical researchers are dealing with — these approaches are similar to clustering approaches that are also popular among data scientists working with web companies. In areas such as e-commerce or <a href="http://gigaom.com/2013/05/05/how-mailchimp-learned-to-treat-data-like-orange-juice-and-rethink-email-in-the-process/">email management</a>, for example, where there isn’t a strong social element, companies can broadly break customers into distinct groups based on their behavior or interests.</p>
<div id="attachment_642360" class="wp-caption aligncenter" style="width: 718px"><a href="http://gigaom2.files.wordpress.com/2013/05/marriedknit-tiff.jpg"><img alt="A sample cluster of subscribers." src="http://gigaom2.files.wordpress.com/2013/05/marriedknit-tiff.jpg?w=708&#038;h=427" width="708" height="427" class="size-large wp-image-642360"></a><p class="wp-caption-text">A sample cluster of MailChimp subscribers.</p></div>
<p>Of course, curing cancer is a slightly more compelling — and difficult — goal than targeted advertising. The algorithms have to be precise so as not to miss similarities hidden within the mass of data. In the case of viSNE, the researchers say they’ve been able to spot small groups of cells (like 20 out of tens of thousands) that might be able to survive chemotherapy and increase the likelihood of a recurring tumor.</p>
<p>But we probably shouldn’t bee too quick to discount the work that web companies do as somehow less valuable than that of cancers researchers, for example. The big data era arguably started with the web, and web companies have generated some of the most important data-analysis techniques and technologies around today (see, for example, Google’s Jeff Dean, with whom I’ll be speaking at our <a href="http://event.gigaom.com/structure/schedule/?utm_source=data&amp;utm_medium=editorial&amp;utm_campaign=intext&amp;utm_term=647256+new-algorithm-maps-cancer-cells-like-nodes-on-a-social-network&amp;utm_content=dharrisstructure">Structure conference</a> next month). As <a href="http://gigaom.com/2012/11/27/why-data-is-the-key-to-better-medicine-and-maybe-a-cure-for-cancer/">medical researchers start generating more and more data</a> via cytometry, genome sequencing and even electronic medical records, it will be critical for individuals in all fields to keep track of what data scientists in other fields are doing and <a href="http://gigaom.com/2013/03/26/how-researchers-are-fighting-lung-cancer-using-pagerank/">figure out how that might apply to their own work</a>.</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=647256&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=22282"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=22282" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=647256+new-algorithm-maps-cancer-cells-like-nodes-on-a-social-network&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2011/11/connected-world-the-consumer-technology-revolution/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=647256+new-algorithm-maps-cancer-cells-like-nodes-on-a-social-network&utm_content=dharrisstructure">Connected world: the consumer technology revolution</a></li><li><a href="http://pro.gigaom.com/2012/03/a-near-term-outlook-for-big-data/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=647256+new-algorithm-maps-cancer-cells-like-nodes-on-a-social-network&utm_content=dharrisstructure">A near-term outlook for big data</a></li><li><a href="http://pro.gigaom.com/2012/01/newnet-q4-platform-mania-and-social-commerce-shakeout/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=647256+new-algorithm-maps-cancer-cells-like-nodes-on-a-social-network&utm_content=dharrisstructure">NewNet Q4: Platform mania and social commerce shakeout</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/05/20/new-algorithm-maps-cancer-cells-like-nodes-on-a-social-network/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/05/screen-shot-2013-05-20-at-9-42-09-am1-e1369079018409.png?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/05/screen-shot-2013-05-20-at-9-42-09-am1-e1369079018409.png?w=150" medium="image">
			<media:title type="html">Screen-Shot-2013-05-20-at-9.42.09-AM</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/05/screen-shot-2013-05-20-at-9-42-09-am.png?w=708" medium="image">
			<media:title type="html">The results of viSNE, showing cell densities in diagnosis and relapse samples.</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/05/marriedknit-tiff.jpg?w=708" medium="image">
			<media:title type="html">A sample cluster of subscribers.</media:title>
		</media:content>
	</item>
		<item>
		<title>Alteryx raises $12M to make predictive analytics user-friendly</title>
		<link>http://gigaom.com/2013/05/20/alteryx-raises-12m-to-make-predictive-analytics-user-friendly/</link>
		<comments>http://gigaom.com/2013/05/20/alteryx-raises-12m-to-make-predictive-analytics-user-friendly/#comments</comments>
		<pubDate>Mon, 20 May 2013 14:55:03 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[Alteryx]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[predictive analytics]]></category>
		<category><![CDATA[statistical analysis]]></category>
		<category><![CDATA[Visualization]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=647059</guid>
		<description><![CDATA[Analytics provider Alteryx has raised another $12 million as it tries to make statistical analysis a more consumer-friendly experience. <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=647059&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.alteryx.com/">Alteryx</a>, an Irvine, Calif.-based startup trying to be a hybrid of Tableau and statistical analysis software like SAS or R, raised $12 million in an extended Series A round. Newcomer firm Toba Capital led the round, with existing investor SAP Capital also contributing.</p>
<p>President and COO George Mathew says the company&#8217;s mission is to be a one-stop shop for statistical analysis. It wants to be the one place where analysts and data scientists can blend their data, model it on it and then visualize it. Often, he noted, that same process might require two or three separate products.</p>
<p>Another feature that Alteryx hopes will set it apart is its collection of prebuilt models in what the company calls an analytics gallery. Users can share their own work or find models others have built for tackling similar issues. Alteryx also offers up its own pre-formatted datasets for analysis, often public data <a href="http://www.alteryx.com/module-exchange-details/614">such as the U.S. census</a>.</p>
<p>&#8220;The canvas for creating an analytics application should never be blank for the analyst when they&#8217;re getting started,&#8221; Mathew explained. They often need to understand external data as well as their internal data, so Alteryx&#8217;s software gives them easy access to it.</p>
<p><a href="http://gigaom2.files.wordpress.com/2013/05/gallery.jpg"><img src="http://gigaom2.files.wordpress.com/2013/05/gallery.jpg?w=708&#038;h=392" alt="gallery" width="708" height="392"  class="aligncenter size-large wp-image-647099" /></a></p>
<p>Because it&#8217;s based on the R statistical-programming language, heavy R user Walmart has been able to transition some workloads to Alteryx when employees need an easier user experience. McDonald&#8217;s uses it to analyze data about franchisees and about its growth strategy in China, and Bloomin&#8217; Brands (parent of company of Outback Steakhouse and other restaurants) is using it to help build menus that take into account what diners in various parts of the country prefer to eat. Nine of the 10 leading top wireless providers providers are also users, Mathew said, trying to blend actual call data with traditional sources such as customer service data.</p>
<p>Mathew compares Alteryx&#8217;s current growth as analogous to software-as-a-service applications like Salesforce.com in the CRM space, or even <a href="http://gigaom.com/2013/05/17/tableau-closes-day-1-as-a-2-9-billion-public-company-up-64-percent/">Tableau in the traditional business-intelligence space</a>. In a business world increasingly driven by at least the idea of big data, one might expect any vendor pushing a more consumer-like purchase and consumption experience to get interest from companies tired of dealing with legacy software or never wanting to experience it in the first place.</p>
<p>&#8220;The disruption that&#8217;s happening is creating a new space for ourselves,&#8221; Mathew said, &#8220;without having to go head to head, frankly, with the a status quo out there.&#8221;</p>
<p><em>Feature image courtesy of <a href="http://www.shutterstock.com/gallery-896311p1.html">Shutterstock user ramcreations</a>.</em></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=647059&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=918323"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=918323" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=647059+alteryx-raises-12m-to-make-predictive-analytics-user-friendly&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/report/sector-roadmap-social-customer-service-in-2013/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=647059+alteryx-raises-12m-to-make-predictive-analytics-user-friendly&utm_content=dharrisstructure">Sector RoadMap: Social customer service in 2013</a></li><li><a href="http://pro.gigaom.com/2011/11/the-internet-of-things-creating-tomorrows-health-care/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=647059+alteryx-raises-12m-to-make-predictive-analytics-user-friendly&utm_content=dharrisstructure">The Internet of things: creating tomorrow&#8217;s health care</a></li><li><a href="http://pro.gigaom.com/2011/11/dissecting-the-data-5-issues-for-our-digital-future/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=647059+alteryx-raises-12m-to-make-predictive-analytics-user-friendly&utm_content=dharrisstructure">Dissecting the data: 5 issues for our digital future</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/05/20/alteryx-raises-12m-to-make-predictive-analytics-user-friendly/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/05/shutterstock_114471748.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/05/shutterstock_114471748.jpg?w=150" medium="image">
			<media:title type="html">analytics</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/05/gallery.jpg?w=708" medium="image">
			<media:title type="html">gallery</media:title>
		</media:content>
	</item>
		<item>
		<title>How big data analytics drives competitive advantage</title>
		<link>http://pro.gigaom.com/report/how-big-data-analytics-drives-competitive-advantage/</link>
		<comments>http://pro.gigaom.com/report/how-big-data-analytics-drives-competitive-advantage/#comments</comments>
		<pubDate>Mon, 20 May 2013 06:55:26 +0000</pubDate>
		<dc:creator>Editor</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[analytics]]></category>
		<category><![CDATA[barack obama]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[Business Intelligence]]></category>
		<category><![CDATA[CPU technologies]]></category>
		<category><![CDATA[CRM]]></category>
		<category><![CDATA[data management]]></category>
		<category><![CDATA[data warehouse]]></category>
		<category><![CDATA[database management systems]]></category>
		<category><![CDATA[enterprise IT]]></category>
		<category><![CDATA[information technology]]></category>
		<category><![CDATA[Infrastructure optimization]]></category>
		<category><![CDATA[machine-learning]]></category>
		<category><![CDATA[Network performance]]></category>
		<category><![CDATA[rdbms]]></category>
		<category><![CDATA[Relational database]]></category>
		<category><![CDATA[social media]]></category>

		<guid isPermaLink="false">http://pro.gigaom.com/?post_type=go-report&#038;p=176801/</guid>
		<description><![CDATA[Few companies today have the time or the analytics expertise to apply statistics and complex data modeling to regular or even daily business decisions and operations. Now, however, an ecosystem of companies is emerging to fill this need.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=648489&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Few companies today have the time or the analytics expertise to apply statistics and complex data modeling to regular or even daily business decisions and operations. Now, however, an ecosystem of companies is emerging to fill this need.</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=648489&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=796877"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=796877" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=pro&utm_medium=editorial&utm_campaign=auto3&utm_term=648489+how-big-data-analytics-drives-competitive-advantage&utm_content=benwoony">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/report/the-new-economics-of-enterprise-data-warehousing/?utm_source=pro&utm_medium=editorial&utm_campaign=auto3&utm_term=648489+how-big-data-analytics-drives-competitive-advantage&utm_content=benwoony">How data warehousing is now a cost-effective solution for businesses</a></li><li><a href="http://pro.gigaom.com/report/how-to-use-big-data-to-make-better-business-decisions/?utm_source=pro&utm_medium=editorial&utm_campaign=auto3&utm_term=648489+how-big-data-analytics-drives-competitive-advantage&utm_content=benwoony">How to use big data to make better business decisions</a></li><li><a href="http://pro.gigaom.com/report/how-to-manage-big-data-without-breaking-the-bank/?utm_source=pro&utm_medium=editorial&utm_campaign=auto3&utm_term=648489+how-big-data-analytics-drives-competitive-advantage&utm_content=benwoony">How to manage big data without breaking the bank</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://pro.gigaom.com/report/how-big-data-analytics-drives-competitive-advantage/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://2.gravatar.com/avatar/b58d6d331ef3725b56f64faa3d21be12?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">benwoony</media:title>
		</media:content>
	</item>
		<item>
		<title>Tableau closes Day 1 as a $2.9B public company, up 64 percent</title>
		<link>http://gigaom.com/2013/05/17/tableau-closes-day-1-as-a-2-9-billion-public-company-up-64-percent/</link>
		<comments>http://gigaom.com/2013/05/17/tableau-closes-day-1-as-a-2-9-billion-public-company-up-64-percent/#comments</comments>
		<pubDate>Fri, 17 May 2013 22:59:24 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[analytics]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[ipo]]></category>
		<category><![CDATA[tableau]]></category>
		<category><![CDATA[Visualization]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=646748</guid>
		<description><![CDATA[Tableau had a successful IPO, closing the trading day up 64 percent and raking in $254 million. CEO Christian Chabot says the company is now set to make itself known around the world.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=646748&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Data analytics star Tableau had a successful initial public offering on Friday, <a href="http://data.cnbc.com/quotes/DATA">closing the day up nearly 64 percent</a> at $50.75 per share. That means the company brought in about $254 million (it sold 5 million shares, while stockholders sold 3.4 million) and has a market cap of $2.9 billion. Shares have remained relatively steady in after-hours trading, trending down only slightly.</p>
<p>&#8220;We&#8217;re thrilled,&#8221; Tableau co-founder and CEO Christian Chabot told me during a call after the market closed. One should hope so.</p>
<p>Chabot and his fellow co-founders stand to make a lot of money if today&#8217;s closing price holds up, as does its sole investor NEA. The firm put $15 million into Tableau since it launched in 2003, and has rode that sum to profitability and more than $127 million in annual revenue.</p>
<p>Here&#8217;s a quick chart (made using Tableau Public) showing who owns how many share and what they&#8217;re potentially worth.</p>
<p><a href="http://gigaom2.files.wordpress.com/2013/05/tabipo.jpg"><img src="http://gigaom2.files.wordpress.com/2013/05/tabipo.jpg?w=708&#038;h=443" alt="tabipo" width="708" height="443"  class="aligncenter size-large wp-image-646811" /></a></p>
<p>The company didn&#8217;t really need more capital to operate, Chabot said, but one of the primary drivers was to raise awareness of the company. It has about 12,000 customers, he said, but there are millions more possible users. As part of attracting them, the company is going to expand globally and is working to improve its reach across mobile devices, the cloud and the Mac operating system.</p>
<p>&#8220;I don&#8217;t believe in the this whole &#8216;or&#8217; philosophy with computers,&#8221; Chabot said. &#8220;It&#8217;s &#8216;and&#8217;&#8221; &#8212; meaning people will use desktops and tablets and smartphones.</p>
<p>More prominence and more users singing its praises might also dispel the notion that Tableau is just about visualization. It has some fairly advanced features under the covers (as a commenter <a href="http://gigaom.com/2013/05/16/tableau-prices-its-stock-at-31-per-share-for-fridays-ipo/">to my earlier post</a> about the company&#8217;s influence pointed out), even if they&#8217;re hidden by the relatively simple user experience. </p>
<p>&#8220;Tableau is not a visualization company, per se, it&#8217;s really an analytics company,&#8221; Chabot said.</p>
<p>However, if the company really wants to expand its reach to everyone one who wants to gain knowledge from data &#8212; something Chabot calls a &#8220;timeless human need&#8221; &#8212; <a href="http://gigaom.com/2013/04/07/we-need-a-data-democracy-not-a-benevolent-data-dictatorship/">it might actually need to get simpler</a>. More marketing can let potential business users know about new features like forecasting and data-extraction, but it won&#8217;t make a dentist is Des Moines better at formatting his data.</p>
<p>After raising $254 million in its IPO, though, Tableau is in a good place to do whatever it has to.</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=646748&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=741419"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=741419" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=646748+tableau-closes-day-1-as-a-2-9-billion-public-company-up-64-percent&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/01/newnet-q4-platform-mania-and-social-commerce-shakeout/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=646748+tableau-closes-day-1-as-a-2-9-billion-public-company-up-64-percent&utm_content=dharrisstructure">NewNet Q4: Platform mania and social commerce shakeout</a></li><li><a href="http://pro.gigaom.com/2012/01/newnet-q4-platform-mania-and-social-commerce-shakeout/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=646748+tableau-closes-day-1-as-a-2-9-billion-public-company-up-64-percent&utm_content=dharrisstructure">NewNet Q4: Platform mania and social commerce shakeout</a></li><li><a href="http://pro.gigaom.com/report/the-new-economics-of-enterprise-data-warehousing/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=646748+tableau-closes-day-1-as-a-2-9-billion-public-company-up-64-percent&utm_content=dharrisstructure">How data warehousing is now a cost-effective solution for businesses</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/05/17/tableau-closes-day-1-as-a-2-9-billion-public-company-up-64-percent/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/05/tabyahoo.png?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/05/tabyahoo.png?w=150" medium="image">
			<media:title type="html">tabyahoo</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/05/tabipo.jpg?w=708" medium="image">
			<media:title type="html">tabipo</media:title>
		</media:content>
	</item>
		<item>
		<title>Database startup Drawn to Scale is closing down</title>
		<link>http://gigaom.com/2013/05/17/database-startup-drawn-to-scale-is-closing-down/</link>
		<comments>http://gigaom.com/2013/05/17/database-startup-drawn-to-scale-is-closing-down/#comments</comments>
		<pubDate>Fri, 17 May 2013 21:24:03 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[big data]]></category>
		<category><![CDATA[database]]></category>
		<category><![CDATA[Drawn to Scale]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Hbase]]></category>
		<category><![CDATA[SQL on Hadoop]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=646718</guid>
		<description><![CDATA[Database startup Drawn to Scale, creator of the SQL-on-Hadoop technology called Spire, is closing down. The company's product, Spire, was one of the first SQL-on-Hadoop technologies.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=646718&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Database startup Drawn to Scale, creator of the SQL-on-Hadoop technology called Spire, is closing down. Co-founder and CEO Bradford Stephens officially <a href="http://www.roadtofailure.com/?p=11">announced the closure in a blog post</a> on Friday.</p>
<p><a href="http://gigaom2.files.wordpress.com/2013/05/spirearchitecture-015-e1361407038325.png"><img  alt="spirearchitecture-015-e1361407038325" src="http://gigaom2.files.wordpress.com/2013/05/spirearchitecture-015-e1361407038325.png?w=300&#038;h=185" width="300" height="185" class="alignleft size-medium wp-image-646740" /></a>The company&#8217;s product, Spire, which provided full SQL support on top of the HBase NoSQL database, was one of the first products to <a href="http://gigaom.com/2012/07/24/how-one-startup-wants-to-inject-hadoop-into-your-sql/">try to blend Hadoop&#8217;s scalability with the robustness and familiarity of SQL</a>. That&#8217;s now <a href="http://gigaom.com/2013/03/05/the-hadoop-ecosystem-the-welcome-elephant-in-the-room-infographic/">an increasingly crowded space</a> (and has grown since that linked graphic was created). In March, Drawn to Scale <a href="http://gigaom.com/2013/03/19/drawn-to-scale-wants-to-solve-your-mongodb-scalability-problems/">expanded its support to MongoDB</a>, as well.</p>
<p>I wasn&#8217;t shocked when Stephens told me the news &#8212; questions about the four-year-old company&#8217;s financial health had been swirling for a while &#8212; but to hear of its financial woes was a bit surprising. His account in the post pretty much echoes what I had heard from others:</p>
<blockquote id="quote-it-seemed-we-had-eve"><p>&#8220;It seemed we had everything going for us — paid customers such as American Express, Orange Telecom, Flurry, and 4 others. Our technology worked brilliantly, we had a big hiring pipeline, and we had great media presence against our competitors who raised 10-100x more cash.&#8221;</p></blockquote>
<p>He added:</p>
<blockquote id="quote-yet-five-days-before2"><p>&#8220;Yet five days before we signed term sheets for a big A round or sold the company, we started getting hit by a series of black swans — and we just didn’t have what we needed to recover. I’ll leave the public detail at that level, but I will say that paying employees’ health insurance out of your meager savings is a powerful incentive to change course.&#8221;</p></blockquote>
<p>Up to this point, the company <a href="http://gigaom.com/2012/03/08/drawn-to-scale-raises-money-to-make-sql-big-data-ready/">had raised $925,000</a> from RTP Ventures, IA Ventures and SK Ventures. There&#8217;s no word yet on what will come of the company&#8217;s intellectual property.</p>
<p>As Stephens &#8212; who&#8217;s now doing an entrepreneur-in-residence gig at Ping Identity and helping out other startups (including popular wardrobe app <a href="http://www.clothapp.com/">Cloth</a>) &#8212; succinctly put it during a phone discussion, &#8220;We just don&#8217;t have the horsepower to keep running the company.&#8221;</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=646718&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=558916"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=558916" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=646718+database-startup-drawn-to-scale-is-closing-down&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/report/the-new-economics-of-enterprise-data-warehousing/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=646718+database-startup-drawn-to-scale-is-closing-down&utm_content=dharrisstructure">How data warehousing is now a cost-effective solution for businesses</a></li><li><a href="http://pro.gigaom.com/report/sql-on-hadoop-roadmap-2013/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=646718+database-startup-drawn-to-scale-is-closing-down&utm_content=dharrisstructure">Sector RoadMap: SQL-on-Hadoop platforms in 2013</a></li><li><a href="http://pro.gigaom.com/report/how-to-use-big-data-to-make-better-business-decisions/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=646718+database-startup-drawn-to-scale-is-closing-down&utm_content=dharrisstructure">How to use big data to make better business decisions</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/05/17/database-startup-drawn-to-scale-is-closing-down/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/05/dtsdragon.png?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/05/dtsdragon.png?w=150" medium="image">
			<media:title type="html">dtsdragon</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/05/spirearchitecture-015-e1361407038325.png?w=300" medium="image">
			<media:title type="html">spirearchitecture-015-e1361407038325</media:title>
		</media:content>
	</item>
		<item>
		<title>Tableau prices its stock at $31 per share for Friday&#8217;s IPO</title>
		<link>http://gigaom.com/2013/05/16/tableau-prices-its-stock-at-31-per-share-for-fridays-ipo/</link>
		<comments>http://gigaom.com/2013/05/16/tableau-prices-its-stock-at-31-per-share-for-fridays-ipo/#comments</comments>
		<pubDate>Fri, 17 May 2013 00:03:48 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[analytics]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[Business Intelligence]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[tableau]]></category>
		<category><![CDATA[Visualization]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=646412</guid>
		<description><![CDATA[Tableau's initial public offering is on Friday, and expectations are high. The company has inspired much of the next-generation analytics space, and how it fares could be telling about just how powerful the data movement is.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=646412&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.tableausoftware.com/">Tableau Software</a> has priced shares for its initial public offering on Friday at $31. The company is offering up 5 million shares, while stockholders are offering 3.2 million shares. Tableau co-founder and CEO Christian Chabot will ring the opening bell on the New York Stock Exchange, where the company will list under the symbol &#8220;DATA.&#8221;</p>
<p>That&#8217;s an apt ticker symbol for a company that is in some ways a bellwether for the current fascination with all things data. Tableau isn&#8217;t a big data company, per se, but its visualization software breathes life into many big data calculations. Its <a href="http://gigaom.com/2013/04/07/we-need-a-data-democracy-not-a-benevolent-data-dictatorship/">focus on making software that&#8217;s easy to use</a> and that creates visually captivating charts has turned people from numerous professions into amateur data analysts. (I&#8217;ve even used it in the past, <a href="http://gigaom.com/2011/10/25/google-shows-the-limits-of-a-free-web/">including for the first time</a> in 2011.)</p>
<div id="attachment_646423" class="wp-caption alignright" style="width: 298px"><a href="http://gigaom2.files.wordpress.com/2013/05/und-leadership-christian-small.jpg"><img  alt="Christian Chabot" src="http://gigaom2.files.wordpress.com/2013/05/und-leadership-christian-small.jpg?w=708"   class="size-full wp-image-646423" /></a><p class="wp-caption-text">Christian Chabot</p></div>
<p>As Chabot <a href="http://gigaom.com/2012/02/23/thanks-to-consumerization-its-ipo-season-in-analytics/">told me during a conversation in 2011</a>, &#8220;In any field of human endeavor &#8230; there are a hundred to a thousand more people who understand the data of that field more than they understand reporting and analytics.&#8221;</p>
<p>Anytime you read about a hot new visualization or analytics startup promising the moon, you&#8217;re also seeing the results of what Tableau has sown in terms of the user experience. Many of those same companies will be quick to tell you how limited Tableau&#8217;s capabilities are. It&#8217;s memory-bound, it doesn&#8217;t have a database, it&#8217;s not available in the cloud (or on the Mac operating system), it can&#8217;t do predictive analytics. All true.</p>
<p>Of course, if it raises the kind of capital it expects to by going public, it can build and buy a lot of those capabilities. If pricing stays flat all day Friday, Tableau stands to make $155 million from its 5 million shares. Previous estimates <a href="http://www.forbes.com/sites/tomiogeron/2013/05/16/tableau-software-raises-ipo-price-range/">had Tableau&#8217;s market cap at around $1.7 billion</a> at a price of $29 per share (the company&#8217;s S-1 filing <a href="http://edgar.sec.gov/Archives/edgar/data/1303652/000119312513138700/d469057ds1.htm#rom469057_17">is available here</a>).</p>
<p>If investors have really bought into the company and the concept of a data-driven world, then who knows. Machine-data expert Splunk wnet public in 2012, flying the big data banner, and <a href="http://gigaom.com/2012/04/19/splunk-ipo-kills-lives-up-to-expectations/">saw shares peak at 91 percent above</a> its original asking price of $17.</p>
<p>I&#8217;m not suggesting Tableau is the biggest name in data, or even that it will some day become it. This next-generation analytics field is very young, with startups and larger vendors alike sometimes competing against themselves to win wholly new accounts than trying to displace legacy vendors within large enterprises. And every month, it seems, <a href="http://gigaom.com/2013/05/13/visualization-is-the-future-6-startups-re-imagining-how-we-consume-data/">I come across some new startup</a> that was built with the same principles in mind as Tableau, but with the advantage of having today&#8217;s best practices baked into its software.</p>
<p>But Tableau definitely commands a lot of the mindshare. How it fares as a public company <a href="http://gigaom.com/2013/04/03/a-tableau-ipo-could-validate-the-big-data-visualization-push-or-not/">could be a strong indicator</a> of just how powerful the data movement is, and how well it capitalizes on a new influx of cash will determine how long it stays on the top of customers&#8217; minds.</p>
<p><em>This post was updated at 7:01 p.m. to include previous estimates of the company&#8217;s market capitalization and a link to its S-1 filing.</em></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=646412&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=257586"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=257586" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=646412+tableau-prices-its-stock-at-31-per-share-for-fridays-ipo&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/report/the-new-economics-of-enterprise-data-warehousing/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=646412+tableau-prices-its-stock-at-31-per-share-for-fridays-ipo&utm_content=dharrisstructure">How data warehousing is now a cost-effective solution for businesses</a></li><li><a href="http://pro.gigaom.com/2012/05/the-importance-of-putting-the-u-and-i-in-visualization/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=646412+tableau-prices-its-stock-at-31-per-share-for-fridays-ipo&utm_content=dharrisstructure">The importance of putting the U and I in visualization</a></li><li><a href="http://pro.gigaom.com/2012/03/4-ipad-apps-to-help-wrangle-data/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=646412+tableau-prices-its-stock-at-31-per-share-for-fridays-ipo&utm_content=dharrisstructure">4 iPad apps to help wrangle data</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/05/16/tableau-prices-its-stock-at-31-per-share-for-fridays-ipo/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/05/products_desktop.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/05/products_desktop.jpg?w=150" medium="image">
			<media:title type="html">products_desktop</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/05/und-leadership-christian-small.jpg" medium="image">
			<media:title type="html">Christian Chabot</media:title>
		</media:content>
	</item>
	</channel>
</rss>