<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>GigaOM &#187; Genomics</title>
	<atom:link href="http://gigaom.com/tag/genomics/feed/" rel="self" type="application/rss+xml" />
	<link>http://gigaom.com</link>
	<description></description>
	<lastBuildDate>Wed, 19 Jun 2013 12:00:11 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='gigaom.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://0.gravatar.com/blavatar/0db8f6557d022075dbbf010c54d46d93?s=96&#038;d=http%3A%2F%2Fs2.wp.com%2Fi%2Fbuttonw-com.png</url>
		<title>GigaOM &#187; Genomics</title>
		<link>http://gigaom.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://gigaom.com/osd.xml" title="GigaOM" />
	<atom:link rel='hub' href='http://gigaom.com/?pushpress=hub'/>
		<item>
		<title>How coding contests can be better at solving problems than Harvard</title>
		<link>http://gigaom.com/2013/03/20/how-coding-contests-can-be-better-at-solving-problems-than-harvard/</link>
		<comments>http://gigaom.com/2013/03/20/how-coding-contests-can-be-better-at-solving-problems-than-harvard/#comments</comments>
		<pubDate>Wed, 20 Mar 2013 18:57:34 +0000</pubDate>
		<dc:creator>Kevin Fitchard</dc:creator>
				<category><![CDATA[coding]]></category>
		<category><![CDATA[contests]]></category>
		<category><![CDATA[crowdsourcing]]></category>
		<category><![CDATA[Genomics]]></category>
		<category><![CDATA[Karim Lakhani]]></category>
		<category><![CDATA[problem solving]]></category>
		<category><![CDATA[Structure Data 2013]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=622488</guid>
		<description><![CDATA[Harvard recently threw a tough genomics problem to TopCoder's crowdsourced community and discovered the contest not only revealed a much broader field of investigation but provided a high level of motivation to get the problem solved.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=622488&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Harvard Business School and <a href="http://www.topcoder.com/">TopCoder</a> recently performed a study where they took a big genomics problem being worked on by Harvard Medical School, broke it into discreet abstract parts, and the threw the problem’s parts to the crowdsourced coding community to solve. What they gleaned was an interesting insight: the smartest guy in the room isn’t always your best problem solver.</p>
<p>Speaking at <a href="http://event.gigaom.com/structuredata/livestream/?utm_source=data&amp;utm_medium=editorial&amp;utm_campaign=intext&amp;utm_term=622488+how-coding-contests-can-be-better-at-solving-problems-than-harvard&amp;utm_content=kfitchard">GigaOM’s Structure:Data conference Wednesday</a>, Harvard Associate Professor Karim Lakhani said crowdsourcing the genomics project did several things. First and foremost it generated dozens of different approaches to tackling the same problem.</p>
<p>Before the TopCoder contest was created, researchers were considering two different paths of investigation. The contest revealed 89 differing approaches to the problem, 20 of which were extreme values — possibilities Harvard and its National Institutes of Health counterparts had never considered.</p>
<p>Second, the contest was able to create motivation to solve problems that you wouldn’t necessarily find in a group of researchers. An institution like Harvard may have brainpower in spades, Lakhani said, but throwing a bunch of geniuses at a problem doesn’t necessarily lead to result if they’re unmotivated to solve it.</p>
<p>“When you go into a self-selection model you don’t have to worry about motivation,” Lakhani said. There are monetary rewards for winning a TopCoder contest, of course, but taking home a prize is not guaranteed. Everyone has their own motivation for participating, whether it’s cash, experience, scoring reputation points or even the free T-shirts given to each participant. “Because there are large numbers of people participating, there’s a greater chance you’ll find the right skills and the right motivation,” Lakhani said.</p>
<p>Ultimately crowdsourcing your science isn’t a replacement for having smart people of your own, Lakhani said, but it certainly helps, something <a href="http://gigaom.com/2012/03/06/space-hackathon-coders-set-to-compete-on-nasadarpa-project/">Harvard’s counterparts at NASA have discovered</a>.</p>
<p>CHeck out <a href="http://gigaom.com/2013/03/20/structuredata-2013-live-coverage/">the rest of our Structure Data 2013 coverage here</a>, and a video embed of the session follows below:</p>
<p><iframe src="http://new.livestream.com/accounts/74987/events/1927733/videos/14311287/player?autoPlay=false&amp;height=360&amp;mute=false&amp;width=640" width="640" height="360" frameborder="0" scrolling="no"></iframe><br>
A transcription of the video follows on the next page</p>
<p><a href="http://gigaom.com/2013/03/20/how-coding-contests-can-be-better-at-solving-problems-than-harvard/2/">Go to page 2 (of 2) on GigaOM .</a></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=622488&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=828478"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=828478" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=622488+how-coding-contests-can-be-better-at-solving-problems-than-harvard&utm_content=kfitchard">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/11/sector-roadmap-crowd-labor-platforms-in-2012/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=622488+how-coding-contests-can-be-better-at-solving-problems-than-harvard&utm_content=kfitchard">Examining the rise of crowd labor platforms in 2012</a></li><li><a href="http://pro.gigaom.com/report/best-practices-in-optimizing-content-for-social-engagement/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=622488+how-coding-contests-can-be-better-at-solving-problems-than-harvard&utm_content=kfitchard">Best practices in optimizing content for social engagement</a></li><li><a href="http://pro.gigaom.com/2012/05/the-quantified-self-hacking-the-body-for-better-health-and-performance/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=622488+how-coding-contests-can-be-better-at-solving-problems-than-harvard&utm_content=kfitchard">The quantified self: hacking the body for better health</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/03/20/how-coding-contests-can-be-better-at-solving-problems-than-harvard/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/03/eqyssg-mcref5p0heer9lfrap8ow0ti82xlfnlsija8.jpeg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/03/eqyssg-mcref5p0heer9lfrap8ow0ti82xlfnlsija8.jpeg?w=150" medium="image">
			<media:title type="html">Mike Lydon TopCoder Karim Lakhani Harvard Business School Structure Data 2013</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/0544c4b228f8fa80e31bb952501cd7a4?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">kfitchard</media:title>
		</media:content>
	</item>
		<item>
		<title>Big data bioinformatics startup Spiral Genetics raises $3M</title>
		<link>http://gigaom.com/2013/03/12/big-data-bioinformatics-startup-spiral-genetics-raises-3m/</link>
		<comments>http://gigaom.com/2013/03/12/big-data-bioinformatics-startup-spiral-genetics-raises-3m/#comments</comments>
		<pubDate>Tue, 12 Mar 2013 16:00:53 +0000</pubDate>
		<dc:creator>Ki Mae Heussner</dc:creator>
				<category><![CDATA[bioinformatics tools]]></category>
		<category><![CDATA[bioniformatics]]></category>
		<category><![CDATA[Genomics]]></category>
		<category><![CDATA[health technology]]></category>
		<category><![CDATA[sequence data analysis]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=619596</guid>
		<description><![CDATA[Seattle-based Spiral Genetics has raised $3 million for its service that helps researchers in academia and industry more quickly analyze raw sequence data.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=619596&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.spiralgenetics.com">Spiral Genetics</a>, a Seattle-based startup that helps researchers and others quickly analyze DNA sequence data, has raised $3 million in its first institutional round of funding.</p>
<p>The Series A round was led by venture firm DFJ and brings the startup’s total amount raised to $3.7 million. With the new funding, Adina Mangubat, Spiral Genetics co-founder and CEO, said her eight-person team plans to expand product development, as well as sales and marketing.</p>
<p>Mangubat said that when she and one of her co-founders, Becky Drees, first looked at the field of genomics, their plan was to launch a consumer-focused genetic testing service like <a href="http://www.23andme.com">23andme</a>. But as that company started launching its services, they decided to switch tacks.</p>
<p>“We were looking at the industry and we wanted to do something really impactful that involved genomics and computing,” she said. When they realized the speed and volume with which raw sequence data was being generated, she said, they spotted an opportunity in offering high-performance bioinformatics tools for analyzing it.</p>
<p>Companies like <a href="https://dnanexus.com">DNANexus</a> also offer sequence analysis, and others might conduct the analysis in-house, but Mangubat said they envisioned a service that could shrink the turnaround time for researchers and others in industry deluged by data. Last month, Redwood City, Calif.-based <a href="http://www.binatechnologies.com/">Bina Technologies</a> announced the <a href="http://www.sfgate.com/business/prweb/article/Bina-Technologies-Launches-Platform-Today-4288691.php">commercial launch of its own genomic analysis platform</a> and <a href="http://gigaom.com/2012/04/30/straight-outta-stanford-bina-wants-to-remake-genome-analysis/">similarly touts a faster-than-ever service</a>.</p>
<p>Mangubat and Drees teamed up with their third co-founder Jeremy Bruestle and started building a computing platform specifically intended to solve this kind of big data problem. Now, the company says, it can analyze a whole human genome in 3 hours, which is about 40 times faster than what it might take others.</p>
<p>Spiral Genetics’ customers run the gamut from academic researchers to corporations, Mangubat said. For example, while some clients may use their bioinformatics tool to tackle childhood cancer, others in agrigenomics could use it to sequence different strains of corn.</p>
<p>Along with the new funding, Spiral Genetics announced a new partnership with <a href="http://www.omicia.com">Omicia</a>, an Emeryville, Calif.-based provider of clinical genome sequence interpretation tools.</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=619596&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=77622"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=77622" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=619596+big-data-bioinformatics-startup-spiral-genetics-raises-3m&utm_content=kimaeheussner">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2011/12/whats-driving-the-next-phase-of-the-e-commerce-evolution/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=619596+big-data-bioinformatics-startup-spiral-genetics-raises-3m&utm_content=kimaeheussner">What&#8217;s driving the next phase of the e-commerce evolution</a></li><li><a href="http://pro.gigaom.com/report/survey-how-apps-can-solve-photo-management/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=619596+big-data-bioinformatics-startup-spiral-genetics-raises-3m&utm_content=kimaeheussner">Survey: How apps can solve photo management</a></li><li><a href="http://pro.gigaom.com/report/social-networks-will-displace-business-processes-not-socialize-them/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=619596+big-data-bioinformatics-startup-spiral-genetics-raises-3m&utm_content=kimaeheussner">Social networks will displace business processes, not socialize them</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/03/12/big-data-bioinformatics-startup-spiral-genetics-raises-3m/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/11/dnakit_illustration-300dpi-rgb.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/11/dnakit_illustration-300dpi-rgb.jpg?w=150" medium="image">
			<media:title type="html">Double Helix</media:title>
		</media:content>

		<media:content url="http://1.gravatar.com/avatar/7467db695203dccb9119d2430d0c5246?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">kimaeheussner</media:title>
		</media:content>
	</item>
		<item>
		<title>Bina launches box to analyze genomes; cloud on the way</title>
		<link>http://gigaom.com/2013/02/18/bina-launches-box-to-analyze-genomes-cloud-on-the-way/</link>
		<comments>http://gigaom.com/2013/02/18/bina-launches-box-to-analyze-genomes-cloud-on-the-way/#comments</comments>
		<pubDate>Tue, 19 Feb 2013 05:01:15 +0000</pubDate>
		<dc:creator>Jordan Novet</dc:creator>
				<category><![CDATA[big data analytics]]></category>
		<category><![CDATA[Bina Technologies]]></category>
		<category><![CDATA[Genomics]]></category>
		<category><![CDATA[on-premise hardware]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=611236</guid>
		<description><![CDATA[New hardware from Bina Technologies gives analysts another example of a use case in which the public cloud isn't always the most appropriate solution.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=611236&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://binatechnologies.com/">Bina Technologies</a> is launching its Bina Box for on-premise genome processing, enabling researchers to quickly and cheaply analyze genomes and give doctors data-driven suggestions for custom treatments. </p>
<p>Use a genome sequencer to see one person’s DNA profile, and you’ll get 6 billion unique characters, or half a terabyte of data, said Bina co-founder and CEO Narges Bani Asadi. Start processing it to find mutations and variations, and you’ll find yourself with more than one terabyte. It’s not small data. As the price of sequencing a genome keeps dropping, scientists will want to do this more and more. It’s a big data problem, Bani Asadi said. The company wants to solve the problem on premises, with hardware and software.</p>
<p>The Bina Box will run on “high-end Intel processors and very high-bandwidth memory,” Bani Asadi said, and can scale out with additional Bina Boxes as customers processing needs change. Price depends on how much processing customers have in mind. If a customer wants to process 100 samples a month, for instance, it would cost $12,500 per month, or $125 per sample, said Mark Sutherland, Bina’s senior vice president of business development.</p>
<p>A Bina Cloud to tie in with the Bina Box will come later this year. The Bina Cloud will host just the needle of genomic data isolated from among the haystack of the entire genome, and it will enable scientists to aggregate many genomes, run data visualizations and collaborate to derive big-picture insights. Early customers are already using a pilot version of the cloud.</p>
<p>The box offering contributes more proof of the notion that, for certain uses, public clouds might not make sense, not yet anyway. (It remains a largely popular perspective in financial services, as my colleague Barb Darrow <a href="http://gigaom.com/2012/12/24/financial-services-and-the-public-cloud-go-or-no-go/">reported</a> a couple of months ago.) The Bina Box, for its part, “provides security that on-premise solutions have, versus cloud solutions, (which) sometimes people in this industry are not completely ready to move into,” Bani Asadi said. Big pharmaceutical companies are a perfect example, as a breach could hamper product development using genomes. Aside from security, there’s the matter of performance. “It’s impossible to send (half a terabyte of raw data from a sequencer) to the cloud easily,” Bani Asadi said.</p>
<p>Meanwhile, other genomics-focused startups, including <a href="http://gigaom.com/2011/10/12/dnanexus-cloudant-biotech-deals/">DNAnexus</a> and <a href="http://gigaom.com/2012/03/22/appistry-structure-data-2012/">Appistry</a>, are eschewing hardware and relying exclusively on cloud resources. </p>
<p>Whether hardware is involved or not, as my colleague Derrick Harris <a href="http://gigaom.com/2012/04/30/straight-outta-stanford-bina-wants-to-remake-genome-analysis/">mentioned</a> when he wrote about Bina last year, it’s clear that the rise of big genomics inherently equates to a rise in data. </p>
<p>The practice of merging life sciences and other industries with big data will come up in conversation when Ayasdi CEO Gurjeet Singh hits the stage at <a href="http://event.gigaom.com/structuredata/?utm_source=data&amp;utm_medium=editorial&amp;utm_campaign=intext&amp;utm_term=611236+bina-launches-box-to-analyze-genomes-cloud-on-the-way&amp;utm_content=gigajordan">GigaOM’s Structure:Data conference</a> on March 20 in New York.</p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=611236&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=316224"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=316224" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=611236+bina-launches-box-to-analyze-genomes-cloud-on-the-way&utm_content=gigajordan">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/report/sector-roadmap-social-customer-service-in-2013/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=611236+bina-launches-box-to-analyze-genomes-cloud-on-the-way&utm_content=gigajordan">Sector RoadMap: Social customer service in 2013</a></li><li><a href="http://pro.gigaom.com/2012/12/sector-roadmap-health-care-and-big-data-in-2012/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=611236+bina-launches-box-to-analyze-genomes-cloud-on-the-way&utm_content=gigajordan">Health care and big data in 2012</a></li><li><a href="http://pro.gigaom.com/2012/05/the-importance-of-putting-the-u-and-i-in-visualization/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=611236+bina-launches-box-to-analyze-genomes-cloud-on-the-way&utm_content=gigajordan">The importance of putting the U and I in visualization</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/02/18/bina-launches-box-to-analyze-genomes-cloud-on-the-way/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/02/bina-111.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/02/bina-111.jpg?w=150" medium="image">
			<media:title type="html">Bina 1[1][1]</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/c00ab753df107b639e76ed4c3ab07ba7?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">gigajordan</media:title>
		</media:content>
	</item>
		<item>
		<title>Biotech startup Syapse wants to be Salesforce.com for our genomes</title>
		<link>http://gigaom.com/2013/01/22/biotech-startup-syapse-wants-to-be-salesforce-com-for-our-genomes/</link>
		<comments>http://gigaom.com/2013/01/22/biotech-startup-syapse-wants-to-be-salesforce-com-for-our-genomes/#comments</comments>
		<pubDate>Tue, 22 Jan 2013 15:45:32 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[big data. graph processing]]></category>
		<category><![CDATA[biotechnology]]></category>
		<category><![CDATA[Genomics]]></category>
		<category><![CDATA[graph databases]]></category>
		<category><![CDATA[omics]]></category>
		<category><![CDATA[Syapse]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=603080</guid>
		<description><![CDATA[A startup called Syapse is trying to bring the world of "omics" -- the study of all our genomes, biomes, proteomes and other "omes" -- under control with a new data management platform based on some of the general techniques that also power Facebook's Graph Search.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=603080&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Think about Facebook’s <a href="http://gigaom.com/2013/01/15/a-really-tiny-explanation-of-how-facebooks-graph-search-works/">new Graph Search feature</a> — only infinitely more complex — and you have a rough understanding of what Palo Alto, Calif.-based startup <a href="http://www.syapse.com/">Syapse</a> is trying to do. The company, which on Tuesday announced $3 million in Series A led by The Social+Capital Partnership (and previously raised a $1.6 million seed round), is building a data-management platform designed to let researchers and physicians easily pore through mountains of complicated molecular data in order to better diagnose a whole range of potential illnesses.</p>
<p>But to understand how Syapse works, you have to understand the problem it’s trying to solve. A condensed version of the situation is this: Sequencing genomes, proteomes, biomes and other microscopic, but very important, biological players generates a lot of data. However, we’re not just talking about the <a href="http://gigaom.com/2012/01/23/as-genomics-pushes-big-data-limits-cloud-could-save-the-day/">terabytes of data that a fully sequenced genome</a> (or perhaps the <a href="http://www.newyorker.com/reporting/2012/10/22/121022fa_fact_specter">tens of thousands sequenced gut bacteria</a>, which can change composition hourly) will produce, but also patient data (e.g., name, date of birth, smoker or non-smoker, etc.) and process data (i.e., everything that happens from the time a lab gets a sample to the time a doctor gets a report on his desk).</p>
<p><a href="http://gigaom2.files.wordpress.com/2013/01/shutterstock_67144993.jpg"><img alt="lab worker" src="http://gigaom2.files.wordpress.com/2013/01/shutterstock_67144993.jpg?w=300&#038;h=199" width="300" height="199" class="alignleft size-medium wp-image-603147"></a>The complexity and perpetually changing nature of both the field of <a href="http://en.wikipedia.org/wiki/Omics">“omics”</a> as it’s called, and the data itself, further complicates things. According to Syapse Co-founder and President Jonathan Hirsch, diagnostics labs and workers are always using new and different processes trying to optimally extract, tag and analyze samples. Furthermore, expert knowledge of what any particular genetic or other signature means is always changing (for example, Hirsch said, we only really understand about 1 percent of the human genome), as are the <a href="http://www.openclinical.org/ontologies.html">ontologies</a> that lab workers, researchers and physician specialists use as their particular fields evolve.</p>
<p>“There is basically a wholes set of measurements that go beyond just sequencing the genome,” he explained. Analyzing genomes, proteomes and anything else is “like a very, very complicated recipe” that involves much more than swabbing someone’s cheek and getting back a comprehensive, understandable report. Syapse doesn’t actually do any of the sequencing work (like a <a href="http://gigaom.com/2011/10/12/dnanexus-cloudant-biotech-deals/">DNAnexus</a> or <a href="http://gigaom.com/2012/04/30/straight-outta-stanford-bina-wants-to-remake-genome-analysis/">Bina Technologies</a> does,) but just captures the metadata from those lab processes and connects to those hefty sequenced data via an API so the platform has access to everything it needs.</p>
<h2 id="organizing-complex-data-requir">Organizing complex data requires a graph</h2>
<p>Using semantic-analysis and graph-processing techniques, Syapse thinks it can bring the <a href="http://www.nytimes.com/2012/06/19/science/studies-of-human-microbiome-yield-new-insights.html?pagewanted=all&amp;_r=1">world of “omics”</a> under control. Although it’s currently working with research centers that analyze the data in order to better hone their processes, Hirsch expects the company will eventually make most of its money from doctors and hospitals using Syapse to help better diagnose their patients. “[We're] trying to fill the gap and be the company that cracks the physician side of this,” he said.</p>
<p>This is where the Graph Search comparison comes into play. The Syapse platform is continuously updated with the latest ontologies from various fields and the changing meanings of the metadata associated with the various lab processes. All this information is stored based on its relationship to other pieces, and semantic analysis means the Syapse software knows that Term X in one field might actually mean Term Y in another.</p>
<p>Syapse has essentially created a “huge <a href="http://gigaom.com/2012/08/08/for-google-keeping-search-relevant-means-baking-big-data-into-everything/">knowledge graph</a>” of clinical, diagnosis and omics data, Hirsch explained, and doctors and researchers can mine it using whatever terms they use in their daily lives. They can easily search, for example, by patients they’ve treated for breast cancer whose genes showed certain specific markers and were processed using particular techniques in the lab in order to find connections among them.</p>
<p>Syapse Co-founder and CEO Glenn Winokur — an admitted “IT guy” compared with his biotech-focused partners — likes to put the platform’s promise in the terms of business software. ”Think of this entire workflow as similar to a sales or marketing workflow,” he said, adding that Syapse is trying to make mining omics data as simple for its users as Salesforce.com makes CRM for its users.</p>
<p>That’s probably a good analogy for selling the software to hospital administrators who might be more concerned with budgets than with big data technology. As we’ll discuss in more detail at our <a href="http://event.gigaom.com/structuredata/?utm_source=data&amp;utm_medium=editorial&amp;utm_campaign=intext&amp;utm_term=603080+biotech-startup-syapse-wants-to-be-salesforce-com-for-our-genomes&amp;utm_content=dharrisstructure">Structure: Data conference</a> on March 20-21, business people are increasingly concerned with using data to make better decisions, but they need applications that make it easier and faster to find stuff out than is possible with many open source packages targeting engineers and statisticians. If Syapse can deliver on this promise for making sense of our complex biological systems, it could make a big difference.</p>
<p><em>Feature image courtesy of <a href="http://www.shutterstock.com/gallery-168430p1.html">Shutterstock user kentoh</a>; lab image courtesy of <a href="http://www.shutterstock.com/gallery-332422p1.html">Shutterstock user VILevi</a>.</em></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=603080&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=489979"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=489979" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=603080+biotech-startup-syapse-wants-to-be-salesforce-com-for-our-genomes&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/05/the-importance-of-putting-the-u-and-i-in-visualization/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=603080+biotech-startup-syapse-wants-to-be-salesforce-com-for-our-genomes&utm_content=dharrisstructure">The importance of putting the U and I in visualization</a></li><li><a href="http://pro.gigaom.com/2012/04/aws-storage-gateway-jolts-cloud-storage-ecosystem/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=603080+biotech-startup-syapse-wants-to-be-salesforce-com-for-our-genomes&utm_content=dharrisstructure">AWS Storage Gateway jolts cloud-storage ecosystem</a></li><li><a href="http://pro.gigaom.com/2010/12/9-companies-that-pushed-the-infrastructure-discussion-in-2010/?utm_source=data&utm_medium=editorial&utm_campaign=auto3&utm_term=603080+biotech-startup-syapse-wants-to-be-salesforce-com-for-our-genomes&utm_content=dharrisstructure">9 Companies that Pushed the Infrastructure Discussion in 2010</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2013/01/22/biotech-startup-syapse-wants-to-be-salesforce-com-for-our-genomes/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2013/01/shutterstock_52453933.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2013/01/shutterstock_52453933.jpg?w=150" medium="image">
			<media:title type="html">genome</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2013/01/shutterstock_67144993.jpg?w=300" medium="image">
			<media:title type="html">lab worker</media:title>
		</media:content>
	</item>
		<item>
		<title>Scientists show what 100M computing hours on Google&#8217;s cloud can do</title>
		<link>http://gigaom.com/2012/12/17/scientists-show-what-100m-computing-hours-on-googles-cloud-can-do/</link>
		<comments>http://gigaom.com/2012/12/17/scientists-show-what-100m-computing-hours-on-googles-cloud-can-do/#comments</comments>
		<pubDate>Mon, 17 Dec 2012 18:35:12 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[Amazon Web Services]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[Genomics]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Microsoft Windows Azure]]></category>
		<category><![CDATA[research]]></category>
		<category><![CDATA[supercomputing]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=595059</guid>
		<description><![CDATA[In the latest case of researchers using the cloud for good, Google is highlighting the six projects to which it awarded grants via its Exacycle for Visiting Faculty program. Ranging from genomic research to astronomy, the researchers received 100 million computing hours apiece.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=595059&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Deep in the bowels of Google&#8217;s offices in Mountain View, Calif., and Seattle, a group of researchers has been consuming an incredible amount of computing resources trying to make scientific discoveries that they hope will help change the world.</p>
<p>On Monday, Google <a href="http://googleresearch.blogspot.com/2012/12/millions-of-core-hours-awarded-to.html">announced on its Research Blog</a> the six projects which it granted 100 million core-computing hours apiece as part of the Google Exacycle for Visiting Faculty program last year. The projects, most of which are led by university researchers (and one by a Google researcher), tackle a variety of pharmaceutical and biological challenges, as well as the problem of analyzing petabytes of data generated by a forthcoming astronomical survey project. The latter hopes to &#8220;[reduce] the time required to simulate one night of [Large Synoptic Survey Telescope] observing, roughly 5 million images, from 3 months down to a few days.&#8221;</p>
<p>Although these projects require a lot of horsepower to tackle enormous datasets and tough compute problems, Google&#8217;s Exacycle program is hardly the first attempt to use to cloud for good. In 2010, for example, Microsoft <a href="http://www.microsoft.com/en-us/news/press/2010/nov10/11-16msblastcloudpr.aspx">launched a version of the NCBI BLAST data-analysis tool</a> on its Windows Azure cloud and gave away free access to researchers. Amazon Web Services also <a href="http://aws.amazon.com/grants/">hosts its own research grant program</a> that awards access to its cloud computing infrastructure. Both companies also host large genomic and other datasets that are available to anyone using their services.</p>
<p>Free or not, though, cloud computing has proven a boon for researchers hungry for computing power but not keen on waiting in line for supercomputer resources or sending massive data across the network. Google <a href="http://gigaom.com/cloud/what-google-compute-engine-means-for-cloud-computing/">pimped its Compute Engine cloud as a scientist&#8217;s best friend</a> when launching the service in June, and AWS recently <a href="http://gigaom.com/cloud/gene-research-in-the-cloud-could-help-cure-diseases-in-the-lab/">highlighted how one customer built an 8,000-core system</a> that churned out 115 years of work in just one week.</p>
<p><em>Feature image courtesy of <a href="http://www.shutterstock.com/gallery-363508p1.html">Shutterstock user Sashkin</a>.</em></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=595059&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=294333"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=294333" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=595059+scientists-show-what-100m-computing-hours-on-googles-cloud-can-do&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/06/cloud-computing-infrastructure-2012-and-beyond/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=595059+scientists-show-what-100m-computing-hours-on-googles-cloud-can-do&utm_content=dharrisstructure">Cloud computing infrastructure: 2012 and beyond</a></li><li><a href="http://pro.gigaom.com/2010/12/9-companies-that-pushed-the-infrastructure-discussion-in-2010/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=595059+scientists-show-what-100m-computing-hours-on-googles-cloud-can-do&utm_content=dharrisstructure">9 Companies that Pushed the Infrastructure Discussion in 2010</a></li><li><a href="http://pro.gigaom.com/report/cloud-and-data-first-quarter-2013-analysis-and-outlook/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=595059+scientists-show-what-100m-computing-hours-on-googles-cloud-can-do&utm_content=dharrisstructure">Cloud and data first-quarter 2013: analysis and outlook</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2012/12/17/scientists-show-what-100m-computing-hours-on-googles-cloud-can-do/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/12/shutterstock_114059095.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/12/shutterstock_114059095.jpg?w=150" medium="image">
			<media:title type="html">servers</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>
	</item>
		<item>
		<title>Better medicine, brought to you by big data</title>
		<link>http://gigaom.com/2012/07/15/better-medicine-brought-to-you-by-big-data/</link>
		<comments>http://gigaom.com/2012/07/15/better-medicine-brought-to-you-by-big-data/#comments</comments>
		<pubDate>Sun, 15 Jul 2012 13:00:53 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[appistry]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[Cloudera]]></category>
		<category><![CDATA[DNAnexus]]></category>
		<category><![CDATA[Genomics]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[IBM]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=542509</guid>
		<description><![CDATA[Slowly but surely, health care is becoming a killer app for big data. Whether it's Hadoop, machine learning or natural-language processing, folks in the worlds of medicine and hospital administration understand that data is the key to helping them take their fields to the next level.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=542509&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://gigaom2.files.wordpress.com/2012/07/stethoscope.jpg"><img  title="stethoscope" src="http://gigaom2.files.wordpress.com/2012/07/stethoscope.jpg?w=300&#038;h=200" alt="" width="300" height="200" class="alignleft size-medium wp-image-542631" /></a>Slowly but surely, <a href="http://gigaom.com/2012/03/09/healthcare-needs-a-big-data-infusion/">health care is becoming a killer app for big data</a>. Whether it&#8217;s Hadoop, machine learning, natural-language processing or some other technique, folks in the worlds of medicine and hospital administration understand that new types of data analysis are the key to helping them take their fields to the next level.</p>
<p>Here are some of the interesting use cases we&#8217;ve written about over the past year or so, and a few others I&#8217;ve just come across recently. If you have a cool one &#8212; or a suggestion for a new use of big data within the healthcare space &#8212; share it in the comments:</p>
<ul>
<li><strong>Genomics.</strong> This is the epitomic case for big data and health care. Genome sequencing is <a href="http://gigaom.com/cloud/as-genomics-pushes-big-data-limits-cloud-could-save-the-day/">getting cheaper by the day</a> and produces mountains of data. Companies such as <a href="http://gigaom.com/2012/03/22/dnanexus-structure-data-2012/">DNAnexus</a>, <a href="http://gigaom.com/cloud/straight-outta-stanford-bina-wants-to-remake-genome-analysis/">Bina Technologies</a>, <a href="http://gigaom.com/2012/03/22/appistry-structure-data-2012/">Appistry</a> and <a href="http://www.nextbio.com/b/nextbioCorp.nb">NextBio</a> want to make analyzing that data to discover cures for diseases faster, easier and cheaper than ever using lots cutting-edge algorithms and lots of cloud computing cores. Dell is <a href="http://content.dell.com/us/en/gen/d/corp-comm/pediatric-cancer">providing computing power for two research centers</a> to try and treat a particular form of pediatric cancer based on each child&#8217;s specific genetic profile.</li>
</ul>
<div><a href="http://gigaom2.files.wordpress.com/2012/07/pediatric-cancer-infographi.jpeg"><img  title="pediatric-cancer-infographi" src="http://gigaom2.files.wordpress.com/2012/07/pediatric-cancer-infographi.jpeg?w=708" alt=""   class="aligncenter size-full wp-image-542674" /></a></div>
<ul>
<li><strong>BI for doctors. </strong>Doctors and staff at Seattle Children&#8217;s Hospital are <a href="http://gigaom.com/cloud/data-for-doctors-big-data-meets-a-big-business/">using Tableau to analyze and visualize terabytes of data</a> dispersed across the institution&#8217;s servers and databases. Not only does visualizing the data help reduce medical errors and help the hospital plan trials but, as of this time last year, its focus on data had saved the hospital $3 million on supply chain costs.</li>
</ul>
<div><a href="http://gigaom2.files.wordpress.com/2012/07/health-costs.jpg"><img  title="health costs" src="http://gigaom2.files.wordpress.com/2012/07/health-costs.jpg?w=708" alt=""   class="aligncenter size-full wp-image-542634" /></a></div>
<ul>
<li><strong>Semantic search. </strong>Imagine you&#8217;re a doctor trying to learn about a new patient or figure out who among your patients might benenfit from a new technique. But patient records have been scattered throughout departments, vary in format and, perhaps worst of all, all use the ontologies of the department that created the record. A startup called Apixio is trying to fix this by <a href="http://gigaom.com/cloud/apixio-is-bringing-big-data-to-medical-records-in-the-cloud/">centralizing records in the cloud and applying semantic analysis</a> to uncover everything doctors need, regardless who wrote it.</li>
</ul>
<div><a href="http://gigaom2.files.wordpress.com/2012/07/mine3-officenotes-semantic-smaller.jpg"><img  title="mine3-officenotes-semantic-smaller" src="http://gigaom2.files.wordpress.com/2012/07/mine3-officenotes-semantic-smaller.jpg?w=708" alt=""   class="aligncenter size-full wp-image-542675" /></a></div>
<ul>
<li><strong>Hadoop for everything.</strong> Cloudera is <a href="http://gigaom.com/cloud/hadoop-meets-healthcare-in-new-partnership/">partnering with the Mount Sinai School of Medicine</a> to help it develop new methods and systems for analyzing biological data. But that&#8217;s just the latest of Cloudera medical efforts, which also include working with the Food and Drug Administration to detect unsuspected adverse side effects from multi-drug combinations, and Emory University on helping pathologists more accurately analyze medical images. One of Cloudera&#8217;s customers, <a href="https://www.explorys.com">Explorys</a>, built a business around aggregating and analyzing medical records, and Intel and NextBio are <a href="http://www.genomeweb.com/blog/intel%E2%80%94nextbio-collaboration-aims-perfect-hadoop-genomics">teaming to tune Hadoop for processing genomic datasets</a>.</li>
</ul>
<div><a href="http://gigaom2.files.wordpress.com/2012/07/datagrid.jpg"><img  title="datagrid" src="http://gigaom2.files.wordpress.com/2012/07/datagrid.jpg?w=708" alt=""   class="aligncenter size-full wp-image-542676" /></a></div>
<ul>
<li><strong>Watson. </strong>IBM has dozens of irons in the healthcare fire, but its coolest might well be <a href="http://www-03.ibm.com/press/us/en/pressrelease/35402.wss">a partnership with WellPoint</a> to put the <em>Jeopardy!</em> champion question-answering system in doctors&#8217; offices. Watson could help doctors answer questions posed in natural language by analyzing them against mountains of medical research data that no individual doctor could possibly read and digest.</li>
</ul>
<div><a href="http://gigaom2.files.wordpress.com/2012/07/watsonpower7.jpeg"><img  title="WatsonPower7" src="http://gigaom2.files.wordpress.com/2012/07/watsonpower7.jpeg?w=708" alt=""   class="aligncenter size-full wp-image-542635" /></a></div>
<ul>
<li><strong>Getting ahead of disease. </strong>It&#8217;s always good if you figure out how to diagnose diseases early without expensive tests, and that&#8217;s<a href="http://gigaom.com/cloud/the-biggest-obstacle-to-embracing-big-data-you/"> just what Seton Healthcare was able to do</a> thanks to its big data efforts. Trying to find better ways to detect congestive heart failure early in order to save the exorbitant costs of treatment as the disease progresses, a team found that a distended jugular vein — something that can be spotted during any routine physical exam — is a particularly high risk factor.</li>
</ul>
<div><a href="http://gigaom2.files.wordpress.com/2012/07/shutterstock_68642137-1.jpg"><img  title="shutterstock_68642137 (1)" src="http://gigaom2.files.wordpress.com/2012/07/shutterstock_68642137-1.jpg?w=708" alt=""   class="aligncenter size-full wp-image-542678" /></a></div>
<ul>
<li><strong>Data scientist in residence. </strong>Here&#8217;s a new title for a healthcare organization &#8212; chief data scientist. Yet, that&#8217;s <a href="http://www.marketwatch.com/story/alliance-health-networks-hires-data-analytics-expert-deep-dhillon-as-new-chief-data-scientist-2012-05-30">exactly the position Alliance Health Networks just added in May</a>. The company, which provides social networks focused on specific medical conditions, acquired medical research database <a href="https://www.medify.com/">Medify</a> and decided it needed someone to lead the effort of analyzing all that data and providing valuable feedback to network users.</li>
</ul>
<div><a href="http://gigaom2.files.wordpress.com/2012/07/medify.jpg"><img  title="medify" src="http://gigaom2.files.wordpress.com/2012/07/medify.jpg?w=708" alt=""   class="aligncenter size-full wp-image-542679" /></a></div>
<ul>
<li><strong>Crowdsourced science. </strong>In a field where controlled experiments can be expensive and sometimes ineffective, it&#8217;s turning out there might be no substitute like the real-world data. Probably the most widely known company in this space is <a href="http://www.patientslikeme.com/">PatientsLikeMe</a>, a social network designed to let individuals share their medical conditions so they can learn from others like themselves what treatments might work best in their particular circumstances. As a side effect, the company is able to conduct observational trials based on data users willingly volunteer.</li>
</ul>
<div><a href="http://gigaom2.files.wordpress.com/2012/07/patients.jpg"><img  title="patients" src="http://gigaom2.files.wordpress.com/2012/07/patients.jpg?w=708" alt=""   class="aligncenter size-full wp-image-542633" /></a></div>
<p><em>Feature image courtesy of <a href="http://www.shutterstock.com/gallery-437830p1.html">Shutterstock user lenetstan</a>; Tableau graph <a href="http://www.perceptualedge.com/articles/visual_business_intelligence/information_visualization_and_art.pdf">courtesy of Perceptual Edge</a>; exam image courtesy of <a href="http://www.shutterstock.com/gallery-185902p1.html">Shutterstock user Blaj Gabriel</a></em></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=542509&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=175332"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=175332" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=542509+better-medicine-brought-to-you-by-big-data&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/03/a-near-term-outlook-for-big-data/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=542509+better-medicine-brought-to-you-by-big-data&utm_content=dharrisstructure">A near-term outlook for big data</a></li><li><a href="http://pro.gigaom.com/2012/01/infrastructure-q4-big-data-gets-bigger-and-saas-startups-shine/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=542509+better-medicine-brought-to-you-by-big-data&utm_content=dharrisstructure">Infrastructure Q4: Big data gets bigger and SaaS startups shine</a></li><li><a href="http://pro.gigaom.com/2011/03/defining-hadoop-the-players-technologies-and-challenges-of-2011/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=542509+better-medicine-brought-to-you-by-big-data&utm_content=dharrisstructure">Defining Hadoop: the Players, Technologies and Challenges of 2011</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2012/07/15/better-medicine-brought-to-you-by-big-data/feed/</wfw:commentRss>
		<slash:comments>12</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/07/stethoscope.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/07/stethoscope.jpg?w=150" medium="image">
			<media:title type="html">stethoscope</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/07/stethoscope.jpg?w=300" medium="image">
			<media:title type="html">stethoscope</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/07/pediatric-cancer-infographi.jpeg" medium="image">
			<media:title type="html">pediatric-cancer-infographi</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/07/health-costs.jpg" medium="image">
			<media:title type="html">health costs</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/07/mine3-officenotes-semantic-smaller.jpg" medium="image">
			<media:title type="html">mine3-officenotes-semantic-smaller</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/07/datagrid.jpg" medium="image">
			<media:title type="html">datagrid</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/07/watsonpower7.jpeg" medium="image">
			<media:title type="html">WatsonPower7</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/07/shutterstock_68642137-1.jpg" medium="image">
			<media:title type="html">shutterstock_68642137 (1)</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/07/medify.jpg" medium="image">
			<media:title type="html">medify</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/07/patients.jpg" medium="image">
			<media:title type="html">patients</media:title>
		</media:content>
	</item>
		<item>
		<title>Hadoop meets healthcare in new partnership</title>
		<link>http://gigaom.com/2012/07/03/hadoop-meets-healthcare-in-new-partnership/</link>
		<comments>http://gigaom.com/2012/07/03/hadoop-meets-healthcare-in-new-partnership/#comments</comments>
		<pubDate>Tue, 03 Jul 2012 21:45:08 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[big data]]></category>
		<category><![CDATA[Cloudera]]></category>
		<category><![CDATA[Genomics]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[healthcare]]></category>
		<category><![CDATA[medical research]]></category>
		<category><![CDATA[medicine]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=539447</guid>
		<description><![CDATA[The Mount Sinai School of Medicine is about to get a lesson in big data thanks to Cloudera Chief Scientist Jeff Hammerbacher. He's partnering with the school's clinical and academic faculty to develop new methods and systems for analyzing data in a variety of important areas. <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=539447&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://gigaom2.files.wordpress.com/2012/07/medical-research.jpg"><img  title="medical research" src="http://gigaom2.files.wordpress.com/2012/07/medical-research.jpg?w=300&#038;h=200" alt="" width="300" height="200" class="alignleft size-medium wp-image-539453" /></a>The <a href="http://www.mssm.edu/research/institutes/genomics-institute">Mount Sinai School of Medicine</a> is about to get a lesson in big data thanks to Cloudera Chief Scientist Jeff Hammerbacher. He will be partnering with the school&#8217;s clinical and academic faculty to develop new methods and systems for analyzing data in a variety of important areas, including genomics and multiscale biology. As we&#8217;ve pointed out before, the advent of big data and data science (of which Hadoop is just one piece) is <a href="http://gigaom.com/2012/03/22/dnanexus-structure-data-2012/">having significant effects in the world of medical research</a>, as well as <a href="http://gigaom.com/2011/11/08/for-science-big-data-is-the-microscope-of-the-21st-century/">in the world of science, generally</a> .</p>
<p>Starting at about 15:50 in the video below, Hammerbacher tells a room at the HISUM2012 conference about some of the work Cloudera has already done in the healthcare space. Among those efforts was a project along with the Food and Drug Administration to detect unsuspected adverse side effects from multi-drug combinations, and an application developed with Emory University to help pathologists more accurately analyze medical images.</p>
<p><iframe src="http://www.youtube.com/embed/2KdiJOyb4GY" frameborder="0" width="560" height="315"></iframe></p>
<p><em>Feature image courtesy of <a href="http://www.shutterstock.com/gallery-350317p1.html">Shutterstock user 18percentgrey</a>.</em></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=539447&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=17691"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=17691" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=539447+hadoop-meets-healthcare-in-new-partnership&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/03/a-near-term-outlook-for-big-data/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=539447+hadoop-meets-healthcare-in-new-partnership&utm_content=dharrisstructure">A near-term outlook for big data</a></li><li><a href="http://pro.gigaom.com/2010/12/9-companies-that-pushed-the-infrastructure-discussion-in-2010/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=539447+hadoop-meets-healthcare-in-new-partnership&utm_content=dharrisstructure">9 Companies that Pushed the Infrastructure Discussion in 2010</a></li><li><a href="http://pro.gigaom.com/report/cloud-and-data-first-quarter-2013-analysis-and-outlook/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=539447+hadoop-meets-healthcare-in-new-partnership&utm_content=dharrisstructure">Cloud and data first-quarter 2013: analysis and outlook</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2012/07/03/hadoop-meets-healthcare-in-new-partnership/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/07/medical-research.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/07/medical-research.jpg?w=150" medium="image">
			<media:title type="html">medical research</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/07/medical-research.jpg?w=300" medium="image">
			<media:title type="html">medical research</media:title>
		</media:content>
	</item>
		<item>
		<title>Preventing counterfeits with an iPhone and digital DNA</title>
		<link>http://gigaom.com/2012/05/10/preventing-counterfeits-with-an-iphone-a-qr-code-and-digital-dna/</link>
		<comments>http://gigaom.com/2012/05/10/preventing-counterfeits-with-an-iphone-a-qr-code-and-digital-dna/#comments</comments>
		<pubDate>Thu, 10 May 2012 21:55:42 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[Apple]]></category>
		<category><![CDATA[Applied DNA Sciences]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[DNA]]></category>
		<category><![CDATA[Genomics]]></category>
		<category><![CDATA[iPhone]]></category>
		<category><![CDATA[Mobile Apps]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=520282</guid>
		<description><![CDATA[Applied DNA Sciences thinks it has created the perfect tool for identifying attempts to counterfeit or steal goods along the supply chain. It's mobile meets cloud computing meets big data, and it begins with QR codes that mimic physical DNA signatures. <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=520282&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://gigaom2.files.wordpress.com/2012/05/digital-dna_-process1.jpg"><img title="Abstract DNA" src="http://gigaom2.files.wordpress.com/2012/05/digital-dna_-process1.jpg?w=300&#038;h=186" alt="" width="300" height="186" class="alignleft size-medium wp-image-520374"></a> If you’re looking for a foolproof way to secure your supply chain and prevent the spread of counterfeit goods, <a href="http://www.adnas.com/">Applied DNA Sciences</a> (ADNAS) thinks it has created just the tool. Its new product, called digitalDNA, creates unique plant-based DNA signatures that are encrypted onto QR codes readable by an iPhone app. When phones scan the code, data is analyzed by a cloud database to identify possible theft or counterfeiting. It’s mobile meets cloud computing meets big data, with genomics as glue holding them all together.</p>
<p>In order to understand digitalDNA, though, you first must be familiar with ADNAS’s core technology. Its flagship product, called <a href="http://www.adnas.com/signature-dna-marking-authentication-for-anticounterfeiting-diversion-security">SigNature DNA</a>, takes specially created, double-stranded DNA signatures derived from plant DNA and combines them in solution made out of ink or some other material. That solution can be applied directly to a product — anything from textiles to microchips to documents — or applied to an invisible bar code that can be read by scanners capable of detecting the DNA strand. Marks can also be swabbed and sent to ADNAS for verification.</p>
<p>Companies using SigNature can verify the authenticity of shipments by scanning the products they receive. If the products aren’t legit, businesses don’t accept them and, presumably, an investigation ensures. Presently, Miller said, this process is unreproducible, meaning would-be counterfeiters can’t one-up ADNAS customers by replicating their authentication method as well as the product itself. In January, <em>Wired</em> published an article about how the U.S. Department of Defense is <a href="http://www.wired.com/dangerroom/2012/01/dna-counterfeits/">using SigNature to detect bogus microchips in military equipment</a>.</p>
<p><a href="http://gigaom2.files.wordpress.com/2012/05/signature_detect1.jpg"><img title="signature_detect(1)" src="http://gigaom2.files.wordpress.com/2012/05/signature_detect1.jpg?w=708" alt=""   class="aligncenter size-full wp-image-520371"></a></p>
<p>Aside from simply stopping counterfeiting activity, though, SigNature is also used to prosecute criminals because the DNA markers are all-but irrefutable evidence (the false positive rate is 1 in a trillion) that someone is in possession of stolen goods. In the United Kingdom, Miller told me, more than a quarter of all cash in banks is marked with using SigNature in order to catch criminals who steal it from transporters such as ADNAS customer Loomis. ADNAS also sells products that pre-mark certain items in order to transfer DNA to thieves, or that spray fleeing intruders with DNA.</p>
<p>Another company, called <a href="http://www.dnatechnologies.com/">DNA Technologies</a>, claims to use a similar method for anti-counterfeiting and actually tagged the footballs to be used in Super Bowl XLVI. Unlike RFID tags, DNA marks can be placed even on small individual objects, or incorporated into them in the case of clothing, for example, and cannot be easily removed.</p>
<h2>Turning DNA into QR codes</h2>
<p>The <a href="http://www.adnas.com/DigitalDNA-crowd-sourced-anti-counterfeiting-data">new digitalDNA product</a> takes SigNature to the next level by tying it to cloud computing, big data and mobile phones. The unique DNA signature still exists on the physical QR code applied to packages, but it has also been digitally encrypted onto to a 2-dimensional QR code in a way ADNAS claims is not copyable. As packages move along the supply chain, employees equipped with iPhones and the ADNAS app can scan products to chart their progress and verify authenticity. But that’s just the beginning.</p>
<p>With every scan, information is also sent to a cloud-based database where it’s stored and analyzed a set of algorithms specially designed to identify patterns associated with counterfeiting or theft. If something pops up, companies can be proactive in trying to determine the problem or take measures to prevent a crime. And even if there isn’t nefarious activity taking place, digitalDNA users can still use the geospatial data they’re generating to get a better handle on their supply chain dynamics.</p>
<p><a href="http://gigaom2.files.wordpress.com/2012/05/dd-process4.jpg"><img title="dd process4" src="http://gigaom2.files.wordpress.com/2012/05/dd-process4.jpg?w=708" alt=""   class="aligncenter size-full wp-image-520375"></a></p>
<p>Looking to the future, Miller said ADNAS is also experimenting with methods for using the ubiquity of iPhones to bring consumers and retail outlets into the fold. That could mean anything from scanning the DNA-based QR code to ensure the freshness of a product to helping stores identify sales trends. Admittedly, though, those uses are a while out and would require cooperation from ADNAS’s customers, which are the ones dealing directly with resellers and consumers. Presumably, the DNA-based QR codes could provide more granular data because they’re tied to<em> </em>individual units of products.</p>
<p>However digitalDNA usage evolves, even if it never really takes off, the high-level concept behind the product is sound. As we’ll discuss in numerous sessions at our <a href="http://event.gigaom.com/structure?utm_source=cloud&amp;utm_medium=editorial&amp;utm_campaign=intext&amp;utm_term=520282+preventing-counterfeits-with-an-iphone-a-qr-code-and-digital-dna&amp;utm_content=dharrisstructure">Structure conference next month</a> in San Francisco, there’s an undeniable connections between cloud computing, big data and mobile technologies as it relates to capturing, storing and processing entirely new types of data. When literally anybody with a mobile phone and the right app can scan a code and send rich data up to the cloud, it opens up entirely new possibilities around both analytics and application architectures.</p>
<p><em>All images courtesy of Applied DNA Sciences.</em></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=520282&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=181162"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=181162" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=520282+preventing-counterfeits-with-an-iphone-a-qr-code-and-digital-dna&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/01/12-tech-leaders-resolutions-for-2012/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=520282+preventing-counterfeits-with-an-iphone-a-qr-code-and-digital-dna&utm_content=dharrisstructure">12 tech leaders’ resolutions for 2012</a></li><li><a href="http://pro.gigaom.com/2011/11/dissecting-the-data-5-issues-for-our-digital-future/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=520282+preventing-counterfeits-with-an-iphone-a-qr-code-and-digital-dna&utm_content=dharrisstructure">Dissecting the data: 5 issues for our digital future</a></li><li><a href="http://pro.gigaom.com/2011/11/connected-world-the-consumer-technology-revolution/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=520282+preventing-counterfeits-with-an-iphone-a-qr-code-and-digital-dna&utm_content=dharrisstructure">Connected world: the consumer technology revolution</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2012/05/10/preventing-counterfeits-with-an-iphone-a-qr-code-and-digital-dna/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/05/digital-dna_-process1.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/05/digital-dna_-process1.jpg?w=150" medium="image">
			<media:title type="html">Abstract DNA</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/05/digital-dna_-process1.jpg?w=300" medium="image">
			<media:title type="html">Abstract DNA</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/05/signature_detect1.jpg" medium="image">
			<media:title type="html">signature_detect(1)</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/05/dd-process4.jpg" medium="image">
			<media:title type="html">dd process4</media:title>
		</media:content>
	</item>
		<item>
		<title>Straight outta Stanford, Bina wants to remake genome analysis</title>
		<link>http://gigaom.com/2012/04/30/straight-outta-stanford-bina-wants-to-remake-genome-analysis/</link>
		<comments>http://gigaom.com/2012/04/30/straight-outta-stanford-bina-wants-to-remake-genome-analysis/#comments</comments>
		<pubDate>Tue, 01 May 2012 00:45:44 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[appistry]]></category>
		<category><![CDATA[Apple]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[Bina Technologies]]></category>
		<category><![CDATA[DNAnexus]]></category>
		<category><![CDATA[Genomics]]></category>
		<category><![CDATA[high-performance computing]]></category>
		<category><![CDATA[humane genome]]></category>
		<category><![CDATA[research]]></category>
		<category><![CDATA[Science]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=515950</guid>
		<description><![CDATA[Bina Technologies emerged from stealth mode last week and is bringing an Apple-like business model to genomics. The company relies on its Bina Box to make genome analysis faster than ever before possible without the benefit of having a supercomputer and a research network on hand. <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=515950&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://gigaom2.files.wordpress.com/2012/04/dna-sculpture.jpg"><img  title="dna sculpture" src="http://gigaom2.files.wordpress.com/2012/04/dna-sculpture.jpg?w=300&#038;h=269" alt="" width="300" height="269" class="alignleft size-medium wp-image-516125" /></a>The advent of the $1,000 genome is bound to revolutionize researchers&#8217; understanding of human health, but ever-lower prices on DNA sequencing are only half the battle. Researchers <a href="http://gigaom.com/cloud/as-genomics-pushes-big-data-limits-cloud-could-save-the-day/">also need to analyze the raw data that comes off sequencing machines</a>, which can range between many gigabytes to terabytes and can cost well more than the sequencing itself. That&#8217;s why a collection of startups are trying to stake their claims as essential parts of the genomics ecosystem by ensuring that analysis doesn&#8217;t become the bottleneck that slows progress.</p>
<h2>Domain expertise, statistics and HPC, unite!</h2>
<p>The latest is <a href="http://www.binatechnologies.com/">Bina Technologies</a>, which just emerged from stealth mode last week and is bringing an Apple-like business model to genomics. The company, which grew out of a research project at Stanford University, relies on its Bina Box appliance to make genome analysis faster than typically possible <a href="http://gigaom.com/cloud/fighting-cancer-at-100-gigabits-per-second/">without the benefit of having a supercomputer and a research network on hand</a>.</p>
<p>According to Bina CEO Narges Bani Asadi, who co-founded the company while completing her Ph.D. at Stanford, the appliance came about as part of a mission to solve a disconnect among the stakeholders in cancer research. Improving the analysis of cancer data required input from medical researchers, statisticians and high-performance computing experts, &#8220;but people are not speaking even the same language,&#8221; she said. While they&#8217;re all headed in the same direction, their paths rarely converge to harness peak velocity.</p>
<p><a href="http://gigaom2.files.wordpress.com/2012/04/bina_box_01.jpg"><img  title="bina_box_01" src="http://gigaom2.files.wordpress.com/2012/04/bina_box_01.jpg?w=300&#038;h=170" alt="" width="300" height="170" class="alignright size-medium wp-image-516123" /></a>Bani Asadi and her team solved that problem by developing a system that merged the three areas into one. With Bina, researchers can develop analysis pipelines that are optimized at both the algorithmic and silicon levels to run optimally across a mix of CPUs, GPUs and FPGAs, all of which are present within the purpose-built box. Applications are getting what they need in order to perform their best, and Bina says results can be processed 10 to 100 times faster (hours instead of days) than running jobs on the Amazon Web Services cloud, which has proven very popular for genomics workloads <a href="http://gigaom.com/cloud/amazon-gets-graphic-with-cloud-gpu-instances/">thanks to its supercomputer-like performance</a>.</p>
<p>That being said, a chart Bina uses to illustrate the performance difference compares the Bina Box to a single eight-core AWS instance rather than a cluster of those high-performance instances.</p>
<h2>The Apple analogy</h2>
<p>Bani Asadi answers the inevitable question of whether research centers will want to special appliances instead of using the cloud or generic servers by pointing to Apple. That company&#8217;s devices and computers can be a little more expensive and more difficult to tinker with than alternatives, but they&#8217;re also designed specifically with Apple&#8217;s operating system and applications in mind. It&#8217;s an analogy other companies, <a href="http://gigaom.com/cloud/ex-nasa-cto-builds-cloud-dream-team-launches-nebula/">such as cloud computing startup Nebula</a>, also use to justify their appliance-based businesses.</p>
<p>Not that Bina dismisses the cloud. If Bina&#8217;s software is the Mac OS to the Bina Box&#8217;s iMac, the Bina Cloud is the company&#8217;s iCloud. Once the box processes the raw sequencing data and compresses it into a smaller volume (up to 1,000 times smaller), the data is shipped to the Bina Cloud where it&#8217;s stored and can be easily accessed and shared. Actually, Bani Asadi said that&#8217;s where the most-innovative research likely will take place. While Bina&#8217;s appliance handles the necessary first steps of  genome analysis (e.g., determining how it&#8217;s unique), it&#8217;s the resulting data sets that are accessible by doctors, specialists and others to really make sense of it all.</p>
<p><a href="http://gigaom2.files.wordpress.com/2012/04/bina_process_01.jpg"><img  title="bina_process_01" src="http://gigaom2.files.wordpress.com/2012/04/bina_process_01.jpg?w=708" alt=""   class="aligncenter size-full wp-image-516124" /></a></p>
<p>Presumably, Bina is referring to companies such as DNAnexus when it compares its solution to entirely cloud-based approaches. DNAnexus is another Silicon Valley startup trying to democratize genome analysis, <a href="http://gigaom.com/cloud/dnanexus-cloudant-biotech-deals/">relying on the processing power and centralized nature of the cloud</a> to serve as a platform for analyzing and collaborating on DNA data. Another startup, St. Louis-based Appistry, has taken a somewhat different approach, <a href="http://gigaom.com/2012/03/22/appistry-structure-data-2012/">building its own high-powered cloud service</a> and developing its own algorithms specially designed for genome analysis.</p>
<h2>In the end, it&#8217;s all about the data</h2>
<p>Regardless of which approach a researcher takes to solving the problem of sequenced genome data (they all have unique benefits), the underlying trend driving innovation is the deluge of genome data itself. Bani Asadi said the biggest difference now compared with past efforts to analyze health data is that we have so much available. There are 30,000 fully sequenced genomes available right now, and some predict there will be 10 million in five years.</p>
<p>That means researchers can study DNA at a much more-granular level the previously possible, Bani Asadi said, and they can analyze findings across huge data sets to identify previously undetectable patterns. Especially with cancer, she said, each case is relatively unique, shaped by many conditions and factors. If we&#8217;re going to make significant progress on treating it, we&#8217;ll need to know exactly what&#8217;s going on in any given case and how similar cases have played out. The more data, the more accurate the diagnosis and treatment.</p>
<p><em>Feature image <a href="http://www.geograph.org.uk/photo/2848513">courtesy of Keith Edkins</a>.</em></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=515950&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=725587"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=725587" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=515950+straight-outta-stanford-bina-wants-to-remake-genome-analysis&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/05/the-importance-of-putting-the-u-and-i-in-visualization/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=515950+straight-outta-stanford-bina-wants-to-remake-genome-analysis&utm_content=dharrisstructure">The importance of putting the U and I in visualization</a></li><li><a href="http://pro.gigaom.com/2012/01/infrastructure-q4-big-data-gets-bigger-and-saas-startups-shine/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=515950+straight-outta-stanford-bina-wants-to-remake-genome-analysis&utm_content=dharrisstructure">Infrastructure Q4: Big data gets bigger and SaaS startups shine</a></li><li><a href="http://pro.gigaom.com/2011/11/dissecting-the-data-5-issues-for-our-digital-future/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=515950+straight-outta-stanford-bina-wants-to-remake-genome-analysis&utm_content=dharrisstructure">Dissecting the data: 5 issues for our digital future</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2012/04/30/straight-outta-stanford-bina-wants-to-remake-genome-analysis/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/04/dna-sculpture.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/04/dna-sculpture.jpg?w=150" medium="image">
			<media:title type="html">dna sculpture</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/04/dna-sculpture.jpg?w=300" medium="image">
			<media:title type="html">dna sculpture</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/04/bina_box_01.jpg?w=300" medium="image">
			<media:title type="html">bina_box_01</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/04/bina_process_01.jpg" medium="image">
			<media:title type="html">bina_process_01</media:title>
		</media:content>
	</item>
		<item>
		<title>As genomics data approaches exascale, cloud could save the day</title>
		<link>http://gigaom.com/2012/01/23/as-genomics-pushes-big-data-limits-cloud-could-save-the-day/</link>
		<comments>http://gigaom.com/2012/01/23/as-genomics-pushes-big-data-limits-cloud-could-save-the-day/#comments</comments>
		<pubDate>Mon, 23 Jan 2012 23:51:51 +0000</pubDate>
		<dc:creator>Derrick Harris</dc:creator>
				<category><![CDATA[analytics]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[genetics]]></category>
		<category><![CDATA[Genome]]></category>
		<category><![CDATA[Genomics]]></category>
		<category><![CDATA[Human genome]]></category>
		<category><![CDATA[life sciences]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=474677</guid>
		<description><![CDATA[Now that human genomes will soon take only one day and $1,000 to sequence, life is about to get a lot easier for medical researchers, but a lot more difficult for companies trying to make a buck selling them tools to store and analyze genomic data.<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=474677&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://gigaom2.files.wordpress.com/2012/01/dna.jpg"><img  title="dna" src="http://gigaom2.files.wordpress.com/2012/01/dna-e1327359998888.jpg?w=300&#038;h=200" alt="" width="300" height="200" class="alignleft size-medium wp-image-474778" /></a>Life is about to get a lot easier for medical researchers, but a lot more difficult for companies trying to make a buck selling them tools to store and analyze genomic data. When the <a href="http://www.ornl.gov/sci/techresources/Human_Genome/project/about.shtml">Human Genome Project</a> successfully concluded in 2003, it had taken 13 years to complete its goal of fully sequencing the human genome. Earlier this month, two firms &#8212; <a href="javascript:openWindow('http://www.prnewswire.com/news-releases/life-technologies-introduces-the-benchtop-ion-proton-sequencer-designed-to-decode-a-human-genome-in-one-day-for-1000-136990438.html');">Life Technologies</a> and <a href="http://investor.illumina.com/phoenix.zhtml?c=121127&amp;p=irol-newsArticle&amp;ID=1646757&amp;highlight=">Illumina</a>&#8211; announced instruments that can do the same thing in a day, one for only $1,000. That&#8217;s likely going to mean <em>a lot</em> of data.</p>
<h2>1TB times 1 million equals &#8230;</h2>
<p>How much data is anybody&#8217;s guess, but the exponential increases in productivity suggest it will be in the exabyte range within a few years. A fully sequenced human genome results in about 100GB of raw data, although <a href="http://dnanexus.com">DNAnexus</a> Founder and CEO Andreas Sundquist told me that volume increases to about 1TB by the time the genome has been analyzed. He also says we&#8217;re on pace to have 1 million genomes sequenced within the next two years. If that holds true, there will be approximately 1 million terabytes (or 1,000 petabytes, or 1 exabyte) of genome data floating around by 2014.</p>
<p>A few years ago, Complete Genomics <a href="http://singularityhub.com/2010/01/26/exclusive-complete-genomics-to-sequence-1-million-genomes-interview-with-ceo/">publicly announced its plan to sequence a million genomes by 2014</a>, but it has been woefully behind schedule to this point. It was hoping to do 50,000 genomes in 2011, but <a href="http://www.completegenomics.com/news-events/press-releases/Complete-Genomics-Announces-Shipment-of-Approximately-3000-Genomes-in-2011-136913193.html">finished the year at only 3,000</a>.</p>
<div id="attachment_474776" class="wp-caption alignright" style="width: 310px"><a href="http://gigaom2.files.wordpress.com/2012/01/life-sequencer-copy.jpg"><img  title="LIFE TECHNOLOGIES CORPORATION ION PROTON SEQUENCER" src="http://gigaom2.files.wordpress.com/2012/01/life-sequencer-copy.jpg?w=300&#038;h=188" alt="" width="300" height="188" class="size-medium wp-image-474776" /></a><p class="wp-caption-text">Life&#39;s Benchtop Ion Proton Sequencer</p></div>
<p>However, sequencing instruments are evolving in a manner similar to mainstream computers, which is to say they&#8217;re always getting faster and more affordable. Whereas sequencers used to cost more than half a million dollars and take up a room, Life&#8217;s genome-in-a-day instrument, the one that claims a $1,000-per-genome price point, sits on a desk and will cost only $149,000 when it&#8217;s available later this year. Upgrading to Illumina&#8217;s new instrument from the previous model costs only $50,000.</p>
<p>The fast rate of improvement comes from genomics&#8217; own version of Moore&#8217;s Law, Sundquist said: data throughput and cost both improve by tenfold every 18 months. When Life rival Illumina <a href="http://www.bloomberg.com/apps/news?pid=newsarchive&amp;sid=aAgJB2R0Wcqs">set a world record in February 2008</a>, it took &#8220;less than four weeks at a cost of about $100,000.&#8221; At this rate, we&#8217;ll have $100 genome sequencing by 2014.</p>
<p>Sundquist added that medical systems have tens of thousands of patients queued up for sequencing, many of which they might start doing now that it can be done so fast and at such a low cost.</p>
<h2>Hidden costs: &#8216;The quest for the $1,000 genome interpretation&#8217;</h2>
<p>Where things get hairy for IT vendors is figuring out how to make it affordable to store, process and analyze all that data &#8212; something Sundquist calls the quest for the $1,000 genome interpretation. It&#8217;s still not an inexpensive proposition to buy and maintain a system capable of storing and processing potentially petabytes of data. And if doctors or researchers want to collaborate with colleagues, their facilities bandwidth likely won&#8217;t cut it for sending even the raw data for a single genome. That&#8217;s why many research institutions are <a href="http://gigaom.com/cloud/fighting-cancer-at-100-gigabits-per-second/">connecting to high-speed research networks</a> designed solely to move massive scientific data sets.</p>
<p>As Forbes&#8217; Matthew Herper <a href="http://www.forbes.com/sites/matthewherper/2011/01/06/why-you-cant-have-your-1000-genome/">opined early last year</a>, even though research costs for genomes will soon cost only $1,000, it costs a lot more to employ people and pay for software capable of analyzing it. Because research genomes aren&#8217;t accurate enough for medical use, they often must be sequenced multiple times. Herper&#8217;s ultimate analysis:</p>
<blockquote><p>I’d think if we’re talking about actual medical use, $10,000 is a more accurate number. Certainly, it is not going to drop below the $2,000 level for a magnetic resonance imaging scan. And once the technology is in use, I think it is possible that the costs will go back up.</p></blockquote>
<p>So, even if genome sequencing itself becomes less expensive, hospitals and patients will both be paying well more than $1,000 for the procedure. Presently, <a href="http://www.bio-itworld.com/news/02/07/11/10000-dollar-genome-Complete-picture-2011.html">$10,000 <em>is</em> about the going rate</a> from Complete Genomics to sequence, analyze and deliver research results to an individual, although the costs certainly are subject to change if hospitals start performing sequencing workloads themselves.</p>
<h2>Cloud computing to the rescue?</h2>
<p>Sundquist thinks cloud computing is the answer. His company, DNAnexus, provides a cloud-based platform for storing and analyzing genomics data, <a href="http://gigaom.com/cloud/dnanexus-cloudant-biotech-deals/">something we&#8217;ve covered before</a>. &#8220;A 100-megabit connection could more than keep up with about a dozen of these machines,&#8221; he said, and once the data is in DNAnexus&#8217;s cloud platform, institutions no longer have to worry about keeping up with exploding data volumes, sending terabytes of data across the Internet or paying software licenses. Access is centralized and everything takes place on DNAnexus&#8217;s virtual infrastructure.</p>
<p>Additionally, cloud computing is ideal for spiky use cases, as is generally the case with genome sequencing.  A <a href="http://gigaom.com/2008/09/07/the-10-laws-of-cloudonomics/">general rule of &#8220;cloudonomics&#8221;</a> is that the cloud costs more on a per-unit basis, but generally will cost less over time unless it&#8217;s being used for a steady workload flow better suited to an on-premise system.</p>
<p>Whether it&#8217;s DNAnexus or some other cloud service, Sundquist&#8217;s reasoning is sound. As prices for gene sequencing continue to fall, doctors should be increasingly likely to do it, but they&#8217;ll be limited by the infrastructure in place to support them. Unless the costs of doing this on-premise come down significantly, the cloud might be the only place where storing and analyzing potentially petabytes per hospital isn&#8217;t such a daunting undertaking.</p>
<p><em>Feature image courtesy of <a href="http://www.flickr.com/photos/blueace/312036915/">Flickr user Robert Gaal</a>.</em></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=474677&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=867844"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=867844" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=474677+as-genomics-pushes-big-data-limits-cloud-could-save-the-day&utm_content=dharrisstructure">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/report/how-big-data-analytics-drives-competitive-advantage/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=474677+as-genomics-pushes-big-data-limits-cloud-could-save-the-day&utm_content=dharrisstructure">How big data analytics drives competitive advantage</a></li><li><a href="http://pro.gigaom.com/report/the-new-economics-of-enterprise-data-warehousing/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=474677+as-genomics-pushes-big-data-limits-cloud-could-save-the-day&utm_content=dharrisstructure">How data warehousing is now a cost-effective solution for businesses</a></li><li><a href="http://pro.gigaom.com/report/how-to-manage-big-data-without-breaking-the-bank/?utm_source=cloud&utm_medium=editorial&utm_campaign=auto3&utm_term=474677+as-genomics-pushes-big-data-limits-cloud-could-save-the-day&utm_content=dharrisstructure">How to manage big data without breaking the bank</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2012/01/23/as-genomics-pushes-big-data-limits-cloud-could-save-the-day/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2012/01/dna-e1327359998888.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2012/01/dna-e1327359998888.jpg?w=150" medium="image">
			<media:title type="html">dna</media:title>
		</media:content>

		<media:content url="http://0.gravatar.com/avatar/9e48ffa0913f65c577727457dd63023f?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">dharrisstructure</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/01/dna-e1327359998888.jpg?w=300" medium="image">
			<media:title type="html">dna</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2012/01/life-sequencer-copy.jpg?w=300" medium="image">
			<media:title type="html">LIFE TECHNOLOGIES CORPORATION ION PROTON SEQUENCER</media:title>
		</media:content>
	</item>
	</channel>
</rss>
