<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments on: Can we please stop saying “unstructured” data?</title>
	<atom:link href="http://gigaom.com/2013/03/17/can-we-please-stop-saying-unstructured-data/feed/" rel="self" type="application/rss+xml" />
	<link>http://gigaom.com/2013/03/17/can-we-please-stop-saying-unstructured-data/</link>
	<description></description>
	<lastBuildDate>Fri, 24 May 2013 07:04:12 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
	<item>
		<title>By: Grant</title>
		<link>http://gigaom.com/2013/03/17/can-we-please-stop-saying-unstructured-data/#comment-1321292</link>
		<dc:creator><![CDATA[Grant]]></dc:creator>
		<pubDate>Tue, 19 Mar 2013 17:41:43 +0000</pubDate>
		<guid isPermaLink="false">http://gigaom.com/?p=621428#comment-1321292</guid>
		<description><![CDATA[Mark Harwood, 

(for some reason, I can&#039;t reply inline to your last reply)

Excellent points.  I think you are starting to already see a new wave of education happening around it with the rise of machine learning and search via dominant companies like Google, FB, etc. as well as the big data movement such that people are being forced to think in probabilistic terms more and more just to stay afloat in the sea of data.]]></description>
		<content:encoded><![CDATA[<p>Mark Harwood, </p>
<p>(for some reason, I can&#8217;t reply inline to your last reply)</p>
<p>Excellent points.  I think you are starting to already see a new wave of education happening around it with the rise of machine learning and search via dominant companies like Google, FB, etc. as well as the big data movement such that people are being forced to think in probabilistic terms more and more just to stay afloat in the sea of data.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jonathan</title>
		<link>http://gigaom.com/2013/03/17/can-we-please-stop-saying-unstructured-data/#comment-1321255</link>
		<dc:creator><![CDATA[Jonathan]]></dc:creator>
		<pubDate>Tue, 19 Mar 2013 15:38:42 +0000</pubDate>
		<guid isPermaLink="false">http://gigaom.com/?p=621428#comment-1321255</guid>
		<description><![CDATA[I agree. The term &quot;Unstructured&quot; is inappropriate when you think about the data that is likely to be stored. However, when talking about the worst case scenarios of what *could* be stored the term fits exactly. In those fields it is overwhelmingly more often than not that the data needs to be handled as if there were no structure at all. How many if these &quot;Unstructured&quot; fields are filtered by some sort of imposed NLP parsed data type after all?  So we use that term to describe how we need to handle the field, not what the field may contain.   However, when our free text data eventually becomes forced to be human readable then we will need a new term.  I don&#039;t mind &quot;Rich Data&quot; but the problem there is that there are many levels of more ir less structured data that could be considered rich. Today, instead of using the term &quot;Unstructred&quot; I prefer calling it &quot;Unrestricted Data&quot;. That describes both the way we should handle it and the potential content.]]></description>
		<content:encoded><![CDATA[<p>I agree. The term &#8220;Unstructured&#8221; is inappropriate when you think about the data that is likely to be stored. However, when talking about the worst case scenarios of what *could* be stored the term fits exactly. In those fields it is overwhelmingly more often than not that the data needs to be handled as if there were no structure at all. How many if these &#8220;Unstructured&#8221; fields are filtered by some sort of imposed NLP parsed data type after all?  So we use that term to describe how we need to handle the field, not what the field may contain.   However, when our free text data eventually becomes forced to be human readable then we will need a new term.  I don&#8217;t mind &#8220;Rich Data&#8221; but the problem there is that there are many levels of more ir less structured data that could be considered rich. Today, instead of using the term &#8220;Unstructred&#8221; I prefer calling it &#8220;Unrestricted Data&#8221;. That describes both the way we should handle it and the potential content.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mark Harwood</title>
		<link>http://gigaom.com/2013/03/17/can-we-please-stop-saying-unstructured-data/#comment-1321209</link>
		<dc:creator><![CDATA[Mark Harwood]]></dc:creator>
		<pubDate>Tue, 19 Mar 2013 11:54:21 +0000</pubDate>
		<guid isPermaLink="false">http://gigaom.com/?p=621428#comment-1321209</guid>
		<description><![CDATA[As we move away from the trusty database hammer and incorporate more of the unstructured with things like faceted search we create a fundamental usability problem - how do we satisfactorily explain that the beautifully structured results (pie charts, timelines, heatmaps) are all founded on some pretty fuzzy matching underneath (&quot;more like this&quot;, fallible synonyms etc)? 
Database users are used to formal set logic (we either count something or we don&#039;t) while search is more about fuzzy sets (things match to a degree).
Not seen a good answer to this question yet - or even general acknowledgement that it is a problem.]]></description>
		<content:encoded><![CDATA[<p>As we move away from the trusty database hammer and incorporate more of the unstructured with things like faceted search we create a fundamental usability problem &#8211; how do we satisfactorily explain that the beautifully structured results (pie charts, timelines, heatmaps) are all founded on some pretty fuzzy matching underneath (&#8220;more like this&#8221;, fallible synonyms etc)?<br />
Database users are used to formal set logic (we either count something or we don&#8217;t) while search is more about fuzzy sets (things match to a degree).<br />
Not seen a good answer to this question yet &#8211; or even general acknowledgement that it is a problem.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Grant</title>
		<link>http://gigaom.com/2013/03/17/can-we-please-stop-saying-unstructured-data/#comment-1321199</link>
		<dc:creator><![CDATA[Grant]]></dc:creator>
		<pubDate>Tue, 19 Mar 2013 11:05:39 +0000</pubDate>
		<guid isPermaLink="false">http://gigaom.com/?p=621428#comment-1321199</guid>
		<description><![CDATA[I sometimes use multi-structured as well, but it doesn&#039;t exactly roll off the tongue.]]></description>
		<content:encoded><![CDATA[<p>I sometimes use multi-structured as well, but it doesn&#8217;t exactly roll off the tongue.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Grant</title>
		<link>http://gigaom.com/2013/03/17/can-we-please-stop-saying-unstructured-data/#comment-1321198</link>
		<dc:creator><![CDATA[Grant]]></dc:creator>
		<pubDate>Tue, 19 Mar 2013 11:03:43 +0000</pubDate>
		<guid isPermaLink="false">http://gigaom.com/?p=621428#comment-1321198</guid>
		<description><![CDATA[Hi Mark,

I like the &quot;greater freedom of expression&quot; viewpoint and fully appreciate your view.  In my view the word unstructured&#039;s meaning depends on who it is using it.  For people who, like you, embrace it, it probably is the simplest way (although rich data is pretty nice, IMO, too).  Often times, though, I find people use it to mean all the other stuff that we don&#039;t know how to handle very well using our big ol&#039; database hammer.]]></description>
		<content:encoded><![CDATA[<p>Hi Mark,</p>
<p>I like the &#8220;greater freedom of expression&#8221; viewpoint and fully appreciate your view.  In my view the word unstructured&#8217;s meaning depends on who it is using it.  For people who, like you, embrace it, it probably is the simplest way (although rich data is pretty nice, IMO, too).  Often times, though, I find people use it to mean all the other stuff that we don&#8217;t know how to handle very well using our big ol&#8217; database hammer.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Craig</title>
		<link>http://gigaom.com/2013/03/17/can-we-please-stop-saying-unstructured-data/#comment-1321090</link>
		<dc:creator><![CDATA[Craig]]></dc:creator>
		<pubDate>Tue, 19 Mar 2013 00:07:05 +0000</pubDate>
		<guid isPermaLink="false">http://gigaom.com/?p=621428#comment-1321090</guid>
		<description><![CDATA[I agree with the headline, too many people do indeed refer to data in XML, JSON and the like as unstructured, but they are just as structured as the relational tuple model.]]></description>
		<content:encoded><![CDATA[<p>I agree with the headline, too many people do indeed refer to data in XML, JSON and the like as unstructured, but they are just as structured as the relational tuple model.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: ksankar</title>
		<link>http://gigaom.com/2013/03/17/can-we-please-stop-saying-unstructured-data/#comment-1320952</link>
		<dc:creator><![CDATA[ksankar]]></dc:creator>
		<pubDate>Mon, 18 Mar 2013 16:45:44 +0000</pubDate>
		<guid isPermaLink="false">http://gigaom.com/?p=621428#comment-1320952</guid>
		<description><![CDATA[Multi-structured is a term that makes a lot of sense because there are many interpretations depending on the context. I am almost sure we will see &#039;multi-structure&#039; will replace the &#039;structure&#039; conference in a few years ;o)]]></description>
		<content:encoded><![CDATA[<p>Multi-structured is a term that makes a lot of sense because there are many interpretations depending on the context. I am almost sure we will see &#8216;multi-structure&#8217; will replace the &#8216;structure&#8217; conference in a few years ;o)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: JHLundin</title>
		<link>http://gigaom.com/2013/03/17/can-we-please-stop-saying-unstructured-data/#comment-1320924</link>
		<dc:creator><![CDATA[JHLundin]]></dc:creator>
		<pubDate>Mon, 18 Mar 2013 15:37:40 +0000</pubDate>
		<guid isPermaLink="false">http://gigaom.com/?p=621428#comment-1320924</guid>
		<description><![CDATA[...so, what cha gunna call it? ...and how are you gunna to define it?]]></description>
		<content:encoded><![CDATA[<p>&#8230;so, what cha gunna call it? &#8230;and how are you gunna to define it?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: stevewooledge</title>
		<link>http://gigaom.com/2013/03/17/can-we-please-stop-saying-unstructured-data/#comment-1320898</link>
		<dc:creator><![CDATA[stevewooledge]]></dc:creator>
		<pubDate>Mon, 18 Mar 2013 14:04:51 +0000</pubDate>
		<guid isPermaLink="false">http://gigaom.com/?p=621428#comment-1320898</guid>
		<description><![CDATA[Some folks are using the term &quot;multi-structured&quot;. Most, if not all, data types have a structure or &quot;format&quot; of some kind. Being able to process/analyze them in as close to native format as possible has benefits. 
Some thoughts: 
http://www.asterdata.com/blog/2011/07/what-big-data-can-learn-from-the-pc-era-the-need-for-a-multi-structured-big-data-platform/]]></description>
		<content:encoded><![CDATA[<p>Some folks are using the term &#8220;multi-structured&#8221;. Most, if not all, data types have a structure or &#8220;format&#8221; of some kind. Being able to process/analyze them in as close to native format as possible has benefits.<br />
Some thoughts:<br />
<a href="http://www.asterdata.com/blog/2011/07/what-big-data-can-learn-from-the-pc-era-the-need-for-a-multi-structured-big-data-platform/" rel="nofollow">http://www.asterdata.com/blog/2011/07/what-big-data-can-learn-from-the-pc-era-the-need-for-a-multi-structured-big-data-platform/</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Joe</title>
		<link>http://gigaom.com/2013/03/17/can-we-please-stop-saying-unstructured-data/#comment-1320892</link>
		<dc:creator><![CDATA[Joe]]></dc:creator>
		<pubDate>Mon, 18 Mar 2013 13:52:53 +0000</pubDate>
		<guid isPermaLink="false">http://gigaom.com/?p=621428#comment-1320892</guid>
		<description><![CDATA[Structure is semantics, and all data has semantics in a given context, thus all data is structured.

If you don&#039;t know what your data means and how to interpret it, game over.  The terms are nice for marketing, but little more.]]></description>
		<content:encoded><![CDATA[<p>Structure is semantics, and all data has semantics in a given context, thus all data is structured.</p>
<p>If you don&#8217;t know what your data means and how to interpret it, game over.  The terms are nice for marketing, but little more.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
