<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>GigaOM &#187; Apache Lucene</title>
	<atom:link href="http://gigaom.com/tag/apache-lucene/feed/" rel="self" type="application/rss+xml" />
	<link>http://gigaom.com</link>
	<description></description>
	<lastBuildDate>Thu, 23 May 2013 02:27:02 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='gigaom.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://0.gravatar.com/blavatar/0db8f6557d022075dbbf010c54d46d93?s=96&#038;d=http%3A%2F%2Fs2.wp.com%2Fi%2Fbuttonw-com.png</url>
		<title>GigaOM &#187; Apache Lucene</title>
		<link>http://gigaom.com</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://gigaom.com/osd.xml" title="GigaOM" />
	<atom:link rel='hub' href='http://gigaom.com/?pushpress=hub'/>
		<item>
		<title>Emerging trends in the non-relational database market</title>
		<link>http://pro.gigaom.com/2012/09/emerging-trends-in-the-non-relational-database-market/</link>
		<comments>http://pro.gigaom.com/2012/09/emerging-trends-in-the-non-relational-database-market/#comments</comments>
		<pubDate>Thu, 06 Sep 2012 20:15:48 +0000</pubDate>
		<dc:creator>augusttechgroup</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[Amazon]]></category>
		<category><![CDATA[apache]]></category>
		<category><![CDATA[Apache Lucene]]></category>
		<category><![CDATA[bigtable]]></category>
		<category><![CDATA[Cassandra]]></category>
		<category><![CDATA[CQL]]></category>
		<category><![CDATA[Cypher]]></category>
		<category><![CDATA[DataStax]]></category>
		<category><![CDATA[Datastax Enterprise]]></category>
		<category><![CDATA[Datomic]]></category>
		<category><![CDATA[DSE]]></category>
		<category><![CDATA[Dynamo]]></category>
		<category><![CDATA[DynamoDB]]></category>
		<category><![CDATA[EC2]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Google BigTable]]></category>
		<category><![CDATA[Hadoop]]></category>
		<category><![CDATA[Heroku]]></category>
		<category><![CDATA[iaas]]></category>
		<category><![CDATA[infrastructure as a service]]></category>
		<category><![CDATA[Neo Technologies]]></category>
		<category><![CDATA[Neo4j]]></category>
		<category><![CDATA[non-relational databases]]></category>
		<category><![CDATA[NoSQL]]></category>
		<category><![CDATA[PaaS]]></category>
		<category><![CDATA[Platform as a Service]]></category>
		<category><![CDATA[Rackspace]]></category>
		<category><![CDATA[relational-databases]]></category>
		<category><![CDATA[relevance]]></category>
		<category><![CDATA[Riak]]></category>
		<category><![CDATA[SpringSource]]></category>
		<category><![CDATA[SpringSource CloudFoundry]]></category>
		<category><![CDATA[Voldemort]]></category>

		<guid isPermaLink="false">http://pro.gigaom.com/?p=122171</guid>
		<description><![CDATA[Observers of database technology should look closely at the non-relational database market to see where the most interesting growth lies in the world of applied information storage and retrieval. <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=560233&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>The market for non-relational databases is a crowded one. Technology leaders looking to extract competitive advantages from their data must now familiarize themselves with this market. This report examines the current marketplace, providing a focused view of three products from across the current non-relational spectrum: Cassandra, Neo4J, and Datomic. </p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=560233&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=406170"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=406170" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=pro&utm_medium=editorial&utm_campaign=auto3&utm_term=560233+emerging-trends-in-the-non-relational-database-market&utm_content=augusttechgroup">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/01/how-amazons-dynamodb-is-rattling-the-big-data-and-cloud-markets/?utm_source=pro&utm_medium=editorial&utm_campaign=auto3&utm_term=560233+emerging-trends-in-the-non-relational-database-market&utm_content=augusttechgroup">Amazon’s DynamoDB: rattling the cloud market</a></li><li><a href="http://pro.gigaom.com/2011/04/infrastructure-q1-iaas-comes-down-to-earth-big-data-takes-flight/?utm_source=pro&utm_medium=editorial&utm_campaign=auto3&utm_term=560233+emerging-trends-in-the-non-relational-database-market&utm_content=augusttechgroup">Infrastructure Q1: IaaS Comes Down to Earth; Big Data Takes Flight</a></li><li><a href="http://pro.gigaom.com/2012/06/cloud-computing-infrastructure-2012-and-beyond/?utm_source=pro&utm_medium=editorial&utm_campaign=auto3&utm_term=560233+emerging-trends-in-the-non-relational-database-market&utm_content=augusttechgroup">Cloud computing infrastructure: 2012 and beyond</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://pro.gigaom.com/2012/09/emerging-trends-in-the-non-relational-database-market/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:thumbnail url="https://gigaom-pro-files.s3.amazonaws.com/files/2012/09/market1.jpg?w=150" />
		<media:content url="https://gigaom-pro-files.s3.amazonaws.com/files/2012/09/market1.jpg?w=150" medium="image">
			<media:title type="html">market1</media:title>
		</media:content>

		<media:content url="http://1.gravatar.com/avatar/7ac26c281b592af10443938f2a58a555?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">augusttechgroup</media:title>
		</media:content>
	</item>
		<item>
		<title>The Case for Open Source Search in the Enterprise</title>
		<link>http://gigaom.com/2010/11/01/the-case-for-open-source-search-in-the-enterprise-2/</link>
		<comments>http://gigaom.com/2010/11/01/the-case-for-open-source-search-in-the-enterprise-2/#comments</comments>
		<pubDate>Mon, 01 Nov 2010 17:49:52 +0000</pubDate>
		<dc:creator>Haydn Shaughnessey</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[Apache Lucene]]></category>
		<category><![CDATA[open source]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=242525</guid>
		<description><![CDATA[More and more, open-source search looks to be an alternative for organizing unstructured data in the enterprise. The open source community believes it has advantages in scalability, flexibility and speed over its proprietary counterparts, and is better adapted to the ever-changing needs of today's enterprise. <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=242525&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://gigaom2.files.wordpress.com/2010/10/opensource.jpg"><img title="opensource" src="http://gigaom2.files.wordpress.com/2010/10/opensource.jpg?w=300&#038;h=199" alt="" width="300" height="199" class="alignleft size-medium wp-image-238091"></a>With enterprise data predicted to grow 650 percent between 2009 and 2014, the pressing question among IT departments is how to manage this data and make it easily searchable. While Google’s Search Appliance and offerings from search market leader Autonomy are options, these proprietary models often mean an increase in cost, either with the number of seats on a search software license or with the volume of searches in a given year, and some enterprises are seeking alternatives.</p>
<p>Otis Gospodnetic, co-author of Lucene implementations primer “Lucene in Action,” <a href="http://www.jroller.com/otis/?page=1">says</a>, “You would be surprised to know how many services and applications use Lucene. For example, did you know that Apple uses Lucene in Safari and iTunes?” He adds that Lucene powers Spotlight, the search functionality in OSX, and browser history in Safari. CNET, Netflix, Cisco, the Guardian, LinkedIn and MTV use it as well.</p>
<p>What’s driving a wider adoption of open source? The <a href="http://lucene.apache.org/">open source community</a> promotes the increased scalability, flexibility and speed over enterprise-specific behemoths like Autonomy, Microsoft and Google. From a company’s perspective, Enterprise 2.0 and the socialization of knowledge create both unstructured data and a demand for knowledge as content.</p>
<p>Other factors include:</p>
<p><strong>Enterprise 2.0. </strong>Enterprises face a growing array of data types. On top of relational databases, companies now generate data through wikis, legacy bulletin boards, internal online communities, email, customer interaction points, PDFs, video, images and Microsoft Office files. Estimates of these volumes vary, but a ratio of 80 to 20 unstructured-to-structured data in the enterprise <a href="http://www.busmanagement.com/article/Managing-the-Data-Explosion/">seems to be sticking</a>. An inability to organize this unstructured data and simultaneously access different structured data files is a problem for business intelligence, adaptability, responsiveness and compliance.</p>
<p><strong>People and video.</strong><strong> </strong>Cisco’s <a href="http://www.cisco.com/en/US/products/ps10701/">Pulse</a> project, which uses Lucene/Solar as a platform, is a good example of an enterprise search application made possible by open source. It allows employees to find one another based on their current interests, even down to their latest corporate video appearance. Pulse tags content and media across enterprise networks and makes them searchable in real-time. The benefit of Lucence/Solr as a platform for the project, according to Cisco, is its scalability and speed.</p>
<p><strong>The cloud.</strong><strong> </strong>Cloud computing and SaaS will present new challenges for data discovery and management as enterprises migrate data into cloud environments and expect immediate access to their assets through search.</p>
<p>There are barriers to the adoption of open source. Many enterprises have poor search and discovery platforms that don’t index and return data well, often because of poor index or topology management. Many have platforms with price plans that were designed when data access rights belonged to small teams, as opposed to the larger open source community.</p>
<p>However, the ability for customers to open up the search box and modify it for their own purposes and to include advanced features like filtering at no cost should see open-source search emerge as a market force in the enterprise, the cloud, SaaS and in e-commerce. Ultimately, the biggest driver may prove to be demand on the part of employees to make the enterprise environment look and work like the consumer-driven web.</p>
<p>Read the full post <a href="http://pro.gigaom.com/2010/10/the-case-for-open-source-search-in-the-enterprise/?utm_source=cloud&amp;utm_medium=editorial&amp;utm_term=242525+the-case-for-open-source-search-in-the-enterprise-2&amp;utm_content=shigginbotham&amp;utm_campaign=intext">here</a>.</p>
<p><em>Image source: <a href="http://www.flickr.com/photos/95572727@N00/2049233526/">Flickr user stuckincustoms</a></em></p>
<ul><li><a href="http://pro.gigaom.com/2010/10/is-the-future-of-enterprise-completely-open-source/?utm_source=cloud&amp;utm_medium=editorial&amp;utm_term=242525+the-case-for-open-source-search-in-the-enterprise-2&amp;utm_content=shigginbotham&amp;utm_campaign=intext">Is the Future of Enterprise Completely Open Source?</a></li>
<li><a href="http://pro.gigaom.com/2010/07/report-an-open-source-smart-grid-primer/?utm_source=cloud&amp;utm_medium=editorial&amp;utm_term=242525+the-case-for-open-source-search-in-the-enterprise-2&amp;utm_content=shigginbotham&amp;utm_campaign=intext">An Open Source Smart Grid Primer</a></li>
<li><a href="http://pro.gigaom.com/2009/08/report-the-future-of-data-center-storage/?utm_source=cloud&amp;utm_medium=editorial&amp;utm_term=242525+the-case-for-open-source-search-in-the-enterprise-2&amp;utm_content=shigginbotham&amp;utm_campaign=intext">The Future of Data Center Storage</a></li>
</ul>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=242525&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=557950"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=557950" /></a></p>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2010/11/01/the-case-for-open-source-search-in-the-enterprise-2/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:thumbnail url="http://gigaom2.files.wordpress.com/2010/11/opensource.jpg?w=150" />
		<media:content url="http://gigaom2.files.wordpress.com/2010/11/opensource.jpg?w=150" medium="image">
			<media:title type="html">opensource</media:title>
		</media:content>

		<media:content url="http://1.gravatar.com/avatar/aee37121e18bf76bb9fee4494bab237a?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">shigginbotham</media:title>
		</media:content>

		<media:content url="http://gigaom2.files.wordpress.com/2010/10/opensource.jpg?w=300" medium="image">
			<media:title type="html">opensource</media:title>
		</media:content>
	</item>
		<item>
		<title>Will Yahoo Use Xoopit&#039;s CloudQuery to Help Usher in the Real-time Web?</title>
		<link>http://gigaom.com/2009/07/22/will-yahoo-use-xoopits-cloudquery-to-help-usher-in-the-real-time-web/</link>
		<comments>http://gigaom.com/2009/07/22/will-yahoo-use-xoopits-cloudquery-to-help-usher-in-the-real-time-web/#comments</comments>
		<pubDate>Thu, 23 Jul 2009 00:52:37 +0000</pubDate>
		<dc:creator>Geva Perry</dc:creator>
				<category><![CDATA[Web]]></category>
		<category><![CDATA[Apache Lucene]]></category>
		<category><![CDATA[Apache Solr]]></category>
		<category><![CDATA[CloudQuery]]></category>
		<category><![CDATA[Dave Rosenberg]]></category>
		<category><![CDATA[real-time web]]></category>
		<category><![CDATA[Xoopit]]></category>
		<category><![CDATA[Yahoo]]></category>

		<guid isPermaLink="false">http://gigaom.com/?p=60141</guid>
		<description><![CDATA[While Xoopit&#8217;s flagship email product is being touted as the reason why Yahoo agreed to buy the company, I think the underlying technology used by Xoopit is far more interesting. Xoopit has even exposed this infrastructure technology as an on-demand service, which it&#8217;s dubbed CloudQuery, with [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=60141&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><img src="http:///2009/07/xoopit_logo_c1.gif" alt="xoopit_logo_c1" title="xoopit_logo_c1" width="147" height="34"  class=" alignleft" />While Xoopit&#8217;s flagship email product is being touted as the reason why <a href="http://gigaom.com/2009/07/22/yahoo-buying-xoopit-for-20m/">Yahoo agreed to buy the company</a>, I think the underlying technology used by Xoopit is far more interesting. Xoopit has even exposed this infrastructure technology as an on-demand service, which it&#8217;s dubbed <a href="http://developer.xoopit.com/">CloudQuery</a>, with the tagline &#8220;Search as a Service.&#8221; But I suspect that, once the acquisition closes, CloudQuery will get buried deep in the bowels of Yahoo.<span id="more-60141"></span></p>
<p>Essentially Xoopit&#8217;s CloudQuery allows you to search-enable any application. It combines the functionality of full text search with more structured database-like queries, which have the advantage of being aware of the underlying data structure of the application and thus provide more accurate search results and in near real-time. It is very similar to the open-source project <a href="http://lucene.apache.org/solr/">Apache Solr</a>; both are based on the popular <a href="http://lucene.apache.org/java/docs/">Apache Lucene</a>.</p>
<p>Suppose I start a dating site that I want to equip with a sophisticated search engine. I have certain well-defined data fields, which I want to be searched as such (and not with a Google-style free text search); in other words I want people to be able to search for &#8220;gender is female,&#8221; &#8220;age is greater than 30 and less than 40&#8243; and &#8220;location is less than 5 miles from Zip 94107.&#8221; But I also want to enable a full text search that says &#8220;long walks on the beach.&#8221; With the Xoopit CloudQuery service I can quickly index my application and enable these sophisticated search capabilities, something I can&#8217;t do with Google ( s goog) Custom Search, for example.</p>
<p>Because of their near-instant indexing capabilities, technologies such as CloudQuery and Solr will be key in implementing the notion of the <a href="http://gigaom.com/2009/05/17/how-internet-content-distribution-discovery-are-changing/">real-time web</a> that has been popularized by services such as Twitter. Not only will I be able to perform the above query, but I will be able to set a  real-time alert for it. To see a very simple but interesting use of this kind of technology, check out <a href="http://search.twitter.com/search?q=&#038;ands=&#038;phrase=underrated+bands&#038;ors=&#038;nots=&#038;tag=&#038;lang=all&#038;from=daveofdoom&#038;to=&#038;ref=&#038;near=&#038;within=15&#038;units=mi&#038;since=&#038;until=&#038;rpp=30">Dave Rosenberg&#8217;s &#8220;A-Z List of Underrated Bands,&#8221;</a> which he created using Twitter Search. If you click on the link you&#8217;ll see that the search query at the top says: &#8220;underrated bands&#8221; from:daveofdoom. This query essentially says &#8220;find all tweets that contain the words &#8216;underrated bands&#8217; posted by the user &#8216;daveofdoom&#8217;.&#8221; if you were to leave that page open while Dave was adding to the list, you would have seen it updated on-the-fly &#8212; hence, the real-time web.</p>
<p>While Twitter has developed its own solution with significant development resources, Xoopit&#8217;s CloudQuery enables any application owner to very easily index and search-enable their application using its hosted cloud service and APIs. In addition to CloudQuery and Solr, other projects that are attempting to provide similar capabilities include the <a href="http://www.davebalmain.com/">Ferret</a> project for Ruby apps and <a href="http://sphinxsearch.com/">Sphinx Search</a>. As another indication of the level of interest in this area,<a href="http://www.lucidimagination.com/"> Lucid Imagination</a>, the company behind Solr (and Lucene support provider) recently received more than $6 million in funding from Granite Ventures, Walden International and In-Q-Tel, the CIA&#8217;s venture fund. But Xoopit remains the only player that has implemented this type of technology as a cloud service.</p>
<p>Yet it appears Yahoo is acquiring Xoopit as part of its strategy of focusing on a handful of core apps &#8212; among them email &#8212; and <a href="http://gigaom.com/2008/12/08/yahoo-plans-to-launch-a-mail-app-platform/">turn its email application into a platform</a>, around which it hopes developers will create an <a href="http://gevaperry.typepad.com/main/2009/05/hubs-spokes-and-islands.html">ecosystem</a> of complementary apps and plug-ins, utilizing <a href="http://developer.yahoo.com/mail/">the APIs that Yahoo has published.</a></p>
<p>But as happens so often in this business, a lost opportunity for one company could prove to be a new opportunity for another.</p>
<p><em>Geva Perry, who writes the <a href="http://gevaperry.typepad.com/main/">Thinking Out Cloud blog</a>, is an adviser to startups and enterprises on cloud computing strategy and marketing. Follow him on Twitter: <a href="http://twitter.com/gevaperry">@gevaperry</a>.</em></p>
<br />  <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=gigaom.com&#038;blog=14960843&#038;post=60141&#038;subd=gigaom2&#038;ref=&#038;feed=1" width="1" height="1" /><p><a href="http://pubads.g.doubleclick.net/gampad/jump?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=417092"><img src="http://pubads.g.doubleclick.net/gampad/ad?iu=/1008864/GigaOM_RSS_300x250&#038;sz=300x250&#038;c=417092" /></a></p><p><strong>Related research and analysis from GigaOM Pro:</strong><br />Subscriber content. <a href="http://pro.gigaom.com/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=60141+will-yahoo-use-xoopits-cloudquery-to-help-usher-in-the-real-time-web&utm_content=carolynpritchard">Sign up for a free trial</a>.</p><ul><li><a href="http://pro.gigaom.com/2012/10/the-state-of-cross-platform-measurement-across-tv-online-and-social/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=60141+will-yahoo-use-xoopits-cloudquery-to-help-usher-in-the-real-time-web&utm_content=carolynpritchard">The state of cross-platform media measurement</a></li><li><a href="http://pro.gigaom.com/2012/10/social-third-quarter-2012-analysis-and-outlook/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=60141+will-yahoo-use-xoopits-cloudquery-to-help-usher-in-the-real-time-web&utm_content=carolynpritchard">Social third-quarter 2012: analysis and outlook</a></li><li><a href="http://pro.gigaom.com/2012/05/the-discovery-democracy-how-social-discovery-is-transforming-entertainment/?utm_source=tech&utm_medium=editorial&utm_campaign=auto3&utm_term=60141+will-yahoo-use-xoopits-cloudquery-to-help-usher-in-the-real-time-web&utm_content=carolynpritchard">How social discovery is transforming entertainment</a></li></ul>]]></content:encoded>
			<wfw:commentRss>http://gigaom.com/2009/07/22/will-yahoo-use-xoopits-cloudquery-to-help-usher-in-the-real-time-web/feed/</wfw:commentRss>
		<slash:comments>18</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/9bed74f77c070b984c238ab18ddc0dc1?s=96&#38;d=retro&#38;r=PG" medium="image">
			<media:title type="html">carolynpritchard</media:title>
		</media:content>

		<media:content url="http:///2009/07/xoopit_logo_c1.gif" medium="image">
			<media:title type="html">xoopit_logo_c1</media:title>
		</media:content>
	</item>
	</channel>
</rss>
