10 Comments

Summary:

[qi:gigaom_icon_web-apps] First there was open source. And now, here comes open data. Or that’s what Gil Elbaz thinks. He’s returning to the startup arena with a new Los Angeles-based company called Factual, which is building its business around the concept of open data. It wants to […]

[qi:gigaom_icon_web-apps] First there was open source. And now, here comes open data. Or that’s what Gil Elbaz thinks. He’s returning to the startup arena with a new Los Angeles-based company called Factual, which is building its business around the concept of open data. It wants to help build open databases that will run the gamut — from the addresses of all Thai restaurants in San Francisco to the fauna of Florida. And instead of chasing the elusive consumer, the company is going to woo app developers by targeting its offerings to them.

“We wanted to build an open data repository,” Elbaz explained, “because just as databases on computers are used by apps, we wanted to make the web computable.” Elbaz first earned his chops as the co-founder of Applied Semantics, a company that was acquired by Google in 2003 for about $102 million. Applied’s technology is now part of Google’s AdSense technology. He left Google in 2007 and since then has been focused on Factual, a company he has so far funded with his own cash.

When I asked him why he started Factual, he said he’d observed that when people didn’t have enough data to use, they couldn’t make smarter decisions. And what if the data available wasn’t accurate or easily accessible — would decisions made be the right ones? All of which are good points, but how does one go about addressing them?

Factual is seeding the system with databases built out of government data, Wikipedia and other such resources. Elbaz is hoping that over time, folks will start their own data projects and leverage their communities to help build these databases. And since these databases will be open source, any app developer could tap into them for their own applications. In many ways Elbaz has taken his cue from Wikipedia.

Elbaz hopes that even larger companies will open up and start sharing their data sets with the community. “Companies will soon realize that they can clean up and grow their data sets without incurring huge costs associated with closed databases,” he said. From the Factual blog:

We think a good route to low cost and high quality data is the open data model.  By making data open to access (read) so that developers can create valuable new applications without complex data licensing restrictions, and by making the data open for opinion, comment, and debate (write) — we believe a groundswell of support for certain data verticals could emerge.

There have been a number of great open structured data projects that have positively impacted the web; ODP, MusicBrainz and OpenStreetMap are just a few examples. But we believe it’s just the beginning. Factual intends to build one of the largest repositories of open data by providing an open, collaborative environment where anyone can easily view, contribute, improve and share data.

We’ve been testing with several partners (see home page for list) who understand that Factual’s open data can help websites offer better data and tools for end users.  For example, we’ve partnered with Demand Media, on a cancer physician table on their Livestrong.com site.

The company plans to eventually charge for a suite of premium services, including access to for-fee premium APIs and quality-of-service guarantees. All that comes next, mostly because the company faces the uphill task of convincing developers and hundreds of data set creators to join Factual. It has competition from San Francisco-based Metaweb (with its Freebase products), amongst several other projects.

Factual’s hard challenges aside, I’m going to keep an eye on this company, mostly because of the pedigree of the founder and also due to the fact that data, or rather data analysis as a key strategic asset, is the next big thing.

Earlier this month, much to the chagrin of some of our readers, I equated the Hadoop-focused startup, Cloudera, to Red Hat. My argument was that in the late 1990s, open-source operating systems and web software proved to be major disruptors and helped Internet services grow exponentially. About a dozen years later, the future of Internet services revolves around data and data analytics.

And Hadoop, open-source data warehouse software, has become a popular choice for everyone who relies on data crunching, from advertising companies to biotech giants. Folks at Google know this all too well, which is why they’ve invested extensively in their infrastructure systems. From that perspective, if Elbaz can convince enough people to sign up, his little Factual has a good chance at carving out a nice niche for itself.

Om’s Note: I am going to play around with their service more extensively before passing judgment on the service.

You’re subscribed! If you like, you can update your settings

By Om Malik

You're subscribed! If you like, you can update your settings

  1. Friends of Dave (friendsofdave) ‘s status on Tuesday, 13-Oct-09 12:08:09 UTC – Identi.ca Tuesday, October 13, 2009
  2. Structured Search Is On The Table | The Noisy Channel Tuesday, October 13, 2009

    [...] can read more detailed coverage in Search Engine Land, TechCrunch, ReadWriteWeb, GigaOM, and [...]

  3. Looks like wiki spreadsheets for structural data with API. Still has limitation that we can’t add columns, for example latitude and longitude for restaurants.

    On revenue model, it is ok to charge for premium API since it hosts infrastructure. Still hope to have some mechanism like Hg or Git to make open database distributed and synced, so other companies can host the data if they wish.

    1. Johann,

      As a matter of factual…users can add columns to Factual tables–new columns, or joined columns from other tables.

      As for data consistency, syndicated Factual tables on external sites push and pull data from Factual, so any changes made to a Factual table, syndicated or otherwise, affects all instances of that table. The API will operate similarly. Good question!

  4. Links 13/10/2009: Ubuntu 9.10 for Servers, ASUS Back to GNU/Linux? | Boycott Novell Tuesday, October 13, 2009

    [...] Factual Sees Open Data As Its Future Earlier this month, much to the chagrin of some of our readers, I equated the Hadoop-focused startup, Cloudera, to Red Hat. My argument was that in the late 1990s, open-source operating systems and web software proved to be major disruptors and helped Internet services grow exponentially. About a dozen years later, the future of Internet services revolves around data and data analytics. [...]

  5. Data is the Future of Web: Latest Validation from Prominent Investors? « Semantifi Blog Thursday, February 11, 2010

    [...] Here is GigaOM’s  .. [...]

  6. Any info on partitioning of fact tables

    1. Ashkay,

      I’m not sure what you mean by partitioning…you can “split up” a Factual table by using filters. Let’s say, for example, that you have a table of food trucks in Los Angeles and you want to see food trucks that only serve Mexican food; apply a filter and the table is now Mexican food trucks in LA.

      Does this answer your question?

  7. Factual Wants to Be Your Place Database in the Cloud: Tech News « Wednesday, September 22, 2010

    [...] data API provider Factual is making a play for the hot geo-local space today. The company, which launched last fall, offers tools for improving data through crowdsourcing and data sets that web and mobile [...]

  8. Factual Nabs $25 Million to Push Open Data : Tech News « Wednesday, December 8, 2010

    [...] start-up, built by Applied Semantics co-founder Gil Elbaz, got off the ground last year. In September, the company sharpened is focus on local data, releasing a huge amount of geo-coded [...]

Comments have been disabled for this post