SHARE
Follow this article on Twitter Facebook LinkedIn Bookmark and Share
Home >> Information Architecture

Yahoo's Genome and hosted big data analytics

Yahoo's Genome and hosted big data analytics

By:  Jaikumar Vijayan  On: 23 May 2012 For: Computing Canada Creator
 

Several companies have begun offering big data analytics-as-a service

FRAMINGHAM, Mass. -- Yahoo has joined a growing list of companies offering big data analytics as a service with its Genome offering this week.

Genome is a service designed to let companies deliver highly targeted online advertising and marketing campaigns. It will let advertisers quickly sift through and analyze terabytes of real-time web data collected from Yahoo's own networks and from those of partners such as Yahoo and AOL.

The service, scheduled to become available in July, will let advertisers mash up their own data with Yahoo's data and run analytics on the combined data set.

Such instant analysis of real-time big data sets is an emerging trend and something that many companies are moving towards. "It is illustrative of the desire by companies of all sizes to capture, synthesize, analyze and share timely information about user behavior," said Jeffrey Kaplan, managing director of ThinkStrategies. The goal: To drive better decision-making and new business opportunities, he said.

Genome is based on technology from interclick, a company that Yahoo acquired last December. At its core is a 20-terabyte in-memory database that pulls in and analyses real-time behavioral and advertising-related data from Yahoo's multi-petabyte scale Hadoop clusters.

The company is using a blend of proprietary technology and best-of-breed commercial products from vendors such as Netezza and Microstrategy to do the data analytics on the real-time data, said Michael Katz, CEO of interclick.

"Looking at it through the lens of the business, big data is not just about storing the data," Katz said. "It's about capturing data, putting it into the platform, updating it and propagating it out to the server to be able to do targeting against it in real-time. It's not a trivial task."

Genome is one of a growing number of services that offer companies a way to do sophisticated analytics with their big data without having to invest in a data analytics infrastructure of their own, or without having to worry about finding scarce data scientists to support the infrastructure

In Yahoo's case, the service is targeted specifically at online ad targeting. Others are broader in nature.

One example is Google's BigQuery, launched a few weeks ago, which aims to let enterprises upload their data to Google's infrastructure and run sophisticated analytics against it. Another is ClearStory, a start-up that came out of stealth mode earlier this year. It offers a service that lets companies mash up heterogeneous data from corporate databases, Hadoop environments and public web sources, and then run it through an analytics application.

Other companies with similar offerings include Metamarkets, a venture funded startup that offers a software-as-a-service (SaaS) solution for big data analytics. The company helps firms analyze clickstream data and other online data and provides visualization and predictive modeling capabilities for customers.


Sign up for our Newsletters

 












Print |  Views: 3179   |   Rating:offoffoffoffoff  (0 votes)
Rate this article on a scale of
1 to 5 stars,5 being the best.




jaikumar vijayan Jaikumar Vijayan is a contributor to the International Data Group (IDG) News Service, which publishes global technology stories from bureaus around the world to more than 300 publications in more than 60 countries.

Recent Canadian IT Jobs




blog comments powered by Disqus