SHARE
Follow this article on Twitter Facebook LinkedIn Bookmark and Share
Home >> Information Architecture

Big three database vendors disagree on Hadoop

Big three database vendors disagree on Hadoop

By:  Eric Lai  On: 17 Dec 2009 For: ComputerWorld (US) Creator

Why Microsoft, Oracle and IBM take three different paths on dealing with the open-source data architecture

The three leaders of the relational database market are responding to the sudden mania for the data processing technology Hadoop in three very different ways.

While startups and established data warehousing vendors such as Sybase Inc. and Teradata Inc. are embracing Hadoop and its Google -developed progenitor, MapReduce, Microsoft Corp. is resisting it.

"We'd never bring Hadoop code into one of our products," said Microsoft technical fellow and University of Wisconsin-Madison professor David J. DeWitt.

DeWitt's lack of interest is not surprising. DeWitt is an academic expert in parallel SQL databases, having co-invented three of them. He co-authored a paper this spring that argued that SQL databases still beat MapReduce at most tasks. He hasn't changed his mind.

"Every database vendor wants to claim that they're doing Hadoop because it's the popular thing," he said. "There's too much FUD. SQL databases still work pretty well."

DeWitt leads a database research lab at Madison that is helping Microsoft with R&D for its upcoming Parallel Data Warehousing version of SQL Server 2008 R2, formerly known as Project Madison.

As such, he said that the new edition of SQL Server will add some analytic functions that roughly mimic some of the features of MapReduce/Hadoop.

The additions are the result of incorporating technology from DATAllegro Inc., which Microsoft acquired, not Hadoop, DeWitt said.

He said does acknowledge, however, that MapReduce/Hadoop is better at keeping long-running queries from crashing than SQL.

Because of that, Microsoft may eventually try to incorporate those capabilities into future data warehousing-oriented versions of SQL Server, he said.

That would likely be a Microsoft-led effort, rather than a licensing of Hadoop's open-source code, which is managed by the Apache Software Foundation.

IBM is the leading corporate supporter of Apache. Perhaps unsurprisingly, it is also "very bullish on Hadoop," said Anant Jhingran, CTO of IBM's information management division in the software group.

"I'm not saying that mind-melding Hadoop with a database is the answer for everything," Jhingran said. "But in the end, I think every enterprise will want Hadoop. I'm just not sure in what form."


Sign up for our Newsletters












Print |  Views: 3863   |   Rating:offoffoffoffoff  (0 votes)
Rate this article on a scale of
1 to 5 stars,5 being the best.




eric lai Eric Lai is a contributor to the International Data Group (IDG) News Service, which publishes global technology stories from bureaus around the world to more than 300 publications in more than 60 countries.

Comments (0)

No Comments!
Name: (required) eMail: (optional)

Your email address will not appear online and will be used only if the editor wishes to contact you personally for additional comments.