HP Vertica adds analytics tool for semi-structured data

With organizations keeping more data for sophisticated analysis the number of business intelligence platforms keeps expanding.

One of the lesser known ones is Hewlett-Packard Co.’s Vertica massively parallel data warehouse platform for analyzing structured data on Linux clusters. HP bought Veritca in 2011.

Now it has announced  Vertica 7.0 Enterprise Edition with a number of new capabilities including improvements to the management console and the addition of a Java software development kit.

But it also announced a new product called Vertica Flex Zone for exploring semi-structured and unstructured data before importing it into Vertica platform.

“We think it’s going to go well with people who already use Hadoop and want to accelerate the exploration angle,” said Luis Maldonado, director of product management, HP Vertica. Other will be satisfied with using Flex Zone by itself, he said.

Flex Zone doesn’t need schemas to be defined before structured or unstructured data is loaded for exploration. Its connectors support JSON and CSVs (comma separated values) and other file stores for analytics.

As a result mixed data like application logs and customer information from Web sites can be accessed in several ways. SQL can be run against the data.

“That we believe is going to be a game changing approach for how you do visualization,” Maldonado said.

Once interesting data has been found, he added, it can be exported in one-step into columnar form for real-time analytics.

Flex Zone “is a great on ramp or discovery tool  and a very nice way of making the whole data life cycle — from early storage to early exploration to operational — much smoother,” he said.

Flex Zone is the first of several new Vertica products coming, Maldonado said.

As for Vertica 7.0 (which carried the code name Crane) — a columnar analytics database –its connectors to Hadoop have been broadened with integration with HCatalog, Hadoop’s table and storage management layer.

That joins integration with MapReduce and HDFS (Hadoop Distributed File System).

As a result, Maldonado said, there are four ways Vertica 7 connects to Hadoop — either integrated tightly or a more federated way.

Vertica 7 now also has a Java SDK — joining developer kits for the R statistical language and C++ — which allows developers to link to more data analysis applications.

Finally, the management console has improved diagnostics, tuning and performance capabilities.

Vertica 7.0 and Flex Zone will be released next month.

HP [NYSE: HPC] doesn’t give pricing details, but Maldonado said Vertica 7 is priced the amount of customer data is managed, starting with a minimum 1 TB. There’s a free Community Edition for those who with less than 1TB, or who just want to get used to the product.

Vertica Flex Zone, which is sold separately, will be “closer to Hadoop pricing,” he said.

Would you recommend this article?


Thanks for taking the time to let us know what you think of this article!
We'd love to hear your opinion about this or any other story you read in our publication.

Jim Love, Chief Content Officer, IT World Canada

Featured Download

Howard Solomon
Howard Solomon
Currently a freelance writer, I'm the former editor of ITWorldCanada.com and Computing Canada. An IT journalist since 1997, I've written for several of ITWC's sister publications including ITBusiness.ca and Computer Dealer News. Before that I was a staff reporter at the Calgary Herald and the Brampton (Ont.) Daily Times. I can be reached at hsolomon [@] soloreporter.com

Featured Article

ADaPT connects employers with highly skilled young workers

Help wanted. That’s what many tech companies across Canada are saying, and research shows that as the demand for skilled workers...

Related Tech News

Tech Jobs

Our experienced team of journalists and bloggers bring you engaging in-depth interviews, videos and content targeted to IT professionals and line-of-business executives.

Tech Companies Hiring Right Now