SHARE
Follow this article on Twitter Facebook LinkedIn Bookmark and Share
Home >> Information Architecture

U of A text mining project could help businesses

U of A text mining project could help businesses

By:  Rafael Ruffolo  On: 25 Mar 2010 For: ComputerWorld Canada Creator

Text mining tools which can summarize and look for patterns within large electronic documents are still costly and difficult to use. But a group of researchers, including one at the University of Alberta, is hoping to change that. Find out about the tools they are developing and how they hope to improve text mining technology

 

Another data analysis tool called Zotero, which works as a free Firefox extension, collects and manages research sources.

 

Both TAPoR and Zotero are part of the Old Bailey project.

 

The ultimate goal for Rockwell is to make these tools more accessible to students, consumers, and businesses and have them start appearing on blogs, wikis, discussion boards, and even embedded right into browsers.

 

A search engine like Google, he said, is comparable to a card catalogue that directs you to a piece of information. This is also the way many text mining tools currently work.

 

“Most of these tools assume you have the word you want to find,” Rockwell said. “But instead of looking for a needle in the haystack, an effective text mining tool will try and show you the shape of the haystack and tell you the words you might want to find.”

 

One trend that Rockwell expects in this space is the rise of entity recognition, which could involve tools which recognize proper names, dates, and places. This would be useful to easily classify what a particular body of text is about.

 

For businesses, text mining and analysis of blogs or discussion boards can provide valuable insights into a customer sentiment. The more structured approach of conducting multiple-choice questionnaires “limit the depth of insight and breadth of customer feedback,” Bruce Temkin, vice-president and principal analyst with Cambridge, Mass.-based Forrester Research Inc., said in an interview with ComputerWorld Canada last year.

 

Text mining, on the other hand, is by no means a novel technology, but vendors are increasingly making it accessible for applications like Voice of the Customer (VoC). Unstructured data channels -- in-bound e-mail, call centre conversations, blogs, SMS messages -- by virtue of being text-oriented are prime targets for mining, he said, as are social media.

 

– With files from Kathleen Lau










Sign up for our Newsletters












Print |  Views: 3407   |   Rating:offoffoffoffoff  (0 votes)
Rate this article on a scale of
1 to 5 stars,5 being the best.




Rafael Ruffolo Rafael Ruffolo was a senior writer for ComputerWorld Canada from 2006 to 2011. He was the winner of a Kenneth R. Wilson award for business journalism in 2009.
blog comments powered by Disqus