SHARE
Follow this article on Twitter Facebook LinkedIn Bookmark and Share
Home >> Information Architecture

U of A text mining project could help businesses

U of A text mining project could help businesses

By:  Rafael Ruffolo  On: 25 Mar 2010 For: ComputerWorld Canada Creator

Text mining tools which can summarize and look for patterns within large electronic documents are still costly and difficult to use. But a group of researchers, including one at the University of Alberta, is hoping to change that. Find out about the tools they are developing and how they hope to improve text mining technology

In an effort to highlight the role of data mining in humanities research, a University of Alberta professor is helping to create text analysis tools to deeply examine historical trial accounts from the U.K.’s famous Old Bailey criminal court. While the research project is important to academia, the Edmonton-based researcher said that improving the quality of text mining tools could have benefits for businesses as well.

 

The project is a joint effort between the University of Alberta, U.S.-based George Mason University, and U.K.-based University of Hertfordshire. It is part of the “Digging Into Data Challenge” competition, which aims to show how tools in the digital humanities can improve the effectiveness of text mining large databases.

 

Geoffrey Rockwell, a professor of philosophy and humanities computing at the University of Alberta, said the explosion of blogs, wikis, digitalized books, and discussion forums has led to information overload for many researchers. But the same could also be true for marketers or other business units keeping tabs on what is being written about their companies.

 

“If I were doing this for market research, I could track how we appear in discussion lists, what words are near the brand, and whether they are talking to stock up or down,” he said.

 

In the Old Bailey project, Rockwell hopes the tools he and his international colleagues are developing will revolutionize how users and businesses cull through digital material. The key, he said, is to draw correlations from a body of text through the use of a mining tool.

 

“You can start looking at words most used in cases involving women pick pockets,” he said, referring to the U.K. prison record dataset. “Are people still worried about witchcraft? How are children being talked about? Is poison still popular?”

 

Rockwell and his international colleagues are developing tools like TAPoR, a textual analysis tool that can summarize a body of text, find collocates, identify important dates, and discover the co-occurrences of two target words. Some of the tools in TAPoR use forms of visualization to help researchers grasp the data even clearer.


Sign up for our Newsletters












Print |  Views: 3022   |   Rating:offoffoffoffoff  (0 votes)
Rate this article on a scale of
1 to 5 stars,5 being the best.




Rafael Ruffolo Rafael Ruffolo was a senior writer for ComputerWorld Canada from 2006 to 2011. He was the winner of a Kenneth R. Wilson award for business journalism in 2009.

Comments (0)

No Comments!
Name: (required) eMail: (optional)

Your email address will not appear online and will be used only if the editor wishes to contact you personally for additional comments.