Login, change your address, subscribe to new or manage current magazines or e-newsletter subscriptions
ComputerWorldNetwork WorldCIO CanadaCIO Canada Governments' ReviewJobUniverse Canada
Advanced Search
Knowledge Centres
Content Types
Featured White Papers
Unlock the potential of data with the right data warehouse solutionUnlock the potential of data with the right data warehouse solution read more
IBM Multiform Master Data Management: The evolution of MDM applicationsIBM Multiform Master Data Management: The evolution of MDM applications read more
Closing the data privacy gap: Protecting sensitive data in non-production environmentsClosing the data privacy gap: Protecting sensitive data in non-production environments read more
Yuk it Up
Green IT Playbook
Featured IT Quiz
IT Quiz: IT World Canada and IDC Canada want to know how your Green IT strategy is shaping up. Take this quiz to see how your company stacks up against other IT World Canada readers.
Featured White Papers
This white paper details Intel's current and future energy-saving initiatives to reduce costs and support business goals. Learn how Intel IT is extending its efforts to be a role model enterprise IT organization by supporting the Climate Savers Computing Initiative, which aims to drive a 50 percent reduction in computer-related CO2 emissions worldwide. No registration required.
Tagging: It's no longer fun and easy
Page 1 of 1

Tagging: It’s no longer fun and easy

COMMENT ON THIS ARTICLE

Most people think that tagging on the Web is pretty easy and fun. Give ‘em a blog or a Web page and a field named “tags,” and they’ll start stuffing in text with wild abandon in the hopes that their content will be easily found by people who are desperately searching for information and opinion on feline hairball cures or cycling in the Ozarks or whatever their particular hobby is.

Alas, all these folks are doing is polluting the Web.

Tags arose out of a need for a way to classify Web page content and blog entries that the big search engines, such as Google, couldn’t find or ignored. Tagging also appealed to people because it was a democratic technique that was fast, easy and had a perceived payoff. If that payoff ever existed it was back when the blogosphere was smaller and tagging hadn’t gone mainstream. Today, I doubt there is much of a payoff anymore.

The trouble is, rot has set in, and tagging has developed a few significant problems that are making it progressively less valuable. This is not to say tagging is, per se, a bad thing, merely that its popularity and the lack of standards have ensured that its utility value will continue to degrade. This degradation ensures that tagging will turn into a bigger source of content “noise” as every day passes.

The first problem with tagging is semantic vagueness. For example, does the tag “china” apply to the country or crockery? While you might hope that the distinction between the two would be evident from examination of related data, such as other tags used for the same item, specific words used in the item or in the rest of the site hosting the item, the effort required to resolve the context wipes out the value of tagging in the first place.

A second problem is that the format of tags isn’t standardized. This means that issues such as how white space is handled, which characters are legal, and which characters have special meanings and what those meanings are go undefined.

The third and perhaps biggest problem is the overuse of tagging. How often have you seen a blog item with a list of tags almost as long as the item itself? This is a direct result of the optimism of tag authors — they want to cover all of the bases so their content can be easily found.

This last problem underlines the messiness of tagging and why the noise generated by tags is growing so rapidly. Any index of tags from a given set of Web or blog pages is gigantic, and each tagged item has scores of closely related tag variants with little or no syntactic distinction. In other words, a big mush of text.

The result is that automated systems for finding, indexing and searching tags across multiple sites such as Del.icio.us and Technorati will continue to become less valuable, because they deal with ever greater levels of noise. Even so, tagging will survive but it will have to evolve to retain relevance. I know that those of you who use it for your blogs and Web sites will probably not give up on tagging too soon, but mark my words: in the near future you will either not be bothering with tagging or you’ll have moved on to the next generation of tagging which will be more complex (probably based on XML) and demand more effort to use. Tagging will no longer be fun and easy.

Gibbs is a columnist for Network World (U.S.). Contact him at backspin@gibbs.com.

QuickLink 071016

COMMENT ON THIS ARTICLE

Page 1 of 1
Send to a Friend  Rate This Page  Print This PageAdd a new comment
Bookmark this article on:
del.icio.us| Digg it| Furl| Google| Technorati| StumbleIt| Yahoo!

Have something to say about this article? Add a new comment

If you find a comment inappropriate, You can notify the moderator by clicking the Report an innapropriate comment icon.
ADD A COMMENT
Name:*Your email address will not appear online and will be used only in the event that the editor wishes to contact you personally for additional comment.
City:
Email:
Title:*
Comment:*
* required fields



Special Advertising Partners
IDC Case Study: Identity And Access Management Buying Criteria.
IDC analyses IAM buying criteria and deployment at Coppin State University. Coppin State replaces "first generation" IAM solution to obtain benefits needed for today's agile enterprise: ease of integration, rapid deployment, simplified compliance, flexibility.
White Papers
Closing the data privacy gap: Protecting sensitive data in non-production environments
How can IT organizations protect sensitive data, including employee and customer information, as well as corporate confidential data and intellectual property? Industry analysts recommend "de-identifying" or masking data as a best practice for protecting privacy. This white paper explains the importance of closing the data privacy gap in non-production environments, and provides guidance on effective data masking. Complimentary with registration. Sponsored by IBM.
Unlock the potential of data with the right data warehouse solution
Once you've made the decision to implement a new data warehouse, you want to make sure you choose the one that's right for your organization. This buyer's guide provides checklists for starting points that you can use when evaluating vendors and their products. Complimentary with registration. Sponsored by IBM.
Prepare for a more efficient SAP implementation: Take data issues off the critical path
This white paper outlines how the Preliminary Data Assessment Appliance (PDAA) from IBM can help address the challenges of integrating data from different operational applications across the enterprise to an SAP platform. Complimentary with registration. Sponsored by IBM.