User-Driven Taxonomies
-
Upload
christine-connors -
Category
Technology
-
view
1.931 -
download
1
description
Transcript of User-Driven Taxonomies
© Copyright 2008 Dow Jones and Company, Inc.
User-Driven Taxonomies
Christine Connors
iKMS, Singapore, 13 March 2008
Proprietary and Confidential | © Copyright 2008 Dow Jones and Company, Inc.
The problem with…
Formal taxonomies High cost
Taxonomy creation experts Subject Matter Experts (SMEs) Software & Hardware Purchase & modify Consultants
Scope and timeline Implementation Maintenance Hard to sell an ROI
Proprietary and Confidential | © Copyright 2008 Dow Jones and Company, Inc.
The problem with…
Informal taxonomies Consistency, clarity, context Scope and timeline Implementation Maintenance Hard to sell an ROI
Proprietary and Confidential | © Copyright 2008 Dow Jones and Company, Inc.
The benefits of a hybrid approach
Expertise in taxonomy design User-centered language Contextual variety User-driven prioritization of knowledge modeling Grow the model faster
Guided by taxonomists to avoid chaos Distributed costs
Does require A champion Change Control Board / Taxonomy Advisory Board
Proprietary and Confidential | © Copyright 2008 Dow Jones and Company, Inc.
Literary and user warrant in the enterprise
Object Repositories
Metadata Registries/Repositories
Search & BrowseMechanisms (UI)
Proprietary and Confidential | © Copyright 2008 Dow Jones and Company, Inc.
What is a folksonomy?
“People’s classification management”
Wisdom of the crowd
User-generated tags applied to digital objects
Informal, uncontrolled vocabularies
Usually Subject or Task based
Provide little to no context on their own
Proprietary and Confidential | © Copyright 2008 Dow Jones and Company, Inc.
Examples
Primary examples are del.icio.us and flickr
Blogs are anothergood place to look
© Copyright 2008 Dow Jones and Company, Inc.
Lessons learned: Hybrid methods and Social tagging pilots
Proprietary and Confidential | © Copyright 2008 Dow Jones and Company, Inc.
Evolution
In the beginning… Best Bets were created by the search
administrator Search terms parsed out of the query sent
from the browser to the search engine Terms compared to manually created list of
Best Bet sites Matches were programmatically inserted
into the SERP before the #1 hit, with special formatting to highlight their existence
Intranet site owners called the search administrator to beg inclusion
Proprietary and Confidential | © Copyright 2008 Dow Jones and Company, Inc.
An early pilot
• Each resource can only be placed in one bucket, need to duplicate entries for full coverage
• Not integrated with any other system - ILMS, DMS, CMS, FS
• Administered by Research Librarians
• Rarely used!
• How do we integrate Enterprise Search, Suggested Sites, Public Bookmarks and Social Tagging?
Proprietary and Confidential | © Copyright 2008 Dow Jones and Company, Inc.
Updates to enterprise search
In search, search terms are tagged to bring back certain websitesUsers call, email or submit via web-form sites they would like to see addedTaxonomy team reviewed the submission for appropriateness, accuracy of tags, uniqueness of tags Sites and associated terms are manually entered into a flat fileDuring the regular index refresh cycle the flat file is programmatically converted to XML and ingested into search
Proprietary and Confidential | © Copyright 2008 Dow Jones and Company, Inc.
2006 Social bookmarking pilot
We wanted to see *what* would happen if we “opened” up the tagging
Goal was to help our users find commonly requested information and most useful information by Tagging favorite internal websites
Maintain security by NOT posting intranet URLs to public sites like del.icio.us
Linking directly to a resource, be it internal or external Sharing and searching other user’s bookmarks Removing a bottleneck and relieving resource constraints in
a moderated hybrid system Reviewed available systems
Public sites not an option due to security considerations Connotea, Scuttle, del.irio.us, Freetag
© Copyright 2008 Dow Jones and Company, Inc.
How can folksonomies improve discovery?
Proprietary and Confidential | © Copyright 2008 Dow Jones and Company, Inc.
As Inputs
To taxonomies, thesauri, ontologies? What folksonomy terms are popular? What synonyms can you derive? What relationships can you identify? What entity types are you discovering?
To search Identify Best Bets As inputs to a recommendation engine
To the content management strategy What do they tell you about how your content is
perceived? What do they tell you about how your content is used? Do they tell you when your users go elsewhere for
their content needs?
Proprietary and Confidential | © Copyright 2008 Dow Jones and Company, Inc.
User driven
Enables user warrant Useful for understanding users – how do they think
about the objects you are providing to them? Allows the users to find things their own way, rather
than forcing them to do it the site’s way
Improves user experience Combine with search and web logs to
Improve navigation Improve browse mechanisms Improve search Identify content gaps Prioritize content and UI related tasks
© Copyright 2008 Dow Jones and Company, Inc.
How can you implement folksonomy tools?
Proprietary and Confidential | © Copyright 2008 Dow Jones and Company, Inc.
Sample Pure methods
Install a tagging tool Tools similar to del.icio.us
Connotea Scuttle ConnectBeam Semantic applications such as Annotea (W3C) or
semantic blogging tools Modules for blogs/CMSs, examples:
Taxonomy modules for Drupal Tagging system in Wordpress or Typepad Extensions for MediaWiki
Make sure you review the reports available in the tools you consider Can you get actionable data?
Proprietary and Confidential | © Copyright 2008 Dow Jones and Company, Inc.
Sample implementations of social/hybrid methods
Best bets Allow users to submit sites, along with keywords, to
improve search results
File properties / repository check-in form Encourage (or require!) that users
fill out the properties of the files they create, using any terms they deem appropriate
Automate whenever possible
Proprietary and Confidential | © Copyright 2008 Dow Jones and Company, Inc.
Commercial Example
Buzzillions.com Combines formal taxonomy with folksonomy terms to
guide users to the products right for them
© Copyright 2008 Dow Jones and Company, Inc.
Thank you!
Christine ConnorsGlobal Director, Semantic Technology SolutionsDow Jones & [email protected]
Proprietary and Confidential | © Copyright 2008 Dow Jones and Company, Inc.
Announcing Synaptica 7.0!
Synaptica 7.0 provides standardized, Semantic Web-enabled tools to manage your global business vocabulary in order to add structure and value to existing information assets, improve the online user experience and connect professionals in your organization with the information they need, where and when they need it.
Customer Benefits
Easy configuration Scalable for the enterprise with multi-user permissions Customizable and flexible with audience-centric views Supports collaboration and workgroups Standards based, semantic Web enabled Multiple data formats (HTML,XML,etc.) API level access for simple integration
http://solutions.dowjones.com/djcs/index.asp
Proprietary and Confidential | © Copyright 2008 Dow Jones and Company, Inc.
Synaptica’s new side by side relationship editor makes the creation and editing of terms a one step process.
Easily find and edit a key term and multiple related terms
Proprietary and Confidential | © Copyright 2008 Dow Jones and Company, Inc.
Synaptica drag and drop hierarchical relationship editing provides a simple, convenient way to manage vocabulary hierarchies.
Easily Manage and edit vocabulary hierarchies
Proprietary and Confidential | © Copyright 2008 Dow Jones and Company, Inc.
Term Information Summary Window provides quick views of term details
Gain quick views of term information without leaving current interface
Proprietary and Confidential | © Copyright 2008 Dow Jones and Company, Inc.
In addition to CSV, HTML and XML formats, reports may be created in Microsoft Word and Excel.
Expanded Reporting Functionality for Easier, More Flexible Information Sharing
Proprietary and Confidential | © Copyright 2008 Dow Jones and Company, Inc.
Synaptica User and Administrative Guides are now available online directly from the application to browse and search
Quickly and easily access Help right from the application
Proprietary and Confidential | © Copyright 2008 Dow Jones and Company, Inc.
Dow Jones Client Solutions Offers Comprehensive, Business Taxonomy Solutions for Fast, Relevant Information Retrieval
Industry-focused
integrated solutions
Build & CustomizeTo Suit Your
Information Needs
Stay Informed with
Taxonomywarehouse.com
Optimize and ManageWith Synaptica
License & IntegrateIndustry-focused
Taxonomies