On the Utility of Tags for Search and Navigation in Online Information Systems
-
Upload
christoph-trattner -
Category
Documents
-
view
5.586 -
download
0
Transcript of On the Utility of Tags for Search and Navigation in Online Information Systems
Graz University of Technology
1
. Christoph Trattner Rigorosum 11.10.2012
On the Utility of Tags for Search and Navigation in Online Information Systems
Christoph Trattner
Knowledge Management Institute
Graz University of Technology, Austria
Graz University of Technology
2
. Christoph Trattner Rigorosum 11.10.2012
What will this talk about
Tagging Systems and in particular about
Tags/tag clouds and
their usefulness for the task of search and navigation
Graz University of Technology
3
. Christoph Trattner Rigorosum 11.10.2012
Definitions
Tag =
A short string, term or word that describes or categorizes an online resource and that is applied by a person or a set people
Tagging System =
An online information system that allows the users to apply tags to resources of the system
Graz University of Technology
4
. Christoph Trattner Rigorosum 11.10.2012
What was the motivation of my work?
Graz University of Technology
5
. Christoph Trattner Rigorosum 11.10.2012
Motivation?
What I recognized when I started my PhD work 3 years ago was the fact that a lot of modern online information systems used tagging functionality to categorize or describe content
and…
to build simple user interfaces on the top-of this light-weight meta-data structures
Graz University of Technology
6
. Christoph Trattner Rigorosum 11.10.2012
Graz University of Technology
7
. Christoph Trattner Rigorosum 11.10.2012
Tags
Graz University of Technology
8
. Christoph Trattner Rigorosum 11.10.2012
InterestinglyThere is a lot of work,
Analysis of Tagging Systems
[Hammond et al.] [Golder and Huberman] [Marlow et al.] [Halphin et al.] [Shen and Wu]
Tags vs. Keywords vs. Named Entities (semantic/structure)
[Krause et al.] [Benz et al.] [Heymann et al.]
Tagging Motivation and Behavior
[Heckner et al.] [Ames and Naaman] [Strohmaier et al.]
[Körner et al.]
Tag Cloud Construction & Visualization
[Montero and Solana] [Kautz et al.] [Rivadeneira et al.] [Kaser and Lemire] [Seifert et al.]
Utility of Tag Clouds for Search Result Summarization
[Kuo et al.] [Koutrika et al.] [Sinclair et al.]
There is hardly any research that investigates the usefulness
of tags or tag clouds for the task of search and navigation
in tagging systems
Graz University of Technology
9
. Christoph Trattner Rigorosum 11.10.2012
Problem Statement
The problem we are facing in this dissertation is the lack of knowledge about the usefulness and the efficiency of tags and corresponding state-of-the-art tag-constructs such as tag clouds for the task of search and navigation in tagging systems.
Graz University of Technology
10
. Christoph Trattner Rigorosum 11.10.2012
Research Questions
RQ1: To what extent are tags/tag clouds useful for (efficient) navigation in tagging systems?
RQ2: To what extent are tags/tag clouds useful for search?
RQ3: To what extent are tags/tag clouds more useful/efficient for search/navigation than other tag-alike meta-data such as keywords or search query-terms?
RQ4: To what extent can we build better tag-based browsing constructs that support efficient search/navigation in tagging systems?
Graz University of Technology
11
. Christoph Trattner Rigorosum 11.10.2012
Ok, lets start….
Graz University of Technology
12
. Christoph Trattner Rigorosum 11.10.2012
Research Question 1:
To what extent are tags/tag clouds useful for (efficient) navigation in tagging systems?
Helic, D., Trattner, C., Strohmaier, M. and Andrews, K. 2010. On
the Navigability of Social Tagging Systems. In Proceedings of the Second
IEEE International Conference on Social Computing (SocialCom
2010), Minneapolis, Minnesota, USA, pp. 161-168.
Graz University of Technology
13
. Christoph Trattner Rigorosum 11.10.2012
Modeling tagging systems as graphs
Resources (= Text Documents, Images, URLs)
Tags
To answer the question to what extent tags/tag clouds are useful for navigationin tagging systems we modeled tagging systems as bipartite graphs
Graz University of Technology
14
. Christoph Trattner Rigorosum 11.10.2012
Defining Navigability
A network is navigable iff:There is a short path between all or almost all pairs of
nodes in the network. [Kleinberg 1999]
Formally:1. There exists a giant component (> 90%)2. The effective diameter is low (bounded by log n)
J. Kleinberg. The small-world phenomenon: An algorithmic perspective. Proc. 32nd ACM Symposium on Theory of Computing, 2000. Also appears as Cornell Computer Science Technical Report 99-1776 (October 1999)
Graz University of Technology
15
. Christoph Trattner Rigorosum 11.10.2012
Are tags useful for navigation?
Results:
In general tags form networks which are navigable
Austria-Forum: 32,245 annotations, 12,837 resourcesBibSonomy: 916,495 annotations, 235,339 resourcesCiteULike: 6,328,021 annotations, 1,697,365 resources
Graz University of Technology
16
. Christoph Trattner Rigorosum 11.10.2012
Are tags useful for efficient navigation?
.
Tagging networks are navigable power-law networks. For power law networks, efficient sub-linear decentralised navigation algorithms exist.
Results:
In general tags form networks which are also efficiently navigable
Graz University of Technology
17
. Christoph Trattner Rigorosum 11.10.2012
But how about tag clouds?
Tag Cloud Size ntopN resources
(topN most common algorithm)
Pagination of resources / tagk resources shown / page
(reverse chronological ordering)
Graz University of Technology
18
. Christoph Trattner Rigorosum 11.10.2012
Are tag clouds useful for navigation?
.
Limiting the tag cloud size n to practically feasible sizes (e.g. 5, 10, or more) does not influence navigability (this is not very surprising).
BUT: Limiting the out-degree of high frequency tags k (e.g. through pagination with resources sorted in reverse-chronological order) leaves the network vulnerable to fragmentation. This destroys navigability of prevalent approaches to tag clouds.
Pagination
Tag Cloud Size
Results:
In general tag clouds do not provide the possibility to navigate to all resources in a tagging system
Graz University of Technology
19
. Christoph Trattner Rigorosum 11.10.2012
Research Question 2:
To what extent are tags/tag clouds useful for search?
Trattner, C., Lin, Y., Parra, D., Yue, Z., Real, W. and Brusilovsky, P.
2012. Evaluating Tag-Based Information Access in Image Collections.
In Proceedings of the 23rd ACM Conference on Hypertext and Social
Media (HT 2012), ACM, New York, NY, USA, pp. 113-122.
Graz University of Technology
20
. Christoph Trattner Rigorosum 11.10.2012
Methodolgy
A controlled user study with 24 participants
With three different types of search interfaces
Baseline Tag Cloud Faceted Tag Cloud
1 2 3
Graz University of Technology
21
. Christoph Trattner Rigorosum 11.10.2012
Dataset
~ 2,000 images ~ 4,200 tags ~ 16,000 tag assignments
Interesting Fact:
Tags were generated by ~100 users from Amazon Mechanical Turk
Graz University of Technology
22
. Christoph Trattner Rigorosum 11.10.2012
Evaluation: Look-up Task
Look-up task
Graz University of Technology
23
. Christoph Trattner Rigorosum 11.10.2012
Evaluation: Exploratory Search Task
Exploratroy search task
Graz University of Technology
24
. Christoph Trattner Rigorosum 11.10.2012
Results: Performance
Look-up: no sign. differences between interfaces
Exploratory:- Tag Cloud Interface out-performs baseline- Faceted Tag Cloud Interface almost as slow as baseline
Question: What interface performs best?Variables: • Total Actions• Search Time
1 2 3
Results:
The Tag cloud interface significantly outperforms the baseline (no-tag) interface
Graz University of Technology
25
. Christoph Trattner Rigorosum 11.10.2012
Results: Preference and RatingQuestion: What was the preference of the users?
• Post-questionair was handed out to the subjects with overall 7 questions.
Question: How are the interfaces rated?
Scale: 1 = very bad….5=very good
Results:
The Tag cloud interfaces are significantly higher rated than the non-tag interface
No tags tags
Graz University of Technology
26
. Christoph Trattner Rigorosum 11.10.2012
Research Question 3:
To what extent are tags/tag clouds more useful/efficient for search/navigation than other tag-alike meta-data such as keywords or search query-terms?
Trattner, C. 2011. Linking Related Content in Web Encyclopedias with search query tag clouds. In the International Journal on WWW/Internet, Volume 9, Issue 2 (IJWI), pp. 33-55.
Helic, D., Körner, C., Granitzer, M., Strohmaier, M. and Trattner, C. 2012. Navigational Efficiency of Broad vs. Narrow Folksonomies. In Proceedings of the 23rd ACM Conference on Hypertext and Social Media (HT 2012), ACM, New York, NY, USA, pp. 63-72.
Graz University of Technology
27
. Christoph Trattner Rigorosum 11.10.2012
Example: Austria-Forum
In Austria-Forum tags/tag clouds are used to link related content
Since the tagging system is in a early adopting phase search query terms are used
Graz University of Technology
28
. Christoph Trattner Rigorosum 11.10.2012
Are query terms more navigable than tags?
Results:Both user tag and query tag networks show a large connected componentBoth show an ED that is bounded by log(N)
Results:
On a network-theoretic level we find that the AF user tags and query tags are efficiently navigable
Graz University of Technology
29
. Christoph Trattner Rigorosum 11.10.2012
And how efficient are they for humans?
Tag hierarchy Tag (Cloud) network
To that end, we implemented a decentralized search algorithm that simulates human-like tag-based navigation by inducing a hierarchy out of the tag-resource network [Helic 2011] [Trattner 2012].
Trattner, C., Singer, P., Helic, D. and Strohmaier, M.: Exploring the Differences and Similarities of Hierarchical Decentralized Search and Human Navigation in Information-networks, In Proceedings of the 12th International Conference on Knowledge Management and Knowledge Technologies, ACM, New York, NY, USA, 2012.
Helic, D., Strohmaier, M., Trattner, C., Muhr M. and Lermann, K.:Pragmatic Evaluation of Folksonomies, In Proceedings of the 20th international conference on World wide web, ACM, New York, NY, USA, 417-426, 2011.
Graz University of Technology
30
. Christoph Trattner Rigorosum 11.10.2012
What are the results?
Results:
From simulations we find that the query tags are better suited for navigation than user tags
Additionally to this a user study was conducted to determine the quality of the query tags, showingno sign. difference.
Graz University of Technology
31
. Christoph Trattner Rigorosum 11.10.2012
We
Keywords
Tags
Example: Mendeley
Keyword = A short term or string (typical controlled vocabulary) assigned by a single user
Graz University of Technology
32
. Christoph Trattner Rigorosum 11.10.2012
Tags
Are keywords more navigable than tags?
Keywords
Results: Our Greedy Navigator (= Simulator) needs on average 1-click more with keywords to reach the target node than with tags
Results:
With simulations we find that tags are more efficient for navigation than keywords
#hops success rate #hops success rate
Stretch= #hops/shortest path
Graz University of Technology
33
. Christoph Trattner Rigorosum 11.10.2012
“Since we observed that tagging systems only support efficient navigation through tags if no user interface limitations are considered, we had the idea to invent a number of new approaches that support more efficient navigation with tags.”
Graz University of Technology
34
. Christoph Trattner Rigorosum 11.10.2012
Research Question 4:
To what extent can we build better tag-based browsing constructs that support efficient search/navigation in tagging systems?
Trattner, C., Helic, D. and Strohmaier, M. 2011. On the Construction of Efficiently Navigable Tag Clouds Using Knowledge From Structured Web Content. In the Journal of Universal Computer Science (JUCS), Volume 17, Issue 4, pp. 565-582.
Trattner, C. 2011. Improving the Navigability of Tagging Systems with Hierarchically Constructed Resource Lists: A Comparative Study. In Proceedings of the 33rd International Conference on Information Technology Interfaces (ITI 2011), IEEE, Cavtat / Dubrovnik, Croatia, pp. 173-178.
Trattner, C., Körner, C. and Helic, D. 2011. Enhancing the Navigability of Social Tagging Systems with Tag Taxonomies. In Proceedings of the 11th International Conference on Knowledge Management and Knowledge Technologies (I-Know 2011). ACM, New York, NY, USA, pp. 18:1-18:8.
Trattner, C. 2011. Improving the Navigability of Tagging Systems with Hierarchically Constructed Resource Lists and Tag Trails. In the Journal of Computing and Information Technology (CIT), Volume 19, Issue 3, pp. 155-167.
Graz University of Technology
35
. Christoph Trattner Rigorosum 11.10.2012
How can we enhance Tag Cloud navigability?
Graz University of Technology
36
. Christoph Trattner Rigorosum 11.10.2012
Through dynamic resource list construction!
Idea
Graz University of Technology
37
. Christoph Trattner Rigorosum 11.10.2012
Approach
Instead of calculating the resource list statically, we calculate the resource list in a dynamic and resource-specific manner=> On each click on a particular tag a different resource list is generated
Link1Link2Link4Link5
Link4Link10Link11Link3
Tag Tag
Resource x Resource y
Link1Link2Link4Link5
Tag Tag
Resource x Resource y
Link1Link2Link4Link5
Static Resource List Construction Dynamic Resource List Construction
Graz University of Technology
38
. Christoph Trattner Rigorosum 11.10.2012
Approach: Hierarchical Resource List Construction
Graz University of Technology
39
. Christoph Trattner Rigorosum 11.10.2012
Results
Random
Results: Only the random approach and the hierarchicalresource list calculation approach show a large connected component
Hierarchical
Giant Component
Similarity
Graz University of Technology
40
. Christoph Trattner Rigorosum 11.10.2012
Results
Hierarchically Constructed Resource List
Simulations
User Study
Results:
We find tag clouds calculating the resource list in a hierarchical manner are better suited for navigation
Graz University of Technology
41
. Christoph Trattner Rigorosum 11.10.2012
…ok lets come to an end
Graz University of Technology
42
. Christoph Trattner Rigorosum 11.10.2012
Summary of Contributions
1. The review of the utility of tags for the task of search and navigation in tagging systems
2. The navigational review of tags compared to other tag-alike meta-data structures such as keywords and search query terms
3. The introduction of (a number) of new approach(es) to support more efficient tag-based navigation in tagging systems
Graz University of Technology
43
. Christoph Trattner Rigorosum 11.10.2012
Thank you!
Christoph Trattner
Email: [email protected]: www.christophtrattner.info
Twitter: @ctrattner
Sponsors: