User Interfaces and Information Retrieval Dina Reitmeyer WIRED (i385d) 10.14.04.

18
User Interfaces and Information Retrieval Dina Reitmeyer WIRED (i385d) 10.14.04

description

Interfaces & IR systems: Problems Past emphasis on a working system (not on a user-friendly system) Usually tacked on a front-end for users (not necessarily user-friendly) Users without training were out of luck Interfaces using Boolean queries were common

Transcript of User Interfaces and Information Retrieval Dina Reitmeyer WIRED (i385d) 10.14.04.

Page 1: User Interfaces and Information Retrieval Dina Reitmeyer WIRED (i385d) 10.14.04.

User Interfaces and Information Retrieval

Dina ReitmeyerWIRED (i385d)

10.14.04

Page 2: User Interfaces and Information Retrieval Dina Reitmeyer WIRED (i385d) 10.14.04.

Presentation Outline

• Past problems with interfaces & IR• Why we need good systems & interfaces• How to make things better:

– User-friendly systems (using statistical ranking) – D. Harman– WebMate – L. Chen & K. Sycara– Dynamic queries – C. Williamson & B. Shneiderman– Relevance profiling within documents – D. Harper, S. Coulthard,

& S. Yixing

• Whatever happened to … ?• Conclusion

Page 3: User Interfaces and Information Retrieval Dina Reitmeyer WIRED (i385d) 10.14.04.

Interfaces & IR systems: Problems

• Past emphasis on a working system (not on a user-friendly system)

• Usually tacked on a front-end for users (not necessarily user-friendly)

• Users without training were out of luck

• Interfaces using Boolean queries were common

Page 4: User Interfaces and Information Retrieval Dina Reitmeyer WIRED (i385d) 10.14.04.

Why do we need a good system & a good interface?

• The user of the system is “the customer”!

• If your system doesn’t work well, no one will use it!

• If your interface doesn’t work or is too confusing/difficult to interact with, no one will use it!

• If you want your system to be useful, how the user views it (and its interface) must be a major concern.

Page 5: User Interfaces and Information Retrieval Dina Reitmeyer WIRED (i385d) 10.14.04.

“User-Friendly Systems Instead of User-Friendly Front-Ends”

–D. Harman • Problem: systems not built for users

(even with a front-end, still hard to use well)• Need for user-friendly systems• 1 Solution: use systems with statistical

rankingsimple definition: a system compares a

document & a query and estimates the likelihood of the document’s relevance to the query

Page 6: User Interfaces and Information Retrieval Dina Reitmeyer WIRED (i385d) 10.14.04.

4 Prototypes of Statistical Retrieval Systems

• PRISE- uses weighting formula, adds weights of matching terms, results

are ranked

• CITE- similar ranking system to PRISE, uses relevance feedback*

• MUSCAT- different weighting system, but does use ranking, relevance feedback

• News Retrieval Tool (NRT)- user can use slider to weight a term, relevance feedback

Page 7: User Interfaces and Information Retrieval Dina Reitmeyer WIRED (i385d) 10.14.04.

Harman’s Results

• Users often preferred this type of natural language search to Boolean

• Searches were comparable to Boolean in speed & retrieval of relevant documents

• Easier for novice users; little training required

The Future of Statistical Ranking: -add more complex term weighting-use IDF (measures scarcity of terms)-use relevance feedback

Page 8: User Interfaces and Information Retrieval Dina Reitmeyer WIRED (i385d) 10.14.04.

WebMate: A Personal Agent for Browsing & Searching

L. Chen & K. Sycara • What is it? How does it work?

– A stand-alone proxy between the user’s browser & the web

– Monitors user’s actions & updates itself constantly with this info.

– Uses TF-IDF (term frequency-inverted document frequency) with multiple vector representations to see if a document is relevant based on what you have already deemed relevant

– Uses trigger pairs model to automatically provide more search terms based on the one(s) the user provides

Page 9: User Interfaces and Information Retrieval Dina Reitmeyer WIRED (i385d) 10.14.04.
Page 10: User Interfaces and Information Retrieval Dina Reitmeyer WIRED (i385d) 10.14.04.

What does WebMate do?• It can compile a personal

“newspaper” by:

– Parsing user-designated URLs– Extracting links from each

headline– Fetching the pages from those

links– Analyzing TF-IDF vectors for

each page– Comparing a page’s similarity

with the user’s current profile

Results: 50-60% accuracy in articles returned

• It can refine a search by:

– Taking the user’s search term.

– Using the trigger pairs model to find related terms

– Searching using those terms together

Results: WebMate gives many more relevant results than if you simply type the word into AltaVista or Lycos.

Page 11: User Interfaces and Information Retrieval Dina Reitmeyer WIRED (i385d) 10.14.04.

The Dynamic HomeFinder: Evaluating Dynamic Queries in a Real-Estate Information Exploration

System

C. Williamson & B. Shneiderman

• Direct Manipulation

– Benefits+ graphic representation of objects/actions+ buttons/sliders instead of query syntax+ rapid & reversible operations (immediate results)+ permits use with little/no training

Page 12: User Interfaces and Information Retrieval Dina Reitmeyer WIRED (i385d) 10.14.04.
Page 13: User Interfaces and Information Retrieval Dina Reitmeyer WIRED (i385d) 10.14.04.

The Experiment

• Users were asked to find answers to different types of queries using:

– HomeFinder (a dynamic query interface)

– Q&A (a natural language query interface)

– Paper listings sorted by different fields

Page 14: User Interfaces and Information Retrieval Dina Reitmeyer WIRED (i385d) 10.14.04.

The Results

• HomeFinder rocked the house because:– Its use took little/no training.– There were no error messages.– It was good for viewing trends.– It took less time.– It was easy for novices to use.– No complex query formulation was necessary.– People just liked it better!

Page 15: User Interfaces and Information Retrieval Dina Reitmeyer WIRED (i385d) 10.14.04.

A Language Modeling Approach to Relevance Profiling for Document Browsing

D. Harper, S. Coulthard, & S. Yixing

• With longer documents available online, users need a tool for within-document retrieval.

• SmartSkim: creates a relevance profile for a query about a document using language modeling (a statistical model of text that captures the distribution of text features)

Page 16: User Interfaces and Information Retrieval Dina Reitmeyer WIRED (i385d) 10.14.04.

Why SmartSkim could be cool:• It highlights all query

terms in the text

• It shows a histogram depicting the places in the text where relevant passages are to be found

• It is color coded to show where you have & have not looked

Page 17: User Interfaces and Information Retrieval Dina Reitmeyer WIRED (i385d) 10.14.04.

Whatever Happened to…?

• WebMate

• Dynamic HomeFinder

• Ben Shneiderman - Spotfire

Page 18: User Interfaces and Information Retrieval Dina Reitmeyer WIRED (i385d) 10.14.04.

Conclusion

• If we want our users to utilize our information retrieval systems, we have to have both user-friendly systems & user-friendly interfaces!

• Tools like statistical ranking, personal software agents, relevance profiling, & dynamic queries help us provide what the user needs to interact with a system.