MIT Artificial Intelligence Laboratory — Research Directions The START Information Access System...

15
MIT Artificial Intelligence Laboratory — Research Directions The START Information Access System Boris Katz http://www.ai.mit.edu/projects/infolab/

description

MIT Artificial Intelligence Laboratory — Research Directions What’s Wrong with Keyword Search?

Transcript of MIT Artificial Intelligence Laboratory — Research Directions The START Information Access System...

Page 1: MIT Artificial Intelligence Laboratory — Research Directions The START Information Access System Boris Katz

MIT Artificial Intelligence Laboratory — Research Directions

The START InformationAccess System

Boris Katzhttp://www.ai.mit.edu/projects/infolab/

Page 2: MIT Artificial Intelligence Laboratory — Research Directions The START Information Access System Boris Katz

MIT Artificial Intelligence Laboratory — Research Directions

Finding information on line

Two Approaches: 1. Keyword search (search engines, e.g., AltaVista)

2. Natural language processing

The Problem:

Page 3: MIT Artificial Intelligence Laboratory — Research Directions The START Information Access System Boris Katz

MIT Artificial Intelligence Laboratory — Research Directions

What’s Wrong with Keyword Search?

Page 4: MIT Artificial Intelligence Laboratory — Research Directions The START Information Access System Boris Katz

MIT Artificial Intelligence Laboratory — Research Directions

What’s Right About Natural Language Processing?

Page 5: MIT Artificial Intelligence Laboratory — Research Directions The START Information Access System Boris Katz

MIT Artificial Intelligence Laboratory — Research Directions

What’s Wrong with Natural Language Processing (today)?

1. Too hardFull-text NL understanding still beyond reach•Intersentential reference

•Paraphrasing

•Summarization

•Common sense implication2. Too slow3. Not all information is language

Most Web resources are not textual•Maps and Images

•Sound and Video

•Multimedia

Web resources are distributed across numerous non-traditional databases

Page 6: MIT Artificial Intelligence Laboratory — Research Directions The START Information Access System Boris Katz

MIT Artificial Intelligence Laboratory — Research Directions

START (SynTactic Analysis using Reversible Transformations) provides multimedia information access using natural language.

Natural languageNatural language is human language. You don’t have to learn a special language to use START. Ask your questions in English; enter information using English.

Multimedia access using natural language annotationsSTART lets you use English to access any kind of information: text, pictures, movies, and more.

“Just the right information”START gives you the answer you want without including a thousand others.

Virtual collaborationSTART retrieves information from its own knowledge base and from databases all over the Web.

What is START?

Page 7: MIT Artificial Intelligence Laboratory — Research Directions The START Information Access System Boris Katz

MIT Artificial Intelligence Laboratory — Research Directions

Natural language is human language. You don’t have to learn a special language to use START. Ask your questions in English; enter information using English

Natural Language

Page 8: MIT Artificial Intelligence Laboratory — Research Directions The START Information Access System Boris Katz

MIT Artificial Intelligence Laboratory — Research Directions

START lets you use English to access any kind of information: text, pictures, movies, and more.

Multimedia Access Using Natural Language Annotations

Page 9: MIT Artificial Intelligence Laboratory — Research Directions The START Information Access System Boris Katz

MIT Artificial Intelligence Laboratory — Research Directions

START gives you the answer you want without including a thousand other answers.

Just the Right Information

Page 10: MIT Artificial Intelligence Laboratory — Research Directions The START Information Access System Boris Katz

MIT Artificial Intelligence Laboratory — Research Directions

START retrieves information from its own knowledge base and from databases all over the Web.

Virtual Collaboration

Page 11: MIT Artificial Intelligence Laboratory — Research Directions The START Information Access System Boris Katz

MIT Artificial Intelligence Laboratory — Research Directions

Bridge the gap between our ability to analyze natural language sentences and other information and our desire to access the huge amount of data now available on the Web.

Annotations are collections of natural language sentences and phrases that describe the content of various information segments.

START• analyzes these annotations • creates the necessary representational structures

• produces special pointers to the information segments summarized by the annotations.

Natural Language Annotations

Page 12: MIT Artificial Intelligence Laboratory — Research Directions The START Information Access System Boris Katz

MIT Artificial Intelligence Laboratory — Research Directions

STARTServer

STARTServer

STARTServer

Document

Natural Language Annotations

Annotation

InformationProvider

InformationSeeker

(negotiation)

(submitted)

(retrieved)

Xxx xx xxxx xxxx xxxxx x xxxxxx x xxx x xxx

Xxx xxxx xxx xxxx x

“Neptune was discoveredusing mathematics.”

+

Document

Xxx xx xxxx xxxx xxxxx x xxxxxx x xxx x xxx

Xxx xxxx xxx xxxx x

Question

“How was Neptune discovered?”

STARTServer

Page 13: MIT Artificial Intelligence Laboratory — Research Directions The START Information Access System Boris Katz

MIT Artificial Intelligence Laboratory — Research Directions

HPKB

POTUS

Fortune500

Uniform Access

START

NL questions

Multimediaresponses

OmnibaseQueries

Data

• Local knowledge base of ternary expressions• Core vocabulary

• Uniform interface to multiple database formats (Web, text, etc.)• Extended lexicon

U.S. Census

IMDb

Page 14: MIT Artificial Intelligence Laboratory — Research Directions The START Information Access System Boris Katz

MIT Artificial Intelligence Laboratory — Research Directions

How START WorksWeb browser

START

Parser

Matcher

English

Databaseof T-exps

T-exps from KB

Generator

HTML

English

Annotations

Scripts

Nativeknowledge

Omnibase(externalknowledge)

Scripts

WWW

PotusIMDb

World Factbook

U.S. Census

Input T-exps

Page 15: MIT Artificial Intelligence Laboratory — Research Directions The START Information Access System Boris Katz

MIT Artificial Intelligence Laboratory — Research Directions