Co reference resolution

23
Coreference Resolution USING HOBBS’S ALGORITHM Hemant Kumar (10579) Babul Bhanu (10545) Pratheek Adidela (10583) Shekhar Kumar (09562)

description

Co-reference resolution..Pronominal resolution using Hobb's algorithm

Transcript of Co reference resolution

Page 1: Co reference resolution

Coreference Resolution USING HOBBS’S ALGORITHM

Hemant Kumar (10579)Babul Bhanu (10545)

Pratheek Adidela (10583)

Shekhar Kumar (09562)

Page 2: Co reference resolution

Contents

What is coreference resolution

Anaphoric and Cataphoric references

Direct and indirect anaphora

Hobbs’s algorithm

Hobb’s algorithm for Hindi

Problems in implementation

Applications

Page 3: Co reference resolution

What is Coreference Resolution

In computational linguistics, Coreference resolution is a well-studied problem in discourse. To derive the correct interpretation of text, or even to estimate the relative importance of various mentioned subjects, pronouns and other referring expressions must be connected to the right individuals.

There are two types of references: When the reader must look back to the previous context, Coreference is called anaphoric reference. When the reader must look forward, it is cataphoric reference.

Page 4: Co reference resolution

Anaphoric and Cataphoric references

Cataphora

In linguistics, cataphora is used to first insert an expression or word that co-refers with a later expression in the discourse. An example of strict, sentence-internal cataphora in English is the following sentence

“When he arrived home, John went to sleep.”

Anaphora

In linguistics, anaphora is the use of an expression the interpretation of which depends upon another expression in context (its antecedent or postcedent).In the sentence

“Sally arrived, but nobody saw her”

the pronoun ‘her’ is anaphoric, referring back to Sally.

Page 5: Co reference resolution

Hobbs’s algorithm

Hobbs’s algorithm is mainly focused on “surface parse tree”. By this is means that the tree that exhibits the grammatical structure of the sentences (its division into subject, verb, objects, adverbials, etc.) without permuting or omitting any of the words in a sentence.

The naïve algorithm traverses the surface parse tree in a particular order looking for a noun phrase of the correct gender and number.

Page 6: Co reference resolution
Page 7: Co reference resolution

Hobbs’s algorithm for Hindi

Pronouns in Hindi exhibit a great deal of ambiguity.

Pronouns in the first, second and third person do not convey any information about the gender

Redefined the grammar for Hindi

Page 8: Co reference resolution

S1

NP_erg(SUB) VP2

Nbar

NounMr.Smith

Postpne

NP2_acc(DO) VP1

Nbar Postpko

NounDriver

PP_loc VP

Verbdekha

NP1(PO) postpmein

Pronoun_possuske

Nbar

Nountruck

EXAMPLE 1Non-Reflexive pronoun

Page 9: Co reference resolution

S1

NP_erg(SUB) VP2

Nbar

NounMr.Smith

Postpne

NP2_acc(DO) VP1

Nbar Postpko

NounDriver

PP_loc VP

Verbdekha

NP1(PO) postpmein

Pronoun_possuske

Nbar

Nountruck

Page 10: Co reference resolution

S1

NP_erg(SUB) VP2

Nbar

NounMr.Smith

Postpne

NP2_acc(DO) VP1

Nbar Postpko

NounDriver

PP_loc VP

Verbdekha

NP1(PO) postpmein

Pronoun_possuske

Nbar

Nountruck

Page 11: Co reference resolution

S1

NP_erg(SUB) VP2

Nbar

NounMr.Smith

Postpne

NP2_acc(DO) VP1

Nbar Postpko

NounDriver

PP_loc VP

Verbdekha

NP1(PO) postpmein

Pronoun_possuske

Nbar

Nountruck

Page 12: Co reference resolution

S1

NP_erg(SUB) VP2

Nbar

NounMr.Smith

Postpne

NP2_acc(DO) VP1

Nbar Postpko

NounDriver

PP_loc VP

Verbdekha

NP1(PO) postpmein

Pronoun_possuske

Nbar

Nountruck

Page 13: Co reference resolution

S1

NP_erg(SUB) VP2

Nbar

NounMr.Smith

Postpne

NP2_acc(DO) VP1

Nbar Postpko

NounDriver

PP_loc VP

Verbdekha

NP1(PO) postpmein

Pronoun_possuske

Nbar

Nountruck

Page 14: Co reference resolution

S1

NP_erg(SUB) VP2

Nbar

NounMr.Smith

Postpne

NP2_acc(DO) VP1

Nbar Postpko

NounDriver

PP_loc VP

Verbdekha

NP1(PO) postpmein

Pronoun_possuske

Nbar

Nountruck

Previous sentence

Page 15: Co reference resolution

S1

NP_erg(SUB) VP2

Nbar

NounMr.Smith

Postpne

NP2_acc(DO) VP1

Nbar Postpko

NounDriver

PP_loc VP

Verbdekha

NP1(PO) postpmein

Pronoun_possapne

Nbar

Nountruck

EXAMPLE 2Reflexive pronoun

Page 16: Co reference resolution

S1

NP_erg(SUB) VP2

Nbar

NounMr.Smith

Postpne

NP2_acc(DO) VP1

Nbar Postpko

NounDriver

PP_loc VP

Verbdekha

NP1(PO) postpmein

Pronoun_possapne

Nbar

Nountruck

Page 17: Co reference resolution

S1

NP_erg(SUB) VP2

Nbar

NounMr.Smith

Postpne

NP2_acc(DO) VP1

Nbar Postpko

NounDriver

PP_loc VP

Verbdekha

NP1(PO) postpmein

Pronoun_possapne

Nbar

Nountruck

Page 18: Co reference resolution

S1

NP_erg(SUB) VP2

Nbar

NounMr.Smith

Postpne

NP2_acc(DO) VP1

Nbar Postpko

NounDriver

PP_loc VP

Verbdekha

NP1(PO) postpmein

Pronoun_possapne

Nbar

Nountruck

Page 19: Co reference resolution

S1

NP_erg(SUB) VP2

Nbar

NounMr.Smith

Postpne

NP2_acc(DO) VP1

Nbar Postpko

NounDriver

PP_loc VP

Verbdekha

NP1(PO) postpmein

Pronoun_possapne

Nbar

Nountruck

Page 20: Co reference resolution

S1

NP_erg(SUB) VP2

Nbar

NounMr.Smith

Postpne

NP2_acc(DO) VP1

Nbar Postpko

NounDriver

PP_loc VP

Verbdekha

NP1(PO) postpmein

Pronoun_possapne

Nbar

Nountruck

Page 21: Co reference resolution

Challenges in implementation

Parts of speech tagger

Richness of word database

In Hindi the gender of noun is identified by using verbs

Page 22: Co reference resolution

Applications

English – Hindi machine translation

Machine learning

Page 23: Co reference resolution

THANK YOU