Automated WordNet Construction Using Word Embeddings · (WOLF) [4] Universal Wordnet [5] Extended...

1
(pistol) (shooting) (to shoot) (species) (crossbow) (kind) (family) (plant) (coriander) (cilantro) (celery) (garlic) Isometric mapping of sense-clusters found for "лук" ("bow", "onion") A u t o m a t e d W o r d N e t C o n s t r u c t i o n U s i n g W o r d E m b e d d i n g s Mikhail Khodak, Andrej Risteski, Christiane Fellbaum, Sanjeev Arora Computer Science Department, Princeton University assign a score to each candidate synset flagstone flag slab Target Word get translations of target word using a bilingual dictionary or machine translation (MT) get set of candidate synsets by querying Princeton WordNet (PWN) using translations of Translations MT + PWN Synset Scoring Threshold Matching return all synsets with score above a threshold Candidates Synset Score Threshold 0.41 flag.n.01 flag.n.04 flag.n.06 flag.n.07 iris.n.01 masthead.n.01 pin.n.08 slab.n.01 score = 0.280 score = 0.222 score = 0.360 score = 0.161 score = 0.200 score = 0.195 score = 0.251 score = 0.521 Contributions: Wordnet Libre du Français (WOLF) [4] Universal Wordnet [5] Extended Open Multilingual Wordnet [6] Synset Representation Synset Representation + Sense Clusters F-Score of Synset Matching for French and Russian Isometric mapping of sense-clusters found for "fox" [1] Fellbaum, MIT Press. [2] Arora et al., ICLR 2017. [3] Arora et al., arXiv 2016. [4] Sagot and Fišer, LREC 2008. [5] de Melo and Weikum, CIKM 2009. [6] Bond and Foster, ACL 2013. Princeton WordNet French-English Dictionary French word 'dalle' correct synsets: flag.n.06 - slab.n.01 retrieved synsets: slab.n.01 Synset Representation: Sense Clusters:

Transcript of Automated WordNet Construction Using Word Embeddings · (WOLF) [4] Universal Wordnet [5] Extended...

Page 1: Automated WordNet Construction Using Word Embeddings · (WOLF) [4] Universal Wordnet [5] Extended Open Multilingual Wordnet [6] Synset Representation Synset Representation + Sense

(pistol)

(shooting)

(to shoot)

(species)

(crossbow)(kind)

(family)

(plant)

(coriander)

(cilantro)

(celery)

(garlic)

Isometric mapping of sense-clustersfound for "лук"("bow", "onion")

Automated WordNet ConstructionUsing Word Embeddings

Mikhail Khodak, Andrej Risteski, Christiane Fellbaum, Sanjeev AroraComputer Science Department, Princeton University

assign a scoreto each

candidate synset

flagstone

flag

slab

TargetWord

get translations of target word using a

bilingual dictionary or machine translation (MT)

get set of candidate synsetsby querying PrincetonWordNet (PWN) using

translations of

Translations

MT + PWN SynsetScoring

ThresholdMatching

return all synsetswith score abovea threshold

Candidates SynsetScore

Threshold 0.41

flag.n.01

flag.n.04

flag.n.06

flag.n.07

iris.n.01

masthead.n.01

pin.n.08

slab.n.01

score = 0.280

score = 0.222

score = 0.360

score = 0.161

score = 0.200

score = 0.195

score = 0.251

score = 0.521

Contributions:

Wordnet Libre du Français (WOLF) [4]

Universal Wordnet [5]

Extended Open Multilingual

Wordnet [6]

Synset Representation

Synset Representation

+ Sense Clusters

F-Score of Synset Matching for French and Russian

Isometric mapping of sense-clusters found for "fox"

[1] Fellbaum, MIT Press.[2] Arora et al., ICLR 2017.[3] Arora et al., arXiv 2016.

[4] Sagot and Fišer, LREC 2008.[5] de Melo and Weikum, CIKM 2009.[6] Bond and Foster, ACL 2013.

PrincetonWordNet

French-EnglishDictionary

French word

'dalle'

��������

correct synsets:flag.n.06 -slab.n.01 �

retrieved synsets:slab.n.01 �

Synset Representation:

Sense Clusters: