Motivation User Study - Matthew Berger · Matthew Berger1,2 Ajay Nagesh1 Joshua A. Levine1 Mihai...

Visual Supervision in Bootstrapped Information Extraction Matthew Berger 1,2 Ajay Nagesh 1 Joshua A. Levine 1 Mihai Surdeanu 1 Hao Helen Zhang 1 1 University of Arizona 2 Vanderbilt University • 1: Bootstrapping model automatically labels entities, performs update • 2: Human labels entities in visual interface • 3: Model updates based on human labels Motivation Data Annotation in Information Extraction: list-based interface of entities ranked by uncertainty Is this the best we can do? We study different visual interfaces, sampling criteria, and interactions for entity annotation. Embedding-Based Bootstrapping Model: semi-supervised word embeddings, distributional similarity of entities with patterns. John said on Saturday Entity Pattern Unsupervised: Skip-Gram Supervised: Margin-based Hinge Loss labeled data (human/machine-promoted) Visual Interfaces List Scatterplot • I: labeling interface – list with uncertainty sampling • II: entity labeling – individual entity selection • III: bootstrapping – round progression User Study • User Base: 10 participants, within-subject design, e.g. presented with both interfaces. • Dataset: Ontonotes (Weischedel et al. 2013), limited to 4 categories: people, organizations, geopolitical entities, religious/political affiliations • Experimental Setup: participant performs 10 rounds of labeling, label entities for up to 1 minute in each round • Evaluation: bootstrapping throughput of entities promoted in each round, as well as extrapolation, throughput of bootstrapper after obtaining all human labels. Results Bootstrapping Extrapolation Participant performance at last round Extrapolation Takeaway: aim for an interface that yields many annotations at reasonable accuracy, rather than few, higher-quality ones Bootstrapping Throughput Over Each Round Bootstrapping Example Participant Performance Takeaway: labeling volume can counter noise in annotations Labeling Consensus Union Majority Vote Takeaway: can tolerate label noise, but not too much noise Acknowledgements: NSF TRIPODS grant (NSF-1749858) and the Bill and Melinda Gates Foundation HBGDki Initiative. Mihai Surdeanu discloses a financial interest in lum.ai. This interest has been disclosed to the University of Arizona Institutional Review Committee and is being managed in accordance with its conflict of interest policies. Contact: [email protected] • I: labeling interface – 2D scatterplot with clustering & coverage sampling • II: entity labeling – group-wise selection via area brushing • III: bootstrapping – view projection of entities during training, determine advancement

Upload
others
Category

Documents
view
1
download
0

Embed Size (px):

Transcript of Motivation User Study - Matthew Berger · Matthew Berger1,2 Ajay Nagesh1 Joshua A. Levine1 Mihai...

Page 1: Motivation User Study - Matthew Berger · Matthew Berger1,2 Ajay Nagesh1 Joshua A. Levine1 Mihai Surdeanu1 HaoHelen Zhang1 1University of Arizona 2Vanderbilt University ... poster_portrait

Visual Supervision in Bootstrapped Information ExtractionMatthew Berger1,2 Ajay Nagesh1 Joshua A. Levine1 Mihai Surdeanu1 Hao Helen Zhang1

1University of Arizona 2Vanderbilt University

• 1: Bootstrapping model automatically labels entities, performs update

• 2: Human labels entities in visual interface• 3: Model updates based on human labels

MotivationData Annotation in Information Extraction: list-based interface of entities ranked by uncertainty

Is this the best we can do?We study different visual interfaces, sampling criteria, and interactions for entity annotation.

Embedding-Based BootstrappingModel: semi-supervised word embeddings, distributional similarity of entities with patterns.

John said on SaturdayEntity Pattern

Unsupervised: Skip-Gram

Supervised: Margin-based Hinge Loss

labeled data (human/machine-promoted)

Visual Interfaces

List

Scatterplot

• I: labeling interface – list with uncertainty sampling

• II: entity labeling –individual entity selection

• III: bootstrapping – round progression

User Study• User Base: 10 participants, within-subject design, e.g.

presented with both interfaces.• Dataset: Ontonotes (Weischedel et al. 2013), limited to 4

categories: people, organizations, geopolitical entities, religious/political affiliations

• Experimental Setup: participant performs 10 rounds of labeling, label entities for up to 1 minute in each round

• Evaluation: bootstrapping throughput of entities promoted in each round, as well as extrapolation, throughput of bootstrapper after obtaining all human labels.

ResultsBootstrapping Extrapolation

Participant performance at last round

Extrapolation

Takeaway: aim for an interface that yields many annotations at reasonable accuracy, rather than few, higher-quality ones

Bootstrapping Throughput Over Each Round

Bootstrapping

Example Participant Performance

Takeaway: labeling volume can counter noise in annotations

Labeling Consensus

Union Majority Vote

Takeaway: can tolerate label noise, but not too much noise

Acknowledgements: NSF TRIPODS grant (NSF-1749858) and the Bill and Melinda Gates Foundation HBGDki Initiative. Mihai Surdeanu discloses a financial interest in lum.ai. This interest has been disclosed to the University of Arizona Institutional Review Committee and is being managed in accordance with its conflict of interest policies.

Contact: [email protected]

• I: labeling interface – 2D scatterplot with clustering & coverage sampling

• II: entity labeling –group-wise selection via area brushing

• III: bootstrapping – view projection of entities during training, determine advancement

Damien.Anderson@strath.ac.uk arXiv:1802.00048v2 [cs.AI] 4 ...Deceptive Games Damien Anderson1, Matthew Stephenson2, Julian Togelius3, Christoph Salge3, John Levine1, and Jochen Renz2

[email protected] arXiv:1802.00048v2 [cs.AI] 4 ...Deceptive Games Damien Anderson1, Matthew Stephenson2, Julian Togelius3, Christoph Salge3, John Levine1, and Jochen Renz2

7 f matthew mules palu butung by matthew

God’s Mission in Matthew Matthew 23

6/27/2015G. Levine1 PROGRAMMING LANGUAGES Text: Programming Languages, Design and Implementation Pratt and Zelkowitz 4 th ed. Prentice-Hall.

Chow groups of projective varieties of very small degreebm0032/publ/SmallDegree.pdfChow groups of projective varieties of very small degree H´el`ene Esnault1 2 Marc Levine1 3 Eckart

Illustration: Matthew Cole vy fotolia · W Illustration: Matthew Cole vy fotolia.de. I Illustration: Matthew Cole vy fotolia.de. L Illustration: Matthew Cole vy fotolia.de

$Overview of Technologies for Direct Optical Imaging of …marel/decadal/tech/levine1.doc · Web viewThe telescope is a conventional diffraction limited design, and does not require$