Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural...
Transcript of Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural...
![Page 1: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/1.jpg)
Building Neural NER Models with Structural Prior
2018/09/11
Peng-Hsuan Li
1
![Page 2: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/2.jpg)
Outline
• Named Entity Recognition• Task• Features• Related Work
• Leveraging Linguistic Structures for NER
• Constructing Deep Cross-BLSTM with Self-Attention for NER
• CKIP NER
2
![Page 3: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/3.jpg)
Named Entities
• CoNLL-2003• PER, LOC, ORG, MISC
• OntoNotes 5.0• person, NORP, facility, organization, GPE, location, product, event, work-of-art, law,
language• date, time, percent, money, quantity, ordinal, cardinal
3
The defense secretary Donald Rumsfeld
ORG PER
![Page 4: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/4.jpg)
Gazetteer Features
• Senna• PER
• LOC
• ORG
• MISC
4
![Page 5: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/5.jpg)
Word Features
• Embedding• English -> 840B (common crawl)
• Chinese -> CNA (gigaword) + ASBC (sinica)
• Uppercase
• Upper-initial
• Lowercase
• Mixed
5
![Page 6: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/6.jpg)
Character Features
• Embedding• English -> Random initialization
• Chinese -> CNA (gigaword) + ASBC (sinica)
• Uppercase
• Lowercase
• Digit
6
![Page 7: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/7.jpg)
Sequence Tagging
7
The defense secretary Donald Rumsfeld
ORG PER
The defense secretary Donald Rumsfeld
O S-ORG O B-PER E-PER
Chunk Labels
B (begin)I (inside)O (outside)E (end)S (single)
![Page 8: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/8.jpg)
Bi-LSTM for Sequence Tagging
8
Jason Chiu and Eric Nichols. 2016. Named Entity Recognition with Bidirectional LSTM-CNNs. Transactions of ACL.
![Page 9: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/9.jpg)
Dilated CNN for Sequence Tagging
Emma Strubell, Patrick Verga, David Belanger, and Andrew McCallum. 2017. Fast and Accurate Entity Recognition with Iterated Dilated Convolutions. In Proceedings of EMNLP.
9
![Page 10: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/10.jpg)
Results of Sequence Tagging
10
Model Sources CoNLL-2003 OntoNotes 5.0
BLSTM
Huang et al., 2015Chiu and Nichols, 2016
Ma and Hovy, 2016Lample et al., 2016Strubell et al., 2017
90.67 83.76
BLSTM-CRF 90.94 86.99
BLSTM-CNN 90.98 -
BLSTM-CNN-CRF 91.21 -
Deep BLSTM - 86.19
Deep-BLSTM-CNN - 86.41
ID-CNN-CRF Strubell et al., 2017 90.65 86.84
![Page 11: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/11.jpg)
Outline
• Named Entity Recognition
• Leveraging Linguistic Structures for NER• Joint parsing and NER• Tree-LSTM for NER• Mitigating inconsistencies between parsing and NER
• Constructing Deep Cross-BLSTM with Self-Attention for NER
• CKIP NER
11
![Page 12: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/12.jpg)
Constituent Prediction
12
![Page 13: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/13.jpg)
CRF-CFG for Constituent Prediction
13
Jenny Rose Finkel and Christopher D. Manning. 2009. Joint Parsing and Named Entity Recognition. In Proceedings of HLT-NAACL.
![Page 14: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/14.jpg)
Bi-Tree-LSTM for Constituent Prediction
14
Peng-Hsuan Li, Ruo-Ping Dong, Yu-Siang Wang, Ju-Chieh Chou, and Wei-Yun Ma. 2017. Leveraging Linguistic Structures for Named Entity Recognition with Bidirectional Recursive Neural Networks. In Proceedings of EMNLP.
NNP
NNP
NNP
NP
senator
PER
Edward Kennedy
NPsenator Edward Kennedy
![Page 15: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/15.jpg)
Bi-Tree-LSTM for Constituent Prediction
15
senator Edward Kennedy
![Page 16: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/16.jpg)
Bi-Tree-LSTM for Constituent Prediction
16
senator Edward Kennedy
![Page 17: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/17.jpg)
Bi-Tree-LSTM for Constituent Prediction
17
senator Edward Kennedy
NNP NNP NNP
NP
NP
PER
![Page 18: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/18.jpg)
Inconsistencies between Parse and NER
18
NNPNNP NNP
NP
Taihang Mountain range
NNNNP
NP
NNP
NP
senator Edward Kennedy
LOCPER
Type-1Cross Siblings
Type-2Cross Branches
Peng-Hsuan Li. 2017. Leveraging Linguistic Structures for Named Entity Recognition with Bidirectional Recursive Neural Networks. Master’s Thesis, Department of Computer Science and Information Engineering, National Taiwan University.
![Page 19: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/19.jpg)
Eliminate Type-1: Constituency Tree Binarization
19
NNPNNP NNP
NP
senator Edward Kennedy
PER
Type-1Cross Siblings
Head
NNP
NNP
NNP
NP
senator
PER
Edward Kennedy
NP
Consistent
![Page 20: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/20.jpg)
Eliminate Type-1: Dependency Transformation
20
Ben ate a cat
NNP VBD DT NN
root nobjnsubj
det
DTNNP
nobj
VBD
nsubj
Ben ate a cat
NN
det
No Constituents No Type-1 Inconsistencies
![Page 21: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/21.jpg)
Eliminate Type-2: Pyramid Construction
21
Taihang Mountain range
NNNNP
NP
NNP
NP
Taihang Mountain range
LOCLOC
Type-2Cross Branches No Liguistic Structures
![Page 22: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/22.jpg)
Eliminate Type-2: Pyramid Construction
22
NNNNP
NP
NNP
NP
Taihang Mountain range
LOC
Type-2Cross Branches
NNNNP
NP
NNP
NP
Taihang Mountain range
LOC
No Inconsistencies
![Page 23: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/23.jpg)
Results of Constituent Prediction
23
Method Model Sources CoNLL-2003 OntoNotes 5.0
Sequence Tagging
BLSTM
Huang et al., 2015Chiu and Nichols, 2016
Ma and Hovy, 2016Lample et al., 2016Strubell et al., 2017
90.67 83.76
BLSTM-CRF 90.94 86.99
BLSTM-CNN 90.98 -
BLSTM-CNN-CRF 91.21 -
Deep BLSTM - 86.19
Deep BLSTM-CNN - 86.41
ID-CNN-CRF Strubell et al., 2017 90.65 86.84
Constituent PredictionCRF-CFG Finkel and Manning, 2009 - 82.42
Bi-Tree-RNN-CNN Li et al., 2017 88.91 87.21
![Page 24: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/24.jpg)
Analyses of Constituent Prediction
• Sequence Tagging vs. Constituent Prediction
24
93% Consistency 97%/100% Consistency
Method CoNLL-2003 OntoNotes 5.0
Sequence Tagging 91.21 86.99
Constituent Prediction 88.91 87.21/88.92
![Page 25: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/25.jpg)
Analyses of Constituent Prediction
• Sequence vs Tree
25
the first couple moves out of the White House on January 20th .
OntoNotes 5.0
Model Const-Only Prediction Precision Recall F1
Bi-RNN X the White 85.7 86.5 86.10
Bi-RNN O - 87.2 85.1 86.14
Bi-Tree-RNN O White House 88.0 86.2 87.10
![Page 26: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/26.jpg)
Ablation Study: Constituency Tree Binarization
26
OntoNotes 5.0
Model Binarize Consistency Precision Recall F1
BRNN X 93% 87.3 83.0 85.11
BRNN O 97% 88.0 86.2 87.10
![Page 27: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/27.jpg)
Ablation Study: Dependency Transformation
27
CoNLL 2003
Model Parser Precision Recall F1
BRNN StanfordRNN 88.9 86.9 87.91
BRNN SyntaxNet 90.2 87.7 88.91
![Page 28: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/28.jpg)
Ablation Study: Pyramid Construction
28
CoNLL 2003
Model Pyramid Precision Recall F1
BRNN X 89.1 82.9 85.89
BRNN O 90.2 87.7 88.91
![Page 29: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/29.jpg)
Ablation Study: Bidirectional
29
OntoNotes 5.0
Model Koran Precision Recall F1
Top-Down - 79.2 69.3 73.93
Bottom-Up PERSON 86.6 86.2 86.41
BRNN WORK OF ART 88.0 86.2 87.10
![Page 30: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/30.jpg)
30
He confirmed it by repeating the verses from the noble Koran and the two testimonies.
![Page 31: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/31.jpg)
Outline
• Named Entity Recognition
• Leveraging Linguistic Structures for NER
• Constructing Deep Cross Bi-LSTM with Self-Attention for NER• Deep Cross Bi-LSTM
• Multi-head self-attention
• CKIP NER
31
![Page 32: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/32.jpg)
A Pattern Across Past and Future
• 𝑤𝑙1, 𝑤𝑐 , 𝑤𝑟1 → (𝑂, 𝑂, 𝑂)
• 𝑤𝑙1, 𝑤𝑐 , 𝑤𝑟2 → (𝑂, 𝑂, 𝑂)
• 𝑤𝑙1, 𝑤𝑐 , 𝑤𝑟3 → (𝐵, 𝐼, 𝐸)-ORG
• 𝑤𝑙2, 𝑤𝑐 , 𝑤𝑟3 → (𝑂, 𝑂, 𝑂)
• (𝑤𝑙3, 𝑤𝑐 , 𝑤𝑟3) → (𝑂, 𝑂, 𝑂)
32
![Page 33: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/33.jpg)
Deep Bi-LSTM
33
![Page 34: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/34.jpg)
Self-Attention
34
Peng-Hsuan Li and Wei-Yun Ma. Constructing Deep BLSTM-CNN with Self-Attention for Sequence Labeling NER. Under review.
![Page 35: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/35.jpg)
Results
35
Method Model Sources CoNLL-2003 OntoNotes 5.0
Sequence Tagging
BLSTM
Huang et al., 2015Chiu and Nichols, 2016
Ma and Hovy, 2016Lample et al., 2016Strubell et al., 2017
90.67 83.76
BLSTM-CRF 90.94 86.99
BLSTM-CNN 90.98 -
BLSTM-CNN-CRF 91.21 -
Deep BLSTM - 86.19
Deep BLSTM-CNN - 86.41
ID-CNN-CRF Strubell et al., 2017 90.65 86.84
Deep Parallel BLSTM-CNN 91.44 87.69
Deep Parallel BLSTM-CNN-Attend (5-head) 91.37 88.13
Deep Cross BLSTM-CNN 91.24 88.39
Deep Cross BLSTM-CNN-Attend (5-head) 91.14 88.35
Constituent PredictionCRF-CFG Finkel and Manning, 2009 - 82.42
Bi-Tree-RNN-CNN Li et al., 2017 88.91 87.21
![Page 36: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/36.jpg)
oh, New York was really fun
36
![Page 37: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/37.jpg)
I am Gregor Cragey.
37
![Page 38: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/38.jpg)
Xinhua News Agency, Beijing, March 18th
38
![Page 39: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/39.jpg)
Outline
• Named Entity Recognition
• Leveraging Linguistic Structures for NER
• Constructing Deep Cross Bi-LSTM with Self-Attention for NER
• CKIP NER• Chinese NER
• Ancient Chinese Document NER
39
![Page 42: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/42.jpg)
42
Ancient Chinese NER
![Page 43: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/43.jpg)
43
Entity ID
Mention
Prefix
Ancient Chinese NER
SuffixLabel
Sample
![Page 44: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/44.jpg)
Ancient Chinese NER
• Entity Types• Person, officer, location, organization
• Can be decided by entity ID
• Goal• Sample typing
44
![Page 45: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/45.jpg)
Ancient Chinese NER
• Extract usable data
1. Include all samples
2. Compute entity-mention bipartite map
3. Remove the samples of which labels are not Y or N
4. Exclude the mentions that do not map to one single entity
5. Exclude the entities of which labels are not human-verified
45
![Page 46: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/46.jpg)
Ancient Chinese NER
46
𝐸𝑛𝑡𝑖𝑡𝑦1
𝐸𝑛𝑡𝑖𝑡𝑦2
𝐸𝑛𝑡𝑖𝑡𝑦3
𝐸𝑛𝑡𝑖𝑡𝑦4
𝐸𝑛𝑡𝑖𝑡𝑦5
𝐸𝑛𝑡𝑖𝑡𝑦6
𝑀𝑒𝑛𝑡𝑖𝑜𝑛1
𝑀𝑒𝑛𝑡𝑖𝑜𝑛2
𝑀𝑒𝑛𝑡𝑖𝑜𝑛3
𝑀𝑒𝑛𝑡𝑖𝑜𝑛4
𝑀𝑒𝑛𝑡𝑖𝑜𝑛5
𝑀𝑒𝑛𝑡𝑖𝑜𝑛6
![Page 47: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/47.jpg)
Ancient Chinese NER
47
𝐸𝑛𝑡𝑖𝑡𝑦1
𝐸𝑛𝑡𝑖𝑡𝑦2
𝐸𝑛𝑡𝑖𝑡𝑦3
𝐸𝑛𝑡𝑖𝑡𝑦4
𝐸𝑛𝑡𝑖𝑡𝑦5
𝐸𝑛𝑡𝑖𝑡𝑦6
𝑀𝑒𝑛𝑡𝑖𝑜𝑛1
𝑀𝑒𝑛𝑡𝑖𝑜𝑛2
𝑀𝑒𝑛𝑡𝑖𝑜𝑛3
𝑀𝑒𝑛𝑡𝑖𝑜𝑛4
𝑀𝑒𝑛𝑡𝑖𝑜𝑛5
𝑀𝑒𝑛𝑡𝑖𝑜𝑛6
![Page 48: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/48.jpg)
Ancient Chinese NER
48
Dataset Unique Entities Unique Mentions Samples Characters Y(%)
Person
Train 5,238 6,106 473,766 15,160,443 75.60
Validate 1,559 1,627 156,396 5,006,014 72.91
Test 2,047 2,205 157,241 5,047,811 71.32
Officer
Train 12 44 5,600 186,210 92.96
Validate 8 9 1,861 62,306 98.93
Test 6 6 213 6,796 97.65
Location
Train 62 94 129,059 4,063,204 83.20
Validate 40 48 36,369 1,131,185 84.52
Test 52 68 38,927 1,218,336 93.25
Organization
Train 215 258 117,813 3,668,503 97.76
Validate 47 49 40,958 1,325,299 97.80
Test 63 68 35,047 1,124,819 98.29
![Page 49: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/49.jpg)
Ancient Chinese NER
49
𝑐1 𝑐𝑚 𝑐𝑚+1 𝑐𝑚+𝑛… …
𝑙𝑚𝑒𝑛𝑡𝑖𝑜𝑛Cross-BLSTM-Max
𝑃𝑁, 𝑃𝑌
• 𝑚 prefix characters, 𝑛 suffix characters
• 𝑐𝑖 : character + 2 features indicating prefix/suffix
• 𝑙𝑚𝑒𝑛𝑡𝑖𝑜𝑛: mention length
• 100-100 BLSTM
![Page 50: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/50.jpg)
Ancient Chinese NER
• Character embedding
• Random initialization
• Alphabet: training set occurrences≥30
50
Corpus Training Set Characters Alphabet Size
Person 15,160,443 4,311
Officer 186,210 763
Location 4,063,204 2,836
Organization 3,668,503 2,415
![Page 51: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/51.jpg)
Ancient Chinese NER
51
Dataset Unique Entities Unique Mentions Samples Characters Y (%) Accuracy (%)
Person
Train 5,238 6,106 473,766 15,160,443 75.60 -
Validate 1,559 1,627 156,396 5,006,014 72.91 86.11
Test 2,047 2,205 157,241 5,047,811 71.32 87.91
Officer
Train 12 44 5,600 186,210 92.96 -
Validate 8 9 1,861 62,306 98.93 98.93
Test 6 6 213 6,796 97.65 97.65
Location
Train 62 94 129,059 4,063,204 83.20 -
Validate 40 48 36,369 1,131,185 84.52 85.59
Test 52 68 38,927 1,218,336 93.25 83.91
Organization
Train 215 258 117,813 3,668,503 97.76 -
Validate 47 49 40,958 1,325,299 97.80 97.80
Test 63 68 35,047 1,124,819 98.29 98.29
![Page 52: Building Neural NER Models with Structural Prior€¦ · Building Neural NER Models with Structural Prior 2018/09/11 Peng-Hsuan Li 1](https://reader035.fdocuments.net/reader035/viewer/2022071218/6053adc69047c7251105a1f2/html5/thumbnails/52.jpg)
Outline
• Named Entity Recognition• Task• Features• Related Work
• Leveraging Linguistic Structures for NER• Joint parsing and NER• Tree-LSTM for NER• Mitigating inconsistencies between parsing and NER
• Constructing Deep Cross Bi-LSTM with Self-Attention for NER• Deep Cross Bi-LSTM• Multi-head self-attention
• CKIP NER• Chinese NER• Ancient Chinese Document NER
52